BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 005023
         (718 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359479833|ref|XP_002267103.2| PREDICTED: spermatogenesis-associated protein 20-like [Vitis
           vinifera]
          Length = 819

 Score = 1225 bits (3169), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 582/698 (83%), Positives = 633/698 (90%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS
Sbjct: 122 STCHWCHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 181

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ KRD+L +SGAFAIEQLSEAL
Sbjct: 182 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWENKRDVLVKSGAFAIEQLSEAL 241

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SA+ASSNKL D +PQ AL LCAEQL+ +YD  +GGFGSAPKFPRPVEIQ+MLYH KKLE+
Sbjct: 242 SATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEE 301

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +GKSGEA+E  KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 302 SGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 361

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+
Sbjct: 362 AYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYI 421

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKL
Sbjct: 422 WTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKL 481

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GMP+EKYL+ILG CRRKLFDVR  RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE  
Sbjct: 482 GMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGT 541

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 542 KFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLIS 601

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLD+YEFG  T WLVWAIELQ+TQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 602 GLLDIYEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 661

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVSVINLVRL S+VAGS  + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVP
Sbjct: 662 SGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVP 721

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FWE  NSN A MA+N
Sbjct: 722 SRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFWEAMNSNIALMAKN 781

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           NF+ DKVVALVCQNF+CS PVTD  SL+ LL  KPSS 
Sbjct: 782 NFAPDKVVALVCQNFTCSSPVTDSTSLKALLCLKPSSA 819


>gi|296086616|emb|CBI32251.3| unnamed protein product [Vitis vinifera]
          Length = 754

 Score = 1224 bits (3167), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 582/698 (83%), Positives = 633/698 (90%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS
Sbjct: 57  STCHWCHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ KRD+L +SGAFAIEQLSEAL
Sbjct: 117 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWENKRDVLVKSGAFAIEQLSEAL 176

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SA+ASSNKL D +PQ AL LCAEQL+ +YD  +GGFGSAPKFPRPVEIQ+MLYH KKLE+
Sbjct: 177 SATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEE 236

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +GKSGEA+E  KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 237 SGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 296

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+
Sbjct: 297 AYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYI 356

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKL
Sbjct: 357 WTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKL 416

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GMP+EKYL+ILG CRRKLFDVR  RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE  
Sbjct: 417 GMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGT 476

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 477 KFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLIS 536

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLD+YEFG  T WLVWAIELQ+TQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 537 GLLDIYEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 596

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVSVINLVRL S+VAGS  + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVP
Sbjct: 597 SGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVP 656

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FWE  NSN A MA+N
Sbjct: 657 SRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFWEAMNSNIALMAKN 716

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           NF+ DKVVALVCQNF+CS PVTD  SL+ LL  KPSS 
Sbjct: 717 NFAPDKVVALVCQNFTCSSPVTDSTSLKALLCLKPSSA 754


>gi|255559290|ref|XP_002520665.1| conserved hypothetical protein [Ricinus communis]
 gi|223540050|gb|EEF41627.1| conserved hypothetical protein [Ricinus communis]
          Length = 874

 Score = 1204 bits (3115), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/698 (82%), Positives = 637/698 (91%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYMT+VQALYGGGGWPLS
Sbjct: 62  STCHWCHVMEVESFEDESVAKLLNDWFVSIKVDREERPDVDKVYMTFVQALYGGGGWPLS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPPED YGRPGFKT+LRKVKDAWDKKRD+L +SGAFAIEQLSEAL
Sbjct: 122 VFLSPDLKPLMGGTYFPPEDNYGRPGFKTLLRKVKDAWDKKRDVLIKSGAFAIEQLSEAL 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SASAS+NKLPD LPQNALR CAEQLS+SYD+RFGGFGSAPKFPRPVEIQ+MLYH+KKLED
Sbjct: 182 SASASTNKLPDGLPQNALRSCAEQLSQSYDARFGGFGSAPKFPRPVEIQLMLYHAKKLED 241

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           + K  +A EG KMV  +LQCMAKGGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQLAN
Sbjct: 242 SEKVDDAKEGFKMVFSSLQCMAKGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 301

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +YLDAFS+T DVFYS++ RDILDYLRRDMIG  GEIFSAEDADSAE EGA +K+EGAFYV
Sbjct: 302 IYLDAFSITNDVFYSFVSRDILDYLRRDMIGQKGEIFSAEDADSAEHEGAKKKREGAFYV 361

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT KE++DILGEHA LFK+HYY+KP GNCDLSRMSDPH EFKGKNVLIELND SA ASK 
Sbjct: 362 WTDKEIDDILGEHATLFKDHYYIKPLGNCDLSRMSDPHKEFKGKNVLIELNDPSALASKH 421

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+P+EKY +ILGE +R LFDVR++RPRPHLDDKVIVSWNGL IS+FARASKILK E+E  
Sbjct: 422 GLPIEKYQDILGESKRMLFDVRARRPRPHLDDKVIVSWNGLAISAFARASKILKRESEGT 481

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            +NFPVVG D +EY+EVAE+AA+FIR+HLY+EQT RLQHSFRNGPSKAPGFLDDYAFLIS
Sbjct: 482 RYNFPVVGCDPREYIEVAENAATFIRKHLYEEQTRRLQHSFRNGPSKAPGFLDDYAFLIS 541

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYEFG G  WLVWA ELQNTQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 542 GLLDLYEFGGGIYWLVWATELQNTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 601

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INL+RLAS+V GSKS+ YR NAEH LAVFETRLKDMAMAVPLMCCAADM+SVP
Sbjct: 602 SGNSVSAINLIRLASMVTGSKSECYRHNAEHLLAVFETRLKDMAMAVPLMCCAADMISVP 661

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVGHK S + ++MLAAAH SYD NKTVIHIDP + EEM+FW ++NSN A MA+N
Sbjct: 662 SRKQVVLVGHKPSSELDDMLAAAHESYDPNKTVIHIDPTNNEEMEFWADNNSNIALMAKN 721

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           NF+ADKVVA+VCQNF+CSPPVTDP SL+ LL +KP++ 
Sbjct: 722 NFTADKVVAVVCQNFTCSPPVTDPKSLKALLSKKPAAV 759


>gi|449436537|ref|XP_004136049.1| PREDICTED: spermatogenesis-associated protein 20-like [Cucumis
           sativus]
          Length = 855

 Score = 1191 bits (3082), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 561/717 (78%), Positives = 626/717 (87%), Gaps = 3/717 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K     FL    +TCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPD
Sbjct: 139 GEEAFAEAQKRNVPIFLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPD 198

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMTYVQALY GGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD
Sbjct: 199 VDKVYMTYVQALYSGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWD 258

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            KRD+L +SG FAIEQLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD  FGGFGSA
Sbjct: 259 NKRDVLVKSGTFAIEQLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSA 318

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPRPVE Q+MLY++K+LE++GKS EA E   MV+F LQCMA+GGIHDHVGGGFHRYSV
Sbjct: 319 PKFPRPVEAQLMLYYAKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSV 378

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE WHVPHFEKMLYDQGQ+ NVYLDAFS+TKDVFYS++ RD+LDYLRRDMIG  GEI+SA
Sbjct: 379 DECWHVPHFEKMLYDQGQITNVYLDAFSITKDVFYSWVSRDVLDYLRRDMIGTQGEIYSA 438

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADSAE+EGATRKKEGAFYVWT KE++DILGEHA  FKEHYY+KP+GNCDLSRMSDPH+
Sbjct: 439 EDADSAESEGATRKKEGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHD 498

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           EFKGKNVLIE+   S  AS   MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWN
Sbjct: 499 EFKGKNVLIEMKSVSEMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWN 558

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL ISSFARASKIL++E E   F FPVVG D KEY +VAE AA FI+  LYDEQTHRLQH
Sbjct: 559 GLTISSFARASKILRNEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQH 618

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           SFRNGPSKAPGFLDDYAFLI GLLDLYE+G G  WLVWAIELQ TQDELFLDREGGGY+N
Sbjct: 619 SFRNGPSKAPGFLDDYAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYN 678

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           TTGED SV+LRVKEDHDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE R
Sbjct: 679 TTGEDKSVILRVKEDHDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKR 738

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
           LK+MA+AVPL+CCAA M S+PSRK VVLVGHK+S  FE  LAAAHASYD N+TVIH+DP 
Sbjct: 739 LKEMAVAVPLLCCAAGMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTVIHVDPT 798

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
           D  E+ FWEE+N + A MA+NNF+ADKVVALVCQNF+C  P+TDP SLE +L EKPS
Sbjct: 799 DDTELQFWEENNRSIAVMAKNNFAADKVVALVCQNFTCKAPITDPGSLEAMLAEKPS 855


>gi|449498445|ref|XP_004160539.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20-like [Cucumis sativus]
          Length = 855

 Score = 1183 bits (3060), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 557/717 (77%), Positives = 622/717 (86%), Gaps = 3/717 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K     FL    +TCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPD
Sbjct: 139 GEEAFAEAQKRNVPIFLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPD 198

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMTYVQALY GGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD
Sbjct: 199 VDKVYMTYVQALYSGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWD 258

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            KRD+L +SG FAIEQLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD  FGGFGSA
Sbjct: 259 NKRDVLVKSGTFAIEQLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSA 318

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPRPVE Q+MLY++K+LE++GKS EA E   MV+F LQCMA+GGIHDHVGGGFHRYSV
Sbjct: 319 PKFPRPVEAQLMLYYAKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSV 378

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE WHVPHFEKMLYDQG + NVYLDAFS+TKD  YS++ RD+LDYLRRDMIG  GEI+SA
Sbjct: 379 DECWHVPHFEKMLYDQGXITNVYLDAFSITKDXLYSWVSRDVLDYLRRDMIGTQGEIYSA 438

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADSAE+EGATR KEGAFYVWT KE++DILGEHA  FKEHYY+KP+GNCDLSRMSDPH+
Sbjct: 439 EDADSAESEGATRXKEGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHD 498

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           EFKGKNVLIE+   S  AS   MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWN
Sbjct: 499 EFKGKNVLIEMKSVSEMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWN 558

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL ISSFARASKIL++E E   F FPVVG D KEY +VAE AA FI+  LYDEQTHRLQH
Sbjct: 559 GLTISSFARASKILRNEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQH 618

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           SFRNGPSKAPGFLDDYAFLI GLLDLYE+G G  WLVWAIELQ TQDELFLDREGGGY+N
Sbjct: 619 SFRNGPSKAPGFLDDYAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYN 678

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           TTGED SV+LRVKEDHDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE R
Sbjct: 679 TTGEDKSVILRVKEDHDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKR 738

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
           LK+MA+AVPL+CCAA M S+PSRK VVLVGHK+S  FE  LAAAHASYD N+TVIH+DP 
Sbjct: 739 LKEMAVAVPLLCCAAGMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTVIHVDPT 798

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
           D  E+ FWEE+N + A MA+NNF+ADKVVALVCQNF+C  P+TDP SLE +L EKPS
Sbjct: 799 DDTELQFWEENNRSIAVMAKNNFAADKVVALVCQNFTCKAPITDPGSLEAMLAEKPS 855


>gi|356570951|ref|XP_003553646.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
          Length = 755

 Score = 1165 bits (3014), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 552/688 (80%), Positives = 608/688 (88%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLS
Sbjct: 56  STCHWCHVMEVESFEDEAVAKLLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLS 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRK+K+AWD KRDML + G++AIEQLSEA+
Sbjct: 116 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKLKEAWDSKRDMLIKRGSYAIEQLSEAM 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SAS+ S+KLPD +P +ALRLC+EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLED
Sbjct: 176 SASSDSDKLPDGVPADALRLCSEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLED 235

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TGK   A+  QKMV F+LQCMAKGG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 236 TGKLDGANRIQKMVFFSLQCMAKGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 295

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+
Sbjct: 296 VYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYI 355

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT KEV DILGEHA LF+EHYY+K +GNC+LS MSDPH+EFKGKNVLIE  + S  ASK 
Sbjct: 356 WTGKEVADILGEHAALFEEHYYIKQSGNCNLSGMSDPHDEFKGKNVLIERKEPSELASKY 415

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GM +E Y  ILGECR KLF+VRS+RP+PHLDDKVIVSWNGL ISSFARASKILK E E  
Sbjct: 416 GMSIETYQEILGECRHKLFEVRSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEVEGT 475

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F FPVVG++ K Y+ +AE AA FI + LY+ +THRL HSFR+ PSKAP FLDDYAFLIS
Sbjct: 476 KFYFPVVGTEAKGYLRIAEKAAFFIWKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLIS 535

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYEFG G  WL+WAIELQ TQD LFLDR GGGYFN TGED SVLLRVKEDHDGAEP
Sbjct: 536 GLLDLYEFGGGINWLLWAIELQETQDALFLDRTGGGYFNNTGEDSSVLLRVKEDHDGAEP 595

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INL+RLAS+VAGSK+++Y+QNAEH LAVFE RLKDMAMAVPLMCCAADML VP
Sbjct: 596 SGNSVSAINLIRLASMVAGSKAEHYKQNAEHLLAVFERRLKDMAMAVPLMCCAADMLHVP 655

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VV+VG ++S DFENMLAAAHA YD N+TVIHIDP + EEM FWE +NSN A MA+N
Sbjct: 656 SRKQVVVVGERTSGDFENMLAAAHALYDPNRTVIHIDPNNKEEMGFWEVNNSNVALMAKN 715

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLE 707
           NF+ DKVVALVCQNF+CSPPVTD  SLE
Sbjct: 716 NFAVDKVVALVCQNFTCSPPVTDHSSLE 743


>gi|115432144|gb|ABI97349.1| cold-induced thioredoxin domain-containing protein [Ammopiptanthus
           mongolicus]
          Length = 839

 Score = 1157 bits (2993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/712 (79%), Positives = 621/712 (87%), Gaps = 3/712 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F   ++     FL    +TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPD
Sbjct: 119 GEEAFSEASRRDVPIFLSIGYSTCHWCHVMEVESFEDEEVAKLLNDWFVSIKVDREERPD 178

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRKVK+AWD
Sbjct: 179 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWD 238

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            KRDML +SGAF IEQLSEALSAS+ S+KLPD +P  AL LC+EQLS SYDS+FGGFGSA
Sbjct: 239 SKRDMLIKSGAFTIEQLSEALSASSVSDKLPDGVPDEALNLCSEQLSGSYDSKFGGFGSA 298

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPRPVE  +MLYHS+KLEDTGK G A+E QKMV F LQCMAKGGIHDH+GGGFHRYSV
Sbjct: 299 PKFPRPVEFNLMLYHSRKLEDTGKLGAANESQKMVFFNLQCMAKGGIHDHIGGGFHRYSV 358

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE WHVPHFEKMLYDQGQLANVYLDAFS+TKD FYS I +DILDYLRRDMIGP GEIFSA
Sbjct: 359 DECWHVPHFEKMLYDQGQLANVYLDAFSITKDTFYSCISQDILDYLRRDMIGPEGEIFSA 418

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADSAE EGATRKKEGAFY+WTSKEVEDILG+HA LFKEHYY+K +GNCDLSRMSDPH+
Sbjct: 419 EDADSAEIEGATRKKEGAFYIWTSKEVEDILGDHAALFKEHYYIKQSGNCDLSRMSDPHD 478

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           EFKGKNVLIE  D+S  ASK GM +E Y  ILGECRRKLF+VRS+R RPHLDDKVIVSWN
Sbjct: 479 EFKGKNVLIERKDTSEMASKYGMSVETYQEILGECRRKLFEVRSRRSRPHLDDKVIVSWN 538

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL ISSFARASKILK EAE   FNFPVVG++ KEY+ +AE AA FIR+ LYD +THRL H
Sbjct: 539 GLAISSFARASKILKREAEGTKFNFPVVGTEPKEYLVIAEKAAFFIRKQLYDVETHRLHH 598

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           SFRN PSKAPGFLDDYAFLISGLLDLYEFG G  WL+WA ELQ TQD LFLDR+GGGYFN
Sbjct: 599 SFRNSPSKAPGFLDDYAFLISGLLDLYEFGGGINWLLWAFELQETQDALFLDRDGGGYFN 658

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
             GEDPSVLLRVKEDHDGAEPSGNSVS INL+RLAS+VAGSK+  Y++NAEH LAVFE R
Sbjct: 659 NAGEDPSVLLRVKEDHDGAEPSGNSVSAINLIRLASMVAGSKAADYKRNAEHLLAVFEKR 718

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
           LKDMAMAVPLMCCAADML VPSRK VV+VG +S  +FE+MLAAAHASYD N+TV+HIDP 
Sbjct: 719 LKDMAMAVPLMCCAADMLRVPSRKQVVVVGERSFEEFESMLAAAHASYDPNRTVVHIDPN 778

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             EEM+FWE +NSN A MA+NN+  +KVVALVCQNF+CSPPVTD ++LE LL
Sbjct: 779 YKEEMEFWEVNNSNIALMAKNNYRVNKVVALVCQNFTCSPPVTDHLALEALL 830


>gi|356505532|ref|XP_003521544.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
          Length = 809

 Score = 1157 bits (2992), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 560/698 (80%), Positives = 622/698 (89%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLS
Sbjct: 110 STCHWCHVMEVESFEDEAVAKLLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLS 169

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRKVK+AWD KRDML +SG++AIEQLSEA+
Sbjct: 170 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWDSKRDMLIKSGSYAIEQLSEAM 229

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SAS+ S+KLPD +P +ALRLC+EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLED
Sbjct: 230 SASSDSDKLPDGVPADALRLCSEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLED 289

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TGK G A+  Q+MV F+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 290 TGKLGVANGSQQMVFFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 349

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+
Sbjct: 350 VYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYI 409

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTSKEVED+LGEHA LF+EHYY+K  GNCDLS MSDPH+EFKGKNVLIE  + S  ASK 
Sbjct: 410 WTSKEVEDLLGEHAALFEEHYYIKQLGNCDLSGMSDPHDEFKGKNVLIERKEPSELASKY 469

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GM +E Y  ILGECR KLF+VRS+RP+PHLDDKVIVSWNGL ISSFARASKILK EAE  
Sbjct: 470 GMSVETYQEILGECRHKLFEVRSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEAEGT 529

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F FPV+G++ KEYM +AE AASFIR+ LY+ +THRL HSFR+ PSKAP FLDDYAFLIS
Sbjct: 530 KFYFPVIGTEPKEYMGIAEKAASFIRKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLIS 589

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYEFG G  WL+WAIELQ TQD LFLD+ GGGYFN TGED SVLLRVKEDHDGAEP
Sbjct: 590 GLLDLYEFGGGISWLLWAIELQETQDALFLDKTGGGYFNNTGEDASVLLRVKEDHDGAEP 649

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INL+RLAS+VAGSK+++Y++NAEH LAVFE RLKDMAMAVPLMCCAADML V 
Sbjct: 650 SGNSVSAINLIRLASMVAGSKAEHYKRNAEHLLAVFEKRLKDMAMAVPLMCCAADMLRVL 709

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VV+VG ++S DFENMLAAAHA YD N+TVIHIDP + +EM+FWE +NSN A MA+N
Sbjct: 710 SRKQVVVVGERTSEDFENMLAAAHAVYDPNRTVIHIDPNNKDEMEFWEVNNSNVALMAKN 769

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           NF+ +KVVALVCQNF+CSP VTD  SL+ LL +KPSS+
Sbjct: 770 NFAVNKVVALVCQNFTCSPSVTDHSSLKALLSKKPSSS 807


>gi|224132400|ref|XP_002321330.1| predicted protein [Populus trichocarpa]
 gi|222862103|gb|EEE99645.1| predicted protein [Populus trichocarpa]
          Length = 756

 Score = 1154 bits (2985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/683 (83%), Positives = 618/683 (90%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM+VESFEDE VA+LLND FVS+KVDREERPDVDKVYMT+VQALYGGGGWPLS
Sbjct: 61  STCHWCHVMKVESFEDEEVAELLNDSFVSVKVDREERPDVDKVYMTFVQALYGGGGWPLS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF+SPDLKPLMGGTYFPP+DKYGRPGFKTILRKVKDAW  KRD L +SGAFAIEQLSEAL
Sbjct: 121 VFISPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKDAWFSKRDTLVKSGAFAIEQLSEAL 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SASASS KLPDEL QNAL LCAEQLS+SYDSR+GGFGSAPKFPRPVEIQ+MLYHSKKL+D
Sbjct: 181 SASASSKKLPDELSQNALHLCAEQLSQSYDSRYGGFGSAPKFPRPVEIQLMLYHSKKLDD 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G   E+ +G +MV FTLQCMA+GGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQL N
Sbjct: 241 AGNYSESKKGLQMVFFTLQCMARGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQLVN 300

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLDAFS+T DVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE E A +KKEGAFY+
Sbjct: 301 VYLDAFSITNDVFYSSLSRDILDYLRRDMIGPEGEIFSAEDADSAEREDAKKKKEGAFYI 360

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTS+E++D+LGEHA LFK+HYY+KP GNCDLSRMSDP +EFKGKNVLIEL D+SA A K 
Sbjct: 361 WTSQEIDDLLGEHATLFKDHYYVKPLGNCDLSRMSDPQDEFKGKNVLIELTDTSAPAKKY 420

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+PLEKYL+ILGECR+KLFD RS+ PRPHLDDKVIVSWNGL ISS ARASKIL  EAE  
Sbjct: 421 GLPLEKYLDILGECRQKLFDARSRGPRPHLDDKVIVSWNGLAISSLARASKILMGEAEGT 480

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            +NFPVVG D KEYM  AE AASFIRRHLY+EQ HRL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 481 KYNFPVVGCDPKEYMTAAEKAASFIRRHLYNEQAHRLEHSFRNGPSKAPGFLDDYAFLIS 540

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYE G G  WLVWA ELQN QDELFLDREGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 541 GLLDLYEVGGGIHWLVWATELQNKQDELFLDREGGGYFNTPGEDPSVLLRVKEDHDGAEP 600

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INL+RLAS++ GSKS+YYRQNAEH LAVFE+RLKDMAMAVPLMCCAADM+SVP
Sbjct: 601 SGNSVSAINLIRLASMMTGSKSEYYRQNAEHLLAVFESRLKDMAMAVPLMCCAADMISVP 660

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           S K VVLVGHKSS++F+ MLAAAHASYD N+TVIHIDP D EEM+ WE++NSN A MARN
Sbjct: 661 SHKQVVLVGHKSSLEFDKMLAAAHASYDPNRTVIHIDPTDNEEMEIWEDNNSNIALMARN 720

Query: 680 NFSADKVVALVCQNFSCSPPVTD 702
           NF+ADKVVALVCQNF+CSPPVTD
Sbjct: 721 NFAADKVVALVCQNFTCSPPVTD 743


>gi|357511183|ref|XP_003625880.1| Spermatogenesis-associated protein [Medicago truncatula]
 gi|355500895|gb|AES82098.1| Spermatogenesis-associated protein [Medicago truncatula]
          Length = 809

 Score = 1145 bits (2962), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 561/708 (79%), Positives = 621/708 (87%), Gaps = 11/708 (1%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDEG+AKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL+
Sbjct: 102 STCHWCHVMEVESFEDEGIAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLT 161

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVK+AW+ KRDML +SG FAIEQLSEAL
Sbjct: 162 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKEAWENKRDMLVKSGTFAIEQLSEAL 221

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S+S++S+KLPD + ++ALRLC+EQLS++YDS +GGFGSAPKFPRPVEI +MLY SKKLED
Sbjct: 222 SSSSNSDKLPDGVSEDALRLCSEQLSENYDSEYGGFGSAPKFPRPVEINLMLYKSKKLED 281

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH-----------VPHFE 248
           TGK   A++ QKMV FTLQCMAKGG+HDHVGGGFHRYSVDE WH           VPHFE
Sbjct: 282 TGKLDGANKSQKMVFFTLQCMAKGGVHDHVGGGFHRYSVDECWHDIYSLSSYTHAVPHFE 341

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYDQGQLANVYLDAFS+TKD FYS + RDILDYLRRDMIGP GEIFSAEDADSAE EG
Sbjct: 342 KMLYDQGQLANVYLDAFSITKDTFYSSLSRDILDYLRRDMIGPEGEIFSAEDADSAENEG 401

Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
            TRKKEGAFYVWTSKEVED+LGEHA LF+EHYY+K  GNCDLS MSDPHNEFKGKNVLIE
Sbjct: 402 DTRKKEGAFYVWTSKEVEDLLGEHAALFEEHYYIKQMGNCDLSEMSDPHNEFKGKNVLIE 461

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
             DSS  ASK GM +E Y  ILGECRRKLF+VR KRP+PHLDDKVIVSWNGLVISSFARA
Sbjct: 462 RKDSSEMASKYGMSIETYQEILGECRRKLFEVRLKRPKPHLDDKVIVSWNGLVISSFARA 521

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           SKILK EAE   FNFPVVG++ KEY+ +A+ AASFI+  LY+ +THRLQHSFRN PSKAP
Sbjct: 522 SKILKGEAEGIKFNFPVVGTEPKEYLRIADKAASFIKNQLYNTETHRLQHSFRNSPSKAP 581

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           GFLDDYAFLISGLLDLYEFG    WL+WAIELQ TQD LFLD++GGGYFN TGED SVLL
Sbjct: 582 GFLDDYAFLISGLLDLYEFGGEINWLLWAIELQETQDTLFLDKDGGGYFNNTGEDSSVLL 641

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           RVKEDHDGAEPSGNSVS +NL+RLAS+V+GSK+++Y++NAEH LAVFE RLKD AMAVPL
Sbjct: 642 RVKEDHDGAEPSGNSVSALNLIRLASLVSGSKAEHYKRNAEHLLAVFEKRLKDTAMAVPL 701

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           MCCAADML VPSRK VVLVG ++S +FE+ML AAHA YD N+TVIHIDP + EEMDFWE 
Sbjct: 702 MCCAADMLRVPSRKQVVLVGERTSEEFESMLGAAHALYDPNRTVIHIDPNNKEEMDFWEV 761

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           +NSN A MA+NN+S  KVVALVCQNF+CS PVTD  SLE LL +KPSS
Sbjct: 762 NNSNIALMAKNNYSGSKVVALVCQNFTCSAPVTDHSSLEALLSQKPSS 809


>gi|147817761|emb|CAN68939.1| hypothetical protein VITISV_028994 [Vitis vinifera]
          Length = 1575

 Score = 1122 bits (2903), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 541/680 (79%), Positives = 589/680 (86%), Gaps = 21/680 (3%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP
Sbjct: 88  CHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 147

Query: 85  DLKPLMGGTYFPPEDKYGRPGFKTILR------------------KVKDAWDKKRDMLAQ 126
           DLKPLMGGTYFPP+DKYGRPGFKT+LR                  KVKDAW+ KRD+L +
Sbjct: 148 DLKPLMGGTYFPPDDKYGRPGFKTVLRMSIFVFVLAILLYLYSFRKVKDAWENKRDVLVK 207

Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
           SGAFAIEQLSEALSA+ASSNKL D +PQ AL LCAEQL+ +YD  +GGFGSAPKFPRPVE
Sbjct: 208 SGAFAIEQLSEALSATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVE 267

Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           IQ+MLYH KKLE++GKSGEA+E  KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPH
Sbjct: 268 IQLMLYHYKKLEESGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPH 327

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYDQGQLAN YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+
Sbjct: 328 FEKMLYDQGQLANAYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAES 387

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           E A RKKEGAFY+WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVL
Sbjct: 388 EDAARKKEGAFYIWTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVL 447

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
           IE N +SA ASKLGMP+EKYL+ILG CRRKLFDVR  RPRPHLDDKVIVSWNGL ISSFA
Sbjct: 448 IERNCASAMASKLGMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFA 507

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           RASKILKSEAE   F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSK
Sbjct: 508 RASKILKSEAEGTKFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSK 567

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
           APGFLDDYAFLISGLLD+YEFG  T WLVWAIELQ+TQ                GEDPSV
Sbjct: 568 APGFLDDYAFLISGLLDIYEFGGNTNWLVWAIELQDTQAWTLYPVPSP---ILGGEDPSV 624

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           LLRVKEDHDGAEPSGNSVSVINLVRL S+VAGS  + +R+NAEH LAVFETRLKDMAMAV
Sbjct: 625 LLRVKEDHDGAEPSGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAV 684

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
           PLMCC ADM SVPSRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FW
Sbjct: 685 PLMCCGADMFSVPSRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFW 744

Query: 667 EEHNSNNASMARNNFSADKV 686
           E  NSN A MA+NNF+ DK+
Sbjct: 745 EAMNSNIALMAKNNFAPDKL 764


>gi|30679394|ref|NP_192229.3| uncharacterized protein [Arabidopsis thaliana]
 gi|332656888|gb|AEE82288.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 818

 Score = 1099 bits (2842), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 519/691 (75%), Positives = 596/691 (86%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 126 STCHWCHVMEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 185

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+AL
Sbjct: 186 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKAL 245

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SAS  ++KL D + + A+  CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL++
Sbjct: 246 SASTGADKLSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKE 305

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +GK+ EA E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 306 SGKTSEADEEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 365

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLD FS+TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 366 VYLDGFSITKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 425

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTS E++++LGE+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK 
Sbjct: 426 WTSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKF 485

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + +EKY  ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES 
Sbjct: 486 SLSVEKYQEILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPEST 545

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            + FPVV S  ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLIS
Sbjct: 546 KYYFPVVNSQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIS 605

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEP
Sbjct: 606 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEP 665

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INLVRLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVP
Sbjct: 666 SGNSVSAINLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVP 725

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVG KSS +  NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+ 
Sbjct: 726 SRKQVVLVGSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKK 785

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N +++KVVALVCQ+F+CSPPV D  SL  LL
Sbjct: 786 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 816


>gi|17064908|gb|AAL32608.1| predicted protein of unknown function [Arabidopsis thaliana]
 gi|34098807|gb|AAQ56786.1| At4g03200 [Arabidopsis thaliana]
          Length = 756

 Score = 1099 bits (2842), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 519/691 (75%), Positives = 596/691 (86%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 64  STCHWCHVMEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 123

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+AL
Sbjct: 124 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKAL 183

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SAS  ++KL D + + A+  CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL++
Sbjct: 184 SASTGADKLSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKE 243

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +GK+ EA E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 244 SGKTSEADEEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 303

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLD FS+TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 304 VYLDGFSITKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 363

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTS E++++LGE+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK 
Sbjct: 364 WTSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKF 423

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + +EKY  ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES 
Sbjct: 424 SLSVEKYQEILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPEST 483

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            + FPVV S  ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLIS
Sbjct: 484 KYYFPVVNSQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIS 543

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEP
Sbjct: 544 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEP 603

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INLVRLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVP
Sbjct: 604 SGNSVSAINLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVP 663

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVG KSS +  NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+ 
Sbjct: 664 SRKQVVLVGSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKK 723

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N +++KVVALVCQ+F+CSPPV D  SL  LL
Sbjct: 724 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 754


>gi|297813987|ref|XP_002874877.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320714|gb|EFH51136.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 812

 Score = 1092 bits (2824), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 516/691 (74%), Positives = 594/691 (85%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFEDE VAKLLND FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 120 STCHWCHVMEVESFEDEEVAKLLNDSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 179

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAWD KRD L +SG +AIE+L++AL
Sbjct: 180 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWDSKRDTLVKSGTYAIEELTKAL 239

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SASA ++KL D + + A+ +CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLY+ KKL++
Sbjct: 240 SASAGADKLSDGISREAVSICAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYYFKKLKE 299

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +GK+ EA E Q MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 300 SGKTSEADEEQSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 359

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLD F +TKDV YSY+ +DILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 360 VYLDGFIITKDVIYSYVAKDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 419

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W+S E++++LGE+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N+ SA ASK 
Sbjct: 420 WSSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNEMSAMASKF 479

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + +EKY  ILGECR+KLFDVR  RP+PHLDDK+IVSWNGLVISSFARASK+LK+E ES 
Sbjct: 480 SLSVEKYQEILGECRKKLFDVRLNRPKPHLDDKIIVSWNGLVISSFARASKMLKAEPEST 539

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            + FPVV S  +EY+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLI+
Sbjct: 540 KYCFPVVNSQPEEYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIA 599

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+D SVLLRVKEDHDGAEP
Sbjct: 600 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDSSVLLRVKEDHDGAEP 659

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS INLVRLASIV G K+D Y   A   LAVFE RL++MA+AVPLMCCAADM+SVP
Sbjct: 660 SGNSVSAINLVRLASIVTGEKADSYLNTAHRLLAVFELRLREMAVAVPLMCCAADMISVP 719

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVG KSS +  NML+AAH+ YD NKTVIHIDP++++EM+FWEE+NSN A MA+ 
Sbjct: 720 SRKQVVLVGSKSSPELNNMLSAAHSVYDPNKTVIHIDPSNSDEMEFWEEYNSNVAEMAKK 779

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N +++KVVALVCQ+F+CSPPV D  SL  LL
Sbjct: 780 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 810


>gi|319428654|gb|ADV56678.1| hypothetical protein [Phaseolus vulgaris]
          Length = 804

 Score = 1085 bits (2805), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 545/732 (74%), Positives = 603/732 (82%), Gaps = 48/732 (6%)

Query: 26  HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 85
           H+  VESFED  VAKLLNDWFVSIKVDREERPDVDK       ALYGGGGWPLSVFLSPD
Sbjct: 78  HLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------ALYGGGGWPLSVFLSPD 130

Query: 86  LKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDAWDKKRDMLAQSGAFAI 132
           LKPLMGGTYFPP+DKYGRPGFKTILR             KVK AWD KRDML +SGAFAI
Sbjct: 131 LKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQAWDSKRDMLIKSGAFAI 190

Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           EQLSEA+S S++S+KLPD +P +ALRLC+EQLS  YDS+FGGFGSAPKFPRPVEI +MLY
Sbjct: 191 EQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFGSAPKFPRPVEINLMLY 250

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           HSKKLE+TGK   A+  QKMVLF+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLY
Sbjct: 251 HSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLY 310

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           DQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RK
Sbjct: 311 DQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARK 370

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           KEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDPHNEFK KNVLIE  + 
Sbjct: 371 KEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDPHNEFKEKNVLIERKEL 430

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           S  ASK GM +E Y  ILGECRRKLF+ RS+RP+PHLDDKVIVSWNGL +SSFARASKIL
Sbjct: 431 SELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVSWNGLAVSSFARASKIL 490

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           KSEAE   F FPVVG++ KEYM +AE AA FIR+ LYD +T RL HSFR  PSKAPGFLD
Sbjct: 491 KSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRLYHSFRRSPSKAPGFLD 550

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFLISGLLDLYEFG G  WL+WAIELQ TQD LFLD+ GGGYFN TGEDPSVLLRVKE
Sbjct: 551 DYAFLISGLLDLYEFGGGVSWLLWAIELQETQDSLFLDKAGGGYFNNTGEDPSVLLRVKE 610

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-------------------- 592
           DHDGAEPSGNSVS INL+RLAS+V+GSK++ YR+NAEH L                    
Sbjct: 611 DHDGAEPSGNSVSAINLIRLASMVSGSKAENYRRNAEHLLVCKLLSLFPLKAFSSHICAN 670

Query: 593 --------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
                   AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG ++S +FENML AAHA
Sbjct: 671 NGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVGGRTSEEFENMLTAAHA 730

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
            YD N+TVIHIDP++ EEM+FWE +NSN + MA+NN++ +KVVALVCQNF+CSPP+TD  
Sbjct: 731 LYDPNRTVIHIDPSNKEEMEFWEVNNSNVSLMAKNNYAVNKVVALVCQNFTCSPPLTDRS 790

Query: 705 SLENLLLEKPSS 716
           SLE LL +KPSS
Sbjct: 791 SLEALLSKKPSS 802


>gi|319428671|gb|ADV56694.1| hypothetical protein [Phaseolus vulgaris]
          Length = 804

 Score = 1084 bits (2803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 544/732 (74%), Positives = 603/732 (82%), Gaps = 48/732 (6%)

Query: 26  HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 85
           H+  VESFED  VAKLLNDWFVSIKVDREERPDVDK       ALYGGGGWPLSVFLSPD
Sbjct: 78  HLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------ALYGGGGWPLSVFLSPD 130

Query: 86  LKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDAWDKKRDMLAQSGAFAI 132
           LKPLMGGTYFPP+DKYGRPGFKTILR             KVK AWD KRDML +SGAFAI
Sbjct: 131 LKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQAWDSKRDMLIKSGAFAI 190

Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           EQLSEA+S S++S+KLPD +P +ALRLC+EQLS  YDS+FGGFGSAPKFPRPVEI +MLY
Sbjct: 191 EQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFGSAPKFPRPVEINLMLY 250

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           HSKKLE+TGK   A+  QKMVLF+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLY
Sbjct: 251 HSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLY 310

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           DQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RK
Sbjct: 311 DQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARK 370

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           KEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDPHNEFK KNVLIE  + 
Sbjct: 371 KEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDPHNEFKEKNVLIERKEL 430

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           S  ASK GM +E Y  ILGECRRKLF+ RS+RP+PHLDDKVIVSWNGL +SSFARASKIL
Sbjct: 431 SELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVSWNGLAVSSFARASKIL 490

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           KSEAE   F FPVVG++ KEYM +AE AA FIR+ LYD +T RL HSFR  PSKAPGFLD
Sbjct: 491 KSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRLYHSFRRSPSKAPGFLD 550

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFLISGLLDLYEFG G  WL+WAIELQ TQD LFLD+ GGGYFN TGEDPSVLLRVKE
Sbjct: 551 DYAFLISGLLDLYEFGGGISWLLWAIELQETQDSLFLDKAGGGYFNNTGEDPSVLLRVKE 610

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-------------------- 592
           DHDGAEPSGNSVS INL+RLAS+V+GSK++ Y++NAEH L                    
Sbjct: 611 DHDGAEPSGNSVSAINLIRLASMVSGSKAENYKRNAEHLLVCKLLVLFLLKAFSSHICAN 670

Query: 593 --------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
                   AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG ++S +FENML AAHA
Sbjct: 671 NGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVGGRTSEEFENMLTAAHA 730

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
            YD N+TVIHIDP++ EEM+FWE +NSN + MA+NN++ +KVVALVCQNF+CSPP+TD  
Sbjct: 731 LYDPNRTVIHIDPSNKEEMEFWEVNNSNVSLMAKNNYAVNKVVALVCQNFTCSPPLTDRS 790

Query: 705 SLENLLLEKPSS 716
           SLE LL +KPSS
Sbjct: 791 SLEALLSKKPSS 802


>gi|186511491|ref|NP_001118924.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332656889|gb|AEE82289.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 685

 Score = 1080 bits (2793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 512/683 (74%), Positives = 588/683 (86%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           MEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLK
Sbjct: 1   MEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           PLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++K
Sbjct: 61  PLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADK 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
           L D + + A+  CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL+++GK+ EA 
Sbjct: 121 LSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKESGKTSEAD 180

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+
Sbjct: 181 EEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSI 240

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E+++
Sbjct: 241 TKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDE 300

Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
           +LGE+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK  + +EKY 
Sbjct: 301 VLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQ 360

Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
            ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES  + FPVV 
Sbjct: 361 EILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVN 420

Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
           S  ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE 
Sbjct: 421 SQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYEN 480

Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
           G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEPSGNSVS I
Sbjct: 481 GGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEPSGNSVSAI 540

Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
           NLVRLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLV
Sbjct: 541 NLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLV 600

Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVV 687
           G KSS +  NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+ N +++KVV
Sbjct: 601 GSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKKNRNSEKVV 660

Query: 688 ALVCQNFSCSPPVTDPISLENLL 710
           ALVCQ+F+CSPPV D  SL  LL
Sbjct: 661 ALVCQHFTCSPPVFDSSSLTRLL 683


>gi|242059825|ref|XP_002459058.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
 gi|241931033|gb|EES04178.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
          Length = 821

 Score = 1004 bits (2596), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/691 (70%), Positives = 571/691 (82%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFE+E VAKLLNDWFVSIKVDREERPDVDKVYMTYV AL+GGGGWPLS
Sbjct: 121 STCHWCHVMEVESFENEEVAKLLNDWFVSIKVDREERPDVDKVYMTYVSALHGGGGWPLS 180

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVK+AW+ KR+ L +SG   IEQL +AL
Sbjct: 181 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKEAWETKREALERSGNLVIEQLRDAL 240

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S  ASS  +P++L   ++  C EQL+  YD +FGGFGSAPKFPRPVE  +MLY  +K  +
Sbjct: 241 STKASSQDVPNDLAAVSVDQCVEQLASRYDPKFGGFGSAPKFPRPVEDYIMLYKFRKHME 300

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            GK  EA   +KMV  TL CMA+GG+HDHVGGGFHRYSVDE WH+PHFEKMLYDQGQ+ N
Sbjct: 301 AGKESEALNIKKMVTHTLDCMARGGVHDHVGGGFHRYSVDECWHIPHFEKMLYDQGQIVN 360

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLD F +T D +YS + RDILDYLRRDMIG  GEIFSAEDADSAE EGA RKKEGAFYV
Sbjct: 361 VYLDTFLITGDEYYSIVARDILDYLRRDMIGKEGEIFSAEDADSAEYEGAPRKKEGAFYV 420

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTSKE+ED LGE+A LFK HYY+K +GNCDLS MSDPHNEF  KNVLIE   +S+ ASK 
Sbjct: 421 WTSKEIEDTLGENAELFKNHYYVKSSGNCDLSPMSDPHNEFSCKNVLIERKPASSMASKC 480

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G  L++Y  ILG+CR+KLF VRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS     
Sbjct: 481 GKSLDEYSQILGDCRQKLFHVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGPSGT 540

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
           +FNFPV G +  EY+EVAE+AA+FI+  LYD  + RL HS+RNGPSKAPGFLDDYAFLIS
Sbjct: 541 LFNFPVTGCNPVEYLEVAENAANFIKEKLYDASSKRLHHSYRNGPSKAPGFLDDYAFLIS 600

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLDLYEFG  T+WL+WA++LQ TQD+LFLD++GGGYFNT GEDPSVLLRVKED+DGAEP
Sbjct: 601 GLLDLYEFGGKTEWLLWAVQLQVTQDDLFLDKQGGGYFNTPGEDPSVLLRVKEDYDGAEP 660

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSV+ INL+RL+SI   SKS  Y+ + EH LAVFETRL+ +++A+PLMCCAADMLSVP
Sbjct: 661 SGNSVAAINLIRLSSIFDVSKSTGYKSSVEHLLAVFETRLRQLSIALPLMCCAADMLSVP 720

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVG K S +F++M+AA  + YD N+TVI IDP +TEEM+FW+ +N++ A MAR+
Sbjct: 721 SRKQVVLVGQKGSEEFQDMVAATFSLYDPNRTVIQIDPRNTEEMEFWDCNNADIAQMARS 780

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +   +  VA VCQ+F CSPPVT P +L  LL
Sbjct: 781 SPLGEPAVAHVCQDFKCSPPVTSPGALRELL 811


>gi|357131648|ref|XP_003567448.1| PREDICTED: spermatogenesis-associated protein 20-like [Brachypodium
           distachyon]
          Length = 814

 Score =  994 bits (2570), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 481/691 (69%), Positives = 568/691 (82%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVMEVESFE+E VAK+LNDWFVSIKVDREERPDVDKVYMTYV ALYGGGGWPLS
Sbjct: 113 STCHWCHVMEVESFENEEVAKILNDWFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLS 172

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSP+LKPLMGGTYFPP+DKYGRPGFKT+LR+VK+AW+ KRD L Q+G   IEQL +AL
Sbjct: 173 VFLSPNLKPLMGGTYFPPDDKYGRPGFKTVLRRVKEAWETKRDALEQAGNVVIEQLRDAL 232

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SA A+S  +P+++    +  C E+L+ +YD +FGGFGSAPKFPRPVE  +MLY  +K  +
Sbjct: 233 SAKATSQDVPNDVAVVYVDTCVEKLASNYDPKFGGFGSAPKFPRPVEDCIMLYKFRKHME 292

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +  E     KMV  TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEKMLYDQGQ+AN
Sbjct: 293 ARRESEGQNILKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEKMLYDQGQIAN 352

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYLD F +T D  YS + RDILDYLRRDMIG  GEIFSAEDADS+E EGA RKKEG+FYV
Sbjct: 353 VYLDTFLITGDECYSSVARDILDYLRRDMIGEEGEIFSAEDADSSEYEGAPRKKEGSFYV 412

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WTSKE+ED LGE A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE    S  ASK 
Sbjct: 413 WTSKEIEDTLGEDAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIERKPGSLVASKS 472

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G  +++Y  ILG+CR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS +   
Sbjct: 473 GKSVDEYSQILGDCRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGSIGT 532

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F FPV G    EY++VAE AA+FI++ LYD  + RL HS+RNGP+KAPGFLDDYAFLI+
Sbjct: 533 RFYFPVTGCHPIEYLQVAEKAATFIKQKLYDASSKRLHHSYRNGPAKAPGFLDDYAFLIN 592

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLLD+YE+G  T+WL+WA++LQ  QD+LFLDR+GGGYFNT GEDPSVLLRVKED+DGAEP
Sbjct: 593 GLLDIYEYGGKTEWLLWAVQLQVIQDQLFLDRQGGGYFNTPGEDPSVLLRVKEDYDGAEP 652

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNS++ INL+RL+SI   +KS+ Y++N EH LAVFETRL+++ +A+PLMCCAADMLSVP
Sbjct: 653 SGNSMAAINLIRLSSIFDAAKSEGYKRNVEHLLAVFETRLRELGIALPLMCCAADMLSVP 712

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           SRK VVLVG K S +F++M+AA  +SYD N+TVI IDP +TEEM FWE +N+N A MAR+
Sbjct: 713 SRKQVVLVGDKGSTEFQDMVAATFSSYDPNRTVIQIDPRNTEEMGFWESNNANIAQMARS 772

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +     VVA VCQ+F CSPPVT P +L  LL
Sbjct: 773 SPPEKLVVAHVCQDFKCSPPVTSPGALRELL 803


>gi|222619828|gb|EEE55960.1| hypothetical protein OsJ_04681 [Oryza sativa Japonica Group]
          Length = 791

 Score =  958 bits (2476), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/710 (66%), Positives = 563/710 (79%), Gaps = 24/710 (3%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVMEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP
Sbjct: 69  CHVMEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSP 128

Query: 85  DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 144
           +LKPLMGGTYFPP+DKYGR GFKTILRKVK+AW+ KRD L ++G   I+QL +ALSA AS
Sbjct: 129 NLKPLMGGTYFPPDDKYGRTGFKTILRKVKEAWETKRDALEKTGNVVIKQLRDALSAKAS 188

Query: 145 SNKLPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPK 180
           S  +P++L   ++  C E                        QL+ SYD +FGG+GSAPK
Sbjct: 189 SQDMPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPK 248

Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           FPRPVE  +MLY  +K  ++G+  E+    KM+  TLQCMA+GG+HDHVGGGFHRYSVDE
Sbjct: 249 FPRPVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDE 308

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
            WHVPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG  GEI+SAED
Sbjct: 309 CWHVPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAED 368

Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           ADSAE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EF
Sbjct: 369 ADSAEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEF 428

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
           KGKNVLIE   +S  ASK G  +++Y  ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL
Sbjct: 429 KGKNVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGL 488

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
            IS+FARAS+ILKSE     F FP+ G + +EY+ VAE AA FI+  LYD  ++RL HS+
Sbjct: 489 AISAFARASQILKSEPTGTRFCFPITGCNPEEYLGVAEKAARFIKEKLYDSSSNRLNHSY 548

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           RNGP+KAPGFLDDYAFLI+GLLDLYE+G   +WL+WA  LQ  QDELFLD++GGGYFNT 
Sbjct: 549 RNGPAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQDELFLDKQGGGYFNTP 608

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
           GEDPSVLLRVKED+DGAEPSGNSV+ INL+RL+SI   +KSD Y+ N EH LAVF+TRL+
Sbjct: 609 GEDPSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYKCNVEHLLAVFQTRLR 668

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
           ++ +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++YD N+TVI IDP +T
Sbjct: 669 ELGIALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFSTYDPNRTVIQIDPRNT 728

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           EEM FWE +N+  A MAR++      VA VCQ+F CSPPVT   +L  LL
Sbjct: 729 EEMGFWESNNAIIAQMARSSPPEKPAVAHVCQDFKCSPPVTSADALRVLL 778


>gi|218189686|gb|EEC72113.1| hypothetical protein OsI_05096 [Oryza sativa Indica Group]
          Length = 806

 Score =  947 bits (2448), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/725 (65%), Positives = 563/725 (77%), Gaps = 39/725 (5%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVMEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP
Sbjct: 69  CHVMEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSP 128

Query: 85  DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 144
           +LKPLMGGTYFPP+DKYGRPGFKTILRKVK+AW+ K D L ++G   I+QL +ALSA AS
Sbjct: 129 NLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWETKCDALEKTGNVVIKQLRDALSAKAS 188

Query: 145 SNKLPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPK 180
           S  +P++L   ++  C E                        QL+ SYD +FGG+GSAPK
Sbjct: 189 SQDIPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPK 248

Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           FPRPVE  +MLY  +K  ++G+  E+    KM+  TLQCMA+GG+HDHVGGGFHRYSVDE
Sbjct: 249 FPRPVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDE 308

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
            WHVPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG  GEI+SAED
Sbjct: 309 CWHVPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAED 368

Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           ADSAE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EF
Sbjct: 369 ADSAEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEF 428

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
           KGKNVLIE   +S  ASK G  +++Y  ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL
Sbjct: 429 KGKNVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGL 488

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSD---------------RKEYMEVAESAASFIR 465
            IS+FARAS+ILKSE     F FP+ G +                +EY+ VAE AA FI+
Sbjct: 489 AISAFARASQILKSEPTGTRFCFPITGCNFSLVKQSLGCACPYMPEEYLGVAEKAARFIK 548

Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
             LYD  ++RL HS+RNGP+KAPGFLDDYAFLI+GLLDLYE+G   +WL+WA  LQ  QD
Sbjct: 549 EKLYDSSSNRLNHSYRNGPAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQD 608

Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
           ELFLD++GGGYFNT GEDPSVLLRVKED+DGAEPSGNSV+ INL+RL+SI   +KSD Y+
Sbjct: 609 ELFLDKQGGGYFNTPGEDPSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYK 668

Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
            N EH LAVF+TRL+++ +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++
Sbjct: 669 CNVEHLLAVFQTRLRELGIALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFST 728

Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
           YD N+TVI IDP +TEEM FWE +N+  A MAR++      VA VCQ+F CSPPVT   +
Sbjct: 729 YDPNRTVIQIDPRNTEEMGFWESNNAIIAQMARSSPPEKPAVAHVCQDFKCSPPVTSADA 788

Query: 706 LENLL 710
           L  LL
Sbjct: 789 LRVLL 793


>gi|168008753|ref|XP_001757071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691942|gb|EDQ78302.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 772

 Score =  891 bits (2303), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/714 (58%), Positives = 540/714 (75%), Gaps = 7/714 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVMEVESFE+E +AKL N+WFV+IKVDREERPD
Sbjct: 42  GEEAFAKAREEDKPIFLSVGYSTCHWCHVMEVESFENEEIAKLQNEWFVNIKVDREERPD 101

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMTYVQA  GGGGWP+SVFL+P+LKP++GGTYFPP+DKYGRPGFKT+L++V++ W+
Sbjct: 102 VDKVYMTYVQASQGGGGWPMSVFLTPELKPIVGGTYFPPDDKYGRPGFKTVLKRVREVWE 161

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGS 177
            K+D+L +SG   ++QL+EA +A A S +L +  +P  A+ LCA QLSK +DS+ GGFG 
Sbjct: 162 SKKDVLRESGKQVVQQLAEATAAVAPSTELTESSVPAQAVTLCANQLSKGFDSKLGGFGG 221

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFPRPVE+ +M+ + K+LE  GK   A++  +M LF+LQCMA GG+HDHVGGGFHRYS
Sbjct: 222 APKFPRPVEVALMMRNYKRLEQQGKEQYATKALEMALFSLQCMANGGMHDHVGGGFHRYS 281

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
           VDE WHVPHFEKMLYD  QL NVYLDAF+++KD+ YSY+ RD+LDYL RDM  P G I+S
Sbjct: 282 VDEYWHVPHFEKMLYDNAQLVNVYLDAFAVSKDLTYSYVARDVLDYLIRDMTHPEGGIYS 341

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDP 356
           AEDADSAET  +T+KKEG FY+WT +E+E++LG E A +F  +YY+K  GNCDLSRMSDP
Sbjct: 342 AEDADSAETTSSTKKKEGLFYIWTLQEIEEVLGKEQAQMFIAYYYVKAEGNCDLSRMSDP 401

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
           H EF GKNVLI+ ++    A+K G   E     LG+CR KL   RS+RP PHLDDKVIV+
Sbjct: 402 HGEFGGKNVLIKRSNVDI-ATKFGKMPEDVSQYLGQCRAKLHAYRSQRPHPHLDDKVIVA 460

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL IS+FARAS+IL +E     + FPV G   KEY+ VAE AA FI+  LY+E+T RL
Sbjct: 461 WNGLAISAFARASRILLNEPSGVRYEFPVTGCHPKEYLVVAERAAHFIKSKLYNEKTKRL 520

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
             S+RNGPSKAPGFLDDYAFLI+GLLDL+E G   KWL WA+ELQ++QDE FLD+EGG Y
Sbjct: 521 TRSYRNGPSKAPGFLDDYAFLIAGLLDLFECGGDYKWLQWALELQSSQDEQFLDKEGGAY 580

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           + T   DPS+L R+KED+DGAEPSGNSV+ INL+RL+S+V G  ++     AEH LAV+E
Sbjct: 581 YITPEGDPSILFRMKEDYDGAEPSGNSVAAINLLRLSSLVTGDLAESVHTTAEHLLAVYE 640

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            R+K++AMAVPL+CCA D  SV +++ +++ G ++S D + ++ A HA +D ++ VI ID
Sbjct: 641 QRVKEVAMAVPLLCCAFDSFSVAAKRQIIIAGVRNSPDTDALMTACHAPFDPDRNVILID 700

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            ++ EE DFW+  NS   +MAR      + +A VCQNF+C  P  D ++LE LL
Sbjct: 701 ESNPEERDFWQSVNSTALAMARKAQDG-RALAYVCQNFTCQAPTGDHVALEQLL 753


>gi|4262148|gb|AAD14448.1| predicted protein of unknown function [Arabidopsis thaliana]
 gi|7270190|emb|CAB77805.1| predicted protein of unknown function [Arabidopsis thaliana]
          Length = 794

 Score =  872 bits (2253), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/660 (65%), Positives = 499/660 (75%), Gaps = 73/660 (11%)

Query: 51  VDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTIL 110
           VDREERPDVDK       ALYGGGGWPLSVFLSPDLKPLMGGTYFPP D YGRPGFKT+L
Sbjct: 206 VDREERPDVDK-------ALYGGGGWPLSVFLSPDLKPLMGGTYFPPNDNYGRPGFKTLL 258

Query: 111 RKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDS 170
           +KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++KL D + + AL+            
Sbjct: 259 KKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADKLSDGISREALK------------ 306

Query: 171 RFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
                                       ++GK+ EA E + MVLF+LQ MA GG+HDH+G
Sbjct: 307 ----------------------------ESGKTSEADEEKSMVLFSLQGMANGGMHDHIG 338

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
           GGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+TKDV YSY+ RDILDYLRRDMI 
Sbjct: 339 GGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSITKDVMYSYVARDILDYLRRDMIA 398

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDL 350
           P G IFSAEDADS E EGA RKKEGAFY+WTS E++++LGE+A LFKEHYY+K +GNCDL
Sbjct: 399 PEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDEVLGENADLFKEHYYVKKSGNCDL 458

Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
           S  SDPHNEF GKNVLIE N++SA ASK  + +EKY  ILGECRRKLFDVR KRP+PHLD
Sbjct: 459 SSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQEILGECRRKLFDVRLKRPKPHLD 518

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK+IVSWNGLVISSFARASKILK+E ES  + FPVV S  ++Y+EVAE AA FIR +LYD
Sbjct: 519 DKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVNSQPEDYIEVAEKAALFIRGNLYD 578

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
           EQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE G G +WL WAI+LQ TQ      
Sbjct: 579 EQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYENGGGIEWLKWAIKLQETQ------ 632

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
                                +DHDGAEPSGNSVS INLVRLASIVAG K++ Y   A  
Sbjct: 633 --------------------AKDHDGAEPSGNSVSAINLVRLASIVAGEKAESYLNTAHR 672

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
            LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLVG KSS +  NML+AAH+ YD NK
Sbjct: 673 LLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLVGSKSSPELTNMLSAAHSVYDPNK 732

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           TVIHIDP+ ++E++FWEEHNSN A MA+ N +++KVVALVCQ+F+CSPPV D  SL  LL
Sbjct: 733 TVIHIDPSSSDEIEFWEEHNSNVAEMAKKNRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 792


>gi|302824870|ref|XP_002994074.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
 gi|300138080|gb|EFJ04861.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
          Length = 769

 Score =  868 bits (2243), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/721 (56%), Positives = 536/721 (74%), Gaps = 4/721 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVMEVESFE E VAKLLNDWFVSIKVDREERPD
Sbjct: 47  GEEAFAKAKAEDKPIFLSVGYSTCHWCHVMEVESFESEEVAKLLNDWFVSIKVDREERPD 106

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YMT+VQA  GGGGWP+SVFL+P+LKP++GGTYFPPED YGRPGFKT+LR+VK+ WD
Sbjct: 107 VDKIYMTFVQASQGGGGWPMSVFLTPELKPIVGGTYFPPEDNYGRPGFKTVLRRVKENWD 166

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            ++ +L  +G   I+QL+EA++A A+S ++   + + A++LCA QL K +D++ GGFGSA
Sbjct: 167 SRKAVLRNAGDNVIQQLAEAMAACATSLQVSGGVAEQAVQLCASQLMKGFDAKLGGFGSA 226

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPRPVE+ +ML + K+L+  GK+  + +  +M  F LQCMA+GG+HDHVGGGFHRYSV
Sbjct: 227 PKFPRPVELNLMLRYYKRLDQAGKASLSKKALEMASFNLQCMARGGMHDHVGGGFHRYSV 286

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+ WHVPHFEKMLYDQ QLAN YLD + +T+D  ++ + RDILDYL RDM  P G IFSA
Sbjct: 287 DDYWHVPHFEKMLYDQAQLANAYLDVYLVTRDTMHACVARDILDYLNRDMTHPEGGIFSA 346

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS E  G+++KKEGAFYVWT+KE+ED+LG + A +F  HYY++  GNC+LSRMSDPH
Sbjct: 347 EDADSLEPSGSSKKKEGAFYVWTAKEIEDVLGKDRAQIFAAHYYVREQGNCNLSRMSDPH 406

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
           NEF GKNVLIE    + + +K G  +E+  ++LG+CR  L   RSKRPRPHLDDKVIV+W
Sbjct: 407 NEFLGKNVLIERQSLADTVAKFGKTVEETADLLGQCRELLHAHRSKRPRPHLDDKVIVAW 466

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL IS+++RAS+ L++E E     FP +G D K+Y+ VAE  A F++  +Y+    RLQ
Sbjct: 467 NGLAISAYSRASRFLRAEPEGLKHYFPDMGCDPKDYLIVAERIAKFVKDKIYNASAKRLQ 526

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
            S+R  PS+APGFLDDYAFLI+GLLDLYE    TKWL W  ELQ  QD LFLD+EGGGYF
Sbjct: 527 RSYRKSPSQAPGFLDDYAFLIAGLLDLYEASGDTKWLAWVFELQEVQDHLFLDKEGGGYF 586

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           +T   D S+L R+KED+DGAEPSGNSV+ INL+RLASI  G +   + + A+H LAVFE 
Sbjct: 587 STAEGDSSILFRMKEDYDGAEPSGNSVAAINLLRLASICHGEEGKLFLERAQHLLAVFEG 646

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           ++K++AMAVPLMCCA D+L+VPS++ +++ G K+S +F+ ++  +H  +D + T+I IDP
Sbjct: 647 KVKELAMAVPLMCCAYDVLAVPSKRQILVAGAKTSGEFDALVTTSHLFFDPDSTIIQIDP 706

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
               +++FW+  N    +MA+      K VA VCQ+F C  PV+D  +LE LL +  S  
Sbjct: 707 ELPSDVEFWQAKNPMLLAMAQGKAPKSKAVAFVCQDFKCYAPVSDAAALERLLNKNKSKV 766

Query: 718 A 718
           A
Sbjct: 767 A 767


>gi|326515716|dbj|BAK07104.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 532

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/521 (68%), Positives = 419/521 (80%)

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
           MLY  +K  + G+  EA    KMV  TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEK
Sbjct: 1   MLYKFRKHMEAGQKSEAENIMKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEK 60

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           MLYDQGQ+AN YLD + +T D +YS + RDILDYLRRDMIG  GEIFSAEDADSAE EG 
Sbjct: 61  MLYDQGQIANAYLDTYVITGDEYYSSVARDILDYLRRDMIGEDGEIFSAEDADSAEYEGD 120

Query: 310 TRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            RKKEG+FYVWTS+E+ED LGE+A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE 
Sbjct: 121 ARKKEGSFYVWTSQEIEDTLGENAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIER 180

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  ASK G  +++Y  ILGECR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS
Sbjct: 181 KPGSLMASKYGKSVDEYYGILGECRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARAS 240

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           +ILKS      F FPV G D  EY++VAE AA+FI+  LYD  + RL HS+RNGP+KAPG
Sbjct: 241 QILKSGPPGTKFYFPVTGCDPVEYLQVAEKAANFIKEKLYDAGSKRLHHSYRNGPAKAPG 300

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           FLDDYAFLI+GLLDL+E+G   +WL+WAIELQ  QDELFLD++GGGYFNT GEDPSVLLR
Sbjct: 301 FLDDYAFLINGLLDLFEYGGKMEWLLWAIELQVIQDELFLDKQGGGYFNTPGEDPSVLLR 360

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           VKED+DGAEPSGNS++ IN+VRL+SI+  +KS+ Y++N EH LAVFETRLK++ +A+PLM
Sbjct: 361 VKEDYDGAEPSGNSMAAINMVRLSSILDAAKSEGYKRNVEHLLAVFETRLKELGIALPLM 420

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           CCAADML+VPSRK VVLVG K+S +F++M+ AA  SYD N+TVI ID +  EEM FWE +
Sbjct: 421 CCAADMLTVPSRKQVVLVGDKASPEFQDMVVAAFLSYDPNRTVIQIDASKMEEMAFWESN 480

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N+N A MAR++ S    VA VCQ F CSPPVT P +L  LL
Sbjct: 481 NANIAQMARSSPSGKPAVAHVCQEFKCSPPVTSPGALRELL 521


>gi|384252567|gb|EIE26043.1| hypothetical protein COCSUDRAFT_52662 [Coccomyxa subellipsoidea
           C-169]
          Length = 796

 Score =  688 bits (1776), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/713 (49%), Positives = 465/713 (65%), Gaps = 18/713 (2%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE E +AKL+ND FV+IKVD+EER DVD+VYMTYVQA  GGGGWP+SV
Sbjct: 72  TCHWCHVMERESFESEAIAKLMNDSFVNIKVDKEERSDVDRVYMTYVQATSGGGGWPMSV 131

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PDL+P +GGTY+PP+D YGRPGF T+L+++ D W  +++ + +  A  + QL+EA+ 
Sbjct: 132 FLTPDLQPFLGGTYYPPQDAYGRPGFSTVLKRIADVWRSRKNEVIEQSADTMRQLNEAIQ 191

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY-HSKKLED 199
                 +LP+      +  C   L+  +D   GGFG+APKFPRP EI ++L  H +  +D
Sbjct: 192 PQGGKAELPEGAAGRFIESCYSMLASRFDPTLGGFGAAPKFPRPAEINLLLVEHLRASQD 251

Query: 200 -------TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
                     SG   +   M   TLQ MA GG++DHVGGGFHRYSVDE WHVPHFEKMLY
Sbjct: 252 REASSATASSSGRRRDALGMAETTLQRMAAGGMYDHVGGGFHRYSVDEHWHVPHFEKMLY 311

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D GQLA  YLDA+  T DV Y+ + R ILDYL RDM  P G  +SAEDADS +  G  +K
Sbjct: 312 DNGQLAQTYLDAYRATGDVRYARVARGILDYLHRDMTHPEGGFYSAEDADSLDASG--KK 369

Query: 313 KEGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            EGAFYVW++ E++++LG   E   +FK+HYY+K +GN DLS  SD H EF G N LIE 
Sbjct: 370 SEGAFYVWSADEIDEVLGTDSERGRVFKQHYYVKASGNTDLSPRSDQHGEFTGLNCLIER 429

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
               A+A+K G+ +E+    L + R+ L + RS+RPRPHLDDKV+ +WNGL I +FA AS
Sbjct: 430 ESVKATATKFGLSVEETEGTLAKARQLLHERRSQRPRPHLDDKVVTAWNGLAIGAFANAS 489

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           ++L +E +     FPV G   K+Y+  A  AA F+R  ++D    RL+ SF  GPS   G
Sbjct: 490 RVLANEPQPPTPLFPVEGRPAKDYLTDAIRAAEFVRDKVWDADARRLRRSFCRGPSDVGG 549

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           F DDYAFL+SGLLDL+      +WL +A++LQ  QDELF D   GGYF+TTGEDPS+LLR
Sbjct: 550 FADDYAFLVSGLLDLHAASGDAQWLQFALQLQAAQDELFWDDAAGGYFSTTGEDPSILLR 609

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           +KED+DGAEP+ +S++  NL+RLA++     S+  R  A  + A F  RL +M++A+P M
Sbjct: 610 MKEDYDGAEPAPSSIAAANLLRLAALTDPDASEPLRARASAAAAAFRERLAEMSLAMPQM 669

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           CCA  +L     + V++ G   + D E +L AA A +  +K VI IDP+D   ++FW  H
Sbjct: 670 CCALHLLDSGHLRQVIIAGRLGAADTEALLDAAQAIFAPDKAVIFIDPSDEASVEFWRGH 729

Query: 670 NSNNASMARN-NFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE---KPSST 717
           N    +M       AD    A VCQNF+C  P TDP  L+  L E    PS+T
Sbjct: 730 NPQALAMVEGAGLQADSSATAFVCQNFTCKAPTTDPQKLKAALGEARSAPSTT 782


>gi|302838582|ref|XP_002950849.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
           nagariensis]
 gi|300263966|gb|EFJ48164.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
           nagariensis]
          Length = 890

 Score =  611 bits (1575), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 322/742 (43%), Positives = 433/742 (58%), Gaps = 55/742 (7%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE E VA+LLN  F+SIKVDREERPDVD+VYMTYVQA+ G GGWP+SV
Sbjct: 76  TCHWCHVMERESFESEEVAELLNRDFISIKVDREERPDVDRVYMTYVQAVSGSGGWPMSV 135

Query: 81  FLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           +L+P L+P  GGTY+PP+D++       PGF T+L ++   W   R  L      A    
Sbjct: 136 WLTPSLEPFYGGTYYPPKDRFVGGQLALPGFSTVLLRIGSLWRTNRQDLKSKVEAAAAPA 195

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
               +A+ +   LP  L   A+  C   L++ YD+ +GGFG APKFPRP EI ++L  + 
Sbjct: 196 GPTEAAANAGAALPPSLAAAAVDACGHDLARRYDAEYGGFGGAPKFPRPSEINLLLRAAV 255

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +  + G    A   + M L +L  MA GG++D +GGGFHRYSVDE WHVPHFEKMLYD  
Sbjct: 256 RQMEQGDQLAAQRRRSMALHSLTAMASGGMYDQLGGGFHRYSVDELWHVPHFEKMLYDNP 315

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE---------- 305
           QLA  YL AF LT D  Y+ + R +LDYL RDM  PGG ++SAEDADS +          
Sbjct: 316 QLALSYLAAFQLTADKQYALVARGVLDYLLRDMTSPGGGLYSAEDADSEDPHSYMTSTTT 375

Query: 306 --------TEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDP 356
                    E  + +KEGAFY+W   EV  +LG E    F   Y +   GNC+ S  SDP
Sbjct: 376 AAAAAPAAMEAGSERKEGAFYIWDHSEVVSVLGPELGPFFCLVYGIDEEGNCNRSSRSDP 435

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPL----EKYLNILGECRRKLFDVRSKRPRPHLDDK 412
           H EF+GKNV       + +A++LG+P      +    L   R  L   R+ RPRP LDDK
Sbjct: 436 HGEFEGKNVPYIATQPAVAAARLGLPYGDDAAEAARRLSAAREALHAARASRPRPSLDDK 495

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           ++ +WNG+ I +FA AS++L SE +     FP  G     Y++ A   A+F+R HL+D  
Sbjct: 496 IVTAWNGMGIGAFAVASRVLASEQQVERL-FPSEGRAPAAYLDAAVRVAAFVREHLWDPA 554

Query: 473 ----THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
                 RL+ S+  GPS   GF DDY+ L+SGLLDLYE G G +WL WA++LQ  QD+LF
Sbjct: 555 AGGGVGRLRRSYCKGPSAVAGFADDYSALVSGLLDLYECGGGREWLEWALQLQAVQDQLF 614

Query: 529 LDREGGGYFNT-----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV------- 576
            D + GGYF+T        DPS+ +R+K+D+DGAEP+ +SV+  NL+RLA ++       
Sbjct: 615 WDPQSGGYFSTPDPASADADPSIRIRIKDDYDGAEPTASSVAASNLLRLADMIQERPLYD 674

Query: 577 --AGSKSDY---YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
             A + + +   Y + A  +LA F  R+    +AVP MCCAA   S    + V++ G   
Sbjct: 675 TTASTTTGHAMPYDEAARRTLAAFSARITQAPLAVPQMCCAAHTFSKRPLRQVIVAGTAG 734

Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
           + D   +L A H+ Y  +K V+ +DP+D  +M FW +HN     M          V  +C
Sbjct: 735 ATDTGALLDAVHSPYCPDKVVLVMDPSDPRDMAFWRKHNPPAYDMV-----TQPAVVFIC 789

Query: 692 QNFSCSPPVTDPISLENLLLEK 713
           QNF+C  P TDP  +  LL ++
Sbjct: 790 QNFTCQAPTTDPARVRQLLAQR 811


>gi|260801315|ref|XP_002595541.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
 gi|229280788|gb|EEN51553.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
          Length = 741

 Score =  604 bits (1558), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 328/731 (44%), Positives = 438/731 (59%), Gaps = 56/731 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFE E V K++N+ FV++KVDREERPD
Sbjct: 43  GEDAFKKAKKENKPIFLSVGYSTCHWCHVMERESFESEEVGKIMNEHFVNVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM+++QA  GGGGWP+SV+L+PDLKP+ GGTYFPP+D  GRPGF TIL ++ + W 
Sbjct: 103 VDKVYMSFIQATSGGGGWPMSVWLTPDLKPIAGGTYFPPKDHMGRPGFSTILTRISEQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
             +D L Q G   I+ L E ++SA  S+  LP    Q +++ C +QL  SYD  FGGFG 
Sbjct: 163 NNKDKLIQQGNMVIDALKELSVSAVDSTATLPG---QESVKKCLDQLDNSYDEEFGGFGH 219

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFP+PV    +      ++ T    EA     M L TL+ MAKGG++DH+G GFHRYS
Sbjct: 220 APKFPQPVNFNFLFRVWSSMKGT---PEAQRALDMALETLRFMAKGGMYDHIGQGFHRYS 276

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
            D  WHVPHFEKMLYDQGQLA  Y DA+ +TKD  ++ I RDIL Y+ RD+    G  +S
Sbjct: 277 TDRTWHVPHFEKMLYDQGQLAVAYCDAYQITKDPIFADIARDILLYVSRDLSDRQGGFYS 336

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNC 348
           AEDADS    G   KKEGAF VW + E+ ++LGE          A LF +HY +  +GN 
Sbjct: 337 AEDADSLPNPGHKTKKEGAFCVWEADEIRNLLGEKLPHYDDMTFADLFAKHYNINRSGNV 396

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
              +  DPH E  GKNVLI       +A   G+   +   +LG+CR  LF VR KRP PH
Sbjct: 397 AFDQ--DPHGELAGKNVLIVRGSVENTAKAFGLEAAQVEEVLGKCRDILFKVRRKRPPPH 454

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
            DDK+I +WNGL+IS FARA+++L  EA               +Y++ A  AA F+R+ +
Sbjct: 455 RDDKMITAWNGLMISGFARAAQVL-GEA---------------QYLDRAVKAAKFVRKKM 498

Query: 469 YDEQTHRLQHSFRNGP---------SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
           YD+ T +L  S  + P         +   GF DDYAFLI GLLDLYE     +W+ WA +
Sbjct: 499 YDDSTGKLLRSCYHDPEMDRVTQIANPIDGFADDYAFLIRGLLDLYEASYNEEWVEWAAQ 558

Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
           LQ  QDELF D EG  YF  +G DPSVL+R+KED DGAEPS NSVS  NL+RLAS     
Sbjct: 559 LQRKQDELFWDSEGLAYFTVSGADPSVLIRMKEDQDGAEPSANSVSAGNLLRLASF---H 615

Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
             + +R  +   +  F  RL  + +A+P M  A  +    + K +++ G+    D + +L
Sbjct: 616 DDEGWRNKSVQLMTAFGARLAAIPLALPEMVSAL-IFYQQTPKQIIIAGNPRDRDTKALL 674

Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
              H+S++ NK +I    AD +E  +  E     +++ + +    K  A VC+N++CS P
Sbjct: 675 QCVHSSFNPNKILI---IADGKEHGYLYEKLKVLSTLKKVD---GKATAYVCENYACSLP 728

Query: 700 VTDPISLENLL 710
           V   + L+ LL
Sbjct: 729 VNTVLELDELL 739


>gi|390355802|ref|XP_003728630.1| PREDICTED: spermatogenesis-associated protein 20
           [Strongylocentrotus purpuratus]
          Length = 671

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 316/702 (45%), Positives = 421/702 (59%), Gaps = 52/702 (7%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  + KL+N+ +VSIKVDREERPDVD+VYMT++QA  GGGGWP+SV+L+PDLK
Sbjct: 1   MERESFENVDIGKLMNEHYVSIKVDREERPDVDRVYMTFIQATAGGGGWPMSVWLTPDLK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           PLMGGTYFPP D++GRPGF TIL+ +   W + R+ L Q     IE L  A+   ++S+ 
Sbjct: 61  PLMGGTYFPPHDRFGRPGFPTILQSIARQWGENREALEQQSTKIIEALQAAVKVKSTSD- 119

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGE 205
            P  L    +  C +QL+ S+D+++GGFG APKFP+PV    +  LY S      G+S  
Sbjct: 120 -PSPLGTEVMEKCFKQLTDSFDNQYGGFGGAPKFPQPVNFNFLFRLYSSPP----GESEI 174

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
              G KM L TL+ MAKGGIHDHV  GFHRYS D  WHVPHFEKMLYDQGQLA  YLDA+
Sbjct: 175 GERGLKMCLHTLKMMAKGGIHDHVSQGFHRYSTDRFWHVPHFEKMLYDQGQLAVAYLDAY 234

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
            +TK+  ++ + RDIL+Y+ RD+    G  +SAEDADS      T KKEGAF VWT  EV
Sbjct: 235 QITKEAVFADVARDILEYVGRDLSDKAGGFYSAEDADSLPAADETHKKEGAFCVWTDTEV 294

Query: 326 EDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
              L +          A +F +HY +K  GN D  +  DPH E K +NVLI      ++A
Sbjct: 295 RTHLSDMVEGSDSVTLADVFCKHYDIKTGGNVDFEQ--DPHGELKDQNVLIARGSVDSTA 352

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S LG+        L   RR L +VR +RPRPHLDDK++ +WNGL+IS F+RA ++L++  
Sbjct: 353 SMLGLTEGTVEAALETARRTLHEVRLERPRPHLDDKMLTAWNGLMISGFSRAGQVLQA-- 410

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-RLQHSFRNG-------PSKAP 488
                          E+ + AE A +FIR+HLYD  T   L+ ++RN        P    
Sbjct: 411 --------------PEFTQRAEQAVTFIRQHLYDPSTGCLLRSAYRNKEGDIAQIPIPIQ 456

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           GF+DDY FLI GLLDLYE     +W+ WA +LQ   DEL  D E GGYF+TT +D S+LL
Sbjct: 457 GFVDDYCFLIRGLLDLYEANYDEQWIEWASQLQEKLDELLWDTENGGYFSTTDKDSSILL 516

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R+KED DGAEPS NSV+ +NL+RL+  +  ++ D Y++ A    +VF  RL+ + +A+P 
Sbjct: 517 RLKEDQDGAEPSANSVACMNLLRLSHYL--NRPD-YQEKASKLFSVFGERLQKIPIALPE 573

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           M  A  +    + K +++ G   + D   +L   H  Y  NK +I  D   T    F   
Sbjct: 574 MASAL-LFQESTAKQIIICGDPQAEDTRLLLQCVHTHYLPNKVLILTDEGQTS--GFLSS 630

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 ++ R +    K  A VC+N+ C  PV     L +LL
Sbjct: 631 RLDILKTLQRID---GKATAYVCENYQCQLPVNSVDDLSDLL 669


>gi|270011341|gb|EFA07789.1| hypothetical protein TcasGA2_TC005347 [Tribolium castaneum]
          Length = 804

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 316/728 (43%), Positives = 426/728 (58%), Gaps = 52/728 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCHVME ESFEDE VAK++N  F+++KVDREERPD
Sbjct: 100 GQEAFDRAKKENKLIFLSVGYSTCHWCHVMEKESFEDEEVAKIMNQHFINVKVDREERPD 159

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM ++QA  GGGGWP+SVFL+P L+PL GGTYFPPEDKYGRPGFKT+L+ + + W 
Sbjct: 160 VDKLYMAFIQASVGGGGWPMSVFLTPTLEPLAGGTYFPPEDKYGRPGFKTVLKSIAEQWR 219

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            K+  +A SG +++E L +      S+ +  +   ++  + C  QLS SY+  FGGF + 
Sbjct: 220 TKQSAIANSGKYSLEVLRKVSEREISAKQDINVPGEDVWKKCLLQLSHSYEDDFGGFSAQ 279

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+P  +  + +   +      S +      M L TL+ MA GGIHDHV  GF RYSV
Sbjct: 280 PKFPQPCNLNFLFHMYSR---DKHSEQGFRCLHMCLNTLRKMAYGGIHDHVNCGFARYSV 336

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+RWHVPHFEKMLYDQ QLA  Y DAF +TKD F++ + RDIL Y+ RD+  P G  + A
Sbjct: 337 DDRWHVPHFEKMLYDQAQLAVSYADAFVVTKDDFFAEVLRDILLYVSRDLSHPLGGFYGA 396

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-------HAILFKEHYYLKPTGNCDLS 351
           EDADS   EGA+ K+EGAF VW  +E+  +LGE       H  LF  HY +K  GN + +
Sbjct: 397 EDADSYPYEGASHKREGAFCVWEFEEISKLLGETKTDDISHRDLFIYHYNVKEDGNVNPA 456

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
           +  DPH+E + KN+L+       ++ K    +E    IL  C   L+  R KRP+PH+D 
Sbjct: 457 Q--DPHHELEKKNILVCFGSFEDTSRKFKTSVETVKEILKSCHEILYKERQKRPKPHVDT 514

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL+IS FA+A  +LK +                EY+  A  AA+FI++ LY+E
Sbjct: 515 KIVTSWNGLMISGFAKAGFVLKDQ----------------EYINRAILAATFIKKFLYNE 558

Query: 472 QTHRLQHSFRNG--------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           Q   L      G        P+   GFLDDYAFLI GLLDLYE      WL WA  LQ  
Sbjct: 559 QDKTLLRCCYKGDNAKIVQTPTPVNGFLDDYAFLIRGLLDLYEASLDADWLSWAEVLQEQ 618

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           QD LF D +G GYF +   D S+L+R KED DGAEP GNS++V NL+RLA+ +   ++D 
Sbjct: 619 QDRLFWDTKGSGYFTSPANDSSILIRGKEDQDGAEPCGNSIAVHNLIRLAAYL--DRAD- 675

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
            R  A  +L VF  RLK + +A+P M  A  +    S   V + G     + + ++    
Sbjct: 676 LRAKAGRTLTVFADRLKSIPVALPEMTSAL-LFYHNSPTQVFIAGPTEDNNTQALIDVVR 734

Query: 644 ASYDLNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           + +   + +   D P        +  H     S+AR      K  A VC+NF+CS PVT+
Sbjct: 735 SRFIPGRILAVTDGPGGL----LYRRHE----SLARLRPIQGKPAAYVCRNFACSLPVTE 786

Query: 703 PISLENLL 710
           P  L + L
Sbjct: 787 PEELASNL 794


>gi|348502030|ref|XP_003438572.1| PREDICTED: spermatogenesis-associated protein 20 [Oreochromis
           niloticus]
          Length = 748

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 311/717 (43%), Positives = 428/717 (59%), Gaps = 52/717 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + K+L++ FV IK+DREERPDVDKVYMT+VQA  GGGGWP+S
Sbjct: 62  STCHWCHVMERESFEDEEIGKILSENFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L+P +GGTYFPP D+ GRPGFKT+L ++ D W   R  L  SG   IE L +  
Sbjct: 122 VWLTPELRPFIGGTYFPPRDRGGRPGFKTVLTRIIDQWQNNRPALESSGERIIEALKKGT 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + +A++ + P   P  A R C +QL+ S++  +GGF  APKFP PV +  ++ +      
Sbjct: 182 TITANAGQSPPLAPDVANR-CFQQLAHSFEEEYGGFRDAPKFPSPVNLMFLISYWTVNRS 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E  E  +M L TL+ MA GGIHDH+  GFHRYS D  WHVPHFEKMLYDQ QLA 
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHIAQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+ A  ++ + F++ + +D+L Y+ RD+    G  +SAEDADS    G   K+EGAF V
Sbjct: 298 AYITASQVSGEQFFAEVAKDVLLYVSRDLSDKSGGFYSAEDADSVPALGGPEKREGAFCV 357

Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           WT+ EV ++L             A +F  HY +K  GN  ++   DPH E +G+NVLI  
Sbjct: 358 WTASEVRELLPDVVEGAAGNATLADIFMHHYGVKEQGN--VAPEQDPHGELQGQNVLIVR 415

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A++ G+ +EK   +L   R K+ +VR  RPRPHLD K++ SWNGL++S++AR  
Sbjct: 416 YSVELTAARFGITVEKVNELLASARAKMAEVRKSRPRPHLDTKMLASWNGLMLSAYARVG 475

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
            +L                  K+ +E A  A  F++ HL+D +   +  S   G      
Sbjct: 476 AVLGD----------------KDLVERAVKAGGFLKEHLWDAKRQTILRSCYRGDQMEVQ 519

Query: 484 ---PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
              PS + GFLDDYAF+I GLLDLYE    T+WL WA ELQ  QD LF D +GGGYF + 
Sbjct: 520 QISPSIS-GFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDVLFWDDQGGGYFCSD 578

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
             D +VLL++KED DGAEPS NSVS  NL+RL+      +   + Q ++  L  F  RL 
Sbjct: 579 PTDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGRQE---WLQKSQQLLTAFSDRLT 635

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
            + +A+P M  A  M    + K +V+ G + + D  ++LAA ++ + L   V+ +   +T
Sbjct: 636 TVPIALPEMVRAL-MAQHYTLKQIVICGQRDAPDTTSLLAAVNSLF-LPYKVLMLADGNT 693

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           E   F  +     +SM++    A    A VCQ+F+CS PVTDP  L  LLL+  + T
Sbjct: 694 E--SFLCQRLPVLSSMSQLRGVA---TAYVCQDFTCSLPVTDPQELRRLLLDGTTDT 745


>gi|363740931|ref|XP_420103.3| PREDICTED: spermatogenesis-associated protein 20 [Gallus gallus]
          Length = 737

 Score =  570 bits (1469), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 316/730 (43%), Positives = 430/730 (58%), Gaps = 53/730 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    +TCHWCHVME ESF+++ + ++++  FV IKVDREERPD
Sbjct: 38  GQEAFDKAKRENKLIFLSVGYSTCHWCHVMEEESFKNQEIGEIMSKNFVCIKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA  GGGGWP+SV+L+PDL+P +GGTYFPPED     GF+T+L ++ + W 
Sbjct: 98  VDKVYMTFVQATSGGGGWPMSVWLTPDLRPFVGGTYFPPEDSAHHVGFRTVLLRIAEQWR 157

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + ++ L QS    +E L  +LS   + ++      Q  L  C +QLS SYD  +GGF   
Sbjct: 158 QNQEALLQSSQRILEAL-RSLSRVGTQDQQAAPPAQEVLTTCFQQLSGSYDEEYGGFSQC 216

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP PV +  +  +      T    E +   +M L TL+ MA GGIHDH+G GFHRYS 
Sbjct: 217 PKFPTPVNLNFLFTYWALHRTT---PEGARALQMSLHTLKMMAHGGIHDHIGQGFHRYST 273

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  WHVPHFEKMLYDQGQLA VY  AF ++ D F++ +  DIL Y  RD+  P G  +SA
Sbjct: 274 DRHWHVPHFEKMLYDQGQLAVVYSRAFQISGDEFFADVAADILLYASRDLGSPAGGFYSA 333

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDIL-------GEHAIL---FKEHYYLKPTGNC 348
           EDADS  T  ++ K+EGAF VW ++EV  +L        E   L   F  HY +K  GN 
Sbjct: 334 EDADSYPTATSSEKREGAFCVWAAEEVRALLPDPVEGAAEGTTLGDVFMHHYGVKEDGN- 392

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
            +S   DPH E +GKNVLI  +    +A+  G+   +   +L E RR+L   R++RPRPH
Sbjct: 393 -VSPRKDPHKELQGKNVLIAHSSPELTAAHFGLEPGQLSAVLQEGRRRLQAARAQRPRPH 451

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LD K++ SWNGL+IS FA+A  +L                 ++EY+  A  AA F+RRHL
Sbjct: 452 LDTKMLASWNGLMISGFAQAGAVLA----------------KQEYVSRAAQAAGFVRRHL 495

Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
           ++  + RL  S   G       S AP  GFL+DY F+I GL DLYE      WL WA++L
Sbjct: 496 WEPGSGRLLRSCYRGEADVVEQSAAPIHGFLEDYVFVIQGLFDLYEASLDQSWLEWALQL 555

Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
           Q+TQD+LF D +G  YF++   DPS+LLR+K+D DGAEP+ NSV+V NL+R AS    S 
Sbjct: 556 QHTQDKLFWDPKGFAYFSSEAGDPSLLLRLKDDQDGAEPAANSVTVTNLLRAASY---SG 612

Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
              + + A   LA F  RL+ + +A+P M  A  +    + K VV+ G     D + ML+
Sbjct: 613 HMEWVEKAGQILAAFSERLQKIPLALPEMARATAVFH-HTLKQVVICGDPQGEDTKEMLS 671

Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
             H+++  NK +I    AD +   F        +S+ R      K  A VC NF+CS PV
Sbjct: 672 CVHSTFIPNKVLIL---ADGDGAGFLYRQLPFLSSLERKE---GKATAYVCSNFTCSLPV 725

Query: 701 TDPISLENLL 710
           T P +L+ LL
Sbjct: 726 TSPRALQELL 735


>gi|189240570|ref|XP_973977.2| PREDICTED: similar to predicted protein [Tribolium castaneum]
          Length = 754

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 316/746 (42%), Positives = 427/746 (57%), Gaps = 70/746 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCHVME ESFEDE VAK++N  F+++KVDREERPD
Sbjct: 32  GQEAFDRAKKENKLIFLSVGYSTCHWCHVMEKESFEDEEVAKIMNQHFINVKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM ++QA  GGGGWP+SVFL+P L+PL GGTYFPPEDKYGRPGFKT+L+ + + W 
Sbjct: 92  VDKLYMAFIQASVGGGGWPMSVFLTPTLEPLAGGTYFPPEDKYGRPGFKTVLKSIAEQWR 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            K+  +A SG +++E L +      S+ +  +   ++  + C  QLS SY+  FGGF + 
Sbjct: 152 TKQSAIANSGKYSLEVLRKVSEREISAKQDINVPGEDVWKKCLLQLSHSYEDDFGGFSAQ 211

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+P  +  + +   +      S +      M L TL+ MA GGIHDHV  GF RYSV
Sbjct: 212 PKFPQPCNLNFLFHMYSR---DKHSEQGFRCLHMCLNTLRKMAYGGIHDHVNCGFARYSV 268

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+RWHVPHFEKMLYDQ QLA  Y DAF +TKD F++ + RDIL Y+ RD+  P G  + A
Sbjct: 269 DDRWHVPHFEKMLYDQAQLAVSYADAFVVTKDDFFAEVLRDILLYVSRDLSHPLGGFYGA 328

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-------HAILFKEHYYLKPTGNCDLS 351
           EDADS   EGA+ K+EGAF VW  +E+  +LGE       H  LF  HY +K  GN + +
Sbjct: 329 EDADSYPYEGASHKREGAFCVWEFEEISKLLGETKTDDISHRDLFIYHYNVKEDGNVNPA 388

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
           +  DPH+E + KN+L+       ++ K    +E    IL  C   L+  R KRP+PH+D 
Sbjct: 389 Q--DPHHELEKKNILVCFGSFEDTSRKFKTSVETVKEILKSCHEILYKERQKRPKPHVDT 446

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL+IS FA+A  +LK +                EY+  A  AA+FI++ LY+E
Sbjct: 447 KIVTSWNGLMISGFAKAGFVLKDQ----------------EYINRAILAATFIKKFLYNE 490

Query: 472 QTHRL--------------------------QHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
           Q   L                           +S    P+   GFLDDYAFLI GLLDLY
Sbjct: 491 QDKTLLRCCYKGDNAKIVQTVANLLSKSQPTLNSINRRPTPVNGFLDDYAFLIRGLLDLY 550

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E      WL WA  LQ  QD LF D +G GYF +   D S+L+R KED DGAEP GNS++
Sbjct: 551 EASLDADWLSWAEVLQEQQDRLFWDTKGSGYFTSPANDSSILIRGKEDQDGAEPCGNSIA 610

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
           V NL+RLA+ +   ++D  R  A  +L VF  RLK + +A+P M  A  +    S   V 
Sbjct: 611 VHNLIRLAAYL--DRAD-LRAKAGRTLTVFADRLKSIPVALPEMTSAL-LFYHNSPTQVF 666

Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSAD 684
           + G     + + ++    + +   + +   D P        +  H     S+AR      
Sbjct: 667 IAGPTEDNNTQALIDVVRSRFIPGRILAVTDGPGGL----LYRRHE----SLARLRPIQG 718

Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
           K  A VC+NF+CS PVT+P  L + L
Sbjct: 719 KPAAYVCRNFACSLPVTEPEELASNL 744


>gi|410895871|ref|XP_003961423.1| PREDICTED: spermatogenesis-associated protein 20-like [Takifugu
           rubripes]
          Length = 748

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 306/714 (42%), Positives = 421/714 (58%), Gaps = 50/714 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT++QA  G GGWP+S
Sbjct: 62  STCHWCHVMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFIQATSGSGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL+P +GGTYFPP D   RPG KT+L ++ D W   R  L  +G   +E L +  
Sbjct: 122 VWLTPDLRPFIGGTYFPPRDHGRRPGLKTVLMRIIDQWTNNRSALESNGNKILEALKKGT 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + +A +   P   P +  + C +QL+ SY+  +GGF  +PKFP PV +  ++ +      
Sbjct: 182 AIAADAGTSPPFAP-DVTKRCFQQLANSYEEEYGGFRDSPKFPSPVNLMFLMSYWCMNRS 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E  E  +M L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA 
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+ A  ++ + FY+ + +DIL Y+ RD+    G  +SAEDADS    G T K+EGAF +
Sbjct: 298 AYITASQVSGEQFYADVAKDILCYVSRDLSDKSGGFYSAEDADSLPHCGGTEKREGAFCI 357

Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           WT+ EV ++L             A +F  HY +K  GN  +S   DPH E +G+NVLI  
Sbjct: 358 WTASEVRELLPDVVEGTAGSATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVR 415

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A+  G+ +E+  N+L   R K+ ++R  RPRPHLD K++ SWNGL++S++AR  
Sbjct: 416 YSLELTAAHFGVSIEEVTNLLASARAKMAEIRKSRPRPHLDTKMLASWNGLMLSAYARVG 475

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
            +L  +A                 +E A  AA+F++ H++D +   L  S   G      
Sbjct: 476 AVLGDKA----------------LLERAVQAANFLQEHMWDPEQQTLLRSCYLGDDMELQ 519

Query: 484 --PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
                  GFLDDYAF+I GLLDL+E    T+WL WA ELQ  QD+LF D EGGGYF +  
Sbjct: 520 QISPPISGFLDDYAFIICGLLDLHEATLQTEWLRWAEELQLRQDKLFWDDEGGGYFCSDP 579

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
            D +VLLR+KED DGAEPS NSVS  NL+RL+      +   + Q +E  LA F  RL  
Sbjct: 580 SDFTVLLRLKEDQDGAEPSANSVSAFNLLRLSEYTGKQE---WLQKSERLLAAFTDRLTK 636

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
           + +A+P M  A  M    + K +V+ G + S D   +LA  ++ +  +K ++ ID    E
Sbjct: 637 VPIALPEMVRAL-MAQHYTLKKIVICGKRDSPDTVTLLATVNSLFLPHKVLMLID--GDE 693

Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
           +    + H +  +   ++  +     A +C NF+CS PVTDP  L  LLL++ S
Sbjct: 694 DSSLQQRHPALYSITQQDGVA----TAYICHNFTCSLPVTDPQELRRLLLDETS 743


>gi|317419139|emb|CBN81176.1| Spermatogenesis-associated protein 20 [Dicentrarchus labrax]
          Length = 748

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 311/717 (43%), Positives = 427/717 (59%), Gaps = 52/717 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT+VQA  GGGGWP+S
Sbjct: 62  STCHWCHVMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L+P +GGTYFPP D   RPG KT+L ++ + W   R  L  SG   +E L +  
Sbjct: 122 VWLTPELRPFIGGTYFPPRDHARRPGLKTVLTRIMEQWQNNRPALESSGERILEALKKGT 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + +A+  + P   P  A R C +QL+ SY+  +GGF  APKFP PV +  ++ +      
Sbjct: 182 AVAANPGESPPLAPDVANR-CFQQLAHSYEEEYGGFRDAPKFPTPVNLMFLMSYWSVNRS 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E  E  +M L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA 
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+ A  ++ +  ++ + +DIL Y+ RD+    G  +SAEDADS    G   K+EGAF V
Sbjct: 298 AYITASQVSGEQLFADVAKDILLYVTRDLSDKSGGFYSAEDADSVPASGGPEKREGAFCV 357

Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           WT+ EV ++L             A +F  HY +K  GN  ++   DPH E +G+NVLI  
Sbjct: 358 WTATEVRELLPDVVEGATGSATQADIFMHHYGVKVQGN--VAPEQDPHGELQGQNVLIVR 415

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A+  G+ +EK   +L   R K+ +VR  RP PHLD K++ SWNGL++S++AR  
Sbjct: 416 YSVELTAAHFGISVEKVNELLASARGKMAEVRKSRPCPHLDTKMLGSWNGLMLSAYARVG 475

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSFRNGPSKA- 487
            +L  +A                 +E A  A +F++ HL+D EQ   L+  +R    +  
Sbjct: 476 AVLGDKA----------------LLERAAQAGNFLKEHLWDAEQQTILRSCYRGDEMEVQ 519

Query: 488 ------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
                  GFLDDYAF+I GLLDLYE    T+WL WA ELQ  QDELFLD +GGGYF++  
Sbjct: 520 QISPPISGFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDELFLDDQGGGYFSSDP 579

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
            D +VLL++KED DGAEPSGNSVS  NL+RL+      +   + Q ++  LA F  RL  
Sbjct: 580 SDNTVLLQLKEDQDGAEPSGNSVSASNLLRLSHYTGRQE---WLQRSQQLLAAFTDRLTR 636

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID-PADT 660
           + +A+P M     M    + K +V+ G + + D  ++LA  ++ +  +K ++  D  AD+
Sbjct: 637 VPIALPEMVRTL-MAQHYTLKQIVICGQRDAPDTASLLATINSLFLPHKVLMLTDGDADS 695

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
               F  +     +SM++ +  A    A VCQ+F+CS PVTDP  L  LLL+  + T
Sbjct: 696 ----FLCQRLPVLSSMSQQDGVA---TAYVCQDFTCSLPVTDPQELRRLLLDGTTET 745


>gi|326672402|ref|XP_001920588.3| PREDICTED: spermatogenesis-associated protein 20 [Danio rerio]
          Length = 818

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 307/712 (43%), Positives = 419/712 (58%), Gaps = 52/712 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + K+L+D FV IKVDREERPDVDKVYMT+VQA  GGGGWP+S
Sbjct: 140 STCHWCHVMERESFEDEEIGKILSDNFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMS 199

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDLKP +GGTYFPP D   RPG KT+L ++ + W   R+ L  SG   +E L +  
Sbjct: 200 VWLTPDLKPFIGGTYFPPRDSGRRPGLKTVLLRIIEQWQTNRETLESSGERVLEALRKGT 259

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + SAS  +     P  A R C +QL+ S++  +GGF  APKFP PV ++ ++        
Sbjct: 260 AISASPGETLPPGPDVANR-CYQQLAHSFEEEYGGFREAPKFPSPVNLKFLMSFWAV--- 315

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              S E +E  +M L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQGQLA 
Sbjct: 316 NRSSSEGAEALQMALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQGQLAV 375

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+ A+ ++ +  ++ + RD+L Y+ RD+    G  +SAEDADS  T  +T K+EGAF V
Sbjct: 376 AYITAYQVSGEQLFADVARDVLLYVSRDLSDKSGGFYSAEDADSFPTVESTEKREGAFCV 435

Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           WT+ E+ ++L             A +F  HY +K  GN D ++  DPH E +G+NVLI  
Sbjct: 436 WTAGEIRELLPDIVEGATGGATQADIFMHHYGVKEQGNVDPAQ--DPHGELQGQNVLIVR 493

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A+  G+ + +   +L E R KL +VR  RP PHLD K++ SWNGL++S FAR  
Sbjct: 494 YSVELTAAHFGISVNRLSELLSEARAKLAEVRRARPPPHLDTKMLASWNGLMLSGFARVG 553

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
            +L  +A                 +E AE AA F++ HL+DE   R+ HS   G      
Sbjct: 554 AVLGDKA----------------LLERAERAACFLQDHLWDEDGQRILHSCYRGNNMEVE 597

Query: 484 --PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
              S   GFLDDYAF++ GLLDL+E     +WL WA ELQ  QD+LF D +G GYF +  
Sbjct: 598 QVASPITGFLDDYAFVVCGLLDLFEATQKFRWLQWAEELQLRQDQLFWDSQGSGYFCSDP 657

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
            DP++LL +K+D DGAEPS NSVS +NL+RL+      + D+  Q +E  L  F  RL  
Sbjct: 658 SDPTLLLALKQDQDGAEPSANSVSAMNLLRLSHFTG--RQDWI-QRSEQLLTAFSDRLLK 714

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
           + +A+P M     M    + K +V+ G   + D  ++++  ++ + L   V+ +   +TE
Sbjct: 715 VPIALPDMVRGV-MAHHYTLKQIVICGLPDAEDTASLISCVNSLF-LPHKVLMLADGNTE 772

Query: 662 EMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
              +      +   +       D K  A VC+NF C+ PVT P  L  LL+E
Sbjct: 773 GFLY------DKLPILSTLVPQDGKATAYVCENFVCALPVTCPQELRRLLME 818


>gi|327264961|ref|XP_003217277.1| PREDICTED: spermatogenesis-associated protein 20-like [Anolis
           carolinensis]
          Length = 739

 Score =  560 bits (1444), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 307/731 (41%), Positives = 429/731 (58%), Gaps = 53/731 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCHVME ESF++E +A++LN+ FVSIKVDREERPD
Sbjct: 40  GQEAFDKAKKEDKLIFLSVGYSTCHWCHVMEHESFQNEEIAQILNENFVSIKVDREERPD 99

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+PDLKP +GGTYFPPED   + GF+T+L ++ + W 
Sbjct: 100 VDKVYMTFVQATSSGGGWPMSVWLTPDLKPFVGGTYFPPEDGIYQVGFRTVLIRILEQWK 159

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + R  L ++    +  L   +       ++P  L +  +  C +QLS+SYD  +GGF   
Sbjct: 160 RNRAALLENSQKILSALLARVDVGVRGEEIPPSL-KEVMSRCFQQLSESYDEEYGGFSET 218

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP PV +  +  +      T    E +   +M L TL+ MA GGIHDH+  GFHRYS 
Sbjct: 219 PKFPTPVNMNFLFSYWALHRST---SEGARALQMALHTLKMMAYGGIHDHIAQGFHRYST 275

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+RWHVPHFEKMLYDQGQLA V+  AF ++ D F++ I  DIL Y  RD+    G  +SA
Sbjct: 276 DQRWHVPHFEKMLYDQGQLAVVFAKAFQISGDEFFADIVADILLYASRDLSDKSGGFYSA 335

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPTGNC 348
           EDADS  T  + +K+EGAF VWT++E+  +L +           A +F  HY +K  GN 
Sbjct: 336 EDADSYPTAKSEKKQEGAFCVWTAEEIRHLLPDLIEGSPERKSVADVFMHHYGVKEDGN- 394

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
            ++ M DPHNE KGKNVLI       +A++ G+ LE+   +L + R +L+  R++RPRPH
Sbjct: 395 -VNPMKDPHNELKGKNVLIVQYSLELTAARFGLGLEQLKTMLVKSRDQLYKARAQRPRPH 453

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LD K++ SWNGL+IS FA++  IL                 +KEY++ A + A F+R ++
Sbjct: 454 LDTKMLASWNGLMISGFAQSGAIL----------------GKKEYVDRAVNTADFLRNYM 497

Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
           ++    +L  S   G       S  P  GFL+DY F+I  L DLYE      WL WA++L
Sbjct: 498 FNASNGKLLRSCYQGKENSVDKSSVPIHGFLEDYVFVIQALFDLYEASLNPSWLEWAVQL 557

Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
           Q+ QDELF D +G  YF T   DPS+LLR+K+D DGAEPS NSV+V NL+R AS     +
Sbjct: 558 QHKQDELFWDPKGFAYFTTEASDPSLLLRMKDDQDGAEPSPNSVAVSNLLRAASYTGHKE 617

Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
              + + A   L+ F  RL  + + +P M  A     + ++K VV+ G     D   +L 
Sbjct: 618 ---WVKKAGQILSAFSERLLKIPVVLPEMARATAAFHL-TQKQVVICGDPKGEDTRELLH 673

Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
             ++++  N+ +I    AD     F  +     +S+ + N    K  A +C+NF+CS PV
Sbjct: 674 CYYSTFTPNRVLIF---ADGNTTGFPYQQLGFLSSLEKKN---GKATAYLCENFACSLPV 727

Query: 701 TDPISLENLLL 711
           T    L  LLL
Sbjct: 728 TSSQELRCLLL 738


>gi|156368209|ref|XP_001627588.1| predicted protein [Nematostella vectensis]
 gi|156214502|gb|EDO35488.1| predicted protein [Nematostella vectensis]
          Length = 735

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 303/734 (41%), Positives = 418/734 (56%), Gaps = 60/734 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K ++  FL    +TCHWCHVME ESFEDE +AK+LN+ F+ +KVDREERPD
Sbjct: 37  GDEAFQKAKKEQKPIFLSVGYSTCHWCHVMERESFEDENIAKILNENFIPVKVDREERPD 96

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD+VYMTY+QA+ GGGGWP+S++L+PDLKP + GTYFPP D  GRPGF T+L  +   WD
Sbjct: 97  VDRVYMTYIQAMVGGGGWPMSLWLTPDLKPFVAGTYFPPNDMAGRPGFGTVLGHIIKQWD 156

Query: 119 KKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
             +    Q     +  + E A      +  +P+   +  +    + +SKS+D   GGFG 
Sbjct: 157 TNKPKFTQQSTIVMNAILEHASEIGLDAKDMPN---KEVIEKLYQGMSKSFDEELGGFGG 213

Query: 178 APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           APKFP+P     +  YH  K      + E      + L TL+CM KGGIHDHVG GFHRY
Sbjct: 214 APKFPQPATFNFLFKYHLLK----NGTEEGERALHICLKTLECMGKGGIHDHVGQGFHRY 269

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S D  WHVPHFEKMLYDQ Q+A  Y   + +TKD  ++  CRDIL Y+ RD+    G  +
Sbjct: 270 STDRFWHVPHFEKMLYDQAQIAAAYAMGYQMTKDEKFAETCRDILLYVMRDLSHKLGGFY 329

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-----------AILFKEHYYLKPT 345
           SAEDADS  +  AT+K EGAFYVW  +E++D+L +            + LF +HY ++  
Sbjct: 330 SAEDADSLPSPNATKKTEGAFYVWEEQELKDLLSDSLPTKGGGSILLSELFNKHYGVQAE 389

Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
           GN  +    DPH E   KNVLI       +   L +  ++    L + R  LF+ R KRP
Sbjct: 390 GN--VKPHQDPHKELVKKNVLIVRGSLQDTIKDLDVEEDEAKEQLAKAREILFEERKKRP 447

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
            PHLDDK+I SWNGL+IS FAR+ ++L  E                 Y+  A  AA F+R
Sbjct: 448 APHLDDKMITSWNGLMISGFARSGQVLGEEV----------------YILRAIKAAEFVR 491

Query: 466 RHLYDEQTHRLQHSFRNG--------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWA 517
            HLYD+ +  L  S   G         +   G+  DY +LI+GLLDLYE     +WL WA
Sbjct: 492 THLYDKSSGELLRSCYRGDKDSIAQIATPIKGYGCDYVYLINGLLDLYEASFDEQWLKWA 551

Query: 518 IELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
            ELQ+  DELFLD+E GGYF  T  D S+L+R+K++ DGAEPS NS++V+NL+RL + V 
Sbjct: 552 EELQDKADELFLDKEKGGYFEVTEADKSILVRLKDEQDGAEPSANSLAVMNLMRLGNFVD 611

Query: 578 GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFEN 637
             +   YR  A+    V+E+RL+ + +A+P +       ++   K +++ G + + D + 
Sbjct: 612 CQR---YRDQAQRIFMVYESRLRQIPLALPELVSNFITHNL-GMKQIIIAGDRDADDTKL 667

Query: 638 MLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCS 697
           ++   H+ Y  NK ++  D  D     F     S   ++ R +    K  A VCQN++C 
Sbjct: 668 LMRCVHSHYIPNKVLLLCDGKDG----FLSTKLSVFKTLQRVD---GKATAYVCQNYTCQ 720

Query: 698 PPVTDPISLENLLL 711
            PVT    L  LL+
Sbjct: 721 LPVTSEEELTKLLV 734


>gi|241111177|ref|XP_002399229.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
 gi|215492917|gb|EEC02558.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
          Length = 745

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 309/717 (43%), Positives = 416/717 (58%), Gaps = 59/717 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+  +A+L+N+ FV++KVDREERPD+D+VYMTY+QA  GGGGWP+S
Sbjct: 65  STCHWCHVMERESFENADIARLMNEHFVNVKVDREERPDLDRVYMTYIQATSGGGGWPMS 124

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           V+L+PDLKP++GGTYFPP+D+Y GRPGFKT+L  + +   +  ++L Q+         EA
Sbjct: 125 VWLTPDLKPIVGGTYFPPDDRYFGRPGFKTLLAAIAEQGSRIVEILRQASDLRSSDEREA 184

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            +A+++S              C EQLS+SYD   GGFG APKFP+ V +  +L H+   +
Sbjct: 185 GAAASTSGSEAVPRASTVAATCFEQLSRSYDEAMGGFGKAPKFPQCVNLNFLLRHAVASQ 244

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           + G   EA+   +M + TL  MA+GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA
Sbjct: 245 EPG---EAARALEMCVNTLNKMARGGIHDHVAKGFHRYSTDGGWHVPHFEKMLYDQAQLA 301

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+AF  T+D   + + RD+LDY+ RD+    G  +SAEDADS     +  KKEGAF 
Sbjct: 302 RAYLEAFQATRDPHLAQVARDVLDYVERDLSHQSGGFYSAEDADSLPEASSGEKKEGAFC 361

Query: 319 VWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           VW   EV  +L E          A LF  ++ ++  GN D   M DPH+E KGKNVL+  
Sbjct: 362 VWEEAEVRRLLPEPLPGCPGRTVADLFCRYFGVEAGGNVD--PMQDPHDELKGKNVLVVR 419

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
               + A + G+ L    ++L + RR L + R +RPRPHLDDK + +WNGL++S FA A+
Sbjct: 420 ESQESLAERFGLELPVLHSLLEDARRVLLEARQRRPRPHLDDKFLAAWNGLMVSGFATAA 479

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS---- 485
           K+L                DR+ Y   A  A +F+ +HLYDE    L  S   G      
Sbjct: 480 KVL---------------GDRR-YAGRALQAVAFLGQHLYDEDRKSLLRSAYRGEGGHVT 523

Query: 486 ----KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
                 PG L+DYAF + GLLD YE       L+ A ELQ+ QD  F D + GGYF ++G
Sbjct: 524 QTARPIPGVLEDYAFTVQGLLDTYEACFEAPCLLRAEELQDAQDARFWDPDQGGYFLSSG 583

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
           ED  +LLR+K+D DGAEPS NSVS+ NLVRL+ ++  +++D  R+ A+     +  RL  
Sbjct: 584 EDAHLLLRLKDDQDGAEPSPNSVSLSNLVRLSVLL--NRAD-LRERAQRLAEAYARRLSL 640

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
           + +A+P M C    L     + VV+ G K     + +L+     +    T I  D     
Sbjct: 641 LPLALPEMVCGLLRLQA-GPQEVVVAGGKDHPGTQELLSCLRGHFLPFLTTILAD----- 694

Query: 662 EMDFWEEHNSNNASMARNNFSADKVV-----ALVCQNFSCSPPVTDPISLENLLLEK 713
                 +   N       NF A K V     A VC+NF CS PVT  + LE LL +K
Sbjct: 695 ------QDPENPLRERLPNFDAYKCVDGKPTAYVCRNFVCSKPVTSAVELERLLQQK 745


>gi|321473187|gb|EFX84155.1| hypothetical protein DAPPUDRAFT_47524 [Daphnia pulex]
          Length = 661

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 290/622 (46%), Positives = 393/622 (63%), Gaps = 55/622 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA+L+N  F++IKVDREERPDVDK+YM++VQA+ G GGWP+S
Sbjct: 61  STCHWCHVMEKESFEDENVAELMNSEFINIKVDREERPDVDKMYMSFVQAITGRGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           V+++P+LKP+ GGTY+PP+D+Y G+PGFKTIL+ + + W +       SG    E++  A
Sbjct: 121 VWMTPELKPVYGGTYYPPDDRYYGQPGFKTILKSLAEQWKENPGKFKASG----EKIMTA 176

Query: 139 LSASASSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           L+ S++  +  D++P   +   LC +QL  SY+ +FGGF  APKFP+PV + ++L     
Sbjct: 177 LARSSTLGR-GDQVPSAFDCGHLCFQQLRGSYEPKFGGFSKAPKFPQPVNMNLLLRWHVL 235

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D   S  A +   M L TL+ MAKGGI DHV  GF RYS DE+WHVPHFEKMLYDQ Q
Sbjct: 236 SDDAADSDLALD---MCLHTLRMMAKGGIFDHVRLGFARYSTDEKWHVPHFEKMLYDQAQ 292

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA VY DA+ LTKD  ++ +  DIL Y+  D+  P G  +SAEDADS    G+  K+EGA
Sbjct: 293 LALVYTDAYLLTKDQDFARVASDILTYVSNDLSDPSGGFYSAEDADSYPETGSDEKREGA 352

Query: 317 FYVWTSKEVEDILGEHAI------------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           F VW+ KE++ +L                 +   H+ ++P+GN D     DPH+E KG+N
Sbjct: 353 FCVWSHKEIQSVLASQPAPSQVGPDVTVSDIVCYHFDIRPSGNVD--PYQDPHDELKGQN 410

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           VLI       +A+K G+ ++    +L      + + R +RPRPHLDDK++ SWNGL+IS+
Sbjct: 411 VLIIRGSDEETAAKFGLSMDVLRELLETALSTMREARQRRPRPHLDDKMLASWNGLMISA 470

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG 483
            ARA +IL                 R  Y+E A  AA F+R+HLYD Q+ RL  S +R G
Sbjct: 471 LARAGQILG----------------RDTYVERAAKAAEFVRQHLYDGQSGRLLRSCYRGG 514

Query: 484 PSKAP----------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             +            GFLDDYAF+I GLLDLY      KW+ WA ELQ  QD+LF D   
Sbjct: 515 DGQQDAVSQNAEPIGGFLDDYAFVIRGLLDLYTACQDEKWIQWADELQQKQDQLFWDPSQ 574

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GGYF++   DPS+L+R+KE+ DGAEPSGNS++V NL RLA  VA  +SD YR  A  +L 
Sbjct: 575 GGYFSSAAGDPSILIRLKEEQDGAEPSGNSIAVGNLERLA--VAVDRSD-YRDQARRTLC 631

Query: 594 VFETRLKDMAMAVPLMCCAADM 615
           +F+ RL  + +++P M  A  +
Sbjct: 632 LFQDRLAKIPVSLPEMVAALQL 653


>gi|193215110|ref|YP_001996309.1| hypothetical protein Ctha_1399 [Chloroherpeton thalassium ATCC
           35110]
 gi|193088587|gb|ACF13862.1| protein of unknown function DUF255 [Chloroherpeton thalassium ATCC
           35110]
          Length = 710

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 288/703 (40%), Positives = 406/703 (57%), Gaps = 56/703 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E +A++LN+ FVSIKVDREE PD+DKVYMTYVQA  G GGWP+S
Sbjct: 54  STCHWCHVMERESFENEEIARILNEHFVSIKVDREEHPDLDKVYMTYVQASTGSGGWPMS 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+LKP  GGTYFPP D YGRPGF ++L K+ ++W + R+ + Q+     EQL    
Sbjct: 114 VWLTPELKPFFGGTYFPPSDSYGRPGFGSMLLKIAESWQQSRERVLQAAGNISEQLQAFS 173

Query: 140 SASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKK 196
              A +  K+PDE    A +    Q    +D  +GGFG+APKFPRP  +  +   +H  K
Sbjct: 174 EMQAEAGAKVPDEA---AFQNTFAQFESVFDKDWGGFGNAPKFPRPAILNFLFTFFHQTK 230

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKM 250
            E            +M L TL+ MA GG+HDH+      GGGF RYS D  WHVPHFEKM
Sbjct: 231 NE---------AALRMALHTLRKMADGGMHDHISVPGKGGGGFARYSTDAYWHVPHFEKM 281

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  QLA+ YLDA+ +T D F++   RDI +Y+  DM  P G  +SAEDADS     + 
Sbjct: 282 LYDNAQLASAYLDAYQITSDRFFADTARDIFNYVLCDMTAPEGGFYSAEDADSLAAPESP 341

Query: 311 RKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            K EGAFYVW   E++ +LG+ A  +F   Y + P GN  +    DPH EFKGKN+LI  
Sbjct: 342 EKTEGAFYVWERAEIDALLGDEASQIFSFIYGVHPGGNASV----DPHGEFKGKNILIRR 397

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S +A + G        ++ + R +LFD R +RPRPH DDK++ +WNGL+IS+FA+  
Sbjct: 398 ATLSQAAQEFGKSEADIAEVMAKSRERLFDARLQRPRPHRDDKILTAWNGLMISAFAKGY 457

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L                D   Y+  A+ AA F+   LY+++T  L   +R+G S   G
Sbjct: 458 MVL----------------DEATYLHAAQKAADFVIEKLYNKETGGLLRRYRDGESAIDG 501

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
             DDYAF +  L+DLYE     K+L  A++L   Q+ LF D + GG+F++T E+ SV+ R
Sbjct: 502 KADDYAFFVQALIDLYEASFQFKYLSLALDLAEKQNALFYDAQNGGFFSSTSENKSVIFR 561

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           +K+D DGAEPS NSV+ +NL+RL+ +   +  + +RQ AE ++  F   L +    +P M
Sbjct: 562 LKDDQDGAEPSANSVAALNLLRLSQM---ADREDFRQKAEATVNFFGKILSEAGNQMPQM 618

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A   L     K ++L G   S +   +  A  + Y+  K ++H            EE 
Sbjct: 619 FAALSFLKQKP-KQIILTGAPDSPELRALRKAIDSVYEPVKVLLHAT----------EET 667

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
               + ++  +  + K  A +C N++C  P ++P  +   L+E
Sbjct: 668 AGLTSFLSSLSLGSQKPTAYICINYACRLPTSEPAKVREFLVE 710


>gi|357626408|gb|EHJ76509.1| hypothetical protein KGM_19065 [Danaus plexippus]
          Length = 813

 Score =  543 bits (1399), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 306/714 (42%), Positives = 409/714 (57%), Gaps = 55/714 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE E VAK++N+ F++IKVDREERPD+D+VYM +V A  GGGGWP+S
Sbjct: 133 STCHWCHVMERESFESEDVAKIMNEHFINIKVDREERPDLDRVYMLFVMATTGGGGWPMS 192

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDL+P+ GGTYFPPED++GRPGFKTIL  +   W + +    ++    ++ L    
Sbjct: 193 VFLTPDLRPVTGGTYFPPEDRWGRPGFKTILLSLAKKWKENQTQFLEASINIMDALQNIS 252

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +    +N +P E   N    C  +   +++  FGGFG+APKFP+   I   L+H    + 
Sbjct: 253 NVKVETNSVPGEATWNK---CVRRYITNFEPHFGGFGTAPKFPQ-ASIFNFLFHFYARDK 308

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++ E  +  +M L TL  ++KGGIHDHV  GF RYSVD  WHVPHFEKMLYDQ QL  
Sbjct: 309 --QNPEGKQCLEMCLHTLTKISKGGIHDHVASGFARYSVDNDWHVPHFEKMLYDQAQLMV 366

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y DA+  TK+ +Y+ + RDI+ Y+ RD+    G  +SAEDADS    GA +KKEGAF V
Sbjct: 367 AYTDAYLATKEEYYADVVRDIVKYVNRDLRHDLGGYYSAEDADSYPVFGADKKKEGAFCV 426

Query: 320 WTSKEVEDILGEHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           W   E+  ++G+  +       +F +++ ++ +GN  +S  SDPH E   KNVLI     
Sbjct: 427 WEYDEINSLIGDKKVGNVSYLEIFCDYFNVEESGN--VSPESDPHGELTNKNVLIIYGSE 484

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
             +ASK  +  ++   +L EC   L++ RSKRPRPHLD K++ SWNGL IS  A A +  
Sbjct: 485 EETASKFEITKDQLKQVLKECIDILYEARSKRPRPHLDTKMLCSWNGLAISGLAHAGQ-- 542

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF----------RN 482
                         G   K ++E A   A+FI+ HLYD++   L HS            N
Sbjct: 543 --------------GLGEKSFVEDAIKTANFIKEHLYDQENKTLLHSCYKAEDGNITQTN 588

Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
            P K  GFLDDYAFLI GLLDLYE      WL WA ELQ  Q+ELF D + GGYF  + E
Sbjct: 589 PPIK--GFLDDYAFLIRGLLDLYEASLDLHWLNWARELQEKQNELFWDSDNGGYFTCSAE 646

Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS----DYYRQNAEHSLAVFETR 598
           D SV+LR+KED DGAEPSGNSVS  NL RLA+    S +    D  R  A+  L  F  R
Sbjct: 647 DTSVVLRLKEDQDGAEPSGNSVSCHNLQRLAAYADKSSAEEGGDRERDMAKKVLMAFAKR 706

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
           L D   A P M  A  M    S   V++ G  S      ++ A  +     + +   DP 
Sbjct: 707 LIDSPTASPEMMSAL-MFFTDSPTQVLISGGCSDPRTLALVRAVRSRLLPGRVLAVADPK 765

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           D+           ++  ++R   + +   A VC+ ++CS PVT    LE LL E
Sbjct: 766 DSPA-------GMSDILLSRIRSTGEAPTAYVCRRYACSLPVTSVQQLETLLDE 812


>gi|328702149|ref|XP_001952649.2| PREDICTED: spermatogenesis-associated protein 20-like
           [Acyrthosiphon pisum]
          Length = 784

 Score =  543 bits (1399), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 318/748 (42%), Positives = 421/748 (56%), Gaps = 78/748 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F      ++  FL    +TCHWCHVME ESFE++ VA ++N+ +V+IKVDREERPD
Sbjct: 81  GDEAFEKARSEKKLIFLSVGYSTCHWCHVMEHESFENQDVAAVMNEHYVNIKVDREERPD 140

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
           VD++YMT+VQA  G GGWP+SVFL+PDLKP+ GGTY+PPED YGRPGFKTIL  +   W 
Sbjct: 141 VDQLYMTFVQAASGQGGWPMSVFLTPDLKPIGGGTYYPPEDAYGRPGFKTILLHMAKRWK 200

Query: 118 ----------DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS 167
                      K   +L  + AF I QL   LS     N      P+  +  C  QL + 
Sbjct: 201 SDSKSMLENSSKMMKILNDTTAFDI-QLGTELSNIMKPN------PKTWIT-CYSQLQRI 252

Query: 168 YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHD 227
           YD  +GGFG  PKFP+P  +  + + S K+    KS E  +  +M L TLQ M  GGIHD
Sbjct: 253 YDDEWGGFGMPPKFPQPTILDFLFHISHKM---SKSYEGKKSLEMALETLQKMTMGGIHD 309

Query: 228 HVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD 287
           H+G GF RYS DE+WHVPHFEKMLYDQ QLA  Y  AF +TK   YS +  DIL Y+ RD
Sbjct: 310 HIGQGFARYSTDEKWHVPHFEKMLYDQAQLAVSYTTAFQITKHEQYSDVVHDILQYVSRD 369

Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKE 338
           +    G  +SAEDADS  T  +T+K+EGAF  WT +EV+ +L +          + LF  
Sbjct: 370 LSHKLGGFYSAEDADSLPTVDSTKKREGAFCTWTQEEVKTLLDQPLDSNPDIKLSELFCW 429

Query: 339 HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLF 398
           H+ + P GN      SDPH E  G+NVLIE      +A K  + +E     L   +  LF
Sbjct: 430 HFSVLPNGNVRPD--SDPHGELLGQNVLIEFRSKENTAKKFQITVENVEKELKIAKSILF 487

Query: 399 DVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAE 458
           + R KRPRPHLD+K+I SWNGL+I+++ARA+  L  E                EY + A 
Sbjct: 488 EARKKRPRPHLDNKIITSWNGLMITAYARAASALNVE----------------EYKQRAI 531

Query: 459 SAASFIRRHLYDEQTHRLQHSFRNG-------PSKAPGFLDDYAFLISGLLDLYEFGSGT 511
            AA F++ H ++     L+  + N             GFL+DYAFLI GLLDLYE    +
Sbjct: 532 KAAEFLKTHAWNNSV-LLRSCYVNDIGDIANIEKPIAGFLNDYAFLIRGLLDLYECTLQS 590

Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 571
           KWL WA ELQ  QDELF D+E  GY++++ +DPS++LR K DHDGAEPSGNS+S +NL+R
Sbjct: 591 KWLKWADELQEQQDELFWDKEKFGYYSSSDKDPSIILRFKSDHDGAEPSGNSISALNLLR 650

Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
           L+ +   S+   YR   +     F  RL   + A+P +  A   L   S   V + G   
Sbjct: 651 LSILTEKSE---YRSKIDPLFLAFAGRLSGSSSALPALVSAL-TLHCDSITSVYVTGDLD 706

Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
           + + E +L+A    Y  N  + H D     E+       +    +A N     KV A VC
Sbjct: 707 NPELEALLSAIRQRYMPNLVLAHADENSLSEL-------AKGLGIAENG----KVAAYVC 755

Query: 692 QNFSCSPPVTDPISLENLL---LEKPSS 716
           +N +C+ PV     L  LL   +E P+S
Sbjct: 756 KNNTCNLPVHSTEELIALLDGRVESPAS 783


>gi|116626220|ref|YP_828376.1| hypothetical protein Acid_7180 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229382|gb|ABJ88091.1| protein of unknown function DUF255 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 704

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 299/694 (43%), Positives = 412/694 (59%), Gaps = 42/694 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E +A LLN  +++IKVDREERPDVD++YMT+VQA  G GGWP+S
Sbjct: 49  STCHWCHVMERESFENEEIAALLNRDYIAIKVDREERPDVDRIYMTFVQATTGSGGWPMS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L+P  GGTYFPPE+++G PGF +IL ++   W   R  + +S    IEQL + +
Sbjct: 109 VWLTPELEPFFGGTYFPPENRWGHPGFGSILTQIAGVWRDNRPQVVESARDVIEQLKKHV 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + S   +     Q  L        +++D+R GGFG+APKFPR V I   L     L  
Sbjct: 169 EVAPSHGGV--AFDQATLDSGFSVFRRTFDTRTGGFGAAPKFPR-VSIHHFL-----LRY 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++G   E   MVL TL+ MA+GG++D +GGGFHRYSVD+RW VPHFEKMLYDQ Q+A 
Sbjct: 221 YARTGN-KEALDMVLLTLREMARGGMNDQLGGGFHRYSVDDRWFVPHFEKMLYDQAQIAI 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFY 318
            YL+AF +T D  Y+   R I DY+ RDM   GG  +SAEDADS  T E  T K EGAFY
Sbjct: 280 SYLEAFQVTGDAQYADTARAIFDYVLRDMTDSGGGFYSAEDADSIITPEQPTLKGEGAFY 339

Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +W+ +E+  ++G  A   F   Y ++  GN +    +DPH EF GKN+L + +    +A 
Sbjct: 340 IWSMEEIHALVGAPASDWFCYRYGVREGGNVE----NDPHGEFTGKNILYQQHTLEQTAE 395

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
             G P  +    L    R L   R+KR RPHLDDK++ SWNGL+IS+FA+   +L+    
Sbjct: 396 HFGQPAGEMDATLDNAARILLQARAKRVRPHLDDKILTSWNGLMISAFAKGGAVLEEPRY 455

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
           +                  A  AA+F+   L D  +  L   +R G +  PGFLDDYAF 
Sbjct: 456 AEA----------------ARRAAAFVAGRLCDAASGTLLRRYREGDAAIPGFLDDYAFF 499

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + GLLDLYE       L  AI L   Q ELF DRE G +F+T   DP ++LRVKED+DGA
Sbjct: 500 VQGLLDLYEAQFDLSHLQLAIRLTEKQLELFEDREAGAFFSTIDGDPELVLRVKEDYDGA 559

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EPSGNSVSV+NLVRLA I   +  D +RQ+A  +L+ F +RL    MAVP +  A + ++
Sbjct: 560 EPSGNSVSVMNLVRLAQI---TNRDQFRQSAGRALSAFASRLSVAPMAVPQLLAACEFVT 616

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
              R+ ++  G + S + + ML   H  +  N+ V+ +D A+  +        +      
Sbjct: 617 GQPRE-IIFAGTRDSAELQAMLHELHRRFIPNRVVLLVDSAEARKT------LAGGIPSI 669

Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            +   AD +  A VC++++C  PV+DP +   L+
Sbjct: 670 ESMLPADGRATAYVCRDYTCQLPVSDPANFAELI 703


>gi|340721576|ref|XP_003399194.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
           terrestris]
          Length = 831

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 298/718 (41%), Positives = 417/718 (58%), Gaps = 63/718 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF ++ +A+++N  F++IKVD+EERPD+DK+YMT++QA  G GGWP+S
Sbjct: 146 STCHWCHVMEKESFTNKEIAEIMNKNFINIKVDKEERPDIDKIYMTFIQATSGHGGWPMS 205

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ DLKP++GGTYFPPED + + GFKTIL  V   W++ R  L + G+  +E L  ++
Sbjct: 206 VFLTADLKPIIGGTYFPPEDTFRQIGFKTILLSVAQKWNQSRSKLTEIGSTNLETLC-SI 264

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
           S   +S K+ D       ++C +Q    ++ +FGGFGS     +PKFP+PV +   L+H 
Sbjct: 265 SKIPNSLKVHDTPSLECSKICIQQFVNGFEPKFGGFGSTYNMQSPKFPQPVNLN-FLFHM 323

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              +   +S        M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ
Sbjct: 324 YARQPNVES--VRPCLHMSVYTLKKMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 381

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           GQL   Y DA+ +TKD F++ I  DI  Y+ RD+    G  +SAEDADS  T  A  KKE
Sbjct: 382 GQLMKSYADAYLVTKDNFFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPTHDAHAKKE 441

Query: 315 GAFYVWTSKEVEDILGEHAI---------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           GAFYVW++ E++ IL +            +F  H+ +  +GN  +    DPH E K KNV
Sbjct: 442 GAFYVWSAVEIKSILNKEVSDETHVKLSDIFCRHFNVNESGN--VKSHQDPHGEIKEKNV 499

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           LI  N+   +A    +P+E+    L E    L+ VRS RPRPHLDDK+I +WNGL+IS  
Sbjct: 500 LIAYNEIEETARYFNLPVEETKMYLKEACSMLYKVRSARPRPHLDDKIITAWNGLMISGL 559

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG- 483
           A                F     + K+Y+E A  AA FI+ +L+DE  + L HS +R+  
Sbjct: 560 A----------------FGGAAVNNKQYIERAADAAKFIKEYLFDETKNILLHSCYRDEK 603

Query: 484 ------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
                  +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D + GGYF
Sbjct: 604 DTIIQISTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDEKDGGYF 663

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           +TT  DPS++LR+KE +DGAEPSGNS++  NL+RLA  +     D ++  A H   VF  
Sbjct: 664 STTSSDPSIILRLKEAYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAAHLFRVFRH 720

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNKTV 652
            L    + VP       + S   R H     + +VG + + D + +L   +     N+ +
Sbjct: 721 LLMQSPVTVP------QLTSALVRYHDDAAQMYVVGKRGAKDTDELLRVIYKRLIPNRIL 774

Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           + IDP  T  +   +  +  N     N     +    VC++ +CS PVT P  L  LL
Sbjct: 775 LLIDPDKTNSLLLRKNQHLRNMKSVNN-----RATVYVCKHRTCSLPVTSPEQLATLL 827


>gi|345485510|ref|XP_001604421.2| PREDICTED: spermatogenesis-associated protein 20-like [Nasonia
           vitripennis]
          Length = 797

 Score =  537 bits (1384), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 309/729 (42%), Positives = 424/729 (58%), Gaps = 61/729 (8%)

Query: 11  KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
           K RR   LI      +TCHWCHVME ESFE+  VAK++N +FV+IKVDREERPD+D+VYM
Sbjct: 98  KARREDKLIFLSVGYSTCHWCHVMEKESFENPEVAKIMNRYFVNIKVDREERPDIDRVYM 157

Query: 65  TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
           T++Q++ G GGWP+SVFL+PDL P+ GGTYFPP DKYG+PGF  IL  +   W + +  L
Sbjct: 158 TFIQSISGHGGWPMSVFLTPDLTPITGGTYFPPVDKYGQPGFSRILESIATKWIESKQDL 217

Query: 125 AQSGAFAIEQLSEALSASASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFGGFGSAPKF 181
            +SG+  ++ L +++ +     K P+E  +P  +    C +QL   ++  FGGF  APKF
Sbjct: 218 LKSGSKILQVLKKSVES-----KDPEEASVPSVDCANTCVKQLINGFEPSFGGFSRAPKF 272

Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
           P+PV   ++     + + TG++G+  +   M + TL  MA GGIHDHVG GF RYSVD +
Sbjct: 273 PQPVNFNLLFLMYAR-DPTGETGK--QCLNMCVHTLTKMANGGIHDHVGQGFSRYSVDGK 329

Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
           WHVPHFEKMLYDQGQL   Y +A+  +KD  ++ I  DI+ Y+ RD+  P G  +SAEDA
Sbjct: 330 WHVPHFEKMLYDQGQLLRSYSEAYLASKDPLFAEIVNDIVTYVARDLRHPEGGFYSAEDA 389

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGE---------HAILFKEHYYLKPTGNCDLSR 352
           DS  +   T KKEGAFYVW  ++VE +L +          + LF  H+ +KP GN  + R
Sbjct: 390 DSFPSFEDTEKKEGAFYVWRYEDVESLLDKVISEKEGLTLSDLFCYHFNVKPEGN--VQR 447

Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
             DPH E   +NVLI     + +A    + ++     L +    LF+ R+KRPRPHLDDK
Sbjct: 448 QQDPHGELMNQNVLIAFGSIAETAEHFKLSIDSVKAHLEKSISILFEERNKRPRPHLDDK 507

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           ++ +WNGLVIS  + A+  L                D  +Y + AE AA FI R+LY++ 
Sbjct: 508 IVTAWNGLVISGLSHAASAL----------------DNPKYTKFAEDAARFIERYLYNKD 551

Query: 473 THRLQHSFRNGPSKA--------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
              L  S   G S           GF  DYAF I GLLDLYE      WL +A ELQ+ Q
Sbjct: 552 DKVLLRSCYRGDSDQILQTSVPIKGFQVDYAFAIRGLLDLYEVSFNAHWLEFAEELQDIQ 611

Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
           D LF D + GGYF+TT +D SV+LR+K+D DGAEPSGNSV+  NLVRLAS +   ++D  
Sbjct: 612 DSLFWDDKSGGYFSTTTDDRSVILRLKDDQDGAEPSGNSVACGNLVRLASYL--DRTD-L 668

Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
              AE  L+  +  L    +A P +  A   L + S   V ++G K + D + +L    +
Sbjct: 669 SSKAEKLLSSMQEILIQFPVACPELVTALVTL-IDSTTQVYIIGKKDTDDTKQLLKVLQS 727

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
                K V+  D  + + + +  + N     M + N    +  A VC +  CS PVTDP 
Sbjct: 728 KLVPGKIVMLADGVNQDNVLY--KKNEVIGKMKQQN---GRATAYVCHHHICSLPVTDPK 782

Query: 705 SLENLLLEK 713
            LE+LL +K
Sbjct: 783 DLESLLDKK 791


>gi|427788829|gb|JAA59866.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 766

 Score =  536 bits (1380), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 311/735 (42%), Positives = 416/735 (56%), Gaps = 76/735 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE++ +AK++ND FV++KVDREERPDVD+VYMTY+QA  GGGGWP+S
Sbjct: 65  STCHWCHVMERESFENDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMS 124

Query: 80  VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQL 135
           ++L+PDLKP++GGTYFPP+D+ YG+PGFKT+L  + + W K R  L   G   F I EQ 
Sbjct: 125 IWLTPDLKPVVGGTYFPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQT 184

Query: 136 SE-----------ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
           S+           +   S ++ K P     +    C  QL +SYD   GGFG APKFP+ 
Sbjct: 185 SDVRVFGGDGVPTSPRGSEANQKCP--FAPDVATTCYRQLERSYDVSMGGFGRAPKFPQC 242

Query: 185 VEIQMMLYHSKKLEDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           V +  +L +   L       EA     +  +M + TL+ MA+GGIHDH+G GFHRYS D 
Sbjct: 243 VNLNFLLRYRAVLLQGDPPPEAKTAVDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDG 302

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
           +WHVPHFEKMLYDQ QL   Y +A+ +T D   + + RDIL Y+ RD+  P G  +SAED
Sbjct: 303 KWHVPHFEKMLYDQAQLTRTYSEAYQVTHDRRLADVARDILCYVERDLSHPSGGFYSAED 362

Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLS 351
           ADS    G   K+EGAF VW   EV  +L E          A +   +Y ++ +GN D  
Sbjct: 363 ADSYPEHGDKEKREGAFCVWEESEVYRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD-- 420

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
            M DPH+E K KNVLI      + A+  G+ +     +L   R  LF+ R +RP+PHLDD
Sbjct: 421 PMQDPHDELKRKNVLIVRESKESVAACYGLEVGVLDALLERARETLFEARLRRPKPHLDD 480

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K + SWNGL+IS FA A++ L         N PV       Y++ A     FI++HLY+ 
Sbjct: 481 KFLTSWNGLMISGFAIAARTL---------NQPV-------YLDRALKCVEFIKKHLYNP 524

Query: 472 QTHRLQHS-FR-------NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           +   L  S +R        G     G L+DYAFLI  LLD+YE       L+WA ELQ+ 
Sbjct: 525 KKKTLIRSAYRGEDGSVVQGSQPIDGVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDK 584

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           QD LF D++  GYF + GEDP+V+LR+K+D DGAEPS NSVS+ NLVRL+ ++   + D 
Sbjct: 585 QDRLFWDKKDMGYFLSNGEDPTVVLRLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDE 641

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
            RQ AE   +V+  R+  + +A+P M C    L     + VV+ G +     + +L+   
Sbjct: 642 LRQRAEKLASVYGQRMILVPLALPEMVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSCLR 700

Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA-----DKVVALVCQNFSCSP 698
             +    TVI  D           +   N       NF        K  A VCQ+F CS 
Sbjct: 701 RHFLPFVTVILAD-----------QDPENPLRKRLTNFDGYTCVNGKPAAYVCQDFQCSK 749

Query: 699 PVTDPISLENLLLEK 713
           PVT    LE LL  K
Sbjct: 750 PVTTAAELEALLTAK 764


>gi|340370640|ref|XP_003383854.1| PREDICTED: spermatogenesis-associated protein 20 [Amphimedon
           queenslandica]
          Length = 741

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 311/738 (42%), Positives = 431/738 (58%), Gaps = 65/738 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFE + VAK+LND FVSIKVDREERPD
Sbjct: 36  GEEAFTKSRNENKPIFLSVGYSTCHWCHVMERESFESDTVAKVLNDHFVSIKVDREERPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA  G GGWP+SVFL+P+LKP +GGTYFPPED +  P F TIL  V + W 
Sbjct: 96  VDKVYMTFVQATQGSGGWPMSVFLTPELKPFLGGTYFPPEDSFRSPSFLTILNAVHEQWT 155

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGS 177
           K  D + Q     ++ L  A++ S+S N    +LP  A ++  AE L+  +DS++GGFG 
Sbjct: 156 KDHDNIKQKMNPLMKALQAAVAGSSSLNP---QLPGTACIQKAAEMLADRFDSKYGGFGQ 212

Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           + KFP+PV + ++L  Y      + G    AS     VLFTL+ M+ GG+HDH+G GFHR
Sbjct: 213 SMKFPQPVILDLLLRIYARYPSSEMGDGALAS-----VLFTLEAMSNGGMHDHIGQGFHR 267

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS D  WHVPHFEKMLYDQ QL   YL A+ +TKD  +     DIL+Y+ RD+    G  
Sbjct: 268 YSTDPYWHVPHFEKMLYDQAQLVVTYLSAYQITKDDKFKETAVDILEYVLRDLGDKDGGF 327

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPT 345
           +SAEDADS    G   KKEGAF VWT +E++ IL +           A LF   + +K  
Sbjct: 328 YSAEDADSYRCHGDKEKKEGAFCVWTWEEIQSILLDPLPGGDTDKTLADLFSSRFGVKKG 387

Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
           GN   ++  DPH E   +NVLI        +S+  + +E+  ++L E + +L+ +R++RP
Sbjct: 388 GNVRPNQ--DPHGELINQNVLIIKKSFEELSSEFSLEVEQVKSLLMEAKDRLYKMRAERP 445

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
           +PH DDK++ +WNGL++S+ +RAS++L                   EY+E A+SAASFIR
Sbjct: 446 KPHRDDKILTAWNGLMVSALSRASQVLGG----------------SEYLERAKSAASFIR 489

Query: 466 RHLYD-EQTHRLQHSFRN-----GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
             LYD E++  L++++R+       S   GF DDYAFLI GL+DLYE      WL WA+E
Sbjct: 490 DSLYDKEKSVLLRNAYRDENDVLSVSTVEGFADDYAFLIRGLIDLYEASHDPLWLKWALE 549

Query: 520 LQNTQDELFLD------REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 573
           LQ  QD LFLD       E GGYF+T+G D S+LLR+K+  DGAEPS NSVS  NL+RL+
Sbjct: 550 LQEQQDRLFLDIKGEEGEEKGGYFSTSGMDDSILLRMKDGEDGAEPSANSVSAENLLRLS 609

Query: 574 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLSVPSRKHVVLVGHKSS 632
           S    S+    R  +E+    F + + +   A+  +  A    L  P  K V++VG  S 
Sbjct: 610 SFFDKSE---LRSKSENIFKTFNSSMMEHPPAMAALIGAFISYLQKP--KQVIIVGLISG 664

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + +L+  H+ +  NKT+I  DP+    +         +  M       DK    +C+
Sbjct: 665 DDTQALLSCIHSHFIPNKTLILHDPSSPSPLLMESLPLLKDMIMVD-----DKATVYLCE 719

Query: 693 NFSCSPPVTDPISLENLL 710
           ++ C+ P      L++++
Sbjct: 720 DYKCAAPTNSSTVLKDMI 737


>gi|385648253|ref|NP_001245301.1| spermatogenesis-associated protein 20 isoform 2 precursor [Homo
           sapiens]
 gi|311033529|sp|Q8TB22.3|SPT20_HUMAN RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411; Flags:
           Precursor
          Length = 786

 Score =  534 bits (1375), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSVPITDPCELRKLL 784


>gi|193787397|dbj|BAG52603.1| unnamed protein product [Homo sapiens]
          Length = 742

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 303/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GEEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +D L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKDTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSVPITDPCELRKLL 740


>gi|84040225|gb|AAI11030.1| SPATA20 protein [Homo sapiens]
 gi|119615009|gb|EAW94603.1| spermatogenesis associated 20, isoform CRA_a [Homo sapiens]
          Length = 786

 Score =  533 bits (1374), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSVPITDPCELRKLL 784


>gi|385648255|ref|NP_001245302.1| spermatogenesis-associated protein 20 isoform 3 [Homo sapiens]
          Length = 742

 Score =  533 bits (1373), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSVPITDPCELRKLL 740


>gi|31542723|ref|NP_073738.2| spermatogenesis-associated protein 20 isoform 1 precursor [Homo
           sapiens]
 gi|19263653|gb|AAH25255.1| Spermatogenesis associated 20 [Homo sapiens]
          Length = 802

 Score =  533 bits (1373), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSVPITDPCELRKLL 800


>gi|119615011|gb|EAW94605.1| spermatogenesis associated 20, isoform CRA_c [Homo sapiens]
          Length = 742

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSVPITDPCELRKLL 740


>gi|426347561|ref|XP_004041418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Gorilla
           gorilla gorilla]
          Length = 786

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ + +    G
Sbjct: 318 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +    P      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M CA       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|41351283|gb|AAH65526.1| SPATA20 protein [Homo sapiens]
          Length = 742

 Score =  532 bits (1371), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GEEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSVPITDPCELRKLL 740


>gi|119615010|gb|EAW94604.1| spermatogenesis associated 20, isoform CRA_b [Homo sapiens]
          Length = 802

 Score =  532 bits (1371), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSVPITDPCELRKLL 800


>gi|158257042|dbj|BAF84494.1| unnamed protein product [Homo sapiens]
          Length = 742

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GEEAFDKARKESKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSVPITDPCELRKLL 740


>gi|426347557|ref|XP_004041416.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 802

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ + +    G
Sbjct: 334 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +    P      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M CA       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|426347555|ref|XP_004041415.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 742

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ + +    G
Sbjct: 274 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +    P      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M CA       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|426347559|ref|XP_004041417.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Gorilla
           gorilla gorilla]
          Length = 786

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ + +    G
Sbjct: 318 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +    P      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M CA       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|343958896|dbj|BAK63303.1| SPATA20 protein [Pan troglodytes]
          Length = 742

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF+DE + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQDEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|114669341|ref|XP_001170552.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Pan
           troglodytes]
 gi|397493180|ref|XP_003817490.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pan
           paniscus]
          Length = 786

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|410051894|ref|XP_003953187.1| PREDICTED: spermatogenesis-associated protein 20 [Pan troglodytes]
          Length = 786

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|403279582|ref|XP_003931326.1| PREDICTED: spermatogenesis-associated protein 20 [Saimiri
           boliviensis boliviensis]
          Length = 742

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNALLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + +DIL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT+ EV+ +L E  +          LF +HY L 
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTANEVQQLLPEPVLGATEPLTSGQLFMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISSSQDPKGELQGQNVLTVRYSLELTAARFGLDVEGVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +           S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTSSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|10437433|dbj|BAB15051.1| unnamed protein product [Homo sapiens]
          Length = 786

 Score =  530 bits (1365), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 422/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D  YS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDELYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           + RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LERHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSVPITDPCELRKLL 784


>gi|114669347|ref|XP_001170636.1| PREDICTED: spermatogenesis-associated protein 20 isoform 7 [Pan
           troglodytes]
 gi|397493176|ref|XP_003817488.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pan
           paniscus]
          Length = 742

 Score =  530 bits (1365), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|114669339|ref|XP_511882.2| PREDICTED: spermatogenesis-associated protein 20 isoform 8 [Pan
           troglodytes]
 gi|397493178|ref|XP_003817489.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pan
           paniscus]
 gi|410211920|gb|JAA03179.1| spermatogenesis associated 20 [Pan troglodytes]
 gi|410266782|gb|JAA21357.1| spermatogenesis associated 20 [Pan troglodytes]
 gi|410349593|gb|JAA41400.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  530 bits (1364), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|449479427|ref|XP_002191427.2| PREDICTED: spermatogenesis-associated protein 20 [Taeniopygia
           guttata]
          Length = 753

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 290/709 (40%), Positives = 397/709 (55%), Gaps = 64/709 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF+ + +  ++N+ FV IKVDREERPDVDKVYMT+VQA  GGGGWP+S
Sbjct: 89  STCHWCHVMEEESFKSKEIGDIMNEHFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMS 148

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDLKP  GGTYFPPED     GF+T+L ++ + W + +D L  S    +E L    
Sbjct: 149 VWLTPDLKPFAGGTYFPPEDGVNHVGFRTVLLRIAEQWKENKDALLGSSQRILEALRHTS 208

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                    P    +  +  C +QLS+SYD  +GGF   PKFP PV +  +  +    + 
Sbjct: 209 EIRVQGQASPPP-AKEVMDTCFQQLSRSYDEEYGGFSKCPKFPSPVNLNFLFTYWALHQT 267

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E +   +M L TL+ MA GGIHDH+G GFHRYS+D+ WHVPHFEKMLYDQGQLA 
Sbjct: 268 T---PEGARALQMALHTLKMMALGGIHDHIGQGFHRYSIDQHWHVPHFEKMLYDQGQLAA 324

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  AF ++ D F++ + RDIL Y+ RD+    G  +SA+DADS  T  +  K+EGAF V
Sbjct: 325 IYSKAFQISGDEFFADVVRDILLYVSRDLSDQAGGFYSAQDADSYPTTTSREKREGAFCV 384

Query: 320 WTSKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           W +KE+  +L +           A +F  HY +K  GN D +R  DP+ E KGKNVLI  
Sbjct: 385 WAAKELRALLPDPVEGATEGTTLADVFMHHYGVKEAGNVDPAR--DPYQELKGKNVLIVR 442

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A+K G+   +   +L EC+++L   R++RP+PHLD K++ +WNGL+IS FA+A 
Sbjct: 443 CAPELTAAKFGLEPGRLSTLLQECQQRLSSARAQRPQPHLDTKMLAAWNGLMISGFAQAG 502

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------QHSFR 481
             L  +                 Y+  A  AA+F+R HL+D  + +L         +S  
Sbjct: 503 AALSEQG----------------YVSRAAQAAAFLRTHLFDPDSGKLLRSCYQGMHNSVE 546

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
            G     GFL+DY F+I  L DLYE      WL WA+ LQ+ QD+LF D +G  YF+T  
Sbjct: 547 QGAVPIQGFLEDYVFVIQALFDLYEVSLEQGWLEWALHLQHMQDKLFWDPKGFAYFSTEA 606

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
            DPS+LLR+K+D DGAEP+ NSV+V NL               +Q     L     R+  
Sbjct: 607 SDPSLLLRLKDDQDGAEPAPNSVAVTNLRE------------KKQTRSEQL-----RVPM 649

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
           + + VP M     +    + K VV+ G     D + ML    + +  NK ++    AD +
Sbjct: 650 ITVVVPEMLRTTAVFH-HTLKQVVICGDPQGEDTKEMLHCVRSVFSPNKVLM---VADGD 705

Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              F        AS+ R +    K  A VC NF+CS PVT    L  +L
Sbjct: 706 NAGFLYRQLPFLASLERKD---GKATAYVCSNFTCSLPVTSVQELRGML 751


>gi|134085853|ref|NP_001076876.1| spermatogenesis-associated protein 20 [Bos taurus]
 gi|133777605|gb|AAI23690.1| SPATA20 protein [Bos taurus]
 gi|296476477|tpg|DAA18592.1| TPA: spermatogenesis associated 20 [Bos taurus]
          Length = 789

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 420/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+PDL+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L ++     ++++ AL A ++ +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVVRNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT KEV+ +L E  +          L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S FA    +L  E    + N+ + G             A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVINYAING-------------AKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DRATAYVCENQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  +L
Sbjct: 772 ACSMPITEPCELRKVL 787


>gi|440910483|gb|ELR60277.1| Spermatogenesis-associated protein 20 [Bos grunniens mutus]
          Length = 789

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 302/736 (41%), Positives = 420/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+PDL+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L ++     ++++ AL A ++ +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVVRNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT KEV+ +L E  +          L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S FA    +L  E    + N+ + G             A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVINYAING-------------AKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DRATAYVCENQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  +L
Sbjct: 772 ACSMPITEPCELRKVL 787


>gi|350406875|ref|XP_003487911.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
           impatiens]
          Length = 831

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 293/721 (40%), Positives = 410/721 (56%), Gaps = 63/721 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF ++ +A+++N  F++IKVD+EERPD+D++YMT++QA  G GGWP+S
Sbjct: 146 STCHWCHVMEKESFTNKEIAEIMNKNFINIKVDKEERPDIDRIYMTFIQATSGHGGWPMS 205

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ DLKP++GGTYFPPED + + GFKTIL  V   W++ R  L + G+  +E L  ++
Sbjct: 206 VFLTTDLKPIVGGTYFPPEDTFRQTGFKTILLSVAQKWNQSRSKLTEIGSTNLETL-HSI 264

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
           S    S K+ D       ++C +QL   ++ +FGGFGS     +PKFP+PV     L+H 
Sbjct: 265 SKIPDSLKVHDIPSLECSKICIQQLVNEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHM 323

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              +   +S        M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ
Sbjct: 324 YARQPNVES--VRPCLYMSVYTLKRMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 381

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           GQL   Y DA+ +TKD +++ I  DI  Y+ RD+    G  +SAEDADS        KKE
Sbjct: 382 GQLMKSYADAYLVTKDNYFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPMHDTHAKKE 441

Query: 315 GAFYVWTSKEVEDILGEHAI---------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           GAFYVW++ E++ +L +            +F  H+ +  +GN  +    DPH E   KNV
Sbjct: 442 GAFYVWSAMEIKSLLNKEVSDENHVKLSDIFCRHFNVNESGN--VKSHQDPHGEMGQKNV 499

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           LI  N+   +A    +P+E+    L E    L+ VRS RPRPHLDDK+I SWNGL+IS  
Sbjct: 500 LIAYNEIEETARYFNLPIEETKMYLKEACSMLYKVRSARPRPHLDDKIITSWNGLMISGL 559

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
           A                F     + K+Y+E A  AA FI+ +L+DE  + L HS      
Sbjct: 560 A----------------FGGAAVNNKQYIEHAADAAKFIKEYLFDETKNILLHSCYRDEK 603

Query: 480 --FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
                  +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D   GGYF
Sbjct: 604 GTITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDETNGGYF 663

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TT  DPS++LR+KE +DGAEPSGNS++  NL+RLA  +     D ++  A      F  
Sbjct: 664 LTTSSDPSIILRLKEVYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAARLFGAFRY 720

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNKTV 652
            L    +AVP       + S   R H     + +VG + + D + +L   +     N+ +
Sbjct: 721 LLMQRPVAVP------QLTSALVRYHDDAAQIYVVGKRGAKDTDELLRVIYKRLIPNRIL 774

Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           + IDP +T  +   +  +  N     N     +    VC++ +CS PVT P  L  LL E
Sbjct: 775 LLIDPDETNSVLLRKNQHLRNMKSLNN-----RTTVYVCKHRTCSLPVTSPEQLATLLDE 829

Query: 713 K 713
           +
Sbjct: 830 Q 830


>gi|47211932|emb|CAF92441.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 833

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 288/668 (43%), Positives = 387/668 (57%), Gaps = 69/668 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + K+LND FV IK+DREERPDVDKVYMT+VQA  GGGGWP+S
Sbjct: 46  STCHWCHVMERESFEDEEIGKILNDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 105

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL+P +GGTYFPP D  GRPG KT+L ++ D W   R  L  +G   +E L +  
Sbjct: 106 VWLTPDLRPFIGGTYFPPRDHGGRPGLKTVLMRIIDQWRNNRPTLESNGNKILEALRKGT 165

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + ++ +   P   P  A R C +QL+ SY+  +GGF  APKFP PV +  ++ +      
Sbjct: 166 AIASDAGSSPAFAPDVAKR-CFQQLANSYEEEYGGFREAPKFPSPVNLMFLMSYWCVNRS 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E  E  +M L TL+ MA GGI+DHV  GFHRYS D  WHVPHFEKMLYDQ QLA 
Sbjct: 225 TS---EGVEALQMALHTLRMMALGGINDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+ A   + + FY+ + +D+L Y+ RD+    G  +SAEDADSA   G   K+EGAF +
Sbjct: 282 AYITASQASGEQFYADVAKDVLRYVSRDLSDKSGGFYSAEDADSAPPSGGAEKREGAFCI 341

Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           WT+ EV ++L             A +F  HY +K  GN  +S   DPH E +G+NVLI  
Sbjct: 342 WTASEVRELLPDVVKGASASATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVR 399

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                +A+  G+ +E+   +L   R K+  VR  RPRPHLD K++ SWNGL++S++AR  
Sbjct: 400 YSLELTAAHFGISVEEVSALLASARAKMAAVRKSRPRPHLDTKMLASWNGLMLSAYARVG 459

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSF-------- 480
            +L                  K  +E A  AA+F++ HL+D EQ   L+  +        
Sbjct: 460 AVLGD----------------KTLLERAAQAANFLQEHLWDPEQQIVLRSCYLGDNMELQ 503

Query: 481 ----------------------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
                                 R+ P    GFLDDYAF+I GLLDL+E    T+WL WA 
Sbjct: 504 QMTIKLNLPELSNENNYETVTQRSQPIS--GFLDDYAFIICGLLDLHEATLQTEWLRWAE 561

Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
           ELQ  QD+LF D +GGGYF +   D +VLL++KED DGAEPS NSVS  NL+RL+     
Sbjct: 562 ELQLRQDKLFWDEQGGGYFCSDPSDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGR 621

Query: 579 SKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 638
            +   + Q ++  LA F  RL    +A+P M  A  M    + K +V+ G + S D   +
Sbjct: 622 QE---WLQKSQRLLAAFTDRLTRAPIALPEMVRAL-MAQHYTLKQIVICGQRDSPDTAAL 677

Query: 639 LAAAHASY 646
           L+  ++ +
Sbjct: 678 LSTVNSLF 685


>gi|297700798|ref|XP_002827419.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pongo
           abelii]
          Length = 786

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|402899623|ref|XP_003912790.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Papio
           anubis]
          Length = 786

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|297700802|ref|XP_002827421.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pongo
           abelii]
          Length = 742

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|402899619|ref|XP_003912788.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Papio
           anubis]
          Length = 742

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|410298424|gb|JAA27812.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 301/736 (40%), Positives = 422/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFLI---NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL     TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGSPTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|109114323|ref|XP_001099418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Macaca
           mulatta]
          Length = 786

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 769 ACSMPITDPCELRKLL 784


>gi|109114325|ref|XP_001099321.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Macaca
           mulatta]
          Length = 742

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 274 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 333

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 725 ACSMPITDPCELRKLL 740


>gi|332246333|ref|XP_003272309.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20 [Nomascus leucogenys]
          Length = 802

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEKESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLAPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L +S     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLESS----QRVTTALLARSEISVGDRQLPPSAATMSNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G    KEGA+YVWT KE + +L E             L  +HY L 
Sbjct: 394 GFYSAEDADSPPERGMX-PKEGAYYVWTVKEFQQLLPEPVPGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD+K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDNKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLIRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M CA       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVRCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|355753994|gb|EHH57959.1| hypothetical protein EGM_07713, partial [Macaca fascicularis]
          Length = 777

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 78  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 137

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 138 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 197

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 198 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 253

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 254 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 308

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 309 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 368

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 369 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 427

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 428 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 485

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 486 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 529

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 530 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 589

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 590 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 649

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 650 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 705

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 706 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 759

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 760 ACSMPITDPCELRKLL 775


>gi|402899621|ref|XP_003912789.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Papio
           anubis]
          Length = 802

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|297700800|ref|XP_002827420.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pongo
           abelii]
          Length = 802

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|109114321|ref|XP_001099622.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Macaca
           mulatta]
 gi|355568523|gb|EHH24804.1| hypothetical protein EGK_08527 [Macaca mulatta]
          Length = 802

 Score =  527 bits (1357), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F        +++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 785 ACSMPITDPCELRKLL 800


>gi|182413448|ref|YP_001818514.1| hypothetical protein Oter_1630 [Opitutus terrae PB90-1]
 gi|177840662|gb|ACB74914.1| protein of unknown function DUF255 [Opitutus terrae PB90-1]
          Length = 751

 Score =  527 bits (1357), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 310/746 (41%), Positives = 409/746 (54%), Gaps = 60/746 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F      ++  FL     TCHWCHVM  ESFE+E VA+LLN+ FV+IKVDREERPD
Sbjct: 27  GEAAFAKARAEQKPIFLSIGYATCHWCHVMAHESFENEAVAQLLNESFVAIKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD+VYMTYVQA+ G GGWPLS +L+PDLKP  GGTYFPPED+ GR GF  ILR +   W 
Sbjct: 87  VDRVYMTYVQAMTGHGGWPLSAWLTPDLKPFFGGTYFPPEDRQGRAGFAAILRAIAHGWS 146

Query: 119 KKRDMLAQSGAFAIEQLSE--------------ALSASASSNKLPDELPQN-------AL 157
            +R+ L   G   I  L E                SA A      D L          A 
Sbjct: 147 TEREKLVAEGERVIAALREHQQSKTADVSKSTGGESAGAEIGSGIDALIHQLHERGAPAF 206

Query: 158 RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA-SEGQKMVLFT 216
               +   +++D   GGFG APKFPR   +   L+ +  L+  G + EA +E  ++   T
Sbjct: 207 ERGFQYFYEAFDPEHGGFGGAPKFPRASNLS-FLFRAAALQ--GVASEAGAEAIRLASAT 263

Query: 217 LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYI 276
           LQ MA+GGIHDHVGGGFHRYSVDERW VPHFEKMLYDQ Q+A   L+A   T D  ++++
Sbjct: 264 LQAMARGGIHDHVGGGFHRYSVDERWFVPHFEKMLYDQAQIALNALEAKQATGDERFAWL 323

Query: 277 CRDILDYLRRDMIGPGGEIFSAEDADSAETEG----ATRKKEGAFYVWTSKEVEDILGEH 332
            RDIL Y+ RD+  P G  +SAEDADSA          +K EGAFYVW   E+E +LG+ 
Sbjct: 324 ARDILTYVLRDLAHPDGGFYSAEDADSAAANAEPGHGGKKVEGAFYVWAQSEIEQVLGDE 383

Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
           A L  EH+ +KP GN  +    DPH EF GKNVL +    + +A    +  E     L  
Sbjct: 384 ARLVCEHFGVKPDGN--VPGQLDPHGEFTGKNVLAQAQPLATTAKAHELTPEMASERLQA 441

Query: 393 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 452
              +L  VR++RPRP  DDK+I +WNGL+IS+ A+A  +L+   ++A             
Sbjct: 442 ALERLRAVRAQRPRPLRDDKIITAWNGLMISALAKAHVVLELAEDAA----------ETL 491

Query: 453 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
           Y+  A   A F+ R L+D     L  S+R G S   GF +DYAF+I GLLDLYE G   +
Sbjct: 492 YLGAATRTAEFVERELFDRDRAILFRSWRGGRSAVEGFAEDYAFMIQGLLDLYEAGFDVR 551

Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
           WL WA  LQ T D  F D E GGYFN+  +DP ++LR+KED+DGAEP+ +SV+ +NL+RL
Sbjct: 552 WLQWAERLQATMDARFWDAEHGGYFNSASDDPHLVLRLKEDYDGAEPAPSSVAAMNLLRL 611

Query: 573 ASIVAGSKSDY------YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
             ++    +        YR+    ++  F+ +      A+P M CA +   +P   HVVL
Sbjct: 612 GVMIERPGAAAAAGGIDYRERGLRTILAFQEQWSQTPQALPQMLCALERALMPP-AHVVL 670

Query: 627 VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN--NASMARNNFSAD 684
            G      F  +L            ++    AD  E   W    +        RN     
Sbjct: 671 AGQPGDEAFRALLRVVQGRLGSQHVLL---VADGGEGQRWLSARAPWLTTMTPRNG---- 723

Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
           +  A VC++F+C  PV  P +L +LL
Sbjct: 724 QATAYVCEDFTCQAPVESPAALRDLL 749


>gi|350590464|ref|XP_003483066.1| PREDICTED: spermatogenesis-associated protein 20-like [Sus scrofa]
          Length = 749

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 300/736 (40%), Positives = 418/736 (56%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 50  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 109

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 110 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 169

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 170 QNKKTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 225

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 226 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 280

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 281 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 340

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
             +SAEDADS    G  R KEGAFY+WT KEV+ +L EH            L  +HY L 
Sbjct: 341 GFYSAEDADSPPERG-MRPKEGAFYLWTVKEVQQLLPEHVPGATEPLTSGQLLMKHYGLT 399

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 400 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVQTLLNTGLEKLFQARKH 457

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S FA    +L  E    + N+ + G             A F
Sbjct: 458 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RLINYAING-------------AKF 501

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DY F++ GLLDLYE    + WL 
Sbjct: 502 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYTFVVRGLLDLYEASQESAWLE 561

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 562 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 621

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 622 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 677

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 678 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLGTLRRLE---DRATAYVCENQ 731

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 732 ACSMPITEPCELRKLL 747


>gi|344285393|ref|XP_003414446.1| PREDICTED: spermatogenesis-associated protein 20 [Loxodonta
           africana]
          Length = 789

 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 303/738 (41%), Positives = 422/738 (57%), Gaps = 65/738 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+P+L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + R+ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNRNTLLENS----QRVTAALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRITQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +W VPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 321 HRYSTDRQWLVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGAFY+WT KE++ +L E  +          L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYLWTVKEIQQLLPEPVLGASEPLTSGQLLTKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF VR  
Sbjct: 440 EAGN--ISPNQDPKGELQGQNVLNVRYSLELTAARFGLDVEAVRTLLNLGLEKLFQVRKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRPHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 498 RPRPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GMDR--LINCAINGAKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  T RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVATGRLMRTCYAGSGGTVEHSDPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 718 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DQATAYVCENQ 771

Query: 695 SCSPPVTDPISLENLLLE 712
           +CS P+T+P  L  LLL+
Sbjct: 772 ACSMPITEPCELRKLLLQ 789


>gi|449283068|gb|EMC89771.1| Spermatogenesis-associated protein 20, partial [Columba livia]
          Length = 682

 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 286/671 (42%), Positives = 395/671 (58%), Gaps = 53/671 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCHVME ESF+++ + ++++  FV IKVDREERPD
Sbjct: 44  GQEAFDKAKKENKLIFLSVGYSTCHWCHVMEEESFKNKEIGEIMSKNFVCIKVDREERPD 103

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+  A  GGGGWP+SV+L+PDLKP  GGTYFPPED   R GF+T+L ++ + W 
Sbjct: 104 VDKVYMTF--ATSGGGGWPMSVWLTPDLKPFAGGTYFPPEDGVHRVGFRTVLLRIAEQWK 161

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + +D L +S    +E L           + P    +  +  C +QLS SYD  +GGF  +
Sbjct: 162 ENKDSLLESSRKILEALQHVSEIRVRGQESPPP-SKEVMATCFQQLSNSYDEDYGGFSKS 220

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP PV +   L+    L  T  + E +   +M L TL+ MA GGIHDH+  GFHRYS 
Sbjct: 221 PKFPSPVNLNF-LFTYWALHRT--TPEGARALQMALHTLKMMAHGGIHDHIDQGFHRYST 277

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+ WHVPHFEKMLYDQGQLA  Y  AF ++ D F++ + +DIL Y+ RD+    G  +SA
Sbjct: 278 DQHWHVPHFEKMLYDQGQLAATYSRAFQISGDQFFADVAQDILLYVSRDLSDQAGGFYSA 337

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPTGNC 348
           EDADS  T  +  K+EGAF VW ++E+  +L +             +F  HY +K TGN 
Sbjct: 338 EDADSYPTTASKEKREGAFCVWAAEEIRALLPDPVEGATEGTTLGDVFMHHYGVKETGN- 396

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
            +S M DPH E KGKNVLI       +A++ G+ L +   +L E R++L   R++RPRPH
Sbjct: 397 -VSPMQDPHQELKGKNVLIVRCSPEVTAAQFGLELGRLGAVLQEGRQRLSTARAQRPRPH 455

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LD K++ +WNGL+IS FA+A  +L                D++EY+  A  AA+F+R+HL
Sbjct: 456 LDTKMLAAWNGLMISGFAQAGTVL----------------DKQEYVSRAAQAAAFLRKHL 499

Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
           +D  + RL  S   G       S  P  GFL+DY F+I  L DLYE      WL WA++L
Sbjct: 500 FDPTSGRLLRSCYRGRDNTVEQSAVPIQGFLEDYVFVIQALFDLYEASLEQDWLEWALQL 559

Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
           Q+ QD+LF D +G  YF++   DPS+LLR+K D DGAEP+ NSV+V NL+R A   A  +
Sbjct: 560 QHMQDKLFWDSKGFAYFSSEAGDPSLLLRLKGDQDGAEPTANSVTVTNLLRAACYSAHME 619

Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
              + + A   LA F  RL+     +P+M  A  +    + K V++ G     D + ML 
Sbjct: 620 ---WVEKAGQILAAFSERLQK----IPIMARATAVFH-HTLKQVIICGDPQGEDTKEMLR 671

Query: 641 AAHASYDLNKT 651
             H+ +  NK 
Sbjct: 672 CVHSVFSPNKV 682


>gi|73966409|ref|XP_548202.2| PREDICTED: spermatogenesis-associated protein 20 [Canis lupus
           familiaris]
          Length = 789

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 295/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E +  LLN+ FVS+KVDREERPD
Sbjct: 90  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R +EGAFYVWT KEV+++L E  +          L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-MRPREGAFYVWTVKEVQNLLPEPVLGATEPLTSGQLLMKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ ++    +L     KLF  R  
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L  E    + N+ + G             A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGQE---RLINYAING-------------AKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVASGRLMRTCYAGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+R+  
Sbjct: 602 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRMHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    A+ +   F        +++ R     D+  A VC++ 
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCEDQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 772 ACSMPITEPCELRKLL 787


>gi|410349595|gb|JAA41401.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  523 bits (1347), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 299/736 (40%), Positives = 421/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM +VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I  D   +  +  W    S   ++ R     D+  A VC+N 
Sbjct: 731 TKALVQCVHSVYIPNKVLILADGDPSSFLSHWLPFLS---TLRRQE---DQATASVCENQ 784

Query: 695 SCSPPVTDPISLENLL 710
           +CS  +TD   L  LL
Sbjct: 785 ACSMLITDTCELRKLL 800


>gi|380028980|ref|XP_003698161.1| PREDICTED: spermatogenesis-associated protein 20 [Apis florea]
          Length = 746

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 284/722 (39%), Positives = 415/722 (57%), Gaps = 65/722 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF+++ +A ++N  F++IKVD+EERPD+D++YMT+VQA  G GGWP+S
Sbjct: 61  STCHWCHVMEKESFKNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP+ GGTYFPPED   + GFKTIL  +   W++ +  + ++G+  +E L + +
Sbjct: 121 VFLTPDLKPIFGGTYFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNI 179

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
           S    ++KL D        +C +QL   ++ +FGGFGS     +PKFP+PV    + +  
Sbjct: 180 SKIPHTSKLHDIPSLECSEICIQQLENEFEPKFGGFGSIYNMQSPKFPQPVNFNFLFHMY 239

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            +  +   +  A     M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ
Sbjct: 240 ARQPN---ADLARLCLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 296

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            QL   Y DA+  TK+ +++ I  DI  Y+ RD+    G  +SAEDADS  T  A+ KKE
Sbjct: 297 AQLMKSYADAYLATKNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKE 356

Query: 315 GAFYVWTSKEVEDILGEHAIL-----------FKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
           GAFY+WT+ E++ +L +  +L           F  H+ +K  GN  +    DPH E +GK
Sbjct: 357 GAFYIWTAIEIKSLLNKELLLSNEKHIKLSDIFCHHFNIKELGN--IKSYQDPHGELEGK 414

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVLI  N+   +A    +P+E+    L E    L+  RS RPRPHLDDK+I +WNGL+IS
Sbjct: 415 NVLIMYNEIEETAKHFNLPVEEVKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMIS 474

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS---- 479
             A                F     + K+Y++ A  A  FI+R+L+D+  + L HS    
Sbjct: 475 GLA----------------FGGTAVNNKQYVKYAVDAIKFIKRYLFDKTKNILLHSCYRD 518

Query: 480 ----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
                    +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D   GG
Sbjct: 519 EKNIITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNGG 578

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           YF+TT  DPS++LR+KE +DGAEPSGNS++  NL+RLA  +  S+   ++  A      F
Sbjct: 579 YFSTTSNDPSIILRLKEAYDGAEPSGNSIAAENLLRLADYLGRSE---FKDKAVRLFGTF 635

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNK 650
              L    +++P       ++S   R H     + +VG +++ D +++L+  +      +
Sbjct: 636 RHLLIKRPVSIP------QLVSALIRYHDDATQIYVVGKRNAKDTDDLLSVIYKRLIPGR 689

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +  ID   T  + F +  +  N     N     +    +C++ +CS PVT+   L  LL
Sbjct: 690 ILFLIDHDKTNSILFRKNEHFRNMKPVNN-----QTTVYICKHCTCSLPVTNSEQLAILL 744

Query: 711 LE 712
            E
Sbjct: 745 DE 746


>gi|171910219|ref|ZP_02925689.1| hypothetical protein VspiD_03585 [Verrucomicrobium spinosum DSM
           4136]
          Length = 723

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 288/693 (41%), Positives = 398/693 (57%), Gaps = 34/693 (4%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E  A++LN+ F+SIKVDREERPDVD  YMTY QA+ GGGGWPL+
Sbjct: 61  STCHWCHVMERESFENEETAQVLNEHFISIKVDREERPDVDLTYMTYAQAVSGGGGWPLN 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
           V+L+P+LKP   GTYFPPED+ GR GF+ +  K+ + W D +  ++ +SGA AI++L E 
Sbjct: 121 VWLTPELKPFFAGTYFPPEDRGGRMGFRALCLKIAEVWKDDRAGVMERSGA-AIQKLQEY 179

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +      +  P +     ++   + +S ++D   GGF  APKFPRPV + ++    K L 
Sbjct: 180 IEDEQKHHDAPFDA---VMKKAYDDVSNAFDYHEGGFSGAPKFPRPVTLNLLGRLKKHLA 236

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              +  E++    M   TL CMA GGI DHVGGGFHRYSVD  WHVPH+EKMLYDQ QL 
Sbjct: 237 LKKEESESNWAVAMGKTTLTCMANGGIRDHVGGGFHRYSVDGYWHVPHYEKMLYDQAQLL 296

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y++    T    ++ I R+I++Y++RD+  P G  +SAEDADS   +  T K EGAFY
Sbjct: 297 TAYVEGHQHTGLKSFAAIAREIVEYVKRDLRHPEGAFYSAEDADSYTDDTRTTKGEGAFY 356

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW + E++++LG E   +F+  Y  +  GN      SDPH E KG N L        +A 
Sbjct: 357 VWKAAEIDELLGKEEGSIFRYAYGARRDGNARPE--SDPHEELKGLNTLFRAYSPKKTAE 414

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              +  +K   IL   R+ LF+ R KRP PHLDDKV+ +WNGL+IS  ARA+  L     
Sbjct: 415 YFKLEEDKVAEILERGRKVLFEAREKRPHPHLDDKVLTAWNGLMISGLARAAGAL----- 469

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      +   ++E+A  +A FI  HL D+ ++ L+ S+R G S   GF  DYA L
Sbjct: 470 -----------NEPSFLELATQSAQFIYDHLSDKGSN-LRRSWREGVSTVHGFASDYALL 517

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I GLLDLYE G   KWL WA  LQ   +  + D E GGYF+ +   P+ +L+VKED+D A
Sbjct: 518 IQGLLDLYEAGFDVKWLQWAAALQEEFETKYGDPEKGGYFSVSKAIPNSVLQVKEDYDSA 577

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EPS NSV+ +NL RLA ++A    +  R+     L +F   L++    VP M  A D  S
Sbjct: 578 EPSPNSVAAMNLFRLARMLA---REDLRERGAKVLRLFGKSLEESPFTVPAMVAALD-FS 633

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
                 +VL G K    F+ +  A  + Y  +  ++H D    +         + N ++ 
Sbjct: 634 HYGEVEIVLAGSKDDAGFQTLATAVRSRYLPHAVLLHADGGAGQAF-----LATRNEALG 688

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             N    +  A VC+N  C  PVT   +L+ +L
Sbjct: 689 AMNPVNGQAAAYVCRNRVCQSPVTTVEALKGIL 721


>gi|328781619|ref|XP_393124.4| PREDICTED: spermatogenesis-associated protein 20 [Apis mellifera]
          Length = 804

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 283/716 (39%), Positives = 413/716 (57%), Gaps = 53/716 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCH+ME ESF+++ +A ++N  F++IKVD+EERPD+D++YMT+VQA  G GGWP+S
Sbjct: 120 STCHWCHIMEKESFKNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMS 179

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP+ GGTYFPPED   + GFKTIL  +   W++ +  + ++G+  +E L + +
Sbjct: 180 VFLTPDLKPIFGGTYFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNI 238

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
           S    ++KL D       ++C +QL   ++ +FGGFGS     +PKFP+PV     L+H 
Sbjct: 239 SKIPHTSKLHDIPSLECSKICIQQLENEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHM 297

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              +  G    A     M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ
Sbjct: 298 YARQPNGDL--ARLCLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 355

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            QL   Y DA+  TK+ +++ I  DI  Y+ RD+    G  +SAEDADS  T  A+ KKE
Sbjct: 356 AQLMKSYADAYLATKNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKE 415

Query: 315 GAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           GAFYVWT+ E++ +L +          + +F  H+ +K  GN  +    DPH E +GKNV
Sbjct: 416 GAFYVWTAMEIKSLLNKELSDEKHIKLSDVFCHHFNIKELGN--IKSYQDPHGELEGKNV 473

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           LI  N+   +A    +P+E+    L E    L+  RS RPRPHLDDK+I +WNGL+IS  
Sbjct: 474 LIMYNEIEETAKHFNLPVEEMKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMISGL 533

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
           A                F     + K+Y+E A  A  FI+R+L+D+  + L HS      
Sbjct: 534 A----------------FGGTAVNNKQYIEYAVDAIKFIKRYLFDKTKNILLHSCYRDEK 577

Query: 480 --FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
                  +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D    GYF
Sbjct: 578 NIITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNAGYF 637

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           +TT  D S++LR+KE +DGAEPSGNS++  NL+RLA  +  S+    +  A      F  
Sbjct: 638 STTSNDLSIILRLKEAYDGAEPSGNSIAAENLLRLADYLGRSE---LKDKAVRLFGTFRH 694

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L    +++P +  A  +        + +VG +++ D +++L+  +      + +  ID 
Sbjct: 695 LLIKRPVSIPQLVSAL-IRYHDDTTQIYVVGKRNAKDTDDLLSVIYKRLIPGRILFLIDH 753

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
             T  + F +  +  N  +  N     +    +C++ +CS PVT+   L  LL E+
Sbjct: 754 DKTNSILFRKNEHFRNMKLVNN-----RTTVYICKHCTCSLPVTNSEQLAILLDEQ 804


>gi|328874248|gb|EGG22614.1| DUF255 family protein [Dictyostelium fasciculatum]
          Length = 815

 Score =  520 bits (1339), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 299/725 (41%), Positives = 423/725 (58%), Gaps = 58/725 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFE+  +A+++N+ FV+IKVDREERPD
Sbjct: 129 GTEAFEEAKKQDKLIFLSVGYSTCHWCHVMERESFENPDIARIMNELFVNIKVDREERPD 188

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +DK+YMTY+  ++G GGWP+SV+L+PDL PL GGTYF  +  +GRPGF    +++ + W 
Sbjct: 189 IDKLYMTYITEVFGHGGWPMSVWLTPDLAPLTGGTYFSSKASHGRPGFGVRCQQIANIWK 248

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K ++M    GA  I+ L E  S     N +   L    +  C   ++K +DS +GGF  A
Sbjct: 249 KDKEMAISRGASFIDYLKE--SKPKGDNNVA--LSNATITKCTGMITKQFDSVYGGFSDA 304

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPR       +Y+  +L   G    +SE  + + FTL  MA GGIHDH+GGGFHRYSV
Sbjct: 305 PKFPR-----CSVYN--ELNVCG----SSEDLEQLDFTLLKMACGGIHDHLGGGFHRYSV 353

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
            E W VPHFEKMLYDQGQ+ANVY+DA+  TK+  +  +  DIL Y++RD+    G  +SA
Sbjct: 354 TEDWRVPHFEKMLYDQGQIANVYIDAYLRTKNPLFRQVVYDILHYVQRDLTDSQGGFYSA 413

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDP 356
           EDADS   E    K+EGAFYVWT +E+E +LG      +    + +KP+GN D S  SDP
Sbjct: 414 EDADSLNKE-TNEKQEGAFYVWTLQEIEKLLGSALDTEVVAYMFDVKPSGNVDPS--SDP 470

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS-KRPRPHLDDKVIV 415
           H E  GKN+L +++ +  +ASK     EK   I+   ++ L++ R+  R RPHLDDK+I 
Sbjct: 471 HGELTGKNILHKVHTTEETASKFNHTPEKIEEIVERSKKILYEYRTNNRVRPHLDDKIIT 530

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTH 474
           +WNGL+IS+FARA ++                   KE++  A+ A  FI+  +LY E   
Sbjct: 531 AWNGLMISAFARAYQVF----------------GEKEFLVSAQRAVEFIQSGNLYQESNQ 574

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
            L  ++R+GPS   GF DDYAFLI  LLDLYE       L WA++LQ  Q ELF D + G
Sbjct: 575 ILIRNYRHGPSNVEGFSDDYAFLIQALLDLYEASFDESHLRWALQLQKKQIELFWDEKEG 634

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G+F T G DP++L R KE+HDGAEPS  SVS  NL+RL++++     D + + A+ ++  
Sbjct: 635 GFFTTNGRDPTLLSRQKEEHDGAEPSAQSVSSCNLLRLSNML---HLDEFEERAQKTMEG 691

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG-------HKSSVDFENMLAAAHASYD 647
               L+   + +P M CA   L  P  + + +VG       H S+   + ++   H    
Sbjct: 692 SSIYLEKAPLVMPQMVCALKYLIDPFYQ-ITVVGSLDPSSKHYSTT--QELVNVIHQKPI 748

Query: 648 LNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFS-CSPPVTDPIS 705
            NK ++ +D  AD ++  F  +    ++S+A+   S D+    VC N   C  P+    S
Sbjct: 749 PNKVLLFVDIDADMDKSIF--KQVDPDSSVAKYTLSNDQPTVYVCSNEEGCYAPINTIDS 806

Query: 706 LENLL 710
           + N L
Sbjct: 807 INNQL 811


>gi|226533705|ref|NP_001152785.1| spermatogenesis-associated protein 20 [Sus scrofa]
 gi|226354712|gb|ACO50965.1| spermatogenesis associated 20 [Sus scrofa]
          Length = 789

 Score =  520 bits (1339), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 298/736 (40%), Positives = 415/736 (56%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKKTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
             +SAEDADS    G  R KEGAFY+WT KEV+ +L EH            L  +HY L 
Sbjct: 381 GFYSAEDADSPPGRG-MRPKEGAFYLWTVKEVQQLLPEHVPGATEPLTSGQLLMKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+  E    +L     KLF  R  
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDAEAVQTLLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S FA    +L  E    + N+ + G             A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RLINYAING-------------AKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DY F++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYTFVVRGLLDLYEASQESAWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+ QD LF D  GGGYF +  E  + L LR+K+D DGAEPS N VS  NL+RL  
Sbjct: 602 WALRLQDMQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANFVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLGTLRRLE---DRATAYVCENQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 772 ACSMPITEPCELRKLL 787


>gi|395826687|ref|XP_003786547.1| PREDICTED: spermatogenesis-associated protein 20 [Otolemur
           garnettii]
          Length = 752

 Score =  520 bits (1339), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 295/738 (39%), Positives = 421/738 (57%), Gaps = 69/738 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ F+S+KVDREERPD
Sbjct: 53  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFISVKVDREERPD 112

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 113 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 172

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 173 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 228

Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  + ++  + +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 229 AEAPKFPTPVILNFLFFYWLNHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 283

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D F+S + + IL Y+ R +    G
Sbjct: 284 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSHAFQISGDEFFSDVAKGILQYVSRSLTHRFG 343

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             + AEDADS    G  R KEGAFYVWT KEV+ +L E             L  +HY L 
Sbjct: 344 GFYCAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPIPGATEPLTSGQLLMKHYGLT 402

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  LS+  DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 403 EAGNIGLSQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 460

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD+K++ +WNGL++S +A    +L  E                + +  A S A F
Sbjct: 461 RPKPHLDNKMLAAWNGLMVSGYAVTGAVLGIE----------------KLINCATSGAKF 504

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  T RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 505 LKRHMFDVATGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 564

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 565 WALRLQDTQDRLFWDCQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 624

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
                    +       L  F  R++ + +A+P M      LS   +  K +V+ G + +
Sbjct: 625 FTGHRD---WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAHQQTLKQIVICGDRQA 678

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + ++   H+ Y  NK +I    +D +   F        +++ R     D+  A V +
Sbjct: 679 KDTKALVQCVHSMYIPNKVLIL---SDGDPSSFMSRQLPFLSTLRRLE---DRATAYVYE 732

Query: 693 NFSCSPPVTDPISLENLL 710
           N +CS P+T+P  L  LL
Sbjct: 733 NQACSMPITEPCELRKLL 750


>gi|189500022|ref|YP_001959492.1| hypothetical protein Cphamn1_1072 [Chlorobium phaeobacteroides BS1]
 gi|189495463|gb|ACE04011.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides
           BS1]
          Length = 712

 Score =  519 bits (1337), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 287/703 (40%), Positives = 404/703 (57%), Gaps = 56/703 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE++ +A+LLN  FV +KVDREERPD+D++YMTYVQA  G GGWP+S
Sbjct: 54  STCHWCHVMERESFENDRIAELLNRAFVPVKVDREERPDIDRLYMTYVQATTGSGGWPMS 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDLKP  GG+YFPPED+YG+PGF ++L  ++ AW + R+    +     EQL EAL
Sbjct: 114 VWLTPDLKPFFGGSYFPPEDRYGKPGFHSLLLSIERAWKEDRNRFLSAAEGMTEQL-EAL 172

Query: 140 SASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S        P+ +P  +      A+  +  +D   GGFG+APKFP+P  ++ +L +S   
Sbjct: 173 SLQK-----PETVPLDEQVFHHAAKTFAGMFDKEDGGFGNAPKFPQPSILEFLLAYSYF- 226

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKML 251
             TG      E ++MVL +L+ MA GGIHDH+      GGGF RYS D RWHVPHFEKML
Sbjct: 227 --TGN----QEAKEMVLLSLRKMASGGIHDHLGIKNLGGGGFARYSTDVRWHVPHFEKML 280

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  QLA V  +A+ +T +  Y+ +  DIL+Y+  DM    G  +SAEDADS     +  
Sbjct: 281 YDNAQLAVVATEAYQITGENLYANLADDILNYVLCDMTDNKGGFYSAEDADSFPNSKSKA 340

Query: 312 KKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           KKEGAFY W+ +E+   L      +F   Y ++  GN     + DPH EF G+N+L   N
Sbjct: 341 KKEGAFYTWSIQEITAKLDPLETDIFCFIYGVESDGNA----LDDPHLEFTGRNILFARN 396

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           D  A+A++  MP E    I  + R KLF  R+ RPRPHLDDK++ SWNGL+IS+ ++AS 
Sbjct: 397 DIEAAAAQFSMPSEIIREITDDAREKLFHSRNDRPRPHLDDKILTSWNGLMISALSKASC 456

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L+S+                 Y++ A  AA FI  +LY     RL   +R+G +   G 
Sbjct: 457 VLRSQ----------------NYLDAALKAAEFILNNLYSTTDGRLLRRYRSGQAGIGGK 500

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
            DDY+F I GLLDLYE  S  ++L  A++L   Q ELF D + GG+FN   +D SV +R+
Sbjct: 501 ADDYSFFIQGLLDLYEASSEHRYLSNAVKLMEKQIELFFDDKSGGFFNAASDDSSVPIRM 560

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           KED+DGAEPS NS++  +L RLA ++     D +R+ A+ ++A F   LK+    +P + 
Sbjct: 561 KEDYDGAEPSPNSINTFSLYRLADMM---DRDDFREIADKTIAYFSKSLKENGRQLPCLL 617

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
             A ML     + V+L G + +   +N+       Y  +  +IH    + E  DF     
Sbjct: 618 KTA-MLPFYGTRQVILTGERHNETMKNLENTLGEMYLPDMFIIHASGNNAENTDF----- 671

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
                + +    +    A VC N +C+ P      L  +   K
Sbjct: 672 -----LKKITLKSTGNAAYVCSNQTCNLPAYSAKELRKIFSAK 709


>gi|348562581|ref|XP_003467088.1| PREDICTED: spermatogenesis-associated protein 20-like [Cavia
           porcellus]
          Length = 789

 Score =  519 bits (1337), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 299/740 (40%), Positives = 422/740 (57%), Gaps = 69/740 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME E+F++E +A+LLN+ FVS+KVDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEETFQNEEIARLLNEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L  S     ++++ AL A +  +    ++P  A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLDSS----QRVTTALLARSEISMGDRQMPPTAATMSSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   +   ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLGHRMAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +W VPHFEKMLYDQGQLA  Y  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 321 HRYSTDRQWQVPHFEKMLYDQGQLAVSYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT KEV+ +L E             L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQRLLPEAVPGATEPLTAGQLLIKHYGLT 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
            TGN +  +  D   E  G+NVL        +A++ G+ +E   ++L     KL   R +
Sbjct: 440 ETGNINTCQ--DSKGELHGQNVLTVRYSLELTAARFGLEVEAVRSLLTAGVDKLLQARKQ 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G D+   +  A + A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GIDK--LVHSATNCAKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAP--------GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  T RL+ +   G             GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVATGRLRRTCYAGTGTTVEHRDPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+ QD LF D +GGGYF +  E   S+ LRVK+D DGAEPS NSV+  NL+RL  
Sbjct: 602 WALRLQDAQDRLFWDSQGGGYFCSEAELGGSLPLRVKDDQDGAEPSANSVAAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
                  D+  + A   L  F  R++ + +A+P M  A   LS   +  K +V+ G +++
Sbjct: 662 FTG--HKDWLDKCA-CLLTAFSERMRRVPVALPEMVRA---LSAHQQGLKQIVICGERTA 715

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D   +L   HA Y  NK +I    AD +   F        +++ R     D+  A V +
Sbjct: 716 KDTRALLQCVHALYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DRATAYVYE 769

Query: 693 NFSCSPPVTDPISLENLLLE 712
           N +CS P+T+P  L+ LLL+
Sbjct: 770 NQACSMPITEPCELQKLLLQ 789


>gi|281208328|gb|EFA82504.1| DUF255 family protein [Polysphondylium pallidum PN500]
          Length = 863

 Score =  518 bits (1335), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 280/697 (40%), Positives = 404/697 (57%), Gaps = 41/697 (5%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    +TCHWCHVME ESFEDE +AK++ND FV+IKVDREERPD
Sbjct: 140 GQEAFDAAKQQDKLIFLSVGYSTCHWCHVMERESFEDETIAKVMNDLFVNIKVDREERPD 199

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +DK+YMTY+    G GGWP+SV+L+PDL+P+ GGTYFPP  KYGR GF  I +K+   W 
Sbjct: 200 IDKIYMTYITETSGSGGWPMSVWLTPDLRPITGGTYFPPTTKYGRGGFPDICKKISTMWK 259

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R  + +SGA  I  L E        NK    +  + L+ C  ++ K +D  FGGF  A
Sbjct: 260 DDRKRVLESGASFITYLKE---EKPKGNK-DAAISFDTLKTCHSEIVKRFDPEFGGFSEA 315

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPR             L    +  E+    + + FTL+ M++GGI+DH+ GGFHRYSV
Sbjct: 316 PKFPRTSIFNF-------LHRVHRRFESDNTLEKLHFTLEKMSRGGIYDHLAGGFHRYSV 368

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
            E W VPHFEKMLYDQGQ+ +VYLDA+ ++K+  +  +   +++Y+ RD+    G  +SA
Sbjct: 369 TEDWKVPHFEKMLYDQGQIVSVYLDAYQISKNEHFKDVATGVIEYVLRDLTHVDGGFYSA 428

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDP 356
           EDADS + +G   K EGAFYVW   E++  + E + L  F   + + P GN  +S   DP
Sbjct: 429 EDADSLDDKG--EKTEGAFYVWDYSEIKKAVPEESDLEIFNFIFGISPNGN--VSASEDP 484

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
           H EF  KN++++ +     ++KL +P+E+    + + +  L  +R+KR RPHLDDK+I S
Sbjct: 485 HGEFLDKNIIMQFHTFEECSNKLNIPVEQVKQSIEKSKVSLLKLRAKRARPHLDDKIITS 544

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WN L+IS+ +++              F ++G  R  Y+E A+ +  FI+ +LY+ +   L
Sbjct: 545 WNALMISALSKS--------------FQLLGEQR--YLEAAKKSVHFIKTNLYNAEKQTL 588

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
             ++R GPSK  GF DDYAFLI  LLDLYE      +L WA+ELQ  QD+LF D+EG GY
Sbjct: 589 IRNYREGPSKVEGFTDDYAFLIQALLDLYECCFDIAYLEWAVELQAKQDKLFWDKEGHGY 648

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F+++G D S+L R+KE+HDGAEPS  SV+  NL+R+ +++     D Y  NA   L    
Sbjct: 649 FSSSGLDSSILSRLKEEHDGAEPSCQSVACNNLIRIGNML---HDDDYTDNALLLLESVS 705

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             L    +  P M  +      P+         KSS +  ++L   H  Y  NK ++  D
Sbjct: 706 LYLHRAPIVFPQMVVSLANHLEPTYT-FSFAADKSSAELRSLLDTIHTFYMPNKVLLLKD 764

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQN 693
               ++M F+ E +  +A + +     DK    +C +
Sbjct: 765 TEHPQDMTFFSELD-QHAILLKYTKLYDKPTLYICSD 800


>gi|344252175|gb|EGW08279.1| Spermatogenesis-associated protein 20 [Cricetulus griseus]
          Length = 1263

 Score =  518 bits (1335), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 296/736 (40%), Positives = 420/736 (57%), Gaps = 65/736 (8%)

Query: 2    GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
            G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 564  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 623

Query: 59   VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
            VDKVYMT+VQA   GGGWP++V+++P L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 624  VDKVYMTFVQATSSGGGWPMNVWMTPSLQPFVGGTYFPPEDGLTRVGFRTVLTRIRDQWK 683

Query: 119  KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
            + ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 684  QNKNTLLENS----QRVTTALLARSEISVGDRQVPPSAATMNTRCFQQLDEGYDEEYGGF 739

Query: 176  GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
              APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 740  AEAPKFPTPVILNFLFSYWLSHRLAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 794

Query: 234  HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
            HRYS D +WH+PHFEKMLYDQ QLA VY  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 795  HRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAFQISGDEFYSDVAKGILQYVTRSLSHRSG 854

Query: 294  EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
              +SAEDADSA   G  + KEGAFYVWT +E++ +L E             L  +HY L 
Sbjct: 855  GFYSAEDADSAPERG-MKPKEGAFYVWTVQEIQQLLPEPVGGASEPLTSGQLLMKHYGLS 913

Query: 344  PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
              GN + ++  DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 914  EAGNINSNQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVSTLLNTGLEKLFQARKH 971

Query: 404  RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
            RP+ HLD K++ +WNGL++S FA    +L              G D+   +  A + A F
Sbjct: 972  RPKAHLDSKMLAAWNGLMVSGFAVTGAVL--------------GMDK--LVTQATNGAKF 1015

Query: 464  IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
            ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 1016 LKRHMFDVASGRLKRTCYAGTGGSVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 1075

Query: 516  WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
            WA+ LQ+TQD LF D  GGGYF +  E  S L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 1076 WALRLQDTQDRLFWDSRGGGYFCSEAELGSDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 1135

Query: 575  IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
               G K   +       L  F  R++ + +A+P M  A       + K +V+ G     D
Sbjct: 1136 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQETLKQIVICGDPQGKD 1191

Query: 635  FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
             + +L   H+ Y  NK +I    AD +   F        +++ R     D+  A + +N 
Sbjct: 1192 TKALLQCVHSIYLPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATAYIFENQ 1245

Query: 695  SCSPPVTDPISLENLL 710
            +CS P+T+P  L  LL
Sbjct: 1246 ACSMPITEPCELRKLL 1261


>gi|426237729|ref|XP_004012810.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20 [Ovis aries]
          Length = 795

 Score =  518 bits (1335), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 299/739 (40%), Positives = 412/739 (55%), Gaps = 67/739 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 92  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 151

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP+SV+L+P+L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 152 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 211

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + +  L ++       L  A SA +  ++     P+ +   C +QL + YD  +GGF  A
Sbjct: 212 QNKSTLLENSQRVTTALL-ARSAISMGDRQXSAAPRPS--RCFQQLDEGYDEEYGGFAEA 268

Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           PKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRY
Sbjct: 269 PKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRY 323

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S D +WHVPHFEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G  +
Sbjct: 324 STDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVARNLSHRSGGFY 383

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTG 346
           SAEDADS    G  R KEGAFYVWT KEV+ +L E  +          L  +HY L   G
Sbjct: 384 SAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLTEAG 442

Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR 406
           N  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  RP+
Sbjct: 443 N--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPK 500

Query: 407 PHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR 466
           PHLD K++ +WNGL++S FA    +L  E                  +  A + A F++R
Sbjct: 501 PHLDSKMLAAWNGLMVSGFAVTGAVLGQE----------------RVVSYAINGAKFLKR 544

Query: 467 HLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
           H++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+
Sbjct: 545 HMFDVASGRLMRTCYAGAGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWAL 604

Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVL-------LRVKEDHDGAEPSGNSVSVINLVR 571
            LQ+TQD LF D  GGGYF +  E  + L       LR+++D DGAEPS NSVS  NL+R
Sbjct: 605 RLQDTQDRLFWDSRGGGYFCSEAELGAGLPWGGGLPLRLEDDQDGAEPSANSVSAHNLLR 664

Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
           L     G K   +       L  F  R++ + +A+P M  A       + K +V+ G   
Sbjct: 665 LHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQ 720

Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
           + D + +L   H+ Y  NK +I    AD +   F         ++ R     D+  A VC
Sbjct: 721 AKDTKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRIE---DRATAYVC 774

Query: 692 QNFSCSPPVTDPISLENLL 710
           +N +CS P+T+P  L  LL
Sbjct: 775 ENQACSMPITEPCELRKLL 793


>gi|354478455|ref|XP_003501430.1| PREDICTED: spermatogenesis-associated protein 20 [Cricetulus
           griseus]
          Length = 789

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 296/736 (40%), Positives = 420/736 (57%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+++P L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWMTPSLQPFVGGTYFPPEDGLTRVGFRTVLTRIRDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQVPPSAATMNTRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRLAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA VY  AF ++ D FYS + + IL Y+ R +    G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAFQISGDEFYSDVAKGILQYVTRSLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADSA   G  + KEGAFYVWT +E++ +L E             L  +HY L 
Sbjct: 381 GFYSAEDADSAPERG-MKPKEGAFYVWTVQEIQQLLPEPVGGASEPLTSGQLLMKHYGLS 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + ++  DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGNINSNQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVSTLLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD K++ +WNGL++S FA    +L              G D+   +  A + A F
Sbjct: 498 RPKAHLDSKMLAAWNGLMVSGFAVTGAVL--------------GMDK--LVTQATNGAKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVASGRLKRTCYAGTGGSVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  S L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGSDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G     D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQETLKQIVICGDPQGKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F        +++ R     D+  A + +N 
Sbjct: 718 TKALLQCVHSIYLPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATAYIFENQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 772 ACSMPITEPCELRKLL 787


>gi|301620517|ref|XP_002939623.1| PREDICTED: spermatogenesis-associated protein 20-like [Xenopus
           (Silurana) tropicalis]
          Length = 775

 Score =  517 bits (1331), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 280/651 (43%), Positives = 385/651 (59%), Gaps = 56/651 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPDVDKVYMT++QA   GGGWP+S
Sbjct: 124 STCHWCHVMERESFEDEEIGRILNENFICVKVDREERPDVDKVYMTFLQATDSGGGWPMS 183

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL+P +GGTYFPPED   R  F+T+L ++ + W + R       AF  E+    L
Sbjct: 184 VWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWKENR-------AFLCERSERIL 236

Query: 140 SASASSNKL------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--L 191
           S   SS+ +      P  LP    +LC +QL + +D  +GGFG  PKFP PV    +  L
Sbjct: 237 SVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEYGGFGEFPKFPTPVNFSFLFCL 294

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           +   K      S E ++   M + TL+ M  GGIHDH+G GFHRYS D+ WHVPHFEKML
Sbjct: 295 WALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIGKGFHRYSTDQTWHVPHFEKML 349

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YDQGQLA  Y +AF ++    +S    DIL Y+ +++    G  +SAEDADS     +  
Sbjct: 350 YDQGQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSDDAGGFYSAEDADSLPNAQSKE 409

Query: 312 KKEGAFYVWTSKEVEDILGE--------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
           KKEGAF  WT+KE++ +L +           +F  HY +K  GN   S+  D H E +G+
Sbjct: 410 KKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGMKEEGNVSASQ--DIHGELQGQ 467

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVLI  +    +A+K G+ + +   IL  CR +L+  R  RP P  D  ++ SWNGL++S
Sbjct: 468 NVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARRLRPPPQRDTNILASWNGLMLS 527

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
             AR   IL+ E                EY+E A+ AASF+  ++YD ++  L  SF  G
Sbjct: 528 GLARCGVILRDE----------------EYIERAKLAASFLHENMYDLKSGILLRSFYKG 571

Query: 484 PSK----APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
                   PGFLDDYAF++ GLLDLYE      +L WA++LQ+ QD+LF D +G GYF +
Sbjct: 572 HQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWALQLQDRQDQLFWDAKGSGYFCS 631

Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
              D S+LLR+K+D DGAEPSGNSVSV+NL+RLA     ++   + + +   LA F  RL
Sbjct: 632 DASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGRTE---FTERSGQILAAFSERL 688

Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             +  ++P M    +M+   + K VV+ G K   +   +L AA + Y  NK
Sbjct: 689 LKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTRELLEAAQSMYVPNK 738


>gi|116487451|gb|AAI25719.1| LOC779596 protein [Xenopus (Silurana) tropicalis]
          Length = 770

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 283/672 (42%), Positives = 392/672 (58%), Gaps = 59/672 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    +TCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPD
Sbjct: 97  GQEAFSRAAREMKPIFLSVGYSTCHWCHVMERESFEDEEIGRILNENFICVKVDREERPD 156

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT++QA   GGGWP+SV+L+PDL+P +GGTYFPPED   R  F+T+L ++ + W 
Sbjct: 157 VDKVYMTFLQATDSGGGWPMSVWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWK 216

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKL------PDELPQNALRLCAEQLSKSYDSRF 172
           + R       AF  E+    LS   SS+ +      P  LP    +LC +QL + +D  +
Sbjct: 217 ENR-------AFLCERSERILSVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEY 267

Query: 173 GGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
           GGFG  PKFP PV    +  L+   K      S E ++   M + TL+ M  GGIHDH+G
Sbjct: 268 GGFGEFPKFPTPVNFSFLFCLWALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIG 322

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
            GFHRYS D+ WHVPHFEKMLYDQ QLA  Y +AF ++    +S    DIL Y+ +++  
Sbjct: 323 KGFHRYSTDQTWHVPHFEKMLYDQAQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSD 382

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--------HAILFKEHYYL 342
             G  +SAEDADS     +  KKEGAF  WT+KE++ +L +           +F  HY +
Sbjct: 383 DAGGFYSAEDADSLPNAQSKEKKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGM 442

Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
           K  GN   S+  D H E +G+NVLI  +    +A+K G+ + +   IL  CR +L+  R 
Sbjct: 443 KEEGNVSASQ--DIHGELQGQNVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARR 500

Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
            RP P  D K++ SWNGL++S  AR   IL+ E                 Y+E A+ AAS
Sbjct: 501 LRPPPQRDTKILASWNGLMLSGLARCGVILRDEG----------------YIERAKLAAS 544

Query: 463 FIRRHLYDEQTHRLQHSFRNGPSK----APGFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
           F+  ++YD ++  L  SF  G        PGFLDDYAF++ GLLDLYE      +L WA+
Sbjct: 545 FLHENMYDLKSGILLRSFYKGHQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWAL 604

Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
           +LQ+ QD+LF D +G GYF +   D S+LLR+K+D DGAEPSGNSVSV+NL+RLA     
Sbjct: 605 QLQDRQDQLFWDAKGSGYFCSDASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGR 664

Query: 579 SKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 638
           ++   + + +   LA F  RL  +  ++P M    +M+   + K VV+ G K   +   +
Sbjct: 665 TE---FTERSGQILAAFSERLLKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTREL 720

Query: 639 LAAAHASYDLNK 650
           L AA + Y  NK
Sbjct: 721 LEAAQSMYVPNK 732


>gi|383859631|ref|XP_003705296.1| PREDICTED: spermatogenesis-associated protein 20 [Megachile
           rotundata]
          Length = 744

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 292/723 (40%), Positives = 406/723 (56%), Gaps = 68/723 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF ++ +A ++N  FV+IKVD  ERPD+DK+YM +VQA  G GGWP+S
Sbjct: 60  STCHWCHVMEKESFTNKEIADIMNKHFVNIKVDNGERPDIDKIYMAFVQATTGHGGWPMS 119

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP+ GGTYFPPED + + GFKTIL  + D W+  +  + + G+   + L +  
Sbjct: 120 VFLTPDLKPVFGGTYFPPEDTFRQTGFKTILLNIADKWNSLKTKITEVGSANFKTLKDIS 179

Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGSA-----PKFPRPVEIQMM--L 191
               +S K   E+P      +CA QL+  ++  FGGF S+     PKFP+PV    +  +
Sbjct: 180 KVPQTSKK--HEVPSLECSNVCALQLASEFEPEFGGFTSSFDMHTPKFPQPVIFNFLFHM 237

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           Y     E+  KS        M ++TL+ +A GGIHDH+G GF RY+ D +WHVPHFEKML
Sbjct: 238 YARHPNEELAKS-----CLHMCVYTLKKIAFGGIHDHIGQGFSRYATDGKWHVPHFEKML 292

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YDQGQL   Y DA+  TKD +++ I  DI  Y+ RD+    G  +SAEDADS  T  A  
Sbjct: 293 YDQGQLMKSYADAYVTTKDNYFAEIVDDIAAYVIRDLRHQEGGFYSAEDADSYATSDAHE 352

Query: 312 KKEGAFYVWTSKEVEDILGEH--------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
           K EGAFYVWT+ E++ +L +         + +F  H+ +K +GN  +    DP  E  GK
Sbjct: 353 KLEGAFYVWTAAEIKSLLDKKVSSENIKLSDIFCHHFNVKESGN--VKGYQDPRGELTGK 410

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVLI   D   +A      +E+  N L +    L++ R  RPRPHLDDK+I SWNGL+IS
Sbjct: 411 NVLIVYEDIDDTAKHFNCTVEEIKNYLKDACSILYEARQARPRPHLDDKIITSWNGLMIS 470

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRN 482
             A    ++                D K+Y+E A  AA FI+R+L+DE    L HS +RN
Sbjct: 471 GLAYGGAVV----------------DNKQYIEYATDAAKFIKRYLFDEAKDILLHSCYRN 514

Query: 483 GPSKAP-------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
             +K         GFLDDYAF+I GLLDLYE G   +WL +A  LQ+ QD+L  D   GG
Sbjct: 515 AENKITQINEPIHGFLDDYAFVIKGLLDLYEAGFDEQWLEFAERLQDIQDKLLWDETSGG 574

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           YF TT +DPS+++R+KE HDGAEPSGNS+S  NL+RLA  +  S     +         F
Sbjct: 575 YFTTTSDDPSIIVRLKEAHDGAEPSGNSISAENLLRLAYYLGRSD---LKDKVVRLFGAF 631

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNK 650
              L    +AVP       ++S   R H     + +VG + + D +++L   +      +
Sbjct: 632 RHLLTQRPIAVP------QLVSALVRYHDDATQIYVVGKRGAKDTDDLLRVIYKRLIPGR 685

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            ++ ID  + + +   +     N          D+    VC+  +CS PV++   LE LL
Sbjct: 686 ILMLIDHDEADSILLGKNERLRNMKPLN-----DQATVYVCKYRTCSLPVSNSKQLEKLL 740

Query: 711 LEK 713
            E+
Sbjct: 741 DEQ 743


>gi|324505187|gb|ADY42236.1| Unknown [Ascaris suum]
          Length = 775

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 299/740 (40%), Positives = 414/740 (55%), Gaps = 84/740 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       R  FL    +TCHWCHVM  ESFE++ +A +LN+ FVSIKVDREERPD
Sbjct: 81  GDEAFTKAKTLNRLIFLSVGYSTCHWCHVMAHESFENQTIADILNENFVSIKVDREERPD 140

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YMT++QA+ GGGGWP+SVFL+PDL P+ GGTYFPPED+YGRPGF +ILR + + W 
Sbjct: 141 VDKLYMTFIQAISGGGGWPMSVFLTPDLNPVTGGTYFPPEDRYGRPGFASILRTIAEKWQ 200

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            + D +   G FA   L+ A+  +  +N+      +N    C  +L+  +D  + GFG A
Sbjct: 201 LEGDQIRGQG-FA---LANAIKKAFLTNRETVPADENVALTCYTELADRFDETYKGFGGA 256

Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           PKFP+P E+  ML  Y + K    GK        KMV  TL+ MA+GGIHDH+G GFHRY
Sbjct: 257 PKFPKPAELDFMLSFYANNKSTTEGKL-----ALKMVGETLEAMARGGIHDHIGKGFHRY 311

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC-------RDILDYLRRDMI 289
           +VD  WHVPHFEKMLYDQ QL +VY +         YS +C        DI DY+ R++ 
Sbjct: 312 AVDAAWHVPHFEKMLYDQAQLLSVYAN---------YSLVCGQMKEIVEDIADYVYRNLT 362

Query: 290 GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG----------EHAILFKEH 339
            P G  +SA+DADS  +  A  K+EGAFYVWT +E++D L           + A  FK++
Sbjct: 363 HPEGGFYSAQDADSLPSHNAKAKREGAFYVWTEQEIDDALKDVTVNGDSSVDVATYFKQY 422

Query: 340 YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFD 399
           + +K  GNC     +DPH E K +NVL   +    SA KLG+  +K   I+ + R+ L +
Sbjct: 423 FGVKANGNCPSD--TDPHGELKLQNVLAMKDSHKDSARKLGISEDKLTAIIEKARQVLVE 480

Query: 400 VRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
            R++RP PHLD K++ SWNGL+IS  +RAS                V + + E    A+ 
Sbjct: 481 ARAQRPEPHLDSKMLTSWNGLMISGLSRAS----------------VAAGKPELAGRAQK 524

Query: 460 AASFIRRHLYDEQTHRLQHSFRN---------GPSKAPGFLDDYAFLISGLLDLYEFGSG 510
              FI++++  E    L+ ++ +          P KA  F DDYAFLI GLLDLYE    
Sbjct: 525 VVEFIKKYMLSENGELLRTAYTDESGGVVHNSKPVKA--FADDYAFLIEGLLDLYEVTFD 582

Query: 511 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 570
              L +A ELQ   DE F D +    +  +  DPS++ R  EDHDGAEP+ NSV+ +NLV
Sbjct: 583 ENLLKFASELQKQFDERFWDTDNNAGYFLSETDPSIMTRFMEDHDGAEPATNSVAALNLV 642

Query: 571 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 630
           RLASI      + +R    + L     RL+     +P M  A    S P+   VV++G +
Sbjct: 643 RLASIF---DEERFRDRVANILESVSLRLRRYPSVLPKMVTALMRHSRPA-TLVVVIGKR 698

Query: 631 SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE-EHNSNNASMARNNFSADKVVAL 689
                + ML      +  N+++I +D       D W  E N +  ++ R   S  K    
Sbjct: 699 DDPLTQQMLDEIKRHFIPNQSLISLDATK----DLWLIEQNDHFGTLLR---STTKPAVF 751

Query: 690 VCQNFSCSPPVTDPISLENL 709
           +C++F C+ P+T   SL++L
Sbjct: 752 ICEHFKCNQPIT---SLDDL 768


>gi|307166116|gb|EFN60365.1| Spermatogenesis-associated protein 20 [Camponotus floridanus]
          Length = 754

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 291/714 (40%), Positives = 410/714 (57%), Gaps = 56/714 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E +A+++N+ FV+IKVDREERPD+D++YMT+VQA  G GGWP+S
Sbjct: 66  STCHWCHVMEKESFENEDIARIMNENFVNIKVDREERPDIDRIYMTFVQAKSGHGGWPMS 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFLSPDL P+ GGTYFPP+ KYG  GFK++L  V   W +++  + +S A  +E+L + +
Sbjct: 126 VFLSPDLMPVTGGTYFPPDGKYGLIGFKSLLLAVAKEWTQQKSNIIKSAANIVERLKDIV 185

Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYH 193
                  K  D  P      LC   L+  Y+ +FGGF S     +PKFP PV     L+ 
Sbjct: 186 ECKQGLKK-DDGFPTAECALLCVHLLANGYEPKFGGFSSRSWMNSPKFPEPVNFN-FLFS 243

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           +  L  +  S    +  +M L TL  MA GGIHDHVG GF RYSVD  WHVPHFEKMLYD
Sbjct: 244 TYALSTS--SELRKQCLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFEKMLYD 301

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q Q+   Y DA+ +TKD FYS I  DI  Y+ RD+    G  +SAEDADS     A+ K+
Sbjct: 302 QAQIIQAYADAYVITKDSFYSDIVDDIATYVVRDLRHKEGGFYSAEDADSLPEPQASAKR 361

Query: 314 EGAFYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNEFKGKN 364
           EGAFYVW  KEV+ +L     G   + F +    H+ +K  GN  + +  DPH E  GKN
Sbjct: 362 EGAFYVWPYKEVKTLLDKKIPGNDNVRFSDLICYHFNVKKEGN--VRKAQDPHGELTGKN 419

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           V I  +    +A   G+ +E   + + E  + LF+ RSKRPRPHLDDK++ +WNGL+IS 
Sbjct: 420 VFIVYDGIEQTAEHFGISVENTKSYIKEACQILFEERSKRPRPHLDDKIVTAWNGLMISG 479

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG- 483
           FARA   ++++                +Y+E+A  AA F++++L+D+    L  S   G 
Sbjct: 480 FARAGAAVRND----------------KYVELATDAAKFVKQYLFDKNKGVLLRSCYRGE 523

Query: 484 -----PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
                 +  P  GF DDYAF++ GLLDLYE     +WL +A ELQ+ QD LF D + GGY
Sbjct: 524 DDRIMQTSVPIHGFHDDYAFVVKGLLDLYEANFDAQWLEFAEELQDIQDRLFWDSQDGGY 583

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F+T  E+  ++LR+K+ HDGAEPS NS++  NL+RLA+ +  S+    +  A   L+ F 
Sbjct: 584 FSTV-ENSQMILRMKDAHDGAEPSSNSIACSNLLRLATYLDRSE---LKDKAGQLLSAFG 639

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             L +M +  P +  A  +L   +   + + G   + D   ML          + ++  D
Sbjct: 640 KGLTEMPIMFPQLTLA--LLEYHNATQIYIAGRPDAEDTIEMLNVIRERVIPGRVLLLAD 697

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           P   + +         NA +++      +   LVC+  +CS P+T+P  L + L
Sbjct: 698 PEQQDNVLL-----RKNAVVSKLKPQKGRATVLVCRRQACSIPITNPSELASQL 746


>gi|307213879|gb|EFN89140.1| Spermatogenesis-associated protein 20 [Harpegnathos saltator]
          Length = 755

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 292/719 (40%), Positives = 411/719 (57%), Gaps = 59/719 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E +A ++ND F++IKVDREERPD+D++YMT+VQA  G GGWP+S
Sbjct: 66  STCHWCHVMEKESFENEEIAHIMNDNFINIKVDREERPDIDRIYMTFVQAKSGHGGWPMS 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+L P+ GGTYFPP+D+YG  GFK++L +V   W ++++ + +SGA  + +L + +
Sbjct: 126 VFLAPNLTPVTGGTYFPPDDRYGLIGFKSLLLEVAKKWAQQKNDIIKSGANIVSRLKDMV 185

Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMM--L 191
               S  K  D  P      LC   L+  Y+ +FGGFGS     APKFP PV    +  +
Sbjct: 186 ERRQSL-KEGDGFPTVECGFLCVHLLANGYEPKFGGFGSQFRMNAPKFPEPVNFNFLFSV 244

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           Y    L +  K     E  +M L TL  MA GGIHDHVG GF RYSVD  WHVPHFEKML
Sbjct: 245 YALSNLSELRK-----ECLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFEKML 299

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YDQ Q+   Y DA+ +TKD FYS I  DI  Y+ RD+    G  +SAEDADS     ++ 
Sbjct: 300 YDQAQIIQAYADAYVITKDSFYSDIVDDIAKYVERDLRHKEGGFYSAEDADSLPESKSSA 359

Query: 312 KKEGAFYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNEFKG 362
           K+EGAFYVWT  EV+ +L     G + + F +    H+ +K  GN  + +  DPH E  G
Sbjct: 360 KREGAFYVWTYDEVKSLLNKKVPGRNNVRFFDLICYHFNVKKEGN--VRKAQDPHGELTG 417

Query: 363 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 422
           KNVLI       +A    + LE     + +    LF  RSKRPRPHLDDK++ +WNGL+I
Sbjct: 418 KNVLIAYEAVEKTAEHFNISLEDTKTYIKQACLILFKERSKRPRPHLDDKMVTAWNGLMI 477

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF-- 480
           S FARA   +++                 +Y+E+A  AA F+ ++L+D+    L  S   
Sbjct: 478 SGFARAGAAVRNS----------------KYVELATDAAKFVEQYLFDKNKGTLLRSCYR 521

Query: 481 ----RNGPSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
               R   +  P  GF DDYAF++ GLLDLY+      WL  A +LQ+TQDELF D + G
Sbjct: 522 EEDDRIIQTSVPIYGFHDDYAFVVKGLLDLYQANFDVHWLELAEQLQDTQDELFWDSQDG 581

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           GYF+T  ED  ++LR+K+ HDGAEPS NS++  NL+RLA+ +  ++    ++ A   L  
Sbjct: 582 GYFSTV-EDSQMILRMKDAHDGAEPSSNSIACSNLLRLAAFLDRNE---LKEKAAQLLRA 637

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   L ++ +  P M  A  +L       + ++G   + D   ML            +  
Sbjct: 638 FGKGLTEIPIMFPQMTLA--LLDYHYTTQIYIIGKSDAEDTNEMLNVVRERLIPGMVLSL 695

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
           +D   +++   + +    N  +++      +    VC++ +CSPP T P  L +LL +K
Sbjct: 696 VDHERSQDNVLFRK----NTIISKMKPQNGRATVFVCRHHTCSPPTTSPRELASLLDDK 750


>gi|242004841|ref|XP_002423285.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212506287|gb|EEB10547.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 774

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 296/738 (40%), Positives = 417/738 (56%), Gaps = 82/738 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFE+E +AK++N+ FV +KVDREERPD
Sbjct: 90  GNEAFSRAVKENKLIFLSVGYSTCHWCHVMEKESFENEEIAKIMNENFVCVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +VQ                   P+ GGTYFPP D + RPGFK++L  + + W 
Sbjct: 150 VDKLYMLFVQ-------------------PIFGGTYFPPSDFHERPGFKSVLLILAEQWR 190

Query: 119 KKRDMLAQSGAFAIEQLSEALSA-----SASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
           + R   +++G   ++ + ++ S      + S+   PD    + +  C   L KSY+  +G
Sbjct: 191 ENRQKFSENGRKIMDYIEQSSSLDNSILNPSAVNPPD---ISCIEKCYNSLFKSYEKNYG 247

Query: 174 GFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           GF  APKFP  V +  +  LY  +   + GK+  A     M + TL+ MA GGIHDH+G 
Sbjct: 248 GFSEAPKFPHLVNLNFLFHLYAREPKSERGKTALA-----MCIHTLKMMANGGIHDHIGK 302

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           GF RYSVD +WHVPHFEKMLYDQGQLA  Y  A+  TK+ F+S +   IL Y+ RD+  P
Sbjct: 303 GFSRYSVDNKWHVPHFEKMLYDQGQLAVSYATAYLTTKNQFFSEVLEGILSYVDRDLSHP 362

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE---------HAILFKEHYYL 342
            G  +SAEDADS     +T KKEGAFYVWT ++++  L +         +A +F E++ +
Sbjct: 363 DGGFYSAEDADSLSAPDSTEKKEGAFYVWTYEDIKKHLPQKIPESSELTYADVFCEYFNV 422

Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
           K  GN + S+  DPHNE K +NVLI  +  +A A+K  +  E+   IL E ++ LF++R+
Sbjct: 423 KANGNVNPSK--DPHNELKNQNVLIITDSEAAVAAKFNLSEERVKQILDESKKILFNLRA 480

Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
           KRPRPHLDDK++ SWNGL+IS +A+A ++L +                  Y++ A  AA 
Sbjct: 481 KRPRPHLDDKILTSWNGLMISGYAKAGQVLGNS----------------HYVQRAIGAAK 524

Query: 463 FIRRHLYDEQTHRL--------QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 514
           FIR+HLY   T  L         ++     +   GFLDDYAFLI GLLDLYE      W+
Sbjct: 525 FIRQHLYKNDTKTLLRSCYKSSDNTISQIATPINGFLDDYAFLIRGLLDLYEASFDPIWI 584

Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
            WA  LQ TQD LF D  G GYF++   D S+L+R+KEDHDGAEP GNSVSV NL+RL +
Sbjct: 585 EWAESLQETQDTLFWDEGGAGYFSSPSGDSSILVRMKEDHDGAEPCGNSVSVSNLLRLGA 644

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
            +  ++   Y+  A   LA F +RLK M + +P M  A  +L       +++ G K+  D
Sbjct: 645 YLDKAE---YKDRAGKLLAAFTSRLKKMPVILPEMVSAL-LLYHDGPTQILITGKKTDPD 700

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
              +L    + +  N+ +  ID  D +E   +++++        +  S     A VC + 
Sbjct: 701 TAALLNVVQSRFIPNRILALID--DDKESILYKKNDIIRTIKPVHGHS----TAYVCHHH 754

Query: 695 SCSPPVTDPISLENLLLE 712
           +CS P+     L  LL E
Sbjct: 755 TCSLPINTREELAKLLDE 772


>gi|148683975|gb|EDL15922.1| spermatogenesis associated 20, isoform CRA_a [Mus musculus]
          Length = 745

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 46  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 105

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 106 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 165

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
             ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 166 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 221

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 222 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 276

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D FY+ + + IL Y+ R +    G
Sbjct: 277 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 336

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  + +EGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 337 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 395

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + S+  DP+ E  G+NVL+       +A++ G+ +E    +L     KLF  R  
Sbjct: 396 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 453

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA     L  E   A                 A S A F
Sbjct: 454 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 497

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 498 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 557

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL S
Sbjct: 558 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 617

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
              G K   +       L  F  R++ + +A+P M      LS   +  K +V+ G   +
Sbjct: 618 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 671

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + +L   H+ Y  NK +I    AD +   F        +S+ R     D+    + +
Sbjct: 672 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 725

Query: 693 NFSCSPPVTDPISLENLL 710
           N +CS P+TDP  L  LL
Sbjct: 726 NQACSMPITDPCELRKLL 743


>gi|351713578|gb|EHB16497.1| Spermatogenesis-associated protein 20, partial [Heterocephalus
           glaber]
          Length = 806

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 292/738 (39%), Positives = 417/738 (56%), Gaps = 69/738 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME E+F++E + +LL++ FVS+KVDREE+PD
Sbjct: 109 GQEAFGKARKENKPIFLSVGYSTCHWCHMMEEETFQNEEIGRLLSEDFVSVKVDREEQPD 168

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 169 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 228

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L +S     ++++ AL A +  +    + P  A  +   C +QL + YD  +GGF
Sbjct: 229 QNKSTLLESS----QRVTTALLARSEISMGDRQAPPLAATMNSRCFQQLDEGYDEEYGGF 284

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   +   +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 285 AEAPKFPIPVILSFLFSYWLGHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 339

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +W  PHFEKMLYDQ QLA  Y  AF ++ D FYS I + IL Y+ R +    G
Sbjct: 340 HRYSTDRQWQGPHFEKMLYDQAQLAVSYSQAFQISGDEFYSDIAKGILQYVDRSLSHRSG 399

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAED+DSA   G  + +EGAFY+WT +E++ +L E  +          L  +HY L 
Sbjct: 400 GFYSAEDSDSAPERG-MQPREGAFYMWTVRELQCLLPEPVVGASEPLTVGQLLTKHYGLT 458

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  L +  DP  E +G+NVL        +A++ G+ +E    +L     KLF VR +
Sbjct: 459 EAGNVSLCQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRGLLTSGLDKLFQVRKQ 516

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L  E                  +  A ++A F
Sbjct: 517 RPKPHLDSKMLTAWNGLMVSGYAVTGAVLGIE----------------RLVNRATNSAKF 560

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  T RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 561 LKRHMFDVATGRLKRTCYAGTGASVEHSTPPRWGFLEDYAFVVRGLLDLYEASQESAWLE 620

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  P + LRVK+D DGAEPS NSV+  NL+RL  
Sbjct: 621 WALRLQDTQDRLFWDSRGGGYFCSEAELGPGLPLRVKDDQDGAEPSANSVAAHNLLRLHG 680

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
               ++   +       L  F  R++ + +A+P M      LS   +  K +V+ G   +
Sbjct: 681 F---TRHKDWLDKCVCLLTAFSERMRRVPVALPEM---VRTLSTHQQGLKQIVICGDAQA 734

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + +L   H+ Y  NK +I    AD     F        +++ R     D+  A VC+
Sbjct: 735 KDTKALLQCVHSLYIPNKVLIL---ADGGPSSFLSRQLPFLSTLRRLE---DRATAYVCE 788

Query: 693 NFSCSPPVTDPISLENLL 710
           N +CS P+T+P  L  LL
Sbjct: 789 NQACSMPITEPCELRKLL 806


>gi|46485467|ref|NP_659076.2| spermatogenesis-associated protein 20 [Mus musculus]
 gi|81912951|sp|Q80YT5.1|SPT20_MOUSE RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411; AltName:
           Full=Transcript increased in spermiogenesis 78 protein
 gi|29748049|gb|AAH50788.1| Spermatogenesis associated 20 [Mus musculus]
          Length = 790

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 91  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 150

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 151 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 210

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
             ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 211 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 266

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 267 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 321

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D FY+ + + IL Y+ R +    G
Sbjct: 322 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 381

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  + +EGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 382 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 440

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + S+  DP+ E  G+NVL+       +A++ G+ +E    +L     KLF  R  
Sbjct: 441 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 498

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA     L  E   A                 A S A F
Sbjct: 499 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 542

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 543 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 602

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL S
Sbjct: 603 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 662

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
              G K   +       L  F  R++ + +A+P M      LS   +  K +V+ G   +
Sbjct: 663 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 716

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + +L   H+ Y  NK +I    AD +   F        +S+ R     D+    + +
Sbjct: 717 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 770

Query: 693 NFSCSPPVTDPISLENLL 710
           N +CS P+TDP  L  LL
Sbjct: 771 NQACSMPITDPCELRKLL 788


>gi|194217119|ref|XP_001499729.2| PREDICTED: spermatogenesis-associated protein 20-like [Equus
           caballus]
          Length = 889

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 296/736 (40%), Positives = 417/736 (56%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 190 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 249

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF T+L+++++ W 
Sbjct: 250 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFHTVLQRIREQWK 309

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 310 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 365

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 366 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 420

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 421 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVTRNLSHRSG 480

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT KEV+ +L E             L  +HY L 
Sbjct: 481 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQQLLPEPVPGATEPLTSGQLLMKHYGLT 539

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E  G+NVL        +A++ G+ ++    +L     KLF  R  
Sbjct: 540 EAGN--ISSNQDPKGELHGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 597

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L  E    + N+ +             + A F
Sbjct: 598 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGLE---RLINYAI-------------NCAKF 641

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 642 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEATQESAWLE 701

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 702 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 761

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   +  
Sbjct: 762 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKG 817

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F        +++ R     D+  A +  + 
Sbjct: 818 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DRATAYIYGSQ 871

Query: 695 SCSPPVTDPISLENLL 710
            CS PVT+P  L  LL
Sbjct: 872 VCSLPVTEPCELRKLL 887


>gi|148683976|gb|EDL15923.1| spermatogenesis associated 20, isoform CRA_b [Mus musculus]
          Length = 796

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 97  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 156

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 157 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 216

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
             ++ L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 217 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 272

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 273 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 327

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D FY+ + + IL Y+ R +    G
Sbjct: 328 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 387

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  + +EGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 388 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 446

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + S+  DP+ E  G+NVL+       +A++ G+ +E    +L     KLF  R  
Sbjct: 447 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 504

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA     L  E   A                 A S A F
Sbjct: 505 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 548

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 549 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 608

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL S
Sbjct: 609 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 668

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
              G K   +       L  F  R++ + +A+P M      LS   +  K +V+ G   +
Sbjct: 669 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 722

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D + +L   H+ Y  NK +I    AD +   F        +S+ R     D+    + +
Sbjct: 723 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 776

Query: 693 NFSCSPPVTDPISLENLL 710
           N +CS P+TDP  L  LL
Sbjct: 777 NQACSMPITDPCELRKLL 794


>gi|391227735|ref|ZP_10263942.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
 gi|391223228|gb|EIQ01648.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
          Length = 734

 Score =  509 bits (1310), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 295/730 (40%), Positives = 402/730 (55%), Gaps = 44/730 (6%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F      ++  FL    +TCHWCHVM  ESFE+E VA +LN  FVSIKVDREERPD
Sbjct: 27  GEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFENEAVAAVLNKHFVSIKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
           VDKVYM YVQA+ G GGWPLSV+L+PDLKP  GGTYFPPED+ GR G  ++L  +   W 
Sbjct: 87  VDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTYFPPEDRSGRSGLLSVLDVIARGWN 146

Query: 118 --DKKRDMLAQS--------GAFAIEQLSEALSASASSNKLPD--ELPQNALRLCAEQLS 165
             D++R  +A+S        G +A +Q+         +  +P   E   +A   C  QL 
Sbjct: 147 DDDERRKFVAESSRVIDVLAGYYAGKQVR-----PDPATPMPPLYETGGDAFERCYLQLG 201

Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
           +S+DS  GGFG APKFPR   +  +   +       ++G   E   M   TL+ M  GGI
Sbjct: 202 ESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPETETGR--EAVSMAASTLRHMIAGGI 259

Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
           HDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A   LDA   T D  Y++  R  LDY+ 
Sbjct: 260 HDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNLLDAALFTGDERYAWAARATLDYVL 319

Query: 286 RDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKP 344
           RD+  P G  FSAEDAD+A   GAT   EGAFYVWT+ E+   L  + A L + H  + P
Sbjct: 320 RDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWTAGELRRALSPDAARLVESHLGINP 379

Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
               ++    DPH E +GKN+L ++   + +A+ LG+        L      L  +R+ R
Sbjct: 380 GPEGNVPPTLDPHGELRGKNILRQVRPLAETAAALGLEPAAAAERLAAALETLQAIRAAR 439

Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
           PRPHLDDKVI +WNGL +S+FARA+    +           +   R  Y++ A  AA F+
Sbjct: 440 PRPHLDDKVITAWNGLALSAFARAATSPAA----------CLDDRRDRYLDAARRAARFV 489

Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
            R L D     L  ++R     + GF +DYA  I+GLLDL++      WL  A  LQ T 
Sbjct: 490 ERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAGLLDLHDATFDAHWLRLAERLQQTM 549

Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
           D  F D   GGYFN+   DP ++LR+KED+DGAEP+ +S++  NL RL+S++     +  
Sbjct: 550 DARFRDEVAGGYFNSPAGDPHIVLRLKEDYDGAEPAPSSIAAANLQRLSSLL---HDETL 606

Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
              A  ++     +      A+P M CA + +L+ P +  VV+ G  ++  F  ++A   
Sbjct: 607 HARAVDTVEALRGQWSQTPHALPAMLCALERILAEPVQ--VVIAGDPAAPGFRALVAVVR 664

Query: 644 ASYDLNK-TVIHIDPA--DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
           A     +  +I + PA     + D W    +      R      +  A VCQ+++C PPV
Sbjct: 665 AQATRRRPALIGLVPAGGSDADADLWLRARAPWLDGMRPA-DGGQAAAYVCQHYTCQPPV 723

Query: 701 TDPISLENLL 710
           T P +L  LL
Sbjct: 724 TTPEALRQLL 733


>gi|301781214|ref|XP_002926022.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20-like [Ailuropoda melanoleuca]
          Length = 785

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 294/736 (39%), Positives = 413/736 (56%), Gaps = 69/736 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGW     L+P+L+P +GGTYFPPED   R GF T+L ++++ W 
Sbjct: 150 VDKVYMTFVQATSSGGGW----XLTPNLQPFVGGTYFPPEDGLTRVGFHTVLLRIREQWK 205

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + +  L ++     ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF
Sbjct: 206 QNKTTLLENS----QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGF 261

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 262 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 316

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 317 HRYSTDRQWHIPHFEKMLYDQAQLAVAYTQAFQISGDEFYSDVAKGILQYVARNLSHRSG 376

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT  EV+ +L E  +          LF +HY L 
Sbjct: 377 GFYSAEDADSPPERG-MRPKEGAFYVWTVNEVQQLLPEPVLGATEPLTSGQLFMKHYGLT 435

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ ++    +L     KLF  R  
Sbjct: 436 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 493

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L  E                  +  A + A F
Sbjct: 494 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGLE----------------RLITCAINGAKF 537

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D    RL  +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 538 LKRHMFDVARGRLMRTCYAGPGGTVEHSNPPSWGFLEDYAFVVRGLLDLYEASQESSWLE 597

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 598 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 657

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 658 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 713

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    A+ +   F        +++ R     D+  A VC+N 
Sbjct: 714 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCENQ 767

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 768 ACSMPITEPNELRKLL 783


>gi|126343214|ref|XP_001376429.1| PREDICTED: spermatogenesis-associated protein 20 [Monodelphis
           domestica]
          Length = 744

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 291/741 (39%), Positives = 421/741 (56%), Gaps = 73/741 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCHVME ESF+++ + ++L++ FVSIKVDREERPD
Sbjct: 45  GQEAFDKAKKENKPIFLSVGYSTCHWCHVMEEESFQNKDIGQILSEDFVSIKVDREERPD 104

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+PDL+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 105 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGVTRVGFRTVLLRIREQWK 164

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + + ML  +     ++++ +L A +       ELP +A  +   C +QL + YD   GGF
Sbjct: 165 QNKAMLMANS----QRVTASLLARSEICMGDRELPPSASAVSNRCFQQLEEVYDEEHGGF 220

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
              PKFP PV +  +   + + ++   G        Q+M + TL+ MA GGI DHVG GF
Sbjct: 221 AEVPKFPTPVILSFLFSYWATHRMATDG-----FRAQQMAMHTLKMMANGGIRDHVGQGF 275

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y+ AF ++ D F++ I +DIL Y+ +++    G
Sbjct: 276 HRYSTDRQWHIPHFEKMLYDQAQLAVAYIQAFQISGDEFFADIAKDILQYVSQNLSHQSG 335

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
              SAEDADS   EG  + KEGA+Y+W  KE++D+L +             LF +HY + 
Sbjct: 336 GFCSAEDADSM-PEGEKKPKEGAYYLWKVKEIKDLLPDPVEGSNEPLTLGQLFMKHYGIT 394

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +    DPH E +G+NVL        +A++ G+  E    +L   R KL   R +
Sbjct: 395 ENGN--IGSTQDPHGELQGQNVLTVRYSMDLTAARYGLEAEAVRTLLDIGREKLIQTRKR 452

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRP LD K++ +WNGL++S +A     L +E                E ++ A   A F
Sbjct: 453 RPRPRLDSKMLAAWNGLMVSGYAITGATLGNE----------------EMIKQAIDGAKF 496

Query: 464 IRRHLYDEQTHRLQHSFRNGP--------SKAPGFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RHL+D  + RL      G         S+  GFL+DYAF+I GLLDLYE    + WL 
Sbjct: 497 LKRHLFDVSSGRLIRGCYAGAGGTVEQSSSQWWGFLEDYAFVIRGLLDLYEASRESAWLE 556

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA++LQ+ QD+LF D +GGGYF    E  + L LR+K+D DG+EPS NSVS  NL+R+  
Sbjct: 557 WALKLQDMQDKLFWDTQGGGYFCNEVELRNDLPLRLKDDQDGSEPSANSVSAHNLLRIHG 616

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
                + DY  +  +  L  F  RL  + +A+P M  A  ++   + K VV+ G   + D
Sbjct: 617 YTG--RRDYMEKCVK-LLTAFSDRLWKVPVALPEMVRAL-IIQQQTVKQVVICGSPQTTD 672

Query: 635 FENMLAAAHASYDLNKTVIHI--DPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALV 690
            + ++   H+ Y  NK +I    DP+     ++ F          +AR +    +  A V
Sbjct: 673 TQALINCVHSVYVPNKVLILTDGDPSSFLARQLPF----------LARFHKLEGRATAYV 722

Query: 691 CQNFSCSPPVTDPISLENLLL 711
           C+N + S PVT+P  L  LLL
Sbjct: 723 CENQAYSMPVTEPAELRKLLL 743


>gi|194336238|ref|YP_002018032.1| hypothetical protein Ppha_1140 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308715|gb|ACF43415.1| protein of unknown function DUF255 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 737

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 288/712 (40%), Positives = 406/712 (57%), Gaps = 62/712 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+  +AKLLN  FV +KVDREE PD+D++YM+YVQA  G GGWP+S
Sbjct: 70  STCHWCHVMEDESFENPEIAKLLNAHFVPVKVDREELPDLDRLYMSYVQASTGRGGWPMS 129

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEA 138
           V+L+P+L P  GG+YFPPE++YG PGFKTIL  +   W+ +R+ ++++SG+F        
Sbjct: 130 VWLTPELNPFYGGSYFPPEERYGMPGFKTILITITRYWENEREKIISESGSFFA------ 183

Query: 139 LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
            S  A S   P   P  + A + C E L  +YD  FGGFG APKFPRPV +  +  H+  
Sbjct: 184 -SLGAVSRTTPSSQPDAEMAQKKCFEWLEANYDPMFGGFGRAPKFPRPVLLNFLFNHAYH 242

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKM 250
             D        +  +M L TL  MA+GGIHDH+      GGGF RYS D+RWHVPHFEKM
Sbjct: 243 TGD-------KKALRMALHTLHKMAEGGIHDHLGIIGKGGGGFARYSTDQRWHVPHFEKM 295

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  QLA   L+AF  + D FY     DI +Y+  DM  P G  +SAEDAD+  T G+ 
Sbjct: 296 LYDNAQLAISCLEAFQCSGDNFYKRTAEDIFNYVLCDMRSPQGGFYSAEDADTLLTHGSE 355

Query: 311 RKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           +K+EGA Y+W++ E+ + L   E A +F   Y ++  GN +     DPH EF GKN+L++
Sbjct: 356 QKQEGALYLWSADEIRETLADEELATIFSFTYGIRDEGNAEY----DPHGEFNGKNILMQ 411

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
                  A   G  +E+    L + R KL+  RS+RPR  LDDK++ +WNGL+IS+ A+ 
Sbjct: 412 QATDEECADTFGKTVEEIRAALDDARTKLYHARSRRPRAFLDDKILTAWNGLMISALAKG 471

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
            ++L +E                 ++  A  AA+FI   LYD+   RL   +R+G +   
Sbjct: 472 YQVLHNET----------------FLAAAREAANFILETLYDQANGRLLRRYRDGNAAIA 515

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G  +DYAFL+ GL DLYE  S  ++L  A++L   Q+ LF D   GGYF+T  +D +V L
Sbjct: 516 GKAEDYAFLVQGLTDLYEASSEVRYLQIALQLAEIQNTLFYDNAQGGYFSTAIDDHTVPL 575

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R+KE++DGAEPS NS+S +NL+RLA +      D+ R+ AE ++      L + + A+P 
Sbjct: 576 RIKEEYDGAEPSANSISTLNLLRLAEMTG--NEDFVRR-AEETIKSCRIMLAENSSALPQ 632

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           M  A +  +   + H+V  G   S     +    +  Y    T+ H   A  E    +  
Sbjct: 633 MLVAKN-FAEQRKVHLVFSGPLDSSSMNELRQTVYEQYLPGATMSH---ASKESAHIFPS 688

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL----LEKPSS 716
           H    A +A+ + +A      +C + SC PP  +P  L  +L    L +P S
Sbjct: 689 H---AAIIAKEDGNAK---VYICIDKSCQPPTENPERLAAMLDSQFLHRPDS 734


>gi|449543699|gb|EMD34674.1| hypothetical protein CERSUDRAFT_86096 [Ceriporiopsis subvermispora
           B]
          Length = 737

 Score =  506 bits (1303), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 296/728 (40%), Positives = 415/728 (57%), Gaps = 51/728 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    + CHWCHV+  ESFEDE  AK++N+ +V+IKVDREERPD
Sbjct: 39  GQEAFDAAKRHNKPIFLSVGYSACHWCHVLAHESFEDEVTAKIMNEHYVNIKVDREERPD 98

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD++YMT++QA  GGGGWP+SV+L+P+L P   GTYFP      +  F+ +L K+ + W+
Sbjct: 99  VDRLYMTFLQATTGGGGWPMSVWLTPELHPFFAGTYFP------QGQFRQVLLKLAEVWN 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
                 A+ G   IEQL  A S  A S  +P  +   ++ +   +L K YDSR GGFG A
Sbjct: 153 NDPARCAEVGKSVIEQLRNA-SNIAPSASIPS-ISAASISIY-RRLEKRYDSRHGGFGGA 209

Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           PKFP+P +    L  Y +  + DT    +A + + M + T+  +  GGI D VGGGF RY
Sbjct: 210 PKFPQPSQTTHFLARYAALNMRDTTTKKDAEQARDMAVETMVKIYNGGIRDVVGGGFSRY 269

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSL-----TKDVFYSYICRDILDYLRRDMIGP 291
           SVDERWHVPHFEKMLYD+GQL +  ++   L      +      +  DI+ Y+ RD+  P
Sbjct: 270 SVDERWHVPHFEKMLYDEGQLLSSAIELSLLLPCDAPERTTLQLMAADIVTYVARDLRSP 329

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G  +SAEDADS  +  +T KKEGAFYVWT+K+++D+LG  A  FK H+ ++  GNCD S
Sbjct: 330 EGGFYSAEDADSLPSSDSTVKKEGAFYVWTAKQLDDLLGAEAEAFKYHFGVEAKGNCDPS 389

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLD 410
              D   E KG+NVL   +    +A K G  +E+   +L     KL + R K RPRPHLD
Sbjct: 390 H--DIQGELKGQNVLYTAHTPEETAKKFGRSIEETGQLLKGSLAKLKEYRDKERPRPHLD 447

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK++  WNGL+IS  ++AS++L    E +           ++ +++AE +A+FIR+ LYD
Sbjct: 448 DKILTCWNGLMISGLSKASEVLDESFELS-----------EKALQLAEDSATFIRQRLYD 496

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
           E T  L+ S+R GP    G  DDYAFLI GLLDLYE     ++ +WAI LQ  QDELF D
Sbjct: 497 ESTGELRRSYREGPGPT-GQADDYAFLIQGLLDLYEASGKEEYALWAIRLQEKQDELFWD 555

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
            EGGGYF ++  DP +L+R+K+  DGAEPS  SV+  NL RL S  A  +   Y++ A  
Sbjct: 556 SEGGGYF-SSAPDPHILVRMKDPQDGAEPSAQSVAFWNLQRL-SHFAEDRHGAYQEKARG 613

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
            L      L     A+  M   A +L+    K  + V   S  +  + L A H+ +   +
Sbjct: 614 VLETDAQILGQAPYALAAMVSGA-LLAEKGLKQFI-VTKPSYSEAASFLKAVHSRFIPQR 671

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASM--------ARNNFSADKVVALVCQNFSCSPPVTD 702
            +IH+DP          E    NA++           +  A +    VC+NF+C  PV D
Sbjct: 672 VLIHLDPEHPP-----RELAEVNATLRALIEDVDTNKDGDAKRASVRVCENFACGLPVED 726

Query: 703 PISLENLL 710
              +E +L
Sbjct: 727 LEEVEKML 734


>gi|149053889|gb|EDM05706.1| spermatogenesis associated 20 [Rattus norvegicus]
          Length = 745

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 289/736 (39%), Positives = 417/736 (56%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E +  LLN+ FVS+ VDREERPD
Sbjct: 46  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 105

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 106 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 165

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 166 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 221

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 222 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 276

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D F+S + + IL Y+ R++    G
Sbjct: 277 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 336

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  + +EGA Y+WT KEV+ +L E             L  +HY L 
Sbjct: 337 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 395

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + ++  D + E  G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 396 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 453

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA A  +L  E                + +  A + A F
Sbjct: 454 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 497

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 498 LKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 557

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+ QD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 558 WALRLQDIQDKLFWDSHGGGYFCSEAELGTDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 617

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
           +  G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 618 LT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDPQAKD 673

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F        +++ R     D+    + +N 
Sbjct: 674 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATVYIFENQ 727

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 728 ACSMPITDPCELRKLL 743


>gi|427779347|gb|JAA55125.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 816

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 311/785 (39%), Positives = 417/785 (53%), Gaps = 126/785 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE++ +AK++ND FV++KVDREERPDVD+VYMTY+QA  GGGGWP+S
Sbjct: 65  STCHWCHVMERESFENDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMS 124

Query: 80  VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQL 135
           ++L+PDLKP++GGTYFPP+D+ YG+PGFKT+L  + + W K R  L   G   F I EQ 
Sbjct: 125 IWLTPDLKPVVGGTYFPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQT 184

Query: 136 SE-----------ALSASASSNKLPDELPQNALRLCAEQ---------LSKSYDSR-FGG 174
           S+           +   S ++ K P     +    C  Q         L ++ D R FGG
Sbjct: 185 SDVRVFGGDGVPTSPRGSEANQKCP--FAPDVATTCYRQLXGTRIFQILEQTSDVRVFGG 242

Query: 175 ----------------------------------------FGSAPKFPRPVEIQMMLYHS 194
                                                   FG APKFP+ V +  +L + 
Sbjct: 243 DGVPTSPRGSEANQKCPFAPDVATTCYRQLERSYDVSMGGFGRAPKFPQCVNLNFLLRYR 302

Query: 195 KKLEDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
             L       EA     +  +M + TL+ MA+GGIHDH+G GFHRYS D +WHVPHFEKM
Sbjct: 303 AVLLQGDPPPEAKTAVDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDGKWHVPHFEKM 362

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQ QL   Y +A+ +T D   + + RDIL Y+ RD+  P G  +SAEDADS    G  
Sbjct: 363 LYDQAQLTRTYSEAYQVTHDRRLADVARDILCYVERDLSHPSGGFYSAEDADSYPEHGDK 422

Query: 311 RKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
            K+EGAF VW   EV  +L E          A +   +Y ++ +GN D   M DPH+E K
Sbjct: 423 EKREGAFCVWEESEVYRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD--PMQDPHDELK 480

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
            KNVLI      + A+  G+ +     +L   R  LF+ R +RP+PHLDDK + SWNGL+
Sbjct: 481 RKNVLIVRESKESVAACYGLEVGVLDALLERARETLFEARLRRPKPHLDDKFLTSWNGLM 540

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-F 480
           IS FA A++ L         N PV       Y++ A     FI++HLY+ +   L  S +
Sbjct: 541 ISGFAIAARTL---------NQPV-------YLDRALKCVEFIKKHLYNPKKKTLIRSAY 584

Query: 481 R-------NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
           R        G     G L+DYAFLI  LLD+YE       L+WA ELQ+ QD LF D++ 
Sbjct: 585 RGEDGSVVQGSQPIDGVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDKQDRLFWDKKD 644

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
            GYF + GEDP+V+LR+K+D DGAEPS NSVS+ NLVRL+ ++   + D  RQ AE   +
Sbjct: 645 MGYFLSNGEDPTVVLRLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDELRQRAEKLAS 701

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
           V+  R+  + +A+P M C    L     + VV+ G +     + +L+     +    TVI
Sbjct: 702 VYGQRMILVPLALPEMVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSCLRRHFLPFVTVI 760

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSA-----DKVVALVCQNFSCSPPVTDPISLEN 708
             D           +   N       NF        K  A VCQ+F CS PVT    LE 
Sbjct: 761 LAD-----------QDPENPLRKRLTNFDGYTCVNGKPAAYVCQDFQCSKPVTTAAELEA 809

Query: 709 LLLEK 713
           LL  K
Sbjct: 810 LLTAK 814


>gi|373850029|ref|ZP_09592830.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
 gi|372476194|gb|EHP36203.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
          Length = 734

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 293/730 (40%), Positives = 402/730 (55%), Gaps = 44/730 (6%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F      ++  FL    +TCHWCHVM  ESFE+E VA +LN+ FVSIKVDREERPD
Sbjct: 27  GEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFENEAVAAVLNEHFVSIKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYM YVQA+ G GGWPLSV+L+PDLKP  GGTYFPPED+ GR G  ++L  +   W+
Sbjct: 87  VDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTYFPPEDRSGRSGLLSVLDVIIQGWN 146

Query: 119 ---KKRDMLAQS--------GAFAIEQLSEALSASASSNKLPD--ELPQNALRLCAEQLS 165
              ++R  +A+S        G +A +Q+         +  +P   E   +A   C  QL 
Sbjct: 147 DDGERRKFVAESSRVIDVLAGYYAGKQVR-----PDPATPMPPLYETGGDAFERCYLQLG 201

Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
           +S+DS  GGFG APKFPR   +  +   +       ++G   E   M   TL+ M  GGI
Sbjct: 202 ESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPETETGR--EAVSMAASTLRHMIAGGI 259

Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
           HDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A   LDA   T D  Y++  R  LDY+ 
Sbjct: 260 HDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNLLDAALFTGDERYAWAARATLDYVL 319

Query: 286 RDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKP 344
           RD+  P G  FSAEDAD+A   GAT   EGAFYVWT+ E+   L  + A L + H  + P
Sbjct: 320 RDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWTADELRRALSPDAARLVESHLGINP 379

Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
               ++    DPH E +GKN+L ++   + +A+ LG+        L      L  +R+ R
Sbjct: 380 GSEGNVPPALDPHGELRGKNILRQVRPLAETAAALGLEPAAAAERLAAALETLQAIRTAR 439

Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
           PRPHLDDKVI +WNGL +S+FARA+    +           +   R  Y++ A  AA F+
Sbjct: 440 PRPHLDDKVITAWNGLALSAFARAATSPAA----------CLDDRRDRYLDAARRAARFV 489

Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
            R L D     L  ++R     + GF +DYA  I+GLLDL++      WL  A  LQ T 
Sbjct: 490 ERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAGLLDLHDATFDAHWLRLAERLQQTM 549

Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
           D  F D   GGYFN+   DP ++LR+KED+DGAEP+ +S++  NL RL+S++     +  
Sbjct: 550 DARFRDEIAGGYFNSPAGDPHIVLRLKEDYDGAEPAPSSIAASNLQRLSSLL---HDETL 606

Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
              A  ++     +      A+P M CA + +L+ P +  VV+ G  ++  F  ++A   
Sbjct: 607 HARAVDTVEALRGQWSQTPHALPAMLCALERILAEPVQ--VVIAGDPAAPGFRALVAVVR 664

Query: 644 ASYDLNK-TVIHIDPA--DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
           A     +  +I + PA     + D W    +      R      +  A VCQ+++C  PV
Sbjct: 665 AQATRRRPALIGLVPAGGSDADADLWLRARAPWLDGMRPA-DGGQAAAYVCQHYTCQSPV 723

Query: 701 TDPISLENLL 710
           T P +L  LL
Sbjct: 724 TTPEALRQLL 733


>gi|40786501|ref|NP_955434.1| spermatogenesis-associated protein 20 [Rattus norvegicus]
 gi|81871190|sp|Q6T393.1|SPT20_RAT RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411
 gi|38156445|gb|AAR12892.1| sperm protein SSP411 [Rattus norvegicus]
          Length = 789

 Score =  503 bits (1296), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 288/736 (39%), Positives = 417/736 (56%), Gaps = 65/736 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E +  LLN+ FVS+ VDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D F+S + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  + +EGA Y+WT KEV+ +L E             L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + ++  D + E  G+NVL   +    + ++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRDSLELTGARYGLEVEAVRALLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA A  +L  E                + +  A + A F
Sbjct: 498 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 542 LKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+ QD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 602 WALRLQDIQDKLFWDSHGGGYFCSEAELGTDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
           +  G K   +       L  F  R++ + +A+P M  A       + K +V+ G   + D
Sbjct: 662 LT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDPQAKD 717

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    AD +   F        +++ R     D+    + +N 
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATVYIFENQ 771

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 772 ACSMPITDPCELRKLL 787


>gi|409047490|gb|EKM56969.1| hypothetical protein PHACADRAFT_92450 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 717

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 284/700 (40%), Positives = 406/700 (58%), Gaps = 58/700 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AKL+N+ +V++KVDREERPDVD++YMT++QA  GGGGWP+S
Sbjct: 62  SACHWCHVLAHESFEDEVTAKLMNERYVNVKVDREERPDVDRLYMTFLQATSGGGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P   GTYFP      +  F+  L K+ + W++ R+ L +SG   IEQL  + 
Sbjct: 122 VWLTPDLHPFFAGTYFP------KGQFRQALEKLANFWEEDRERLVESGKGIIEQLKSSS 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE- 198
           +AS  S                ++L + YDS  GGFG APKFP P +    L     L  
Sbjct: 176 NASICSQ-------------VYKRLERLYDSVHGGFGGAPKFPSPSQTTHFLARLAALNI 222

Query: 199 -DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D     EA + + M + T+  +  GGI D VGGGF RYSVD+ WHVPHFEKMLYD+ QL
Sbjct: 223 GDEKLKSEALKARDMAVQTMVKIYNGGIRDVVGGGFSRYSVDDHWHVPHFEKMLYDEAQL 282

Query: 258 ANVYLDAFSL-----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
            +  L+   L      +      +  DI+ Y+ RD+    G  +SAEDADS  +  +T K
Sbjct: 283 LSSALELAQLLPIDSVECKTLEAMANDIIIYVSRDLRNSEGAFYSAEDADSLPSSDSTIK 342

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           KEGAFYVWTS +++++LG+++ +FK HY +K  GNCD     D   E KG+NVL   +  
Sbjct: 343 KEGAFYVWTSAQLDELLGDNSDVFKFHYGVKSNGNCDPKH--DVQGELKGQNVLYTAHTV 400

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKI 431
             +A K G+P E+    L +C   L   R + RPRPHLDDK++  WNGL++S  A+AS++
Sbjct: 401 EDTARKFGIPAEQVQVTLDQCLAHLKRYRDENRPRPHLDDKILTCWNGLMLSGLAKASEV 460

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ +A +A              +++AE +A+FI++ LYDE+T  L+ S+R GP    G  
Sbjct: 461 LEGQAANA--------------LKLAEDSAAFIKKELYDEKTGELRRSYRQGPGPT-GQA 505

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           DDYAFLI GLLDLYE     +++ WAI LQ  QDELF D EGGGYF  +  DP +L+R+K
Sbjct: 506 DDYAFLIQGLLDLYEASGKEEYVTWAIRLQEKQDELFHDTEGGGYF-ASAPDPHILVRMK 564

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           +  DGAEPS  SV++ NL RLA   A  +   YR+ A+  L      L+    A+  M  
Sbjct: 565 DAQDGAEPSAVSVTLYNLNRLAHF-AEDRHGEYREKAQSILRSNSQLLEHAPFALATMVS 623

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA----DTEEMDFWE 667
           AA + +    +  ++ G  S+ D    L A   ++  ++ +IH+DP     +  +++   
Sbjct: 624 AA-LTAQRGYRQFIVSGEASNSDTTRFLHAIRHTFVPSRVLIHLDPQRPPRELAKLNGTL 682

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
               ++++ AR N         +C+NF+C  P+ DP  L+
Sbjct: 683 RALMDDSANARPNVR-------LCENFACGLPIYDPKELK 715


>gi|395536753|ref|XP_003770376.1| PREDICTED: spermatogenesis-associated protein 20 [Sarcophilus
           harrisii]
          Length = 744

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 285/739 (38%), Positives = 413/739 (55%), Gaps = 69/739 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       +  FL    +TCHWCHVME ESF ++ + ++L++ FVS+KVDREE PD
Sbjct: 45  GQEAFDKAKNENKPIFLSVGYSTCHWCHVMEEESFRNKEIGEILSEDFVSVKVDREEHPD 104

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+PDL+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 105 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 164

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA---LRLCAEQLSKSYDSRFGGF 175
           + + ML ++     ++++ +L A +       ELP  A    + C +QL + YD   GGF
Sbjct: 165 QNKAMLLENS----QRVTASLLARSEITVGDRELPPTASAVSKRCFQQLEEVYDEEHGGF 220

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
             APKFP PV +  +  +      T    E    Q+M + +L+ MA GGI DHVG GFHR
Sbjct: 221 AEAPKFPTPVILSFLFSYWAAHRMT---SEGFRAQQMAMHSLKMMANGGIRDHVGQGFHR 277

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D  +S + + IL Y+ +++  P G  
Sbjct: 278 YSTDRQWHIPHFEKMLYDQAQLAVAYTQAFQVSGDELFSDVAKGILQYVSQNLSHPSGGF 337

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPT 345
           +SAEDADS   EG  + KEGA+Y+WT  E++D+L E             LF +HY +  T
Sbjct: 338 YSAEDADSV-PEGEVKPKEGAYYLWTVNEIKDLLPEPVEGATEPLSLGQLFMKHYGVTET 396

Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
           GN  +    DP  E +G+NVL        +A++ G+  E    +L   R KL  +R +R 
Sbjct: 397 GN--IGSTQDPQGELQGQNVLTVRYSMDLTAARFGLEAETVRKLLDTGREKLVQIRKRRS 454

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
           RP LD K++ +WNG+++S +A A  +L  E                E +  A   A F++
Sbjct: 455 RPRLDIKMLAAWNGMMVSGYAIAGAVLGKE----------------ELINQAIDGAKFLK 498

Query: 466 RHLYDEQTHRLQH--------SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWA 517
           RHL+D  + RL          +     S+  GFL+DYAF+I GLLDLYE    + WL WA
Sbjct: 499 RHLFDVSSGRLFRGCYATIGGTVEQSSSQFWGFLEDYAFVIRGLLDLYEASGESAWLEWA 558

Query: 518 IELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           + LQ+ QD+LF D +GGGYF +  E    L LR+K+D DG+EPS NSVS  NL+R+ +  
Sbjct: 559 LRLQDMQDKLFWDTQGGGYFCSEAELGGNLPLRLKDDQDGSEPSANSVSAHNLLRIHAYT 618

Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
              + D+  +  +  L  F  RL+ + +A+P M  A   +   + K +V+ G     D +
Sbjct: 619 G--RRDWMDKCVK-LLTAFSDRLRRVPVALPEMVRAL-CIQQQTIKQIVICGSPQGQDTK 674

Query: 637 NMLAAAHASYDLNKTVIHID--PAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            ++   H+ Y  NK +I  D  P+     ++ F          + R      +  A VC+
Sbjct: 675 ALIDCVHSIYVPNKVLILYDGEPSSFLARQLPF----------LVRLQKVDSQATAYVCE 724

Query: 693 NFSCSPPVTDPISLENLLL 711
           N + S PVT+P  L  LLL
Sbjct: 725 NQAYSLPVTEPAELRKLLL 743


>gi|431890790|gb|ELK01669.1| Spermatogenesis-associated protein 20 [Pteropus alecto]
          Length = 777

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 293/736 (39%), Positives = 415/736 (56%), Gaps = 77/736 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 90  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRIGFRTVLLRIREQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +   
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISTGDRQLPPSAATMNSRCFQQLDEGYDEEY--- 262

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
                    V +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 ---------VILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 308

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQGQLA  Y  AF ++ D FYS + + IL Y+ R++    G
Sbjct: 309 HRYSTDRQWHVPHFEKMLYDQGQLAVAYSQAFQISGDEFYSDVAKGILQYVSRNLSHRSG 368

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  R KEGAFYVWT KEV+ +L E             L  +HY L 
Sbjct: 369 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQQLLPESVHGATEPLTSGQLLMKHYGLT 427

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 428 EAGN--ISPNQDPKGELQGQNVLTVRYSLELTAARFGLDVEAIRTLLNTGLEKLFQARKH 485

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L  E    + N+             A + A F
Sbjct: 486 RPKPHLDSKMLAAWNGLMVSGYAITGAVLGME---RLVNY-------------ATNGAKF 529

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 530 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASLESAWLE 589

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD+LF D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 590 WALRLQDTQDKLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 649

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   + +     L  F  R++ + +A+P M  A  +    + K +V+ G   + D
Sbjct: 650 FT-GHKD--WMEKCVCLLTAFSERMRRVPVALPEMVRAL-LAHQQTLKQIVICGDPQAKD 705

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD +   F         ++ R     D+  A VC+N 
Sbjct: 706 TKALVQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRR---LEDRATAYVCENQ 759

Query: 695 SCSPPVTDPISLENLL 710
           +CS PVT+P  L  LL
Sbjct: 760 ACSMPVTEPSELRKLL 775


>gi|110598780|ref|ZP_01387040.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
           13031]
 gi|110339607|gb|EAT58122.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
           13031]
          Length = 712

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 276/669 (41%), Positives = 386/669 (57%), Gaps = 56/669 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  R  FL    +TCHWCHVME ESFE+  +A++LN +FV +KVDREE PD
Sbjct: 33  GEEAFEKAERENRPIFLSVGYSTCHWCHVMERESFENPDIAEVLNRYFVPVKVDREELPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM YVQ+  G GGWP+SV+L+PD  P  GG+YFPPED+YG  GFKTIL  +   W+
Sbjct: 93  LDRLYMEYVQSTTGRGGWPMSVWLTPDRNPFYGGSYFPPEDRYGMTGFKTILLSIASLWE 152

Query: 119 KKRDML--AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
              + +  A SG F+  Q      A++ +  LP E    A   C   L  ++D  +GGF 
Sbjct: 153 SDEEKIRDASSGFFSDLQ----AFAASRAAALPPE--DEAQHNCFRWLESTFDPVYGGFS 206

Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------G 230
            APKFPRPV +  +  H+        SG  S+ ++M LFTL+ MA+GGIHDH+      G
Sbjct: 207 GAPKFPRPVLLNFLFSHAY------YSGN-SKAREMALFTLRRMAEGGIHDHISVTGKGG 259

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
           GGF RYS DERWHVPHFEKMLYD  QLA  YL+AF  + +  +  +  DI +Y+  DM  
Sbjct: 260 GGFARYSTDERWHVPHFEKMLYDNAQLAVSYLEAFQCSGEPLFRSVAEDIFNYVLSDMTA 319

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNC 348
           P G  +SAEDADS E+E  T KKEGAFY+W + E+ + +G  E A +F   Y ++  GN 
Sbjct: 320 PEGGFYSAEDADSLESESGTEKKEGAFYLWRADELHEAIGNAEQAAIFSFVYGVRAEGNA 379

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
               ++DPH EF G+N+L++      +A + G    +  ++L E RRKL+  RS RPRP 
Sbjct: 380 ----LNDPHGEFTGRNILMQQVSVEETAVRFGKTAVEIRDVLDEARRKLYTARSGRPRPF 435

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LDDK++ SWN L+IS+ ++  ++L SE                E +  A  AA F+   L
Sbjct: 436 LDDKILTSWNALMISALSKGFRVLHSE----------------ECLTAARKAADFLLETL 479

Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           YD ++ RL   +R+G +   G +DDYAF +  L+DLYE      +L  A+EL   Q  LF
Sbjct: 480 YDRRSCRLLRRYRDGSAAIAGKVDDYAFFVQALIDLYEASFEIVYLKAALELAEVQKTLF 539

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   GGYF++  +D +V +R KE +DGAEPS NSV+ +NL+RL  +    K ++  Q A
Sbjct: 540 CDALHGGYFSSASDDQTVPVRQKESYDGAEPSANSVTALNLLRLGELTG--KEEFALQ-A 596

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK---HVVLVGHKSSVDFENMLAAAHAS 645
           E   + F T L   + A+P M  A +     +RK    ++  G   + + E + A A   
Sbjct: 597 EELFSAFGTTLASQSHALPQMLVALNF----ARKRGCRILFSGDLHATEMERLRAVAGER 652

Query: 646 YDLNKTVIH 654
           Y     V+H
Sbjct: 653 YLPGTVVMH 661


>gi|395328680|gb|EJF61071.1| hypothetical protein DICSQDRAFT_161788 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 791

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 284/704 (40%), Positives = 400/704 (56%), Gaps = 63/704 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AK++N+++V+IKVDREERPDVD++YMT++QA  GGGGWP+S
Sbjct: 112 SACHWCHVLAHESFEDEVTAKIMNEYYVNIKVDREERPDVDRLYMTFLQATTGGGGWPMS 171

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P   GTYFPP +      F+ +L K+ + W++  +    SG   IE L ++ 
Sbjct: 172 VWLTPDLHPFFAGTYFPPGN------FRQVLIKLAEIWERDPERCIASGKQIIEVLQQSS 225

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-----YHS 194
            A+  S      L +  L     QL K +D++ GGFG APKFP P +    L     Y+ 
Sbjct: 226 KAAPESGVDVKPLAEKILT----QLQKRFDAKEGGFGRAPKFPSPSQTMYPLARIAAYYL 281

Query: 195 KKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                T +  E++E  + M +FT+  +  GGI D VGGGF RYSVDERWHVPHFEKMLYD
Sbjct: 282 NNSSATAQEKESAEKARDMAVFTMTKIYNGGIRDVVGGGFSRYSVDERWHVPHFEKMLYD 341

Query: 254 QGQLANVYLDAFSLTKD-----VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           + QL +  L+ + L             + +DI+ Y+ RD+  P G  +SAEDADS  +  
Sbjct: 342 EAQLLSSALELYQLLPSGSHDKTTLELMAKDIVSYVARDLRSPQGGFYSAEDADSLPSHE 401

Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           +T KKEGAFYVWT+K+++++L   A LFK H+ +K  GNCD S   D   E KG+NVL  
Sbjct: 402 STVKKEGAFYVWTAKQLDELLDADAELFKYHFGVKAEGNCDPSH--DIQGELKGQNVLFT 459

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
            +    +A K G   E+    L      L + R+K RPRPHLDDK++  WNGL+IS  ++
Sbjct: 460 AHTLEETAQKFGKAYEEVQKTLEVNLATLREYRNKHRPRPHLDDKILACWNGLMISGLSK 519

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
             ++L S +E A           K+ +++AE +A+F+R HLYDE++  L  S+R GP   
Sbjct: 520 TYEVLHSHSEIA-----------KKALQLAEDSATFLRAHLYDEKSGTLWRSYREGPGPT 568

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            G  DDYAFLI GLLDLYE  +  ++L+WA+ LQ  QDELF D EGGGYF  +  D  +L
Sbjct: 569 -GQADDYAFLIQGLLDLYEASAKEEYLLWALRLQEKQDELFYDPEGGGYF-ASAPDEHIL 626

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           +R+K+  DGAEPS  SV+V NL RLA     + S +  +    +LA     LK    A+ 
Sbjct: 627 VRMKDAQDGAEPSAVSVAVSNLQRLAHFAEDNHSAFTEKTTS-TLASNGQFLKQAPHALA 685

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDF--------ENMLAAAHASYDLNKTVIHIDPAD 659
            M  AA            L G K  + F           L    +++  N+ +IH DP++
Sbjct: 686 YMVSAA------------LTGEKGYMQFIYEGTSQDSPFLKLIRSTFIPNRVLIHFDPSN 733

Query: 660 TEEMDFWEEHNSNNASMA---RNNFSADKVVALVCQNFSCSPPV 700
                   +HN +  S+           +   ++C+NF+C  P+
Sbjct: 734 PPRG--IAKHNGSVRSLVEELEKKEGEHRENVMICENFTCGLPI 775


>gi|392558461|gb|EIW51649.1| hypothetical protein TRAVEDRAFT_137028 [Trametes versicolor
           FP-101664 SS1]
          Length = 739

 Score =  494 bits (1271), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 297/737 (40%), Positives = 411/737 (55%), Gaps = 67/737 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIK-VDREERP 57
           G+ +F    K  +  FL    + CHWCHV+  ESFEDE  AK++N+ +V++K VDREERP
Sbjct: 36  GQEAFDKAKKENKPIFLSVGYSACHWCHVLAHESFEDEITAKMMNEHYVNVKKVDREERP 95

Query: 58  DVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           DVD++YMT++QA  GGGGWP+SV+L+PDL P   GTYFPP    GR  F+ IL ++ D W
Sbjct: 96  DVDRLYMTFLQASTGGGGWPMSVWLTPDLHPFFAGTYFPP----GR--FRQILDRLADVW 149

Query: 118 DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL------CAEQLSKSYDSR 171
              R+   +S    +E L E      SSN  P   PQ+++ L        ++L K +D  
Sbjct: 150 TYDRERCIESAGKVLETLKE------SSNIAPS--PQDSVELKPLPQEVFQRLQKRFDGV 201

Query: 172 FGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE----ASEGQKMVLFTLQCMAKGGI 225
            GGFG APKFP P +    L  Y +  L D   S E    A   + M ++++  +  GGI
Sbjct: 202 NGGFGGAPKFPSPAQTTHFLARYAASHLSDLNASNEDKKNAQAARDMAVYSMIKIYNGGI 261

Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL----TKD-VFYSYICRDI 280
            D VGGGF RYSVDERWHVPHFEKMLYD+ QL +  LD + L    ++D      + +DI
Sbjct: 262 RDVVGGGFSRYSVDERWHVPHFEKMLYDEAQLLSSSLDLYQLLTTPSRDKKTLELMAKDI 321

Query: 281 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY 340
           + Y+  D+  P G  +SAEDADS  T  +  KKEGAFYVWTS++++++LG  A LF+ H+
Sbjct: 322 VSYVANDLRSPEGGFYSAEDADSLPTHDSIVKKEGAFYVWTSEQLDELLGADAELFEYHF 381

Query: 341 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 400
            ++  GNCD     D   E KG+NVL   + S  +A K G  +E    ILG   + L D 
Sbjct: 382 GVEADGNCDPGH--DIQGELKGQNVLFTAHTSEETADKFGKSVEDTEKILGAGLKTLRDY 439

Query: 401 RSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
           R K RPRPHLDDK++  WNGL+IS  AR S++L  + + A            + +++AE+
Sbjct: 440 RDKHRPRPHLDDKILTCWNGLMISGLARTSEVLGHDKDVA-----------SKALDMAEA 488

Query: 460 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
           +A+FIR HL+DEQ+ +L  S+R GP    G  DDYAFLI G LDLYE  +  + L+WA+ 
Sbjct: 489 SAAFIRGHLFDEQSGKLWRSYREGPGPT-GQADDYAFLIQGFLDLYEASANEEHLLWALR 547

Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
           LQ  QDELF D E GGYF  +  D  +L+R+K+  DGAEPS  SV++ NL RLA +    
Sbjct: 548 LQEKQDELFYDPEDGGYF-ASAPDEHILIRMKDAQDGAEPSAVSVTLANLQRLAHLAEDR 606

Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
            +D Y   A+  L+     L     A+  M   A M    + K  +   H  +     +L
Sbjct: 607 HAD-YNAKAKSILSSNGQLLTRAPFALASMVSGAMM----ADKGYMQFIHTGASSTSPLL 661

Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM------ARNNFSADKVVALVCQN 693
               +++  N+ +IHIDP +        E    N S+              K    +C+N
Sbjct: 662 ELTRSTFIPNRVLIHIDPKNLP-----RELAKVNGSIRSLIEELERTGGETKENVRICEN 716

Query: 694 FSCSPPVTDPISLENLL 710
           F+C  P+ D   L   L
Sbjct: 717 FTCGLPIEDVDDLRTRL 733


>gi|223935696|ref|ZP_03627612.1| protein of unknown function DUF255 [bacterium Ellin514]
 gi|223895704|gb|EEF62149.1| protein of unknown function DUF255 [bacterium Ellin514]
          Length = 701

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 289/715 (40%), Positives = 401/715 (56%), Gaps = 72/715 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFE E + K LN+ FVSIKVDREERPD
Sbjct: 52  GEEAFAKARKENKPIFLSIGYSTCHWCHVMERESFEKEEIGKYLNEHFVSIKVDREERPD 111

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YMT+VQ+  G GGWPL+ FL+PDLKP  GGTYFPPE KYGRP F  +L+ +   W+
Sbjct: 112 VDKIYMTFVQSTSGQGGWPLNCFLTPDLKPFYGGTYFPPESKYGRPSFLDLLKHINQLWE 171

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +   +  S     EQL++ ++A  ++N L   L Q  L   A QL + YDSR GGFG A
Sbjct: 172 TRHGDVTNSAVQLHEQLAQ-MTAKETTNGL--ALTQAVLNKAAGQLKEMYDSRNGGFGDA 228

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+P +   +L +       G      E   MVL T   MA+GGIHD +GGGF RY+V
Sbjct: 229 PKFPQPSQPAFLLRY-------GVHSNDQEAIAMVLNTCDHMARGGIHDQIGGGFARYAV 281

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W VPHFEKMLYD  QL N+YLDA+ ++ +  Y+   RD++ Y+ RDM    G  +SA
Sbjct: 282 DAKWLVPHFEKMLYDNAQLVNLYLDAYLVSGETRYADTARDVIGYVLRDMTHAEGGFYSA 341

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDP 356
           EDADS   EG    KEG FY WT  E+  +L   E  +  K   Y   T   +    SDP
Sbjct: 342 EDADS---EG----KEGKFYCWTRVELAKLLTPEEFNVAVK---YFGITEGGNFVDHSDP 391

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
                 +NVL  ++ +   A +   PL      L   ++K+F  RSKR RPHLDDK++ S
Sbjct: 392 -EPLPNQNVLSIVDSNLPRADE---PL------LQSAKQKMFAARSKRVRPHLDDKILAS 441

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL++S+ ARA  +L                  KEY+  AE   SF++  L+D +T  L
Sbjct: 442 WNGLMLSAIARAYAVLGD----------------KEYLTAAEHNLSFLQSKLWDAKTKTL 485

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
            H +R+G        + YAFL++G++DLYE     + L +AI L +     F D   GG+
Sbjct: 486 YHRWRDGERDTAQLHETYAFLLNGVVDLYEATLDPRHLEFAISLADAMIAKFYDPAEGGF 545

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           + + G  P ++LR+KED+DGAEPSGNSV+ + L++LA+I    ++D YR+ AE ++ +F 
Sbjct: 546 WQSAGA-PDLILRIKEDYDGAEPSGNSVATLTLLKLAAIT--DRAD-YRKAAEGTMRLFA 601

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HI 655
            RL+    AVP M  A D  S+   K VV+ G+++  + + +L AAH+ Y   K V+ ++
Sbjct: 602 DRLQRFPQAVPYMLMAVD-FSLQEPKRVVIAGNRAEPEAQKLLRAAHSVYQPAKVVLGNV 660

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            P +                 AR   +       +C   +C  P +D   ++ LL
Sbjct: 661 GPVE---------------EFARTLPAKQGATVYICTAKACQAPTSDAAKVKQLL 700


>gi|320168532|gb|EFW45431.1| spermatogenesis-associated protein 20 [Capsaspora owczarzaki ATCC
           30864]
          Length = 832

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 293/783 (37%), Positives = 415/783 (53%), Gaps = 118/783 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME +SF + G+A ++N  FV+IKVDREERPDVD+VYM ++ A  G GGWP+S
Sbjct: 65  STCHWCHVMEEQSFMNPGIASIMNKNFVNIKVDREERPDVDRVYMAFITATTGHGGWPMS 124

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L P+ GGTYFPPEDK+G PGF  +L K+   W  +RD +   G   ++ L + +
Sbjct: 125 VWLTPELTPIFGGTYFPPEDKWGTPGFPFLLAKIAALWSSRRDEILLKGRGIMQLLEQGI 184

Query: 140 SASASSNKLPDE---------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
            A     +  +E           ++ L L   +  + +D + GGFG APKFPRPV +Q +
Sbjct: 185 DARLQPTEESNEGAVSDAKQDSARDWLELAFTKFEEEFDPQLGGFGGAPKFPRPVILQFL 244

Query: 191 L------------YHSKKLEDTGKSGEAS------------------------------- 207
           L              ++  + T     AS                               
Sbjct: 245 LNLYAHFSRVTASLKAQATDATPSPTSASPRLAGAPVAAAAATTLSASPKLKGSRRLSVA 304

Query: 208 -----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 262
                +  +M   TL  M +GG++DH+GGGFHRYSVD+ WHVPHFEKML+DQ QLA  Y 
Sbjct: 305 ERNCLQTMRMCTTTLDAMHRGGLYDHLGGGFHRYSVDQFWHVPHFEKMLFDQAQLALTYA 364

Query: 263 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
             F LT+   Y+ +CRD L Y+ RD+  P G  FSAEDADS  +  +  K EGA+YVW+ 
Sbjct: 365 MGFQLTRIPAYAQVCRDTLAYVLRDLAHPLGGFFSAEDADSLPSVTSESKSEGAYYVWSY 424

Query: 323 KEVEDILGE------------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           +E+   L +               +F   + ++P GN  + R S+PH E   KN L +  
Sbjct: 425 EEISTTLSQGDCAAGVASNATDLAVFCYAFGVRPQGN--IRRESNPHGELARKNHLFQEY 482

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
               +A    +PL    N L   R +L  +R+ RPRPHLDDK+I +WNGL+IS+ A+A  
Sbjct: 483 TLQETADHFHLPLADVANRLENARARLHGIRAARPRPHLDDKIIAAWNGLMISALAKAGG 542

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
           ++    E  +F            +  A+ AA F+R  +Y+ ++ +L  S+R+G  SK  G
Sbjct: 543 VV----EEPLF------------IHAAQKAARFLRGSMYNTESGQLVRSWRDGSASKVGG 586

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLL 548
           FL DYAF+I GLLDLYE    T WL WA++LQ+ QDELF D   GGGYF T+  DPS+L+
Sbjct: 587 FLSDYAFVIQGLLDLYEVDGDTTWLEWALQLQSKQDELFHDPNGGGGYFVTSTHDPSILV 646

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R+K + D AEP+GNS++ INL+RLA++V   +    R  A   +   +    +   A+P+
Sbjct: 647 RLKCEEDSAEPAGNSIAAINLLRLANLVNRPE---MRDRAAALITSHQFLFSNAPTALPM 703

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFEN-----------MLAAAHASYDLNKTVIHIDP 657
           M  A   L  P+ + VVLV   S  D                AA+ A+ +L   V+    
Sbjct: 704 MLSALQFLHSPNVQ-VVLVTKNSPTDVPKPKDEPTRPAAAASAASEAATELQSVVLSQCF 762

Query: 658 ADTEEMDFWEEHNSNNAS--MARNNFSA--------DKVVALVCQNFSCSPPVTDPISLE 707
              + +     H  ++AS    RN   A        ++  A VCQ+F+C  PVT    L 
Sbjct: 763 IPFKSI----VHLQSDASRRFLRNKLPAVDDYQMIDNQPTAYVCQSFACQAPVTSVRELR 818

Query: 708 NLL 710
            LL
Sbjct: 819 TLL 821


>gi|189346882|ref|YP_001943411.1| hypothetical protein Clim_1372 [Chlorobium limicola DSM 245]
 gi|189341029|gb|ACD90432.1| protein of unknown function DUF255 [Chlorobium limicola DSM 245]
          Length = 706

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 288/703 (40%), Positives = 393/703 (55%), Gaps = 69/703 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE+E  A+LLN  F+ +KVDREE PD+D++YMTYVQA  G GGWP+SV
Sbjct: 55  TCHWCHVMERESFENEETARLLNGSFIPVKVDREELPDLDRLYMTYVQASTGRGGWPMSV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           +L+PDLKP  GG+YFPPED+YG PGF+T+L  +   W+     + ++     EQL    S
Sbjct: 115 WLTPDLKPFYGGSYFPPEDRYGMPGFRTVLTSIAQLWNTDPARITEASRIFFEQLQS--S 172

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           +    + LP++    A   C   L+ +YD   GGFG APKFPRP  +  +  H+     T
Sbjct: 173 SPMGKSGLPEK--GEAQEACFRWLASAYDPLRGGFGGAPKFPRPALLTFLFSHAFH---T 227

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQ 254
           G    AS    M L TL+ MA+GGIHDHV      GGGF RYS DERWH+PHFEKMLYD 
Sbjct: 228 GNREAAS----MALHTLKKMAEGGIHDHVHSMGKGGGGFARYSTDERWHLPHFEKMLYDN 283

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            QLA  YL+AF ++ +  ++ I  DI +Y+  DM  P G  +SAEDADS        K+E
Sbjct: 284 AQLAASYLEAFQISGETLFARIAEDIFNYILHDMQSPEGGFYSAEDADSFPDGETQEKRE 343

Query: 315 GAFYVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           GAFYVW+ KEV  +  E     LF   Y +KP GN       DPH EF GKNVL+E +  
Sbjct: 344 GAFYVWSWKEVMSLPAEPDKLELFARTYGMKPEGNVS----EDPHGEFGGKNVLMEQSAP 399

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
                      +  +  L E R+ L++ R +R RP LDDK+I SWNGL+IS+FA+  ++L
Sbjct: 400 EKHE-------KDTVAALDEVRQLLYEKRLQRSRPLLDDKIITSWNGLMISAFAKGYRVL 452

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
             E                EY+  A +AA FI  HLY+E   RL   +R+G +   G  +
Sbjct: 453 GHE----------------EYLRAARNAADFILVHLYEENEGRLLRRYRDGDAAITGKAE 496

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAF + GL+DLY+     ++L  A  L  T + LF D   GGYF+T  +D +V +R+KE
Sbjct: 497 DYAFFVRGLIDLYQACFDNRYLDAADRLCETCNRLFYDHADGGYFSTATDDNTVPVRLKE 556

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           ++DGAEP+ +SV ++NL+ LA ++ G+++  Y   AE     F T L   + A+PLM  A
Sbjct: 557 EYDGAEPAASSVGILNLLDLA-VMTGNEA--YEGMAEACFRGFGTMLSHNSPALPLMLAA 613

Query: 613 ADMLSVPSRKH---VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
            +     +RK     VL G+  S   + +L   ++ Y    T++H   A           
Sbjct: 614 LNN----ARKGGILAVLAGNMQSPRMQELLKTLNSRYLPGLTLMHHASA----------- 658

Query: 670 NSNNASMARNNFSADKVVALV--CQNFSCSPPVTDPISLENLL 710
            S   S    +   +  +  V  C   +C  P T P +L+ LL
Sbjct: 659 GSLKGSEIPADIDPESAIPAVYLCIGHACRLPATTPEALDELL 701


>gi|66826709|ref|XP_646709.1| DUF255 family protein [Dictyostelium discoideum AX4]
 gi|60474801|gb|EAL72738.1| DUF255 family protein [Dictyostelium discoideum AX4]
          Length = 824

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 277/701 (39%), Positives = 404/701 (57%), Gaps = 62/701 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWC+VME E FE+  +AK++N++ V+IK+DREERPD+DK+YMTY+  + G GGWP+S++
Sbjct: 140 CHWCNVMERECFENVEIAKVMNEYCVNIKIDREERPDIDKIYMTYLTEISGSGGWPMSIW 199

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P L P+ GGTYF PE KYGRPGF  +++K+   W K R+M+ +     I+ L E    
Sbjct: 200 LTPQLHPITGGTYFAPEAKYGRPGFPDLIKKLDKLWRKDREMVQERADSFIKFLKEEKPM 259

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
              +N L  +     +  C +Q+ K YD   GG+  APKFPR     ++L   K  ED  
Sbjct: 260 GNINNALSSQ----TIEKCFQQIMKGYDPIDGGYSDAPKFPRCSIFNLLLMTLK--EDYS 313

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           K  +     K+V FTL+ MA GG++D VGGGFHRYSV   W +PHFEKMLYD  QLA+VY
Sbjct: 314 K--QVGSLDKLV-FTLEKMANGGMYDQVGGGFHRYSVTSDWMIPHFEKMLYDNAQLASVY 370

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
           LDA+ +TK   +  + ++IL Y+   +    G  FSAEDADS   E    K+EGAFYVW+
Sbjct: 371 LDAYQITKSPLFERVAKEILHYVSTKLTHTLGGFFSAEDADSLNLE-INEKQEGAFYVWS 429

Query: 322 SKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI---ELNDSSASA 376
            ++++  + +     ++  H+ L   GN D     DPHNEFK KNV+     L +++A  
Sbjct: 430 YQDIKKAIQDKDDIEIYSFHHGLIENGNVD--PKDDPHNEFKDKNVITIVKSLKETAAYF 487

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
            K    +EK LN   + + KLF  R + +P+P LDDK+IVSWNGL++SSF +A ++ K E
Sbjct: 488 KKTQEEIEKSLN---QSKEKLFKFREQFKPKPQLDDKIIVSWNGLMVSSFCKAYQLFKDE 544

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE--------------QTHRLQHSFR 481
                           +Y+  A  +  FI+ HLYD                  RL  +++
Sbjct: 545 ----------------KYLNSAIKSIEFIKTHLYDSVGDDNDYDDEDDKLNNCRLIRNYK 588

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
           +GPSK   F DDY+FLI  LLDLY+     K L WA++LQ  QD LF D E GGY++T+G
Sbjct: 589 DGPSKIHAFTDDYSFLIQALLDLYQVTFDYKHLEWAMKLQKQQDNLFYDLENGGYYSTSG 648

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
            D S+L R+KE+HDGAEPS  S+SV NL++L SI   + ++ Y++ A+ +L      L+ 
Sbjct: 649 LDKSILSRMKEEHDGAEPSPQSISVSNLLKLYSI---TYNEAYKEKAKKTLENCSLYLEK 705

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVG----HKSSVDFENMLAAAHASYDLNKTVIHIDP 657
             +  P M C+   L + S   ++L      ++      ++L   H++Y  NK ++  D 
Sbjct: 706 APLVFPQMVCSL-YLYLNSINTIILSTNSNDNQQKQQLLSILDEIHSNYIPNKLILLNDH 764

Query: 658 ADTEEMDFWEEHNSN-NASMARNNFSADKVVALVCQNFSCS 697
           ++     F+E+  SN N S++   +  DK    +C    C+
Sbjct: 765 SNNSITQFFEKSTSNLNLSLSTPVY--DKTTFSLCNPNGCT 803


>gi|254445309|ref|ZP_05058785.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198259617|gb|EDY83925.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 715

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 281/694 (40%), Positives = 403/694 (58%), Gaps = 41/694 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDEG+A  +ND FV++K+DREERPDVD++YM+YVQ+  G GGWP+S
Sbjct: 59  STCHWCHVMAHESFEDEGIAGRMNDLFVNVKLDREERPDVDRIYMSYVQSTTGSGGWPMS 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDLKP  GGTYFPPEDKYGR GF T++ ++   W  +R  L + G     + S+AL
Sbjct: 119 VWLTPDLKPFYGGTYFPPEDKYGRVGFLTLVERIGQLWRDERATLLEYG-----EKSQAL 173

Query: 140 SASASSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            A ++S  L D + +   A+ LC EQL   YD ++GGFG APKFP P   QM+      +
Sbjct: 174 LADSASRNLSDGIGEAAGAIDLCLEQLDTEYDEQWGGFGGAPKFPMPGYFQML------V 227

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +   + G A    +M+  +L+ MA GGI DHVG GFHRYSVD+ WHVPH+EKMLYDQGQL
Sbjct: 228 DGISRRGNARL-TEMLAGSLEKMADGGIWDHVGSGFHRYSVDKYWHVPHYEKMLYDQGQL 286

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A +Y +A+ LT    ++ + + I+ Y+ RD+ G  GE+F+AEDADSA  + A++  EGAF
Sbjct: 287 AGIYAEAYRLTGRDSFAAVAKGIVRYVARDLQGAAGELFAAEDADSALPDDASKHGEGAF 346

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVW+  E++ +LGE A LF   Y +K  GN      SDPH E KG N L+ +        
Sbjct: 347 YVWSKAELDGLLGEDAALFASAYDVKAGGNARPE--SDPHGELKGMNTLMRVASDGELGK 404

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +  + +      LG C   LF+ R  RPRPHLDDK +VSWN L+IS    A K+ ++  +
Sbjct: 405 RFSLEVSAVRERLGACLGVLFEKRDGRPRPHLDDKALVSWNALMISG---ACKVYQACGD 461

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
           +             + +E+A+ AA F+   ++D    R    +R G  +  GF +DYA  
Sbjct: 462 A-------------DALELAKKAAVFLFAEMWDAGEGRFARVYRGGCGEQGGFAEDYAAA 508

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
               LDLYE      W+  A E+       F D + GG+F T   D +VL+R+++D+DGA
Sbjct: 509 AGACLDLYEATFDAVWVERAREVLQQLKLRFWDEQRGGFFATEVGDANVLVRLRDDYDGA 568

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EP+ +S++ + L+RLA+++   K    R     ++  F  + K    A+PLM  AA    
Sbjct: 569 EPAASSLAALALLRLAALLDDEK---LRVLGRETIEAFGEQWKRSPRAMPLMLVAASRF- 624

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           + S + +V+VG   + +   ++A A+        ++ +DPA    +   E    N    A
Sbjct: 625 LESDQQIVVVGDLEAAETRELIACANRWRASFSVLVGVDPA----VGLPEVFGGNEKLKA 680

Query: 678 RNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
               + A K +  VC+NF+C  PV    SLE +L
Sbjct: 681 MLEVAEAGKPLVYVCENFACKEPVGSVESLEGIL 714


>gi|451946132|ref|YP_007466727.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
           10523]
 gi|451905480|gb|AGF77074.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
           10523]
          Length = 710

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 279/696 (40%), Positives = 388/696 (55%), Gaps = 49/696 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  +SFED+ +A  LN +F+ IKVDREERPDVD++YM   QA+ G GGWP+S
Sbjct: 62  STCHWCHVMAHQSFEDQEIADFLNSYFIPIKVDREERPDVDQIYMAATQAMTGSGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL PD +P   GTYFPP   YGRPGF  IL+ +K AW   R+ L+ S     EQ++  L
Sbjct: 122 LFLFPDTRPFYAGTYFPPRADYGRPGFMEILQAIKTAWLTDRESLSLSA----EQVTSLL 177

Query: 140 SASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
               S  ++    P+ A L     QL +SYD ++GGFG APKFPRPV I  +L + K   
Sbjct: 178 RKDTSDGRVS---PEKAWLDKGFSQLEESYDPKYGGFGQAPKFPRPVVIDFLLRYYKS-- 232

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+       + M L TL+ MA GG++D +GGGFHRYSVD RW VPHFEKMLYDQ QL 
Sbjct: 233 -TGRKA----ARDMALVTLEQMAGGGMYDQIGGGFHRYSVDGRWRVPHFEKMLYDQSQLV 287

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL AF LT D  Y  I  ++L+Y+ RDM  P G  +SAEDADS          EGAFY
Sbjct: 288 FAYLSAFQLTGDSAYKEIVVEVLEYVLRDMRHPEGGFYSAEDADSVNPYNLEEHGEGAFY 347

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E++ +L E  A L K +Y +K  GN     + DP  EF G+N+     + S  A 
Sbjct: 348 LWTEEEIDTLLTEKQAALIKAYYGVKAKGNA----LHDPQKEFTGRNIFYRDKELSEVAR 403

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           ++G+  E+  +IL + RR L   R  R  PHLDDK++ SWNGL+IS+FARA+ +L     
Sbjct: 404 EVGLSEEEARDILQDARRSLLSHRQDRTAPHLDDKILTSWNGLMISAFARAAMVLGE--- 460

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                        K Y+  A  A  F+   L  +    L   +R+G ++    LDDY+FL
Sbjct: 461 -------------KRYLAAANQATDFLLDRLTVD--GELVRRWRDGDARYAAGLDDYSFL 505

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + GLLDLY     +  L  A++L      +F D +GG  F  T +   +L R++  +DGA
Sbjct: 506 VQGLLDLYLASHDSIRLQAAVDLTEKMIRIFADEKGG--FYDTPQSTQLLTRMRAAYDGA 563

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EPSGNSV+V+NL+RLA +   ++   +   A  S+  F   L     A+P+M  A D   
Sbjct: 564 EPSGNSVAVMNLLRLAGLTGNNE---WVALATESIESFGKTLSTYPPAMPMMLSAMD-FQ 619

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   + +V+ G   + D   +L+  H+ Y  N  ++  D    ++  F         ++ 
Sbjct: 620 MDKPRQIVIAGTLEADDTRELLSEVHSRYLPNTLLLLADGGKNQQ--FLRGGLPFIGTVK 677

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
           + +    +  A VC++F+C  PV     L  LL EK
Sbjct: 678 KID---GRATAYVCEDFTCRIPVNTREGLRALLDEK 710


>gi|452825593|gb|EME32589.1| hypothetical protein Gasu_03590 [Galdieria sulphuraria]
          Length = 822

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 275/707 (38%), Positives = 395/707 (55%), Gaps = 57/707 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E +A +LN +FVS+KVDREERPDVD VYMT+VQA  G GGWP+S
Sbjct: 153 STCHWCHVMEKESFENEQIASILNTYFVSVKVDREERPDVDGVYMTFVQATNGNGGWPMS 212

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PDL P +G TY PP+       F + L+++ + W   ++ + Q G+  +  L + L
Sbjct: 213 IFLTPDLVPFVGTTYLPPDR------FASALQQIAEKWRTSKEAIEQEGSRVLNALQQYL 266

Query: 140 SASASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
            A    + L      N    C EQ      + +D  +GGFG+APKFPRPV    +   + 
Sbjct: 267 DAPRKDDSL------NITTSCLEQGYMEAKEMFDEEYGGFGTAPKFPRPVVYDFLF--TL 318

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D GK+  A +   M L TL  MAKGGIHDH+GGGFHRYSVD+ WHVPHFEKMLYDQ 
Sbjct: 319 YWFDGGKTERAKDCLNMALQTLSNMAKGGIHDHLGGGFHRYSVDQYWHVPHFEKMLYDQS 378

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG-GEIFSAEDADSAE-------TE 307
           QL   YLDA+ +TKD  +     DIL Y+ RDM     G  FSAEDADS E       + 
Sbjct: 379 QLLQSYLDAYLITKDESFRDTAIDILSYVLRDMTDKNTGAFFSAEDADSLEPFSTDSSSI 438

Query: 308 GATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
            +  KKEGAFY WT  E + ILG   + L  EH+ +KP GN      SDP  E  GKNVL
Sbjct: 439 NSETKKEGAFYTWTDFECKLILGPTTSKLISEHFDIKPEGNARPG--SDPFGELGGKNVL 496

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
                 +  +  +G+   +    + E ++KL++ R++R RPHLDDK+I SWN ++I S  
Sbjct: 497 YIAKSLTEVSKSMGVSEAEANVAIQEAKQKLWEQRNRRARPHLDDKIITSWNAMMIYSLV 556

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD---EQTHRLQHSFRNG 483
           +A  +L+ E                +Y++ A  AA+F++ ++ +   ++T  +  S+R G
Sbjct: 557 KAYIVLEDE----------------QYLQKAMDAATFLKSYMIETTSQETTLIYRSYREG 600

Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
            S   GF++DYA  I   L ++E     +WL +AI+LQNTQD  F D   GGYF+T+ + 
Sbjct: 601 RSDVEGFVEDYAHTIRAFLSVFEATGNEEWLKYAIQLQNTQDATFYDEVNGGYFSTSSQA 660

Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
            ++LLR K+D+DG+EPS ++VS  NL RL +I   +K   Y +  + ++  F   +    
Sbjct: 661 KNILLRRKDDYDGSEPSPSAVSGWNLFRLGAITGDTK---YYEKFKSTINAFSIPVNKAP 717

Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
             VP M     +L   + + V++V +       +++ A  + ++ N+ +I + P +   +
Sbjct: 718 FGVPAMLINCCLLLKEATRVVLVVDNMKEPRTRDLVNAVVSRFEPNRVLIPLKPDNQRFL 777

Query: 664 DFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +S +  +       D   A VC   +C  PVT    L  LL
Sbjct: 778 ------SSLSTELKAMKMIEDSPTAYVCFGKTCKNPVTSKEELCALL 818


>gi|390463544|ref|XP_002748471.2| PREDICTED: spermatogenesis-associated protein 20 [Callithrix
           jacchus]
          Length = 783

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 283/736 (38%), Positives = 406/736 (55%), Gaps = 84/736 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++              
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSE-------------- 148

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
                 T+V A   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 149 -----GTFVSATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 203

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 204 QNKNALLENS----QRVTTALLARSEISVGDRQLPPSAATVNSRCFQQLDEGYDEEYGGF 259

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 260 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 314

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + +DIL Y+ R +    G
Sbjct: 315 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSG 374

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          LF +HY L 
Sbjct: 375 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATELLTSGQLFTKHYGLT 433

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 434 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLGVEAVRTLLNTGLEKLFQARKH 491

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+PHLD K++ +WNGL++S +A    +L              G DR   +  A + A F
Sbjct: 492 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 535

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 536 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 595

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 596 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 655

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  R++ + +A+P M  A       + K +V+ G + + D
Sbjct: 656 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 711

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + ++   H+ Y  NK +I    AD + + F        +++ R     D+  A VC+N 
Sbjct: 712 TKALVQCVHSVYIPNKVLIL---ADGDPLSFLSRQLPFLSTLRRLE---DQATAYVCENQ 765

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+TDP  L  LL
Sbjct: 766 ACSMPITDPCELRKLL 781


>gi|405953510|gb|EKC21160.1| Spermatogenesis-associated protein 20 [Crassostrea gigas]
          Length = 682

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 282/708 (39%), Positives = 386/708 (54%), Gaps = 98/708 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E + ++LN+ FVSIKVDREERPDVD+VYMT++QA  GGGGWP+S
Sbjct: 60  STCHWCHVMERESFENEEIGRILNENFVSIKVDREERPDVDRVYMTFIQATVGGGGWPMS 119

Query: 80  VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           V+L+P+LKPL GGTYFPP+D+ YGRPGFKT+L  + + W  K  +L +  +  +  L E 
Sbjct: 120 VWLTPELKPLFGGTYFPPDDRYYGRPGFKTVLTSLAEQWKTKGPVLKEQSSVILRTLQEG 179

Query: 139 LSAS-ASSNKLPDELPQNALRLCAE----QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
            SAS A    LPD      L+ C E    QL +S+D   GGF   PKFP+PV    +   
Sbjct: 180 TSASEAQGQSLPD------LKDCTEKLYYQLERSFDQEDGGFSKEPKFPQPVNFNFLFRL 233

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
             K +D+  S  A+   +M  FTL  MAKGGI DH+                        
Sbjct: 234 YAKYKDSF-SDMANSSLEMATFTLNKMAKGGIFDHIS----------------------- 269

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
                        +TK   ++ + RDI +Y  RD++ P G  +SAEDADS  T  +  KK
Sbjct: 270 ------------KITKQDNFAEVVRDIAEYTMRDLLNPCGGFYSAEDADSLPTAESPEKK 317

Query: 314 EGAFYVWTSKEVEDILGEH-------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           EGAF VWT ++++DIL E        A +F  H+ +K  GN D   M DPH+E   +NVL
Sbjct: 318 EGAFCVWTYQQIQDILKEKVKDNLSLAQIFCYHFNIKEKGNVD--PMQDPHDELLNQNVL 375

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
           I  +    +A K  +   +  ++L +CR  L+  R  RPRPHLDDK++ +WNGL+IS  +
Sbjct: 376 IVKDSVEETAQKFSLNPVEVKDVLEKCRTLLYKERQNRPRPHLDDKIVAAWNGLMISGLS 435

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           +A + L    ES              +++ A   ASF++ H+                S 
Sbjct: 436 KAGQAL---GESL-------------FVDQAVKTASFLQSHM---------------SSP 464

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
             GF+DDYA++I GLLDLYE     +W+ WA ELQ  Q+ LF D EGG YF+ +G D S+
Sbjct: 465 IEGFVDDYAYVIRGLLDLYEVCQDEQWVQWAEELQERQNGLFWDSEGGAYFSNSGRDASI 524

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           +LR+K+D DGAEP  NSVSV NLVRL +++       Y + A   L VF  RL  + +A+
Sbjct: 525 VLRLKDDQDGAEPCPNSVSVSNLVRLGALLNNQD---YTEKAVTILKVFYERLTKIPIAI 581

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
           P M C   +L   + K +VLVG  +S D   +       Y  NK  I  D    + M   
Sbjct: 582 PEMVCGLILLQ-DTPKQIVLVGDPNSDDLTALKNCVAKHYLPNKITITCDGTSDKFMKAK 640

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
            E  +   S+ + +    K  A VC+N++C  PVT    LE +L   P
Sbjct: 641 LEFLN---SLTKKD---GKATAYVCENYTCDLPVTSVADLERVLKVNP 682


>gi|170067981|ref|XP_001868692.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
 gi|167863990|gb|EDS27373.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
          Length = 763

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 284/717 (39%), Positives = 390/717 (54%), Gaps = 75/717 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE E VA+++N+ FV++KVDREERPD+DK+YMT++  + G GGWP+S
Sbjct: 75  STCHWCHVMEKESFESEEVAEIMNENFVNVKVDREERPDIDKLYMTFILLINGSGGWPMS 134

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K  W    + L ++G   I+ + + +
Sbjct: 135 VWLTPDLAPITGGTYFPPKDRWGMPGFTTILLKLKIKWATDGEDLKETGRSIIQAIQKNV 194

Query: 140 SASASSNKLPDELP---QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
                 +K   ELP   +   R       +++D  +GG    PKFP   ++  +++H   
Sbjct: 195 E---EKHKEEPELPLTVEEKFRQAIMIYRRNFDPVWGGSMGEPKFPEVSKLN-LIFHLHL 250

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           L+       AS+   +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQ
Sbjct: 251 LD------PASKLLGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQ 304

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y + +  T+   Y  +   I  YL +D+  P G  +S EDADS     +  K EGA
Sbjct: 305 LLMAYANGYKATRKPLYLEVADSIFKYLCKDLRHPAGGFYSGEDADSLPAWDSKDKIEGA 364

Query: 317 FYVWTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           FY WT  E++D+   +              +F EHY ++PTGN + S  SDPH    GKN
Sbjct: 365 FYAWTFSEIKDLFNANLEKFGDLGKLNPVEVFTEHYDVQPTGNVEPS--SDPHGHLLGKN 422

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +LI       +A KL    E    IL      L +VR KRPRPHLD K+I +WNGL++S 
Sbjct: 423 ILIVYGSLRETALKLDTSEEVVAKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSG 482

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A  S++  +              +R EY+EVA    +FIR +L+D +  +L  SF    
Sbjct: 483 LAELSRVKDA-------------PNRAEYLEVAAKLVAFIRENLFDAKAGKLLRSFYGDD 529

Query: 485 S------KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
           S      + P  GF+DDYAFLI GL+D Y     T  L WA ELQ  QD LF D   G Y
Sbjct: 530 SDKAKSLEVPIYGFIDDYAFLIKGLIDYYRASLDTSALRWARELQEIQDRLFWDDTSGAY 589

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH----SL 592
           F +     +V++R+KEDHDGAEP GNSV+  NL+ L         DY+ + A H     L
Sbjct: 590 FYSEANSANVVVRLKEDHDGAEPCGNSVAAHNLLLLG--------DYFAEGAFHERARKL 641

Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA-AAHASYDLNKT 651
             + + +      +P M  AA ++    R  ++++G K   D  N L  A    Y+    
Sbjct: 642 LDYFSNVAPFGYVLPKMMSAA-LMEEHGRDMLIVIGPKG--DQTNALVDAVRNFYNPGLV 698

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFS--ADKVVALVCQNFSCSPPVTDPISL 706
           V+H+DP    E         + A    +NF    D   A +C +  C  P+TDP  L
Sbjct: 699 VVHLDPTKPSE---------HLAGKKLDNFKMIQDAPTAYICHDKICQLPLTDPDRL 746


>gi|225156854|ref|ZP_03724957.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
 gi|224802800|gb|EEG21050.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
          Length = 758

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 298/730 (40%), Positives = 404/730 (55%), Gaps = 59/730 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+E VA +LN+ FVSIKVDREERPDVD++YM YVQA+ G GGWPLS
Sbjct: 48  STCHWCHVMARESFENESVAAVLNEHFVSIKVDREERPDVDRIYMAYVQAMTGRGGWPLS 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLS- 136
            +L+PDLKP  GGTYFPP D+ GRPGF  +L  + +AW  + +R  L    A  I+ L+ 
Sbjct: 108 AWLTPDLKPFYGGTYFPPHDQQGRPGFLAVLHAITEAWSDEAERHKLVAESARVIQALTD 167

Query: 137 -----EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
                +  S  A +  L D    +A   C  QL +S+D   GGFG APKFPR   +   L
Sbjct: 168 YHAGKQHASVPAHTRPLHDRA-ADAFEHCFLQLRESFDPAHGGFGGAPKFPRASNLD-FL 225

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           +    ++ T +S    E  K+   TL+ M  GGIHDHVGGGFHRY+VDE W VPHFEKML
Sbjct: 226 FRVAAIQGT-QSEVGREAVKLATTTLRHMIAGGIHDHVGGGFHRYAVDETWLVPHFEKML 284

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----ETE 307
           YDQ Q+A   LDA  +T D  Y+++ R  LDY+ RD+  P G  FSAEDADSA    + +
Sbjct: 285 YDQAQIAVNLLDAALVTGDERYAWVARSTLDYVLRDLRHPAGGFFSAEDADSAVPHDDGD 344

Query: 308 GATR----KKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMS------DP 356
            + R      EGAFYVWT+ E+  IL  + A  F  H+ +  + + + +         DP
Sbjct: 345 ASPRAHGNHAEGAFYVWTTAELRRILPSDTADRFILHFGVAGSHDANAAEAGNVPPAHDP 404

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
           H E  GKN+L      + +A+ LG+               L  VR+ RPRPHLDDK+I +
Sbjct: 405 HGELSGKNILHHTRPIAETAAALGLDPAALAAEFARALETLRAVRAARPRPHLDDKIITA 464

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDEQTHR 475
           WNGL I++FARA+    +  +           DR+E Y++ A +AA FI R LYD+    
Sbjct: 465 WNGLAITAFARAAASPAACLD-----------DRREFYLDAALTAARFIERELYDDDGGD 513

Query: 476 ------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
                 L  ++R+G   + GF +DYAFLI+GLLDL+E      WL  A  LQ T D LF 
Sbjct: 514 APARCILWRNWRDGRGASEGFAEDYAFLIAGLLDLHEATLDPHWLRRAARLQETMDHLFW 573

Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
           D   GGYFNT    P ++LR+KED+DGAEP+  S++  NL RL+++    + D     A 
Sbjct: 574 DDAHGGYFNTPAGSPHLVLRLKEDYDGAEPAPGSIAAANLQRLSALF---QDDTLHARAV 630

Query: 590 HSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHA-SYD 647
            ++     + +    A+P +  A + +L  P++  ++L G   S DF  + A   A    
Sbjct: 631 RTVESLRGQWETTPHALPALLFALERILEEPAQ--IILAGDPRSHDFRALAAVLRARDKT 688

Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNA-------SMARNNFSADKVVALVCQNFSCSPPV 700
           L +  I   P  +  +   +  NS+ A        +A    S     A VC   +C PPV
Sbjct: 689 LRRHTILAAPL-SPALPTTDSPNSDEAWLLERAPWLAGMKPSDGCAAAYVCHGRTCHPPV 747

Query: 701 TDPISLENLL 710
           T P +L  LL
Sbjct: 748 TTPSALRQLL 757


>gi|21674102|ref|NP_662167.1| hypothetical protein CT1279 [Chlorobium tepidum TLS]
 gi|21647257|gb|AAM72509.1| conserved hypothetical protein [Chlorobium tepidum TLS]
          Length = 710

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 281/719 (39%), Positives = 383/719 (53%), Gaps = 54/719 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T R  FL    +TCHWCHVME ESFE+   A LLN  FV +K+DREE PD
Sbjct: 30  GEEAFSRARETGRPIFLSSGYSTCHWCHVMEHESFENAETAALLNRHFVPVKLDREEHPD 89

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YM +VQA  G GGWP+SV+++PDLKP  GG+YFP  +++G P F+++L  + + W+
Sbjct: 90  VDHLYMMFVQATTGRGGWPMSVWMTPDLKPFFGGSYFPATERWGMPSFRSVLEHLANLWE 149

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R  L  S    ++QLS        +    DE+       C   L + +D+ +GGFG  
Sbjct: 150 HDRPRLLASAGSIMDQLSGLTRPQEGT----DEVTDAHASACLAALERGFDAEWGGFGGE 205

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH------VGGG 232
           PKFPRP  +  +  H+     TG          M L TL+ MA GGIHDH       GGG
Sbjct: 206 PKFPRPAVLSFLFSHAVA---TGN----RHALDMALLTLRKMAAGGIHDHLGVAGLGGGG 258

Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
           F RYS D  WHVPHFEKMLYD  QLA  YL+A+  + D  ++   RDI  Y+  DM  P 
Sbjct: 259 FARYSTDRFWHVPHFEKMLYDNAQLAASYLEAYQASGDELFANTARDIFHYVLCDMTSPE 318

Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
           G  +SAEDADS +  G+  K+EGAFY+WT +E+  +L  E A LF   Y ++  GN    
Sbjct: 319 GAFWSAEDADSLDPYGSGEKREGAFYLWTEQEITGLLDPEEATLFIATYGIRSDGNAPF- 377

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
              DPH EF GKN+LI     +  A    +P+E     L   R+KLF+ R KRPRP LDD
Sbjct: 378 ---DPHGEFTGKNILIRTMSDNELAGTFEIPIETVGKRLNSARKKLFEARKKRPRPGLDD 434

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL++S+ A+ S +L                     +E AE AA FI   L D 
Sbjct: 435 KILTSWNGLMLSALAKGSLVLGD----------------TTLLEAAERAARFILDTLCDS 478

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
           ++ +L   +R+G +   G   DYA LI GLLDLY     + WL  AI+L   Q E F D+
Sbjct: 479 KSGKLLRRYRDGQAAIEGKAADYACLILGLLDLYSASFDSDWLRAAIKLAEAQIERFFDQ 538

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           E G +++T  ED SV LR+ ED+D AEPS NSV+ +N +RLA+I      D +R  A  +
Sbjct: 539 EAGVFYSTAVEDHSVPLRMIEDNDNAEPSANSVNALNYLRLAAITG---RDEFRTIALRT 595

Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
           +  F   L     A+PL+   A  ++  S   ++  G + +     ++A A        T
Sbjct: 596 IRHFSGTLDANPSALPLLLV-ARQIATASPVQIIFAGKRGNPALAKLVATAFRHNRPELT 654

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           VIH D  +T E    E       + A       +  A +C   SC P + +  SL+  L
Sbjct: 655 VIHAD--ETCEALLPE-------AAAIGKMHKGEPAAYLCAGGSCQPAIRNAESLDAAL 704


>gi|193212931|ref|YP_001998884.1| hypothetical protein Cpar_1281 [Chlorobaculum parvum NCIB 8327]
 gi|193086408|gb|ACF11684.1| protein of unknown function DUF255 [Chlorobaculum parvum NCIB 8327]
          Length = 708

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 266/698 (38%), Positives = 396/698 (56%), Gaps = 51/698 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  +A  LN  FV +K+DREE PD+D+ YM +VQA     GWP+S
Sbjct: 51  STCHWCHVMERESFEDPEIAGFLNAHFVPVKLDREEHPDIDRFYMLFVQATTSNAGWPMS 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+++PD KP  GG+YFPP +++G P F+++L  +   W+  R  L  S    ++QL +  
Sbjct: 111 VWMTPDRKPFFGGSYFPPAERWGMPSFRSVLETLARMWEHDRPKLLASAGSIMDQLFDIA 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              +    + D    +A R C E L++ +D+ +GGFG+APKFP+P  +  +  H+ +   
Sbjct: 171 KPQSGPGDVSD---AHAAR-CFEALAQRFDAEWGGFGNAPKFPQPSILGFLFSHAAR--- 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYD 253
           TG    A     M L TL+ MA GG+HD +      GGGF RYS D  WHVPHFEKMLYD
Sbjct: 224 TGNQTAAD----MALVTLRKMAAGGLHDQLGVTGRGGGGFARYSTDRFWHVPHFEKMLYD 279

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
             QLA  YL+A+ LT +  ++   RDI +Y+  DM  P G  +SAEDADS +  G+  K+
Sbjct: 280 NAQLAASYLEAYQLTGEALFADTARDIFNYVLCDMTSPEGGFWSAEDADSLDPNGSGEKR 339

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT +E+ ++L  + A+LF E Y ++P GN  +    DPH EF G+N+L      
Sbjct: 340 EGTFYVWTEEEIGNLLDPDEAVLFMEAYGVRPEGNAPV----DPHGEFIGRNILKRTASD 395

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
               ++ G+ +++    L E R KLF+ R  RPRP LDDK++V+WNG++IS+ A+ + +L
Sbjct: 396 EELTNRFGLSMDEASRRLKEARSKLFESRLTRPRPGLDDKILVAWNGMMISALAKGALVL 455

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           +                 K+ +E AE AA FI   LYD  T +L   +R+G +   G   
Sbjct: 456 RD----------------KKLLEAAERAALFILGTLYDSATGKLLRRYRDGEAAIDGKAS 499

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA +I  L+DLY+     ++L  AI L  TQ E F D++ G +++T  +D S  LR+ E
Sbjct: 500 DYACMIQALIDLYQASLDPEYLSTAIALAETQIERFFDQKQGVFYSTAFDDESAPLRMIE 559

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           D+D AEPS NSVS  N +RLA++      D  R+ A  ++  F + L    +A+PLM  A
Sbjct: 560 DNDTAEPSPNSVSAFNYLRLAAMTG---RDELREIALRTINFFSSTLDANPVALPLMLAA 616

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             M    +   +++ G +S    +  + AA   +    T++H +    E +++     S 
Sbjct: 617 RAMADT-APAQLIVSGKRSDPAIQRFVEAASRHFQPELTILHAN----ENVEWLP---SE 668

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             ++A+++    +  A +C    C P VT+P  L+ LL
Sbjct: 669 AVAIAKDHHG--QPAAWLCAKGQCYPAVTEPEELDTLL 704


>gi|330805805|ref|XP_003290868.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
 gi|325078993|gb|EGC32616.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
          Length = 740

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 269/703 (38%), Positives = 400/703 (56%), Gaps = 46/703 (6%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWC VM  E FE+  ++K++ND F++IKVDREERPD+DK+YMT++    GGGGWP+S++
Sbjct: 64  CHWCSVMHKECFENPSISKVMNDLFINIKVDREERPDIDKLYMTFLTETTGGGGWPMSIW 123

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P L+P+  GTYF PE K+GR  F  + +K+ + W   R+ + + G   IE L E    
Sbjct: 124 LTPSLQPISAGTYFAPEPKFGRAAFPELCKKLNEIWKNDRETVIERGNSFIEYLKEDKPK 183

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
               N L +E     +  C EQ+ K YD   GGF  APKFPR      +L  S   ++  
Sbjct: 184 GNLDNALSEE----TVSKCIEQILKGYDPDDGGFTDAPKFPRCSIFNFLL--SASTQEQL 237

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           KS + S  +K+  FTL  MA GGI+D +G GFHRYSV   W +PHFEKMLYDQGQL  VY
Sbjct: 238 KSSKESILEKL-FFTLSKMAYGGIYDQIGFGFHRYSVTPDWKIPHFEKMLYDQGQLVPVY 296

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
           LD++ L+K+  +  I +  L Y++  +    G  FSAEDADS     +  K EGAFY+W 
Sbjct: 297 LDSYILSKNELFKNISKSTLKYVQNYLTHKDGGFFSAEDADSFNE--SNEKSEGAFYIWN 354

Query: 322 SKEVEDIL---GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
            ++++  L    E   ++   Y L   GN  ++   DPHNEF  KN+++ +  +  +A+ 
Sbjct: 355 FEDIKKALENDKEAIEIYSFIYGLVENGN--VNPKDDPHNEFIDKNIIMRIKSNQDAANY 412

Query: 379 LGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                ++  + L   R+KL   R   +PRP LDDK+IV+WNGL+IS+FARA +I      
Sbjct: 413 FKKSTKEIESSLESSRKKLLTYRDTFKPRPPLDDKIIVAWNGLMISAFARAYQI------ 466

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                FP    D + Y+E A+ A  FI+ +LY++ T  L  +F++ PS    F DDYA L
Sbjct: 467 -----FP----DEESYLESAKRATKFIKDNLYNQATKTLIRNFKDSPSLIHAFADDYASL 517

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDG 556
           I GLLDLY+     ++L WAIELQ  QD+LF D +  GGYF+T+G+D S+L R+KE+HDG
Sbjct: 518 IQGLLDLYQCTFEIEYLEWAIELQEKQDQLFYDSQLPGGYFSTSGDDKSILHRLKEEHDG 577

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           AE S  S+SV NL++L S+    +   Y++ A  +L      L+   + +P M C+  ML
Sbjct: 578 AENSCQSISVSNLLKLYSVTYNQE---YKEKALATLDSCSLYLEKAPIVMPQMMCS--ML 632

Query: 617 SVPSRKHV-----VLVGHK----SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
               +++      +++  K    +  D + +L   ++ +  NK +   D +D +++ F+ 
Sbjct: 633 LCKEKENTLNSINIVINSKEYNQTKNDLKQILKQVNSLFIPNKFITVKDISDQKQVQFFN 692

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           E  + N ++       DK    +C    CS    +   + N+L
Sbjct: 693 EK-TKNLNLINLKPVYDKPSLSLCNPNGCSISSNNLGQITNIL 734


>gi|158296880|ref|XP_317217.4| AGAP008252-PA [Anopheles gambiae str. PEST]
 gi|157014924|gb|EAA12337.5| AGAP008252-PA [Anopheles gambiae str. PEST]
          Length = 813

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 280/713 (39%), Positives = 380/713 (53%), Gaps = 64/713 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E VAK++N+ F++IKVDREERPD+DK+YM ++  + G GGWP+S
Sbjct: 122 STCHWCHVMEKESFENEEVAKIMNEHFINIKVDREERPDIDKLYMMFILLINGSGGWPMS 181

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P+ GGTYFPP D++G PGF T+L K+   W   +D L  +G   IE +   +
Sbjct: 182 VWLTPDLAPVTGGTYFPPNDRWGMPGFTTVLTKLASKWSTDKDDLVTTGRSVIEAIRRNV 241

Query: 140 S---ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
               A    +    E  +   +       ++YD  +GG   APKFP   ++ +M +H   
Sbjct: 242 DHKRADEVEDATNMETLEAKFKQAVNMYQRNYDMVWGGSLGAPKFPEASKLNLM-FHLHV 300

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            E   K         +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQ
Sbjct: 301 QEPKHKV------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQ 354

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L ++Y + + LTK   Y  +   I  YL +D+  P G  +S EDADS  T  +  K EGA
Sbjct: 355 LLSLYANGYRLTKKPSYLAVADAIYRYLCKDLRHPAGGFYSGEDADSLPTAESEEKIEGA 414

Query: 317 FYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKN 364
           FY WT  EV+++LG +   F E            HY +K  GN   S  SDPH    GKN
Sbjct: 415 FYAWTYDEVKELLGANGEKFGELGGVDPVAVYAAHYDVKEEGNVKPS--SDPHGHLLGKN 472

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +LI       +A K    +E    IL      L +VR KRPRPHLD K++ +WNGLV+S 
Sbjct: 473 ILIVYGSVRETAEKFNTTVEIVERILKTGNELLHEVRDKRPRPHLDTKILCAWNGLVLSG 532

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG- 483
            ++ + +  +               R EY+  AE    FIR +LYD Q  +L  S   G 
Sbjct: 533 LSQLACVKDAPG-------------RSEYLATAEELVKFIRANLYDVQARKLLRSCYGGA 579

Query: 484 ----PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
                S+ P  GF+DDYAFLI GL+D Y        L WA ELQ+ QDELF D + G YF
Sbjct: 580 EESLASERPIYGFIDDYAFLIKGLIDYYVASLDEHALHWAKELQDIQDELFWDTKHGAYF 639

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN--AEHSLAVF 595
            +    P+V +R+KEDHDGAEP GNSV+  NL+ L        SDY+ +    E +  +F
Sbjct: 640 YSEANSPNVAVRLKEDHDGAEPCGNSVAAHNLLLL--------SDYFEEERLKEKARTLF 691

Query: 596 E--TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
           +           +P M  AA +L    R  +++VG +S  +   ++      Y     ++
Sbjct: 692 DYFAHTAHFGYVLPEMMSAA-LLEEQGRNTLIVVGPESP-EATALVDGVREFYIPGMIIV 749

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            +   D          + +N  M +N        A +C N  C  PVT+P  L
Sbjct: 750 QLK-IDQPAHIVRRRKSLDNFKMVKN-----MPTAYICHNKVCHLPVTEPERL 796


>gi|156058630|ref|XP_001595238.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980]
 gi|154701114|gb|EDO00853.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 797

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 251/592 (42%), Positives = 356/592 (60%), Gaps = 27/592 (4%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCH+ME ESFE+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+
Sbjct: 86  SSCHWCHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 145

Query: 80  VFLSPDLKPLMGGTYFPPEDKY----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           VFL+P L+P+ GGTY+P   K      +  F  IL K+   W ++     Q  A  ++QL
Sbjct: 146 VFLTPSLEPVFGGTYWPGPSKTKAFEDQVDFLGILDKLSTVWSEQERRCRQDSAQILQQL 205

Query: 136 SEALSASASSNKLPDELPQNALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
            +  +    SN+L D +    + L  E     +KS+D + GGFGSAPKFP P ++  +L 
Sbjct: 206 KDFANEGTLSNRLGDAVDNIDIELLEEATQHFAKSFDKKNGGFGSAPKFPTPSKLAFLLR 265

Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
            S+    + D     +    + + + TL+ MA+GGIHDH+G GF RYSV   W +PHFEK
Sbjct: 266 LSQFPQAVLDIVGIPDCENAKNIAITTLRKMARGGIHDHIGNGFARYSVTADWSLPHFEK 325

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           MLYD  QL ++YLDAF L++D  +  +  DI DYL   +  P G  +S+EDADS    G 
Sbjct: 326 MLYDNAQLLHIYLDAFLLSRDPEFLGVAYDIADYLTITLFHPQGGFYSSEDADSYYKAGD 385

Query: 310 TRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           T K+EGA+YVWT +E E+ILG EH  +    + +   GN  +++ +DPH+EF  +NVL  
Sbjct: 386 TEKREGAYYVWTKREFENILGTEHEPILSAFFNVTSHGN--VAQENDPHDEFMDQNVLAI 443

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFAR 427
            +  SA A++ GM   + + ++ E + KL   R + R +P +DDK+IVSWNG+ I + AR
Sbjct: 444 SSTPSALANQFGMKEAEIIKVIKEGKAKLRKRREADRVKPDMDDKIIVSWNGIAIGALAR 503

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
           AS ++        F+ PV   D   Y++ A   A FI+ +LYDE++  L   +R G    
Sbjct: 504 ASAVING------FD-PVKAQD---YLDAALKTAKFIKENLYDEKSKILYRIWREGRGDT 553

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            GF DDYAFL+ GL+DLYE     KWL WA ELQ +Q   F D   GG+F+T    P+V+
Sbjct: 554 QGFADDYAFLMEGLIDLYEATFDEKWLQWADELQQSQINFFYDTNKGGFFSTIASAPNVI 613

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           LR+KE  D AEPS N  S  NL RL+SI+     + Y + A  ++  FE+ +
Sbjct: 614 LRLKEGMDSAEPSTNGTSSSNLYRLSSIL---NDESYAKKANETVKSFESEM 662


>gi|194334203|ref|YP_002016063.1| hypothetical protein Paes_1395 [Prosthecochloris aestuarii DSM 271]
 gi|194312021|gb|ACF46416.1| protein of unknown function DUF255 [Prosthecochloris aestuarii DSM
           271]
          Length = 720

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 285/703 (40%), Positives = 390/703 (55%), Gaps = 53/703 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE++ +A++LN  FV +K+DREERPD+D++YM YVQA  G GGWP+S
Sbjct: 56  STCHWCHVMERESFENDEIAQVLNHSFVPVKIDREERPDIDRLYMAYVQASTGSGGWPMS 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+LKP  GGTY+PPED++GRPGF ++L  + DAW + R  L        + +   L
Sbjct: 116 VWLTPELKPFYGGTYYPPEDRFGRPGFLSLLHSIADAWKEDRKKLEH----VADGIQSQL 171

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            + +++   P+ L +  L     Q+S  +D   GGF SAPKFPRP  +  +  ++     
Sbjct: 172 KSFSTAAPHPESLGEKVLDDAFMQISSHFDPVAGGFSSAPKFPRPSILTFLFNYAYF--- 228

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYD 253
           TG+     E   M L TL+ MA+GGIHDH+      GGGF RY+ D  WHVPHFEKMLYD
Sbjct: 229 TGR----EEASAMALLTLERMARGGIHDHLGVKGKGGGGFARYATDALWHVPHFEKMLYD 284

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
              LA  +L+AF LTK+  Y+    DI +Y+  DM  P G  +SAEDADS     +  K 
Sbjct: 285 NALLALSFLEAFQLTKETLYAQTAEDIFNYVLCDMTSPEGAFYSAEDADSFPDRESKTKI 344

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  E+ ++L      +F   Y +K  GN     + DPH  F+ KN+L    D 
Sbjct: 345 EGGFYVWTKTEIAELLDPLEEQIFSFRYGVKQNGNV----LEDPHGTFERKNILSLKADE 400

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
             +A    +P ++  N+      KLF  R +RPRP  DDK+I SWN L+IS+ A+ S++L
Sbjct: 401 ETTAKHFDLPTDQVANLSRSAIEKLFQARMRRPRPDRDDKIITSWNALMISALAKGSRVL 460

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           ++                 +Y+  AE AA FI  +L++  T  L   +  G S   G  +
Sbjct: 461 QN----------------TDYLTAAEKAAGFIGDNLFENGTGNLLRRYCKGESGITGQAE 504

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFLI GLLDLYE       L  A EL   Q E F D E GG+FN + ++ SV +R+KE
Sbjct: 505 DYAFLIQGLLDLYEASFDDSLLHKAQELAERQCEHFYDDEHGGFFNASSQEASVPIRLKE 564

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           D+DGAEPS NSVSV+N  RL  ++ G +  +Y   AE +L  F   L    M +P M   
Sbjct: 565 DYDGAEPSANSVSVMNFSRLW-LMTGKQ--HYLDIAEKTLYYFSAILAANGMQLPEMLAG 621

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
              L  PS   V+L G +S   F+ +  +    Y    TV+H     T+E        + 
Sbjct: 622 YARLLHPSNT-VILTGSQSDPAFKALKKSVEQLYLPGTTVMHA----TKEKPVSSIPGAE 676

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
            AS   N+       A +C+  SC  PVT P  + NLL  +PS
Sbjct: 677 TASEENNS-----AAAYICKGGSCRLPVTTPEEVTNLL--RPS 712


>gi|403182450|gb|EAT47160.2| AAEL001725-PA [Aedes aegypti]
          Length = 749

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 272/712 (38%), Positives = 380/712 (53%), Gaps = 55/712 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++  + G GGWP+S
Sbjct: 61  STCHWCHVMEKESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K+ W    + LA +G   I+ +   +
Sbjct: 121 VWLTPDLAPVTGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNV 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                        P+   R       +++D  +GG   APKFP   ++ ++ +   +   
Sbjct: 181 EEKHQEEAERVFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPS 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T   G       +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL  
Sbjct: 241 TKILG-------VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLM 293

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y + +  T+   Y  +   I  Y+ +D+  P G  +S EDADS  T  +T K EGAFY 
Sbjct: 294 AYANGYKTTRKPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYA 353

Query: 320 WTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
           WT  EV D+L  +              +F EHY ++ TGN + S  SDPH    GKN+ I
Sbjct: 354 WTFAEVRDLLKANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPI 411

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
                  +A K     E    IL      L +VR KRPRPHLD K+I +WNGL++S  ++
Sbjct: 412 VYGSVRETADKFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQ 471

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS-- 485
            S I  +              +R  Y++      SFIR +LYD Q  +L  S     S  
Sbjct: 472 LSCIKDA-------------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQ 518

Query: 486 ----KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
               + P  GF+DDYAFLI GL+D Y     T  L WA ELQ  QDELF D + G YF +
Sbjct: 519 AKSLETPIYGFIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYS 578

Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
                +V++R+KEDHDGAEP GNSVS  NL+ L      +    +R+ A    + F + +
Sbjct: 579 EANSANVVVRLKEDHDGAEPCGNSVSAHNLIMLGDYFETAA---FREKANKLFSYF-SNV 634

Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
                 +P M  A  +L    R  +V+VG     +   ++ A    Y     ++ +DP+ 
Sbjct: 635 TPFGYVLPEMMSAM-LLQENGRDMLVVVG-PDGPEATALVDAVRDFYMPGLLIVQLDPS- 691

Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
                   +H+    ++       +   A +C N  C  PVT+P  L + L+
Sbjct: 692 ------LPDHSLGGKTLKSFKMMNEAPTAYMCHNKVCQLPVTEPEKLADDLV 737


>gi|403418379|emb|CCM05079.1| predicted protein [Fibroporia radiculosa]
          Length = 791

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 294/757 (38%), Positives = 398/757 (52%), Gaps = 94/757 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFED+  A L+N+ +++IKVDREERPDVD++YMT++QA  GGGGWP+S
Sbjct: 62  SACHWCHVLAHESFEDKVTANLMNEHYINIKVDREERPDVDRLYMTFLQASSGGGGWPMS 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPG-FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++L+P+L P   G   P    Y  PG F+ +L K+ D W+   D    SG   IE L +A
Sbjct: 122 IWLTPELHPFFAGPSLPVPQTYFPPGRFRQVLYKLADIWESDPDRCRASGKQIIESLRDA 181

Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMML------ 191
            +  + +    DELP  +L L    +L+K +D+R+GGF SAPKFP+P +    L      
Sbjct: 182 TNVKSGT----DELPVVSLALTVYARLAKRFDTRYGGFSSAPKFPQPSQTTQFLARYAAL 237

Query: 192 -YHSKK-----------------LEDTGKSG-----------------EASEGQKMVLFT 216
             HSK                   E  G+ G                 EA   + M   T
Sbjct: 238 RMHSKDSGAGEQKNADEVLKHLDAESLGEDGKDSKLSEPSSKPKSKQEEAEHARDMAAET 297

Query: 217 LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL--------- 267
           L  + KGGIHD V GGF RYSVDERWHVPHFEKMLYDQ QL    L+  SL         
Sbjct: 298 LVQIYKGGIHDVVEGGFARYSVDERWHVPHFEKMLYDQAQLLTSALELASLLPHSSDGPP 357

Query: 268 ---TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
              T+    + + R IL YL R +  P G  +SAEDADS     +T+ KEGAFY WT+ +
Sbjct: 358 LSSTRTTLLA-LARSILIYLPRHLTSPEGGFYSAEDADSLPAADSTKTKEGAFYTWTANQ 416

Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
              ILGE A +    Y +K  GNCD   M D   E KG+NVL   +    +A K G P+E
Sbjct: 417 FSRILGEDAEVAVWAYGVKEDGNCD--PMHDIQGELKGQNVLFMAHTPEEAAEKFGRPVE 474

Query: 385 KYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
           +    L     KL   R + RPRPHLDDK++  WNGL+IS  ARA++  +          
Sbjct: 475 EVRCALQHSLDKLRAFRDENRPRPHLDDKILTCWNGLMISGLARATETFE---------- 524

Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
              G +  + + +AE +A+F+R  LY+E +  L  S+R G +   G  DDYAFLI GLLD
Sbjct: 525 ---GEEAVQALTLAERSAAFLRAQLYNEASGELTRSWREG-AGPKGQADDYAFLIQGLLD 580

Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
           LYE     ++++WAI LQ  QDELF D EG GYF  +  D  +L+R+K+  DGAEPS  S
Sbjct: 581 LYEACGKEEYVIWAIRLQEKQDELFFDAEGCGYF-ASAPDEHILIRMKDAQDGAEPSAVS 639

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
           V++ NL+RL S  A  +   Y + A+  LA     L     A+  M  AA M      K 
Sbjct: 640 VTLSNLLRL-SHFAEDRHKEYDEKAKSILASNAQLLGAAPYALAAMVSAA-MCREKGYKQ 697

Query: 624 VVLVGHKSSVDFEN-MLAAAHASYDLNKTVIHIDPADTEE---------MDFWEEHNSNN 673
           ++L   +S   F +  L A    +  N+ +IH+DPA+                 + N++ 
Sbjct: 698 IILT--ESPASFPSPYLKAIRERFVPNRVLIHLDPANPPRKLAKVNGTLRSLLTDINTDR 755

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +  A    +   V   VCQNF+C  P+ D   L+  L
Sbjct: 756 SGNADARSAQPNV--RVCQNFTCGLPIRDMAELKAAL 790


>gi|157123455|ref|XP_001653842.1| hypothetical protein AaeL_AAEL001725 [Aedes aegypti]
          Length = 752

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 272/715 (38%), Positives = 380/715 (53%), Gaps = 58/715 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++  + G GGWP+S
Sbjct: 61  STCHWCHVMEKESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K+ W    + LA +G   I+ +   +
Sbjct: 121 VWLTPDLAPVTGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNV 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                        P+   R       +++D  +GG   APKFP   ++ ++ +   +   
Sbjct: 181 EEKHQEEAERVFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPS 240

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T   G       +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL  
Sbjct: 241 TKILG-------VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLM 293

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y + +  T+   Y  +   I  Y+ +D+  P G  +S EDADS  T  +T K EGAFY 
Sbjct: 294 AYANGYKTTRKPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYA 353

Query: 320 WTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
           WT  EV D+L  +              +F EHY ++ TGN + S  SDPH    GKN+ I
Sbjct: 354 WTFAEVRDLLKANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPI 411

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
                  +A K     E    IL      L +VR KRPRPHLD K+I +WNGL++S  ++
Sbjct: 412 VYGSVRETADKFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQ 471

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS-- 485
            S I  +              +R  Y++      SFIR +LYD Q  +L  S     S  
Sbjct: 472 LSCIKDA-------------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQ 518

Query: 486 ----KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
               + P  GF+DDYAFLI GL+D Y     T  L WA ELQ  QDELF D + G YF +
Sbjct: 519 AKSLETPIYGFIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYS 578

Query: 540 TGEDPSVLLRVKE---DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
                +V++R+KE   DHDGAEP GNSVS  NL+ L      +    +R+ A    + F 
Sbjct: 579 EANSANVVVRLKEGKLDHDGAEPCGNSVSAHNLIMLGDYFETAA---FREKANKLFSYF- 634

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
           + +      +P M  A  +L    R  +V+VG     +   ++ A    Y     ++ +D
Sbjct: 635 SNVTPFGYVLPEMMSAM-LLQENGRDMLVVVG-PDGPEATALVDAVRDFYMPGLLIVQLD 692

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           P+         +H+    ++       +   A +C N  C  PVT+P  L + L+
Sbjct: 693 PS-------LPDHSLGGKTLKSFKMMNEAPTAYMCHNKVCQLPVTEPEKLADDLV 740


>gi|119357268|ref|YP_911912.1| hypothetical protein Cpha266_1460 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354617|gb|ABL65488.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides DSM
           266]
          Length = 720

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 279/723 (38%), Positives = 385/723 (53%), Gaps = 61/723 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFED   A LLN  FV +KVDREE PD
Sbjct: 33  GVEAFAKAKKESKPIFLSVGYSTCHWCHVMERESFEDPRTALLLNTNFVPVKVDREEYPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMT+VQ+  G GGWP+SV+L+PDL P  GG+YFPP D+YG PGF T+L  +   W 
Sbjct: 93  LDRLYMTFVQSTTGRGGWPMSVWLTPDLDPFYGGSYFPPVDRYGMPGFNTLLTSIARLWQ 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS 177
                +    A   +QL+     SA S K    LP ++A   C   L  S+D  FGGFG+
Sbjct: 153 TDPQSILDRSALFFQQLN-----SAESVKTEGSLPSKDAANRCFRWLEDSFDRDFGGFGN 207

Query: 178 APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------G 230
           APKFPRPV +  +  YH      TG      +   M LFTL+ MA+GGIHDH+      G
Sbjct: 208 APKFPRPVLLDFLFNYHYH----TGN----EQALAMALFTLRKMAEGGIHDHLGIPEKGG 259

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
           GGF RYS D  WH+PHFEKMLYD  QLA  ++ AF  + D FY+ +  DI +Y+  D+  
Sbjct: 260 GGFSRYSTDPFWHLPHFEKMLYDNAQLAISFVQAFQCSGDSFYAEVADDIFNYVLTDLAS 319

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI-LGEHAI-LFKEHYYLKPTGNC 348
             G  +SAEDADS   + ++  +EGAFY W+ +EV  +     +I LF   Y ++P GN 
Sbjct: 320 SEGAFYSAEDADSLPEQSSSVLEEGAFYRWSHEEVLRLPCSRRSIELFSRLYGIRPEGNV 379

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
               ++DPHNEF G N+L + +          M  ++    L E R  L + R  RPRP 
Sbjct: 380 ----LNDPHNEFAGLNILKKESSIEEIGRIFSMREKEVAEALEEVRLALHNARLARPRPF 435

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LDDK++ SWNGL+IS+ AR  ++                   K  +  A  A  F+   L
Sbjct: 436 LDDKILASWNGLMISALARGYRVFGD----------------KRLLLAANRATEFLLSTL 479

Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           Y+  T +L   +RNG +   G  DDYAF + GLLDLYE     + +  AI L  T   LF
Sbjct: 480 YNRHTGKLLRRYRNGSAGIDGKADDYAFFVQGLLDLYEADFDPRHIETAIALTETVILLF 539

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   GG+ +T  +D S+  R++E++DGAEP+ NSV  +NL+RL+ +    +   Y + A
Sbjct: 540 EDTIKGGFSSTASDDTSLPARMREEYDGAEPAANSVLAMNLLRLSEMTGEER---YNEKA 596

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
           E+    F++ L   + A+P M  A +      +   +L G  +S   + +  A    Y  
Sbjct: 597 ENIFKAFDSILDTNSHALPAMLVALNFWE-QKKSLTILNGDPASPVMQELKRAPGRRYLP 655

Query: 649 NKTVIHIDPAD-TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
               IH       + +D  E+   + A + R         A VC + +C  PV+DPISL 
Sbjct: 656 GNVTIHASIRQVVKGLDVLEQIEESPA-IPR---------AYVCLDRACQLPVSDPISLM 705

Query: 708 NLL 710
            LL
Sbjct: 706 ALL 708


>gi|189195556|ref|XP_001934116.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979995|gb|EDU46621.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 748

 Score =  460 bits (1183), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 273/698 (39%), Positives = 378/698 (54%), Gaps = 49/698 (7%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+ F
Sbjct: 69  CHWCHVMERESFENDEVAKLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNAF 128

Query: 82  LSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-- 135
           ++PDL+P+ GGTY+P P          GF  IL K++D W  +R    +S      QL  
Sbjct: 129 ITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLRD 188

Query: 136 -SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
            +E  + S      P+ L  + L    E   K YD    GFG APKFP P  ++ +L  S
Sbjct: 189 FAEDGNISRKDGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKLS 248

Query: 195 K---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           +    + +   + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKML
Sbjct: 249 QYPSAVREVLSAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKML 308

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGAT 310
           YDQ QL  VYLDA+ +T+   +     DI  YL    M    G  FS+EDADS       
Sbjct: 309 YDQAQLLPVYLDAYLMTRSPEHLSAVHDIATYLTSPPMQAESGGFFSSEDADSLYRPNDK 368

Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            K+EGAFYVWT KE + ILG+  A +   +Y ++  GN  ++   D H+E   +NVL   
Sbjct: 369 EKREGAFYVWTLKEFQQILGDRDAEILARYYNVQDEGN--VAPEHDAHDELINQNVLAVT 426

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARA 428
                 A + G+  ++   IL E R+KL D R+K RPRP LDDK++VSWNGL I + AR 
Sbjct: 427 TTKPDLAQQFGLSEDEVNKILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALART 486

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           S  L S+  +            ++Y+  AE AA+F+R HLY+  +  L   +R GP  AP
Sbjct: 487 SAALSSQDPTR----------SQKYLAAAEKAATFLRAHLYNSTSKTLIRVYREGPGDAP 536

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           GF DDYA+LISGL+DLYE      +L WA +LQ TQ  +F D++  G+F+T  +   +++
Sbjct: 537 GFADDYAYLISGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLIM 596

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R+K+  D AEP  N VS  NL RL +++   + + Y + A  + + FE  +       P 
Sbjct: 597 RLKDGMDNAEPGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFPT 653

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVD-----FENMLAAAHASYDLNKTVIHIDPADTEEM 663
           M  A  ++      H V+ G    VD     + N  A       L K V           
Sbjct: 654 MMDAV-VVGKLGISHSVITGEGKKVDEWLQRYRNRPAGLGTVSKLGKGV----------G 702

Query: 664 DFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
           ++ +  N    SM     +ADK   +VC+N +C   +T
Sbjct: 703 EWLKSRNPLVKSM-----NADKEGVMVCENGACREALT 735


>gi|386812871|ref|ZP_10100096.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386405141|dbj|GAB62977.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 704

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 271/715 (37%), Positives = 391/715 (54%), Gaps = 67/715 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE VAK+LN+ FVSIKVDREERPD
Sbjct: 50  GEEAFQKAIRENKPVFLSIGYSTCHWCHVMEYESFEDEEVAKILNENFVSIKVDREERPD 109

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +Y+T  QA+ G GGWPL++FL+P+ KP   GTYFP  ++YG PGF  IL+K+ D W 
Sbjct: 110 LDNIYITVCQAMTGSGGWPLNLFLTPEKKPFFAGTYFPKTERYGNPGFIAILKKISDLWK 169

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGS 177
             ++ +  S     EQ+++ + ++A S   P E L +  L+    QL  ++DS +GGFGS
Sbjct: 170 TNKESVIASS----EQITKVIQSAAIST--PGEILTKETLQHAYAQLRDNFDSIYGGFGS 223

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFP P     +L   K+  D           ++V  TL+ M +GGI+D +GGGFHRYS
Sbjct: 224 APKFPTPHNYTFLLRWWKRSND-------PTALEIVEKTLERMGRGGIYDQLGGGFHRYS 276

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
            DE W VPHFEKMLYDQ   A  Y + +  T  VFY+   R I  Y+ RDM  P G  +S
Sbjct: 277 TDEYWLVPHFEKMLYDQALAAIAYTETYQATGKVFYADSVRGIFTYVLRDMTSPEGGFYS 336

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDP 356
           AEDADS   EG     EG FYVWT  E+  ILGE    +F ++Y +   GN         
Sbjct: 337 AEDADS---EGV----EGKFYVWTPDEIIKILGEKEGNIFCDYYDVSKEGN--------- 380

Query: 357 HNEFKGKNVLIELNDSSASASKL-GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
              F+ KN+L  ++    + SK+ G+   +   +L   R KLF VR KR  PH DDK++ 
Sbjct: 381 ---FEEKNIL-HVDKPVDTFSKMRGIKPAELEEVLRTAREKLFSVREKRIHPHKDDKILT 436

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           +WNGL+I++ A+ ++ L                +  +Y + A  AA FI   L  ++   
Sbjct: 437 AWNGLMIAALAKGAQAL----------------NEPKYTQAAMRAADFILNTL-RQKDGT 479

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L   +R+G +  PG+LDDYA+ + GL+DLYE     K+L  A EL N   E F D +GGG
Sbjct: 480 LLRRYRSGEASIPGYLDDYAYFVWGLIDLYEATFEVKYLKIARELNNHMIENFQDEKGGG 539

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           +F +  ++  ++ + KE +DGA PSGNSV++ N++RL  I   ++   + + AE  +  F
Sbjct: 540 FFFSGKKNEQLITQTKEIYDGATPSGNSVALFNILRLGRITGNTE---FEKIAEQIIRAF 596

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
              +K          CA D +  P+ K +V+ G   S D E +L      + L + V+ +
Sbjct: 597 GETIKQHPSGYTQFLCALDFVLGPT-KEIVIAGEPGSDDTERILREIGKRF-LPRKVLLL 654

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            P+  + ++   E       +       +K  A +C N++C+ P  D   +  LL
Sbjct: 655 HPSKDKSIEDIAEF------IKEQKIVDNKATAYICINYACNAPTNDIHKIIQLL 703


>gi|423073704|ref|ZP_17062443.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
           DP7]
 gi|361855545|gb|EHL07513.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
           DP7]
          Length = 706

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 280/721 (38%), Positives = 387/721 (53%), Gaps = 65/721 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 40  GEEAFAKAKAENKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 99

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           VD +YM + QAL G GGWPL++FL+PD  KP   GTYFP E +YGRPG   +L ++ + W
Sbjct: 100 VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 159

Query: 118 DKKRDML---AQSGAFAIEQLSEALSASASSNKLPDELP--QNALRLCAEQLSKSYDSRF 172
            K +  +   A S   A+    E   +S +  +  D +P  +  L    + L KS+D ++
Sbjct: 160 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPAQQDDFIPWAKEILDTAFQTLQKSFDRQY 219

Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
           GGFG APKFP P  +  +L ++    D G   EA +   MV  TL+ M +GGI DHVG G
Sbjct: 220 GGFGRAPKFPTPHHLTFLLRYA---HDHGDGLEAQQASLMVRTTLERMGQGGIFDHVGFG 276

Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
           F RYS D RW VPHFEKMLYD   LA  YL+ +    D +     R+I  Y+ RDM  P 
Sbjct: 277 FARYSTDRRWLVPHFEKMLYDNALLAIAYLETYQAEHDPYDGQKAREIFAYVLRDMTAPE 336

Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
           G  +SAEDADS   EG     EG FYVWT +E+ +ILG E   L+ + Y + P GN    
Sbjct: 337 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGNEEGRLYCQAYGITPEGN---- 385

Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
                   F+GK++   L+ D  A  S     L      L + R KLF VR +R  PH D
Sbjct: 386 --------FEGKSIPNLLDTDWEALESDWQQSLSALKERLEKSREKLFAVRKERIPPHKD 437

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK++ SWNGL+I++ A+ +++L   A                Y E AE A  FIR++LY 
Sbjct: 438 DKILTSWNGLMIAALAKGTQVLGEPA----------------YAEAAEQAVYFIRKNLYA 481

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
            Q  RL   +R+G S   G+LDDYAFLI GL++LY+     + L +A++LQ  QDELF D
Sbjct: 482 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGQKEHLEFALQLQREQDELFWD 539

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
               GYF T  +   +L+R KE +DGA PSGNS+S +NL+RLA +      +   + A  
Sbjct: 540 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGMLE---ERAYE 596

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
            +  F+  L            A       SR+ ++L G     + ENM       +    
Sbjct: 597 QINAFKATLAAYPSGYSAFLQAIQFALQESRE-IILAGSLQHPELENMKTMIFKEFRPYT 655

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           T+++ +   +E + + +++             ++KV A +CQN++C  PV     L  LL
Sbjct: 656 TLLYEEGTLSELIPWLKDY----------PLDSEKVTAYLCQNYACHKPVYQAEELLALL 705

Query: 711 L 711
           +
Sbjct: 706 I 706


>gi|330916342|ref|XP_003297383.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
 gi|311329963|gb|EFQ94518.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
          Length = 747

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 258/627 (41%), Positives = 353/627 (56%), Gaps = 29/627 (4%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFE++ VA LLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+ 
Sbjct: 67  ACHWCHVMERESFENDEVANLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNA 126

Query: 81  FLSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL- 135
           F++PDL+P+ GGTY+P P          GF  IL K++D W  +R    +S      QL 
Sbjct: 127 FITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLR 186

Query: 136 --SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
             +E  + S      P+ L  + L    E   K YD    GFG APKFP P  ++ +L  
Sbjct: 187 DFAEDGNISRKDGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKL 246

Query: 194 SK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           S+    + +   + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKM
Sbjct: 247 SQYPSAVREVLGAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKM 306

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGA 309
           LYDQ QL  VYLDA+ +T+   +     DI  YL    M    G  FS+EDADS      
Sbjct: 307 LYDQAQLLPVYLDAYLMTRSPEHLSAVHDIAAYLTSPPMQAESGGFFSSEDADSLYRPND 366

Query: 310 TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
             K+EGAFYVWT KE + ILG+  A +   +Y +K  GN  ++   D H+E   +NVL  
Sbjct: 367 KEKREGAFYVWTLKEFQQILGDRDAEILARYYNVKDEGN--VAPEHDAHDELINQNVLAI 424

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
                  A + G+  ++  NIL E R+KL D R+K RPRP LDDK++VSWNGL I + AR
Sbjct: 425 TTTKPDLAQQFGLSEDEVNNILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALAR 484

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
            S  L S+  +            ++Y+  AE AASF+R HLY+  +  L   +R GP  A
Sbjct: 485 TSAALSSQDPTR----------SQKYLAAAEKAASFLRAHLYNPTSKTLIRVYREGPGDA 534

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
           PGF DDYA+LISGL+DLYE      +L WA +LQ TQ  +F D++  G+F+T  +   ++
Sbjct: 535 PGFADDYAYLISGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLI 594

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           +R+K+  D AEP  N VS  NL RL +++   + + Y + A  + + FE  +       P
Sbjct: 595 MRLKDGMDNAEPGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFP 651

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVD 634
            M  A  ++      H V+ G    V+
Sbjct: 652 TMMDAV-VVGKLGNSHSVITGEGKKVE 677


>gi|431794219|ref|YP_007221124.1| thioredoxin domain-containing protein [Desulfitobacterium
           dichloroeliminans LMG P-21439]
 gi|430784445|gb|AGA69728.1| thioredoxin domain protein [Desulfitobacterium dichloroeliminans
           LMG P-21439]
          Length = 698

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 277/720 (38%), Positives = 390/720 (54%), Gaps = 64/720 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       R  FL    +TCHWCHVME ESFED  VA LLN +F++IKVDREERPD
Sbjct: 33  GQEAFAKAKTQNRPIFLSIGYSTCHWCHVMERESFEDHEVADLLNRYFIAIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
           VD +YM + QAL G GGWPL++ ++PD KP   GTYFP E +YGRPG   +L ++ + W 
Sbjct: 93  VDHIYMEFCQALIGSGGWPLTILMTPDQKPFYAGTYFPKESRYGRPGIIDVLHQLGELWR 152

Query: 118 --DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCA--EQLSKSYDSRFG 173
             +KK    A+S   A+    E  +AS  S++  D  P   + L A  +   +S+DS++G
Sbjct: 153 VDEKKVLSSAESIYTAVTTHKELPNASVVSSQEDDFRPWAKVILEAAFQTFQESFDSQYG 212

Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           GF  APKFP P  +  +L ++    D G++ +A +   MV  TL  M +GGI+DH+G GF
Sbjct: 213 GFRQAPKFPTPHNLTFLLRYAY---DHGQAPKAQQATHMVRTTLDAMGQGGIYDHIGFGF 269

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
            RYS D+ W VPHFEKMLYD   LA  YL+++ +          R+I  Y+ RDM+ P G
Sbjct: 270 ARYSTDQHWLVPHFEKMLYDNALLAIAYLESYQVQHLPRDEQKVREIFAYVLRDMVSPEG 329

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSR 352
             +SAEDADS   EG     EG FYVWT +E+ ++LG  A  L+   Y +   GN     
Sbjct: 330 GFYSAEDADS---EGV----EGKFYVWTPQEIHELLGSEAGQLYCRAYDITRDGN----- 377

Query: 353 MSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                  F+GKN+   L+ + +A A +  +  E+    L E R+ LF  R KR  PH DD
Sbjct: 378 -------FEGKNIPNLLHTEWTALAEEFNLSREELSLQLEEARKVLFQAREKRIHPHKDD 430

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL+I++ A+ ++IL                D   Y + AE A SFI  +LY +
Sbjct: 431 KILTSWNGLMIAALAKGAQIL----------------DDTTYTDAAEKAVSFIINYLYPK 474

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
           Q  RL   +R+  S   G+LDDYAFLI GL++LY        L  A+ LQ  QDELFLD 
Sbjct: 475 Q--RLLARYRDRDSAHLGYLDDYAFLIWGLIELYSATGKKDHLGLALSLQKAQDELFLDT 532

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           E  GYF T  +   +L+R KE +DGA PSGNSVS  NL+RLA +       ++ + A   
Sbjct: 533 EQLGYFLTGHDAEELLIRPKEIYDGATPSGNSVSACNLIRLARLTGDI---HWEKRANEQ 589

Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
           L  F++ L   +    +   A       SR+ +VL G     +   M       Y    T
Sbjct: 590 LMAFKSSLSTHSAGYTMFLQALQYALAQSRE-IVLAGPIQHAELSKMKELIFTEYRPYTT 648

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +++ +   +E + + +++  +          + +  A +CQN+SC  PV     L +LLL
Sbjct: 649 LLYQEGTLSELIPWLKDYPED----------SKQSTAYICQNYSCLRPVHTAAELPSLLL 698


>gi|195334316|ref|XP_002033829.1| GM21533 [Drosophila sechellia]
 gi|194125799|gb|EDW47842.1| GM21533 [Drosophila sechellia]
          Length = 808

 Score =  454 bits (1167), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 267/723 (36%), Positives = 375/723 (51%), Gaps = 75/723 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE    A ++N+ FV+IKVDREERPD+DK+YM ++    G GGWP+S
Sbjct: 122 STCHWCHVMEHESFESPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L PL+ GTYFPP+ +YG P F  +L  +   W+  ++ L  +G+  +  L +  
Sbjct: 182 VWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLNSIARKWETDKESLLTTGSSLLSALKKNQ 241

Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
            ASA        +P+ A       E+LS++       +D   GGFGS PKFP    +  +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
            +     +D        +   MV+ TL  + KGGIHDH+ GGF RY+  + WH  HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQGQL   + +A+ +T+D  Y      I  YL +D+  P G  ++ EDADS  T    
Sbjct: 347 LYDQGQLMVAFTNAYKVTRDEIYLGYADKIYKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406

Query: 311 RKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
            K EGAFY WT  E++           DI  + A  ++  HY LKP GN  +   SDPH 
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFDDITPDRAFEIYAYHYDLKPPGN--VPTYSDPHG 464

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
              GKN+LI       + +   +  +++  +L      L  +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEADQFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GLV+S   +                    ++R++YM+ A+    F+R+ +YD +   L  
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570

Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           S               S+  GFLDDYAFLI GLLD Y+       L WA  LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   G YF +  + P+V++R+KEDHDGAEPSGNSVS  NLV LA        D + Q A
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPSGNSVSAHNLVLLAHYY---DEDAFLQKA 687

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
              L  F   +     A+P M  A  +L   +   +V V    S D +  +      Y  
Sbjct: 688 GKLLNFF-ADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEICRKFYIP 744

Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
           +  ++H+DP++ EE        SN     +      K    +CQ  +C  PVTDP  LE+
Sbjct: 745 SMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICQERACRMPVTDPQQLED 797

Query: 709 LLL 711
            L+
Sbjct: 798 NLM 800


>gi|451845821|gb|EMD59132.1| hypothetical protein COCSADRAFT_41015 [Cochliobolus sativus ND90Pr]
          Length = 799

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 263/630 (41%), Positives = 360/630 (57%), Gaps = 37/630 (5%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+VF
Sbjct: 120 CHWCHVMERESFENDEVAKLLNEHFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNVF 179

Query: 82  LSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++PDL+P+ GGTY+P P          GF  IL+K++D W  +R    +S      QL +
Sbjct: 180 ITPDLEPIFGGTYWPGPGSTMAMGEHIGFIGILKKIRDVWRDQRQRCLESAKEITAQLRD 239

Query: 138 ALSASASSNKLPDELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
                  S K  D  P   L L       E   K YD    GFG APKFP P  +  +L 
Sbjct: 240 FAEEGNISRK--DGAPNETLDLELLDEAYEHFKKRYDQVHAGFGGAPKFPTPSNLHFLLK 297

Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
            S+    +++   + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEK
Sbjct: 298 LSQYPNPVKEVLGAKDCTYAKDMALATLSAMNKGGIHDQIGNGFARYSVTKDWSLPHFEK 357

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
           MLYDQ QL  VYLDA+ +T+   +     DI  YL    M    G  +S+EDADS     
Sbjct: 358 MLYDQSQLLAVYLDAYLMTRSPEHLGAVHDIATYLTSPPMHAESGGFYSSEDADSLYRPN 417

Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
              K+EGAFYVWT  E +DILGE  + +   +Y +K  GN  ++   D H+E   +NVL 
Sbjct: 418 DKEKREGAFYVWTLNEFQDILGERDSEILARYYNVKDEGN--VAPEHDAHDELINQNVLA 475

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFA 426
             + S+  A + G+  +K   IL E R+KL + R+K RPRP LDDK++VSWNGL I + A
Sbjct: 476 ITSTSADLAKQFGLSEDKVEKILTEGRQKLLEHRNKERPRPGLDDKIVVSWNGLAIGALA 535

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           R S  L S+  +            KEY+  AE AA+F+++HLY+ ++  L   +R GP  
Sbjct: 536 RTSAALASQDPAR----------SKEYLAAAEKAAAFLQKHLYNSESKTLIRVWREGPGD 585

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
           APGF DDYA+LISGL++LYE      +L WA +LQ TQ ++F D++  G+F+T  +   +
Sbjct: 586 APGFADDYAYLISGLINLYEATFNDSYLQWADDLQKTQLKMFWDKQHLGFFSTPEDQTDL 645

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           ++R+K+  D AEP  N VS  NL RL +++  S+   Y Q A  + + FE  +       
Sbjct: 646 IMRLKDGMDNAEPGTNGVSAQNLDRLGALLEDSE---YTQRARDTASAFEAEIMQHPFLF 702

Query: 607 PLM--CCAADMLSVPSRKHVVLVGHKSSVD 634
           P M     A  L +   +H V+ G    VD
Sbjct: 703 PSMMEAVVAGKLGI---RHAVITGDGQKVD 729


>gi|20129985|ref|NP_610953.1| CG8613 [Drosophila melanogaster]
 gi|7303195|gb|AAF58258.1| CG8613 [Drosophila melanogaster]
 gi|60677913|gb|AAX33463.1| RE10908p [Drosophila melanogaster]
          Length = 808

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 267/727 (36%), Positives = 376/727 (51%), Gaps = 83/727 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+   A ++N+ FV+IKVDREERPD+DK+YM ++    G GGWP+S
Sbjct: 122 STCHWCHVMEHESFENPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P L PL+ GTYFPP+ +YG P F T+L+ +   W+  ++ L  +G+  +  L +  
Sbjct: 182 VWLTPTLAPLVAGTYFPPKSRYGMPSFNTVLKSIARKWETDKESLLATGSSLLSALQKNQ 241

Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
            ASA        +P+ A       E+LS++       +D   GGFGS PKFP    +  +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
            +     +D        +   MV+ TL  + KGGIHDH+ GGF RY+  + WH  HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQGQL   + +A+ +T+D  Y      I  YL +D+  P G  ++ EDADS  T    
Sbjct: 347 LYDQGQLMMAFANAYKVTRDEIYLRYADKIHKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406

Query: 311 RKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
            K EGAFY WT  E++           DI  E A  ++  HY LKP GN  +   SDPH 
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFDDITPERAFEIYAYHYGLKPPGN--VPAYSDPHG 464

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
              GKN+LI       + +   +  +++  +L      L  +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEEDRFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GLV+S   +                    ++R++YM+ A+    F+R+ +YD +   L  
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570

Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           S               S+  GFLDDYAFLI GLLD Y+       L WA  LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   G YF +  + P+V++R+KEDHDGAEP GNSVS  NLV LA         YY +NA
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAH--------YYDENA 682

Query: 589 ----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
                  L  F   +     A+P M  A  +L   +   +V V    S D +  +     
Sbjct: 683 YLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEICRK 740

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
            +  +  ++H+DP++ EE        SN     +      K    +C   +C  PVTDP 
Sbjct: 741 FFIPSMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTDPQ 793

Query: 705 SLENLLL 711
            LE+ L+
Sbjct: 794 QLEDNLM 800


>gi|169597471|ref|XP_001792159.1| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
 gi|160707528|gb|EAT91170.2| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
          Length = 756

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 276/702 (39%), Positives = 378/702 (53%), Gaps = 48/702 (6%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VA +LN  F+ IK+DREERPD+D++YM YVQA  GGGGWPL+ F
Sbjct: 68  CHWCHVMERESFENQEVADILNKNFIPIKIDREERPDIDRIYMNYVQATTGGGGWPLNAF 127

Query: 82  LSPDLKPLMGGTYFP-PEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++PDL+P+ GGTY+P PE      G PGF  IL K++D W  +R     S      QL +
Sbjct: 128 ITPDLEPIFGGTYWPGPESTMAMEGHPGFVGILEKIRDVWQNQRQRCLDSAKEITAQLRD 187

Query: 138 ALSASASSNKLPDE-------LPQNALRLC----AEQLSKSYDSRFGGFGSAPKFPRPVE 186
                  S K   E       L  +A  +C     +   + YD    GFGSAPKFP P  
Sbjct: 188 FAEDGNISRKDGAEHDHLDLDLLDDAYEVCEADGPQHFKRRYDQAHAGFGSAPKFPTPSN 247

Query: 187 IQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
           +  +L    + K+      + + S  QKMVL TL  M KGGIHD +G GF RYSV + W 
Sbjct: 248 LHFLLKLNTYPKQTAQILTAEDISNAQKMVLATLDKMNKGGIHDQIGNGFARYSVTKDWS 307

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
           +PHFEKMLYDQ QL  VYLDA+  TK         DI  YL    M    G  FS+EDAD
Sbjct: 308 LPHFEKMLYDQAQLLPVYLDAYLATKRPEMLEAVHDIATYLTTPPMQAESGGFFSSEDAD 367

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S        K+EGAFYVWT KE ++ILG+  A +   +Y ++  GN  ++   D H+E  
Sbjct: 368 SLYRPSDKEKREGAFYVWTLKEFQEILGDRDAEILARYYNVRDEGN--VAPEHDAHDELI 425

Query: 362 GKNVL-IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
            +NVL I  N  +  A +  +  ++  +IL   R+KL D R+K RPRP LDDK++VSWNG
Sbjct: 426 NQNVLAINNNTPTDVAKQFALSEDELQSILRSGRQKLLDHRNKERPRPALDDKIVVSWNG 485

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 479
           L I + AR +  + ++  S             +Y+  AE AA FI++ LY+  +  L   
Sbjct: 486 LAIGALARTAAAISAQDPSR----------SSQYLAAAEKAAHFIQKELYNPTSKTLTRV 535

Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
           +R GP  APGF DDYA+LISGL+DLYE       L WA ELQ TQ  +F D++  G+F+T
Sbjct: 536 YREGPGDAPGFADDYAYLISGLIDLYEATFNPSNLQWADELQQTQLSMFWDKQHLGFFST 595

Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
                 +++R+K+  D AEP  N VS  NL RL +++  ++   Y + A  +++ FE  +
Sbjct: 596 PENQTDLIMRLKDGMDNAEPGTNGVSARNLDRLGALLEDAE---YVKKARDTVSAFEAEI 652

Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
                  P M  A     +  R HVV+ G       E  L           T+  +   D
Sbjct: 653 MQHPFLFPSMLDAVVAGKLGMR-HVVVTGKGEKA--EQWLRRYRERPAGLSTISRV---D 706

Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
           T+  D+ ++ N    SM      A +   +VC+N +C   +T
Sbjct: 707 TDLGDWLKQRNPLVKSM-----DAGREGVMVCENGACKDGLT 743


>gi|218780669|ref|YP_002431987.1| hypothetical protein Dalk_2829 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218762053|gb|ACL04519.1| protein of unknown function DUF255 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 718

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 279/714 (39%), Positives = 374/714 (52%), Gaps = 55/714 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFED   A LLN  F+ IKVDREERPD
Sbjct: 54  GDEAFEQAKKEDKPVFLSIGYSTCHWCHVMERESFEDPEAAALLNRHFICIKVDREERPD 113

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYM+  QA+ G GGWP+SVFL+PD +P   GTYFP ED  GRPG   +   + + W 
Sbjct: 114 IDHVYMSVTQAMTGAGGWPMSVFLTPDKEPFYAGTYFPKEDHMGRPGLMRLATLLGELWK 173

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R         A +Q+ +ALS  A   K  +EL  + L      L  SYD + GGFG  
Sbjct: 174 NERSKALN----AAQQVVQALS-QAQPKKGREELGPHTLGKAFAGLKASYDVQQGGFGRG 228

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
            KFP P  +  +L + K+  D       +E   MV  TL  M  GGI+DHVG G HRY+ 
Sbjct: 229 NKFPTPHNLTFLLRYWKRTGD-------AEALAMVEKTLTAMRMGGIYDHVGFGIHRYAT 281

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYDQ   AN  L+A+  T    Y+   R+I  Y+ RDM  P G  +SA
Sbjct: 282 DPNWLLPHFEKMLYDQALTANALLEAYQATGKEEYATNAREIFTYVLRDMTSPEGGFYSA 341

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG    +EG FYVWT+KE+ +ILG E   LF   + L   GN          
Sbjct: 342 EDADS---EG----EEGKFYVWTTKEITEILGKEDGALFISAFNLVKGGNF----FDQAT 390

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
            +  G ++     D    A+ LGM   +  + L + R  LF  R KR  P+ DDK++  W
Sbjct: 391 GQKTGDSIPHLQKDPGRLAADLGMEKAELESRLEKIRAALFAEREKRIHPYKDDKILTDW 450

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++ A+  +IL  E                +Y   A  AA FI   L D + H LQ
Sbjct: 451 NGLMIAALAKGGRILGDE----------------KYTLAAVRAADFILDALQDGEGH-LQ 493

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             FR G +  PG LDDYAF++ GLL+LYE   G KWL  A+ L  T  +LF DR+ GG F
Sbjct: 494 KRFREGEAALPGLLDDYAFMVWGLLELYESTFGVKWLKKAVTLNETMLDLFWDRKNGGLF 553

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            +      + +R K+ HDGA+PSGNSV+ +NL+RLA I A  +    R+ AE  L  F  
Sbjct: 554 MSPVYGEKLFMRGKDLHDGAQPSGNSVAAVNLLRLAGITANEEC---REKAEAILQAFSG 610

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHID 656
           +++        +  A D +  P+ + +V+ G + + D   ML   +  +  NK  V   +
Sbjct: 611 QIEAQPYVYTHLLGALDFIIGPALE-IVICGDQGARDSTVMLDGVNQRFVPNKVLVFRPN 669

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             D +E+D    +    A +        K  A VCQ ++C  P TDP +L  +L
Sbjct: 670 TEDCKELDELAPYTREQACV------QGKATAYVCQGYTCQRPTTDPEALFRIL 717


>gi|333922724|ref|YP_004496304.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
 gi|333748285|gb|AEF93392.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
          Length = 692

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 267/707 (37%), Positives = 381/707 (53%), Gaps = 65/707 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFE E VA++LN ++V+IKVDREERPD
Sbjct: 33  GEEAFEKAKRENKPVFLSIGYSTCHWCHVMERESFESEDVAEVLNKYYVAIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMT  QAL G GGWPL++ ++PD KP   GTYFP    YG+PG   IL+++ D W 
Sbjct: 93  IDQIYMTVCQALTGQGGWPLNIIMTPDQKPFFAGTYFPKNSNYGKPGLIDILQQIADLWA 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K R  L       + +L+  +  + +  +L  E+   A RL A    + +DS +GGFG+ 
Sbjct: 153 KDRQQLLGISDQLMARLN--MKTATAPGQLSPEVLDKAYRLFA----RHFDSTYGGFGNP 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  + ++L   KK           +   MV  TL  M +GGI+DH+G GF RYS 
Sbjct: 207 PKFPTPHNLMLLLRCWKKTSQ-------KKALTMVEDTLDAMHRGGIYDHIGFGFSRYST 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D RW VPHFEKMLYD   LA  +L+ + + ++  +S + ++I  Y+ RDM  P G  +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLAIAFLETYQINRNPRFSRVAKEIFTYVLRDMTAPEGGFYSA 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FYVW  +EVE +LG+    LF  +Y + P GN          
Sbjct: 320 EDADS---EGV----EGKFYVWHPQEVEQVLGQIDGQLFCRYYDITPRGN---------- 362

Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
             F+G ++   +N D    A +L + LE  ++ L +CR+ LF  R KR  PH DDK++ S
Sbjct: 363 --FEGASIPNLINQDPLKFAQELDITLEDLVDGLEKCRQLLFAQREKRVHPHKDDKILTS 420

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I++ AR +++L  E                +Y + AE A  FI  +L      RL
Sbjct: 421 WNGLMIAALARGARVLGDE----------------KYSQAAEKAVDFIYHNL-QRADGRL 463

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +R+G +  P +LDDYAFLI GLL+LYE     K L  A++L ++  +LF DR+ GG+
Sbjct: 464 LARYRDGEAAYPAYLDDYAFLIWGLLELYEATFDIKHLEQAVQLTDSMIDLFWDRQNGGF 523

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F    +   ++ R KE +DGA PSGNSV+ +NL RLA +   ++   Y + A   L VF 
Sbjct: 524 FFYGKDSEQLISRPKEIYDGAIPSGNSVATVNLFRLARLTGRNR---YEELATKQLQVFA 580

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             L+   +       AA +   P  + +VL G +     + M+      + L   VI + 
Sbjct: 581 GELEHYPIGYSYFMIAAYLNQEPPTE-IVLSGKREDSALKQMIDVVQKEF-LPSAVIAVR 638

Query: 657 PADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTD 702
                              + ++    A K  A VC+NF+C PPVTD
Sbjct: 639 YEGEAAA-----QAEELVPLLKDRLPVAGKATAYVCKNFACQPPVTD 680


>gi|195430492|ref|XP_002063288.1| GK21469 [Drosophila willistoni]
 gi|194159373|gb|EDW74274.1| GK21469 [Drosophila willistoni]
          Length = 752

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 274/718 (38%), Positives = 372/718 (51%), Gaps = 65/718 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+S
Sbjct: 66  STCHWCHVMEHESFENPETAAVMNKHFVNIKVDREERPDIDKVYMQFLLLSKGSGGWPMS 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL PL  GTYFPP  ++G P F  +L  + + W   R+ L ++G+  ++ L +  
Sbjct: 126 VWLTPDLAPLAAGTYFPPHSRWGMPSFTKVLESIANKWQTDRESLLKAGSTVLKALQKNQ 185

Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
            A+A +    +  P +A     E L+   + YD   GGFG  PKFP    +  + +    
Sbjct: 186 DAAAVAEAAFE--PGSAEEKLMEALNVHKQRYDQAHGGFGREPKFPEIPRLNFLFHAYLV 243

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D        +   MV+ TL  + +GGI+DHV GGF RY+    WH  HFEKMLYDQGQ
Sbjct: 244 TKDV-------DVLDMVMQTLDHIGRGGINDHVFGGFCRYATTRDWHNVHFEKMLYDQGQ 296

Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           L   Y +A+ LT+ D+F SY  + I  YL +D+  P G  ++ EDADS  T   T K EG
Sbjct: 297 LMAAYANAYKLTRSDLFLSYADK-IYRYLIKDLRHPAGGFYAGEDADSLPTHQDTVKVEG 355

Query: 316 AFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGK 363
           AFY WT  E+++     A  F E            HY L+P GN  +   SDPH    GK
Sbjct: 356 AFYAWTWSEIQETFKSQAQCFGEVSPERAFEIYTFHYDLQPKGN--VPPASDPHGHLTGK 413

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           N+LI       + S   + LE+   IL      L  VR KRPRPHLD K+I  WNGLV+S
Sbjct: 414 NILIVKGSEEDTCSNFNLELEQLQQILETANDILHSVRDKRPRPHLDTKIICGWNGLVLS 473

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS---- 479
             ++ +    ++              R EYM+ A+    F+RR +YD++   LQ S    
Sbjct: 474 GLSKLANCGTTK--------------RDEYMQTAKELVDFLRREMYDKERKLLQRSCYGS 519

Query: 480 ------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
                       +  GFLDDYAFLI GLLD Y+       L WA ELQ +QD+LF D++ 
Sbjct: 520 GVEDNTLEKNELQIEGFLDDYAFLIKGLLDYYKASLDLSVLSWAKELQESQDKLFWDQQN 579

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G YF +    P+V++R+KEDHDGAEP GNSVS  NL  L+     S    Y + A   L 
Sbjct: 580 GAYFFSQQNAPNVIVRLKEDHDGAEPCGNSVSARNLTLLSHYYDESS---YLERAGKLLN 636

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F   +     A+P M  A  +L       V +VG  SS D +  +      Y     ++
Sbjct: 637 FF-ADVSPFGHALPEMLSAL-LLHENGLDLVAVVGPDSS-DTKKFVEICRKFYIPGMIIL 693

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           H+DP   +  D   +   N   M        K    +C +  C  PVTDP+ LE  L+
Sbjct: 694 HVDPLHPD--DACNQRVQNKFKMVNG-----KTTVYICHDRVCRMPVTDPVQLEENLM 744


>gi|148656403|ref|YP_001276608.1| hypothetical protein RoseRS_2279 [Roseiflexus sp. RS-1]
 gi|148568513|gb|ABQ90658.1| protein of unknown function DUF255 [Roseiflexus sp. RS-1]
          Length = 700

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 265/697 (38%), Positives = 377/697 (54%), Gaps = 72/697 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFEDE  A L+N  F+++KVDREERPD+D +YMT VQA+ G GGWP++V
Sbjct: 57  ACHWCHVMEHESFEDEETAALMNQHFINVKVDREERPDIDAIYMTAVQAMTGSGGWPMTV 116

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD  P   GTYFPPED++  P F+ +LR V +A+  +R+ L   G   +E++ EA+S
Sbjct: 117 FLTPDGVPFFAGTYFPPEDRWQMPSFRRVLRSVAEAYASRRNELLARGRELVERMREAIS 176

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                  L   +   A       L +++D  FGGFG APKFP+P+ ++ +L ++ +   T
Sbjct: 177 MHMPGGTLTPAVLDTAF----IGLQQAFDPAFGGFGRAPKFPQPMTLEFLLRYAVR---T 229

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+      G +M+  TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD   LA V
Sbjct: 230 GR------GMEMLEMTLRRMAEGGMYDQLGGGFHRYSVDAQWLVPHFEKMLYDNALLARV 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL+ F  T +  Y  I  + LDY+ R+M  P G  FS +DADS  T  AT K EGAF+VW
Sbjct: 284 YLETFQATGNACYRRIAEETLDYMLREMHHPEGGFFSTQDADSLPTPDATHKHEGAFFVW 343

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E+ + LG  AI+F   Y +   GN            F+GKN+L         A  +G
Sbjct: 344 TPAEIREALGTDAIVFSALYGVTDQGN------------FEGKNILHVRRSPDEVARVMG 391

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           MP+E+   I    RR LF+VR +RP P LDDKV+ +WNG+ I +FA  +           
Sbjct: 392 MPVEQIETIAARGRRILFEVRQRRPMPDLDDKVLTAWNGMAIRAFALGA----------- 440

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                V  DR++Y   A   A F+  +L       L+   R   +  P FL+DYA L  G
Sbjct: 441 -----VALDREDYRIAAVRCARFVLTNLRRADGELLRSWRRGVANPTPAFLEDYALLADG 495

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL LYE      WL+ A  L ++  E F D   GG+++T      +++R ++  D A PS
Sbjct: 496 LLALYEATFDPHWLLEARALADSLLERFWDEGLGGFYDTGKNHEQLVIRPRDTGDNATPS 555

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM---------CC 611
           G+S +V  L+RLA I   ++   YR   E +L+V E+        VP+M           
Sbjct: 556 GSSAAVDVLLRLALIFDEAR---YR---ERALSVLES-------MVPVMQRYPTGFGRYL 602

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           AA   ++   + + L+G+    D + + A     +  N+ ++   P         E+   
Sbjct: 603 AAAEFALGQPREIALIGNPEDADTQALAAVVLKPFLPNRVIVLARPG--------EDPPR 654

Query: 672 NNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLE 707
             + +       D K  A VCQN++C  PVT+P +LE
Sbjct: 655 IPSPLLNGRGQIDGKATAYVCQNYACQLPVTEPSALE 691


>gi|410980751|ref|XP_003996739.1| PREDICTED: spermatogenesis-associated protein 20 [Felis catus]
          Length = 773

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 281/736 (38%), Positives = 389/736 (52%), Gaps = 81/736 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90  GPEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT++Q       W             +GG   PP   +        L +    W 
Sbjct: 150 VDKVYMTFIQVSSVSTYW------------AVGGXXXPPPTPHADLQVCPCLPQ----WK 193

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +   +   C +QL +SYD  +GGF
Sbjct: 194 QNKNTLLENS----QRVTAALLARSEISMGDRQLPPSGATMNSRCFQQLDESYDEEYGGF 249

Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +  +  S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 250 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 304

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QLA  Y  AF ++ D FYS + R IL Y+ R++    G
Sbjct: 305 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVARGILQYVARNLSHRSG 364

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
              SAEDADS    G  + KEGAFYVWT KEV+ +L E             L  +HY L 
Sbjct: 365 GFCSAEDADSPPERG-MQPKEGAFYVWTVKEVQQLLSEPVPGATEPLTSGQLLMKHYGLT 423

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN  +S   DP  E  G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 424 EAGN--ISPSQDPKGELHGRNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 481

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRPHLD K++ SWNGL++S FA    +L  E    + N+             A + A F
Sbjct: 482 RPRPHLDSKMLASWNGLMVSGFAVTGAVLGLE---RLINY-------------ATNGAKF 525

Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
           ++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLLDLYE    + WL 
Sbjct: 526 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 585

Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
           WA+ LQ+ QD LF D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL  
Sbjct: 586 WALRLQDAQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 645

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
              G K   +       L  F  RL+ + +A+P M  A       + K +V+ G   + D
Sbjct: 646 FT-GHKD--WMDKCVSLLTAFSERLRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 701

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            + +L   H+ Y  NK +I    A+ +   F        +++ R     D+  A VC+N 
Sbjct: 702 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCENQ 755

Query: 695 SCSPPVTDPISLENLL 710
           +CS P+T+P  L  LL
Sbjct: 756 ACSVPITEPCELRKLL 771


>gi|89894906|ref|YP_518393.1| hypothetical protein DSY2160 [Desulfitobacterium hafniense Y51]
 gi|89334354|dbj|BAE83949.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 699

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 277/720 (38%), Positives = 385/720 (53%), Gaps = 65/720 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 33  GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           VD +YM + QAL G GGWPL++FL+PD  KP   GTYFP E +YGRPG   +L ++ + W
Sbjct: 93  VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 152

Query: 118 DKKRDMLAQSGAFAIEQLS--EALSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRF 172
            K +  +  S     + ++  E  S S+ +  L D+     +  L    + L KS+D ++
Sbjct: 153 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPALQDDFIPWAKEILDTAFQTLQKSFDRQY 212

Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
           GGFG APKFP P  +  +L ++    D     EA +   MV  TL+ M +GGI DHVG G
Sbjct: 213 GGFGRAPKFPTPHHLTFLLRYA---HDHSDGLEAQQAALMVRTTLERMGQGGIFDHVGFG 269

Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
           F RYS D  W VPHFEKMLYD   LA  YL+ +    D       R+I  Y+ RDM  P 
Sbjct: 270 FARYSTDRHWLVPHFEKMLYDNALLAIAYLENYQAQHDPHDEQKAREIFSYVLRDMTAPE 329

Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
           G  +SAEDADS   EG     EG FYVWT +E+ +ILG E   L+ + Y + P GN    
Sbjct: 330 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGSEEGRLYCQAYGVSPEGN---- 378

Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
                   F+GK++   L+ D  A  S+    LE     L + R KLF VR +R  PH D
Sbjct: 379 --------FEGKSIPNLLDTDWEALGSERQHSLEVLKRRLEKSREKLFAVRKERIPPHKD 430

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK++ SWNGL+IS+ A+ +++L   A                Y E AE A  FIR++LY 
Sbjct: 431 DKILTSWNGLMISALAKGAQVLGEPA----------------YAEAAEQAVYFIRKNLYA 474

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
            Q  RL   +R+G S   G+LDDYAFLI GL++LY+     + L +A++LQ  QDELF D
Sbjct: 475 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGQKEHLEFALQLQREQDELFWD 532

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
               GYF T  +   +L+R KE +DGA PSGNS+S +NL+RLA +      +   + A  
Sbjct: 533 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGMLE---ERAYE 589

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
            +  F+  L            A       SR+ ++L G     + +NM       +    
Sbjct: 590 QINAFKATLATYPSGYSAFLQAIQFALQESRE-IILAGSLQHPELKNMKTTIFKKFHPYT 648

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           T+++ +   +E + + +++             ++K+ A +CQN++C  PV     L  LL
Sbjct: 649 TLLYEEGTLSELIPWLKDY----------PLDSEKMTAYLCQNYACHKPVHKAEELSALL 698


>gi|194756922|ref|XP_001960719.1| GF13496 [Drosophila ananassae]
 gi|190622017|gb|EDV37541.1| GF13496 [Drosophila ananassae]
          Length = 797

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 274/737 (37%), Positives = 379/737 (51%), Gaps = 79/737 (10%)

Query: 11  KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
           K RR + LI      +TCHWCHVME ESFE    A ++N+ FV+IKVDREERPD+DKVYM
Sbjct: 96  KARRENKLIFLSVGYSTCHWCHVMEHESFESPETAAIMNEHFVNIKVDREERPDIDKVYM 155

Query: 65  TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
            ++    G GGWP+SV+L+PDL PL+ GTYFPP+ +YG P F T+L+ +   W   ++ L
Sbjct: 156 QFLLMSKGSGGWPMSVWLTPDLAPLVAGTYFPPKTRYGMPSFTTVLQNIAKKWQTDKESL 215

Query: 125 AQSGAFAIEQLSEALSASASSNKLPDEL--PQNALRLCAEQLS---KSYDSRFGGFGSAP 179
            ++G+     L +AL  +  +  +P+    P +A    +E ++   + +D   GGFGS P
Sbjct: 216 IEAGS----TLVDALKRNQDAEAVPEAAFEPGSAEAKLSEAITVHKQRFDQTHGGFGSEP 271

Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           KFP    +  + +     +D        +   MVL +L  + +GGI+DH+ GGF RY+  
Sbjct: 272 KFPEVPRLNFLFHGYLVTKDV-------DVLDMVLQSLDHIGRGGINDHIFGGFARYATT 324

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
             WH  HFEKMLYDQGQL   Y +A+ LT+   +      I  YL +D+  P G  ++ E
Sbjct: 325 RDWHNVHFEKMLYDQGQLMAAYANAYKLTRSETFLGYADKIYKYLVKDLRHPLGGFYAGE 384

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGN 347
           DADS  T   T K EGAFY WT +E++      A  F+             HY LKP GN
Sbjct: 385 DADSLPTHKDTVKVEGAFYAWTWEEIQSAFKNQAERFEGVSPERAFEIYSFHYGLKPQGN 444

Query: 348 CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM---PLEKYLNILGECRRKLFDVRSKR 404
             +   SDPH    GKN+LI      A+ S   +   PLEK L+   +    L  +R +R
Sbjct: 445 --VPTYSDPHGHLTGKNILIVKGSDEATCSNFNLEAEPLEKLLDTANDI---LHVLRDQR 499

Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
           PRPHLD K+I +WNGLV+S  ++ +    ++              R+EYM+ A+    F+
Sbjct: 500 PRPHLDTKIICAWNGLVLSGLSKLANCGTAK--------------RQEYMQTAKELLEFL 545

Query: 465 RRHLYDEQTHRLQHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 514
           R+ +YD +   L  S               S+  GFLDDY+FLI GLLD Y+       L
Sbjct: 546 RKEMYDSERKLLLRSCYGVAVGDPRLEKNESEIEGFLDDYSFLIKGLLDYYKASLDLSAL 605

Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
            WA ELQ TQD+LF D   G YF +  + P+V++R+K+DHDGAEP GNSVS  NL  L+ 
Sbjct: 606 NWAKELQETQDKLFWDERNGAYFFSQRDSPNVIVRLKDDHDGAEPCGNSVSARNLTLLSH 665

Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
                  D Y Q A   L  F   +     A+P M  A  +L       V +VG  S  D
Sbjct: 666 YY---DEDAYLQRAGKLLNFF-ADVSPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-D 719

Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
            E  +      Y     ++H+DP   +E        SN     +      K    +C + 
Sbjct: 720 TERFVEICRKFYIPGMIILHVDPQHPDEA-------SNQRVQKKFKMVNGKTTVYICHDR 772

Query: 695 SCSPPVTDPISLENLLL 711
            C  PVTDP  LE  L+
Sbjct: 773 VCRMPVTDPAQLEQNLM 789


>gi|414153807|ref|ZP_11410129.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
 gi|411454828|emb|CCO08033.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
          Length = 691

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 266/717 (37%), Positives = 384/717 (53%), Gaps = 69/717 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFE   VA++LN +FVSIKVDREERPD
Sbjct: 34  GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFESADVAEVLNKYFVSIKVDREERPD 93

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD++YM+  QAL G GGWPL+V ++P  KP   GTYFP E  YGRPG   IL ++   W+
Sbjct: 94  VDQIYMSVCQALTGSGGWPLTVIMTPQQKPFFAGTYFPKETNYGRPGLIEILTRIAWLWE 153

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R  L   G    EQL+  L   A+ +  P +LP + L      L+++YD+ +GGFG+A
Sbjct: 154 HERPSLLAMG----EQLTAHLHQEAAVS--PGQLPADILDQAYRLLARNYDASYGGFGTA 207

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L +  K +         +   MV  TL  M +GGI+DH+G GF RYSV
Sbjct: 208 PKFPTPHNLMFLLRYYYKTKQ-------PQALTMVEETLDAMHRGGIYDHIGFGFARYSV 260

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W VPHFEKMLYD   LA  +L+ + +T ++ +  I ++I  Y+ RDM  P G  +SA
Sbjct: 261 DHKWLVPHFEKMLYDNALLALAFLETYQVTGNMRFGRIAKEIFAYVLRDMTSPEGGFYSA 320

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS  T       EG FY+W  +EV DILG+    +F  +Y +   GN          
Sbjct: 321 EDADSEGT-------EGKFYLWQPQEVVDILGQPDGEIFCRYYNITAQGN---------- 363

Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
             F+G N+  LI   D    A++LG+ L   +  + +CR  LF  RSKR  P  DDK++ 
Sbjct: 364 --FEGSNIPNLIG-QDPRRFAAELGIELADLVKGMEKCRSLLFKARSKRVHPFKDDKILT 420

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           +WNGL+I++ +R +++  SE                 Y   A  A +FI + L      R
Sbjct: 421 AWNGLMIAALSRGARVFHSEV----------------YRTAAVKAVNFINQRL-RRPDGR 463

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L   FR+G +  P +LDDYAFL  GLL+LYE    T +L  A+ L     ELFLD++ GG
Sbjct: 464 LLARFRDGEAAFPAYLDDYAFLAWGLLELYEATFDTDYLAEAVRLTEDMIELFLDQQHGG 523

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           +F    +   ++ R KE +DGA PSGNSV+ +NL+RLA +   + +D + + A   L  F
Sbjct: 524 FFFYGKDSEQLISRPKEIYDGALPSGNSVAAVNLIRLARL---TGNDRFAELAHRQLTGF 580

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-H 654
             +++           AA +L  P  + +VL G  +      M+     ++  +  ++  
Sbjct: 581 AQQVEQYPAGYSFFMIAAYLLQEPPLE-IVLTGEAADDSLRRMIQTVQRAFLPHGVIMAR 639

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            + ADTEE        +    + R+    + +     C+NF+C  P+T+   L+  L
Sbjct: 640 YEGADTEE-------PARLLPLTRDKLPVNGQATVYFCENFTCRKPITELSQLQAAL 689


>gi|374856309|dbj|BAL59163.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
          Length = 683

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 254/690 (36%), Positives = 376/690 (54%), Gaps = 65/690 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME E FE+  +A+ LN+ FVSIKVDREERPD+D++YMT VQ L G GGWPL+
Sbjct: 51  SACHWCHVMERECFENPQIAQYLNEHFVSIKVDREERPDLDEIYMTAVQLLTGQGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPPED++GRPGF T+L+ +   + K+R+ + +      EQL++ L
Sbjct: 111 VFLTPDLKPFFGGTYFPPEDRWGRPGFLTVLKAITALYQKEREKIVEQA----EQLTQYL 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A        + L ++ ++       +S+D   GGFG APKFP  +E+ ++L +  +  D
Sbjct: 167 QALQQPRPSSELLTRDLIQRAYLSALQSFDREHGGFGGAPKFPHSLELSLLLRYWHRTRD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  ++   +V F+L+ MA+GGI+D +GGGFHRYSVD +W VPHFEKMLYD   L  
Sbjct: 227 -------ADALHVVEFSLEQMARGGIYDQLGGGFHRYSVDAQWAVPHFEKMLYDNALLVW 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+ +T+   Y  +  + LDY+ R+M    G  F+++DADS +        EGAFY+
Sbjct: 280 TYLEAYQITQKALYRRVVEETLDYVLREMTSSAGGFFASQDADSPD-------GEGAFYL 332

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT +E+E +LG  A   K   Y    G   + R      EF               A+K+
Sbjct: 333 WTPEEIEAVLGA-ADGAKACEYFGVAGGASVLRSPYTLEEF---------------AAKM 376

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            M + +    L   + KLF  R +RP+P  D+K++ +WNGL+IS+  RA ++L  E    
Sbjct: 377 KMTISECEGWLARVKEKLFAAREQRPKPARDEKMLTAWNGLMISALVRAYQVLGHE---- 432

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       +Y+  A  AA F    LY +    L+HS ++G +K PG+LDDYAFLI 
Sbjct: 433 ------------KYLHAAHDAAHFCLNSLYRDGA--LKHSCKDGIAKIPGYLDDYAFLIL 478

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLDLYE     +W+  A  L  T  E F D  GGG+F T+ +   + +R K  +DGA P
Sbjct: 479 ALLDLYESDFDLRWVHAAKTLSATLIEKFWDEHGGGFFFTSSDHEKLPVRPKSFYDGATP 538

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNS + + L+RL  +   +     R  AE +L +    ++    A+  M  A D    P
Sbjct: 539 SGNSAATMALLRLVELTGDAA---LRVKAEQTLRLCRDFMEQAPQALSYMLSALDFYLGP 595

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           + + + +VG +     +  + +  A +  NK V+  +P D E         +    + + 
Sbjct: 596 TTQ-IAIVGARGDARTQQFVESIRARFLPNKIVVVSEPGDGE--------RAALIPLVQG 646

Query: 680 NFSADKVVAL-VCQNFSCSPPVTDPISLEN 708
               +   A+ +C+N SC  P+T+   LE 
Sbjct: 647 KGLVNGAPAVYLCKNSSCQAPITEITELER 676


>gi|194883110|ref|XP_001975647.1| GG20445 [Drosophila erecta]
 gi|190658834|gb|EDV56047.1| GG20445 [Drosophila erecta]
          Length = 805

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 270/739 (36%), Positives = 384/739 (51%), Gaps = 71/739 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFE+   A  LN+ FVSIK+DREERPD
Sbjct: 101 GEEAFEKARRENKIIFLSVGYSTCHWCHVMEHESFENPDTAAFLNEHFVSIKLDREERPD 160

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +DK+YM ++    G GGWP++V+L+PDL PL+ GTYFP + +YG   F  +L+ +   W+
Sbjct: 161 IDKIYMKFLLMTKGSGGWPMNVWLTPDLVPLVAGTYFPHKPQYGMHSFIVVLKTIAKKWN 220

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGF 175
             ++ L  +G+  +  + E+ SA+  S K       +A+   +E ++   + +D  +GGF
Sbjct: 221 ADKEFLLTTGSSMLSTILESQSAAEVSFK-----EGSAIDKLSEAINIHKQRFDETYGGF 275

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           GS PKFP    I  + +     +D        +   MV+ TL  + KGGI+DH+ GGF R
Sbjct: 276 GSEPKFPEVPRINFLFHAYLVTKDV-------DVLDMVIETLNQIGKGGINDHIFGGFAR 328

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           Y+  E WH  HFEKMLYDQGQL   + +A+ +++D  +      I  YL +D+  P G  
Sbjct: 329 YATTEDWHNVHFEKMLYDQGQLMGAFANAYKVSRDETFLGYGDKIYKYLVKDLSHPMGGF 388

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLK 343
           ++ EDADS  T     K EGAFY WT  E++           DI  E A  ++  HY LK
Sbjct: 389 YAGEDADSLPTHEDKVKVEGAFYAWTWDEIQAAVQDQAQRFDDITAERAFEIYAYHYDLK 448

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
           P GN   S  SDPH    GKN+LI       + +   +  +K   +L      L  +R +
Sbjct: 449 PPGNVKAS--SDPHGHLTGKNILIIRGSEEDTCANFKLEADKLKKLLATTNDILHVLREQ 506

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRPHLD K+I +WNGLV+S   + +                  ++R++YM+ AE    F
Sbjct: 507 RPRPHLDTKIICAWNGLVLSGLCKLAN--------------CYSANREQYMQTAEKLLDF 552

Query: 464 IRRHLYDEQTHRLQHSF-----------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
           +R+ +YD +  RL  S            +N P +  GFLDDYAFLI GLLD Y+      
Sbjct: 553 LRKEMYDPERKRLIRSCYGVAVGDETLEKNEP-QIDGFLDDYAFLIKGLLDYYKATLDVD 611

Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
            L WA ELQ TQD LF D + G YF +  + P++++R KEDHDGAEP GNSVS  NLV L
Sbjct: 612 VLHWAKELQETQDTLFWDDQNGAYFFSQQDAPNIIMRYKEDHDGAEPCGNSVSAGNLVLL 671

Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 632
           A     S    Y Q A   L  F   +     A+P M  A  +L   +   +V V    S
Sbjct: 672 AHYYDESA---YIQKAGKLLNFF-ADVSPFGHALPEMLSA--LLMYENGLDLVAVVGPDS 725

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
            D +  +      Y  +  ++H+DP++ EE+        N+    +      K    +C 
Sbjct: 726 PDTQRFVEICRKFYIPSMIIVHVDPSNPEEV-------LNHRLQKKFKMVGGKTTVYICH 778

Query: 693 NFSCSPPVTDPISLENLLL 711
             +C  PVTDP  LE+ L+
Sbjct: 779 ERACRMPVTDPQQLEDNLV 797


>gi|290982332|ref|XP_002673884.1| predicted protein [Naegleria gruberi]
 gi|284087471|gb|EFC41140.1| predicted protein [Naegleria gruberi]
          Length = 600

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 245/581 (42%), Positives = 338/581 (58%), Gaps = 52/581 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFE+E +A ++N  FV+IKVDREERPD
Sbjct: 38  GEEAFEKARNENKPIFLSIGYSTCHWCHVMEKESFENEEIAAIMNQNFVNIKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDA 116
           +D+VYMT+VQ   G GGWPLS FL+P LKP+ GGTYFPP++    G   F ++L K+ + 
Sbjct: 98  IDRVYMTFVQLTTGSGGWPLSCFLTPQLKPIFGGTYFPPKESIYRGNISFPSLLNKIHNM 157

Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS-------KSYD 169
           W  KR+ L   G   +  L +A +   +  + P +   + L+   E ++        S+D
Sbjct: 158 WTNKREALVSQGDKIVSVLKKAFTEKENEEE-PAKSADHILKFAHEYVASTVEDFLSSFD 216

Query: 170 SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV 229
           + +GGF  APKFPRPV I  +L    + +D  +  +       V FTL  MA+GG++DH+
Sbjct: 217 TVYGGFSQAPKFPRPVVIDFLLRSYYEEKDDRRKLDIINS---VTFTLDKMARGGLYDHL 273

Query: 230 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM- 288
           GGGFHRYSVD  WHVPHFEKM+YDQGQLA V+ +A+  T++ +Y  I  +IL Y+ RDM 
Sbjct: 274 GGGFHRYSVDTYWHVPHFEKMMYDQGQLAIVFAEAYKATRNEYYKQILEEILLYIERDMS 333

Query: 289 IGPGGEI---FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG---------EHAILF 336
           +G   ++   FSAEDADS  T  +  K+EGAFY W  ++V DI+          + + +F
Sbjct: 334 LGESSDMIGFFSAEDADSLPTFDSKEKREGAFYAWDYQQVVDIIDNMVPHIGSVKPSDIF 393

Query: 337 KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG-MPLEKYLNILGECRR 395
              + LK  GN   S  SDPH E  G NVL        +  +   +P E   N++ +C+ 
Sbjct: 394 SFMFDLKQDGNVRQS--SDPHGELTGLNVLYMDKSLKETQDRFSTIPPESVANVIMDCKD 451

Query: 396 KLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 454
            LF  R+K +PRPHLDDK+I +WN  VIS+F+R++ +L                    Y+
Sbjct: 452 ILFKERNKMKPRPHLDDKIITAWNAYVISAFSRSALLLSEPG----------------YL 495

Query: 455 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSK---APGFLDDYAFLISGLLDLYEFGSGT 511
           ++AE AA+FI   LYD +T  L   F+    K     GFL DYA +IS L+DLYE     
Sbjct: 496 KIAERAANFIYEKLYDRETKVLHRIFKKNSEKERNIAGFLSDYANMISALIDLYEASGSI 555

Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           KWL WA ELQ+ QD  F D+  GGYF   G DP+++ R+KE
Sbjct: 556 KWLNWAFELQDIQDSYFYDQTNGGYFEERGNDPTIIYRLKE 596


>gi|219669354|ref|YP_002459789.1| hypothetical protein Dhaf_3335 [Desulfitobacterium hafniense DCB-2]
 gi|219539614|gb|ACL21353.1| protein of unknown function DUF255 [Desulfitobacterium hafniense
           DCB-2]
          Length = 699

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 276/720 (38%), Positives = 386/720 (53%), Gaps = 65/720 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 33  GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           VD +YM + QAL G GGWPL++FL+PD  KP   GTYFP E +YGRPG   +L ++ + W
Sbjct: 93  VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 152

Query: 118 DKKRDMLAQSGAFAIEQLS--EALSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRF 172
            K +  +  S     + ++  E  S S+ +  L D+     +  L    + L KS+D ++
Sbjct: 153 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPALQDDFIPWAKEILDTAFQTLQKSFDRQY 212

Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
           GGFG APKFP P  +  +L ++    D     EA +   MV  TL+ M +GGI DHVG G
Sbjct: 213 GGFGRAPKFPTPHHLTFLLRYA---HDHSDGLEAQQAALMVRTTLERMGQGGIFDHVGFG 269

Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
           F RYS D  W VPHFEKMLYD   LA  YL+ +    D       R+I  Y+ RDM  P 
Sbjct: 270 FARYSTDRHWLVPHFEKMLYDNALLAIAYLENYQAQHDPHDEQKAREIFSYVLRDMTAPE 329

Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
           G  +SAEDADS   EG     EG FYVWT +E+ +ILG E   L+ + Y + P GN    
Sbjct: 330 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGSEEGRLYCQAYGVSPEGN---- 378

Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
                   F+GK++   L+ D  A  S+    LE     L + R KLF VR +R  PH D
Sbjct: 379 --------FEGKSIPNLLDTDWEALGSERQHSLEVLKRRLEKSREKLFAVRKERIPPHKD 430

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK++ SWNGL+I++ A+ +++L   A                Y E  E A  FIR++LY 
Sbjct: 431 DKLLTSWNGLMIAALAKGAQVLGEPA----------------YAEAVEQAVYFIRKNLYA 474

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
            Q  RL   +R+G S   G+LDDYAFLI GL++LY+     + L +A++LQ  QDELF D
Sbjct: 475 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGKKEHLEFALQLQREQDELFWD 532

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
               GYF T  +   +L+R KE +DGA PSGNS+S +NL+RLA +    + +   + A  
Sbjct: 533 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGELE---KRAYE 589

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
            +  F+  L            A       SR+ ++L G     + +NM  A    +    
Sbjct: 590 QINAFKATLSTYPSGYSAFLQAIQFALQESRE-IILAGPLQHPELKNMKTAIFKKFHPYT 648

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           T+++ +   +E + + +++             ++K+ A +CQN++C  PV     L  LL
Sbjct: 649 TLLYEEGTLSELIPWLKDY----------PLDSEKMTAYLCQNYACHKPVHKAEELSALL 698


>gi|156742936|ref|YP_001433065.1| hypothetical protein Rcas_2990 [Roseiflexus castenholzii DSM 13941]
 gi|156234264|gb|ABU59047.1| protein of unknown function DUF255 [Roseiflexus castenholzii DSM
           13941]
          Length = 696

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 266/692 (38%), Positives = 376/692 (54%), Gaps = 58/692 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE  A L+N +FV++KVDREERPDVD +YMT VQA+ G GGWP++VF
Sbjct: 58  CHWCHVMEHESFEDEETAALMNRYFVNVKVDREERPDVDSIYMTAVQAMTGSGGWPMTVF 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P   GTYFPPED++  P F+ +LR V +A+  +R+ L   G   +E++ E    
Sbjct: 118 LTPDGTPFFAGTYFPPEDRWQMPSFQRVLRSVAEAYATRRNDLLARGRELVERMRE---- 173

Query: 142 SASSNKLP-DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            AS  ++P   L   AL      L +++D  +GGFG APKFP+P+ ++ +L ++ +   T
Sbjct: 174 -ASMMQIPGSTLTPAALDSAFMGLQQAFDPEYGGFGRAPKFPQPMTLEFLLRYAAR---T 229

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+      G +M+  TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD   LA V
Sbjct: 230 GR------GMEMLERTLRAMAEGGMYDQIGGGFHRYSVDAQWLVPHFEKMLYDNALLARV 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL+ F  T + FY  I  + L Y+ R+M  P G  FS +DADS  T  AT K EGAF+VW
Sbjct: 284 YLETFQATGNAFYRRIAEETLTYMLREMQHPDGGFFSTQDADSLPTADATHKHEGAFFVW 343

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E+ + LG  A +F   Y +   GN            F+GKN+L      +  A  +G
Sbjct: 344 TPAEIREALGADATVFSALYGVTDRGN------------FEGKNILHVQRSPAEVARVMG 391

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           M +E+  +I    RR LF VR  RP+P LDDKV+ +WNG+ + +FA  + +L        
Sbjct: 392 MSVERVESIAERGRRVLFAVRQHRPKPELDDKVLTAWNGMALRAFALGAIVL-------- 443

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
                   DR+EY   A   A F+ R L       L+ S+R G  +  P FL+DYA L  
Sbjct: 444 --------DREEYRTAAVRCAEFVLRELRRADGELLR-SWRQGVANPTPAFLEDYALLAD 494

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLL LYE     +WL+ A  L +   E F D   GG+++T      +++R ++  D A P
Sbjct: 495 GLLALYEATFDPRWLLEARALADALLERFWDDGIGGFYDTGSHHEQLVIRPRDTGDNATP 554

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSV 618
           SG+S +   L+RLA I    +   YR+ A   L+     ++           AA+  LS 
Sbjct: 555 SGSSAAADVLLRLALIFDEPR---YRERALTVLSAMAPLMERYPTGFGRYLAAAEFALSQ 611

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P  + + L+G   + D   + A A   +  N+ V+   P +       +     +  +A 
Sbjct: 612 P--REIALIGDPEAADTRALAAIALKPFLPNRVVVLARPGE-------DPPRIPSPLLAG 662

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +  A VCQN++C  PVT P  L   L
Sbjct: 663 RTPIDGRAAAYVCQNYACRLPVTKPADLAAQL 694


>gi|451995214|gb|EMD87683.1| hypothetical protein COCHEDRAFT_21080 [Cochliobolus heterostrophus
           C5]
          Length = 734

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 266/651 (40%), Positives = 368/651 (56%), Gaps = 40/651 (6%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +     K+ R  F+      CHWCHVME ESFE++ VA LLN+ F+ IK+DREERPD
Sbjct: 36  GQEAIGLAKKSNRLIFISIGYAACHWCHVMERESFENDEVANLLNEHFIPIKIDREERPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVK 114
           VD++YM YVQA  G GGWPL+VF++PDL+P+ GGTY+P P          GF  IL+K++
Sbjct: 96  VDRIYMNYVQATTGSGGWPLNVFITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILKKIR 155

Query: 115 DAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF-- 172
           D W  +R    +S      QL +       S K  D  P   L L  E L ++Y++    
Sbjct: 156 DVWRDQRQRCLESAKEITAQLRDFAEEGNISRK--DGAPNETLDL--ELLDEAYEASTTF 211

Query: 173 -GGFGSAPKFPRPVEIQMMLYHSKK---LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
              FG APKFP P  +  +L  S+    +++   + + +  + M L TL  M KGGIHD 
Sbjct: 212 ASSFGGAPKFPTPSNLHFLLKLSQYPNLVKEVLGAKDCTRAKDMALATLSAMNKGGIHDQ 271

Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-D 287
           +G GF RYSV + W +PHFEKMLYDQ QL  VYLDA+ +T+   +     DI  YL    
Sbjct: 272 IGNGFARYSVTKDWSLPHFEKMLYDQSQLLAVYLDAYLMTRSPEHLEAVHDIATYLTSPP 331

Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTG 346
           M    G  +S+EDADS        K+EGAFYVWT KE +DILGE  + +   +Y +K  G
Sbjct: 332 MHAESGGFYSSEDADSLYRPNDKEKREGAFYVWTLKEFQDILGERDSEILARYYNVKDEG 391

Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RP 405
           N  ++   D H+E   +NVL   +  +  A + G+  EK   IL E R+KL + R+K RP
Sbjct: 392 N--VAPEHDAHDELINQNVLAITSTPADLAKQFGLSEEKVKRILTEGRQKLLEHRNKERP 449

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
           RP LDDK++VSWNGL I + AR S  L S+  +            KEY+  AE AA+F++
Sbjct: 450 RPGLDDKIVVSWNGLAIGALARTSAALASQDPTR----------SKEYLAAAEKAAAFVQ 499

Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
           +HLY  ++  L   +R GP  APGF DDYA+LISGL+DLYE      +L WA +LQ TQ 
Sbjct: 500 KHLYHSESKTLIRVWREGPGDAPGFADDYAYLISGLIDLYEATFNDSYLQWADDLQKTQL 559

Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
           ++F D++  G+F+T  +   +++R+K+  D AEP  N VS  NL RL +++  S+   Y 
Sbjct: 560 KMFWDKQHLGFFSTPEDQTDLIMRLKDGMDNAEPGTNGVSAQNLDRLGALLEDSE---YT 616

Query: 586 QNAEHSLAVFETRLKDMAMAVPLM--CCAADMLSVPSRKHVVLVGHKSSVD 634
           Q A  + + FE  +       P M     A  L +    H V+ G+   VD
Sbjct: 617 QRARDTASAFEAEIMQHPFLFPSMMDAVVAGKLGI---THAVITGNGQKVD 664


>gi|308274671|emb|CBX31270.1| Spermatogenesis-associated protein 20 [uncultured Desulfobacterium
           sp.]
          Length = 633

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 238/572 (41%), Positives = 343/572 (59%), Gaps = 40/572 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF D  +AK++ND F+ IKVDREERPD+D++Y++ V AL G  GWPL+
Sbjct: 43  STCHWCHVMENESFTDHEIAKIMNDNFICIKVDREERPDLDRIYISAVTALTGSAGWPLN 102

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLS 136
           VFL+P LKP  GGTYFP E  +G   +  +L ++   W      +D+++ S     E+++
Sbjct: 103 VFLTPKLKPFFGGTYFPAESNFGITSWPDLLNRITSVWKDPVVHKDIISSS-----EKIT 157

Query: 137 EALSASASSNKL---PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
           + +  + S +K+    ++  Q+ L    +  S SYD ++ GFG APKFP P  I+ +L +
Sbjct: 158 DIIIKNLSYDKVFSTAEKHKQSHLDDAFKYYSSSYDEKYAGFGKAPKFPSPSIIKFILAY 217

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
               +   +   A     M  +TL+ MAKGGI+D + GGFHRYS DE+WH+PHFEKMLYD
Sbjct: 218 FSYAKKINEPAVAKRTIDMADYTLKAMAKGGIYDQLRGGFHRYSTDEKWHIPHFEKMLYD 277

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE-------T 306
             QL NVYL+A+ +T D F++ I ++  DY+  DM    G  +SAEDADS         +
Sbjct: 278 NAQLVNVYLEAYQITSDKFFAQIAKETCDYILSDMTSSPGGFYSAEDADSYPGQISEKGS 337

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           + A  K EGAFYVW+ KE++ IL E+ A +F   + +   GN       DPH  FK KN+
Sbjct: 338 DDAHNKVEGAFYVWSKKELDKILEENTAEIFSYFFGVMEEGNA----AHDPHGYFKKKNI 393

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           L   +  + +A K  M  +K   I+ + + KL   RS R RPHLDDK++ SWNGL+IS+F
Sbjct: 394 LYVKHSINETAKKYNMAPDKVELIINDAKNKLLKARSSRERPHLDDKILTSWNGLMISAF 453

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
           A+A K+L              GSD+  Y++ A++AA FI  +LYD+ T +L   +R G  
Sbjct: 454 AKAYKVL--------------GSDK--YLQAAKNAAEFIISNLYDKNTGKLFRRWREGER 497

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DP 544
              G   DYAF I GL+DLYE  S  KWL  A+ L     +LF D +  G++ T+ + D 
Sbjct: 498 AVLGMGSDYAFYICGLIDLYESDSDKKWLETAVMLSEEYIKLFYDEQFAGFYITSPDHDK 557

Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           ++++R K+D D   P+  SV++ NL+RL+ I 
Sbjct: 558 NLIIRAKDDSDSVIPAHGSVAIQNLLRLSKIT 589


>gi|341899864|gb|EGT55799.1| hypothetical protein CAEBREN_04954 [Caenorhabditis brenneri]
          Length = 731

 Score =  447 bits (1150), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 276/729 (37%), Positives = 383/729 (52%), Gaps = 65/729 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T +  FL    +TCHWCHVME ESFE+E  AK+LN+ FV+IKVDREERPD
Sbjct: 45  GEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFVAIKVDREERPD 104

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF TIL  +   W 
Sbjct: 105 VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWQ 164

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K+ + L   GA  I+ L   +  S   N+  D                ++DSR GGFG A
Sbjct: 165 KEGENLRTRGAQIIKLLQPEMK-SGDVNRSED-----VFESIYSHKKSTFDSRLGGFGRA 218

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+  +   ++  +        S E  E   M+  TL+ MA GGIHDH+G GFHRYSV
Sbjct: 219 PKFPKAPDFDFLIAFAS---SQSNSKEKQESIMMLQKTLESMADGGIHDHIGNGFHRYSV 275

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
           D  WH+PHFEKM+YDQ QL   Y +   LT  K      +  DI +Y+++     GG  +
Sbjct: 276 DSEWHIPHFEKMIYDQSQLLASYSEFHRLTEKKHENIKLVINDIFEYMQKISHKDGG-FY 334

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
           +AEDADS  T  +T K EGAF  W   E++ +LGE  I       +F +++ ++  GN  
Sbjct: 335 AAEDADSLPTHESTEKVEGAFCAWERDEIKQLLGEKKIESASLFDVFVDYFDVEENGN-- 392

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
           +++ SDPH E K KNVL +L      A+  G+ +E+  N + E R  L+  R+KRP PHL
Sbjct: 393 VAKSSDPHGELKNKNVLRKLLTDEECATNHGITVEQLKNGIDEAREILWIARTKRPSPHL 452

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K++ +W GL I+   +A +                 ++  +Y+E AE  A+F+ ++L 
Sbjct: 453 DSKMVTAWQGLAITGLVKAYQ----------------ATNEPKYVERAEKCAAFVEKYL- 495

Query: 470 DEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
            E+   L+ S           G  +   F DDYAFLI GLLDLY      ++L  +I+LQ
Sbjct: 496 -EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYAFLIQGLLDLYTVAGKNEYLERSIKLQ 554

Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
            T DE F    G GYF +   D  V +R+ ED DGAEP+  S++  NL+R   I+   ++
Sbjct: 555 KTCDEKFWS--GNGYFISEKSDEVVSVRMIEDQDGAEPTATSIASNNLLRFYDIL---EN 609

Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
           + YR+ A         RL  + +A+P M  A     + S    VLVG   S         
Sbjct: 610 EEYRERANQCFRGASERLNKIPIALPKMAVALQRWQLGSTT-FVLVGDPVSELLTEARNQ 668

Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +     N +V+HI      E D     +S+NA MA+      +    +C+ F C  PV 
Sbjct: 669 LNQKLINNLSVVHI----RSENDVSASGSSHNA-MAQ----GPQPAVYLCKGFVCGLPVR 719

Query: 702 DPISLENLL 710
               LE L 
Sbjct: 720 KIDKLEQLF 728


>gi|268530908|ref|XP_002630580.1| Hypothetical protein CBG13036 [Caenorhabditis briggsae]
          Length = 724

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 281/731 (38%), Positives = 385/731 (52%), Gaps = 63/731 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    ++ +  FL    +TCHWCHVME ESFE+E  AKLLND FV+IKVDREERPD
Sbjct: 36  GEEAFQKARESNKPIFLSVGYSTCHWCHVMEKESFENENTAKLLNDNFVAIKVDREERPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF TIL  + + W 
Sbjct: 96  VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHEEWQ 155

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K+ + L   GA  I+ L   L+ S   N+  D       R    +   S+DSR GGFG A
Sbjct: 156 KEGENLKARGAQIIKLLQPKLN-SGDVNRSED-----VFRAIFTRHQSSFDSRLGGFGGA 209

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+P ++  ++  +   +    S  + E  KM+  TL+ MA GGIHDH+G GFHRYSV
Sbjct: 210 PKFPKPSDLDFLICMANT-DPILNSESSKESVKMIQKTLESMADGGIHDHIGNGFHRYSV 268

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF--YSYICRDILDYLRRDMIGPGGEIF 296
           D  WHVPHFEKMLYDQ QL   Y D + LT         I  DI  Y+++     GG  +
Sbjct: 269 DAEWHVPHFEKMLYDQSQLLATYSDFYRLTGRKLDNIKTIVDDIFQYMQKISHKDGG-FY 327

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
           SAEDADS     +T+K EGAF VW  +E++ +LGE  I       +F +  YL    N +
Sbjct: 328 SAEDADSLPRHDSTKKMEGAFCVWEKEEIKILLGEMKIGSANLVDVFND--YLDVEENGN 385

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
           +SR SDPH E K KNVL +L      A    + +++ +  +   ++ L++ R+KRP PHL
Sbjct: 386 VSRSSDPHGELKNKNVLRKLLTDEECAINHDITVDELIEGMQRAKKILWEARTKRPSPHL 445

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K++ +W GL I+   +A +                 ++  +Y+E AE  A F++++L 
Sbjct: 446 DSKMVTAWQGLAITGLVKAYQ----------------ATNDTKYIERAEKCAEFVQKYL- 488

Query: 470 DEQTHRLQHSFRNGPS--------KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
             +   L+ S   GP+        +   F DDYAF+I  LLDLY       +L  AIELQ
Sbjct: 489 -AENGELKRSVYLGPTGEVEQGNQEMKAFSDDYAFMIQALLDLYTTLGKDDYLKNAIELQ 547

Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
              D  F    G GYF +   D  V +R+ ED DGAEP+  S++  NL+R   I+   + 
Sbjct: 548 KICDSKFW--SGNGYFISEQTDEKVSVRMIEDQDGAEPTATSIASNNLLRFYDIL---ED 602

Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
           + YR+ A         RL  + +A+P M  A +     S    VLVG   S         
Sbjct: 603 EEYREKAHQCFRGASERLNKVPIALPKMAVALNRWQKGSIT-FVLVGEPDSELLIETRKR 661

Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  +  N + +HI      E D      S+ A M      A      +C+ F CS PV 
Sbjct: 662 LNQKFIENFSAVHI----RSENDLGATGASHKA-MTEGPHPA----VYMCKGFVCSLPVR 712

Query: 702 DPISLENLLLE 712
           D   L+ +L E
Sbjct: 713 DIKGLDKMLNE 723


>gi|195029929|ref|XP_001987824.1| GH19740 [Drosophila grimshawi]
 gi|193903824|gb|EDW02691.1| GH19740 [Drosophila grimshawi]
          Length = 747

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 272/718 (37%), Positives = 363/718 (50%), Gaps = 65/718 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+S
Sbjct: 61  STCHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L PL  GTYFPP+ +YG P F  +L  +   W   R  L  +G+  ++ L    
Sbjct: 121 VWLTPELAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRAALQNAGSILMDALKANQ 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           +ASA      +  P +A    AE L+   + +D + GGFG  PKFP    +  + +    
Sbjct: 181 NASAVGEAAFE--PGSADAKLAEALNVHKQRFDQQHGGFGREPKFPEVSRLNFLFHAYLV 238

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D        +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQ
Sbjct: 239 SKDV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQ 291

Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           L   + +A+ LT+ + F  Y  R I +YL +D+  P G  F+ EDADS  T   T K EG
Sbjct: 292 LMAAFANAYKLTRSEEFLGYADR-IYEYLLKDLRHPAGGFFAGEDADSLPTHKDTVKVEG 350

Query: 316 AFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGK 363
           AFY WT +EV+D        F +            HY +KP GN  +   SDPH    GK
Sbjct: 351 AFYAWTWQEVQDAFRAQKTHFNDVSPDRAFDIYSFHYDMKPGGN--VPPDSDPHGHLTGK 408

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVLI       + S   + L++   +L      L  VR KRPRPHLD K+I SWNGLV+S
Sbjct: 409 NVLIVRGSEEDTCSNFNVELDQLKPLLRTANDILHAVRDKRPRPHLDTKIICSWNGLVLS 468

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL------- 476
             A+ +     +              R  Y++ A+    F+R HLYDE+   L       
Sbjct: 469 GLAKLANCGTGK--------------RNAYLKTAKELVQFLRTHLYDEEQQVLLRSCYGA 514

Query: 477 ---QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
               ++      +  GFLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + 
Sbjct: 515 GVQDNTLEQNAVRIEGFLDDYAFLIKGLLDYYKASLDMGALRWAKELQGTQDKLFWDEKN 574

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G YF +  + P+V++R+KEDHDGAEP GNSV+  NL  L         D Y +  +  L 
Sbjct: 575 GAYFYSQQDAPNVIVRLKEDHDGAEPCGNSVTARNLTLLTHYY---DDDAYLKRTDKLLN 631

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F   +     A+P M  A  ML       V +VG   S D    +      Y     ++
Sbjct: 632 YF-ADVSPFGHALPEMLSAL-MLHEHGLDLVAVVG-PDSPDTARFVEICRKFYVPGMIIV 688

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           H DP   +E         N     +      K    +C +  C  PVTDP  LE  L+
Sbjct: 689 HCDPQHPDEA-------CNQRLQTKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739


>gi|323703366|ref|ZP_08115015.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
           DSM 574]
 gi|323531635|gb|EGB21525.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
           DSM 574]
          Length = 692

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 265/707 (37%), Positives = 379/707 (53%), Gaps = 65/707 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFE E VA++LN ++V+IKVDREERPD
Sbjct: 33  GEEAFEKAKRENKPVFLSIGYSTCHWCHVMERESFESEDVAEVLNKYYVAIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMT  QAL G GGWPL++ ++PD KP   GTYFP    YG+PG   IL+++ D W 
Sbjct: 93  IDQIYMTVCQALTGQGGWPLNIIMTPDQKPFFAGTYFPKNSNYGKPGLIDILQQIADLWA 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K R  L        +QL   L+   ++   P +L    L       ++ +DS +GGFG+ 
Sbjct: 153 KNRQQLLGIS----DQLMARLNMKTATA--PGQLSPEVLDKAYLLFARHFDSTYGGFGNP 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  + ++L   KK           +   MV  TL  M +GGI+DH+G GF RYS 
Sbjct: 207 PKFPTPHNLMLLLRCWKKTSQ-------KKALTMVEDTLDAMHRGGIYDHIGFGFSRYST 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D RW VPHFEKMLYD   LA  +L+ + + ++  +S + ++I  Y+ RDM  P G  +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLAIAFLETYQINRNPRFSRVAKEIFTYVLRDMTAPEGGFYSA 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FYVW  +EVE +LG+    LF  +Y + P GN          
Sbjct: 320 EDADS---EGV----EGKFYVWHPQEVEQVLGQIDGQLFCRYYDITPRGN---------- 362

Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
             F+G ++   +N D    A +L + LE  ++ L +CR+ LF  R KR  PH DDK++ S
Sbjct: 363 --FEGASIPNLINQDPLKFAQELDITLEDLVDGLEKCRQLLFAQREKRVHPHKDDKILTS 420

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I++ AR +++L  E                +Y + AE A  FI  +L      RL
Sbjct: 421 WNGLMIAALARGARVLGDE----------------KYSQAAEKAVDFIYHNL-QRADGRL 463

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +R+G +  P +LDDYAFLI GLL+LYE     K L  A++L ++  +LF DR+ GG+
Sbjct: 464 LARYRDGEAAYPAYLDDYAFLIWGLLELYEATFDIKHLEQAVQLTDSMIDLFWDRQNGGF 523

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F    +   ++ R KE +DGA PSGNSV+ +NL RLA +   ++ + Y + A   L VF 
Sbjct: 524 FFYGKDSEQLISRPKEIYDGAIPSGNSVATVNLFRLARL---TERNRYEELATKQLQVFA 580

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             L+   +       AA +   P  + +VL G +     + M+      + L   V+ + 
Sbjct: 581 GELEHYPIGYSYFMIAAYLNQEPPTE-IVLSGKREDSALKQMIDVVQKEF-LPSAVLAVR 638

Query: 657 PADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTD 702
                              + ++    A K  A VC+NF+C PPVTD
Sbjct: 639 YEGEAAA-----QAEELVPLLKDRLPVAGKATAYVCKNFACQPPVTD 680


>gi|347839355|emb|CCD53927.1| similar to DUF255 domain protein [Botryotinia fuckeliana]
          Length = 823

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 241/592 (40%), Positives = 354/592 (59%), Gaps = 26/592 (4%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCH+ME ESFE+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+
Sbjct: 82  SSCHWCHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 141

Query: 80  VFLSPDLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           VFL+P L+P+ GGTY+       D   +  F  IL K+   W ++     Q  A +++QL
Sbjct: 142 VFLTPSLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSAQSLQQL 201

Query: 136 SEALSASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML- 191
            +  +    SN+L    D +    L    E  + SYD   GGFGSAPKFP P +I  +L 
Sbjct: 202 KDFANEGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSKIAFLLR 261

Query: 192 --YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
                + + D     +    +++ + TL+ MA+GGIHDH+G GF RYS    W +PHFEK
Sbjct: 262 LGQFPQAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWSLPHFEK 321

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           MLYD  QL ++YLD F L++D  +  +  DI +YL   +    G  +S+EDADS    G 
Sbjct: 322 MLYDNAQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADSYYKNGD 381

Query: 310 TRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           + K+EGA+YVWT +E E+ILG    L    ++   TG+ ++ + +DPH+EF  +NVL   
Sbjct: 382 SEKREGAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQNVLAIS 440

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
           +  SA AS+ G+   + + ++ E + +L   R + R +P +DDKV+VSWNG+ + + AR 
Sbjct: 441 STPSALASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAVGALARL 500

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           S ++        F+ PV     +EY++ A  AA+FI+++LYD++   L   +R G     
Sbjct: 501 SSVING------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWREGRGDTQ 550

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVL 547
           GF DDYAFLI GL+DLYE     KWL WA ELQ +Q  LF D+ G G +F+TT   P+V+
Sbjct: 551 GFADDYAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTVSAPNVI 610

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           LR+K+  D +EPS N +S  NL RL+S+      + Y + A+ ++  FE  +
Sbjct: 611 LRLKDAMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEM 659


>gi|195120756|ref|XP_002004887.1| GI20164 [Drosophila mojavensis]
 gi|193909955|gb|EDW08822.1| GI20164 [Drosophila mojavensis]
          Length = 747

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 269/716 (37%), Positives = 362/716 (50%), Gaps = 61/716 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED   A+++N  FV+IKVDREERPD+DKVYM ++    G GGWP+S
Sbjct: 61  STCHWCHVMEHESFEDAATAEVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL+PL  GTYFPP+ +YG P F  +L  +   W   RD L ++G+  ++ +    
Sbjct: 121 VWLTPDLEPLAAGTYFPPKPRYGMPSFTMVLESIAKKWVADRDSLKKAGSTLLQAMQTNQ 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           SA  S+    +    +A    A  + K  +D +  GFG  PKFP    +  + +     +
Sbjct: 181 SAGTSAEMAFERGSGDAKLAEAVAVHKQRFDQQHAGFGREPKFPEVPRLNFLFHAYLVTK 240

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D        +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL 
Sbjct: 241 DV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLM 293

Query: 259 NVYLDAFSLTKDV-FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             Y +A+ LT+   F  Y  R I +YL +D+  P G  ++ EDADS  T   T K EGAF
Sbjct: 294 AAYANAYKLTRSKEFLGYADR-IYEYLIKDLRHPAGGFYAGEDADSLPTHEDTVKVEGAF 352

Query: 318 YVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           Y WT  EV+    +    FK+            HY LKP+GN  +S  SDPH    GKN+
Sbjct: 353 YAWTWDEVKQAFQKEESCFKDISAARAFEIYSFHYDLKPSGN--VSPSSDPHGHLTGKNI 410

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           LI       + S   M LEK   +L      L  +R +RPRPHLD K+I  WNGLV+S  
Sbjct: 411 LIVRGSEEDTCSNFNMELEKLQQLLRTANEILHKIRDQRPRPHLDTKIICGWNGLVLSGL 470

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
           A+ +    ++              R  Y+  A+    F+R+HLYDE    L  S      
Sbjct: 471 AKLANCGTAK--------------RDAYLATAKQLMEFVRKHLYDEDEKLLLRSCYGAGV 516

Query: 480 ----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
                    ++  GFLDDYAFLI GLLD Y+     + L W+  LQ TQD+LF D + G 
Sbjct: 517 ADDTLEQNATRIEGFLDDYAFLIKGLLDYYKASLEMEALNWSKTLQETQDKLFWDEDKGA 576

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           YF +    P+V++R+KEDHDGAEP GNSV+  NL  L+      K   Y + A   L  F
Sbjct: 577 YFFSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLSHYYDDRK---YFERATKLLNYF 633

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
              +     A+P M  A  +L       V +VG   S D    +      Y     ++H 
Sbjct: 634 -ADVSPFGHALPEMLSAL-LLHENGLDLVAVVG-PDSEDTRRFVEIVRKFYVPGMIIVHC 690

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           DP   +          N     +      K    +C +  C  PVTDP  LE  L+
Sbjct: 691 DPLHPDAA-------CNQRLQQKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739


>gi|449300572|gb|EMC96584.1| hypothetical protein BAUCODRAFT_33944 [Baudoinia compniacensis UAMH
           10762]
          Length = 739

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 270/706 (38%), Positives = 382/706 (54%), Gaps = 45/706 (6%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           +T R  F+    + CHWCHVM  ESF+D  +A+LLN+ F+ IK+DREERPD+D+ YM ++
Sbjct: 43  QTNRLLFVSIGYSACHWCHVMAHESFDDPRIAQLLNEHFIPIKIDREERPDIDRQYMDFL 102

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PED---KYGRPGFKTILRKVKDAWDKKRDM 123
           QA  GGGGWPL+VF++PDL+P+ GGTY+P P+    + G  GF+ IL KV   W ++   
Sbjct: 103 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPKSERAQMGGTGFEQILVKVAQMWKEQESK 162

Query: 124 LAQSGAFAIEQLSEALSASASSNKL-------PDELPQNALRLCAEQLSKSYDSRFGGFG 176
           L ++G     QL E         +         D L  + +          +DS++GGFG
Sbjct: 163 LRENGKQITAQLKEFAQEGTLGGRTDGKTSDGDDGLELDLIEEAYNHYKGRFDSKYGGFG 222

Query: 177 SAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           SAPKFP PV ++ ++    H   +++     E    + M + TL+CMAKGGI D VG GF
Sbjct: 223 SAPKFPTPVHLKALVRFGCHPHTVKEIVGDKEVKHARYMAVKTLECMAKGGIKDQVGHGF 282

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPG 292
            RYSV   W +PHFEKMLYD  QL  +YLDA+ LTK   +     D+  YL  + M    
Sbjct: 283 ARYSVTRDWSLPHFEKMLYDNAQLLPLYLDAYLLTKTDLFLETVHDVATYLTTEPMQSSL 342

Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
           G I ++EDADS  T     K+EGAFYVWT  E +++L  E A +   ++ ++P GN D  
Sbjct: 343 GGINASEDADSLPTAIDHHKREGAFYVWTLDEFKELLTDEEATVCARYWNVQPNGNVD-- 400

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLD 410
           R  D   E  G+N L    D+   AS+LGM   +   ++G  R+KL + R K RP P LD
Sbjct: 401 RRYDHQGELVGRNTLCVQYDTPDLASELGMSDSEVKRLIGSGRKKLLEYRDKNRPLPSLD 460

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK++ +WNGL I   ARAS  L S A  +           + Y+  AE AA+ I++HL+D
Sbjct: 461 DKIVTAWNGLAIGGLARASAALSSMAPDSA----------QAYLAGAERAAACIKQHLFD 510

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
            +T  L+  +R GP +  GF DDYAFLISGLLDLYE      +L +A  LQ TQ +LF D
Sbjct: 511 AKTGTLRRVYREGPGETQGFADDYAFLISGLLDLYEATFDDSYLSFADTLQQTQVKLFWD 570

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
                +F+T    P +L+R K+  D AEPS N VS  NL RL+S++   K   Y + A+ 
Sbjct: 571 DNKYAFFSTPANQPDILVRTKDAMDNAEPSTNGVSAQNLFRLSSLLNDEK---YEKMAKR 627

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
           ++A FE  +         M  +  + S    K +++VG       E  L  A  S   N 
Sbjct: 628 TVAAFEVEIGQHPGLFSGMMSSI-IASKLGMKGLMVVGEGEVA--EAALKKARESVRPNW 684

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
           TV+ +      E  +  + N         +    +V+  VC++ +C
Sbjct: 685 TVLRV--GGKAEAKWLRQRNE-----LLQDLDGSRVMVQVCEDGAC 723


>gi|283778260|ref|YP_003369015.1| hypothetical protein Psta_0467 [Pirellula staleyi DSM 6068]
 gi|283436713|gb|ADB15155.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
          Length = 709

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 270/703 (38%), Positives = 388/703 (55%), Gaps = 75/703 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE + +A  LN+ FV IKVDREERPD+D++YM  VQ + G GGWP+S
Sbjct: 58  SACHWCHVMEHESFESQEIADYLNEHFVCIKVDREERPDLDQIYMDAVQLMTGRGGWPMS 117

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEA 138
           VFL+P+ KP  GGTY+PP D+ G PGF  ++R V DAW  +R+  L+Q+      +L++ 
Sbjct: 118 VFLTPEGKPFFGGTYWPPTDRQGMPGFSRVIRAVIDAWKNRREQALSQA-----TELTDH 172

Query: 139 LSASASSNKLPDELPQNALR--------LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L + A+SN  P +LP +  R          A +LS+++DSR+GGFGSAPKFP  ++++++
Sbjct: 173 LGSLATSNT-PAQLPLSVSRSMVDGWMETAAARLSRAFDSRYGGFGSAPKFPHSMDLELL 231

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L   ++           +  +M L TL+ M+ GGI+DH+GGGF RYSVDERW VPHFEKM
Sbjct: 232 LLEWQR-------SARVDVAEMTLVTLEKMSAGGIYDHLGGGFARYSVDERWLVPHFEKM 284

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD   L    + A+  T D  ++   R+  +YL RDM    G I+S EDADS   EG  
Sbjct: 285 LYDNSLLLRALVRAYQATGDAKFAATMRETCNYLLRDMTDELGGIYSTEDADS---EG-- 339

Query: 311 RKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
             +EG FYVW   E+ ++LG E    F + Y + P GN            F+    ++ L
Sbjct: 340 --EEGKFYVWKPAEIYEVLGPERGSRFCQVYDVAPGGN------------FEHGFSILNL 385

Query: 370 NDSSASASKLG-MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
           + S A  S+L  MPLE   N L E R  LFDVR KR  P  DDK++ SWN L I + A  
Sbjct: 386 SRSIADWSRLWEMPLEVLSNELAEDRAILFDVREKRVHPGKDDKILTSWNALAIDALAEV 445

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           + +L                D   Y+  A+ AA F+ +HL D    RL H++R+G +K  
Sbjct: 446 AGVL----------------DEPRYLLAAQRAADFVLQHLRDSDG-RLLHTWRHGRAKLA 488

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
            +LDDYA+L+  L+ LYE    T+WL  A+EL +     F D E GG+F T  +  +++ 
Sbjct: 489 AYLDDYAYLVHALVSLYEADFHTRWLSAAVELADQMIAHFSDHERGGFFFTADDHEALIT 548

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R K+ HDG+ PSG+S++ + L RL  I        Y   +E ++      +     A  +
Sbjct: 549 RAKDMHDGSVPSGSSMAALALARLGKITGKQA---YLLASERAILAASGSVTANPTASAV 605

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
           M  AAD+L  P+ + +VL G ++ V +    L   +A   +   ++   P D        
Sbjct: 606 MIQAADLLVGPTSE-IVLAGPEAEVRETARALRKIYAPRKVVAALMTGLPVDA------- 657

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S  A + +   S+ ++   +CQNFSC  PVT   S+   L
Sbjct: 658 --SSPVAPLVQGKESS-QLSLYICQNFSCQAPVTGASSIAAAL 697


>gi|195485941|ref|XP_002091297.1| GE13577 [Drosophila yakuba]
 gi|194177398|gb|EDW91009.1| GE13577 [Drosophila yakuba]
          Length = 809

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 268/742 (36%), Positives = 377/742 (50%), Gaps = 74/742 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVME ESFE    A ++N+ FV+IKVDREERPD
Sbjct: 102 GEEAFEKARSENKIIFLSVGYSTCHWCHVMEHESFESPVTAAIMNEKFVNIKVDREERPD 161

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +DK+YM ++    G GGWP+SV+L+P L PL+ GTYFPP+ +YG P F  +L+ +   W+
Sbjct: 162 IDKIYMQFLLMSKGSGGWPMSVWLTPTLAPLVAGTYFPPKSRYGMPSFNAVLKSIAKKWE 221

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGF 175
             ++ L  +G+  +  L +   ASA +         +A+   +E ++   + +D   GGF
Sbjct: 222 TDKESLLTAGSTLLTALQKNQDASAVAEAAFG--VGSAIEKLSEAINVHKQRFDQTHGGF 279

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           GS PKFP    I  + +     +D       ++   MV+ TL  + KGGI+DH+ GGF R
Sbjct: 280 GSEPKFPEVPRINFLFHAYLVTKD-------ADVLDMVIETLTQIGKGGINDHIFGGFAR 332

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           Y+  E WH  HFEKMLYDQGQL   + +A+ +T+D  +      I  YL +D+  P G  
Sbjct: 333 YATTEDWHNVHFEKMLYDQGQLMAAFANAYKVTRDETFLGYADKIYKYLLKDLRHPLGGF 392

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLK 343
           ++ EDADS  T     K EGAFY WT  E++           DI  E A  ++  HY LK
Sbjct: 393 YAGEDADSLPTHEDNVKVEGAFYAWTWDEIQAAFKDQAQRLDDITPERAFEIYAYHYDLK 452

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
           P GN  +   SDPH    GKN+LI       S +   +  +K+  +L      L  VR +
Sbjct: 453 PPGN--VPAYSDPHGHLTGKNILIVRGSEEDSIANFSLEADKFKKLLATTNDILHVVREQ 510

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRPHLD K+I +WNGLV+S   +                    ++R +YM+ A+    F
Sbjct: 511 RPRPHLDTKIICAWNGLVLSGLCKLGN--------------CYSANRDQYMQTAKELLDF 556

Query: 464 IRRHLYDEQTHRLQHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 513
           +R+ +YD +   L  S               S+  GFLDDYAFLI GLLD Y+       
Sbjct: 557 LRKEMYDPEKKLLIRSCYGVAVGDETLEKNESQIDGFLDDYAFLIKGLLDYYKATLDVDV 616

Query: 514 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 573
           L WA  LQ+TQD+LF D   G YF +  + P+V++R+KEDHDGAEP GNSVS  NLV L 
Sbjct: 617 LHWAKALQDTQDKLFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSARNLVLLG 676

Query: 574 SIVAGSKSDYYRQNA----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 629
                    YY +NA       L  F   +     A+P M  A  +L   +   +V V  
Sbjct: 677 H--------YYDENAYLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVG 726

Query: 630 KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVAL 689
             S D +  +      Y  +  ++H+DP++  E        SN     +      K    
Sbjct: 727 PDSPDTQRFVEICRKFYIPSMIIVHVDPSNPGEA-------SNQRLQTKFKMVGGKTTVY 779

Query: 690 VCQNFSCSPPVTDPISLENLLL 711
           +C   +C  PVTDP  LE+ L+
Sbjct: 780 ICHERACRMPVTDPQQLEDNLM 801


>gi|407917811|gb|EKG11113.1| protein of unknown function DUF255 [Macrophomina phaseolina MS6]
          Length = 747

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 262/665 (39%), Positives = 356/665 (53%), Gaps = 35/665 (5%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           KT R  F+      CHWCHVME ESFE+  +A +LN  F+ +KVDREERPDVD++YM YV
Sbjct: 53  KTNRLLFVSIGYAACHWCHVMERESFENPEIANILNKNFIPVKVDREERPDVDRIYMNYV 112

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYG----RPGFKTILRKVKDAWDKKRDM 123
           QA  G GGWPL+VF++PDL+P+ GGTY+P           P F  IL ++KD W  +R  
Sbjct: 113 QATTGSGGWPLNVFITPDLEPIFGGTYWPGPGSTTVLGDHPSFLEILERIKDVWQTQRQK 172

Query: 124 LAQSGAFAIEQL----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
             +S      QL     E   +      + D L    L       +  YD ++ GFG AP
Sbjct: 173 CLESAKEVTAQLREFAQEGTISKGGEGAVGDGLDLELLEEAYTHFANKYDKQYAGFGKAP 232

Query: 180 KFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           KFP P  I  +L    + + +E      E +  ++M + TL+ MA+GGIHD +G GF RY
Sbjct: 233 KFPTPTNISFLLRLAQYPEAVEHVVGDRECAHAKEMAVETLRRMARGGIHDQIGNGFARY 292

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYL-RRDMIGPGGEI 295
           SV   W +PHFEKMLYDQ QL   YLDA  +T D        DI  YL    +  P G  
Sbjct: 293 SVTRDWSLPHFEKMLYDQSQLLTAYLDAHIITNDSELLDAAHDIATYLTTHPLQSPDGGF 352

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
           FS+EDADS        K+EGAFYVWT KE + ILGE  A +   +Y ++  GN  +S   
Sbjct: 353 FSSEDADSLYRPNDKEKREGAFYVWTRKEFKSILGEKDAEVCARYYNVRENGN--VSPEH 410

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKV 413
           D H+E   +NVL   +   A A + G+  ++   IL   RR+L + R+K RPRP LDDK+
Sbjct: 411 DAHDELINQNVLAISSTPDALAKEFGLSKDEVTKILESGRRRLLEHRNKERPRPGLDDKI 470

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           +V WNGL I + AR S  L++              DR  Y+  AE A   I+  LY    
Sbjct: 471 VVGWNGLAIGALARFSAYLQASGSKE--------PDR--YISAAEKAVKLIKTKLYSAAD 520

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             L+  +R GP +AP F DDYAFLISGL+DLYE      +L +A +LQ TQ +LF D   
Sbjct: 521 GTLKRVYREGPGEAPAFADDYAFLISGLIDLYEATFDDSYLEFADQLQRTQIKLFWDSTS 580

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G +F+T      ++LR+KE  D AEPS N +S  NL RL +++   + DY ++ A+ +  
Sbjct: 581 GAFFSTAEGQADLILRLKEGMDNAEPSTNGISASNLYRLGALL--EEPDYTKR-AKETCE 637

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            FE  L       P M      L +   K +V+ G   +V  E  ++ A +  + N T+ 
Sbjct: 638 AFEAELMQHPFLFPSMLNGIVALRL-GMKSIVVSGSGENV--EKAISKARSRVNTNTTIA 694

Query: 654 HIDPA 658
            + P 
Sbjct: 695 RLGPG 699


>gi|195583350|ref|XP_002081485.1| GD11041 [Drosophila simulans]
 gi|194193494|gb|EDX07070.1| GD11041 [Drosophila simulans]
          Length = 808

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 268/723 (37%), Positives = 374/723 (51%), Gaps = 75/723 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE    A ++N+ FV+IKVDREERPD+DK+YM ++    G GGWP+S
Sbjct: 122 STCHWCHVMEHESFESPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L PL+ GTYFPP+ +YG P F  +L+ +   W+  ++ L  +G+  +  L +  
Sbjct: 182 VWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLKSIARKWETDKESLLSTGSSLLSALQKNQ 241

Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
            ASA        +P+ A       E+LS++       +D   GGFGS PKFP    +  +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
            +     +D        +   MV+ TL  + KGGIHDH+ GGF RY+  + WH  HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQGQL   + +A+ +T+D  Y      I  YL +D+  P G  ++ EDADS  T    
Sbjct: 347 LYDQGQLIVAFTNAYKVTRDEIYLGYADKIYKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406

Query: 311 RKKEGAFYVWTSKEV-----------EDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
            K EGAFY WT  E+           EDI  E A  ++  HY LKP GN  +   SDPH 
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFEDITPERAFEIYAYHYDLKPPGN--VPTYSDPHG 464

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
              GKN+LI       + +   +  +++  +L      L  +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEADQFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GLV+S   +                    ++R++YM+ A+    F+R+ +YD +   L  
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570

Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           S               S+  GFLDDYAFLI GLLD Y+       L WA  LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   G YF +  + P+V++R+KEDHDGAEP GNSVS  NLV LA        D + Q A
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAHYY---DEDAFLQKA 687

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
              L  F   +     A+P M  A  +L   +   +V V    S D E  +      Y  
Sbjct: 688 GKLLNFF-ADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTERFVEICRKFYIP 744

Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
           +  ++H+DP++ EE        SN     +      K    +C   +C  PVTDP  LE+
Sbjct: 745 SMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTDPQQLED 797

Query: 709 LLL 711
            L+
Sbjct: 798 NLM 800


>gi|341876361|gb|EGT32296.1| hypothetical protein CAEBREN_30752 [Caenorhabditis brenneri]
          Length = 745

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 278/743 (37%), Positives = 386/743 (51%), Gaps = 79/743 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T +  FL    +TCHWCHVME ESFE+E  AK+LN+ FV+IKVDREERPD
Sbjct: 45  GEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFVAIKVDREERPD 104

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF TIL  +   W 
Sbjct: 105 VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWQ 164

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K+ + L   GA  I+ L   +  S   N+  D       +        ++DSR GGFG A
Sbjct: 165 KEGENLRTRGAQIIKLLQPEIK-SGDVNRSED-----VFKSIYSHKKSTFDSRLGGFGRA 218

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+  +   ++  +        S E  E   M+  TL+ MA GGIHDH+G GFHRYSV
Sbjct: 219 PKFPKAPDFDFLIAFAS---SQSNSEEKQESIMMLQKTLESMADGGIHDHIGNGFHRYSV 275

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS--YICRDILDYLRRDMIGPGGEIF 296
           D  WH+PHFEKM+YDQ QL   Y +  SLT+    S   +  DI +Y+++     GG  +
Sbjct: 276 DSEWHIPHFEKMIYDQSQLLASYSEFHSLTEKKHESIKLVINDIFEYMQKISHKDGG-FY 334

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
           +AEDADS  T  +T K EGAF  W   E++ +LGE  I       +F +++ ++  GN  
Sbjct: 335 AAEDADSLPTHESTEKVEGAFCAWERDEIKQLLGEKKIESASLFDVFVDYFDVEENGN-- 392

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
           +++ SDPH E K KNVL +L      A+  G+ +E+  N + E R  L+  R+KRP PHL
Sbjct: 393 VAKSSDPHGELKNKNVLRKLLTDEECATNHGITVEQLKNGIDEAREILWIARTKRPSPHL 452

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K++ +W GL I+   +A +                 ++  +Y+E AE  A+F+ ++L 
Sbjct: 453 DSKMVTAWQGLAITGLVKAYQ----------------ATNEPKYLERAEKCAAFVEKYL- 495

Query: 470 DEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
            E+   L+ S           G  +   F DDYAFLI GLLDLY      ++L   IELQ
Sbjct: 496 -EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYAFLIQGLLDLYTVAGKNEYLERCIELQ 554

Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKE--------------DHDGAEPSGNSVSVI 567
            T DE F    G GYF +   D  V +R+ E              D DGAEP+  S++  
Sbjct: 555 KTCDEKFWS--GNGYFISEKSDEEVSVRMIEGKIILSNFYKKNFSDQDGAEPTATSIASN 612

Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
           NL+R   I+   +++ YR+ A         RL  + +A+P M  A     + S    VLV
Sbjct: 613 NLLRFYDIL---ENEEYREKANQCFRGASERLNKIPIALPKMAVALQRWQLGSTT-FVLV 668

Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVV 687
           G  +S          +     N +V+HI   D    D     +S+NA MA+      +  
Sbjct: 669 GDPTSELLTEARNQLNQKLINNVSVVHIRSKD----DVSASGSSHNA-MAQ----GPQPA 719

Query: 688 ALVCQNFSCSPPVTDPISLENLL 710
             +C+ F C  PV     LE L 
Sbjct: 720 VYLCKGFVCGLPVRKIDKLEQLF 742


>gi|28210673|ref|NP_781617.1| thymidylate kinase [Clostridium tetani E88]
 gi|28203111|gb|AAO35554.1| thymidylate kinase [Clostridium tetani E88]
          Length = 713

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 264/720 (36%), Positives = 391/720 (54%), Gaps = 89/720 (12%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE VAK+LND F+SIKVDREERPD
Sbjct: 69  GEEAFQKAKEEDKPIFLSIGYSTCHWCHVMERESFEDEEVAKVLNDNFISIKVDREERPD 128

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YMT+ QA+ G GGWPL++ ++PD KP   GTYFP ED+YG  G   IL+++ + W 
Sbjct: 129 IDNIYMTFCQAVTGSGGWPLTIIMTPDKKPFFAGTYFPKEDRYGVRGLMYILKEMSNQWK 188

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R+++  S    ++ +S+ +S S       ++L +  ++ C E L +SYD   GGF  A
Sbjct: 189 NNRELILNSSEKLLKDMSQYISVSQR-----EDLNKEVIKECFEVLKESYDPIHGGFYDA 243

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP   ++  +L + +  +D        E   +V  TL+ M KGGI DH+G GF RYS 
Sbjct: 244 PKFPTSHKLMFLLRYYRLYKD-------EEALNIVEKTLKSMYKGGIFDHIGYGFSRYST 296

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D++W VPHFEKMLYD   L   Y + + +TK+  Y  I    + Y+ RDM    G  +SA
Sbjct: 297 DDKWLVPHFEKMLYDNAMLTIAYAEMYQITKEELYKEIIEKTISYVIRDMKDKKGAFYSA 356

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FYVWT +E+EDILG E A LF ++Y +   GN          
Sbjct: 357 EDADS---EGV----EGKFYVWTLEEIEDILGKEDAKLFSKYYGITDRGN---------- 399

Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKY----LNILGECRRKLFDVRSKRPRPHLDD 411
             F+G+N+  LIE             PLE       + L   R+ LF  R KR  PH D 
Sbjct: 400 --FEGENIPNLIE------------TPLEDLEPDVKDKLENIRKTLFINREKRIHPHKDT 445

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL+I++ A + ++LK                RK+Y+E AE A  FI ++L DE
Sbjct: 446 KILTSWNGLMIAALAYSGRVLK----------------RKDYIESAEEAVKFIMKNLIDE 489

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   +R+G     G L+DY+FLI  L++LY+    T+++  A+++     ELF D 
Sbjct: 490 NG-RIYVRYRDGERAHKGHLEDYSFLIWALIELYQSTFKTEYIEKALKINYDMIELFWDE 548

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           E  G+F+T  +   ++L++KE +D A PSGNSV++ N+VRL+ I   SK D   +  + +
Sbjct: 549 ENHGFFHTGKDGEELILKLKESYDSAIPSGNSVAMYNMVRLSRITGDSKLD---EIIQQN 605

Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD-LNK 650
           L  F  R+K    +      +     + S + V++ G    + F+ M+   +  Y   + 
Sbjct: 606 LNYFSGRIKSTLESHTFFLISYMHYVLESEEIVIVKGEDEDI-FKAMIKVINEKYHPFSM 664

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            ++  +  +    +  E++N  N           K    +C+NF+C  P+   ISLE+L+
Sbjct: 665 NIVKDEKVEKLMPELKEKNNIQN-----------KTTVYICKNFACGNPI---ISLEDLI 710


>gi|374302064|ref|YP_005053703.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332555000|gb|EGJ52044.1| protein of unknown function DUF255 [Desulfovibrio africanus str.
           Walvis Bay]
          Length = 691

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 278/712 (39%), Positives = 379/712 (53%), Gaps = 55/712 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F   T+  +  FL    +TCHWCHVME ESFED+ VAKLLN+ FV IKVDREERPD
Sbjct: 30  GEEAFRTATEQDKPVFLSIGYSTCHWCHVMERESFEDDEVAKLLNEAFVCIKVDREERPD 89

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  Q + G GGWPL+V ++PD KP   GTYFP     GR G   ++ KV+D W 
Sbjct: 90  IDNVYMTVCQMMTGHGGWPLTVLMTPDKKPFFSGTYFPKSSLSGRMGLMELVPKVQDLWR 149

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R+ L QS     E L   L   A   +L D +   A R    QLS+ +D  FGGFG A
Sbjct: 150 TRREDLVQSADKVTEAL-RGLERPAVGGELGDSVLFKAER----QLSERFDEAFGGFGGA 204

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P     +L   +    TG +   +    MV  TL  M +GGI+DH+G GFHRYS 
Sbjct: 205 PKFPTP---HNLLLLLRMFRRTGNARNLA----MVEKTLTTMRRGGIYDHLGYGFHRYST 257

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+RW +PHFEKMLYDQ QL   Y++A+ LT+   Y    ++I++Y+RRD+  P G  +SA
Sbjct: 258 DQRWLLPHFEKMLYDQAQLLMAYVEAYQLTRKPIYKRTAQEIVEYVRRDLQHPDGPFYSA 317

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS   EG    +EG FYVW+ KE+  +LG+ A  F   Y + P GN     + +  +
Sbjct: 318 EDADS---EG----EEGKFYVWSEKEIRSVLGKKADPFIRAYDILPEGNF----LDEATH 366

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
              G NVL         A +LGM   +    L + RR LF VR +R RP  DDKV+  WN
Sbjct: 367 RRTGANVLHLQRPLDILAKELGMSELELETTLADQRRLLFHVRERRVRPLRDDKVLTDWN 426

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL+I++ + A+K L                D + ++  A +AA FI   +   +  RL H
Sbjct: 427 GLMIAALSMAAKAL----------------DEELFVRAATAAADFILSRM--RKDGRLLH 468

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+G       L DYAFLI GL++LYE G  ++ L  A++L    ++ F D + GGY+ 
Sbjct: 469 RFRDGEVAIEATLTDYAFLIWGLVELYEAGLDSRHLEAALDLTEIMNKQFWDPKDGGYYF 528

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           T      +L+R K+  DGA PSGNSV++  L++L+ +               S A   T 
Sbjct: 529 TAESAEQLLVRQKDLFDGAIPSGNSVAMHVLLKLSRLTGRPNLANRAAAVARSAARQAT- 587

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
             +  +    + C  D    PS   VV+VG +++ +   ML   HASY  NK ++  +  
Sbjct: 588 --EHPVGFTQLLCGVDFSIGPS-AEVVIVGKRNAPETRAMLRKLHASYIPNKVLLLREEG 644

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           D       E   +     A       K  A VC+ FSC  PVT+P ++  LL
Sbjct: 645 D-------ERMPALAPFTAELVMQDGKATAYVCRGFSCELPVTEPQAMMELL 689


>gi|91201579|emb|CAJ74639.1| conserved hypothetical protein [Candidatus Kuenenia
           stuttgartiensis]
          Length = 729

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 264/714 (36%), Positives = 382/714 (53%), Gaps = 65/714 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       +  FL    +TCHWCHVME ESFEDE VAK+LN+++V+IKVDREERPD
Sbjct: 75  GKEAFEKAKAESKVIFLSIGYSTCHWCHVMETESFEDEEVAKILNEYYVAIKVDREERPD 134

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  QA+ G GGWPL++FL+ + K    GTYFP  ++ G PG   +L ++ + W+
Sbjct: 135 IDNVYMTVCQAMTGSGGWPLTLFLTSEGKSFYAGTYFPKTERLGNPGLIALLTQIANLWN 194

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             ++ +  S +  + +L +  +AS    K PD      L+   EQLS  +DS +GGFG++
Sbjct: 195 TNKESIIAS-SLQVTKLIDTETASKGEEK-PD---VRTLKTAYEQLSDRFDSLYGGFGTS 249

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P     +L   K+  +       +   +MV  +L+ MA+GGIHDH+GGGFHRYS 
Sbjct: 250 PKFPTPHNFTFLLRWWKRSNN-------AFALEMVEKSLELMARGGIHDHLGGGFHRYST 302

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE W  PHFEKMLYDQ  LA  Y++ +  TK   YS I +DI DY+ RDM  P G  +SA
Sbjct: 303 DEYWLTPHFEKMLYDQALLAISYIETYQATKKDLYSAIAKDIFDYVLRDMTSPEGGFYSA 362

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN--CDLSRMSDP 356
           EDADS   EG     EG FYVW  +E+++ LGE              GN  CD   +SD 
Sbjct: 363 EDADS---EGI----EGKFYVWKPEEIKEALGEK------------DGNIFCDFYDVSDI 403

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
            N F+ KN+L        +A    M  +     L   R+KL  +R KR +PH D K+I S
Sbjct: 404 GN-FEDKNILHADKPLHIAAKLENMSPDALEKRLANSRKKLLSIREKRIKPHKDTKIITS 462

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+IS+ +R ++ +                D  +Y  VA  AA FI   L  E    L
Sbjct: 463 WNGLMISALSRGAQAM----------------DEPKYTNVAMCAADFILNTLLQENKILL 506

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
           +  +  G S   GFLDDYAF ++GL+DLYE     K+L  A+++     + FLD   GG+
Sbjct: 507 RR-YCQGESAIAGFLDDYAFFVNGLIDLYEATFQEKYLQAALQINEEMIKNFLDENEGGF 565

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F +   +  +  + K+ +DGA PSGNS++++NL+RL  I        Y   A++ +  F 
Sbjct: 566 FLSGKSNEKLFTQTKDIYDGATPSGNSIALLNLLRLGRITGNPS---YEALADNLIKTFS 622

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             +           CA D    P+ K +++ G +   D +++L    + +  NK V+ + 
Sbjct: 623 GTILQYPSGYTQFMCALDFALGPT-KEIIVAGEREGNDTKDILREIRSRFLPNK-VLLLH 680

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           P++     F EE       +        +    +C+N+SC  PV+D  ++  LL
Sbjct: 681 PSNG---IFIEEIAPYTKELIP---IEGRSTVYMCENYSCKKPVSDKNAVIQLL 728


>gi|333374035|ref|ZP_08465926.1| thymidylate kinase [Desmospora sp. 8437]
 gi|332968513|gb|EGK07575.1| thymidylate kinase [Desmospora sp. 8437]
          Length = 702

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 275/710 (38%), Positives = 382/710 (53%), Gaps = 65/710 (9%)

Query: 5   SFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 61
           +F    K  +  FL    +TCHWCHVME ESFED  VA+LLN  +++IKVDREERPDVD 
Sbjct: 49  AFAKARKEDKPIFLSIGYSTCHWCHVMERESFEDVEVAQLLNREYIAIKVDREERPDVDN 108

Query: 62  VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 121
           +YM+  QAL G GGWPL++ ++P+ +P   GTYFP +   G  G   IL +V  AW ++R
Sbjct: 109 IYMSVCQALTGHGGWPLTIIMTPEKEPFFAGTYFPKQAVQGMQGLMEILGQVARAWREER 168

Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKF 181
           + +  +G      +   L  S S +   +EL +        Q   +YD ++GGFG+APKF
Sbjct: 169 EQVLDAGRKITRAVQTQLKVSESGDLGKEELAE-----AYRQFKSTYDPQYGGFGTAPKF 223

Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
           PRP ++  +L + K      +SGE      MV  TL  M +GGI+DHVG GF RY+VD  
Sbjct: 224 PRPHDLLFLLRYWK------ESGEPF-ALSMVEETLDGMRRGGIYDHVGFGFARYAVDRE 276

Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
           W VPHFEKMLYD   LA  YL+A+ +TK   Y+   R+I  Y+ R M  P G  +SAEDA
Sbjct: 277 WLVPHFEKMLYDNALLAYAYLEAYQVTKKDAYAGTAREIFTYVLRGMTSPEGGFYSAEDA 336

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS   EG    +EG FYVW   EV+++LGE A  LF E Y + P GN +  +MS P+   
Sbjct: 337 DS---EG----EEGKFYVWNPSEVKEVLGEEAGELFCECYDITPHGNFE-QKMSIPN--- 385

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
           +  + L E+ D      + G  +E+    L   R KLF  R +R  PH DDK++ SWNGL
Sbjct: 386 RIHSSLQEIAD------RRGRDVEELREQLEVSREKLFRAREERVHPHKDDKILTSWNGL 439

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           +I++ A+ +++L  E+                Y E AE AASFI   L DE+  RL   +
Sbjct: 440 MIAALAKGARVLGDES----------------YAEAAEKAASFILERLRDEKG-RLLARY 482

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           R+G +  PG++DDYAFL+ GL++LYE     ++L  A+EL     ELF D E GG + T 
Sbjct: 483 RDGEAAIPGYVDDYAFLVWGLIELYEATFRPRYLKSALELTREMLELFGDEEEGGLYFTG 542

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
            +   +L R KE +DGA PSGNSV+ +NL RLA +   +     R+ A+  +  F   + 
Sbjct: 543 RDAEKLLTRTKEVYDGAVPSGNSVAALNLARLARLTGDTG---LREQADRQIRAFAGSVG 599

Query: 601 DMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
               A      A    L  P  K +V+ G     D E M+     ++ L + V+   P  
Sbjct: 600 QAPTAFSFFLTAVQFFLGTP--KEIVIAGPDGDHDTELMIRRVQQAF-LPEAVLLYKPEG 656

Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
                  EE       +A       +  A VC+N++C  P T   +LE L
Sbjct: 657 K-----GEEVTQLVPFLAEQGAIQGRATAYVCENYACMAPAT---TLEEL 698


>gi|332020712|gb|EGI61117.1| Spermatogenesis-associated protein 20 [Acromyrmex echinatior]
          Length = 746

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 274/711 (38%), Positives = 386/711 (54%), Gaps = 69/711 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA--LYGGGGWP 77
           +TCHWCHVME ESF++E VAK++N+ +V+IKVDREERPD+D + M ++QA  L G GGWP
Sbjct: 63  STCHWCHVMEKESFKNEEVAKIMNENYVNIKVDREERPDIDMMCMMFIQASRLRGHGGWP 122

Query: 78  LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           L+VFL+PDL P+ GGTYF          F   L ++   W + RD + +S A   ++L E
Sbjct: 123 LNVFLTPDLMPITGGTYF------SCAMFTLYLTRIVKEWTEGRDKMVKSAAIVSDRLKE 176

Query: 138 ALSASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGSA-------PKFPRPVEIQM 189
            LS S    K  D +P  +   LCA  L   YD  +GGFGS+       PKFP P  +  
Sbjct: 177 -LSTSRHDIK-DDGVPAIDCAFLCAHVLLNIYDEEYGGFGSSSATNPNSPKFPEPTNLNF 234

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
           +L     L  +    E S      L TL+ M+ GG+HDHVG GFHRY+VD RW VPHFEK
Sbjct: 235 LL-SMHVLSTSTMLVEMSLNAS--LNTLRKMSFGGLHDHVGKGFHRYTVDARWKVPHFEK 291

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           MLYDQ QL   Y+DA+ +TKD F+S I  DI  Y+ R +    G  FSA DADS  T  A
Sbjct: 292 MLYDQAQLIQCYVDAYIITKDSFFSDIVDDIATYVLRMLTHMEGGFFSAVDADSLPTFDA 351

Query: 310 TRKKEGAFYVWTSKEVEDIL-----GEHAI----LFKEHYYLKPTGNCDLSRMSDPHNEF 360
             K+EGAFYVW+   ++ +L     G+  +    L   H+ ++  GN  + R  DPH E 
Sbjct: 352 PAKREGAFYVWSYDNLKALLKKKVPGKDNVTYFDLICRHFSVRKEGN--VERPQDPHGEL 409

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
            GKNVL   +    +A+   + +++    + E    L++ RS RP P LDDK++ SWNGL
Sbjct: 410 TGKNVLSMQSGIEDTANHFKLNVKETQKYIKEACTTLYEDRSHRPWPSLDDKMVTSWNGL 469

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS- 479
           +IS  ARA   +K+                K+Y+E A  AA+F+ ++L+++    L  S 
Sbjct: 470 MISGLARAGIAVKN----------------KDYVEAATEAATFVEKYLFNKDKRILLRSC 513

Query: 480 FRNGPSK-------APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
           +R    K        PGF +DYAF + GLLDLYE      W+ +A ELQ+ QD LF D E
Sbjct: 514 YRRRDDKIVQRSDPIPGFHEDYAFFVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDSE 573

Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
            GGYF    E P +L R K+  DG++PSGNS++  NL+RLA  +     D  R  AE  L
Sbjct: 574 DGGYFAMAEESP-ILTRTKDSDDGSQPSGNSIACSNLLRLAIYL---DRDDLRHKAEKLL 629

Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
             F  +L +   A P M  A      P++ +V   G   + +   ML    +     + +
Sbjct: 630 CAFGNKLANCPAACPQMMLALIEFHHPTQIYV--AGKADAKETIEMLEIIRSRLIPGRVL 687

Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
           I    AD+E+   +      N  + R     ++    +C+++SC+ P+++P
Sbjct: 688 IL---ADSEDNVLFRR----NMIVKRMKPQKNRATVFICRDYSCTLPISNP 731


>gi|296415498|ref|XP_002837423.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633295|emb|CAZ81614.1| unnamed protein product [Tuber melanosporum]
          Length = 773

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 261/693 (37%), Positives = 376/693 (54%), Gaps = 63/693 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           C +  VME ESFE+E +A++LN+ F+ IK+DREERPD+D++YM +VQA  G GGWPL+VF
Sbjct: 109 CEYTIVMERESFENEEIARILNENFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVF 168

Query: 82  LSPDLKPLMGGTYFPPEDKYG----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           L+PDL+P+ GGTY+P     G    + GF  +LRK+ + W ++ +    S +  + QL E
Sbjct: 169 LTPDLQPVFGGTYWPGPSAVGGMKDQLGFLEVLRKIANVWKEQHERCVASASDILNQLKE 228

Query: 138 ALSAS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---Y 192
                   +  +  D L  + L    +     YD  +GGFG+APKFP PV +  +L    
Sbjct: 229 FTDEGLKGTGGEPGDGLELDLLEEAYQHFMARYDPLYGGFGNAPKFPTPVNLAFLLRLGT 288

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
               ++D     E    + MV+ TLQ MAKGGIHDH+G GF RYSV   W++PHFEKMLY
Sbjct: 289 FPATVQDIVGEMECENAKSMVIDTLQGMAKGGIHDHIGHGFSRYSVTANWNLPHFEKMLY 348

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATR 311
           DQ QL ++Y+DA+ +TK         DI +Y+  D +  P G  +S+EDADS   +  T 
Sbjct: 349 DQAQLLSIYIDAWLVTKSPAMLEAANDIAEYMCLDALKSPDGAFYSSEDADSLYRKADTE 408

Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           K+EGAFYVWT KE + +LGE  A +   ++ +   GN D +  +DPH+EF  +NVL   +
Sbjct: 409 KREGAFYVWTRKEFDVMLGEQDASICARYWNVHRDGNVDPA--NDPHDEFIAQNVLSVAS 466

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 429
                +   GM  E+  NI+   R+KL   R K RPRP+LDDK++ +             
Sbjct: 467 TPEKLSKMYGMSAERITNIISSARQKLLQHRLKERPRPNLDDKIVTT------------- 513

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
                                + Y + AE A SFIR++LYDE+T  L+  +R+GP +A G
Sbjct: 514 ---------------------QLYKKNAEEAISFIRKNLYDEKTGILKRVYRDGPGEADG 552

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           F DDYAFLISGLL +YE     ++L WA  LQ  Q + F D E GG+F+T+     ++LR
Sbjct: 553 FADDYAFLISGLLCMYEATFDVEYLQWADALQQKQIDAFWDAENGGFFSTSEGASDLILR 612

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           +K+  D  EPS N VS  NL RL +++   K + Y   A+ + + F T L    +  P +
Sbjct: 613 LKDGLDSQEPSTNGVSANNLFRLGTLLGDPKLEEY---AQQTCSAFSTEL----LQHPFL 665

Query: 610 CCA---ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
             +   A + S    + VVL G       E  L    +    N T++ +DPA  + +D+ 
Sbjct: 666 FSSLMPAIVASNLGMRSVVLAGDPKDPTIEKHLKRLRSKLLTNTTLVQLDPARGDSLDWL 725

Query: 667 EEHNSNNASMARNNFSAD---KVVALVCQNFSC 696
              N  +  +   N +A    K V  VC+   C
Sbjct: 726 LSRNKLHKELL--NVAAKGSGKPVVQVCEGTKC 756


>gi|108805332|ref|YP_645269.1| hypothetical protein Rxyl_2540 [Rubrobacter xylanophilus DSM 9941]
 gi|108766575|gb|ABG05457.1| protein of unknown function DUF255 [Rubrobacter xylanophilus DSM
           9941]
          Length = 685

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 267/696 (38%), Positives = 386/696 (55%), Gaps = 65/696 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE  A+++N+ FV+IKVDREERPD+D +YM+ +QA+  GGGWP++
Sbjct: 51  SSCHWCHVMERESFEDEETARIMNEHFVNIKVDREERPDIDSIYMSALQAMTRGGGWPMT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+  P   GTYFPPE + G P FK +L  + DA+  +R+ + +S     E L  + 
Sbjct: 111 VFLTPEGVPFYAGTYFPPEPRGGMPSFKQVLLTLADAYRNRREEVLRSAESVREFLRAST 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A     +L +EL   A    AE L +  D RFGGFG APKFP+P+ ++++L H ++  D
Sbjct: 171 TAEMPRGRLREELLDGA----AEALMRQLDRRFGGFGGAPKFPQPMSLEVLLRHHRRTGD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   E    V  TL+ MA+GGI+D +GGGFHRY+VD RW VPHFEKMLYD   L+ 
Sbjct: 227 -------REALAGVELTLRSMARGGIYDQLGGGFHRYAVDGRWLVPHFEKMLYDNALLSR 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +YL+A+  T D FY  I  + LDY+ RDM GP G  +SAEDADS   EG    +EG FYV
Sbjct: 280 LYLEAYQATGDGFYRRIAEETLDYVARDMRGPEGGFYSAEDADS---EG----EEGKFYV 332

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +E+ + LG E A L   ++ +   GN            F+G+NVL    +    A +
Sbjct: 333 WTPRELREALGSEDASLAAAYWGVTERGN------------FEGRNVLHVPREPEEVARE 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           +G+   +    + E RR+L + R +R RP  D+KV+ +WNGL++ SFA  +++L+     
Sbjct: 381 VGLSPGELGRRVREIRRRLLEARGRRVRPGRDEKVLAAWNGLMLRSFAFTARVLR----- 435

Query: 439 AMFNFPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      R++Y+ +A E+AA  + R L  E   RL  S+R+G ++  G+L+DYA +
Sbjct: 436 -----------REDYLRIACENAAFLLGRLLSPE--GRLLRSYRDGRARIAGYLEDYAMV 482

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GL+ LYE    T+WL  AI L +  DELF D   G +F+       ++ R ++ +D A
Sbjct: 483 ADGLVSLYEATFETRWLREAISLADAMDELFWDESAGAFFDAPAGGEELVTRPRDVYDNA 542

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-L 616
            PSG SV+V   V L   +   + D YR+ AE +L      L+ M  A   +  A D  L
Sbjct: 543 TPSGTSVAVD--VLLRLALLLGRED-YRRRAEAALEGLSGLLEQMPAAFGRLLGALDFHL 599

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P  + V +VG   + D   ++ A ++ Y  N+ VI   P          E  S    +
Sbjct: 600 GRP--REVAIVGRPDAPDTRALVDALYSVYLPNR-VIAGGPGG--------EDASLVPLL 648

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                   +  A VC+ + C  P T+P  L   L E
Sbjct: 649 EGRGMVDGRATAYVCEGYVCKSPTTEPGELLRQLRE 684


>gi|298710386|emb|CBJ25450.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 808

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 296/782 (37%), Positives = 407/782 (52%), Gaps = 94/782 (12%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    +TCHWCHVME ESFE + VAK+LN+ FVSIKVDREERPD
Sbjct: 48  GQEAFSRAKEEDKPIFLSVGYSTCHWCHVMERESFESQTVAKVLNENFVSIKVDREERPD 107

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD+ +MT+VQA  GGGGWP+SV+L+PDLKP +G TYFP         F +IL+ + D W 
Sbjct: 108 VDQCFMTFVQATSGGGGWPMSVWLTPDLKPFVGATYFPEMR------FVSILKTLADKWS 161

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-----PQNALRLCAEQLSKSYDSRFG 173
             R+ + + G   +  L E LS +A+++  P         + A+R     L K +D   G
Sbjct: 162 SDREEVVKQGDHIVRLLQERLSETAAASGDPLAFLALDKSREAVREGVRVLDKGHDDVLG 221

Query: 174 GFGSAP---KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
           G+G      KFP+P  + ++L  + +LE  G S   +    MV  TL+ MAKGGI+D++ 
Sbjct: 222 GWGGGRGGMKFPQPSRMNLLL-RAHRLEGEG-SALGARALAMVETTLKAMAKGGIYDYLF 279

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
            GF RYS D RWHVPHFEKMLYDQ QL   Y++AF +T D  Y+ + R +L Y+ RDM  
Sbjct: 280 DGFARYSTDPRWHVPHFEKMLYDQSQLVTAYVEAFQVTGDTAYADVARGVLRYVLRDMTD 339

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAI--------------L 335
            GG  +SAEDADS   EGAT KKEGAF VWT  ++  +L GE  +              L
Sbjct: 340 EGGGFYSAEDADSLPFEGATEKKEGAFCVWTEPDLRRLLDGEEGVALPGEGGQTVPVSSL 399

Query: 336 FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL--EKYLNILGEC 393
           F   Y ++P GN D +   D H E   +NVL +      +A  LG+    E+    +   
Sbjct: 400 FCRVYGVRPEGNVDPA--VDAHGELTSQNVLFKSETVRVAAEALGLTCSGEEAEAAMTGA 457

Query: 394 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 453
           R  L   R KRP PHLDDKV+ SWNGL+IS+ ARAS+          F+      +   Y
Sbjct: 458 RATLVAARRKRPAPHLDDKVLTSWNGLMISALARASQ---------AFSSSPPSEESLAY 508

Query: 454 MEVAESAASFIRRHLY------DEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYE 506
           +  A  AA F+R +LY       E    L  S+RNG  S   GF DDYAFLI GL+DLYE
Sbjct: 509 LGAATKAAEFVRENLYRSGSGDGETAGTLLRSWRNGRASPVEGFADDYAFLIRGLIDLYE 568

Query: 507 F----GSGTKWLVWAIELQNTQDELFL--DREGGGYFN-----TTGEDPS---------- 545
                 +G +WL WA ELQ   DE F      GGGY++     + GE             
Sbjct: 569 ADPRRDTGWRWLRWARELQAEMDEGFKCPSEAGGGYYSSRALESEGETKGDGETEGGSGS 628

Query: 546 --VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
             +  R++ D+DGAEP   SV+  NL+RL+    G +    R+ A   LA     L +  
Sbjct: 629 GVLPYRLRTDYDGAEPGAGSVAADNLLRLSGYFGGEEGKVLREKAAEQLAA-AFALPETP 687

Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
            A P +  A+ + ++   K V++ G  +  + + +++AA  S+  N  +I  D   +++ 
Sbjct: 688 QAYPEL-TASLVTALLGPKQVIISGDPAGAETQALMSAAQRSFCPNLVLIVEDSTTSDDR 746

Query: 664 DFWEEHNSNNAS-----MARNNFSA----------DKVVALVCQNFSCSPPVTDPISLEN 708
              EE            + R    A           +  A VC + +CS PV    +LE 
Sbjct: 747 GKEEEAGDGKTGDEPPPLFREILEAYGGGYSAGEGGQAAAYVCFDNTCSAPVHTVEALEK 806

Query: 709 LL 710
           LL
Sbjct: 807 LL 808


>gi|452985594|gb|EME85350.1| hypothetical protein MYCFIDRAFT_60228 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 784

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 262/663 (39%), Positives = 366/663 (55%), Gaps = 37/663 (5%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           KT R  F+    + CHWCHVM  ESF+D  +++LLN+ F+ +K+DREERPD+D+ YM ++
Sbjct: 94  KTNRLLFVSIGYSACHWCHVMAHESFDDPRISRLLNENFIPVKIDREERPDIDRQYMDFL 153

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP---PEDKYGRPGFKTILRKVKDAWDKKRDML 124
           QA  GGGGWP++VF++PDL+P+ GGTY+P    E      GF+ IL K+   W ++   +
Sbjct: 154 QATNGGGGWPMNVFVTPDLEPVFGGTYWPGPKSERLQAAGGFEDILIKIATTWKEQEARV 213

Query: 125 AQSGAFAIEQLSE-----ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
            QSG     QL E     ++          DEL  + L    +     YD +  GFG AP
Sbjct: 214 RQSGKEITRQLREFAQEGSIGGKNGRTDDEDELELDLLDDAFQHYKMRYDPKHHGFGGAP 273

Query: 180 KFPRPVEIQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           KFP PV I+ +L    Y S   E  G+  E  E + M + TL  MAKGGI D +G GF R
Sbjct: 274 KFPTPVHIRPLLRVAAYPSVVREIVGEK-ECVEARAMAVNTLAAMAKGGIKDQIGHGFAR 332

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGE 294
           YSV   W +PHFEKMLYD  QL  VYLDA+ LTK   +     DI  YL    M  P G 
Sbjct: 333 YSVTRDWSLPHFEKMLYDNAQLLPVYLDAYLLTKSPLFLETAIDIATYLTSPPMQSPLGG 392

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
           I SAEDADS+ T     K+EGA+YVWT  E + +LG+  + +  +++ ++P GN D  + 
Sbjct: 393 ICSAEDADSSPTVSDKEKREGAYYVWTFDEFKQVLGDAQVDICAKYWNVRPEGNID--QR 450

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDK 412
           SD   E  G+N L    D    A +LG+P ++   ++ + R+KL   R K RPRP LDDK
Sbjct: 451 SDAQGELAGQNTLCVQYDIPDLAKELGLPEDEVKQMILDGRQKLLAHREKTRPRPALDDK 510

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           ++ SWNGL I   AR S +L+S A +              Y+  A  A + I+ HL+D  
Sbjct: 511 IVTSWNGLAIGGLARTSAVLQSSAPAQA----------TRYLSSAVRAVTCIQEHLFDPA 560

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
           T  L+  +R GP +  GF DDYAF +SGLLDLYE    ++WL +A  LQ TQ++LF D  
Sbjct: 561 TGTLKRVYREGPGETQGFADDYAFFVSGLLDLYEATFDSRWLEFAETLQKTQNKLFWDDL 620

Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
             G+F+T  + P +L+R K+  D AEPS N VS  NL RL S++  ++   Y +     +
Sbjct: 621 KYGFFSTPADQPDILIRTKDAMDNAEPSVNGVSAANLFRLGSLLNDAE---YEKMGRRVV 677

Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
           A FE  ++        M  +  + S    K +++VG   +   E  L  A  +   N T+
Sbjct: 678 ACFEVEIEQHPGLFSGMLSSV-VASKLGMKGLMIVGEGDAA--EAALKKARETVRPNYTI 734

Query: 653 IHI 655
           + I
Sbjct: 735 LRI 737


>gi|268316671|ref|YP_003290390.1| hypothetical protein Rmar_1111 [Rhodothermus marinus DSM 4252]
 gi|262334205|gb|ACY48002.1| protein of unknown function DUF255 [Rhodothermus marinus DSM 4252]
          Length = 699

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 265/691 (38%), Positives = 364/691 (52%), Gaps = 52/691 (7%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF+DE VA+LLND F++IKVDREERPD+D +YMT  Q + G GGWPL++ 
Sbjct: 50  CHWCHVMAHESFQDEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTII 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           ++PD KP    TY P   +YGRPG   I+ ++K+AW + RD +  S       L + +S 
Sbjct: 110 MTPDKKPFFAATYIPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSF 169

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A S  +  E  + A R    +L   +D + GGFG APKFP P  +  +L +        
Sbjct: 170 EAPSQIIDAEWLEIAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------ 219

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           +SGEA   Q MV  TL  M  GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ  L   Y
Sbjct: 220 RSGEAHALQ-MVEHTLVQMRLGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAY 278

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
            +A+  T + FY    R+IL Y+ RD+  P G  +S+EDADS   EG    +EG FYVWT
Sbjct: 279 TEAYQATGNPFYERTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWT 331

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +E+ ++LG E   L  E + + P GN +     +   E  GKN+L       A A + G
Sbjct: 332 VEELREVLGPELTPLAIELFNVDPEGNYE----EEATGERTGKNILYLSKPPEALARERG 387

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              E+    L E R++LF  R++R RP  D+K++  WNGL+I++ ARA+++         
Sbjct: 388 WTPEELEAKLEEIRQRLFAYRARRVRPGRDEKILTDWNGLMIAALARAAQVF-------- 439

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   D   Y+E A SAA F+ R ++  +  RL H +R G +  PG LDDYAFL  G
Sbjct: 440 --------DEVAYVEAARSAADFLLRTMHTPEG-RLWHRYREGEAGIPGMLDDYAFLTWG 490

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LLDLYE    T +L  A+ L       F D  G  Y      +P +++R +E  D A PS
Sbjct: 491 LLDLYETTFETSYLETALALTEQMLAHFWDPRGAFYMTPDDGEP-MIVRPRETLDNALPS 549

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
           GN+V+++NLVRL  +   +    Y ++A+  +  F   +K        M  A D+   P 
Sbjct: 550 GNAVALMNLVRLGHMTGRTA---YEEHADAMIRFFSGPVKQQPPIFTGMLIAIDLAFGPI 606

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
            + +VL G         ML   H  Y   K ++   P +        E     A      
Sbjct: 607 YE-LVLAGEPDDPTLREMLRTIHRRYLPRKVLLLRRPGEA------GERLVRVAPFVAAQ 659

Query: 681 FSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
              D +  A VC ++ C  PVTDP +L   L
Sbjct: 660 LPVDGRATAYVCHDYRCEQPVTDPEALARQL 690


>gi|198457071|ref|XP_001360541.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
 gi|198135846|gb|EAL25116.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
          Length = 803

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 262/717 (36%), Positives = 365/717 (50%), Gaps = 63/717 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+   A ++N+ FV+IKVDREERPD+DK+YMT++Q   GGGGWP+S
Sbjct: 117 STCHWCHVMEHESFENLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMS 176

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           ++L+PDL P+  GTYFPP  +YG P FKT+L  +   W   R  L +SG+  +  L +  
Sbjct: 177 IWLTPDLAPITAGTYFPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKQNE 236

Query: 140 SASASSNKLPDELPQNALRLCAEQL---SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
            ASA +    +  P +A    AE +    + +D   GGFG+ PKFP    +  + +    
Sbjct: 237 DASAVAEAAFE--PGSASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLV 294

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D            +VL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQ
Sbjct: 295 SKDVSV-------LDLVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQ 347

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LT+   +      I  Y+ +D+  P G  ++ EDADS      T K EGA
Sbjct: 348 LMAAYSNAYKLTRSATFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGA 407

Query: 317 FYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           FY WT  E+E           D+L + A  ++  HY LKP GN  +   SDPH    GKN
Sbjct: 408 FYAWTWNEIEAAFKDQAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKN 465

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +LI       + S   +  EK   +L      L  +R +RPRPHLD K+I +WNGL++S 
Sbjct: 466 ILIVRGSDEETCSNFDLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSG 525

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----- 479
            ++ +     +              R+EY++ A+    F+R+ +YD +   L  S     
Sbjct: 526 LSKLANCGTVK--------------REEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVA 571

Query: 480 -----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
                     S+  GFLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + G
Sbjct: 572 VGDPTLEKNESQIDGFLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNG 631

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            YF +    P+V++R+KE  DGAEP GNSVS  NL  L+        + Y Q A   L  
Sbjct: 632 AYFFSQQNAPNVIVRLKEGDDGAEPCGNSVSARNLTLLSHYY---DEETYLQRAA-KLMN 687

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   +     A+P M  A  +L       V +VG  S  D +  +      +     ++H
Sbjct: 688 FFADVAPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-DTKRFVEICRKFFIPGMIILH 745

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +DP   ++         N     +      K    +C +  C  PVTDP  LE  L+
Sbjct: 746 VDPLHPDDA-------CNQRVQKKFKMVNGKTTVYICHDRVCRMPVTDPTQLEENLM 795


>gi|391342665|ref|XP_003745636.1| PREDICTED: spermatogenesis-associated protein 20 [Metaseiulus
           occidentalis]
          Length = 728

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 284/731 (38%), Positives = 387/731 (52%), Gaps = 114/731 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E VAK+LND +VSIKVDREERPD+DK+YMTYVQ   G  GWPLS
Sbjct: 54  STCHWCHVMERESFENEEVAKILNDRYVSIKVDREERPDIDKIYMTYVQVTSGHSGWPLS 113

Query: 80  VFLSPDLKPLMGGTYFPPED-KYGRPGFKTILRKVKDAW------------DKKRDMLAQ 126
           V+L+P+LKP+ GGTYFPPED +YG  GFKTIL  + D W            D+   MLA+
Sbjct: 114 VWLTPELKPIFGGTYFPPEDNQYGLAGFKTILLMLDDKWHSSKNEKIKADSDRITAMLAR 173

Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV- 185
           +       L E L A+ S        P   ++ C+  L K       GF   P+FP+ V 
Sbjct: 174 AS-----NLRENLEAAESFQ------PSQCIKDCSLILQK----HLIGFVKEPRFPQCVN 218

Query: 186 -EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
               M L+H +             G  +V   L+ MA GGIHDH+GGGFHRY+VD  W V
Sbjct: 219 GNFYMNLFHFQN---------NRMGVDIVERQLKEMATGGIHDHLGGGFHRYTVDAAWQV 269

Query: 245 PHFEKMLYDQGQLANVYLDAFSLTK-----DVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
           PHFEKMLYDQ Q+  +Y     +         F+  +   I DY+ RD+  P G  +SAE
Sbjct: 270 PHFEKMLYDQAQILALYCSYLRMPGIKPEIASFFGGVATGIADYVMRDLSHPQGGFYSAE 329

Query: 300 DADSAET-EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           DADS E+ + +  KKEGAFYVWT  E++ IL  + A +F E + +   GN       DPH
Sbjct: 330 DADSLESFDSSDHKKEGAFYVWTMAEIQKILSKKEAKVFCEFFGVDEQGNV------DPH 383

Query: 358 NEFKG----KNVLI---------ELNDSSASAS-KLGMPLEKYLNILGECRRKLFDVR-S 402
           ++ +G    +N L           +ND +     + G PL++   IL   +RKL   R  
Sbjct: 384 HDAQGELLNQNTLFYRYPDSYDQNINDMAKVIDLEDGDPLDE---ILESAKRKLLQRRLE 440

Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
            RPRPHLD+K++ +WNGL+I++ A+AS +LK                R  Y E A  A  
Sbjct: 441 SRPRPHLDNKIVSAWNGLMIAALAKASVVLK----------------RPAYAERALKAVD 484

Query: 463 FIRRHLYDEQTHRLQHS-FRNGPSKA----------PGFLDDYAFLISGLLDLYEFGSGT 511
           FIR +L+D +  RL  S +  G   A          PG L+DYAF+ISGLL LY+     
Sbjct: 485 FIRANLFDRENQRLYRSAYTEGEGDAARVEQLEKPIPGVLEDYAFVISGLLQLYDATLDE 544

Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 571
           + L++A  LQ++Q+  F D   GGYF  +G   +++  +K+DHDGAEPS NSVS+ NL+R
Sbjct: 545 QLLLFAKILQDSQNRQFWDETNGGYFLFSGGGSNIIYVLKDDHDGAEPSANSVSIANLIR 604

Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
           L  I      + YR  A  ++ +F  RL  + +A+P M  +   L  P  K ++      
Sbjct: 605 LYHIF---DHEPYRTKANKTVKLFAERLSKVPIALPEMVSSLMYLVEPPTKIILSAEDDE 661

Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
             DF+ +       +      I        E+ F +E        A N     +V A VC
Sbjct: 662 ISDFKRVCDEEARGFS-----IVFAARSVSELGFTKEQYP-----AVNG----EVTAYVC 707

Query: 692 QNFSCSPPVTD 702
           ++ SC PP+ D
Sbjct: 708 KDLSCLPPIND 718


>gi|195150279|ref|XP_002016082.1| GL10685 [Drosophila persimilis]
 gi|194109929|gb|EDW31972.1| GL10685 [Drosophila persimilis]
          Length = 803

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 262/717 (36%), Positives = 365/717 (50%), Gaps = 63/717 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+   A ++N+ FV+IKVDREERPD+DK+YMT++Q   GGGGWP+S
Sbjct: 117 STCHWCHVMEHESFENLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMS 176

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           ++L+PDL P+  GTYFPP  +YG P FKT+L  +   W   R  L +SG+  +  L +  
Sbjct: 177 IWLTPDLAPITAGTYFPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKKNE 236

Query: 140 SASASSNKLPDELPQNALRLCAEQL---SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
            ASA +    +  P +A    AE +    + +D   GGFG+ PKFP    +  + +    
Sbjct: 237 DASAVAEAAFE--PGSASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLV 294

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D            +VL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQ
Sbjct: 295 SKDVSV-------LDLVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQ 347

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LT+   +      I  Y+ +D+  P G  ++ EDADS      T K EGA
Sbjct: 348 LMAAYSNAYKLTRSATFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGA 407

Query: 317 FYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           FY WT  E+E           D+L + A  ++  HY LKP GN  +   SDPH    GKN
Sbjct: 408 FYAWTWNEIEAAFKDQAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKN 465

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +LI       + S   +  EK   +L      L  +R +RPRPHLD K+I +WNGL++S 
Sbjct: 466 ILIVRGSDEETCSNFDLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSG 525

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----- 479
            ++ +     +              R+EY++ A+    F+R+ +YD +   L  S     
Sbjct: 526 LSKLANCGTVK--------------REEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVA 571

Query: 480 -----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
                     S+  GFLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + G
Sbjct: 572 VGDPTLEKNESQIDGFLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNG 631

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            YF +    P+V++R+KE  DGAEP GNSVS  NL  L+        + Y Q A   L  
Sbjct: 632 AYFFSQQNAPNVIVRLKEGDDGAEPCGNSVSARNLTLLSHYY---DEETYLQRAA-KLMN 687

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   +     A+P M  A  +L       V +VG  S  D +  +      +     ++H
Sbjct: 688 FFADVAPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-DTKRFVEICRKFFIPGMIILH 745

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +DP   ++         N     +      K    +C +  C  PVTDP  LE  L+
Sbjct: 746 VDPLHPDDA-------CNQRVQKKFKMVNGKTTVYICHDRVCRMPVTDPTQLEENLM 795


>gi|195382934|ref|XP_002050183.1| GJ22002 [Drosophila virilis]
 gi|194144980|gb|EDW61376.1| GJ22002 [Drosophila virilis]
          Length = 747

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 261/718 (36%), Positives = 363/718 (50%), Gaps = 65/718 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+S
Sbjct: 61  STCHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL PL  GTYFPP+ +YG P F  +L  +   W   R  L ++G+  +E +    
Sbjct: 121 VWLTPDLAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRTSLKKAGSTLMEAMRANQ 180

Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           +A   +    +  P +A    AE L+   + +D    GFG  PKFP    +  + +    
Sbjct: 181 NAGTDAEAAFE--PGSADAKLAEALAVHKQRFDQEHAGFGREPKFPEVPRLNFLFHAYLV 238

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D        +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQ
Sbjct: 239 SKDV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQ 291

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LT+   +      I +YL +D+  P G  ++ EDADS  T   T K EGA
Sbjct: 292 LMAAYANAYKLTRSKEFLRYADRIYEYLIKDLRHPAGGFYAGEDADSLPTHADTVKVEGA 351

Query: 317 FYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKN 364
           FY WT  EV+         F +            HY +KP GN  +   SDPH    GKN
Sbjct: 352 FYAWTWDEVKQAFEAQQARFNDVSPARVFEIYCFHYGMKPAGN--VPPASDPHGHLTGKN 409

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +LI       + S   + + +   +L      L  +R +RPRPHLD K+I  WNGLV+S 
Sbjct: 410 ILIVRGSEEDTCSNFNLEMAQLSQLLETANDILHKIRDQRPRPHLDTKIICGWNGLVLSG 469

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYD-EQTHRLQHSFRN 482
            ++ +                 G+D+++ Y+  A+    F+R HLYD EQ   L+  +  
Sbjct: 470 LSKLAN---------------CGTDKRDAYLATAKQLMDFLRTHLYDGEQKLLLRSCYGA 514

Query: 483 G---------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
           G         P++  GFLDDYAFL+ GLLD Y+       L WA ELQ TQD+LF D + 
Sbjct: 515 GVQDNTLEQNPTRIEGFLDDYAFLVKGLLDYYKASLDMSALHWAKELQVTQDKLFWDEKN 574

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G YF +    P+V++R+KEDHDGAEP GNSV+  NL  L+      +  Y ++ A+  L 
Sbjct: 575 GAYFFSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLSHYF--DEGTYLKRAAK--LL 630

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            +   +     A+P M  A  +L       V +VG   S D +  +      Y     ++
Sbjct: 631 NYFADVAPFGHALPEMLSAL-LLHENGLDLVAVVG-PDSPDTKRFVEIVRKFYVPGMIIV 688

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           H DP   +E         N     +      K    +C +  C  PVTDP  LE  L+
Sbjct: 689 HCDPQHPDEA-------CNQRLQQKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739


>gi|384917096|ref|ZP_10017228.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
 gi|384525484|emb|CCG93101.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
          Length = 727

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 252/689 (36%), Positives = 366/689 (53%), Gaps = 37/689 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+  VA+LLN +++ +KVDREERPD+D+ YM +VQA  G GGWP+S
Sbjct: 47  STCHWCHVMAEESFENPTVAELLNAFYIPVKVDREERPDIDQFYMEFVQAFCGQGGWPMS 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDL+P  GGTYFP E K+GRPGF  +L+K+ + W   R  L Q G   + ++ E++
Sbjct: 107 VWLTPDLEPFFGGTYFPLESKWGRPGFIDLLKKIANLWQSHRSALQQQGQEILNKMRESI 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S      P+ L Q A R   EQL  ++D  +GGF   PKFPRP  +   L+ +   ++
Sbjct: 167 LCSIEIESQPN-LTQIA-RKTVEQLWGNFDRVYGGFSPPPKFPRP-NLFFFLFRAGSFKE 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                + ++  KM LFTLQ M+ GGIHD + GGFHRYSVD +W +PHFEKMLYDQ  L +
Sbjct: 224 LPDPLQ-NKAMKMALFTLQKMSCGGIHDILEGGFHRYSVDAQWRLPHFEKMLYDQAHLGS 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+AF +T D  +      + +YL   +  P G  +SAEDADS  + G   K EGA+Y+
Sbjct: 283 AYLEAFQMTSDFLFKETATALFEYLFSHLYNPAGGFYSAEDADSLNSSG--EKAEGAYYL 340

Query: 320 WTSKEVEDILGEHAILFKEH-----YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           WT +E+E IL E  ++ KE       +   T   +L+         + KN+L      SA
Sbjct: 341 WTMEELEKILEE--VVGKERSKVLASFFGATNQGNLAEGLGTEPSMRLKNMLFFSKPLSA 398

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A +L MP+E+  ++L + +  L + R KRP+P LDDK+I +WNG  IS+ A+A  +L  
Sbjct: 399 LAEELKMPIEETKDLLLKAKTALKEARLKRPKPFLDDKIITAWNGYAISALAKAYMVLAD 458

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                             Y+  A+  A FI  HL+D  +  L   +RNG    PGF  DY
Sbjct: 459 ----------------SRYLNEAKKTADFILEHLWDADSKILYRIYRNGRGSIPGFASDY 502

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L + LLDL+E     KWL+ A   Q   +E F D     Y +   E  + +++ +E++
Sbjct: 503 ASLAASLLDLFEADQDEKWLLQAKMFQELLEEKFADPYRHQYLSRAVETAATIIQTREEY 562

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGAEP+  S+S   L +L SI    K   +++  E         L+    A+P       
Sbjct: 563 DGAEPATLSLSAYALWKLFSITGEEK---WKKRLEELFNSAWPILERFPTALPYFLGVYL 619

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
             SVP  + +++VG K  +    +     +    N+  + +DP       F      +N 
Sbjct: 620 EYSVPPIE-IIIVGEKDDLKTRALFNTLSSVLIPNRLFLVLDPRQGVPRTFKSIDFYSNL 678

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDP 703
                 +     +A +C    CS P T+P
Sbjct: 679 LSVYPGYP----IAYICARGQCSLPQTEP 703


>gi|154303146|ref|XP_001551981.1| hypothetical protein BC1G_09593 [Botryotinia fuckeliana B05.10]
          Length = 753

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 243/610 (39%), Positives = 355/610 (58%), Gaps = 28/610 (4%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CH+ME ESFE+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+VFL+P
Sbjct: 17  CHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFLTP 76

Query: 85  DLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            L+P+ GGTY+       D   +  F  IL K+   W ++     Q  A +++QL +  +
Sbjct: 77  SLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSAQSLQQLKDFAN 136

Query: 141 ASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHS 194
               SN+L    D +    L    E  + SYD   GGFGSAPKFP P +I  +L      
Sbjct: 137 EGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSKIAFLLRLGQFP 196

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           + + D     +    +++ + TL+ MA+GGIHDH+G GF RYS    W +PHFEKMLYD 
Sbjct: 197 QAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWSLPHFEKMLYDN 256

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            QL ++YLD F L++D  +  +  DI +YL   +    G  +S+EDADS    G + K+E
Sbjct: 257 AQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADSYYKNGDSEKRE 316

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           GA+YVWT +E E+ILG    L    ++   TG+ ++ + +DPH+EF  +NVL   +  SA
Sbjct: 317 GAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQNVLAISSTPSA 375

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
            AS+ G+   + + ++ E + +L   R + R +P +DDKV+VSWNG+ + + AR S ++ 
Sbjct: 376 LASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAVGALARLSSVIN 435

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                  F+ PV     +EY++ A  AA+FI+++LYD++   L   +R G     GF DD
Sbjct: 436 G------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWREGRGDTQGFADD 485

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKE 552
           YAFLI GL+DLYE     KWL WA ELQ +Q  LF D+ G G +F+TT   P+V+LR+K+
Sbjct: 486 YAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTVSAPNVILRLKD 545

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMC 610
             D +EPS N +S  NL RL+S+      + Y + A+ ++  FE  +       P  +  
Sbjct: 546 AMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEMLQYPWLFPSFMPA 602

Query: 611 CAADMLSVPS 620
             A  L V S
Sbjct: 603 IVASHLGVKS 612


>gi|302814858|ref|XP_002989112.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
 gi|300143213|gb|EFJ09906.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
          Length = 354

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 206/330 (62%), Positives = 260/330 (78%), Gaps = 3/330 (0%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVMEVESFE E VAKLLNDWFVSIKVDREERPD
Sbjct: 25  GEEAFAKAKAEDKPIFLSVGYSTCHWCHVMEVESFESEEVAKLLNDWFVSIKVDREERPD 84

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA  GGGGWP+SVFL+P+LKP++GGTYFPPED YGRPGFKT+LR+VK+ WD
Sbjct: 85  VDKVYMTFVQASQGGGGWPMSVFLTPELKPIVGGTYFPPEDNYGRPGFKTVLRRVKENWD 144

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            ++ +L  +G   I+QL+EA++A A+S ++   + + A++LCA QL K +D++ GGFGSA
Sbjct: 145 SRKAVLRNAGDNVIQQLAEAMAACATSLQVSGGVAEQAVQLCASQLMKGFDAKLGGFGSA 204

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFPRPVE+ +ML + K+L+  GK+  + +  +M  F LQCMA+GG+HDHVGGGFHRYSV
Sbjct: 205 PKFPRPVELNLMLRYYKRLDQAGKASLSKKALEMASFNLQCMARGGMHDHVGGGFHRYSV 264

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+ WHVPHFEKMLYDQ QLAN YLD + +T+D  ++ + RDILDYL RDM  P G IFSA
Sbjct: 265 DDYWHVPHFEKMLYDQAQLANAYLDVYLVTRDTMHACVARDILDYLNRDMTHPEGGIFSA 324

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDI 328
           EDADS E  G+++KKEGAFYVWT+KEV ++
Sbjct: 325 EDADSLEPSGSSKKKEGAFYVWTAKEVRNL 354


>gi|167629725|ref|YP_001680224.1| thioredoxin [Heliobacterium modesticaldum Ice1]
 gi|167592465|gb|ABZ84213.1| conserved hypothetical protein containing a thioredoxin domain
           [Heliobacterium modesticaldum Ice1]
          Length = 687

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 279/714 (39%), Positives = 372/714 (52%), Gaps = 67/714 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE VA  LN+ F+S+KVDREERPD
Sbjct: 34  GEEAFTRAKEQDKPVFLSVGYSTCHWCHVMERESFEDEEVAAYLNEHFISVKVDREERPD 93

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YMT  QA+ G GGWPL+V ++PD KP   GTYFP   + G  G   IL  V D W 
Sbjct: 94  VDHIYMTVCQAITGHGGWPLTVIMTPDKKPFFAGTYFPKRSRQGLAGLLDILEAVVDQWK 153

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R  L  +G    + L   + A+ S+  L D    + LR  A  L K +D  +GGFG A
Sbjct: 154 NDRGKLVAAGDRVTQHLQREVQAN-SAGSLDD---ASILRGYA-WLQKRFDDVYGGFGHA 208

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L   K +        A E   MV  TL+ M  GGI+DH+G GF RYS 
Sbjct: 209 PKFPTPHNLLFLLRCDKLI-------NAKEALPMVEKTLRQMHAGGIYDHLGYGFSRYST 261

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE+W VPHFEKMLYD  QLA  YL+A+ +T    Y+ + R+I  Y+ RDM  P G  +SA
Sbjct: 262 DEKWLVPHFEKMLYDNAQLAMAYLEAYQVTAKDEYAEVAREIFSYVLRDMHAPEGGFYSA 321

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FY+WT +EV++ILGE    LF + Y +   GN          
Sbjct: 322 EDADS---EGV----EGKFYLWTPQEVKEILGEETGKLFCQWYDITEKGN---------- 364

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
             F+G+N+   LN   A       P+  +  IL +   KLF  R KR  P  D+K++ +W
Sbjct: 365 --FEGQNI---LNRIDADRRPFTPPM-GWHQILTDAEEKLFVAREKRVHPLKDEKILTAW 418

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++ A   +IL                  + Y++ A  AA FI   L D++  RL 
Sbjct: 419 NGLMIAALAMGFRILYD----------------RSYLDAAIGAADFIWEKLRDDKG-RLL 461

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             +R+G +   G++DDYAF+I  L++LY+  +   WL  A+ LQ  Q+ LF D + GGYF
Sbjct: 462 ARYRDGEAAYKGYIDDYAFMIWALIELYQADTNPLWLKRALTLQEDQNRLFWDPDQGGYF 521

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
               +   +L R KE +DGA PSGNSVS +NL+RLA I    ++ Y RQ AE  L  F  
Sbjct: 522 FYGSDSEELLTRPKEIYDGATPSGNSVSALNLLRLARITG--RNAYARQ-AETLLESFSG 578

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            +            A      P  K VV+V  +    F   L   H+ +   +TV     
Sbjct: 579 NINAQPAGHTFALMALLFARRPG-KEVVVVADRKRETFRQELERLHSPFS-PETVFLYRL 636

Query: 658 ADTEEMDFWEEHNSNNASMARNNF-SADKVVALVCQNFSCSPPVTDPISLENLL 710
           AD E  D      +  A    N     D     VC+NF+C PP T+P  +  +L
Sbjct: 637 ADREYKDL-----AELAPFVENMAPQGDSPTYYVCENFACKPPTTNPREVWEIL 685


>gi|392375956|ref|YP_003207789.1| hypothetical protein DAMO_2917 [Candidatus Methylomirabilis
           oxyfera]
 gi|258593649|emb|CBE69990.1| conserved protein of unknown function [Candidatus Methylomirabilis
           oxyfera]
          Length = 1103

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 250/694 (36%), Positives = 372/694 (53%), Gaps = 64/694 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFE E +A+L+N +FV IKVDREERPD+D +YM    AL +G GGWP+
Sbjct: 63  SACHWCHVMAHESFESEQIAELMNRYFVCIKVDREERPDLDAIYMAATLALNHGQGGWPM 122

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +VFL+PDL+P   GTYFPP D  GRPGF TIL +V   W ++ D L        ++++E 
Sbjct: 123 TVFLTPDLQPFFAGTYFPPRDGLGRPGFPTILNRVAQVWREQPDALRTQS----DKITEG 178

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L  S S   LP  + +  +       + ++D  FGGFG+APKFP    + ++L H +   
Sbjct: 179 LRES-SRPSLPMPVGRAEIAAAVAHFAATFDPTFGGFGAAPKFPAATALSLLLRHHQHTG 237

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D       +   +MV  TL  MA+GGI+D +GGGF RYS DERW +PHFEKMLYD   LA
Sbjct: 238 D-------AHALQMVRTTLDAMARGGIYDQIGGGFARYSTDERWLIPHFEKMLYDNALLA 290

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+AF +  D  Y  I  ++LDY+ R+M    G  +SA DADS   EG     EG FY
Sbjct: 291 RTYLEAFQVAGDPSYRQIATELLDYILREMTALEGGFYSATDADS---EGV----EGKFY 343

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  E+E ILG E A  F  +Y + PTGN            ++G+++      ++  A+
Sbjct: 344 VWTPAEIEAILGQEEARRFCAYYDITPTGN------------WEGRSIPNIRRTAAQVAA 391

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           KLG+ +E+    +   + K+++ R KR  P LDDK++ +WNGL++S+ A   ++L     
Sbjct: 392 KLGVSVEELAASIDRTQPKVYEARRKRVPPGLDDKILTAWNGLMVSAMAEGYRVLGE--- 448

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                        + +++ A  AA F+   L      RL  ++R+G +    +L+DYA L
Sbjct: 449 -------------RRHLDAAVRAADFLLSTLLRPDG-RLLRTYRSGVAHLNAYLEDYACL 494

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GL+DLYE G  T++L  A+ L       F D E G +  T+ +  +++LR +E  DGA
Sbjct: 495 CEGLIDLYEAGGETRYLREAVRLAERMPGDFADEESGAFHTTSRDHETLILRYREGTDGA 554

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+   L RL+  +     + +R+ AE +++ +  ++     A        D+L 
Sbjct: 555 TPSGNAVAASALTRLSFHL---NREEWRRAAEQAISAYGQQIARYPHAFAKSLAVVDLL- 610

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +     + L+G+ +    E +       +  N+ + H DP          + N     + 
Sbjct: 611 LEGPVELCLIGNPAEAGCEALRREVGRHFIPNRIIAHHDPT---------KGNPPELPLL 661

Query: 678 RNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
           R     D   AL +C+NF+C  P+TDP  +  LL
Sbjct: 662 RGKGLVDGRAALYLCRNFTCQAPITDPAQVAELL 695


>gi|374297486|ref|YP_005047677.1| thioredoxin domain-containing protein [Clostridium clariflavum DSM
           19732]
 gi|359826980|gb|AEV69753.1| thioredoxin domain protein [Clostridium clariflavum DSM 19732]
          Length = 680

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 261/696 (37%), Positives = 369/696 (53%), Gaps = 75/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA++LN +F+SIKVDREERPD+D +YM   QAL G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDYEVAEILNKYFISIKVDREERPDIDHIYMNVCQALTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F++PD KP   GTYFP  D+ G  G  +IL  V +AW   R+ L +   + I  ++E  
Sbjct: 113 IFMTPDKKPFFAGTYFPKNDRMGMSGLMSILESVHNAWTTDREALLKESEYIINAINEHN 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
                 ++   EL ++ L     +L  ++D+ FGGFGSAPKFP P  +  +L   Y++K+
Sbjct: 173 ELLEQDHE--GELTEDILDKAYSELKFAFDNIFGGFGSAPKFPTPHNLFFLLRYWYNTKE 230

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                          MV  TL CM KGGI+DH+G GF RYS D +W VPHFEKMLYD   
Sbjct: 231 ----------EYALTMVEKTLACMHKGGIYDHIGFGFSRYSTDRKWLVPHFEKMLYDNAL 280

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L+  YL+A+  TK   Y+ I  +I  Y+ RDM  P G  +SAEDADS   EG     EG 
Sbjct: 281 LSIAYLEAYQATKKRDYADIAEEIFTYVLRDMTSPEGGFYSAEDADS---EGM----EGK 333

Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVW+  EV+ +LGE H   + ++Y + P GN            F+G N+         +
Sbjct: 334 FYVWSMDEVKKVLGEQHGEKYCKYYDITPHGN------------FEGFNI--------PN 373

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
             K  +P E+    + ECR+KLF+ R KR  PH DDK++ SWNGL+I++ A   ++L  E
Sbjct: 374 LIKGNIPDEE-RPFIEECRKKLFEYREKRVHPHKDDKILTSWNGLMIAALAIGGRVLGKE 432

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                           +Y+  AE AA FI   L      RL   +R+G S  PG++DDYA
Sbjct: 433 ----------------KYITAAERAAKFISSKLVSNNG-RLLARYRDGESAFPGYVDDYA 475

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F I GL++LYE      +L  +++L +   + F D   GG F    +   ++ R KE +D
Sbjct: 476 FFIWGLIELYETTYKPVYLKQSLKLNDDLIKYFWDENNGGLFYYGSDSEQLITRPKETYD 535

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSVS +N +RLA +   S  +     A      F   +++ AM       A  +
Sbjct: 536 GAIPSGNSVSTLNFLRLARLTGRSDLE---DKAYIQFKTFSRNIENFAMGHSFFLTAL-L 591

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            +    K VV+VG+   ++ ++M+      +      +    A +E  D         A 
Sbjct: 592 FAKSKSKEVVIVGN-DKLESDSMINIIREEFRPFTLSMFYSDAQSELKDI--------AP 642

Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
              N  S + K  A +C+N++C  P+TD  S  N +
Sbjct: 643 FIENYRSVEGKTTAYICENYTCHDPITDVSSFRNAI 678


>gi|25147430|ref|NP_495615.2| Protein B0495.5 [Caenorhabditis elegans]
 gi|21264548|sp|Q09214.2|YP65_CAEEL RecName: Full=Uncharacterized protein B0495.5
 gi|351065503|emb|CCD61473.1| Protein B0495.5 [Caenorhabditis elegans]
          Length = 729

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 266/727 (36%), Positives = 376/727 (51%), Gaps = 61/727 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       +  FL    +TCHWCHVME ESFE+E  AK+LND FV+IKVDREERPD
Sbjct: 43  GQEAFQKAKDNNKPIFLSVGYSTCHWCHVMEKESFENEATAKILNDNFVAIKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF TIL  +   W 
Sbjct: 103 VDKLYMAFVVASSGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWK 162

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K+ + L Q GA  I +L +  +AS   N+      +   +        S+DSR GGFG A
Sbjct: 163 KEGESLKQRGAQII-KLLQPETASGDVNR-----SEEVFKSIYSHKQSSFDSRLGGFGRA 216

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+  ++  ++  +       +S +A +   M+  TL+ MA GGIHDH+G GFHRYSV
Sbjct: 217 PKFPKACDLDFLITFAAS---ENESEKAKDSIMMLQKTLESMADGGIHDHIGNGFHRYSV 273

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
              WH+PHFEKMLYDQ QL   Y D   LT  K     ++  DI  Y+++     GG  +
Sbjct: 274 GSEWHIPHFEKMLYDQSQLLATYSDFHKLTERKHDNVKHVINDIYQYMQKISHKDGG-FY 332

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
           +AEDADS     ++ K EGAF  W  +E++ +LG+  I       +  +++ ++ +GN  
Sbjct: 333 AAEDADSLPNHNSSNKVEGAFCAWEKEEIKQLLGDKKIGSASLFDVVADYFDVEDSGN-- 390

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
           ++R SDPH E K KNVL +L      A+   + + +    + E +  L++ R++RP PHL
Sbjct: 391 VARSSDPHGELKNKNVLRKLLTDEECATNHEISVAELKKGIDEAKEILWNARTQRPSPHL 450

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K++ SW GL I+   +A +                 ++  +Y++ AE  A FI + L 
Sbjct: 451 DSKMVTSWQGLAITGLVKAYQ----------------ATEETKYLDRAEKCAEFIGKFLD 494

Query: 470 DEQTHR------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           D    R             G  +   F DDYAFLI  LLDLY      ++L  A+ELQ  
Sbjct: 495 DNGELRRSVYLGANGEVEQGNQEIRAFSDDYAFLIQALLDLYTTVGKDEYLKKAVELQKI 554

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
            D  F +  G GYF +   D  V +R+ ED DGAEP+  S++  NL+RL  I+   + + 
Sbjct: 555 CDVKFWN--GNGYFISEKTDEDVSVRMIEDQDGAEPTATSIASNNLLRLYDIL---EKEE 609

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
           YR+ A         RL  + +A+P M  A     + S    VLVG   S       +  +
Sbjct: 610 YREKANQCFRGASERLNTVPIALPKMAVALHRWQIGSTT-FVLVGDPKSELLSETRSRLN 668

Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
             +  N +V+HI           EE  S +    +      K    +C+ F C  PV   
Sbjct: 669 QKFLNNLSVVHIQS---------EEDLSASGPSHKAMAEGPKPAVYMCKGFVCDRPVKAI 719

Query: 704 ISLENLL 710
             LE L 
Sbjct: 720 QELEELF 726


>gi|406859397|gb|EKD12463.1| putative DUF255 domain-containing protein [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 820

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 243/594 (40%), Positives = 343/594 (57%), Gaps = 34/594 (5%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A LLN  F+ +K+DRE RPD+D++YM +VQA  G GGWPL+VF
Sbjct: 105 CHWCHVMERESFENEEIATLLNTHFIPVKIDREVRPDIDRIYMNFVQATTGSGGWPLNVF 164

Query: 82  LSPDLKPLMGGTYFPP-------EDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           L+PDL+P+ GGTY+P        ED+     F  IL+K+   W ++ +   +     +EQ
Sbjct: 165 LTPDLEPVFGGTYWPGHSSGTAFEDQVD---FLGILQKLSSVWREQEERCRRDSKQILEQ 221

Query: 135 LSEALSASASSNKLPDELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQM 189
           L    +     ++L D    +      L    +  S +YDS  GGFG APKFP P ++  
Sbjct: 222 LKSFAADGTFGSRLGDGEGGDGLDIELLEEAVQHFSSTYDSTNGGFGLAPKFPTPSKLSF 281

Query: 190 ML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           +L    +   + D   + E    Q M + TL+ MA+GG+HD VG GF RYSV   W +PH
Sbjct: 282 LLRLGQYPSIVVDVVGAPECRNAQSMAVTTLRKMARGGVHDQVGNGFARYSVTADWSLPH 341

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD  QL +VYLDAF L++D     +  DI  YL  D+    G  +S++DADS   
Sbjct: 342 FEKMLYDNAQLLHVYLDAFLLSRDAELLGVVYDISTYLTTDLAHAEGGFYSSQDADSLYR 401

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
            G + K+EGAFYVWT +E E++LGE+  +     +   TG+ ++   +D H+EF  +NVL
Sbjct: 402 RGDSEKREGAFYVWTKREFENVLGENEPILSA--FFNVTGHGNVGPENDGHDEFLDQNVL 459

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 425
             ++  SA AS+ GM  E+ + I+   +  L   R K R RP LDDK++ SWNGL + + 
Sbjct: 460 AIVSTPSALASQFGMKEEEVVRIIKAGKAALRAHREKERVRPGLDDKIVTSWNGLAVGAL 519

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
           AR   + K         F    S+  E +  A  AA+FI+++LYD  +  L   +R G  
Sbjct: 520 ARTGGVFK--------GFDPAKSE--ELLGFAIKAATFIKQNLYDSSSKILYRIWREGRG 569

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
              GF DDYAFL+ GL+DLYE     +WL WA ELQ TQ  LF D   GG+F+T+   P 
Sbjct: 570 DTEGFADDYAFLVEGLIDLYEATFDEEWLKWADELQQTQISLFFDVNIGGFFSTSSTAPH 629

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           ++LR+K+  D +EPS N  S  NL RL+S++       Y + A+ +LA FE+ +
Sbjct: 630 LILRLKDGMDTSEPSTNGTSASNLYRLSSLL---NDLTYAEKAKQTLACFESEM 680


>gi|392411456|ref|YP_006448063.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
 gi|390624592|gb|AFM25799.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
          Length = 692

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 260/699 (37%), Positives = 371/699 (53%), Gaps = 65/699 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE  A  +N  FVSIKVDREERPD+D +YMT  Q + G GGWPL+
Sbjct: 49  STCHWCHVMEHESFEDEETAAAMNQSFVSIKVDREERPDLDNIYMTVCQMMTGSGGWPLN 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
           V L+PDLKP   GTYFP   ++G+ G   +  ++++ W  +R+ + +S      A+ Q+ 
Sbjct: 109 VVLTPDLKPFFAGTYFPKTSRFGKIGMVELSDRIREIWQTRRNDVLESADKVTNALRQMP 168

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           +A S S     L        L     +L K +D   GGF  APKFP P  +  +L + K+
Sbjct: 169 DASSGSVQGKAL--------LEQAFTELDKRFDPARGGFSPAPKFPTPHNLLFLLRYWKR 220

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             D        +  KMV  TL  +  GGI+DHVG GFHRYS D  W VPHFEKMLYDQ  
Sbjct: 221 TGD-------EKALKMVEKTLHALRLGGIYDHVGFGFHRYSTDTEWLVPHFEKMLYDQAL 273

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+  T + FY+   ++I+ Y+ RDM  P G  +SAEDADS   EG     EG 
Sbjct: 274 LTMAYTEAYQATGNEFYADTAKEIVTYVLRDMTSPQGGFYSAEDADS---EGV----EGK 326

Query: 317 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVWT +E+ED+LG+  A L+   Y  +P GN       +   +  G N+   L      
Sbjct: 327 FYVWTLREIEDVLGQKDAALYSAVYNFEPEGNFH----DEASGQATGANIPHLLARFEEI 382

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+   M   +  + L   R KLF  R +R  PH DDK++  WNGL+I++ A+A+++ ++ 
Sbjct: 383 AATRDMTPHELHDRLRAIREKLFSTRERRVHPHKDDKILTDWNGLMIAALAKAAQVFEN- 441

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                          +EY E A  AA F+   L DEQ  RL H FR+G +     +DD+A
Sbjct: 442 ---------------REYGEAARKAADFLLSTLRDEQG-RLLHRFRDGEAGLTAHVDDFA 485

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F + GLL+LYE     ++L  A+EL +   + F D E GG++ T  +  ++L+R KE +D
Sbjct: 486 FFVWGLLELYETVFEPQYLAAALELNDDLLKRFWDDERGGFYFTAMDAENLLVRTKEVYD 545

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSVS++NL+RL  + +  + +     AE     F   L+    A   M    + 
Sbjct: 546 GAVPSGNSVSLLNLLRLGRMTSNPELE---SKAEQIAKAFAGTLRQFPSAYTQMLVGLEF 602

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
                R + V++ +  + D   ML     ++  NK V+         M F +  + N   
Sbjct: 603 --AEGRTYEVVIANSGTEDVLPMLRIIRRNFLPNKVVL---------MRFRDGKHENLLR 651

Query: 676 MAR--NNFS--ADKVVALVCQNFSCSPPVTDPISLENLL 710
           + R  ++F+   +K  A VC N+ C  P T+P  +  LL
Sbjct: 652 VVRFDHDFALLENKTTAYVCVNYHCELPTTEPSRVLELL 690


>gi|308480509|ref|XP_003102461.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
 gi|308261193|gb|EFP05146.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
          Length = 746

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 276/743 (37%), Positives = 389/743 (52%), Gaps = 78/743 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    ++ +  FL    +TCHWCHVME ESFE+E  AK+LN+ F++IKVDREERPD
Sbjct: 45  GEEAFKKAKESNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFIAIKVDREERPD 104

Query: 59  VDKVYMTYV---------------QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGR 103
           VDK+YM +V               QA  G GGWP+SVFL+P+L P+ GGTYFPP+D  G 
Sbjct: 105 VDKLYMAFVVVYLNFCFTSSFSFFQAASGHGGWPMSVFLTPELHPITGGTYFPPDDNRGM 164

Query: 104 PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQ 163
            GF TIL  ++  W K+ D L + G   I +L +  +AS   NK      +   +     
Sbjct: 165 LGFSTILNMIQTEWKKEGDNLRKRGEQII-KLLQPETASGDVNK-----SEEVFQSIYSH 218

Query: 164 LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKG 223
              S+DSR GGFG APKFP+  ++  ++  S       KS E++    M+  TL+ MA G
Sbjct: 219 KQSSFDSRLGGFGGAPKFPKASDLDFLIAFSSADSCGDKSKEST---TMLQKTLESMADG 275

Query: 224 GIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDIL 281
           GIHDH+G GFHRYSVD  WHVPHFEKMLYDQ QL   Y D   LT  K+    ++  DI 
Sbjct: 276 GIHDHIGTGFHRYSVDGEWHVPHFEKMLYDQSQLLATYSDFHRLTGKKNENIKFVINDIF 335

Query: 282 DYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY- 340
           +Y+++     GG  +SAEDADS     +  K EGAF VW  +E++ +L E  I   + + 
Sbjct: 336 EYMQKISHKEGG-FYSAEDADSLPKNDSKEKMEGAFCVWEKEEIKKLLCERKIGSADLFD 394

Query: 341 ----YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRK 396
               Y     N ++ R SDPH E K KNVL +L      A+   + +E+    + E ++ 
Sbjct: 395 VVADYFDVEDNGNVPRSSDPHGELKNKNVLRKLLTDDECAANHSLTVEELKRGIEEAKQI 454

Query: 397 LFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEV 456
           L++ R+KRP PHLD K++ +W  L IS   +A +                 ++  +Y+E 
Sbjct: 455 LWEARTKRPSPHLDSKMVTAWQALAISGLVKAYQ----------------ATEDVKYIER 498

Query: 457 AESAASFIRRHLYDEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFG 508
           AE  A+F+R++L  E+   L+ S           G      F DDYAF+I GLLDLY   
Sbjct: 499 AEKCAAFVRKYL--EENGELKRSVYLGVEGNIEQGHQNMKAFSDDYAFMIQGLLDLYTVL 556

Query: 509 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 568
              ++L  AIELQ T D+ F    G GYF +   D  V +R+ ED DGAEP+  S++  N
Sbjct: 557 GKNEYLEKAIELQKTCDQKFWS--GNGYFISEQADEGVSVRMVEDQDGAEPTATSIASNN 614

Query: 569 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 628
           L+RL  I+   ++D YR+ A         RL    +A+P M  A       S    VLVG
Sbjct: 615 LLRLHDIL---ENDEYREKANKCFRGASERLNKFPIALPKMAVALHRWQNGSTT-FVLVG 670

Query: 629 HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVA 688
                +FE+ L    A   LN+ +I     +   +    E+    +  + N  S     A
Sbjct: 671 -----EFESEL-LVEARRRLNEKLIE----NLSVVHIRSENEIGASGPSHNAMSQGPQPA 720

Query: 689 L-VCQNFSCSPPVTDPISLENLL 710
           + +C+ F+C  P+    +L+ L 
Sbjct: 721 VYMCKGFACGLPIRSIDALDKLF 743


>gi|312385290|gb|EFR29828.1| hypothetical protein AND_00943 [Anopheles darlingi]
          Length = 874

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/709 (37%), Positives = 372/709 (52%), Gaps = 69/709 (9%)

Query: 30  VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 89
           V+ F++E VA+++N+ F+++K+DREERPD+DK+YM ++  + G GGWP+SV+L+PDL P+
Sbjct: 186 VDCFQNEEVARIMNENFINVKLDREERPDIDKLYMMFILLINGSGGWPMSVWLTPDLAPI 245

Query: 90  MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 149
            GGTYFPP D++G PGF T+L K+   W   R+ L ++G   IE +   +     S    
Sbjct: 246 TGGTYFPPNDRWGMPGFTTVLTKLAAKWASDREDLVRTGRSVIEAIKRNVDQKQGSGNGD 305

Query: 150 DELPQNALRLCAEQL-----------SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +E    A+    E L            ++YD  +GG   APKFP   ++ +M +H    E
Sbjct: 306 EEDGAAAVAAAGETLEAKFRQAINLYQRNYDPVWGGSLGAPKFPEAAKLNLM-FHLHVQE 364

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              K         +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQL 
Sbjct: 365 PKHKI------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLL 418

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++Y + + LT    Y  +   I  YL +D+  PGG  +S EDADS  T  +  K EGAFY
Sbjct: 419 SLYANGYRLTHKPLYLTVADAIYRYLCKDLRHPGGGFYSGEDADSLPTADSDVKVEGAFY 478

Query: 319 VWTSKEVEDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
            WT  EV++ L   A            ++ EHY +K TGN + +  SDPH    GKN+ I
Sbjct: 479 AWTYAEVKETLERGAAKFGDTTVSPIEVYAEHYDIKETGNVEPA--SDPHGHLLGKNIPI 536

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
                  +A K G   E    +L      L +VR +RPRPHLD K+I +WNGLV+S  + 
Sbjct: 537 VYGSVRETAEKCGTRPEIVERVLRVANELLHEVREQRPRPHLDTKIICAWNGLVLSGLSH 596

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG--- 483
            + +  +              DR +Y+  AE    F+R +LYD Q  +L  S + NG   
Sbjct: 597 LACVHDA-------------PDRSKYLATAEELVKFVRANLYDVQARKLLRSCYGNGEET 643

Query: 484 -PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
             S+ P  GF+DDYAFLI GL+D Y        L WA ELQ+ QDELF D + G YF + 
Sbjct: 644 LASERPIYGFIDDYAFLIRGLIDYYVASLDEHRLHWAKELQDIQDELFWDPKHGAYFYSE 703

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
              P V +R+KEDHDGAEP GNSV+  NL+ L       + +  ++ A    A F +   
Sbjct: 704 ANSPHVAVRLKEDHDGAEPCGNSVAGHNLLLLHDYF---EEERLKERARKLFAYF-SESS 759

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI---DP 657
                +P M  AA  L     KH ++V    S +   ++ A    Y     ++ +    P
Sbjct: 760 PFGYVLPEMMSAA--LVEEHGKHTLIVVGPESPEATALVDAVRRFYIPGMIIVQLKIDKP 817

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
           A  E        + +N  M +N        A +C N  C  PVT+P  L
Sbjct: 818 AHIER----RRKSLDNFKMVKN-----MPTAYICHNRVCHLPVTEPERL 857


>gi|225181777|ref|ZP_03735215.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
           1]
 gi|225167551|gb|EEG76364.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
           1]
          Length = 697

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 262/694 (37%), Positives = 370/694 (53%), Gaps = 55/694 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA+ LN  FV IKVDREERPD+D +YM   QA+ G GGWPL+
Sbjct: 55  STCHWCHVMERESFEDEEVARELNRVFVCIKVDREERPDIDNIYMAVCQAMTGSGGWPLT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + +SPD +P   GTYFP +  +GR G   + ++++  W   RD +  +        S   
Sbjct: 115 IVMSPDKRPFFAGTYFPKKTSFGRMGVIDLAQRIEMLWKTSRDKINSTAD------SVMT 168

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           S  A S   P +LP + AL+    +L   +D   GGFG APKFP P  +  +L + K   
Sbjct: 169 SLQAMSKVTPGDLPGEEALQGGFAKLEGRFDPDHGGFGYAPKFPSPHNLTFLLRYWK--- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              +SG A +  +MV  TL  MA+GG++DH+G GFHRYS D  W +PHFEKMLYDQ  LA
Sbjct: 226 ---RSGNA-KALEMVEKTLLAMARGGVYDHIGFGFHRYSTDREWLLPHFEKMLYDQALLA 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+A+  T    Y+   R+I  Y+ RDM  P G  +SAEDADS   EG    +EG FY
Sbjct: 282 VTYLEAYQATGKEVYAQTAREIFGYVLRDMTSPQGGFYSAEDADS---EG----EEGKFY 334

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW + E+  ILGE  A +F   Y ++  GN       +   +  G N+          A 
Sbjct: 335 VWETNEIVHILGEADAAIFNAAYNIREDGNF----TDETTGKKTGANIPHLRKTYQELAQ 390

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +L +   +  + L   R+KLF VR KR  PH DDK++  WNGL+I++ A   +IL  E  
Sbjct: 391 ELSLEPNELKDRLEAMRQKLFAVRKKRIHPHKDDKILTDWNGLMIAALAMGGRILNDE-- 448

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y + A+ AA FI  HL  ++  RL   FR   +  P  LDDYAF 
Sbjct: 449 --------------NYNKSAKKAAGFILSHL--KKDGRLLKRFREDEASLPAHLDDYAFF 492

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + GL++LYE    T +L  A+ L  T  + F D + G ++ T  +   VL+R +E +DGA
Sbjct: 493 VWGLIELYETTFDTDFLKEALSLNKTMIKHFWDHDNGSFYFTADDAEDVLVRHRELYDGA 552

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ +N +RL  I   ++ +   Q AE     F   ++ +      M  A + ++
Sbjct: 553 VPSGNSVAAMNNLRLGRITGNTELE---QIAEKIARAFTDEIEKVPQGYTQMLSAINFMA 609

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHNSNNASM 676
            PS + +V+ G   + D ++ML    +++  NK V+ H      +E++    +     S+
Sbjct: 610 GPSLE-IVIAGEAQAQDTKDMLQKLCSTFVPNKVVVLHPGGKKAKEIEELAPYTRRQQSI 668

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   K  A VC+NFSC  PVTD   + +LL
Sbjct: 669 ------EGKATAYVCRNFSCQAPVTDADKMLSLL 696


>gi|452845430|gb|EME47363.1| hypothetical protein DOTSEDRAFT_41782 [Dothistroma septosporum
           NZE10]
          Length = 734

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 253/607 (41%), Positives = 341/607 (56%), Gaps = 39/607 (6%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           +T R  F+    + CHWCHVM  ESF+D  +A+LLN++FV IK+DREERPD+D+ YM ++
Sbjct: 48  QTNRLLFVSIGYSACHWCHVMAHESFDDPRIAQLLNEYFVPIKIDREERPDIDRQYMDFL 107

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYGRPG---FKTILRKVKDAWDKKRDM 123
           QA  GGGGWPL+VF++PDL+P+ GGTY+P P     + G   F+ IL KV   W ++ + 
Sbjct: 108 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPRSDRAQMGGTTFEDILLKVSSMWKEQEER 167

Query: 124 LAQSGAFAIEQLSE-----ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           L  SG    +QL E      +          D L  + L    +   K YD +FGGFG+A
Sbjct: 168 LRASGKEITKQLREFAQEGHIGGRDGKGDDNDGLELDLLDDAFQHYKKRYDRKFGGFGAA 227

Query: 179 PKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           PKFP PV I+ +L+   + K++ +     E+ E + M + +L+ MAKGGI D +G GF R
Sbjct: 228 PKFPTPVHIRPLLHVACYPKEVREIVGEDESIEVRAMAVKSLENMAKGGIKDQIGHGFAR 287

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGE 294
           YSV   W +PHFEKMLYD  QL  VYL+A+ LTK   +     DI  YL    M    G 
Sbjct: 288 YSVTRDWSLPHFEKMLYDNAQLLPVYLEAYMLTKSQLFLETTHDIAKYLTSAPMASDLGG 347

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRM 353
           I SAEDADS  T     K+EGA+YVWT  E + IL +  +     Y+ +K  GN D  + 
Sbjct: 348 ICSAEDADSLPTAIDHHKREGAYYVWTMDEFKKILTDEEVKVCSAYWGVKSEGNID--KQ 405

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDK 412
            D   E  G+N L   ++ +  A +L M  E     L   R KL   R K RPRP LDDK
Sbjct: 406 HDIQGELVGQNTLCVQHEPAELARELSMSEEDVKRTLANGREKLLAYRQKDRPRPALDDK 465

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           ++ SWNGL +   ARA          A    P       EY+  AE A + IR  L+DE+
Sbjct: 466 IVTSWNGLAVGGLARAG---------AALGVP-------EYIAAAEKAVNCIRAQLFDEK 509

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
              L+  +R GP +  GF DDYAFLISGLLDLYE    ++WL +A  LQ TQ +LF D E
Sbjct: 510 AKTLKRVYREGPGETQGFADDYAFLISGLLDLYESTFDSQWLEFADILQQTQTKLFWDEE 569

Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
             G+F+T    P +L R K+  D AEPS N VS +NL RL S++  +    Y +  + ++
Sbjct: 570 KFGFFSTPANQPDILFRTKDAMDNAEPSVNGVSAMNLFRLGSLLYDAT---YEKMGKRTV 626

Query: 593 AVFETRL 599
           A F+  +
Sbjct: 627 AAFDVEI 633


>gi|396464920|ref|XP_003837068.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312213626|emb|CBX93628.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 748

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 252/625 (40%), Positives = 342/625 (54%), Gaps = 28/625 (4%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VAK+LN+ ++ IKVDREERPDVD++YM YVQAL G GGWPL+ F
Sbjct: 68  CHWCHVMERESFENQEVAKILNESYIPIKVDREERPDVDRIYMNYVQALTGRGGWPLNAF 127

Query: 82  LSPDLKPLMGGTYFP---PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--- 135
           L+PDL+P+ GGTYF         G   F  +L K++D W  +R     S     ++L   
Sbjct: 128 LTPDLQPIFGGTYFAGPGSTTALGAQPFVAVLEKIRDLWTDQRQRCLDSAREETKKLIDF 187

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
           ++  + S       D L    L        + YD    GFG APKFP P  +Q +L  S+
Sbjct: 188 AQDGNISRQGGAEHDGLELELLDDALSHFKRKYDPVNAGFGDAPKFPTPSNLQFLLKLSR 247

Query: 196 ---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
               + +   + + +  + MVL TL  M KGGIHD +G GF RYSV + W +PHFEKMLY
Sbjct: 248 YPTAVTELLGADDCTLAKTMVLKTLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLY 307

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATR 311
           D  QL  V+LDA+ LTK   +     DI  YL    M    G  FS+EDADS        
Sbjct: 308 DHAQLLPVFLDAYLLTKSAAHLSAVHDIATYLTSPPMHAEHGGFFSSEDADSLYRPNDKE 367

Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           K+EGAFYVWT  E +DILGE  A +   +Y ++  GN       D H+E   +NVL    
Sbjct: 368 KREGAFYVWTLTEFQDILGERDAEILARYYNVRDEGNVHPEH--DAHDELINQNVLAIST 425

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 429
             S  A + G+  E+   IL   R+KL   R K RPRP LDDK++VSWNGL I + AR +
Sbjct: 426 TPSDLAKQFGLSEEEVHRILTSGRQKLLFHRDKERPRPALDDKIVVSWNGLAIGALARTA 485

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
             L S   +A             Y+  AE AA+F++ +LYD  +  L   +R GP + PG
Sbjct: 486 AALSSSEPTASHT----------YLAAAEKAATFLKENLYDPSSQTLTRVYREGPGETPG 535

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           F DDYA+LISGL+DLY+      +L WA +LQ +Q  LF D +  G+F+T      +++R
Sbjct: 536 FADDYAYLISGLIDLYQTTFNDSYLQWADDLQQSQIRLFWDTKHLGFFSTPAGQSDLIMR 595

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           +K+  D AEP  N VS  NL RL +++   + + Y + A  + + FE  L       P +
Sbjct: 596 LKDGMDNAEPGTNGVSAQNLDRLGALL---EDEAYSKRARETASAFEAELMQHPFLFPSL 652

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVD 634
             A  +  +  R H V+ G    V+
Sbjct: 653 MDAVVVGRLGIR-HSVITGEGRRVE 676


>gi|78043330|ref|YP_360543.1| hypothetical protein CHY_1723 [Carboxydothermus hydrogenoformans
           Z-2901]
 gi|77995445|gb|ABB14344.1| conserved hypothetical protein [Carboxydothermus hydrogenoformans
           Z-2901]
          Length = 686

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 263/696 (37%), Positives = 372/696 (53%), Gaps = 63/696 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA LLN  FV+IKVDREERPDVD++YMT  QA+ G GGWPL+
Sbjct: 50  STCHWCHVMERESFEDEEVADLLNKHFVAIKVDREERPDVDQIYMTACQAMTGQGGWPLT 109

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++P+ KP   GTYFP   K+GRPG   IL ++   W+  R+ L        ++L E +
Sbjct: 110 IIMTPEKKPFFAGTYFPKRSKWGRPGLMEILTEIVKLWETDREQLLTIS----KRLYEFM 165

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                S K   +L +  L     +    +DS +GGFG APKFP P  +  +L + K+   
Sbjct: 166 QTIPQSKK--GDLTEEVLEKAYREFLGRFDSEYGGFGPAPKFPTPHNLIFLLRYWKR--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+       +K    TL+ MA+GGI+DHVG GFHRYS D  W VPHFEKMLYD   LA 
Sbjct: 221 TGEEKALFMAEK----TLEAMARGGIYDHVGYGFHRYSTDREWLVPHFEKMLYDNALLAY 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+  TK   Y+ I R++  Y++R M  P    +SAEDADS   EG     EG +YV
Sbjct: 277 TYLEAYQATKKEKYARIAREVFTYVKRKMTSPERGFYSAEDADS---EGV----EGKYYV 329

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           WT  EV+ +LG E   LF   Y + P GN            F+GKN+  LI   D    A
Sbjct: 330 WTPDEVKKVLGPEEGELFCRVYDITPEGN------------FEGKNIPNLIH-TDIELVA 376

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            ++G    +    L   R+KL+  R KR  P  DDK++ SWNGL+I++ A+ +++L+ + 
Sbjct: 377 QEIGKSAAELTESLDRMRQKLYHEREKRVLPLKDDKILTSWNGLMIAALAKGARVLQDQ- 435

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          E + +A +AA FI   L      RL   +R G +    +LDDYAF
Sbjct: 436 ---------------ELLNMAHNAAEFIFSKL-RRADGRLIARYREGEAAVLAYLDDYAF 479

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI GL++LYE      +L  A+EL     +LF D + GG F T  +   ++ R KE +DG
Sbjct: 480 LIWGLIELYEASFEVWYLKLAVELTREMLKLFWDEKHGGLFFTGADGEELITRPKEIYDG 539

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ +NL+RL+ ++     + + Q A   L+ F  ++ ++  A      A  + 
Sbjct: 540 ALPSGNSVAALNLLRLSRMLG---EEDFLQKAVEILSTFAGKVSEIPSAHSFYLLAY-LF 595

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNSNNAS 675
            +   K +V+ G     D   M+   + +Y  N  V+     D  +E+     H ++  S
Sbjct: 596 YLGPVKEIVVAGEPDGEDTRAMIEKINLAYLPNSVVLFHPIGDAGQEIREIIPHIADKKS 655

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +       ++    VC+NFSC  PV +   LE  L+
Sbjct: 656 LI-----GERATVYVCENFSCKAPVVEVEMLEEYLM 686


>gi|168186605|ref|ZP_02621240.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
 gi|169295490|gb|EDS77623.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
          Length = 693

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 251/690 (36%), Positives = 370/690 (53%), Gaps = 73/690 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFEDE VAKLLND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++
Sbjct: 62  SCHWCHVMEKESFEDEEVAKLLNDKYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTI 121

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP +  YGRPG   IL ++ D W+  RD +  +    +  + E  S
Sbjct: 122 IMAPDQKPFFAGTYFPKKRMYGRPGLIQILNQIADEWENNRDGVINASNELLNTMKEHTS 181

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              S      E+ +N L+   +++   YD  +GGFG APKFP P ++ ++L + K+  + 
Sbjct: 182 QDKSG-----EINENVLQDAIKEMKHYYDESYGGFGIAPKFPTPHKLMLLLTYYKEYNN- 235

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                      MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA V
Sbjct: 236 ------KIALHMVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 289

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y   + +T  +FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FY+W
Sbjct: 290 YTQTYQITGKLFYKEVAEKIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGKFYLW 342

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  EVE+IL E A  F   Y +   GN            F+G N+           + +G
Sbjct: 343 TLHEVENILKEDAKEFCNTYDITKGGN------------FEGSNI----------PNLIG 380

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             LE   + L   R+KLF VR KR  P  DDK++ +WN L+IS+ A A ++ +++     
Sbjct: 381 KDLEN-TDKLENLRKKLFQVREKRVHPFKDDKILTAWNALMISALAYAGRVFENQ----- 434

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                      EY++ A+ A +FI  +L   +  RL   FR+G +    +++DY+FL+  
Sbjct: 435 -----------EYIDRAKEAYNFIENNLI-RKDGRLLARFRHGEAAYIAYIEDYSFLVWA 482

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL+LYE    +K+L  A++  +   +LF D E  G+F++  +   ++L +K+ +D A PS
Sbjct: 483 LLELYEATFESKFLKEALQFTDEMIKLFWDEESYGFFHSGKDGEKLILNLKDSYDTAIPS 542

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
           GNSV+ +NL++L+ I   +      + A   L  F   +K+   +  +          PS
Sbjct: 543 GNSVAAMNLIKLSKITGDNS---LGEKAYKMLEGFGGNIKESLQSHSIFLMVYMNYIRPS 599

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
            K +++   K    F++M+   +  + +  T + ++  + E +           S+    
Sbjct: 600 -KQIIIASKKEDKVFKDMIREVNKRF-MPFTTVLLNDGNLENII---------PSIKDER 648

Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
              +K  A VC+NFSC+ PV +      LL
Sbjct: 649 KVDNKTTAYVCENFSCNRPVDNIKEFIKLL 678


>gi|134119086|ref|XP_771778.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254378|gb|EAL17131.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 748

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 265/706 (37%), Positives = 389/706 (55%), Gaps = 41/706 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 66  SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F++P L+P   GTYFP      RP F  +L K+ + W++ R+   + G   IE L +  
Sbjct: 126 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEVLKDMS 179

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
               +S  L   L  +       QLS   D+R+GGF ++      PKFP   + ++ +  
Sbjct: 180 HTGRTSESLSQLLASSPASKLFSQLSTMNDTRYGGFTNSGSSTRGPKFPSCSITLEPLAR 239

Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
            +       ++ E  E  ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKML
Sbjct: 240 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 299

Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           YDQ QL +  LD   L     +D    Y +  DIL Y  RD+  P G  +SAEDADSAE 
Sbjct: 300 YDQAQLVSSCLDFARLYPVDHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 359

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           +GA +K EGAFY+W   E++++LG+ A LF   + ++P GN D+  + D H E +GKN+L
Sbjct: 360 KGA-KKSEGAFYIWKKTEIDEVLGDDAPLFNSFFGVQPDGNVDI--IHDSHGEMRGKNIL 416

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
            +       A + G   ++   I+ +   KL   R +R RP LDDK++ +WNGL++++ +
Sbjct: 417 HQHKTYEEVALEFGKREDQAKGIIIQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 476

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           +AS +L           P     R + +  A    +F++ H++D  T  L  S+R G  K
Sbjct: 477 KASTLL-----------PPSYGIRSQCLPAALGIVNFVKSHMWDSSTRTLTRSYREG--K 523

Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
            P    DDYAFL+ GLL+LYE       +++A ELQ  QDELF D   GGYF  + ED  
Sbjct: 524 GPQAQTDDYAFLVQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-ASAEDAH 582

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           VL+R+K+  DGAEPS  +VS  NL R + +++ S+ + Y   AE +       +     A
Sbjct: 583 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRA 641

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
           V         L    R+ V+++G  S    +  L AA  +Y  N+ ++ I P +  +   
Sbjct: 642 VGYAVSGLIDLEKGYRE-VIVIGSASDEVVKKFLEAARKTYFSNQVIVQIQPENLPK-GL 699

Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
            E++    A +       +K  +L VC+  +C  PV D    +NLL
Sbjct: 700 AEKNEVVKALVNDVESGKEKAASLRVCEGGTCGLPVKDLEGAKNLL 745


>gi|158521543|ref|YP_001529413.1| hypothetical protein Dole_1532 [Desulfococcus oleovorans Hxd3]
 gi|158510369|gb|ABW67336.1| protein of unknown function DUF255 [Desulfococcus oleovorans Hxd3]
          Length = 641

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 248/610 (40%), Positives = 338/610 (55%), Gaps = 50/610 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESF D   A L+N  FV +KVDREERPD+D++YMT V A+ G GGWPL+V
Sbjct: 55  TCHWCHVMAHESFSDPDTAALMNAHFVCVKVDREERPDIDRLYMTAVSAITGSGGWPLNV 114

Query: 81  FLSPD-LKPLMGGTYFPPEDKYGRPG------FKTILRKVKDAW---DKKRDMLAQSGAF 130
           FL P  L P  GGTYFPP     RPG      +  +L+++ DAW   DK+  +LA + + 
Sbjct: 115 FLEPHALAPFFGGTYFPP-----RPGRTLMITWPDLLQQIADAWENPDKRSSLLASADSI 169

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
               L  AL+ +       D       +   +  +  YDS+ GGFG APKFP P  I  +
Sbjct: 170 TTF-LESALTGTRHRPAEGDAELTGIYKKALDAFTGMYDSQSGGFGPAPKFPMPAIINFL 228

Query: 191 LY--HSKKLEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +    D G  + +  +   M + TL  MA+GGI+D +GGGFHRYS DERWH+PHF
Sbjct: 229 LACAATDPAADLGLDTRQREKALGMAIHTLSAMARGGIYDQLGGGFHRYSTDERWHLPHF 288

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYIC--RDILDYLRRDMIGPGGEIFSAEDADSAE 305
           EKMLYD  QL     DA++LT++   S +C  R   DY+ ++M  P G  +SA+DADS E
Sbjct: 289 EKMLYDNAQLLACLADAYALTEN--NSLLCRARQTADYILKEMTHPEGGFYSAQDADSPE 346

Query: 306 TEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPH-NEFKGK 363
           + GA +K EGAFYVW ++E+E +L    A LF  H+ ++P GN     +S PH  EF  K
Sbjct: 347 SAGAGKKVEGAFYVWEAREIESLLDAPAAKLFMSHFGVRPEGN-----VSGPHAAEFSHK 401

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVL        +A   G+  ++  ++L   R+ L   R  RP P  DDK+I +WNGL+IS
Sbjct: 402 NVLYGTGPVDQAAKTFGLSEQETQDLLQTARQTLLAHRKHRPAPDTDDKIITAWNGLMIS 461

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
             A+  ++ +                  +Y + A  AA FI+ HLYD QTH L   +R G
Sbjct: 462 GLAKLYRVTR----------------EAQYRDGAVKAARFIQTHLYDPQTHHLARIWRAG 505

Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-TGE 542
            ++  G  +DYAFL  GL+DLYE  +   WL WAI+L       F D + GG F T  G 
Sbjct: 506 EARIDGMAEDYAFLAQGLIDLYEANADAFWLAWAIDLSEEVLASFYDSKNGGIFMTGKGH 565

Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
           DP +LLR+KED D   PS  SV+  N  RL++     ++D +   A  ++      L++ 
Sbjct: 566 DPHLLLRMKEDTDNVMPSAGSVAARNFYRLSAYTG--RND-FSDAARATINALIPLLEEH 622

Query: 603 AMAVPLMCCA 612
             A PL+  A
Sbjct: 623 PSAAPLLLTA 632


>gi|253681418|ref|ZP_04862215.1| dTMP kinase [Clostridium botulinum D str. 1873]
 gi|253561130|gb|EES90582.1| dTMP kinase [Clostridium botulinum D str. 1873]
          Length = 671

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 246/683 (36%), Positives = 368/683 (53%), Gaps = 75/683 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFEDE VAK+LND ++SIKVDREERPDVD  YMT+ QA+ G GGWPL++
Sbjct: 54  SCHWCHVMEKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++P+ KP   GTYFP +  YGRPG   IL+++ D W   +D +  +    +  + E +S
Sbjct: 114 IMTPEQKPFFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDNIINTSNKLLNTMKERVS 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                    +E+ ++ L     +++  YD+++GGFG APKFP P ++ ++L + K   D 
Sbjct: 174 QDKW-----EEINESILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDK 228

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
              G       MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA V
Sbjct: 229 SALG-------MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+ +T   FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW
Sbjct: 282 YTEAYQVTGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVW 334

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           + +E++ ILGE A  F   Y +   GN            F+GKN+           + +G
Sbjct: 335 SLEEIQSILGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIG 372

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             LE  ++ L + R KLF VR KR  P  DDK++ +WN L+I S + A ++         
Sbjct: 373 KDLEN-IDKLKDLRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF-------- 423

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   + KEY+  ++ A  FI  +L   +  RL   FR+G +    +L+DY+FL+  
Sbjct: 424 --------ENKEYINRSKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWA 474

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L++LYE    + +L  A+   +   +LF D E  G+F++  +   ++L +K+ +D A PS
Sbjct: 475 LMELYEATFESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPS 534

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
           GNSV+ +NL++L+ I   +      + A      F   +K+   +  +   +      PS
Sbjct: 535 GNSVAAMNLIKLSKITGDNSLG---EKAYKMFQCFGGNIKESLQSHSIFLISYMNYIKPS 591

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
           R+ +V+   K    F+ M+   +  + +  T+I ++  + E          N     ++ 
Sbjct: 592 RQ-IVIASEKEDRLFKEMIKEVNKRF-MPFTIILLNDGNLE----------NIVPFIKDE 639

Query: 681 FSAD-KVVALVCQNFSCSPPVTD 702
              D K  A +C+NFSC+ PV +
Sbjct: 640 KKIDNKTTAYICENFSCNKPVYN 662


>gi|58262588|ref|XP_568704.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57230878|gb|AAW47187.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 773

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 265/706 (37%), Positives = 387/706 (54%), Gaps = 41/706 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 91  SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 150

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F++P L+P   GTYFP      RP F  +L K+ + W++ R+   + G   IE L +  
Sbjct: 151 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEVLKDMS 204

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF---GSAPKFPRPVEIQMMLYHSKK 196
               +S  L   L  +       QLS   D+R+GGF   GS+ + P+     + L    +
Sbjct: 205 HTGRTSESLSQLLASSPASKLFSQLSTMNDTRYGGFTNSGSSTRGPKFPSCSITLEPLAR 264

Query: 197 LEDTGKSGEAS-----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           L      G  +     + ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKML
Sbjct: 265 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 324

Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           YDQ QL +  LD   L     +D    Y +  DIL Y  RD+  P G  +SAEDADSAE 
Sbjct: 325 YDQAQLVSSCLDFARLYPVDHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 384

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           +GA +K EGAFY+W   E++++LG+ A LF   + ++P GN D+  + D H E +GKN+L
Sbjct: 385 KGA-KKSEGAFYIWKKTEIDEVLGDDAPLFNSFFGVQPDGNVDI--IHDSHGEMRGKNIL 441

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
            +       A + G   ++   I+ +   KL   R +R RP LDDK++ +WNGL++++ +
Sbjct: 442 HQHKTYEEVALEFGKREDQAKGIIIQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 501

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           +AS +L           P     R + +  A    +F++ H++D  T  L  S+R G  K
Sbjct: 502 KASTLL-----------PPSYGIRSQCLPAALGIVNFVKSHMWDSSTRTLTRSYREG--K 548

Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
            P    DDYAFL+ GLL+LYE       +++A ELQ  QDELF D   GGYF  + ED  
Sbjct: 549 GPQAQTDDYAFLVQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-ASAEDAH 607

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           VL+R+K+  DGAEPS  +VS  NL R + +++ S+ + Y   AE +       +     A
Sbjct: 608 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRA 666

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
           V         L    R+ V+++G  S    +  L AA  +Y  N+ ++ I P +  +   
Sbjct: 667 VGYAVSGLIDLEKGYRE-VIVIGSASDEVVKKFLEAARKTYFSNQVIVQIQPENLPK-GL 724

Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
            E++    A +       +K  +L VC+  +C  PV D    +NLL
Sbjct: 725 AEKNEVVKALVNDVESGKEKGASLRVCEGGTCGLPVKDLEGAKNLL 770


>gi|85858097|ref|YP_460299.1| thymidylate kinase [Syntrophus aciditrophicus SB]
 gi|85721188|gb|ABC76131.1| thymidylate kinase [Syntrophus aciditrophicus SB]
          Length = 691

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 265/695 (38%), Positives = 365/695 (52%), Gaps = 63/695 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+E VA+LLN+ F+SIKVDREERPD+DK+YM   Q L GGGGWPL+
Sbjct: 58  STCHWCHVMAHESFENEEVARLLNESFISIKVDREERPDIDKLYMAVCQLLTGGGGWPLT 117

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD +P   GTY P E + G  G   ++  + + W K+R+ + ++      +++ AL
Sbjct: 118 ILMTPDRRPFYAGTYIPRESRSGMVGMLVLIPGLSEVWRKERNRILETAG----EITTAL 173

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                    P ELP    L    + L + +D+R+GGF SAPKFP       M  HS  L 
Sbjct: 174 QGMDQGG--PGELPLDRVLHEAYDDLRRRFDARYGGFDSAPKFP-------MAQHSFFLL 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G+  E S+   +V  TLQ M +GGI+D VG GFHRYS D +W +PHFEKMLYDQ  LA
Sbjct: 225 RYGRRQENSQALAIVEKTLQSMRRGGIYDAVGFGFHRYSTDAQWRLPHFEKMLYDQALLA 284

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +AF       Y    R+IL Y+ RDM  P G  +SAEDAD+A        +EGAFY
Sbjct: 285 MAYTEAFQAAGQSLYKKTAREILTYVLRDMTAPEGGFYSAEDADTA-------GEEGAFY 337

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS- 377
           +WT++E+  +L           Y  P G               GK  ++  + S    S 
Sbjct: 338 LWTAEELRQVLPTEEAELMIRVYAIPEG---------------GKPSVLHCSSSYPELSV 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            L +P E+ L  L   R+KLF  R+KR RP  DDK++  WNGL+I++ ARA+ +      
Sbjct: 383 DLDLPEERLLERLESARQKLFLQRAKRIRPLRDDKILTDWNGLMIAAMARAAAV------ 436

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
              F  PV       Y++ A  A  FI  +L D +  RL H +R G +  P  LDDYAFL
Sbjct: 437 ---FEEPV-------YLQAAREAVRFILENLRDPRG-RLLHRWREGEAAMPAVLDDYAFL 485

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I GL++ YE       L  A+ L       F D   GGYF T  +  S+L+R KE +DGA
Sbjct: 486 IWGLIEAYEATFDANLLQTALSLDEELTAHFWDNASGGYFYTPDDGESLLVRQKESYDGA 545

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+++NL+RL+ +   +  +   + A  +   F   ++ ++ A      A D L+
Sbjct: 546 IPSGNSVAMLNLLRLSRLTGQAGLE---ERAVATAQAFADSIRSLSAAHTSFMVALDYLA 602

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            PS   VV+ G     D  +ML     ++  + TV+ I   D  E             M 
Sbjct: 603 GPS-AEVVIAGSPEGTDTRDMLRELRRAFLPHVTVLLI--PDEGEKGMLAGVAEFTGGMT 659

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           R +    +  A VC+NFSC  P TDP  +  LL E
Sbjct: 660 RID---GRATAYVCRNFSCRKPTTDPAEMTTLLRE 691


>gi|398407269|ref|XP_003855100.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
 gi|339474984|gb|EGP90076.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
          Length = 750

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 256/611 (41%), Positives = 331/611 (54%), Gaps = 38/611 (6%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           KT R  F+    + CHWCHVME ESF D  +A+LLN+ F+ IK+DREERPD+D+ YM ++
Sbjct: 48  KTNRLLFVSIGYSACHWCHVMEHESFSDSRIAQLLNEHFIPIKIDREERPDIDRQYMDFL 107

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYGR-----PGFKTILRKVKDAWDKKR 121
           QA  GGGGWPL+VF++PDL+P+ GGTY+P P  +  R       F+ +LRKV  AW ++ 
Sbjct: 108 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPNSERARSRAAGTTFEDVLRKVSTAWKEQE 167

Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL------CAEQLSKSYDSRFGGF 175
                +      QL E         +   +  +N            E     YD++ GGF
Sbjct: 168 QKCRANAKDITRQLREYAQEGMLGGRDGKQTDENDGLELDLLDDAYEHYKGRYDAKCGGF 227

Query: 176 GSAPKFPRPVEIQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           G APKFP PV I+ +L    Y     E  G+  +  E ++M + TL+ MAKGGI D +G 
Sbjct: 228 GGAPKFPTPVHIKPLLRVANYPHVVREIVGEE-DCQEARRMAVHTLESMAKGGIKDQIGH 286

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIG 290
           GF RYSV   W +PHFEKMLYD  QL  VYLDA+ LTK         DI  YL    M+ 
Sbjct: 287 GFARYSVTRDWSLPHFEKMLYDNAQLLPVYLDAWILTKSPLLLESVNDIATYLTSPPMVS 346

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCD 349
             G IFSAEDADS  T     K+EGAFYVW   E + IL E  +     Y+ ++  GN D
Sbjct: 347 ELGGIFSAEDADSLPTPQDKHKREGAFYVWMMDEFKSILSEEEVTVCAKYWGVQAQGNVD 406

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPH 408
             R  D   E  G+N L    +    A +L    E+    +   R KL   R K RPRP 
Sbjct: 407 --RRFDLQGELVGQNTLCVQYEIPELAQELSKSEEQITQTIQSGRSKLLAHREKNRPRPA 464

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LDDK++ SWNGL I   AR S  L+           +       Y+  A  A + I+ HL
Sbjct: 465 LDDKIVTSWNGLAIGGLARTSSALRY----------ISPEPAAAYLAAALKATNCIKTHL 514

Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           +D  T+ L+  +R GP + PGF DDYAFLISGLLDLYE    + WL WA  LQ TQ  LF
Sbjct: 515 FDPSTNALKRVYREGPGETPGFADDYAFLISGLLDLYEATWDSNWLQWADTLQQTQTRLF 574

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D E  G+F+T    P +L+RVK+  D AEPS N V+  NL RL S++  S+   Y + A
Sbjct: 575 WDEEKYGFFSTAASQPDILIRVKDAMDNAEPSVNGVASYNLFRLGSLLNDSE---YEKMA 631

Query: 589 EHSLAVFETRL 599
              +A FE  L
Sbjct: 632 RRIVACFEVEL 642


>gi|390559056|ref|ZP_10243426.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
 gi|390174366|emb|CCF82718.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
          Length = 685

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 255/691 (36%), Positives = 370/691 (53%), Gaps = 66/691 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE+  +A ++N+ F++IKVDREERPD+D +YM  VQ L G GGWP++
Sbjct: 48  SSCHWCHVMAHESFENPDIAAIMNENFINIKVDREERPDLDAIYMAAVQMLSGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD++P   GTYFPPED+   PGF  IL  V DA+  +R+ + ++     ++L+   
Sbjct: 108 VFLTPDMRPFYAGTYFPPEDRPPMPGFARILDLVADAYRDRREDIDETAEQISDELNHHF 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A+  S  +   +  +  R    +L+  +D   GGFG+ PKFP  + ++ ML   +    
Sbjct: 168 QAAIESLAISPSILDDGAR----KLALQFDQSNGGFGNEPKFPPSMSLEFML---RTYVR 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG    +    +MV FTL  MA+GGI+D +GGGFHRYSVD  W VPHFEKMLYD   LA 
Sbjct: 221 TG----SKRALEMVTFTLDRMARGGIYDQIGGGFHRYSVDAIWLVPHFEKMLYDNALLAR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y   +  T    Y  I      Y+ R+M+ P G  +SA+DADS   EG    +EG FY+
Sbjct: 277 IYTLGYQATGKDLYRRIAEQTFTYVLREMMSPEGGFYSAQDADS---EG----EEGKFYI 329

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +E E +LG   A + K ++ + P GN            F+GKN+L    +    A +
Sbjct: 330 WTPQEFETVLGRRDASIAKRYFGIMPDGN------------FEGKNILTAPREPERIAEQ 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+ LE+  + + E R KL+  RS R  P  DDKV+ +WN L++ SFA  + +       
Sbjct: 378 FGISLEELESTIAEIRGKLYQARSTRVWPGRDDKVLTAWNALMLRSFAEGATVF------ 431

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                      R + +EVA   A FIR +LY  Q   L  ++  G +K  G+L+DYA+LI
Sbjct: 432 ----------GRADLLEVAVRNARFIRDNLY--QDGHLLRTYTAGQAKLNGYLEDYAYLI 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             LL LYE      W+ WA EL +T  + F D E GG+F+T      ++ R KE  D A 
Sbjct: 480 DALLSLYEATFNASWIAWAQELTDTMVKEFWDHENGGFFSTGTSHEELVARPKELFDSAT 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR---LKDMAMAVPLMCCAADM 615
           PSGNSV+   L+RL+ ++   ++DY     E  +AV +      K+       +  A D 
Sbjct: 540 PSGNSVAADVLLRLSHLLG--RNDY----RERGMAVLKKHGMLAKEYPHGTARLLLAYD- 592

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            ++ S + + LVG  S+   +++LA     Y  +K V    P   +E          +  
Sbjct: 593 FALSSPREIALVGDPSAEATQSLLAVVQQPYLPHKVVALRHPGRADEAAIIPLLEGRD-E 651

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISL 706
           + R      K  A VC+NF+C  PVT+P  L
Sbjct: 652 IER------KPAAYVCRNFTCERPVTEPAEL 676


>gi|321265830|ref|XP_003197631.1| DUF255 domain protein [Cryptococcus gattii WM276]
 gi|317464111|gb|ADV25844.1| DUF255 domain protein, putative [Cryptococcus gattii WM276]
          Length = 772

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 266/698 (38%), Positives = 382/698 (54%), Gaps = 41/698 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 90  SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 149

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++P L+P   GTYFP      RP F  +L+K+ + W++ R+   + G   IE L +  
Sbjct: 150 VFMTPKLEPFFAGTYFP------RPNFHQLLKKIHNVWEEDREKCEKMGKGVIEALKDMN 203

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
               +S  L   L  +       QLS   D R+GGF +A      PKFP   + ++ +  
Sbjct: 204 DTGRTSESLSQLLSTSPASKLFAQLSTMNDPRYGGFTNAGSSTRGPKFPSCSITLEPLAR 263

Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
            +       ++ E  E  ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKML
Sbjct: 264 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 323

Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           YDQ QL +  LD   L      D    Y +  DIL Y  RD+  P G  +SAEDADSAE 
Sbjct: 324 YDQTQLVSSCLDFARLYPADHPDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 383

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           +GA +K EGAFY+W   E++++LG+ A LF   + ++P GN D+  + D H E + KN+L
Sbjct: 384 KGA-KKSEGAFYIWKKSEIDEVLGDDAPLFNSFFGVEPDGNVDI--IHDSHGEMRDKNIL 440

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
            +       A + G   ++  +I+ +   KL   R +R RP LDDK++ +WNGL++++ +
Sbjct: 441 HQHKTYEEVALEFGKKEDEAKDIIVQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 500

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           +AS +L    + +    P            A    +F++ H++D  T  L  S+R G  K
Sbjct: 501 KASTLLPPSYDISPQCLP-----------AALGIVNFVKSHMWDSSTRTLTRSYREG--K 547

Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
            P    DDYAFLI GLL+LYE       +++A ELQ  QDELF D   GGYF T+ EDP 
Sbjct: 548 GPQAQTDDYAFLIQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-TSAEDPH 606

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           VL+R+K+  DGAEPS  +VS  NL R + +++    D Y   AE +       +     A
Sbjct: 607 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLSSEFED-YEARAEATYLSMGPLIAQAPRA 665

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
           V         L    R+ V++VG       +  L AA  +Y  N+ +IHI P +  +   
Sbjct: 666 VGYAVSGLIDLEKGYRE-VIIVGSTKDDVVKKFLKAARETYFSNQVIIHIQPENLPK-GL 723

Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
            E++    A +       +K  +L VC+  +C  P  D
Sbjct: 724 AEKNEVVKALVNDIESGKEKGASLRVCEGGTCGLPAKD 761


>gi|134300686|ref|YP_001114182.1| hypothetical protein Dred_2853 [Desulfotomaculum reducens MI-1]
 gi|134053386|gb|ABO51357.1| protein of unknown function DUF255 [Desulfotomaculum reducens MI-1]
          Length = 690

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 258/716 (36%), Positives = 386/716 (53%), Gaps = 67/716 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFE E VAK+LN+ FVSIKVDREERPD
Sbjct: 33  GNEAFDMAKRVDKPIFLSIGYSTCHWCHVMERESFESEEVAKILNEHFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM   Q+L G GGWPL++ ++PD KP   GTYFP + +YGRPG   IL  V   W 
Sbjct: 93  IDQIYMNVCQSLTGSGGWPLTIMMTPDQKPFFAGTYFPKQAQYGRPGITEILENVASLWK 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R  L + G    ++L   + + AS+   P +LP + L       +++YD+ +GGFG+A
Sbjct: 153 NERQHLLEVG----DKLVSHMQSEASTA--PGQLPADILDKAYHIFAQNYDATYGGFGTA 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L +        K+GEA +   MV  TL  M +GGI+DH+G GF RYS 
Sbjct: 207 PKFPTPHNLMFLLRYWH------KTGEA-KALSMVEETLDAMHRGGIYDHIGFGFSRYST 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D++W VPHFEKMLYD   LA  + + + +T +  +  + ++I  Y+ RDM  P G  +SA
Sbjct: 260 DKKWLVPHFEKMLYDNALLALAFTETYQITGNPRFGRVAKEIFTYILRDMTSPEGGFYSA 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FYVW  +EV  +LG+    L+ ++Y +  TGN          
Sbjct: 320 EDADS---EGV----EGKFYVWRPEEVISLLGQVDGELYCQYYDITSTGN---------- 362

Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
             F+G+++  LI   D    +  L + L   +  L  CR+ LF+ R+KR  P+ DDK++ 
Sbjct: 363 --FEGESIPNLIG-QDPFKFSQDLEITLGDLVEGLEACRKTLFEERAKRIHPYKDDKILT 419

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           +WNGL+I++ AR +++ +S                K Y+E A +A  FI   L      R
Sbjct: 420 AWNGLMIAALARGAQVFQS----------------KRYLEAASNAMGFIFDRL-QRNDGR 462

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L   +R   +  P +LDDYAF+I GLL+LY+     + L  A+ L +   +LF D + GG
Sbjct: 463 LLARYREYEAAYPAYLDDYAFVIWGLLELYQATFEPRHLQNAVYLTDDMIDLFYDDKQGG 522

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           ++    +   ++ R K+ +DGA PSGNSV+ +NL +LA +   S+   Y + A   L VF
Sbjct: 523 FYFYGKDSEQLISRPKDIYDGAIPSGNSVATVNLFKLARLTGNSR---YEELANQQLQVF 579

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
              L             A +   P  + +V+ G K     + M+     ++  N +V+  
Sbjct: 580 ADELARYPAGYSFFMMGAYLQQEPPME-IVIAGTKEDPSLQQMINTLRQNFLPNASVLV- 637

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
              D E  + W    S    + ++    + K  A VCQN +C  P+T+P +L+ ++
Sbjct: 638 -RYDDEFANKW----SPLLPLLKDKTPVNGKAAAYVCQNLACQAPLTEPEALQKMI 688


>gi|374994065|ref|YP_004969564.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
           DSM 765]
 gi|357212431|gb|AET67049.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
           DSM 765]
          Length = 702

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 263/730 (36%), Positives = 386/730 (52%), Gaps = 84/730 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE VA LLN WF+SIKVDREERPD
Sbjct: 33  GEEAFTLSKRENKPIFLSIGYSTCHWCHVMERESFEDEAVAALLNRWFISIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YM + QAL G GGWPL++ ++P+ KP   GTYFP  + +G  G   +L +V   W 
Sbjct: 93  VDHMYMAFCQALTGSGGWPLTIIMTPEKKPFFAGTYFPKTEHHGYHGLMELLEQVGTLWR 152

Query: 119 KKRDML----------AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSY 168
              + L           QSG    ++ S  +  S +++       ++ +      L +++
Sbjct: 153 TSENKLRESADQIVAAVQSGLALPKKASTPIDNSQNTSDSNKAWEKDVIDKAYAALEQNF 212

Query: 169 DSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
           D R+GGFG APKFP P  +  +L ++       ++   S    MV  TL  MA+GG++DH
Sbjct: 213 DPRYGGFGRAPKFPSPHTLTFLLRYA-------ENHPQSNALAMVRKTLNGMARGGMYDH 265

Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
           +G GF RYS DE+W +PHFEKMLYD   LA  YL++F +T    ++ + +DI  Y+ RDM
Sbjct: 266 IGFGFARYSTDEKWLIPHFEKMLYDNALLALAYLESFQVTHSPEHAKVAQDIFTYVLRDM 325

Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGN 347
             P G  +SAEDAD+ +       +EG F+VWT +EVE +L  E A  +   Y +   GN
Sbjct: 326 TSPEGGFYSAEDADAED-------QEGKFHVWTPQEVEAVLDMETAQKYCSVYDISAKGN 378

Query: 348 CDLSRMSDPHNEFKGKNV--LIELN----DSSASASKLGMPLEKYLNILGECRRKLFDVR 401
                       F+GK++  L++ N    D  +S +++ +     +  L   R+ LF  R
Sbjct: 379 ------------FEGKSIPNLLQGNIHKLDQESSLAEVDV-----IKSLESARQALFSAR 421

Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
            KR  PH DDK++ SWNGL+I++ A+ +++L +                K Y+E  E AA
Sbjct: 422 EKRIHPHKDDKILTSWNGLMIAALAKGAQVLGN----------------KTYLEAGEKAA 465

Query: 462 SFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIEL 520
            FI  HL      RL   +R G S   G+LDDY+F I GLL+LY F SG   +L  A+ L
Sbjct: 466 DFILTHL-RRVDGRLLARYREGDSAILGYLDDYSFFIWGLLELY-FASGKPLFLQTALLL 523

Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
           Q  QD LF D + GGYF T  +   +L R KE +DGA PSGNS++ +NL+R   +  GSK
Sbjct: 524 QEEQDRLFFDTQRGGYFLTGSDGEKLLFRPKESYDGAIPSGNSITTLNLLRFGQLT-GSK 582

Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
             Y+++ AE  L  F T L+           A      P+++ ++L G   S +   M  
Sbjct: 583 --YWKEKAEQQLLDFRTVLEAHPSGYTAFLQALQFALHPTQE-LILAGSLDSEELSMMRN 639

Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
              + +    +V++ + +  E + + E +            ++D+  A +CQNF+C  PV
Sbjct: 640 LFFSEFRPYASVLYQEGSLGELVPWIENY----------PLASDQTAAYLCQNFTCQQPV 689

Query: 701 TDPISLENLL 710
            +      LL
Sbjct: 690 YEVDQFARLL 699


>gi|322420309|ref|YP_004199532.1| hypothetical protein GM18_2810 [Geobacter sp. M18]
 gi|320126696|gb|ADW14256.1| protein of unknown function DUF255 [Geobacter sp. M18]
          Length = 742

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 254/685 (37%), Positives = 368/685 (53%), Gaps = 54/685 (7%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+ LN  F++IKVDREERPDVD VYMT V A+   GGWPL+V
Sbjct: 99  TCHWCHVMEEESFEDESVAEFLNGNFIAIKVDREERPDVDTVYMTAVHAMGLQGGWPLNV 158

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD KP  GGTY PP D  G  GF T+LR++++++D   D ++++G    E +   L+
Sbjct: 159 FVAPDRKPFYGGTYSPPNDYPGGLGFLTLLRRIRESFDSAPDRVSRAGVQLTEAVQTMLA 218

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +       +  P  A+RL  ++    +D R GG   APKFP  + ++++L +  +  D 
Sbjct: 219 PAQGEESWQEISPDPAVRLYQDR----FDDRNGGLVGAPKFPSSLPLRLLLRYFLRTGD- 273

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                      MV  TL+ MA GGI+D  GGGFHRY+ D  W VPHFEKMLYD   L   
Sbjct: 274 ------RRSLSMVELTLRSMAAGGIYDQAGGGFHRYATDTSWLVPHFEKMLYDNALLTVS 327

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL+ +  T    ++ + R+IL YL+RDM  P G  +SA DADS    G   ++EG F+ W
Sbjct: 328 YLEGYQATGAAEFAAVAREILRYLQRDMQAPAGGFYSATDADSLSPGG--HREEGVFFTW 385

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T +E+   LG E   L    Y +   GN            F+G+++L      +  A  L
Sbjct: 386 TPEELRGTLGPERGDLMAACYGVTQGGN------------FEGRSILHREKSIAELARAL 433

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  ++    L +CR  L+  R+KRP P  D+K++ SWNGL IS+FA    IL       
Sbjct: 434 KLSEQELELTLADCRELLYRARAKRPLPLRDEKILASWNGLAISAFASGGLIL------- 486

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                    +  E ++VA  AA F+ +++      RL+HSF+ G +K   FLDDYAFLI+
Sbjct: 487 ---------NNAELVQVAVRAAGFMLQNMV--VNGRLRHSFQEGEAKGEAFLDDYAFLIA 535

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GL+DL+E      WL  A+EL     E F DRE GG+F T      ++ R K  +DG  P
Sbjct: 536 GLIDLFEASRDISWLERALELTAAVQEQFEDRESGGFFMTGPHHEELISREKPAYDGVIP 595

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSV ++NL+RL ++   ++       A ++LA F T+L +   A+  M  A + L   
Sbjct: 596 SGNSVMIMNLLRLNTLTGATR---LLDQARNALAAFATQLANSPAALSEMLLAIEYLQQT 652

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
            ++ V++         E  L     +   N+ ++ +   + EE+    +  +    +   
Sbjct: 653 PKEVVIVAPAGKPEAAEPFLEGLRRTLVPNRALVVV--CEGEEL----QRAARLIPLVEG 706

Query: 680 NFS-ADKVVALVCQNFSCSPPVTDP 703
             +  D+ VA +C N SC PP +DP
Sbjct: 707 KTAEGDRAVAYLCANRSCRPPTSDP 731


>gi|269926785|ref|YP_003323408.1| hypothetical protein Tter_1680 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269790445|gb|ACZ42586.1| protein of unknown function DUF255 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 686

 Score =  427 bits (1098), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/695 (37%), Positives = 377/695 (54%), Gaps = 62/695 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE+  +AK++ND FV+IKVDREERPD+D +YM  VQA+ G  GWPL+
Sbjct: 48  SSCHWCHVMAHESFENPEIAKIMNDNFVNIKVDREERPDIDAIYMEAVQAMTGQAGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP  GGTYFPPED+ G PGFK +L  + + +  +R  + QS +   +QL +  
Sbjct: 108 VFLTPDGKPFFGGTYFPPEDRVGMPGFKRLLLWLSEVYHTRRQEIEQSASQIAQQLLQIS 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A   S+ +  E+ ++A     + L  S+D ++GGFG+APKFP+P+ ++ +L        
Sbjct: 168 RAELKSHDISLEILESA----CQSLKSSFDHQYGGFGTAPKFPQPMTVEYLL-------Q 216

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +    +  E   MV  TL  M+ GGIHDH+GGGFHRYSVD  W +PHFEKMLYDQ  +A 
Sbjct: 217 SFIRAQQKEYLDMVTLTLVRMSLGGIHDHLGGGFHRYSVDRTWLIPHFEKMLYDQALIAR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL A+ +T + +Y  +    L Y+ +DM    G  +SA+DADS   EG    +EG +Y+
Sbjct: 277 AYLHAWQVTHNSWYLKVVNRTLQYVLKDMTSSQGGFYSAQDADS---EG----EEGKYYL 329

Query: 320 WTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+  E++ +L E  + L  EHY +  +GN            F+GKN+L         A  
Sbjct: 330 WSLDEIKRVLNEREVELVCEHYGVTASGN------------FEGKNILHIAKSIEDLARD 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             M L +   I+ E   KL   R +R  P  D KV+ SWN L+ ++ A        EA  
Sbjct: 378 HNMDLSEVEKIIDEASMKLLHYRDQRTPPAKDTKVVTSWNALMSTTLA--------EAGF 429

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
           AM N         EY+  ++  A F+  +L  +    L H++ +   K PGFL+DYA L 
Sbjct: 430 AMNN--------PEYIAASQRNAQFLLDNLVVDGL--LHHTYSDSKPKVPGFLEDYAALS 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           + L+ LYE  S  KWL  A        + F   E G + +T+ +   + L+ +  +D A 
Sbjct: 480 NSLITLYEITSDGKWLESARRFVQDMIDSFWKEEIGTFSDTSIKHSDIFLQPRNLYDNAT 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNS++ + L+RLA I    + D YR+ A   +      +     A   M C A+ L  
Sbjct: 540 PSGNSLACMALLRLAVIF--DRQD-YREIASRVVRGLALVMSKHPTAFGHMLCVANTLLS 596

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           PS + +V++G K SV+ E +L     +Y  NK +I    + TEE    E   S+   +  
Sbjct: 597 PSVE-IVILGDKHSVNTEALLEVIRQTYIPNKILI----STTEE----EASRSDLPLLQG 647

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLE 712
                +K  A VC+N++CS PV +P  L E L L+
Sbjct: 648 RTLRNNKPTAFVCRNYACSMPVNEPDELREQLTLQ 682


>gi|254442730|ref|ZP_05056206.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198257038|gb|EDY81346.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 727

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 259/708 (36%), Positives = 371/708 (52%), Gaps = 72/708 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESF DE +A  LN+ +V IK+DREERPD+D VYMT+VQ L G GGWPL+
Sbjct: 71  STCHWCHVMNRESFSDEEIAAYLNEHYVCIKIDREERPDIDNVYMTFVQNLTGNGGWPLN 130

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGR-PGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSE 137
           V+LSPD KP  GGTYFPP D   R  GF  +++++ D W      +LA+S +  ++ L++
Sbjct: 131 VWLSPDKKPFFGGTYFPPRDDPSRGRGFLPLIQEINDFWIQDPTGVLARSQSI-VDTLNQ 189

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
             + + ++N       +NA  L  E+LS+S       +D +  GFG+  KFP P  + ++
Sbjct: 190 HSAQTLAANS------ENAASL--ERLSESITAFLFIFDEQNKGFGNDQKFPSPNTLSLL 241

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L  +   E      + S  +++ L TL  M  GGI DH+GGGFHRY+VD  W +PHFEKM
Sbjct: 242 LRAAATPE--LHQEDRSLAKRLALETLDAMLAGGIRDHLGGGFHRYTVDAGWQLPHFEKM 299

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQ  +A+  +DA+ LT +  Y     + LDY+ RD+    G ++SAEDA+S + + + 
Sbjct: 300 LYDQALIASALVDAYQLTGEARYRQAATETLDYVLRDLRHENGGLYSAEDAESLDPDKSF 359

Query: 311 RKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
            K+EGA+Y WT+ + E +    E       H+ L+P GN        P   F G N L  
Sbjct: 360 AKREGAYYTWTTADFERLFPHEEKRAGLAAHFSLRPAGNAPYGNF--PREIFAGYNTLRI 417

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
             D+     +L   L             L   RS R RPHLDDK+I SWNGL IS+ ARA
Sbjct: 418 NPDAKIDPDQLAADLA-----------TLRQDRSTRARPHLDDKIITSWNGLAISALARA 466

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
             +                 +R +Y   A+ AA+F+  +LY  ++ +L   +R   S   
Sbjct: 467 GLVF----------------NRPDYTNAAQQAANFLLENLYQPESQQLLRLYRQDASPVA 510

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
            F +DYA+LI+GLLDLYE  +  +WL  A ELQ  Q++ F D E GGYF     D  V  
Sbjct: 511 AFAEDYAYLIAGLLDLYEADADHRWLQKAHELQLAQNQRFADTENGGYFLFEASDDIVFN 570

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R K+  D A PS NSVS  NL RLA     +    ++Q A  ++  F  +L      +P 
Sbjct: 571 RTKQAADTAIPSPNSVSAKNLARLAQFFDDAS---FQQQASQTINAFAPQLDSSGTTLPT 627

Query: 609 MCCAADMLSVPSRK-HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-----TEE 662
           +  A  +L V  +   +V+ G   +   + ML   +     ++T+++ D AD      + 
Sbjct: 628 LREA--ILFVGKKPLQIVIAGDPQTASAQAMLHEVNQRLLPSRTLLYADQADGQAYLGQH 685

Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           ++F +   S N           K    VC+NF C  P  DP +L   L
Sbjct: 686 LEFIQTAKSYNG----------KATVFVCENFVCQMPTEDPQTLAKQL 723


>gi|402218687|gb|EJT98763.1| hypothetical protein DACRYDRAFT_110659 [Dacryopinax sp. DJM-731
           SS1]
          Length = 705

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 254/617 (41%), Positives = 349/617 (56%), Gaps = 59/617 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TC WCHVME ESFE+E VAK++ND  V++KVDRE  PDVD+VYM YV A+ G GGWP+S
Sbjct: 46  STCRWCHVMERESFENEEVAKMMNDVCVNVKVDREVLPDVDRVYMNYVTAISGRGGWPMS 105

Query: 80  VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           V+++PD K P  GGTYFPP+        + IL +VKD W  +RD L   G    + L E 
Sbjct: 106 VWITPDTKIPFFGGTYFPPQ------AMEQILTQVKDKWKNERDKLVPKGNSLSDILQEP 159

Query: 139 LSASASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
            S ++ +      L Q  L L  ++    L + YD   GGFG APKFP       +   +
Sbjct: 160 ASPTSPA------LSQLGLPLLRDRGLAMLGQMYDRTHGGFGGAPKFPTQSRFSFLHLVA 213

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              ED+      + G+KM  FTL+ MA GGIHD +G GFHRYSVD  WH+PHFE MLYD 
Sbjct: 214 YLAEDSN-----NLGRKMSAFTLKKMAMGGIHDQIGLGFHRYSVDAAWHIPHFEIMLYDN 268

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP---GGEIFSAEDADSAETEGATR 311
            QLA  YL  + LT D +Y  +   +L YL R ++     G    SAEDA+S E EG T 
Sbjct: 269 AQLAYHYLTYYVLTGDEYYRTVANGVLAYLDRVLLKKTDHGIAYMSAEDAESYEEEGDTI 328

Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           KKEGAFYVWT  ++   LGE     F +H+ +K  GN  L    DPH E +GKNVL+E  
Sbjct: 329 KKEGAFYVWTRAQITAALGEKDGDAFCDHFGVKEEGNVGLEH--DPHKELQGKNVLMEQR 386

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
            +  +A+ LG+  E+   I+   R  L + R KRP+PHLDDK+I SWNGL++ + A+A+ 
Sbjct: 387 SAEETATALGISTEEMEGIINRGREVLREERDKRPKPHLDDKIIASWNGLMLKTLAQAAL 446

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
            L S            G + +++       A F++  +  +   +L   +R   +   G 
Sbjct: 447 RLPS------------GPEPEKFYNQGIEVARFVQNQMIKD--GKLLRCYR---TNVQGV 489

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLR 549
            +DYA +I+GLL LY+       L  A+ELQ+ QDELF D +  GYF +  + D S ++R
Sbjct: 490 CEDYASVINGLLALYQVKLEPWLLRIAVELQDKQDELFWDEKAWGYFASAEDSDASKIMR 549

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASI-------------VAGSKSDYYRQNAEHSLAVFE 596
           +K+DHDG EPS NS+S+ NLV L SI             ++ S+++ Y+  A+  +  F 
Sbjct: 550 LKDDHDGPEPSANSLSLHNLVTLDSICHATDPFALGIPNMSESRAERYQMYAQKMVTFFT 609

Query: 597 TRLKDMAMAVPLMCCAA 613
            RL     ++P M  AA
Sbjct: 610 PRLLTQPASMPEMVSAA 626


>gi|453087339|gb|EMF15380.1| hypothetical protein SEPMUDRAFT_147282 [Mycosphaerella populorum
           SO2202]
          Length = 800

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 250/597 (41%), Positives = 334/597 (55%), Gaps = 32/597 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF+D  +A+LLN+ F+ +K+DREERPD+D+ YM ++QA  GGGGWPL+
Sbjct: 121 SACHWCHVMAHESFDDPRIAQLLNENFIPVKIDREERPDIDRQYMDFLQATNGGGGWPLN 180

Query: 80  VFLSPD-LKPLMGGTYFPPEDK--YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           VF++P  L+P+ GGTY+P  ++    R GF+ I+ KV  AW ++     QS      QL 
Sbjct: 181 VFVTPGGLEPIFGGTYWPKRERAQQARTGFEDIILKVSTAWREQEQRCRQSAKDITRQLR 240

Query: 137 EALSASA----SSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           E     +      N+  D  EL  + L    +     YD + GGFG APKFP PV I+ +
Sbjct: 241 EFAQEGSIGGKDVNRTDDDAELELDLLDDAFQHYKMRYDDKHGGFGGAPKFPTPVHIRPL 300

Query: 191 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           L    Y +   E  G+  E  E + M L TL+ MAKGGI D +G GF RYSV   W +PH
Sbjct: 301 LRVASYPATVREIVGEE-ECIEARSMALMTLEKMAKGGIKDQIGHGFARYSVTRDWSLPH 359

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
           FEKMLYD  QL  VYLDA+ LTK   +  I +DI  YL    M    G I SAEDADS  
Sbjct: 360 FEKMLYDNAQLLAVYLDAYLLTKSPLFLEIVKDIATYLTSAPMQSELGGIHSAEDADSFP 419

Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKN 364
           T     K+EGA+YVWT +E E +L E  +     Y+ +K  GN D  R  D   E   +N
Sbjct: 420 TINDKHKREGAYYVWTLEEFEQVLSEEEVKVCAKYWNVKAEGNVD--RRHDAQGELIKQN 477

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVIS 423
            L    +++  A +L M  +     +   R+ L   R + RP P LDDK++ SWNGL I 
Sbjct: 478 TLCVSRETAELAEELNMAEDDVKRAIDSGRQALLAYREANRPSPSLDDKIVTSWNGLAIG 537

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
           S ARA   L+  +       P  GS    Y+  A  AA  I+ HL+D  +  L+  +R G
Sbjct: 538 SLARAGAALREVS-------PEAGSS---YVSAARKAALCIQNHLFDAMSGTLRRVYREG 587

Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
           P +  GF DDYAF ISGLLDLYE    + +L  A  LQ TQ++LF D E  G+F+T    
Sbjct: 588 PGETQGFADDYAFFISGLLDLYEATFDSDFLQLADTLQETQNKLFWDPEKYGFFSTPAHQ 647

Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
           P +L+R K+  D AEPS N VS  NL RL S++     + Y + A  ++A FE  ++
Sbjct: 648 PDILIRTKDAMDNAEPSVNGVSASNLFRLGSLL---NDEEYSKMARRTVACFEVEIE 701


>gi|410661555|ref|YP_006913926.1| Thymidylate kinase [Dehalobacter sp. CF]
 gi|409023911|gb|AFV05941.1| Thymidylate kinase [Dehalobacter sp. CF]
          Length = 741

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 271/751 (36%), Positives = 395/751 (52%), Gaps = 82/751 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFED+ VA +LN  ++ +KVDREERPD
Sbjct: 33  GEEAFQKAKEENKPVFLSIGYSTCHWCHVMERESFEDKEVAAILNRSYIPVKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMTY Q + G GGWPL+V ++PD +P   GTYFP    YGRPG   IL +V + W 
Sbjct: 93  IDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAGTYFPKHSHYGRPGLMDILSQVGELWQ 152

Query: 119 KKRDMLAQSGAFAIEQLSEAL----SASASSNKLPDELP---------------QNALRL 159
            ++D + Q+ A   E ++       +A+++  K    LP               +  L  
Sbjct: 153 TEKDKVIQTAAELYETVTRHYRGDKNATSAVPKNKQTLPFTEKEKDSGDIAIWGKTLLGK 212

Query: 160 CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
             E L   +DS++GGFGSAPKFP P  +  +L +S  +E+       S+   MV  TL  
Sbjct: 213 GYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRYS--MEEP-----QSKALAMVEKTLDS 265

Query: 220 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRD 279
           MA GGI DH+G GF RYS D  W VPHFEKMLYD   LA VYL+A+  TK+  Y  + ++
Sbjct: 266 MADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYDNAGLALVYLEAYQRTKNQKYRRVAQN 325

Query: 280 ILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEH 339
           I  Y+ RDM    G  +SAEDADS   EG    +EG +Y+W+  E+   L +     ++ 
Sbjct: 326 IFGYVLRDMTSAEGGFYSAEDADS---EG----EEGKYYLWSKDEIRKTLQDGIESLQKE 378

Query: 340 YYL----KPTGN---------CDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGM 381
             L    KP            CD   ++D  N ++GKN+      + + D ++  S  G 
Sbjct: 379 RELKNGFKPLSKQKEEVADIYCDAYGITDEGN-YEGKNIPSRIFHVGVGDLTSRYSLTGD 437

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            L + L+I   C   LF  R KR RP  DDK++VSWNGL+I + A+  ++L  +      
Sbjct: 438 ELGEMLDI---CNTILFSAREKRVRPAKDDKILVSWNGLMIGALAKGVQVLSGDLSWE-- 492

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                 +D+K  +  AE+AA FIR  ++D +  RL   +R G +  PG+LDDYAFL+ GL
Sbjct: 493 ------NDKKSLLLTAENAAGFIRDKMFDSRG-RLLARYREGEAGIPGYLDDYAFLVHGL 545

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           L+LY     T++L  AI LQ  Q++LF D   GGY+ T  +   +LLR KE +DGA PSG
Sbjct: 546 LELYTACGKTEYLEQAIFLQEEQEKLFRDETNGGYYFTGCDAEELLLRPKEIYDGAMPSG 605

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           NS+S  NL RL  +   SK   +++ AE  +  F T ++D          A    ++   
Sbjct: 606 NSMSACNLGRLWRLTGLSK---WQERAEKQINSFRTTVEDYPPGYTAFLQAI-QYTLNQG 661

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
           + +VL G  ++   E M  A    +     V + D +  + +   +++      + R+  
Sbjct: 662 EELVLSGSSANQTLEKMQTAIFKDFHPYAAVAYNDGSLGQLIPRMDDY-----PVGRD-- 714

Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLLLE 712
               +   VC++F+C  PV  P  L  +L E
Sbjct: 715 ----LSVYVCRDFACREPVNTPEELAKILSE 741


>gi|410658568|ref|YP_006910939.1| Thymidylate kinase [Dehalobacter sp. DCA]
 gi|409020923|gb|AFV02954.1| Thymidylate kinase [Dehalobacter sp. DCA]
          Length = 741

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 271/751 (36%), Positives = 395/751 (52%), Gaps = 82/751 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFED+ VA +LN  ++ +KVDREERPD
Sbjct: 33  GEEAFQKAKEENKPVFLSIGYSTCHWCHVMERESFEDKEVAAILNRSYIPVKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMTY Q + G GGWPL+V ++PD +P   GTYFP    YGRPG   IL +V + W 
Sbjct: 93  IDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAGTYFPKHSHYGRPGLMDILSQVGELWQ 152

Query: 119 KKRDMLAQSGAFAIEQLSEAL----SASASSNKLPDELP---------------QNALRL 159
            ++D + Q+ A   E ++       +A+++  K    LP               +  L  
Sbjct: 153 TEKDKVIQTAAELYETVTRHYRGDKNATSAVPKNKQTLPFTEKEKDSGDIAIWGKTLLGK 212

Query: 160 CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
             E L   +DS++GGFGSAPKFP P  +  +L +S  +E+       S+   MV  TL  
Sbjct: 213 GYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRYS--MEEP-----QSKALAMVEKTLDS 265

Query: 220 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRD 279
           MA GGI DH+G GF RYS D  W VPHFEKMLYD   LA VYL+A+  TK+  Y  + ++
Sbjct: 266 MADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYDNAGLALVYLEAYQRTKNQKYRRVAQN 325

Query: 280 ILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEH 339
           I  Y+ RDM    G  +SAEDADS   EG    +EG +Y+W+  E+   L +     ++ 
Sbjct: 326 IFGYVLRDMTSAEGGFYSAEDADS---EG----EEGKYYLWSKDEIRKTLQDGIESLQKE 378

Query: 340 YYL----KPTGN---------CDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGM 381
             L    KP            CD   ++D  N ++GKN+      + + D ++  S  G 
Sbjct: 379 RELKNGFKPLSKQKEEVADIYCDAYGITDEGN-YEGKNIPSRIFHVGVGDLTSRYSLTGD 437

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            L + L+I   C   LF  R KR RP  DDK++VSWNGL+I + A+  ++L  +      
Sbjct: 438 ELGEMLDI---CNTILFSAREKRVRPAKDDKILVSWNGLMIGALAKGVQVLSGDLSWE-- 492

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                 +D+K  +  AE+AA FIR  ++D +  RL   +R G +  PG+LDDYAFL+ GL
Sbjct: 493 ------NDKKSLLLTAENAAGFIRDKMFDSRG-RLLARYREGEAGIPGYLDDYAFLVHGL 545

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           L+LY     T++L  AI LQ  Q++LF D   GGY+ T  +   +LLR KE +DGA PSG
Sbjct: 546 LELYTACGKTEYLEQAIFLQEEQEKLFRDETNGGYYFTGCDAEELLLRPKEIYDGAMPSG 605

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           NS+S  NL RL  +   SK   +++ AE  +  F T ++D          A    ++   
Sbjct: 606 NSMSACNLGRLWRLTGLSK---WQERAEKQINSFRTTVEDYPPGYTAFLQAI-QYALNQG 661

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
           + +VL G  ++   E M  A    +     V + D +  + +   +++      + R+  
Sbjct: 662 EELVLSGSSANQTLEKMQTAIFKDFHPYAAVAYNDGSLGQLIPRMDDY-----PVGRD-- 714

Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLLLE 712
               +   VC++F+C  PV  P  L  +L E
Sbjct: 715 ----LSVYVCRDFACREPVNTPEELAKILSE 741


>gi|322794007|gb|EFZ17245.1| hypothetical protein SINV_09516 [Solenopsis invicta]
          Length = 891

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 279/784 (35%), Positives = 387/784 (49%), Gaps = 131/784 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA---------- 69
           +TCHWCHVME ESF++E VAK++N+ +V+IKVDREERPD+D + M ++QA          
Sbjct: 144 STCHWCHVMEKESFKNEEVAKIMNEHYVNIKVDREERPDIDMMCMMFIQASLYLVSGTTR 203

Query: 70  LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA 129
           L G GGWPLSVFL+PDL P+ GGTYF          F   L ++   W   RD + +S  
Sbjct: 204 LRGHGGWPLSVFLTPDLMPITGGTYF------SSSMFTLYLTRIMKEWTDGRDKMIKSAT 257

Query: 130 FAIEQLSEALSASASSNKLP-----------------------DELPQ-NALRLCAEQLS 165
              E+L E L+ S    K+                        D +P  ++  LCA  L 
Sbjct: 258 TIAERLKE-LATSREDIKVSECYLKFLNYFNNVFYLLIFAIQDDGVPAIDSAFLCAHVLM 316

Query: 166 KSYDSRFGGFGSA-------PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQ 218
             YDS +GGFGS+       PKFP P  +  +L        T      S+     L TL+
Sbjct: 317 NIYDSEYGGFGSSSAINPNSPKFPEPSNLNFLLSMHVLTTSTMLVEMTSDA---CLNTLK 373

Query: 219 CMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICR 278
            M+ GGIHDH+G GFHRY+VD RW VPHFEKMLYDQ QL   Y DA+ +TKD FYS I  
Sbjct: 374 KMSYGGIHDHIGKGFHRYTVDARWKVPHFEKMLYDQAQLIQCYADAYLITKDSFYSDIVD 433

Query: 279 DILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI---- 334
           DI  Y+ R +    G  FSAEDADS  T  A+ K+EGAFYVWT   ++ +L +  +    
Sbjct: 434 DIATYVLRILQHMEGGFFSAEDADSLPTSDASAKREGAFYVWTYDRLKTLLKKEKVPGKD 493

Query: 335 ------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 388
                 L   H+ ++  GN +  +  DPH E  GKNV         +AS   + +E+   
Sbjct: 494 NVTYFDLICRHFSVRKEGNVESPQ--DPHGELTGKNVFSMQAGIEDTASHFKLSVEETQK 551

Query: 389 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 448
            L E    LF+ R+ RP P LDDK++ +WNGL+IS  ARA   +K+              
Sbjct: 552 HLKEACTILFEDRTHRPWPQLDDKMVTAWNGLMISGLARAGIAVKN-------------- 597

Query: 449 DRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------------------------- 479
             K Y+E A  AA+F+ ++L+D++   L  S                             
Sbjct: 598 --KTYVEAATEAATFVEKYLFDKKKRILLRSCYRRRDDKIVQRQVLSLHQSVSRCEIYDA 655

Query: 480 -FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            +R+ P   PGF +DYAF + GLLDLYE      W+ +A ELQ+ QD LF D + GGYF 
Sbjct: 656 IYRSTP--IPGFHEDYAFYVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDLQDGGYFA 713

Query: 539 TTGEDPSVLLRVKE---------DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
              E P +L R K+           DGA PS NS++  NL+RLA  +     D  R  AE
Sbjct: 714 MAEESP-ILTRTKDFKIPMSFVVADDGALPSSNSIACSNLLRLAIYL---DRDDLRNKAE 769

Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
             L  F  +L     A P M  A      P++ +V   G   + +   ML    +     
Sbjct: 770 KLLCAFGNKLVSCPAACPQMMLALIEYHHPTQIYV--TGKTDAKETNEMLEIIRSRLIPG 827

Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
           + +I  D    + + F     + N  + R     D+ +  +C++++CS P++ P +L + 
Sbjct: 828 RVLILADAEQQDNVLF-----NRNMIVKRMKPQKDRAMVFICRDYTCSLPISSPSALISE 882

Query: 710 LLEK 713
           L +K
Sbjct: 883 LNKK 886


>gi|83816674|ref|YP_445669.1| hypothetical protein SRU_1548 [Salinibacter ruber DSM 13855]
 gi|83758068|gb|ABC46181.1| Protein of unknown function, DUF255 family [Salinibacter ruber DSM
           13855]
          Length = 701

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 259/695 (37%), Positives = 358/695 (51%), Gaps = 51/695 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+ VA LLND FV IKVDREERPDVD +YM   Q + G GGWPL+
Sbjct: 48  STCHWCHVMERESFEDDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSE 137
           V L+PD KP    TY P E ++ + G   +L +VK  W  D +  +L  +     EQ+++
Sbjct: 108 VLLTPDRKPFFAATYLPKEGRFQQTGLMDLLPRVKQLWNSDDRAKLLDDA-----EQVTD 162

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            L          D      L   A QL++ +D   GGFGSAPKFP P  +  +L H  + 
Sbjct: 163 RLQRIGDDQTDGDAPGPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR- 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG+    ++    V  TL  M  GG+ D VG GFHRYS D++W +PHFEKMLYDQ   
Sbjct: 222 --TGEQAALNQ----VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMH 275

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+  T    Y    R++L Y+RRD+  P G  FSAEDADS   EG    +EGAF
Sbjct: 276 VLAYTEAYQATGTDRYERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAF 333

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+ +++ + L    A L  + Y + P GN    R      E  GKNVL      +A+A
Sbjct: 334 YVWSIEDIREHLEPALADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAA 389

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            + GM ++   + L   RR L D RS+RPRP LDDKV+  WNGL+ ++ A+A+++     
Sbjct: 390 EQRGMEVDVLRDHLETARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF---- 445

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D  ++ E A     F+   ++D    RL H +R G +     LDDYAF
Sbjct: 446 ------------DDAQFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAF 492

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI GLL+LYE      WL  A+E      + F D EGGG++ T  +  ++++R KE +DG
Sbjct: 493 LIWGLLELYETTFDADWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDG 552

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV ++NL+RLA      ++++  + A  S     T  +       ++      L
Sbjct: 553 ALPSGNSVQLMNLLRLARFTG--RTEFEERAAALSRWAGATARRRPTGFTAMLSGLHWAL 610

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P  + VV+ G   S D   ++      Y      +   P D +         +  A  
Sbjct: 611 GTP--REVVVAGEPDSDDTNALIDVLRDDYTPTTVTLQRPPGDAD--------ITALAPF 660

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
             +    D +  A VC+ F C  PVTDP +L   L
Sbjct: 661 TESQTPVDGRAAAYVCEAFRCEAPVTDPAALREQL 695


>gi|331269923|ref|YP_004396415.1| thymidylate kinase [Clostridium botulinum BKT015925]
 gi|329126473|gb|AEB76418.1| thymidylate kinase [Clostridium botulinum BKT015925]
          Length = 671

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 250/711 (35%), Positives = 379/711 (53%), Gaps = 78/711 (10%)

Query: 5   SFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 61
           +F    K  +  FL    ++CHWCHVME ESFEDE VAK+LND ++SIKVDREERPDVD 
Sbjct: 35  AFLKAKKEDKPIFLSIGYSSCHWCHVMEKESFEDEEVAKILNDKYISIKVDREERPDVDN 94

Query: 62  VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 121
            YMT+ Q++ G GGWPL++ ++P+ KP   GTYFP +  YGRPGF  IL+++ D W   +
Sbjct: 95  TYMTFCQSVTGSGGWPLTIIMTPEQKPFFAGTYFPKKSMYGRPGFIQILKQISDEWKSNK 154

Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKF 181
           + +  +    +  + E +S   S      E+ +  L+    +++  YD+++GGFG++PKF
Sbjct: 155 NNIINTSNELLNTMEEHISQDKSG-----EINETILQDAVIEMNYYYDNKYGGFGASPKF 209

Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
           P P ++ ++L + K   +    G       MV  TL+CM KGGI DH+G GF RYS DE+
Sbjct: 210 PTPHKLMLLLINYKVYNNKNALG-------MVENTLKCMYKGGIFDHIGFGFSRYSTDEK 262

Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
           W VPHFEKMLYD   LA VY  A+ +T   FY  +   I  Y+ RDM  P G  +SAEDA
Sbjct: 263 WLVPHFEKMLYDNALLAYVYTQAYQVTGKSFYKEVAEKIFKYILRDMTSPEGGFYSAEDA 322

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           DS   EG     EG FYVWT  E+E ILGE A  F   Y +   GN            F+
Sbjct: 323 DS---EGV----EGKFYVWTLHEIESILGEDAKEFCNIYNITKNGN------------FE 363

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
           G N+           + +G  L+  ++ L   R+KLF+VR KR  P  DDK++ +WN L+
Sbjct: 364 GSNI----------PNLIGKDLDD-IDKLESLRKKLFEVREKRIHPFKDDKILTAWNALM 412

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           I + A A ++ ++E                +Y+  A+ A +FI  +L   +  RL   FR
Sbjct: 413 IVALAYAGRVFENE----------------KYINRAKKAYNFIENNLI-RKDGRLLARFR 455

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
           +G +    +L+DY+FL+  L++LYE    +K+L  A+   +   +LF D E  G+F++  
Sbjct: 456 HGEAAYIAYLEDYSFLVWALMELYEATFDSKYLKQALHFTDEMIKLFWDEESYGFFHSGK 515

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
           +   ++L +K+ +D A PSGNS++ +NL++L+ I   +      + A   +  F   + +
Sbjct: 516 DGEKLILNLKDSYDMAIPSGNSIAAMNLIKLSKITGDNT---LAEKAYKMIEGFGGNIIE 572

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
              +  +   A      PS + +V+   K    F++M+   +  + +  T   ++  D E
Sbjct: 573 SIQSHSIFLMAYMNYIRPSTQ-IVIASEKQDELFKDMIREVNKRF-MPFTTTLLNDGDLE 630

Query: 662 EMDFWEEHNSNNASMARNNFSA-DKVVALVCQNFSCSPPVTDPISLENLLL 711
                     N     +N     +K  A VC+NFSC+ PV +      LL+
Sbjct: 631 ----------NVIPFIKNEKKIYNKTTAYVCENFSCNRPVDNVEDFIKLLI 671


>gi|365158244|ref|ZP_09354475.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
 gi|363621167|gb|EHL72387.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
          Length = 678

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 264/687 (38%), Positives = 371/687 (54%), Gaps = 76/687 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA+LLN +FV+IKVDREERPD+D VYMT  Q + G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDPEVAELLNQYFVAIKVDREERPDIDSVYMTVCQMMTGQGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP   GTYFP   +YGRPG   IL ++  A+ +  D +A  G+  +E L E  
Sbjct: 113 VFLTPDKKPFYAGTYFPKNSQYGRPGMMDILPQLHRAYHQDPDRIADIGSRLVEALKE-- 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
               +  K   ++ + A+    EQL+  +DS +GGFG APKFP P ++  +   YH    
Sbjct: 171 ---EAGRKSEGDVTEEAVHKGFEQLAGKFDSLYGGFGEAPKFPSPHQLLFLFRYYHM--- 224

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                +GE S   KM   TL  MA GGI+DH+GGGF RYS D  W VPHFEKMLYD   L
Sbjct: 225 -----TGEES-ALKMAEKTLDSMAAGGIYDHIGGGFSRYSTDGMWLVPHFEKMLYDNALL 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +TK+  Y  I  +I D++ R+M  P G  +SA DADS   EG    +EG F
Sbjct: 279 MYAYTEAYQITKNERYRRIVLEIADFVAREMTHPEGGFYSAIDADS---EG----EEGKF 331

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSAS 375
           YVW+ +E+ D+LGE    +F E Y++   GN            F+GKN+L  L  D    
Sbjct: 332 YVWSKEEIMDVLGEETGTIFSELYHVTDQGN------------FEGKNILHLLQTDLETI 379

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+   + +E+  N++ + ++ LF  R KR +PH+DDKV+ SWNGL+I++ A+A  +    
Sbjct: 380 AANHELSIEELENLMSKAKQFLFQAREKRVKPHVDDKVLTSWNGLMIAALAKAGSV---- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                F+ P + S        A  A +F+ ++++ E+  RL   FR G +K  G+LDDYA
Sbjct: 436 -----FDDPGLLSQ-------ARKAMAFLEKYVWKEK--RLMARFREGEAKYRGYLDDYA 481

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL+ G L+L+        L +AIEL+N   E F D E GG+F T  +   +L+R K  +D
Sbjct: 482 FLLWGTLELFLAEDDLHMLSFAIELKNALFERFWD-ENGGFFFTDRDGEELLVREKPGYD 540

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV+   L RLA +    +     +  E  +  F   L    +++  M  AA  
Sbjct: 541 GAYPSGNSVAAYQLWRLAKLTGDIE---LMKRVEMCVRSFSKELNAFPVSMLYMLEAAMA 597

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L    R+ V+++G   S                 + V+     +    D W  H      
Sbjct: 598 LFAQGRE-VIVIGSNGSE---------------KRAVLWRCREEFLPFDVWSGHRPEWLE 641

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
            A      D +V  +C+N +C  P+ D
Sbjct: 642 GAAKQKETDLLV-FICENQACKMPMED 667


>gi|296132106|ref|YP_003639353.1| hypothetical protein TherJR_0579 [Thermincola potens JR]
 gi|296030684|gb|ADG81452.1| protein of unknown function DUF255 [Thermincola potens JR]
          Length = 673

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 265/695 (38%), Positives = 376/695 (54%), Gaps = 77/695 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA +LN+ +VSIKVDREERPD+D +YM+  QA+ G GGWPL+
Sbjct: 52  STCHWCHVMERESFEDEEVAAILNEHYVSIKVDREERPDIDTIYMSVCQAMTGHGGWPLT 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V ++PD KP   GTYFP +   G PG   IL ++ D W +++  L +SG    E+++EA+
Sbjct: 112 VIMTPDKKPFFAGTYFPKKSSRGMPGLTDILIQIADLWRERKKELTESG----EKITEAV 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           ++   S+   D + +  L        +++D  +GGFG+APKFP P  +  +L + K    
Sbjct: 168 NSHLFSHTGGD-VSKEMLDKAFAYFEENFDRLYGGFGAAPKFPTPHNLTFLLRYWK---- 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              +G A E   MV  TL  M +GGI+DH+G GF RYS D +W VPHFEKMLYD   LA 
Sbjct: 223 MSGNGAALE---MVEKTLDAMYRGGIYDHIGFGFARYSTDRKWLVPHFEKMLYDNALLAI 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+  T +  Y+    +I  Y++RDMI P G  +SAEDADS   EG    +EG FYV
Sbjct: 280 AYLEAYQATGNRKYAKTAEEIFTYVQRDMISPEGGFYSAEDADS---EG----EEGKFYV 332

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           WT +EV+++LG+     F   Y +   GN            F+ K++  LIE        
Sbjct: 333 WTPEEVKEVLGDTLGRYFCRDYDITAQGN------------FESKSIPNLIETG------ 374

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                    Y+    E R+KLF  R +R  P  DDK++ +WNGL+I++ A  ++ L    
Sbjct: 375 ---------YVEGYEEARKKLFARREQRVHPFKDDKILTAWNGLMIAAMAYGARAL---- 421

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                         K+Y EVA  A +FI ++L  E   RL   FR+G +   G+LDDYA 
Sbjct: 422 ------------GEKKYAEVAAKAVNFINKNLRREDG-RLSARFRDGEAAFLGYLDDYAC 468

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
            + GL++LYE      +L  A+EL N   +LF D E GG F    +  +++ R KE +DG
Sbjct: 469 YVWGLIELYEATFEPAYLEQALELNNDMLKLFWDEENGGLFLYGNDAENLITRPKEIYDG 528

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A P+GNSV+ +NL RLA +    +     + A   L  F   + +  M       A   L
Sbjct: 529 ALPAGNSVAAVNLFRLARLTGDRQ---LAERAREQLKAFGGSVAESPMGHSHFLMAV-WL 584

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            +     + +VG + + D E MLA  ++ +    TVI + P   E      E  +   + 
Sbjct: 585 DLTPPVDITVVGDRKAGDTEKMLATVNSRFMPEATVI-LKPPGPE-----GEKLAQAVAF 638

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            R+  + + K  A VC+N+SC PPVTD   LE LL
Sbjct: 639 LRDRQAVNGKATAYVCKNYSCHPPVTDADKLEKLL 673


>gi|87306323|ref|ZP_01088470.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
           3645]
 gi|87290502|gb|EAQ82389.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
           3645]
          Length = 688

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/700 (38%), Positives = 380/700 (54%), Gaps = 74/700 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ +A  LN+ FVSIKVDREERPD+D++YM  VQ L G GGWP+S
Sbjct: 48  SACHWCHVMEHESFENQEIADYLNEHFVSIKVDREERPDLDQIYMNAVQMLTGRGGWPMS 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEA 138
           VFL+P LKP  GGTY+PP  + G PGF  +L+ V DAW+ +R + L QS  FA E+L E 
Sbjct: 108 VFLTPQLKPFFGGTYWPPTPRGGMPGFDQVLKAVMDAWENRRAIALEQSEKFA-ERLQEI 166

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             A  S  ++   L  +A +     L   YD R GGFG APKFP  ++I++ L +S++  
Sbjct: 167 GQAEDSGEQIDLHLLDDAYKY----LESIYDFRHGGFGGAPKFPHTMDIEVCLRYSRR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                  +S   +M +  L  MA+GGI+DH+GGGF RYSVD RW VPHFEKMLYD   LA
Sbjct: 221 -----QPSSRALEMAIHNLDQMARGGIYDHLGGGFARYSVDARWLVPHFEKMLYDNALLA 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY+D +  T    ++ + R+  DY+   +    G   S EDADS   EG    +EG FY
Sbjct: 276 GVYIDGYRATGREDFARVARETCDYVLHYLTDEAGGFQSTEDADS---EG----EEGKFY 328

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELNDSSA 374
           VWT +E+ DILGE     F E + +  +GN            F+GKN+L     + D  A
Sbjct: 329 VWTPQEIVDILGEGEGRRFCEIFDVSESGN------------FEGKNILNLPQSIEDWGA 376

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
           +++   + L + L++    R++L  VR KR RP  DDKV+VSWNGL+I S ARA+  L  
Sbjct: 377 ASNLDVVELRRELDV---ARQQLLQVRDKRIRPAKDDKVLVSWNGLMIDSLARAAGALSE 433

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                            +Y+  AE AA F+   + D+ + RL HS+R+G +K   +LDDY
Sbjct: 434 ----------------PKYLIAAERAADFVFDKMIDD-SGRLLHSYRHGVAKLAAYLDDY 476

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L +  + LYE     +WL  AIEL N     F D  GGGY+ T  +   ++ R K+ +
Sbjct: 477 ANLANACISLYEASFAERWLKRAIELTNLMMRHFGDPVGGGYYFTADDHEKLIARNKDLY 536

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D + PSGNS++ + L+RL++++  ++       A  ++ V    +K    A   M  A D
Sbjct: 537 DNSVPSGNSMAAVVLLRLSALLGNTE---LLDEAVTTIRVAAPLMKKHPTATGQMLAAVD 593

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
               P+R+ VV+ G+  S      LA    SY  N  +  +           E+   + +
Sbjct: 594 RYLGPARE-VVIFGNADSGATHEFLAELRRSYTPNSAIACVSS---------EKALPSGS 643

Query: 675 SMA-----RNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
            +A     +           VC+NF+C  PVT   ++ +L
Sbjct: 644 PLAPIFAGKGPLPEADGTVYVCENFACQRPVTAAEAIADL 683


>gi|386002945|ref|YP_005921244.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
 gi|357211001|gb|AET65621.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
          Length = 698

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 268/723 (37%), Positives = 369/723 (51%), Gaps = 70/723 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESFEDE VA+LLN  FV IKVDREERPD
Sbjct: 29  GEEAFTRAEREDKPVFLSIGYSTCHWCHVMAAESFEDEEVARLLNATFVPIKVDREERPD 88

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYM   Q + G GGWPL+VFL+PD KP    TY P E ++GR G   ++ ++   W 
Sbjct: 89  LDAVYMAVAQMMTGSGGWPLTVFLTPDKKPFFAATYIPKESRFGRIGILDLIPRIGHLWK 148

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP-----QNALRLCAEQLSKSYDSRFG 173
            +R ML          LS A   +++  + P E+P     +  ++   + L   +D+  G
Sbjct: 149 NERAML----------LSSAEEVASALRRPPPEVPGLRLEEATIKAAYQGLVARFDAANG 198

Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           GFG APKFP P     +L H ++  D G       G +M   TL+ M +GGI DH+GGGF
Sbjct: 199 GFGGAPKFPSPTTFLFLLRHWRRTGDPG-------GVQMTEVTLRAMRRGGIFDHLGGGF 251

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D  W +PHFEKMLYDQ  ++   L+A   T    Y+ I R++ DYL RD+  P G
Sbjct: 252 HRYSTDLHWRLPHFEKMLYDQAMISLACLEAHQATGKAEYATIAREVFDYLLRDLAAPEG 311

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR 352
             +SAEDADS   EG    +EG FY+WT  EV  +L  + A L    ++L+  GN     
Sbjct: 312 GFYSAEDADS---EG----EEGRFYLWTLPEVRAVLDPDEAELAARIFHLQEEGNF---- 360

Query: 353 MSDPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
             +      GKNVL   I L D    A ++G+P+      L   R KLF  R  R RP  
Sbjct: 361 REEATGRLTGKNVLAMKIPLED---HAREMGIPVGDLREWLEAAREKLFAAREGRARPKK 417

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           DDK++  WNGL I++ AR +++L              G  R E  E A+ AA  +   + 
Sbjct: 418 DDKILADWNGLAIAALARGAQVL--------------GDRRLE--EAADRAADLVLHRMR 461

Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
           DE+  RL H +R G +   G LDDYA ++ GLL+LYE G   + L  A+ L     E F 
Sbjct: 462 DERG-RLLHRYRGGDAGILGNLDDYANMVWGLLELYEAGFRPERLEAALALARDMVERFR 520

Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
           DR+GGG+F T  +   +++R K+ HDGA P+GN+V+  NL+RLA +    + +       
Sbjct: 521 DRDGGGFFFTPEDGEELIVRRKDGHDGALPAGNAVAAFNLLRLARMTGDPELEVI---GS 577

Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
             L  F  + +    A   +  A D    PS   VV+VG   S +   ML A  + +   
Sbjct: 578 EGLQAFAAQARGSPSAFLHLLSALDFALGPS-SEVVVVGEAGSPETAEMLKALRSRFLPR 636

Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
           K V+     + + +    E     A M        +  A VC    C  P TDP ++  L
Sbjct: 637 KVVLGRPVGEDQRI---VELAGFTAEM---EALEGRTTAYVCSGRVCRQPTTDPAAVLKL 690

Query: 710 LLE 712
           L E
Sbjct: 691 LEE 693


>gi|221632535|ref|YP_002521756.1| hypothetical protein trd_0509 [Thermomicrobium roseum DSM 5159]
 gi|221156894|gb|ACM06021.1| Protein of unknown function, DUF255 family [Thermomicrobium roseum
           DSM 5159]
          Length = 687

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 253/706 (35%), Positives = 370/706 (52%), Gaps = 88/706 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME E FE+  +A+L N+ FV+IKVDREERPD+D++YM  +QA+ G GGWPL+
Sbjct: 48  SSCHWCHVMERECFENPEIAQLQNELFVNIKVDREERPDLDELYMNALQAMTGSGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQL 135
           VFL+PD KP  GGTYFPPED+   P +  +L  V  A+ ++R  + ++     ++  +Q 
Sbjct: 108 VFLTPDGKPFYGGTYFPPEDRGQLPAWPRVLLAVAQAYRERRADVERAAEDLVSYLQQQS 167

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
              L A+    +  DE  +N        L   YD   GGFG+APKFP P++++ +L    
Sbjct: 168 RPPLQAAPLREQFLDEAARN--------LVPHYDREHGGFGTAPKFPSPLQLEFLL---- 215

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
               T +   A    +MVL TL  MA+GGIHD +GGGFHRY+VDE W VPHFEKMLYD  
Sbjct: 216 ---RTFRRAGAPRALEMVLQTLTAMARGGIHDQIGGGFHRYTVDEAWLVPHFEKMLYDNA 272

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            LA VY  A   + +     I  + L Y++R+M G  G  F+A+DADS E        EG
Sbjct: 273 LLARVYTLAHLASGNRLCRTIAEETLVYIQREMRGDHGAFFAAQDADSEE-------GEG 325

Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           AFY+WT +E+  +LG + A L   ++ + P GN            F+GK++L    D   
Sbjct: 326 AFYLWTPEEIAAVLGNDDAGLACRYFGVTPRGN------------FEGKSILHVAEDPVT 373

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            AS+ G+ L++    +G  R +L++ R +RP P  D+KVIV+WN L I +FA A   L  
Sbjct: 374 IASEFGLSLDELEQRIGSIRARLYEARDQRPHPARDEKVIVAWNALAIRAFAEAGTAL-- 431

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                         DR +++ +AE AA+F+R  L+D +T  L H +  G ++ PGFLDDY
Sbjct: 432 --------------DRPDFVALAERAATFLRDQLWDGKT--LYHVWEEGEARFPGFLDDY 475

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L++ L+ LYE      W+ WA +L       F+D   G +++T  +   +++R K   
Sbjct: 476 ADLVNALVSLYEATFDPFWIAWARQLTEAILAKFIDPVAGDFYDTASDGEQLIVRPKTFI 535

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK---------SDYYRQNAEHSLAVFETRLK-DMAM 604
           D   PSGN  +   L+RL +++   +           Y +   EH +A  +  L  D A+
Sbjct: 536 DQGTPSGNGATAEALLRLGTLLGEHRFIDQARTLLERYAQLAVEHPIACGQLLLAMDFAL 595

Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
             P                V ++G  +  +   +L    ASY  N+ +    P D     
Sbjct: 596 GQPF--------------EVAIIGDPTQPETRALLRVVQASYLPNRVLALRRPED----- 636

Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             E   S    +A  +       A VC+NF+C  PVT P  L + L
Sbjct: 637 --EIAASIVPLLAERSLVDGHPAAYVCRNFACQRPVTTPQELASQL 680


>gi|302392081|ref|YP_003827901.1| hypothetical protein [Acetohalobium arabaticum DSM 5501]
 gi|302204158|gb|ADL12836.1| protein of unknown function DUF255 [Acetohalobium arabaticum DSM
           5501]
          Length = 686

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 259/691 (37%), Positives = 373/691 (53%), Gaps = 76/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA++LN  FV+IKVDREERPD+D +YMT  Q L G GGWPL+
Sbjct: 55  STCHWCHVMERESFEDEEVAEILNRSFVAIKVDREERPDIDNIYMTVCQTLTGRGGWPLT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V ++P+ KP   GTYFP E   G+PG   IL +V+ AW KKR  L ++     E++  AL
Sbjct: 115 VIMTPEKKPFFAGTYFPKEAGRGQPGLMDILIRVEQAWKKKRQPLLETS----EEILSAL 170

Query: 140 SASASSNKLPDELPQNALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
                ++K      +    L  E       ++D  +GGFG+APKFP P  +  +L + K 
Sbjct: 171 ERVNDTDKNDSASMEEMSGLAKEAFISFVANFDEDYGGFGTAPKFPTPHNLMFLLRYWK- 229

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 +GE  +  +MV  TL  M +GG++DH+G GF RYS DE+W VPHFEKMLYD   
Sbjct: 230 -----STGE-EKALEMVETTLDNMYRGGMYDHLGYGFARYSTDEKWLVPHFEKMLYDNAL 283

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA  YL+A+ +T    Y+ I R+I  Y+ RD+  P G  +SAEDADS        ++EG 
Sbjct: 284 LAVTYLEAYQITDKEDYADIAREIFTYVLRDLTSPEGGFYSAEDADS-------EREEGK 336

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LI--ELNDS 372
           FYVWT  E++ ILG       E +       C +  ++D  N F+GK++  LI  EL+ S
Sbjct: 337 FYVWTPNEIKKILGNKQ---GEEF-------CQVYNITDEGN-FEGKSIPNLIGTELDKS 385

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
                                R++LF  R KR  PH DDK++ SWNGL+I++ A  +++L
Sbjct: 386 EVDKK------------FAAERKELFKAREKRVHPHKDDKILTSWNGLMIAALAIGARVL 433

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
             E                 Y + A+ AA FI ++L  +   RL   +RNG +   G++D
Sbjct: 434 NDE----------------RYQQAAKEAAEFIWQNLRRDGNGRLLARYRNGEADYYGYVD 477

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAF I GL++LYE    T++L  A EL N   E F D+E GG +    +   +L R KE
Sbjct: 478 DYAFFIWGLIELYETTFETEYLEKAAELNNDLIEYFWDKEQGGLYFYGYDSEELLTRPKE 537

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
            +DGA PSGNSV+ +NL+RLA ++  ++ +   + A      F +R+ +  +A      +
Sbjct: 538 IYDGAIPSGNSVATLNLLRLAKLIGDTELE---EKARQQFEYFGSRITNKPIASSYFLLS 594

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             + +    + +V+ G++     E M+   H  + L  TV  ++   T+E     +  S 
Sbjct: 595 W-LFAQNGGREIVIAGNREETVTEEMVQVLHQEF-LPFTVSLLNT--TQE----RKKLSE 646

Query: 673 NASMARNNFSADKV-VALVCQNFSCSPPVTD 702
               A +    DK   A +C+NF+C  PV D
Sbjct: 647 LVPFAADQMKVDKRPTAYICENFACQKPVID 677


>gi|294507561|ref|YP_003571619.1| hypothetical protein SRM_01746 [Salinibacter ruber M8]
 gi|294343889|emb|CBH24667.1| conserved hypothetical protein [Salinibacter ruber M8]
          Length = 701

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 257/691 (37%), Positives = 356/691 (51%), Gaps = 51/691 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+ VA LLND FV IKVDREERPDVD +YM   Q + G GGWPL+
Sbjct: 48  STCHWCHVMERESFEDDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSE 137
           V L+PD KP    TY P E ++ + G   +L +V+  W  D +  +L  +     EQ+++
Sbjct: 108 VLLTPDRKPFFAATYLPKEGRFQQTGLMDLLPRVRQLWNSDDRAKLLDDA-----EQVTD 162

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            L          D      L   A QL++ +D   GGFGSAPKFP P  +  +L H  + 
Sbjct: 163 RLQRIGDDQTDGDAPGPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR- 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG+    ++    V  TL  M  GG+ D VG GFHRYS D++W +PHFEKMLYDQ   
Sbjct: 222 --TGEQAALNQ----VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMH 275

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+  T    Y    R++L Y+RRD+  P G  FSAEDADS   EG    +EGAF
Sbjct: 276 VLAYTEAYQATGTDRYERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAF 333

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+ +++ + L    A L  + Y + P GN    R      E  GKNVL      +A+A
Sbjct: 334 YVWSIEDIREHLEPALADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAA 389

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            + GM  +   + L   RR L D RS+RPRP LDDKV+  WNGL+ ++ A+A+++     
Sbjct: 390 EQRGMEADVLRDHLDTARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF---- 445

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D  ++ E A     F+   ++D    RL H +R G +     LDDYAF
Sbjct: 446 ------------DEAQFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAF 492

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI GLL+LYE      WL  A+E      + F D EGGG++ T  +  ++++R KE +DG
Sbjct: 493 LIWGLLELYETTFDADWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDG 552

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV ++NL+RLA      ++++  + A  S     T  +       ++      L
Sbjct: 553 ALPSGNSVQLMNLLRLARFTG--RTEFEERAAALSRWAGATARRRPTGFTAMLSGLHWAL 610

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P  + VV+ G   S D   ++      Y      +   P D +         +  A  
Sbjct: 611 GTP--REVVVAGEPDSDDTNALIDVLRDDYTPTTVTLQRPPGDAD--------ITALAPF 660

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISL 706
             +    D +  A VC+ F C  PVTDP +L
Sbjct: 661 TESQTPVDGRAAAYVCEAFRCEAPVTDPAAL 691


>gi|25326752|pir||A88216 protein B0495.5 [imported] - Caenorhabditis elegans
          Length = 722

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 262/727 (36%), Positives = 373/727 (51%), Gaps = 60/727 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       +  FL    +TCHWCHVME ESFE+E  AK+LND FV+IKVDREERPD
Sbjct: 35  GQEAFQKAKDNNKPIFLSVGYSTCHWCHVMEKESFENEATAKILNDNFVAIKVDREERPD 94

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF TIL  +     
Sbjct: 95  VDKLYMAFVVASSGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEVV 154

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           +KR    ++    I +L +  +AS   N+      +   +        S+DSR GGFG A
Sbjct: 155 EKRRREFETTRAQIIKLLQPETASGDVNR-----SEEVFKSIYSHKQSSFDSRLGGFGRA 209

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP+  ++  ++  +       +S +A +   M+  TL+ MA GGIHDH+G GFHRYSV
Sbjct: 210 PKFPKACDLDFLITFAAS---ENESEKAKDSIMMLQKTLESMADGGIHDHIGNGFHRYSV 266

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
              WH+PHFEKMLYDQ QL   Y D   LT  K     ++  DI  Y+++     GG  +
Sbjct: 267 GSEWHIPHFEKMLYDQSQLLATYSDFHKLTERKHDNVKHVINDIYQYMQKISHKDGG-FY 325

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
           +AEDADS     ++ K EGAF  W  +E++ +LG+  I       +  +++ ++ +GN  
Sbjct: 326 AAEDADSLPNHNSSNKVEGAFCAWEKEEIKQLLGDKKIGSASLFDVVADYFDVEDSGN-- 383

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
           ++R SDPH E K KNVL +L      A+   + + +    + E +  L++ R++RP PHL
Sbjct: 384 VARSSDPHGELKNKNVLRKLLTDEECATNHEISVAELKKGIDEAKEILWNARTQRPSPHL 443

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K++ SW GL I+   +A +                 ++  +Y++ AE  A FI + L 
Sbjct: 444 DSKMVTSWQGLAITGLVKAYQ----------------ATEETKYLDRAEKCAEFIGKFLD 487

Query: 470 DEQTHR------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           D    R             G  +   F DDYAFLI  LLDLY      ++L  A+ELQ  
Sbjct: 488 DNGELRRSVYLGANGEVEQGNQEIRAFSDDYAFLIQALLDLYTTVGKDEYLKKAVELQKI 547

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
            D  F +  G GYF +   D  V +R+ ED DGAEP+  S++  NL+RL  I+   + + 
Sbjct: 548 CDVKFWN--GNGYFISEKTDEDVSVRMIEDQDGAEPTATSIASNNLLRLYDIL---EKEE 602

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
           YR+ A         RL  + +A+P M  A     + S    VLVG   S       +  +
Sbjct: 603 YREKANQCFRGASERLNTVPIALPKMAVALHRWQIGSTT-FVLVGDPKSELLSETRSRLN 661

Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
             +  N +V+HI           EE  S +    +      K    +C+ F C  PV   
Sbjct: 662 QKFLNNLSVVHIQS---------EEDLSASGPSHKAMAEGPKPAVYMCKGFVCDRPVKAI 712

Query: 704 ISLENLL 710
             LE L 
Sbjct: 713 QELEELF 719


>gi|125972813|ref|YP_001036723.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
           27405]
 gi|281417012|ref|ZP_06248032.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
 gi|385779271|ref|YP_005688436.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
           1313]
 gi|419721660|ref|ZP_14248818.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
 gi|419725407|ref|ZP_14252450.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
 gi|125713038|gb|ABN51530.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
           27405]
 gi|281408414|gb|EFB38672.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
 gi|316940951|gb|ADU74985.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
           1313]
 gi|380771156|gb|EIC05033.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
 gi|380782356|gb|EIC11996.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
          Length = 680

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 257/689 (37%), Positives = 367/689 (53%), Gaps = 77/689 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA++LN  FVSIKVDREERPD+D +YMT  QAL G GGWPL+
Sbjct: 53  STCHWCHVMESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP +D+ G PG  +IL+ V + W  ++D LA+  +  +  +SE++
Sbjct: 113 IIMTPDKKPFFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESI 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 +   DE+ ++       Q    +D+ +GGFG+APKFP P  +  +L +  K   
Sbjct: 173 DDDYYYS--VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK--- 227

Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                 A E   +V+   TL  M  GGI+DH+G GF RYS DE+W VPHFEKMLYD   L
Sbjct: 228 ------AKEEYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALL 281

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  YL+ +  TK+  Y+ I ++I  Y+ RDM  P G  +SAEDADS   EG    +EG F
Sbjct: 282 AIAYLETYQATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKF 334

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+  E++++LGE     F ++Y +   GN            F+G N+   +N +    
Sbjct: 335 YIWSPTEIKEVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDE 382

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            K  + L         CR+KLFD R KR  PH DDK++ +WNGL+I++ A   ++L  E 
Sbjct: 383 DKEFVEL---------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE- 432

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +Y   AE A+ FI   L      RL   +R+G +    +LDDYAF
Sbjct: 433 ---------------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDGEAAFLAYLDDYAF 476

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI  L++LYE      +L  A+EL N   + F D + GG F    +   ++ R KE +DG
Sbjct: 477 LIWALIELYETTYKPMYLKKAMELTNDMIKYFWDNKKGGLFIYGSDSEQLITRPKEIYDG 536

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ +N +RL+ +    + +   + A    A+F +++  M         A  + 
Sbjct: 537 AIPSGNSVAALNFLRLSRLTGQQELE---EKAHQMFALFGSKIDSMPQGYAFFLTAM-LF 592

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           S      VVLVG     D +NML+     +    T I           + EEH      +
Sbjct: 593 SKSKSNEVVLVGSNEK-DTQNMLSILSEDFRPFTTSIL----------YSEEHKDLKELI 641

Query: 677 AR-NNFSA--DKVVALVCQNFSCSPPVTD 702
              +N++   +K  A VC+NF C  P+TD
Sbjct: 642 PFIDNYTTIENKPTAYVCENFVCHEPITD 670


>gi|408381411|ref|ZP_11178960.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
           3637]
 gi|407815878|gb|EKF86441.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
           3637]
          Length = 712

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 260/701 (37%), Positives = 366/701 (52%), Gaps = 58/701 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESF+D  +  LLN  FV +KVDREERPD+D VYMT  Q + G GGWPL+
Sbjct: 59  STCHWCHVMARESFQDPEIGDLLNQVFVPVKVDREERPDIDSVYMTVCQMITGSGGWPLT 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
           V ++PDLKP   GTYFP +      G + ++  V+D WD KR  L +S      +++Q+S
Sbjct: 119 VIMTPDLKPFFAGTYFPKDTGPRGTGLRDLILNVRDLWDNKRGELVKSAEELTHSLQQIS 178

Query: 137 EA-----LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
           E      +  S    +   EL +  L+   + LS ++D ++ GFG+  KFP P  +  +L
Sbjct: 179 EGPLPQTVKGSQGFPESSQELGEEILKQAYQSLSDNFDEKYTGFGNNQKFPTPHHLLFLL 238

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
            + K    TG+    +    MV  TL  M KGGI+DHVG GFHRY+VD +W VPHFEKML
Sbjct: 239 RYWKH---TGEDMALT----MVERTLDAMKKGGIYDHVGFGFHRYTVDRQWMVPHFEKML 291

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YDQ  LA  Y +AF  T    Y     ++L+Y+ RDM  P G  +SAEDADS   EG   
Sbjct: 292 YDQALLAIAYTEAFQATGKTQYRETAEEVLEYILRDMRSPEGGFYSAEDADS---EG--- 345

Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIEL 369
            +EG FY+WT  E+ D+LG +   LF E Y +   GN       D     K GKN+L   
Sbjct: 346 -EEGKFYLWTQDEIMDLLGSNDGALFSEIYSVSEEGN-----FKDEATRVKTGKNILHRT 399

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                 + KLG+  E+        R  LF  R  R  PH DDKV+  WNGLVI + A A 
Sbjct: 400 QTWDELSKKLGISTEELWWKTETARETLFHARKSRIHPHKDDKVLTDWNGLVIVALALAG 459

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
              K                R++Y+  A  A  FI   L+ +   RL+H +R+G +   G
Sbjct: 460 NSFK----------------REDYLMAAGDAVKFIMTKLHHQG--RLKHRWRDGEAAVDG 501

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
            LDDYA+LI GLL+LY+    +++L  A++L  T  E FLD + GG++ T+     +L+R
Sbjct: 502 NLDDYAYLIWGLLELYQATFQSEYLEIALKLNQTLLEHFLDHDNGGFYFTSDFTQKILVR 561

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            KE +D A PSGNSV ++NL + + I+     D     + H L  +   +   + +   M
Sbjct: 562 QKEAYDTALPSGNSVQMMNLEKFSLII----DDMKISESFHGLESYFASMITQSPSAFTM 617

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             +A +L +     VV+ G K S D + +L      Y L   ++ ++ +D   +      
Sbjct: 618 FLSAIILKIGPSFQVVICGEKDSPDTQVLLNTIQKEY-LPNVILILNSSDDSLI------ 670

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N    S+        +  A VC N +C  PV +P  L N+L
Sbjct: 671 NQIVGSLEHKTIVNGQATAYVCGNGTCHAPVNNPDDLINIL 711


>gi|268325595|emb|CBH39183.1| conserved hypothetical protein, DUF255 family [uncultured archaeon]
          Length = 685

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 263/708 (37%), Positives = 371/708 (52%), Gaps = 93/708 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE++  A+LLN  F+ IKVDREERPD+D +YM  VQ + G GGWPLS
Sbjct: 50  STCHWCHVMARESFENKQTAELLNTNFICIKVDREERPDLDALYMKAVQMMAGTGGWPLS 109

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PDLKP  GGTYFPPE  +G P F  +L+ + D W +KR+ +  S     EQ++E L
Sbjct: 110 VFMTPDLKPFYGGTYFPPEPIHGLPAFNELLQTITDYWHEKRERILHSS----EQITEHL 165

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--------APKFPRPVEI-QMM 190
             S   N L +EL  + L    EQL+  +DS +GGFG+         PKFP P  +  ++
Sbjct: 166 RRSYQHNLLTEELSVDMLENAFEQLNLQFDSTYGGFGAEVAAWSVKKPKFPLPSYLFFLL 225

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           LYH +  E        S   KMV  TL  MA+GGI+D + GGFHRYS D RW VPHFEKM
Sbjct: 226 LYHHRTDE--------SYALKMVTKTLYEMARGGIYDQLAGGFHRYSTDNRWLVPHFEKM 277

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD   LA VYL A+ +T D F++ I  + LD++ R+M    G  +SA DADS +     
Sbjct: 278 LYDNALLAQVYLWAYQVTGDKFFAQIATETLDWVLREMTDSNGGFYSAIDADSEDI---- 333

Query: 311 RKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
              EGAFYVW+  E+  +L  EH  +F  +Y +   GN +            GK+VL   
Sbjct: 334 ---EGAFYVWSPSEIISVLSEEHGEVFCRYYGVTQQGNFE-----------GGKSVLHVA 379

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
           ND     +           I+   ++KL + R++R RP  DDK+I  WN L+IS+FA   
Sbjct: 380 NDEVNKDTA---------GIINRSKQKLLEARNRRIRPATDDKIITGWNSLMISAFALGY 430

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           ++L+                 + +++ A SA  FI   L  E   +L   +R G +   G
Sbjct: 431 QVLRE----------------RRFLDAATSATQFILNKLNKEG--QLFRRYRAGEAAITG 472

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
            LDD+AFLI+ LLD+YE     KWL  A++  +   ELF D+   G+F     +  +   
Sbjct: 473 TLDDHAFLIAALLDIYEASFDLKWLREALQRNDRVVELFWDKANAGFFFNRYGETDLPAA 532

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
           +KE +DG  PSGNS++  NL+RLA++   + ++  R  A+     F  +L+   +    M
Sbjct: 533 IKEAYDGPIPSGNSIAAQNLIRLAAL---TDNEELRILAKDLFRTFGAQLEQSPLEHTQM 589

Query: 610 CCAADM-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
            CA D  LS P +  VV+   K  ++     A   + + L   VI    +          
Sbjct: 590 LCALDFYLSSPMQ--VVIASQK--IEEVQAFAVEISRHFLPNQVIAFTSS---------- 635

Query: 669 HNSNNASMARNNFSADKV------VALVCQNFSCSPPVTDPISLENLL 710
             S+N    R     DKV         +C+N++C  P+TD   L  +L
Sbjct: 636 --SDNELSGRIPLITDKVAVQGKPTVYICENYACKAPITDLYDLRRVL 681


>gi|399888568|ref|ZP_10774445.1| hypothetical protein CarbS_08603 [Clostridium arbusti SL206]
          Length = 679

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 253/692 (36%), Positives = 364/692 (52%), Gaps = 70/692 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED  VA+LLN +F++IKVDREERPD+D +YM+  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDNEVAELLNKYFIAIKVDREERPDIDNIYMSVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++ D KP   GTY P + +YG  G   +L K+   W + ++ L +S    ++ L + + 
Sbjct: 114 IMTSDKKPFFAGTYLPKKTQYGHMGLMELLNKINKLWIEDKNKLVESSNNIVDFLQDQIV 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                     E+ +  +    E L  SY+  FGGF S+PKFP P  +  +L + +   D 
Sbjct: 174 HKKG------EISEKIVNDAYESLRDSYNPVFGGFSSSPKFPTPHNLNFLLRYYRAKGD- 226

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +MV  TL  M  GGI DH+G GF RYSVD +W VPHFEKMLYD   LA +
Sbjct: 227 ------KYALQMVENTLNSMYSGGIFDHIGFGFSRYSVDSKWLVPHFEKMLYDNALLAII 280

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y + + +T    Y  I   IL+Y+ RDM    G  +SAEDADS   EG     EG FYVW
Sbjct: 281 YTETYQITHKDRYREIAMKILNYILRDMTSKQGGFYSAEDADS---EGV----EGKFYVW 333

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASK 378
             KE++ +LGE A  F EHY +K  GN            F+GKN+  LI  +        
Sbjct: 334 DKKEIKSVLGEDADFFNEHYNIKSKGN------------FEGKNIPNLIGEDLEELEDES 381

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           +   L+         + KLF  R KR  PH DDK++ SWNGL+I++ A A +        
Sbjct: 382 IKSKLDG-------LKEKLFSYREKRIHPHKDDKILTSWNGLMIAAMAYAGR-------- 426

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                 V G +R  Y E A  + SFI  +L + +  RL   +R+G +   G+LDDYAFL+
Sbjct: 427 ------VFGIER--YKEAASKSISFISHNLVNHKG-RLLCRYRDGEAANLGYLDDYAFLV 477

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL+++YE    + +L  AIEL +   + F D + GG F    +   ++L+ KE +DGA 
Sbjct: 478 FGLIEMYEATFESFYLRKAIELNDEMVKYFWDEQNGGLFFYGKDSEELILKTKEIYDGAI 537

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+ +N++RL+ I    K +   Q A      F  ++ ++ +A  +   +A + S 
Sbjct: 538 PSGNSVAAMNIIRLSRITGDKKLE---QKAGEIFNTFAEKINEVPLAY-VNTISAFLTSK 593

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
            S  HVV+ G K   + + M+   +  +     +I  D  +++E+        NN  M +
Sbjct: 594 ISETHVVIAGDKDHTNTKAMINEINKKFLPFSEIIFND--ESKEIYKLIPFIKNNV-MVK 650

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N     K  A VC+N SC  P  D     NL+
Sbjct: 651 N-----KTTAYVCKNNSCLAPTNDLQEFSNLI 677


>gi|405123962|gb|AFR98725.1| cold-induced thioredoxin domain-containing protein [Cryptococcus
           neoformans var. grubii H99]
          Length = 745

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 267/708 (37%), Positives = 389/708 (54%), Gaps = 42/708 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+  ESFEDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 60  SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 119

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F++P L+P   GTYFP      RP F  +L K+ + W++ R+   + G   IE L +  
Sbjct: 120 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEALKDMS 173

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
               +S  L   L  +       QLS   D+R+GGF +A      PKFP   + ++ +  
Sbjct: 174 DTGRTSESLSQLLSSSPASKLFAQLSTMNDTRYGGFTNAGSSTRGPKFPSCSITLEPLAR 233

Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
            +       ++ E  E  ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKML
Sbjct: 234 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 293

Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           YDQ QL +  LD   L     +D    Y +  DIL Y  RD+  P G  +SAEDADSAE 
Sbjct: 294 YDQAQLVSSCLDFARLYPANHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 353

Query: 307 EGATRK--KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           +GA +    EGAFY+W   E+++ILG+ A LF   + ++P GN ++  + D H E +GKN
Sbjct: 354 KGAKKSVLPEGAFYIWKKTEIDEILGDDAPLFDSFFGVEPDGNVNI--IHDSHGEMRGKN 411

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           +L +       A + G   ++  +I+ E   KL   R +R RP LDDK++ +WNGL++++
Sbjct: 412 ILHQHKTYEEVALEFGKREDQAKDIIIEACEKLRLKREERERPGLDDKILTAWNGLMLTA 471

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            ++AS +L S    +    P            A    +F++ H++D  T  L  S+R G 
Sbjct: 472 LSKASTLLPSSYGISSQCLP-----------AALGIVNFVKSHMWDPSTRTLTRSYREG- 519

Query: 485 SKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
            K P    DDYAFLI GLL+LYE       +++A ELQ  QDELF D + GGYF  + ED
Sbjct: 520 -KGPQAQTDDYAFLIQGLLNLYEATGDESHVLFAEELQKRQDELFWDDDDGGYF-ASAED 577

Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
             VL+R+K+  DGAEPS  +VS  NL R + +++ S+ + Y   AE +       +    
Sbjct: 578 AHVLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAP 636

Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
            AV         L    R+ V+++G  +    +  L AA  +Y  N+ ++HI P    + 
Sbjct: 637 RAVGYAVSGLIDLEKGYRE-VIVIGSANDEMIKEFLKAARETYFSNQVIVHIQPEKLPK- 694

Query: 664 DFWEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
              E++    A +       +K  +L VC+  +C  PV D    +NLL
Sbjct: 695 GLAEKNEVVKALINDVESGKEKEASLRVCEGGTCGLPVKDLEGAKNLL 742


>gi|347754417|ref|YP_004861981.1| thioredoxin domain-containing protein [Candidatus
           Chloracidobacterium thermophilum B]
 gi|347586935|gb|AEP11465.1| Thioredoxin domain containing protein [Candidatus
           Chloracidobacterium thermophilum B]
          Length = 691

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 255/692 (36%), Positives = 366/692 (52%), Gaps = 58/692 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME E FE+  +A L+N+ FV+IKVDREERPD+D +YM  VQ + G GGWPL+
Sbjct: 56  SACHWCHVMEHECFENPSIAALMNELFVNIKVDREERPDLDTLYMNAVQLMTGRGGWPLT 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P  GGTYFPPED+   PGF  ILR V DA+ ++R  + QS A    +L    
Sbjct: 116 VFLTPDGEPFYGGTYFPPEDRGRMPGFPRILRSVADAYRQRRQDVRQSIAEITAELRRIH 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                +  L  E+  +A R    +LS  +D   GGFG APKFP  + +  +L + +    
Sbjct: 176 EPLDGARTLSPEILTDAYR----RLSTRFDHVHGGFGGAPKFPNSMLLSFLLRYWR---- 227

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              +GE     +MV  +L  MA GG++DH+GGGFHRYS D++W VPHFEKMLYD   LA 
Sbjct: 228 --LTGEL-HALEMVELSLDKMASGGMYDHLGGGFHRYSTDDQWLVPHFEKMLYDNALLAR 284

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+  T    Y  I  + LDY+ R+M  P G  ++ +DADS   EG    +EG F+V
Sbjct: 285 TYLEAWQATGKPRYRQIVEETLDYVVREMTAPTGGFYATQDADS---EG----EEGRFFV 337

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +E+  +L E  A L + ++ +   GN           E  GK VL         A  
Sbjct: 338 WTPEEINTLLDEADADLVRRYFDVTEEGNF----------EGTGKTVLSTPLPLETVARL 387

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +  E   ++L   +R LF+ R +R +P  D+K + +WNGL++ SFARA+ +L      
Sbjct: 388 KEVTPEHLEHVLARAKRILFEAREQRVKPARDEKCLAAWNGLMLYSFARAAAVL------ 441

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     +R +Y  VAE  A+F+   +Y +    L  S ++G +K PG+ +DYA   
Sbjct: 442 ----------ERDDYRAVAERNAAFVLGTMYVDGI--LYRSHKDGQNKFPGYQEDYACYA 489

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL LYE     K+   A EL       F D +GGG+F T      ++ RVK+  D A 
Sbjct: 490 EGLLALYEATGNVKYFCAARELTEAMLAQFDDPQGGGFFFTGDRHEQLITRVKDVFDNAT 549

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+V  L+RLA +    +   YR+ AEH L    + +  M      +  A D   +
Sbjct: 550 PSGNSVAVEVLLRLALLTGEQR---YRERAEHILQTLSSSMAKMPSGFGQLLGALDFY-L 605

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
            S + +V+VG   + +   +      ++  ++ V  ++P D        +H      +A+
Sbjct: 606 ASVREIVIVGPPDAAETRELRRVVEEAFRPHRVVALLNPEDG-------DHAQYVPLVAQ 658

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +  A VCQNF+C  PVT P +L   L
Sbjct: 659 RTMHNGQPTAYVCQNFTCQAPVTTPDALRAQL 690


>gi|118443135|ref|YP_878469.1| thymidylate kinase [Clostridium novyi NT]
 gi|118133591|gb|ABK60635.1| thymidylate kinase [Clostridium novyi NT]
          Length = 678

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 245/690 (35%), Positives = 375/690 (54%), Gaps = 73/690 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFEDE VA++LND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++
Sbjct: 61  SCHWCHVMENESFEDEEVAEILNDNYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTI 120

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD +P   GTYFP +  YGRPG   IL ++ D W+  ++ +  S    ++ L E   
Sbjct: 121 IMTPDQRPFFAGTYFPKKRMYGRPGLIQILNQIADEWEINKNNIINSSDELLKTLKEH-E 179

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A   S ++ +E+ Q+A+    E++   YD  +GGFG APKFP P ++ ++L + K+  D 
Sbjct: 180 AQDKSGEINEEVLQDAI----EEMKYYYDDVYGGFGIAPKFPTPHKLMLLLTYYKEYNDK 235

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                      +V  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA V
Sbjct: 236 NV-------LHIVEHTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 288

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+ LT   FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FY+W
Sbjct: 289 YTEAYQLTGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYLW 341

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
              E+E+IL E         Y K     D++R+ +    F+G N+           + +G
Sbjct: 342 KLNEIENILKED--------YKKFCNTYDITRVGN----FEGSNI----------PNLIG 379

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             +E  ++ L   R KLF +R KR  P  DDK++ +WN L+IS+ A   ++ ++      
Sbjct: 380 KDIEN-IDKLEYIREKLFQIREKRIHPFKDDKILTAWNALMISALAYGGRVFEN------ 432

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                     KEY++ A+ A  FI+ +L   +  RL   FR G +    +L+DY+FL+  
Sbjct: 433 ----------KEYIKRAKDAYDFIKNNLI-RKDGRLLARFRYGEAAYIAYLEDYSFLVWA 481

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L++LYE    +K+L  A+  Q+   +LF D +  G+F++  +   ++L +K+ +D A PS
Sbjct: 482 LIELYEATFESKFLKEALYFQDEMIKLFWDEKSYGFFHSGKDGEKLILNLKDSYDTAIPS 541

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
           GNSV+ +NL++L+ I   +      + A   +  F   +K+   +  +   A      PS
Sbjct: 542 GNSVAAMNLIKLSKITGYNS---LVEKAYKMIKGFGGNIKESLQSHSVFLMAYMNYIRPS 598

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
           R+ +++  +K      +M+   +  + +  T + ++    E++           S+    
Sbjct: 599 RQ-IIIASNKEDKVLNDMIREVNKKF-MPFTTVLLNDGTLEDII---------PSIKNEK 647

Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
              +K  A VC+NFSC+ PV +      LL
Sbjct: 648 IIDNKTTAYVCENFSCNRPVNNVEDFRKLL 677


>gi|357039905|ref|ZP_09101696.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
           DSM 7213]
 gi|355357268|gb|EHG05044.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
           DSM 7213]
          Length = 688

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 260/693 (37%), Positives = 367/693 (52%), Gaps = 55/693 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+ VA  LN  FVSIKVDREERPD+D++YMT  QAL G GGWPL+
Sbjct: 48  STCHWCHVMERESFEDQEVADALNHHFVSIKVDREERPDIDQIYMTVCQALTGQGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V ++PD KP   GTYFP   ++GR G   I+ +V D W   RD L Q+     EQ+    
Sbjct: 108 VIMTPDKKPFFAGTYFPKRSRWGRAGLLDIIEQVADKWTNDRDKLIQASDMITEQVQ--- 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                   L DE   +      +Q  +S+D ++GGFG APKFP P  +  ++ + K    
Sbjct: 165 --FTPGGYLADEPLADISARGYKQFRQSFDKQYGGFGLAPKFPTPHNLLFLMRYWK---- 218

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++GE +    M   TLQ + +GGI+DH+G GF RYS DE+W VPHFEKMLYD   LA 
Sbjct: 219 --QNGEEA-ALNMAKKTLQSIYRGGINDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAL 275

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            +L+ +  T++ FY+   R I  Y+ RDM  P G  +SAEDADS   EG     EG FYV
Sbjct: 276 AFLEVYQATQNDFYAGAARQIFTYVLRDMTHPEGGFYSAEDADS---EGV----EGKFYV 328

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+  EV  +LG E+  ++ + Y +  +GN +   +          N++  L +    A K
Sbjct: 329 WSPAEVYQVLGRENGDIYCKVYNITESGNFESKSIP---------NLISALPEE--HARK 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+     L +L E R+KLF+ R++R  P  DDKV+ +WNGL++++ AR + +L      
Sbjct: 378 LGIETRALLQLLEESRQKLFNHRARRVHPFKDDKVLTAWNGLMMAALARGAAVL------ 431

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                   G  R  Y + A  A  FI RH    +  RL   +R+G S   G+LDDYAF+I
Sbjct: 432 --------GDVR--YRDAAVKAEQFI-RHKLQRRDGRLLARYRDGESDLNGYLDDYAFVI 480

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL+LY       +L  AI+L +   +LF D+E GG+F    +   ++ R KE +DGA 
Sbjct: 481 WGLLELYRATFQAVYLSRAIDLTHHVRDLFWDQEQGGFFFYGTDSEQLIARPKEIYDGAM 540

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV   NL++LA+I   S+ +   + AE  + +F                A    + 
Sbjct: 541 PSGNSVMAANLLQLAAITGNSELE---ELAERQIDIFAGTAAQHPRGYAYFLTALLFATG 597

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P+   +V+ G +       ML  A   Y     +I+    + +            A   R
Sbjct: 598 PT-SEIVITGQRDDPQVAEMLRLAQRQYAPGAVLIY--RPEGDGDQQDGGQIGKLAPFTR 654

Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
              S D +  A VC++ +C  PVT+   L +LL
Sbjct: 655 EQKSIDGRATAYVCRDRACREPVTETEVLGSLL 687


>gi|298243436|ref|ZP_06967243.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
           44963]
 gi|297556490|gb|EFH90354.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
           44963]
          Length = 719

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 257/706 (36%), Positives = 374/706 (52%), Gaps = 67/706 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+  +A L+N  FVSIKVDREERPD+D +YM  VQA+   GGWP++
Sbjct: 66  SACHWCHVMERESFENPAIAALMNQHFVSIKVDREERPDIDNIYMQAVQAMTQQGGWPMT 125

Query: 80  VFLSPDLKPLMGGTYFPPEDK----YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           VFL+PD +P  GGTYFPP+D+    Y  PGF+ +L  +   + ++R+ + +      + L
Sbjct: 126 VFLTPDGRPFYGGTYFPPDDRHHGQYVMPGFRRVLLSLAQLYAQEREKIEEQADELAQFL 185

Query: 136 --SEALSASASSNKLPDELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMM-- 190
              E +      N     LPQ  L + A Q L+  +D++ GGFG APKFP  + ++ +  
Sbjct: 186 RQREGMPLRRRENAT-QGLPQLDLLVVASQALANDFDAQHGGFGGAPKFPHSMALEFLLR 244

Query: 191 --LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
             L+ SK+    G+         MV  +L+ MAKGG++D +GGGFHRYSVD  W VPHFE
Sbjct: 245 VYLHRSKQELSLGQLPGNLTELGMVESSLEHMAKGGMYDQLGGGFHRYSVDAEWLVPHFE 304

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD   L+  YL A+ +T   FY  I  + LDY+ R+M+ P G  +S +DADS   EG
Sbjct: 305 KMLYDNALLSCAYLAAYLVTGKPFYRRIVEETLDYVAREMVSPEGGFYSTQDADS---EG 361

Query: 309 ATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
                EG F++W   EVE +L    A +F  +Y +   GN            F+GKN+L 
Sbjct: 362 V----EGKFFLWQPAEVEALLNAPDAAIFMRYYDISARGN------------FEGKNILH 405

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
              +    A +L + + +   I+   R +LF  R  R +P  D+K++ SWNGL++ SFA 
Sbjct: 406 INVEVEQLAKELTLSVPEVEQIVKSGREQLFKARELRVKPGRDEKILTSWNGLMLRSFAE 465

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
           A++ L                 R +Y+E+A + A+F+ R L   Q  RL  ++++G ++ 
Sbjct: 466 AARHL----------------GRGDYLEIAINNANFLLRSL--RQDGRLLRTYKDGRARL 507

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            G+L+DYAFL  GLL LY+     +W   A  L +    LF D + GG+F+T  +   ++
Sbjct: 508 KGYLEDYAFLADGLLALYQACFDPRWFAEARTLMDQAIALFADEQNGGFFDTGSDHEELV 567

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            R K+  D A PSGNSV+   L+RLA++   S  D YR+ AE  L      L D+ +  P
Sbjct: 568 TRPKDIMDNATPSGNSVAADVLLRLAAL---SGEDAYRERAEAYL----QSLADVMVQHP 620

Query: 608 LM---CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
                   A   S+   + + L+G   + D + +L   +  Y  N  +    P D E + 
Sbjct: 621 QFFGQALGALDFSLTMAREIALLGSPEAADTQALLNVVNTRYLPNSVLACARPDDKEAI- 679

Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                      +A       K  A VCQNF+C  PVT   +L  LL
Sbjct: 680 ------RAVPLLAERTMQEGKATAYVCQNFACQAPVTTAEALRQLL 719


>gi|347753644|ref|YP_004861209.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
 gi|347586162|gb|AEP02429.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
          Length = 689

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 267/703 (37%), Positives = 380/703 (54%), Gaps = 75/703 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+E VA++LN+ FV+IKVDREERPD+D +YM   Q + G GGWPLS
Sbjct: 53  STCHWCHVMERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+  P   GTYFP E +YG PGFK +L  +   + +  D +   G     Q+ +AL
Sbjct: 113 VFLTPEKVPFYAGTYFPRESRYGMPGFKEVLLYLSQQYTENPDRIKDVGV----QVKQAL 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            AS    K    L +  +    +   + +D R+GGFG APKFP P  +  +L ++K  E+
Sbjct: 169 EASREKGK-QTALTKETIGRAFQAYKQGFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYEN 227

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                 A++       TL  +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD   L  
Sbjct: 228 RDALAMATK-------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLVL 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y DAF +TK+  Y  I  +I+ Y+ RDM  P G  +SAEDADS   EG    KEG FYV
Sbjct: 281 AYTDAFRMTKNAQYKKITEEIITYVLRDMAHPDGGFYSAEDADS---EG----KEGKFYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-AS 377
           WT  EV+D+LGE    LF + Y +   GN            F+GKN+  ++     S A 
Sbjct: 334 WTPAEVKDVLGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLESIAK 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K G+        L   R+ LF  R KR RP  DDK++ +WNGL+I++ A+A ++      
Sbjct: 382 KEGISPAALAEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRV------ 435

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
              F+ P        Y++ AE A SFIR +L   Q  R+   +R+G  K  GF+D+YAFL
Sbjct: 436 ---FHQP-------SYVQAAEKAVSFIRDNLI--QNDRVMVRYRDGEVKNKGFIDEYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + G ++LYE      +L  A +L     +LF D  GGG+F +  +D  +L+R KE +DGA
Sbjct: 484 LWGYMELYESTFAPFYLAEAKKLAGNMIDLFWDGHGGGFFFSGNDDEPLLVRQKESYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+   L+RL+ +      +   +  +    VF   + D   A  +M  A  M +
Sbjct: 544 LPSGNSVAACQLLRLSKLTGDFTLE---EKVQQLFQVFSKDIHDEPTAHAMMLQAG-MHA 599

Query: 618 VPSRKHVVLV---GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD----FWEEHN 670
             + K VV+V     K  VDF N +     ++    +V+ +   +  ++     F E++ 
Sbjct: 600 QQATKEVVIVMDDETKEVVDFINHI---QKNFYPGISVMVVKRREQAKLSKIASFIEDYA 656

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
             N           +    VC+NFSC+ P  D  +  +LL +K
Sbjct: 657 MING----------QPTIYVCENFSCNQPTNDFQTAMDLLFKK 689


>gi|410671814|ref|YP_006924185.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
 gi|409170942|gb|AFV24817.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
          Length = 703

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 250/705 (35%), Positives = 375/705 (53%), Gaps = 53/705 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFED  VA+L+N+ FV IKVDREERPD
Sbjct: 38  GEEAFNKAKQDDKPIFLSIGYSTCHWCHVMERESFEDPQVAELMNEAFVPIKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YM+  QAL G GGWPLS+ ++PD KP M  TY P E +YG  G   I+  V + W 
Sbjct: 98  IDTIYMSVCQALTGRGGWPLSIIMTPDKKPFMAATYIPRESRYGMAGMLDIVPAVSNMWT 157

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++R+ L  +     E++  A+S  A  +     L ++ L    + L  S+D    GFG+A
Sbjct: 158 RQREELIANA----EEIVSAISGGARDSTEGPGLDESTLDRTYQLLRSSFDPSSAGFGNA 213

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  ++ +L + K+     K  +A E   M   TL+ M KGGI+DH+G GFHRYS 
Sbjct: 214 PKFPTPHHLKFLLRYWKR----SKEDKALE---MAEETLKAMRKGGIYDHIGFGFHRYST 266

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D RW VPHFEKMLYDQ  ++   ++ +  T++  Y     ++  Y+ RDM  P G  +SA
Sbjct: 267 DSRWLVPHFEKMLYDQALISIALVETYQATQNPEYRENAEEVFSYVLRDMHSPEGGFYSA 326

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS +       +EG FY+WT +E+ED+LGE  A LFKE ++  P GN  L   S  H
Sbjct: 327 EDADSED-------EEGRFYLWTEQELEDVLGEMDAGLFKEVFHTSPGGNF-LDEASMTH 378

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               G+N+L        +A + G   +++   L   RRKLF+ R  R  P  DDK++  W
Sbjct: 379 T---GRNILHLEESLREAAERRGEDYDRFRQSLESSRRKLFEHREMRVHPSKDDKIMTDW 435

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           N L+I + ++A++                  D   Y + A   A FI   +      RL 
Sbjct: 436 NSLMIVALSKAARAF----------------DEPAYAQEAALTADFILSKMISPNG-RLF 478

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R+G     GFLDDYAF I GL++LY+    T++L  A+   +     F D   GG+F
Sbjct: 479 HRYRDGEVAVEGFLDDYAFFIWGLIELYQATFNTEYLRNALRFNDQLILHFRDSIHGGFF 538

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           +T  +   +++R KE +DGA PSGNSV  +NL+ L  I   +  +   + A   + +F  
Sbjct: 539 HTADDSEKLIMRSKEIYDGAIPSGNSVCALNLLHLGRITGNTDLE---KKAYEIMQLFSG 595

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           ++  M +    + CA D  + PSR+ +V+ G   S + + +++  +  +  NK ++    
Sbjct: 596 QVSKMPVGYTQLMCALDFAAGPSRE-IVVAGDPESEETQGIISDINREFVPNKVILLKPE 654

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               E+    E+ S+ +          +    +C+N++C+ P TD
Sbjct: 655 GRETEISAIAEYVSDMS------MKDGRTTVHICRNYNCNLPSTD 693


>gi|407473332|ref|YP_006787732.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
 gi|407049840|gb|AFS77885.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
          Length = 682

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 251/697 (36%), Positives = 382/697 (54%), Gaps = 77/697 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED+ VA++LN +F+SIKVDREERPD+D +YM + QA+ G GGWP+++
Sbjct: 54  TCHWCHVMERESFEDDEVAEVLNKYFISIKVDREERPDIDSIYMNFCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP + GTY+P    +GR G   +L KV + W   +D L  S    +E +   + 
Sbjct: 114 IMTPDKKPFIAGTYYPKHSMHGRIGIIELLNKVNEKWKSNKDDLINSSEEILEFMKTNIV 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           AS   N L  E  +NA  L    L  S+D  +GGFG APKFP P  +  +L + K     
Sbjct: 174 ASEQGN-LDMEDIENAFNL----LKNSFDPEYGGFGKAPKFPTPHNLNFLLRYYK----- 223

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
              G+ S   ++V  TL+ M KGGI DH+G GF RYSVDE+W VPHFEKMLYD   LA  
Sbjct: 224 -VKGDES-ALEVVEKTLESMYKGGIFDHIGYGFARYSVDEKWLVPHFEKMLYDNALLAVA 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y++A+ +TK   Y  I   I +++ R+M    G  +SA DADS   EG     EG FY++
Sbjct: 282 YIEAYQITKRDLYKEIAEKIFEFIEREMTSEEGGFYSAIDADS---EGV----EGKFYLF 334

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
              E+ + LG E + LF  +Y +   GN            F+GKN+         +    
Sbjct: 335 DHSEISEQLGLEDSELFAHYYDITYDGN------------FEGKNI--------PNLIIT 374

Query: 380 GMPLEKYLNILGE----CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           G+P     ++L E    C +KL+  R+KR  PH DDK++ SWNGL+I + A   ++ K +
Sbjct: 375 GLPNMDTNSVLQERLRACIKKLYTYRNKRVYPHKDDKILTSWNGLMIGALALGGRVFKDD 434

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                           +Y+E AE +A+FI  +L D +  RL   +R+G +K   +L+DYA
Sbjct: 435 ----------------KYIERAERSANFILENLIDREG-RLLARYRDGETKYKAYLEDYA 477

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           +L+ GL++LY+     ++L  AI+L     +LF D   GG F    +   ++L+ KE +D
Sbjct: 478 YLVHGLIELYQSTFKMEYLEKAIKLNQDMLDLFWDDNEGGLFIYGKDSEQLVLQHKEIYD 537

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM--AVPLMCCAA 613
           GA+PSGNSV+ +NL+RL+ I+     +   + ++  L  F   +K+  +  +  LM C  
Sbjct: 538 GAQPSGNSVASLNLIRLSKILEDPSLE---EKSKAILKAFGGNVKNTVIGHSYLLMSC-- 592

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
            + ++ S + +V++G+K+  D + M+   + ++    TV+  + ++ EE++         
Sbjct: 593 -LFNIVSTQEIVILGNKNDSDTQEMIDKVNDNFTPFTTVVLSNNSE-EELNVI------- 643

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +       DK  A +C+NF+C+ P  D      LL
Sbjct: 644 PRLKDYKKVEDKTTAYICKNFTCNDPTADVEQFSGLL 680


>gi|345302921|ref|YP_004824823.1| hypothetical protein Rhom172_1056 [Rhodothermus marinus
           SG0.5JP17-172]
 gi|345112154|gb|AEN72986.1| protein of unknown function DUF255 [Rhodothermus marinus
           SG0.5JP17-172]
          Length = 699

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 263/690 (38%), Positives = 365/690 (52%), Gaps = 50/690 (7%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF+DE VA+LLND F++IKVDREERPD+D +YMT  Q + G GGWPL++ 
Sbjct: 50  CHWCHVMAHESFQDEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTII 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           ++PD KP    TY P   +YGRPG   I+ ++K+AW + RD +  S       L + +S 
Sbjct: 110 MTPDKKPFFAATYIPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSF 169

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A S  +  E  + A R    +L   +D + GGFG APKFP P  +  +L +        
Sbjct: 170 EAPSQVIDAEWLEIAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------ 219

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           +SGEA   Q MV  TL  M  GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ  L   Y
Sbjct: 220 RSGEAHALQ-MVEHTLVQMRPGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAY 278

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
            +A+  T + FY    R+IL Y+ RD+  P G  +S+EDADS   EG    +EG FYVWT
Sbjct: 279 TEAYQATGNPFYERTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWT 331

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +E+ + LG E A L  E + + P GN +     +   E  GKN+L       A A + G
Sbjct: 332 VEELREALGPELAPLAIELFNVNPEGNYE----EEATGERTGKNILYLTRPPKALARERG 387

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              E+    L E R++LF  R++R RP  D+K++  WNGL+I++ ARA+++         
Sbjct: 388 WTPEELEAKLEEIRQRLFAYRAQRVRPGRDEKILTDWNGLMIAALARAAQVF-------- 439

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   D   Y+E A +AA F+ R +   +  RL H +R+G +  PG LDDYAFL  G
Sbjct: 440 --------DEAAYVEAARAAADFLLRTMRTPEG-RLWHRYRDGEAGIPGMLDDYAFLTWG 490

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LLDLYE      +L  A+ L +     F D   G ++ T  +  S+++R +E  D A PS
Sbjct: 491 LLDLYEATFEESYLETALALTDQTLAHFWDPR-GVFYMTPDDGESLIVRPRETLDNALPS 549

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
           GN+V+++NLVRL  +   +    Y ++A+  +  F   +K        M  A D+   P 
Sbjct: 550 GNAVALMNLVRLGHMTGRT---VYEEHADAMIRFFSGPVKQQPPIFTGMLVAIDLAFGPI 606

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
            + +VL G         ML   H  Y   K ++   P         E        +A   
Sbjct: 607 YE-LVLAGEPDDPTLREMLRTIHRRYLPRKVLLLRRPGAAG-----ERLVRLAPFVAAQA 660

Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
               +  A VC ++ C  PVTDP +L   L
Sbjct: 661 LLDGRATAYVCHDYRCEQPVTDPEALARQL 690


>gi|91204070|emb|CAJ71723.1| conserved hypothetical protein (thioredoxin) [Candidatus Kuenenia
           stuttgartiensis]
          Length = 758

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 259/720 (35%), Positives = 377/720 (52%), Gaps = 64/720 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESFED  VA+L+N+ F+ IKVDREERPD
Sbjct: 94  GPEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDPEVARLMNEVFICIKVDREERPD 153

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YM   Q + G GGWPL++ ++PD KP   GTY  P+  YGR G   ++ ++K+ W+
Sbjct: 154 IDNIYMRVCQMMTGSGGWPLTIVMTPDKKPFYAGTYI-PKKSYGRIGMLDLVPRIKELWN 212

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +   + +S       L +  S   S  +    L  + L+   E L++ +  + GGF ++
Sbjct: 213 IQHADIQKSANLITASLGQ-FSHDPSEAR----LDASTLKAAYELLARRFSEQHGGFSTS 267

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L + K       +GE +   +MV+ TL  M KGGI+DH+G GFHRYS 
Sbjct: 268 PKFPSPQNLLFLLRYWK------STGEGN-ALRMVVKTLHSMRKGGIYDHIGYGFHRYST 320

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLYDQ  LA  Y +A+  T    +    ++I  Y+ RDM  P G   SA
Sbjct: 321 DPEWLVPHFEKMLYDQAMLAMAYTEAYLATGRKEFGETAKEIFAYVMRDMTDPKGGFCSA 380

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG    KEG FYVWT +E+   L E  A L    + ++  GN          
Sbjct: 381 EDADS---EG----KEGKFYVWTEEEIRHALKEDDANLIINVFNIEKAGNFK-------- 425

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE----CRRKLFDVRSKRPRPHLDDKV 413
           +E  G+N    +     S +++ +  +  L+ L E     RRKLF VRSKR RPH DDK+
Sbjct: 426 DEIAGRNTGDNILHLKKSLAEIALENKTSLDELKERVETARRKLFAVRSKRIRPHKDDKI 485

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           +  WNGL+I++ A+ ++                  D  EY+  A+ AA FI   +   Q 
Sbjct: 486 LTDWNGLMIAALAKGAQAF----------------DAPEYLAAAKRAADFILSDM-RRQD 528

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
            RL H +R G +  P F DDYAF I GLL+LYE      +L  A++L +   + F D + 
Sbjct: 529 GRLLHRYRGGQAGIPAFADDYAFFIWGLLELYETNFNVNYLRTALDLNSDMIKHFWDNQN 588

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GG++ T  +   +++R KE +DGA PSGNSV+ +NL RLA I A  + +   + A  ++ 
Sbjct: 589 GGFYFTADDAEDLIVRQKEVYDGAIPSGNSVAALNLFRLARITADPELE---EKANKTML 645

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F T +K M      M         P+ + +++ G+  +VD  +ML      +  NK V+
Sbjct: 646 AFSTEVKKMPAGYTQMMIGLSFGIGPAYE-IIIAGNPRAVDTRDMLNTLRRHFIPNKIVL 704

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
            + P D E  +      +  A    +    D K  A +C++++C  PVTD   +  LL E
Sbjct: 705 -LRPTDEETPEI-----TRIAKFTEHQSGIDGKATAYICRDYTCKMPVTDTKEMLKLLKE 758


>gi|392962639|ref|ZP_10328068.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
 gi|421053373|ref|ZP_15516355.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
 gi|421058355|ref|ZP_15521061.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
 gi|421066419|ref|ZP_15528029.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
 gi|421073618|ref|ZP_15534678.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
 gi|392442414|gb|EIW20004.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
 gi|392444040|gb|EIW21515.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
 gi|392451880|gb|EIW28849.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
 gi|392456062|gb|EIW32823.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
 gi|392460977|gb|EIW37218.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
          Length = 683

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 254/693 (36%), Positives = 366/693 (52%), Gaps = 67/693 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME E FED+ VA LLN  F++IKVDREERPDVD +YM+  QAL G GGWPL+
Sbjct: 51  SCCHWCHVMERECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++P+ KP   GTYFP   K GR G   +L  +   W+  R  + ++G   +  L    
Sbjct: 111 IIMAPNKKPFFAGTYFPKHRKMGRMGLLELLTTLHQHWENNRSEIIKAGNEIVSILQRPK 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            AS       + L Q  L     +L  SYDS+ GGFGSAPKFP P +I  +L + +  ++
Sbjct: 171 PASEEGQVGEELLKQAYL-----ELENSYDSQCGGFGSAPKFPTPHKITFLLRYWQHFKE 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   +   MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   L  
Sbjct: 226 -------PKALAMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCT 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+  T +  ++ I  +IL Y+ RDM+   G  +SAEDADS   EG     EG FYV
Sbjct: 279 SYLEAYQCTGNGEFARIAEEILTYVMRDMMDKSGGFYSAEDADS---EGV----EGKFYV 331

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASAS 377
           +T KEV +ILG E   LF + Y +   GN +            G ++   +  D    A 
Sbjct: 332 FTRKEVLEILGEEEGTLFADFYQISSQGNFE-----------HGTSIPNRIGRDLEEYAR 380

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K+   +E    +L + R KL+ VR KR  PH DDK++ +WNGL+I++FA+A+K+LK    
Sbjct: 381 KVKWTVESLSALLEQGREKLYHVREKRIHPHKDDKILTAWNGLMIAAFAKAAKVLK---- 436

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                       + +Y  VAE  A+FI   L  +   RL   +R G +    ++DDYAFL
Sbjct: 437 ------------QSKYANVAEQGAAFIYEKLM-KADGRLLARYREGEAAHQAYIDDYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L+++YE     ++L  A+ L    + LF D   GG++    +   +++R KE +DGA
Sbjct: 484 LMALIEVYEATCNNQYLHRAVTLAKDMEALFGDNTEGGFYFYGNDGEELIVRPKEIYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L +L  I   +    +   AE  L+ F   +   A        A D   
Sbjct: 544 IPSGNSVAALALQKLGDI---TDDRGFSDIAERLLSSFAGEVSRYAAGYTYFMMAVDYYV 600

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
             + K +++ G K + D + ML   ++ + L  + I           F++ H+  N    
Sbjct: 601 ADNTK-IIIAGDKEAADTKAMLDVINSCF-LPSSAIR----------FYDRHSQENVEYK 648

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +    K  A +C+NF+C PP+TD   L NLL
Sbjct: 649 EID---HKATAYICRNFACQPPITDAEKLCNLL 678


>gi|306811901|gb|ADN05998.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
           bacterium]
          Length = 800

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 249/695 (35%), Positives = 367/695 (52%), Gaps = 57/695 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A  LN  F++IKVDREERPD+D VYM  V  L G GGWP++
Sbjct: 136 STCHWCHVMERESFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMKAVTILTGRGGWPMT 195

Query: 80  VFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLS 136
           V ++PD +P  GGTYFPP   +  GR G   IL  +   + ++  +++A++     ++LS
Sbjct: 196 VIMTPDKEPFFGGTYFPPRKGFRGGRAGLIDILADMLGLYRNEPTEVVARA-----QELS 250

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           + +  +A+    P       + + A+ L + +D   GGFG APKFP+P  + ++L ++++
Sbjct: 251 QRVEQAAAIKPGPGVPSDKVIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLLRYARR 310

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             D G +        MV  TL  MA GGI+D VGGGFHRYS D +W VPHFEKMLYD  Q
Sbjct: 311 TRDKGATA-------MVATTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQ 363

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA VYL+A+  T D  Y  + R+ILDY+ R+M  P G  +SA DADS    G    +EG 
Sbjct: 364 LAVVYLEAWQHTGDSGYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGW 421

Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           F+ WT  E+E +LG   A +F   + +   GN            F+G+N+L  +      
Sbjct: 422 FFTWTPDELERLLGAGDAAVFSSAFGVTKPGN------------FEGRNILHRVKSDQEL 469

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           AS+LG+  ++   ++   +  L+D R+ RP P  D+K+I +WNG++ ++FA+A  +L +E
Sbjct: 470 ASELGLAPKRVGEMIRRAQSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AE 528

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
           A                Y+EVA  A  F+   +  +    L  ++R+G   +  FLDDYA
Sbjct: 529 A---------------RYVEVAARAVQFVLEQMRTKDGA-LVRTYRDGKKGSASFLDDYA 572

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F+++  LDLYE      W+  A+ELQ  QD  +LD + GGY+ T  +   +L+R K  +D
Sbjct: 573 FMVAASLDLYEATGDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYD 632

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSGNSV+  NL+RL       K   +R+ AE   A    ++       PL+  A D 
Sbjct: 633 RAVPSGNSVAANNLLRLHDFNGDPK---WRRRAERLFASLAFQVTRSPTGFPLLLVALDR 689

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
               +   V L+   +  +   + A    S+  NK    +   DTE      +  S    
Sbjct: 690 Y-YDTVLEVALIAPTNREEASLLNARLRKSFVPNKAFTVL--TDTEAT----QQESTIPW 742

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +        K  A VC+   C  P + P   +  L
Sbjct: 743 LEAKRAMGGKSTAYVCERGRCDLPTSKPQVFQKQL 777


>gi|188996723|ref|YP_001930974.1| hypothetical protein SYO3AOP1_0787 [Sulfurihydrogenibium sp.
           YO3AOP1]
 gi|188931790|gb|ACD66420.1| protein of unknown function DUF255 [Sulfurihydrogenibium sp.
           YO3AOP1]
          Length = 686

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 253/687 (36%), Positives = 358/687 (52%), Gaps = 65/687 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE VAK+LN+ FVSIKVDREERPD+D +YM       G GGWPL+
Sbjct: 51  SSCHWCHVMEKESFEDEEVAKILNENFVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP   + GR G   +L  V + W   ++ L Q     IE L    
Sbjct: 111 IIMTPDKKPFFAGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKNDF 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
              +      DE+ ++ +  C   L   +D  +GGF   PKFP P  I  +L   YH+K+
Sbjct: 171 KGKS------DEISKDIIDACYLDLKSRFDKEYGGFSIKPKFPTPHNILFLLRYYYHTKE 224

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           +          E  KM   TL  M  GG++DHVG GFHRYS D  W +PHFEKMLYDQ  
Sbjct: 225 M----------EALKMAEKTLINMRLGGMYDHVGFGFHRYSTDREWLLPHFEKMLYDQAM 274

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LTK+ FY    ++ + Y+ RDM    G  +S+EDADS   EG    +EG 
Sbjct: 275 LTMAYTEAYQLTKNNFYKKTAQETIAYVLRDMTSKEGVFYSSEDADS---EG----EEGK 327

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY WT  E++++L +  + L  + + +K  GN     + +      G+N+L         
Sbjct: 328 FYTWTIDELKEVLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIREL 383

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+ L M  ++    L E R+KLFD R KR  P  DDKV+  WNGL+IS+ A+A K     
Sbjct: 384 ANDLNMNQDQLETKLEEIRKKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK----- 438

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                      G + ++ +E A++AA FI   ++   T  L H +++G  K  G LDDYA
Sbjct: 439 -----------GFEDRDLIEKAKTAADFILNTMFKNDT--LYHLYKDGEVKVEGLLDDYA 485

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F   GL++LYE     K+L  A++L +   E F D E GG+F +      V++R KE  D
Sbjct: 486 FFSWGLIELYEATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFD 545

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSVS  NL RL  I    K   Y   A  +L  F   +K +     +      +
Sbjct: 546 GAIPSGNSVSAYNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLML 602

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           +  P+ + VVL G     + E +L   +  +  NK +I ++  + +++     + S    
Sbjct: 603 VFYPTSE-VVLAG-----NCEKVLDKINTEFIPNKAIIFLNRENEKQLKELIPYTS---- 652

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
              N   +D+    VC+NFSC+ P  D
Sbjct: 653 ---NMILSDECDIYVCKNFSCNLPTKD 676


>gi|218887845|ref|YP_002437166.1| hypothetical protein DvMF_2759 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218758799|gb|ACL09698.1| protein of unknown function DUF255 [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 756

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 267/738 (36%), Positives = 382/738 (51%), Gaps = 85/738 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED+ VA+LLND FV +KVDREERPD+D  YM   Q L G GGWPL+
Sbjct: 50  STCHWCHVMAHESFEDDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGSGGWPLT 109

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---S 136
           +   PD +P    TY P   + GR G   ++ +V + W  KRD +  S    +E +   +
Sbjct: 110 IIALPDGRPFFAATYLPKHSRPGRIGLMDLVPRVLEVWRHKRDDVLDSADSIVEHVRRHA 169

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           EA+    +  +LP       L    E ++  +D+  GGFG+APKFP P  +  +L  +++
Sbjct: 170 EAMLRPPADGRLPG---AGTLHAACEAMASEFDAVNGGFGTAPKFPSPHNLLFLLRWARR 226

Query: 197 ---------LEDTGK--SGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
                    L   G   +GE S G K   M   TL+ + +GGIHDHVG GFHRYS D RW
Sbjct: 227 NGHAAGQPGLAQAGTVPTGEESGGAKALRMAAQTLRSIRRGGIHDHVGYGFHRYSTDARW 286

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
            +PHFEKMLYDQ  L   Y +A+  T D  +     +   Y+ RD+  P G  +SAEDAD
Sbjct: 287 LLPHFEKMLYDQAMLMLAYAEAWLATGDGEFRRTAEETAAYVLRDLASPEGAFYSAEDAD 346

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN------------CDL 350
           S E +GA  + EG FY +T  ++E+      +       ++P G+             DL
Sbjct: 347 S-ELDGA--RGEGLFYTFTLADIEEACAPLDVRPGVRPAVRPDGDGGGGVNPASLSEADL 403

Query: 351 SRMS-----------DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFD 399
           +  +           +      G+NVL         A  LG+P  +    L   R  LFD
Sbjct: 404 TARAFGCTAYGNYEDEATRSRTGRNVLHLPRAPQELARDLGLPPREVEERLEAARAALFD 463

Query: 400 VRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
           +R++RPRPHLDDKV+  WNGL I++ +R ++                  D     E A +
Sbjct: 464 LRARRPRPHLDDKVLADWNGLAIAAMSRCAQAF----------------DAPHLAEAAAA 507

Query: 460 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
           AA F+   +   Q  RL H +R+G +  PG LDDYAF+I GL++LY      +WL  A+ 
Sbjct: 508 AADFVLARMV-TQEGRLLHRWRDGEAAVPGLLDDYAFMIWGLIELYGATGEVRWLRRALR 566

Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
           LQ  QD  F D EGGGY+ T  +  ++L+R KE HDGA PSGN+ ++ NL+RLA ++   
Sbjct: 567 LQEVQDTFFHDAEGGGYWMTPADGDALLVRRKEGHDGALPSGNAAALFNLLRLALLLGRP 626

Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
           +   Y + A   L  F T+++   +   +  C  D  ++   + V++ G     D E ML
Sbjct: 627 E---YGERARGVLRAFATQVRHHPVGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAML 682

Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA------DKVVALVCQN 693
           AA   +Y    TV+H+   D          N+ + + A   F+A      D+  A +C+N
Sbjct: 683 AAVRGTY-APTTVLHLRTTD----------NARDLA-ALVPFTAHLAPLEDRATAWLCEN 730

Query: 694 FSCSPPVTDPISLENLLL 711
           ++CSPP+TDP  L+  LL
Sbjct: 731 YACSPPITDPAELKARLL 748


>gi|366164964|ref|ZP_09464719.1| hypothetical protein AcelC_14944 [Acetivibrio cellulolyticus CD2]
          Length = 680

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 259/699 (37%), Positives = 370/699 (52%), Gaps = 81/699 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+ VA  LN  F+SIKVDREERPD+D +YM   QAL G GGWPL+
Sbjct: 53  STCHWCHVMEKESFEDKEVADALNKNFISIKVDREERPDIDHIYMNVCQALTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F+SPD KP   GTYFP  ++ G PG  T+L  V DAW   RD+L +S     EQ+  AL
Sbjct: 113 IFMSPDKKPFFAGTYFPKNNRMGMPGLLTVLESVHDAWVSNRDILTRSS----EQILNAL 168

Query: 140 SASASSNKL--PD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
           S     N +  PD   EL ++       +    +D+ +GGFGSAPKFP P  +  +L + 
Sbjct: 169 S---DRNDILEPDSEEELSEDIFYEAFSEFKYDFDNNYGGFGSAPKFPTPHNLFFLLRYW 225

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              +D           KMV  TL+ M KGGI+DH+G GF RYS D +W +PHFEKMLYD 
Sbjct: 226 YNTKD-------EYALKMVEKTLESMHKGGIYDHIGFGFSRYSTDRKWLIPHFEKMLYDN 278

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             LA  YL+ +  TK   Y+ I ++I  Y+ RDM    G  +SAEDADS   EG    +E
Sbjct: 279 ALLAIAYLEVYQATKKSEYADIAKEIFTYVLRDMTSNEGGFYSAEDADS---EG----EE 331

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDS 372
           G FY+W++ EV+ +LG       E Y       C L  ++  H  F+G N+  LI+ N +
Sbjct: 332 GKFYIWSANEVKTVLGNKD---GEKY-------CKLYDIT-AHGNFEGFNIPNLIKGNIA 380

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
                            + ECR+KLF+ R KR  P+ DDK++ SWNGL+I++ A   ++L
Sbjct: 381 QEDDG-----------FIEECRKKLFEFREKRVHPYKDDKILTSWNGLMIAAMAFGGRVL 429

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                         G D+  Y + AE A  FI   L      RL   +R+G S  P ++D
Sbjct: 430 --------------GVDK--YTKAAEKAVDFIFSKLISSDG-RLLARYRDGDSAFPAYVD 472

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFLI GL++LYE      +L  +++L +   + F D   GG F+   +   ++ R KE
Sbjct: 473 DYAFLIWGLIELYETTYKPIYLKRSLKLNDDLIKYFWDETNGGLFHYGSDSEQLITRPKE 532

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
            +DGA PSGNSV+ +N +RLA +   ++ +   + A +  A F   ++  A        A
Sbjct: 533 IYDGATPSGNSVATMNFLRLARLTGQAELE---EKAYNQFATFGRSIERFARGHSFFLSA 589

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             + +    K VV+VG++ +++  +M++     +      +      T+ +D        
Sbjct: 590 L-LFAKSKSKEVVIVGNE-NLEESSMVSIIREDFRPFTLSMFYSNKHTDLIDL------- 640

Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            A    N  + + K  A VC+NF+C  P+TD     N +
Sbjct: 641 -APFIENYKTVEGKTTAYVCENFACQAPITDNSLFRNAI 678


>gi|219849212|ref|YP_002463645.1| hypothetical protein Cagg_2330 [Chloroflexus aggregans DSM 9485]
 gi|219543471|gb|ACL25209.1| protein of unknown function DUF255 [Chloroflexus aggregans DSM
           9485]
          Length = 693

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 250/692 (36%), Positives = 364/692 (52%), Gaps = 64/692 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESF D  +A + N++F++IKVDREERPD+D +YM   QAL G GGWPL+V
Sbjct: 55  ACHWCHVMAHESFADPEIAAIQNEYFINIKVDREERPDLDSIYMAAAQALTGRGGWPLNV 114

Query: 81  FLSPDLKPLMGGTYFPPE---DKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQ 134
           F  PD  P   GTYFPP+   ++Y  P ++ +L  + +A+  +RD L   AQ     I+ 
Sbjct: 115 FCLPDGTPFFAGTYFPPDAKANRYRMPSWRQVLLSIAEAYRTRRDDLTASAQELLNHIKL 174

Query: 135 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
           L++ L  +A+ ++         L   A +L + +D ++GGFG APKFP+P+ ++ +L   
Sbjct: 175 LAQPLPETATVDE-------ALLLEAAAKLEREFDPQYGGFGDAPKFPQPLVLEFLL--- 224

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                T   G   +   M+  TL+ MA GG++D VGGGFHRYSVD RW VPHFEKMLYD 
Sbjct: 225 ----RTHLRGHV-QALPMLHQTLEQMAHGGMYDQVGGGFHRYSVDTRWLVPHFEKMLYDN 279

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             LA VY  A  +T D F + I  +   YL RD+  P G  FS+EDADS    GA   +E
Sbjct: 280 ALLAEVYHLAALVTGDPFLAQIADETFAYLLRDLRHPEGAFFSSEDADSLPVPGAAHAEE 339

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           GAFYVWT  E+   LG+ A +   +Y +   GN            F+GK++L     +SA
Sbjct: 340 GAFYVWTPDELRLALGDDATIVGAYYGVTRQGN------------FEGKSILYVPRSASA 387

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A++LG+P+E+    +   R  L   R +RPRP  D+K+I +WN L I + A AS  +  
Sbjct: 388 VAARLGVPVERVTETVERARPILRTFREQRPRPFRDEKIITAWNALAIRALATASARV-- 445

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                            EY+  A   A F+  +L      RL  S+++G     GFLDDY
Sbjct: 446 ----------------PEYLSAARQCADFLLANL-RRADGRLLRSWKDGRPGPAGFLDDY 488

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L   LL+L+  G  T +L  AIEL     +LF D +   +F+T  + P+++ R ++  
Sbjct: 489 ALLCDALLELHAAGGETYYLATAIELAEAMLDLFWDAQSWMFFDTGRDQPALVTRPRDLS 548

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A PSG S + + L+RL ++   + +D +   AE  L      L    +    M CAAD
Sbjct: 549 DNATPSGTSAATMALLRLYAL---TGNDLFATRAEQVLQQVAPMLIRFPLGFGRMLCAAD 605

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           ++  P R+ + ++G       + +LA A ++Y     + H +P D              A
Sbjct: 606 LMIGPIRE-LAIIGPSGHPATQALLAVARSAYRPRLVIAHAEPGDP--------IAEQVA 656

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            +A       +  A +C+ F+C  PVT P +L
Sbjct: 657 LLAGRTLIDGQPTAYLCERFACRLPVTTPEAL 688


>gi|345856701|ref|ZP_08809173.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
 gi|344330213|gb|EGW41519.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
          Length = 652

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 246/696 (35%), Positives = 371/696 (53%), Gaps = 80/696 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE++ VA +LN +F+SIKVDREERPDVD +YM + Q L G GGWPL++ ++PD K
Sbjct: 1   MERESFENDEVAGILNRYFISIKVDREERPDVDHLYMAFCQTLTGSGGWPLTIIMTPDKK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP  ++YGRPG   +  +V   W      L +S    +  +    +  + S+ 
Sbjct: 61  PFFAGTYFPKTERYGRPGLMELAEQVGTLWKTNEGKLRESSDEIVAAVHSQRTVPSKSSP 120

Query: 148 LPDELPQNA-------------LRLCAEQL--------SKSYDSRFGGFGSAPKFPRPVE 186
           LP  +  +               +  +EQL        ++S+D+R+GGFG APKFP P  
Sbjct: 121 LPSAVTNDPSLKDGNGPTSSEDFQTWSEQLIDKAYQVFAQSFDARYGGFGRAPKFPTPHT 180

Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           I  +L ++       +    S+  +MV  TL  MA+GGI+DHVG GF RYS DE+W VPH
Sbjct: 181 ISFLLRYA-------QDHPQSKALEMVRKTLDGMAQGGIYDHVGFGFARYSTDEKWLVPH 233

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD   LA+ YL+++        +   ++I  Y+ RDM  P G  +SAEDAD+   
Sbjct: 234 FEKMLYDNALLASTYLESYQANHQPDDAQKAKEIFTYVLRDMTSPEGGFYSAEDADA--- 290

Query: 307 EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           EG     EG F+VWT  E+E +LG + A ++   Y + P GN            F+GKN+
Sbjct: 291 EGV----EGKFHVWTRAEIETLLGKDTAAMYCAVYDITPEGN------------FEGKNI 334

Query: 366 L-IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
             + L +    A    +   + L IL + R+ LF  R KR  PH DDK++ +WNGL+I++
Sbjct: 335 PNLLLGNLEKIARNNSLAAAEVLQILEKARQTLFTAREKRIHPHKDDKILTAWNGLMIAA 394

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
           FA+ +++L   A                Y+E AE+AA F+  HL      RL   +R G 
Sbjct: 395 FAKGAQVLGIPA----------------YLEAAENAADFVLTHL-KRNDGRLLARYREGH 437

Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 544
           S   G+LDDYAF I GLL+LY       +L  A++LQ  Q+ LFLD E GGY+ T  +  
Sbjct: 438 SAYLGYLDDYAFFIGGLLELYSVSGKPHYLQVALQLQEEQERLFLDEEDGGYYLTGSDGE 497

Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
            +L R KE +DGA P+GNS++ +NL +LA +    +   + + AE  L VF + L++   
Sbjct: 498 ELLFRPKESYDGAIPAGNSITALNLFKLARLTGDER---WERKAEQQLLVFRSVLEEHPS 554

Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
                  A      PS++ ++L G  ++ +   M     +++    +V++ + +  E + 
Sbjct: 555 GYTAFLQALQFAVHPSQE-LILAGALNATELPEMRQIFFSAFRPYASVLYQEGSLPETVP 613

Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
           + +++  + +           + A +CQNF+C  PV
Sbjct: 614 WIQDYPIDPS----------HITAYLCQNFTCQRPV 639


>gi|20092523|ref|NP_618598.1| hypothetical protein MA3726 [Methanosarcina acetivorans C2A]
 gi|19917793|gb|AAM07078.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 697

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 250/713 (35%), Positives = 367/713 (51%), Gaps = 54/713 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESFEDE +A+L+N+ FVSIKVDREERPD
Sbjct: 33  GEEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDEEIARLMNEAFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YMT  Q + G GGWPL++ ++P  KP   GTY P + ++ + G   ++ ++K+ WD
Sbjct: 93  IDNIYMTVCQIILGRGGWPLTIIMTPGKKPFFAGTYIPKKSRFNQTGMTELIPRIKEIWD 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++ + +  S       +   +  S           +  +      L  S+D  +GGFG A
Sbjct: 153 QQHEEVLDSAEKITSTIQNMIVESTGEGLG-----EEIIEEAYNDLLNSFDPEYGGFGRA 207

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P +I  +L + K+  D        E   MV  TL  M  GGI+DH+G GFHRYS 
Sbjct: 208 PKFPTPHKISFLLRYWKRSGD-------PEALDMVEHTLDNMRSGGIYDHLGSGFHRYST 260

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYDQ   A  Y++A+ ++    Y      ILDY+ RD+  P G  +  
Sbjct: 261 DNMWLLPHFEKMLYDQALTAIAYIEAYQVSGKDLYKETAEGILDYVLRDLTSPEGGFYCG 320

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDAD    EG    +EG +Y+WT +EV  ILG E + L  + + LK  GN +     +  
Sbjct: 321 EDAD---VEG----EEGKYYLWTIEEVMSILGPEDSELIIKMFNLKRGGNFE----EEIR 369

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               G N+   ++   + A++L +P+E+  + +   R KL   R +R RP LDDKV+  W
Sbjct: 370 GRKTGTNLFYMVHSPGSLAAELEIPVEEVESRVKSAREKLLKARYERKRPSLDDKVLTDW 429

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++FA+               F V G ++  Y++ AE AA F+   LY  +  RL 
Sbjct: 430 NGLMIAAFAKG--------------FQVFGEEK--YLKAAEKAADFLLETLYGPE-KRLH 472

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R+G +   G  DDYAFLI GLL+LYE G   ++L  A+ L     E F D E GG++
Sbjct: 473 HRYRDGVAGISGTSDDYAFLIHGLLELYEAGFELRYLKSAVSLNRELLEHFWDPENGGFY 532

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T  +   ++ R KE  D A PSGNS  ++NL+RL+ ++A    +   + A+     F  
Sbjct: 533 FTASDSEVLIFRKKEFTDAAIPSGNSFEMLNLLRLSRLIADPGME---ETADRLERAFSK 589

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            +K           A D    PS + V++ G + S D  NML    + +  NK ++    
Sbjct: 590 LIKKTPSGYTQFLSAFDFRLGPSYE-VIISGKRESPDTVNMLEELWSYFTPNKVLVFRPE 648

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +  E+    E+      +        K  A VCQN+ C  P T+   +  LL
Sbjct: 649 GENPEIADLAEYTKEQLPI------EGKATAYVCQNYECQLPTTETREMLKLL 695


>gi|306811868|gb|ADN05966.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
           bacterium]
          Length = 800

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 247/694 (35%), Positives = 358/694 (51%), Gaps = 55/694 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A  LN  F++IKVDREERPD+D VYMT V  L G GGWP++
Sbjct: 136 STCHWCHVMERESFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMTAVTILTGRGGWPMT 195

Query: 80  VFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           V ++P  +P  GGTYFPP   +   R G   IL  +   +  +   +        ++LS+
Sbjct: 196 VIMTPHKEPFFGGTYFPPRKGFRGNRAGLIDILTDMLSLYKNEPTQVVARA----QELSQ 251

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            +  +A+    P       + + A+ L + +D   GGFG APKFP+P  + +++ ++++ 
Sbjct: 252 RVEQAAAIKPGPGVPSDKMIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLMRYARRT 311

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D G +        MV  TL  MA GGI+D VGGGFHRYS D +W VPHFEKMLYD  QL
Sbjct: 312 RDEGATA-------MVTTTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQL 364

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A VYL+A+  T D  Y  + R+ILDY+ R+M  P G  +SA DADS    G    +EG F
Sbjct: 365 AVVYLEAWQHTGDSAYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGWF 422

Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           + WT  E+E +LG   A +    + +   GN            F+G+N+L  +       
Sbjct: 423 FTWTPGELERLLGAGDAAVVSSAFGVTERGN------------FEGRNILHRVKADQELG 470

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S+LG+  ++   I+   R  L+D R+ RP P  D+K+I +WNG++ ++FA+A  +L +EA
Sbjct: 471 SELGLAPKRVGEIIRSARSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AEA 529

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+EVA  A  F+   +  E    L  ++R G   +  FLDDYAF
Sbjct: 530 ---------------RYVEVAARAVGFVLAQMRAEGGA-LVRTYREGKKGSASFLDDYAF 573

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +++  LDLYE      W+  A+ELQ  QD  +LD + GGY+ T  +   +L+R K  +D 
Sbjct: 574 IVAACLDLYEATGDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYDR 633

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+  NL+RL       K   +R+ AE   A    ++       PL+  A D  
Sbjct: 634 AVPSGNSVAANNLLRLHDFTGDPK---WRRRAERLFAWLAFQVTRSPTGFPLLLVALDRY 690

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
              +   V L+   S  +   + A    S+  NK    +  A+  + +      S    +
Sbjct: 691 -YDTVLEVALIAPASREEASVLDAQLRKSFVPNKAFTVLTDAEASQQE------STIPWL 743

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 A K  A VC+   C  P + P   +  L
Sbjct: 744 EAKRAMAGKSTAYVCERGRCELPTSKPQVFQKQL 777


>gi|315425009|dbj|BAJ46683.1| hypothetical conserved protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 692

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 267/700 (38%), Positives = 381/700 (54%), Gaps = 81/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE +A+LLN +FV +KVDREERPD+D+VYM  V  + G GGWPL+
Sbjct: 61  SSCHWCHVMEKESFEDEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLT 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPP  + G  G   ILR V + W K    + +    A EQ    L
Sbjct: 121 VFLTPDLKPFFGGTYFPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLL 176

Query: 140 SASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            +  ++ K  D  P + L + A + L+ S+DS +GGFG APKFP PV +  +  +S  LE
Sbjct: 177 KSFYTTEK-SDTTPSHNLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE 234

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                 +     +MV  TL+ MA+GG+ DH+GGGF RYS D  W VPHFEKMLYD   LA
Sbjct: 235 ------KEPAAVRMVSTTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLA 288

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY++ + +T D FY  I    LD+L  +M+ PGG  +SA DADS E        EG +Y
Sbjct: 289 RVYMNHYLITGDSFYREIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGEYY 341

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW   E+E ILG E A +  + Y +  TGN +            GKN+L     ++  A+
Sbjct: 342 VWRRGELEQILGPELAKIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAA 390

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +LG+       +L E + KL D R KRP P +DDK+I +WNG  +S+     +       
Sbjct: 391 ELGVDEPTLKQMLEEAKNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR------- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                     +  K Y++ A     FI  +++   T  L   ++NG S   GFLDDYA +
Sbjct: 444 ---------ATGEKRYLDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAV 491

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           ++ LLD++E     ++L  A+++ N   ELF D   GG++ T  ED + + R+K+ +DGA
Sbjct: 492 VNALLDVFEVSFEPRYLAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGA 550

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADML 616
            PSGN+++   L++L+ +   +K   Y Q  E +L  F +RL+   A    L+   A   
Sbjct: 551 TPSGNTLAAAALLKLSELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFH 607

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           +  SR  VVLV  +S  +    LA  + ++   ++V+ +             HN N  ++
Sbjct: 608 T--SRMEVVLV-TESPQEARPYLAHLYRAFKPFRSVVVV-------------HNGNRDTL 651

Query: 677 AR-NNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
            +     ADK     V A VC+N+SC  PVT   SLE  +
Sbjct: 652 QKYTRLVADKPAKGPVTAYVCENYSCRMPVT---SLEEFV 688


>gi|301061221|ref|ZP_07202007.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300444689|gb|EFK08668.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 694

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 258/699 (36%), Positives = 379/699 (54%), Gaps = 68/699 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED   A++LND +VSIKVDREERPD+DK+YM+  QAL G GGWPLSV
Sbjct: 55  TCHWCHVMAHESFEDPETARILNDHYVSIKVDREERPDLDKIYMSVCQALTGRGGWPLSV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+  P   GTYFP     G  GF  +L K+   W + R+ L  +G    ++++E L 
Sbjct: 115 FLTPERIPFFAGTYFPKIGHQGLIGFPELLLKLGKLWKEDRERLLTAG----DEITEHLR 170

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S     +   L    L     QLS+S+D R+GGFG APKFP P ++  +L    + ++ 
Sbjct: 171 NSELGGSVEKSLDMEVLNKAGVQLSRSFDPRWGGFGGAPKFPSPHQLTFLLRRHVRSKN- 229

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                 +   +MV  TLQ M +GG+ DH+G GFHRYSVDE+W  PHFEKMLYDQ  LA  
Sbjct: 230 ------ARDLEMVEKTLQSMRRGGLFDHIGYGFHRYSVDEKWFAPHFEKMLYDQALLAMA 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+ +T   FY+ + R+I  Y+ RDM  P G  +SAEDADS   EG     EG FY+W
Sbjct: 284 YTEAYQVTGKSFYARVAREIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGLFYLW 336

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR----MSDPHNEF-KGKNVLIELNDSSA 374
           T KEV++ILG E A LF +++ ++  GN +  R    M +P + F +G+N          
Sbjct: 337 TPKEVQEILGTESADLFCDYFDIRERGNFEEGRSIPHMREPLSTFAEGRN---------- 386

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
                 M +++ +++L + R KLF  R KR  P  DDK++ SWNGL+I++  +  + L  
Sbjct: 387 ------MGVKRLVSLLRQGREKLFSARQKRIHPLKDDKILTSWNGLMITALFKGYRALGD 440

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
            A                Y+  A+++  FI   L  E    L   +R G +   G+LDDY
Sbjct: 441 AA----------------YVTAAQNSLQFILNTLRKEDGC-LIRRYREGETAHAGYLDDY 483

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AFL+  L++ YE       L  A+ L +T  +LF D E GG+F T  E+ +++ R ++  
Sbjct: 484 AFLVWALIEGYESTFNPNHLKTAMVLTHTMLDLFWDSENGGFFFTGRENETLIARSRDAQ 543

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGA PSGNSV+ + L++L  +   +    + + A   +  F  ++     A   M  A D
Sbjct: 544 DGAIPSGNSVAALTLLQLGRLTGDTS---FEEKANALMQAFSGQMDAYPSAHTQMLQALD 600

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
            +  P+++ VV+ G +   + + ML     ++ L + V  +  ++ E      E  +  A
Sbjct: 601 FVIGPTQE-VVIAGTRHDRNTDVMLKVIQQNF-LPRQVALLVSSNEE-----RERVAGLA 653

Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
              +     + K  A +C+  +C  PVTDP ++E  L E
Sbjct: 654 PYVKEMVPVEGKATAYICRRHACQAPVTDPEAMEKALNE 692


>gi|83590501|ref|YP_430510.1| hypothetical protein Moth_1665 [Moorella thermoacetica ATCC 39073]
 gi|83573415|gb|ABC19967.1| Protein of unknown function DUF255 [Moorella thermoacetica ATCC
           39073]
          Length = 752

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 273/752 (36%), Positives = 372/752 (49%), Gaps = 90/752 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESF DE VA LLND F++IKVDREERPD
Sbjct: 32  GEEAFARAKREDKPVFLSIGYSTCHWCHVMARESFNDEEVAALLNDSFIAIKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D+VYM   QAL G GGWPL+VFL+P+ +P   GTYFP  ++YGRPG   +L+ +++ W 
Sbjct: 92  IDQVYMAACQALTGSGGWPLTVFLTPEKRPFYAGTYFPKHNRYGRPGLVELLKLIREKWA 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R+ L +SGA  I+ ++   + +      P E     L    +QL   +D  +GGF  A
Sbjct: 152 THREELEESGAELIQHVAGQFAPTP-----PGEPGAQVLEKGWQQLRAGFDPLYGGFSEA 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P ++  +L + K+ ++ G          MV  TLQ M  GGI+DH+G GF RYS 
Sbjct: 207 PKFPSPHQLLFLLRYWKRYDEAG-------ALAMVEKTLQAMYCGGIYDHIGFGFARYST 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D RW VPHFEKMLYD   LA  YL+    T    YS++ R+I  ++ RDM  P G  +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLALAYLETRQATGKAVYSHVAREIFTWVLRDMTSPEGGFYSA 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
            DADS   EG    +EG FY+WT  +V ++LG     F   Y+   T   +    S P+ 
Sbjct: 320 LDADS---EG----EEGRFYLWTPDQVREVLGAKEGEFFCRYF-DITAGGNFEGRSIPNL 371

Query: 359 EFKGKNVLI------ELNDSSASASK------------------LGMPLEKYLNILGEC- 393
             +G+ +        E ND++    +                   G P E  L   G   
Sbjct: 372 IGRGEALFAAGTSGNESNDTAGDQRQPREQGGRAGGISGGGGCAKGSPEEDRLPGRGPTT 431

Query: 394 ---------------RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                          R KLF  R KR  PH DDK++ +WNGL+I++ AR + +L      
Sbjct: 432 LAGFGPATAARLAAAREKLFAAREKRVHPHRDDKILTAWNGLMIAALARGAWVL------ 485

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D   Y   A  AA FI  HL D +  RLQ  +R G +  P +LDDYAFL 
Sbjct: 486 ----------DEPAYAAAAARAARFILTHLRDAEG-RLQARYREGQAAFPAYLDDYAFLT 534

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+    T +L  A+ L     ELF D EGGGYF T      + +R +E +DGA 
Sbjct: 535 WGLIELYQATFETGYLREALALTRQMQELFRD-EGGGYFFTPHGAGELPVRPREVYDGAI 593

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+ +NL+RLA I   S+ +   + A   +      + +         CA D    
Sbjct: 594 PSGNSVAALNLLRLARITGDSRLE---EEAAAQVRALAGTVAEYPRGYSFYLCALDFYLG 650

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P    +VL G + + D   +L    A+Y L   V+ + P   E     EE        A 
Sbjct: 651 PV-TEIVLAGERETEDTRALLRVLRAAY-LPSAVLVLRPGGREG----EEVTRLIPYTAG 704

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 K    +C+NF+C  PVT    LE  L
Sbjct: 705 QKPVNGKATLYLCRNFACRAPVTTAGELEQWL 736


>gi|385811559|ref|YP_005847955.1| thioredoxin domain-containing protein [Ignavibacterium album JCM
           16511]
 gi|383803607|gb|AFH50687.1| Thioredoxin domain protein [Ignavibacterium album JCM 16511]
          Length = 692

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 242/684 (35%), Positives = 369/684 (53%), Gaps = 54/684 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VAKL+ND F+SIKVDREERPD+D VYM   Q + GGGGWPL+
Sbjct: 51  STCHWCHVMERESFEDEEVAKLMNDTFISIKVDREERPDIDGVYMAVCQMITGGGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP  +++GR G   ++ K+ D W  +R+ +  S     E++++++
Sbjct: 111 IVMTPDKKPFFAGTYFPKYNRFGRIGMLELITKLNDIWKNRREEVLNSA----EEITKSI 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +   S  K  +E+ +  L    ++ S+ +D  +GGFG+APKFP P  +  +L + ++ ++
Sbjct: 167 N-KISHKKSDEEIDEKILDKAFDEYSRRFDKEYGGFGNAPKFPTPHNLLFLLRYYRRTKN 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                      K+V  TL  M KGGI+D +G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 226 LS-------ALKIVEKTLTEMRKGGIYDQIGFGFARYSTDKYWLVPHFEKMLYDNALLLM 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            + +AF +T + FY     +I +Y+ RDM  P G  FSAEDADS   EG    +EG FY+
Sbjct: 279 AFSEAFQITGNDFYKTTSEEIAEYVLRDMTHPEGGFFSAEDADS---EG----EEGKFYL 331

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+ ++L  + A    + + ++P GN       +      G N+L         A+ 
Sbjct: 332 WTEVEIRELLTKDEADFIIKVFNIEPNGNW----YDEARGVRTGNNILHLKKSYKELAND 387

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           L M    ++  L   R+K+FD R KR  PH DDK++  WN L+IS+  ++S IL      
Sbjct: 388 LSMSENDFIKNLSSIRKKMFDWRKKRVHPHKDDKILTDWNSLMISALIKSSVIL------ 441

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D+ ++++ A  A  F++++L+  ++ +L H FR   S   G +DDYAF I
Sbjct: 442 ----------DKNKFLQAAMKADKFVKKYLF--RSEKLLHRFRESESAIDGNIDDYAFFI 489

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              LDL+E  S  ++L+ AI L       F D + GGYF T+ +   +++R KE +DGA 
Sbjct: 490 QAQLDLFEATSEAEFLLTAIRLNEILFHKFWDDKSGGYFFTSEDSEKLIVRQKEIYDGAI 549

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV ++NL+RL  +   +    Y + A+  +  F + +  M        C  D LS 
Sbjct: 550 PSGNSVQLLNLLRLYELTGNA---VYYEIAQKQVKAFASEVSRMPSVFAQFLCGFDFLSG 606

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
            S + V+    K+  D   +       Y  +K +I ID ++ +++       S      +
Sbjct: 607 ASVQLVITAKDKNVAD--EIFKKLSREYFPSKVIIRIDNSNCQKL-------SEIIPHLK 657

Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
           +    +K     C++F C  P  +
Sbjct: 658 DYKVEEKPTIYFCRDFVCEKPTNN 681


>gi|159897570|ref|YP_001543817.1| hypothetical protein Haur_1041 [Herpetosiphon aurantiacus DSM 785]
 gi|159890609|gb|ABX03689.1| protein of unknown function DUF255 [Herpetosiphon aurantiacus DSM
           785]
          Length = 681

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 251/690 (36%), Positives = 365/690 (52%), Gaps = 64/690 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A ++N+ FV+IKVDREERPD+D +YM  VQA+   GGWP++
Sbjct: 48  SACHWCHVMAHESFEDPATAAVMNELFVNIKVDREERPDIDSLYMAAVQAMTRHGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD  P  GGTYFPPE ++  P F+ +L  V +A+  +R+ + QS     E L + L
Sbjct: 108 VFLTPDGAPFYGGTYFPPEPRHNMPSFQQVLHGVAEAYRDRREEVFQSAEQMREHLEDIL 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S      K    L ++ L + A++    +DSRFGG+G APKFP+ +   M+L    + ED
Sbjct: 168 SFDLEQVK----LSKSQLNVAAQRQMSQFDSRFGGYGGAPKFPQALIFGMVLRTWLRSED 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                + ++       TLQ MA GG++D +GGGF RYSVD +W VPHFEKMLYD   L+ 
Sbjct: 224 QDALNQVTQ-------TLQAMANGGMYDQLGGGFARYSVDAQWLVPHFEKMLYDNALLSQ 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +YL+ +  T D FY  I  + ++Y+ RDM  P G  ++AEDADS   EG    +EG FYV
Sbjct: 277 LYLETYQATHDPFYRRIAEESINYILRDMTSPDGGFYAAEDADS---EG----EEGKFYV 329

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+  E++ +L  E A L + ++ ++P GN            F+G  +L    D S  A +
Sbjct: 330 WSLAEIQQLLSPEDAALAQLYWNIQPEGN------------FEGHAILYVPQDPSVVAKE 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           L +        +   R  L   R+ R RP  D+K++ SWNG+++ S A A+ +L      
Sbjct: 378 LSISEADLAQRIAVIRATLLAQRNTRIRPGRDEKILASWNGMMLRSLAFAANVL------ 431

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D  +Y   A   A FI   LY  Q  +L  S+++G +K  G+L+DYA + 
Sbjct: 432 ----------DNADYRAAAIRNAEFITSKLY--QNGQLYRSYKDGQAKFKGYLEDYACVA 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            G+L LYE     +WL  AIEL  +  E F D +   +F+T  +   ++ R ++ +D A 
Sbjct: 480 DGMLALYEATFDLRWLQVAIELAESMTERFWDAQQRSFFDTASDHEQLITRPRDLYDNAT 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+GNSV+V  L+RLA+++   +   YRQ AE  LA     L  +  A   +  AAD    
Sbjct: 540 PAGNSVAVDVLLRLATLLDRYE---YRQYAETVLANLSGALLQLPGAFGRLLAAADFALA 596

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN--ASM 676
             R+ V L+G  +   F+ +L A + +Y  NK V    P D         H +      +
Sbjct: 597 EPRE-VALIGDPADPAFKALLQATYRNYQPNKVVAACKPDD---------HAAQQLIPLL 646

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISL 706
           A       +  A VC   +C  P  DP  L
Sbjct: 647 AERPLLNQQATAYVCVRRACKLPTNDPNEL 676


>gi|315426698|dbj|BAJ48323.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343485462|dbj|BAJ51116.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 692

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 265/699 (37%), Positives = 377/699 (53%), Gaps = 79/699 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE +A+LLN +FV +KVDREERPD+D+VYM  V  + G GGWPL+
Sbjct: 61  SSCHWCHVMEKESFEDEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLT 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPP  + G  G   ILR V + W K    + +    A EQ    L
Sbjct: 121 VFLTPDLKPFFGGTYFPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLL 176

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            +  ++ K       N +    + L+ S+DS +GGFG APKFP PV +  +  +S  LE 
Sbjct: 177 KSFYTTEKSVTTPSHNLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE- 234

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                + S   +MV  TL+ MA+GG+ DH+GGGF RYS D  W VPHFEKMLYD   LA 
Sbjct: 235 -----KESAAVRMVSTTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLAR 289

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY++ + +T D FY  I    LD+L  +M+ PGG  +SA DADS E        EGA+YV
Sbjct: 290 VYMNHYLITGDSFYREIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGAYYV 342

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W   E+  ILG E A +  + Y +  TGN +            GKN+L     ++  A++
Sbjct: 343 WRLGELGQILGPELAKIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAAE 391

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+       +L E + KL D R KRP P +DDK+I +WNG  +S+     +        
Sbjct: 392 LGVDEPTLKQMLEEAKNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR-------- 443

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                    +  K Y++ A     FI  +++   T  L   ++NG S   GFLDDYA ++
Sbjct: 444 --------ATGEKRYLDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAVV 492

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           + LLD++E     ++L  A+++ N   ELF D   GG++ T  ED + + R+K+ +DGA 
Sbjct: 493 NALLDVFEVSFEPRYLAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGAT 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADMLS 617
           PSGN+++   L++L+ +   +K   Y Q  E +L  F +RL+   A    L+   A   +
Sbjct: 552 PSGNTLAAAALLKLSELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFHT 608

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
             SR  VVLV  +S  +    LA  +  +   ++V+ +             HN N  ++ 
Sbjct: 609 --SRMEVVLV-TESPQEARPYLAHLYREFKPFRSVVVV-------------HNGNRDTLQ 652

Query: 678 R-NNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
           +     ADK     V A VC+N+SC  PVT   SLE  +
Sbjct: 653 KYTRLVADKPAKGPVTAYVCENYSCRMPVT---SLEEFV 688


>gi|328951864|ref|YP_004369198.1| hypothetical protein Desac_0120 [Desulfobacca acetoxidans DSM
           11109]
 gi|328452188|gb|AEB08017.1| protein of unknown function DUF255 [Desulfobacca acetoxidans DSM
           11109]
          Length = 693

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 250/696 (35%), Positives = 369/696 (53%), Gaps = 60/696 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  E FED  +A+L+N+WF++IKVDREERPD+D +YM  VQ + G GGWPL+
Sbjct: 53  STCHWCHVMAHECFEDPEIARLMNEWFINIKVDREERPDLDDIYMHAVQMITGRGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+LKP  GGTYFPP D+ G PGF  +L+ + D++  K+  +    A  +EQ    L
Sbjct: 113 VFLTPELKPFYGGTYFPPIDRGGLPGFPRLLQALHDSYKNKKSNIHNVIA-TLEQNMRIL 171

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           + + +S + P      AL    E     +D   GGF  APKFP   ++     H      
Sbjct: 172 ALTPASGQAPS---LAALDQLIEHNLADFDEGNGGFRGAPKFPPSQDLGFWACHYH---- 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++G+    Q + L TLQ MA+GG++D + GGFHRYSVD+ W +PHFEKMLYD  QLA 
Sbjct: 225 --RTGQPKVLQSLSL-TLQKMARGGLYDQLRGGFHRYSVDDVWLIPHFEKMLYDNAQLAR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+ +T DVF + + +  LDY+  +M  P G  ++A+DADS   EG     EG F+V
Sbjct: 282 RYLEAYQITGDVFLAQVAQQTLDYVLAEMTAPEGVFYAAQDADS---EGV----EGRFFV 334

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +++ ++ G + A L    + +   GN +            G +VL    + +  A +
Sbjct: 335 WTPEQIAEVAGAQRAPLICAAFGVTQEGNFE-----------HGASVLHRPQNEAQLAEQ 383

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             + +++  ++L E RR+L+  R +R RPH D+K+I +WN L+IS+ A  S++L      
Sbjct: 384 FSLNMDEMRHVLTEARRRLWQGREQRVRPHRDEKIITAWNALMISALAYGSQVL------ 437

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D + Y   A +AA FI     + Q  RL   +     +   FLDD+AF I
Sbjct: 438 ----------DNRTYRGAAITAAQFILGR--EAQAGRLLRIWAATDRQGSAFLDDFAFFI 485

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           + LLDLYE      WL  A+ L    +  F DRE GGYF+T  +   +L+R K   D A 
Sbjct: 486 AALLDLYETDFSPAWLAAAVRLSKEVETSFYDREAGGYFSTPVDHEKLLVRPKNFFDLAI 545

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV V NL+RL         DY+ + A+ +L   +T + +    +  +  A +    
Sbjct: 546 PSGNSVMVHNLIRLHRFT--DNPDYFLR-AQETLTRLQTLMMENPRGLSHLAAATEDFLA 602

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P+   + LVG+ +      MLA  +  Y  ++ ++  DP   E +             AR
Sbjct: 603 PTLA-ITLVGNPTEPALAEMLAVVYRHYLPHRRLVVKDPESCEAL-------LEIVPAAR 654

Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEK 713
           +    D +  A VC   +C  PV     L+NLL  +
Sbjct: 655 HYDRIDGRPTAFVCHGQTCQAPVFSAGGLDNLLATR 690


>gi|148379048|ref|YP_001253589.1| hypothetical protein CBO1058 [Clostridium botulinum A str. ATCC
           3502]
 gi|153933571|ref|YP_001383431.1| hypothetical protein CLB_1099 [Clostridium botulinum A str. ATCC
           19397]
 gi|153935757|ref|YP_001386978.1| hypothetical protein CLC_1111 [Clostridium botulinum A str. Hall]
 gi|148288532|emb|CAL82612.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
           3502]
 gi|152929615|gb|ABS35115.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
           19397]
 gi|152931671|gb|ABS37170.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
          Length = 680

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 249/693 (35%), Positives = 358/693 (51%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK E
Sbjct: 170 --FQDNHRQGELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDE 227

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                        ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 228 KV---------LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++  
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M +
Sbjct: 536 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYN 591

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   K + L  +K   DF   +   +  Y     V   D ++        E    N ++ 
Sbjct: 592 ISPVKEITLAYNKKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+TD    ++LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPITDLEEFKSLL 676


>gi|416351321|ref|ZP_11681110.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
 gi|338196028|gb|EGO88249.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
          Length = 611

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 243/674 (36%), Positives = 360/674 (53%), Gaps = 75/674 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFEDE VAK+LND ++SIKVDREERPDVD  YMT+ QA+ G GGWPL++ ++P+ K
Sbjct: 1   MEKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTIIMTPEQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP +  YGRPG   IL+++ D W   +D +  +    +  + E +S   S   
Sbjct: 61  PFFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDKIINTSNKLLNTMKERVSQDKS--- 117

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
             +E+  + L     +++  YD+++GGFG APKFP P ++ ++L + K   D    G   
Sbjct: 118 --EEINGSILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDKSALG--- 172

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
               MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY +A+ +
Sbjct: 173 ----MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTEAYQV 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T   FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW+ +E++ 
Sbjct: 229 TGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWSLEEIQS 281

Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
           ILGE A  F   Y +   GN            F+GKN+           + +G  LE  +
Sbjct: 282 ILGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIGKDLEN-I 318

Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
           + L E R KLF VR KR  P  DDK++ +WN L+I S + A ++                
Sbjct: 319 DKLEELRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF--------------- 363

Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
            + KEY+  A+ A  FI  +L   +  RL   FR+G +    +L+DY+FL+  L++LYE 
Sbjct: 364 -ENKEYINRAKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWALMELYEA 421

Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
              + +L  A+   +   +LF D E  G+F++  +   ++L +K+ +D A PSGNSV+ +
Sbjct: 422 TFESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPSGNSVTAM 481

Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
           NL++L+ I   +      + A      F   +K+   +  +   +      PSR+ +V+ 
Sbjct: 482 NLIKLSKITGDNSLG---EKAYKMFQGFGGNIKESLQSHSIFLISYMNYIKPSRQ-IVIA 537

Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD-KV 686
             K    F+ M+   +  + +  T+I ++  + E          N     ++    D K 
Sbjct: 538 SEKEDRLFKEMIKKVNKRF-MPFTIILLNDGNLE----------NIVPFIKDEKKIDNKT 586

Query: 687 VALVCQNFSCSPPV 700
            A +C+NFSC+ PV
Sbjct: 587 TAYICENFSCNKPV 600


>gi|430746011|ref|YP_007205140.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430017731|gb|AGA29445.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
          Length = 701

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 250/683 (36%), Positives = 364/683 (53%), Gaps = 58/683 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+   A L+N+ F+++KVDREERPDVD++YM  VQA+   GGWP+S
Sbjct: 66  SACHWCHVMEHESFENADTAALMNEHFINVKVDREERPDVDQIYMAAVQAMTDHGGWPMS 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP   GTYFPP D  G PGF  +L  V  AW ++RD +  S     +++    
Sbjct: 126 VFLTPDLKPFYCGTYFPPVDGRGMPGFPRVLYSVHRAWAERRDDILISAGDLTDRIRLMG 185

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              A+S  L   L   A R     L++S+D+  GGFGSAPKFP P++++++L    +  +
Sbjct: 186 KIPAASGALESVLLDQAAR----GLARSFDTIHGGFGSAPKFPHPMDLKVLLRQHARTRE 241

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  +   ++V  TL  MA+GGI+D + GGF RYS DERW  PHFEKMLYD   L++
Sbjct: 242 -------AHPLQIVRHTLDKMARGGIYDQLLGGFARYSTDERWLAPHFEKMLYDNALLSS 294

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYL+A  +T D  Y+ + R+ +DY+   M GP GEI+S EDADS   EG    +EG FYV
Sbjct: 295 VYLEAHQVTGDAEYARVARETMDYILERMTGPEGEIYSTEDADS---EG----EEGKFYV 347

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+  EV  ILG E A  F   Y +  +GN            ++ +N+L        +A++
Sbjct: 348 WSLAEVNQILGPERAKEFAAVYDVTESGN------------WEHQNILNLPMSVDQAATR 395

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG    +    L   R +L + R +R  P  D KV+ SWNGL++++ A  S+ILK E   
Sbjct: 396 LGRDERELQADLDRDRARLLEARDRRVPPGKDTKVLTSWNGLMLAALAEGSRILKDE--- 452

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y++ A  AA+F+   +   +  RL H++++G ++  G+LDDY+ LI
Sbjct: 453 -------------RYLDAATKAAAFLLDRMRTAEG-RLLHAYKDGRARFNGYLDDYSNLI 498

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL  LYE     +W+  A+EL     + F D E GG+F T      ++ R K+  D A 
Sbjct: 499 DGLTRLYEVSGEPRWIEAALELTAVMIDEFHDAEAGGFFYTGRSHEVLIARQKDFQDNAT 558

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN++    L+RL ++  G +S   R     +L   +  L    MA+     A D    
Sbjct: 559 PSGNAMVATALLRLGALT-GRES--LRTLGRSTLEAVQAYLDRAPMAMGQSLVALDFELA 615

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
             R+  V+ G     +F  ++ A +A +  +K V    PA  E+     E       +A 
Sbjct: 616 SPREFAVIAG-SDPAEFRRVMEAIYAPFLPHKVVA---PALAEKASALAE---TLPLLAD 668

Query: 679 NNFSADKVVALVCQNFSCSPPVT 701
                D+    +C+ F+C  PV 
Sbjct: 669 RPAQDDRTTTYICERFTCHAPVV 691


>gi|410721128|ref|ZP_11360472.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
           MBC34]
 gi|410599579|gb|EKQ54125.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
           MBC34]
          Length = 708

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 264/716 (36%), Positives = 370/716 (51%), Gaps = 59/716 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESF+D  +  LLN  FV +KVDREERPD
Sbjct: 44  GDEAFDKAKKEDKPIFLSIGYSTCHWCHVMARESFQDPEIGDLLNQVFVPVKVDREERPD 103

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  Q + G GGWPL++ ++PDLKP   GTYFP +      G + ++  V D W+
Sbjct: 104 IDSVYMTVCQMITGSGGWPLTIIMTPDLKPFFAGTYFPKDTGPRGTGLRDLILNVHDLWE 163

Query: 119 KKRDMLAQSG---AFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
            KR+ L +S      +++Q+S       S +K  ++L    L    +   +++D  + GF
Sbjct: 164 NKREDLLKSAEDLTLSLQQISH-----RSPDKSGEQLNDGILNQTYQSQLENFDQEYAGF 218

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           G+  KFP P  +  +L + K       +GE  E   MV  TL  M KGGI+DHVG GFHR
Sbjct: 219 GTNQKFPTPHHLLFLLRYWK------HTGE-DEALTMVEKTLDAMRKGGIYDHVGFGFHR 271

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           Y+VD +W VPHFEKMLYDQ  L   Y +AF  T    Y     ++L+YL RDM  P    
Sbjct: 272 YTVDRKWVVPHFEKMLYDQALLVIAYTEAFQATGKTKYRETAEEVLEYLLRDMRSPEDGF 331

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMS 354
           +SAEDADS   EG    +EG FY+WT  E+ +ILG E   LF   Y +   GN       
Sbjct: 332 YSAEDADS---EG----EEGKFYLWTLDEIINILGPEEGELFSRVYSVSENGNFK----D 380

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
           +   E  GKN+L         + KL M  E+        R  LF  R  R  PH DDK++
Sbjct: 381 EATGEKTGKNILHRSQTWDELSKKLEMSPEELWWKTESARETLFQAREGRVHPHKDDKIL 440

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
             WNGLVI + A A K+                  R++Y+  A  A +FI   +   Q  
Sbjct: 441 TDWNGLVIVALALAGKVFG----------------REDYLLAATEAVNFIMTKI--NQQG 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           RL H +R+G +   G LDDYA+LI GLL+LY+    +++L  A++L  T  E F D + G
Sbjct: 483 RLHHRWRDGEAAVDGNLDDYAYLIWGLLELYQATFNSEYLKTALKLNQTILEHFWDHDNG 542

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G++ T+   P +L+R KE +D A PSGNSV ++NL +L  I      D + +   ++L  
Sbjct: 543 GFYFTSDYAPEILVRQKEAYDTALPSGNSVMMMNLEKLYLIT----EDIHIREISNALEK 598

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           + + + + + +   M  +A +L       + + G K S D + ML A +  Y  N  +I 
Sbjct: 599 YFSPMIEQSPSAFTMFLSAIILKRGPSFKIAITGEKDSADTKAMLNALYKKYLPNCMLI- 657

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +  +D   ++   E +  N  M  NN    K  A VC N +C  PV  P  L NLL
Sbjct: 658 LRSSDDAMINQIIESSETNIMM--NN----KATAYVCGNGTCHAPVNTPEDLVNLL 707


>gi|387817346|ref|YP_005677690.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
           065]
 gi|322805387|emb|CBZ02951.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
           065]
          Length = 680

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 249/693 (35%), Positives = 358/693 (51%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VAK+LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAKVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++  
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M +
Sbjct: 536 TPSGNAVAALTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYN 591

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   K + L  ++   DF   +   +  Y     V   D ++        E    N ++ 
Sbjct: 592 ISPVKEITLAYNEKDEDFYKFINEVNNRYIPFSIVTVNDKSN--------EIEKINKNIK 643

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+TD    ++LL
Sbjct: 644 DKIAIKDKSTVYICQNYACREPITDLEEFKSLL 676


>gi|373458119|ref|ZP_09549886.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
 gi|371719783|gb|EHO41554.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
          Length = 684

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 260/702 (37%), Positives = 369/702 (52%), Gaps = 82/702 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE  A+L+N  FV+IKVDREERPD+D+ YM +VQ L G GGWPL+
Sbjct: 51  SACHWCHVMEKESFEDEETAQLMNRLFVNIKVDREERPDIDQHYMEFVQTLTGSGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P  GGTYFPPED+YG+P FK +L  V + + K R  L ++    ++++ E +
Sbjct: 111 VFLTPDGEPFYGGTYFPPEDRYGKPAFKKLLVMVSEYYHKNRQQLEEN----LDKIREIM 166

Query: 140 SASASSNK---LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           +      K   +PD     A     ++L++ YD+  GG G APKFP    +Q+     +K
Sbjct: 167 ARQRREIKGRHIPDT---EAWNQAVQRLTQFYDALNGGMGQAPKFP---AVQVFSLFLRK 220

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
               G      +  +M   TLQ MA GGI+D +GGGF RY+VDE+W VPHFEKMLYD  Q
Sbjct: 221 FAHHGD----KQFLRMAEHTLQRMANGGIYDQLGGGFARYAVDEKWRVPHFEKMLYDNAQ 276

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA++Y+DA+ LT++ FY  I R+ L+++RR++  P G  +S+ DADS   EG    +EG 
Sbjct: 277 LASLYIDAYRLTQNPFYLQIARETLEFVRRELTDPDGGFYSSLDADS---EG----QEGK 329

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY+W+  E+  ILG E   LF   + +   GN            F+G N+L         
Sbjct: 330 FYLWSKDEILKILGDETGRLFCARFGVTDGGN------------FEGSNILFVSKSFDEL 377

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A++     E+   ++ + R+K+   R +R RP LD K + SWNGL++S+FA A ++  + 
Sbjct: 378 AAEFKKTPEEIEALIRQARKKMLAEREQRIRPGLDYKALTSWNGLMLSAFAAAYQVTLNP 437

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y  V +    F+RR+LY  Q+ RL H +  G SK   F+DDYA
Sbjct: 438 T----------------YAAVIDKNIDFVRRNLY--QSGRLLHVYSKGQSKIDAFVDDYA 479

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDH 554
           +LI GLLD YE      +L  A+EL    ++LF D+  GGY F  TG+D +     K + 
Sbjct: 480 YLIQGLLDAYEALFDEHYLQMAVELTRRANDLFWDKRHGGYFFEATGKDQAK-RHFKSET 538

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D ++PS  +V + N +RL           Y Q AE  +  +  +  +   A      A D
Sbjct: 539 DASQPSPTAVMLHNQLRLFHFTG---EQLYLQTAEQLMRKYGQKALENPYAFASFLNALD 595

Query: 615 M-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             LS P     +L+  K    F+       + Y  NK V+              +  S+ 
Sbjct: 596 FYLSQPLE---ILILKKDQQRFDAFQKLIFSRYLPNKVVL-------------VQTASSK 639

Query: 674 ASMARNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLL 710
           ASM R           K  A VC   SCS PVT    L+ +L
Sbjct: 640 ASMGRPLLQGRESMEGKTTAFVCHGQSCSLPVTTVDGLKQIL 681


>gi|326203005|ref|ZP_08192872.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
 gi|325987082|gb|EGD47911.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
          Length = 672

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 260/693 (37%), Positives = 369/693 (53%), Gaps = 76/693 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  Q L G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQTLTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP ++  G  G  ++L  VK+AWD KR+ L +S    IE +S   
Sbjct: 113 VFLTPDRQPFYAGTYFPKDNSKGSIGLMSLLDSVKEAWDLKRESLLESAKNIIEHVSHEE 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S+  +       + ++ +    +    ++D ++GGFG++PKFP P  +  +L    +   
Sbjct: 173 SSDETI------ISKDIIHEAFKHFKYNFDIKYGGFGTSPKFPSPHTLLFLL----RYWY 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T K   A E   MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   LA 
Sbjct: 223 TEKEPFALE---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLAI 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+S T +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG FY+
Sbjct: 280 AYGEAYSATGNKNYEETSRQILDYVQRDMSSQLGAFYSAEDADS---EGF----EGKFYI 332

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASAS 377
           W+ +EV  +LG+     KE+        C+L  ++ P   F+G N+  LIE    S    
Sbjct: 333 WSQEEVMKVLGQKD--GKEY--------CNLFDIT-PSGNFEGLNIPNLIETGALSQQQK 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                         ECR+KLF+ R KR  P+ DDKV+ SWNGL+I++ A   +I      
Sbjct: 382 SFA----------EECRKKLFNHREKRVHPYKDDKVLTSWNGLMIAAMAYCGRIF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                    G +R  Y+E A+    FI + L      RL   +R+G +  P +L+DYAFL
Sbjct: 427 ---------GEER--YIETAKRCVDFIYKKLI-RTDGRLLARYRDGEAMFPAYLEDYAFL 474

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + GLL+LYE    T +L  A++L +    LF +      F    +   ++ R +E +DGA
Sbjct: 475 VWGLLELYEATFTTIYLKRALKLTDAMLNLFGENNSAALFLYGHDSEQLISRPRESYDGA 534

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ +NL+RLA I    +   Y   A+  +  F  ++K        M  ++ M S
Sbjct: 535 IPSGNSVAAMNLLRLARITGHHE---YENRAKAIMDFFNNQVKAAPTGHSYM-LSSYMYS 590

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           V      +++  ++S +  + L   +  + +  T+ +I P  TE   F  ++ S N    
Sbjct: 591 VSDNSSEIVITGENSKEMVDTLNRKYLPFAV--TISNISPELTEIAPFVGDYKSQNG--- 645

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  K  A VC+NFSC  PVT P  L  +L
Sbjct: 646 -------KTAAYVCRNFSCMEPVTQPEKLSEVL 671


>gi|15607089|ref|NP_214471.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
 gi|2984353|gb|AAC07873.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
          Length = 692

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 247/704 (35%), Positives = 370/704 (52%), Gaps = 64/704 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFED  +A++LN++FV IKVDREERPD
Sbjct: 30  GEEAFKKAKEEDKPIFLSIGYSTCHWCHVMEKESFEDPEIAEILNNYFVPIKVDREERPD 89

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD  YM+  QA+ G GGWPL++ ++PD +P   GTY P E  +GRPG + +L  +++ W+
Sbjct: 90  VDAFYMSVCQAMTGTGGWPLTIIMTPDKEPFFAGTYIPKEGMFGRPGLRDLLLTIRELWE 149

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K R  +  +    ++ L EA   +  +     ++ +  +     +L  SYD  FGGFGSA
Sbjct: 150 KDRTKILNTAKHLVKALQEASRETQKA-----QIGEETIHRAFSELFSSYDEHFGGFGSA 204

Query: 179 PKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           PKFP P  +  +   Y+  K E         +  KM+  TL  M  GGI+DHVG GFHRY
Sbjct: 205 PKFPTPHNLMFLGRYYYRYKRE---------QALKMIEKTLTNMRMGGIYDHVGFGFHRY 255

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S D  W +PHFEKMLYDQ  L   Y + + L K   +     +I+D+L+RDM+ P G  +
Sbjct: 256 STDREWILPHFEKMLYDQAMLLFAYTEGYQLLKKDLFKQTVYEIVDFLKRDMLSPEGAFY 315

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD 355
           SA DADS   EG    +EG FY W+ +E++++L  E   L  + + L   GN     + +
Sbjct: 316 SAWDADS---EG----EEGKFYTWSFEELKEVLDPEELELAVKVFNLSQEGNY----LEE 364

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
                 G+NVL         A +LG+  ++    L   R+KLF+ R KR +P  D+K++ 
Sbjct: 365 ATKVKTGRNVLYIGKSYEELAKELGISEKELKEKLERIRKKLFEAREKRVKPLRDEKILT 424

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
            WNGL I++ + A K+                   KE++++A+ AA F+ +++  E    
Sbjct: 425 DWNGLTIAALSYAGKVF----------------GEKEWIDLAKGAADFVLKNMRTENG-L 467

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L H +  G +K  GFL+DYA+ I GL++LYE    +K+L   I+LQ  Q + F D+E GG
Sbjct: 468 LLHRYMEGEAKYWGFLEDYAYFIWGLMELYEATLDSKYLEEVIKLQEIQIKHFWDKENGG 527

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           +F T      + +R KE +DGA PSGNSVS  NL+RL  +++ S+   Y +    +L  F
Sbjct: 528 FFQTPDFFTEIPVRKKEVYDGAIPSGNSVSAYNLIRLGRLISRSE---YEKYGTKTLEAF 584

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
              + +   A      A D++ V   K +V+V    S  + N+ A     Y  +  ++  
Sbjct: 585 SWEIANFPSAHTFSIIALDLI-VNGTKELVIVPTDDS--WRNLKAQLDKEYLPDLLILKK 641

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
           D           E  S N    +      K    +C+N++C  P
Sbjct: 642 DKVI--------EKLSENLEQMKP--VEGKTTYYLCRNYTCESP 675


>gi|440792869|gb|ELR14077.1| Hypothetical protein ACA1_367000 [Acanthamoeba castellanii str.
           Neff]
          Length = 865

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 255/689 (37%), Positives = 353/689 (51%), Gaps = 104/689 (15%)

Query: 36  EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYF 95
           E +++LLND FVSIKVDREERPDVD++YMTYV A  G GGWPLSVFL+PDLKPL+GGTYF
Sbjct: 265 EKISRLLNDNFVSIKVDREERPDVDRLYMTYVTATTGHGGWPLSVFLTPDLKPLVGGTYF 324

Query: 96  PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELPQ 154
           PP  KYGRPGF T++  V   W +K+D L          L E ++ A      + D+  +
Sbjct: 325 PPTSKYGRPGFDTLIHNVDKVWREKQDQLKAEADNTAHALQEYMTVAGKEVEGIDDDSIE 384

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMV 213
            A     + L++SYD   GGF  APKFPR   +  +   +  + E    + +A++   M 
Sbjct: 385 IAYDAALKSLAESYDEEHGGFTRAPKFPRLATLNFLFRVYGHRKEGLELNEKATKAMDMA 444

Query: 214 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 273
           L TL  MA+GGI+DH+G           W VPHFEKMLYDQ QL   YL A+ +T +  +
Sbjct: 445 LVTLTKMARGGIYDHIGN----------WLVPHFEKMLYDQSQLTMAYLSAYQITDEPVF 494

Query: 274 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 332
           + +  D+L+Y+   +  P G  +SAEDADS  +  +  K EGAFYVW   EV   LGE  
Sbjct: 495 ADVAEDVLEYVTTKITSPEGAFYSAEDADSLVSPDSDEKVEGAFYVWEYDEVIKALGEQD 554

Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
             +F   Y + P GN  +   +D   E K KNVL E   +  +A + G  ++    +  E
Sbjct: 555 GKIFAHRYGVLPEGN--VPAPADIQGELKHKNVLAEKLTAEETALEFGFKVDYVDKLTME 612

Query: 393 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 452
            + KL   R KRPRPHLDDK+I SWNGL+IS++ARAS++L                  K 
Sbjct: 613 SKAKLKHERDKRPRPHLDDKIITSWNGLMISAYARASEVLGD----------------KR 656

Query: 453 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
           Y E A   A FIR  LYD+Q                                       +
Sbjct: 657 YAESASKCAQFIRDQLYDDQ---------------------------------------E 677

Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
            ++WA +               GYFNT  +DPS+L RV++D DGAEPS NS+S +NLVRL
Sbjct: 678 AILWARQ--------------RGYFNTVKDDPSLLARVRDDQDGAEPSSNSISAMNLVRL 723

Query: 573 ASIVAGSKSDYYRQNAEHSLA------VFETRL-----KDMAMAVPLMCCAADMLSVPSR 621
             +     SD + + AE + +      +   RL     KD  + VP M C+ D  S  + 
Sbjct: 724 WHMTG---SDDWYKKAEATFSSCKGPIITPLRLTVCPAKDAPLMVPQMLCSLD-FSRATA 779

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
           K +V+ G  ++ D   +L    + +  N+ +++ D    E  DF   + +    M   + 
Sbjct: 780 KQIVIAGDPNAEDTAALLKEVRSQFIPNRVLLYAD--GREGQDFLSSYRALIKDMKPIDG 837

Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
           +A    A VC+NF+C  P   P  L + L
Sbjct: 838 AA---TAYVCENFTCKLPTNKPEKLRDAL 863


>gi|269836164|ref|YP_003318392.1| hypothetical protein Sthe_0131 [Sphaerobacter thermophilus DSM
           20745]
 gi|269785427|gb|ACZ37570.1| protein of unknown function DUF255 [Sphaerobacter thermophilus DSM
           20745]
          Length = 685

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 255/686 (37%), Positives = 363/686 (52%), Gaps = 68/686 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFE+  +A L+N  F++IKVDREERPD+D VYM   Q + G GGWPL++
Sbjct: 49  ACHWCHVMERESFENPDIAALMNQHFINIKVDREERPDLDTVYMAAAQMMTGQGGWPLTI 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL PD KP   GTYFPPED+ G PGF  +L  V +A+  +R  L ++       L+E   
Sbjct: 109 FLMPDGKPFYAGTYFPPEDRSGMPGFPRVLLAVAEAYRNRRADLERAANDIQGHLTEHFR 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLED 199
            S     +   L    L   A  L++ +D   GGFG APKFP P+ ++ +L Y  +   D
Sbjct: 169 WSLPETAITPAL----LNEAASGLARQFDEANGGFGGAPKFPPPMALEFLLRYRLRTGSD 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T          ++V  TL+ MA+GGIHD VGGGFHRY+VD  W VPHFEKMLYD   LA 
Sbjct: 225 TAL--------RIVELTLERMARGGIHDQVGGGFHRYAVDATWLVPHFEKMLYDNALLAR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y   +  T   FY+    D ++Y+ R+M  P G  +S +DADS   EG    +EG FYV
Sbjct: 277 LYTLTYQATGHPFYAATALDTIEYVLREMTSPDGGFYSTQDADS---EG----EEGKFYV 329

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +E+E +LG E A +   +Y + P GN            F+GK++L       + A+ 
Sbjct: 330 WTPEELEAVLGPEQAPIVARYYGVHPGGN------------FEGKSILHVPEAPESVAAA 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             + +++ + I+G  R KL+  R++R  P  D+K++  WNGL++ + A+A+  L      
Sbjct: 378 FDLTIDELVEIIGPAREKLYAARAQRVWPGRDEKILTDWNGLMLRALAQAAIALG----- 432

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                      R +  + A   A+F+  HLY  +  RL HS+++G +K  G+L DYA LI
Sbjct: 433 -----------RSDLRDAAVRNATFLHTHLY--RDGRLLHSYKDGEAKITGYLADYASLI 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           +GLL LYE     +W+ WA +L +     F D EGG +F+T+ +D  ++ R K+  D A 
Sbjct: 480 AGLLALYEATFDVRWIAWARDLTDRAIADFWDNEGGAFFDTSADDAPLVARPKDAFDSAT 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---MCCAADM 615
           PSGNS+   +L+RL  +      D YRQ A   + V E R   +A   P        A  
Sbjct: 540 PSGNSLMAESLLRLGLL---LGEDDYRQRA---MTVLE-RFAALAAKAPTGFGQLLCAAD 592

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L++     + LVG         MLA     Y L   V+ +   D +  D  E        
Sbjct: 593 LALAEAHEIALVGDPQVPAMAEMLAVVQQPY-LPHQVVALRHPDQDGED--EVIPLLAGR 649

Query: 676 MARNNFSADKVVALVCQNFSCSPPVT 701
            AR+     +  A VC+N++C  PVT
Sbjct: 650 TARDG----QPTAYVCRNYACRQPVT 671


>gi|406878261|gb|EKD27217.1| hypothetical protein ACD_79C00804G0001 [uncultured bacterium]
          Length = 713

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 249/701 (35%), Positives = 370/701 (52%), Gaps = 66/701 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESF  + +A +LN  F+SIKVDREERPD+D VYM  VQ + G GGWPL+V
Sbjct: 55  TCHWCHVMEEESFSGKTIADILNRDFISIKVDREERPDIDSVYMNAVQKMTGSGGWPLNV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD K   GGTYF PE        K IL  ++D W  KR+ + +     +  ++E   
Sbjct: 115 FITPDKKIFYGGTYFAPEQ------LKIILSSIEDLWKNKREKILKPSEELMNLMNEETL 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A   + ++ D +   A      Q    YDS +GGFG+ PKFP       +L +  + ++ 
Sbjct: 169 ARNHTTEVSDVVFNTAFEFLLSQ----YDSMYGGFGTFPKFPSSQTFSFLLRYYYRTKN- 223

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +MV  ++  +  GGI+D +G G HRYS D++W +PHFEKMLYDQ  +  V
Sbjct: 224 ------KTALEMVKNSISHILDGGIYDQLGSGIHRYSTDQKWFLPHFEKMLYDQALITKV 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFYV 319
           +L+ + +T++  Y+   RDIL+++ R+M  P G  +SA DADS    E + +K EGAFY+
Sbjct: 278 FLEIYQITREEKYAEAARDILEFVLREMTSPEGVFYSALDADSFNNDENSVKKTEGAFYI 337

Query: 320 WTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W  KE+  ILG     +F  +Y ++  GN      +D H EF  KNVL   N+ + +A  
Sbjct: 338 WEKKEIIRILGNKTGEIFCYYYGIQEDGNVS----NDSHGEFIRKNVLAVSNNLTNTAKH 393

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             M  ++  N L    + LF  R KRP+P LDDK++  WN L+IS+FA+   IL      
Sbjct: 394 FNMQHKEIENELNRSHQLLFHSREKRPKPFLDDKILTDWNALMISAFAKGGLIL------ 447

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     +   Y+  + ++A+F+   L  E+   L H +R+  +  PGFLDDYAF I
Sbjct: 448 ----------NEPRYVNASINSANFVLSRLKTEKG-TLLHRYRDQIAGIPGFLDDYAFFI 496

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-TGEDPSVLLRVKEDHDGA 557
           + LLDLYE      +L  A+ L +   ELF D+  GG+F T  G +  +  R+KE +DGA
Sbjct: 497 NSLLDLYEATFEGIYLKEALALNDKMLELFEDKVNGGFFLTAVGTETILQNRIKEFYDGA 556

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNS+++INL++L+ I   ++ +  +Q+++ S+      L     A  LM   A   S
Sbjct: 557 YPSGNSIALINLIKLSRI---TQKNILKQSSKKSIDFISEALSKFPTAY-LMSLIALNNS 612

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +     +V+V + S        + +  +Y     +IH          F   HN N   + 
Sbjct: 613 LEPENEIVIVSNDSKDS-----SVSQINY-----LIHRFYLSGWSFLF---HNMNENDII 659

Query: 678 -------RN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  RN    +DK    VC++  C PP+TD    + +L
Sbjct: 660 LSIVPRIRNYALISDKTTIYVCKDNICQPPITDIGRFQEIL 700


>gi|308069056|ref|YP_003870661.1| hypothetical protein PPE_02290 [Paenibacillus polymyxa E681]
 gi|305858335|gb|ADM70123.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
          Length = 688

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 257/696 (36%), Positives = 358/696 (51%), Gaps = 66/696 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE VA++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL+
Sbjct: 53  STCHWCHVMGRESFEDEEVAEVLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
           + ++PD KP   GTY P E K+GR G   +L KV   W ++ + L + S     E   + 
Sbjct: 113 ILMTPDQKPFFAGTYLPKEQKFGRVGLLELLDKVGTRWKEQPEELVELSEQVLTEHERQD 172

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L A         EL + +L     + S ++D  +GGFG APKFP P  +  +L +++   
Sbjct: 173 LLAGYRG-----ELDEQSLNKAFHEYSHTFDKEYGGFGEAPKFPSPHNLSFLLRYAQH-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG      +  +M   TL  M++GGI+DH+G GF RYSVDE+W VPHFEKMLYD   LA
Sbjct: 226 -TGN----QQALEMAEKTLDAMSRGGIYDHIGMGFSRYSVDEKWLVPHFEKMLYDNALLA 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+ +T    Y  I   I  YL RDM   GG  +SAEDADS   EG    +EG FY
Sbjct: 281 IAYTEAWQMTGKELYRRITEQIFTYLARDMTDAGGAFYSAEDADS---EG----EEGRFY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
           VW   EV  +LG E A  F + Y + P GN            F+G N+  LI++N   A 
Sbjct: 334 VWDDSEVRAVLGDEDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAY 380

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
             K  +  ++    + E R KLF  R +R  PH DDK++ SWNGL+I++ A+A +     
Sbjct: 381 GIKHDLTEQELEQRVSELRAKLFAAREQRVHPHKDDKILTSWNGLMIAALAKAGQ----- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                      G  R  Y E A  A +F+  HL  E   RL   +R+G +  PG++DDY 
Sbjct: 436 ---------AFGDMR--YTEQARKAETFLWNHLRQENG-RLLARYRDGEAAYPGYVDDYV 483

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F + GL++LY+      +L  A+ L     +LF D E  G F    +   ++ + KE  D
Sbjct: 484 FYVWGLIELYQATFDIVYLQRALTLNQNMIDLFWDEERDGLFFYGSDSEQLIAKPKEIDD 543

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNS++  N VRLA +   S+ + Y   A      F   +         +  A  +
Sbjct: 544 GAIPSGNSIAAYNFVRLARLTGESRLENY---AAKQFKAFGGMVAHYPSGHSALLSAL-L 599

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            +  + K +V+VGH+        + A  A +  N  VI  D   +E         +   S
Sbjct: 600 YATGTTKEIVIVGHRDDPQTGQFIRAVRAGFRPNTVVILKDEGQSE--------IAETVS 651

Query: 676 MARN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             R+ +    K    VC++F+C  PVT    L+ LL
Sbjct: 652 YIRDYDLVEGKPAVYVCEHFTCQAPVTRLEDLKVLL 687


>gi|403068246|ref|ZP_10909578.1| hypothetical protein ONdio_01469 [Oceanobacillus sp. Ndiop]
          Length = 685

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 268/720 (37%), Positives = 372/720 (51%), Gaps = 77/720 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F       +  FL    +TCHWCHVM  ESFED  VA+LLN  ++SIKVDREERPD
Sbjct: 31  GKEAFERAKLENKPIFLSIGYSTCHWCHVMAHESFEDPEVAELLNAHYISIKVDREERPD 90

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYM   Q + G GGWPL++ ++PD  P   GTYFP E K+G PG    L ++   + 
Sbjct: 91  IDSVYMKVCQMMTGHGGWPLTIMMTPDKVPFYAGTYFPKESKHGMPGILEALSQLHKKYT 150

Query: 119 KKRDMLAQSGAFAIEQLSEALSASA---SSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
           K  D +A+      E ++ AL  S    S N+L  E  + A R    QL+K++D  +GGF
Sbjct: 151 KDPDHIAE----VTESVTAALQKSVTEKSENRLTSESTEKAYR----QLAKNFDFSYGGF 202

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           G APKFP+P  +  +L H     +T          KMV  TLQ MA GGI DH+G GF R
Sbjct: 203 GPAPKFPQPQNLFFLLKHYHFTGNTS-------ALKMVESTLQSMASGGIWDHIGYGFSR 255

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS DE+W VPHFEKMLYD   L  VY + + +TK+ FY  I   I+ ++ R+M    G  
Sbjct: 256 YSTDEKWLVPHFEKMLYDNALLLMVYTECYQITKNPFYRQISEQIIAFVSREMTSSDGAF 315

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
           +SA DADS   EG     EG +YVW ++E+ D+LGE    L+ + Y + P GN       
Sbjct: 316 YSAIDADS---EGI----EGKYYVWRNEEIYDVLGEELGELYSDIYGITPFGN------- 361

Query: 355 DPHNEFKGKNVLIELNDS-SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
                F+GKN+   +N S   +A   GM L    + L   R KL   R KR  PH+DDKV
Sbjct: 362 -----FEGKNIPNLINTSLEKTAKDNGMSLANLHSHLETARSKLLLAREKRTYPHVDDKV 416

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           + +WNGL++++ A+A K L ++                 Y+E A  A  FI + LY  Q 
Sbjct: 417 LTAWNGLMVAALAKAGKALANDT----------------YIEKANRAIQFIEKKLY--QG 458

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
           +RL   FR+G +K   ++DDYAFL+ G ++LYE    T++L  A+ L     ELF D   
Sbjct: 459 NRLMARFRDGEAKFKAYIDDYAFLLWGYIELYEATYSTEYLQKAMALIEQMTELFWDEAN 518

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GG++    +   ++ + KE +DGA PSGNS + + L R+A +   +    Y    E    
Sbjct: 519 GGFYFNGKDSEELISKEKEIYDGAIPSGNSTAALMLTRMAYLTGETA---YLDKTEEMYF 575

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F       A A      +  +   P+ K VV++G       + +LA    +Y  N TV+
Sbjct: 576 TFYEDTHQYASASAFFMQSLFVTENPA-KEVVILGRSDDPARQKLLAKLQEAYIPNVTVL 634

Query: 654 HID--PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS-LENLL 710
             D   A      F  E+   N          D     VC+NF+C  P TD  S L+N+L
Sbjct: 635 AADHPSAFAVVAPFAAEYKQLN----------DSTTIYVCENFTCQQPTTDIDSALKNIL 684


>gi|237755775|ref|ZP_04584378.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
 gi|237692063|gb|EEP61068.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
          Length = 686

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 250/687 (36%), Positives = 353/687 (51%), Gaps = 65/687 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE VAK+LN+ +VSIKVDREERPD+D +YM       G GGWPL+
Sbjct: 51  SSCHWCHVMEKESFEDEEVAKILNENYVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP   + GR G   +L  V + W   ++ L Q     IE L +  
Sbjct: 111 IIMTPDKKPFFAGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKDDF 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
                   + DE+ ++ +  C   L   +D  +GGF   PKFP P  I  +L   YH+K+
Sbjct: 171 KG------IYDEISKDIIDACYFDLKSRFDREYGGFSIKPKFPTPHNIMFLLRYYYHTKE 224

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                     +E  KM   TL  M  GG++DH+G GFHRYS D  W +PHFEKMLYDQ  
Sbjct: 225 ----------TEALKMAEKTLINMRLGGMYDHIGFGFHRYSTDREWLLPHFEKMLYDQAM 274

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LTK+ FY    ++ + Y+ RDM    G  +S+EDADS   EG    +EG 
Sbjct: 275 LTMAYTEAYQLTKNNFYKKTAQETITYVLRDMTSKEGVFYSSEDADS---EG----EEGK 327

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY WT  E++++L +  + L  + + +K  GN     + +      G+N+L         
Sbjct: 328 FYTWTIDELKEVLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIREL 383

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+ L M  ++    L E RRKLFD R KR  P  DDKV+  WNGL+IS+ A+A K     
Sbjct: 384 ANDLNMNQDQLEAKLEEIRRKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK----- 438

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                      G + K+ +E A+ AA FI   ++   T  L H +++G  K  G LDDY 
Sbjct: 439 -----------GFEDKDLIEKAKVAADFILNTMFKNDT--LYHLYKDGEIKVEGLLDDYT 485

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F   GL++L E     K+L  A++L +   E F D E GG+F +      V++R KE  D
Sbjct: 486 FFSWGLIELCEATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFD 545

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSVS  NL RL  I    K   Y   A  +L  F   +K +     +      +
Sbjct: 546 GAIPSGNSVSAYNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLML 602

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           +  P+ + VVL G     + E +L   +  +  NK ++ ++  +       E+       
Sbjct: 603 VFYPTSE-VVLAG-----NCEKVLDKINTEFIPNKAIVFLNREN-------EKQIKELIP 649

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
              N   +D+    VC+NFSC+ P  D
Sbjct: 650 YTNNMILSDECDIYVCKNFSCNLPTKD 676


>gi|423680595|ref|ZP_17655434.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
 gi|383441701|gb|EID49410.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
          Length = 681

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/691 (37%), Positives = 368/691 (53%), Gaps = 75/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE VAKLLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+
Sbjct: 49  STCHWCHVMAHESFEDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLN 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP   GTYFP   ++ RPGF  +++++ D + K R+ +        E+ +  L
Sbjct: 109 VFLTPDQKPFYAGTYFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNL 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
              A S+   D L ++ LR   +QL  S+D+ +GGFGSAPKFP P  +  +L YH     
Sbjct: 165 RIKAKSDA-GDSLGEDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ---- 219

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
               SGE +     V+ TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   L 
Sbjct: 220 ---YSGEEN-ALYSVMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLL 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+ +TK+  Y  I   I+ ++RR+M    G  +SA DAD   TEG     EG +Y
Sbjct: 276 IAYTEAYQITKNERYKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYY 328

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSS 373
           VW+ +EV + LG E   L+   Y +   GN            F+G N    +   L D  
Sbjct: 329 VWSKEEVLETLGDELGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK 376

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
               +  +  E+  N L E R KLF+ R +R  PH+DDKV+ SWN L+I+  A+A+K+  
Sbjct: 377 ---DEFALTDEELQNKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV-- 431

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                  +N P       EY+E+A +AA FI   L   Q  R+   +R+G  K  GF+DD
Sbjct: 432 -------YNAP-------EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDD 475

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL+   ++LYE       L  A +L+     LF D E GG++ T  +  ++++R KE 
Sbjct: 476 YAFLLWAYIELYEASLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEV 535

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGN V  + L RL  +  G  S      A    A F   +              
Sbjct: 536 YDGALPSGNGVLAVQLSRLGRLT-GDLS--LHDQAAKMFAAFHGDVSAYPSGHTNFLQGL 592

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWEEHNS 671
               +P +K +V++G ++  D + +++A   ++  N  V+  +  D  +   DF  E+ +
Sbjct: 593 LSQFMP-QKEIVVLGKRNDPDRQKIVSALQQAFQPNYAVLAAESPDDFKGIADFAAEYKA 651

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            +          +K    +C+NF+C  P T+
Sbjct: 652 VD----------NKTTVYICENFACRQPTTN 672


>gi|336113948|ref|YP_004568715.1| hypothetical protein BCO26_1270 [Bacillus coagulans 2-6]
 gi|335367378|gb|AEH53329.1| protein of unknown function DUF255 [Bacillus coagulans 2-6]
          Length = 629

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 257/698 (36%), Positives = 365/698 (52%), Gaps = 81/698 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+E VA++LN+ FV+IKVDREERPD+D +YM   Q + G GGWPLSVFL+P+  
Sbjct: 1   MERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLSVFLTPEKV 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP E +YG PGFK +L  +   + +  D +   G     Q+ +AL AS    +
Sbjct: 61  PFYAGTYFPRESRYGMPGFKEVLHYLSQQYTENPDRIKDVGT----QVKQALEASREKGE 116

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
               L +       +   +++D R+GGFG APKFP P  +  +L ++K  E+      A+
Sbjct: 117 -QTALTKETTGRAFQTYKQAFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYENRDALAMAT 175

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           +       TL  +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD   LA  Y DAF +
Sbjct: 176 K-------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLALAYTDAFRM 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           TK+  Y  I  +I+ Y+ RDM  P G  +SAEDADS   EG    +EG FYVWT KEV+D
Sbjct: 229 TKNARYKKITEEIIKYVLRDMAHPDGGFYSAEDADS---EG----EEGKFYVWTPKEVKD 281

Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 385
           +LGE    LF + Y +   GN            F+GKN+  ++     + A K G     
Sbjct: 282 VLGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLETIAKKEGFSPAA 329

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
               L   R+ LF  R KR RP  DDK++ +WNGL+I++ A+A ++    +         
Sbjct: 330 LAEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRVFYQPS--------- 380

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
                  Y++ AE A SFIR +L   Q  R+   +R+G  K  GF+D+YAFL+ G ++LY
Sbjct: 381 -------YVQAAEKAVSFIRDNLI--QNGRIMVRYRDGEVKNKGFIDEYAFLLWGYMELY 431

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E      +L  A  L     +LF D  GGG+F +  +D  +L+R KE +DGA PSGNSV+
Sbjct: 432 ESTFAPFYLAEAKRLAGNMIDLFWDEHGGGFFFSGNDDEPLLVRQKESYDGALPSGNSVA 491

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
              L+RLA +      +   +  +     F   + D   A  +M  A  M +  + K VV
Sbjct: 492 ACQLLRLAKLTGDFTLE---EKVQQMFQAFSKVIHDDPNAHAMMMQAV-MYAQQATKEVV 547

Query: 626 LV---GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR-NNF 681
           +V     + +VDF                + HI      E+ F          +++   F
Sbjct: 548 IVMDDETEKAVDF----------------IRHIQENFHPEISFMAVKRREKKKLSKIAPF 591

Query: 682 SAD------KVVALVCQNFSCSPPVTDPISLENLLLEK 713
             D      +    VC+NFSC+ P  D  +  +LL +K
Sbjct: 592 IEDYAMINGQPTIYVCENFSCNQPTNDFQTARDLLFKK 629


>gi|376259602|ref|YP_005146322.1| thioredoxin domain-containing protein [Clostridium sp. BNL1100]
 gi|373943596|gb|AEY64517.1| thioredoxin domain protein [Clostridium sp. BNL1100]
          Length = 673

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 265/695 (38%), Positives = 364/695 (52%), Gaps = 80/695 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  QAL G GGWPL+
Sbjct: 54  STCHWCHVMERESFEDEDVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLT 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP ED  G  G  ++L  VK+AWD KRD L +S    IE +S+  
Sbjct: 114 VFLTPDRQPFYAGTYFPKEDSRGFMGLMSLLGSVKEAWDNKRDKLLESAKSIIEHVSQ-- 171

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
                  K+ DE  + ++ +    +    ++DS++GGFG++PKFP P  +  +L    + 
Sbjct: 172 ------EKVSDEAKISKDIIHEAFKHFKYNFDSKYGGFGTSPKFPSPHTLLFLL----RY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             T K   A E   MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   L
Sbjct: 222 WYTEKEPFALE---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALL 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  Y +AFS T +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG F
Sbjct: 279 AIAYGEAFSATGNKNYEETARQILDYVQRDMTSQFGAFYSAEDADS---EGV----EGKF 331

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y+W+ +E  D+LG       E Y       C L  ++   N F+G N+   +N       
Sbjct: 332 YIWSREEAIDVLGSKD---AEEY-------CRLFDITSSGN-FEGLNIPNLINS------ 374

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
             G   E+  +   +CR+KLF  R KR  P+ DDKV+ SWNGL+ ++ A   +I      
Sbjct: 375 --GTLTEQQKSFAEDCRKKLFSHREKRIHPYKDDKVLTSWNGLMTAAMAYCGRIF----- 427

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                    G DR  Y+E A+    FI + L      RL   +R+G +  P +L+DYAFL
Sbjct: 428 ---------GEDR--YIESAKRCVDFIYKKLI-RTDGRLLARYRDGEAVFPAYLEDYAFL 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + GLL+LYE    T +L  A++L +    LF +    G F    +   ++ R +E +DGA
Sbjct: 476 VWGLLELYEATFTTIYLKRALKLTDAMLNLFGENNSAGLFLYGHDSEQLISRPRESYDGA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ +NL+RLA I    +   Y   A+  +  F  +++        M C+     
Sbjct: 536 IPSGNSVAAMNLLRLARITGHHE---YENRAKAIMDFFSNQVEVAPTGHSYMLCSYMYSV 592

Query: 618 VPSRKHVVLVGH--KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
                 VV+ G   K  VD  N      A       + +I P  TE   +  ++ + N  
Sbjct: 593 SDVSSEVVIAGANGKELVDTINRKYLPFAV-----AISNISPELTEIAPYVGDYKAQNG- 646

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                    K  A VC+NFSC  P+T+   L  +L
Sbjct: 647 ---------KTAAYVCRNFSCMEPITEAEKLAEVL 672


>gi|440784088|ref|ZP_20961509.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
           525]
 gi|440219124|gb|ELP58339.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
           525]
          Length = 679

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 252/714 (35%), Positives = 365/714 (51%), Gaps = 69/714 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESFEDE VA++LN +FV+IKVDREERPD
Sbjct: 32  GEEAFNKADRENKPVFLSVGYSTCHWCHVMNRESFEDEEVAEILNKYFVAIKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YM+  QA+ G GGWPL++ ++ + KP   GTY P  +KYG+ G   +L KV   W 
Sbjct: 92  IDNIYMSVCQAITGSGGWPLTIIMTAEKKPFFAGTYLPKIEKYGQIGIIELLDKVNTMWI 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           +K+D L +S    ++ L           K+ +++   A       L  +YD  FGGF  +
Sbjct: 152 QKKDKLLESSNNIVDFLQN--DTVDKKGKINEDIIDEAYN----SLKNAYDPVFGGFSDS 205

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L + K   D        E  +MV  TL  M  GGI DH+G GF RYSV
Sbjct: 206 PKFPIPHNLSFLLRYYKIKGD-------REALQMVENTLDSMYSGGIFDHIGFGFARYSV 258

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W VPHFEKMLYD   LA VY + + +T    Y  I + I DY  RDM    G  +SA
Sbjct: 259 DSKWLVPHFEKMLYDNALLAIVYTETYQITHKNRYKEIVQKIFDYTLRDMTNEDGGFYSA 318

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS   EG     EG FY+W   E+E+IL E A LF  +Y +K  GN           
Sbjct: 319 EDADS---EGV----EGKFYLWDKSEIENILEEDADLFNSYYNIKSKGN----------- 360

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
            F+G+N+   + +                N +   R KLF+ R KR  PH DDK++ +WN
Sbjct: 361 -FEGRNIPNLIGEDLEELENEETK-----NKINRLREKLFNYREKRVHPHKDDKILTAWN 414

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL+I++ A A K+ K EA                    A+ A+ FI  +L D +  RL  
Sbjct: 415 GLMIAAMAYAGKVFKIEAYKKA----------------AKKASDFILANLIDNRG-RLLC 457

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            +R+G +   GFLDDYAF + GL++LYE      +L  A++L     + F D E  G+F 
Sbjct: 458 RYRDGETGNVGFLDDYAFFVFGLIELYEATFEVHYLKKAVDLNGEMIKYFWDEENSGFFF 517

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
              +   ++L+ KE +DGA PSGNSV+ +NL+RL+ I    + +   +      ++F  +
Sbjct: 518 YGKDSEELILKTKEIYDGALPSGNSVAAMNLIRLSRITGDVQLE---EKVAEIFSLFSEK 574

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
           +  + +       A    +VP   H+V+ G K  V+ + ++   +  + L  +V+  D +
Sbjct: 575 INKVPLGYINTISAFLTNTVPDI-HIVIAGDKDDVNTKTLIDEINKRFLLFASVVFNDES 633

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           D        E +     +  N    +K  A VC+N +C  PV D     +L+ E
Sbjct: 634 D--------ELSKLIPYIEDNKVVNNKATAYVCKNKACLTPVNDVKEFMDLIEE 679


>gi|94985364|ref|YP_604728.1| hypothetical protein Dgeo_1263 [Deinococcus geothermalis DSM 11300]
 gi|94555645|gb|ABF45559.1| protein of unknown function DUF255 [Deinococcus geothermalis DSM
           11300]
          Length = 678

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 248/697 (35%), Positives = 351/697 (50%), Gaps = 68/697 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED   A+ +N  FV+IKVDREERPDVD VYMT  Q + G GGWP++
Sbjct: 47  STCHWCHVMAHESFEDPSTAEFMNKHFVNIKVDREERPDVDSVYMTATQLMTGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP   GTYFPPED+YG PGF+ +L  V  AW + RD L  +     + L+E +
Sbjct: 107 VFLTPDGKPFYAGTYFPPEDRYGMPGFRRLLASVAQAWAQDRDKLTGNA----QTLTEHI 162

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             ++   +   +LP + LR   + L + YD+  GGFGSAPKFP P  +  +L        
Sbjct: 163 REASRPRRGAGDLPTDFLRRGVDNLRRVYDADLGGFGSAPKFPAPTTLDFLLTQ------ 216

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   EG+ M L TL+ M +GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL  
Sbjct: 217 -------PEGRDMALHTLRMMGRGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTR 269

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             L A+  T D  ++ + R+ L YL R+M+ P G  FSA+DAD+   EG T       + 
Sbjct: 270 TLLRAWQFTGDPTFTRLARETLAYLEREMLAPQGGFFSAQDADTQGVEGLT-------FT 322

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
           WT +E+ ++LG           L+  G  +    +DPH  E+  +NVL  L   +  A  
Sbjct: 323 WTPQEIREVLGAGP---DTDLVLRVYGVTEEGNFADPHRPEYGRRNVLHVLTPPAELARD 379

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG   E     L   RRKL   R +RP+P  D KV+ SWNGL +++FA A +IL      
Sbjct: 380 LGESAEALSARLDAARRKLLTAREQRPQPGTDRKVLTSWNGLALAAFADAGRILGE---- 435

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y+E+A   A F+R+HL       L+H++++G ++  G L+D+A   
Sbjct: 436 ------------GHYLEIARRNADFVRQHLRLPDGT-LRHTYKDGEARVEGLLEDHALYG 482

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL+ LY+ G     L WA EL       F D E G + +T G   ++L R  +  D A 
Sbjct: 483 LGLVALYQAGGDLAHLAWARELWGIVRRDFWDGEAGLFRSTGGRAETLLTRQAQGFDAAV 542

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
            S N+ + +  + ++      +++   + A  ++  ++  +   A     +  AA  L+ 
Sbjct: 543 LSDNAAAALLGLWISRYFGDEEAE---RLARATVRTYQADMLAAAGGFGGLWQAAAFLAA 599

Query: 619 PSRKHVVLVGHKSS-VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           P +  V L+G  +     E ++A     +        I PA         EH      + 
Sbjct: 600 P-QVEVALIGTPAERAPLERVVARFPLPF------AAIAPA---------EHGEGLPVLE 643

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
                     A VC   +C  P  DP  L   L   P
Sbjct: 644 GRPGGG---TAYVCVGHACDLPTRDPEVLAGQLERLP 677


>gi|293376087|ref|ZP_06622338.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
 gi|292645289|gb|EFF63348.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
          Length = 672

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 249/693 (35%), Positives = 360/693 (51%), Gaps = 73/693 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA  LN+ F+SIKVDREERPD+D VYM+  QAL G GGWPL+
Sbjct: 51  STCHWCHVMEHESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F++P  +    GTYFP   +YGRPGF  +L+ +   W+  R  +            +  
Sbjct: 111 IFMTPTQQAFYAGTYFPKTSRYGRPGFLDVLKNIDFNWNHHRAKVTDITKQIESHFKDLE 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 + L   + QN +     QL +SYD RFGGFG+APKFP P ++  +L + ++ +D
Sbjct: 171 GIETEGDSLSMAIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                     Q MV  TL  M KGGI DH+G GF RYS DE W VPHFEKMLYD   L  
Sbjct: 227 KSV-------QDMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMI 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  I     +Y+   +  P G  + AEDADS   EG    +EG FYV
Sbjct: 280 SYTEAYQVTREPRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYV 332

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +T  E+  ILG E    F E Y +   GN            F+GKN+L  L+        
Sbjct: 333 FTPAEIIQILGHEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK----- 375

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
               LE  +  L  CR  L   R +R   H DDK++ SWNGL+I++FA+           
Sbjct: 376 ----LELDIKELEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK----------- 420

Query: 439 AMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                 + G  +K  Y++ A  A  FI++HL+DE   RL   +R G S    +LDDYAFL
Sbjct: 421 ------LYGQTQKMIYLDAASKAVIFIKQHLFDET--RLLARYREGESHFKAYLDDYAFL 472

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GL++L++  +  ++L  AI+L     +LF D E GG++ T  +  +++LR KE +DGA
Sbjct: 473 SYGLIELHQSTAEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGA 531

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+  NL+RLA +   +    +   AE  +     ++K   M       AA    
Sbjct: 532 MPSGNSVAAYNLIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFAL 588

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
             +++ ++ V  +  +  + +L   + +   N T++   P +  ++       S  A   
Sbjct: 589 SDTKELMITVTKQEQI--KEILKQLNETPHFNTTLLFKTPENQTQL-------SKLAPYT 639

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           ++    D+    +C N +C  P +   SL+N+L
Sbjct: 640 KDYPIVDQPTYYLCSNGTCQAPTSSLESLKNIL 672


>gi|220931972|ref|YP_002508880.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halothermothrix orenii H 168]
 gi|219993282|gb|ACL69885.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halothermothrix orenii H 168]
          Length = 691

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 252/691 (36%), Positives = 357/691 (51%), Gaps = 75/691 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESF+DE VA+LLN+ F+SIKVDREERPD+D VYM   QAL G GGWPL++
Sbjct: 57  TCHWCHVMERESFKDEEVARLLNENFISIKVDREERPDIDAVYMNVCQALTGSGGWPLTI 116

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            L+PD KP  GGTY P   + GR G   +L +V + W K  + + ++       +  +++
Sbjct: 117 LLTPDKKPFFGGTYIPKNSRGGRMGLIDLLSRVTELWSKNNEKIIKNADKITSSIQRSMT 176

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             +        L +N L    + L   +D  +GGFG+APKFP P ++  +L++  +    
Sbjct: 177 DDSYKGHKETSLGKNTLEKAFDDLKVVFDVEYGGFGTAPKFPIPHQLIFLLHYWYR---- 232

Query: 201 GKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                   G  M L+    TL  M  GGI DH+G GFHRYS D +W +PHFEKMLYDQ  
Sbjct: 233 -------TGNDMALYMVEKTLTAMRCGGIFDHIGYGFHRYSTDRKWILPHFEKMLYDQAL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+  T++  +    ++I+DY+RR++    G  +SA+D   AE+EG     EG 
Sbjct: 286 LTYSYSEAYLATENKKFLTTIKEIIDYVRRELKSDRGGFYSAQD---AESEGV----EGK 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +Y W+ KE+E+ILG+ A  F E Y LK  GN     + +   +  GKNVL   N      
Sbjct: 339 YYTWSVKEIENILGKQADRFIETYSLKSDGNF----IDEATGKKTGKNVLYLRNYKEEVE 394

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                          + R KLF VR +R  P  DDK++  WNGL+I+  ARA +      
Sbjct: 395 ELK------------KEREKLFKVRQRRRPPFKDDKILTDWNGLMIAGLARAGQ------ 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                      +   EY+ +A  AA FI  +LY    +RL H FR G     G L+DYAF
Sbjct: 437 ----------ATGEIEYITMAREAADFIINNLYSSD-NRLYHRFRKGEVSIKGNLNDYAF 485

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
            I GLL+LY+     K+L  A++L + Q   F D + GG++ T  ++  +L+R KE +DG
Sbjct: 486 FIWGLLELYQDTFEVKYLKKALKLIDQQLNYFWDNKNGGFYFTPDDEEEILVRQKEIYDG 545

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSVS+ NL R+  +   S    Y + AE+ L VF  ++K+   +  +     + L
Sbjct: 546 ATPSGNSVSIWNLYRIGHLTGNSD---YEEIAENILRVFSDKIKNDPASYSMALIGLNSL 602

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI----HIDPADTEEMDFWEE-HNS 671
             P    VV+VG K+      +L +    Y  N   +    H     TE   F E  H  
Sbjct: 603 LGPGYD-VVVVGDKNKAKTHKILYSLKNEYIPNVNTLFKPAHNGKILTELGPFIENYHMI 661

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           NN                VC+++SC  P  +
Sbjct: 662 NNLP-----------TIYVCKDYSCRRPTNN 681


>gi|398309078|ref|ZP_10512552.1| hypothetical protein BmojR_06022 [Bacillus mojavensis RO-H-1]
          Length = 689

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 247/687 (35%), Positives = 358/687 (52%), Gaps = 67/687 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIASLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   KY RPGF  +L  + + +   R+ +      A   L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKYNRPGFVDVLEHLSETFANDREHVEDIAENAANHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A  S       L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +    
Sbjct: 173 AAKTSEG-----LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHT 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKDICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ +E+   LGE    L+   Y +   GN            F+GKN+  LI        A
Sbjct: 334 WSKEEILKTLGEDLGTLYCSVYDITEKGN------------FEGKNIPNLIHTKREQIKA 381

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G+  E+    L + R KL   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +   
Sbjct: 382 DG-GLTEEELSRKLEDARLKLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVFQ--- 437

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +Y+ +AE A +FI  ++  +   R+   +R+G  K  GF+DDYAF
Sbjct: 438 -------------EPQYLSLAEDAITFIENNVIIDG--RVMVRYRDGEVKNKGFIDDYAF 482

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+   LDLYE      +L  A +L     +LF D E GG++ T  +  ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLEKAKKLSEDMIDLFWDEEHGGFYFTGHDAEALIVREKEVYDG 542

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ + L+RL   V G  S    + AE   +VF+  ++           +    
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPEIEAYPSGHSFFMQSVLKH 599

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P +K +V+ G     D + + +A   ++  N +++  +  D            + A  
Sbjct: 600 MTP-KKEIVIFGRPDDPDRKQITSALQQAFIPNDSILVAEHPD---------QCKDIAPF 649

Query: 677 ARN-NFSADKVVALVCQNFSCSPPVTD 702
           A +     D+    +C+NF+C  P TD
Sbjct: 650 AADYRIIDDQTTVYICENFACQQPTTD 676


>gi|440631885|gb|ELR01804.1| hypothetical protein GMDG_00904 [Geomyces destructans 20631-21]
          Length = 918

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 238/603 (39%), Positives = 340/603 (56%), Gaps = 39/603 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+
Sbjct: 96  SACHWCHVMEKESFENDEVAAILNKDFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 155

Query: 80  VFLSPDLKPLMGGTYF-------PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 132
           VF++P L+P+ GGTY+       P  +      F  IL K+  AW ++        A  +
Sbjct: 156 VFVTPTLEPVFGGTYWHGPHSNTPQLELEDHVDFLRILGKLSQAWREQESRCRLDSAQIL 215

Query: 133 EQLSEALSASASSNKLPD---ELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRP 184
           +QL +  +A  +    P    E P   L L       + L  ++D+   GF +APKFP P
Sbjct: 216 QQL-KVFAAEGTLGGAPKTGAEPPAGGLDLDIIDEAYQHLVSTFDTTNSGFSAAPKFPTP 274

Query: 185 VEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
            ++  +L   +  + + D   + E    Q M L TL+ MA+GGIHDH+G GF RYSV   
Sbjct: 275 SKLAFLLRLPHFPQPVLDVVGAEEVKSAQFMALSTLRAMARGGIHDHIGHGFSRYSVTAD 334

Query: 242 WHVPHFEKMLYDQGQLANVYLDAF-SLTK-DVFYSYICRDILDYLRRDMIG-PGGEIFSA 298
           W +PHFEKMLYD  QL ++YLDAF  L K D     +  D+  YL    I  PGG  +S+
Sbjct: 335 WSLPHFEKMLYDNAQLLSLYLDAFLGLPKPDPELLGVVYDLAAYLLSPPIAAPGGGFYSS 394

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
           +DADS   +G    +EGA+YVWT++E+E +L   A  +    + + P GN   S   D H
Sbjct: 395 QDADSFYRKGDKETREGAYYVWTARELETLLPAGAYDIVAAFFGVNPDGNVAPSH--DVH 452

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVS 416
           +EF  +NVL   +  S  AS+ G+   + +  +   +R L   R ++R  P+LDDK++ +
Sbjct: 453 DEFINQNVLRIASTPSQLASQFGIAESEVVETIKSAKRTLLAHREAERVVPNLDDKIVCA 512

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNG+ I + AR    L+ E ++ M       S+R   ++ A  AA F+RR +YDE    L
Sbjct: 513 WNGIAIGALARTGASLR-EVDAQM-------SER--CLDAAIRAARFMRREMYDEDAKTL 562

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
           +  +R GP +  GF DDYAFL+ GLL+LYE     +W+ WA ELQ TQ+  FLD    G+
Sbjct: 563 RRVWRGGPGETAGFADDYAFLVEGLLELYEATFADEWVRWADELQATQNSHFLDPTASGF 622

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F T    P  +LR+K+  D +EPS N VS  NL RLAS++     D Y   A+ ++  FE
Sbjct: 623 FATAAAAPHTILRLKDGMDASEPSTNGVSASNLFRLASLLG---DDKYEALAKETVGAFE 679

Query: 597 TRL 599
             +
Sbjct: 680 AEI 682


>gi|298675032|ref|YP_003726782.1| hypothetical protein Metev_1104 [Methanohalobium evestigatum
           Z-7303]
 gi|298288020|gb|ADI73986.1| protein of unknown function DUF255 [Methanohalobium evestigatum
           Z-7303]
          Length = 728

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 250/709 (35%), Positives = 368/709 (51%), Gaps = 77/709 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED  +A++LND FV IKVDREERPD+D  YM   QAL G GGWPL++
Sbjct: 59  TCHWCHVMENESFEDPEIAQILNDNFVCIKVDREERPDIDSTYMDVCQALTGRGGWPLTI 118

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK-KRDMLAQSGAFAIEQLSEAL 139
            ++P+ KP    TY P E ++G  G   +L ++ D W K KR++++++     EQ++ ++
Sbjct: 119 IMTPEKKPFSAATYLPKESRFGLTGLIDLLPRISDMWSKQKRELVSRA-----EQITSSV 173

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               + +    EL    L    E L ++YD  +GGFG+APKFP P  +  ++ + ++  +
Sbjct: 174 EEVFTKSPKTRELSNQELDSAYESLLENYDPEYGGFGNAPKFPSPHNLMFLMRYWERTSN 233

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  ++  +MV  TL+ M  GGI+DH+G GFHRYS D  W +PHFEKMLYDQ  L+ 
Sbjct: 234 -------NKALEMVEKTLKNMRIGGIYDHIGFGFHRYSTDRYWMIPHFEKMLYDQALLSM 286

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y++ +  T  + Y    RD+  Y  RD+    G  +SA DADS   EG     EG FY 
Sbjct: 287 AYIEVYQATGKIEYKNTARDVFTYALRDLTSKEGGFYSAVDADS---EGV----EGKFYT 339

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE-------- 368
           WT  E+  IL +  A +    + +K  GN    +  +      GKN+  LIE        
Sbjct: 340 WTYDEIHKILSKSEANIVTNLFNIKKEGNFRDEKTGN----LTGKNIPHLIETPLYIDVE 395

Query: 369 -----------LNDSSASASKLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVI 414
                      LN++          L K +     L   RRKLF+ R  R  P  DDK++
Sbjct: 396 PDEELDEFHEKLNEAREKRGAWKRNLLKTIYSQRRLEVARRKLFEARENRVHPAKDDKIL 455

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
             WNGL+I++ ++ +++                 + KEY   A  AA FI +++ D  + 
Sbjct: 456 TDWNGLMIAALSKGAQVF----------------NDKEYANSARKAADFIIKNMSD-SSG 498

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           +L H +R+G S   GF+DDYAFL  GL++LYE     K+L  A+E  N     F D   G
Sbjct: 499 QLMHRYRDGDSDIHGFIDDYAFLTWGLIELYETTFEVKYLEKALEFNNYLINHFWDDNNG 558

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G++ T     + ++R KE +DGA PSGNSV+++NL+RL  +    + +   + A  S+  
Sbjct: 559 GFYFTPDNAETPIVRKKEIYDGASPSGNSVALMNLMRLGRMTGNPELE---KKASDSIKS 615

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   L    +A      A D +  PS + VV+ G   S D +NM+ +    + + + V+ 
Sbjct: 616 FSKSLSRNPIASTHSMQALDFVQGPSSE-VVITGDFQSEDTQNMINSLRTEF-IPRKVVL 673

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
             P   +  D       N A   R+  S + K  A +CQN+SCS P TD
Sbjct: 674 FKPDKVQSPDI-----VNIAGFTRDMDSQEGKATAYICQNYSCSSPKTD 717


>gi|325958772|ref|YP_004290238.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
 gi|325330204|gb|ADZ09266.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
          Length = 702

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 271/717 (37%), Positives = 365/717 (50%), Gaps = 61/717 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESFED  VA+LLN+ FV++KVDREERPD
Sbjct: 38  GDEAFEKAKKLDKPIFLSIGYSTCHWCHVMAHESFEDLEVAELLNNNFVAVKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD VYM   Q + G GGWPL++ ++ D KP   GTYFP E  +G  G K +L  V D W 
Sbjct: 98  VDSVYMAACQIMTGTGGWPLTIIMTHDKKPFFAGTYFPKESSFGNIGLKDLLLNVMDIWR 157

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R     SG    +Q+  AL    S N    +L    L    +QLSK +D   GGFG  
Sbjct: 158 DERKNALDSG----DQIFRALK-EMSVNTKGKQLDSTILEKTYDQLSKVFDVENGGFGDF 212

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
            KFP P  +  +L + K+   TG     +    MVL TL  MA GGI+DHVG GFHRYSV
Sbjct: 213 QKFPTPHSLMFLLRYWKR---TGNKHSLN----MVLKTLDEMAMGGIYDHVGFGFHRYSV 265

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+ W VPHFEKMLYDQ  +A +Y + +S T    Y    + I +Y+ RDM    G  +SA
Sbjct: 266 DKNWLVPHFEKMLYDQALIAMLYTEVYSATGKFEYKKTAQQIYEYVLRDMTDVEGGFYSA 325

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FY WT +E+  IL  + A L  E + +K  GN      +D +
Sbjct: 326 EDADS---EGV----EGKFYYWTYEELYSILDKDSADLITEVFNVKKDGN-----FNDGY 373

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
           +     N+L +  D    A   G+ +     ++ +   +LF VR KR  PH DDK++  W
Sbjct: 374 SNESINNILHKKRDYKKIAENKGLNISDLEELVDDILSELFLVREKRVHPHKDDKILTDW 433

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I+S +RA ++ + E                +Y++ AE+  +FI    Y  Q +RL 
Sbjct: 434 NGLMIASLSRAFQVFEEE----------------KYVKAAENCVNFIMNKSY--QQNRLM 475

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H FR+G S   G LDDY F+I GLL++Y       +L  A++L  T  E F D E GG++
Sbjct: 476 HMFRDGESAVYGNLDDYTFMIWGLLEIYMATFNVDYLEKAMDLNQTVVEHFWDEENGGFY 535

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFE 596
            T  ++  VL+R K+  D A PSGNSV  +NL+RL S      +D+ + +    L  VF 
Sbjct: 536 FTADDEEKVLIREKKTFDSAIPSGNSVEFLNLLRLGSFT----NDHNQMDTARKLETVFS 591

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             +K             D    PS   VV+VG   S D   ML      Y  N T+I  D
Sbjct: 592 ETVKRSPTGHTQFISGVDFALGPSYS-VVIVGDGDSEDTIEMLRLRQL-YIPNTTIILKD 649

Query: 657 PADTEEMDFW-EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                    W ++ NS +  + + +    K  A VC   SC  P      +  LL E
Sbjct: 650 SK-------WSDKTNSISEDIDKKSMINGKATAHVCSTGSCKLPTNKKSEMLKLLNE 699


>gi|421839588|ref|ZP_16273125.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
           CFSAN001627]
 gi|409733965|gb|EKN35825.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
           CFSAN001627]
          Length = 680

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 245/684 (35%), Positives = 353/684 (51%), Gaps = 70/684 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +WT +E+ DILGE      E Y       C +  ++   N F+ KN+   +N        
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
               LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++   
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA 
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              K + L  +K   DF   +   +  Y     V   D ++        E    N ++  
Sbjct: 593 SPVKEITLAYNKKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644

Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
                DK    +CQN++C  P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668


>gi|226948333|ref|YP_002803424.1| hypothetical protein CLM_1215 [Clostridium botulinum A2 str. Kyoto]
 gi|226841180|gb|ACO83846.1| conserved hypothetical protein [Clostridium botulinum A2 str.
           Kyoto]
          Length = 680

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 245/684 (35%), Positives = 354/684 (51%), Gaps = 70/684 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +   A+ L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAAKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +WT +E+ DILGE      E Y       C +  ++   N F+ KN+   +N        
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
               LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++   
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA 
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              K + L  ++   DF   +   +  Y     V   D ++        E    N ++  
Sbjct: 593 SPVKEITLAYNEKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644

Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
                DK    +CQN++C  P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668


>gi|423720021|ref|ZP_17694203.1| thioredoxin domain protein [Geobacillus thermoglucosidans
           TNO-09.020]
 gi|383366783|gb|EID44068.1| thioredoxin domain protein [Geobacillus thermoglucosidans
           TNO-09.020]
          Length = 637

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 257/694 (37%), Positives = 369/694 (53%), Gaps = 77/694 (11%)

Query: 18  LINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 77
           LI+TCHWCHVM  ESFEDE VAK+LN+ +VSIKVDREERPD+D VYM   Q + G GGWP
Sbjct: 2   LISTCHWCHVMAHESFEDEEVAKILNEKYVSIKVDREERPDIDSVYMRVCQMMTGQGGWP 61

Query: 78  LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           LSVFL+P+ KP   GTYFP + +YGRPGF  +L ++ D + +  D +        EQ++E
Sbjct: 62  LSVFLTPEGKPFYAGTYFPKQSRYGRPGFIELLTRLYDKYKENPDEIVHVA----EQVTE 117

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP-VEIQMMLYHSKK 196
           AL  SA ++   + LP  A+     QL   +D+ +GGFG APKFP P + + +M Y+  K
Sbjct: 118 ALRQSARASG-TERLPFAAIEKAYRQLLNGFDAVYGGFGGAPKFPIPHMLMFLMRYYQWK 176

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D            MV  TL  MA GGI+DH+G GF RYS D  W VPHFEKMLYD   
Sbjct: 177 RDD--------RALLMVEKTLNGMANGGIYDHIGYGFARYSTDAMWLVPHFEKMLYDNAL 228

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ LTK   Y  I   I+++++R+M    G  +SA DADS   EG     EG 
Sbjct: 229 LVIAYTEAYQLTKKERYKEIAEQIIEFVKREMTSQDGAFYSAVDADS---EGV----EGK 281

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
           +YVWT  EV ++LG       E Y       C +  ++D  N F GKNV  LI       
Sbjct: 282 YYVWTPDEVVNVLGAE---LGELY-------CRVYDITDEGN-FAGKNVPNLIHARMERL 330

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A +  +  E+    L E R++L   RS R RPH+DDK++ +WN L+I++ A+A+K+   
Sbjct: 331 -ARRYRLTEEELRERLEEARKQLLAERSSRVRPHVDDKILTAWNALMIAALAKAAKVY-- 387

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                         +R++Y+++A+ A SFI  HL+  Q  RL   +R G  K  G +DDY
Sbjct: 388 --------------ERRDYLQMAKQALSFIETHLW--QNGRLMVRYRGGEVKHLGIIDDY 431

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A+L+   +++YE      +L  A         LF D + G +F T  +  ++++R KE +
Sbjct: 432 AYLVWAYVEMYEATLDLAYLQKAKTCAERMISLFWDEKHGAFFMTGNDAEALIIREKEIY 491

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGA PSGNSV+ + ++RLA +          + AE    VF  +++              
Sbjct: 492 DGALPSGNSVAAVQMIRLARLTGDLA---LLEKAETMYKVFRRQVEAYESGHTFFLQGLL 548

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           ++  P+ + VVL G +     E  +     ++  N  ++              EH ++ A
Sbjct: 549 LIETPAAE-VVLFGKQGDEKREQFILKWQHAFAPNVFLLV------------AEHPADVA 595

Query: 675 SMARNNFSA------DKVVALVCQNFSCSPPVTD 702
            +A   F+A      D+    VC+NF+C  P TD
Sbjct: 596 GIA--PFAAEYEPLGDETTVYVCENFACQQPTTD 627


>gi|15896782|ref|NP_350131.1| hypothetical protein CA_C3546 [Clostridium acetobutylicum ATCC 824]
 gi|337738753|ref|YP_004638200.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
           1731]
 gi|384460264|ref|YP_005672684.1| hypothetical protein CEA_G3552 [Clostridium acetobutylicum EA 2018]
 gi|15026641|gb|AAK81471.1|AE007851_2 Highly conserved protein containing a domain related to cellulase
           catalitic domain and a thioredoxin domain [Clostridium
           acetobutylicum ATCC 824]
 gi|325510953|gb|ADZ22589.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
 gi|336292984|gb|AEI34118.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
           1731]
          Length = 677

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 246/694 (35%), Positives = 358/694 (51%), Gaps = 76/694 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED+ VA++LN  FVSIKVDREERPD+D++YM    A+ G GGWPL++
Sbjct: 56  TCHWCHVMERESFEDDDVAEVLNRSFVSIKVDREERPDIDEIYMNVCTAITGSGGWPLTI 115

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++P+ KP   GTY P  ++ G  G  ++L  ++  W + ++ L + G   +  L++   
Sbjct: 116 VMTPEQKPFFAGTYIPKNNRMGMQGLISLLENIEYQWKENQNELVEIGDKIVSSLNKDRK 175

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +A       EL +  L     Q   ++D  +GGFGS PKFP P  +  ++ +    +D 
Sbjct: 176 TTAK------ELSEEVLEEAFSQFKYNFDRTYGGFGSEPKFPTPHNLIFLMRYFYASKD- 228

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                      M L TL  M +GGI+DH+G GF RYSVD++W VPHFEKMLYD   LA  
Sbjct: 229 ------KTSLNMALKTLDTMYRGGIYDHIGYGFSRYSVDKKWLVPHFEKMLYDNALLAYA 282

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +AF +TK+  Y  I   I  Y+ RDM    G  + AEDADS   EG     EG FYVW
Sbjct: 283 YTEAFKITKNDNYKNIVDQIFTYILRDMTSNEGGFYCAEDADS---EGV----EGKFYVW 335

Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           + KE+ ++LGE     F +++ +  TGN            F+G+N+L     +     K+
Sbjct: 336 SKKEINNVLGEDDGKKFSKYFNVTDTGN------------FEGENIL-----NLIETEKI 378

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               E     L  CR+KLFD R KR  P+ DDK++ SWNGL+I++ A   + LK+E    
Sbjct: 379 EFEDE----FLNSCRKKLFDYREKRIHPYKDDKILTSWNGLMIAALAFGGRSLKNEI--- 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                        Y+  AE A +FI   L D    RL   +R+G +   G+L DY+FLI 
Sbjct: 432 -------------YINAAEKAVTFIFTKLID-ANGRLLSRYRHGEASIKGYLTDYSFLIW 477

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GL++LYE    ++++  AI+L N   + F D +  G F    +   ++ R KE +DGA P
Sbjct: 478 GLIELYEATYKSEYIEKAIKLNNDLIKYFWDDKNKGLFLYGSDSEELISRPKEIYDGAIP 537

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSVS +N +RL+ +      +         L  F   ++   M       +   L   
Sbjct: 538 SGNSVSALNFIRLSRLTGSYDLE---DKCTEILQAFSEEIESYPMGYSFSLLSVLFLGKK 594

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYD-LNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           S K + LV +      +  L   +  Y+ L+  + +I+   T E          N S   
Sbjct: 595 S-KEITLVSNSYDNTSKEFLEVINDKYNPLSTFIYYIEGDKTLE----------NVSNFV 643

Query: 679 NNFSA--DKVVALVCQNFSCSPPVTDPISLENLL 710
           +++    DK    +C+NFSC+ PVT+   L+ LL
Sbjct: 644 SDYQPLNDKPTVYICENFSCNAPVTNISDLKKLL 677


>gi|345560346|gb|EGX43471.1| hypothetical protein AOL_s00215g207 [Arthrobotrys oligospora ATCC
           24927]
          Length = 758

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/698 (36%), Positives = 372/698 (53%), Gaps = 43/698 (6%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESF+D  VAK+LND F+ IK+DREERPD+D++YM YVQA  G GGWPL+VF
Sbjct: 68  CHWCHVMERESFQDAYVAKILNDNFIPIKIDREERPDIDRIYMNYVQATTGSGGWPLNVF 127

Query: 82  LSPDLKPLMGGTYFPPEDKYGRP------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           L+P+L+P+ GGTY+P  +    P      GF  +L K+   W +++D    S    ++QL
Sbjct: 128 LTPNLEPVFGGTYWPGPNATDGPSMKDQIGFVEVLDKIVKVWKEQQDKCLASAKDILKQL 187

Query: 136 S----EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
                E L     +    + L  + L    +     YD+  GGFG+ PKFP P  +  +L
Sbjct: 188 KEFSDEGLKEQGGNQDGAEILEIDLLEEAYQHFLSRYDTTHGGFGTEPKFPTPTNLAFLL 247

Query: 192 YHSKKLEDTGKSGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
             S             E ++   M + TL+ M++GGIHDH+G GF RYSV   W +PHFE
Sbjct: 248 RLSSLSSVVEDVVGDVECERAKFMAVTTLRHMSRGGIHDHIGNGFERYSVTADWSLPHFE 307

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP----GGEIFSAEDADSA 304
           KMLYD  QL +VYLDA+ LTKD        D  DYL     GP     G  +SAEDADS 
Sbjct: 308 KMLYDNAQLISVYLDAYLLTKDREMLDAALDAADYL---CSGPLSHKDGGFYSAEDADSY 364

Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
             +G T K+EGAFYVW  KE   +LGE  A +  +++ ++  GN D +R  D H+EF  +
Sbjct: 365 ARKGDTEKREGAFYVWDKKEFIKVLGEQDAEVCSKYWGVRTDGNVDPAR--DIHDEFLHQ 422

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH-LDDKVIVSWNGLVI 422
           NVL      +   S LG+     +  +   R KL + R +      LDDK++  WNGL I
Sbjct: 423 NVLQISQTPAQIGSMLGLSETAIVEKIKNGRAKLREYRERERPRPILDDKILTGWNGLAI 482

Query: 423 SSFARASKILK-SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           ++ +R +  L+  +AE + F           Y+  A  AA FIR++++D++T  L+  +R
Sbjct: 483 AALSRLAAALEIVDAEKSKF-----------YLNQAIRAAEFIRKNVFDQRTLGLKRVWR 531

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
             P     F DDYA+LI GL+ LYE      WL WA  LQ  Q +LF D   GG+F+T  
Sbjct: 532 ETPGATKAFADDYAYLIYGLISLYEATFDAGWLRWAHSLQAAQTKLFWDEAQGGFFSTER 591

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
           + P ++LR+K+  D AEPS N +S  NL +L S++  +   +    A  +   F T L  
Sbjct: 592 DAPDLILRLKDGLDSAEPSTNGISAANLYKLGSLLGDASFSFL---ASKTCNAFSTELMQ 648

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-T 660
                  M  +   L++ +   V++ G KS        A        N ++I +DP + +
Sbjct: 649 HPFLFSTMLPSVVALNLGTGT-VIIAGKKSDPTISAYRAKLRTQLFTNTSIIVVDPTEKS 707

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
           +++ ++   N     + ++  +A K +  VCQN +C P
Sbjct: 708 DDITWFTGKNEILKDILKS--AATKPIVQVCQNQTCVP 743


>gi|168182912|ref|ZP_02617576.1| dTMP kinase [Clostridium botulinum Bf]
 gi|182673930|gb|EDT85891.1| dTMP kinase [Clostridium botulinum Bf]
          Length = 682

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 244/693 (35%), Positives = 357/693 (51%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 55  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 171

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK-- 227

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                   ++   ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 228 -------DNKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 281 MTYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVD 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++  
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 432

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 537

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F   +K   M   L    A M +
Sbjct: 538 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYN 593

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           V   K + L   +   DF   +   +  Y     +I  D ++        E    N ++ 
Sbjct: 594 VLPIKEITLTYREKDEDFYKFINEVNNRYIPFSIIILNDKSN--------EIEKINKNIK 645

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+TD    +++L
Sbjct: 646 DKIAIKDKTTVYICQNYACREPITDLEEFKSVL 678


>gi|116749973|ref|YP_846660.1| hypothetical protein Sfum_2547 [Syntrophobacter fumaroxidans MPOB]
 gi|116699037|gb|ABK18225.1| protein of unknown function DUF255 [Syntrophobacter fumaroxidans
           MPOB]
          Length = 684

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 249/685 (36%), Positives = 361/685 (52%), Gaps = 61/685 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA LLN+  V++KVDREERPD+D++YMT  QAL G GGWPLSV
Sbjct: 49  TCHWCHVMERESFEDEEVAALLNEHVVAVKVDREERPDIDQIYMTVCQALLGSGGWPLSV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++P+      G+YFP   + G  GF  ++R++   W   R+ L ++G    E +     
Sbjct: 109 FMTPEKNAFFAGSYFPKHARLGMAGFTDVIRRIVHMWKNDRERLLEAGRQITESIQPRPV 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +  S   P+ L +   R     LS+++D+ +GGFGS PKFP P  +  +L   ++    
Sbjct: 169 QTVGSLPGPEVLEEAYSR-----LSRAFDATWGGFGSKPKFPTPHHLTFLLRWHRR---- 219

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                 S+   +V  TL  M  GGI D VG GFHRYSVDE+W VPHFEKMLYDQ  LA  
Sbjct: 220 ---NPWSDALAIVEKTLDGMRDGGIFDQVGFGFHRYSVDEKWLVPHFEKMLYDQAMLALA 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL+AF +T    +  + R+I +Y+ RDM  P G  +SAEDADS   EG     EG FYVW
Sbjct: 277 YLEAFQVTGRERHGRVAREIFEYVLRDMTDPDGGFYSAEDADS---EGV----EGRFYVW 329

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T  EV  +LG E    F   + + P GN +  R S PH        L EL DS +   + 
Sbjct: 330 TPAEVNALLGNEIGETFCRFFDITPEGNFEDGR-SIPH--------LAELADSLSDRDEP 380

Query: 380 GM-PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           G+  LE   ++L + RR LF+ R  R  P  DDK++ SWNGL+I++ ++ S+ L      
Sbjct: 381 GIGGLE---DLLEKGRRLLFEARRMRVHPLKDDKILTSWNGLMIAALSKGSRALGD---- 433

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                       + Y   A  AA FI   +    + RL   +R G +    + DDYAF I
Sbjct: 434 ------------RSYALAASRAADFILDRM-RRDSGRLHRRYRKGEAAIHAYADDYAFFI 480

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LYE     ++L  A++LQ+   +LF D   GG+F T  +  ++++R +E +DGA 
Sbjct: 481 WGLIELYEAAFDVRYLEEAVKLQDLMIDLFWDDAEGGFFFTPNDGENLIVREREIYDGAV 540

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PS NS + +NL+RL  +V   +   + + A+  L  F   ++D   A      A D  + 
Sbjct: 541 PSSNSAAALNLLRLGRMVGAVR---FEEKADRLLRRFSETVRDYPSAYTQFLHAVDFAAG 597

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV-IHIDPADTEEMDFWEEHNSNNASMA 677
           P+R+ VV+ G   +     M+    + +  N  V +   P     +     + +   +  
Sbjct: 598 PTRE-VVIAGSPDNATTAEMMKIVGSGFVPNTVVLLRGTPESGARLAELAPYTAGLVAPG 656

Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
            N          +C+ F+C+ P+T+
Sbjct: 657 GNP------AVYICEKFACTSPITE 675


>gi|347733897|ref|ZP_08866951.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
 gi|347517453|gb|EGY24644.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
          Length = 781

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 258/734 (35%), Positives = 370/734 (50%), Gaps = 85/734 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED+ VA+LLND FV +KVDREERPD+D  YM   Q L G GGWPL+
Sbjct: 83  STCHWCHVMAHESFEDDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGTGGWPLT 142

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---S 136
           +   PD +P    TY P   + GR G   ++ +V   W  KR  +  S    +E +   +
Sbjct: 143 IIALPDGRPFFAATYLPKHSRPGRIGLMDLVPRVLAVWRDKRGEVLDSAESIVEHVRRHA 202

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           EA+    +  +LP       L    E ++  +D+  GGFGSAPKFP P  +  +L  +++
Sbjct: 203 EAMLRPPADGRLPG---AGTLHAACEAMASEFDAANGGFGSAPKFPSPHNLLFLLRWARR 259

Query: 197 --------------LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
                            T      ++  +M   TL+ + +GGIHDHVG GFHRYS D RW
Sbjct: 260 NGYGAGSGASGAAAPGATQDEPGGAKALRMAAQTLRAIRRGGIHDHVGYGFHRYSTDARW 319

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
            +PHFEKMLYDQ  L   Y +A+  T D  +     +   Y+ RD+    G  +SAEDAD
Sbjct: 320 LLPHFEKMLYDQAMLMLAYAEAWLATGDGEFRRTAEETAAYVLRDLTSSEGAFYSAEDAD 379

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILG-------------------EHAILFKEHYYLK 343
           S E +G   + EG FY +T  ++E                         A L    +   
Sbjct: 380 S-ELDGV--RGEGLFYTFTLADLEAACAPLDVGSGGDGGAEAGEGAISDADLAARAFGCT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN +     +      G+NVL       A A +LG+P  +    L   R  LFD+R+ 
Sbjct: 437 AYGNYE----DEATRSRTGRNVLHLPRSPEALARELGLPPREVEERLEAARAALFDLRTT 492

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RPRPHLDDKV+  WNGL I++ +R ++                  D     E A  AA F
Sbjct: 493 RPRPHLDDKVLADWNGLAIAAMSRCAQAF----------------DAPHLAEAAAVAADF 536

Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           +   +   +  RL H +R+G +  PG LDDYAF+I GL++LY      +WL  A+ LQ  
Sbjct: 537 VLTRMVTPEG-RLLHRWRDGEAAVPGLLDDYAFMIWGLVELYGATGEVRWLRRALRLQEV 595

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           QD  F D EGGGY+ T  +  ++L+R KE HDGA PSGN+ ++ NL+RL+ ++   +   
Sbjct: 596 QDTFFHDPEGGGYWMTPADGDALLVRRKEGHDGALPSGNAAALFNLLRLSLLLGRPE--- 652

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
           Y + A   L  F T+++   +   +  C  D  ++   + V++ G     D E MLAA  
Sbjct: 653 YGERARGVLRAFATQVRHHPIGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAMLAAVR 711

Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA------DKVVALVCQNFSCS 697
            +Y    TV+H+  +D          N+ + + A   F+A      D+  A +C+N++CS
Sbjct: 712 GTY-APTTVLHLRTSD----------NARDLA-ALVPFTAHLAPVEDRATAWLCENYACS 759

Query: 698 PPVTDPISLENLLL 711
           PP+TDP  L+  LL
Sbjct: 760 PPITDPAELKARLL 773


>gi|419820995|ref|ZP_14344599.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
 gi|388474906|gb|EIM11625.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
          Length = 645

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 251/695 (36%), Positives = 371/695 (53%), Gaps = 84/695 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 11  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 70

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+         +E+++E  
Sbjct: 71  VFITPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSNTFANDREH--------VEEIAENA 122

Query: 140 SASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           S S    K P+    L + AL    +QL   +D+ +GGFG APKFP P    M++Y  + 
Sbjct: 123 S-SHLQIKTPEGNGTLTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---HMLMYLLRY 178

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            + TG+        K    TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   
Sbjct: 179 HQYTGQENALYNVTK----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNAL 234

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ +T+D  Y +I   I+ +++R+M    G  +SA DAD   TEG     EG 
Sbjct: 235 LLTAYTEAYQVTQDSRYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEGV----EGK 287

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
           +YVW+  E+ + LG E   L+   Y +  +GN            F+G N+  LI      
Sbjct: 288 YYVWSKDEIIETLGDELGELYCAIYNITSSGN------------FEGHNIPNLIHTKLDK 335

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A +  +  ++    LGE R+KL   R  R  PH+DDKV+ SWN L+I+  A+A+K+ +
Sbjct: 336 VKA-EFDLNEQEINKQLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGLAKAAKVFQ 394

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
           +                 EY+ +A++AA+FI + L  +   R+   +R+G  K  GF+DD
Sbjct: 395 A----------------PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEVKNKGFIDD 436

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL+   ++LYE G    +L  A +L     +LF D++ GG++ T  +  ++L+R KE 
Sbjct: 437 YAFLLWAYIELYEAGYDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEALLVREKEV 496

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGNSV+ + L+RL  +  G  S    + AE   + F+  ++           + 
Sbjct: 497 YDGAVPSGNSVAAVQLLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSGHSFFMQSV 553

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
               +P +K +V+ G K     +++++A   ++  N +V+              EH    
Sbjct: 554 LTHMMP-KKEIVIFGRKDDSQRQHIISALQQAFQPNFSVL------------VAEHPDQC 600

Query: 674 ASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
             +A   F+AD      K    +C+NF+C  P TD
Sbjct: 601 KDIA--PFAADYRIIDGKTTVYICENFACQQPTTD 633


>gi|237794355|ref|YP_002861907.1| thymidylate kinase [Clostridium botulinum Ba4 str. 657]
 gi|229263126|gb|ACQ54159.1| dTMP kinase [Clostridium botulinum Ba4 str. 657]
          Length = 682

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 244/693 (35%), Positives = 357/693 (51%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 55  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 171

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK-- 227

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                   ++   ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 228 -------DNKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 281 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVD 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++  
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 432

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 537

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F   +K   M   L    A M +
Sbjct: 538 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYN 593

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           V   K + L   +   DF   +   +  Y     +I  D ++        E    N ++ 
Sbjct: 594 VLPIKEITLTYREKDEDFYKFINEVNNRYIPFSIIILNDKSN--------EIEKINKNIK 645

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+TD    +++L
Sbjct: 646 DKIAIKDKTTVYICQNYACREPITDLEEFKSVL 678


>gi|220927673|ref|YP_002504582.1| hypothetical protein Ccel_0215 [Clostridium cellulolyticum H10]
 gi|219998001|gb|ACL74602.1| protein of unknown function DUF255 [Clostridium cellulolyticum H10]
          Length = 673

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 261/701 (37%), Positives = 371/701 (52%), Gaps = 92/701 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  QAL G GGWPL+
Sbjct: 54  STCHWCHVMERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLT 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP ED  G  G  ++L  VK+AWD KR+ L  S    I  +S+  
Sbjct: 114 VFLTPDKQPFYAGTYFPKEDSKGLMGLISLLGSVKEAWDNKREHLLVSAENIINHVSKES 173

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
            +  S      ++ Q A          ++DS++GGFG++PKFP P  +  +L  +++KK 
Sbjct: 174 ISKDSKIS--SDIIQEAF----AHFKYNFDSKYGGFGTSPKFPSPHTLLFLLRYWYTKK- 226

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                        +MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   L
Sbjct: 227 --------EPYALEMVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALL 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  Y +A+S T +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG F
Sbjct: 279 AIAYGEAYSATGNKNYEETARQILDYVQRDMSSQLGAFYSAEDADS---EGV----EGKF 331

Query: 318 YVWTSKEVEDILG-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN 370
           Y+W+ +EV ++LG     E+  +F     + P+GN            F+G N+  LIE  
Sbjct: 332 YIWSKEEVINVLGSKDGEEYCRIFD----ISPSGN------------FEGLNIPNLIE-- 373

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
                    G   E+  +   +CR+KLF  R KR  P+ DDK++ +WNGL+ ++ A   +
Sbjct: 374 --------TGTLPEQQKSFAEDCRKKLFTHREKRIHPYKDDKILTAWNGLMTAAMAYCGR 425

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L              G D+  Y+E A+    FI + L      RL   +R G +  P +
Sbjct: 426 VL--------------GEDK--YIESAKRCIDFISKKLV-RTDGRLLARYREGEAVFPAY 468

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL+ GLL+LYE    T +L  A++L +    LF +    G F    +   ++ R 
Sbjct: 469 LEDYAFLVWGLLELYEATFTTLYLKRALKLTDAMLNLFGENNSTGLFLYGHDSEQLIARP 528

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E +DGA PSGNSV+ +NL+RLA I    +   Y   A+  +  F T++         M 
Sbjct: 529 RESYDGAIPSGNSVAAMNLLRLARITGRHE---YENRAKAIMDFFGTQINAAPTGHSYML 585

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY-DLNKTVIHIDPADTEEMDFWEEH 669
           C+  M SV      V++   + VD + ++   +  Y      + +I P  TE   F  ++
Sbjct: 586 CSY-MYSVSDISSEVVI---AGVDGKGLIDTFNNKYLPFAVAISNISPELTEIAPFIGDY 641

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            + N           K +A VC+NFSC  P+T+P  L  +L
Sbjct: 642 KAQNG----------KTMAYVCRNFSCMEPITEPKKLGEVL 672


>gi|168178477|ref|ZP_02613141.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
 gi|182670724|gb|EDT82698.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
          Length = 680

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 244/684 (35%), Positives = 353/684 (51%), Gaps = 70/684 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +WT +E+ DILGE      E Y       C +  ++   N F+ KN+   +N        
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
               LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++   
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             L++LYE      +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA 
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              K + L  ++   DF   +   +  Y     V   D ++        E    N ++  
Sbjct: 593 SPVKEITLAYNEKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644

Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
                DK    +CQN++C  P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668


>gi|335040507|ref|ZP_08533634.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
           TA2.A1]
 gi|334179587|gb|EGL82225.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
           TA2.A1]
          Length = 715

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 256/705 (36%), Positives = 365/705 (51%), Gaps = 65/705 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE +A +LN+ FVSIKVDREERPD
Sbjct: 56  GEEAFEKARREDKPVFLSIGYSTCHWCHVMERESFEDEEIADILNNHFVSIKVDREERPD 115

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YM   QAL G GGWPL++ + PD KP    TY P E K+GR G K IL+K+   W 
Sbjct: 116 VDAIYMAVCQALTGHGGWPLTIVMHPDQKPFFAATYLPKEGKWGRSGLKEILQKIHHLWL 175

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R  L ++G   I+ + E  S    +     EL +  L     Q  +++D+ +GGFG A
Sbjct: 176 HDRKKLNEAGTNIIKAIQEMKSRPKGA-----ELTKEILHHAYAQFERTFDADYGGFGQA 230

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P     +L   +  + TG+     +  +M   +L+ M +GGI+DH+G GF RYSV
Sbjct: 231 PKFPLPHSYLFLL---RYWQMTGE----PKALEMTEKSLRAMHRGGIYDHLGYGFARYSV 283

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE+W VPHFEKMLYD   LA  Y +A+  T++ +Y  +  +I +Y++R M  P G  +SA
Sbjct: 284 DEKWLVPHFEKMLYDNALLAYSYTEAYQATRNPYYKQVTEEIFEYVQRVMTSPEGGFYSA 343

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG     EG FYVWT +E+ ++L E  A LF           CD+  +++  
Sbjct: 344 EDADS---EGV----EGKFYVWTPEEIFEVLEETEAELF-----------CDIYDVTEQG 385

Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
           N F+GKN+L  ++ D    A + G+   +    L   R KLF  R KR  PH DDK++ +
Sbjct: 386 N-FEGKNILHLIDVDLEQKAKQYGLSFAQLEQKLAAARHKLFLHREKRVHPHKDDKILTA 444

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I++ A+AS                    R +Y+E+A  AA+ I RHL D +  RL
Sbjct: 445 WNGLMIAALAKASAAF----------------GRSDYLELARRAANMIERHLTDNEG-RL 487

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +R+G +    ++DDYAF I  L +LY        L  A  L +   E F D++ GG+
Sbjct: 488 LARYRDGEAHYLAYIDDYAFFIWALHELYFASLDASCLQQAKSLLDQALERFWDKQNGGF 547

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F    +   ++   KE +DGA PSGN V   NLVR   +   S  D YR+ AE  L  F 
Sbjct: 548 FFYAKDAERLITNPKEIYDGATPSGNGVMAFNLVRHYLL---SGEDVYRETAEALLQAFG 604

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            ++ +          A  +LS  +   +V+V  K    ++ M+     +Y     V++  
Sbjct: 605 QQINEYPSGHAFSLLALQLLS-GNHAELVIVEGKDRHTYDKMVETVQRAYLPLAVVLYKT 663

Query: 657 PADTEEMD-FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
               + ++     H    A   +  F         C NF+C  PV
Sbjct: 664 REQNQRLNALAPAHQDKQAVDGQTTFYH-------CVNFACRQPV 701


>gi|451344787|ref|YP_007443418.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
 gi|449848545|gb|AGF25537.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
          Length = 689

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 256/700 (36%), Positives = 368/700 (52%), Gaps = 78/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F+++KVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI G L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWGYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +VL G K   D +  + A            H  PA T       EH    A +
Sbjct: 599 TMP-QKEIVLFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGI 645

Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTDPISLENLL 710
           +  +F+A       K    +C+NF+C  P TD     N+L
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTDIDEAMNIL 683


>gi|333987397|ref|YP_004520004.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
 gi|333825541|gb|AEG18203.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
          Length = 700

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 263/718 (36%), Positives = 371/718 (51%), Gaps = 66/718 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESFED  VA+L+N+ FV +KVDREERPD
Sbjct: 38  GDEAFKKAEKEDKPIFLSIGYSTCHWCHVMAHESFEDPEVAELINEVFVPVKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD++YM   Q + G GGWPL++ ++PD KP   GTYFP E +YG  G K ++  V++ W 
Sbjct: 98  VDRIYMDVCQIMTGTGGWPLTIIMTPDKKPFFAGTYFPKESRYGSTGLKDLILNVEEIWK 157

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + R  +  SG    EQ+   L    SS     E+    L    + LSK++D  +GGFG  
Sbjct: 158 ENRKDVLNSG----EQVFRVLK-DVSSTPRGGEIEAKILEKTYDTLSKTFDYEYGGFGDF 212

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
            KFP P  +  +L + K+   TG          MV  TL  M  GGI+DH+G GFHRYSV
Sbjct: 213 QKFPTPHNLMFLLRYWKR---TGNKNAVH----MVEKTLDSMYMGGIYDHLGFGFHRYSV 265

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLYDQ  ++ VY++AF  T +  Y  I   I  Y+ R+M  P G  +SA
Sbjct: 266 DPGWVVPHFEKMLYDQALISMVYIEAFQATGNEEYKRIAEQIFKYVFRNMKSPEGGFYSA 325

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDAD   TEG     EG FY+WT KE+ D L  + A L  + + +K  GN +   +    
Sbjct: 326 EDAD---TEGV----EGKFYLWTKKEIFDALDPDEAELICKIFNVKEAGNFEDETIG--- 375

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
            E  G N+L   +     A  LG+   +  + L   R KLF  R  R  P  DDK++  W
Sbjct: 376 -EETGANILYLKSSIGELAEGLGISRRELEDKLETSRMKLFQNRETRVHPQKDDKILADW 434

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++ A+A++                  D  +Y + AE AA+FI   +  E   RL 
Sbjct: 435 NGLMITALAKAAQAF----------------DDPKYSKAAEDAANFILDKMCKEG--RLF 476

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R+  +  PG LDD+ F+I GLL+LYE     K+L  A++L     E F D + GG++
Sbjct: 477 HRYRDNEAAIPGNLDDHTFMIWGLLELYEAVFNVKYLKKALKLNKILIEHFWDEKDGGFY 536

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T  +   VLL  K+ +DGA PSGNSV + NL++LA I    + +    + E +   F T
Sbjct: 537 FTANDSEHVLLWEKQTYDGALPSGNSVGIFNLIKLARITEDPELERRSIDLERA---FST 593

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           +++   +       A D    PS + VV+VG   + D + M+ +  + +  NK  +  D 
Sbjct: 594 QIRRAPIVHTHFLEAIDFKVGPSYE-VVIVGDPEADDTKKMIQSIRSHFIPNKVFLLKDE 652

Query: 658 -----ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                ++  E   ++E    NA+            A +C   SC  P TD   + NLL
Sbjct: 653 NVPDISEIAESLKYKEPIKGNAT------------AYICTEGSCKSPSTDVRKVLNLL 698


>gi|424826571|ref|ZP_18251427.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
 gi|365980601|gb|EHN16625.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
          Length = 682

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 248/694 (35%), Positives = 360/694 (51%), Gaps = 74/694 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN+ F+SIKVDREERPDVD +YM++ QA  G GGWPL++
Sbjct: 56  TCHWCHVMERESFEDEDVAEILNNNFISIKVDREERPDVDNIYMSFCQAYTGSGGWPLTI 115

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   IL+ +   W + +  + +S    +EQ+     
Sbjct: 116 LMTPDKKPFFAGTYFPKWGKYNIPGIMDILKSINKLWHEDKSKILESSNRILEQIER--- 172

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N   DEL +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK E
Sbjct: 173 --FQDNHGEDELEEYIIEEAAQTLIDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDE 230

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                        ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 231 KV---------LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  Y  +   IL+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 282 MAYTEAYEATKNPLYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFY 334

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           +WT KE+ DILGE    F           C L  ++   N F+ KN+  LI+ +      
Sbjct: 335 LWTKKEIIDILGEEDGAFY----------CKLYDITSRGN-FENKNIANLIQTDLKDVDN 383

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +K         + L   R KLF+ R KR  PH DDK++ SWN L+I +F RA +  K++ 
Sbjct: 384 NK---------DKLERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND- 433

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+++A+ +A FI ++L DE    L    R+      GF+DDYAF
Sbjct: 434 ---------------NYIDIAKQSADFIIKNLMDENG-TLYARIRDEERGNEGFIDDYAF 477

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
            +  L++LYE      +L  +IE+ ++  +LF  +E GG++  +     +++R KE +DG
Sbjct: 478 FLWALIELYEASFDIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYDG 537

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGN+V+ + L  L  I      D Y+   +     F   +K   M   L    A M 
Sbjct: 538 AMPSGNAVASLALSLLYYITG---EDKYKNLVDEQFKFFAANIKSGPM-YHLFSVMAYMY 593

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           +V   K + L  ++    F   +   +  Y +  ++I ++    E     E+ N N    
Sbjct: 594 NVSPVKEITLAYNEKDEAFYEFINEFNNRY-IPFSIITLNDKSNE----IEKINKNLKDK 648

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           A      DK    +CQN++C  P+TD    +++L
Sbjct: 649 AP---IKDKTTVYICQNYACREPITDLEKFKSVL 679


>gi|311070619|ref|YP_003975542.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
 gi|310871136|gb|ADP34611.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
          Length = 687

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 251/695 (36%), Positives = 371/695 (53%), Gaps = 84/695 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+         +E+++E  
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSNTFANDREH--------VEEIAENA 164

Query: 140 SASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           S S    K P+    L + AL    +QL   +D+ +GGFG APKFP P    M++Y  + 
Sbjct: 165 S-SHLQIKTPEGNGTLTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---HMLMYLLRY 220

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            + TG+        K    TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   
Sbjct: 221 HQYTGQENALYNVTK----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNAL 276

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y +A+ +T+D  Y +I   I+ +++R+M    G  +SA DAD   TEG     EG 
Sbjct: 277 LLTAYTEAYQVTQDSRYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEGV----EGK 329

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
           +YVW+  E+ + LG E   L+   Y +  +GN            F+G N+  LI      
Sbjct: 330 YYVWSKDEIIETLGDELGELYCAIYNITSSGN------------FEGHNIPNLIHTKLDK 377

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A +  +  ++    LGE R+KL   R  R  PH+DDKV+ SWN L+I+  A+A+K+ +
Sbjct: 378 VKA-EFDLNEQEINKQLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGLAKAAKVFQ 436

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
           +                 EY+ +A++AA+FI + L  +   R+   +R+G  K  GF+DD
Sbjct: 437 A----------------PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEVKNKGFIDD 478

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL+   ++LYE G    +L  A +L     +LF D++ GG++ T  +  ++L+R KE 
Sbjct: 479 YAFLLWAYIELYEAGYDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEALLVREKEV 538

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGNSV+ + L+RL  +  G  S    + AE   + F+  ++           + 
Sbjct: 539 YDGAVPSGNSVAAVQLLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSGHSFFMQSV 595

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
               +P +K +V+ G K     +++++A   ++  N +V+              EH    
Sbjct: 596 LTHMMP-KKEIVIFGRKDDSQRQHIISALQQAFQPNFSVL------------VAEHPDQC 642

Query: 674 ASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
             +A   F+AD      K    +C+NF+C  P TD
Sbjct: 643 KDIA--PFAADYRIIDGKTTVYICENFACQQPTTD 675


>gi|197119298|ref|YP_002139725.1| hypothetical protein Gbem_2926 [Geobacter bemidjiensis Bem]
 gi|197088658|gb|ACH39929.1| thioredoxin domain protein YyaL [Geobacter bemidjiensis Bem]
          Length = 746

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 252/700 (36%), Positives = 366/700 (52%), Gaps = 56/700 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+ LN  F++IKVDREERPDVD +YMT V A+   GGWPL+V
Sbjct: 98  TCHWCHVMEEESFEDEEVARFLNSNFIAIKVDREERPDVDTIYMTAVHAMGMQGGWPLNV 157

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +PD KP  GGTYFPP D  G  GF ++L+++++ + +  D +  +G     QL+EA+ 
Sbjct: 158 FATPDRKPFYGGTYFPPRDYAGGIGFLSLLQRIRETYRQAPDRVTHAGV----QLTEAIR 213

Query: 141 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
              +   +  E PQN + L    E   + +D++ GG   APKF         L     L 
Sbjct: 214 GMLAP--MGGEPPQNEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D  + G+ +    M  +TL+ MA GGI+D  GGGFHRY+ D  W +PHFEKMLYD  +LA
Sbjct: 266 DHLRRGDKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSAWLIPHFEKMLYDNARLA 324

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+ +  T D  ++ + R+IL YL+RDM+ P G  +SA DADS    G   ++EG F+
Sbjct: 325 AAYLEGYQATGDPQFAKVAREILRYLQRDMMSPQGAFYSATDADSLTESG--HREEGIFF 382

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
            WT +E++ +LG E A +    Y +   GN            F+G+++L         A 
Sbjct: 383 TWTPEELDAVLGTERARVVAACYGVTSEGN------------FEGRSILHREKSMQHLAE 430

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +L +P E+   +L E R +L+  R +RP P  D+K++ SWNGL IS+FAR   +L   A 
Sbjct: 431 ELMLPKEELERLLDEAREELYRARQRRPLPLRDEKILASWNGLAISAFARGGLVLNDPA- 489

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                           ++ A  AA+FI + +  ++  RL HS++ G +K  GFLDDYAF 
Sbjct: 490 ---------------LLDTARRAANFILQSMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I+GL+DL+E      WL  A+E+     E F D E GG+F T      ++ R K  +DG 
Sbjct: 533 IAGLIDLFEATGELPWLKRALEVAQQVQEQFEDSETGGFFMTGPRHEELISREKPAYDGV 592

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV ++NL+RL ++       +    A+ +L  F  +L     A+  M  A D L 
Sbjct: 593 IPSGNSVMIMNLLRLNALTG---EQWMLDQAQRALDAFSIQLASAPTALSEMLLALDYLQ 649

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
              R+ V++           +L      +  N+ ++        E D  E+       + 
Sbjct: 650 DLPREIVIVAPQGKREAAGPLLEKLRGVFLPNRALVVFC-----EGDELEQAGELLPLVR 704

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
                    +A +C++ SC  P +DP      L E  S  
Sbjct: 705 EKKADGGLAMAYLCESRSCRRPTSDPEEFHRQLQETQSKV 744


>gi|435851537|ref|YP_007313123.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
           15978]
 gi|433662167|gb|AGB49593.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
           15978]
          Length = 717

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 246/687 (35%), Positives = 370/687 (53%), Gaps = 61/687 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA+L+N  F+ IKVDREERPD+D VYM   QA+ G GGWPL+
Sbjct: 65  STCHWCHVMEKESFEDPDVARLMNATFICIKVDREERPDIDSVYMAICQAITGRGGWPLT 124

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++P+ +P    TY P + ++G PG   ++  +   W ++++ + Q+      +L  AL
Sbjct: 125 ILMTPNKEPFFAATYIPKKSRFGNPGMLDLIPHIAKVWTQQQEDILQTA----RELKAAL 180

Query: 140 S---ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           S     AS+     E+ +  L     QL  ++D + GGFG APKFP P  +  +L + ++
Sbjct: 181 SPQMVQASAKSTGTEINEKTLHSGYSQLLSAFDWQAGGFGRAPKFPSPHNLTFLLRYWQR 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              TGK     E  +MV  TL  M  GGI+DHVG GFHRYS D +W VPHFEKMLYDQ  
Sbjct: 241 ---TGK----LEALQMVTKTLDGMRGGGIYDHVGFGFHRYSTDGQWLVPHFEKMLYDQAM 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y + F +T    +  +  +I++Y+ RDM    G  + AEDADS   EG     EG 
Sbjct: 294 LIMAYTEGFQVTGIEDHRQVAAEIIEYVLRDMCSAEGAFYCAEDADS---EGM----EGK 346

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNC--DLSRMSDPHNEFKGKNVLIELNDSS 373
           FY+W  +E+ D+L  E A L  + Y +   GN   ++S +S        +N+L       
Sbjct: 347 FYLWKKEEIYDLLPLEVANLVCKVYDISSEGNYKEEISGIS------TRQNILHLARPMQ 400

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
            +A +LG+ L++    L   R+ LF  R KR  P  DDKV+  WNGL+I++  +AS+   
Sbjct: 401 EAAQELGISLDELKAKLEPARKILFAAREKRVHPSKDDKVLTDWNGLMIAALCKASRAF- 459

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                          +R EY + A   A FI +H+      RL H +R+G +   GFL+D
Sbjct: 460 ---------------ERPEYAQAASRTADFILQHM-SSHDGRLLHRYRDGEASISGFLED 503

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL+ GL++LY+     K+L  A+ L + Q   F+D E GG+F+T  +  ++L R K+ 
Sbjct: 504 YAFLVWGLIELYQATFEKKYLEHALRLNSLQIRDFMDVE-GGFFHTANDSETLLFRNKDL 562

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGNSVSV+NL++L+ +   +  +   + A  S+  F  ++  M MA      A 
Sbjct: 563 YDGAMPSGNSVSVLNLLKLSRLTGDTDLE---EKASTSMKAFSGQIDAMPMAYSQFLHAL 619

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           D  + P+ + VV+ G     +   M++ A  S+  N  ++     +  E+     +  + 
Sbjct: 620 DFTAGPAYE-VVIAGDPDDPNTREMISLAGRSFLPNMVLLLQGKNNIGEL---APYTKDM 675

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPV 700
           ++  RN          +CQ +SCS P+
Sbjct: 676 SATDRN------ATVYICQGYSCSMPI 696


>gi|194017545|ref|ZP_03056156.1| YyaL [Bacillus pumilus ATCC 7061]
 gi|194010817|gb|EDW20388.1| YyaL [Bacillus pumilus ATCC 7061]
          Length = 687

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 253/688 (36%), Positives = 360/688 (52%), Gaps = 71/688 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+V
Sbjct: 54  TCHWCHVMAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNV 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD KP   GTYFP    YGRPGF   L +++DA+   RD +      A   L    +
Sbjct: 114 FVTPDQKPFYAGTYFPKRSAYGRPGFIEALTQLRDAYHNDRDHIESLAEKATNNLRIKAA 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               S      L Q A+     QL  S+D+  GGFGSAPKFP P    M+ +  +  E T
Sbjct: 174 GQTEST-----LTQEAIHKAYYQLMSSFDTLHGGFGSAPKFPAP---HMLSFLMRYYEWT 225

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+          V+ TL  MA GGI+DHVG GF RYS DE+W VPHFEKMLYD   L   
Sbjct: 226 GQEN----ALYAVMKTLDGMANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALLMEA 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+ LT+   Y  +   ++ +++RDM+ PGG  +SA DADS   EG    KEG +YVW
Sbjct: 282 YTEAYQLTQQPEYEKLVHRLIHFIKRDMMNPGGSFYSAIDADS---EG----KEGQYYVW 334

Query: 321 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +  E+   LGE    LF   Y++   GN + + +  PH       +    +D  AS S  
Sbjct: 335 SKDEIMTHLGEDLGALFCAIYHITEEGNFEGANI--PH------TISTSFDDIKASFSID 386

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
              L+  L    E R  L  VR +RP P +DDKV+ SWN L+ISS A+A ++  +E    
Sbjct: 387 DHALQSKLQ---EARHILQSVRQQRPAPLVDDKVLTSWNALMISSLAKAGRVFGAE---- 439

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA ++ 
Sbjct: 440 ------------EAIRMAKQAMSFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAHMLK 485

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + LYE      WL  A  +     ELF D+E GG+F +  +  ++++R KE +DGA P
Sbjct: 486 AYMSLYEATFELAWLEKATAIAKNMFELFWDKEKGGFFFSGSDAEALIVREKEVYDGAMP 545

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA---ADM 615
           SGNS ++  L+ L+ +         RQ+   +L  +F+    D++ + P    A     +
Sbjct: 546 SGNSTALKQLLMLSRLTG-------RQDWLDTLEQMFKAFYVDVS-SYPSGHTAFLQGLL 597

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
               +++ ++++G       E +L A      L K  +  D   T E     E  +  A 
Sbjct: 598 AQYATKREIIILGKNGDPQKEQLLQA------LQKRFMPFDIILTAETG---EELAKLAP 648

Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTD 702
             +N  + D K    +C+N+SC  P+T+
Sbjct: 649 FTKNYKTIDGKTTVYICENYSCRQPITN 676


>gi|435854108|ref|YP_007315427.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
 gi|433670519|gb|AGB41334.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
          Length = 681

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 247/701 (35%), Positives = 372/701 (53%), Gaps = 83/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF D+ VA +LN+ FVSIKVDREERPD+D +YM+  QA+ G GGWPL+
Sbjct: 53  STCHWCHVMERESFADQEVANVLNENFVSIKVDREERPDIDDIYMSVCQAMTGRGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA- 138
           V ++PD +P   GTYFP + K GRPG   IL ++   W  +++ + +S    ++ + +  
Sbjct: 113 VVMTPDKRPFFAGTYFPKQTKRGRPGLLKILDQITKKWSNQQEKILESSEELVQAIKQQD 172

Query: 139 ---LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
               +A+ SSN L D+L + A+      L  S+D+++GGFGSAPKFP P  +  +L +  
Sbjct: 173 MKKQAANFSSNDL-DKLVKEAV----SSLKSSFDAQYGGFGSAPKFPSPHNLMFLLRY-- 225

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
                GK     E   +V  TL  M +GGI+DH+G GF RY+ DE+W  PHFEKMLYD  
Sbjct: 226 -----GKIHNDQEVLSIVEKTLDSMYQGGIYDHIGYGFSRYATDEKWLAPHFEKMLYDNA 280

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            L  VYL+ + + +   Y+ I  +IL Y+ RDM    G  +SAEDADS   EG    +EG
Sbjct: 281 LLTIVYLEGYQVLEKEIYAKIAEEILAYINRDMTSSKGAFYSAEDADS---EG----EEG 333

Query: 316 AFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
            +Y+W   EV++ LG+     F + Y + P GN            F GKN+    N    
Sbjct: 334 KYYLWQPGEVKEALGDKLGSQFCQTYNIIPEGN------------FAGKNI---PNLIKT 378

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
              KL +  E       + R+KLF  R KR RP  DDK++ +WNGL+I +FA+A KIL  
Sbjct: 379 ERDKLKINHE-----FRKARKKLFLAREKRVRPAKDDKILTAWNGLMIVAFAKAGKIL-- 431

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                         D++EY+  A+ AA FI  +L  +   RL   +R G +   G+++DY
Sbjct: 432 --------------DKEEYLNYAKEAADFIWDNLIRKDDGRLLARYREGEADYLGYVNDY 477

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AF I GL++LY+      +L  A+ L       F D+E GG++    +   ++ R K   
Sbjct: 478 AFYIWGLIELYQANFNANYLERALILNKDLIHFFWDQEDGGFYLYGSDGEKLITRPKRVR 537

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           DGA PSGNS++ +NL++L+ +V+  + SD  +Q  E+    F  +++    A      + 
Sbjct: 538 DGALPSGNSIATLNLLKLSKLVSNQELSDMAQQQFEY----FYNQVRKAPRAYSAFLISV 593

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM----DFWEEH 669
                P  K V++V  K   +   M+      ++    V+  D  + +++     + +++
Sbjct: 594 LFNQQPG-KEVIIVKAKEETE---MIDIFQQKFNPFSVVVVKDTKNNDKLIELISYIKDY 649

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              N           +  A VC++FSC  PVT     + L+
Sbjct: 650 QVKNG----------ETTAYVCEDFSCLAPVTSRDKFKELI 680


>gi|302037753|ref|YP_003798075.1| hypothetical protein NIDE2440 [Candidatus Nitrospira defluvii]
 gi|300605817|emb|CBK42150.1| conserved protein of unknown function (modular protein) [Candidatus
           Nitrospira defluvii]
          Length = 1236

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 239/694 (34%), Positives = 358/694 (51%), Gaps = 64/694 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           ++CHWCHVME ESFE+E +A+L+N  FV IKVDREERPD+D++YM    AL    GGWP+
Sbjct: 56  SSCHWCHVMERESFENEAIARLMNHHFVCIKVDREERPDLDEIYMQATLALNRNQGGWPM 115

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +VFL+PD KP   GTYFPPED++GRPGF T+L+K+ + W+K    +    A    +L + 
Sbjct: 116 TVFLTPDQKPFFAGTYFPPEDRWGRPGFPTLLKKIAEYWEKDHAGVVAQAATLTARLQDG 175

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             A +     P  + +  L +   Q ++ +D++ GGFG APKFP    + ++L+   + +
Sbjct: 176 SHAPS-----PTTVGEAELDMAVTQFAEDFDAKLGGFGGAPKFPPATGLSLLLHCYHRTK 230

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D        +   MV  TL  MA GGI+D +G GF RYS D+RW VPHFEKMLYD   LA
Sbjct: 231 D-------PQTLTMVRTTLDAMAAGGIYDQIGDGFARYSTDDRWLVPHFEKMLYDNALLA 283

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY++AF +T D  Y  +  + LDY+ ++M  P G  +SA DADS   EG     EG F+
Sbjct: 284 RVYVEAFQVTADPNYRRVACETLDYILKEMTSPEGGFYSATDADS---EGV----EGKFF 336

Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT  E+  +L   E       +Y + P GN            ++ KNVL      ++ A
Sbjct: 337 VWTPDEIRAVLSNEEDVRRICTYYDVTPAGN------------WEHKNVLHTAKPVASVA 384

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +LG+ +E     +   +  L+  R+KR  P LDDKVI +WNG++IS+ A A ++     
Sbjct: 385 KELGLTVEDLQATIDRVKPLLYAARAKRVPPGLDDKVITAWNGMMISAMAEAGRV----- 439

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P        Y   AE A  F+   L  +   RL  ++R G +    +L+DYA+
Sbjct: 440 ----FDMP-------RYRAAAERACEFLLTTL-SKPDGRLLRTYRAGTAHLDAYLEDYAY 487

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
              GL+D YE G   ++L  A+ L       F D + GG+F T     ++++R +E  DG
Sbjct: 488 FAEGLIDTYEAGGHERYLSAAVRLAERILADFSDGQQGGFFTTATGHEALIVRSREGPDG 547

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGN+V+   L RL+        + +RQ A  ++  +  ++     A        D+L
Sbjct: 548 ATPSGNAVAAAALARLSYHFG---REDFRQAAAGAVRAYGRQIARYPRAFAKSLIVVDLL 604

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           +      + ++G     +   + AA   +Y  N+ +   +   +E           +  +
Sbjct: 605 T-SGPVEIAVIGAPDDSNTVALRAAVSRTYIPNRVIASRESQQSE---------PTHPLL 654

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   K    VC+NF+C  P+TDP  L   L
Sbjct: 655 HGKALVGGKSALYVCRNFACRRPITDPADLPTQL 688


>gi|163846817|ref|YP_001634861.1| hypothetical protein Caur_1244 [Chloroflexus aurantiacus J-10-fl]
 gi|222524638|ref|YP_002569109.1| hypothetical protein Chy400_1363 [Chloroflexus sp. Y-400-fl]
 gi|163668106|gb|ABY34472.1| protein of unknown function DUF255 [Chloroflexus aurantiacus
           J-10-fl]
 gi|222448517|gb|ACM52783.1| protein of unknown function DUF255 [Chloroflexus sp. Y-400-fl]
          Length = 693

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 248/694 (35%), Positives = 367/694 (52%), Gaps = 62/694 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF D  VA + N++F++IKVDREERPD+D +YM   QAL G GGWPL+VF
Sbjct: 56  CHWCHVMAHESFADPEVAAVQNEYFINIKVDREERPDLDNIYMAAAQALTGRGGWPLNVF 115

Query: 82  LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
             PD  P   GTYFPP+ K  R   PG++ +L  V +A+  +R  +  S    +E +   
Sbjct: 116 CLPDGTPFFAGTYFPPDAKAARYRMPGWRQVLLSVAEAYKTRRADVTASAHELLEHIK-- 173

Query: 139 LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
                 +  LP+ LP  +  L   A Q+ + +D ++GGFG APKFP+PV ++ +L     
Sbjct: 174 ----LLTRPLPETLPLDEELLMAAAAQIGREFDPQYGGFGDAPKFPQPVVLEFLLR---- 225

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              T   G+  +   M+  TL+ MA+GG++D VGGGFHRYSVDERW VPHFEKMLYD   
Sbjct: 226 ---THLRGDV-QALPMLQQTLEQMARGGMYDQVGGGFHRYSVDERWLVPHFEKMLYDNAL 281

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA VY  A  +T D F + I  +   Y+ RD+  P G  FS+EDADS  T GA+  +EGA
Sbjct: 282 LAEVYHLAAQVTGDTFLARIADETFTYMLRDLRHPDGAFFSSEDADSLPTPGASHAEEGA 341

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FYVWT  E+   LG+ A+L   +Y +   GN            F+G+++L     ++A A
Sbjct: 342 FYVWTPDELRAALGDDAVLVGAYYGVTRQGN------------FEGRSILHVPRPAAAVA 389

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           + LG+ +E+    +   R  L   R +RPRP  D+KVI +WN + I + A AS  + +  
Sbjct: 390 AMLGVSVERLEATVARARPILRTFRERRPRPFRDEKVITAWNAMAIRALAVASSRVPA-- 447

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y++ A   A F+  +L  +   RL  S+++G      FLDDYA 
Sbjct: 448 ----------------YLDAARQCADFLLTNLRRDDG-RLLRSWKDGRPGPAAFLDDYAL 490

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
               L++L+  G  T++L  AI+L +   +LF D + G +F+T  + P+++ R ++  D 
Sbjct: 491 FCDALIELHAAGGDTRYLATAIDLADAMIDLFWDDQAGMFFDTGRDQPALVTRPRDLSDN 550

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSG+S + + L+RL +I    +   Y   A  +L      LK   +    M CAAD+ 
Sbjct: 551 ATPSGSSAATVALLRLYAITGRER---YETRAMQTLQQTTPLLKRFPLGFGRMLCAADLA 607

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P R+ + ++G       + MLA A ++Y     +    P D           + +  +
Sbjct: 608 LGPLRE-LAIIGPPDHPVTQAMLAVARSAYRPRLVIARAMPDDPV--------VTLSPLL 658

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   +  A +C+ F+C  PVT P +L+  L
Sbjct: 659 NDRPMVDGQPTAYLCEQFACQMPVTTPEALQAQL 692


>gi|387929306|ref|ZP_10131983.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
 gi|387586124|gb|EIJ78448.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
          Length = 685

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 236/564 (41%), Positives = 326/564 (57%), Gaps = 53/564 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA+LLN+ FVSIKVDREERPD+D +YM   Q + G GGWPLS
Sbjct: 53  STCHWCHVMERESFEDEEVARLLNERFVSIKVDREERPDIDSIYMNICQMMNGHGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP E +YG PGFK ++ ++ D + K RD + +  + A E L    
Sbjct: 113 VFMTPDQKPFFAGTYFPKESRYGVPGFKEVITQLHDQYMKNRDQIEKIASDAAEALKH-- 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           SA  SS +LP     + L    +QL+ S++S +GGFG APKFP P  +  +L + K    
Sbjct: 171 SARESSAELPS---ADVLHKTYQQLAGSFNSFYGGFGDAPKFPIPHNLMFLLKYYKW--- 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TGK        KMV  TL  MA GGI+DH+G GF RYSVD  W VPHFEKMLYD   L  
Sbjct: 225 TGKEM----ALKMVEKTLVSMANGGIYDHIGFGFARYSVDVMWLVPHFEKMLYDNALLLY 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +TK+  Y  I   I++++ R+M    G  FSA DADS   EG    +EG +YV
Sbjct: 281 TYSEAYQVTKNSKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ +E+ D+LG+     F   Y +   GN            F+GKN+  LI  N    + 
Sbjct: 334 WSKEEILDVLGDKDGEFFCRVYDITSGGN------------FEGKNIPNLIHTN-IVKTV 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++ G+ LE+    L E R+KLF+ R +R  PHLDDK++ SWN L+I+  A+A +  ++  
Sbjct: 381 AEAGLNLEEGKAKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQN-- 438

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                         K ++E AE A  FI   L       L   +R+G SK   +LDD+AF
Sbjct: 439 --------------KNHVEKAEKALRFIEEKLV--VNGELMARYRDGESKFRAYLDDWAF 482

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+  LL+LYE     ++L  A        + F D + GG++ T  +  ++++R K+ +DG
Sbjct: 483 LLWALLELYEATFSMEYLDKARNTAEKMKKHFWDEQDGGFYFTRSDGEALIVREKQVYDG 542

Query: 557 AEPSGNSVSVINLVRLASIVAGSK 580
           A PSGNSV+ ++L+RL      +K
Sbjct: 543 ALPSGNSVAAVSLLRLGHFTGETK 566


>gi|345020399|ref|ZP_08784012.1| hypothetical protein OTW25_03576 [Ornithinibacillus scapharcae
           TW25]
          Length = 685

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 256/714 (35%), Positives = 372/714 (52%), Gaps = 78/714 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESFEDE VAKL+ND +++IKVDREERPD
Sbjct: 32  GEEAFEKAKQENKPIFLSIGYSTCHWCHVMAHESFEDEEVAKLINDHYIAIKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YM   Q + G GGWPL++F++PD  P   GTYFP E KYGRPG K  L ++   + 
Sbjct: 92  VDSIYMKVCQMMAGHGGWPLTIFMTPDKIPFYAGTYFPKESKYGRPGIKEALEQLHIKYT 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASA---SSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
              + +A       E + EAL  +    S+N+L  E    A     +QL + +D  +GGF
Sbjct: 152 TDPEHIAD----VTESVREALDNTIREKSNNRLTIETVDQAF----QQLGRGFDFTYGGF 203

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
             APKFP+P   Q +L+  +    +GK+       KMV  TLQ MA GGI DH+G GF R
Sbjct: 204 WEAPKFPQP---QNLLFLMRYYHFSGKTA----ALKMVESTLQNMAAGGIWDHIGYGFAR 256

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS DE+W VPHFEKMLYD   L  VY + + +TK  FY  I   I+ +++R+M    G  
Sbjct: 257 YSTDEKWLVPHFEKMLYDNALLLMVYTECYQITKKPFYKNIAEQIITFIKREMTSKDGAF 316

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
           +SA DADS   EG     EG +YVW  +E+ DILGE    ++   Y + P GN       
Sbjct: 317 YSAIDADS---EGV----EGKYYVWADEEIYDILGEDLGEIYTTTYGITPFGN------- 362

Query: 355 DPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
                F+GKN+  LI  N  S  A +  + L +  + L   R  L   R KR  PH+DDK
Sbjct: 363 -----FEGKNIPNLIRANLESV-AEEFDLTLSELTSQLETARLTLLQEREKRVYPHVDDK 416

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           V+ SWN ++I+  A+AS++ +++                +Y+ +A+ A SF+  ++  + 
Sbjct: 417 VLTSWNAMMIAGLAKASRVFQNQ----------------DYVTLAKRALSFLEENIVVDG 460

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
              L   +R G +K   +LDDYA+LI   ++LY+      +L  A    N   ELF D  
Sbjct: 461 D--LMARYREGETKYHAYLDDYAYLIWAYIELYQLEFDLTYLSKAKAQLNIMIELFWDPH 518

Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
            GG+F +   +  ++   KE +DGA PSGNSV+ + L ++AS+    + DY  +  E   
Sbjct: 519 HGGFFFSGKNNEKLISNDKEIYDGATPSGNSVAALMLGQMASLTG--EVDYLDKINEMYS 576

Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN-KT 651
             +E  +K  +  V  +     +L+    K VV++GH  +V  +  L      Y  N   
Sbjct: 577 TFYEDMMKQPSAGVFFLQSL--LLTENPTKEVVVLGHDENV--QEFLNHVQDKYAPNIAL 632

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
           ++ + P    E+  +    + N  M  N     +    VC+NF+C  P  D I+
Sbjct: 633 LVAVTPGQLIEVAPF----AANYKMVNN-----QTTIYVCENFACQQPTNDIIA 677


>gi|421729533|ref|ZP_16168663.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
           plantarum M27]
 gi|407076503|gb|EKE49486.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
           plantarum M27]
          Length = 689

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 254/692 (36%), Positives = 364/692 (52%), Gaps = 78/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKIHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI G L+LYE G    +L  A  L     ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWGYLELYEAGFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +V+ G K   D +  + A            H  PA T       EH    A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGI 645

Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
           +  +F+A       K    +C+NF+C  P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675


>gi|300855044|ref|YP_003780028.1| hypothetical protein CLJU_c18640 [Clostridium ljungdahlii DSM
           13528]
 gi|300435159|gb|ADK14926.1| conserved protein containing a thioredoxin domain [Clostridium
           ljungdahlii DSM 13528]
          Length = 675

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 248/702 (35%), Positives = 359/702 (51%), Gaps = 92/702 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME  SFED  VA++LND F+SIKVDREERPD+D +YM   Q++ G GGWPL++
Sbjct: 54  TCHWCHVMEKGSFEDTEVAEMLNDSFISIKVDREERPDIDSIYMNVCQSITGSGGWPLTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP  ++ G  G  +IL  +K AW   R  L  +        ++ L 
Sbjct: 114 IMTPDQKPFFAGTYFPKNNRDGLMGLMSILDYIKKAWKNNRSELLNAS-------TQILD 166

Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +  +SN+  +E + ++  +         +D  +GGFG  PKFP    +  +L +  K +D
Sbjct: 167 SLKNSNETSNETINEDIFQKTFLNFKYDFDPTYGGFGDFPKFPSAHNLLFLLRYFYKTKD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  S   +MV  TL CM KGGI+DH+G GF RYSVD +W VPHFEKMLYD   L  
Sbjct: 227 -------SSALEMVEKTLDCMRKGGIYDHIGFGFSRYSVDRKWLVPHFEKMLYDNALLII 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y++ F  T +  Y     +IL Y+ RDM    G  +SAEDADS   EG    +EG FYV
Sbjct: 280 AYIETFQATGNKKYCKTAEEILSYVLRDMTSNEGGFYSAEDADS---EG----EEGKFYV 332

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E++DIL E  +  F  ++ +   GN            F+GKN+L  +N S      
Sbjct: 333 WSEEEIKDILQEEDSGKFCSYFNVTKGGN------------FEGKNILNLINSS------ 374

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +P E  +  +  CR KLF  R KR  P+ DDK++ SWNGL+I + + A+++L      
Sbjct: 375 --IP-EDDMQFIENCREKLFAEREKRIHPYKDDKILTSWNGLMIGAMSIAARVL------ 425

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     +  +Y + A+ A  FI ++L  +   RL   +R+G +   G+LDDY+FLI
Sbjct: 426 ----------NNSKYTKAAKKAVDFIYKNLV-KSDGRLLARYRDGEASFLGYLDDYSFLI 474

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LYE    T +L  A+EL     +LF D+E GG+F    +   ++ R KE +D A 
Sbjct: 475 WGLIELYETTYSTDYLKKALELNEDLLKLFWDKENGGFFLYGNDGEKLITRPKEIYDSAI 534

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+ +NL+RL+ + +      +   A+     F   +     A      +      
Sbjct: 535 PSGNSVATLNLLRLSHLTSSYD---FEDKAKQLFDAFSREINSFPRACSFSLISLLFSKS 591

Query: 619 PSRKHVVLVG----------HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           P R+ +V  G          H  +  F N    +    +LNK +  I P     +D    
Sbjct: 592 PIRQIIVSAGSNIEEGKQVVHMINEKF-NPFTISILYCNLNKDLSTISPIIKNYIDI--- 647

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              NN           K    +C+NF+C  P+TD   L  +L
Sbjct: 648 ---NN-----------KTTTYICENFTCKKPITDINLLRKIL 675


>gi|444911449|ref|ZP_21231624.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
 gi|444718207|gb|ELW59023.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
          Length = 683

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 246/702 (35%), Positives = 369/702 (52%), Gaps = 78/702 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE +A+L+N+ F+++KVDREERPDVD++Y   VQ +  GGGWPL+
Sbjct: 48  SACHWCHVMAHESFEDEAIARLMNEGFINVKVDREERPDVDQLYQGVVQLMGQGGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSE- 137
           VFL+PDL P  GGTYFPP+D+YGRPGF  +LR + +AW   R ++L+Q+  F  E L E 
Sbjct: 108 VFLTPDLVPFFGGTYFPPKDRYGRPGFPKVLRALSEAWATNRGELLSQAREFR-EGLGEL 166

Query: 138 ---ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
               L A+ ++ K P+++    L L      +  D   GGFG APKFP P+ + ++L   
Sbjct: 167 ALHGLDAAPAALK-PEDIVSMGLSLL-----ERMDGVNGGFGGAPKFPNPMNVALVLRAW 220

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           ++  + G+       ++ VL TL+ MA+GG++D +GGGFHRYSVDERW VPHFEKMLYD 
Sbjct: 221 RR--EPGQDAL----KQAVLLTLEKMARGGVYDQLGGGFHRYSVDERWAVPHFEKMLYDN 274

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            QL ++Y +A  +     +  +  +  +Y+RR+M    G  ++ +DAD   TEG    +E
Sbjct: 275 AQLLHLYAEAQQVEPRPLWRKVVEETAEYVRREMTDARGGFYATQDAD---TEG----EE 327

Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           G F+VW  ++V ++L  E A L   H+ +   GN +            G+ VL       
Sbjct: 328 GRFFVWLPEQVREVLPPELAELALRHFRVTALGNFE-----------HGRTVLESAVSVE 376

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           + A +L  P+E+  + L E RR+LF+ R +R +P  DDK++  WNGL+I   A A ++  
Sbjct: 377 SLAEELQRPVEEVASGLSEARRRLFEARERRVKPGRDDKILAGWNGLMIRGLAFAGRVF- 435

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                          DR +++E A  AA F+   L+D Q  RL  S++ G ++ PGF++D
Sbjct: 436 ---------------DRADWVESARKAADFVLAELWDGQ--RLSRSYQEGQARIPGFVED 478

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           Y  L +GL  LY+     ++L  A  L  T + LF D E G Y         +++     
Sbjct: 479 YGDLAAGLTALYQATFEPRYLEAAEALVRTAETLFWDEERGAYLTAPRTQGDLVVATYAT 538

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D A PSG S      V LA++ +  +   Y +  E  ++    +L+   M    +  AA
Sbjct: 539 FDNAFPSGASTLTEAQVALAALTSNKQ---YLELPERYVSRMGEQLRKNPMGYGHLALAA 595

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           D L V     V   G + +V  E +LA +   Y                   W+   +  
Sbjct: 596 DAL-VDGAPSVTFAGTREAV--EPLLAVSRTVYAPTFGFT------------WKAPEAPV 640

Query: 674 ASMARNNF-----SADKVVALVCQNFSCSPPVTDPISLENLL 710
               R  F        +  A +C+NF+C PP+T+  +L   L
Sbjct: 641 PPSMRETFLGREPVGGRAAAYLCRNFACEPPLTEAGALAKRL 682


>gi|224368664|ref|YP_002602826.1| hypothetical protein HRM2_15540 [Desulfobacterium autotrophicum
           HRM2]
 gi|223691380|gb|ACN14663.1| conserved hypothetical protein [Desulfobacterium autotrophicum
           HRM2]
          Length = 766

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 249/710 (35%), Positives = 388/710 (54%), Gaps = 57/710 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  R  FL     TCHWCHVME ESFE+E +A+ LN+ ++ +KVDREERPD
Sbjct: 88  GDEAFETARKLNRPVFLSVGYATCHWCHVMEEESFENEEIARYLNENYLCVKVDREERPD 147

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDA 116
           +D +YM+ VQAL G GGWP++V+L+ D KP  GGTYFPP   D+    GF T+L K+  +
Sbjct: 148 IDSIYMSAVQALTGRGGWPMNVWLTCDRKPFYGGTYFPPRDGDRGADIGFLTLLEKLIQS 207

Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
           +  +   +  +G      + + +S    +     E  QNA+        +SYDSRFGG  
Sbjct: 208 FHAQDGRVENAGRQITAAIQQMMSPKPGTRLPGKETIQNAVSF----YRQSYDSRFGGLS 263

Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
            +PKFP  + ++++L H++   +  K  + +   +M+  +L  MA GG++DHVGGGFHRY
Sbjct: 264 GSPKFPSSLPVRLLLRHNRNTFE--KVKQDTNILEMIDHSLAQMAGGGMYDHVGGGFHRY 321

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S DE W VPHFEKMLYD   LA VYL+A+  T +  +  +  +IL Y+ +DM    G  +
Sbjct: 322 STDEHWLVPHFEKMLYDNALLAVVYLEAWQATDNADFKRVVNEILSYVIQDMTSADGAFY 381

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD 355
           SA DADS    G    +EG ++ WT +E++ ILG E++ + K +Y +  T N        
Sbjct: 382 SATDADSITPRG--HMEEGWYFTWTPEELDAILGKENSKIIKRYYSVGVTPN-------- 431

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
               F+ +++L      + +AS L +  EK   I+   R  L+  R+KRP P  D+KV+ 
Sbjct: 432 ----FEKRHILHTTKSRAETASALNITEEKLAKIIETSRELLYLERNKRPAPLRDEKVLT 487

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           +WN L+IS+FARA   L +                  Y++ A  AA FI  +LY +  +R
Sbjct: 488 AWNALMISAFARAGFTLNNTV----------------YIDQAVRAARFIMENLYID--NR 529

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L  S+++G ++   +L+DYAF I+ L+DLYE     +WL  A+EL +     + DR+ G 
Sbjct: 530 LFRSYKDGKARHNAYLEDYAFFIAALIDLYEATHDIEWLKKALELDDVLKTFYEDRKNGA 589

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNAEHSLAV 594
           +F T+ +  +++ R K  +D A PSGN+++++NL+RL S      +DY Y+Q AE +L  
Sbjct: 590 FFMTSSDHEALISREKPYYDNATPSGNAIAILNLLRLHSFT----TDYRYKQRAEKALKF 645

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F  RL     A+  M  A D     + K ++++      D  + L     +  +   ++ 
Sbjct: 646 FSERLNTAPSALSEMLLAIDYY-FDNPKEIIVIAPTEKPDAGDCLLETFRNLFIPNRILM 704

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDP 703
           +  AD ++       ++    +A+   + + K  A VC+N +C  P +DP
Sbjct: 705 V--ADEKQA----ADHAKIIPLAQGKKAINGKATAYVCENGTCKLPTSDP 748


>gi|153003852|ref|YP_001378177.1| hypothetical protein Anae109_0984 [Anaeromyxobacter sp. Fw109-5]
 gi|152027425|gb|ABS25193.1| protein of unknown function DUF255 [Anaeromyxobacter sp. Fw109-5]
          Length = 725

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 261/716 (36%), Positives = 366/716 (51%), Gaps = 79/716 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T R  FL    +TCHWCHVME ESFEDE +A++LN+ +V IKVDREERPD
Sbjct: 71  GEEAFAEARRTGRPVFLSVGYSTCHWCHVMEGESFEDEEIARVLNERYVPIKVDREERPD 130

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED-KYGRP-GFKTILRKVKDA 116
           VD +YMT VQ L GGGGWP+SV+L+P+ +P  GGTYFP  D   G P GF +ILR++ D 
Sbjct: 131 VDGLYMTAVQLLTGGGGWPMSVWLTPEKEPFFGGTYFPARDGDRGAPRGFLSILRELADL 190

Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
           + +    +  + +  +  +  AL+     +  +P     + L         ++D+  GG 
Sbjct: 191 YARDAGRVQAATSSLVGAVRAALAPRGEPAASVPG---ADVLEAAFRGFRDAFDAAHGGL 247

Query: 176 GSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
             APKFP  + ++ +L YH +  E        +E  +M   TL+ MA GG+HD +GGGFH
Sbjct: 248 RGAPKFPSSLPVRFLLRYHRRARE--------AEALRMATVTLERMAAGGLHDQIGGGFH 299

Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
           RYS D  W VPHFEKMLYD   LA  Y +A+ +T     + + R  LDYL R+M  P G 
Sbjct: 300 RYSTDATWLVPHFEKMLYDNALLAVAYAEAWQVTGRRELARVVRQTLDYLGREMTSPEGG 359

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
           ++SA DADS   EG    +EG F+VW + E+   LG  A  F   +     GN       
Sbjct: 360 LYSATDADS---EG----EEGRFFVWDAAELRQRLGADAERFMRFHGATDAGN------- 405

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                F+G+NVL            +  P E     L   R  L+  R +RPRP  D+K++
Sbjct: 406 -----FEGRNVL-----------HVPRPDEDEWEALAPQRALLYAAREERPRPLRDEKIL 449

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT 473
             WNGL IS+ A   ++L  E                 Y++ A SAA F+  R + D   
Sbjct: 450 AGWNGLAISALAFGGRVLGEE----------------RYVKAAASAAEFVLGRMIVD--- 490

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
            RL+ ++ +G +  PGFLDD+AF+  GLLDLYE     +WL  A+EL    + LF D  G
Sbjct: 491 GRLRRAWLDGAAGVPGFLDDHAFVAQGLLDLYEATFDARWLEAAVELSERLEVLFGDPRG 550

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G +F T  +   +L R K  HDGAEPSG SV+++N +RL++    +  D +R  AE +L 
Sbjct: 551 GAWFGTAADHERLLAREKPTHDGAEPSGASVALVNALRLSAF---TTDDRWRVRAEGALR 607

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            +   L +   A   M  A D  +  +R+ VVLV  +     E  LA    S+  N+ + 
Sbjct: 608 HYGRALAEHPSAFTEMLLAVDFATDVARE-VVLVWPEEGPSPEPFLAVLRRSFLPNRALA 666

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLEN 708
                         E     A +A    +   +V A VC+   CS P   P  L +
Sbjct: 667 GAAEGAA------IERLGRVALVAAEKVALGGRVTAYVCERGQCSLPAIAPEKLAS 716


>gi|429507366|ref|YP_007188550.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
 gi|429488956|gb|AFZ92880.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
          Length = 689

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 253/692 (36%), Positives = 364/692 (52%), Gaps = 78/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI   L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSATAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +V+ G K   D +  + A            H  PA T       EH    A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645

Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
           +  +F+A       K    +C+NF+C  P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675


>gi|384267593|ref|YP_005423300.1| hypothetical protein BANAU_3964 [Bacillus amyloliquefaciens subsp.
           plantarum YAU B9601-Y2]
 gi|380500946|emb|CCG51984.1| putative protein yyaL [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
          Length = 689

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 249/686 (36%), Positives = 361/686 (52%), Gaps = 66/686 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   KY RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKIHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LPAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 NELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI   L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +V+ G K   D +  + A    +    T++  +  D        E    +   
Sbjct: 599 TMP-QKEIVVFGSKDDPDRKRFIEALQEHFTPAYTILAAEHPD--------ELKGISDFA 649

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
           A       K    +C+NF+C  P TD
Sbjct: 650 AGYQMIDGKTTVYICENFACRRPTTD 675


>gi|91772578|ref|YP_565270.1| hypothetical protein Mbur_0543 [Methanococcoides burtonii DSM 6242]
 gi|91711593|gb|ABE51520.1| Protein of unknown function DUF255 [Methanococcoides burtonii DSM
           6242]
          Length = 703

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 233/684 (34%), Positives = 355/684 (51%), Gaps = 51/684 (7%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESF ++ VAK++ND FVSIKVDREERPD+D VYM   Q + G GGWPL++
Sbjct: 56  TCHWCHVMAKESFRNKDVAKMMNDTFVSIKVDREERPDIDSVYMDICQKMNGSGGWPLTI 115

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++P+  P +  TY P +  +GR G   I+  ++  W ++ + + +        LSE   
Sbjct: 116 IMTPEKVPFIAATYIPLKSGFGRKGMLEIIPWIEHLWKEEHNKIVEQTELIKTALSE--- 172

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              S N   +E+ +  +      L+ ++D+  GGFG++PKFP P  I  +L + K     
Sbjct: 173 --KSENSHNEEVTEEIIHRTYTYLANNFDNENGGFGTSPKFPSPHNISYLLRYWK----- 225

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            ++G  +  Q MV  TLQ M KGGI+DH+G GFHRYS D  W VPHFEKMLYDQ  L   
Sbjct: 226 -RTGNPTALQ-MVERTLQAMRKGGIYDHIGFGFHRYSTDSSWLVPHFEKMLYDQALLIIA 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+  T    YS    +I++Y+ RDM  P G  + A DADS E        EG FY W
Sbjct: 284 YTEAYQATNKEEYSNTANEIIEYILRDMTSPDGGFYCAGDADSEEV-------EGRFYTW 336

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
              E+E IL  E   +F++ + ++P GN        P+    GKN+L    D  +   + 
Sbjct: 337 ELSEIESILNREDHPIFRDAFNVRPEGNFLEESTHRPN----GKNILHLEKDLESIEKQY 392

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  ++  +I+  CR++LF  R KR  P  DDK++  WNGL++++ + + +++ +     
Sbjct: 393 NITRKEIDHIIERCRKQLFSTREKRIHPSKDDKILTDWNGLMLAALSISGRVMGN----- 447

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K Y+++A+  A  +      E    L H++ +      GFLDDYAF   
Sbjct: 448 -----------KRYIDIAKRNADLLISERMKENG-ELYHNYSSNKEPTIGFLDDYAFFTW 495

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GL++LYE      +L  A++L +   E F D   GG+F+T+ +  ++L R KE +DGA P
Sbjct: 496 GLIELYEATFEVTYLAKALQLTDYMIENFKDTINGGFFHTSNKSETLLFRKKEVYDGAIP 555

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSV + NL++L+ +    + +     A  +   F + +  M           D+   P
Sbjct: 556 SGNSVEINNLLKLSKLTGNPELN---SEAIDTSNAFASTIYAMPFGYTHFIAGLDLALAP 612

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           S + +V+ G   S D + ML   +  +   KTVI     + +E++    + S   +  + 
Sbjct: 613 SVE-IVIAGELDSEDTQLMLNNINEEFIPGKTVIVKSEKNEKELERIAPYTSTLKTQNQ- 670

Query: 680 NFSADKVVALVCQNFSCSPPVTDP 703
                K  A VCQ   C+ P TDP
Sbjct: 671 -----KATAYVCQGHECTLPTTDP 689


>gi|153939114|ref|YP_001390416.1| hypothetical protein CLI_1150 [Clostridium botulinum F str.
           Langeland]
 gi|384461487|ref|YP_005674082.1| hypothetical protein CBF_1122 [Clostridium botulinum F str. 230613]
 gi|152935010|gb|ABS40508.1| conserved hypothetical protein [Clostridium botulinum F str.
           Langeland]
 gi|295318504|gb|ADF98881.1| conserved hypothetical protein [Clostridium botulinum F str.
           230613]
          Length = 680

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 245/693 (35%), Positives = 354/693 (51%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD  P   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 LMTPDKNPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKVLESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++  
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M +
Sbjct: 536 TPSGNAVASLALNLLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYN 591

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   K + L   +   DF   +   +  Y     V   D ++        E    N ++ 
Sbjct: 592 ILPVKEITLAYREKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+TD    + LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPITDLEEFKFLL 676


>gi|296330011|ref|ZP_06872495.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
 gi|305676735|ref|YP_003868407.1| hypothetical protein BSUW23_20330 [Bacillus subtilis subsp.
           spizizenii str. W23]
 gi|296153050|gb|EFG93915.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
 gi|305414979|gb|ADM40098.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
           str. W23]
          Length = 695

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 245/692 (35%), Positives = 362/692 (52%), Gaps = 77/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 59  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 119 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 178

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 179 AAKSGEG-----LSKSAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 230

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 231 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 286

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 287 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 339

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ +E+   LG+   +L+ + Y +   GN            F+GKN+  LI        A
Sbjct: 340 WSKEEILKTLGDDLGMLYCQVYDITEEGN------------FEGKNIPNLIHTMQEQIKA 387

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G+  E+    L   R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +   
Sbjct: 388 DA-GLTKEELSLKLENARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ--- 443

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +Y+ +AE A +FI   L  +   R+   +R+G  K  GF+DDYAF
Sbjct: 444 -------------EPKYLSLAEDAITFIENQLIIDG--RVMVRYRDGEVKNKGFIDDYAF 488

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DG
Sbjct: 489 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDG 548

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ + L+RL   V G  S    + AE   +VF+  ++           +  + 
Sbjct: 549 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSV-LK 604

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            V  +K +V+ G       + +  A   ++  N +++              EH      +
Sbjct: 605 HVMPKKEIVIFGSADDPARKQITTALQKAFKPNDSIL------------VAEHPDQCKDI 652

Query: 677 ARNNFSAD------KVVALVCQNFSCSPPVTD 702
           A   F+AD      K    +C+NF+C  P T+
Sbjct: 653 AP--FAADYRIIDGKTTVYICENFACQQPTTN 682


>gi|170761713|ref|YP_001786452.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
 gi|169408702|gb|ACA57113.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
          Length = 682

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 244/685 (35%), Positives = 351/685 (51%), Gaps = 72/685 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+ LN  F+SIKVDREERPDVD +YM + QA  G GGWPL++
Sbjct: 55  TCHWCHVMERESFEDEEVAEALNKNFISIKVDREERPDVDNIYMNFCQAYTGSGGWPLTI 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   KY  PG   +LR + + W + ++ + +S     EQ+     
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNIPGIMDVLRSISNLWREDKNKILESSNRISEQIER--- 171

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 227

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 228 -------DKKILDVINKTLTNMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 281 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N    +  
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKTVD 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R KLF+ R KR  PH DDK++ SWN L+I +F++A + LK++  
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPHKDDKILTSWNALMIVAFSKAGRSLKND-- 432

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKESGGFYLYSKNSEKLLVRPKEIYDGA 537

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F + +K   M   L    A M +
Sbjct: 538 TPSGNAVASLALNLLYYITG---EDRYKDLVDKQFKFFASNIKSGPM-YHLFSVMAYMYN 593

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           V   K + L   +   DF   +   +  Y     V   D ++        E    N ++ 
Sbjct: 594 VLPVKEITLAYREKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 645

Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
                 DK    +CQN++C  P+TD
Sbjct: 646 DKIAIKDKATVYICQNYACREPITD 670


>gi|73667810|ref|YP_303825.1| hypothetical protein Mbar_A0261 [Methanosarcina barkeri str.
           Fusaro]
 gi|72394972|gb|AAZ69245.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 711

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 245/705 (34%), Positives = 361/705 (51%), Gaps = 54/705 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVM  ESFEDE +A+L+N  FV IKVDREERPD
Sbjct: 47  GEEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDEEIARLMNRAFVCIKVDREERPD 106

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  Q + G GGWPL++ ++PD+KP   GTY P   ++ + G   ++ ++++ W+
Sbjct: 107 IDNVYMTVCQIILGRGGWPLNIIMTPDMKPFFAGTYIPKNSRFSQTGMLELVPRIEEIWN 166

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++   + +S       +   +S  A        + ++ +    E+L  S+D+ +GGFG A
Sbjct: 167 RQHTEVLESADKITSTIQNMISEPAGEG-----IGESIMEEAYEELLTSFDNEYGGFGRA 221

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP   +I  +L + +      +SG   E   MV +TL+ M +GGIHDH+G GFHRYS 
Sbjct: 222 PKFPTSHKIFFLLRYWR------RSGN-PEALHMVEYTLENMYRGGIHDHLGSGFHRYST 274

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLYDQ  +A  Y + + +T    Y      ILDY+ RD+    G  +  
Sbjct: 275 DNVWIVPHFEKMLYDQALIATAYTEIYQVTGKRLYKEAAEGILDYVLRDLTSQEGGFYCG 334

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDAD    EG    +EG +Y+WT +EV  +L  E + L  + + L  TGN +     +  
Sbjct: 335 EDAD---VEG----EEGKYYLWTLEEVRTVLSPEESELITKVFNLSETGNFE----EEIR 383

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               G N+        + A++L +P +   + +   + KL   R KR RP  DDK++  W
Sbjct: 384 GRKTGTNIFYMPRSLESLAAELNIPADDVDSRVKTAKAKLLLARDKRKRPAKDDKILTDW 443

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++ A+               F   G ++  Y++ AE AA FI + LY+    RL 
Sbjct: 444 NGLMIAALAKG--------------FQAFGEEK--YLKAAEKAADFILKVLYNPD-RRLL 486

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R+G +   G  DDYAFLI GLL+LYE G    +L  A+ L     E F D   GG F
Sbjct: 487 HRYRDGKTGISGTADDYAFLIHGLLELYEAGFKLDYLKAALCLNREFLEHFWDPIQGGLF 546

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T  +  +++ R KE  D A PSGNS+ ++NL+RL+ I A S+ +   Q  E +   F  
Sbjct: 547 FTADDSEALIFRKKEFSDAAIPSGNSIEMLNLLRLSRITADSELEDRAQGLERA---FSK 603

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            ++ +         A D    P+ + VV+VG   S D   ML      +  NK +I    
Sbjct: 604 LIQKIPSGYTQFLSALDFGLGPAYQ-VVIVGEHESPDTGQMLEELWTYFIPNKVLIFRPE 662

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               E+    ++      +        K  A VCQN+ C  P T+
Sbjct: 663 GKDPEITKLAKYTEGQVPI------DGKATAYVCQNYQCQLPTTE 701


>gi|385266996|ref|ZP_10045083.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
 gi|385151492|gb|EIF15429.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
          Length = 689

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 253/692 (36%), Positives = 363/692 (52%), Gaps = 78/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E   +  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--GTGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI   L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +V+ G K   D +  + A            H  PA T       EH    A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645

Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
           +  +F+A       K    +C+NF+C  P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675


>gi|442804077|ref|YP_007372226.1| N-acylglucosamine 2-epimerase family protein [Clostridium
           stercorarium subsp. stercorarium DSM 8532]
 gi|442739927|gb|AGC67616.1| N-acylglucosamine 2-epimerase family protein [Clostridium
           stercorarium subsp. stercorarium DSM 8532]
          Length = 679

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 248/687 (36%), Positives = 366/687 (53%), Gaps = 77/687 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA +LN  FV+IKVDREERPD+D +YMT+ QA+ G GGWPL+
Sbjct: 57  STCHWCHVMERESFEDEEVADILNKHFVAIKVDREERPDIDHIYMTFCQAITGHGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP  D++G PG  TIL+    AW++ +  L + G    EQ+  ++
Sbjct: 117 IIMTPDKKPFFAGTYFPKNDRHGMPGLVTILKSAHRAWEENKKDLERLG----EQILNSV 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S  ++   + L +  +    +QL  S+D  +GGFG+APKFP P  +  +L +      
Sbjct: 173 -YSEDNDYQHEVLSETIIDDIYKQLESSFDPVYGGFGNAPKFPAPHNLLFLLRYWY---- 227

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              +GE  +  +MV  TL  M KGGI+DH+G GF RYS D +W +PHFEKMLYD   LA 
Sbjct: 228 --ATGE-KKALEMVEKTLDSMHKGGIYDHIGFGFCRYSTDRKWLIPHFEKMLYDNALLAM 284

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+  TK   Y+ I  +I  Y+ RDM  P G  +SAEDADS   EG     EG FY 
Sbjct: 285 AYSEAYQATKKDKYARIAAEIYKYIERDMTSPEGAFYSAEDADS---EGV----EGFFYT 337

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +EV  +LG E    F   + + P+GN            F+G+N+   +N   + +  
Sbjct: 338 WTYEEVMSVLGDEDGKRFCGIFDITPSGN------------FEGRNIPNLINADPSDSDF 385

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           + +           CR+KLF+ R KR RP  DDK++ SWN L+ +S A   +ILK     
Sbjct: 386 IEI-----------CRKKLFETREKRIRPFKDDKILTSWNALMAASLAVGGRILKD---- 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                          + +A+ A SFI+  L  E   RL   +R+G +  P FLDDYA+L 
Sbjct: 431 ------------MNLINMAKKAVSFIKAKLVREDG-RLLARYRDGSADIPAFLDDYAYLQ 477

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              ++LY+      +L+ A+ +    + LFLD E GG+F    +   ++ R K+ +DGA 
Sbjct: 478 WAYIELYQSTHEPGYLIDAVSINEEINGLFLDDEKGGFFFYGNDAERLITRPKDAYDGAM 537

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV  +NL++L+ I        Y  + E+ +  F   +    +    M  +      
Sbjct: 538 PSGNSVMAMNLLKLSQITGDLS---YSDSFENQIDAFSGEISQNPLGYVYMLTSFLGYIQ 594

Query: 619 PSRKHVVLVGHKSS---VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           P ++ V LV  +S    + F N++   +  +    TV+ +  +  + ++    H  +  +
Sbjct: 595 PDQR-VFLVSDESESRLMPFINVINENYRPF----TVLILYGSRYKRLEDVIPHIKDYTA 649

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
                  A K  A VC+NF+C+ PV+D
Sbjct: 650 ------PAGKTAAYVCENFTCNEPVSD 670


>gi|295695073|ref|YP_003588311.1| hypothetical protein [Kyrpidia tusciae DSM 2912]
 gi|295410675|gb|ADG05167.1| protein of unknown function DUF255 [Kyrpidia tusciae DSM 2912]
          Length = 716

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 242/608 (39%), Positives = 328/608 (53%), Gaps = 52/608 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA+LLN  FV+IKVDREERPDVD +YM   QAL G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDPEVAELLNRHFVAIKVDREERPDVDHLYMAACQALTGQGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFP   +YGRPG   +L +V   W+K  D +  +G     Q+ EAL
Sbjct: 113 VFLTPEKEPFYAGTYFPKRSRYGRPGLMELLTRVAQLWEKGADRVKDAGRHLTGQIGEAL 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +A       E+    L    EQL  SYD  FGGFG APKFPRP ++  +L +  +   
Sbjct: 173 GRAAQG-----EVDAGTLTRAFEQLLASYDHTFGGFGHAPKFPRPHDLLFLLRYGVR--- 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +G+     E   MV  TL+ M +GGI DHVG GF RYS D RW +PHFEKMLYD   L  
Sbjct: 225 SGR----REAFDMVQGTLEGMRRGGIWDHVGFGFARYSTDRRWLIPHFEKMLYDNALLVL 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A+    D  ++   R+I+ Y+RR+M  PGG  +SAEDADS   EG    +EG FYV
Sbjct: 281 TYLEAYQALGDQRWAQTAREIVTYVRREMTDPGGGFYSAEDADS---EG----EEGKFYV 333

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASAS 377
           WT +E+ + +G E   +   ++ +   GN +            G++VL E++ D    A 
Sbjct: 334 WTPQEITEAVGPEDGEVLCRYFGVTEEGNFE-----------GGRSVLNEIDTDVDLLAR 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +LGM  E+    +      L  VR +R  PH DDK++ +WNGL+I++ AR +++L     
Sbjct: 383 ELGMTPEEIDRKVRRGLEILHSVRDRRVHPHKDDKILTAWNGLMIAALARGARVLGD--- 439

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+  A  AA ++ R L  +   RL   +R+G +   G+LDDYAF 
Sbjct: 440 -------------ADYLVSARRAAEWLWRTL-RQGDGRLLARYRDGEAGILGYLDDYAFY 485

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I GLL+LY+      WL  AI L      LF D + GG F T  +  ++  R K   DGA
Sbjct: 486 IWGLLELYQADGDVAWLRRAIRLAQDVRTLFWDEKEGGCFLTGSDAEALWSRPKTAEDGA 545

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV  ++L+ L  +        + + AE  L  F   +            A D   
Sbjct: 546 LPSGNSVLALDLLWLGRLTGDPA---WERWAEAQLRAFAGAVSRYPAGYTFFLTAWDFAL 602

Query: 618 VPSRKHVV 625
            PS + VV
Sbjct: 603 GPSEEIVV 610


>gi|421076735|ref|ZP_15537717.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
 gi|392525347|gb|EIW48491.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
          Length = 628

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 250/686 (36%), Positives = 358/686 (52%), Gaps = 65/686 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME E FED+ VA LLN  F++IKVDREERPDVD +YM+  QAL G GGWPL++ ++PD K
Sbjct: 1   MERECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLTIIMAPDKK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   K GR G   +L  +   W+K R  + ++G   +  L      S     
Sbjct: 61  PFFAGTYFPKHRKMGRMGLLELLTTLHQHWEKNRSEILKAGNEIVNILQRPKPPSGEGQI 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
             D L Q  L     +L  SYD ++GGFGSAPKFP P +I  +L + +  ++        
Sbjct: 121 GEDLLKQAYL-----ELENSYDPQYGGFGSAPKFPTPHKITFLLRYWQHFKE-------P 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           +   MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   L   YL+A+  
Sbjct: 169 KALAMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCTSYLEAYQC 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T +  ++ I  DIL Y+ RDM+   G  +SAEDADS   EG     EG FYV+T K+V +
Sbjct: 229 TGNQEFARIAEDILTYVMRDMMDKNGGFYSAEDADS---EGV----EGKFYVFTRKQVVE 281

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ILG E   LF + Y++   GN +    S  H    G+N+          A  +   +E  
Sbjct: 282 ILGEEEGALFADFYHISSHGNFEHG-TSILH--LIGRNL-------EEYARVVNKTVENL 331

Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
             +L + R KL+ VR  R  P+ DDK++ +WNGL+I++FA+A+++LK             
Sbjct: 332 SEVLKKGREKLYQVREARIHPYKDDKILTAWNGLMIAAFAKAARVLK------------- 378

Query: 447 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 506
              + +Y +VAE   +FI   L      RL   +R G +    +LDDYAFL+  L+++YE
Sbjct: 379 ---QSKYAKVAEQGIAFIYEKLMGSNG-RLLARYREGEAAHLAYLDDYAFLLMALIEVYE 434

Query: 507 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 566
                 +L  A  L     ELF DR  GG++    +   ++ R KE +DGA PSGNSV+ 
Sbjct: 435 TTCNDYYLQQAAILAKDMGELFGDRTEGGFYFYGNDGEELIARPKEIYDGAIPSGNSVAA 494

Query: 567 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
             L +LA +   ++   +   AE  L  F   +   A        A D     + K +V+
Sbjct: 495 FALQKLADM---TEDRSFSDTAERLLGHFAGEVSRYAAGYTYFMMAVDYYLADNTK-IVI 550

Query: 627 VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKV 686
           VG K + D ++M             VI+     +  M F++ H+  N      +    K 
Sbjct: 551 VGDKEAADTKSMF-----------DVINNCFLPSAAMRFYDRHSRENVEYKEID---HKA 596

Query: 687 VALVCQNFSCSPPVTDPISLENLLLE 712
            A +C+NF+C PP+T+   L NLL++
Sbjct: 597 TAYICKNFACQPPITNVEKLRNLLMK 622


>gi|253699928|ref|YP_003021117.1| hypothetical protein GM21_1299 [Geobacter sp. M21]
 gi|251774778|gb|ACT17359.1| protein of unknown function DUF255 [Geobacter sp. M21]
          Length = 750

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 250/698 (35%), Positives = 365/698 (52%), Gaps = 56/698 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE +A+ LN  F++IKVDREERPDVD VYMT V A+   GGWPL++
Sbjct: 98  TCHWCHVMEEESFEDEEIARFLNANFIAIKVDREERPDVDTVYMTAVHAMGMQGGWPLNI 157

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +P+ KP  GGTYFPP D  G  GF ++LR++++ + +  D +  +G     QL+EA+ 
Sbjct: 158 FATPERKPFYGGTYFPPSDYAGGIGFLSLLRRIRETYQQAPDRVTHAGL----QLTEAIR 213

Query: 141 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
              +   +  E P+  + L    E   + +D++ GG   APKF         L     L 
Sbjct: 214 GILAP--MGGEPPEKEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D  + GE +    M  +TL+ MA GGI+D  GGGFHRY+ D  W +PHFEKMLYD  +LA
Sbjct: 266 DYLRRGEKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSTWLIPHFEKMLYDNARLA 324

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+ +  T D  ++ + R+IL YL+RDM+ P G  +SA DADS    G   ++EG F+
Sbjct: 325 AAYLEGYQATGDRHFAQVAREILRYLQRDMMSPEGAFYSATDADSLTESG--HREEGIFF 382

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
            WT +E++  LG E A +    Y +   GN            F+G+++L         A 
Sbjct: 383 TWTPEELDAALGAERARVVAACYGVTDEGN------------FEGRSILHREKSMQHLAE 430

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +L +P E+   +L E R +L+  R +RP P  D+K++ SWNGL IS+FAR   +L + A 
Sbjct: 431 ELMLPKEELERLLDEAREELYLARQRRPLPLRDEKILASWNGLAISAFARGGLVLNAPA- 489

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                           ++ A  AA+F+  ++  ++  RL HS++ G +K  GFLDDYAF 
Sbjct: 490 ---------------LLDTARGAANFMLENMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I+GL+DL+E      WL  A+E      E F D E GG+F T      ++ R K  +DG 
Sbjct: 533 IAGLIDLFEATGELPWLKRALEQARQVQEQFEDSETGGFFMTGPHHEELISREKPAYDGV 592

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV ++NL+RL ++            A+ +L  F T+L     A+  M  A D L 
Sbjct: 593 IPSGNSVMIMNLLRLNALTGEQGMP---DQAQRALDAFSTQLASAPTALSEMLLALDYLQ 649

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
              R+ V++           +L      +  N+ ++        E D  E+       + 
Sbjct: 650 DVPREIVIVAPQGKREAAGPLLEKLRGVFLPNRALVVFC-----EGDELEQAGELLPLVR 704

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
                  + +A +C++ SC  P +DP      L E  S
Sbjct: 705 EKKADGGRAMAYLCESRSCRRPTSDPEEFHRQLQETRS 742


>gi|310641971|ref|YP_003946729.1| cellulase catalitic domain protein and a thioredoxin domain protein
           [Paenibacillus polymyxa SC2]
 gi|386040955|ref|YP_005959909.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
 gi|309246921|gb|ADO56488.1| cellulase catalitic domain protein and a thioredoxin domain protein
           [Paenibacillus polymyxa SC2]
 gi|343096993|emb|CCC85202.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
          Length = 691

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 250/695 (35%), Positives = 357/695 (51%), Gaps = 64/695 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+ VA++LN  +VSIKVDREERPDVD +YM+  + + G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDQEVAEVLNQDYVSIKVDREERPDVDHIYMSICETMTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
           + ++PD KP   GTY P E K+GR G   +L KV   W ++ D L + S     E   + 
Sbjct: 113 IMMTPDQKPFFAGTYLPKEQKFGRVGLLELLGKVGIRWKEQPDELMELSEQVLTEHERQD 172

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L A         EL    L     + S ++D  +GGFG APKFP P  +  +L +++   
Sbjct: 173 LLAGYRG-----ELDDQCLNKAFHEYSHTFDHEYGGFGEAPKFPSPHNLSFLLRYAQH-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG      +  +MV  TL  M++GGI+DHVG GF RYSVDE+W VPHFEKMLYD   LA
Sbjct: 226 -TGN----QQALEMVEKTLDAMSRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLA 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+ +T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FY
Sbjct: 281 ITYTEAWQVTGKRLYRQITEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGRFY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
           VW+  E++ +LG E A  F + Y + P GN            F+G N+  LI++N   A 
Sbjct: 334 VWSDSEIKAVLGDEDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAY 380

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
            +K  +   +    + E + KLF  R +R  P  DDK++ SWNGL+I++ A+A +     
Sbjct: 381 GNKHDLTEPELEQRVSELKDKLFTAREQRVHPQKDDKILTSWNGLMIAALAKAGQ----- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                      G  R  Y E A  A +F+  HL  E   RL   +R+G +   G++DDYA
Sbjct: 436 ---------AFGDTR--YTEQARKAETFLWNHLRREDG-RLLARYRDGQAAYLGYVDDYA 483

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F + GL++LY+     ++L  A+ L     +LF D E  G F T  +   ++ R KE +D
Sbjct: 484 FYVWGLIELYQATFDVQYLQRALTLNQNMIDLFWDEERDGLFFTGSDSEQLISRPKEIYD 543

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNS++  N VRLA +   ++ + Y   A      F   +         +  A  +
Sbjct: 544 GAIPSGNSIAAHNFVRLARLTGETRLEDY---AAKQFKAFGGMVAHYPSGHSALLSAL-L 599

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            +      +V+VG ++       +    A +  N  VI  D    E  +           
Sbjct: 600 YATGKTSEIVIVGQRNDPQTAQFVQEVQAGFRPNMVVIFKDKGQPEIAEI-------APY 652

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +   +    K    VC++F+C  PVT    L+++L
Sbjct: 653 IHDYDLVDGKPAVYVCEHFACQAPVTHIDDLKHML 687


>gi|374324300|ref|YP_005077429.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
 gi|357203309|gb|AET61206.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
          Length = 631

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 253/692 (36%), Positives = 361/692 (52%), Gaps = 74/692 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFEDE VA+LLN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD K
Sbjct: 1   MERESFEDEEVAELLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDHK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTY P E K+GR G   +L KV   W ++ D L       +E   + L+     +K
Sbjct: 61  PFFAGTYLPKEQKFGRVGLMELLPKVAARWKEQPDEL-------VELSEQVLTEHERHDK 113

Query: 148 LPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
           L     EL +++L     Q S ++D  +GGFG APKFP P  +  +L +++    TG   
Sbjct: 114 LASYQGELDEHSLNKAFHQFSYAFDKDYGGFGEAPKFPSPHNLSFLLRYAQH---TGN-- 168

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
              +  +M   TL  M +GGI+DHVG GF RY+VDE+W VPHFEKMLYD   LA  Y +A
Sbjct: 169 --QQALEMAEKTLDAMYRGGIYDHVGMGFSRYAVDEKWLVPHFEKMLYDNALLAIAYTEA 226

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
           + +T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FYVW   E
Sbjct: 227 WQVTGKELYRRIAEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGKFYVWDESE 279

Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 381
           V  ILG+  A  F + Y + P GN            F+G N+  LI++N   A   K  +
Sbjct: 280 VRAILGDKDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGIKHDL 326

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
             ++      E R KLF  R +R  PH DDK++ SWNGL+I++ A+A +           
Sbjct: 327 TEQELEQRASELRAKLFTTREQRTHPHKDDKILTSWNGLMIAALAKAGQAFGE------- 379

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                     +Y E A+ A SF+  HL  +   RL   FR+G +  PG++DDYAF + GL
Sbjct: 380 ---------AQYTEQAQRAESFLWNHLRRDDG-RLLARFRDGDAAYPGYVDDYAFYVWGL 429

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           ++LY+     ++L  A+ L     +LF D E GG F    +   ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQDMIDLFWDEERGGLFFYGPDGEQLIAKPKEVYDGAIPSG 489

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           NS++  NLVRLA ++  S+ + Y   +     VF   +         +  +  + +  + 
Sbjct: 490 NSIAAHNLVRLARLMGESRLEDY---SAKQFKVFGGLVVQYPTGYSALLSSL-LYATGTT 545

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID---PADTEEMDFWEEHNSNNASMAR 678
           K +V+VGH+ +      + A  A +  N  VI  D   PA  + + +  ++   +     
Sbjct: 546 KEIVIVGHRDAPQTVQFIRAVQAGFRPNTVVILKDEGQPAIADIVPYIRDYTLVDG---- 601

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 K    VC++F+C  PVT    L+ LL
Sbjct: 602 ------KPAVYVCEHFACQAPVTRLDDLKALL 627


>gi|154688185|ref|YP_001423346.1| hypothetical protein RBAM_037900 [Bacillus amyloliquefaciens FZB42]
 gi|154354036|gb|ABS76115.1| YyaL [Bacillus amyloliquefaciens FZB42]
          Length = 689

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 252/692 (36%), Positives = 363/692 (52%), Gaps = 78/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGYGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M    G  FSA DAD   TEG    +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVMFIQREMTHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMITGLAKAAKV----- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI   L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +          + AE   +VF+  ++    +      +    
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P +K +V+ G K   D +  + A            H  PA T       EH    A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645

Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
           +  +F+A       +    +C+NF+C  P TD
Sbjct: 646 S--DFAAGYQMIDGRTTVYICENFACRRPTTD 675


>gi|386760793|ref|YP_006234010.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
 gi|384934076|gb|AFI30754.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
          Length = 689

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 243/687 (35%), Positives = 362/687 (52%), Gaps = 67/687 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVENIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A     K  + L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AA-----KTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFVQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ +E+   LG E   L+ + Y +   GN            F+GKN+  LI        A
Sbjct: 334 WSREEILKTLGDELGTLYCQVYDITEEGN------------FEGKNIPNLIHSKREQIKA 381

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G+  E+    L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+     
Sbjct: 382 DA-GLTEEELRLKLEDARQRLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVY---- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       +  +Y+ +A+ A +FI  HL  +   R+   +R+G  K  GF+DDYAF
Sbjct: 437 ------------EEPKYLSLAQDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAF 482

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+   LDLYE      +L  A +L +    LF D E GG++ +  +  ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFSGHDAEALIVREKEVYDG 542

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ + L+RL   V G  S    + AE   +VF+  +            +    
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIDAYPSGHAFFMQSVLRH 599

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            +P +K +V+ G       + ++     ++  N +++  +           E   + A  
Sbjct: 600 LMP-KKEIVIFGSADDPARKQIITELQKAFKPNDSILVAEQP---------EQCKDIAPF 649

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTD 702
           A +    D K    +C+NF+C  P T+
Sbjct: 650 AADYRIIDGKTTVYICENFACQQPTTN 676


>gi|418030673|ref|ZP_12669158.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|351471732|gb|EHA31845.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
           str. SC-8]
          Length = 664

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 239/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 28  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 88  VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 147

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 148 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 199

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 200 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 255

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 256 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 308

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 309 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 356

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 357 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 412

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL
Sbjct: 413 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 458

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 459 LWAYLDLYEASFDLSFLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 518

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL  +   S      + AE   +VF+  +            +     
Sbjct: 519 VPSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKQHIDAYPSGHAFFMQSVLRHL 575

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++     ++  N +++              EH      +A
Sbjct: 576 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 622

Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
              F+AD      K    +C+NF+C  P T+
Sbjct: 623 --PFAADYRIIDGKTTVYICENFACQQPTTN 651


>gi|163782790|ref|ZP_02177786.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159881911|gb|EDP75419.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 697

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 267/720 (37%), Positives = 380/720 (52%), Gaps = 59/720 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFEDE +A++LN+ +V IKVDREERPD
Sbjct: 31  GEEAFEKAEREDKPVFLSIGYSTCHWCHVMERESFEDEEIARILNENYVPIKVDREERPD 90

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD VYM+  Q + G GGWPL+V ++PD KP   GTYFP E  YGRPG + IL ++ + W 
Sbjct: 91  VDSVYMSVCQMMTGSGGWPLTVIMTPDKKPFFAGTYFPKEGMYGRPGLRDILLRIAELWR 150

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R    Q    A EQ+ +AL+     + + + L ++ L     +L  +YD  +GGFG+A
Sbjct: 151 NDR----QKVLTAAEQVVDALAKGEEESYIGERLDESILHKGFAELYHTYDEAYGGFGNA 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L + ++   TG +G+A E   MV  TL+ M  GGI DHVG GFHRYS 
Sbjct: 207 PKFPIPHNLMFLLRYYRR---TG-NGKALE---MVKHTLKKMRLGGIWDHVGFGFHRYST 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYD   L  VY +AF  T D F++ +  +I +YL+RDM+ P G  +SA
Sbjct: 260 DREWLLPHFEKMLYDNALLMLVYTEAFQATGDEFFAQVVEEIAEYLQRDMLSPEGAFYSA 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPH 357
           EDADS   EG    +EG FY WT  E+E++L E  +      + +   GN     + +  
Sbjct: 320 EDADS---EG----EEGKFYTWTLAELEELLTEEELGIALRLFGIAEEGNF----LEEAT 368

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               GKNVL    +    A +LG   +     L E R KLF  R KR RP  D+KV+  W
Sbjct: 369 RRKVGKNVLHMKKELEKYAEELGYEPDVLKQKLEEIRSKLFKRREKRVRPLRDEKVLTDW 428

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL I++F++A                 V   RK+++ VA+  A F+   + D++  +L 
Sbjct: 429 NGLAIAAFSKAG----------------VALGRKDFLAVAKRTADFLLNTMVDDEG-KLL 471

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H ++ G +  P FL+DYA+LI GL++LY+     ++L  A EL +   E F D E  G++
Sbjct: 472 HRYKEGEAGIPAFLEDYAYLIWGLMELYQGSFEGEYLKRAKELTDFALEHFWDEENLGFY 531

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T      VL+R KE +DGA PSGNSV   NLVRL  ++   +   Y + A+ +L  F  
Sbjct: 532 QTPDFGERVLVRKKEIYDGATPSGNSVMAYNLVRLGRLLGLQE---YERRADQTLNAFSQ 588

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            +     A      A D+L V     +V VG +          A  +  +L +  +    
Sbjct: 589 VIASFPGAHTFSLLALDIL-VKGSFELVAVGDREE--------AIQSLLELERDFLPEGL 639

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
              ++    E   S +           +    +C+NFSC  P TD   + N L+ + S T
Sbjct: 640 FAVKD----ETLQSLSGFFDSLREMDGRTTYYLCRNFSCESPATDIEDIRNRLVPQESGT 695


>gi|452913203|ref|ZP_21961831.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
 gi|452118231|gb|EME08625.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
          Length = 664

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 240/686 (34%), Positives = 365/686 (53%), Gaps = 65/686 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 28  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 88  VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQ--- 144

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + ++ K  + L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 145 --TKTAAKTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 199

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 200 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 255

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 256 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 308

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 309 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 356

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 357 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 412

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL
Sbjct: 413 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 458

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 459 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 518

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL   V G  S    + AE   +VF+  ++           +     
Sbjct: 519 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSVLRHL 575

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++A    ++  N +++  +           E   + A  A
Sbjct: 576 MP-KKEIVIFGSADDPARKQIIAELQKAFKPNDSILVAEQP---------EQCKDIAPFA 625

Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTD 702
            +    D K    +C+NF+C  P T+
Sbjct: 626 ADYRIIDGKTTVYICENFACQQPTTN 651


>gi|321313642|ref|YP_004205929.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
 gi|320019916|gb|ADV94902.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
          Length = 689

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 239/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSFLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL  +   S      + AE   +VF+  +            +     
Sbjct: 544 VPSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKQHIDAYPSGHAFFMQSVLRHL 600

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++     ++  N +++              EH      +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647

Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
              F+AD      K    +C+NF+C  P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676


>gi|350268373|ref|YP_004879680.1| hypothetical protein GYO_4496 [Bacillus subtilis subsp. spizizenii
           TU-B-10]
 gi|349601260|gb|AEP89048.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
          Length = 689

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 247/692 (35%), Positives = 362/692 (52%), Gaps = 77/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AAKSGEG-----LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T    E       V  TL  MA GGI+DH+G GF RYS DE W VPHFEKMLYD   L  
Sbjct: 225 T----EQENALYNVTKTLDSMANGGIYDHIGYGFARYSTDEEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+  LI        A
Sbjct: 334 WSKEEILRTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKRKQIKA 381

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G+  E+    L   R+ L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +   
Sbjct: 382 DA-GLTEEELSLKLEGARQLLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ--- 437

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +Y+ +A+ A +FI  HL  +   R+   +R+G  K  GF+DDYAF
Sbjct: 438 -------------EPKYLSLAKDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAF 482

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDG 542

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+ + L+RL   V G  S    + AE   +VF+  + D   +       + + 
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDI-DAYPSGHAFFMQSVLK 598

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            V  +K +V+ G       + ++ A   ++  N +++              EH      +
Sbjct: 599 HVMPKKEIVIFGSADDPARKQIITALQKAFKPNDSIL------------VAEHPDQCKDI 646

Query: 677 ARNNFSAD------KVVALVCQNFSCSPPVTD 702
           A   F+AD      K    +C+NF+C  P T+
Sbjct: 647 AP--FAADYRIIDGKTTVYICENFACQQPTTN 676


>gi|307107988|gb|EFN56229.1| hypothetical protein CHLNCDRAFT_145019 [Chlorella variabilis]
          Length = 648

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 221/533 (41%), Positives = 305/533 (57%), Gaps = 37/533 (6%)

Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 271
           M  F+L+ MA GG+ DHVGGGFHRYSVDE WHVPHFEKMLYD  QLA  YL AF +T+D 
Sbjct: 114 MATFSLRQMAAGGMWDHVGGGFHRYSVDEYWHVPHFEKMLYDNPQLAATYLAAFQITRDA 173

Query: 272 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 330
            Y+ + R I DYL R M  PGG +F+AEDADS +      KKEG FYVW+ +E++ +LG 
Sbjct: 174 QYAGVARGIFDYLLRGMTHPGGGLFAAEDADSLDPASGD-KKEGWFYVWSWEELQQLLGP 232

Query: 331 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
           E A  F  HYY K  GNCDLS  SDPH EF G N LI+    + +A+            L
Sbjct: 233 EDAPAFCAHYYAKQGGNCDLSPRSDPHGEFVGLNCLIQRQSLAQTAAAAARGEADTAAAL 292

Query: 391 GECRRKLFDVRSKRPRPHLDDK-----------------------VIVSWNGLVISSFAR 427
             CR KLF  R +RPRPH DDK                       ++ +WNG+ IS++A 
Sbjct: 293 AACREKLFRARERRPRPHRDDKARARGRGGAWPRILSNPWQHRLLIVAAWNGMAISAYAL 352

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
           AS+IL  E   A   FPV G    +Y++ A  AA+F+R+HL+D +T RL+  F  GPS  
Sbjct: 353 ASRILPHEQPPAARCFPVEGRPPGDYLQAALQAAAFVRQHLWDGETGRLRRCFTTGPSAV 412

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            GF DDYA++++GLLDL+          WA++LQ T DE+  D  GG YF+    D S+L
Sbjct: 413 EGFADDYAWMVAGLLDLHSTTGD-----WALQLQGTMDEVLWDEAGGAYFSGVAGDASIL 467

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           LR+KED+DGAEP+ +S+++ NL RLA +    +S  +R+ A    A F  RL +  +A+P
Sbjct: 468 LRMKEDYDGAEPAASSIALANLWRLAGLCGTEESARWRERAAKCAAAFAERLGEAPVALP 527

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            M  +  +L++   + V++ G + + D + +L AA  S+  +  VI +DP  ++ MDFW 
Sbjct: 528 QMAASLHLLTLGHPRQVIIAGAQGAPDTQALLDAAFYSFTPDMVVIQLDPGSSQVMDFWR 587

Query: 668 EHNSNNASMAR--NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
           + N    ++       + D   A + Q      P  DP  ++ +L E   S A
Sbjct: 588 QRNPEAVAVVEVMGMQAGDPATAFIYQA-----PTRDPEKVKQVLAEPRISAA 635



 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/63 (57%), Positives = 42/63 (66%), Gaps = 3/63 (4%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFE E  A L+N  FV++KVDREERPD
Sbjct: 42  GEEAFERARKEDKPIFLSVGYSTCHWCHVMERESFESEETAALMNQLFVNVKVDREERPD 101

Query: 59  VDK 61
           VDK
Sbjct: 102 VDK 104


>gi|407768088|ref|ZP_11115467.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
           17429]
 gi|407288801|gb|EKF14278.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
           17429]
          Length = 683

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 251/704 (35%), Positives = 369/704 (52%), Gaps = 80/704 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED+G+A L+N+ FV+IK+DREERPD+D VY   +  L   GGWPL++
Sbjct: 52  ACHWCHVMAHESFEDDGIAALMNELFVNIKLDREERPDLDSVYQNALALLGQQGGWPLTM 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW----DKKRDMLAQSGAFAIEQLS 136
           FL+PD +P  GGTYFP E +YGRPGF  +L+ V + +    D  R  +AQ G  A+ +++
Sbjct: 112 FLTPDGEPFWGGTYFPKEARYGRPGFGDVLKSVSEIYTQQPDNIRHNVAQIGQ-ALIKMN 170

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
              + S  S  + D+        C     +  D   GG   APKFP+P  + ++     +
Sbjct: 171 SGATGSMPSLAMIDQ--------CGHGCLQIMDGENGGTNGAPKFPQPSILALIWRVGVR 222

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             DT       + +++V  +L  M +GGI+DHVGGGF RY+VD++W VPHFEKMLYD  Q
Sbjct: 223 TNDT-------DLKRIVRHSLDRMCQGGIYDHVGGGFARYAVDDQWLVPHFEKMLYDNAQ 275

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L ++  D +  T +  Y     + +D++ RDM  PGG   ++ DADS   EG     EG 
Sbjct: 276 LIDLLCDVWRETGNPLYEARISETIDWILRDMRVPGGAFAASLDADS---EGV----EGK 328

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FYVW   E+  ILG  A LFK+ Y + P+GN            ++ KN+L      + + 
Sbjct: 329 FYVWDEAEINAILGNDAALFKDIYDVSPSGN------------WEHKNIL------NRTQ 370

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S LG+        L E R KL  VR+KR  P  DDK +  WN + I++ A A+ + K   
Sbjct: 371 SGLGLADRTTEKKLSETRTKLLAVRNKRIWPGWDDKALTDWNAMTIAALAEAAMVFK--- 427

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDY 494
                        R ++++ A+ A +F+   L   +++  R  HS+RNG ++  G L+DY
Sbjct: 428 -------------RADWLDYAKLAYNFVINSLMTGESNDRRFLHSYRNGKAQHAGMLEDY 474

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +I   L LYE      +L  A E     + LF D + GGYF +  +   +++R K   
Sbjct: 475 AHMIRAALRLYECFGEDAYLREATEWCEAVENLFADTK-GGYFQSASDADDLVVRQKPHM 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A P+GNSV   NL RL ++   +K   YR  AE ++A F  RL +    +P +  AA+
Sbjct: 534 DNAVPAGNSVMAQNLARLYALTGDTK---YRDRAEITIAAFAGRLNEQFPNMPGLLLAAE 590

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           ML  P +  +VL+  + S  +  M  A  A+Y  N+ +  +  ADT+ +         + 
Sbjct: 591 MLQNPLQ--IVLIAKERSQMYMEMRRAIFAAYLPNRAITIL--ADTDALP--------DL 638

Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
             A+   + D    A VCQ   CS PVT+   L  LL   P+ +
Sbjct: 639 HPAKGKTAIDGHETAYVCQGSVCSAPVTNVADLAKLLANLPNKS 682


>gi|16081134|ref|NP_391962.1| hypothetical protein BSU40820 [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|221312064|ref|ZP_03593911.1| hypothetical protein Bsubs1_22036 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221316389|ref|ZP_03598194.1| hypothetical protein BsubsN3_21942 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221321302|ref|ZP_03602596.1| hypothetical protein BsubsJ_21895 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221325585|ref|ZP_03606879.1| hypothetical protein BsubsS_22051 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|402778252|ref|YP_006632196.1| protein YyaL [Bacillus subtilis QB928]
 gi|586842|sp|P37512.1|YYAL_BACSU RecName: Full=Uncharacterized protein YyaL
 gi|467366|dbj|BAA05212.1| unknown [Bacillus subtilis]
 gi|2636629|emb|CAB16119.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|402483431|gb|AFQ59940.1| YyaL [Bacillus subtilis QB928]
 gi|407962936|dbj|BAM56176.1| hypothetical protein BEST7613_7245 [Bacillus subtilis BEST7613]
 gi|407966948|dbj|BAM60187.1| hypothetical protein BEST7003_3986 [Bacillus subtilis BEST7003]
          Length = 689

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 241/686 (35%), Positives = 364/686 (53%), Gaps = 65/686 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A     K  + L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AA-----KTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL   V G  S    + AE   +VF+  ++           +     
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSVLRHL 600

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++A    ++  N +++  +           E   + A  A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIIAELQKAFKPNDSILVAEQP---------EQCKDIAPFA 650

Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTD 702
            +    D K    +C+NF+C  P T+
Sbjct: 651 ADYRIIDGKTTVYICENFACQQPTTN 676


>gi|354559793|ref|ZP_08979037.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
           metallireducens DSM 15288]
 gi|353540319|gb|EHC09795.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
           metallireducens DSM 15288]
          Length = 653

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 260/712 (36%), Positives = 373/712 (52%), Gaps = 93/712 (13%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFED  VA+LLN  F++IKVDREERPD+D +YM + QAL G GGWPL++ ++P+ +
Sbjct: 1   MERESFEDTEVAELLNRSFLAIKVDREERPDIDHLYMEFCQALTGSGGWPLTILMTPEKQ 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS----------- 136
           P   GTYFP    YGRPG   +L ++ + WDK  + L +S    ++ ++           
Sbjct: 61  PFFTGTYFPKSSHYGRPGLIDLLSQISELWDKDENKLRKSAEEIVKAITSHQKRSSEEVN 120

Query: 137 ----EALS----------ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
                AL           ASA      +EL + +     + L +++DSR+GGFG APKFP
Sbjct: 121 PVEVHALQGFLNVQNGGDASADFQSWANELIEQSY----QALIQNFDSRYGGFGQAPKFP 176

Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
            P  +  +L ++K   D       S+ + M+   L  M +GGI+DH+G GF RYS D++W
Sbjct: 177 SPHNLTFLLRYAKDHPD-------SQAEAMIRKNLDTMGQGGIYDHIGFGFARYSTDQQW 229

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
            VPHFEKMLYD   LA  Y++A+   K+   +   ++IL Y+ RDM  P G  +SAEDAD
Sbjct: 230 LVPHFEKMLYDNALLAIAYIEAYQSQKEPRDAQKAQEILTYVLRDMTSPEGGFYSAEDAD 289

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S   EG     EG FYVWT +E+  +LGE  + LF + + + P GN            F+
Sbjct: 290 S---EGI----EGKFYVWTPEEITSVLGEKRSALFCDVFNITPEGN------------FE 330

Query: 362 GKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
           GK++   L+ D    A K  +  E    IL E R KL+  R  R  PH DDK++ SWNGL
Sbjct: 331 GKSIPNRLSGDIGELARKHHLNPETLNYILEEDRLKLWQSREHRIHPHKDDKILTSWNGL 390

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           +I + A+  ++         FN      D K Y+  AE AA F+  +LY  +  RL   F
Sbjct: 391 MIVALAKGGQV---------FN------DNK-YILAAEQAAHFVLENLYPNE--RLLARF 432

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           R+G +   G+LDDYAF I GLL+LY     + +L  A+ LQ   + LF D E GGY+ T 
Sbjct: 433 RDGNAAYLGYLDDYAFFIWGLLELYTASGKSDYLKSALSLQEQLETLFKDEEAGGYYLTG 492

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
            +   +LLR KE +DGA PSGNS++ +NL+ LA +    +   ++  AE  L  F + L 
Sbjct: 493 SDGEELLLRPKEIYDGALPSGNSITALNLLHLARLTGDER---WKLQAEKQLLSFRSTLT 549

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM--LAAAHASYDLNKTVIHIDPA 658
                      A      PS++ ++LVG   S++ E +  L     +  L  + +     
Sbjct: 550 SNPAGYTAFLQALQYALHPSQE-LLLVG---SLNHEGISPLRQTFFTIFLPYSSLLYHEG 605

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              E+  W         +    F  +KV+A +C NF+C  PV  P  L+ LL
Sbjct: 606 RLGELLPW---------VKDYPFDPNKVLAYLCTNFTCQKPVESPEELKALL 648


>gi|21226721|ref|NP_632643.1| hypothetical protein MM_0619 [Methanosarcina mazei Go1]
 gi|20905010|gb|AAM30315.1| conserved protein [Methanosarcina mazei Go1]
          Length = 700

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 238/705 (33%), Positives = 358/705 (50%), Gaps = 54/705 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCH+M  ESFEDE VA L+N+ FVSIKVDREERPD
Sbjct: 36  GEEAFEKARKENKPVFLSIGYSTCHWCHMMAHESFEDEEVAGLMNEAFVSIKVDREERPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YMT  Q + G GGWPL++ ++P  KP   GTY P   ++ + G   ++ ++K+ W+
Sbjct: 96  IDNIYMTVCQIILGRGGWPLNIIMTPGKKPFFAGTYIPKNTRFNQIGMLELVPRIKEIWE 155

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++ + +  S       + E +  S+        L +  +    E+L  S+D+ +GGF  A
Sbjct: 156 QQHEEVLDSAEKITSTIQEMIKESSGEG-----LGEEVIEEVYEELLSSFDTEYGGFSGA 210

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P +I  +L + ++  +        E   M  +TL  M +GGI+DH+G GFHRYS 
Sbjct: 211 PKFPTPHKISFLLRYWRRSRN-------PEALHMAEYTLDKMRRGGIYDHLGSGFHRYST 263

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYDQ   A  Y +A+ +T    Y      ILDY+ RD+  P G  +  
Sbjct: 264 DSMWLLPHFEKMLYDQALTAIAYTEAYQVTGKDLYKETAEGILDYVLRDLTSPEGGFYCG 323

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           EDAD         ++EG +Y+WT +E+  IL  E + L  + + L+  GN +     +  
Sbjct: 324 EDAD-------VEREEGKYYLWTLEEIRSILDPEDSELIIKMFNLREEGNFE----EEIR 372

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               G N+        + A+K+ +P+E+    +   R KL   R +R RP LDDK++  W
Sbjct: 373 GRETGTNLFYMARSPGSLAAKMKIPVEEVEKKVKAAREKLLKARYERKRPSLDDKILTDW 432

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I++FA+               + V G  R  Y++ AE AA FI   LY      L 
Sbjct: 433 NGLMIAAFAKG--------------YQVFGEQR--YLKAAEKAADFILMALYS-PGDGLL 475

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R+G +   G  DDYAFLI GLL+LYE G   ++L  A+ L +   E F D   GG +
Sbjct: 476 HRYRDGVAGISGTSDDYAFLIHGLLELYEAGFKMRYLKAAVSLNSELLECFWDPVNGGLY 535

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T  +  +++ R KE  D A P+GNS  ++NL+RL+ I+A    +   + A+     F  
Sbjct: 536 FTANDSEALIFRKKEFMDSAIPTGNSFEMLNLLRLSRIIADPGLE---ETADKLERAFSK 592

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           ++            A D    PS + V++ G   + D E ML    + +  NK +I    
Sbjct: 593 QIMKAPSGYTQFLSAFDFRLGPSYE-VIISGKAEASDTEQMLKELWSYFVPNKVLIFRPE 651

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            +  E+    ++      +        K  A VCQN+ C  P T+
Sbjct: 652 REKPEITELAKYTEEQVPI------EGKATAYVCQNYECQLPTTE 690


>gi|406830400|ref|ZP_11089994.1| hypothetical protein SpalD1_02134 [Schlesneria paludicola DSM
           18645]
          Length = 883

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 240/617 (38%), Positives = 333/617 (53%), Gaps = 63/617 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    ++C+WCHVME + F +E +AK LN  FV IKVDREERPD
Sbjct: 92  GPEAFEKAKKEGKMIFLSVGYSSCYWCHVMERKVFMNEAIAKTLNQDFVCIKVDREERPD 151

Query: 59  VDKVYMTYVQALY------GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRK 112
           VD +YMT +Q  Y        GGWPLS+FL+PD KP+ GGTYFPPE   G  GF  IL K
Sbjct: 152 VDDIYMTALQVYYQAIKAPASGGWPLSMFLTPDGKPIAGGTYFPPEATEGNEGFPAILAK 211

Query: 113 VKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF 172
           + D W    + +  +      +    +    S    P E+    +      ++ S+D  F
Sbjct: 212 LTDLWKNNHEQMVGNADIVANETRRLMRPKLSLK--PVEVNAKLVESVFAAVAGSFDPEF 269

Query: 173 GGFG------SAPKFPRPVEI---QMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKG 223
           GG          PKFP P ++   Q MLY S   ED           K++  TL  +A G
Sbjct: 270 GGIDFNPNRPDGPKFPTPTKLSFLQQMLYRSPN-EDV---------SKLLDVTLLQLACG 319

Query: 224 GIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDY 283
           GI DHVGGGFHRYSVD RW VPHFEKMLYDQ QLA+VY +A+  +    +  +  ++ ++
Sbjct: 320 GIRDHVGGGFHRYSVDRRWDVPHFEKMLYDQAQLADVYAEAYRTSHQPLHKQVAEELFEF 379

Query: 284 LRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLK 343
           + RD+  P G  +SA D   AET G     EG FYVW + E++ ILG  A  FKE Y +K
Sbjct: 380 VARDLTAPEGGFYSAID---AETNGI----EGEFYVWDATEIDHILGRSAAAFKEAYRVK 432

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
              + +   +     +   K   I+   + ASA+  G   +++ +     R+KL +VR+K
Sbjct: 433 ELSDFEHGNVLRLSQKRLPKAEAIKAVATPASAT--GSEKDEFTS----SRQKLLEVRNK 486

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           R +P  D+K++  WNGL+I ++ARA         +A  N P       EY+E+A  AA F
Sbjct: 487 RKKPLRDEKLLTCWNGLMIGAYARA---------AAPLNHP-------EYVEIAARAAEF 530

Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           I     D Q  RL H++ +G +K   +LDDYAFLI GL+ LY+     KWL  A +LQ+ 
Sbjct: 531 ILTKARDSQG-RLLHTYASGQAKLNAYLDDYAFLIDGLISLYDATEDVKWLKVAKQLQDD 589

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           Q  LFLD   GG+F T+     +L R K   DG  P+GNSVS  NL+RLA++   +K   
Sbjct: 590 QLRLFLDESNGGFFFTSHHHEELLTRTKNCFDGVVPAGNSVSARNLIRLAAL---TKISS 646

Query: 584 YRQNAEHSLAVFETRLK 600
           Y   A  ++ +F + ++
Sbjct: 647 YADEARATVELFASNIE 663


>gi|170757692|ref|YP_001780692.1| hypothetical protein CLD_3500 [Clostridium botulinum B1 str. Okra]
 gi|169122904|gb|ACA46740.1| conserved hypothetical protein [Clostridium botulinum B1 str. Okra]
          Length = 680

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 243/693 (35%), Positives = 353/693 (50%), Gaps = 72/693 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++
Sbjct: 53  TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD  P   GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+     
Sbjct: 113 LMTPDKNPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
                N    EL +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK  
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK+  +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSDEGGFYSAEDADS---EGV----EGKFY 331

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +WT +E+ DILG E   L+ + Y +   GN            F+ KN+   +N       
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                LEK        R+KLF+ R KR  P+ DDK++ SWN L+I +F++A +  K++  
Sbjct: 380 NNKDKLEK-------MRKKLFEYREKRIHPYKDDKILTSWNALMIIAFSKAGRSFKND-- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF 
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDERG-TLYARIREGERGNEGFIDDYAFF 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  L++LYE      +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V+ + L  L  I      D Y+   +     F T +K   M   L    A M +
Sbjct: 536 TPSGNAVASLALNLLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYN 591

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   K + L   +   DF   +   +  Y     V   D ++        E    N ++ 
Sbjct: 592 ILPVKEITLAYREKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 DK    +CQN++C  P+ D    + LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPIADLEEFKFLL 676


>gi|119184130|ref|XP_001243004.1| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
          Length = 797

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 244/611 (39%), Positives = 336/611 (54%), Gaps = 49/611 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188

Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E  +   +  + P     ++L    L    +     YD   GGF  APKFP P  
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247

Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
           +  +L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
            +PHFEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS      T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEF 424

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
             +NVL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484

Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           L I + A+ S +L K +AE A                VAE AA FIR +L+D +T +L  
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533

Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
            +R+G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL        
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593

Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
             GY+    N  G+ P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A
Sbjct: 594 PAGYYMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650

Query: 589 EHSLAVFETRL 599
            H+ + F   +
Sbjct: 651 RHTCSAFAAEM 661


>gi|392865908|gb|EAS31753.2| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
          Length = 799

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 244/611 (39%), Positives = 336/611 (54%), Gaps = 49/611 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188

Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E  +   +  + P     ++L    L    +     YD   GGF  APKFP P  
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247

Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
           +  +L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
            +PHFEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS      T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEF 424

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
             +NVL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484

Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           L I + A+ S +L K +AE A                VAE AA FIR +L+D +T +L  
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533

Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
            +R+G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL        
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593

Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
             GY+    N  G+ P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A
Sbjct: 594 PAGYYMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650

Query: 589 EHSLAVFETRL 599
            H+ + F   +
Sbjct: 651 RHTCSAFAAEM 661


>gi|51892001|ref|YP_074692.1| hypothetical protein STH863, partial [Symbiobacterium thermophilum
           IAM 14863]
 gi|51855690|dbj|BAD39848.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
           14863]
          Length = 623

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 254/681 (37%), Positives = 362/681 (53%), Gaps = 76/681 (11%)

Query: 27  VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
           +ME ESF D   A+++N  FV IKVDREERPD+D +Y T  Q +   GGWPLSV+L+P+ 
Sbjct: 1   MMERESFADPETAEIMNRHFVCIKVDREERPDLDDIYQTICQLVTRSGGWPLSVWLTPEQ 60

Query: 87  KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR---DMLAQSGAFAIEQLSEALSASA 143
           KP   GTYFPP ++YGRPGF+ +L  +  AW +KR   + +A+S A  I Q  E L    
Sbjct: 61  KPFYVGTYFPPVERYGRPGFRQVLLALAQAWREKRQEVEKVAESWARGIAQTDELLP--- 117

Query: 144 SSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 202
            +  +PD  L  +A R  AE++    D + GGFG APKFP  + + +ML H K   D   
Sbjct: 118 PAGPMPDHRLVADAARALAERI----DRQHGGFGGAPKFPNTMALDLMLRHWKATGD--- 170

Query: 203 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 262
                    +V  TL+ MA+GGI+D +GGGFHRYSVD RW VPHFEKMLYD   L  VYL
Sbjct: 171 ----DLFLHLVTLTLRKMAEGGIYDQLGGGFHRYSVDARWAVPHFEKMLYDNALLPAVYL 226

Query: 263 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
            A+  T +  +  I  + LDY+ R+M  P G  FS  DADS   EG    +EG +YVW  
Sbjct: 227 AAWQATGEPLFRRIVEETLDYVLREMTHPEGGFFSTTDADS---EG----EEGRYYVWDP 279

Query: 323 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
           +EV  +LG +   L   HY +   GN           E  GK VL     ++  AS LG+
Sbjct: 280 REVTAVLGPDLGALICRHYGVTEAGNF----------ERTGKTVLHIAEPAADLASSLGL 329

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
           P+E+    L E RR+L + RS+R  P  D+K++  WNGL+IS+ ARA +IL+        
Sbjct: 330 PVEEVERRLAEGRRRLLEARSRRVPPFRDEKILAGWNGLMISALARAGRILR-------- 381

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                   R +Y E A  AA+F+   L D +   L+  +++G +  PG+L+D+AF+ +GL
Sbjct: 382 --------RPDYAEAARRAATFVLDRLADGEGGLLRR-YKDGHAGIPGYLEDHAFMAAGL 432

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           +DLYE     ++L  A+ L       F D  G  +   +G +P ++ R ++  D + PSG
Sbjct: 433 IDLYECTFDERFLQEAMRLTEETLRRFYDGSGSFHLTQSGAEP-LIHRPRDTTDQSVPSG 491

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPS 620
            +V+V+NL+RL       + D +R+ A+ +       +  +  A   +  A D+ L  P+
Sbjct: 492 AAVAVVNLLRLQPY---RRDDRFREVADTAFRAHRDLMARVPGATATLLQALDLYLDGPT 548

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
              V LVG       E  L A    Y+ N  +  I            E   ++A +    
Sbjct: 549 --EVTLVGDPP----EAWLEALGRRYEPNLVLTRI------------EAPRDDAPIWAGK 590

Query: 681 FSADKVVALVCQNFSCSPPVT 701
            +    VA VC+NF+CSPP T
Sbjct: 591 AAGTGPVAYVCRNFACSPPAT 611


>gi|430756760|ref|YP_007207432.1| hypothetical protein A7A1_1268 [Bacillus subtilis subsp. subtilis
           str. BSP1]
 gi|430021280|gb|AGA21886.1| Hypothetical protein YyaL [Bacillus subtilis subsp. subtilis str.
           BSP1]
          Length = 689

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 241/691 (34%), Positives = 362/691 (52%), Gaps = 75/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL   V G  S    + AE   +VF+  +            +     
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKLDIDAYPSGHAFFMQSVLRHL 600

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++     ++  N +++              EH      +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647

Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
              F+AD      K    +C+NF+C  P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676


>gi|161528699|ref|YP_001582525.1| hypothetical protein Nmar_1191 [Nitrosopumilus maritimus SCM1]
 gi|160340000|gb|ABX13087.1| protein of unknown function DUF255 [Nitrosopumilus maritimus SCM1]
          Length = 675

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 249/686 (36%), Positives = 370/686 (53%), Gaps = 74/686 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE+E VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 49  SSCHWCHVMAHESFENEEVAKFMNENFVNIKVDREERPDIDDIYQKACQIATGQGGWPLS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PD KP   GTYFP  D YGRPGF +I R++  AW +K   + +S    ++ L++  
Sbjct: 109 IFLTPDQKPFYVGTYFPILDSYGRPGFGSICRQLSQAWKEKPKDIEKSADNFLDALNKTE 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S SS     +L +  L   A  L +  DS +GGFGSAPKFP    +  +  ++K    
Sbjct: 169 KVSISS-----KLERTILDEAAMNLFQLGDSAYGGFGSAPKFPNAANVSFLFRYAKI--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +G S     G K    TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD   +  
Sbjct: 221 SGLSKFTEFGLK----TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +AF +TKD FY  + +  LD++ R+M  P G  +SA DADS   EG     EG FYV
Sbjct: 277 NYAEAFQITKDPFYLDVLKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYV 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   E+++ILG+ A +F   Y     GN            ++G N+L    + S  A   
Sbjct: 330 WKKSEIKEILGDDADIFCLFYDATDGGN------------WEGNNILCNNLNISTVAFNF 377

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G   EK   IL  C +KL DVRSKR  P LDDK++VSWN L+I++FA+  ++        
Sbjct: 378 GTTEEKVREILQACSKKLLDVRSKRVAPGLDDKILVSWNSLMITAFAKGYRV-------- 429

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                   ++   Y++ A+   SFI  +L+     +L  +++N  +K  G+L+DY++ ++
Sbjct: 430 --------TNESRYLDAAKDCISFIENNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVN 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLD++E     K+L  A++L +   E F D E   +F T+     +++R K ++D + P
Sbjct: 480 CLLDVFEIEPDPKYLKLALKLGHHLVEHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLP 539

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAAD 614
           SGNSVS   ++RL            +Q  + +  + E++ + MA   P     L+   + 
Sbjct: 540 SGNSVSAFVMLRLFHFSQE------QQFLDIATKIMESQAQ-MAAENPFGFGYLLNTISI 592

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
            L  P    + ++  ++S   +++L      Y  N  V+ I   ++ ++    E+     
Sbjct: 593 YLEKPVE--ITIINTENSQLCDSIL----LEYLPNSIVVTIQ--NSTQLSALSEY----P 640

Query: 675 SMARNNFSADKVVALVCQNFSCSPPV 700
             A  +F  +K  A VC+NF+CS P+
Sbjct: 641 FFAGKSFE-EKTSAFVCKNFTCSLPL 665


>gi|407462858|ref|YP_006774175.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407046480|gb|AFS81233.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 675

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 247/684 (36%), Positives = 366/684 (53%), Gaps = 70/684 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE+E VA+ +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 49  SSCHWCHVMAHESFENEEVAQFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PD KP   GTYFP  D YGRPGF +I R++  AW +K   + +S    ++ L++  
Sbjct: 109 IFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLAQAWKEKPHDIEKSANNFLDALNKTE 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S      P +L +  L   A  L +  DS +GGFGSAPKFP    +  +  ++K    
Sbjct: 169 KIST-----PSKLERTILDEAAMNLFQLGDSTYGGFGSAPKFPNAANVSFLFRYAKL--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           +G S     G K    TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD   +  
Sbjct: 221 SGLSKFTEFGLK----TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +AF +TKD FY  I +  LD++ R+M  P G  +SA DADS   EG     EG FYV
Sbjct: 277 NYAEAFQITKDPFYLDILKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYV 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   E+++ILG+ + +F  +Y +   GN            ++G N+L    + S  A   
Sbjct: 330 WKKSEIKEILGDDSDIFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNF 377

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  EK   IL  C +KL DVRSKR  P LDDK++VSWN L+I++FA+  ++        
Sbjct: 378 GITEEKVREILQSCSKKLLDVRSKRIAPGLDDKILVSWNALMITAFAKGCRV-------- 429

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                   ++   Y+  A++  SFI  +L+     +L  +++N  +K  G+L+DY++ ++
Sbjct: 430 --------TNDSRYLNAAKTCISFIEDNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVN 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLD++E     K+L  A++L +   + F D E   +F T+     +++R K ++D + P
Sbjct: 480 CLLDVFEIEPDPKYLKLALKLGHHLVDHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLP 539

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-MCCAADMLSV 618
           SGNSVS   ++RL  +    K        E +  + E++ + MA   P       + +S+
Sbjct: 540 SGNSVSAFAMLRLFHLSQEKKF------LEITEKIMESQAQ-MAAENPFGFGYLLNTISI 592

Query: 619 PSRKHVVLVGHKSSVDFEN--MLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
              K + +    + ++ EN  +  +    Y  N  V+ I   D           S     
Sbjct: 593 YLEKPIEI----TIINTENSPLCKSILLEYLPNSIVVTIQNPDQLSA------LSQYPFF 642

Query: 677 ARNNFSADKVVALVCQNFSCSPPV 700
           A  +F  DK    VC+NF+CS P+
Sbjct: 643 AGKSFE-DKTSVFVCKNFTCSLPL 665


>gi|375308642|ref|ZP_09773925.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
 gi|375079269|gb|EHS57494.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
          Length = 690

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 251/698 (35%), Positives = 356/698 (51%), Gaps = 70/698 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM+ ESFEDE +A++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL+
Sbjct: 55  SSCHWCHVMKRESFEDEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTY P E K+GR G   +L KV   W ++ + L       +E   + L
Sbjct: 115 ILMTPDQKPFFAGTYLPKEQKFGRVGLLELLDKVGTRWKEQPEEL-------VELSEQVL 167

Query: 140 SASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           +     + L     EL + +L     Q S ++D  +GGFG APKFP P  +  +L +++ 
Sbjct: 168 TEHERQDMLAGYRGELDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPSPHILSFLLRYAQH 227

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              TG      +  +MV  TL  M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD   
Sbjct: 228 ---TGN----QQALEMVEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNAL 280

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA  Y + + +T    Y  I   I  Y+ R+M   GG  +SAEDADS   EG    +EG 
Sbjct: 281 LAIAYTETWQVTGKELYRQITEQIFTYIAREMTDAGGAFYSAEDADS---EG----EEGR 333

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
           FYVW   EV  +LG E A  F + Y + P GN            F+G N+  LI++N   
Sbjct: 334 FYVWDDSEVRAVLGDEDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LE 380

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           A   K  +  ++  + + E R KLF  R KR  PH DDK++ SWNGL+I + A+A +   
Sbjct: 381 AYGLKHDLTKQELEDRVRELRDKLFAAREKRVHPHKDDKILTSWNGLMIVALAKAGQAFG 440

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                              Y E A+ A SF+  HL      RL   +R+G +  PG+LDD
Sbjct: 441 DVT----------------YTERAQKAESFLWSHL-RRVDGRLLARYRDGDAAYPGYLDD 483

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAF + GL++LY+     ++L  A+ L     +LF D E  G F    +   ++ + KE 
Sbjct: 484 YAFYVWGLIELYQATFDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEI 543

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGNS++  NLVRLA +   ++ + Y   A      F   +         +  + 
Sbjct: 544 YDGAIPSGNSIAAHNLVRLARLTGEARLEDY---AAKQFKAFGGMVSYDPPGYSALLSSL 600

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
            + +  + K +V+VG +        + A  A +  N   I  D   +   D         
Sbjct: 601 -LYATGTTKEIVIVGQRDDPQTLQFIRAIQAGFRPNTVAILKDEGQSAIADI-------- 651

Query: 674 ASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
               R+    D K    VC++F+C  PV     L+ LL
Sbjct: 652 VPYIRDYTLVDGKPAVYVCEHFACQAPVMTLDDLKALL 689


>gi|157690983|ref|YP_001485445.1| thioredoxin [Bacillus pumilus SAFR-032]
 gi|157679741|gb|ABV60885.1| possible thioredoxin [Bacillus pumilus SAFR-032]
          Length = 687

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 250/691 (36%), Positives = 361/691 (52%), Gaps = 77/691 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+V
Sbjct: 54  TCHWCHVMAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNV 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD KP   GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  +
Sbjct: 114 FVTPDQKPFYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKAT 165

Query: 141 AS---ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            +    ++ +  + L Q  +     QL  S+D+  GGFG+APKFP P    M+ +  +  
Sbjct: 166 NNLRIKAAGQTENTLTQETIHKAYYQLMSSFDTLHGGFGTAPKFPAP---HMLSFLMRYY 222

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           E TG+        K    TL  +A GGI+DHVG GF RYS DE+W VPHFEKMLYD   L
Sbjct: 223 EWTGQENALYAVTK----TLDGIANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALL 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ LT+   Y  +   ++ +++RDM+ P G  +SA DADS   EG    KEG F
Sbjct: 279 MEAYTEAYQLTQQPTYEKLVHRLIHFIKRDMMNPDGSFYSAIDADS---EG----KEGQF 331

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+  E+   LGE    LF   Y++   GN +   +  PH       +    +D  AS 
Sbjct: 332 YVWSKDEIMTHLGEDLGALFCAVYHITDEGNFEGENI--PH------TISTSFDDIKASF 383

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S     L+  L    E R  L  VR +RP P +DDKV+ SWN L+IS+ A+  ++     
Sbjct: 384 SIDDQTLQSKLQ---EARYILQSVRQQRPAPLVDDKVLTSWNALMISALAKTGRVF---- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D +E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA 
Sbjct: 437 ------------DAEEAIRMAKQAISFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAH 482

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           ++   + LYE      WL  A  +     ELF D+E GG+F +  +  ++L+R KE +DG
Sbjct: 483 MLKAYMSLYEATFELAWLEKATAIAENMFELFWDKEKGGFFFSGSDAEALLVREKEVYDG 542

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--- 612
           A PSGNS ++ +L+ L+ +         RQN   +L  +F+    D++ + P    A   
Sbjct: 543 AMPSGNSTALKHLLILSRLTG-------RQNWLDTLEQMFQAFYVDVS-SYPSGHTAFLQ 594

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             +    +++ ++++G       E +L A      L K  +  D   T E     +  + 
Sbjct: 595 GLLAQYATKREIIILGKNGDPQKEQLLQA------LQKRFMPFDIILTAETG---QELAK 645

Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
            A   ++  + D K    +C+N+SC  P+TD
Sbjct: 646 LAPFTKDYKTIDGKTTVYICENYSCRQPITD 676


>gi|373849972|ref|ZP_09592773.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
 gi|372476137|gb|EHP36146.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
          Length = 785

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 248/717 (34%), Positives = 371/717 (51%), Gaps = 74/717 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  E+F    VA  LN+ F+ +K+DREERPD+D++Y+ +V    G GGWPL+
Sbjct: 112 STCHWCHVMRRETFSRADVAAFLNEHFIPVKLDREERPDIDRIYLAFVAGTTGRGGWPLN 171

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PDLKP +GGTY+PPED+ G+PGF T+ R   + W + R+ +A            A 
Sbjct: 172 VWLTPDLKPFLGGTYYPPEDQPGQPGFLTVARVAAEGWARDREKVAAH-----ADRIAAA 226

Query: 140 SASASSNKLPDE---------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
            AS +    PD+         +   A    A QL + +D   GGFG   KFP   +I+ +
Sbjct: 227 LASLAGAAGPDQRSGRSGAATIDNAAWSAAAAQLFEEFDPEHGGFGRDAKFPHASKIRFL 286

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
              +  ++    +GEA+  +++   +L+ +  GG+ DH+GGGFHRY+VD  W +PHFEKM
Sbjct: 287 FRFA--VQPGVPAGEAARAREVAFASLEALTGGGLRDHLGGGFHRYTVDRGWRLPHFEKM 344

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYDQ  +A + +DA+ L+ D     + R+ L ++   +  P G  ++A DA+SA    A 
Sbjct: 345 LYDQALVAGLLVDAYQLSGDTRRFDLLRETLAFVEAALTSPDGAFYAALDAESALPGAAE 404

Query: 311 -RKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-- 366
             K EGAFY W+  E+   L  + A L    Y     GN   + + +       +NVL  
Sbjct: 405 GDKAEGAFYTWSLDEITAALPPDEAALVIARYGFTAEGNA--TSLEERAGVLHNRNVLVP 462

Query: 367 ------IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
                   +  +  +A KL   L+           +L  +RS R  P  D+K+I +WNG 
Sbjct: 463 ASSAAATAVTKAPGAAEKLSRALD-----------RLRAIRSTRQPPARDEKIITAWNGY 511

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           +IS+ ARA +              V G  R  ++++A  AA+ + +  ++ +T  L+   
Sbjct: 512 MISALARAHQ--------------VTGESR--WLDLATRAATHLWQTAWNGKTATLRRI- 554

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-----GGG 535
              P    GF +DYA  I GLLDLYE G   +WL  A+ LQ T D  F D       GGG
Sbjct: 555 -AAPGGGDGFAEDYAAFIQGLLDLYEAGFDPRWLDRALALQATLDTRFADPAPASAGGGG 613

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           YF T      VL+R+KED DGAEP+ +S++  NL RLA     +    Y   A   LA F
Sbjct: 614 YFGTAAGASGVLVRMKEDFDGAEPAASSLAADNLRRLAVFTGDAA---YEHRARAVLAAF 670

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSR-KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
             + +    A+P++  AA  L+  ++ + +V+ G   + D   +LA A   +    T++ 
Sbjct: 671 APQHRRAPAAMPVLLAAAFGLAEGAKPRQIVIAGRAGADDTRALLAEARRRFQPFATILL 730

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
              AD    D+  + N   A+M     SAD +  A VC+NF+C  PV+DP +L  LL
Sbjct: 731 ---ADGASGDWLAQRNEAVAAMR----SADGQATAFVCENFACDAPVSDPAALGRLL 780


>gi|167043013|gb|ABZ07725.1| putative protein of unknown function, DUF255 [uncultured marine
           microorganism HF4000_ANIW141A21]
          Length = 678

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 254/711 (35%), Positives = 380/711 (53%), Gaps = 83/711 (11%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           +K +R + +I      +TCHWCHVM  E+FE++  A++LN  F+ IKVDREERPD+D++Y
Sbjct: 39  SKAKRENKIIFLSIGYSTCHWCHVMAHETFENDEAAEILNQNFIPIKVDREERPDIDELY 98

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-D 122
           M  V ++ G GGWPL+VFL+PDLKP  GGTY+P         FK++L  V + W+K+R D
Sbjct: 99  MKAVTSMGGQGGWPLTVFLTPDLKPFYGGTYYP------LSSFKSLLGSVTEIWNKQRKD 152

Query: 123 MLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
           +  Q+ +  +E L    +    S+    E P +A  L    L  S+D R+GGFG +PKFP
Sbjct: 153 VFGQANSI-VENLRRMYTPQEQSS--ISEYPIDAAYL---NLVDSFDDRWGGFGDSPKFP 206

Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
            P  + ++L    +  D  K+ +A +   MV+ TL  M+ GGI DH+ GGFHRYSVD  W
Sbjct: 207 TPSNLILLL----RYYDRSKNHKALD---MVVKTLDAMSSGGIQDHLAGGFHRYSVDRMW 259

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
            + HFEKMLYD   L   YL+A+    +  +    R  L+++ R+M    G  +SA+DAD
Sbjct: 260 VISHFEKMLYDNALLTIAYLEAYRCKPNDAFEKTARMTLNWILREMQSKDGAFYSAQDAD 319

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S +        EGA+YVW+  E+ DILG ++ ++  E + +   GN +           K
Sbjct: 320 SPDG-------EGAYYVWSKAEISDILGPKNGMIVAEWFGVGDEGNFE-----------K 361

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
            K+VL    +    A K+G+  +K + ++ + +  L   RS R +P  DDK++ SWNGL 
Sbjct: 362 EKSVLTTRTNLDDLAKKVGLTPKKLVALMDKSKAALLQARSHRVKPSTDDKILTSWNGLT 421

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           IS+ A  +++L                DR EY+E A+ AASF+   L   +  RL   +R
Sbjct: 422 ISALALGAQVL---------------GDR-EYLEAAKRAASFLMETL--SEKGRLLRRYR 463

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
           +G +   G L+DYAF I GLLDLYE     KWL  A+ L +   ELF D   GG+F   G
Sbjct: 464 DGEAALGGTLEDYAFFIQGLLDLYEADLQIKWLQEAMRLADKMIELFWDDSSGGFF-FNG 522

Query: 542 EDPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           +D S  +++++KE +DGA PSGNSV  + L++L      S+ D YR+    ++  F  R+
Sbjct: 523 KDSSDNMIVKIKEAYDGATPSGNSVGALALLKLGVF---SERDEYREKGVKTIMSFFGRI 579

Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
           +   MA   M  A D     SR+ +++ G  +++   +ML      Y  NK V+ +    
Sbjct: 580 ESNPMAHSHMLSAVDFHLRGSRE-IIVAGSDANL-INDMLHEIWRRYIPNK-VLALSGKA 636

Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            E+             M +       V   +C+NF C  PV+    L  +L
Sbjct: 637 VEK----------TIPMVKGKIGT-PVSVYICENFVCKRPVSKLKELTAML 676


>gi|384170788|ref|YP_005552166.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
 gi|341830067|gb|AEK91318.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
          Length = 664

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 246/696 (35%), Positives = 357/696 (51%), Gaps = 70/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 28  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 88  VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENA 139

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+L+  +  
Sbjct: 140 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLLFLLRYY 196

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 197 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 252

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            + Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 253 LSAYTEAYQVTNNERYKQIATQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 305

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
           Y+W+ KE+ ++LG+    L+ + Y +   GN            F+G+N+  LI      A
Sbjct: 306 YIWSKKEIMNLLGDQLGSLYCKVYNITEQGN------------FEGENIPNLI-FTRREA 352

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
              + G+   +    L   R+KL + R  R  PH DDKV+ SWN L+I+  A+A+K+   
Sbjct: 353 ILEETGLTEHELTERLEGARKKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKVFHE 412

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                             ++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDY
Sbjct: 413 PG----------------FLSMAETAIRFLERHLIPDG--RVMVRYREGEVKNKGFIDDY 454

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AFLI   L+LYE G    +L  A  L  +  +LF D   GG+F T  +  ++L+R KE +
Sbjct: 455 AFLIWAYLELYEAGFNPSYLKKAKTLCTSMLDLFWDERHGGFFFTGNDAETLLVREKEVY 514

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGA PSGNS + + L+RL  +          + AE   +VF+  ++    +      +  
Sbjct: 515 DGAVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSV- 570

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           +  +  +K +V+ G K   D +  + A    +    T++  +          EE    + 
Sbjct: 571 LAHIMPQKEIVVFGSKDDPDRKWFIEALQEHFTPAYTILAAENP--------EELAGISD 622

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             A       K    +C+NF+C  P TD     N+L
Sbjct: 623 FAAGYEMIDGKTTVYICENFTCRRPTTDIDEAMNVL 658


>gi|443631576|ref|ZP_21115757.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
 gi|443349381|gb|ELS63437.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
          Length = 689

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 244/698 (34%), Positives = 364/698 (52%), Gaps = 89/698 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED  +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDAEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A      QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AAKSGEG-----LSESATHRTFLQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE------LN 370
           W+  E+   LG+    L+ + Y +   GN            F+GKN+  LI       + 
Sbjct: 334 WSKDEILKTLGDDLGTLYCQVYDITEKGN------------FEGKNIPNLIHTKREQLIA 381

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           D+S +  +L + LE       + R++L  +R +R  PH+DDKV+ SWN L+I+  A+A+K
Sbjct: 382 DASLTKEELNLKLE-------DARQQLLKIREERTYPHVDDKVLTSWNALMIAGLAKAAK 434

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           + +                  +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF
Sbjct: 435 VYQ----------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGF 476

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           +DDYAFL+   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R 
Sbjct: 477 IDDYAFLLWAYLDLYEASFDLSYLRKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVRE 536

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           KE +DGA PSGNSV+ + L+RL   V G  S    + AE   +VF+  +           
Sbjct: 537 KEVYDGAMPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAESMFSVFKPDIDAYPSGHAFFM 593

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            +     +P +K +V+ G+      + ++ A   ++  N +++              EH 
Sbjct: 594 QSVLKHLMP-KKEIVIFGNADDPARKQIITALQKAFKPNDSIL------------VAEHP 640

Query: 671 SNNASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
                +A   F+AD      K    +C+NF+C  P T+
Sbjct: 641 DECTDIAP--FAADYRIIDGKTTVYICENFACQQPTTN 676


>gi|407478214|ref|YP_006792091.1| hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
 gi|407062293|gb|AFS71483.1| Hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
          Length = 677

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 244/729 (33%), Positives = 375/729 (51%), Gaps = 97/729 (13%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F     T +  FL    +TCHWCHV+  ESFEDE  A++LN+ FVSIKVDREERPD
Sbjct: 28  GEEAFSLARATNKPIFLSIGYSTCHWCHVLAHESFEDEETARMLNERFVSIKVDREERPD 87

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMT  Q + G GGWPLSVFLSPD  P   GTYFP   ++ RP F+ ++ ++ + + 
Sbjct: 88  IDQIYMTAAQLMNGQGGWPLSVFLSPDQTPFYIGTYFPKTPQFNRPSFRQVILQLSEHYR 147

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
              + + + G   I+ L++  SA  ++ +L D L  +      +Q  + +D + GGFG A
Sbjct: 148 TDPEKIKRVGNELIQALTDVTSAD-TTGQLDDTLIHDTF----DQAMRQFDVQNGGFGEA 202

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L       D  +  E     +MV+ TL  M  GGI D +G G  RY+V
Sbjct: 203 PKFPSPSLLTFLL-------DYYRFAEDETALQMVMRTLTAMRDGGITDQIGFGLCRYTV 255

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DERW VPHFEKMLYD    A + ++ + ++    +     ++  Y+ RD++ P G  +SA
Sbjct: 256 DERWDVPHFEKMLYDNALFATLCIETYQVSGRERFKQYAEEVFTYIERDLLSPDGAFYSA 315

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS   EG    +EG FY +T  E+ D+LGE A LF   Y   P GN           
Sbjct: 316 EDADS---EG----REGTFYTFTYDELLDVLGEDA-LFPRFYQATPQGN----------- 356

Query: 359 EFKGKNVLIELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
            F G+NV    N S    A   G  ++K L  L + R+ L  VRS+R RP  DDK++ +W
Sbjct: 357 -FDGRNVFRRTNQSVQQFADDNGRTVQKTLFQLEQERQTLLHVRSQRIRPFRDDKILTAW 415

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           N L+IS++A+A ++                 D   Y +VA  A +F+  HL D+   RL+
Sbjct: 416 NALMISAYAKAGRVF----------------DDHHYTDVAIRALTFLETHLMDDD--RLR 457

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             +R G  +  GFLDDY+FL    L+L++    T ++  A+ L +   + F D E G +F
Sbjct: 458 VRYREGHIQGNGFLDDYSFLTEAYLELHQTTQQTVYIQQALRLTDRMIQDFGD-EQGSFF 516

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T+ E+ ++L+R K+ +DG +P+GNS +V+NL+RL+ +   +    YR+ A+H  +    
Sbjct: 517 FTSVEEETLLVRPKDIYDGVKPAGNSTAVLNLIRLSQLTGRTD---YRECAQHVFSALAL 573

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVG------------HKSSVDFENMLAAAHAS 645
            +         +  A     +  ++ ++L              HK  +   ++LA     
Sbjct: 574 EVASQPTGFASLLSAYVRTWLEPKELIMLTDSLETIGPFLADLHKRRLPELSVLAGK--- 630

Query: 646 YDLNKTVIHIDP--ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
               +T++ + P  AD + +D                    +  A +CQ+F C  P T+ 
Sbjct: 631 ---KETLLKVAPFIADYDLID-------------------SRPTAYLCQDFQCERPTTNL 668

Query: 704 ISLENLLLE 712
             L + ++E
Sbjct: 669 SELLHQIIE 677


>gi|255306584|ref|ZP_05350755.1| hypothetical protein CdifA_08327 [Clostridium difficile ATCC 43255]
          Length = 678

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 237/698 (33%), Positives = 364/698 (52%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE   I F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +    +FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   ++ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +       DK+   VCQ+ SCS P+ D   L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|384161675|ref|YP_005543748.1| YyaL [Bacillus amyloliquefaciens TA208]
 gi|328555763|gb|AEB26255.1| YyaL [Bacillus amyloliquefaciens TA208]
          Length = 689

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 246/696 (35%), Positives = 357/696 (51%), Gaps = 70/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+L+  +  
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLLFLLRYY 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            + Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 278 LSAYTEAYQVTNNERYKQIATQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
           Y+W+ KE+ ++LG+    L+ + Y +   GN            F+G+N+  LI      A
Sbjct: 331 YIWSKKEIMNLLGDQLGSLYCKVYNITEQGN------------FEGENIPNLI-FTRREA 377

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
              + G+   +    L   R+KL + R  R  PH DDKV+ SWN L+I+  A+A+K+   
Sbjct: 378 ILEETGLTEHELTERLEGARKKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKVFHE 437

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                             ++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDY
Sbjct: 438 PG----------------FLSMAETAIRFLERHLIPDG--RVMVRYREGEVKNKGFIDDY 479

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AFLI   L+LYE G    +L  A  L  +  +LF D   GG+F T  +  ++L+R KE +
Sbjct: 480 AFLIWAYLELYEAGFNPSYLKKAKTLCTSMLDLFWDERHGGFFFTGNDAETLLVREKEVY 539

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGA PSGNS + + L+RL  +          + AE   +VF+  ++    +      +  
Sbjct: 540 DGAVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSV- 595

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           +  +  +K +V+ G K   D +  + A    +    T++  +          EE    + 
Sbjct: 596 LAHIMPQKEIVVFGSKDDPDRKWFIEALQEHFTPAYTILAAENP--------EELAGISD 647

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             A       K    +C+NF+C  P TD     N+L
Sbjct: 648 FAAGYEMIDGKTTVYICENFTCRRPTTDIDEAMNVL 683


>gi|67517751|ref|XP_658661.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
 gi|40747019|gb|EAA66175.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
 gi|259488639|tpe|CBF88239.1| TPA: DUF255 domain protein (AFU_orthologue; AFUA_1G12370)
           [Aspergillus nidulans FGSC A4]
          Length = 774

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 240/602 (39%), Positives = 331/602 (54%), Gaps = 37/602 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN+ F+ IKVDREERPDVD +YM YVQA  G GGWPL+
Sbjct: 66  SACHWCHVMEKESFMSQEVASILNESFIPIKVDREERPDVDDIYMNYVQATTGSGGWPLN 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPG-----FKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +     G     F  IL K++D W  +R    +S     +Q
Sbjct: 126 VFLTPDLEPVFGGTYWPGPNAASLLGPETVSFIEILEKLRDVWQTQRQRCLESAKEITKQ 185

Query: 135 L---SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
           L   +E  + +   ++  ++L    L    +  +  YD   GGF  APKFP P  +  +L
Sbjct: 186 LREFAEEGTHTFQGDQSDEDLDVELLEEAYQHFASRYDINNGGFSRAPKFPTPANLSFLL 245

Query: 192 ---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
               +   + D     E      M + TL  MA+GGI DH+G GF RYSV   W +PHFE
Sbjct: 246 RLGIYPSAVTDIVGQEECENATAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHFE 305

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETE 307
           KMLYDQ QL +VY DAF +T +  +     D++ YL    I    G   S+EDADS  T 
Sbjct: 306 KMLYDQAQLLDVYADAFKITHNPEFLGAVYDLITYLTSAPIQSTTGGFHSSEDADSLPTP 365

Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
             T K+EGAFYVWT KE+  +LG   A +   H+ +   GN  ++  +DPH+EF  +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELTQVLGPRDAGVCARHWGVLSDGN--IAPENDPHDEFMDQNVL 423

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 425
                 S  A + G+  ++ + I+   R++L + R K R RP LDDK+IV+WNGL I + 
Sbjct: 424 SIKVTPSKLAKEFGLGEDEVVRIIKSGRQRLREYRDKNRVRPDLDDKIIVAWNGLAIGAL 483

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
           A+ S +L  E +S         S   +  E A  A +FI+  LYD+ T +L   +R+G  
Sbjct: 484 AKCS-VLFEEIDS---------SKSAQCREAAAKAINFIKETLYDKATGQLWRIYRDGSK 533

Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNTTG 541
              PGF +DYAFL SGLLD+YE      +L +A +LQ   +E FL   G    GY+ T  
Sbjct: 534 GTTPGFAEDYAFLTSGLLDMYEATFDDSYLQFAEQLQRYLNENFLAYAGSSPAGYYTTPS 593

Query: 542 E----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
                 P+ LLR+K   + A PS N V   NL+RL+SI+   + + YR  A  +   F  
Sbjct: 594 TSAPGSPATLLRLKTGTESAVPSVNGVIARNLLRLSSIL---EENSYRVLARQTCQSFAV 650

Query: 598 RL 599
            +
Sbjct: 651 EI 652


>gi|153953760|ref|YP_001394525.1| hypothetical protein CKL_1135 [Clostridium kluyveri DSM 555]
 gi|219854377|ref|YP_002471499.1| hypothetical protein CKR_1034 [Clostridium kluyveri NBRC 12016]
 gi|146346641|gb|EDK33177.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
 gi|219568101|dbj|BAH06085.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
          Length = 633

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 231/697 (33%), Positives = 367/697 (52%), Gaps = 74/697 (10%)

Query: 17  FLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGW 76
           +LI TCHWCHVM  ESF+D  VA++LN +F+S+KVDREERPDVD +YM   Q++ G GGW
Sbjct: 4   YLICTCHWCHVMAKESFQDNEVAEILNKYFISVKVDREERPDVDSIYMKVCQSITGSGGW 63

Query: 77  PLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           PL++ ++P+ KP   GTYFP  +     G   IL  ++ AW   +  L + G  ++  + 
Sbjct: 64  PLTIIMTPEQKPFFAGTYFPKNNVGEALGLIAILEYIQKAWKDNKAQLLKEGD-SLLDII 122

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L+ ++S      EL Q+ L+    +  +++D+ +GGFG  PKFP    +  +L +  K
Sbjct: 123 NTLNKNSSG-----ELSQDILKKAFLEFKQNFDTLYGGFGGYPKFPSAHNLLFLLRYFHK 177

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            +D       +   +MV  TL+ M +GG++DH+G GF RYSVD +W +PHFEKMLYD   
Sbjct: 178 TKD-------AFALEMVEKTLESMYRGGMYDHIGYGFSRYSVDRKWLIPHFEKMLYDNAL 230

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           +A  YL+ F +T +  Y+ +  +I +Y+ RDM    G  +SAEDADS   EG    +EG 
Sbjct: 231 IAMAYLETFQVTGNKKYAKVAEEIFEYVLRDMTSKEGGFYSAEDADS---EG----EEGK 283

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY+W+ +E++DILG E    F  ++ +   GN            F+GKN+   + +S   
Sbjct: 284 FYMWSQEEIKDILGQEQGSKFCCYFNVTSQGN------------FRGKNIPNLIGNS--- 328

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                  LE+ +  +  CR KLF  R KR  PH DDK++ SWNGL+I++ A A ++L   
Sbjct: 329 ------ILEEDVQFIKNCREKLFKYREKRVHPHKDDKILTSWNGLMIAAMALAGRVL--- 379

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        +  +Y   A+ +  FI ++L   +  RL   +R G S   G+ DDYA
Sbjct: 380 -------------NNSKYTLAAKKSVDFIYKNLI-RKDGRLLARYREGDSSFLGYADDYA 425

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI GL++LYE     ++L  A+EL     E+F D E GG+F    +   +++R KE +D
Sbjct: 426 FLIWGLIELYETTYNPEYLKNALELNQNFLEIFWDSENGGFFLYGKDSEKLIIRPKEIYD 485

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G  P GNS + +NL+RL+ +    +   +    +     F   ++   ++      A   
Sbjct: 486 GPTPCGNSAAALNLLRLSYLATSYE---FEDKVKQLFENFADEIESSPISCSFSLVALLF 542

Query: 616 LSVPSRKHVVLVGH--KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
              P R+ ++  G     +    +M+   ++ + ++    H++          +E  +  
Sbjct: 543 SKYPVRQIIISAGENINEARKVLDMINKKYSPFTVSVLYSHLN----------KELKNIC 592

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            S+ +      KV   VC+NF+C  P+T+   L+ +L
Sbjct: 593 PSIEQYIAIRGKVTVYVCENFTCKEPITNMDLLKEVL 629


>gi|425767540|gb|EKV06109.1| hypothetical protein PDIG_78870 [Penicillium digitatum PHI26]
 gi|425780454|gb|EKV18461.1| hypothetical protein PDIP_27280 [Penicillium digitatum Pd1]
          Length = 752

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 244/605 (40%), Positives = 330/605 (54%), Gaps = 40/605 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN+ FV IKVDREERPD+D +YM YVQA  G GGWPL+
Sbjct: 32  SACHWCHVMEKESFMSSEVASILNESFVPIKVDREERPDIDDIYMNYVQATTGSGGWPLN 91

Query: 80  VFLSPDLKPLMGGTYF--PPEDKYGRP---GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+  P    +  P   GF  IL K++D W  ++     S     +Q
Sbjct: 92  VFLTPDLEPVFGGTYWQGPNSTTFTGPEAIGFVEILEKLRDVWQTQQQRCLDSAKEITKQ 151

Query: 135 LSEALSASASSNK------LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
           L E       S +        +++    L    +  +  YDS  GGFG APKFP P  + 
Sbjct: 152 LREFAEEGTHSQQGDRDDDNDEDMDIELLEEAYQHFASRYDSVNGGFGRAPKFPTPSNLS 211

Query: 189 MMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
            +L    +  ++ D     E  +   M + TL  MA+GGI DH+G GF RYSV   W +P
Sbjct: 212 FLLRLGAYPTQVMDVVGHDECEQATAMAVTTLVNMARGGIRDHIGHGFARYSVTTDWGLP 271

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 304
           HFEKMLYDQ QL +VY+DAF LT D        D+  YL    I  P G  FS+EDADS 
Sbjct: 272 HFEKMLYDQAQLLDVYVDAFRLTHDPELLGAVYDLAAYLTSAPIQSPTGGFFSSEDADSY 331

Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
                T K+EGAFYVW+ KE+  +LG   A +  +H+ + P GN  +    DPH+EF  +
Sbjct: 332 PHPNDTEKREGAFYVWSLKELTSVLGPRDAPVCAKHWGVLPDGN--VPPEYDPHDEFMNQ 389

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 422
           NVL      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I
Sbjct: 390 NVLSIRATPSKLAKDFGLSEEEVVKIIKSSKQKLHDYRERSRGRPDLDDKIIVAWNGLAI 449

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
            + A+ S +L  E ES+   +           E A  A SFI+  L+D+ T +L   +R 
Sbjct: 450 GALAKCS-VLFEEIESSKAVY---------CREAAARAISFIKDKLFDKTTGQLWRIYRG 499

Query: 483 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG---GYFN 538
           G     PGF DDYA+L SGLLD+Y+      +L +A  LQ   +E FL + G    GY++
Sbjct: 500 GNRGDTPGFADDYAYLASGLLDMYDATYDDSYLQFAERLQKYLNEYFLAQSGSTATGYYS 559

Query: 539 T----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           T    T   P  LLR+K   + A PS N V   NL+RL++++   + + YR  A  +   
Sbjct: 560 TPSVITPGMPGPLLRLKTGTESATPSVNGVIARNLLRLSALL---EDESYRTLARQTCNT 616

Query: 595 FETRL 599
           F   +
Sbjct: 617 FAVEI 621


>gi|340345243|ref|ZP_08668375.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339520384|gb|EGP94107.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 675

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 220/561 (39%), Positives = 319/561 (56%), Gaps = 49/561 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE++ VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 49  SACHWCHVMAHESFENDEVAKFMNENFVNIKVDREERPDLDDIYQKVCQIATGQGGWPLS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PD KP   GTYFP  D YGRPGF +I R++  AW +K   + +S    +  L +A 
Sbjct: 109 IFLTPDQKPFYVGTYFPVLDSYGRPGFGSITRQLAQAWKEKPKDIEKSADNFLSALQKAE 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +      K+P +L +  L   A  L +  D+ +GGFGSAPKFP    +  +  ++K    
Sbjct: 169 TV-----KIPSKLEKVILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG     S+  +  L TL  MAKGGI D +GGGFHRYS D +W VPHFEKMLYD   +  
Sbjct: 221 TG----LSKFNEFALKTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T+D FY  +    L ++ R+M    G  +SA DADS   EG     EG FYV
Sbjct: 277 NYAEAYQITQDQFYLEVLHKTLGFVLREMTSKEGGFYSAYDADS---EGV----EGKFYV 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   E+++ILG+ A +F  +Y +   GN            ++G ++L    + SA A   
Sbjct: 330 WKKSEIKEILGDDAEIFCLYYDVTDGGN------------WEGNSILCNNINISAVAFHF 377

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GMP EK   IL  C  KL +VRSKR  P LDDKV+ SWN L+I++FA+  ++        
Sbjct: 378 GMPEEKIKEILVRCSEKLLNVRSKRVPPGLDDKVLTSWNALMITAFAKGYRV-------- 429

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                   +   +Y++ A++  SFI   L D+   +L  +++N  +K  G+L+DY++  +
Sbjct: 430 --------TGETKYLDAAKNCVSFIETKLLDDT--KLLRTYKNNVAKIDGYLEDYSYFAN 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLD++E     K+L  A++L +   + F D E   +F T+ +   +++R K ++D + P
Sbjct: 480 ALLDVFEIEPEAKYLNLAVKLGHHLVDHFWDPESSSFFMTSDDHEKLIIRPKSNYDLSLP 539

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNSVS   ++RL  +    K
Sbjct: 540 SGNSVSCFVMLRLYHLTQEEK 560


>gi|384177739|ref|YP_005559124.1| hypothetical protein I33_4252 [Bacillus subtilis subsp. subtilis
           str. RO-NN-1]
 gi|349596963|gb|AEP93150.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
           str. RO-NN-1]
          Length = 689

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 241/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A +        L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +
Sbjct: 173 AAKSGEG-----LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+        K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L  
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +E+   LG+    L+ + Y +   GN            F+GKN+   ++       +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKE 381

Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                EK L++ L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +    
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+ +A+ A +FI   L  +   R+   +R G  K  GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRGGEVKNKGFIDDYAFL 483

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +   LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+ + L+RL   V G  S    + AE   +VF+  +            +     
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKLDIDAYPSGHAFFMQSVLRHL 600

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +K +V+ G       + ++     ++  N +++              EH      +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647

Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
              F+AD      K    +C+NF+C  P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676


>gi|320031949|gb|EFW13906.1| DUF255 domain-containing protein [Coccidioides posadasii str.
           Silveira]
          Length = 799

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 243/611 (39%), Positives = 335/611 (54%), Gaps = 49/611 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188

Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E  +   +  + P     ++L    L    +     YD   GGF  APKFP P  
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247

Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
           +  +L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
            +PHFEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS      T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEF 424

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
             +NVL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484

Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           L I + A+ S +L K +AE A                VAE AA FIR +L+D +T +L  
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533

Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
            +R+G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL        
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593

Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
             GY+    N   + P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A
Sbjct: 594 PAGYYMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650

Query: 589 EHSLAVFETRL 599
            H+ + F   +
Sbjct: 651 RHTCSAFAAEM 661


>gi|255937427|ref|XP_002559740.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584360|emb|CAP92395.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 788

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 247/605 (40%), Positives = 329/605 (54%), Gaps = 40/605 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN+ FV IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 68  SACHWCHVMEKESFMSSEVASILNESFVPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 127

Query: 80  VFLSPDLKPLMGGTYF--PPEDKYGRP---GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+P L+P+ GGTY+  P    +  P   GF  IL K++D W  ++     S     +Q
Sbjct: 128 VFLTPSLEPVFGGTYWQGPNSTTFRGPEAIGFVEILEKLRDVWQTQQQRCLDSAKEITKQ 187

Query: 135 LSEALSASASS------NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
           L E       +      N   +E+    L    +  +  YDS  GGFG APKFP P  + 
Sbjct: 188 LREFAEEGTHTQQGDRDNDKDEEMDIELLEEAYQHFASRYDSVNGGFGRAPKFPTPSNLS 247

Query: 189 MMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
            +L    +  ++ D     E  +   M + TL  MA+GGI DH+G GF RYSV   W +P
Sbjct: 248 FLLRLGAYPTQVMDVVGHDECEQATAMAVTTLVNMARGGIRDHIGHGFARYSVTADWGLP 307

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 304
           HFEKMLYDQ QL +VY+DAF LT D        D+  YL    I  P G  FS+EDADS 
Sbjct: 308 HFEKMLYDQAQLLDVYVDAFRLTHDPELLGAVYDLSAYLTSAPIQSPTGGFFSSEDADSY 367

Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
                T K+EGAFYVW+ KE+  +LG   A +  +H+ + P GN  +    DPH+EF  +
Sbjct: 368 PHPNDTEKREGAFYVWSLKELTSVLGPRDAPVCAKHWGVLPDGN--VPPEYDPHDEFMNQ 425

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 422
           NVL      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I
Sbjct: 426 NVLSIRATPSKLAKDFGLSEEEVVKIIKSSKQKLHDHREQTRGRPDLDDKIIVAWNGLAI 485

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
            + A+ S +L  E ES         S      E A  A  FI+  L+D+ T +L   +R+
Sbjct: 486 GALAKCS-VLFEEIES---------SKAVHCREAAARAIGFIKDKLFDKATGQLWRIYRD 535

Query: 483 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFN 538
           G     PGF DDYA+L SGLLD+Y+      +L +A  LQ   +E FL + G    GY++
Sbjct: 536 GNRGDTPGFADDYAYLASGLLDMYDATYDDSYLQFAERLQKYLNEYFLAQSGSTAAGYYS 595

Query: 539 ----TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
               TT   P  LLR+K   + A PS N V   NL+RL++++ G +S  YR  A  +   
Sbjct: 596 TPSVTTPGMPGPLLRLKTGTESATPSVNGVIARNLLRLSALL-GDES--YRTLARQTCNT 652

Query: 595 FETRL 599
           F   +
Sbjct: 653 FAVEI 657


>gi|126699171|ref|YP_001088068.1| hypothetical protein CD630_15680 [Clostridium difficile 630]
 gi|115250608|emb|CAJ68432.1| conserved hypothetical protein [Clostridium difficile 630]
          Length = 678

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/698 (33%), Positives = 364/698 (52%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  ++  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKDMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE   I F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +    +FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   ++ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +       DK+   VCQ+ SCS P+ D   L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|303320203|ref|XP_003070101.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240109787|gb|EER27956.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 799

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 243/611 (39%), Positives = 335/611 (54%), Gaps = 49/611 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188

Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E  +   +  + P     ++L    L    +     YD   GGF  APKFP P  
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247

Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
           +  +L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
            +PHFEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS      T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEF 424

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
             +NVL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484

Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           L I + A+ S +L K +AE A                VAE AA FIR +L+D +T +L  
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533

Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
            +R+G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL        
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593

Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
             GY+    N   + P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A
Sbjct: 594 PAGYYMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650

Query: 589 EHSLAVFETRL 599
            H+ + F   +
Sbjct: 651 RHTCSAFAAEM 661


>gi|86157370|ref|YP_464155.1| hypothetical protein Adeh_0943 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85773881|gb|ABC80718.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 718

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 248/633 (39%), Positives = 351/633 (55%), Gaps = 70/633 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T R  FL    +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64  GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
           VD VYMT VQ L G GGWP+SV+L+PD +P  GGTYFPP D    P  G  +IL ++ D 
Sbjct: 124 VDAVYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGLLSILHEIADL 183

Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
           W +  D + + +GA      +    A  ++  +P   P ++A+ L    L +S+D R GG
Sbjct: 184 WARDPDRIRSATGALVEAVRTALAPAGPAAADVPGPEPIEHAVTL----LERSFDERHGG 239

Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
              APKFP  V ++++L H +      ++GE     +M   TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGE-ERSLRMATVTLERMAAGGLHDQVGGGFH 292

Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
           RYS D +W VPHFEKMLYD   LA  Y +A+  T    ++ + R  LDYL R++  P G 
Sbjct: 293 RYSTDAQWLVPHFEKMLYDNALLAVAYAEAWQATGRRDFARVTRQTLDYLLRELTSPEGG 352

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
           ++SA DADS   EG    +EG F+ WT  E+ + LG+ A  F   + ++P GN       
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                F+G+NVL            +  P E         R  L+ +R +RPRP  D+KV+
Sbjct: 399 -----FEGRNVL-----------HVPAPDEDAWESFAPDRAALYALRERRPRPLRDEKVL 442

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
             WNGL IS+ A   ++L SEA                +++ A  AA F+   +  +   
Sbjct: 443 AGWNGLAISALALGGRVL-SEA---------------RWVDAAARAADFVLTRMVKDG-- 484

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           RLQ S+  G +  P +L+D+AFL+ GLLDL+E     +WL  A++L   QD LF D  GG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEASFDPRWLRSALQLAEAQDRLFGDPAGG 544

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G+F +  +   +L R K  HDGAEPSG SV+ +N +RL +  +  +   +R+ A+ +L  
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
               L +  +A+  +  A D  S   R+ VVLV
Sbjct: 602 HARTLAEQPLAMSELLLALDFASDAVRE-VVLV 633


>gi|423090012|ref|ZP_17078355.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
           70-100-2010]
 gi|357557317|gb|EHJ38868.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
           70-100-2010]
          Length = 678

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/698 (33%), Positives = 363/698 (52%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE     F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +    +FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   ++ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +       DK+   VCQ+ SCS P+ D   L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|187778206|ref|ZP_02994679.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
           15579]
 gi|187775134|gb|EDU38936.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
           15579]
          Length = 683

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 237/687 (34%), Positives = 351/687 (51%), Gaps = 74/687 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA++LN+ F+SIKVDREERPD+D +YM + QA  G GGWPL+
Sbjct: 55  STCHWCHVMERESFEDEDVAEILNENFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP   K+  PG   IL+ +   W + ++ + +S    +EQ+    
Sbjct: 115 ILMTPDKKPFFAGTYFPKWGKHNIPGIMDILKSINKLWREDKNKVLESSNRILEQIER-- 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
                 N   DEL +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK 
Sbjct: 173 ---FQDNHGEDELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK- 228

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                     +   ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L
Sbjct: 229 --------DKKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALL 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           +  Y +A+  TK+  Y  +   IL+Y+++ M    G  +SAEDADS   EG     EG F
Sbjct: 281 SMAYTEAYEATKNPLYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKF 333

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
           Y+WT KE+ DILGE    F           C L  ++   N F+ KN+  LI+ +     
Sbjct: 334 YLWTKKEIMDILGEEDGAFY----------CKLYDITSRGN-FEKKNIANLIQTDLKDVD 382

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
            +K         + L   R KLF+ R KR  PH DDK++ SWN L+I +F RA +  K++
Sbjct: 383 NNK---------DKLERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND 433

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+++A+ +A FI ++L DE+   L    R       GF+DDYA
Sbjct: 434 ----------------NYIDIAKQSADFIIKNLMDEKG-TLYARIREEERGNEGFIDDYA 476

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F +  L++LYE      +L  +IE+ ++  +LF  +E GG++  +     +++R KE +D
Sbjct: 477 FFLWALIELYEASFDIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYD 536

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGN+V+ + L  L  I      D Y+   +     F   +K   M   L    A M
Sbjct: 537 GAMPSGNAVASLALSLLYYITG---EDKYKNLVDKQFKFFAANIKSGPM-YHLFSVIAYM 592

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            ++   + + L   +    F   +   +  Y     +   D ++  E          N +
Sbjct: 593 YNISPVQEITLAYSEKDEAFYEFINELNNRYIPFSIITLNDKSNKIE--------KINKN 644

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
           +       DK    +CQ+++C  P+ D
Sbjct: 645 LKDKTPIKDKTTVYICQDYACKEPIMD 671


>gi|254975197|ref|ZP_05271669.1| hypothetical protein CdifQC_07775 [Clostridium difficile QCD-66c26]
 gi|255092587|ref|ZP_05322065.1| hypothetical protein CdifC_07992 [Clostridium difficile CIP 107932]
 gi|255314324|ref|ZP_05355907.1| hypothetical protein CdifQCD-7_08235 [Clostridium difficile
           QCD-76w55]
 gi|255517004|ref|ZP_05384680.1| hypothetical protein CdifQCD-_07809 [Clostridium difficile
           QCD-97b34]
 gi|255650105|ref|ZP_05397007.1| hypothetical protein CdifQCD_07959 [Clostridium difficile
           QCD-37x79]
 gi|260683234|ref|YP_003214519.1| hypothetical protein CD196_1491 [Clostridium difficile CD196]
 gi|260686830|ref|YP_003217963.1| hypothetical protein CDR20291_1466 [Clostridium difficile R20291]
 gi|306520110|ref|ZP_07406457.1| hypothetical protein CdifQ_08874 [Clostridium difficile QCD-32g58]
 gi|384360839|ref|YP_006198691.1| hypothetical protein CDBI1_07695 [Clostridium difficile BI1]
 gi|260209397|emb|CBA62859.1| conserved hypothetical protein [Clostridium difficile CD196]
 gi|260212846|emb|CBE04045.1| conserved hypothetical protein [Clostridium difficile R20291]
          Length = 678

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 236/698 (33%), Positives = 363/698 (52%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L+ V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLKNVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE     F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +    +FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   ++ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +       DK    VCQ+ SCS P+ D   L++++L
Sbjct: 638 GFLNNYRLKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|189218169|ref|YP_001938811.1| Highly conserved protein containing a thioredoxin domain
           [Methylacidiphilum infernorum V4]
 gi|189185027|gb|ACD82212.1| Highly conserved protein containing a thioredoxin domain
           [Methylacidiphilum infernorum V4]
          Length = 724

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 232/645 (35%), Positives = 351/645 (54%), Gaps = 34/645 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+  VA+LLN +F+ IKVDREERPD+D+ YM +VQA  G GGWP++
Sbjct: 47  STCHWCHVMAKESFENPIVAQLLNSFFIPIKVDREERPDIDQFYMEFVQAFTGQGGWPMN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+L+P  GGTYFP E K+G+PGF  IL+K+ + W   R +L Q G     ++ E +
Sbjct: 107 VWLTPNLEPFFGGTYFPLESKWGKPGFVDILKKIAELWQYNRSLLEQQGQEIFHKMREVI 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            +S      P+     A R   EQL  S+D   GGF  +PKFPRP  +   L+ +  L D
Sbjct: 167 QSSFEPKSPPNL--AIASRKAVEQLWGSFDRTHGGFSPSPKFPRP-SLFYFLFRAGSLAD 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +  +    Q M L++LQ M+ GGIHD + GGFHRYSVDE+W +PHFEKMLYDQ  L  
Sbjct: 224 FSEDYKKKSLQ-MALYSLQKMSGGGIHDQLEGGFHRYSVDEKWRLPHFEKMLYDQATLGL 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YLDA+  T D  +      +++YL   +  P G  +SAEDADS    G  +++EGA+Y+
Sbjct: 283 SYLDAYQATDDPLFKDTFESLVEYLLSHLHHPSGGFYSAEDADSLNASG--QEEEGAYYL 340

Query: 320 WTSKE----VEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           WT +E    +E I+G+       H++     GN     +S+       KN+L+     S 
Sbjct: 341 WTFQELQQTLEPIVGKDRSKILAHFFGATEQGNLPGGLISE--EALAKKNILLMEKPLSD 398

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A +LG+ LE+   I+ + +  L   R KR +P LDDK+I +WNG  +S+ A+A      
Sbjct: 399 LAHELGISLEEAREIVLKAKEGLKKERLKRSKPFLDDKIICAWNGYTLSALAKA------ 452

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                   + V+G  R   +  A+  A+F+  +L+D  +  L   +RNG    PGF  DY
Sbjct: 453 --------YMVIGDGR--LINEAKKTATFLLENLWDPSSKTLYRIYRNG-RGTPGFSSDY 501

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L   +L L+E     KWL  A   Q   +E F+D     Y     E  +  ++ +E++
Sbjct: 502 ASLALSMLHLFEADQDEKWLSLAKLFQELLEEKFVDPYRHNYMVEAVEISAKSIQTREEY 561

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DGAEP+  S++  +L++L ++    K   +R+  E   +     L+    A+P +     
Sbjct: 562 DGAEPATLSLAAHSLLKLYTLTGEEK---WRKRLEELFSYAWPILERFPTALPYLLGVYC 618

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
               P  + ++LVG K + + + +  +       N+ ++ +DP +
Sbjct: 619 EYRAPLVE-IILVGEKKNEETKRLFHSLSKLLIPNRLLVVLDPQE 662


>gi|394994118|ref|ZP_10386849.1| YyaL, partial [Bacillus sp. 916]
 gi|393805058|gb|EJD66446.1| YyaL, partial [Bacillus sp. 916]
          Length = 607

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 240/637 (37%), Positives = 346/637 (54%), Gaps = 58/637 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+
Sbjct: 23  STCHWCHVMAHESFEDEEIADMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 82

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD KP   GTYFP   KY RPGF  +L  + + +   R          +E ++E  
Sbjct: 83  VFVTPDQKPFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 134

Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A       P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +  
Sbjct: 135 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 191

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TGK  +A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L
Sbjct: 192 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 247

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y +A+ +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +
Sbjct: 248 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 300

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + 
Sbjct: 301 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 356

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L   LE       E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+     
Sbjct: 357 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 404

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F+ P       +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAF
Sbjct: 405 ----FHEP-------DFLSMAETAIRFLERHLMPDA--RVMVRYREGEVKNKGFIDDYAF 451

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI   L+LYE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DG
Sbjct: 452 LIWAYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 511

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS + + L+RL  +  G  S    + AE   +VF+  ++    +      +    
Sbjct: 512 AVPSGNSAAAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 568

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
           ++P +K +V+ G K   D +  + A    +    T++
Sbjct: 569 TMP-QKEIVVFGRKDDPDRKRFIEALQEHFTPAYTIL 604


>gi|255100682|ref|ZP_05329659.1| hypothetical protein CdifQCD-6_07712 [Clostridium difficile
           QCD-63q42]
          Length = 678

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 237/698 (33%), Positives = 362/698 (51%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD  +GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDENYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE   I F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +    +FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   ++ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +       DK    VCQ+ SCS P+ D   L++++L
Sbjct: 638 GFLNNYILKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|115491785|ref|XP_001210520.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114197380|gb|EAU39080.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 787

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 236/569 (41%), Positives = 323/569 (56%), Gaps = 37/569 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN+ F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 70  SACHWCHVMEKESFMSQEVASILNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 129

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKT-----ILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +    PG +T     IL K++D W  ++    +S     +Q
Sbjct: 130 VFLTPDLEPVFGGTYWPGPNATTNPGHETIGFVDILEKLRDVWQTQQQRCRESAKDITKQ 189

Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L   +E  + S   ++  DE L    L    +     YD+  GGF  APKFP P  +  +
Sbjct: 190 LREFAEEGTHSYQGDRAADEDLDIELLEEAYQHFVSRYDTAHGGFSKAPKFPTPANLSFL 249

Query: 191 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           L    Y S  ++  GK  E      M + TL  MA+GGIHDH+G GF RYSV   W +PH
Sbjct: 250 LRLGVYPSAVVDVVGKE-ECENATAMAVNTLINMARGGIHDHIGHGFARYSVTADWGLPH 308

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
           FEKMLYDQ QL +VY+DAF +T +        D++ YL    +    G   S+EDADS  
Sbjct: 309 FEKMLYDQAQLLDVYIDAFKITHNPELLGAVYDLVTYLTTAPLQSSTGAFHSSEDADSLP 368

Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
               T K+EGAFYVWT KE+  +LG   A +   H+ + P GN  +S  +DPH+EF  +N
Sbjct: 369 MPNDTEKREGAFYVWTLKELTQVLGSRDAGVCARHWGVLPDGN--ISPANDPHDEFMNQN 426

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 423
           VL      S  A + G+  ++ + IL   ++KL + R K R RP LDDK+IV+WNGL I 
Sbjct: 427 VLSIKVTPSKLAREFGLGEDEVVRILRSAKQKLREYREKNRVRPDLDDKIIVAWNGLAIG 486

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
           + A+AS +   + +S+M +         +  E A  A SFI+  L+++ T +L   +R+G
Sbjct: 487 ALAKASALF-DQIDSSMAS---------KCREAAARAVSFIKETLFEKSTGQLWRIYRDG 536

Query: 484 P-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT 539
                PGF DDYA+L SGLL++YE      +L +A +LQ   +E FL   G    GY++T
Sbjct: 537 SRGDTPGFADDYAYLTSGLLEMYEATFDDSYLQFAEQLQKYLNEKFLAYVGSTPAGYYST 596

Query: 540 ----TGEDPSVLLRVKEDHDGAEPSGNSV 564
               T   P  LLR+K   + A PS N V
Sbjct: 597 PSTMTPGMPGPLLRLKTGTESATPSINGV 625


>gi|404493392|ref|YP_006717498.1| thioredoxin domain-containing protein YyaL [Pelobacter carbinolicus
           DSM 2380]
 gi|77545446|gb|ABA89008.1| thioredoxin domain protein YyaL [Pelobacter carbinolicus DSM 2380]
          Length = 711

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 243/684 (35%), Positives = 352/684 (51%), Gaps = 61/684 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA++LN  F+ IKVDREERPD+D +YMT  Q + GGGGWPL+
Sbjct: 76  STCHWCHVMEQESFEDREVAEVLNKLFIPIKVDREERPDIDNLYMTACQLVTGGGGWPLN 135

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD  P    TY P   +   PG   IL K+   W   RD L Q+G    E L   +
Sbjct: 136 VFLTPDKAPFYAATYMPRRPRGQMPGIIAILTKIGAMWQSDRDQLLQTGREIGETL---I 192

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              +S+  +   L +  L    E+   ++D   GGFG APKFP P  + ++ + +++   
Sbjct: 193 RLESSAAPVASSLTEAPLTEAFERFKANFDHERGGFGKAPKFPMPHNLSLLFHIAQRF-- 250

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G+ +  + M + TLQ +  GG++DH+G G HRYSVD  W VPHFEKMLYDQ  +  
Sbjct: 251 ----GQET-AEAMAIKTLQHIRLGGMYDHIGFGMHRYSVDAFWRVPHFEKMLYDQALVTL 305

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             LDA+ +T D F+  +    + Y+ RD+  P G   S EDAD   TEGA    EG FY+
Sbjct: 306 AALDAYQVTHDTFFESLADQTMSYVLRDLSLPEGGFCSGEDAD---TEGA----EGTFYL 358

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT ++VE++LG + A +F   Y +   GN            F+G N+     D    A  
Sbjct: 359 WTPQQVEEVLGHQQATIFCTCYEISEAGN------------FEGSNIPRLEMDLKEWAQW 406

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G   ++   +L + RRKL   R  R RPH DDKV+V+WNGL I++ AR ++++      
Sbjct: 407 FGTDTDELGAVLEDGRRKLLQARKLRVRPHRDDKVLVAWNGLAIAAMARTARLI------ 460

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        EY+E A  AA FI  ++ +E+   L+   R   +  P FL+DYA LI
Sbjct: 461 ----------GHPEYLEGATRAADFILSNMRNEEGRLLRRWRRG-QAGIPAFLEDYAALI 509

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+ G   ++L  A++L     E F     G Y++T  +   VL+R +  HDGA 
Sbjct: 510 LGLIELYQAGFNARYLAEAVQLGRDMQERF-GTPDGVYYDTGTDAEEVLVRKRTLHDGAM 568

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
            SGNS++ + L+RL S+   +      ++AE  L     +  D   A   +  A D L++
Sbjct: 569 ISGNSMAAMALLRLGSL---TGEPALEEHAEKILLASSKQWTDAPTASGQLLMALD-LAL 624

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
             R+ +V+   K   +   M+ AAH  +  N  ++   P D           S    + R
Sbjct: 625 SQREVLVIAAPKDDPEGTRMVKAAHTGFRPNLIILWHTPDDNAL--------SEVTPLVR 676

Query: 679 -NNFSADKVVALVCQNFSCSPPVT 701
                  K  A +C+  +C  P T
Sbjct: 677 GKTMQNGKATAYLCRGQTCMAPAT 700


>gi|121701517|ref|XP_001269023.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
 gi|119397166|gb|EAW07597.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
          Length = 788

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 240/598 (40%), Positives = 329/598 (55%), Gaps = 35/598 (5%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHV+E ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWPLS
Sbjct: 69  SACHWCHVIEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +          GF  IL K++D W  ++    +S      Q
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNSSTLSGPHTIGFVDILEKLRDVWKTQQQRCRESAKEITRQ 188

Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L   +E  + S   ++  DE L    L    +  +  YD+  GGF  APKFP P  +  +
Sbjct: 189 LREFAEEGTHSQQGDREADEDLDIELLEEAYQHFASRYDAVNGGFSRAPKFPTPANLSFL 248

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E  +   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 249 LRLKTYPSAVSDIVGQEECDKATTMAVSTLVSMARGGIRDHIGHGFARYSVTSDWSLPHF 308

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I    G   S+EDADS   
Sbjct: 309 EKMLYDQAQLLDVYVDAFQITHNPELLGAVYDLATYLTTAPIQSSTGAFHSSEDADSLPA 368

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +NV
Sbjct: 369 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 426

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A + G+  E+ + I+   ++KL + R K R RP LDDK+IV+WNGL I +
Sbjct: 427 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKIIVAWNGLAIGA 486

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   E  E A  A SFI+ +L+++ T +L   +R+G 
Sbjct: 487 LAKCSALFE-EIES---------SKAVECREAAARAISFIKENLFEKVTGQLWRIYRDGS 536

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
               PGF DDYA+L  GLLD+YE      +L +A +LQ   +  FL   G    GY++T 
Sbjct: 537 RGDTPGFADDYAYLTQGLLDMYEATFEDSYLQFAEQLQRYLNRNFLAYIGSTPAGYYSTP 596

Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
              T   P  LLR+K   + A PS N V   NL+RL++++   +     +   HS +V
Sbjct: 597 STMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALLEDEEYRTLARQTCHSFSV 654


>gi|258569036|ref|XP_002585262.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237906708|gb|EEP81109.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 818

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 240/611 (39%), Positives = 333/611 (54%), Gaps = 49/611 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+
Sbjct: 58  SACHWCHVMEKESFMSQEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLN 117

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S    
Sbjct: 118 VFLTPDLEPVFGGTYWPGPHSSSVPRLGGEEPITFVDILEKLRDVWNSQQLRCMESAKEI 177

Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E  +   +  + PD   +  L +       +     YD   GGF  APKFP P  
Sbjct: 178 TRQLRE-FAEEGTHLRRPDSEGEEDLEVELLEEAYQHFVSRYDPVNGGFSRAPKFPTPAN 236

Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
           +  +L    Y    ++  G+  E +   +MV  TL  M +GGIHD +G GF RYSV   W
Sbjct: 237 LSFLLRLGRYPGAVMDIVGQE-ECARATEMVSKTLLQMVRGGIHDQIGHGFARYSVTADW 295

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
            +PHFEKMLYDQ QL +VY+D F  T+D        DI+ Y+    M+ P G   S+EDA
Sbjct: 296 SLPHFEKMLYDQAQLLDVYVDCFEATQDPELLGAVYDIVAYMTSPPMLSPEGAFHSSEDA 355

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
           DS  T   T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R  DPH+EF
Sbjct: 356 DSLPTPKDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGYDPHDEF 413

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNG 419
             +NVL         A  LG+  ++ + I+   R+KL + R ++R RP LDDKVIVSWNG
Sbjct: 414 INQNVLSIKATPRHIAKDLGLSEDEVVRIIKSSRKKLQEFRDTQRVRPDLDDKVIVSWNG 473

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYM-EVAESAASFIRRHLYDEQTHRLQH 478
           L I + A+ S +L             +  D+ E+    A +AA+FI+  L+D  T +L  
Sbjct: 474 LAIGALAKCSVLLDR-----------IDPDKAEHCRRSAATAAAFIKEKLFDADTGQLWR 522

Query: 479 SFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
            +R+G   + PGF DDYA+L +GL+ LYE      +L +A +LQ   +  FL        
Sbjct: 523 VYRDGVRGETPGFGDDYAYLTAGLIQLYEATFDDSYLRFAEQLQKYMNTHFLAMAADGST 582

Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
             GY+    N  G+ P  L R+K   D A PS N V   NLVRL S++   + + Y   A
Sbjct: 583 PAGYYMTQENMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLVRLGSLL---EDESYSVLA 639

Query: 589 EHSLAVFETRL 599
           + + + F   +
Sbjct: 640 KQTCSAFAAEI 650


>gi|167043802|gb|ABZ08492.1| hypothetical protein ALOHA_HF4000APKG3D24ctg2g4 [uncultured marine
           crenarchaeote HF4000_APKG3D24]
          Length = 620

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 239/686 (34%), Positives = 364/686 (53%), Gaps = 69/686 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE +AK++N+ FV+IKVDREERPD+D +Y    Q   G GGWPLSVFL+P+ +
Sbjct: 1   MAHESFEDEEIAKIMNENFVNIKVDREERPDLDDIYQKVCQMSTGQGGWPLSVFLTPEQR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFA--IEQLSEALSASAS 144
           P   GTYFP  D YGRPGF ++ R++  +W +K +D+   +  F   +++L +  + S  
Sbjct: 61  PFYVGTYFPAIDSYGRPGFGSLCRQMAQSWKEKPKDIEKAADNFMQNLDKLKQFPTPSEI 120

Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
              + DE   N L++         D  +GGFG APKFP    +  M  +SK       SG
Sbjct: 121 DKSILDEAAINLLQIA--------DITYGGFGQAPKFPNASNLSFMFRYSKL------SG 166

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
             S+ +K  L TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD   L  VY +A
Sbjct: 167 -ISKFEKFALLTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPIVYSEA 225

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
           + +TKD F+  + R  LDY+ R+M    G  FSA+DAD+   EG T       +VW  +E
Sbjct: 226 YQITKDPFFENVVRKTLDYIIREMTSSDGMFFSAQDADTNGEEGQT-------FVWKKRE 278

Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
           +E ILGE + +F  +Y +   GN            F+G  +L    ++S+   K G    
Sbjct: 279 IEKILGEDSEIFCIYYDVTDGGN------------FEGNTILANNINASSLGFKFGKSES 326

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
           +  NI+ +C  KL +VR+KR +P  DDKVI SWNGL+IS+F    +I             
Sbjct: 327 EIQNIILKCSDKLLEVRNKREQPGKDDKVITSWNGLMISAFLSGYQI------------- 373

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
              +D  +Y+++A+ +  F   +   ++ H L  +F+NG  K  G+LDDYA++ +  +D+
Sbjct: 374 ---TDNSKYLDMAKKSIDFFESNF--KENHILHRTFKNGEPKLNGYLDDYAYMANASIDM 428

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
           +E  S  K+L++A  L N     F D    G+F T+     +++R K ++D + PSGNSV
Sbjct: 429 FENTSDPKYLLFATNLANYLVTHFWDDSTHGFFFTSDNHEKLIIRPKNNYDLSMPSGNSV 488

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
           +   L++L  I         +Q  E +  + E++    A   P        +     +  
Sbjct: 489 AACVLLKLYHITQD------KQFLEIAKKIIESQAT-AAAENPFAFGYLLNVLYLYYQKP 541

Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
             +   +  +FE ++++    +     ++ +  A+   +D   ++    A  +   F  D
Sbjct: 542 TEITIINDKNFE-LVSSLRKKFLPESIMVLV--ANKNNLDALSKY----AFFSGKEFQDD 594

Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
           K   +VC+NFSCS P++D   +E  L
Sbjct: 595 KTNVIVCKNFSCSLPLSDLSEIEKEL 620


>gi|423083522|ref|ZP_17072052.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
           002-P50-2011]
 gi|423088427|ref|ZP_17076810.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
           050-P50-2011]
 gi|357542999|gb|EHJ25034.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
           050-P50-2011]
 gi|357544282|gb|EHJ26286.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
           002-P50-2011]
          Length = 678

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 235/698 (33%), Positives = 361/698 (51%), Gaps = 83/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   I+ L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIKALKDDFD 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +TK   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKDGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
           ++   E+ ++LGE     F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 IFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   + +   K+F+ R +R   H DDK++ SWN L+I +  +A   L+++
Sbjct: 375 ----NKEYERHNEKIADLSEKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLEND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y+E +     FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 I----------------YLEYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L      LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNENCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   S+ +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDSRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             + S K ++ +  + S  + F+ +++             +  P  T     + E N+  
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           + +       DK    VCQ+ SCS P+ D   L++++L
Sbjct: 638 SFLNNYRLKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675


>gi|448382091|ref|ZP_21561926.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
           11522]
 gi|445662325|gb|ELZ15095.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
           11522]
          Length = 731

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 240/696 (34%), Positives = 359/696 (51%), Gaps = 59/696 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFADEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP E K G+PGF  +  ++ D+W+ + D         Q    A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGKRGQPGFLDLCERISDSWESEEDREEMQHRAQQWTDAATD 172

Query: 134 QLSEAL-SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           +L E   SA   +    +    + L   A+ + +S D ++GGFG+  KFP+P  ++++  
Sbjct: 173 RLEETPDSAGVDAGGAAEPPSSDVLEAAADAVLRSADRQYGGFGTGQKFPQPSRLRVL-- 230

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
            ++  + TG+     E ++++  TL  MA GG+ DHVGGGFHRY VD  W VPHFEKMLY
Sbjct: 231 -ARTYDRTGR----EEYREVLAETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLY 285

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D  ++   +L  + LT +  Y+    D L ++ R++    G  FS  DA S + E   R 
Sbjct: 286 DNAEIPRAFLAGYQLTGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER- 344

Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           +EGAFYVWT +EV D++ +   A LF   Y +  +GN            F+G+N    + 
Sbjct: 345 EEGAFYVWTPEEVHDVIADETDASLFCARYDITESGN------------FEGQNQPNRIA 392

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
             S  AS+  +   + L  L   R++LF+ R +RPRP  D+K++  WNGL+IS++A A+ 
Sbjct: 393 RVSELASQFDLAESEVLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAAL 452

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L              G D  EY E A  A  F+R  L+D+++ RL   ++ G  K  G+
Sbjct: 453 VL--------------GED--EYAETAVDALEFVRDRLWDDESQRLSRRYKAGDVKVDGY 496

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     S++ R 
Sbjct: 497 LEDYAFLARGALDCYQATGEVDHLAFALELARVIETEFWDADRGTLYFTPESGESLVTRP 556

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+V  L+ L    A    D     A   L     +L+  A+    +C
Sbjct: 557 QELGDQSTPSSTGVAVETLLALDEFAASEFGDI----AATVLETHANKLEANALEHATLC 612

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
            AAD L+  + +  V     ++ +       A AS  L   +  + P     ++ W E  
Sbjct: 613 LAADRLAAGALEVTV-----AADELPTEWREAFASQYLPDRLFALRPPTEAGLETWLETL 667

Query: 670 ---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
              ++      R     +  +  VC++ +CSPP  D
Sbjct: 668 GLADAPPIWAGREARDGEPTL-YVCRDRTCSPPTHD 702


>gi|52078696|ref|YP_077487.1| hypothetical protein BL00131 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319649027|ref|ZP_08003236.1| YyaL protein [Bacillus sp. BT1B_CT2]
 gi|52001907|gb|AAU21849.1| conserved protein YyaL [Bacillus licheniformis DSM 13 = ATCC 14580]
 gi|317389021|gb|EFV69839.1| YyaL protein [Bacillus sp. BT1B_CT2]
          Length = 625

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 254/698 (36%), Positives = 362/698 (51%), Gaps = 105/698 (15%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE VAKLLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+VFL+PD K
Sbjct: 1   MAHESFEDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   ++ RPGF  +++++ D + K R+ +        E+ +  L   A S+ 
Sbjct: 61  PFYAGTYFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNLRIKAKSDA 116

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 206
             D L ++ LR   +QL  S+D+ +GGFGSAPKFP P  +  +L YH         SGE 
Sbjct: 117 -GDSLGEDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
           +     V+ TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   L   Y +A+ 
Sbjct: 169 N-ALYSVMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLLIAYTEAYQ 227

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
           +TK+  Y  I   I+ ++RR+M    G  +SA DAD   TEG     EG +YVW+ +EV 
Sbjct: 228 ITKNERYKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYYVWSKEEVL 280

Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSSASASKLGM 381
           + LG E   L+   Y +   GN            F+G N    +   L D      +  +
Sbjct: 281 ETLGDELGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK---DEFAL 325

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
             E+  N L E R KLF+ R +R  PH+DDKV+ SWN L+I+  A+A+K+         +
Sbjct: 326 TDEELQNKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV---------Y 376

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
           N P       EY+E+A +AA FI   L   Q  R+   +R+G  K  GF+DDYAFL+   
Sbjct: 377 NAP-------EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDDYAFLLWAY 427

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           ++LYE       L  A +L+     LF D E GG++ T  +  ++++R KE +DGA PSG
Sbjct: 428 IELYEASLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEVYDGALPSG 487

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS- 620
           N V  + L RL  +                    +  L D A A        D+ + PS 
Sbjct: 488 NGVLAVQLSRLGRLTG------------------DLSLHDQA-AKMFAAFHGDVSAYPSG 528

Query: 621 --------------RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MD 664
                         +K +V++G ++  D + +++A   ++  N  V+  +  D  +   D
Sbjct: 529 HTNFLQGLLSQFMPQKEIVVLGKRNDPDRQKIVSALQQAFQPNYAVLAAESPDDFKGIAD 588

Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           F  E+ + +          +K    +C+NF+C  P T+
Sbjct: 589 FAAEYKAVD----------NKTTVYICENFACRQPTTN 616


>gi|317030461|ref|XP_001392621.2| hypothetical protein ANI_1_728074 [Aspergillus niger CBS 513.88]
          Length = 791

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 236/580 (40%), Positives = 323/580 (55%), Gaps = 35/580 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 73  SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 132

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +       G  GF  IL K+ D W  ++    +S     +Q
Sbjct: 133 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 192

Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +     YD   GGF +APKFP P  +  +
Sbjct: 193 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 252

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 253 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 312

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 313 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 372

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NV
Sbjct: 373 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 430

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 431 LSVKVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 490

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G 
Sbjct: 491 LAKCSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGG 540

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
               PGF DDYA+LI GLLD+YE      +L +A +LQ   ++ FL   G    GY++T 
Sbjct: 541 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTP 600

Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
              T   P  LLR+K   + A P+ N V   NL+RL S++
Sbjct: 601 STMTSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 640


>gi|397775180|ref|YP_006542726.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
 gi|397684273|gb|AFO58650.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
          Length = 732

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 235/697 (33%), Positives = 363/697 (52%), Gaps = 60/697 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF+DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFQDEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ +P   GTYFP E + G+PGF+ + +++ D+W+   D         Q    A +
Sbjct: 113 AWLTPEGEPFFIGTYFPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATD 172

Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
           +L E   A+     + P+    + L   A+ + +S D  +GGFGS+ PKFP+P  I+++ 
Sbjct: 173 RLEETPDAAGGGTVEAPEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL- 231

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             ++  + TG+     E ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKML
Sbjct: 232 --ARTYDRTGR----DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 285

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  ++   +L  + LT +  Y+ +  D L ++ R++    G  FS  DA SA  E   R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER 345

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EGAFYVWT  EV D+L +   A LF   Y +   GN            F+G+N    +
Sbjct: 346 -EEGAFYVWTPAEVHDVLEDETDAALFCARYDITEAGN------------FEGRNQPNRV 392

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A++  +   + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLAEHEILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L              G+D  +Y + A  A  F+R  L+D+   RL   +++G  K  G
Sbjct: 453 LVL--------------GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDG 496

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G LD Y+       L +A+EL       F D + G  + T     +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIKAEFWDADRGTLYFTPESGEALVTR 556

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+V  L+ L    A    + +   A   L     +L+  A+    +
Sbjct: 557 PQELSDQSTPSATGVAVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATL 612

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           C AAD L   + + V +        + + L + +        +  + P   + +D W E 
Sbjct: 613 CLAADRLEAGALE-VTVAADDLPTAWRDRLTSQY----FPDRLFALRPPTEDGLDAWLET 667

Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               ++      R     +  +  VC++ +CSPP  D
Sbjct: 668 LGLADAPPIWAGREARDGEPTL-YVCRDRTCSPPSHD 703


>gi|238498046|ref|XP_002380258.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
 gi|317141806|ref|XP_003189401.1| hypothetical protein AOR_1_504164 [Aspergillus oryzae RIB40]
 gi|220693532|gb|EED49877.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
          Length = 787

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/568 (41%), Positives = 318/568 (55%), Gaps = 35/568 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN+ F+ IKVDREERPD+D +YM YVQA  G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSPEVATILNESFIPIKVDREERPDIDDIYMNYVQATTGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +          GF  IL K+++ W  ++     S     +Q
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNSSTLLGNETIGFVDILEKLREVWQTQQQRCLDSAKEITKQ 188

Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L   +E  + S   +K  DE L    L    +     YDS  GGF  APKFP P  +  +
Sbjct: 189 LREFAEEGTHSYQGDKEADEDLDIELLEEAYQHFVSRYDSVHGGFSRAPKFPTPANLSFL 248

Query: 191 LY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E  +   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 249 LRLGAYPNAVSDIVGREECEKATAMAVHTLISMARGGIRDHIGHGFARYSVTADWSLPHF 308

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  +
Sbjct: 309 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPS 368

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  +S  +DPH+EF  +NV
Sbjct: 369 PKDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVHPDGN--ISPENDPHDEFMNQNV 426

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A + G+  E+ + I+   +++L + R + R RP LDDK+IV+WNGLVI +
Sbjct: 427 LSVKVTPSKLAREFGLGEEEVVRIIRSAKQRLREYRERTRVRPDLDDKIIVAWNGLVIGA 486

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + +           +  S   +  E A  A SFI+ +L+D+ T +L   +R+G 
Sbjct: 487 LAKCSALFER----------IESSKAVQCREAAAKAISFIKNNLFDKATGQLWRIYRDGG 536

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
               PGF DDYA+LISGLLD+YE      +L +A +LQ   +E FL   G    GY++T 
Sbjct: 537 RGDTPGFADDYAYLISGLLDMYEATFDDSYLQFAEQLQKYLNENFLAYVGSTPAGYYSTP 596

Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSV 564
              T + P  LLR+K   + A PS N V
Sbjct: 597 SNMTSDMPGPLLRLKTGTESATPSVNGV 624


>gi|325845722|ref|ZP_08169003.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
 gi|325488252|gb|EGC90680.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
          Length = 614

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 242/685 (35%), Positives = 353/685 (51%), Gaps = 73/685 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFEDE VA  LN+ F+SIKVDREERPD+D VYM+  QAL G GGWPL++F++P  +
Sbjct: 1   MEHESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLTIFMTPTQQ 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
               GTYFP   +YGRPGF  +L+ +   W+  R  +            +        + 
Sbjct: 61  AFYAGTYFPKTSRYGRPGFLDVLKTIDFNWNHHRAKVTDITKQIASHFKDLEGIETEGDS 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
           L   + QN +     QL +SYD RFGGFG+APKFP P ++  +L + ++ +D        
Sbjct: 121 LSMAIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKDKSV----- 171

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
             Q MV  TL  M KGGI DH+G GF RYS DE W VPHFEKMLYD   L   Y +A+ +
Sbjct: 172 --QDMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMISYTEAYQV 229

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T++  Y  I     +Y+   +  P G  + AEDADS   EG    +EG FYV+T  E+  
Sbjct: 230 TREPRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYVFTPAEIIQ 282

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ILG E    F E Y +   GN            F+GKN+L  L+            LE  
Sbjct: 283 ILGPEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK---------LELD 321

Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
           +  L  CR  L   R +R   H DDK++ SWNGL+I++FA+                 + 
Sbjct: 322 IKELEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK-----------------LY 364

Query: 447 GSDRKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
           G  +K  Y++ A  A +FI++HL+DE   RL   +R G S    +LDDYAFL  GL++L+
Sbjct: 365 GQTQKMIYLDAASKAVTFIKQHLFDET--RLLARYREGESHFKAYLDDYAFLSYGLIELH 422

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           +  +  ++L  AI+L     +LF D E GG++ T  +  +++LR KE +DGA PSGNSV+
Sbjct: 423 QSTAEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGAMPSGNSVA 481

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
             NL+RLA +   +    +   AE  +     ++K   M       AA      +++ ++
Sbjct: 482 AYNLIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFALSDTKELMI 538

Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
            V  +  +  + +L   + +   N T++   P +  ++       S  A   ++    D+
Sbjct: 539 TVPKQEQI--KEILKQLNETPHFNTTLLFKTPENQTQL-------SKLAPYTKDYPIGDQ 589

Query: 686 VVALVCQNFSCSPPVTDPISLENLL 710
               +C N +C  P +   SL+N+L
Sbjct: 590 PTYYLCSNGTCQAPTSSLESLKNIL 614


>gi|196232510|ref|ZP_03131362.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
 gi|196223272|gb|EDY17790.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
          Length = 428

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 197/366 (53%), Positives = 242/366 (66%), Gaps = 16/366 (4%)

Query: 11  KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
           K RR H  I      +TCHWCHVM  ESFE+   AKL+N+ FV+IKVDREERPDVD+VYM
Sbjct: 57  KARREHKPIFLSIGYSTCHWCHVMAHESFENPATAKLMNENFVNIKVDREERPDVDRVYM 116

Query: 65  TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
           TYVQA  G GGWP+SVFL+PDLKP  GGTYFPPED+YGRPGF TIL+++ +AW    + +
Sbjct: 117 TYVQATTGSGGWPMSVFLTPDLKPFYGGTYFPPEDRYGRPGFPTILQRLAEAWKDDHEKV 176

Query: 125 AQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
             +   AI  L++   S  A S  +  E    A+ L   QL++S+D   GGFG APKFPR
Sbjct: 177 LGAANDAIRALNDYTASGPAQSTAVGKE----AIALALNQLTRSFDDELGGFGGAPKFPR 232

Query: 184 PVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
           PV +  + +   +     + G+A+ G  M L TLQ MA GG+HDH+GGGFHRYSVD+ WH
Sbjct: 233 PVTLNFLFHVFAREGHESRDGKAALG--MALITLQKMADGGMHDHLGGGFHRYSVDKFWH 290

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 303
           VPHFEKMLYDQ QLA+ YLDAF +T D  Y    RDI DY+RRDM   GG  +SAEDADS
Sbjct: 291 VPHFEKMLYDQAQLASSYLDAFQVTHDTVYERTARDIFDYVRRDMTDAGGGFYSAEDADS 350

Query: 304 AETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKG 362
              +G     EGAFYVWT  E+  +LGE  A +F   Y +   GN      SDP  EF+G
Sbjct: 351 LLEKGKPEHSEGAFYVWTKDEIVHVLGEDAAAVFDRVYGVDAEGNA--PEGSDPQGEFRG 408

Query: 363 KNVLIE 368
           KN+LI+
Sbjct: 409 KNILIQ 414


>gi|70995702|ref|XP_752606.1| DUF255 domain protein [Aspergillus fumigatus Af293]
 gi|19309415|emb|CAD27314.1| hypothetical protein [Aspergillus fumigatus]
 gi|41581314|emb|CAE47963.1| hypothetical protein, conserved [Aspergillus fumigatus]
 gi|66850241|gb|EAL90568.1| DUF255 domain protein [Aspergillus fumigatus Af293]
          Length = 799

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 244/620 (39%), Positives = 334/620 (53%), Gaps = 55/620 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWPLS
Sbjct: 63  SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 122

Query: 80  VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+P+L+P+ GGTY+P  +     +    GF  IL K++D W  ++     S      Q
Sbjct: 123 VFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 182

Query: 135 LSEALSASASSN----KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +  +  YD+  GGF  APKFP P  +  +
Sbjct: 183 LREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 242

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E      M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 243 LRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 302

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 303 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 362

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +NV
Sbjct: 363 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 420

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A + G+  E+ + I+   ++KL + R K R RP LDDKVIV+WNGL I +
Sbjct: 421 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAIGA 480

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G 
Sbjct: 481 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 530

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDEL--- 527
             + PGF DDYA+LI GLLD+YE      +L +A +LQ+             TQ E    
Sbjct: 531 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYLND 590

Query: 528 -FLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
            FL   G    GY++T    T   P  LLR+K   + A PS N V   NL+RL++++   
Sbjct: 591 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL--- 647

Query: 580 KSDYYRQNAEHSLAVFETRL 599
           + + YR  A  +   F   +
Sbjct: 648 EEEEYRTLARQTCLSFSVEI 667


>gi|325288476|ref|YP_004264657.1| hypothetical protein Sgly_0289 [Syntrophobotulus glycolicus DSM
           8271]
 gi|324963877|gb|ADY54656.1| protein of unknown function DUF255 [Syntrophobotulus glycolicus DSM
           8271]
          Length = 752

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 263/759 (34%), Positives = 383/759 (50%), Gaps = 92/759 (12%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL    +TCHWCHVME ESFED+ VA+ LN  F+++KVDREERPD
Sbjct: 34  GIEAFEKAAKENKPVFLSIGYSTCHWCHVMERESFEDKEVAEKLNKSFIAVKVDREERPD 93

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D  YMT+ QAL G GGWPL++ ++PD KP   GTYF      GR G   +L    + W 
Sbjct: 94  IDHTYMTFCQALTGAGGWPLTILMTPDKKPFFAGTYFAKNSGGGRVGLIDVLDYTSEKWK 153

Query: 119 KKRDMLA------------------QSGAFAIEQLSEALSASASSNKLPDEL---PQNAL 157
            +++ +                   Q   F  E L E +  + +  +  D++    +  +
Sbjct: 154 NEKEKILTSAEELYTVVSSHYGGKDQETVFKKEGLLEEVRYADARKQTKDDIMVWGKQMI 213

Query: 158 RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTL 217
               E L+K++D +FGGFG APKFP P  +  ++       D           +MV  TL
Sbjct: 214 EKGYEMLAKTFDPKFGGFGHAPKFPSPHTLGFLMRCHLDRPD-------QNALEMVRKTL 266

Query: 218 QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC 277
             MA GGI+D +G GF RYS D  W VPHFEKMLYD   LA  YL+A+ LT +  Y  + 
Sbjct: 267 DLMADGGIYDQIGYGFSRYSTDRFWLVPHFEKMLYDNATLAYTYLEAYQLTHEQRYGQVA 326

Query: 278 RDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFK 337
           R+I  Y+ R+M  P G  +SAEDADS   EG    +EG +Y+WT +EV + L    +  +
Sbjct: 327 REIFSYVLREMCSPEGGFYSAEDADS---EG----EEGKYYIWTYQEVMETLTAELLRIQ 379

Query: 338 E-------------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL-NDSSASAS 377
           E                   H  + P   C+  +++   N F+GKN+L  L +D    A 
Sbjct: 380 ENRASLDQPDGRDIFQSQFAHPDVLPGLYCEAYQITKEGN-FEGKNILNRLFSDWRDLAR 438

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K  +P ++++  +  C   L  VR +R RP  DDK++VSWNGL+I++ A+ +++L     
Sbjct: 439 KASIPFDEFVRAIRYCNTILLRVRERRVRPIRDDKILVSWNGLMIAALAKGAQVL----- 493

Query: 438 SAMFNFP----VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
               +FP     V  +   Y+  AE AA+FI  ++      RL   +R+G ++ P +LDD
Sbjct: 494 ----SFPDQTFAVHENASLYLTQAEKAANFIDDNMRSSDG-RLFARYRHGEAQYPAYLDD 548

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAF I GLL+LY       +L  AIELQ  Q+ LF D E GGYF T  +   +L R KE 
Sbjct: 549 YAFYIFGLLELYTACGKPVYLQRAIELQQQQENLFRDTEKGGYFFTGKDSEELLFRPKEV 608

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGNS++V+NL +L  +   +K   ++  AE ++  F   +K+          A 
Sbjct: 609 YDGALPSGNSLAVLNLTKLWKMTGDNK---WKNIAEGNIQSFHAEMKEYP--------AG 657

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT--VIHIDPADTEEMDFWEEHNS 671
            +  + S +H +  G       E +L  A  +  LNK   V   D      + + E    
Sbjct: 658 HLAFLRSIQHYISDGD------ELILGGALNNEVLNKMKEVFFRDFRPYAVLLYHEGTVQ 711

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                       +K  A +C+NFSC  PV     L+++L
Sbjct: 712 ELVPELAGYPQQEKAAAYLCRNFSCLNPVFSVEELQHVL 750


>gi|448343975|ref|ZP_21532892.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
 gi|445622058|gb|ELY75523.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
          Length = 732

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/697 (33%), Positives = 363/697 (52%), Gaps = 60/697 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF+DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEAESFQDEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ +P   GTYFP E + G+PGF+ + +++ D+W+   D         Q    A +
Sbjct: 113 AWLTPEGEPFFIGTYFPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATD 172

Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
           +L E   A+     + P+    + L   A+ + +S D  +GGFGS+ PKFP+P  I+++ 
Sbjct: 173 RLEETPDAAGGGTVEAPEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL- 231

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             ++  + TG+     E ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKML
Sbjct: 232 --ARTYDRTGR----DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 285

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  ++   +L  + LT +  Y+ +  D L ++ R++    G  FS  DA SA  E   R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER 345

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EGAFYVWT  EV D+L +   A LF   + +   GN            F+G+N    +
Sbjct: 346 -EEGAFYVWTPAEVHDVLEDETDAALFCARFDITEAGN------------FEGRNQPNRV 392

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A++  +   + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLAEHEILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L              G+D  +Y + A  A  F+R  L+D+   RL   +++G  K  G
Sbjct: 453 LVL--------------GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDG 496

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGEALVTR 556

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+V  L+ L    A    + +   A   L     +L+  A+    +
Sbjct: 557 PQELGDQSTPSATGVAVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATL 612

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           C  AD L   + + V +        + + L + +        +  + P   + +D W E 
Sbjct: 613 CLVADRLEAGALE-VTVAADDLPTAWRDRLTSQY----FPDRLFALRPPTEDGLDAWLET 667

Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               ++      R     +  +  VC++ +CSPP  D
Sbjct: 668 LGLADAPPIWAGREARDGEPTL-YVCRDRTCSPPSHD 703


>gi|300087365|ref|YP_003757887.1| hypothetical protein Dehly_0239 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299527098|gb|ADJ25566.1| protein of unknown function DUF255 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 669

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 247/694 (35%), Positives = 367/694 (52%), Gaps = 77/694 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A ++N  F++IKVDREERPD+D +YM  VQA+ G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDEATAAVMNRHFINIKVDREERPDIDSIYMAAVQAMTGHGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP  GGTY+PPED++G P F  IL  V +A+ ++ D +A +    +  +++  
Sbjct: 108 VFLTPDGKPFYGGTYYPPEDRHGLPAFTRILEAVAEAYRERPDEVAATATRLVTAVADKP 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
              A  + L  EL   A     + L++ +D    GFG APKFP+P+ +  +L YH +   
Sbjct: 168 VGDAGESSLTVELLDRAF----QALTRDFDENHAGFGGAPKFPQPLVLDFLLRYHYRT-- 221

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                  ++   +MV  TL+ M +GG++DH+GGGFHRYSVD+ W VPHFEKMLYD   LA
Sbjct: 222 ------SSARALEMVEKTLEAMYRGGMYDHLGGGFHRYSVDDAWQVPHFEKMLYDNALLA 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE-IFSAEDADSAETEGATRKKEGAF 317
            VYL AF +T    Y  +  DILDY+  +M  P     +SA+DADS   EG    +EG +
Sbjct: 276 RVYLHAFQITGKAQYRLVTEDILDYVLEEMTDPATSGFYSAQDADS---EG----EEGRY 328

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+WT  E+E +LG E A +F   Y +   GN            F+G+N+L    + S  A
Sbjct: 329 YIWTPDEIESVLGRESAEIFGRRYGVTQAGN------------FEGRNILHLTGEFSVEA 376

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S  G+  +         R +L   R KR  P  D K++VSWN +   + A A        
Sbjct: 377 SA-GVSAD---------RARLLAERRKRVPPGTDTKILVSWNAMTQLALASAG------- 419

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    V  DR +Y+  AE+ A+F+  +L D  + RL+H+     S A GFL+DYA 
Sbjct: 420 ---------VALDRPDYLAAAEANAAFLLDNLLD--SGRLRHTV----SVAEGFLEDYAL 464

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L   LL L++     +WL  A+ L     ELF D + G +++T  +   +  R +   DG
Sbjct: 465 LTESLLALHKATLTPRWLRQAMALGAAMVELFWDEDEGVFYDTPADAGQLFQRPRNFQDG 524

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSG SV+ + L+RL+ +   +    Y Q A  +L    + +    +   L   A D  
Sbjct: 525 AVPSGASVASLALLRLSRL---ADERSYWQTAGRALKGVSSFMGRYPLGFGLWLGALDFY 581

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P ++ V ++G  +      ++A    ++  N  +  +D  D+E +       ++    
Sbjct: 582 LGP-QQEVAVIGPAADDASRRLVAVVGRAFRPNTVLAGLDAGDSEGI-------ASLPLF 633

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A +  A VC++F+C PPVT P+ LE +L
Sbjct: 634 QGRGQTAGQPTAWVCRSFTCYPPVTAPVDLEQVL 667


>gi|407465214|ref|YP_006776096.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
 gi|407048402|gb|AFS83154.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
          Length = 675

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 244/686 (35%), Positives = 362/686 (52%), Gaps = 74/686 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE+E VAK +N+ F++IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 49  SSCHWCHVMAHESFENEDVAKFMNENFINIKVDREERPDIDDIYQKVCQIATGQGGWPLS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP   GTYFP  D YGRPGF +I R++  AW +K + +  S    I+ L++  
Sbjct: 109 VFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLSQAWKEKPNDIETSAKRFIDALTK-- 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              A + ++P +L +  L   A  L +  D+ +GGFGSAPKFP    I   L+   KL  
Sbjct: 167 ---AEAIQVPSKLERILLDEAAMNLFQLGDATYGGFGSAPKFPNAANIS-FLFRYAKLSG 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             K  E        L TL+ MA GGI D +GGGF RYS D +W VPHFEKMLYD   ++ 
Sbjct: 223 LTKFNE------FALKTLKKMANGGIFDQIGGGFSRYSTDAKWLVPHFEKMLYDNALISV 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +AF +TKD FY  + R  LD++ R+M  P G  +SA DADS   EG     EG +YV
Sbjct: 277 NYAEAFQITKDPFYLEVLRKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKYYV 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   E+++ILG+ A LF  +Y +   GN            ++G N+L    + S  A   
Sbjct: 330 WKKSEIKEILGDDADLFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNF 377

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+   +   I+  C +KL  VRS R  P LDDK++VSWN L+I++ A+  ++        
Sbjct: 378 GISETEVKKIINLCSKKLLKVRSSRIPPGLDDKILVSWNSLMITALAKGYRV-------- 429

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                   +    Y+  A++  SFI  +L      +L  +++NG +K  G+L+DY++ I+
Sbjct: 430 --------TGDILYLNAAKNCISFIENNLL--VNDKLLRTYKNGTAKIDGYLEDYSYFIN 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLD++E     K+L  +++L +     F D +   +F T+ +   +++R K ++D + P
Sbjct: 480 ALLDVFEIEPDEKYLKLSLKLAHHLVNHFWDSKNNNFFMTSDDHEKLIIRPKSNYDLSLP 539

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD----MAMAVPL-MCCAAD 614
           SGNSVS   L+RL           Y  + + +     T++ +    MA   P       +
Sbjct: 540 SGNSVSAFALLRL-----------YHLSQDSTFLKITTKIMESQAQMAAENPFGFGYLLN 588

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
            +S+  +K V +    + ++ EN         D     I I   D  ++    E+ S   
Sbjct: 589 TISMYIQKPVEI----TIINTENPKICESLLLDYLPNSIMITIRDASQL----ENLSEYP 640

Query: 675 SMARNNFSADKVVALVCQNFSCSPPV 700
             A  +F  DK    VC++F+CS P+
Sbjct: 641 FFAGKSFE-DKTTVFVCKDFTCSLPL 665


>gi|134077135|emb|CAK45476.1| unnamed protein product [Aspergillus niger]
          Length = 765

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 235/577 (40%), Positives = 322/577 (55%), Gaps = 39/577 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 57  SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +       G  GF  IL K+ D W  ++    +S     +Q
Sbjct: 117 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 176

Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +     YD   GGF +APKFP P  +  +
Sbjct: 177 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 236

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L+   +        E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHFEKM
Sbjct: 237 LHIVGR-------DECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHFEKM 289

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGA 309
           LYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T   
Sbjct: 290 LYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPTPND 349

Query: 310 TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NVL  
Sbjct: 350 TEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNVLSV 407

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
               S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I + A+
Sbjct: 408 KVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGALAK 467

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SK 486
            S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G    
Sbjct: 468 CSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGGRGN 517

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT---- 539
            PGF DDYA+LI GLLD+YE      +L +A +LQ   ++ FL   G    GY++T    
Sbjct: 518 TPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTPSTM 577

Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           T   P  LLR+K   + A P+ N V   NL+RL S++
Sbjct: 578 TSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 614


>gi|442323509|ref|YP_007363530.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
 gi|441491151|gb|AGC47846.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
          Length = 697

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 248/704 (35%), Positives = 357/704 (50%), Gaps = 69/704 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 57  SACHWCHVMAHESFESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPPED+YGRPGF  +L  ++DAW  KR+ + +  A   E L E  
Sbjct: 117 VFLTPDLKPFYGGTYFPPEDRYGRPGFPRLLMALRDAWKNKREDIHRQAAQFEEGLGEL- 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A+   +  P  L    +    ++++   DS  GGFG APKFP P+   ++L   ++   
Sbjct: 176 -AAYGLDAAPGVLSVEDVLSMGQRMALQVDSVHGGFGGAPKFPNPMNFSLLLRAWRR--- 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G     +  V  TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD  QL +
Sbjct: 232 ----GGGDSLRDAVFLTLERMALGGIYDQLGGGFHRYSVDARWLVPHFEKMLYDNAQLMH 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A  +     +  +  + ++Y+RR+M   GG  ++A+DADS   EG    +EG F+V
Sbjct: 288 LYSEAQQVAPRPLWRKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340

Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W  +E++ +L  E A L   H+ + P GN +            G  VL  +  +   A +
Sbjct: 341 WRPEEIQAVLPPERAELVMRHFRVTPLGNFE-----------HGATVLEVVVPAETLARE 389

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             + LE     L E R+ LF  R +R +P  DDK++  WNGL+I   A A+++       
Sbjct: 390 RSLSLEAVERELAETRQVLFQARERRVKPGRDDKILAGWNGLMIRGLALAARVF------ 443

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     DR ++  +A SAA F+   L+D    RL  S++ G ++  GFL+DY  L 
Sbjct: 444 ----------DRPDWTRLAVSAADFVLAKLWD--GTRLARSYQEGQARIDGFLEDYGDLA 491

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           SGL  LY+     K+L  A  L    +ELF D E   Y         +++      D A 
Sbjct: 492 SGLTALYQATFDVKYLEAAKALVKRAEELFWDAEKQAYLTAPRGQKDLVVATYGLFDNAF 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S      V LA++   +  +++ +     +A     L   AM    +  AAD L +
Sbjct: 552 PSGASTLTEAQVALAAL---TGDEHHLELPSKYVARMREGLVANAMGYGHLGLAADSL-L 607

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                V   G   +V    +L+AA+  Y           A T     W+E      ++ +
Sbjct: 608 DGGAGVTFSGSSDAV--APLLSAANHVY-----------APTFAFG-WKEEGRPVPALLK 653

Query: 679 NNFS-----ADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
             F      A K  A +C+ F+C  P TD  +L   L EKP   
Sbjct: 654 ELFEGREPVAGKGAAYLCRGFACELPRTDAKALAERLTEKPKGA 697


>gi|159131360|gb|EDP56473.1| DUF255 domain protein [Aspergillus fumigatus A1163]
          Length = 799

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 244/620 (39%), Positives = 333/620 (53%), Gaps = 55/620 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWPLS
Sbjct: 63  SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 122

Query: 80  VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+P+L P+ GGTY+P  +     +    GF  IL K++D W  ++     S      Q
Sbjct: 123 VFLTPNLDPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 182

Query: 135 LSEALSASASSN----KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +  +  YD+  GGF  APKFP P  +  +
Sbjct: 183 LREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 242

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E      M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 243 LRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 302

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 303 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 362

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +NV
Sbjct: 363 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 420

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A + G+  E+ + I+   ++KL + R K R RP LDDKVIV+WNGL I +
Sbjct: 421 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAIGA 480

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G 
Sbjct: 481 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 530

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDEL--- 527
             + PGF DDYA+LI GLLD+YE      +L +A +LQ+             TQ E    
Sbjct: 531 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYLND 590

Query: 528 -FLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
            FL   G    GY++T    T   P  LLR+K   + A PS N V   NL+RL++++   
Sbjct: 591 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL--- 647

Query: 580 KSDYYRQNAEHSLAVFETRL 599
           + + YR  A  +   F   +
Sbjct: 648 EEEEYRTLARQTCLSFSVEI 667


>gi|429217838|ref|YP_007179482.1| thioredoxin domain-containing protein [Deinococcus peraridilitoris
           DSM 19664]
 gi|429128701|gb|AFZ65716.1| thioredoxin domain protein [Deinococcus peraridilitoris DSM 19664]
          Length = 677

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 246/692 (35%), Positives = 361/692 (52%), Gaps = 69/692 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE VA  +N  FV+IKVDREERPDVD VYM+ VQA  G GGWP++
Sbjct: 47  STCHWCHVMAHESFEDETVAGFMNTHFVNIKVDREERPDVDAVYMSAVQATTGSGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL    +P   GTYFPP D +G P F  +L  V  AW+ +R  L Q+     E L++ L
Sbjct: 107 VFLDAQGRPFYAGTYFPPRDAHGMPSFSRVLAGVAQAWNGRRQDLMQNA----ETLTQHL 162

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             SA   +  + LP +       Q+ K +D+R GGFGSAPKFP P  +  +L        
Sbjct: 163 Q-SAGRREGSEALPADFTARGLAQVRKLFDARHGGFGSAPKFPAPTTLAYLLTQ------ 215

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   + + + L TLQ MA GG++D +GGGFHRYSVDERW VPHFEKMLYD  QLA 
Sbjct: 216 -------PQARDISLTTLQKMAAGGLYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLAR 268

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYL A+ LT +  ++   R+ L+YL R+M+ P G  +SA+DADS   EG     EG F+V
Sbjct: 269 VYLQAYQLTGEASFTQFARETLEYLEREMLSPEGGFYSAQDADS---EGI----EGKFFV 321

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
           WT +E++ ILG+ A L    + +   GN       DPH+ +F  ++VL  +   +  A +
Sbjct: 322 WTPQELQAILGDDAALAARFWGVTAEGN-----FMDPHHPDFGRRSVLSVVASPTELAEQ 376

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+        L   RR+L++ R  R  P  D KV+ SWNGL + +FA A+++L+ E   
Sbjct: 377 FGLSEPDVRRRLEAARRRLWEERELRVHPGTDTKVLTSWNGLALGAFALAARVLREE--- 433

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         +++VA   A F+R HL  E    L+HS+++G ++  G L+D+A   
Sbjct: 434 -------------RFLDVARRNADFVRSHLRSEDA-TLRHSYKDGQARVQGLLEDHALYA 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+       L WA EL N     F D+EGG +++T+    +++ R K+  D A 
Sbjct: 480 LGLIELYQASGHLPHLEWARELWNVVATEFWDQEGGAFWSTSARAETLITRQKDAFDSAV 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
            S N+ + +  + +       + +   + A  ++  F   +         +  A  +L+ 
Sbjct: 540 MSDNAAAALLGLWMGRYYGDPRGE---ELATRTIGTFAADMLAAPSGFGGLWQAHALLTA 596

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P  +  VL   ++   FE  LA     +        + P++              + +  
Sbjct: 597 PHVEVAVLGSSQARAPFEAELARHFLPF------AALAPSEA------------GSGLPV 638

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
               + + VA VC+NF+C  P  D  +L   L
Sbjct: 639 LEGRSGEGVAYVCRNFACDLPARDTATLGQQL 670


>gi|383762697|ref|YP_005441679.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381382965|dbj|BAL99781.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 689

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 244/692 (35%), Positives = 358/692 (51%), Gaps = 63/692 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE  A L+N+ FV+IKVDREERPD+D +YM  VQA+ G GGWP+S
Sbjct: 53  SACHWCHVMERESFEDEETAALMNELFVNIKVDREERPDLDAIYMDAVQAMTGQGGWPMS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PD KP  GGTYFP E +YG P F+ +LR V +A+ ++R+M+        E+L+  L
Sbjct: 113 VWLTPDGKPFYGGTYFPKEPRYGMPSFQQVLRAVAEAYRERREMVEGQA----ERLASML 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +AS      EL +  L     Q+ + +D   GGFGS PKFP+P+ +   L    +   
Sbjct: 169 QRTASLRAEGGELGEEILEEALGQMRQYFDEEEGGFGSQPKFPQPMTLDFALTQYLR--- 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG      +   M   TL+ MA GGI+D +GGGFHRYSVD  W VPHFEKMLYD  QL  
Sbjct: 226 TGN----LDALYMAELTLEKMAHGGIYDQLGGGFHRYSVDAIWLVPHFEKMLYDNAQLLR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL A+ +T+   +  +  + +DY+ R+M  P G  +SA+DADS   EG     EG F++
Sbjct: 282 TYLHAWQVTQRPLFRRVVEETIDYVLREMTAPDGGFYSAQDADS---EG----HEGKFFL 334

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+ +EVE +L  H A +F ++Y +   GN            F+GKN+L  +      A +
Sbjct: 335 WSQQEVESLLDPHTAAIFCDYYGVSAHGN------------FEGKNILSVVRSIEQVAQR 382

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +   +  + L   R  LF  R KR +P  D+K++  WNGL+I + A    +L      
Sbjct: 383 FRIGEAEVEDALRRARAILFAHREKRIKPARDEKILTEWNGLMIHALAECGVVL------ 436

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     +R++ +  A  AA FI   +  +   RL  S+++G ++   +L+DYA LI
Sbjct: 437 ----------ERQDALAAAVRAAEFILAQM-SQPDGRLYRSYKDGRARFNAYLEDYASLI 485

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL+ LYE     +WL  A  L     E F D   GG+F T  +   ++ R K+  D A 
Sbjct: 486 RGLIALYEATFDLRWLGEATRLAQIMFEQFHD-PAGGFFQTGVDHEQLVARRKDFVDNAV 544

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNS++   L+RL+  +   +   YR  A   L + +  +         + C  D    
Sbjct: 545 PSGNSLAAEALLRLSVFLDKPE---YRTEAGRILLMMKDAMARQPTGFGRLLCVLDAYLS 601

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           PS++ + +VG +       +LA     +  +  +   +P          E  S    +  
Sbjct: 602 PSQE-IAIVGRRDDPATAALLAEVRRRFLPHAILALKEP----------EQESVLPLLQG 650

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 K  A VC+N++C  PVT   +L  +L
Sbjct: 651 RTLVDGKATAYVCENYACKLPVTSAEALAAML 682


>gi|222056570|ref|YP_002538932.1| hypothetical protein Geob_3488 [Geobacter daltonii FRC-32]
 gi|221565859|gb|ACM21831.1| protein of unknown function DUF255 [Geobacter daltonii FRC-32]
          Length = 705

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 247/685 (36%), Positives = 344/685 (50%), Gaps = 76/685 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED  VAK LND FV+IKVDREERPD+D  +M   Q + G GGWPL+V
Sbjct: 80  TCHWCHVMAHESFEDREVAKALNDSFVAIKVDREERPDIDDQFMAVAQMISGSGGWPLNV 139

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSE 137
            L+PD KP    TY P E + G PG   +L ++   W ++RD + +S +    ++E+L+ 
Sbjct: 140 LLTPDKKPFFAATYLPKERRMGVPGIIDLLERISRFWQRERDKVEESCSTIMASLERLNR 199

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
              A A       EL + A      QL+  YD  +GGFG APKFP P  I  +L      
Sbjct: 200 TEPAYAGG-----ELEEAAF----NQLAAMYDDDWGGFGQAPKFPMPHYISFLL------ 244

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
               K+G   E  +M   TL  M +GGI+D +G G HRYSVD +W VPHFEKMLYDQ  +
Sbjct: 245 -RCWKAGR-PEALQMAEHTLTRMRQGGIYDQLGFGIHRYSVDRQWLVPHFEKMLYDQALV 302

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  + +AF  T   +Y  + R+IL+Y   +M G  G   SA+DAD   TEG    +EG F
Sbjct: 303 AIAFAEAFQATGKNYYREVVREILNYCLVEMTGIDGGFCSAQDAD---TEG----QEGKF 355

Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W + EV+++LGE A  LF   + +   GN            F+GKN+L      ++ A
Sbjct: 356 YLWAAAEVKEVLGEEAARLFCRLFDITEKGN------------FEGKNILHLPVSIASFA 403

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            + G+  E +   L + R KL  VR KR RP  D KV+ +WNGL+I++ A+   +   E 
Sbjct: 404 DREGLIAESFKGELIKWRAKLLTVRQKRVRPLRDAKVLTAWNGLLIAALAKGYGVTGDET 463

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+  AESA + I   L  ++  RL  S+  G +K P FL+DYAF
Sbjct: 464 ----------------YLRAAESAVTIILEKLQTKEG-RLSRSYHLGQAKIPAFLEDYAF 506

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  GLL+LY+      +L  A+ L      LF    GGG+++   +   VL+R K  +DG
Sbjct: 507 LGWGLLELYQVSLHQGYLFQALRLARDMIRLF-SAPGGGFYDNGMDAEEVLIRQKNAYDG 565

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS++ +NL+RL  I+   K D      EH +  F             +  A D  
Sbjct: 566 AMPSGNSIAAMNLLRLGKIL---KDDSLETAGEHGVGAFLGNALQQPAGYLQLIMAHDYQ 622

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
               +  + L G +   +   +LA  +  +     + H +  D              A  
Sbjct: 623 HA-EKIEITLAGAREGAEIRALLATVNRHFIAGLVLRHAEDGD--------------AGA 667

Query: 677 ARNNFSADKVVALVCQNFSCSPPVT 701
                 A    A +C + +C PPVT
Sbjct: 668 GTMEAPAVGAAAYICASGACRPPVT 692


>gi|284045681|ref|YP_003396021.1| hypothetical protein Cwoe_4232 [Conexibacter woesei DSM 14684]
 gi|283949902|gb|ADB52646.1| protein of unknown function DUF255 [Conexibacter woesei DSM 14684]
          Length = 666

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 253/695 (36%), Positives = 351/695 (50%), Gaps = 81/695 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFED   A L+N+ FV IKVDREERPDVD +YM  VQA+ G GGWPL+
Sbjct: 48  SACHWCHVMERESFEDPQTAALMNERFVCIKVDREERPDVDAIYMDAVQAMTGHGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            F +P+  P   GTYFPP+ ++G P ++ +L  + DAW  +RD +       +  LS   
Sbjct: 108 AFATPEQVPFYAGTYFPPQPRHGLPSWRQVLEAISDAWRARRDEILAQNDRIVAHLSAGA 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + S   +   L  +A+    + L  + D   GGFGSAPKFP+   I+++L        
Sbjct: 168 RLAPSGAMVDPGLLDDAV----DSLRMAADPVNGGFGSAPKFPQASVIELLL-------- 215

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             + GE    Q + L  L+ MA+GGIHD +GGGF RY+VD  W VPHFEKMLYD   LA 
Sbjct: 216 --RRGE----QTVALDALRAMARGGIHDQLGGGFSRYTVDAAWVVPHFEKMLYDNALLAR 269

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL  + ++ D     +C D LD+  R+M GP G   SA DADS   EG     EG FYV
Sbjct: 270 AYLHGWQVSGDPLLRQVCEDTLDWALREMRGPEGGFHSALDADS---EGV----EGKFYV 322

Query: 320 WTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           W+  E+   LG+  +  +    Y     GN            F+G N+L+    +SA+  
Sbjct: 323 WSLAELRSALGDDELYDVAVAWYGATVAGN------------FEGLNILVRAGSASAAE- 369

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
               P E     L E RR+L   RS R RP LDDK + SWN L+I++ A A  +L     
Sbjct: 370 ----PPE-----LPEIRRRLLAARSTRVRPGLDDKRLTSWNALMIAALAEAGAVL----- 415

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      +R +Y++ A   ASF+   L      RL  S+++G +  PG+L+D+A+ 
Sbjct: 416 -----------ERDDYLDAARGTASFLLDSLATSDG-RLLRSWKDGRATLPGYLEDHAYA 463

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL LYE     +W   A  L +     F D E GG+F T  +   ++ R K+  D  
Sbjct: 464 LEALLTLYEATFEERWFTAARALADATIAHFADAEHGGFFMTADDHEQLVARRKDLEDTP 523

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNS +   L+RLA +     +DY R+ AE  +A+        AMA   +  A D   
Sbjct: 524 IPSGNSAAAFGLLRLARLT--GSADYERE-AERVIALLHPLAAGHAMAFAHLLAAID-FQ 579

Query: 618 VPSRKHVVLVGHKSSVD-FENMLAAAHASYDLNKTVIHIDPA-DTEEMDFWEEHNSNNAS 675
           +     V +VG +++    E ++ A        K   H+  A  T E D   E +     
Sbjct: 580 LGEVHEVAIVGDRAAAKPLERVVRA--------KLRPHVVLAGGTGEGDRDAEASVVPLL 631

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             R+     K  A VC+ F+C  PVTDP +L  LL
Sbjct: 632 EGRHAVGG-KPAAYVCERFACRAPVTDPDALAELL 665


>gi|430745763|ref|YP_007204892.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430017483|gb|AGA29197.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
          Length = 811

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 237/608 (38%), Positives = 324/608 (53%), Gaps = 60/608 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++C+WCHVME E F+D  +AKL+N  FV IKVDREERPD+D++YM  +QA +G GGWP+S
Sbjct: 86  SSCYWCHVMERECFKDPQIAKLMNQKFVCIKVDREERPDIDQIYMAALQA-FGNGGWPMS 144

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PD +P  GGTYFPP+D+ G  GF T+L  V DAW  ++  + +S     + +  +L
Sbjct: 145 MFLTPDGRPFFGGTYFPPKDRNGIRGFPTVLAGVADAWRDEKAQIEESADRLTDLVRRSL 204

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMMLYH 193
           + S      P  L +       E+L++ +D  +GGFG        PKFP PV +  +L  
Sbjct: 205 AKSNDKRHAP--LTRAVAAQGREELTEQFDPEYGGFGFNPENARRPKFPEPVNLVFLLDE 262

Query: 194 SKKLEDTGKSGEASEGQK-------MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
            ++    GK     EGQ+       MVL TL  MA+GGI D + GG+HRY+    W VPH
Sbjct: 263 HRRGAAAGKK----EGQEASSNALAMVLKTLDQMARGGIRDQLAGGYHRYATSRYWIVPH 318

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD  QLA+ +L AF LT D  +         ++ R M  P G  +SA D   AET
Sbjct: 319 FEKMLYDNAQLASTHLLAFELTADPRWRLEAESTFAFIARSMTSPEGGFYSAID---AET 375

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           +G     EG +YVWT  EVE  LG       F + Y LK   N +           K + 
Sbjct: 376 DG----DEGQYYVWTRDEVEKTLGAGPDYEAFAQVYGLKREPNFE-----------KERY 420

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
           VL+E    +  A+ L          +   R KL  VR +RP P LDDKV+ SWNGL+I++
Sbjct: 421 VLLEPRSRADQAATLKTTPAALEATMAPLRAKLLAVRERRPAPLLDDKVLTSWNGLMIAA 480

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
           +A   +IL                   +Y + A+ AA FI   L      RL  S+R G 
Sbjct: 481 YADGFRILHD----------------AKYRQAADKAADFILAKLRSPDG-RLLRSYRLGQ 523

Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 544
           +K  G+L+DYAFL+ GLL L+      K L  A EL +     F D E GG+F T     
Sbjct: 524 AKLAGYLEDYAFLVHGLLRLHAATGDPKRLTQARELTDRMIADFSDPEEGGFFYTADGHE 583

Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
           S+L R K+ +DGA PSGNSV++ NLV LAS    ++   Y   A+ +L  F + L     
Sbjct: 584 SLLARPKDPYDGALPSGNSVAIRNLVALASATGEAR---YLDQAQKALDAFSSTLAQNPG 640

Query: 605 AVPLMCCA 612
           ++PL+  A
Sbjct: 641 SLPLLVVA 648


>gi|172058552|ref|YP_001815012.1| hypothetical protein Exig_2546 [Exiguobacterium sibiricum 255-15]
 gi|171991073|gb|ACB61995.1| protein of unknown function DUF255 [Exiguobacterium sibiricum
           255-15]
          Length = 677

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 239/709 (33%), Positives = 359/709 (50%), Gaps = 77/709 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHV+  ESFEDE  A++LND F+SIKVDREERPD
Sbjct: 28  GEEAFAAARSANKPIFLSIGYSTCHWCHVLAHESFEDEETARMLNDRFISIKVDREERPD 87

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YMT  Q + G GGWPLSVF+SPD  P   GTYFP   ++ RP F+ +L ++ + + 
Sbjct: 88  IDQIYMTAAQMMNGQGGWPLSVFMSPDQTPFYIGTYFPKTPQFNRPSFRQVLLQLSEHYR 147

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
              D + + G    +++ +AL+A  + +   D L +  +    +Q  + YD   GGFG+A
Sbjct: 148 TDPDKIKRVG----QEIIQALTAVTTFDS-EDPLDEALVHETFDQAMRQYDVENGGFGTA 202

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L       D  +  E     +MV+ TL  M  GGI DHVG G +RY+V
Sbjct: 203 PKFPSPSLLTFLL-------DYYRFAEDETALQMVMRTLTAMRDGGITDHVGFGLYRYTV 255

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DERW +PHFEKMLYD    A + ++ + ++    +     +I  Y+ RD+  P G  +SA
Sbjct: 256 DERWEIPHFEKMLYDNALFATLCIETYQVSGRERFKQYAEEIFAYIERDLSSPDGAFYSA 315

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS   EG    +EG FY +T  E+ D+LG+ A+ F   Y   P GN           
Sbjct: 316 EDADS---EG----REGLFYTFTFDELTDLLGQDAV-FPLLYQATPQGN----------- 356

Query: 359 EFKGKNVLIELNDSSASASK-LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
            F+G+ V      S    S      ++  L  L + RR L   RS+R RP  DDKV+ SW
Sbjct: 357 -FEGRIVFRRTGQSIQQLSADRNTAVQDILIQLEQERRTLLLFRSQRTRPFRDDKVLTSW 415

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           N L+IS++A+A ++   E                 Y + A  A +F+  HL D+   RL 
Sbjct: 416 NALMISAYAKAGRVFNDE----------------RYTKFARQALTFLETHLMDDD--RLH 457

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             +R G  +  G+LDDY+FL    L+L++      +L  AI L       F D E G +F
Sbjct: 458 VRYRQGHIQGNGYLDDYSFLTEAYLELHQTTQHIPYLKQAIRLTERMIGDFSD-EDGSFF 516

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T+ ED ++L+R K+ +D  +P+GNS +V NL+RL+ +   +    YR  A+ + +   +
Sbjct: 517 FTSFEDETLLMRPKDVYDVVKPAGNSTAVSNLLRLSQLTGRTD---YRDQAQRNFSTLAS 573

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHV----VLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            +K            A +LSV +R  +    ++V  +S  D  + L   H       +++
Sbjct: 574 EIKSQPTGF------ASLLSVYTRTLMEPKELIVLTESYTDVASFLTQLHQRRLPELSLL 627

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                D  E+            +A  +    +  A +C +F C  P T+
Sbjct: 628 VGSKTDLLEI---------APFLATYDAPTQQPTAYLCHDFQCDRPTTN 667


>gi|212538503|ref|XP_002149407.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210069149|gb|EEA23240.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 783

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 238/612 (38%), Positives = 333/612 (54%), Gaps = 51/612 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LND F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 68  SACHWCHVMEKESFMSTEVATILNDSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 127

Query: 80  VFLSPDLKPLMGGTYFP-----PEDKYGRP---GFKTILRKVKDAW--------DKKRDM 123
           VFL+PDL+P+ GGTY+P      + ++G     GF  IL K++D W        D  +++
Sbjct: 128 VFLTPDLEPVFGGTYWPGPQASSQSQWGAEGPIGFVDILEKLRDVWQTQQARCLDSAKEI 187

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
             Q   FA E       A      L  EL + A     +  +  YD  +GGFG APKF  
Sbjct: 188 TKQLREFAEEGTHTQQGAKGGGEDLEIELIEEAF----QHFASRYDPLYGGFGRAPKFHT 243

Query: 184 PVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           P  +  ++    +   + D     E      M   TL  +A+GGI DH+G G  RYSV  
Sbjct: 244 PANLSFLIRLGMYPSAVSDIVGQDECVRATAMATNTLLNIARGGIRDHIGHGVARYSVTA 303

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAE 299
            W +PHFEKMLYDQ QL +VY+DAF  T +        D++ YL  + I    G  +S+E
Sbjct: 304 DWLLPHFEKMLYDQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTGGYYSSE 363

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 358
           DADS  T   T K+EGAFYVWT KE++ +LG+  A +   H+ +   GN  ++  +DPH+
Sbjct: 364 DADSLPTPNDTEKREGAFYVWTMKELKQVLGQRDAGVCARHWGVLADGN--IAPENDPHD 421

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSW 417
           EF  +NVL      S  A + G+  E+ + I+   ++KL D R K R RP LDDK+IV+W
Sbjct: 422 EFMDQNVLSIKVTPSKLAKEFGLSEEEVIKIIKSGKQKLRDYREKIRVRPDLDDKIIVAW 481

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL I + A+AS +L+           +     ++  + A  A  FIR+ L++  + +L 
Sbjct: 482 NGLTIGALAKASVLLEE----------IDKVKAQQCRDSAHKAVEFIRKTLFEPSSGQLW 531

Query: 478 HSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--- 533
             +R+G     PGF DDYAFL SGL+ +YE      +L +A +LQ   ++ F+   G   
Sbjct: 532 RIYRDGHRGNTPGFADDYAFLTSGLIAMYEATFDDSYLQFAEQLQKHLNQYFMAPGGESG 591

Query: 534 --GGYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
              GY+ T+ E    +P  LLR+K   D A PS N +   NLVRL +++   + D YR+ 
Sbjct: 592 TSAGYYTTSSEPISGEPGPLLRLKSGTDSATPSINGIIARNLVRLGTLL---EDDNYRRL 648

Query: 588 AEHSLAVFETRL 599
           A  + + F   L
Sbjct: 649 ARQTCSTFSVEL 660


>gi|255655589|ref|ZP_05400998.1| hypothetical protein CdifQCD-2_07782 [Clostridium difficile
           QCD-23m63]
 gi|296451580|ref|ZP_06893315.1| thymidylate kinase [Clostridium difficile NAP08]
 gi|296878837|ref|ZP_06902837.1| thymidylate kinase [Clostridium difficile NAP07]
 gi|296259645|gb|EFH06505.1| thymidylate kinase [Clostridium difficile NAP08]
 gi|296430109|gb|EFH15956.1| thymidylate kinase [Clostridium difficile NAP07]
          Length = 678

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/696 (33%), Positives = 356/696 (51%), Gaps = 79/696 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++
Sbjct: 54  TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD KP   GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +   
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
              +   L  E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L 
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +LDA+ +T    Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY
Sbjct: 281 IAFLDAYKITNKELYKEIAMKTIDYVVREMQDKDGGFYSAQDADS---EG----EEGKFY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
            +   E+ ++LGE     F  ++ +  +GN            F+GK++  LI+       
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E++   +    +K+F+ R +R   H DDK++ SWN L++ +  +A   LK++
Sbjct: 375 ----NKEYERHNEKIDNLSKKVFEYRKERTSLHKDDKILTSWNALMVVALTKAYSTLKND 430

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                            Y++ +     FI  +L +E + RL   +R+G S    +LDDYA
Sbjct: 431 M----------------YLDYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FLI   ++LYE     K+L  A+ L  +  +LF D E  G++    +  +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCIDLFWDYEKSGFYIYGKDSENLIARPKDLYD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV + NL+RLA I   +K +   + +   L ++   +K           +  M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNKLE---EMSYKQLKLYVNNVKSSPTGYSFYMLSL-M 589

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
             + S K ++ +  K   D          ++  N T +            + E N+    
Sbjct: 590 FELYSTKEIICI-FKEDSDLSAFKELISENFIPNTTFLAKK---------YNEENTIIGF 639

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +       DK    VCQ+ SCS P+ +   L++++L
Sbjct: 640 LNNYKLKEDKTSYYVCQSNSCSQPINNLQKLKDMIL 675


>gi|358371871|dbj|GAA88477.1| DUF255 domain protein [Aspergillus kawachii IFO 4308]
          Length = 784

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 235/580 (40%), Positives = 321/580 (55%), Gaps = 35/580 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 66  SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 125

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +          GF  IL K+ D W  ++    +S     +Q
Sbjct: 126 VFLTPDLEPVFGGTYWPGPNSSTLTGNETIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 185

Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +     YD   GGF +APKFP P  +  +
Sbjct: 186 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 245

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 246 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 305

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 306 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 365

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NV
Sbjct: 366 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 423

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 424 LSVKVTPSRLAKDFGLGEEEVVRIIRTAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 483

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A SFI+ +L+++ T +L   +R+G 
Sbjct: 484 LAKCSALFE-EIES---------SKAVQCREAAAKAISFIKENLFEKSTGQLWRIYRDGG 533

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
               PGF DDYA+LI GLLD+YE      +L +A +LQ   ++ FL   G    GY++T 
Sbjct: 534 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTP 593

Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
              T   P  LLR+K   +   P+ N V   NL+RL S++
Sbjct: 594 STMTSGAPGPLLRLKTGTESVTPAVNGVIARNLLRLGSLL 633


>gi|297622269|ref|YP_003703703.1| hypothetical protein [Truepera radiovictrix DSM 17093]
 gi|297163449|gb|ADI13160.1| protein of unknown function DUF255 [Truepera radiovictrix DSM
           17093]
          Length = 704

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 222/535 (41%), Positives = 303/535 (56%), Gaps = 50/535 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A L+N  FV++KVDREERPDVD VYM+ VQA+ G GGWP++V
Sbjct: 74  ACHWCHVMAHESFENPEIADLMNAHFVNVKVDREERPDVDAVYMSAVQAMTGSGGWPMTV 133

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            L+PD KP  GGTY+PPED+ G PGFK +L  + +AW  +RD + ++       L++   
Sbjct: 134 ALTPDGKPFFGGTYYPPEDRLGHPGFKRVLLSLAEAWRSRRDEVLRAAETLTNHLADLNK 193

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             A+    P  L +  L      L +++D + GGFG APKFP    +  +L   +     
Sbjct: 194 LPAAGEPSPGALGEEVLAEAVRALQRTFDPQHGGFGGAPKFPPHGALAFLLRRPE----- 248

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                  E ++M   TL  MA GGI D +GGGF RYSVD RW VPHFEKMLYD  QL  V
Sbjct: 249 ------PEAREMAYVTLDKMAAGGIFDQLGGGFARYSVDARWLVPHFEKMLYDNAQLVGV 302

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A++ T+   Y  +    L +++R++  P G  +SA DADS   EG    +EG FYVW
Sbjct: 303 YAEAYAQTRRARYREVVEATLAFVQRELTSPEGCFYSALDADS---EG----EEGKFYVW 355

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            + E  D+LGE A L K ++ +   GN            F+G+NVL   +  +A A + G
Sbjct: 356 RADEF-DVLGEDAALAKVYFGVSAAGN------------FEGRNVLFVPHPPAAVAERFG 402

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +        L   +R LF++RS+R RP LDDKV+ SWNGL+I +FARA ++L  +A    
Sbjct: 403 LSEAALAARLARVKRALFEIRSRRTRPGLDDKVLASWNGLMIGAFARAGRVLAEDA---- 458

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A  AA  +R  L  E   RL H+FR G +K  G L+DYA L  G
Sbjct: 459 ------------YLEAARRAARGVRSALLREG--RLWHTFRGGEAKVEGLLEDYALLGLG 504

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           LL+LY       WL+WA+EL       F D E GG+F+T  +  ++++R KE  D
Sbjct: 505 LLELYRATLEGPWLLWALELAEVIAARFTDPE-GGFFSTAADAEALVVRPKELFD 558


>gi|452209206|ref|YP_007489320.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
 gi|452099108|gb|AGF96048.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
          Length = 690

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 231/684 (33%), Positives = 348/684 (50%), Gaps = 51/684 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           N   WCH+M  ESFEDE VA L+N+ FVSIKVDREERPD+D +YMT  Q + G GGWPL+
Sbjct: 47  NKPDWCHMMAHESFEDEEVAGLMNEAFVSIKVDREERPDIDNIYMTVCQIILGRGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++P  KP   GTY P   ++ + G   ++ ++K+ W+++ + +  S       + E +
Sbjct: 107 IIMTPGKKPFFAGTYIPKNTRFNQIGMLELVPRIKEIWEQQHEEVLDSAEKITSTIQEMI 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S+        L +  +    E+L  S+D+ +GGF  APKFP P +I  +L + ++  +
Sbjct: 167 KESSGEG-----LGEEVIEEVYEELLSSFDTEYGGFSGAPKFPTPHKISFLLRYWRRSRN 221

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   E   M  +TL  M +GGI+DH+G GFHRYS D  W +PHFEKMLYDQ   A 
Sbjct: 222 -------PEALHMAEYTLDKMRRGGIYDHLGSGFHRYSTDSMWLLPHFEKMLYDQALTAI 274

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T    Y      ILDY+ RD+  P G  +  EDAD         ++EG +Y+
Sbjct: 275 AYTEAYQVTGKDLYKETAEGILDYVLRDLTSPEGGFYCGEDAD-------VEREEGKYYL 327

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +E+  IL  E + L  + + L+  GN +     +      G N+        + A+K
Sbjct: 328 WTLEEIRSILDPEDSELIIKMFNLREEGNFE----EEIRGRETGTNLFYMARSPGSLAAK 383

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           + +P+E+    +   R KL   R +R RP LDDK++  WNGL+I++FA+           
Sbjct: 384 MKIPVEEVEKKVKAAREKLLKARYERKRPSLDDKILTDWNGLMIAAFAKG---------- 433

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
               + V G  R  Y++ AE AA FI   LY      L H +R+G +   G  DDYAFLI
Sbjct: 434 ----YQVFGEQR--YLKAAEKAADFILMALYS-PGDGLLHRYRDGVAGISGTSDDYAFLI 486

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL+LYE G   ++L  A+ L +   E F D   GG + T  +  +++ R KE  D A 
Sbjct: 487 HGLLELYEAGFKMRYLKAAVSLNSELLECFWDPVNGGLYFTANDSEALIFRKKEFMDSAI 546

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+GNS  ++NL+RL+ I+A    +   + A+     F  ++            A D    
Sbjct: 547 PTGNSFEMLNLLRLSRIIADPGLE---ETADKLERAFSKQIMKAPSGYTQFLSAFDFRLG 603

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           PS + V++ G   + D E ML    + +  NK +I     +  E+    ++      +  
Sbjct: 604 PSYE-VIISGKAEASDTEQMLKELWSYFVPNKVLIFRPEREKPEITELAKYTEEQVPI-- 660

Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
                 K  A VCQN+ C  P T+
Sbjct: 661 ----EGKATAYVCQNYECQLPTTE 680


>gi|327357546|gb|EGE86403.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 833

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 242/638 (37%), Positives = 339/638 (53%), Gaps = 64/638 (10%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           K  R  FL    + CHWCHVME ESF    VA +LN  F+ IK+DREERPD+D+VYM YV
Sbjct: 59  KLNRMVFLSIGYSACHWCHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYV 118

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDK 119
           QA  G GGWPL+VFL+PDL+P+ GGTY+P       P         F  IL K++D W  
Sbjct: 119 QATTGSGGWPLNVFLTPDLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQT 178

Query: 120 KRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGF 175
           ++    +S     +QL E A   + S  K  D      + L     +  +  +D   GGF
Sbjct: 179 QQLRCRESAKDITKQLREFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGF 238

Query: 176 GSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
             APKF  P  +  ++  S+    + D     E S   +M   TL  M++GGIHD +G G
Sbjct: 239 SRAPKFATPANLSFLINLSRYPSAVSDIVGYDECSRALEMATKTLISMSRGGIHDQIGHG 298

Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGP 291
           F RYSV   W +PHFEKMLYDQ QL NVY+DAF    +        DI  Y+    ++ P
Sbjct: 299 FARYSVTADWSLPHFEKMLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSP 358

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDL 350
            G  +S+EDADS  T   T K+EGAFYVWT KE + ILG+  A +   H+ + P GN  +
Sbjct: 359 TGGFYSSEDADSLPTPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--V 416

Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHL 409
           +R +DPH+EF  +NVL      +  A + G+  E+ + I+   R KL + R SKR RP L
Sbjct: 417 ARGNDPHDEFINQNVLSIKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGL 476

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           DDK+IVSWNGL I + A+ S +L++          V  +  +E+   AE+AA FIR++L+
Sbjct: 477 DDKIIVSWNGLAIGALAKCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLF 526

Query: 470 DEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
           D  + +L   +R+G     PGF DDY++L SGL+DLYE      +L +A +LQ   +  F
Sbjct: 527 DPASGQLWRIYRDGERGDTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYF 586

Query: 529 LDR---------------------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSG 561
           L +                        GY+ T          P+ L R+K   D + PS 
Sbjct: 587 LAQGPTPTPSPRTSITTESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSP 646

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           N V   NL+RL++++   + D Y++ A  ++  F   +
Sbjct: 647 NGVIAQNLLRLSTLL---EDDTYKRLARETVNAFAVEI 681


>gi|225848123|ref|YP_002728286.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
 gi|225644610|gb|ACN99660.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
          Length = 684

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 243/695 (34%), Positives = 363/695 (52%), Gaps = 67/695 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFEDE VA++LN +FV IKVDREERPD+D VYM       G GGWPL+
Sbjct: 51  SSCHWCHVMEKESFEDEEVAEILNKYFVPIKVDREERPDIDAVYMNVCMLFNGSGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
           + ++PD KP   GTYFP   +  R G   +L  V   W + K D++++S     E++   
Sbjct: 111 IIMTPDKKPFFAGTYFPKHSRPNRIGVVDLLLSVAKYWQENKEDLISRS-----EKVLGY 165

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSK 195
           L     SN    EL ++ +      L   +D+ +GGF + PKFP P  I  +L   YH+K
Sbjct: 166 LKEDNKSNY--GELKKDYIHAGFYDLKGRFDNTYGGFSNKPKFPTPHNIMFLLRYYYHTK 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +           E  +MV  TL  M  GGI+DHVG GFHRYS D +W +PHFEKM YDQ 
Sbjct: 224 E----------EEALQMVEKTLTNMRLGGIYDHVGFGFHRYSTDRQWLLPHFEKMHYDQA 273

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            L   Y + + +TK   Y    ++I++Y+ RDM    G  FSAEDADS   EG    +EG
Sbjct: 274 MLLMAYTETYQITKKDLYKQTVQEIIEYVIRDMTNEEGVFFSAEDADS---EG----EEG 326

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FY WT +E++DIL E + L  + + +K  GN        P     G+N++         
Sbjct: 327 KFYTWTFQEIKDILKEESDLAIKIFNIKEEGNYLEEATGHP----TGRNIIYLSKTLRDY 382

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A  LG+        L + R+KLF  R KR  P  DDKV+  WNGL+I++ ++A K   ++
Sbjct: 383 AIDLGIDENTLKQKLEQIRKKLFKEREKRVHPLKDDKVLTDWNGLMIAALSKAGKAFSNQ 442

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                           +Y+  A+ AA FI  ++  +   +L H +++   K  G LDDYA
Sbjct: 443 ----------------DYISYAQKAADFIIHNMIIDG--KLYHLYKDKEVKIEGMLDDYA 484

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL+ GL++LY+     K+L  A++L N   +   D + GG+F +  +D  +++  KE  D
Sbjct: 485 FLVWGLIELYQATGELKYLKTAVDLTNKAIQPLYDEKNGGFFLSKSQD--LIVNPKESFD 542

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNSV   NL RL  I A  + ++Y+++ E +L  F   +K +     +   A  M
Sbjct: 543 GAIPSGNSVMAYNLYRLYLITA--QEEFYKKSYE-TLTAFAGDIKRLPSYHTMFLIALMM 599

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
              P+ +  +++  K  ++  N L   +  +  N  +I   P + EE+       S  + 
Sbjct: 600 HFFPTSE--IVISGKGWIEALNQL---NREFLPNTVIIVKTPENKEEL-------SKISH 647

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             ++    +     +C+NF+C+ P  D   + N+L
Sbjct: 648 YTQSMEVPEDFYIYLCKNFACNLPTKDLEYVINML 682


>gi|119495483|ref|XP_001264525.1| hypothetical protein NFIA_013170 [Neosartorya fischeri NRRL 181]
 gi|119412687|gb|EAW22628.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 805

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 241/615 (39%), Positives = 335/615 (54%), Gaps = 52/615 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWPLS
Sbjct: 69  SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 128

Query: 80  VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+P+L+P+ GGTY+P  +     +    GF  IL K++D W  ++     S      Q
Sbjct: 129 VFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 188

Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L   +E  + S   ++  DE L    L    +  +  YD+  GGF  APKFP P  +  +
Sbjct: 189 LREFAEEGTHSQQGDRQTDEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 248

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E  +   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 249 LRLKTYPSAVSDIVGQEECDKAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 308

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 309 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 368

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +NV
Sbjct: 369 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 426

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISS 424
           L      S  A + G+  E+ + I+   ++KL + R + R RP LDDKVIV+WNGL I +
Sbjct: 427 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYRETTRVRPDLDDKVIVAWNGLAIGA 486

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G 
Sbjct: 487 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 536

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT-----------------QDE 526
             + PGF DDYA+LI GLLD+YE      +L +A +LQ+                   ++
Sbjct: 537 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTHAEYLND 596

Query: 527 LFLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
            FL   G    GY++T    T   P  LLR+K   + A PS N V   NL+RL++++   
Sbjct: 597 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALLEEE 656

Query: 580 KSDYYRQNAEHSLAV 594
           +     +   HS +V
Sbjct: 657 EYRTLARQTCHSFSV 671


>gi|46446752|ref|YP_008117.1| hypothetical protein pc1118 [Candidatus Protochlamydia amoebophila
           UWE25]
 gi|46400393|emb|CAF23842.1| conserved hypothetical protein [Candidatus Protochlamydia
           amoebophila UWE25]
          Length = 718

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/568 (39%), Positives = 320/568 (56%), Gaps = 54/568 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG-GWPLS 79
           TCHWCHVME ESFED  VA  +N  FVSIKVDREE P+VD +YM + Q++  G  GWPL+
Sbjct: 85  TCHWCHVMERESFEDIEVADSMNQTFVSIKVDREELPEVDSLYMEFSQSMMAGAAGWPLN 144

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEA 138
           V L+PDL+P    TY P    +G  G   +++++ + W  ++R+ +       +E  S+A
Sbjct: 145 VILTPDLQPFFATTYLPSHSSHGMMGLIDLIQRIAELWSSEEREKIITQAEKIVEVFSKA 204

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +  +     +PDE     + + A+ L K  D  +GG   APKFP   +   ML +   ++
Sbjct: 205 VHTTGED--IPDE---EQISITADLLYKMADPTYGGIKGAPKFPIGYQYSFMLRYYANMK 259

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D       S    +V  TL  + +GGI+DH+GGGF RYS+DE+W VPHFEKMLYD   LA
Sbjct: 260 D-------SRALFLVERTLDMLHRGGIYDHLGGGFSRYSIDEKWLVPHFEKMLYDNAILA 312

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+A+ LTK   Y  + ++IL+Y+ RDM    G  +SAEDADS   EG     EG FY
Sbjct: 313 QSYLEAWQLTKKNLYKEVAQEILNYILRDMTYSDGGFYSAEDADS---EG----HEGFFY 365

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
            W  +EV++ILG+H+ LF E+Y +   GN            F+G+N+L    +    ASK
Sbjct: 366 TWKEEEVKEILGDHSQLFCEYYDITAEGN------------FEGRNILHTPLNLEEFASK 413

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
               +++   I    R+KL+  R KR  P  DDK++ SWNGL+I SFA A         +
Sbjct: 414 HQQDIDQLRIIFDNQRKKLWSAREKRIHPLKDDKILSSWNGLMIYSFAEA---------A 464

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
             F+ P+       Y+E A  AA FI+  L+  Q  +L   +R G +     LD+YAF+I
Sbjct: 465 FTFDCPL-------YLEAAVKAARFIKNKLWKNQ--KLLRRWREGQAMFQAGLDEYAFMI 515

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            G L L+E  +GT+WL WAIE+     + +   E G ++ T G D ++LLR  +  DGAE
Sbjct: 516 KGALSLFEANAGTEWLEWAIEMATLLKDQY-KAEEGAFYQTDGGDKNLLLRKCQFSDGAE 574

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQ 586
           PSGN+V   NL+RL  +   ++ DY  Q
Sbjct: 575 PSGNAVHCENLLRLYQLT--NEEDYLAQ 600


>gi|326474295|gb|EGD98304.1| hypothetical protein TESG_05683 [Trichophyton tonsurans CBS 112818]
 gi|326479253|gb|EGE03263.1| DUF255 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 774

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 227/605 (37%), Positives = 334/605 (55%), Gaps = 42/605 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188

Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E        +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV 
Sbjct: 189 TRQLREFAEEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVN 248

Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
           +  +L  S+  E   D     E ++  +M + T+  +A+GGI D +G GF RYSV   W 
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWS 308

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
           +PHFEKMLYDQ QL +V++D F  + +        D++ Y+    ++ P G  +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDAD 368

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S  +   T K+EGA+YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF 
Sbjct: 369 SQPSPEDTEKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFM 426

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
            +NVL      +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGL 486

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           VI + A+ + +L+           +     K    +A +A  FI+ +L+D ++ +L   +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIY 536

Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
           R +     PGF DDYA+LISGLL LYE       L +A +LQ   ++ F+          
Sbjct: 537 RADSRGDTPGFADDYAYLISGLLQLYEATFDDAHLQFADKLQQYLNKYFISVSASDSSIC 596

Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
            G++ T  E     PS L R+K   D A PS N V   NL+RL+S++         +   
Sbjct: 597 TGFYMTPSEAVTDTPSALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTC 656

Query: 590 HSLAV 594
           H+ AV
Sbjct: 657 HAFAV 661


>gi|325107403|ref|YP_004268471.1| hypothetical protein Plabr_0826 [Planctomyces brasiliensis DSM
           5305]
 gi|324967671|gb|ADY58449.1| protein of unknown function DUF255 [Planctomyces brasiliensis DSM
           5305]
          Length = 686

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 227/607 (37%), Positives = 334/607 (55%), Gaps = 51/607 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ +A L+N WFV++KVDREERPD+D++YMT VQ + G GGWP+S
Sbjct: 52  SACHWCHVMERESFENDQIAALMNQWFVNVKVDREERPDIDQIYMTAVQLVTGQGGWPMS 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P  +P  GGTY+PP  ++G PGF  IL+K+   W++ R+     GA    +L  A+
Sbjct: 112 VFLAPSGEPFYGGTYWPPTSRHGMPGFADILQKIHQYWEEHREECLAKGA----ELVTAI 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                  +    L ++ LR    +L +S D + GGFG APKFP P++++++L   ++   
Sbjct: 168 DQLHHHEQEKSPLQEDLLRHAQHRLMQSADMQEGGFGHAPKFPHPIDLRVLLRSWRRF-- 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               GE  E + +V  TL  MA GGI+DH+ GGF RYS D  W VPHFEKMLYD  QLA 
Sbjct: 226 ----GEV-ESRNVVTLTLDKMADGGIYDHLAGGFARYSTDRYWLVPHFEKMLYDNSQLAT 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+ +  T +  Y+ + R+ LD++ RDM       +S  DADS   EG     EG FYV
Sbjct: 281 AYLEGYQATGEERYAEVVRETLDFVLRDMTSSEHGFYSTLDADS---EGV----EGKFYV 333

Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W+  EV+++L  + A  FK  Y +   GN            ++G N+L         A +
Sbjct: 334 WSEAEVDELLEAKAAEWFKHVYNVSAQGN------------WEGHNILHRTKPLQELAGE 381

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG   E     L + R  L  VR +R  P  D+K+IV+WNGL++S+FA+A +IL      
Sbjct: 382 LGTDRETLSASLMQSRETLLKVREQRIWPGRDEKIIVAWNGLMLSAFAQAGRIL------ 435

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                   G DR  Y + A +AA F+   L  E    L H  ++G ++  GFLDDYA L+
Sbjct: 436 --------GEDR--YTQAACNAADFLLDTLRREDG-SLWHCRKDGRNRFNGFLDDYACLV 484

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL DLY      K+L  A+EL +    LF D E   +  T  +   +++RV++ +D A 
Sbjct: 485 DGLNDLYLTTLEPKYLQAALELADVMQRLFYDDEQKAFHYTPSDHEELVVRVRDRYDSAI 544

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG ++++  L++L  I    + DY  +  +  L      ++     +     A D+L  
Sbjct: 545 PSGTNLAIHALLKLGWIAG--REDYVTRAGD-CLDSVSGTMRQQPSGMGQAVVALDLLLG 601

Query: 619 PSRKHVV 625
           P+ + ++
Sbjct: 602 PTEEFIL 608


>gi|415885100|ref|ZP_11547028.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
 gi|387590769|gb|EIJ83088.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
          Length = 625

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 245/689 (35%), Positives = 366/689 (53%), Gaps = 74/689 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFEDE VAKLLN+ FVSIKVDREERPD+D +YM   Q + G GGWPLSVF++PD K
Sbjct: 1   MERESFEDEEVAKLLNERFVSIKVDREERPDIDSIYMNICQLMNGHGGWPLSVFMTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP E +YG PGFK ++ ++ D + K R  + +  + A E L +  SA  SS +
Sbjct: 61  PFFAGTYFPKESRYGVPGFKDVITQLYDQYMKNRSHIEKIASDAAEALKQ--SARESSAE 118

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
           LP     + L    +QL+ S++S +GGFG APKFP P  +  +L + K    TG      
Sbjct: 119 LPS---VDVLHKTYQQLAGSFNSVYGGFGDAPKFPIPHHLMFLLKYYKW---TG----TE 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
              KMV  TL  MA GGI+DH+G GF RYSVD  W VPHFEKMLYD   L   Y +A+ +
Sbjct: 169 MALKMVEKTLVSMANGGIYDHIGFGFARYSVDAMWLVPHFEKMLYDNALLLYTYSEAYQV 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           TK+  Y  I   I++++ R+M    G  FSA DADS   EG    +EG +YVW+ +E+ D
Sbjct: 229 TKNSKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYVWSKEEILD 281

Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 385
           +LGE    F           C +  ++   N F+GKN+  LI  N    + ++ G+ LE+
Sbjct: 282 VLGEKDGEF----------YCKVYDITSGGN-FEGKNIPNLIHTN-MVKTFAEAGLKLEE 329

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
               L E R+KLF+ R +R  PHLDDK++ SWN L+I+  A+A +  +++          
Sbjct: 330 GKAKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQNQ---------- 379

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
                 +Y+E AE A  FI   L       L   +R+G SK   +LDD+AFL+   L+LY
Sbjct: 380 ------DYVEKAEKALRFIEEKLM--VNGELMARYRDGESKYSAYLDDWAFLLWAYLELY 431

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E     ++L  A        +LF D + GG++ T  +  ++++R K+ +DGA PSGNSV+
Sbjct: 432 EATFSMEYLDKAQNTAEKMKKLFWDEQDGGFYFTRSDGEALIVREKQVYDGALPSGNSVA 491

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
            +N +RL      +K   +    +     F+  ++        +  +  +   P  + V+
Sbjct: 492 AVNFLRLGHFTGETK---WFDVVDEIHRFFKDDVESYGPGHTFLLQSLLLKEFPMSEVVI 548

Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA-- 683
           +   +   +   ++  A+           I P  +       ++  +   + +  ++A  
Sbjct: 549 VGTPEKRSELAGIIQKAYTP--------EIAPVTS-------KNQEDLVKIYQRGYTATD 593

Query: 684 DKVVALVCQNFSCSPPVTDPISLENLLLE 712
             +   +C+NF+C  P+ D   LE++L E
Sbjct: 594 SDLTVYICENFTCQKPMND---LEDVLKE 619


>gi|451982157|ref|ZP_21930485.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
 gi|451760626|emb|CCQ91765.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
          Length = 727

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 235/697 (33%), Positives = 357/697 (51%), Gaps = 64/697 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFE E  AKL+N+ FV+IKVDREERPD+D +YM  V AL G GGWP+S
Sbjct: 53  SSCHWCHVMAHESFESEETAKLMNELFVNIKVDREERPDIDAIYMKSVIALNGHGGWPMS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P +GGTY+PPE K+ RPGF  +L++  D +  ++D +    A  +E+L+   
Sbjct: 113 VFLTPEQEPYLGGTYYPPEPKFNRPGFPQVLQQAADIYRNQKDRMKSVSARLMEKLTTPP 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                     D L   A+ L  E+    +D  +GGFGS  KFP P+   ++L H +K ED
Sbjct: 173 PIPQGQGAGTDALIPQAVELMKEK----FDETYGGFGSGMKFPEPMLYTLLLRHWQKRED 228

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  ++   M   +L  MA+GG++D VGGGFHRYS D +W VPHFEKMLYD   LA 
Sbjct: 229 -------NDAILMADKSLTKMAEGGMYDQVGGGFHRYSTDRKWLVPHFEKMLYDNALLAR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           ++++ F  TK   Y  I R++  Y+ R+M  P    +S++DAD       T   EG F+ 
Sbjct: 282 LFVEMFQATKQEIYERIAREVFHYIGREMTSPEWAFYSSQDAD-------TDAGEGHFFT 334

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT KEV DILG  H+ +F   Y +  TGN            F+ +NVL         +  
Sbjct: 335 WTMKEVLDILGPRHSKVFARVYGMTATGN------------FEKRNVLHIAETMEKVSES 382

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+P+ +  +I+   R+ L + R KR  P  DDK++  WNG++I++FA  + + +     
Sbjct: 383 EGVPIFEVDHIIRNGRQTLLESRGKRQNPGRDDKILTGWNGMMIAAFAAGAVVFRDRV-- 440

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         Y + A  AA F+   ++ +   +L   +++G  +  G L+DYA+ I
Sbjct: 441 --------------YRDHAVQAARFLWDTMWKDG--KLFRVYKDGKVRVDGCLEDYAWFI 484

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL ++E     +W+  A  + +   + F D +  G+F T  +   ++ R+K   D A 
Sbjct: 485 EGLLGVFEATGEGEWIDKAQAVADALIDRFWDDKDNGFFMTAADQEKLITRLKNPEDEAI 544

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML-S 617
           PS N V+ + L +L  +      D Y +    ++  F  R++    A   +  A D + S
Sbjct: 545 PSANGVAALALAKLGRLTG---KDAYFEKGRDTVRAFADRIEHRPTAYTSLLAAMDFIES 601

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P    V + G +    +  +L A +A Y  +K V+      T +   W E         
Sbjct: 602 LPM--EVTISGPEGDPQYGKLLEAVYADYRPDKLVVRYSGDATVQRVPWAE--------G 651

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
           R   S    V  VC+  +C PPV D  +L N +   P
Sbjct: 652 RGPVSGQPTV-YVCRQGTCYPPVHDAEALMNQMGRPP 687


>gi|327293790|ref|XP_003231591.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
 gi|326466219|gb|EGD91672.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
          Length = 774

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/605 (37%), Positives = 333/605 (55%), Gaps = 42/605 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEDPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188

Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E        +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV 
Sbjct: 189 TRQLREFAEEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVN 248

Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
           +  +L  S+  E   D     E ++  +M + T+  +A+GGI D +G GF RYSV   W 
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWS 308

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
           +PHFEKMLYDQ QL +V++D F  + +        D++ Y+    ++ P G  +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPKGCFYSSEDAD 368

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S  +   T K+EGA+YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF 
Sbjct: 369 SQPSPEDTEKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFM 426

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
            +NVL      +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGL 486

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           VI + A+ + +L+           +     K    +A +A  FI+ +L+D ++ +L   +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIY 536

Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
           R +     PGF DDYA+LISGLL LYE       L +A +LQ   ++ F+          
Sbjct: 537 RADSRGDTPGFADDYAYLISGLLQLYEATFDDAHLQYADKLQQYLNKYFISVSASDSSIC 596

Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
            G++ T  E     P  L R+K   D A PS N V   NL+RL+S++         +   
Sbjct: 597 TGFYMTPSEAVTDTPGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTC 656

Query: 590 HSLAV 594
           H+ AV
Sbjct: 657 HAFAV 661


>gi|149174989|ref|ZP_01853613.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
 gi|148846326|gb|EDL60665.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
          Length = 876

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 228/630 (36%), Positives = 340/630 (53%), Gaps = 58/630 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY------GG 73
           ++C+WCHVME   FE+  +AK +N+ FV+IKVDREERPD+D +YMT +   +        
Sbjct: 103 SSCYWCHVMERLVFENPEIAKYMNENFVNIKVDREERPDIDDIYMTSLSVYFHLIGAPDN 162

Query: 74  GGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE 133
           GGWPLS+FL+PD +P  GGTYFPP D+ G+  F  +L+KV + W   +  + QS     +
Sbjct: 163 GGWPLSMFLTPDREPFAGGTYFPPTDQGGQMSFPRVLQKVNELWSGDKAKVQQSATIIAK 222

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEI 187
           +++       ++  +P E     ++     ++ S+DS +GG        + PKFP   ++
Sbjct: 223 EVARLQKEEGATEAIPIE--DRLVKAGVRSINASFDSEYGGIDFSEVSPNGPKFPTSSKL 280

Query: 188 QMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
            ++ Y  + ++    S E++   K++  TL  MA GGI+DH+GGGFHRYS D  WHVPHF
Sbjct: 281 VLLQYDIESMDAESTSAESA---KVLYQTLDAMANGGIYDHLGGGFHRYSTDRYWHVPHF 337

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETE 307
           EKMLYD GQLA++Y  A+  T +  Y  +   I+D++ R++    G  +SA D   AET+
Sbjct: 338 EKMLYDNGQLASLYAKAYGQTGNEQYKQVAAGIIDFVLRELTDTQGGFYSALD---AETD 394

Query: 308 GATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
           G     EG  Y W+ +E+++IL E   LF E Y L           ++P   F+   VL 
Sbjct: 395 GV----EGEHYAWSQEELKEILDEGYPLFAEFYGL-----------NEP-VRFEHGYVLH 438

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
            +    A A K     E   + L   R+KL  VR++R     DDK++ SWNGL+I+  A 
Sbjct: 439 RVTTLKALAEKQKTTPEALESQLAAMRKKLHTVRNQRQPLLKDDKILTSWNGLMITGMAN 498

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
           A +ILK                R +Y   AE AA FI   + D+Q H L  S+R   ++ 
Sbjct: 499 AGRILK----------------RPDYTAAAEKAAQFILDQMRDKQGH-LYRSYRADQARL 541

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
             +LDDYAFL+ GLL LYE     +WL  A  L + Q +LF D++  G+F TT +   ++
Sbjct: 542 NAYLDDYAFLVQGLLALYEATGKQQWLDQAQALTDLQIKLFWDQKEHGFFFTTHDHEQLI 601

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA-MAV 606
            R K  +D A PSGNS+S  NL++L  +    K   YRQ+A+ +L +F   +K       
Sbjct: 602 ARTKNAYDAAIPSGNSISTRNLIQLTQLTGDPK---YRQHADQTLQLFGRVIKRYPNRCA 658

Query: 607 PLMCCAADMLSV-PSRKHVVLVGHKSSVDF 635
            L+    + L+  P++K   L+   S   F
Sbjct: 659 QLVQAVGEFLTTPPAQKQSALLAPTSDAGF 688


>gi|296816653|ref|XP_002848663.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238839116|gb|EEQ28778.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 781

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 232/610 (38%), Positives = 336/610 (55%), Gaps = 45/610 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+
Sbjct: 69  SACHWCHVMEKESFMSLEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
           VFL+PDL+P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S    
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188

Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
             QL E        A A+  +  ++L    L       +  YD+  GGF ++PKFP PV 
Sbjct: 189 TRQLREFAEEGTHLAQANKKEQMEDLEIELLEEAFVHFAARYDATNGGFSTSPKFPTPVN 248

Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
           +  +L  S+  E   D     E ++  +M + TL  +A+GGI D +G GF RYSV   W 
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECTKATEMAVNTLIKVARGGIRDQIGYGFSRYSVTPDWS 308

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
           +PHFEKMLYDQ QL +VY+D F  + +        D++ Y+    ++ P G  +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVYIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDAD 368

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           S  +   T K+EGA+YVWT KE++ ILG   A +   H+ + P GN  ++R++DPH+EF 
Sbjct: 369 SQPSPDDTDKREGAYYVWTLKELKQILGHRDADVCARHWGVLPDGN--VARVNDPHDEFM 426

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
            +NVL      +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IVSWNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLHEEETIRILKNSRVKLREYRETKRVRPELDDKIIVSWNGL 486

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           VI + A+ + +L+           +     K    +A +A  FI+ +L D ++ +L   +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCKLMASNAVKFIKENLLDAESGQLWRIY 536

Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
           R +     PGF DDYA+LISGL+ LYE      +L +A +LQ   ++ F+          
Sbjct: 537 RADSRGNTPGFADDYAYLISGLIQLYEATFDDSYLQFADKLQQYLNKYFISVSTSDSSIC 596

Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
            GY+ T  E     PS L R+K   D A PS N V   NL+RL+S++   + + Y+  A 
Sbjct: 597 TGYYMTPSEAVTNTPSALFRLKTGTDSATPSTNGVIAQNLLRLSSLL---EDESYKVKAR 653

Query: 590 HSLAVFETRL 599
            +   F   +
Sbjct: 654 QTCNAFAVEI 663


>gi|404329401|ref|ZP_10969849.1| hypothetical protein SvinD2_04859 [Sporolactobacillus vineae DSM
           21990 = SL153]
          Length = 731

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/708 (36%), Positives = 362/708 (51%), Gaps = 82/708 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+  A LLN+ +VSIKVDREERPD+D VYM   Q L G GGWPL+
Sbjct: 94  SACHWCHVMAGESFEDQETAALLNENYVSIKVDREERPDIDAVYMKVCQTLTGQGGWPLN 153

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD  P   GTYFP    YG P FK +LR++K  +D+  D +A  G+    Q+  AL
Sbjct: 154 VFLTPDQTPFYAGTYFPLHAAYGHPAFKDVLRELKKQYDQNPDKIAAIGS----QIMTAL 209

Query: 140 S-ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +  S S  KL DE     +R   E LS+++D RFGGFG APKFP P ++  +L       
Sbjct: 210 AKQSRSGRKLTDE----TVRKAYEALSENFDPRFGGFGDAPKFPAPHQLIFLLRFGSL-- 263

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TGK     +   M + TL+ +A+GGI DH+GGGF RY+ D +W VPHFEKMLYDQ  LA
Sbjct: 264 -TGK----KQAMDMAVRTLRALAEGGIRDHIGGGFCRYATDRQWQVPHFEKMLYDQAMLA 318

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             + +A+  T +  +  +   I DY  RD++ P G  + +EDADS   EG    +EG +Y
Sbjct: 319 AAFTEAYQATGEAAFRDVVATIFDYCERDLLSPAGGFYCSEDADS---EG----EEGKYY 371

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +W   EV  +LG  A LF E Y++   GN      S PH    G ++        A A+ 
Sbjct: 372 LWNPGEVRAVLGADAGLFCEVYHITDAGN--FHGQSIPH--LSGSDL-----GRIAEANH 422

Query: 379 LGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           L +P    LN  L   R KLF  R KR  P  DDK++ SWN L+I+  A A ++L +   
Sbjct: 423 LSLPA---LNQQLAASRHKLFAARQKRVHPFKDDKILTSWNALMIAVLAEAGRVLHN--- 476

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                        K Y+ +A+S   FI  HL  + T  L   +R+  ++   +LDDYAFL
Sbjct: 477 -------------KHYVNLAKSCFHFIDTHLVQDST--LLARYRDEEARFSAYLDDYAFL 521

Query: 498 ISGLLDLYEFGSGTKWL----VWAIELQNTQDELFLDREGGGYFNTTGEDP--SVLLRVK 551
                 +YE      +L    VW   +       F+DRE GG+F    E+P  ++++R K
Sbjct: 522 TLACEAMYEATFDLTYLEKMKVWGDRMTGR----FMDREHGGFFM---EEPQSTLIIRNK 574

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E +D A PSGNS +V+ L+RL+         +Y   A  + A     + +       M  
Sbjct: 575 EAYDSAVPSGNSAAVLALLRLSERTGDQNYIHYADQAFAAFA---DEVSEYPAGYTFMLS 631

Query: 612 AADM-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
           A  + LS PS + V L G K        L ++   Y     +   DP            +
Sbjct: 632 ALMLRLSGPS-ELVALQGAKGEAAVAE-LRSSDLPYLPGLALYAGDPCRL---------S 680

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
           + N ++   +  A +     CQNF C  PVT+   L+  L ++   T+
Sbjct: 681 AFNENIGIYSPIAGRTTYFFCQNFICHLPVTEFAKLKTQLNDEAQKTS 728


>gi|242806544|ref|XP_002484765.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218715390|gb|EED14812.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 791

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 239/612 (39%), Positives = 335/612 (54%), Gaps = 51/612 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF    VA +LN+ F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 70  SACHWCHVMEKESFMSTEVATILNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 129

Query: 80  VFLSPDLKPLMGGTYFP-----PEDKYGRP---GFKTILRKVKDAW--------DKKRDM 123
           VFL+PDL+P+ GGTY+P      + ++G     GF  IL K++D W        D  +++
Sbjct: 130 VFLTPDLEPVFGGTYWPGPHSSSQSQWGVEGPIGFVDILEKLRDVWQTQQARCLDSAKEI 189

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
             Q   FA E       A +    L  EL + A     +  +  YD  +GGFG APKFP 
Sbjct: 190 TKQLREFAEEGTHVQQGAKSGGEDLEIELIEEAF----QHFASRYDPVYGGFGRAPKFPT 245

Query: 184 PVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           P  +  ++    +   + D     E      M   TL  +A+GGI DH+G G  RYSV  
Sbjct: 246 PANLGFLIRLGMYPTAVSDIVGQDECVRATAMATKTLLNIARGGIRDHIGHGVARYSVTT 305

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAE 299
            W +PHFEKMLYDQ QL +VY+DAF  T +        D++ YL  + I    G  +S+E
Sbjct: 306 DWLLPHFEKMLYDQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTGGYYSSE 365

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 358
           DADS  +   T K+EGAFYVWT KE++ +LG+  A +   H+ +   GN  ++  +DPH+
Sbjct: 366 DADSLPSPNDTEKREGAFYVWTLKELKQVLGQRDAGVCARHWGVLADGN--IAPENDPHD 423

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSW 417
           EF  +NVL      S  A + G+  E+ + I+   ++KL + R K R RP LDDK+I +W
Sbjct: 424 EFMDQNVLSIKVTPSKLAKEFGLSEEEVIKIIKSGKQKLREYREKARVRPDLDDKIIAAW 483

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL I + A+AS IL  E ++            ++  + A+ A  FI+  L++  T +L 
Sbjct: 484 NGLAIGALAKAS-ILLEEIDTI---------KAQQCRDSAQRAVEFIKTTLFEPSTGQLW 533

Query: 478 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL-----DR 531
             +R+G     PGF DDYAFLISGL+ +YE      +L +A +LQ   ++ F+       
Sbjct: 534 RIYRDGSRGNTPGFADDYAFLISGLITMYEATFDDSYLQFAEQLQEHLNKYFIAPGDEPD 593

Query: 532 EGGGYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
              GY+ T+ E    +P  LLR+K   D A PS N +   NLVRL S++   + D YRQ 
Sbjct: 594 TYAGYYTTSSEPIPDEPGPLLRLKSGTDSATPSINGIIARNLVRLGSLL---EDDTYRQL 650

Query: 588 AEHSLAVFETRL 599
           A  + + F   L
Sbjct: 651 ARQTCSTFSVEL 662


>gi|390452556|ref|ZP_10238084.1| hypothetical protein PpeoK3_00885 [Paenibacillus peoriae KCTC 3763]
          Length = 628

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 247/692 (35%), Positives = 357/692 (51%), Gaps = 74/692 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFEDE +A++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD K
Sbjct: 1   MERESFEDEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTY P E K+GR G   +L KV   W ++ + L       +E   + L+     + 
Sbjct: 61  PFFAGTYLPKEQKFGRIGLLELLDKVGTRWKEQPEEL-------VELSEQVLTEHERQDM 113

Query: 148 LP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
           L     EL + +L     Q S ++D  +GGFG APKFP P  +  +L +++       SG
Sbjct: 114 LAGYRGELDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPAPHNLSFLLRYAQ------HSG 167

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
              +  +M   TL  M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD   LA  Y + 
Sbjct: 168 N-QQALEMAEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLAIAYTET 226

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
           + +T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FYVW   E
Sbjct: 227 WQVTGKGLYRQIAEQIFTYIARDMTDVGGAFYSAEDADS---EG----EEGRFYVWNEAE 279

Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 381
           +  +LG+  A  F + Y + P GN            F+G N+  LI++N   A   K  +
Sbjct: 280 IRAVLGDRDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGLKHDL 326

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
             ++  + + E R KLF VR KR  PH DDK++ SWNGL+I++ A+A +           
Sbjct: 327 TKQELEDRVRELRDKLFAVREKRVHPHKDDKILTSWNGLMIAALAKAGQAFGD------- 379

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
              V+      Y E A+ A SF+  HL      RL   +R+G +  PG+LDDYAF + GL
Sbjct: 380 ---VI------YTERAQKAESFLWNHL-RRANGRLLARYRDGDAAYPGYLDDYAFYVWGL 429

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           ++LY+     ++L  A+ L     +LF D E  G F    +   ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEIYDGAIPSG 489

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           NS++  NLVRLA +   ++ + Y   A      F   +     A   +  +  + +  + 
Sbjct: 490 NSIAAHNLVRLARLTGEARLEDY---AAKQFKAFGGMVSYDPSAYSALLSSL-LYATGTT 545

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID---PADTEEMDFWEEHNSNNASMAR 678
           K +V+VG +        + A  A +  N  VI  D   PA  + + +  ++   +     
Sbjct: 546 KEIVVVGQRDDPQTLQFIRAIQAGFRPNTVVILKDAGQPAIADIVPYIHDYTLIDG---- 601

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 K    +C++F+C  PVT    L+ LL
Sbjct: 602 ------KPAVYMCEHFACQAPVTSLDDLKALL 627


>gi|350629727|gb|EHA18100.1| hypothetical protein ASPNIDRAFT_47529 [Aspergillus niger ATCC 1015]
          Length = 769

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 237/591 (40%), Positives = 324/591 (54%), Gaps = 46/591 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+
Sbjct: 60  SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 119

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
           VFL+PDL+P+ GGTY+P  +       G  GF  IL K+ D W  ++    +S     +Q
Sbjct: 120 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 179

Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           L E       S     +  ++L    L    +     YD   GGF +APKFP P  +  +
Sbjct: 180 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 239

Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
           L    +   + D     E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHF
Sbjct: 240 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 299

Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
           EKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS  T
Sbjct: 300 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 359

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
              T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NV
Sbjct: 360 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 417

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
           L      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 418 LSVKVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 477

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
            A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+G 
Sbjct: 478 LAKCSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGG 527

Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDEL-----------FLDRE 532
               PGF DDYA+LI GLLD+YE      +L +A +LQ+ +  L           FL   
Sbjct: 528 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQSKRLALLTFLLEYLNDNFLAYV 587

Query: 533 G---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           G    GY++T    T   P  LLR+K   + A P+ N V   NL+RL S++
Sbjct: 588 GTTPAGYYSTPSTMTSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 638


>gi|14548135|gb|AAK66792.1|U40238_13 Highly conserved protein containing a thioredoxin domain
           [uncultured crenarchaeote 4B7]
          Length = 674

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 233/689 (33%), Positives = 364/689 (52%), Gaps = 68/689 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFE++ VAK++N+ FV+IKVDREERPD+D +Y    Q   G GGWPLSV
Sbjct: 49  SCHWCHVMAHESFENDDVAKIMNENFVNIKVDREERPDLDDIYQKICQMSTGQGGWPLSV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP   GTYFP  D YGRPGF ++ R++  AW++K   +  S    +  L++   
Sbjct: 109 FLTPEQKPFYVGTYFPVLDSYGRPGFGSLCRQLAQAWNEKPKDVGTSAEQFMSNLTKLEK 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S        E+ ++ L   A  L +  D+ +GGFG APKFP    +  M  +SK     
Sbjct: 169 VSDGG-----EIEKSILDEAAVNLLQVADTNYGGFGQAPKFPNAANLSFMFRYSK----- 218

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
             SG  ++ Q+  L TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD   L  V
Sbjct: 219 -LSG-ITKFQEFALMTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPPV 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+ +TKD FY  +    LDY+ R+M    G  +SA+DAD+   EG T       +VW
Sbjct: 277 YAEAYQITKDPFYLDVVTKTLDYIMREMTSASGLFYSAQDADTNGEEGQT-------FVW 329

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
             +E+E+ILG+ + +F  +Y +   GN            F+G  +L    + S+ + K  
Sbjct: 330 KKREIENILGDDSEIFCIYYDVTDGGN------------FEGNTILANNINISSLSFKFN 377

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              ++   +L    +KL DVRS R +P  DDK+I SWN ++IS+FA+  +I         
Sbjct: 378 KTEDEITKLLKRSSKKLLDVRSNRDQPGTDDKIITSWNSMMISAFAKGYRI--------- 428

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH-SFRNGPSKAPGFLDDYAFLIS 499
                  S  ++Y+ VA +AA +          H   H +F+N   K  G+LDDY++L++
Sbjct: 429 -------SGNEKYLNVAVNAAKYFSEQF---SKHGFIHRTFKNDTPKLNGYLDDYSYLVN 478

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            L+D++E  S   +L  A ++ +   E F +     ++ T     S+++R K  +D + P
Sbjct: 479 SLIDVFEITSDAYFLDIAQKITHYMIEHFWNETEKSFYFTADTHESLIVRPKNYYDLSVP 538

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSV 618
           SGNSV+   L++L  +V   +   + + ++  L +  T   +   A   +    ++ L  
Sbjct: 539 SGNSVAANALLKLHHLVNDEE---FLKISKQILELNGTSAAENPFAFGYLLNVMNLYLKH 595

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P+   + ++  ++S     ++ + +  +     +I I   D E +    ++         
Sbjct: 596 PTE--ITIINSENS----EIVNSLYKKFIPEGIIIQI--KDEENLKLLSKY----PFFEG 643

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLE 707
             FS DK    +C+NF+CS P+++   +E
Sbjct: 644 KEFS-DKTSVTICKNFTCSLPLSELSKIE 671


>gi|448365504|ref|ZP_21553884.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
 gi|445655043|gb|ELZ07890.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
          Length = 717

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 244/693 (35%), Positives = 355/693 (51%), Gaps = 57/693 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMADESFADETVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
            +L+PD KP   GTYFP E K G+PGF  IL  V ++W+  R+ +     Q  A A ++L
Sbjct: 113 AWLTPDGKPFYVGTYFPREAKRGQPGFLDILENVTNSWESDREEIENRADQWTAAATDRL 172

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
            E   A  +S     ++    L   A    +S D  FGGFGS  PKFP+P  ++++   +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           +  + TG+     E   +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD 
Sbjct: 226 RAADRTGR----DEFSDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++   +L  +  T D  Y+ +  + LD++ R++    G  FS  DA S + E   R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELTHEAGGFFSTLDAQSEDPETGER-EE 340

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           GAFYVWT  +V D+L +   A LF   Y +  +GN            F+GKN    +   
Sbjct: 341 GAFYVWTPDDVRDVLADETDAELFCSRYDITESGN------------FEGKNQPNRVASI 388

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
               ++  +P ++    L   RR LF+ R +RPRP+ D+KV+  WNGL+I++ A A+ +L
Sbjct: 389 DDLTNRSELPADETRERLESARRDLFEARERRPRPNRDEKVLAGWNGLMIATCAEAALVL 448

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                         G D  +Y E+A  A +F+R  L+D    RL   +++      G+L+
Sbjct: 449 --------------GED--DYAEMATDALAFVRDRLWDADEQRLSRRYKDHDVAIDGYLE 492

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L  YE       L +A+EL    +  F D   G  + T     S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L+ L    AG   ++ R  A   L     RL+  ++    +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLELDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLA 610

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--EEHN 670
           AD L   + +  +     ++ D         AS  L   +    PA  +E++ W  E   
Sbjct: 611 ADRLESGALEVTI-----AADDLPAEFVEPFASRYLPDRLFARRPATDDELEPWLDELEL 665

Query: 671 SNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
           ++  ++     + D    L VC++ +CSPP  D
Sbjct: 666 ADEPAIWAGREARDGEPTLYVCRDRTCSPPTHD 698


>gi|373488750|ref|ZP_09579414.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
 gi|372005695|gb|EHP06331.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
          Length = 660

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 235/556 (42%), Positives = 315/556 (56%), Gaps = 69/556 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+  VA  LN  FV IKVDREERPD+D++YM  VQ L G GGWP+S
Sbjct: 48  SACHWCHVMERESFENADVAAFLNKHFVPIKVDREERPDLDELYMGAVQLLAGRGGWPMS 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEA 138
           V+L+P+L+P  GGTYFPP  + G PGF  +L  V   W ++R D+LAQ+G     +L  A
Sbjct: 108 VWLTPELEPFYGGTYFPPVSRGGMPGFLDVLEGVARVWQERRQDVLAQAG-----ELVAA 162

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L A       P    +  L +    LS S+D+R+GGFG APKFP    + ++L       
Sbjct: 163 LRAGRGIGGDPPG--EGLLEVAIRHLSYSFDARWGGFGGAPKFPPIPALTLLLGRGD--- 217

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +   M + TL  MA GGI DH+GGGF RYSVDERW VPHFEKML D  QLA
Sbjct: 218 --------PKALDMAIRTLDAMAAGGIRDHLGGGFARYSVDERWKVPHFEKMLCDNAQLA 269

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VYL+AF +T +V +    R+ILDY   +M    G  FS+EDADS   EG    +EG FY
Sbjct: 270 WVYLEAFRVTGEVRHGERAREILDYFLGEMRDASGGFFSSEDADS---EG----EEGRFY 322

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
            ++  EV+++LG  A LF   Y + P GN +            G+++L  +       S+
Sbjct: 323 TFSWGEVQEVLGPGADLFCRAYGVTPEGNFE-----------GGRSLLHRMEVGDFPESE 371

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           L +            R ++   R +R RPH DDK++V+WNGL +S+ A+ S +L      
Sbjct: 372 LAI-----------LRERIRLYRDRRVRPHRDDKILVAWNGLALSALAKGSALL------ 414

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                   G  R  Y+E AE+ A F++R L+ + T  L  ++R G    PGFL+DY  LI
Sbjct: 415 --------GEPR--YLEAAEACADFLQRELWRDGT--LLRTWRQGRGHTPGFLEDYGALI 462

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLLDLY+ G  ++WL WA EL     E F + E GG+F T   D  V+LR     D A 
Sbjct: 463 LGLLDLYQTGFHSRWLHWAQELGEALLERFHEAE-GGFFGTEALD--VILRQCPVFDHAI 519

Query: 559 PSGNSVSVINLVRLAS 574
           PSGN+++ + L+RL +
Sbjct: 520 PSGNALAALALLRLGN 535


>gi|417766154|ref|ZP_12414108.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|400351608|gb|EJP03827.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
          Length = 691

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/699 (36%), Positives = 363/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G IFSAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|455791360|gb|EMF43176.1| PF03190 family protein [Leptospira interrogans serovar Lora str. TE
           1992]
          Length = 691

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/699 (36%), Positives = 363/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G IFSAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAISLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|433591712|ref|YP_007281208.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
 gi|448334040|ref|ZP_21523224.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
 gi|433306492|gb|AGB32304.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
 gi|445620768|gb|ELY74256.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
          Length = 731

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 234/696 (33%), Positives = 356/696 (51%), Gaps = 59/696 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFADEAVAEILNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP + + G+PGF  + +++ D+W+ + D         Q    A +
Sbjct: 113 AWLTPEGKPFFIGTYFPRDGERGQPGFPDLCQRISDSWESEEDREEMQHRAQQWTDAAKD 172

Query: 134 QLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           +L E   ++     +  E P  + L   A+ + +S D ++GGFG+  KFP+P  ++++  
Sbjct: 173 RLEETPDSAGVDAGVAAEPPSSDVLETAADAVLRSADRQYGGFGTGQKFPQPSRLRVL-- 230

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
            ++  + TG+     E ++++  TL  MA GG+ DHVGGGFHRY VD  W VPHFEKMLY
Sbjct: 231 -ARTYDRTGR----EEYREVLEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLY 285

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D  ++   +L  + LT +  Y+    D L ++ R++    G  FS  DA S + E   R 
Sbjct: 286 DNAEIPRAFLAGYQLTGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER- 344

Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           +EGAFYVWT +EV D++ +   A LF   Y +  +GN            F+G+N    + 
Sbjct: 345 EEGAFYVWTPEEVHDVIADETDASLFCARYDITESGN------------FEGQNQPNRIA 392

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
             S  AS+  +   + L  L   R++LF+ R +RPRP  D+K++  WNGL+IS++A A+ 
Sbjct: 393 RVSELASQFDLAESEVLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAAL 452

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L              G D  EY E A  A  F+R  L+D ++ RL   ++ G  K  G+
Sbjct: 453 VL--------------GED--EYAETAVDALEFVRDRLWDTESQRLSRRYKAGDVKVDGY 496

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     S++ R 
Sbjct: 497 LEDYAFLARGALDCYQATGDVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRP 556

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+V  L+ L         D + + A   L      L+  A+    +C
Sbjct: 557 QELGDQSTPSSTGVAVETLLALDEFA----DDDFSEIAATVLETHANELEANALEHATLC 612

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
             AD     + +  V     ++ +       A AS      +  + P     ++ W E  
Sbjct: 613 IGADRFEAGALEVTV-----AADELPTEWREAFASRYFPDRLFALRPPTEAGLETWLETL 667

Query: 670 ---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
              ++      R     +  +  VC++ +CSPP  D
Sbjct: 668 GLADAPPIWAGREARDGEPTL-YVCRDRTCSPPTHD 702


>gi|338812196|ref|ZP_08624385.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
 gi|337275852|gb|EGO64300.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
          Length = 633

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 243/685 (35%), Positives = 358/685 (52%), Gaps = 59/685 (8%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFED+ VA LLN  +++IKVDREERPDVD +YM   QAL G GGWPL++ ++PD  
Sbjct: 1   MERESFEDQEVADLLNQDYIAIKVDREERPDVDHIYMQVCQALTGQGGWPLTIMMTPDKS 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   K+GRPG   IL  +   W ++RD L        E++ +++ A    + 
Sbjct: 61  PFFAGTYFPKNSKWGRPGLMAILTALSQQWRQQRDSLNDYA----EEILKSIDAREPGSP 116

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
               L +  +      L++ +DS +GGF SAPKFP P  +  ++ + +       +GEA 
Sbjct: 117 Y-SLLSEEQVHAAFHGLARYFDSEYGGFSSAPKFPTPHNLLFLMRYWR------HTGEA- 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           +   MV  TLQ M +GGI+DH+G GF RYSVD +W VPHFEKMLYD   L  +Y +AF  
Sbjct: 169 KAMDMVEKTLQSMRRGGIYDHLGFGFARYSVDHQWLVPHFEKMLYDNALLCYIYAEAFQA 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T +  Y+ +  +I+ Y++RDM GP G  +SAEDADS   EG    +EG FY+WT +E+  
Sbjct: 229 TGNKEYAQVAEEIIAYVQRDMTGPAGGFYSAEDADS---EG----EEGKFYLWTKEEILR 281

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEK 385
            LG     +F ++Y++   GN D            G ++L  +  +    A+K+GM  ++
Sbjct: 282 ALGWTQGTIFADYYHVTAEGNFD-----------AGSSILHTIGREPGEYAAKVGMKPDE 330

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
           +  +L + R KL ++R++R  P  DDKV+ SWN L+I++ A+A+++L             
Sbjct: 331 FQAMLQDGREKLRELRNQRVHPFKDDKVLTSWNALMIAALAKAARVL------------- 377

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
              D+ +Y+  A  A +FI  HL   Q  RL    R G S    +LDDYA+L+  +++LY
Sbjct: 378 ---DKPQYLFAASQALNFIEIHL-TRQDGRLLARHRAGESAYLAYLDDYAYLLWAVIELY 433

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E      +L  A  L     ELF D + GG+F T  +   ++ R KE +DGA PSGNS +
Sbjct: 434 ETTLSAAYLEMAKGLAGNMVELFWDEKQGGFFFTGSDAEKLISRPKEIYDGATPSGNSAA 493

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
              L+RLA I   +         E     F   +     A      A D   +P  ++++
Sbjct: 494 AYALLRLARITEDAD---LLTVVERLFEYFAGEVSQAPRAFTFFLMAFDYYLMPP-QNII 549

Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
           + G K  +   ++L  A   Y     ++   P   E +     H + + +  R+      
Sbjct: 550 IAGVKDDIATVSLLKQARKYYMPEVVLVLNSPDQAETL----RHTAPHVT-GRDRLDG-L 603

Query: 686 VVALVCQNFSCSPPVTDPISLENLL 710
             A VC  FSC  PVT    LE LL
Sbjct: 604 ATAYVCHKFSCQRPVTSVRDLERLL 628


>gi|448339114|ref|ZP_21528145.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
 gi|445621085|gb|ELY74571.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
          Length = 727

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 234/694 (33%), Positives = 357/694 (51%), Gaps = 59/694 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE VA+++N+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMAEESFEDEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW------DKKRDMLAQSGAFAIE 133
            +L+P+ KP   GTYFP E + G+PGF+ + +++ D+W      ++  +   Q    A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFRDLCQRISDSWESEEDREEMENRAQQWTDAAKD 172

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
           QL E    +    + P     + L   A+ + +S D ++GGFGS  KFP+P  ++++   
Sbjct: 173 QLEETPDTAGVGAEPPS---SDVLETAADMVLRSADRQYGGFGSGQKFPQPSRLRVL--- 226

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           ++  + TG+     E +++   TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD
Sbjct: 227 ARAYDRTGR----EEYREVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYD 282

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
             ++   +L  + LT +  Y+ +  + L+++ R++    G  FS  DA S   E   R +
Sbjct: 283 NAEIPRAFLSGYQLTGEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQSESPETGER-E 341

Query: 314 EGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           EGAFYVWT  EV + L +   A LF   + +  +GN            F+G+N    +  
Sbjct: 342 EGAFYVWTPAEVHEALDDETDAALFCARFDISESGN------------FEGRNQPNRVAT 389

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            S  A +  +   + L  L   R+ LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +
Sbjct: 390 VSELADQFDLAEHEILKRLDSARQTLFEAREERPRPNRDEKILAGWNGLLISTYAEAALV 449

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L              G+D  +Y + A  A  F+R  L+DE   RL   +++G  K  G+L
Sbjct: 450 L--------------GAD--DYADTAVDALEFVRDRLWDEDDQRLSRRYKDGDVKVDGYL 493

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     S++ R +
Sbjct: 494 EDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRPQ 553

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D + PS   V+V  L+ L    A    D     A   L      L+  A+    +C 
Sbjct: 554 ELGDQSTPSATGVAVETLLALDEFAAEDFEDI----AATVLETHANELESNALEHATLCL 609

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHN 670
           AAD L+  + + V +        + + LA+ +        +  + P   + ++ W E   
Sbjct: 610 AADRLAAGALE-VTVAADDLPTAWRDRLASQY----YPDRLFALRPPTEDGLEAWLETLG 664

Query: 671 SNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
             NA    A      D+    VC+  +CSPP  D
Sbjct: 665 LENAPPIWADREARDDEPTLYVCRERTCSPPTHD 698


>gi|408403905|ref|YP_006861888.1| hypothetical protein Ngar_c12930 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408364501|gb|AFU58231.1| protein of unknown function DUF255 [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 695

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 253/714 (35%), Positives = 362/714 (50%), Gaps = 101/714 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+ +AK++N+ F++IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 57  SACHWCHVMAHESFEDDEIAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLS 116

Query: 80  VFLSPDLKPLMGGTYFPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAF--AIEQL 135
           VFL+PD KP   GTYFP E   Y  PGFKTIL ++  A+  KK+++ A SG F  A+ Q 
Sbjct: 117 VFLTPDQKPFYVGTYFPKEGGHYNMPGFKTILLQLATAYKSKKQEIEAASGEFMDALAQT 176

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
           +  ++  A+       L ++ L   A  L +  D  +GGFG APKFP    +  +L   +
Sbjct: 177 ARDVALGAAGKA---SLERSILDEAAVGLLQMGDPIYGGFGQAPKFPNASNLMFLL---R 230

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             + +G S      +  V FT   MA GGIHD +GGGF RY+ D++W VPHFEKMLYD  
Sbjct: 231 YYDISGMSC----FKDFVAFTADKMAAGGIHDQLGGGFARYATDQKWLVPHFEKMLYDNA 286

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            LA +Y + + +TK   Y  I R  LD++ R+M  P G  +SA+DADS   EG    +EG
Sbjct: 287 LLAQLYSELYQITKAEKYLQITRKTLDFVIREMTHPEGGFYSAQDADS---EG----EEG 339

Query: 316 AFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
            FYVW+ KE+  ILG+ A   +F EHY +   GN            F+GKN+L      S
Sbjct: 340 KFYVWSKKEIASILGDQAATDIFCEHYGVTEGGN------------FEGKNILNVRVPVS 387

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           +   + G   E+   I+ +   KLF  R KR RP  D+K++ SWNGL+IS FA+   I  
Sbjct: 388 SVGLRYGKTPEQTAQIIADASAKLFAAREKRVRPARDEKILTSWNGLMISGFAKGYGI-- 445

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                         +  ++Y++ A+ A  FI   +      RL H+F++G SK   +LDD
Sbjct: 446 --------------TGDQKYLQAAKDAVKFIETKIVTGDG-RLLHTFKDGKSKLNAYLDD 490

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAF   GLLDL+   S  ++L  A++  +     F D +    F T+ +   +++R K  
Sbjct: 491 YAFYTGGLLDLFAIDSRQEYLDKAVKYTDFMLAHFWDEKEENLFFTSDDHEKLIVRTKSF 550

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +D A PSGNSV+  NL+RL          +Y QN  +                  + CA 
Sbjct: 551 YDLAIPSGNSVAASNLLRLY---------HYTQNNSY------------------LDCAV 583

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-TEEMDFWEEH--- 669
            ++   ++        ++   F  ML   +        V  I   D + +M  W      
Sbjct: 584 KIMKASAKP-----AAENPFGFGQMLNTIYLYVKKPVEVTVITRNDHSSKMAEWLNQQFV 638

Query: 670 -NSNNASMARNNFSA------------DKVVALVCQNFSCSPPVTDPISLENLL 710
            +  NA ++ N  ++            D   A VC+NF+CS P+     LE  L
Sbjct: 639 PDGINAIVSTNELASLQKYAYFKGRVGDGETAFVCRNFTCSLPIKSQQELERQL 692


>gi|448363039|ref|ZP_21551643.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
 gi|445647661|gb|ELZ00635.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
          Length = 717

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 241/693 (34%), Positives = 355/693 (51%), Gaps = 57/693 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMADESFADEAVAAELNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
            +L+P+ KP   GTYFP E K G+PGF  +L  V ++W+  R+ +     Q  A A ++L
Sbjct: 113 AWLTPEGKPFYVGTYFPREAKRGQPGFLDVLENVTNSWESDREEIENRADQWTAAATDRL 172

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
            E   A  +S     ++    L   A    +S D  FGGFGS  PKFP+P  ++++   +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           +  + TG+     E  ++++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD 
Sbjct: 226 RATDRTGR----DEFSEVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++   +L  +  T D  Y+ +  + LD++ R++    G  FS  DA S + E   R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELTHDAGGFFSTLDAQSEDPETGER-EE 340

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           GAFYVWT  EVE  + +   A LF+  Y +  +GN            F+G N    +   
Sbjct: 341 GAFYVWTPDEVEAAVTDETDAELFRSRYDITQSGN------------FEGTNQPNRVASI 388

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              A +  +P ++  + L   RR LF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L
Sbjct: 389 DELADRFDLPADEVEDRLESARRDLFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVL 448

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                         G D  +Y E+A  A +F+R  L+D    RL   +++      G+L+
Sbjct: 449 --------------GED--DYAEMATDALAFVRERLWDGDEKRLSRRYKDDDVAIDGYLE 492

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L  YE       L +A+EL    +  F D   G  + T     S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L++L    AG   ++ R  A   L     RL+  ++    +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLQLDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLA 610

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--EEHN 670
           AD L   + +  +     ++ +         AS  L   +    PA  +E+  W  E   
Sbjct: 611 ADRLESGALEITI-----AADELPEAFVEPFASRYLPDRLFARRPATDDELAAWLDELEL 665

Query: 671 SNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
           ++  ++     + D    L VC++ +CSPP  D
Sbjct: 666 ADEPAIWAGRATRDGEPTLYVCRDRTCSPPTHD 698


>gi|165970642|gb|AAI58572.1| Spata20 protein [Rattus norvegicus]
          Length = 550

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 200/487 (41%), Positives = 289/487 (59%), Gaps = 46/487 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E +  LLN+ FVS+ VDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +  +  S ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D F+S + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  + +EGA Y+WT KEV+ +L E             L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + ++  D + E  G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           RP+ HLD+K++ +WNGL++S FA A  +L  E                + +  A + A F
Sbjct: 498 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541

Query: 464 IRRHLYD 470
           ++RH++D
Sbjct: 542 LKRHMFD 548


>gi|387900736|ref|YP_006331032.1| hypothetical protein MUS_4478 [Bacillus amyloliquefaciens Y2]
 gi|387174846|gb|AFJ64307.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens Y2]
          Length = 629

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 244/678 (35%), Positives = 355/678 (52%), Gaps = 66/678 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   KY RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKI 112

Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLPAYTEAY 225

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + ++L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGNELAERLE 334

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
           YE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
           + + L+RL  +  G  S    + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545

Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
           V+ G K   D +  + A    +    T++  +  D        E    +   A       
Sbjct: 546 VVFGSKDDPDRKRFIEALQEHFTPAYTILAAEHPD--------ELKGISDFAAGYQMIDG 597

Query: 685 KVVALVCQNFSCSPPVTD 702
           K    +C+NF+C  P TD
Sbjct: 598 KTTVYICENFACRRPTTD 615


>gi|338733047|ref|YP_004671520.1| hypothetical protein SNE_A11520 [Simkania negevensis Z]
 gi|336482430|emb|CCB89029.1| uncharacterized protein yyaL [Simkania negevensis Z]
          Length = 676

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 242/714 (33%), Positives = 360/714 (50%), Gaps = 78/714 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +  FL     TCHWCHVM  ESF +  +A L+N+ F+++KVDREE P+
Sbjct: 29  GDEAFEAAKKLDKPIFLSIGYATCHWCHVMSRESFANSEIATLMNETFINVKVDREELPE 88

Query: 59  VDKVYMTYVQALYGGG-GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           +D +YM + QAL   G GWPL++ L+P+LKP    TY PP  +    G K ++  +K  W
Sbjct: 89  IDSLYMEFAQALMASGSGWPLNLILTPELKPFYATTYMPPTTRQELMGIKELVSHIKQLW 148

Query: 118 DK-KRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
              +R++L       ++    A S      +LP+E     L    EQ  ++ D  +GG  
Sbjct: 149 KSAERELLLDQAEKLVDLF--ARSVQTRGEELPNE---EHLDAAVEQFYEAVDPVYGGIK 203

Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
            APKFP   +I   L H+++  D       S        TL  M +GGI+D VGGGF RY
Sbjct: 204 GAPKFPLGYQILFFLEHARREHD-------SRSLFFAELTLSMMHRGGIYDQVGGGFSRY 256

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           SVDE+W +PHFEKMLYD   +A  +LDA+ LTK   Y  +C +ILDYL RDM   GG  +
Sbjct: 257 SVDEKWIIPHFEKMLYDNALMALAFLDAWKLTKKPLYRQVCEEILDYLLRDMQHQGGGFY 316

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSD 355
           SAED   AET+G    +EGA+Y W ++E++ +L    + LF E++ + P+GN        
Sbjct: 317 SAED---AETDG----EEGAYYTWHAQEIQKLLPPADLDLFCEYFDVTPSGN-------- 361

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
               F GKNVL         A   G+        L  C   LFD R  R RP  DDK++V
Sbjct: 362 ----FGGKNVLYRTMTIQEFAELRGLDPLMIQTRLDSCLNLLFDARKGRKRPFKDDKILV 417

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           +WN + I  F +A +  ++EA                Y++   +AASFIR++L+  +  +
Sbjct: 418 TWNAMAIDVFIKAGRAFQNEA----------------YLKSGLAAASFIRQNLW--KGGK 459

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L+  FR G +   G LDDYA+LI  L+ L E   G  WL WA+EL +  ++ F   EG  
Sbjct: 460 LKRRFREGQTDYEGGLDDYAYLIRALITLSEADLGNVWLQWALELADFLEKEFKADEGA- 518

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
            F  TG + S+LLR  E  D A+PSGN++   NL+RL+ +   +++   R  AE  L V 
Sbjct: 519 -FYQTGPEYSILLRRPELFDSAQPSGNAIHAENLIRLSQL---TQNRELRIQAEDILKVA 574

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
            + ++      P   C           H++ + H    +   ++ A      L + ++ +
Sbjct: 575 TSYIE----TYPQGACY----------HLIALQHYLDKEALTIVVALDEKESLKEEILEV 620

Query: 656 DPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
              +     + FW+ H  ++     N     K    +C++  C  P+T   +L+
Sbjct: 621 LSTEFIPHHVVFWKRH--SDKEFEENIPLEGKTTVYLCKHGKCEAPITSTDALQ 672


>gi|329765558|ref|ZP_08257134.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137996|gb|EGG42256.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 675

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 218/561 (38%), Positives = 315/561 (56%), Gaps = 49/561 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE+E VAK +N+ F++IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 49  SACHWCHVMAHESFENEDVAKFMNENFINIKVDREERPDLDDIYQKVCQIATGQGGWPLS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP   GTYFP  D YGRPGF +I R++  AW +K   + +S    I  L +  
Sbjct: 109 VFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLAQAWKEKSKDIEKSADKFIVALQK-- 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                + K+P +L +  L   A  L +  D+ +GGFGSAPKFP    +  +  ++K    
Sbjct: 167 ---TDTVKVPSKLDKTILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG     S+  +  L TL  MA+GGI D +GGGFHRYS D +W VPHFEKMLYD   +  
Sbjct: 221 TG----LSKFNEFALKTLNKMARGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y++A+ +T+D FY  +    LD++ R+M    G  +SA DADS   EG     EG FYV
Sbjct: 277 NYVEAYQITQDPFYLEVLNKTLDFVLREMTAKNGGFYSAYDADS---EGI----EGKFYV 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   +++ ILG+ + LF  +Y +   GN            ++G N+L    + SA +   
Sbjct: 330 WKKSDIKVILGDDSDLFCLYYDVTDGGN------------WEGNNILCNNINISAVSFHF 377

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           GMP EK   IL  C +KL   RS R  P LDDK++ SWN L+I++FA+   +        
Sbjct: 378 GMPEEKIKKILTMCSQKLLKSRSMRVAPGLDDKILTSWNALMITAFAKGYGV-------- 429

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                   +D  +Y++ A++   FI   L  +   +L  + +NG +K  G+L+DY++  +
Sbjct: 430 --------TDDLKYLDAAKNCIHFIETTLLVDD--KLLRTSKNGITKIDGYLEDYSYFAN 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LLD++E    +K+L  A++L N   + F D E   +F T+     +++R K ++D + P
Sbjct: 480 ALLDVFEVEPDSKYLDLALKLGNYLVDHFWDSESSSFFMTSDNHEKLIIRPKSNYDLSLP 539

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNSVS   ++RL  +    K
Sbjct: 540 SGNSVSCSVMLRLYHLTHDEK 560


>gi|418679291|ref|ZP_13240555.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|400320416|gb|EJO68286.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
          Length = 696

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  + F ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 294 FLEILAEYFLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 347 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A  
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 609

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696


>gi|357632813|ref|ZP_09130691.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
 gi|357581367|gb|EHJ46700.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
          Length = 737

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 255/702 (36%), Positives = 345/702 (49%), Gaps = 65/702 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A L+    V++KVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 80  STCHWCHVMEHESFEDEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLN 139

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP E  +GR G + +L++V  AW   R  +  +    ++ +   L
Sbjct: 140 VFLTPDGQPFFAGTYFPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRSQL 199

Query: 140 SA-SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            A  A     P E   +A R    +L+ +YD+  GGFG APKFP P  +  +L   ++  
Sbjct: 200 EARDAGETAEPGEAQLDAAR---NELAAAYDAANGGFGGAPKFPSPHNLLFLL---REFR 253

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+     E   MV  TL  M +GG+ D +G G HRYS D  W VPHFEKMLYDQ   A
Sbjct: 254 RTGR----EENLAMVTATLDAMRRGGVFDQIGLGLHRYSTDAHWFVPHFEKMLYDQALTA 309

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
               +A+  T D  +  + RDI +Y+ RD+ GP G  +SAEDADS   EG     EG FY
Sbjct: 310 MAATEAYLATGDAEWRRMARDIFEYVHRDLTGPDGAFYSAEDADS---EGV----EGKFY 362

Query: 319 VWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  E+  +L G+ A LF + Y + P GN       +   +  G N+       +A A 
Sbjct: 363 VWTESEIRAVLAGDEAGLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAG 418

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K G+   +  + L   R  L   R KR RP  DDKV+   NGL+I++ A+A++       
Sbjct: 419 KKGLGPAELASRLERSRELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF----- 473

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      D +E    A+ A+ F+   +    + RL H  R G +   G LDDYAFL
Sbjct: 474 -----------DDEELAGRAKRASDFLLAKMLLPDS-RLLHRLRLGEAAVTGMLDDYAFL 521

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GLL+LY+      +L  A+ L       F D   GG F T  +  ++LLR K  +D A
Sbjct: 522 AWGLLELYQTVFDPAYLAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAA 580

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA--------MAVPLM 609
            PSGNSV+ + L  L           YR   E S     +RL   A              
Sbjct: 581 IPSGNSVAFLVLTTL-----------YRLTGEKSFMEEASRLARAAGPWVAGHPSGFTFF 629

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
            C    +  PS   V + G   + D   +  A    Y L +  + + PA  E  D  E  
Sbjct: 630 LCGLSQMLAPS-AEVTIAGDPDAPDTHALARALFERY-LPEVAVVLRPAGEEPND--EPD 685

Query: 670 NSNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
               A   R      D+  A VC+  SC PP  DP ++  LL
Sbjct: 686 IVALAPFTRFQLPMGDRAAAHVCRAGSCQPPTPDPAAMLALL 727


>gi|225571461|ref|ZP_03780457.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
           15053]
 gi|225159937|gb|EEG72556.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
           15053]
          Length = 669

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/693 (34%), Positives = 346/693 (49%), Gaps = 94/693 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED+  A +LN+ F+SIKVDREERPD+D VYM+  QAL G GGWP+S
Sbjct: 61  STCHWCHVMAHESFEDKRTADILNENFISIKVDREERPDIDSVYMSVCQALTGSGGWPMS 120

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI------E 133
           +F++ + KP    TY PP+++YG  GF+ +L ++   W  K+  L +S    +      E
Sbjct: 121 IFMTAEQKPFYAATYIPPDNRYGMKGFRELLLEISGHWKYKKSELLESAEQILDHIDTKE 180

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
           + ++  +           LP+ A    AE  ++++D ++GGFG+APKFP P  +  ++ +
Sbjct: 181 ERAKKKTLKRVGAGTDTTLPERA----AELFAQAFDEKYGGFGAAPKFPTPHNLLFLMIY 236

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           S  L+D G S EA +       TL+ M +GGI DH+G GF RYS D  + VPHFEKMLYD
Sbjct: 237 S-SLQDAGMSYEAEK-------TLEQMRRGGIFDHIGYGFSRYSTDRFYLVPHFEKMLYD 288

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
              L   Y  A+ ++    +        +Y+ R+M GP GE +SA+DADS   EG    +
Sbjct: 289 NALLMIAYSAAYKVSGKTMFLETAEKTAEYILREMTGPDGEFYSAQDADS---EG----R 341

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG +YVW  +E+  ILG E    F  +Y +   GN            F+GKN+  EL+  
Sbjct: 342 EGLYYVWDEEEICGILGAERGTEFCRYYGITEEGN------------FEGKNIPNELDGK 389

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
             +            +   + R  L+D R +R R HLDDKV+ SWN L+IS+ A    +L
Sbjct: 390 EIT------------DRFHKERELLYDYRKRRARLHLDDKVLTSWNSLMISAMA----VL 433

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                     + V G +R  Y+E AE A  FI  +L D  T R+  S R G     GFLD
Sbjct: 434 ----------YRVTGKER--YLEAAERARRFIEHNLADGNTLRV--SCRGGSGSVKGFLD 479

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA+  + LL LYE  S    L  A ++     + F D EGGG+F     + S++ R KE
Sbjct: 480 DYAYYTAALLSLYEAVSDVDHLTRAEQICREARQQFADEEGGGFFLYGSRNDSLITRPKE 539

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
            +DGA PSGNS    +LVRL  I    +   Y+  A+  LA      ++      +   A
Sbjct: 540 TYDGALPSGNSTMAYDLVRLYQITGNEE---YKDAAKRQLAFMSGEAQEYPAGYSMFLTA 596

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             +   P +K  V++                     NK  I         +  + E N  
Sbjct: 597 LLLYENPPQKITVVLADGD-----------------NKEEI------MSRLPLYAEINIL 633

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
           +           +    VC+N++C PP  + +S
Sbjct: 634 SGETREYKLLNGRTTYYVCKNYTCLPPSNELMS 666


>gi|383458464|ref|YP_005372453.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
           2259]
 gi|380730954|gb|AFE06956.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
           2259]
          Length = 696

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 241/703 (34%), Positives = 353/703 (50%), Gaps = 70/703 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE   +A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 56  SACHWCHVMAHESFEHPDIARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDL+P  GGTYFPP D+YGRPGF  +L  ++DAW+ K D + +      E L E  
Sbjct: 116 VFLTPDLRPFYGGTYFPPSDRYGRPGFPRLLTALRDAWENKADEIEEQAKRFQEGLGEL- 174

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            ++   +  P  L    +    + + K  D   GGFG APKFP P+ + ++L   ++   
Sbjct: 175 -STHGLDAAPAHLSAEDIVAMGQSMLKRMDPVNGGFGGAPKFPNPMNVALLLRAWRR--- 230

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G     +  V  TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL +
Sbjct: 231 ----GGGEPLKAAVFRTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLH 286

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A  +     +  +  + ++Y+RR+M  P G  ++ +DADS   EG    +EG F+V
Sbjct: 287 LYSEAEQVESRPLWRKVVEETVEYVRREMTDPAGGFYATQDADS---EG----EEGKFFV 339

Query: 320 WTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           W  +EV   L  G+ A     H+ +KP GN +            G  VL  +      A 
Sbjct: 340 WHPEEVRAALSVGQQADTVLRHFGIKPGGNFE-----------HGATVLEVVVPVEQLAK 388

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G P+E     L E RR LF +R +R +P  DDK++  WNGL+I   A AS++      
Sbjct: 389 EQGRPVEAVEKELAEARRVLFLLREQRVKPGRDDKILAGWNGLMIRGLALASRVF----- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      DR ++ ++A  AA F+   ++D +  RL  S+++G  +  GFL+DY   
Sbjct: 444 -----------DRPDWAKLAADAADFVLAKMWDGK--RLLRSYQHGQGRIDGFLEDYGDF 490

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
            SGL  LY+     K+L  A  L +   ELF D E   Y +       +++      D A
Sbjct: 491 ASGLTALYQATFDAKYLDAADALAHRAVELFWDEEKQAYLSAPRGQKDLVVAAFSLFDNA 550

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSG S      V L+++   +    +    EH +A    +L    M    +  AAD L 
Sbjct: 551 FPSGASTLTEAQVTLSAL---TGDVCHLDQPEHYVAKLHDQLVRNPMGYGHLGLAADSL- 606

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           V     V   G + +V    +LAAA+ +Y     V             W + ++   +  
Sbjct: 607 VDGASGVTFAGTREAV--APLLAAANRTY---APVFSFG---------WHDTSAPPPARL 652

Query: 678 RNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
           +  F        K  A +C+ F C  P+T+   L   L+  P 
Sbjct: 653 QELFEGRDPVEGKGAAYLCRGFVCERPITEQGLLAERLVAAPG 695


>gi|375364488|ref|YP_005132527.1| hypothetical protein BACAU_3798 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|371570482|emb|CCF07332.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens
           subsp. plantarum CAU B946]
          Length = 629

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 248/692 (35%), Positives = 358/692 (51%), Gaps = 78/692 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE +A +LND F+++KVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKV 112

Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAY 225

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF DDYAFLI G L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFNDDYAFLIWGYLEL 429

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
           YE G    +L  A  L     ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
           + + L+RL  +          + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAHTMP-QKEI 545

Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA- 683
           V+ G K   D +  + A            H  PA T       EH    A ++  +F+A 
Sbjct: 546 VVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPVELAGIS--DFAAG 591

Query: 684 -----DKVVALVCQNFSCSPPVTDPISLENLL 710
                 K    +C+NF+C  P TD     N+L
Sbjct: 592 YQMIDGKTTVYICENFACRRPTTDIDEAMNIL 623


>gi|80978835|gb|ABB54669.1| SSP411 [Homo sapiens]
          Length = 521

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 196/436 (44%), Positives = 270/436 (61%), Gaps = 30/436 (6%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87  GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W 
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262

Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +  +  S +L   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D FYS + + IL Y+ R +    G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
             +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +          L  +HY L 
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN   S+  DP  E +G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 437 EAGNISPSQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494

Query: 404 RPRPHLDDKVIVSWNG 419
           RP+PHLD K++ +WNG
Sbjct: 495 RPKPHLDSKMLAAWNG 510


>gi|120603287|ref|YP_967687.1| hypothetical protein Dvul_2244 [Desulfovibrio vulgaris DP4]
 gi|120563516|gb|ABM29260.1| protein of unknown function DUF255 [Desulfovibrio vulgaris DP4]
          Length = 715

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/704 (35%), Positives = 366/704 (51%), Gaps = 57/704 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED  VA+ LN+ FV +KVDREERPD+D +YM   Q L G GGWPL+
Sbjct: 59  STCHWCHVMAHESFEDAEVAQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLT 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
           +F  PD  P    TY P   + GR G   ++ +V+D +  +R  +  S    A A+ + +
Sbjct: 119 IFALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERA 178

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L  S    + P       LR     L  ++D+  GGFG APKFP P  +  +L H ++
Sbjct: 179 AELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRR 235

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             D       S  Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ  
Sbjct: 236 TGD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAM 288

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                 + +  T++           DY+ RDM   GG + +AEDADS   EG  +++EGA
Sbjct: 289 FMLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGA 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
           FY +T  EV +  G++A L    + +   GN       +     +G NVL + L D   +
Sbjct: 347 FYTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--A 400

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+ LG+  ++      +    L  +R+ R RPH DDK++  WNGL I++ AR   +    
Sbjct: 401 ATTLGIDADELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV---- 456

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDD 493
                F+ P           + ++AAS     L  + T    L HS   G    PGFLDD
Sbjct: 457 -----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDD 501

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKE 552
           YAF+I GLL+LY   +  +WL  AI LQ+ QD+ FLD   GGY++T  + P +  LR+KE
Sbjct: 502 YAFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKE 561

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             DGA PSGN+ +++NL+RLA ++  +    Y + A   +  F ++++   +   +  C 
Sbjct: 562 ARDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCG 618

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNS 671
            D  ++   + V++ G   + D E ML A   SY  N TV+H+   +T E +       S
Sbjct: 619 VD-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPN-TVMHLRDGNTAERLAMLAPFTS 676

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLEKP 714
           + A +        K  A +CQ+ +CS P+ DP +L E L   +P
Sbjct: 677 HLAPI------DGKTTAWLCQDNACSAPIQDPAALAERLAGARP 714


>gi|289582639|ref|YP_003481105.1| hypothetical protein Nmag_2991 [Natrialba magadii ATCC 43099]
 gi|448281932|ref|ZP_21473225.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
 gi|289532192|gb|ADD06543.1| protein of unknown function DUF255 [Natrialba magadii ATCC 43099]
 gi|445577561|gb|ELY31994.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
          Length = 722

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 242/692 (34%), Positives = 354/692 (51%), Gaps = 51/692 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 55  SACHWCHVMEDESFADEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLS 136
            +L+P+ KP   GTYFP   K G+PGF  IL  V ++W+  RD +   A+    A +   
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENVTNSWEGDRDEVENRAEQWTDAAKDRL 174

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           E    S S+++ P     + L   A    +S D +FGGFGS  PKFP+P  ++++   + 
Sbjct: 175 EETPDSVSASQPPS---SDVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVLARAAA 231

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +   TG+     + Q + + TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  
Sbjct: 232 R---TGR----DDFQDVFVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            +   +L  +  T D  Y+ +  + L ++ R++    G  FS  DA S + +   R +EG
Sbjct: 285 AIPRAFLVGYQQTGDERYAEVVAETLTFVERELTHEEGGFFSTLDAQSEDPDTGER-EEG 343

Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           +FYVWT  EV D+L     A LF + Y +  +GN            F+G N    +   S
Sbjct: 344 SFYVWTPDEVHDVLENETDADLFCDRYDITESGN------------FEGSNQPNRVASVS 391

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A++  +        L   R KLF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L 
Sbjct: 392 DLAAEYDLDATDVRERLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLG 451

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                        G D  EY  +A  A  F+R  L+DE   RL   +++      G+L+D
Sbjct: 452 G------------GEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDEDVAIDGYLED 499

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL  G L  YE       L +A++L    ++ F D + G  + T     S++ R +E 
Sbjct: 500 YAFLARGALGCYEATGEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQEL 559

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D + PS   V+V  L+ L   V   + D + + A   L     R++  ++    +C AA
Sbjct: 560 GDQSTPSAAGVAVETLLALEGFV--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAA 617

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNSN 672
           D L   + +  V     ++ D  +    A A   L   +    PA  +E++ W +E +  
Sbjct: 618 DRLESGALEITV-----AADDLPDEWREAFAGRYLPDRLFARRPATDDELESWLDELDLA 672

Query: 673 NAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
           +A    A    S  +    VC++ +CSPP  D
Sbjct: 673 DAPPIWAGREASDGEPTLYVCRDRTCSPPTHD 704


>gi|46579138|ref|YP_009946.1| hypothetical protein DVU0725 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|387152533|ref|YP_005701469.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
 gi|46448551|gb|AAS95205.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311232977|gb|ADP85831.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
          Length = 715

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/704 (35%), Positives = 366/704 (51%), Gaps = 57/704 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED  V++ LN+ FV +KVDREERPD+D +YM   Q L G GGWPL+
Sbjct: 59  STCHWCHVMAHESFEDAEVSQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLT 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
           +F  PD  P    TY P   + GR G   ++ +V+D +  +R  +  S    A A+ + +
Sbjct: 119 IFALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERA 178

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L  S    + P       LR     L  ++D+  GGFG APKFP P  +  +L H ++
Sbjct: 179 AELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRR 235

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             D       S  Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ  
Sbjct: 236 TGD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAM 288

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                 + +  T++           DY+ RDM   GG + +AEDADS   EG  +++EGA
Sbjct: 289 FMLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGA 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
           FY +T  EV +  G++A L    + +   GN       +     +G NVL + L D   +
Sbjct: 347 FYTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--A 400

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+ LG+  E+      +    L  +R+ R RPH DDK++  WNGL I++ AR   +    
Sbjct: 401 ATTLGIDAEELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV---- 456

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDD 493
                F+ P           + ++AAS     L  + T    L HS   G    PGFLDD
Sbjct: 457 -----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDD 501

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKE 552
           YAF+I GLL+LY   +  +WL  AI LQ+ QD+ FLD   GGY++T  + P +  LR+KE
Sbjct: 502 YAFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKE 561

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             DGA PSGN+ +++NL+RLA ++  +    Y + A   +  F ++++   +   +  C 
Sbjct: 562 ARDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCG 618

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNS 671
            D  ++   + V++ G   + D E ML A   SY  N TV+H+   +T E +       S
Sbjct: 619 VD-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPN-TVMHLRDGNTAERLAMLAPFTS 676

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLEKP 714
           + A +        K  A +CQ+ +CS P+ DP +L E L   +P
Sbjct: 677 HLAPI------DGKTTAWLCQDNACSAPIQDPAALAERLAGARP 714


>gi|448352262|ref|ZP_21541053.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
 gi|445631642|gb|ELY84871.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
          Length = 717

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/693 (34%), Positives = 346/693 (49%), Gaps = 57/693 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMADESFADEAVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
            +L+P+ KP   GTYFP E K G+PGF  IL  V ++W+  R+ +     Q  A A ++L
Sbjct: 113 AWLTPEGKPFYVGTYFPREAKRGQPGFLEILENVTNSWENDREEIETRADQWTAAATDRL 172

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
            E   A  +S     ++    L   A    +S D  FGGFGS  PKFP+P  ++++   +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           +  + TG+     E   +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD 
Sbjct: 226 RAADRTGR----DEFSDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++   +L  +  T D  Y+ +  + LD++ R+++   G  FS  DA S   E   R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELMHEAGGFFSTLDAQSEAPETGER-EE 340

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           GAFYVWT  +V D+L +   A LF   Y +  +GN            F+G N    +   
Sbjct: 341 GAFYVWTPDDVRDVLADETDAELFCSRYDITESGN------------FEGTNQPNRVASI 388

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              A +  +P ++    L   R   F  R +RPRP+ D+KV+  WNGL+I++ A A+ +L
Sbjct: 389 DELADRFDLPTDEVEERLDSARETAFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVL 448

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                         G D  +Y E+A  A +F+R  L+D    RL   +++      G+L+
Sbjct: 449 --------------GKD--DYAEMATDALAFVRDRLWDADEKRLSRRYKDDDVAIDGYLE 492

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L  YE       L +A+EL    +  F D   G  + T     S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L+ L       ++D + + A   L     RL+  ++    +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLELDGFAG--ETDEFERIATTVLETHANRLETNSLEHATLCLA 610

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
           AD L   + +  +     ++ D         AS  L   +    PA  +E+  W    E 
Sbjct: 611 ADRLESGALEVTI-----AADDLPEEFVEPFASRYLPDRLFARRPATDDELAAWLDELEL 665

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               A  A       K    VC++ +CSPP  D
Sbjct: 666 MDAPAIWAGREARDGKPTLYVCRDRTCSPPTHD 698


>gi|417784564|ref|ZP_12432270.1| PF03190 family protein [Leptospira interrogans str. C10069]
 gi|421127859|ref|ZP_15588077.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421133342|ref|ZP_15593490.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|409952381|gb|EKO06894.1| PF03190 family protein [Leptospira interrogans str. C10069]
 gi|410022350|gb|EKO89127.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410434326|gb|EKP83464.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
          Length = 691

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KFSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|418670392|ref|ZP_13231763.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418689642|ref|ZP_13250763.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
 gi|418725255|ref|ZP_13283931.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
 gi|418729313|ref|ZP_13287860.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
 gi|421118286|ref|ZP_15578631.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|421121658|ref|ZP_15581951.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
 gi|400361321|gb|EJP17288.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
 gi|409961637|gb|EKO25382.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
 gi|410010134|gb|EKO68280.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|410345509|gb|EKO96605.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
 gi|410753774|gb|EKR15432.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410775491|gb|EKR55482.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
 gi|456824626|gb|EMF73052.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. LT1962]
          Length = 691

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|294827769|ref|NP_711139.2| hypothetical protein LA_0958 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|386073252|ref|YP_005987569.1| hypothetical protein LIF_A0779 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|293385614|gb|AAN48157.2| conserved protein containing a thioredoxin domain [Leptospira
           interrogans serovar Lai str. 56601]
 gi|353457041|gb|AER01586.1| conserved protein containing a thioredoxin domain [Leptospira
           interrogans serovar Lai str. IPAV]
          Length = 714

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 78  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 137

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 309 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 361

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 362 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 409

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 410 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 459

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 460 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 509

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 510 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 567

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 568 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 622

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 623 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 672

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 673 KFSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 711


>gi|168703256|ref|ZP_02735533.1| hypothetical protein GobsU_27241 [Gemmata obscuriglobus UQM 2246]
          Length = 698

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/694 (36%), Positives = 357/694 (51%), Gaps = 62/694 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVME ESFEDE  A ++N+ FV IKVDREERPD+D +YMT +Q +   GGGWPL
Sbjct: 53  SACHWCHVMEHESFEDEATAAIMNEHFVCIKVDREERPDLDTIYMTALQVMTREGGGWPL 112

Query: 79  SVFLSPDLKPLMGGTYFPPEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
           SVFL+PDLKP   GTY+PP+D+Y   GRPGFK +L  + +AW  +RD + + G   +  L
Sbjct: 113 SVFLAPDLKPFFAGTYYPPDDRYAAQGRPGFKKLLLGIHNAWQTQRDRVHEIGTSVVGDL 172

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
               +   +   +  EL   A       L +SYD RFGGFGS PKFP  +E++++L  S 
Sbjct: 173 QRMGALGDADGPVAPELLAGA----LAALRRSYDPRFGGFGSQPKFPHALELKLLLRLSD 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +  D            MV  TL  MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD  
Sbjct: 229 RFND-------PVALDMVKHTLTTMARGGIYDQLGGGFARYSVDAKWLVPHFEKMLYDNA 281

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            LA+   +A+  T D F+  I R+ LDY+ R+M   GG  FS +DADS   EG    +EG
Sbjct: 282 LLASALAEAYQRTGDPFFQQIGRETLDYVVREMWAEGGAFFSTQDADS---EG----EEG 334

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FYVW+  E+  +LG     F    +    G             F+G+N+L      +  
Sbjct: 335 KFYVWSLDELRAVLGAEDAEFACKVWGATRG-----------GNFEGRNILFRTLSDADE 383

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
               G   E +   L   +  L+  R+KR  P  D+K++ +WNGL+I++FA+        
Sbjct: 384 GKAHGTSEEAFRARLRAVKDTLYAARAKRVWPGRDEKILTAWNGLMIAAFAQ-------- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                F     G D       A+     I R +        + +    P K  G+L+DYA
Sbjct: 436 -----FGMATGGEDAACAAVAADH----ILRTMRTADGRLYRTAGVGQPPKLSGYLEDYA 486

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL   L+ LYE     KWL  A+EL     + F D  G G+F T  +   ++ R K+ HD
Sbjct: 487 FLADALVTLYEATFEVKWLRAALELAEALLKHFADPNGPGFFFTADDHEELIARTKDLHD 546

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G+ PSGN+V+V  L+RLA++    + D   + AE +L  +   + +   A   M  A D 
Sbjct: 547 GSTPSGNAVAVTVLLRLAALT--GRRD-LAEPAERTLRGYRETMAEHPAASGQMLIALDF 603

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
              P ++ V +VG +        + A  A++   + V   DPA            +  A+
Sbjct: 604 HLGPVQQ-VAIVGPEHDQATRRAIEAVRATFGPRRVVAFHDPASGAP-------PAELAT 655

Query: 676 MARNNFSADKVVAL-VCQNFSCSPPVTDPISLEN 708
           +     + D  V + VC+NF+C  P+T   ++E+
Sbjct: 656 LFEGKEALDGAVTVYVCENFACRAPLTGAEAIES 689


>gi|386875180|ref|ZP_10117368.1| lanthionine synthetase C-like protein, partial [Candidatus
           Nitrosopumilus salaria BD31]
 gi|386807022|gb|EIJ66453.1| lanthionine synthetase C-like protein, partial [Candidatus
           Nitrosopumilus salaria BD31]
          Length = 539

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/539 (39%), Positives = 304/539 (56%), Gaps = 49/539 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFE++ VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS+
Sbjct: 50  SCHWCHVMAHESFENDEVAKFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLSI 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP   GTYFP  D YGRPGF +I R++  AW +K   + +S     E    AL 
Sbjct: 110 FLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLSQAWKEKPKDIEKSA----ENFLNALH 165

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            + + +  P +L +  L   A  L +  D+ +GGFGSAPKFP    I  +  ++   E T
Sbjct: 166 KTETVHT-PSKLEKIILDEAAMNLFQLGDATYGGFGSAPKFPNAANISFLFRYA---ELT 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G     S+  +  L TL  MAKGGI D +GGGFHRYS D +W VPHFEKMLYD   +   
Sbjct: 222 G----LSKFNEFALKTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVN 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y++A+ +TKD FY  + +  LD++ R+M  P G  +SA DADS   EG     EG FYVW
Sbjct: 278 YVEAYQITKDPFYLEVLQKTLDFVLREMTTPEGGFYSAYDADS---EGV----EGKFYVW 330

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
              E+++ILG  A +F   Y +   GN            ++G  +L    + S  A   G
Sbjct: 331 KKSEIKEILGSDADIFCLFYDVTDGGN------------WEGNTILCNNLNISTVAFNFG 378

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              ++  +IL  C  KL  VRS R  P LDDK++VSWN L+I++FA+             
Sbjct: 379 KSEQEIHDILNSCAEKLLKVRSTRISPGLDDKILVSWNSLMITAFAKG------------ 426

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
             + V G  R  Y+  A+   SFI ++L   +  +LQ +++N  +K  G+L+DY++ I+ 
Sbjct: 427 --YRVTGDQR--YLSAAKDCISFIEKNLLVGE--KLQRTYKNNTAKIDGYLEDYSYFINA 480

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           LLD++E  S  K+L  ++ L N   E F D +   +F T+     +++R K ++D + P
Sbjct: 481 LLDVFEIESDQKYLQLSLNLANYLLEHFWDSDANSFFMTSDNHEKLIIRPKSNYDLSLP 539


>gi|456972139|gb|EMG12591.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 699

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 294 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 347 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 394

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 395 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 444

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 445 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 494

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 495 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYD 552

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 553 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 607

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 608 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 657

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 658 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 696


>gi|452857673|ref|YP_007499356.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
           plantarum UCMB5036]
 gi|452081933|emb|CCP23707.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
           plantarum UCMB5036]
          Length = 629

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 248/684 (36%), Positives = 357/684 (52%), Gaps = 78/684 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENAAAHLEVKV 112

Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A 
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAC 225

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
           YE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
           + + L+RL  +  G  S    + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545

Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA- 683
           V+ G K   D +  + A            H  PA T       EH    A ++  +F+A 
Sbjct: 546 VVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGIS--DFAAG 591

Query: 684 -----DKVVALVCQNFSCSPPVTD 702
                 K    +C+NF+C  P TD
Sbjct: 592 YQLIDGKTTVYICENFACRRPTTD 615


>gi|226356002|ref|YP_002785742.1| hypothetical protein Deide_10920 [Deinococcus deserti VCD115]
 gi|226317992|gb|ACO45988.1| conserved hypothetical protein [Deinococcus deserti VCD115]
          Length = 696

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/549 (38%), Positives = 302/549 (55%), Gaps = 43/549 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE  A  +N+ FV +KVDREERPDVD VYMT  QA+ G GGWP++
Sbjct: 62  STCHWCHVMAHESFEDEATAAQMNEHFVCVKVDREERPDVDAVYMTATQAMTGQGGWPMT 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP+D YG P F+ +L  + +AW   R+ L  +     + + EA 
Sbjct: 122 VFLTPDGEPFYAGTYFPPQDGYGLPSFRRLLASIANAWQNDREKLTGNARALTDHIREAS 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               S   LP    Q A     ++L + +D+  GGFG APKFP P  ++ +L        
Sbjct: 182 RPRPSQGDLPAGFLQQA----PDKLRRVFDADLGGFGGAPKFPAPTLLEFLLTR------ 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   EG+ M L TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL  
Sbjct: 232 -------PEGRDMALHTLRRMAAGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTR 284

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           V + A+  T D  ++ + R+ L YL R+M+ P G  +SA+DAD+    G     EG  + 
Sbjct: 285 VLVQAYQHTDDEDFARLARETLTYLEREMLSPAGGFYSAQDADTPTDHGGV---EGLTFT 341

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPH-NEFKGKNVLIELNDSSASASK 378
           WT  E+  +LG  + L +  Y +   GN       DPH  E+  +NVL         A  
Sbjct: 342 WTPAEIRAVLGGDSALIERVYGVTDQGN-----FLDPHRREYGSRNVLHLPTPLEQLARD 396

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG   + + + + + R +L + R +R +P  DDKV+ SWNGL +++FA A+++L      
Sbjct: 397 LGEDPQAFHSRVDQARARLLEAREQRTQPGTDDKVLTSWNGLALAAFADAARVL------ 450

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                   G  R  Y+E+A   A F+RR L       L+H+F++G ++  G L+D+A   
Sbjct: 451 --------GEPR--YLEIARQNAEFVRRELRLPDG-TLRHTFKDGQARVEGLLEDHALYG 499

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL+ L++ G     L WA EL       F D + G + +T G+   +L R  +  D A 
Sbjct: 500 LGLVALFQAGGDLGHLEWARELWTLVRRDFWDEDAGVFHSTGGQAEPLLSRQVQGFDSAV 559

Query: 559 PSGNSVSVI 567
            S N+ + +
Sbjct: 560 LSDNAAAAL 568


>gi|188585586|ref|YP_001917131.1| hypothetical protein Nther_0959 [Natranaerobius thermophilus
           JW/NM-WN-LF]
 gi|179350273|gb|ACB84543.1| protein of unknown function DUF255 [Natranaerobius thermophilus
           JW/NM-WN-LF]
          Length = 686

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/690 (34%), Positives = 350/690 (50%), Gaps = 84/690 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  +A +LN  F+SIKVDREERPD+D +YM+  QAL G GGWPL+
Sbjct: 56  STCHWCHVMEQESFEDHEIAGILNKNFISIKVDREERPDIDAIYMSACQALTGRGGWPLT 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ D  P   GTYFP E++ G PG K IL KV   W   R  L   G    + +    
Sbjct: 116 VFLNHDKNPFYAGTYFPKENRLGMPGLKDILEKVSSKWQNDRYELINIGNEITQAVEHHF 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
              A     P  + + +L +   QL +++D  +GGFGSAPKFP P  +  +L  YH    
Sbjct: 176 FTHA-----PGNVTEESLHIAFSQLEENFDEEYGGFGSAPKFPSPHNLYFLLRYYHL--- 227

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG          MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   L
Sbjct: 228 --TGNES----ALHMVKKTLTSMYRGGIYDHIGYGFCRYSTDKKWLVPHFEKMLYDNALL 281

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  YL+ + +T++ F+  I ++I  Y+ R++  P G  +SAEDADS   EG    +EG F
Sbjct: 282 AIAYLEVYEITRNNFFKEIAQEIFTYVSRELTSPEGGFYSAEDADS---EG----EEGKF 334

Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YV+T +EV ++LGE     F + Y +   GN            F+  N +  L   +   
Sbjct: 335 YVFTPQEVIEVLGEVRGQEFCKQYNITANGN------------FEHGNSIPNLIGKNPEK 382

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L           +KLF+ R +R  P  DDK++ SWNGL+I++ A+ S++L  E 
Sbjct: 383 DEFQKDL-----------KKLFEYREQREHPFKDDKILTSWNGLMIAALAKGSRVLNDE- 430

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+ +A+S+  FI ++L      RL   +R+G +  PGFLDDYA+
Sbjct: 431 ---------------RYLNMAQSSYRFIEKNLIT-NNQRLLTRYRDGEASIPGFLDDYAY 474

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+ GL++LY       +L  A+   +   +LF D++ GG +    +  +++ R KE  D 
Sbjct: 475 LVWGLIELYNASFEPYYLEKALIFNDEMIKLFWDQDQGGLYLYGHDSETLVSRPKEIDDS 534

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNSV+  NL+ L  +   +  +   + AE  +  F   +    +       A   L
Sbjct: 535 ALPSGNSVATRNLLELFHLTGKTSLE---ELAERQINSFGGSVNKSPIYYTHFLTAV-YL 590

Query: 617 SVPSRKHVVLVG-----HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
            + + + + +V        +SV  E ++   H +  L            EE+        
Sbjct: 591 VLTTTEEITVVSDPEPDEATSVLVEALIKGFHPNRFLLVKTEDRKGRQLEEL-------- 642

Query: 672 NNASMARN-NFSADKVVALVCQNFSCSPPV 700
             A +  N N   +K    VC++F+C  PV
Sbjct: 643 --APIVNNRNQKDNKPTIYVCKDFTCLTPV 670


>gi|302342409|ref|YP_003806938.1| hypothetical protein Deba_0974 [Desulfarculus baarsii DSM 2075]
 gi|301639022|gb|ADK84344.1| protein of unknown function DUF255 [Desulfarculus baarsii DSM 2075]
          Length = 681

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/688 (36%), Positives = 353/688 (51%), Gaps = 59/688 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED+ VA LLN  +V++KVDREERPD+D +YMT  QAL G GGWPL+ 
Sbjct: 49  TCHWCHVMAHESFEDQAVADLLNQHYVAVKVDREERPDLDAIYMTACQALSGAGGWPLTA 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEAL 139
            L+PD  P + GTYFP   + GRPG   IL +V   W+  +R  + Q+G    ++++ A+
Sbjct: 109 LLTPDGLPFIAGTYFPKTARLGRPGLLEILAEVARRWNGPERARMIQAG----QEVARAI 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              A       +L   AL +   QL +S+D +FGGFG APKFP P  +  +L    +   
Sbjct: 165 QPQAGPKT---DLDPRALGMAYSQLRQSFDDQFGGFGQAPKFPTPHNLLFLLRWQAR--- 218

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                  S+   MV  TL  MA GG+ D VG GFHRYSVD  W  PHFEKMLYDQ  LA 
Sbjct: 219 ----NPGSDALAMVEKTLTAMADGGLFDQVGFGFHRYSVDRPWLTPHFEKMLYDQALLAM 274

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YL+A  LT    ++   R +  Y+   M GP G  ++AEDADS   EG     EG +YV
Sbjct: 275 AYLEAHQLTGREDFAATARQVFTYVLTRMTGPEGGFYAAEDADS---EGV----EGKYYV 327

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +EV    G+    LF + + +   GN +    S PH     +  L +       A++
Sbjct: 328 WTPQEVLAAAGQADGRLFNDFHGITADGNFEHG-TSIPHR----RQSLADF------ATQ 376

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+  ++    L   R  L   R +R  P  DDK+I +WNGL+I++ A+A + L  EA +
Sbjct: 377 HGLDADQAAQALERARLALLAARQQRIPPLKDDKIITAWNGLMIAALAKAGQALADEALT 436

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
           A              ++ A +               RL  S R+G +  PGFL+DYAF+I
Sbjct: 437 AAAA-----RAATFILQTARATGG------------RLARSQRDGQASGPGFLEDYAFMI 479

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++L+E       L  A+EL +   ELF D   GGYF +  +   +++R K+D+DGA 
Sbjct: 480 WGLIELFEATFELDHLEAALELTDKCCELFWDEADGGYFFSPADGEKLIMRDKDDYDGAT 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+GNS   +NL+RLA +    + +   Q    ++A    RL    MA  ++  A D    
Sbjct: 540 PAGNSTMTLNLLRLARLTGRRQLEDMAQQLMQTMAAQTMRLP---MAHTMLLMALDFAQG 596

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P+ K +V+ G K+    + M+A A   +   + ++   P   E         +     A 
Sbjct: 597 PT-KEIVICGAKNDPAAQAMIAKAQQKFIPARALLWRPPEGPEAARL----AALAPFTAG 651

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL 706
                 +  A VCQ+  C+ PVTDP  L
Sbjct: 652 MTTVGGRATAYVCQDHVCARPVTDPDEL 679


>gi|418701443|ref|ZP_13262368.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410759525|gb|EKR25737.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
          Length = 691

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/699 (35%), Positives = 361/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASEFSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A+  P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALNYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|418710447|ref|ZP_13271218.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|410769383|gb|EKR44625.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
          Length = 691

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|418715817|ref|ZP_13275928.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
 gi|410788318|gb|EKR82040.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
          Length = 691

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++A+   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAKETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|428281760|ref|YP_005563495.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
           BEST195]
 gi|291486717|dbj|BAI87792.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
           BEST195]
          Length = 629

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 235/690 (34%), Positives = 354/690 (51%), Gaps = 89/690 (12%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    +A +    
Sbjct: 61  PFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG- 119

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
               L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+     
Sbjct: 120 ----LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQENALY 172

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
              K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +
Sbjct: 173 NVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQV 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+  
Sbjct: 229 TQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILK 281

Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN------DSSASASK 378
            LG+    L+ + Y +   GN            F+GKN+  LI         D+  +  +
Sbjct: 282 TLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKADAGLTEKE 329

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           L + LE       E R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +     
Sbjct: 330 LSLKLE-------EARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----- 377

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+
Sbjct: 378 -----------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLL 424

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA 
Sbjct: 425 WAYLDLYEASFDLSYLQKAKKLTDDIISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAV 484

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+ + L+RL  +   S      + AE   +VF+  +            +     +
Sbjct: 485 PSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKPDIDAYPSGHAFFMQSVLRHLM 541

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P +K +V+ G       + ++     ++  N +++              EH      +A 
Sbjct: 542 P-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA- 587

Query: 679 NNFSAD------KVVALVCQNFSCSPPVTD 702
             F+AD      K    +C+NF+C  P T+
Sbjct: 588 -PFAADYRIIDGKTTVYICENFACQQPTTN 616


>gi|441505288|ref|ZP_20987276.1| Thymidylate kinase [Photobacterium sp. AK15]
 gi|441427143|gb|ELR64617.1| Thymidylate kinase [Photobacterium sp. AK15]
          Length = 732

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 242/688 (35%), Positives = 362/688 (52%), Gaps = 59/688 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED  VA LLN  FV+IKVDREERPD+D+++M   Q++ GGGGWPL+ 
Sbjct: 72  TCHWCHVMERESFEDTEVAALLNRDFVAIKVDREERPDIDQLHMAACQSMTGGGGWPLNC 131

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            L+P+ +     TY P + +YGRPG   ++  +  AW K+RD+L  +GA  + +  +ALS
Sbjct: 132 VLTPEGQVFYATTYLPKQGQYGRPGMMELIPTIALAWQKQRDVLL-NGAIQLNKQLQALS 190

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             +++  L + +   A  L  EQ   ++D   GGFG APKFP P +   +L +  +   T
Sbjct: 191 GVSAAGVLDENIEHQAY-LWFEQ---TFDPEHGGFGDAPKFPLPHQYFFLLRYWYR---T 243

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+    S    MV  +LQ M  GG+ DH+G GFHRYS D  W VPHFEKMLYDQ  L   
Sbjct: 244 GQRQALS----MVEESLQAMRLGGLFDHIGYGFHRYSTDNCWLVPHFEKMLYDQSLLLMA 299

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A++ T + FY     ++++YL+  M+ P G  FSAEDADS   EG    +EG FY+W
Sbjct: 300 YSEAYAATGNEFYKQTAEEVVEYLKSRMLHPDGGFFSAEDADS---EG----EEGKFYIW 352

Query: 321 TSKEVEDILGEHAILF-KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
             +E++ +L E  + + ++HY + P GN     + +      G N+L        SA K 
Sbjct: 353 RYEELKAVLEESELTWLEQHYCIFPQGN----YVDEVSGRMTGANILHLSMHPLVSADKK 408

Query: 380 G------MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           G         E + N     R+KL+  R +R  P LDDKV+  WNGL I++ AR S ++ 
Sbjct: 409 GKVDHDKATPECWRNQWQLIRQKLYQHRERREHPLLDDKVLSDWNGLTIAALARCSLLI- 467

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                          D  + +E+A  A  FIR +L DE +H L   +RNG +  P  LDD
Sbjct: 468 ---------------DSSDCLEMARKAFEFIRLNLVDENSH-LMKRYRNGNAGLPAHLDD 511

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA LI   L+L++      +L  A+       + F D +  G++ T   +  + +R KE 
Sbjct: 512 YASLIWAALELHQATLNNDYLQQALNWTEMAVDKFWDSDNHGFYFTEA-NTDLAVRAKEI 570

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
           +DGA PSGN+V   NL  L  +   S+   ++      +A F  +L        L+  A 
Sbjct: 571 YDGAIPSGNAVMARNLAFLYRLTGESR---WQTKFNKLIAAFAPQLNRYPAGYTLLLTAV 627

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           D+++ P  +H++  G   +   E++L      Y  N   + ++  D  +       N+  
Sbjct: 628 DLMNSPG-QHLLFSGAGVA---EDILRPLKGKYLPNTLWLAVNDKDRVQGG----KNTAV 679

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  + +FS ++ V   CQ+ +C  P+T
Sbjct: 680 PASFKLSFSGNEPVLCFCQDSACELPIT 707


>gi|389572654|ref|ZP_10162736.1| yyaL [Bacillus sp. M 2-6]
 gi|388427679|gb|EIL85482.1| yyaL [Bacillus sp. M 2-6]
          Length = 627

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/684 (35%), Positives = 361/684 (52%), Gaps = 77/684 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 144
           P   GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  + +    +
Sbjct: 61  PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHSDRD--------HIESLAEKATNNLRIKA 112

Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
           + +  + L Q ++     QL  S+D+ +GGFGSAPKFP P    M+ +  +  E TG+  
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLTFLMRYFEWTGQEN 169

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
                 K    TL  MA GGI+DH+G GF RYS DE+W VPHFEKMLYD   L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
           + +T+   Y  + +D++ +++RDM+   G  +SA DADS   EG    KEG +YVWT KE
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKKE 278

Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
           +   LG+    LF   Y++   GN +   +  PH       +    +D  A+ S   +  
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGNFEGQNI--PH------TISTSFDDIKAAYS---IDD 327

Query: 384 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
           +   + L   R  L  VR +RP P +DDKV+ SWN L+IS+ A+A  +   E        
Sbjct: 328 QTLYSKLQSARNILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHEE-------- 379

Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
                   E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA +++  + 
Sbjct: 380 --------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLTAYMS 429

Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
           LYE      WL  A  +     ELF D + GG+F +  +  ++++R KE +DGA PSGNS
Sbjct: 430 LYEATFDLDWLTKARAVGENMFELFWDEQIGGFFFSGSDAETLIVREKEVYDGAMPSGNS 489

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--ADMLS-VP 619
            ++  L++L+ ++        RQ+   +L  +F     D++ + P    A    +LS   
Sbjct: 490 TALQQLLKLSRMIG-------RQDWIETLEKMFSAFYVDVS-SYPSGHTAFLQGLLSQYA 541

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           +++ ++++G K     E +L A      L K  +  D   T E     +  +  A  A++
Sbjct: 542 AKREIIILGKKGDPQKEQLLQA------LQKRFMPFDLILTAETG---QELARLAPFAKD 592

Query: 680 NFSA-DKVVALVCQNFSCSPPVTD 702
             +  D     +C+N+SC  P+T+
Sbjct: 593 YKTINDSTTVYICENYSCRQPITN 616


>gi|417761487|ref|ZP_12409496.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
 gi|417772112|ref|ZP_12420002.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|417776397|ref|ZP_12424235.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
 gi|418671976|ref|ZP_13233322.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
 gi|418680449|ref|ZP_13241698.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|418703630|ref|ZP_13264514.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|400327807|gb|EJO80047.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|409942568|gb|EKN88176.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
 gi|409946069|gb|EKN96083.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|410573764|gb|EKQ36808.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
 gi|410581098|gb|EKQ48913.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
 gi|410766766|gb|EKR37449.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|455668123|gb|EMF33372.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Fox 32256]
          Length = 691

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|452972836|gb|EME72663.1| hypothetical protein BSONL12_20380 [Bacillus sonorensis L12]
          Length = 627

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 247/695 (35%), Positives = 360/695 (51%), Gaps = 99/695 (14%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE VA+LLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+VFL+P+ K
Sbjct: 1   MAHESFEDEEVAQLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPEQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   +Y RPGF  +L+++   + K RD +        E+ +  L   A SN 
Sbjct: 61  PFYAGTYFPKTSRYNRPGFVEVLKQLSATFAKNRDHVEDIA----EKAANNLRIKAKSNA 116

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 206
             + L ++ L+   +QL  S+D+ +GGFGSAPKFP P  +  +L YH         SGE 
Sbjct: 117 -GEALGEDILKRTYQQLINSFDTAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
           +     V  TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ 
Sbjct: 169 N-ALYSVTKTLDSMANGGIYDHIGYGFARYSTDQEWLVPHFEKMLYDNALLLMAYTEAYQ 227

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
           +TK   Y  I   I+ ++RR+M    G  FSA DAD   TEG     EG +Y+W+  E+ 
Sbjct: 228 VTKRERYKRISEQIIAFIRREMTDERGAFFSALDAD---TEGV----EGKYYIWSKDEIT 280

Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA-SKLGMPLE 384
           + LG E   L+           C +  ++D  N F+G N+   +  S      +  +   
Sbjct: 281 ETLGDELGSLY-----------CAVYDITDEGN-FEGFNIPNLIYTSFEQVRDEFSLTET 328

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
           +  N L   R+KLF+ R  R  PH+DDKV+ SWN L+I+  A+ASK+ ++          
Sbjct: 329 ELQNKLEAARQKLFEKRRGRIYPHVDDKVLTSWNALMIAGLAKASKVFEA---------- 378

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                  EY+E+A +A SFI   L   +  R+   +R+G  K  GF+DDYAFL+   L+L
Sbjct: 379 ------PEYLEMARTALSFIEDELI--KDGRVMVRYRDGEVKNKGFIDDYAFLLWSYLEL 430

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
           YE       L  A EL     +LF D + GG++ T  +  ++++R KE +DGA PSGN V
Sbjct: 431 YEASLNLPDLRKAKELAGDMIDLFWDEDHGGFYFTGKDAEALIVRDKEVYDGALPSGNGV 490

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS---- 620
           + + L RL  +                L++ + R+ DM  A        D+ + PS    
Sbjct: 491 AAVQLFRLGRLTG-------------DLSLID-RVSDMFSAF-----HGDVSAYPSGHTN 531

Query: 621 -----------RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWE 667
                      +K +V++G +   + +N++ A   ++  N  V+  +  D  +   DF  
Sbjct: 532 FLQSLLSQMMPQKEIVILGKRDDPNRQNIIRALQQAFQPNYAVLAAESPDDFKGIADFAA 591

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           ++ + +          DK    +C+NF+C  P  +
Sbjct: 592 DYKAID----------DKTTVYICENFACQKPTAN 616


>gi|220916114|ref|YP_002491418.1| hypothetical protein A2cp1_1001 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219953968|gb|ACL64352.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 718

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/632 (38%), Positives = 350/632 (55%), Gaps = 69/632 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T R  FL    +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64  GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
           VD +YMT VQ L G GGWP+SV+L+PD +P  GGTYFPP D    P  GF +IL ++   
Sbjct: 124 VDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGFLSILHEIAGL 183

Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
           W++  D + + +GA      +    A  ++ ++P   P ++A+ L    L +S+D R GG
Sbjct: 184 WERDPDRIRSATGALVEAVRTALAPAGPAAAQVPGPEPIEHAVAL----LERSFDERHGG 239

Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
              APKFP  V ++++L H +      ++GEA    +M   TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGEA-RSLRMATVTLERMAAGGLHDQVGGGFH 292

Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
           RYS D  W VPHFEKMLYD   LA  Y +A+ +T    ++ + R  LDYL R++  P G 
Sbjct: 293 RYSTDAEWLVPHFEKMLYDNALLALAYAEAWQVTGRRDFARVTRQTLDYLLRELTSPEGG 352

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
           ++SA DADS   EG    +EG F+ WT  E+ + LG+ A  F   + ++P GN       
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                F+G++VL            +  P E     L   R  L+ +R +RPRP  D+K++
Sbjct: 399 -----FEGRSVL-----------HVPAPDEDAWEALAPDRAALYALRERRPRPLRDEKIL 442

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
             WNGL IS+ A   + L                    +++ A  AA F+   L  +   
Sbjct: 443 AGWNGLAISALAFGGRALAE----------------PRWVDAAARAADFVLTRLVKDG-- 484

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           RLQ S+  G +  P +L+D+AFL+ GLLDL+E     +WL  A EL   QD LF D EGG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEATFDPRWLAAAAELAGAQDRLFGDPEGG 544

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G+F +  +   +L R K  HDGAEPSG SV+ +N +RL +  +  +   +R+ A+ +L  
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
               L +  +A+  +  A D  S   R+ V++
Sbjct: 602 HARTLAEQPLAMSELLLALDYASDAVREVVLI 633


>gi|197121417|ref|YP_002133368.1| hypothetical protein AnaeK_1004 [Anaeromyxobacter sp. K]
 gi|196171266|gb|ACG72239.1| protein of unknown function DUF255 [Anaeromyxobacter sp. K]
          Length = 718

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 247/633 (39%), Positives = 350/633 (55%), Gaps = 70/633 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +T R  FL    +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64  GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
           VD +YMT VQ L G GGWP+SV+L+PD +P  GGTYFPP D    P  GF +IL ++   
Sbjct: 124 VDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGFLSILHEIAGL 183

Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
           W++  D + + +GA      +    A  ++ ++P   P ++A+ L    L +S+D R GG
Sbjct: 184 WERDPDRIRSATGALVEAVRTALAPAGPAAAEVPGPEPIEHAVAL----LERSFDERHGG 239

Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
              APKFP  V ++++L H +      ++GE     +M   TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGE-ERSLRMATVTLERMAAGGLHDQVGGGFH 292

Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
           RYS D  W VPHFEKMLYD   LA  Y +A+ LT    ++ + R  LDYL R++  P G 
Sbjct: 293 RYSTDAEWLVPHFEKMLYDNALLALAYAEAWQLTGRRDFARVTRQTLDYLLRELTSPEGG 352

Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
           ++SA DADS   EG    +EG F+ WT  E+ + LG+ A  F   + ++P GN       
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                F+G++VL            +  P E     L   R  L+ +R +RPRP  D+K++
Sbjct: 399 -----FEGRSVL-----------HVPAPDEDAWEALAPDRAALYALRERRPRPLRDEKIL 442

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
             WNGL IS+ A   + L                    +++ A  AA F+   L  +   
Sbjct: 443 AGWNGLAISALAFGGRALAE----------------PRWVDAAARAADFVLTRLVKDG-- 484

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           RLQ S+  G +  P +L+D+AFL+ GLLDL+E     +WL  A EL   QD LF D EGG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEATFDPRWLAAAAELAGAQDRLFGDPEGG 544

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           G+F +  +   +L R K  HDGAEPSG SV+ +N +RL +  +  +   +R+ A+ +L  
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
               L +  +A+  +  A D  S   R+ VVLV
Sbjct: 602 HARTLAEQPLAMSELLLALDCASDAVRE-VVLV 633


>gi|448397958|ref|ZP_21569896.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
 gi|445672174|gb|ELZ24751.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
          Length = 731

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 236/697 (33%), Positives = 347/697 (49%), Gaps = 60/697 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEAESFADEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP E K G+PGF  +  ++ D+W    D         Q    A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGKRGQPGFLDLCERISDSWASAEDRPEMESRAEQWTDAAKD 172

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLY 192
           +L E  +  A ++          L   A+ + +S D R GGFGS+ PKFP+P  ++++  
Sbjct: 173 RLEETPTEDADTDASAGPPSSEVLETAADAIVRSADRRCGGFGSSGPKFPQPSRLRVLAR 232

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
              + +D     E  E       TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLY
Sbjct: 233 AHDRTDDETAYREVLEE------TLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLY 286

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D  ++   +L  + LT +  Y+ +  D L+++ R++    G  FS  DA S   E   R 
Sbjct: 287 DNAEIPRAFLAGYQLTGENRYAEVVGDTLEFVERELTHDDGGFFSTLDAQSESPETGER- 345

Query: 313 KEGAFYVWTSKEVEDILGEH---AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           KEGAFYVWT  EV D++ EH   A LF + Y +  +GN            F+G++    +
Sbjct: 346 KEGAFYVWTPDEVHDVI-EHEPDAALFCKRYDITESGN------------FEGRSQPNRV 392

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A    +   + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+
Sbjct: 393 TPVSELAVGFDLEESEVLKRLDAIRQRLFEAREERPRPNRDEKILAGWNGLMISTYAEAA 452

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L              G D  +Y E A  A  F+R  L+D    RL   ++ G     G
Sbjct: 453 LVL--------------GED--DYAETAVDALEFVRDRLWDADEQRLSRRYKGGDVAIDG 496

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     S++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEVEFWDADHGTLYFTPASGESLVTR 556

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+V  L+ L        ++ + + A   L      L+  A+    +
Sbjct: 557 PQELSDQSTPSAAGVAVETLLSLDEFA----TEDFEEIAATVLETHANTLEANALEHATL 612

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           C AAD L   + +  V     ++ D          S      +  + P   + ++ W + 
Sbjct: 613 CLAADRLESGALEVTV-----AADDLPATWRDRFTSRYFPDRLFALRPPTEDGLEAWLDR 667

Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               ++      R     +  +  VC+N +CSPP  D
Sbjct: 668 LDLADAPPIWAGREARDGEPTL-YVCRNRTCSPPTHD 703


>gi|418695562|ref|ZP_13256581.1| PF03190 family protein [Leptospira kirschneri str. H1]
 gi|409956647|gb|EKO15569.1| PF03190 family protein [Leptospira kirschneri str. H1]
          Length = 711

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 246/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 78  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNL 137

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 309 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 361

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 362 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 405

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 406 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 459

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S+  G+ +DYA
Sbjct: 460 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESRILGYSNDYA 508

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 509 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 566

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A  
Sbjct: 567 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALSYPFLLSAYW 624

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 625 SYKHHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 675

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 676 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 711


>gi|124504310|gb|AAI28719.1| Spata20 protein [Rattus norvegicus]
          Length = 550

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 199/487 (40%), Positives = 288/487 (59%), Gaps = 46/487 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL    +TCHWCH+ME ESF++E +  LLN+ FVS+ VDREERPD
Sbjct: 90  GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L ++ D W 
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265

Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
             APKFP PV +  +  +  S ++   G     S  Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D +WH+PHFEKMLYDQ QL+ VY  AF ++ D F+S + + IL Y+ R++    G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
             +SAEDADS    G  + +EGA Y+WT KEV+ +L E             L  +HY L 
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439

Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
             GN + ++  D + E  G+NVL        +A++ G+ +E    +L     KLF  R  
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 497

Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
           R + HLD+K++ +WNGL++S FA A  +L  E                + +  A + A F
Sbjct: 498 RLKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541

Query: 464 IRRHLYD 470
           ++RH++D
Sbjct: 542 LKRHMFD 548


>gi|448345120|ref|ZP_21534020.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
           12890]
 gi|445636069|gb|ELY89233.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
           12890]
          Length = 589

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 212/568 (37%), Positives = 315/568 (55%), Gaps = 46/568 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF+DE VA+++N+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFQDEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP E + G+PGF+ + +++ D+W+   D         Q    A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFRDLCQRISDSWESDADREEMENRAQQWTDAATD 172

Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
           +L E   A+  S  + P+    + L   A+ + +S D  +GGFGS+ PKFP+P  ++++ 
Sbjct: 173 RLEETPDAAGGSPVEAPEPPSSDVLETAADAVVQSADREYGGFGSSGPKFPQPSRLRVL- 231

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             ++  + TG+     E +++   TL  MA GG+ DHVGGGFHRY VD  W VPHFEKML
Sbjct: 232 --ARTYDRTGR----EEYREVFEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKML 285

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  ++   +L  + LT +  Y+ +  D L ++ R++    G  FS  DA S   E   R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSDSPETGER 345

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EGAFYVWT  EV D+L +   A LF   Y +   GN            F+G+N    +
Sbjct: 346 -EEGAFYVWTPDEVHDVLEDETDAALFCARYDITEAGN------------FEGRNQPNRV 392

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A++  +   + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLADHEILKRLESARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L              G+D  +Y + A  A  F+R  L+DE   RL   +++G  K  G
Sbjct: 453 LVL--------------GAD--DYADTAVDALGFVRDELWDEDEQRLSRRYKDGDVKIDG 496

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADSGTLYFTPESGEALVTR 556

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVA 577
            +E  D + PS   V+V  L+ L    A
Sbjct: 557 PQELGDQSTPSATGVAVETLLALDEFAA 584


>gi|45658527|ref|YP_002613.1| hypothetical protein LIC12692 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|45601770|gb|AAS71250.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
          Length = 716

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 80  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 139

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 140 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 199

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 200 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 257

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 258 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 310

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 311 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 363

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 364 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 411

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 412 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 461

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 462 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 511

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 512 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 569

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 570 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 624

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 625 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 674

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 675 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 713


>gi|418741789|ref|ZP_13298163.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
 gi|410751237|gb|EKR08216.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
          Length = 688

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 245/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ G+ + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 339 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 382

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+ +L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 383 SNFTEEESKHLDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 436

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 437 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 485

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 486 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 543

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A  
Sbjct: 544 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 601

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 602 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 652

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 653 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 688


>gi|421085457|ref|ZP_15546310.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
 gi|421103567|ref|ZP_15564164.1| PF03190 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410366530|gb|EKP21921.1| PF03190 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410432093|gb|EKP76451.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
          Length = 691

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 544

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688


>gi|448301393|ref|ZP_21491386.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
 gi|445584129|gb|ELY38453.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
          Length = 788

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 231/689 (33%), Positives = 347/689 (50%), Gaps = 51/689 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA LLN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 116 SACHWCHVMEDESFADEEVADLLNENFVPIKVDREERPDVDSIYMTVAQLVTGRGGWPLS 175

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P  KP   GTYFP E K G+PGF  +L ++ ++W++ RD +        +   + L
Sbjct: 176 AWLTPQGKPFYVGTYFPKEAKRGQPGFLDVLEQLANSWEQDRDEVENRAQQWTDAAKDRL 235

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
             +  S    +      L   A+   +S D + GGFGS  PKFP+P  + ++   ++  +
Sbjct: 236 EETPDSVAQAEPPSSEVLTTAADAALRSADRQHGGFGSGGPKFPQPSRLHVL---ARAYD 292

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+     + ++++  +L  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++ 
Sbjct: 293 RTGR----EQFREVLEESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIP 348

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             +L  + LT D  Y+ +  + L+++ R++    G  FS  DA S   +G   K+EG FY
Sbjct: 349 RAFLAGYQLTGDDRYAEVTAETLEFVDRELTHEEGGFFSTLDAQSKTEDG--EKEEGVFY 406

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT  E+ ++L E   A LF   Y +  +GN            F+G N    +      A
Sbjct: 407 VWTPDEISEVLEEETDAELFCARYDITESGN------------FEGTNQPNRVRSIPDLA 454

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +  +  +     L   R+ LF+ R +RPRP+ D+KV+ SWNGL+I++ A A+ +L    
Sbjct: 455 DEFDLAEDDTEQRLESARKALFEARERRPRPNRDEKVLASWNGLLINTCAEAALVL---- 510

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                     G D  EY E+   A  F+R  L+D    RL   +++G  K  G+L+DYAF
Sbjct: 511 ----------GED--EYAEMGVDALDFVRERLWDADEGRLARRYKDGDVKVDGYLEDYAF 558

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L  YE       L +A++L  T +  F D E G  + T     S++ R +E  D 
Sbjct: 559 LARGALRCYEATGDVDHLAFALDLARTIEAEFWDEERGTLYFTPESGESLVTRPQELDDQ 618

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           + PS   V++  L+ L    A      + + A   L     R++  ++    +C AAD L
Sbjct: 619 STPSATGVALETLLALDGFAADEN---FEKIASTVLETHANRIEANSLQHASLCLAADRL 675

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
              + + + +   +    + +  AA +        +  + P   E ++ W E        
Sbjct: 676 EAGALE-ITIAADELPAAWRDRFAAEYRP----DRLFALRPPTAEGLESWLEQLGLEEAP 730

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
           A  A       +    VC++ +CSPP  D
Sbjct: 731 AIWAGREARDGEPTLYVCRDRTCSPPTHD 759


>gi|456984461|gb|EMG20516.1| PF03190 family protein [Leptospira interrogans serovar Copenhageni
           str. LT2050]
          Length = 699

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG 
Sbjct: 294 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    S   
Sbjct: 347 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 394

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +         
Sbjct: 395 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 444

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                    +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA 
Sbjct: 445 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 494

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +D
Sbjct: 495 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 552

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPS NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A   
Sbjct: 553 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 607

Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
               S KH    +VL+  K+S + ++MLA   + +  +  +  ++  + EE         
Sbjct: 608 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 657

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+  +  S    +  VC+NFSC  PV +   LE  +
Sbjct: 658 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 696


>gi|418686893|ref|ZP_13248057.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|410738600|gb|EKQ83334.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
          Length = 713

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 245/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 80  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 139

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 140 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 199

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 200 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 257

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 258 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 310

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 311 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 363

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ G+ + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 364 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 407

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+ +L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 408 SNFTEEESKHLDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 461

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 462 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 510

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 511 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 568

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A  
Sbjct: 569 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 626

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 627 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 677

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 678 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 713


>gi|398339915|ref|ZP_10524618.1| hypothetical protein LkirsB1_10954 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 696

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 246/696 (35%), Positives = 358/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 294 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 347 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A  
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYW 609

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696


>gi|335427892|ref|ZP_08554812.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
 gi|334893818|gb|EGM32027.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
          Length = 682

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 227/686 (33%), Positives = 352/686 (51%), Gaps = 70/686 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +++LLN  F+SIKVDREERPD+D +YM   QAL G GGWPL+
Sbjct: 53  STCHWCHVMERESFEDEEISELLNKDFISIKVDREERPDIDHIYMEVCQALTGRGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++ D KP   GTYFP      + G   +L  +   W   +D +  S     + L++  
Sbjct: 113 IVMTADKKPFYAGTYFPKTTVGKQLGLTQLLPTITKQWKSNKDKILDSATEIYDVLNKYR 172

Query: 140 SASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
               S   KL  ++ +N  +     L  ++D+ +GGFG+APKFP P  +  +L++     
Sbjct: 173 EEQESVRGKLSLDVVENLFK----NLRGAFDNLYGGFGTAPKFPSPHNLLFLLHY----- 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G      +   MV  TL+ M KGGI+DH+G GF RYSVD +W VPHFEKMLYD   L 
Sbjct: 224 --GYINNNQDAVFMVERTLEQMYKGGIYDHIGYGFSRYSVDRKWLVPHFEKMLYDNALLT 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y++A+ L  D  Y  +  + L+Y+ R M    G  ++AEDADS   EG    +EG FY
Sbjct: 282 LAYIEAYQLKNDPLYKQVVEETLEYVSRVMTDKEGGFYTAEDADS---EG----EEGKFY 334

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD-PHNEFKGKNVLIELNDSSASA 376
            +T  E++++L  E A    E+Y +   GN + + + +  H ++      ++L+D     
Sbjct: 335 TFTKNEIKELLDKEDATFIIEYYNISEEGNFERTNILNLIHKDY------LDLDDKERER 388

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                        L + + +LF+ R KR  PH DDK++ SWN ++I+++ARA ++L ++A
Sbjct: 389 -------------LNKIKERLFNYRDKRVHPHKDDKILTSWNAMMITAYARAGRVLNNDA 435

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+  A+    FI  HL DE   R+Q  +R+G +K  G++DDYA+
Sbjct: 436 ----------------YINKAKQGVQFISDHLIDENG-RIQARYRDGEAKFKGYIDDYAY 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L   L++L+   S   ++  A++L +   ELF D E  G++    +   +L+R KE +DG
Sbjct: 479 LNWALIELFLGTSDQTYIHQALKLTDDMIELFWDDEKDGFYYYGNDSEYLLMRNKEIYDG 538

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGNS++ +N ++L+ I    K   Y + A      F  ++K    +   M       
Sbjct: 539 AIPSGNSIATMNFIKLSEITDEIK---YEKYARKLFDAFAYKVKQSPSSHSYMLNTYLHA 595

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           S P  K V++  H      E     +H    L   +I      + +   + ++   N  +
Sbjct: 596 SHPKTKVVIVGKHDDPKLKEIKRKISHHYLPLGTVLILYKDLVSADDPIFGDYLVENKDI 655

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
           A            +CQ++SC  P+ D
Sbjct: 656 A----------CYICQDYSCDEPIYD 671


>gi|381206676|ref|ZP_09913747.1| hypothetical protein SclubJA_13745 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 693

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/696 (35%), Positives = 363/696 (52%), Gaps = 66/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED   A  LN  FV++KVDREERPD+D+V+M  + AL   GGWPL++
Sbjct: 52  TCHWCHVMERESFEDLETADYLNRNFVAVKVDREERPDIDQVFMDALHALGEQGGWPLNM 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +PD +P  GGTYFPP+  YGR  F+ IL  ++  W +++  + ++     +Q++  L 
Sbjct: 112 FATPDGRPFTGGTYFPPKPMYGRQSFRQILESLRYYWQEEKAKIHETA----DQVTAYLR 167

Query: 141 ASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFG--SAPKFPRPVEIQMML-YHSKK 196
            + +   L + LPQ N +    +   +++DS  GGF      KFP  + +Q++L YH + 
Sbjct: 168 RAPAPQPLDEPLPQWNCVEETVQAYRQAFDSEDGGFALQRPNKFPPSMGLQLLLRYHLRT 227

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                          MV  TL  M  GGI+D VGGG  RYS D RW VPHFEKMLYD   
Sbjct: 228 --------RIPSDLFMVELTLFKMRNGGIYDQVGGGLCRYSTDYRWLVPHFEKMLYDNAL 279

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
            A   L+ F +T + FY  I  DI  Y+ RDM+       SAEDADS   EG     EG 
Sbjct: 280 FAQTSLECFQVTSNPFYREIAEDIFQYVTRDMMAESSAFCSAEDADS---EG----HEGL 332

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY+WT+ E +  +  +++     ++ + P GN            F+G+N+L     +   
Sbjct: 333 FYLWTADEFKKTVEDKYSDSLANYWNVTPQGN------------FEGRNILNVSQSTKVF 380

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
             +LG+   ++  I+   R  L DVR++R RP  DDK++VSWN L+ISSFA+A++IL   
Sbjct: 381 GEQLGLEENEWQTIIKSARSNLQDVRAQRIRPLKDDKILVSWNALMISSFAQAARIL--- 437

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        +  EY   A +A +FI  HL + Q  RL   +R+G +K P +L DYA
Sbjct: 438 -------------EHNEYGITANNALAFIEEHLIN-QEGRLLRRYRDGDAKFPAYLSDYA 483

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            L    LD+Y +    ++++ A    N  + LFL+ + G YF T  +   VL+R  + +D
Sbjct: 484 QLGLACLDIYAWNYEPQYVLKAHHWANEINRLFLNPD-GAYFETGFDAEEVLVRKADGYD 542

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           G EPSGN+ + +  ++LAS   GS      ++AE  L  F   L    +    M  A  +
Sbjct: 543 GVEPSGNTSTALLFLKLASFGMGSG---LLRDAERILHSFSPHLHQAGVNFSAMLNAL-I 598

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
            +      +V+ G +S+++ + +L     S+ L + V+   P+D        +  S    
Sbjct: 599 WARKGGTEIVVSGDESNLETKEVLQWLRQSF-LPEVVVAFIPSDD------PDPVSQQIP 651

Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
           +A    S D +++  VCQ   C  PV D  SL+ L+
Sbjct: 652 IAEGRASLDERLLIHVCQGQLCHAPVQDLPSLKKLI 687


>gi|429193250|ref|YP_007178928.1| thioredoxin domain-containing protein [Natronobacterium gregoryi
           SP2]
 gi|448324467|ref|ZP_21513897.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
 gi|429137468|gb|AFZ74479.1| thioredoxin domain protein [Natronobacterium gregoryi SP2]
 gi|445618899|gb|ELY72451.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
          Length = 741

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 241/704 (34%), Positives = 357/704 (50%), Gaps = 64/704 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT    + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFADEAVAEVLNENFVPIKVDREERPDVDSIYMTVCNLVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
            +L+P+ KP   GTYFP E K G+PGF  +L  + ++W+  R+ +     Q    A +QL
Sbjct: 113 AWLTPEGKPFYVGTYFPTEAKRGQPGFLDVLENITNSWENDREEVENRADQWTEAARDQL 172

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
            E  +  A S    D    + L   A+   +S D ++GGFGS  PKFP+P  +Q++   +
Sbjct: 173 EE--TPGAPSPGAADPPSSDLLERAADASLRSADRQYGGFGSDGPKFPQPSRLQVL---A 227

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           +  + TG      E ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD 
Sbjct: 228 RAYDRTGD----EEYRQVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 283

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++   +L  + LT +  Y+ +  + L ++ R++    G  FS  DA S + E   R +E
Sbjct: 284 AEIPRAFLAGYQLTGEERYAEVVHETLAFVDRELTHEDGGFFSTLDAQSEDPETGER-EE 342

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT  EV D+L +   A LF  HY +  +GN            F+G N    +   
Sbjct: 343 GTFYVWTPAEVHDVLADETDADLFCAHYDITASGN------------FEGANQPNRVRSI 390

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           +  A +  +   +    L + R++LF+ R KRPRP+ D+KV+  WNGL+I++ A A+  L
Sbjct: 391 ADLAGEFDLAEHEVKQRLEDARQQLFETREKRPRPNRDEKVLAGWNGLMIATCAEAALTL 450

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
             E                 Y E+A  A  F+R  L+D++  RL   ++       G+L+
Sbjct: 451 GEE----------------RYAEMAVDALEFVRDRLWDDEEGRLSRRYKGEDVAIEGYLE 494

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L  YE       L +A+EL    +E F D + G  + T     S++ R +E
Sbjct: 495 DYAFLARGALGCYEATGEVDHLAFALELGRAIEEEFWDADRGTLYFTPESGESLVTRPQE 554

Query: 553 DHDGAEPSGNSVSVINLVRLASIVA--GSKSDY---------YRQNAEHSLAVFETRLKD 601
             D + PS   V+V  L+ L       GSKS           Y + A   L+    RL+ 
Sbjct: 555 LGDQSTPSSAGVAVEILLALEKFAGSEGSKSPRGDGEVADADYEEIAATVLSTHANRLEA 614

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
            ++    +C AAD L   + +  V     ++ +       A A+      ++   P   +
Sbjct: 615 NSLQHATLCLAADHLESGALEVTV-----TADELPEEWREAFATQYFPDRLLARRPTTDD 669

Query: 662 EMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
           +++ W +  S  A+    A       +    VC++ +CSPP  D
Sbjct: 670 DLEAWLDRLSLAAAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 713


>gi|421131211|ref|ZP_15591395.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
 gi|410357462|gb|EKP04717.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
          Length = 696

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 245/696 (35%), Positives = 359/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG 
Sbjct: 294 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ G+ + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 347 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A  
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 609

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696


>gi|386392363|ref|ZP_10077144.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
 gi|385733241|gb|EIG53439.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
          Length = 704

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 251/701 (35%), Positives = 343/701 (48%), Gaps = 67/701 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A L+    V++KVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 51  STCHWCHVMEHESFEDEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP E  +GR G + +L++V  AW   R  +  +    ++ + + L
Sbjct: 111 VFLTPDGRPFFAGTYFPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRDQL 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A  +   +  E  Q  L     +L+ ++D+  GGFG APKFP P  +  +L   ++   
Sbjct: 171 EARDAGEAV--EPGQAQLGAARNELAAAFDTANGGFGGAPKFPSPHNLLFLLREYRR--- 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+     +   MV  TL  M +GG+ D +G G HRYS D RW VPHFEKMLYDQ   A 
Sbjct: 226 TGQ----EDNLAMVTATLDAMRRGGVFDQIGLGLHRYSTDARWFVPHFEKMLYDQALTAM 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
              +A+  T D     +  +I +Y+RRD+ GP G  +SAEDADS   EG     EG FYV
Sbjct: 282 AATEAYLATGDAGLRRMAMEIFEYVRRDLTGPDGAFYSAEDADS---EGV----EGRFYV 334

Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+  +L G+ A LF + Y + P GN       +   +  G N+       +A A K
Sbjct: 335 WTESEIRAVLPGDEAGLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAGK 390

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G    +    L   R  L   R KR RP  DDKV+   NGL+I++ A+A++        
Sbjct: 391 RGQEPAELAARLERSRELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF------ 444

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D +E    A+ A+ F+   +    + RL H  R G +   G LDDYAFL 
Sbjct: 445 ----------DDEELAGRAKRASDFLLGKMLLPDS-RLLHRLRLGEAAVSGMLDDYAFLA 493

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL+LY+      +L  A+ L       F D   GG F T  +  ++LLR K  +D A 
Sbjct: 494 WGLLELYQTVFDPAYLAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAAI 552

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA--------MAVPLMC 610
           PSGNSV+ + L  L           YR   E S     TRL   A               
Sbjct: 553 PSGNSVAFLVLTTL-----------YRLTGEKSFMEEATRLARAAGPWLAGHPSGFTFFL 601

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
           C    +  PS   V + G   + D + +  A    Y L +  + + PA        E   
Sbjct: 602 CGLSQMLAPS-AEVTIAGDPDAPDTQALARALFERY-LPEVAVVLRPAGG------EPDI 653

Query: 671 SNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
              A   R      D+  A VC+  SC PP TDP ++  LL
Sbjct: 654 VALAPFTRFQLPMGDRAAAHVCRAGSCQPPTTDPAAMLALL 694


>gi|302390271|ref|YP_003826092.1| hypothetical protein Toce_1734 [Thermosediminibacter oceani DSM
           16646]
 gi|302200899|gb|ADL08469.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
           16646]
          Length = 670

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 242/687 (35%), Positives = 349/687 (50%), Gaps = 90/687 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE V  +LN ++VSIKVDREE PDVD  YM   QAL G GGWPL+
Sbjct: 56  STCHWCHVMEKESFEDEEVGNILNRYYVSIKVDREEHPDVDNFYMEVCQALTGSGGWPLT 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD  P+   TY P ED YGRPG KT+L K+ + W K R+ L  +G   +  + +  
Sbjct: 116 IIMTPDKHPVFAATYLPKEDSYGRPGLKTVLFKINELWQKDRERLITTGREIVSSIKKLE 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
                      EL    +    E L  SYD ++GGF  APKFP P  +  +L  YH +K 
Sbjct: 176 RTGHG------ELDPGVIDKAFEILKASYDRKYGGFFGAPKFPMPGTLLFLLGYYHYRK- 228

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                     E  +MV  TL+ M KGGI+DH+G G  RYS D RW VPHFEKMLYD   +
Sbjct: 229 --------DPEALEMVENTLKNMYKGGIYDHIGFGLCRYSTDRRWLVPHFEKMLYDNALV 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           + V  +A+ + +D F+     +I+DY+ R++  P G  ++AEDADS   EG    +EG F
Sbjct: 281 SFVCAEAYKIARDEFFKTFALEIIDYVLRNLRNPEGGFYTAEDADS---EG----EEGRF 333

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y WT +E+  +LG+ A  F E Y +   GN            F+GKN+           +
Sbjct: 334 YTWTPQEIRHVLGDRADEFMESYNITERGN------------FEGKNI----------PN 371

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            +G  L   ++   + R+KLF+ R +R +P  D+K++VS N L+I+S  R   I K+E  
Sbjct: 372 LIGRDLSCKMD--EDTRKKLFEYREQRVKPFRDEKILVSGNSLMIASLFRVYGITKNE-- 427

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y + AE A +FI  +       RL   +R G  KA    DDY+ L
Sbjct: 428 --------------NYRKEAEVALNFILENARGSDG-RLHVGYREGIMKAKATFDDYSHL 472

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL+ YE+   T +L  A  L +   +LF D+E GG++ T  +   +  R K+ +DGA
Sbjct: 473 LWALLEAYEYTLETSYLKKAKSLADEMIDLFYDKEAGGFYLTGSDVDHLPARAKDAYDGA 532

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNS++  +L RL+ ++  S  +   + A +   VF   + +  +       +  + +
Sbjct: 533 VPSGNSMAAFSLARLSRLLFDSGME---ELARNQYRVFARTISENPVYHTFFLYSF-IYA 588

Query: 618 VPSRKHVVLVGHKSSVDFENMLAA---AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           V     V++ G +  + F N LA     +A +     +  I PA       +E +     
Sbjct: 589 VTGGTEVIIAGERPEM-FTNYLAENFFPYAVWAHADRLKEIVPA-------YENYGKIGG 640

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVT 701
             A          A VC+N SC  PVT
Sbjct: 641 RTA----------AYVCKNGSCKSPVT 657


>gi|150016393|ref|YP_001308647.1| hypothetical protein Cbei_1515 [Clostridium beijerinckii NCIMB
           8052]
 gi|149902858|gb|ABR33691.1| protein of unknown function DUF255 [Clostridium beijerinckii NCIMB
           8052]
          Length = 680

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 250/719 (34%), Positives = 356/719 (49%), Gaps = 83/719 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESFEDE +A ++ND F++IKVDREERPD
Sbjct: 32  GDEAFAKAKEEDKPIFLSIGYSTCHWCHVMAHESFEDEEIAGIMNDSFIAIKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  QAL G GGWPL+V ++PD KP   GTYFP + KY  PG   IL  +   W 
Sbjct: 92  IDSVYMTVCQALTGHGGWPLTVIMTPDQKPFFAGTYFPKKAKYNMPGLMDILNSINKQWK 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             +D L  SG   + +L        S  KL  +  +N       Q+  +++ ++GGFG A
Sbjct: 152 DNKDKLISSGDSILSELGGYFDGETSKLKLTSKTLKNGYN----QILHAFEEKYGGFGDA 207

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  I M L    K     K+ E +E       TL  M +GGI DH+G GF RYS 
Sbjct: 208 PKFPTP-HITMFLLRYYKSHKEIKALEMAEK------TLISMYRGGIFDHIGFGFSRYST 260

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W VPHFEKMLYD   L   YL+ + +TK+  Y  +   +L+Y+ R++    G  + A
Sbjct: 261 DNKWLVPHFEKMLYDNALLVISYLEGYEVTKNEIYKEVATKVLEYVFRELTSKNGGFYCA 320

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG    +EG +YV+   E+  +LGE     F +++ +   GN          
Sbjct: 321 EDADS---EG----EEGKYYVFEPLEILSVLGEEDGTYFNDYFDITSDGN---------- 363

Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
             F+GK++  LI+  +   S  ++ +  E+ L             RS R   H DDK++ 
Sbjct: 364 --FEGKSIPNLIKNKNFHKSDDRIKLLSEQILQ-----------YRSDRTELHKDDKILT 410

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           SWNGL+I++  +A K+++ E                 Y E A+ A  FI  +L DE   R
Sbjct: 411 SWNGLMIAALGKAYKVIEDE----------------RYFEYAKKAVEFIFNNLMDENK-R 453

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L   +R+  S+   +LDDYAFL  GL++LYE     ++L  AIE+      LF D E  G
Sbjct: 454 LLARYRDKDSRHKAYLDDYAFLCFGLIELYESSYDIEFLNKAIEINKDMINLFWDNEKDG 513

Query: 536 YFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           +F   GED   L+ R KE  DGA PSGNSV+  NL++LA +      +   + AE     
Sbjct: 514 FF-LYGEDSEKLIARPKELFDGAMPSGNSVAAYNLIKLARLTGDLTLE---EMAEKQFDF 569

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
               + +  +       AA      S++ V +   K   +    L +    ++L  T+I 
Sbjct: 570 ICGSVFNEEINHSFFLMAASFALNESQELVCVTNDKGEEEKIKDLLSERPIFNLT-TIIK 628

Query: 655 IDPADTEEMD---FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            D    E  D   F +E++  N          +K    +C+  SC  PV D   L  +L
Sbjct: 629 NDENRNEIEDLAPFLKEYDLIN----------EKSTYYLCKGKSCMAPVNDIDELRKML 677


>gi|435846903|ref|YP_007309153.1| thioredoxin domain protein [Natronococcus occultus SP4]
 gi|433673171|gb|AGB37363.1| thioredoxin domain protein [Natronococcus occultus SP4]
          Length = 732

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/702 (34%), Positives = 361/702 (51%), Gaps = 66/702 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFADEEVAEVLNEEFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P+ KP   GTYFP   K G+PGF  ++  + D+W   R+         IE  +E  
Sbjct: 113 AWLTPEGKPFYVGTYFPKHSKRGQPGFLDLIEGLADSWKTDRE--------EIENRAEEW 164

Query: 140 SASASS--NKLPDEL------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
           +A+A+    + PD +        + L   A+   +S D + GGFGS  PKFP+P  ++++
Sbjct: 165 TAAATDRLEETPDSIGAAEPPSSDVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL 224

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
              ++  + TG+     E ++++  +L  M +GG++DHVGGGFHRY VDE W VPHFEKM
Sbjct: 225 ---ARAYDRTGR----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDEDWTVPHFEKM 277

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  ++    L  + LT D  Y+   RD L+++ R++    G  FS  DA S E     
Sbjct: 278 LYDNAEIPRALLAGYQLTGDERYADSVRDTLEFVSRELTHAEGGFFSTLDAQS-EDPATG 336

Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
            ++EGAF+VWT  EV ++LG+   A LF   Y +  +GN            F G+N    
Sbjct: 337 EREEGAFFVWTPAEVREVLGDETDAELFCARYDITESGN------------FGGQNQPNV 384

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
           +   S  A +  +  E     L + R +LF+ R +RPRP+ D+KV+ SWNGL+I++ A A
Sbjct: 385 VASISELAERFDLAAETVEQRLEDARAELFEAREERPRPNRDEKVLASWNGLMIATCAEA 444

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
              L              G DR  Y  +A  A  F+R  L+D +  RL   F++G     
Sbjct: 445 GLAL--------------GEDR--YAGMAVDALEFVRDRLWDAEEGRLSRRFKDGDVAVQ 488

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G+L+DYAFL  G L  YE     + L +A+EL    +  F D E    + T     S++ 
Sbjct: 489 GYLEDYAFLARGALGCYEATGEVEHLAFALELARVIEAEFYDAERETIYFTPESGESLVT 548

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA-----EHSLAVFET---RLK 600
           R +E +D + PS   V+V  L+ L    AG  S   R++      E + +V  T   RL+
Sbjct: 549 RPQELNDQSTPSATGVAVETLLALDGF-AGEGSTSPREDGDAEFEEIAASVLRTHAGRLE 607

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
             A+    +C AAD L   + + V +   +   ++    A+ +    L       +   +
Sbjct: 608 SNALQHATLCLAADRLESGALE-VTVAADEVPAEWRAAFASRYLPDRLFAPRPPTEDGLS 666

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           E +D  E  ++      R     +  +  VC+N +CSPP  D
Sbjct: 667 EWLDELELESAPTIWAGREARDGEPTL-YVCRNRTCSPPTHD 707


>gi|239608009|gb|EEQ84996.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 823

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 235/621 (37%), Positives = 331/621 (53%), Gaps = 61/621 (9%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVME ESF    VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+P
Sbjct: 66  CHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 125

Query: 85  DLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           DL+P+ GGTY+P       P         F  IL K++D W  ++    +S     +QL 
Sbjct: 126 DLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLR 185

Query: 137 E-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           E A   + S  K  D      + L     +  +  +D   GGF  APKF  P  +  ++ 
Sbjct: 186 EFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLIN 245

Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
            S+    + D     E S   +M   TL  M++GGIHD +G GF RYSV   W +PHFEK
Sbjct: 246 LSRYPSAVSDIVGYDECSRALEMATKTLISMSRGGIHDQIGHGFARYSVTADWSLPHFEK 305

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
           MLYDQ QL NVY+DAF    +        DI  Y+    ++ P G  +S+EDADS  T  
Sbjct: 306 MLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPS 365

Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
            T K+EGAFYVWT KE + ILG+  A +   H+ + P GN  ++R +DPH+EF  +NVL 
Sbjct: 366 DTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLS 423

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFA 426
                +  A + G+  E+ + I+   R KL + R SKR RP LDDK+IVSWNGL I + A
Sbjct: 424 IKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALA 483

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-S 485
           + S +L++          V  +  +E+   AE+AA FIR++L+D  + +L   +R+G   
Sbjct: 484 KCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERG 533

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------------- 531
             PGF DDY++L SGL+DLYE      +L +A +LQ   +  FL +              
Sbjct: 534 DTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSITT 593

Query: 532 -------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
                     GY+ T          P+ L R+K   D + PS N V   NL+RL++++  
Sbjct: 594 ESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL-- 651

Query: 579 SKSDYYRQNAEHSLAVFETRL 599
            + D Y++ A  ++  F   +
Sbjct: 652 -EDDTYKRLARETVNAFAVEI 671


>gi|328950404|ref|YP_004367739.1| hypothetical protein Marky_0883 [Marinithermus hydrothermalis DSM
           14884]
 gi|328450728|gb|AEB11629.1| protein of unknown function DUF255 [Marinithermus hydrothermalis
           DSM 14884]
          Length = 667

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 229/577 (39%), Positives = 311/577 (53%), Gaps = 57/577 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVM  ESFED  VA+LLN  FV +KVDREERPD
Sbjct: 27  GEEAFARAQQEGKPIFLSVGYATCHWCHVMARESFEDPEVARLLNAHFVPVKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD  YM  +QAL G GGWP+S+FL+P+ KP  GGTYFPP D+YG P F+ +L  V +AW 
Sbjct: 87  VDHAYMQALQALTGQGGWPMSLFLTPEGKPFYGGTYFPPTDRYGLPSFRRVLEAVAEAWT 146

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K+R+ +    A   +++++AL  +     LP +L   AL    E   +++D + GGFG A
Sbjct: 147 KRRNEIETHAAALAQRIAQAL--TNRPGDLPPQLHAKAL----EAYRQAFDPQHGGFGGA 200

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    ++ +L  +         GEA+ G+ M+  TL  M  GG++D VGGGFHRY+V
Sbjct: 201 PKFPNAPALRYLLLQAWL-------GEAAAGE-MLRVTLDRMQAGGVYDQVGGGFHRYAV 252

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLYD  QLA VYL AF L  D  Y    R+ LDYL R+M    G  ++A
Sbjct: 253 DAVWRVPHFEKMLYDNAQLARVYLGAFRLFGDARYRRTARETLDYLLREMQDAAGGFYAA 312

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           +D   AE+EG    +EG +YVW   E+  +LG        ++ +   GN           
Sbjct: 313 QD---AESEG----EEGRYYVWRIPELRAVLGADFEAAARYFGVSDAGN----------- 354

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
            ++GKN+L         A +LG+    +   L   + +L + R +R RP  DDK++  WN
Sbjct: 355 -WEGKNILEARYPEPLLAQELGLDAAGFEAWLASVKARLLEARLRRVRPLTDDKILADWN 413

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL +++FA A + L              G  R  Y+E A   A F+   LY  Q   L+H
Sbjct: 414 GLALAAFAEAGRWL--------------GEAR--YLEAARKNAEFVLGALY--QDGLLRH 455

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           ++R G      +L D A    GLL L+E     +WL  A  L     E F D E GG+F+
Sbjct: 456 AWRRGRLGRHAYLSDQAHYGLGLLALFEATGEMRWLEAARVLAEGILEHFRDPE-GGFFD 514

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
               +P  L R K+  DGA PSGN+ +   LVRLA +
Sbjct: 515 ALEANP--LGRPKDVFDGAWPSGNAAAAELLVRLARL 549


>gi|403389033|ref|ZP_10931090.1| hypothetical protein CJC12_14629 [Clostridium sp. JC122]
          Length = 593

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 219/593 (36%), Positives = 325/593 (54%), Gaps = 60/593 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  E FED+ VAK+LND F+SIKVDREERPDVD +YMT  QA  GGGGWPL++
Sbjct: 55  TCHWCHVMAHECFEDDEVAKILNDNFISIKVDREERPDVDSIYMTVCQAFTGGGGWPLNL 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD KP   GTYFP   KY  PGF  IL  + D W   ++ +  +    I QL  A  
Sbjct: 115 FITPDQKPFYAGTYFPKHAKYNVPGFMDILSSISDQWKSDKERIIDASEEVINQLENAFQ 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            + + +++  ++ +     C E     +D   GGF  APKFP P ++  +L +  KLE+ 
Sbjct: 175 PTTTDDEIGKDIIEGGYLWCLE----FFDVVNGGFDKAPKFPTPHKLMFLLKYY-KLENE 229

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            K+ E      MV  TL  M +GGI DH+G GF RYS D++W VPHFEKMLYD   L   
Sbjct: 230 PKALE------MVEKTLNQMYRGGIFDHIGYGFSRYSTDDKWLVPHFEKMLYDNALLTMA 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL+ +S+TK  FY  +    +DY+ R++    G  + A+DADS   EG     EG FYV+
Sbjct: 284 YLETYSITKKEFYKNVAIKTMDYVLRELTSDEGGFYCAQDADS---EG----DEGKFYVF 336

Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
              E+ ++LGE     F  ++ +  +GN            F+GK++   L ++S      
Sbjct: 337 NPLEICEVLGEDDGKYFNNYFDITTSGN------------FEGKSIANLLKNNSFENDD- 383

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               EK    + + R+K+F+ R +R   H D+K++ SWN L+I++FA+A  ILK E    
Sbjct: 384 ----EK----INDLRKKVFNYRLERTTLHKDEKILTSWNALMITAFAKAYSILKDE---- 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       +Y++V + A +FI  +L + + +RL   +++G      +L+DYAFLI 
Sbjct: 432 ------------KYLKVCKDAIAFIENNLVN-KDNRLLARYKDGDVAYFSYLEDYAFLIW 478

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             ++LYE  +  ++L  AI L +   + F D    G+F    +   ++ R KE +DGA P
Sbjct: 479 SFIELYEGTNEKEYLEKAISLNSEMIDKFWDENSSGFFLYGKDSEKLIARPKEIYDGAIP 538

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           SGNSV+   LV+L+ I   +K    +    + L  F + +K+  ++  +   A
Sbjct: 539 SGNSVAAYVLVKLSKI---TKDKILKDITYNQLKYFSSTVKNSPISYTMYLIA 588


>gi|410462713|ref|ZP_11316275.1| thioredoxin domain containing protein [Desulfovibrio magneticus
           str. Maddingley MBC34]
 gi|409984165|gb|EKO40492.1| thioredoxin domain containing protein [Desulfovibrio magneticus
           str. Maddingley MBC34]
          Length = 697

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 251/694 (36%), Positives = 353/694 (50%), Gaps = 53/694 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A L+N   VSIKVDREERPD+D +YM+   AL G GGWPL+
Sbjct: 52  STCHWCHVMERESFEDEDIAALMNAVAVSIKVDREERPDLDTLYMSVCHALTGRGGWPLT 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP E  YGR G + +L++V  +W   R  +  +    ++ + E L
Sbjct: 112 VFLTPDKEPFFAGTYFPKESAYGRTGLRELLQRVHMSWKGNRQAVVNNAGQIMDAVREQL 171

Query: 140 SASASSNKL-PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +A+A +    P E   +A R    QLS  +D+R GGFG APKFP P  +  +L   +   
Sbjct: 172 TAAAGAASAEPGEAVLDAAR---AQLSGIFDARNGGFGGAPKFPSPHNLLFLLREYR--- 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              ++G+AS  + MV  TL  M +GG++DHVG G HRY+ D +W +PHFEKMLYDQ    
Sbjct: 226 ---RTGDAS-CRDMVCRTLDAMRRGGVYDHVGFGLHRYATDAQWFLPHFEKMLYDQALTV 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              ++A+  + D  +  +  +IL+Y+RRD+  P G   SAEDADS   EG     EG FY
Sbjct: 282 MACVEAYQASGDAAHKTMALEILEYVRRDLTSPEGLFHSAEDADS---EGV----EGKFY 334

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VW++ E+  +LG+ A L          GN       +   E  G N+L        +A++
Sbjct: 335 VWSAAELRRLLGDEAALVMAAMGATEEGNAH----DEATGETTGSNILHLPRPLDETAAQ 390

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+ +E     L ECRR L   R KR RP  DDKV+   NGL++++ A+A++    E  +
Sbjct: 391 LGLTVEALTTRLEECRRILLVEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEELA 450

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                          +  AES  + + R        RL H  R+G +   GFLDDY FL 
Sbjct: 451 G------------RAVTAAESLLTRLTR-----PNGRLLHRLRDGEAAIDGFLDDYVFLA 493

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+    T +L  A+ L     + F D   GG+F T  +   +L+R K   D A 
Sbjct: 494 WGLVELYQTVFDTAYLHRAVALLRAVADHFADPAEGGFFVTPDDGEQLLVRQKVFFDAAV 553

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLS 617
           PSGNSV+   L  L  +   +    +++ A         RL D A       C  + +L 
Sbjct: 554 PSGNSVAYFVLTTLFRL---TGDPVFKEQATALARAMAPRLADHAAGHAFFLCGLSQVLG 610

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            PS   V L G  +  D + +  A    Y L +  + + P      D  E      A   
Sbjct: 611 KPS--EVTLAGDPAGPDTQALARAVFGRY-LPEVAVVLRP------DEGEPDIVALAPFT 661

Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
           R     D +  A VC+  SC P   D  ++  LL
Sbjct: 662 RYQLPLDGRTAAHVCRAGSCQPATADVETMLKLL 695


>gi|383625377|ref|ZP_09949783.1| hypothetical protein HlacAJ_18680 [Halobiforma lacisalsi AJ5]
 gi|448700355|ref|ZP_21699463.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
 gi|445779895|gb|EMA30810.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
          Length = 746

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 241/699 (34%), Positives = 346/699 (49%), Gaps = 60/699 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA LLND FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 57  SACHWCHVMEEESFADEDVADLLNDHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA----FAIEQL 135
            +L+P+ KP   GTYFP E K G+PGF  IL  V D+W+  R+ +          A ++L
Sbjct: 117 AWLTPEGKPFYVGTYFPKESKRGQPGFVDILENVIDSWETDREEIENRAQKWTDAARDEL 176

Query: 136 SEAL------SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
            E         A+ + +  P     + L   A+   +S D  +GGFGS  PKFP+P  ++
Sbjct: 177 EETPGTGGPGDAAVAESTEPTPPSSDLLETTADAAVRSADRGYGGFGSDGPKFPQPSRLR 236

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           ++   S +   TG  GE    ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFE
Sbjct: 237 VLARASDR---TG--GETY--REVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFE 289

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD  ++   +L  + LT D  Y+ +  + L ++ R++    G  F+  DA S + E 
Sbjct: 290 KMLYDNAEIPRAFLTGYRLTGDDRYAEVVEETLAFVDRELTHDEGGFFATLDAQSEDPET 349

Query: 309 ATRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
             R +EGAFYVWT  EV D+L +   A LF E Y +  +GN            F+G+N  
Sbjct: 350 GER-EEGAFYVWTPDEVRDVLEDETDAELFCERYDITASGN------------FEGENQP 396

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
             +   +  A    +   +    L + R +LF  R +RPRP+ D+KV+  WNGL+I++ A
Sbjct: 397 NRVRSVADLAESFDLEESEVRERLADARERLFAAREERPRPNRDEKVLAGWNGLMIATCA 456

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
            A+  L              G D  EY  +A  A  F+R  L+D    RL   +++    
Sbjct: 457 EAAMTL--------------GED--EYATMAVDALEFVRERLWDADERRLSRRYKDDDVA 500

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
             G+L+DYAFL  G L  Y+       L +A++L    +  F D E G  + T      +
Sbjct: 501 IDGYLEDYAFLARGALACYQATGDVDHLAFALDLAREIEGEFWDEEAGTLYFTPESGEDL 560

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R +E  D + PS   V+V  L+ L S V  +    Y + AE  L     RL+   +  
Sbjct: 561 VTRPQELGDQSTPSAAGVAVETLLALESFVPDAD---YAELAETVLGTHVDRLEGSPLQH 617

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
             +C  AD L   + + V +   +   ++    A  H        +I   P   + ++ W
Sbjct: 618 ATLCLGADRLESGALE-VTVAAEEVPDEWREAFATGH----YPDRLIARRPPTEDGLEAW 672

Query: 667 EEH---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            +           A      D+    VC+  +CSPP  D
Sbjct: 673 LDRLGLEDAPPIWAGREARDDEPTLYVCRGRTCSPPTHD 711


>gi|297566141|ref|YP_003685113.1| hypothetical protein [Meiothermus silvanus DSM 9946]
 gi|296850590|gb|ADH63605.1| protein of unknown function DUF255 [Meiothermus silvanus DSM 9946]
          Length = 665

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 227/583 (38%), Positives = 309/583 (53%), Gaps = 62/583 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED   A+LLN++FV +KVDREE PDVD VYM  +QAL G GGWP+S+
Sbjct: 49  TCHWCHVMERESFEDPETAQLLNEFFVPVKVDREELPDVDHVYMMALQALTGSGGWPMSL 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PDLKP  GGTYFPPED++G P F  +L+ +   W  +R+ +  S     + L + L 
Sbjct: 109 FLTPDLKPFYGGTYFPPEDRHGLPSFARVLKTIASTWQNRREEVLGSADELTQHLHKLL- 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                  LP +L   AL+    QL++++D+  GGFG APKFP+   +  +L  + K +  
Sbjct: 168 -VPRGGPLPQDLHAQALK----QLARAHDATHGGFGGAPKFPQAPTLTYLLALAWKGDPL 222

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                      M+  TL  MA+GGI+D VGGGFHRY+VD  W VPHFEKMLYD  QLA V
Sbjct: 223 AWG--------MLELTLDKMAEGGIYDQVGGGFHRYAVDGIWRVPHFEKMLYDNAQLAWV 274

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL    LT    Y  +  + LDYL R+M  P G  +SA+DADS   EG     EG FYVW
Sbjct: 275 YLGMSRLTGKTLYRRVTLETLDYLLREMQHPEGGFYSAQDADS---EGV----EGKFYVW 327

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           + +EV  +LG  A    + + +   GN            ++G NVL       A   +LG
Sbjct: 328 SEQEVRAVLGSDAEAALKLFGVSQAGN------------WEGVNVLEARYPEPALRQELG 375

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +    +   L E + KL+  R +R  P  DDK++  WNGL + +FA A +IL  EA    
Sbjct: 376 LDEATFARWLEEVKAKLYQARRQRIPPLTDDKILADWNGLALRAFAAAGRILGKEA---- 431

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A   A F+   +  +    L+HS+R G  +   +L D A    G
Sbjct: 432 ------------YLEAARKNAEFVTSRMMRDGL--LRHSWRGGKLRPEAYLSDQASYGLG 477

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL+ Y+     +WL  A  L       F D   GG+F+ +G    + LR K+  DG  P 
Sbjct: 478 LLETYQATGEMRWLEAARTLAEGILTHFRD-PNGGFFDASGG--GLPLRAKDVFDGPYPG 534

Query: 561 GNSVSVINLVRLASI--------VAGSKSDYYRQNAEHSLAVF 595
           GNS +   L+RLA++         A    +++ Q   HS + F
Sbjct: 535 GNSAAAELLIRLAALYEREDWAEAARGAIEFHAQGLAHSPSAF 577


>gi|421092713|ref|ZP_15553445.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
 gi|410364564|gb|EKP15585.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
 gi|456889958|gb|EMG00828.1| PF03190 family protein [Leptospira borgpetersenii str. 200701203]
          Length = 700

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 252/713 (35%), Positives = 362/713 (50%), Gaps = 71/713 (9%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 46  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 105

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W +KR  
Sbjct: 106 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQE 165

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS--APK 180
           L  + +     L ++    A   +    LP ++            YD+ FGGF +    K
Sbjct: 166 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNFGFSLYESYYDAEFGGFKTNHVNK 225

Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           FP  + +  +L YH         S    +  +MV  TL  M +GGI+D VGGG  RYS D
Sbjct: 226 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 277

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
            RW VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I SAE
Sbjct: 278 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 337

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN            
Sbjct: 338 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 378

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F+GKN+L E       A+KL     K ++ +L   R KL + RSKR RP  DDK++ SWN
Sbjct: 379 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 436

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I + A+A                 +   R++++++AE   SFI R+L D    R+  
Sbjct: 437 GLYIKALAKAG----------------IAFRREDFLKLAEETYSFIERNLIDPDG-RILR 479

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+G S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G F 
Sbjct: 480 RFRDGESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 537

Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  YR+ AE   + F  
Sbjct: 538 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 595

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L   +++ P +  A       S K +VL+  K +   +++LAA    +  +     ++ 
Sbjct: 596 ELSTHSLSYPHLLSAYWTYRYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 653

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            + EE           +++  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 654 NELEEA-------RKLSALFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 699


>gi|397690129|ref|YP_006527383.1| Thioredoxin domain protein [Melioribacter roseus P3M]
 gi|395811621|gb|AFN74370.1| Thioredoxin domain protein [Melioribacter roseus P3M]
          Length = 690

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 241/681 (35%), Positives = 347/681 (50%), Gaps = 72/681 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFEDE VA+LLN  F+SIKVDREERPD+D +YM   Q + G GGWPLS+
Sbjct: 67  TCHWCHVMAHESFEDEEVAELLNKNFISIKVDREERPDIDSIYMASCQLITGRGGWPLSI 126

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP   GTYFP    YGR GF  +L ++ D W+K R++L ++       +++   
Sbjct: 127 FLTPDGKPFYAGTYFPKYSYYGRIGFVDLLNRIIDLWNKDRNVLLRTSDEITAAINKHFE 186

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           +SA      D +   A     E L  ++D  +GGFGSAPKFP P  +  +L  +    D 
Sbjct: 187 SSAKE-AFDDSVVDKAF----ETLKLNFDPEYGGFGSAPKFPSPHNLLFLLDRNNPQAD- 240

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +MV  TL  M KGGI D +G GFHRYS D +W +PHFEKM+YDQ  L   
Sbjct: 241 ----------EMVQKTLTEMRKGGIFDQLGFGFHRYSTDGKWFLPHFEKMIYDQASLIEA 290

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y  AF+ T D  Y+    +I ++++ +M    G  +SA DADS   EG    +EG FY+W
Sbjct: 291 YAYAFAKTGDALYADTINEIYEFIKNEMTSHEGAFYSALDADS---EG----EEGKFYLW 343

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           TS+E+  + G+   + KE +     GN      ++ +    GKN+L           K G
Sbjct: 344 TSEEIRSVAGDDYEIAKEIFNFTDEGN----HRNESNGNSTGKNILFLRKRPDKLYEKYG 399

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               KY +I    R  L + R KR  P  D+K++  WN +VISS A A  I++++   A 
Sbjct: 400 RS--KYDSI----RINLLEARKKRIPPMRDEKILTDWNAMVISSLANAGSIIENDDMVAW 453

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                           AE A   + +H +      L H   N  +   GFLDDYA+LI  
Sbjct: 454 ----------------AERAYQCLMKHAF--VNGELYHYPENNIT---GFLDDYAYLIKA 492

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LDLY      ++L  A+EL +   E F D+ EGG +FN  G +    +RVK+ +DGA P
Sbjct: 493 ALDLYRATLNEEYLFNALELNDLLSENFEDKSEGGYFFNKAGANT---IRVKDAYDGAVP 549

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNS+ + NL+ L   + G+ S  YR +AE+S+  F + L   ++           L   
Sbjct: 550 SGNSIQLSNLIELY-FITGNNS--YRLSAENSIKTFSSGLNKSSIGYTYFLRGIKKLYSK 606

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
               +++ G K+  +F   L+    + DL    +H+   + E +            +   
Sbjct: 607 DTSLLLIAGKKTGREF---LSRLRKNTDL--YYLHVAEDNVERLI------KRAPWIEIY 655

Query: 680 NFSADKVVALVCQNFSCSPPV 700
              ++K V  +C++F+C  P 
Sbjct: 656 KLDSEKTVYYLCRDFTCGIPT 676


>gi|448318308|ref|ZP_21507834.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
 gi|445599332|gb|ELY53367.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
          Length = 721

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 244/711 (34%), Positives = 355/711 (49%), Gaps = 65/711 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF DE VA+LLN+ FV IKVDREERPDVD +YMT  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFADEEVAELLNEEFVPIKVDREERPDVDSIYMTVCQLVSGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFP   K G+PGF  +L  + D+W+  R+         IE  +E  
Sbjct: 113 VWLTPEGKPFYVGTYFPKRSKRGQPGFLDLLEGLADSWETDRE--------EIENRAEEW 164

Query: 140 SASASS--NKLPDEL------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
           +A+A     + PD +          L   A+   +S D + GGFGS  PKFP+P  ++++
Sbjct: 165 TAAARDRLEETPDSIGAAEPPSSEVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL 224

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
              ++  + TG      E ++++  +L  M +GG++DHVGGGFHRY VD  W VPHFEKM
Sbjct: 225 ---ARAFDRTGN----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKM 277

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  ++    L  + LT D  Y+   R+ L+++ R++    G  FS  DA S + E   
Sbjct: 278 LYDNAEIPRALLAGYRLTGDERYADYVRETLEFVSRELTHAEGGFFSTLDAQSEDPETGE 337

Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           R +EGAFYVWT  EV D+LG    A LF   Y +  +GN            F+G++    
Sbjct: 338 R-EEGAFYVWTPAEVRDVLGSETDADLFCARYDITESGN------------FEGQSQPNL 384

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
               S  A +  +   +    L   RR+LF+ R +RPRP+ D+KV+  WNGL+I++ A A
Sbjct: 385 AASISELADRFDLEEREVEERLESARRELFEAREERPRPNRDEKVLAGWNGLMIATCAEA 444

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           +  L              G DR  Y  +A  A  F+R  L++    RL   F++G     
Sbjct: 445 ALAL--------------GEDR--YAGMAVDALEFVRDRLWNADEGRLSRRFKDGDVAVQ 488

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G+L+DYAFL  G L  YE       L +A+EL    +  F D E G  + T     S++ 
Sbjct: 489 GYLEDYAFLARGALGCYEATGEVDHLAFALELARAIEAEFYDAERGTLYFTPESGESLVT 548

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R +E +D + PS   V+V  L+ L  +    + D + + A   L     RL+  A+    
Sbjct: 549 RPQELNDQSTPSATGVAVETLLALGDVAG--EDDGFEEIATSVLRTHAGRLESNALEHAT 606

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +C AAD L       V +   +    +     + +    L   +    P   + ++ W +
Sbjct: 607 LCLAADRLEA-GPLEVTVAAEEVPAAWRERFGSRY----LPDRLFAPRPPTEDGLESWLD 661

Query: 669 H---NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
                +  A  A       +    VC+N +CSPP  D     + L E  +S
Sbjct: 662 ELGLEAAPAIWAGREARDGEPTLYVCRNRTCSPPTRDVDEALDWLAESEAS 712


>gi|421108799|ref|ZP_15569331.1| PF03190 family protein [Leptospira kirschneri str. H2]
 gi|410006082|gb|EKO59855.1| PF03190 family protein [Leptospira kirschneri str. H2]
          Length = 688

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 245/696 (35%), Positives = 358/696 (51%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I SAED+DS   EG    +EG 
Sbjct: 286 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDSDS---EG----EEGL 338

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 339 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 382

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 383 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 436

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 437 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 485

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 486 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 543

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A  
Sbjct: 544 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALIYPFLLSAYW 601

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 602 SYKHHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 652

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 653 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 688


>gi|407980032|ref|ZP_11160833.1| thioredoxin [Bacillus sp. HYC-10]
 gi|407413294|gb|EKF35013.1| thioredoxin [Bacillus sp. HYC-10]
          Length = 627

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 240/684 (35%), Positives = 357/684 (52%), Gaps = 77/684 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 144
           P   GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  + +    +
Sbjct: 61  PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKATNNLRIKA 112

Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
           + +  + L Q ++     QL  S+D+ +GGFGSAPKFP P    M+ +  +  E TG+  
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLSFLMRYFEWTGQEN 169

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
                 K    TL  MA GGI+DH+G GF RYS DE+W VPHFEKMLYD   L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
           + +T+   Y  + +D++ +++RDM+   G  +SA DADS   EG    KEG +YVWT +E
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKEE 278

Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
           +   LG+    LF   Y++   GN +   +  PH       +    +D  A+ S     L
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGNFEGQNI--PH------TISTSFDDIKAAYSIDDKTL 330

Query: 384 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
              L      R  L  VR +RP P +DDKV+ SWN L+IS+ A+A  +   E        
Sbjct: 331 HSKLQ---SARHILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHVE-------- 379

Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
                   E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA +++  + 
Sbjct: 380 --------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLTAYMS 429

Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
           LYE      WL  A        ELF D + GG+F +  +  ++++R KE +DGA PSGNS
Sbjct: 430 LYEATFDLDWLTKARAAAENMFELFWDEQIGGFFFSGSDAEALIVREKEVYDGAMPSGNS 489

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--ADMLS-VP 619
            ++  L++L+ ++        RQ+   +L  +F     D++ + P    A    +LS   
Sbjct: 490 TALQKLLKLSRMIG-------RQDWIETLEKMFSAFYVDVS-SYPSGHTAFLQGLLSQYA 541

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
            ++ ++++G K     E +L A      L K  +  D   T E     +  +  A  A++
Sbjct: 542 VKREIIILGEKGDPQKEQLLQA------LQKRFMPFDLILTAETG---QELARLAPFAKD 592

Query: 680 NFSA-DKVVALVCQNFSCSPPVTD 702
             +  D     +C+N+SC  P+T+
Sbjct: 593 YKTINDSTTVYICENYSCRQPITN 616


>gi|261200020|ref|XP_002626411.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239594619|gb|EEQ77200.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 823

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 234/621 (37%), Positives = 331/621 (53%), Gaps = 61/621 (9%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVME ESF    VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+P
Sbjct: 66  CHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 125

Query: 85  DLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           DL+P+ GGTY+P       P         F  IL K++D W  ++    +S     +QL 
Sbjct: 126 DLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLR 185

Query: 137 E-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           E A   + S  K  D      + L     +  +  +D   GGF  APKF  P  +  ++ 
Sbjct: 186 EFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLIN 245

Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
            S+    + D     E +   +M   TL  M++GGIHD +G GF RYSV   W +PHFEK
Sbjct: 246 LSRYPSAVSDIVGYDECARALEMATKTLIYMSRGGIHDQIGHGFARYSVTADWSLPHFEK 305

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
           MLYDQ QL NVY+DAF    +        DI  Y+    ++ P G  +S+EDADS  T  
Sbjct: 306 MLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPS 365

Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
            T K+EGAFYVWT KE + ILG+  A +   H+ + P GN  ++R +DPH+EF  +NVL 
Sbjct: 366 DTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLS 423

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFA 426
                +  A + G+  E+ + I+   R KL + R SKR RP LDDK+IVSWNGL I + A
Sbjct: 424 IKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALA 483

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-S 485
           + S +L++          V  +  +E+   AE+AA FIR++L+D  + +L   +R+G   
Sbjct: 484 KCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERG 533

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------------- 531
             PGF DDY++L SGL+DLYE      +L +A +LQ   +  FL +              
Sbjct: 534 DTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSTTT 593

Query: 532 -------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
                     GY+ T          P+ L R+K   D + PS N V   NL+RL++++  
Sbjct: 594 ESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL-- 651

Query: 579 SKSDYYRQNAEHSLAVFETRL 599
            + D Y++ A  ++  F   +
Sbjct: 652 -EDDTYKRLARETVNAFAVEI 671


>gi|15805870|ref|NP_294568.1| hypothetical protein DR_0844 [Deinococcus radiodurans R1]
 gi|6458560|gb|AAF10421.1|AE001938_7 conserved hypothetical protein [Deinococcus radiodurans R1]
          Length = 690

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 241/688 (35%), Positives = 338/688 (49%), Gaps = 67/688 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+E  A  +N  FV+IKVDREERPDVD VYM   QAL G GGWP++
Sbjct: 62  STCHWCHVMAHESFENERTAAFMNAHFVNIKVDREERPDVDAVYMAATQALTGQGGWPMT 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP++  G P F  +L  + D W  +RD    +     + L+E +
Sbjct: 122 VFLTPDAEPFYAGTYFPPQEGMGMPSFMRVLASIDDVWQNRRDQALGNA----QALTEHV 177

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             ++   +   ELP  AL    E  ++ YD++FGGFG APKFP P  +  +L        
Sbjct: 178 RGASQPTRREGELPGGALARAVENAARLYDAQFGGFGRAPKFPAPSTLDFLLTQ------ 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   +G++M L TL+ M  GGI+D +GGGFHRYSVD +W VPHFEKMLYD  QL  
Sbjct: 232 -------PQGREMALHTLRMMGAGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLVR 284

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             L A+ LT +  ++ + R+ L YL R+M+ P G  +SA+DAD+    G     EG  + 
Sbjct: 285 TLLRAYQLTGEDDFARLARETLAYLEREMLAPDGGFYSAQDADTPTEHGGV---EGLTFT 341

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG-KNVLIELNDSSASASK 378
           WT  E+  +LGE A L    + +   GN       DPH    G +NVL       A A +
Sbjct: 342 WTPDEIRAVLGEDADLALRSFNVTAQGN-----FRDPHQPAYGSRNVLHTPTPLPALARE 396

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG   +     L   R KLF  R  RP+PH DDKV+ SWNGLV+++ A A++IL  E   
Sbjct: 397 LG---DDAAQRLQAARAKLFAARQVRPQPHTDDKVLTSWNGLVLAALADAARILGEE--- 450

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        +Y+++A   A F+ R L       L+H+F++G +   G L+D+A   
Sbjct: 451 -------------KYLDLARRNADFVHREL-RLPGGTLRHTFKDGRASVEGLLEDHALYG 496

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL+ L++ G     L WA EL N     F D   G ++++ G   ++L R     D A 
Sbjct: 497 LGLVALFQAGGDLAHLHWARELWNIVRRDFWDEGAGVFYSSGGHAETLLTRQASFFDSAI 556

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
            S N+ + +  V +      ++++     A  ++  F   L      +  +   A  L  
Sbjct: 557 LSDNAAAALLGVWMNRYFGDAEAEAI---ARRTVQSFHAELLAAPTGLGGLWQVAAFLEA 613

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P  +  V+         E  LA     +        + PAD        E        AR
Sbjct: 614 PHTEIAVIGTPAERQPLERELAWHFLPF------TALAPAD--------EGGDLPVLEAR 659

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL 706
                    A VC N +C  P  DP  L
Sbjct: 660 PGGGQ----AYVCVNHACQLPTRDPAEL 683


>gi|448310353|ref|ZP_21500197.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
           JCM 12255]
 gi|445608208|gb|ELY62067.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
           JCM 12255]
          Length = 729

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 236/694 (34%), Positives = 356/694 (51%), Gaps = 59/694 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFADEAVADVLNEHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP E+K G+PGF  + R++ D+W    D         Q    A +
Sbjct: 113 AWLTPEGKPFFVGTYFPKEEKRGQPGFLDLCRRISDSWSSPEDRPEMENRAEQWTDAAKD 172

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
           +L E   + A +     E+    L   A+   +S D + GGFGS  PKFP+P  ++++  
Sbjct: 173 RLEETPDSVAGAEPPTSEV----LTAAADAAVRSADHQHGGFGSGGPKFPQPSRLRVL-- 226

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
            ++  + TG+     E + ++  +L  MA GG++DHVGGGFHRY VD  W VPHFEKMLY
Sbjct: 227 -ARAYDRTGE----GEYRAVLEESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLY 281

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D  ++   +L  + LT D  Y+ +  + L+++ R++   GG  FS  DA S + E   R 
Sbjct: 282 DNAEIPRAFLAGYQLTGDERYAEVVAETLEFVDRELTHEGGGFFSTLDAQSEDPETGER- 340

Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           +EGAF+VWT  E+ DIL +   A LF E Y +  +GN            F+G+N    + 
Sbjct: 341 EEGAFFVWTPDEIRDILDDETTAELFCERYDVTESGN------------FEGQNQPNRVR 388

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
              + A    +  ++    L + R ++F+ R +RPRP+ D+KV+ SWNGL+I++ A A+ 
Sbjct: 389 SIDSLAEAYDLAEDELRERLEDAREQVFEAREERPRPNRDEKVLASWNGLMIATCAEAAL 448

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L  +A                Y E+   A  F+R  L+D    RL+  +++G     G+
Sbjct: 449 VLGEDA----------------YAEMGVDALEFVRDRLWDADEGRLRRRYKDGDVAIQGY 492

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G L  YE       L +A+EL  + +  F D + G  + T     S++ R 
Sbjct: 493 LEDYAFLARGALGCYEATGDVDHLAFALELARSIEAEFWDADAGTLYFTPESGESLVTRP 552

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+V  L+ L     G   D     A   L      ++  A+    +C
Sbjct: 553 QELDDQSTPSATGVAVETLLAL----DGFADDDLESIAVGVLRTHANEIQTNALQHASLC 608

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            AAD L   + + + +   +   ++ + +A A   Y  ++ +    P +    ++ E  N
Sbjct: 609 LAADRLEAGALE-ITVAADELPDEWRDRVADA---YRPDRLIARRPPTEDGLEEWLEALN 664

Query: 671 --SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                A  A       +    VC+N +CSPP  D
Sbjct: 665 LAEPPAIWAGREARDGEPTLYVCRNRTCSPPTHD 698


>gi|448305439|ref|ZP_21495370.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
           14089]
 gi|445588825|gb|ELY43066.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
           14089]
          Length = 727

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 233/704 (33%), Positives = 350/704 (49%), Gaps = 49/704 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF D+ VA+LLN+ FV IKVDREERPDVD +YMT  Q +   GGWPLS
Sbjct: 53  SACHWCHVMEDESFADDEVAELLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P+ KP   GTYFP E K G+PGF  IL ++ + W+  R+ +        +  ++ L
Sbjct: 113 AWLTPEGKPFHIGTYFPKESKRGQPGFLDILERLAETWETDREEVENRAQQWTDAATDQL 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
             +  +    +    + L   A+   +S D ++GGFGS  PKFP+P  ++++   ++  +
Sbjct: 173 EETPDTVAAAEPPSSDVLETAADTALRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFD 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+    SE  +++  +L  M  GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++ 
Sbjct: 230 RTGQ----SEYLEVLEESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIP 285

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              L  + LT +  Y+    + L ++ R++    G  FS  DA S + E   R +EGAF+
Sbjct: 286 RALLAGYQLTGEERYAETVAETLAFVDRELTHDDGGFFSTLDAQSKDPETGER-EEGAFF 344

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT +EV ++L +   A LF E Y +  +GN            F+G+N    +   S+ A
Sbjct: 345 VWTPEEVSEVLEDQTTAELFCERYDITESGN------------FEGQNQPNRVQSISSLA 392

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
               +  ++    L   R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L    
Sbjct: 393 EAFDLEEQEVETRLEAARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL---- 448

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                     G D  EY E A  A  F+R  L+D    RL   +++G     G+L+DYAF
Sbjct: 449 ----------GDD--EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAF 496

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L    +  YE       L +A+EL  T +  F D E G  + T     S++ R +E +D 
Sbjct: 497 LARAAVGCYEATGEVDHLAFALELARTIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQ 556

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           + PS   V+V  L+ L      S+   +   A   L     R++   +    +C AAD L
Sbjct: 557 STPSAAGVAVETLLALDRFAVDSEE--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
              + +  V          +      H        +  + P   + ++ W E        
Sbjct: 615 ESGALEITVAADELPDAWRDRFAETYHPD-----RLFALRPPTDDGLEAWLEQLGLADAP 669

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
           A  A       +    VC+  +CSPP  D       L E  S+T
Sbjct: 670 AIWAGREARDGEPTLYVCRGRTCSPPTNDVEDALEWLGENTSAT 713


>gi|308513297|ref|NP_952224.2| thioredoxin domain-containing protein YyaL [Geobacter
           sulfurreducens PCA]
 gi|409911713|ref|YP_006890178.1| thioredoxin domain-containing protein YyaL [Geobacter
           sulfurreducens KN400]
 gi|41152670|gb|AAR34547.2| thioredoxin domain protein YyaL [Geobacter sulfurreducens PCA]
 gi|298505285|gb|ADI84008.1| thioredoxin domain protein YyaL [Geobacter sulfurreducens KN400]
          Length = 710

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 240/695 (34%), Positives = 346/695 (49%), Gaps = 79/695 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESF+D+ VA +LN  +V +KVDREERPD+D  +M   Q + G GGWPL++
Sbjct: 79  TCHWCHVMAAESFDDDEVAAVLNREYVPVKVDREERPDIDDTFMRVAQMMNGSGGWPLTI 138

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD +P    TY P   + G PG   +L K+ + W ++RD++ Q+ +  ++ LS   S
Sbjct: 139 IMTPDRQPFFAATYIPRRSRGGMPGLIDLLEKIAEVWRQRRDVVRQNCSAIMDALSRFNS 198

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              ++ +  DE P +  R   +QL+  YD  FGGFG APKFP  + +  +L + ++  D 
Sbjct: 199 VRPAAAE--DEAPLHGAR---QQLADIYDKEFGGFGGAPKFPMAMNLSFLLRYGQRYGD- 252

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                  E   M   TL  MA+GGI DH+GGGFHRY+VD RW VPHFEKMLYDQ      
Sbjct: 253 ------GEAVAMATDTLTAMAQGGIWDHLGGGFHRYTVDGRWLVPHFEKMLYDQALCTLA 306

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
            ++A  +T +  +  + ++   ++ R++  P G  +SA DADS   EG    +EGA Y+W
Sbjct: 307 LVEAAQVTGNSVFRELAKETCGFVLRELSAPAGGFYSALDADS---EG----REGACYLW 359

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T  +V DILG     LF   Y +   GN            F+G NVL       A A   
Sbjct: 360 TPAQVRDILGVADGELFCRLYAVTAWGN------------FEGANVLHLPLAPDAFARDE 407

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+   +    + +    L + R +RPRP  D+K+I  WNGL+I++ AR   I   E    
Sbjct: 408 GVDPLRLQEKIAQWHILLLEARERRPRPFRDEKIITGWNGLMIAALARTFLICGDEL--- 464

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAFL 497
                         +E AE A   +RR   D +T   RL  S   G +  PGFL+DYAF 
Sbjct: 465 -------------LLEGAERA---VRRVCIDLRTPAGRLVRSCHRGEASGPGFLEDYAFF 508

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I GLL+L+E     + L  A  L +    LF D  GGG F+T  +  ++L+R K   DGA
Sbjct: 509 IRGLLELHEATLDPRHLALARSLAHDMLRLFGD-SGGGLFDTGSDAETILVRGKGALDGA 567

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL--AVFETRLKDMAMAVPLMCCAADM 615
            PSGN+++   L+RL  I      D   + A   +  A      +  A  + L+C   ++
Sbjct: 568 IPSGNAMAASVLIRLGRIT----GDGVFEEAGRGIIRAFLAGAARQPAAHIHLLCALGEL 623

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L+ P               FE ++AAA   + + + +  +       +   E   +  A 
Sbjct: 624 LADP---------------FEVVIAAATRPHAVRELLCILGGRLIPGLVLMEREENAPAR 668

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 S    +A VC    C PPVT P  LE +L
Sbjct: 669 EGGGGGS----IARVCAGRVCLPPVTAPEGLEEIL 699


>gi|448307474|ref|ZP_21497369.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
 gi|445595646|gb|ELY49750.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
          Length = 727

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 229/689 (33%), Positives = 349/689 (50%), Gaps = 49/689 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q +   GGWPLS
Sbjct: 53  SACHWCHVMESESFADEEVAEMLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P+ KP   GTYFP E K G+PGF  IL ++ + W+  RD +        +  ++ L
Sbjct: 113 AWLTPEGKPFHIGTYFPKESKRGQPGFLDILERLAETWETDRDEVENRAQQWTDAATDQL 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
             +  +    +    +AL   A+   +S D ++GGFGS  PKFP+P  ++++   ++  +
Sbjct: 173 EETPDTVAAAEPPSSDALEAAADTAVRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFD 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+     E  +++  +L  M  GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++ 
Sbjct: 230 RTGR----EEYLEVLEESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIP 285

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              L  + LT +  Y+    + L+++ R++    G  FS  DA S ++E   R +EGAF+
Sbjct: 286 RALLAGYQLTDEERYAETVAETLEFVERELTHDEGGFFSTLDAQSEDSETGER-EEGAFF 344

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT +EV ++L +   A LF   Y +  +GN            F+G+N    +   S+ A
Sbjct: 345 VWTPEEVSEVLADETDADLFCARYDITESGN------------FEGQNQPNRVQSISSLA 392

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +  +        L   R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L    
Sbjct: 393 GEFDLEESDVETRLEAARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL---- 448

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                     G D  EY E A  A  F+R  L+D    RL   +++G     G+L+DYAF
Sbjct: 449 ----------GDD--EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAF 496

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L    +  YE       L +A+EL  + +  F D E G  + T     S++ R +E +D 
Sbjct: 497 LARAAVGCYEATGEVDHLAFALELARSIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQ 556

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
             PS   V+V  L+ L      S++  +   A   L     R++   +    +C AAD L
Sbjct: 557 PTPSAAGVAVETLLALDGFAGDSEA--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
              + +  V         + +  A    +Y  ++      P + + ++ W E        
Sbjct: 615 ESGALEITVAADELPDA-WRDRFA---ETYRPDRLFARRPPTE-DGLEAWLEQLGLADAP 669

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
           A  A       +    VC+  +CSPP  D
Sbjct: 670 AIWAGREARDGEPTLYVCRGRTCSPPTRD 698


>gi|115372663|ref|ZP_01459970.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
 gi|310823874|ref|YP_003956232.1| hypothetical protein STAUR_6648 [Stigmatella aurantiaca DW4/3-1]
 gi|115370384|gb|EAU69312.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
 gi|309396946|gb|ADO74405.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 694

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 235/701 (33%), Positives = 345/701 (49%), Gaps = 69/701 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED  +A ++N  F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 57  SACHWCHVMAHESFEDPAIASVMNAHFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDL+P  GGTYFPP+DKYGRPGF  +L  + DAW  +R+ +    A   E L E  
Sbjct: 117 VFLTPDLRPFYGGTYFPPQDKYGRPGFPKVLESLHDAWMNQREKVLGQAADFREGLGEL- 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A+      P  L    +    E++ +  D   GGFG APKFP P+ +  +L   ++   
Sbjct: 176 -ATYGLEAAPAALSVEDVLKMGERMLRHVDPVNGGFGGAPKFPNPMNVSFLLRAWRR--- 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G     +   L TL+ MA GG++D +GGGFHRY+VD+RW VPHFEKMLYD  QL +
Sbjct: 232 ----GGPEPLKDAALRTLERMALGGVYDQLGGGFHRYAVDDRWRVPHFEKMLYDNAQLLH 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +   +     +  +  +  +Y+RR+M    G  ++A+DADS   EG    +EG F+V
Sbjct: 288 LYAEGEQVESRPLWRKVVEETAEYVRREMTDARGGFYAAQDADS---EG----EEGRFFV 340

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  +V  +L  EHA L   H+ + P GN +           +G  VL      +  A +
Sbjct: 341 WTPAQVCSVLTPEHANLLLRHFRITPQGNFE-----------QGATVLEVAVPVAQIAHE 389

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+  E     L   R  LF +R +R +P  DDK++  WNGL+I   A AS++       
Sbjct: 390 RGLSQEALERTLTAAREALFGIREQRVKPGRDDKILSGWNGLMIRGLAFASRVF------ 443

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                      R E+ ++A  +A F+  H++D    RL  S+  G  +  GFL+DY    
Sbjct: 444 ----------GRPEWAQLAAGSADFVLTHMWD--GTRLSRSYEEGGGRIDGFLEDYGDFA 491

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL  LY+     K+L  A  L      LF D E   Y +       +++      D A 
Sbjct: 492 VGLTALYQATFEAKYLEAASALVKRAVALFWDEEKQAYLSAPKGQKDLVVATYSLFDNAF 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S      V LA++  G KS  + +  E  L+     L+D  +    +  AAD   +
Sbjct: 552 PSGASTLTEAQVALAALT-GDKS--HLELPERYLSRMRKALEDNPLGYGHLALAADTF-L 607

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                +   G +  V    +L  A  ++     V             W+E  +   ++ +
Sbjct: 608 DGGAGITFAGTREQV--APLLEVAQRAFAPTFAV------------GWKEAGAPVPAVLK 653

Query: 679 NNFSADKVV-----ALVCQNFSCSPPVTDPISLENLLLEKP 714
             F   + V     A VC+ F+C  P+T+P  L+  L  +P
Sbjct: 654 ELFEGREPVEGKGAAYVCRGFACERPLTNPEQLKARLGARP 694


>gi|407772664|ref|ZP_11119966.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
 gi|407284617|gb|EKF10133.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
          Length = 679

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 236/701 (33%), Positives = 358/701 (51%), Gaps = 76/701 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDEG+A L+N+ F++IK+DREERPD+D +Y   +  L   GGWPL++
Sbjct: 52  ACHWCHVMAHESFEDEGIAALMNELFINIKLDREERPDLDALYQNALALLGQQGGWPLTM 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P  GGTYFP E +YGRPGF  +L+ V   + +K D +  +    + Q+S AL 
Sbjct: 112 FLTPDGEPFWGGTYFPKEARYGRPGFGDVLKTVAKIYAEKPDDVRHN----VSQISNALI 167

Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
              SA+   +P       +  C     +  D   GG   APKFP+P  +  +     + +
Sbjct: 168 KMNSAAVGAVPS---LEMIDRCGHGCLQIMDGENGGTSGAPKFPQPSLLSYIWRTGVRTD 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D G        +++V  +L  M +GGI+DH+GGG  RY+VD++W VPHFEKMLYD  QL 
Sbjct: 225 DDGL-------KRIVKHSLDRMCQGGIYDHLGGGLARYAVDDQWLVPHFEKMLYDNAQLI 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++  D + +  +  Y+    + + ++ R+M  PGG   ++ DADS   EG     EG FY
Sbjct: 278 DLLCDVWRVDPNPLYAKRVEETIGWILREMRIPGGAFTASLDADS---EGV----EGKFY 330

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VW+  E++ ILG +A LFK+ Y +   GN            ++G  +L      + +AS 
Sbjct: 331 VWSEDEIDQILGANADLFKKFYDVSKDGN------------WEGHTIL------NRTASG 372

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           L +  +     L E R KL   R+KR RP  DDK +  WN + I++FA A+         
Sbjct: 373 LELADDATEEKLAELRAKLLAERAKRIRPGWDDKALTDWNAMTIAAFAEAAMTFH----- 427

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                      R ++++ A+ A  F+   L   +  R  HS+R+G  +  G L+DYA +I
Sbjct: 428 -----------RADWLDYAKLAYGFVINTLM--KGDRFLHSYRDGRVQHAGMLEDYAHMI 474

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              L LYE      +L  AI      + LF D + GGYF +  +   +++R K   D A 
Sbjct: 475 RAALRLYECFGEDAYLNEAIRWSAAVETLFADAK-GGYFQSASDASDLVVRQKPFMDNAV 533

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN++   NL +L ++   ++   YR  AE +LA F  R+ +    +P +  AA+ML  
Sbjct: 534 PSGNAIMAQNLAKLYALTGDTQ---YRDQAEITLAAFGGRIGEQFPNMPGLMMAAEMLQN 590

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P +  +VL+    S  + +M  A   +Y  N+ +  +   D             +   A+
Sbjct: 591 PVQ--IVLIAKDRSQTYLDMRRAIFGAYLPNRAITILSDGDPLP----------DGHPAQ 638

Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
              + D K  A +CQ   CS PVT    L  +L + P+  A
Sbjct: 639 GKTAIDGKETAYICQGPVCSAPVTGVEELTEMLADLPAKAA 679


>gi|359728137|ref|ZP_09266833.1| hypothetical protein Lwei2_14957 [Leptospira weilii str.
           2006001855]
          Length = 724

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 257/718 (35%), Positives = 368/718 (51%), Gaps = 82/718 (11%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R  + LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 71  TKAREQNKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 130

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE +YGR  F  IL  ++  W++KR  
Sbjct: 131 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWNEKR-- 188

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS-- 177
             Q    A  +LS  L  S     +  +   LP +N            YD+ FGGF +  
Sbjct: 189 --QELIVASSELSRYLKDSGEGRAIEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNH 246

Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
             KFP  + +  +L  YHS        SG      +MV  TL  M +GGI+D +GGG  R
Sbjct: 247 VNKFPPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCR 297

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS D  W VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I
Sbjct: 298 YSTDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGI 357

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
            SAEDADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN        
Sbjct: 358 CSAEDADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN-------- 402

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVI 414
               F+GKN+L E     + A+K      K ++ +L   R KL + RSKR RP  DDK++
Sbjct: 403 ----FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKIL 456

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
            SWNGL I + A+A                 V   R++++++AE   SFI ++L D    
Sbjct: 457 TSWNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPNG- 499

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           R+   FR+G S   G+ +DYA +IS  + L+E G G ++L  A+     +D + L R   
Sbjct: 500 RILRRFRDGESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWM--EDAIRLFRSPA 557

Query: 535 GYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G F  TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  Y + AE    
Sbjct: 558 GVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESIFL 615

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTV 652
            F   L   +++ P +  A       S K +VL+  +   DF +++LAA    +  +  +
Sbjct: 616 YFTKELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVL 672

Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             ++  + EE           +++  +  S    +  VC+NFSC  PV+D   L+  +
Sbjct: 673 AVVNENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSDLADLKKWI 723


>gi|322371783|ref|ZP_08046326.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
           DX253]
 gi|320548668|gb|EFW90339.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
           DX253]
          Length = 713

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 238/700 (34%), Positives = 348/700 (49%), Gaps = 70/700 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA+LLN+ FV IKVDREERPD+D +YM+  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMEEESFEDEDVAELLNEHFVPIKVDREERPDIDAIYMSICQQVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+PD KP   GTYFP   + GRPGF  +L  VK+ W +  + +   G    EQ ++A+
Sbjct: 113 AWLTPDGKPFYVGTYFPKRSQQGRPGFIDLLENVKNTWQENPEEMKNRG----EQWTDAI 168

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKL 197
                S    D+ P    L   AEQ  ++ D  +GGFG   PKFP+P  + ++L   +  
Sbjct: 169 EGELESTPEADDAPGPELLGSAAEQTVRTADREYGGFGRGGPKFPQPARLHLLL---RAY 225

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           + TG    A++ + + +  L  MA GG++DH+GGGFHRY+ D +W VPHFEKMLYD  +L
Sbjct: 226 DRTG----ATQYRDVAVEALDAMADGGMYDHIGGGFHRYATDRKWTVPHFEKMLYDNAEL 281

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              YL  + LT D  Y+ + R+    L R+M  P G  +S  DA S +  G    +EG F
Sbjct: 282 PRAYLAGYQLTGDERYAELVRETFASLEREMRHPEGGFYSTLDARSEDEAG--NYEEGPF 339

Query: 318 YVWTSKEV---------EDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
           YVWT  +V         +DI  E  A +  E Y +  +GN            F+GK VL 
Sbjct: 340 YVWTPSDVYEAVEDERDDDIDTETRADIVCERYGVTQSGN------------FEGKTVLT 387

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
              D    A K  +  ++  ++L + R  +F+ R +R RP  D+K++  WNGL+I++ A 
Sbjct: 388 LTTDVPDLAEKYDVSEDEVRDVLADARHSMFEAREERERPPRDEKILAGWNGLLIAALAE 447

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
              +L                  + Y ++A  A  F+R  L+DE   +L   F++     
Sbjct: 448 GGFVLD-----------------EHYTDLAADALDFVREKLWDEADAKLSRRFKDEDVAI 490

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            G+L+DYAFL  G   LYE       L +A++L    +  F D E    + T      ++
Sbjct: 491 DGYLEDYAFLARGAFALYESTGNPDHLEFALDLARAIEREFWDAERETLYFTPESGERLV 550

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM---AM 604
            R +E  D + PS   V+   L  L+              AE   AV ET  + +     
Sbjct: 551 ARPQELADQSTPSSLGVATDVLAVLSEFAPDEAF------AEIPEAVLETHARTVESNPF 604

Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
               +  AAD  +  S + + + G +    + + LA  +    L   V+   P   + + 
Sbjct: 605 QYATLVLAADRNATGSLE-LTVAGDELPEAWHDQLAETY----LPMRVLTRRPPTEDGVA 659

Query: 665 FWEEH--NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            W E     N   +  +  SA +    VC++F+CSPPVTD
Sbjct: 660 AWCEKLGVENVPPIWADRESAGEPTLYVCRSFTCSPPVTD 699


>gi|320102044|ref|YP_004177635.1| hypothetical protein Isop_0491 [Isosphaera pallida ATCC 43644]
 gi|319749326|gb|ADV61086.1| protein of unknown function DUF255 [Isosphaera pallida ATCC 43644]
          Length = 723

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 241/739 (32%), Positives = 365/739 (49%), Gaps = 80/739 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    + CHWCHVME ESFE   +A L+N WFV+IKVDREERPD
Sbjct: 40  GEEAFAKAKAENKPIFLSVGYSACHWCHVMERESFESPTIAALMNQWFVNIKVDREERPD 99

Query: 59  VDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           +D++YM  VQAL  G GGWP+SVF++P+ +P  GGTY+PP D  G PGF  IL  +  AW
Sbjct: 100 IDQIYMAAVQALNQGHGGWPMSVFMTPEGEPFFGGTYYPPHDARGMPGFPRILEGLATAW 159

Query: 118 DKKRDMLAQSGAFAIEQLSE------------ALSASASSNKLPDELPQNALRLCAEQLS 165
            ++   + ++ A  +E L +            AL   A+ ++  D L    +   A  L 
Sbjct: 160 REREPEVREAAARLVEHLRKRNEPMPPLIKGPALDHPAADDR--DGLDPGWIAEAARALG 217

Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
           + +DSR+GGFGSAPKFP P++++++L H ++++D            MV+ TL  M++GGI
Sbjct: 218 RVFDSRYGGFGSAPKFPHPMDLKLLLRHHQRVQD-------PRALAMVIQTLDHMSRGGI 270

Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
           +DH+GGGF RY+ DERW VPHFEKMLYD   L +   +      D   + +  + LDYL 
Sbjct: 271 YDHLGGGFARYATDERWLVPHFEKMLYDNALLISALAETIQCRPDPTLARVVVETLDYLA 330

Query: 286 RDMIGP--GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYL 342
             M GP      F+ EDADS   EG     EG +YVW+  E+ + LGE    LF E Y +
Sbjct: 331 ERMTGPPEAPGFFATEDADS---EGV----EGKYYVWSRDEMLETLGEPLGSLFAEVYDV 383

Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
              GN            ++G ++L         A +LG P ++    L + R  L   R 
Sbjct: 384 TEAGN------------WEGHSILNLPEPLDRVAQRLGRPTDQLAAELAQARALLKARRD 431

Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
           +R  P  D K++ SWNGL++++ A A+ ++                DR +++E AE AA 
Sbjct: 432 RRIPPGKDTKILTSWNGLMLAAIAEAAWVV----------------DRPDHLERAEKAAG 475

Query: 463 FIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN 522
           F+  HL  +   RL H F++G ++  G+L+DYA+LI GL  L +    T+W+  A +L  
Sbjct: 476 FLLDHLR-QPDGRLFHVFKDGRARFNGYLEDYAYLIDGLTRLGQVTGTTRWIREARDLSR 534

Query: 523 TQDELFLDR--EGGGYFNTTG-EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
              E F D   +G G F  TG    +++ R ++  D A PS  +++V  L+RLA++   +
Sbjct: 535 LMIEEFGDEVIDGVGGFAFTGVRHETLVARPRDLFDNATPSAAAMAVTALLRLAAL---T 591

Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD-FENM 638
                R      L      +K    A      A D         +V+ G     D    +
Sbjct: 592 DDQALRGRGLAGLRALAPLMKHAPTAAAQSLIALDFALRDPEIALVVPGQLDPSDTLAQV 651

Query: 639 LAAAHASYDLNKTVI--HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
           L   H  +   + ++   +DP    ++            +   +   D V   +C+  +C
Sbjct: 652 LRLLHRDFQPGRLLLVRSLDPPHPHDLHLL-------PPLQGRDHPHDHVTLYLCRGQTC 704

Query: 697 SPPVTDPISLENLLLEKPS 715
             P+    ++   L   P+
Sbjct: 705 QAPLVGVEAIAQALTSPPT 723


>gi|188475827|gb|ACD50089.1| hypothetical protein [uncultured crenarchaeote MCG]
          Length = 684

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 248/710 (34%), Positives = 365/710 (51%), Gaps = 94/710 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A +LN+ FV +KVDREERPD+D +YM    AL G GGWP+SV
Sbjct: 49  ACHWCHVMAHESFEDELTASILNENFVCVKVDREERPDLDAIYMRATVALSGSGGWPMSV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PDL+P   GTYFPP  +Y  PGF  +LR +  AW  ++          I  ++  + 
Sbjct: 109 FLTPDLRPFYAGTYFPPARRYNLPGFPELLRALAQAWGTRQQ--------EIHAVAARVD 160

Query: 141 ASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            S S+  LP  L    Q  L      L +  D + GG+G+APKFP+P+ I+++L     L
Sbjct: 161 QSLSTPDLPSHLGVVSQQLLEQAESWLVRHADRQHGGWGAAPKFPQPMAIELLL-----L 215

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +     G  ++G  +   +LQ MA+GG++D +GGGF RYS D  WHVPHFEKMLYD  QL
Sbjct: 216 QAAADPGAHADGLAVATQSLQAMARGGMYDVLGGGFSRYSTDTTWHVPHFEKMLYDNAQL 275

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  YL AF +T +  +  +  + LD++ R+M  P G  +S+ DADS   EG    +EG +
Sbjct: 276 ALAYLHAFLVTGETSFRQVAAETLDFVAREMTHPEGGFYSSLDADS---EG----REGKY 328

Query: 318 YVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YVWT  E+ +++G+ ++  LF   Y     G    S         +G+ +L    + +  
Sbjct: 329 YVWTQAEIREVIGDPSMTELFLAAY---DAGTAPAS---------QGEIILQRAPNDANL 376

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           +++      +   +L   R +LF  R  RPRP LDDKVIV+WNGL++ +FA+A++     
Sbjct: 377 SARFDKSASEIEELLQRARARLFRARQARPRPGLDDKVIVAWNGLMLQAFAQAARC---- 432

Query: 436 AESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDD 493
                  F   GS   + Y+EVA   A+F+  +L +  Q HR+   +R G +    FL+D
Sbjct: 433 -------FGGAGSGTGDMYLEVATRNAAFLLGNLRNHGQLHRI---WRRGKTGQHVFLED 482

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--GGYFNTTGEDPSVLLRVK 551
           YA LI GLLDLY+      W + A +L    DE+ L      GG+F+T  +    L+R  
Sbjct: 483 YAALILGLLDLYQADFSNAWFIAARQL---ADEMLLRFAAPDGGFFDTPDDSKPPLIRPM 539

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  DGA P+G +++   L++LA++   +    YR +AE +L +      +  ++      
Sbjct: 540 ELQDGATPAGGALATEALLKLAALTGEAT---YRDHAERTLPLGLANAAESPLSYARWLA 596

Query: 612 AAD----------MLSVPSRKHVVLVGHKSSVDFEN-MLAAAHASYDLNKTVIHIDPADT 660
           AA           +L  PS   V  +G  +S    + M+AA+          +  D    
Sbjct: 597 AAALALAGPRQLALLFPPSANPVAFLGVVNSAFRPHWMVAASPYPPPTGAPPLLQD---- 652

Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                                 A+   A VC++F+C  P+TDP  L  LL
Sbjct: 653 ------------------RPVVANLPTAFVCRDFACLRPITDPAELPALL 684


>gi|403380657|ref|ZP_10922714.1| hypothetical protein PJC66_12642 [Paenibacillus sp. JC66]
          Length = 547

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 226/594 (38%), Positives = 327/594 (55%), Gaps = 53/594 (8%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE VA  LN  ++++KVDREERPDVDK+YM+  QA+ G GGWPL+V ++PD K
Sbjct: 1   MAQESFEDEKVAAWLNAHYIAVKVDREERPDVDKLYMSVCQAMTGQGGWPLTVLMTPDKK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   +YG+PG   I+ +V   W ++R+ L        E+++E +  +     
Sbjct: 61  PFFVGTYFPKTSQYGKPGVIDIVSQVHQKWTEQREELLDIA----EEIAETVR-NRQETA 115

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
           L  EL  + L +  E  S+++DS++GGFG APKFP P ++  +L + K+   TG+     
Sbjct: 116 LSGELSADMLDMAYELFSQAFDSQYGGFGDAPKFPSPHQLSFLLRYYKR---TGEQDALD 172

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
             +K    TL+ M +GG++DH+G GF R S DERW VPHFEKMLYD   LA VYL+A+ +
Sbjct: 173 MAEK----TLEGMHRGGMYDHIGYGFARCSADERWLVPHFEKMLYDNALLAAVYLEAYEV 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T    Y+ I   I  Y++RDM    G  FSAE + S   EGA    E  FY+WT +EV  
Sbjct: 229 TGKQEYAEIAEQIFAYVKRDMTSSEGFFFSAEGSHS---EGA----EEQFYLWTPEEVNA 281

Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG-MPLEK 385
           +LGE    LF + + ++  G  D            G +V   L  + ++ ++L  M   +
Sbjct: 282 VLGEEDGELFCDVFDIQEDGPVD------------GYSVPNLLGLTRSTFARLQRMDPAE 329

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
               L   R KLF  R +R RPH DDK++ +WNGL+I + A+ +K+L+            
Sbjct: 330 RERRLERSRVKLFQHRERRARPHKDDKMLTAWNGLMIMALAKGAKVLQ------------ 377

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
               + E+ + A+ A  FI + L  E   RL   +R+G +  P +LDDYAFL+ GL++LY
Sbjct: 378 ----KAEHADAAQKAVGFILQRLVREDG-RLLARYRDGDAAIPAYLDDYAFLVWGLIELY 432

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E    T++L  A+        LF D E GG++ +  +   +L R KE HDG  PSGNS +
Sbjct: 433 EATRETEYLHQAVRFNQEMIRLFWDDESGGFYFSGIDGEKLLARSKEIHDGDMPSGNSAA 492

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
            +NL+RLAS+   +K     Q A   L  F   ++       +  CA D +  P
Sbjct: 493 AMNLLRLASLTEDTK---LLQLAHRQLRSFAAVVEQYPAGFSMYLCALDSILPP 543


>gi|448373972|ref|ZP_21557857.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
 gi|445660649|gb|ELZ13444.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
          Length = 760

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 238/718 (33%), Positives = 348/718 (48%), Gaps = 65/718 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT  QA+ G GGWPLS
Sbjct: 53  SACHWCHVMEAESFADETVATVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQL 135
            +L+PD +P   GTYFP E + G PGF  + R+++ +W + RD +        A A ++L
Sbjct: 113 AWLTPDGRPFYVGTYFPREAQRGTPGFLELCRQIRVSWSENRDEIESRADEWTAMAADRL 172

Query: 136 SEALSASASSNKLP---------------DELPQNALRLCAEQLSKSYDSRFGGFG-SAP 179
             A +A   S+  P               D    +AL    E   ++ D   GGFG   P
Sbjct: 173 DSAAAAGNESSSTPAPISADTGSPIDGGLDADGPDALERVGEAALRASDDEHGGFGRGGP 232

Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           KFP+P  ++ +L    +L+    + +    ++     L  M  GG++DHVGGGFHRY VD
Sbjct: 233 KFPQPRRVESLL----RLD---AAHDRPNARETATRALDAMCSGGLYDHVGGGFHRYCVD 285

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
           E W VPHFEKMLYD   +    L  + +T D  Y+   R+ +D+L R++  P G  +S  
Sbjct: 286 EDWTVPHFEKMLYDNAAIPRALLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTL 345

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI------LFKEHYYLKPTGNCDLSRM 353
           DA S ETE   R +EGAFYVWT  E+E  + E  +      LF   + +  +GN      
Sbjct: 346 DAQS-ETESGER-EEGAFYVWTPAEIESAVAEAGLSDESGALFCNRFGVTDSGN------ 397

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
                 F+G  VL         A+  G+      + L   R  +F+ R+ RPRP  D+K+
Sbjct: 398 ------FEGSTVLTVEASIEDLATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKI 451

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVG------SDRKEYMEVAESAASFIRRH 467
           +  WNGL I   A AS +L +    A  N    G      S    Y ++A  A +F+R +
Sbjct: 452 LAGWNGLAIDMLAEASIVLGTSGREAATNAASAGGASDGPSGDDRYAQLATDALAFVRTN 511

Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDEL 527
           L+D+ T RL    R+G     G+L+DYAFL  G L  YE     + L +A++L       
Sbjct: 512 LWDDDTGRLARRVRDGDVGIDGYLEDYAFLARGALTCYEATGEVEPLAFALDLARAIRRD 571

Query: 528 FLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
           F D      + T     S+L+R +E  D + PS   V+V  L  L    A    + + + 
Sbjct: 572 FWDESAETLYFTPERGESLLVRPQELGDQSTPSPTGVAVEILAMLDPFTA----EPFGEM 627

Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
           A   ++   T +++       +  A D+++      V  V     +++E  L   +    
Sbjct: 628 ARRVVSTHATEIEESPFEYVSLSLAQDLVTH-GPLEVTTVADGRPMEWERTLGRTY---- 682

Query: 648 LNKTVIHIDPADTEEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           L + ++   PA +  +D W +    ++     A     AD+    VC +  CSPP  D
Sbjct: 683 LPRRLLAPRPASSAMLDDWLDVIGLDTVPPIWADREQRADEPTVYVCADRVCSPPEHD 740


>gi|116327565|ref|YP_797285.1| hypothetical protein LBL_0795 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116120309|gb|ABJ78352.1| Conserved hypothetical protein containing a thioredoxin domain
           [Leptospira borgpetersenii serovar Hardjo-bovis str.
           L550]
          Length = 692

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 253/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 38  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W +KR  
Sbjct: 98  MDALHAMDQQGGWPLNIFLTPDGKPIAGGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQE 157

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
           L  + +     L ++    A   +    LP          L +S YD+ FGGF +    K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNK 217

Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           FP  + +  +L YH         S    +  +MV  TL  M +GGI+D VGGG  RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
            RW VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN            
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F+GKN+L E       A+KL     K ++ +L   R KL + RSKR RP  DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I + A+A                 +   R++++++AE   SFI R+L D    R+  
Sbjct: 429 GLYIKALAKAG----------------IAFQREDFLKLAEETYSFIERNLIDPDG-RILR 471

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+  S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G F 
Sbjct: 472 RFRDSESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529

Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  YR+ AE   + F  
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L   +++ P +  A       S K +VL+  K +   +++LAA    +  +     ++ 
Sbjct: 588 ELSTHSLSYPHLLSAYWTYKYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            + EE           + +  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691


>gi|420158002|ref|ZP_14664826.1| PF03190 family protein [Clostridium sp. MSTE9]
 gi|394755349|gb|EJF38596.1| PF03190 family protein [Clostridium sp. MSTE9]
          Length = 685

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 244/706 (34%), Positives = 356/706 (50%), Gaps = 73/706 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G ++F    +  +  FL    +TCHWCHVM  ESFED+ VA+ LN  FV IKVDREERPD
Sbjct: 33  GEQAFEKAKREDKPIFLSIGYSTCHWCHVMAHESFEDDEVAEALNQGFVCIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYMT  QA+ G GGWP+++ ++P+ +P   GTY P    +   G   +L  +++ W 
Sbjct: 93  IDAVYMTVCQAMTGSGGWPMTILMTPEQRPFWAGTYLPKMSTFRSTGLLELLAFIREQWS 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
             R  L  +G      L E    S  S K   +L    LR    QLS SYDSR+GGFG A
Sbjct: 153 TNRQQLLNAGEEITNYLREQSGPSLGSAKPELDL----LRGAVAQLSASYDSRWGGFGGA 208

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L +S  + +  KS      Q M  +TL  M +GG+ DH+GGGF RYS 
Sbjct: 209 PKFPAPHNLLFLLRYS--VLEREKS-----AQSMAEYTLSQMFRGGLFDHIGGGFSRYST 261

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W VPHFEKMLYD   LA  YL+A+++T    Y  + +  LDY+ R++    G  +  
Sbjct: 262 DVKWLVPHFEKMLYDNALLAYTYLEAYAVTGRPLYRSVAKRTLDYVLRELTDEQGGFYCG 321

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
           +DADS   +G     EG +YV+T +EV+ +LG E   LF   + +   GN          
Sbjct: 322 QDADS---DGV----EGKYYVFTPQEVQGVLGKEDGELFCSRFGVTEAGN---------- 364

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
             F+GK++   L+ S+          E+  +I   C+R L++ R +R R H DDKV+ SW
Sbjct: 365 --FEGKSIPNLLDFSAYD--------EEDPHIAQLCQR-LYEYRLERTRLHRDDKVLTSW 413

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           N L+I++ A+A  +L                D  EY++ A+ A  F+   L DE+  RL 
Sbjct: 414 NALMIAALAKAGWLL----------------DEPEYLQAAQKAQRFLEEKLVDERG-RLL 456

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             +R G +   G LDDYAF    LL+LY       +L+ A ++     ELF D E GG +
Sbjct: 457 LRWREGEAANDGQLDDYAFYAFSLLELYRSSFDCTYLLRAAQIAEQILELFSDAEQGGLY 516

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            T  +   ++ R KE +DGA PSGNSV+    VRLA++    +   +RQ  E  +     
Sbjct: 517 LTAKDSEQLISRPKEVYDGAIPSGNSVAGEVFVRLAALTGEER---WRQAGERQIRFLTG 573

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLV-GHKSSVDFENMLAAAHASYDLNKTVIHID 656
            +K+      +   A   +  PS++ V    G ++  +  + L      + L    + + 
Sbjct: 574 WIKEYPAGYGMSLIALSSVLYPSQELVCTAQGEEAFQEVRDFL----RRHSLPSLTVLLK 629

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            A  E     +E  +            D V   +CQN +C+ PV +
Sbjct: 630 CAKNE-----QELAAAAPFTVEYPLPQDGVRYYLCQNGTCAAPVQE 670


>gi|418738150|ref|ZP_13294546.1| PF03190 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|410746324|gb|EKQ99231.1| PF03190 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
          Length = 692

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 253/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 38  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W +KR  
Sbjct: 98  MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQE 157

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
           L  + +     L ++    A   +    LP          L +S YD+ FGGF +    K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFKTNHVNK 217

Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           FP  + +  +L YH         S    +  +MV  TL  M +GGI+D VGGG  RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
            RW VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN            
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F+GKN+L E       A+KL     K ++ +L   R KL + RSKR RP  DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I + A+A                 +   R++++++AE   SFI R+L D    R+  
Sbjct: 429 GLYIKALAKAG----------------IAFRREDFLKLAEETYSFIERNLIDPDG-RILR 471

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+G S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G F 
Sbjct: 472 RFRDGESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529

Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  YR+ AE   + F  
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L   +++ P +  A         K +VL+  K +   +++LAA    +  +     ++ 
Sbjct: 588 ELSTHSLSYPHLLSAYWTYRY-HFKEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            + EE           + +  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691


>gi|405355793|ref|ZP_11024905.1| Thymidylate kinase [Chondromyces apiculatus DSM 436]
 gi|397091065|gb|EJJ21892.1| Thymidylate kinase [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 696

 Score =  369 bits (946), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 243/697 (34%), Positives = 345/697 (49%), Gaps = 69/697 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 57  SACHWCHVMAHESFESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPP+DKYGRPGF  +L  ++DAW+ K+D + +  A   E L E  
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDKYGRPGFPRLLMALRDAWENKQDEVQRQSAQFEEGLGEL- 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            AS      P  L    +    + ++K  D+  GGFG APKFP P+   +ML   ++   
Sbjct: 176 -ASYGLEAAPAVLTVADVVAMGQGMAKQVDAVNGGFGGAPKFPNPMNFALMLRAWRR--- 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G  +  +  V  TL+ MA+GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL +
Sbjct: 232 ----GGGAALKDAVFLTLERMARGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLH 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A  +     +  +  + ++Y+RR+M   GG  ++A+DADS   EG    +EG F+V
Sbjct: 288 LYAQAQQVEPRPLWRKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W  +EV   L E  A L   H+ +KP GN +            G  VL  +    A A +
Sbjct: 341 WKPEEVRAALPEAQAELVLRHFGIKPGGNFE-----------HGATVLEVVVPVDALAKE 389

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G   +   + L   R+ LF  R +R +P  DDK +  WNGL+I   A AS++       
Sbjct: 390 RGGAEDVVASELAAARKTLFAAREQRVKPGRDDKQLSGWNGLMIRGLALASRVF------ 443

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     DR E+   A  AA F+    +D    RL  S++ G ++  GFL+DY  L 
Sbjct: 444 ----------DRPEWARWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGNLA 491

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           SGL  LY+     K+L  A  L     +LF D E   Y         +++      D A 
Sbjct: 492 SGLTALYQATFDVKYLEAADALVRRAVDLFWDAEKAAYLTAPRGQKDLVVATYGLFDNAF 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S      V LA++    +   + +  E  ++     L    M    +  AAD L +
Sbjct: 552 PSGASTLTEAQVELAALTGDKR---HLELPERYVSRMHDGLVRNPMGYGYLGLAADAL-L 607

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                V L G +   D   + +A   ++    +V             W+       ++ +
Sbjct: 608 EGAAAVTLAGSRE--DVAPLRSALDHAFIPTVSV------------GWKAMGQPVPALLK 653

Query: 679 NNFSADKVV-----ALVCQNFSCSPPVTDPISLENLL 710
             F   + V     A +C+ F C  PVT+P  L   L
Sbjct: 654 ELFEGREPVKGKGAAYLCRGFVCELPVTEPDVLSQRL 690


>gi|116331824|ref|YP_801542.1| hypothetical protein LBJ_2312 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116125513|gb|ABJ76784.1| Conserved hypothetical protein containing a thioredoxin domain
           [Leptospira borgpetersenii serovar Hardjo-bovis str.
           JB197]
          Length = 692

 Score =  369 bits (946), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 252/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 38  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD +P+ GGTYFPPE  YGR  F  +L  ++  W +KR  
Sbjct: 98  MDALHAMDQQGGWPLNIFLTPDGRPIAGGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQE 157

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
           L  + +     L ++    A   +    LP          L +S YD+ FGGF +    K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNK 217

Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           FP  + +  +L YH         S    +  +MV  TL  M +GGI+D VGGG  RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
            RW VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN            
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F+GKN+L E       A+KL     K ++ +L   R KL + RSKR RP  DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I + A+A                 +   R++++++AE   SFI R+L D    R+  
Sbjct: 429 GLYIKALAKAG----------------IAFQREDFLKLAEETYSFIERNLIDPDG-RILR 471

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+  S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G F 
Sbjct: 472 RFRDSESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529

Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  YR+ AE   + F  
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L   +++ P +  A       S K +VL+  K +   +++LAA    +  +     ++ 
Sbjct: 588 ELSTHSLSYPHLLSAYWTYKYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            + EE           + +  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691


>gi|418746293|ref|ZP_13302623.1| PF03190 family protein [Leptospira santarosai str. CBC379]
 gi|410792840|gb|EKR90765.1| PF03190 family protein [Leptospira santarosai str. CBC379]
          Length = 699

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 41  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 100

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W+
Sbjct: 101 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 160

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
           +KR  L      A  +LS+ L  S     +  +   LP       A  L +S YDS FGG
Sbjct: 161 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 216

Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F +    KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGG
Sbjct: 217 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 268

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           G  RYS D RW VPHFEKMLYD        ++  S++K +       D++ YL RDM   
Sbjct: 269 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 328

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G I SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN    
Sbjct: 329 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 377

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DD
Sbjct: 378 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 428

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL   +  +A                 V   +++++++AE   SFI R+L D 
Sbjct: 429 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 471

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R
Sbjct: 472 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 529

Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
              G F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE 
Sbjct: 530 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 587

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             + F   L   ++  P +  A       S K +VL+  K +   +++LA     +  + 
Sbjct: 588 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 645

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 646 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 689


>gi|448359615|ref|ZP_21548265.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
           10990]
 gi|445642250|gb|ELY95319.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
           10990]
          Length = 811

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 217/604 (35%), Positives = 317/604 (52%), Gaps = 43/604 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA+ LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 55  SACHWCHVMEDESFADEQVAEALNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLS 136
            +L+P+ KP   GTYFP   K G+PGF  IL  V ++W++ RD +   A+    A +   
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENVTNSWERDRDEVENRAEQWTNAAKDRL 174

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           E    + S+++ P     + L   A    +S D +FGGFGS  PKFP+P  ++++   + 
Sbjct: 175 EETPDTVSASQPPS---SDVLDAAANASFRSADRQFGGFGSDGPKFPQPSRLRVLARAAD 231

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +        E  + Q +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  
Sbjct: 232 RT-------EREDFQDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            +   +L  +  T D  Y+ +  + L ++ R++    G  FS  DA S + +   R +EG
Sbjct: 285 AIPRAFLIGYQQTGDERYAEVVAETLAFVERELTHEEGGFFSTLDAQSEDPDTGER-EEG 343

Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
            FYVWT  E+ D+L     A LF + Y +  +GN            F+G N    +   S
Sbjct: 344 TFYVWTPDEIHDVLENETTADLFCDRYDITESGN------------FEGSNQPNRVRSVS 391

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A++  +      + L   R +LF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L 
Sbjct: 392 DLAAEYDLEAPDVQDRLESAREELFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLG 451

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                        G D  EY  +A  A  F+R  L+DE   RL   +++G     G+L+D
Sbjct: 452 G------------GEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDGYLED 499

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL    L  YE       L +A++L    ++ F D + G  + T     S++ R +E 
Sbjct: 500 YAFLARAALGCYEATGEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQEL 559

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D + PS   V+V  L+ L       + D + + A   L     R++  ++    +C AA
Sbjct: 560 GDQSTPSAAGVAVETLLALEGFA--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAA 617

Query: 614 DMLS 617
           D L+
Sbjct: 618 DRLA 621


>gi|448328363|ref|ZP_21517675.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
 gi|445615887|gb|ELY69525.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
          Length = 729

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 238/718 (33%), Positives = 362/718 (50%), Gaps = 68/718 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEDESFEDEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P+ KP   GTYFP E K G+PGF  +  ++ D+W+ + D          EQ ++A 
Sbjct: 113 AWLTPEGKPFFVGTYFPREGKQGQPGFLDLCERISDSWESEEDRAEMEN--RAEQWTDA- 169

Query: 140 SASASSNKLPDEL---------PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
            A     + PD             + L   A+ + +S D + GGFGS  KFP+P  ++++
Sbjct: 170 -AKDQLEETPDAAGAGTGAAPPSSDVLETAADMVLRSADRQHGGFGSGQKFPQPSRLRVL 228

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
              ++  + TG+     E  ++   TL  MA GG++DHVGGGFHRY VD  W VPHFEKM
Sbjct: 229 ---ARAYDRTGR----EEYLEVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKM 281

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  ++   +L  + LT +  Y+ +  + L+++ R++    G  FS  DA S E+    
Sbjct: 282 LYDNAEIPRAFLSGYQLTGEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQS-ESPETG 340

Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
             +EGAFYVWT ++V + L     A LF   + +  +GN            F+G+N    
Sbjct: 341 EHEEGAFYVWTPEDVHEALESETDAALFCARFDISESGN------------FEGRNQPNR 388

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
           +   S  A +  +   + L  L   R+ LF+ R +RPRP  D+KV+  WNGL+IS++A A
Sbjct: 389 VATVSELADQFDLEESEILKRLDSARQTLFEAREERPRPARDEKVLAGWNGLLISTYAEA 448

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           + +L              G+D  +Y   A  A  F+R  L++E   RL   +++G  K  
Sbjct: 449 ALVL--------------GAD--DYAATAVDALEFVRDRLWNEADQRLSRRYKDGDVKVD 492

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G+L+DYAFL  G LD Y+       L +A+EL    +  F D + G  + T     S++ 
Sbjct: 493 GYLEDYAFLARGALDCYQATGEVAHLAFALELARVIEAEFWDEDRGTLYFTPESGESLVT 552

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R +E  D + PS   V+V  L+ L         + +   A   L     +L+  A+    
Sbjct: 553 RPQELGDQSTPSATGVAVEVLLALDEFA----DEDFEDIAATVLETHANKLESSALEHAT 608

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +C AAD L+  + + V +   +   ++    A+ +    L   +    P     +D W E
Sbjct: 609 LCLAADRLAAGALE-VTVAADELPTEWREGFASRY----LPDRLFARRPPTEAGLDDWLE 663

Query: 669 H---NSNNASMARNNFSADKVVALVCQNFSCSPP---VTDPISL--ENLLLEKPSSTA 718
               +      A       +    VC++ +CSPP   VT+ +    EN  +E  S+++
Sbjct: 664 TLGLDDAPPIWAGREARDGEPTLYVCRDRTCSPPTHEVTEALEWLGENAAVEGSSASS 721


>gi|108757716|ref|YP_634091.1| hypothetical protein MXAN_5954 [Myxococcus xanthus DK 1622]
 gi|108461596|gb|ABF86781.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
          Length = 696

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 246/699 (35%), Positives = 347/699 (49%), Gaps = 73/699 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 57  SACHWCHVMAHESFESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA-QSGAFAIEQLSEA 138
           VFL+PDLKP  GGTYFPP+D+YGRPGF  +L  ++DAW+ K+D +  QSG F  E L E 
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDRYGRPGFPRLLMALRDAWENKQDEVQRQSGQFE-EGLGEL 175

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             A+      P  L    +    ++++K  D+  GGFG APKFP P+   +ML   ++  
Sbjct: 176 --ATYGLEAAPAVLTAADVVGMGQRMAKQVDAVHGGFGGAPKFPNPMNFALMLRAWRR-- 231

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                G  +  +  V  TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL 
Sbjct: 232 -----GGGAPLKDAVFLTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLL 286

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++Y  A  +     +  +  + + Y+RR+M   GG  ++A+DADS   EG    +EG F+
Sbjct: 287 HLYAQAQQVEPRQLWRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFF 339

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW  +EV   L E  A L   H+ +KP GN +            G  VL  +   S  A 
Sbjct: 340 VWRPEEVRAALPEAQAELVLRHFGIKPGGNFE-----------HGATVLEVVVPVSELAR 388

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G+  +     L   ++ LFD R +R +P  DDK++  WNGL+I   A AS++      
Sbjct: 389 ERGVSEDAMERELAAAKQTLFDARERRVKPGRDDKLLSGWNGLMIRGLALASRVF----- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                       R E+ + A  AA F+    +D    RL  S++ G ++  GFL+DY  L
Sbjct: 444 -----------GRPEWAKWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGDL 490

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
            SGL  LY+     K+L  A  L     +LF D E   Y         +++      D A
Sbjct: 491 ASGLTALYQATFDVKYLEAADALVRRAVDLFWDAEKAAYLTAPRGQRDLVVATYGLFDNA 550

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSG S      V LA++    +   + +  E  +A     L    M    +  AAD L 
Sbjct: 551 FPSGASTLTEAQVELAALTGDKQ---HLELPERYVARMHDGLVRNTMGYGYLGLAADAL- 606

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF-WEEHNSNNASM 676
                   L G  S       +  A AS D+      +D A    +   W+       ++
Sbjct: 607 --------LEGAAS-------VTVAGASDDVAPLRAAMDRAFAPTVALAWKAPGQPVPAL 651

Query: 677 ARNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLL 710
            +  F        +  A +C+ F C  PVT+P  L   L
Sbjct: 652 LQGTFEGREPVKGRAAAYLCRGFVCELPVTEPDVLTQRL 690


>gi|304314907|ref|YP_003850054.1| hypothetical protein MTBMA_c11480 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302588366|gb|ADL58741.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 677

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 209/558 (37%), Positives = 317/558 (56%), Gaps = 53/558 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED  +A +LN+ FV++KVDREERPD+D +YM   Q + G GGWPL+
Sbjct: 53  STCHWCHVMARESFEDPEIADILNENFVAVKVDREERPDIDAIYMKVCQMMTGTGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++P+ +P   GTYFPP+D+ G PG +TIL +V   W    D + ++    +  L +++
Sbjct: 113 IIMTPEGEPFFAGTYFPPDDRGGVPGLRTILERVVLLWKNDPDGIVKTARDVVSALKKSV 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
              A ++KL  E    A     E L +++D+R GGFGS  KFP P  I  +L YH ++ +
Sbjct: 173 ---AKASKLKPETVDAAY----EYLRRNFDTRNGGFGSYQKFPTPHNIYFLLRYHLRRGD 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D        E  +MV  TL+ M  GGI+D +G GFHRY+V+  W VPHFEKMLYDQ  + 
Sbjct: 226 D--------EALRMVNLTLRRMRYGGIYDQLGYGFHRYAVEPTWTVPHFEKMLYDQALIL 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL+AF +T D  Y     +I++Y+  ++  P G  +SAED   AE+EG     EG +Y
Sbjct: 278 KAYLEAFQVTCDDLYKKTALEIVEYVLGNLQSPEGAFYSAED---AESEGV----EGKYY 330

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +W + E+ ++LG+ A +   ++ +   GN           + +G+N+L  +      A +
Sbjct: 331 LWRASEIREVLGDDANVVMRYFNVLEDGNF--------AGDVRGENIL-HIGSPWRVADE 381

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             + L++   I+   RR L + R +RP P LDDK++  WNGL++ + A   +IL SE   
Sbjct: 382 FNLTLDELNEIIENARRHLLERRMERPTPALDDKILTDWNGLMLGALAACGRILDSE--- 438

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        E +  AE    FI  +L+ +    L H +R+  +   G LDDYAFLI
Sbjct: 439 -------------EALAAAERCLKFIMDNLHVDG--ELLHRYRDSEAGIDGKLDDYAFLI 483

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL+L++      ++  A+EL  + ++ F   +GG Y     +DP +++R  +  DGA 
Sbjct: 484 WGLLELHDATFREGYVEMALELSESLEDRFGAPDGGFYLT---DDPKLIVRPMDATDGAI 540

Query: 559 PSGNSVSVINLVRLASIV 576
           PSGNSV ++NL+RL  I+
Sbjct: 541 PSGNSVQMLNLLRLGGIL 558


>gi|448688002|ref|ZP_21693970.1| thioredoxin [Haloarcula japonica DSM 6131]
 gi|445779793|gb|EMA30709.1| thioredoxin [Haloarcula japonica DSM 6131]
          Length = 717

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 230/690 (33%), Positives = 354/690 (51%), Gaps = 60/690 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFEDEAIAEQLNEDFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
           L+P+ +P   GTYFPPE+K G+PGF  +L+++ D+W   ++R+ +        E +   L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRARQWTEAIESDL 177

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
            A+ +    P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   
Sbjct: 178 EATPAD---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAHA 231

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D G+     +   +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++ 
Sbjct: 232 DGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIP 287

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATRKKEG 315
             +L  +       Y+ + R+  ++++R++  P G  FS  DA+SA   E EG T  +EG
Sbjct: 288 RAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPIDEPEGET--EEG 345

Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
            FYVWT ++V D + +   A +F +++ +   GN            F+G  VL      S
Sbjct: 346 LFYVWTPEQVRDAVDDETDAEIFCDYFGVTARGN------------FEGATVLAVRKPVS 393

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A +     +K    L     + F+ R++RPRP  D+KV+  WNGL+I + A  + +L 
Sbjct: 394 VLAEEYDQSEDKITASLQRALNQTFEARTERPRPARDEKVLAGWNGLMIRTLAEGAIVLD 453

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                             +Y +VA  A SF+R HL++E  +RL   +++G     G+L+D
Sbjct: 454 D-----------------QYADVAADALSFVREHLWNEDENRLNRRYKDGDVAIDGYLED 496

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R +E 
Sbjct: 497 YAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQEL 556

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D + PS   V+V  L+ L+     S  D + + AE  +     R+    +    +  A 
Sbjct: 557 TDQSTPSSTGVAVDLLLSLSHF---SDDDRFEEVAERVIRTHADRVSSNPLQHASLTLAT 613

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW----EEH 669
           D     + + + LVG +S  D+ +      A   + + ++   PAD    + W    E  
Sbjct: 614 DTYEQGALE-LTLVGDRS--DYPSEWTETLAERYVPRRLLAHRPADEGRFEQWLDALELD 670

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPP 699
            S      R      K     C+NF+CSPP
Sbjct: 671 ESPPIWAGREQIDG-KPTVYACRNFACSPP 699


>gi|284164956|ref|YP_003403235.1| hypothetical protein Htur_1677 [Haloterrigena turkmenica DSM 5511]
 gi|284014611|gb|ADB60562.1| protein of unknown function DUF255 [Haloterrigena turkmenica DSM
           5511]
          Length = 733

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 230/695 (33%), Positives = 354/695 (50%), Gaps = 57/695 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFED+ VA +LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEDESFEDDEVAAVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
            +L+P+ KP   GTYFP E +  +PGF  + +++ D+W+   D         Q    A +
Sbjct: 113 AWLTPEGKPFFVGTYFPKESQRNQPGFLELCQRISDSWESGEDREEMEHRADQWTEAAKD 172

Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
           +L E    + ++    +      L   A+   +S D ++GGFGS  PKFP+P  + ++  
Sbjct: 173 RLEETPDDAGTAGGAAEPPSSEVLETAADAALRSADRQYGGFGSGGPKFPQPSRLHVL-- 230

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
            ++  + TG+     E  ++V  +L  MA GG++DHVGGGFHRY VD+ W VPHFEKMLY
Sbjct: 231 -ARAYDRTGR----EEYLEVVEESLDAMAAGGLYDHVGGGFHRYCVDKDWTVPHFEKMLY 285

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D  ++   +L  + LT +  Y+ +  + L +L R++    G  FS  DA S + E   R 
Sbjct: 286 DNAEIPRAFLAGYQLTGEERYAEVVDETLAFLERELTHDEGGFFSTLDAQSEDPETGER- 344

Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           +EG FYVWT  EV ++L +   A LF   Y +  +GN            F+G+N    + 
Sbjct: 345 EEGVFYVWTPDEVSEVLEDETTADLFCARYDITESGN------------FEGRNQPNRVR 392

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
              + A +  +   +  + L + R +LF+ R +RPRP+ D+KV+  WNGL+I++ A A+ 
Sbjct: 393 SLESLADEYDLAEAEIEDRLEDAREQLFEAREQRPRPNRDEKVLAGWNGLMINACAEAAL 452

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
                         VVG+D  EY + A  A  F+R  L+DE   RL   F++G  K  G+
Sbjct: 453 --------------VVGND--EYADQAVDALEFVRDRLWDEDEQRLSRRFKDGNVKVDGY 496

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G L  Y+       L +A++L  T +  F D E G  + T     S++ R 
Sbjct: 497 LEDYAFLARGALGCYQATGDVDHLGFALDLARTIEAEFWDEEQGTIYFTPESGESLVTRP 556

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+V  L+ L         D + + A   L     +++  ++    +C
Sbjct: 557 QELTDQSTPSAAGVAVETLLALDEFA----EDDFGEIAATVLETHANKIEANSLEHASLC 612

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
            AAD L   + + V +   +   ++ +  A  +        +  + P   E ++ W +  
Sbjct: 613 LAADRLEAGALE-VTVAADELPAEWRDRFADEYHP----DRLFALRPPTAEGLEAWLDQL 667

Query: 670 --NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                 A  A       +    VC++ +CSPP  D
Sbjct: 668 GLEEPPAIWAGREARDGEPTLYVCRDRTCSPPTHD 702


>gi|359683227|ref|ZP_09253228.1| hypothetical protein Lsan2_00420 [Leptospira santarosai str.
           2000030832]
          Length = 691

 Score =  368 bits (944), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 246/707 (34%), Positives = 355/707 (50%), Gaps = 65/707 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 33  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W 
Sbjct: 93  IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWS 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS- 177
           +KR  L  + +   + L ++    A   +  D   +N            YDS FGGF + 
Sbjct: 153 EKRQELVVASSELSQYLKDSGEGRAVEKQEGDLPSENCFDSAFSLYESYYDSEFGGFKTN 212

Query: 178 -APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
              KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGGG  R
Sbjct: 213 HVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCR 264

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS D RW VPHFEKMLYD        ++  S++K +       D++ YL RDM    G I
Sbjct: 265 YSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNEDGGI 324

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
            SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN        
Sbjct: 325 CSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN-------- 369

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
               F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DDK++ 
Sbjct: 370 ----FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILT 424

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           SWNGL   +  +A                 V   +++++++AE   SFI R+L D    R
Sbjct: 425 SWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID-PNGR 467

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           +   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R   G
Sbjct: 468 ILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAG 525

Query: 536 YFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE   + 
Sbjct: 526 VFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAESIFSY 583

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   L   ++  P +  A       S K +VL+  K +   +++LA     +  +  +  
Sbjct: 584 FTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDSVLAV 641

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
           ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 642 VNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681


>gi|388254779|gb|AFK24895.1| protein of unknown function DUF255 [uncultured archaeon]
          Length = 691

 Score =  368 bits (944), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 220/557 (39%), Positives = 310/557 (55%), Gaps = 48/557 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+ VAK++N+ F++IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 55  SACHWCHVMAHESFEDDEVAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLS 114

Query: 80  VFLSPDLKPLMGGTYFPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSE 137
           VFL+ D KP   GTYFP E  +Y  PGFKTIL ++  A+  KK+++ A SG F +  L++
Sbjct: 115 VFLTSDQKPFYVGTYFPKEGGRYNMPGFKTILLQLATAYKSKKQEIEAASGEF-MGALAQ 173

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
                AS       L ++ +   A  L +  D  +GGFG APKFP P  +  +L +    
Sbjct: 174 TAKDIASGMAEKASLERSIIDEAAMGLLQMGDPIYGGFGQAPKFPNPTNLMFLLRYYN-- 231

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                SG  +  +  V FT   MA GGIHD +GGGF RY+ D++W +PHFEKMLYD   L
Sbjct: 232 ----LSG-LNRFKDFVAFTADKMAAGGIHDQLGGGFARYATDQKWLIPHFEKMLYDNALL 286

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A +Y + + +TK   Y  I R  LD++ R+M+ P G  +SA DADS   EG    +EG F
Sbjct: 287 AQLYSELYQITKADKYVQITRKTLDFVSREMMHPEGGFYSALDADS---EG----EEGKF 339

Query: 318 YVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           Y+W  KE+  ILG+     +F EHY +   GN            F+G+N+L      +  
Sbjct: 340 YIWQKKEIASILGDQVATDIFCEHYGVTEGGN------------FEGQNILNVRVPLANV 387

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
             + G   E+   I+ +   KLF  R KR RP  D+K++ SWNGL+IS FA+   I    
Sbjct: 388 GLRYGKTPEQAAQIIADASAKLFTAREKRVRPGRDEKILTSWNGLMISGFAKGYSI---- 443

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                       +   +Y++ A++A  FI   +      RL  +F++G SK   +LDDYA
Sbjct: 444 ------------TGDAKYLQAAKNAVDFIEAKI-AAGDGRLLRTFKDGHSKLNAYLDDYA 490

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F +SGLLDL+   S   +L  AI   +   + F D + G  F T+ +   +++R K  +D
Sbjct: 491 FYVSGLLDLFAVDSKQAYLDKAIMHTDFMLKHFWDEKEGNLFFTSDDHEKLIVRTKSFYD 550

Query: 556 GAEPSGNSVSVINLVRL 572
            A PSGNS++  +L+RL
Sbjct: 551 LAIPSGNSMAAADLLRL 567


>gi|16768044|gb|AAL28241.1| GH13403p [Drosophila melanogaster]
          Length = 629

 Score =  368 bits (944), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 231/669 (34%), Positives = 330/669 (49%), Gaps = 83/669 (12%)

Query: 78  LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +SV+L+P L PL+ GTYFPP+ +YG P F T+L+ +   W+  ++ L  +G+  +  L +
Sbjct: 1   MSVWLTPTLAPLVAGTYFPPKSRYGMPSFNTVLKSIARKWETDKESLLATGSSLLSALQK 60

Query: 138 ALSASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQ 188
              ASA        +P+ A       E+LS++       +D   GGFGS PKFP    + 
Sbjct: 61  NQDASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLN 112

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
            + +     +D        +   MV+ TL  + KGGIHDH+ GGF RY+  + WH  HFE
Sbjct: 113 FLFHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFE 165

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYDQGQL   + +A+ +T+D  Y      I  YL +D+  P G  ++ EDADS  T  
Sbjct: 166 KMLYDQGQLMMAFANAYKVTRDEIYLRYADKIHKYLIKDLRHPLGGFYAGEDADSLPTHE 225

Query: 309 ATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDP 356
              K EGAFY WT  E++           DI  E A  ++  HY LKP GN  +   SDP
Sbjct: 226 DKVKVEGAFYAWTWDEIQAAFKDQAQRFDDITPERAFEIYAYHYGLKPPGN--VPAYSDP 283

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
           H    GKN+LI       + +   +  +++  +L      L  +R KRPRPHLD K+I +
Sbjct: 284 HGHLTGKNILIVRGSEEDTCANFKLEEDRFKKLLATTNDILHVIRDKRPRPHLDTKIICA 343

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGLV+S   +                    ++R++YM+ A+    F+R+ +YD +   L
Sbjct: 344 WNGLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLL 389

Query: 477 QHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDE 526
             S               S+  GFLDDYAFLI GLLD Y+       L WA  LQ+TQD+
Sbjct: 390 IRSCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDK 449

Query: 527 LFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
           LF D   G YF +  + P+V++R+KEDHDGAEP GNSVS  NLV LA         YY +
Sbjct: 450 LFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAH--------YYDE 501

Query: 587 NA----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
           NA       L  F   +     A+P M  A  +L   +   +V V    S D +  +   
Sbjct: 502 NAYLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEIC 559

Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
              +  +  ++H+DP++ EE        SN     +      K    +C   +C  PVTD
Sbjct: 560 RKFFIPSMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTD 612

Query: 703 PISLENLLL 711
           P  LE+ L+
Sbjct: 613 PQQLEDNLM 621


>gi|422002946|ref|ZP_16350180.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417258416|gb|EKT87804.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 691

 Score =  367 bits (943), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 33  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W+
Sbjct: 93  IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
           +KR  L      A  +LS+ L  S     +  +   LP       A  L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208

Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F +    KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           G  RYS D RW VPHFEKMLYD        ++  S++K +       D++ YL RDM   
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 320

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G I SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN    
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL   +  +A                 V   +++++++AE   SFI R+L D 
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521

Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
              G F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE 
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGIDSARYRKFAES 579

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             + F   L   ++  P +  A       S K +VL+  K +   +++LA     +  + 
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681


>gi|295667924|ref|XP_002794511.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226285927|gb|EEH41493.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 791

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 219/536 (40%), Positives = 304/536 (56%), Gaps = 36/536 (6%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           K  R  FL    + CHWCHVME ESF    +A +LN  F+ IK+DREERPD+D+VYM YV
Sbjct: 58  KLNRLIFLSIGYSACHWCHVMEKESFMSPEIAAILNKSFIPIKLDREERPDIDEVYMNYV 117

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDK 119
           QA  G GGWPL+VFL+PDL+P+ GG+Y+P P           G+  F  IL K++D W  
Sbjct: 118 QATTGSGGWPLNVFLTPDLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHT 177

Query: 120 KRDMLAQSGAFAIEQLSEALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGG 174
           ++    +S     +QL E  +   + +K  D     +L    L    +  +  YD+  GG
Sbjct: 178 QQLRCRESAKDITKQLRE-FAEEGTHSKQSDVETEEDLEIELLEEAYQHFASRYDAVNGG 236

Query: 175 FGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F  APKFP PV +  +++ S+    + D     E S   ++ + TL  M++GGIHD +G 
Sbjct: 237 FSEAPKFPTPVNLSFLVHLSRYPSAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGH 296

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIG 290
           GF RYSV   W +PHFEKMLYDQ QL +VY+DAF    D        DI  Y+    M+ 
Sbjct: 297 GFARYSVTADWSLPHFEKMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLS 356

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCD 349
           P G   S+EDADS  +   T K+EGAFYVWT KE++ ILG+  A +   H+ +   GN  
Sbjct: 357 PTGGFHSSEDADSRPSPNDTEKREGAFYVWTLKELKQILGQRDADVCARHWGVLADGN-- 414

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPH 408
           ++R++DPH+EF  +NVL      S  A + G+  ++ + I+   R KL + R SKR RP 
Sbjct: 415 VARINDPHDEFINQNVLSIQVTPSKLAKEFGLGEDEVVRIIKRSREKLREYRESKRVRPD 474

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LDDK+IV+WNGL I + A+ S +L++      + F             AE A  FI+ +L
Sbjct: 475 LDDKIIVAWNGLAIGALAKCSVVLENLDRDKAYQF----------RRAAEEAVRFIKHNL 524

Query: 469 YDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
           +DEQT +L   +R G     PGF DDYA+LISGL++LYE       L +A +LQ+ 
Sbjct: 525 FDEQTGQLWRIYRGGVRGDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQHA 580


>gi|239906990|ref|YP_002953731.1| hypothetical protein DMR_23540 [Desulfovibrio magneticus RS-1]
 gi|239796856|dbj|BAH75845.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 697

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 243/692 (35%), Positives = 342/692 (49%), Gaps = 49/692 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A L+N   VS+KVDREERPD+D +YM+   AL G GGWPL+
Sbjct: 52  STCHWCHVMERESFEDEDIAALMNAVVVSVKVDREERPDLDALYMSVCHALTGRGGWPLT 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP E  YGR G + +L++V   W   R  +  +    ++ + E L
Sbjct: 112 VFLTPDKEPFFAGTYFPKESAYGRTGLRELLQRVHMFWKGNRQAVVNNAGQIMDAVREQL 171

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A+A +     E  Q AL     QL+  +D+R GGFG APKFP P  +  +L   ++  D
Sbjct: 172 AAAAGTASA--EPGQAALDAARTQLAGIFDARNGGFGGAPKFPSPHNLLFLLREYRRTGD 229

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                     + M   TL  M +GG++D VG G HRY+ D  W +PHFEKMLYDQ     
Sbjct: 230 V-------SCRDMACRTLVAMRRGGVYDQVGFGLHRYATDAHWFLPHFEKMLYDQALTVM 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             ++A+  + DV +  +  +IL+Y+RRD+  P G  +SAEDADS   EG     EG FYV
Sbjct: 283 ACVEAYQASGDVAHKTMALEILEYVRRDLTSPEGLFYSAEDADS---EGV----EGKFYV 335

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W++ E+  +LG+ A L          GN       +   E  G N+L        +A++L
Sbjct: 336 WSAAELRRLLGDEAALIMAAMGATEEGNAH----DEATGETTGANILHLPRPLDETAARL 391

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  E     L  CR  L   R KR RP  DDKV+   NGL++++ A+A++    E  + 
Sbjct: 392 GLTAEILAERLEACRHVLLAEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEDLAG 451

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                         +  AE+  S + R     Q  RL H  R+  +   G LDDY FL  
Sbjct: 452 ------------RAVTAAEALLSRLAR-----QNGRLLHRLRDDEAAIDGLLDDYVFLAW 494

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GL++LY+    T +L  A+EL     E F D   GGYF    +   +L+R K   D A P
Sbjct: 495 GLVELYQTVFDTAYLRRAVELMKAVAEHFADPNEGGYFLAPDDGEQLLVRQKIFFDAAVP 554

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGNSV+   L  L  +        +++ A         RL D A       C    + + 
Sbjct: 555 SGNSVAYFVLTTLFRLTGDPA---FKEQATALARAMAPRLADHAAGYAFFLCGLSQV-LG 610

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
               V L G  +  D + +  A    Y L +  + + P D +E D      +  A   R 
Sbjct: 611 QASEVTLAGDPAGPDTQTLARAIFERY-LPEVAVVLRP-DEDEPDI-----AALAPFTRY 663

Query: 680 NFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
               D +  A VC+  SC PP  +  ++  LL
Sbjct: 664 QLPLDGRAAAHVCRAGSCQPPTAEVETMLKLL 695


>gi|432330863|ref|YP_007249006.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
 gi|432137572|gb|AGB02499.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
          Length = 708

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 255/709 (35%), Positives = 360/709 (50%), Gaps = 59/709 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVM  ESFED  VA+LLN  F+++KVDREERPD
Sbjct: 38  GEEAFLRAAREDKPVFLSIGYATCHWCHVMAHESFEDLEVAELLNRDFIAVKVDREERPD 97

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D  YM   Q L G GGWPL++ ++P+ KP    TY P E ++  PG   +L ++  AW 
Sbjct: 98  IDSTYMQVCQMLSGQGGWPLTIVMTPEKKPFFAATYLPKERRFAVPGLLDLLPRIAKAWR 157

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGS 177
           ++R  L QS     E +++AL    ++   P+  P  A L    E L   +D  +GGF  
Sbjct: 158 EQRGELLQSA----ESITQALETRDAAPAGPE--PDAALLDEGYEDLLLRFDPGYGGFSG 211

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFP P  +  +L + K+   TGK         MV+ TL     GGIHDH+GGGFHRYS
Sbjct: 212 APKFPTPHTLLFLLRYWKR---TGK----KRALDMVVKTLDAFRDGGIHDHIGGGFHRYS 264

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
            D +W VPHFEKMLYDQ  L   Y +AF  T++  Y       + Y+ RD+  P G  FS
Sbjct: 265 TDAQWRVPHFEKMLYDQALLVIAYTEAFQATRNYRYRETAMSTVRYVLRDLTDPEGAFFS 324

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDP 356
           AEDADS       R  EGAFY+WT  E+E +L  + A +    + ++  GN        P
Sbjct: 325 AEDADS-------RGGEGAFYLWTMGELEAVLEKDDAAIAGRVFNVRDEGN-----FLSP 372

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
            +    +N+L       A  S  G+  E+    +   R +LF  R KR RP  DDKV++ 
Sbjct: 373 EST-GAENILFRTRTDEALVSVTGIHQEELDERIASIRERLFAAREKRERPRRDDKVLLD 431

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I++ A+A++   +            G  R       E   S +R         RL
Sbjct: 432 WNGLMIAALAKAARAFGN------------GECRTAAERAMECILSRMR-----TGDGRL 474

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
            H +R+G    PGF DDYAFL   L++LYE     ++L  A+ +  T  + FLDRE GG+
Sbjct: 475 YHRYRDGERAIPGFADDYAFLGLALIELYECTFDPRYLAEALAIMKTFRDHFLDRENGGF 534

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F T G+  ++L+R K  +DGA PS NSV+   L+RL+ +   ++ +        S   F 
Sbjct: 535 FFTAGDAEALLVRDKVIYDGAVPSANSVACEVLLRLSRLTGTTEHEDLAAALARS---FA 591

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            R+++   A     CA +    PS + +V+ G   S   +  LAA  + Y  + TVIH  
Sbjct: 592 GRVRESPSAFCWFLCAIERAVGPS-QDIVIAGDSGSPAVQEFLAAVRSRYLPHCTVIHKP 650

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADK--VVALVCQNFSCSPPVTDP 703
            +D + +   E            N  AD+    A +C   +CS P+TDP
Sbjct: 651 ASDPDTIAALEALTPFT-----RNILADRNTPAAYLCSGSTCSLPITDP 694


>gi|398331059|ref|ZP_10515764.1| hypothetical protein LalesM3_03040 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 699

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 256/717 (35%), Positives = 363/717 (50%), Gaps = 80/717 (11%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 46  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 105

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE +YGR  F  IL  ++  W +KR  
Sbjct: 106 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWKEKRQE 165

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS-- 177
           L      A  +LS  L  S     +  +   LP +N            YD+ FGGF +  
Sbjct: 166 L----IVASSELSRYLKDSGEGRAIEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNH 221

Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
             KFP  + +  +L  YHS        SG  S   +MV  TL  M +GGI+D +GGG  R
Sbjct: 222 VNKFPPSMGLSFLLRYYHS--------SGNPS-ALEMVENTLLAMKQGGIYDQIGGGLCR 272

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YS D  W VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I
Sbjct: 273 YSTDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGI 332

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
            SAEDADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN        
Sbjct: 333 CSAEDADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTKKGN-------- 377

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVI 414
               F+GKN+L E     + A+K      K ++ +L   R KL + R+KR RP  DDK++
Sbjct: 378 ----FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRNKRVRPLRDDKIL 431

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
            SWNGL I + A+A                 V   R++++++AE   SFI R+L D  + 
Sbjct: 432 TSWNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIERNLID-PSG 474

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           R+   FR+  S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   
Sbjct: 475 RILRRFRDKESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWMEEAIRLF--RSPA 532

Query: 535 GYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           G F  TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  YR+ AE    
Sbjct: 533 GVFFDTGNDGEVLLRRSVDSYDGVEPSANSSLAYSLVKLS--LFGIDSVRYREFAESIFL 590

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F   L   +++ P +  A       S K +VL+  K +   + +LAA    +  +    
Sbjct: 591 YFTKELSTYSLSYPHLLSAYWTYRHHS-KEIVLI-RKDTDSGKELLAAIQTRFLPDSVFA 648

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            ++  + EE           +++  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 649 VVNENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 698


>gi|379010883|ref|YP_005268695.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
 gi|375301672|gb|AFA47806.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
          Length = 686

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 245/699 (35%), Positives = 348/699 (49%), Gaps = 74/699 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA+ LN +F+SIKVDREERPD+D++YMT+ Q   G GGWPL+
Sbjct: 56  STCHWCHVMEKESFEDAEVAEYLNKYFISIKVDREERPDIDQIYMTFSQVSTGQGGWPLN 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ + KP    TY P   +YG PG   +L  ++  W +  + +  S A  +  L   L
Sbjct: 116 VFLTAERKPFYVTTYLPKRSRYGHPGLMDVLVGIEGQWRQNNEEIIYS-ADKMTSLLNDL 174

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 NKL   +  +A     E    S+D R+GGFG APKFP P       +H   L  
Sbjct: 175 EIRKDENKLKRTIFFDAYDFFDE----SFDDRYGGFGKAPKFPTP-------HHLFYLLR 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++    +   MV  TL+ M +GG+ DH+G GF RYS DE+W VPHFEKMLYD   L  
Sbjct: 224 CYQAFNQPDALVMVEKTLKQMYQGGLFDHIGFGFSRYSTDEQWLVPHFEKMLYDNALLVM 283

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y + + +T +  Y  I +  + Y+ RD+    G  F AEDADS   EG    +EG FYV
Sbjct: 284 IYAETYQVTGNPLYKKIAQKTITYVNRDLRSEEGGFFCAEDADS---EG----EEGRFYV 336

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
           W+ ++VE ILG + A +F + Y +   GN            F GKN+  +I ++     A
Sbjct: 337 WSMEKVEKILGKKRAAVFFKFYPMTAKGN------------FDGKNIPNMIPVDLDLIEA 384

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +     LEK   +L E +  LF+ R KR  PH DDK++ +WNGL+I++ A A +I     
Sbjct: 385 NP---ELEK---VLDEMKADLFNQREKRIHPHKDDKILTAWNGLMITALAMAGRIF---- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D+ EY+  AE   +FI   +   +  RL   +R G +K   +LDDYA 
Sbjct: 435 ------------DQPEYLIQAEETMAFIENKM-TRRNGRLYARYRLGEAKILAYLDDYAS 481

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHD 555
           +I G L+LY+    T++L  AI        +F D  G  G+F    +   ++ R KE +D
Sbjct: 482 VIWGYLELYQATFKTEYLEKAILRAVDMINIFGDDFGMSGFFQYGNDAEKLIARPKEIYD 541

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A+PSGN+++   L++L  I    K   Y        A F   L    MA  +M CA   
Sbjct: 542 NAQPSGNALAACCLLKLGKITGEQK---YIDIVNGMFAYFAGNLNQAPMASTMMLCAKLF 598

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA--DTEEMDFWEEHNSNN 673
              P+ + VV  G++       M      +  LNK  +       +  E D      + N
Sbjct: 599 HEQPTTE-VVFAGYEKDPTIRAM------NQRLNKLFLPFSVVLFNKSEKDL----KTIN 647

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           A          +  A VC+N+ C  PV D  S   ++ E
Sbjct: 648 AFAVNQQMIHGQPTAYVCKNYRCEEPVNDLESFLKIIEE 686


>gi|397780504|ref|YP_006544977.1| hypothetical protein BN140_1338 [Methanoculleus bourgensis MS2]
 gi|396939006|emb|CCJ36261.1| putative protein yyaL [Methanoculleus bourgensis MS2]
          Length = 719

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 242/707 (34%), Positives = 360/707 (50%), Gaps = 56/707 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    + CHWCHVME ESF D  VAKLLND FV IKVDREERPD
Sbjct: 44  GEEAFLRAKEEAKPIFLSIGYSACHWCHVMEEESFADPMVAKLLNDVFVCIKVDREERPD 103

Query: 59  VDKVYMTYVQALYGGG-GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           +D++Y+     L G   GWPL++F++ D +P    +Y P E +YG  G   ++ ++   W
Sbjct: 104 IDQIYIDAAHVLSGVAVGWPLTIFMTHDGRPFFAASYIPKESRYGMTGLVDLIPRISRIW 163

Query: 118 DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
             +R  L Q+G+    ++ EAL ++A +     EL +  L    + L + +D   GGFG 
Sbjct: 164 QTRRQELEQTGS----RVLEALQSAARTPPGESELSEATLDDAYDTLFRLFDGENGGFGD 219

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFP P  +  +L +  +   TGK+        MV  TL  M +GGI DH+G GFHRY+
Sbjct: 220 APKFPAPHNLIFLLRYGHR---TGKT----PAYTMVEKTLHAMRRGGIFDHIGWGFHRYT 272

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
            D  W VPHFEKMLYDQ  L   Y +A+  T    ++   R+ + Y+ R+M  P G  +S
Sbjct: 273 TDAEWLVPHFEKMLYDQALLIMAYTEAYLATGREEFARTARETIAYVLREMTDPDGGFYS 332

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDP 356
           AEDADS   EG     EG FY+WT   +  +LGE     F   + +   GN     +  P
Sbjct: 333 AEDADS---EGV----EGKFYIWTKAGILQVLGEEDGERFSRIFGVTEPGNY----LEQP 381

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
                G+NVL      ++ A +  MP E     + + R++LF  R +R RP  DDK++  
Sbjct: 382 GARRTGQNVLRLRRPLASWAHEFSMPEEDLAWFVEDARQRLFAAREERARPAKDDKILTD 441

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I++ A A++                  D  EY+  AE AA+F+   L      RL
Sbjct: 442 WNGLMIAALATAARAF----------------DDPEYLAAAEKAAAFVLTRLRGPDG-RL 484

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
            H +RNG +     LDDYAF++  L+++YE      +L  A++L       + D + GG+
Sbjct: 485 LHRYRNGEAGITATLDDYAFMLWALIEVYEASFAPGYLRTAVKLARDLSARYWDCDHGGF 544

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F T  +D  + +R K   DGA PSGNSV++  L  L  + A  +   + + A     VF 
Sbjct: 545 FFTP-DDVEIAVRQKPVFDGATPSGNSVAMYALFLLGRMTANLE---FEEMANRIRRVFA 600

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             +++  +A        + +  P+ + V++ G + + D   M+ A  + Y  +  VI   
Sbjct: 601 DTVRESPIAYSYFLTGLEFMLGPNVE-VIISGVRDAEDTRAMIQAIRSRYTPDAVVI-FR 658

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
           P+D EE +      +  A   R+  + + K  A VC N++C  PVTD
Sbjct: 659 PSDEEEPEI-----TKVAGFTRDIVTIEGKATAYVCTNYACDIPVTD 700


>gi|410450937|ref|ZP_11304964.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
 gi|410015249|gb|EKO77354.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
          Length = 691

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 252/711 (35%), Positives = 359/711 (50%), Gaps = 73/711 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 33  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W+
Sbjct: 93  IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
           +KR  L      A  +LS+ L  S     +  +   LP       A  L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208

Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F +    KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           G  RYS D RW VPHFEKMLYD         +  S++K +       D++ YL RDM   
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLAECSSVSKKISAKSFALDVISYLHRDMRNE 320

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G I SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN    
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL   +  +A                 V   +++++++AE   SFI R+L D 
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521

Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
              G F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE 
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 579

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             + F   L   ++  P +  A       S K +VL+  K +   +++LA     +  + 
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681


>gi|433638443|ref|YP_007284203.1| thioredoxin domain protein [Halovivax ruber XH-70]
 gi|433290247|gb|AGB16070.1| thioredoxin domain protein [Halovivax ruber XH-70]
          Length = 759

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 244/713 (34%), Positives = 350/713 (49%), Gaps = 56/713 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT  QA+ G GGWPLS
Sbjct: 53  SACHWCHVMEAESFADETVAAVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
            +L+PD +P   GTYFP E + G PGF  + R+++ +W + RD +     +  A A ++L
Sbjct: 113 AWLTPDGRPFYVGTYFPREAQRGTPGFVELCRQIRVSWSENRDEIEARANEWAAMATDRL 172

Query: 136 SEALSASASSNKLPDELPQ---------------NALRLCAEQLSKSYDSRFGGFG-SAP 179
             A      S   P+ +                 + L    E   ++ D   GGFG   P
Sbjct: 173 DSA-DGGGESASTPEPISADTDSPIDVGLDADGPDGLERVGEAALRASDDEHGGFGRGGP 231

Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           KFP+P  ++ +     +L+ T     A E        L  M  GG++DHVGGGFHRY VD
Sbjct: 232 KFPQPRRVEALF----RLDATHDRPTAHE---TATRALDAMCTGGLYDHVGGGFHRYCVD 284

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
           E W VPHFEKMLYD   +  V L  + +T D  Y+   R+ +D+L R++  P G  +S  
Sbjct: 285 EDWTVPHFEKMLYDNAAIPRVLLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTL 344

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DA S ETE   R +EGAFYVWT  E+E  + E A L  E   L     CD   ++D  N 
Sbjct: 345 DAQS-ETESGER-EEGAFYVWTPAEIESAVAE-AGLSDESGAL----FCDRFGVTDSGN- 396

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 419
           F+G  VL         A+  G+      + L   R  +F+ R+ RPRP  D+K++  WNG
Sbjct: 397 FEGSTVLTVEASIEDLATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKILAGWNG 456

Query: 420 LVISSFARASKILKSEAESAMFNFP--VVGSDR----KEYMEVAESAASFIRRHLYDEQT 473
           L I   A AS +L +    A  +    V  SD       Y ++A  A +F+R HL+D+ T
Sbjct: 457 LAIDMLAEASIVLGTSGREAAIDAASDVASSDEPSGDDRYAQLATDALAFVRTHLWDDDT 516

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
            RL    R+G     G+L+DYAFL  G L  YE     ++L +A++L       F D   
Sbjct: 517 GRLARRVRDGDVGIDGYLEDYAFLARGALTCYEATGEVEFLAFALDLARAIRRDFWDESA 576

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNAEHSL 592
              + T     S+L+R +E  D + PS   V+V  L  L    A    +  +R  + H+ 
Sbjct: 577 ETLYFTPERGESLLVRPQELGDQSTPSPTGVAVEILALLDPFTAEPFGEMAHRVVSTHAT 636

Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
            + E+  + +++++      A  L       V  V     +++E  L   +    L + +
Sbjct: 637 EIEESPFEYVSLSL------AQSLVTHGPLEVTTVADGRPMEWERTLGRTY----LPRRL 686

Query: 653 IHIDPADTEEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           +   PA +  +D W +    ++     A     AD+    VC +  CSPP  D
Sbjct: 687 LAHRPASSAMLDDWLDVIGVDTVPPIWADREQRADEPTVYVCADRVCSPPEHD 739


>gi|421111206|ref|ZP_15571685.1| PF03190 family protein [Leptospira santarosai str. JET]
 gi|410803388|gb|EKS09527.1| PF03190 family protein [Leptospira santarosai str. JET]
          Length = 699

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 41  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 100

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W+
Sbjct: 101 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 160

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
           +KR  L      A  +LS+ L  S     +  +   LP       A  L +S YDS FGG
Sbjct: 161 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 216

Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F +    KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGG
Sbjct: 217 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 268

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           G  RYS D RW VPHFEKMLYD        ++  S++K +       D++ YL RDM   
Sbjct: 269 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 328

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G I SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN    
Sbjct: 329 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 377

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DD
Sbjct: 378 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 428

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL   +  +A                 V   +++++++AE   SFI R+L D 
Sbjct: 429 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 471

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R
Sbjct: 472 PNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 529

Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
              G F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE 
Sbjct: 530 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGIDSARYRKFAES 587

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             + F   L   ++  P +  A       S K +VL+  K +   +++LA     +  + 
Sbjct: 588 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 645

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 646 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 689


>gi|296121436|ref|YP_003629214.1| hypothetical protein Plim_1180 [Planctomyces limnophilus DSM 3776]
 gi|296013776|gb|ADG67015.1| protein of unknown function DUF255 [Planctomyces limnophilus DSM
           3776]
          Length = 707

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 237/699 (33%), Positives = 350/699 (50%), Gaps = 76/699 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+  +A+LLN WFVSIKVDREERPD+D++YM  V A+   GGWP+S
Sbjct: 50  SACHWCHVMEHESFENPRIAELLNQWFVSIKVDREERPDLDQIYMAAVIAMTQQGGWPMS 109

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P   P  GGTYFPP  +YGRPGF  +L  + DAW+ +R+++ +  +    QL+  +
Sbjct: 110 VFLTPQGHPFYGGTYFPPTSRYGRPGFAEVLAAIHDAWENRREVVTEQAS----QLTMTV 165

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               S  + P  L +N L      L +  D   GGFG APKFP  +++++ +  + +  D
Sbjct: 166 HDQLSERQEPTTLHENLLEKAGRTLVRVCDRVNGGFGHAPKFPHAMDLRLAMRLAHRF-D 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T ++ E +E        L  MAKGGIHDH+GGGF RYS DE W VPHFEKMLYD   L  
Sbjct: 225 TTETAEVAE------LGLTAMAKGGIHDHLGGGFARYSTDEIWLVPHFEKMLYDNALLLQ 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI----FSAEDADSAETEGATRKKEG 315
            YLD +   K  FY    + I+ Y+ R+M  P  E+     +A+DADS   EG    +EG
Sbjct: 279 AYLDGWQFNKTDFYRRTAQSIVHYVLREMQVPRAELPGGFCAAQDADS---EG----EEG 331

Query: 316 AFYVWTSKEVEDIL------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            F+VW+  E+ D+L       + + LF+  Y +   GN            ++G N+L   
Sbjct: 332 RFFVWSQSEIRDVLSGSELGNDDSRLFERAYGVTSGGN------------WEGHNILNLP 379

Query: 370 NDSSASASKLGM---PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
              +A   +LGM    LE+ L++L   R KLF+ R  R  P  D+K+IV+WNGL+IS+ A
Sbjct: 380 KTIAALGRELGMAETALEQKLSLL---RTKLFEHRKNRIAPGRDEKLIVAWNGLMISALA 436

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           RA  +L  +               +  +++AES              + L HS + G  K
Sbjct: 437 RAGLVLDDQEALQAAQ-----RAARVILDMAESL------------PYGLPHSIQKGQPK 479

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
              +LDDY   +  L++L+       WL  A+ L +     F D E GG++ T+ +   +
Sbjct: 480 HGAYLDDYGCFLEALIELFLADGDPSWLSRAVPLIDRLVNEFHDDEQGGFYFTSSQAEKL 539

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R ++  D   PSGN+     L++   I   ++S+   + A   L      ++   MA 
Sbjct: 540 ISRSRDFQDNVTPSGNAAVANALLKFGRITGDARSE---ELAHEVLQAASGLMQQSTMAT 596

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENML---AAAHASYDLNKTVIHIDPADTEEM 663
                A D    PS + V +    +S      L   A    +++L    +       +  
Sbjct: 597 AHSLAALDWWLGPSYECVYVPAETTSTTDSEPLKQDAVQRVAHELYLPNVLFLTGRAQ-- 654

Query: 664 DFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVT 701
             WE   +  A + +   + A + V  VCQ   C  PV 
Sbjct: 655 --WE--GTLAAGLVQGRLAPASEPVLYVCQKGVCQLPVV 689


>gi|456873671|gb|EMF89033.1| PF03190 family protein [Leptospira santarosai str. ST188]
          Length = 691

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 252/711 (35%), Positives = 358/711 (50%), Gaps = 73/711 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA  LN  FVSIKVDREERPD
Sbjct: 33  GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM  + A+   GGWPL+VFL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W 
Sbjct: 93  IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWS 152

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
           +KR  L      A  +LS+ L  S     +  +   LP       A  L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208

Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
           F +    KFP  + +  +L YH        +S    +  +M   TL  M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260

Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
           G  RYS D RW VPHFEKMLYD         +  S++K +       D++ YL RDM   
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLAECSSVSKKISAKSFALDVISYLHRDMRNE 320

Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
            G I SAEDADS   EG    +EG FYVW  +E  ++ GE + + ++ + +   GN    
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+GKN+L E +  S +A        +  ++L   R KL + RSKR RP  DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ SWNGL   +  +A                 V   +++++++AE   SFI R+L D 
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              R+   FR+G S   G+ +DYA +I+  + L+E G G ++L  A+        LF  R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521

Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
              G F  TG D  VLLR   D +DG EPS NS  V +LV+L+  + G  S  YR+ AE 
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 579

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
             + F   L   ++  P +  A       S K +VL+  K +   +++LA     +  + 
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +  ++  + EE           +++  +  S    +  VC+NFSC  P+ 
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681


>gi|87310211|ref|ZP_01092343.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
           3645]
 gi|87287201|gb|EAQ79103.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
           3645]
          Length = 637

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 219/564 (38%), Positives = 311/564 (55%), Gaps = 56/564 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESF DE +AK LN+ F+ IKVDREERPD+D VYMT VQ +  GGGWPLS
Sbjct: 71  SSCHWCHVMEHESFTDEEIAKFLNEHFICIKVDREERPDIDHVYMTAVQIMTRGGGWPLS 130

Query: 80  VFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           VFL+P+ KP  GGTY+P    D+  + GF T++ +V   W++K   L +SG    + + E
Sbjct: 131 VFLTPEGKPFYGGTYWPARDGDRDAQVGFLTVIDRVAQFWEEKEADLRKSGDGLSDLVKE 190

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMML 191
           AL    +    P  L +  L      +++++D+  GGF       + PKFP P  +Q +L
Sbjct: 191 ALRPRVTLQ--PLTLDEQLLATADAAIAETFDAEHGGFNFSADDPNQPKFPEPATLQYLL 248

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             +       +SG A E QKM+  TL  +A GGI DH+GGG HRYSVD  W +PHFEKML
Sbjct: 249 ARA-------RSGSA-EAQKMLTTTLDGIAAGGIRDHIGGGLHRYSVDRFWRIPHFEKML 300

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  QLA++Y +A+ LT +  Y  +  +  D++ R+M GP G+ +SA DADS   EG   
Sbjct: 301 YDNAQLASLYAEAYQLTGNPQYRRVAAETCDFVLREMTGPDGQFYSAIDADS---EG--- 354

Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
            +EG +Y W+  E+  IL    + L K  Y L  + N            F+    + EL 
Sbjct: 355 -EEGKYYRWSQAELTAILSPAQLELAKSVYGLGGSPN------------FEEVYFVPELQ 401

Query: 371 DSSASASK-LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              A   + L +  ++    L   R  L   R+KR  P +D K + +WNGL+I+  A A 
Sbjct: 402 APIAELPQNLKLDADQLQTRLQTLRETLLAARAKRTPPAIDTKALTAWNGLMIAGLADAG 461

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           +IL+                R++Y++ A  +A FI  ++      RL  SF++G +K   
Sbjct: 462 RILQ----------------RQDYLDAAARSADFILANVTSADG-RLLRSFKDGQAKITA 504

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           ++DDYA L+ GL+ L+E     KWL  A  L   Q ELF D   GG++ T  +   V++R
Sbjct: 505 YVDDYAMLVDGLIALHEATGEPKWLDAAERLTKQQIELFGDPRLGGFYFTAADAEEVIVR 564

Query: 550 VKEDHDGAEPSGNSVSVINLVRLA 573
            K   D A P+GNSV+  NL+ LA
Sbjct: 565 GKIATDNAIPAGNSVAAGNLLYLA 588


>gi|448355570|ref|ZP_21544321.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
           10989]
 gi|445635098|gb|ELY88270.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
           10989]
          Length = 722

 Score =  365 bits (938), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 238/696 (34%), Positives = 351/696 (50%), Gaps = 59/696 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS
Sbjct: 55  SACHWCHVMEDESFADEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML-------AQSGAFAI 132
            +L+P+ KP   GTYFP   K G+PGF  IL  + ++W   RD +         +    +
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENLTNSWAGDRDEIENRAEQWTDAAKDRL 174

Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML 191
           E+  +A+SAS   +        + L   A    +S D +FGGFGS  PKFP+P  ++++ 
Sbjct: 175 EETPDAVSASQPPSS-------DVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVL- 226

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             ++  + TG+     E Q +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKML
Sbjct: 227 --ARAADRTGR----DEFQDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 280

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  ++   +L  +  T D  Y+ +  + L ++ R++    G  FS  DA S E E    
Sbjct: 281 YDNAEIPRAFLIGYQQTGDERYAEVVAETLAFVARELTHEEGGFFSTLDAQSEEPE-TGE 339

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
           ++EGAFYVWT  E+ D+L     A LF + Y +  +GN            F+G      +
Sbjct: 340 REEGAFYVWTPDEIHDVLENETTADLFCDRYDITESGN------------FEGSTQPNRV 387

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A++  +        L   R KLF  R +RPRP+ D+KV+  WNGL+I++ A A+
Sbjct: 388 RSVSDLAAEYDLEAADVRARLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAA 447

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L                D  EY  +A  A  F+R  L+DE   RL   +++G     G
Sbjct: 448 LVLGG------------SEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDG 495

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL    L  YE       L +A++L    ++ F D + G  + T     S++ R
Sbjct: 496 YLEDYAFLARAALGCYEATGEVDHLAFALDLARIIEDEFWDADRGTLYFTPESGESLVTR 555

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+V  L+ L       + D + + A   L     R++  ++    +
Sbjct: 556 PQELGDQSTPSAAGVAVETLLALEGF--ADQDDEFEEIATTVLETHANRIETNSLEHATL 613

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--E 667
           C AAD L   + +  V     ++ D       A A   L   +    PA  +E++ W  E
Sbjct: 614 CLAADRLESGALEITV-----AADDLPAAWREAFAGRYLPDRLFARRPATDDELESWLTE 668

Query: 668 EHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
              ++   +     + D    L VC++ +CSPP  D
Sbjct: 669 LDLADAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 704


>gi|74318745|ref|YP_316485.1| hypothetical protein Tbd_2727 [Thiobacillus denitrificans ATCC
           25259]
 gi|74058240|gb|AAZ98680.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
           25259]
          Length = 673

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 239/688 (34%), Positives = 351/688 (51%), Gaps = 73/688 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  + FED  V  ++N  FV+IKVDREERPD+D++Y T  Q L   GGGWPL
Sbjct: 48  SACHWCHVMAHDCFEDAEVGAVMNRLFVNIKVDREERPDLDQIYQTAHQLLAQRGGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSE 137
           +VFL+PD  P   GTYFP   +Y  PGF  ++  V  AW  +R ++LAQ+ A     L++
Sbjct: 108 TVFLTPDQTPFFAGTYFPKTARYQLPGFPELMENVAHAWHARRGEVLAQNDAVRA-ALAQ 166

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           + S  A+S   P  L    L      L++++D  +GGF  APKFPRP E+  +L  ++  
Sbjct: 167 SQSQPAASASTP--LTAAPLEQGVRDLAQAFDPVWGGFSRAPKFPRPGELFFLLRRAQ-- 222

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                 G  ++ ++M LFTL+ MA GG+ D +GGGF RYSVDE W +PHFEKMLYD G L
Sbjct: 223 ------GGDAKAREMALFTLRKMASGGVVDQLGGGFCRYSVDEEWAIPHFEKMLYDNGPL 276

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y DA++L  +  +      I+ +L R+M  P G  +SA DADS   EG     EG F
Sbjct: 277 LHLYADAWALRGETLFRETAEGIVAWLLREMRAPEGGFYSALDADS---EG----HEGKF 329

Query: 318 YVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YVW+ +EV+ +L   E+A+    + +  P           P+ E    N L         
Sbjct: 330 YVWSREEVKSLLTPDEYAVAAPFYGFDAP-----------PNFENTSWNPL-RARPLEEI 377

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+ LG+        +   RRKLF  R  R RP  DDK + SWN L+I   A A +++   
Sbjct: 378 AAALGLFPTDAEARVAAARRKLFAARESRIRPGRDDKQLTSWNALMIGGLAHAGRVMA-- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                         R E++  A +A  F+RR+L+  +  RL+ +F+ G ++   +LDDYA
Sbjct: 436 --------------RPEWVAEAHAAIDFLRRNLW--RDGRLRATFKRGEARLNAYLDDYA 479

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL+  LL+  +       + WA EL +     F DRE GG+F T+ +  ++L R K  +D
Sbjct: 480 FLVDALLETMQAAYREADMAWAQELADALLAHFEDREAGGFFFTSHDHEALLTRPKPGYD 539

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSGN V+   L RL  ++  ++   Y   +   L +F  ++    +A P +    D 
Sbjct: 540 NATPSGNGVAAFALQRLGHLLGETR---YLDASARCLRLFLPQVVQQPIAHPTLLAVLDE 596

Query: 616 LSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
              P R  +VL G  + V ++   LA    + D+   +                 N   A
Sbjct: 597 ALRPPRV-IVLRGPDTPVQEWAANLAPRLGARDMLLAL----------------PNGEGA 639

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTD 702
             A     A +  A +C   +C PP+T+
Sbjct: 640 PGALAKPEAPQPTAWICSGTACQPPITE 667


>gi|291614213|ref|YP_003524370.1| hypothetical protein Slit_1752 [Sideroxydans lithotrophicus ES-1]
 gi|291584325|gb|ADE11983.1| protein of unknown function DUF255 [Sideroxydans lithotrophicus
           ES-1]
          Length = 676

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 237/704 (33%), Positives = 366/704 (51%), Gaps = 87/704 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFEDE VA ++N+ F++IKVDREERPD+D++Y    Q L    GGWPL
Sbjct: 48  SACHWCHVMAHESFEDEAVAAVMNELFINIKVDREERPDLDQIYQNAHQLLSRRSGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD  P   GTYFP + +YG PGF  +++ +  A+ ++R  LA+ G    +Q+  A
Sbjct: 108 TMFLAPDGTPFYSGTYFPKQARYGLPGFPALIQDIAHAYKEQRGELAEQG----KQIVAA 163

Query: 139 LSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           L+A        D  L  + +     Q S+++D   GGFG APKF  P E+ ++L  +   
Sbjct: 164 LAAWQPEKSATDSTLDASPIATSIRQHSENFDRVNGGFGGAPKFLHPAELDLLLQQTHAT 223

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D       ++ + +VLFTLQ MA+GG++D +GGGF RYSVD  W +PHFEKMLYD G L
Sbjct: 224 HD-------AQTRHIVLFTLQQMAQGGLYDQLGGGFCRYSVDAEWDIPHFEKMLYDNGLL 276

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y DA+  + D F++ I      ++ R+M  P G  +++ DADS         +EG F
Sbjct: 277 LGLYSDAWLSSSDPFFARIVEQTAAWVMREMQSPQGGYYASLDADS-------EHEEGKF 329

Query: 318 YVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLS----RMSDPHNEFKGKNVLIELND 371
           YVW   ++ D+L   E+A L + HY L  T N +      R+S P  E            
Sbjct: 330 YVWQRNDIRDLLSAAEYA-LIQPHYGLDSTPNFENHAWNLRVSQPLGEI----------- 377

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
               A KLG+  E+   +L   + KLF  R +R RP  D+K++ SWNGL+I+  A+A++I
Sbjct: 378 ----AQKLGLGEEQAAMLLAAAKTKLFAAREQRIRPGRDEKILGSWNGLMIAGMAKAARI 433

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
                             R++++  A+ A  F+R  L+  Q  RL  + ++G +    +L
Sbjct: 434 FG----------------REDWLHSAQQAMDFVRTTLW--QDGRLLATHKDGKTHLNAYL 475

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           DD+A+L++  L+L +    +  L +A+++ +     F D   GG+F T+ +  +++ R K
Sbjct: 476 DDHAYLLNAALELLQAEFRSPDLSFAVQIADALLARFEDVRNGGFFFTSHDHEALIQRNK 535

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
              D A PSGN ++   L+RLA +    +   Y   AE  L +F   ++  A     +C 
Sbjct: 536 TAQDNATPSGNGIATQGLLRLAELTGDIR---YTDAAERCLKLFFPIMQRAAGQFSSLCT 592

Query: 612 A-ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
           A  + L  PS   +VL G  + ++     AA  A Y     +I +              N
Sbjct: 593 ALGEALQPPSM--LVLCG--AEIETAAWRAAVAAKYLPGLMIIVL--------------N 634

Query: 671 SNNASM--ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
            + AS+  + +   +    A +C    C PP+T   SL+ LL E
Sbjct: 635 GDEASLPSSLDKPRSATTTAWLCHGTQCLPPIT---SLDELLTE 675


>gi|448393368|ref|ZP_21567693.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
 gi|445663783|gb|ELZ16525.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
          Length = 730

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 230/701 (32%), Positives = 355/701 (50%), Gaps = 70/701 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFED+ VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEDESFEDDDVAEVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFA 131
            +L+P+ KP   GTYFP E +  +PGF  + +++ D+W+         + D   ++    
Sbjct: 113 AWLTPEGKPFFVGTYFPKESQRNQPGFLELCQRISDSWESEDREEMEHRADQWTEAAKDR 172

Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
           +E+  +   A+  + + P       L   A  + +S D ++GGFGS  PKFP+P  + ++
Sbjct: 173 LEETPDGAGAAGGAAEPPS---SEVLETAANAVLRSADRQYGGFGSGGPKFPQPSRLHVL 229

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
              ++  + TG+     E  +++  TL  MA GG+ DHVGGGFHRY VD+ W VPHFEKM
Sbjct: 230 ---ARAYDRTGR----EEYLEVIEETLDAMAAGGLSDHVGGGFHRYCVDKDWTVPHFEKM 282

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD  ++   +L  + LT D  Y+ +  + LD+L R++    G  FS  DA S E     
Sbjct: 283 LYDNAEIPRAFLAGYQLTGDERYAEVVEETLDFLERELTHDEGGFFSTLDAQS-EDPATG 341

Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
            ++EGAFYVWT  EV ++L +   A LF   Y +  +GN            F+G+N    
Sbjct: 342 EREEGAFYVWTPGEVSEVLEDETTADLFCARYDITESGN------------FEGRNQPNR 389

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
           +    + A +  +   +    L + R  LF+ R +RPRP+ D+KV+  WNGL+I++ A A
Sbjct: 390 VRSLESLAEEYDLEQSEIEERLEDARETLFEAREERPRPNRDEKVLAGWNGLMINACAEA 449

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           + +L              G DR  Y E A  A  F+R  L+D    RL   F++G  K  
Sbjct: 450 ALVL--------------GEDR--YAEQAVDALEFVRDRLWDADEQRLSRRFKDGDVKVD 493

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G+L+DYAFL  G L  Y+       L +A++L  T +  F D E G  + T      ++ 
Sbjct: 494 GYLEDYAFLARGALGCYQATGDVDHLAFALDLARTIEAEFWDEEQGTIYFTPESGEPLVT 553

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R +E  D + PS   V+V  L+ L         D   + A   L     +++  ++    
Sbjct: 554 RPQELTDQSTPSAAGVAVETLLALDEFA----EDDLERIAATVLETHANKIEANSLEHAS 609

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +C AAD L   + + V +   +   ++ +  A  +    L      + P   + ++ W +
Sbjct: 610 LCLAADRLEAGALE-VTVAADELPDEWRDRFAEEYHPGRL----FALRPPTEDGLEAWLD 664

Query: 669 HNSNNAS-------MARNNFSADKVVALVCQNFSCSPPVTD 702
             + + +        ARN     +    VC++ +CSPP  D
Sbjct: 665 ELALDEAPPIWAGREARNG----EPTLYVCRDRTCSPPTHD 701


>gi|448627283|ref|ZP_21671896.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
 gi|445759112|gb|EMA10399.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
          Length = 733

 Score =  365 bits (937), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 234/710 (32%), Positives = 356/710 (50%), Gaps = 78/710 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFENEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
           L+PD +P   GTYFPPE+K G+PGF  +L+++ D+W   +++ +M   AQ    AIE   
Sbjct: 118 LTPDGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDL 177

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           EA  A       P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D G+     +   +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  
Sbjct: 229 AHADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----------- 304
           ++   +L  +       Y+ + R+  ++++R++  P G  FS  DA+SA           
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQ 344

Query: 305 ------ETEGATRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDP 356
                   E     +EG FYVWT ++V D + +   A +F ++Y +   GN         
Sbjct: 345 SSGESPRDEPGGETEEGLFYVWTPEQVHDAVDDETDAEVFCDYYGVTERGN--------- 395

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
              F+G  VL      +  A +     ++    L     + F+ R  RPRP  D+KV+  
Sbjct: 396 ---FEGATVLAVRKPVAVLAEEYEQSEDEITASLQRALNQTFEARKDRPRPARDEKVLAG 452

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I + A  + +L                  ++Y +VA  A SF+R HL+DE   RL
Sbjct: 453 WNGLMIRTLAEGAIVLD-----------------EQYADVAADALSFVREHLWDEDERRL 495

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +++G     G+L+DYAFL  G L L+E     + L +A++L     E F D E G  
Sbjct: 496 NRRYKDGDVAIDGYLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTL 555

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F T     S++ R +E  D + PS   V+V  L+ L+     S +D +   AE  L    
Sbjct: 556 FFTPTGGESLVARPQELTDQSTPSSTGVAVDLLLSLSHF---SDNDRFESVAERVLRTHA 612

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            R+    +    +  A D     + + + LVG +S+  +    A   A + + + ++   
Sbjct: 613 DRVSSNPLQHASLTLATDTYEQGALE-LTLVGDQSA--YPGEWAETLAEHYIPRRLLAHR 669

Query: 657 PADTEEMDFWEEHNSNNAS----MARNNFSADKVVALVCQNFSCSPPVTD 702
           PAD  E + W +    + S      R     +  V   C+NF+CSPP  D
Sbjct: 670 PADDSEFEQWLDALGLDESPPIWAGREQVDGEPTV-YACRNFACSPPKHD 718


>gi|150400057|ref|YP_001323824.1| hypothetical protein Mevan_1315 [Methanococcus vannielii SB]
 gi|150012760|gb|ABR55212.1| protein of unknown function DUF255 [Methanococcus vannielii SB]
          Length = 687

 Score =  365 bits (936), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 237/713 (33%), Positives = 366/713 (51%), Gaps = 58/713 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL    +TCHWCHVM  +SFED  VA  LN  F+SIKVDREERPD
Sbjct: 28  GEEAFKKAKLENKPIFLSIGYSTCHWCHVMAKDSFEDFDVADTLNKNFISIKVDREERPD 87

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +Y+   Q + G GGWPL++ ++PD KP    T+   E ++G PG   +L  + + W 
Sbjct: 88  LDDIYLKTCQLMTGSGGWPLTIIMTPDKKPFFAATFISKEPRFGSPGIIDLLEGISELWA 147

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            K D + +     +  L E +S + S  KL ++L + A      QL + YD  +GGFG  
Sbjct: 148 IKHDEIVKRSDEILIHL-ENISKTTSKGKLDEKLLEKAFL----QLKEIYDKNYGGFG-V 201

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    I  ++ + KK   TG      E  +M + TL  M  GGI+DH+  GFHRY+V
Sbjct: 202 PKFPTAHLIIFLIKYWKK---TGN----DEALEMAIKTLDKMKMGGIYDHISYGFHRYAV 254

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DE W +PHFEKMLYDQ  ++  YL+++  T++  +  I  ++ +Y+ + +  P    +SA
Sbjct: 255 DEMWKLPHFEKMLYDQALISMAYLESYRATRNEEHKKIVSEVFEYVLKVLKSPEKAFYSA 314

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPH 357
           E+   AE+EG     EG FY W   E++ IL      +FK+ Y +KP GN  L   ++  
Sbjct: 315 EN---AESEGI----EGKFYTWNITEIDQILRNSENNIFKKVYNIKPEGNY-LGESTEAT 366

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
           N   G N+L         AS++ M  E+   IL + R+KL D    R RP  D K++  W
Sbjct: 367 N---GTNILYMERSIQEIASEMEMWPEEVDQILEKARKKLLDALENRKRPSKDYKILADW 423

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+I+S ++A +I K+E                EY++ +E A SF+   +   +  +L 
Sbjct: 424 NGLMIASLSKAGRIFKNE----------------EYIKASEDAMSFLLSKMVINE--KLY 465

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           HS+     K PGFLDDYAF+  GL++LY      ++L  A +      ELF   E GG+ 
Sbjct: 466 HSYIENELKVPGFLDDYAFITWGLIELYFATFNIEYLKKARDFAEKTLELFW--EDGGFN 523

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
             + E    + +V+  +DGA PSG S+  +NL++L+ I+   + D Y +           
Sbjct: 524 FASKEVNDNIFKVRNIYDGAIPSGTSIMALNLLKLSHIL---RIDKYHEKVYELFENSAE 580

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           ++         M  A +  + P+   V +VG   +   + ++   +  Y  N +++ I P
Sbjct: 581 KISKSPFTYLQMLSAYNFDNDPT--DVSIVGDLENKTTKEIIDEINRVYRPNMSLLFI-P 637

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +D+E +   E+     AS  +   ++   V  +C+  SC  P T+P  + NLL
Sbjct: 638 SDSERLKKLEKI----ASFVKEYPTSKDPVVYICKKDSCLNPETNPSQILNLL 686


>gi|448562484|ref|ZP_21635442.1| thioredoxin domain containing protein [Haloferax prahovense DSM
           18310]
 gi|445718802|gb|ELZ70486.1| thioredoxin domain containing protein [Haloferax prahovense DSM
           18310]
          Length = 709

 Score =  365 bits (936), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 234/697 (33%), Positives = 350/697 (50%), Gaps = 74/697 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   RD +A       EQ + A+
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIANRA----EQWTSAI 168

Query: 140 SAS-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKK 196
           +     +  +P E P  + L    +   +  D   GGFG   PKFP+P  I  +L     
Sbjct: 169 TDRLEETPDVPGEAPGSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL----- 223

Query: 197 LEDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
                  G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLY
Sbjct: 224 ------RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLY 277

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           DQ  LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         
Sbjct: 278 DQAGLASRYLDAARLTGNESYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------G 330

Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EG FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ 
Sbjct: 331 EEGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSA 378

Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           ++A  A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S 
Sbjct: 379 TTAELADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSV 438

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L+ ++         + SD       A  A  F+R  L+D++T  L     NG  K  G+
Sbjct: 439 VLEDDS---------LASD-------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGY 482

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G  DLY+       L +A++L       F D + G  + T     S++ R 
Sbjct: 483 LEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRP 542

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+    + L      +    +   A+  L  F  R++   +    + 
Sbjct: 543 QEPTDQSTPSSLGVATSLFLDLEQFAPDAD---FGDVADAVLGSFANRVRGSPLEHVSLA 599

Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-E 667
            AA+  +  VP    + +   + S ++   LA+ +    L   V+   P   EE+D W +
Sbjct: 600 LAAEKAASGVP---ELTIAADEVSDEWRETLASRY----LPGLVVSRRPGTDEELDAWLD 652

Query: 668 EHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
           E   + A    A    +  +     C+NF+CS P  D
Sbjct: 653 ELGLDEAPPIWAGREMADGEPTVYACENFTCSAPTHD 689


>gi|394990058|ref|ZP_10382890.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
 gi|393790323|dbj|GAB72529.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
          Length = 681

 Score =  364 bits (935), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 240/688 (34%), Positives = 358/688 (52%), Gaps = 73/688 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           +TCHWCHVM  ESFED+  A L+N  +++IKVDREERPD+D++Y +    L G  GGWPL
Sbjct: 48  STCHWCHVMAHESFEDQTTADLINRDYIAIKVDREERPDLDQIYQSAHNLLTGKSGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD  P  GGTYFPPE +Y RPGFK +L KV  A+ ++R  +AQ        L E+
Sbjct: 108 TLFLTPDQTPFYGGTYFPPEARYNRPGFKDLLPKVAQAYRERRHDIAQQNI----SLRES 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L++     +   E     L     QL K++D   GGFG APKFPRP EI   L      E
Sbjct: 164 LASGGPVPQAGIEPNPAPLAGAQSQLEKNFDPVHGGFGGAPKFPRPSEIAFCLRRYAAEE 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           +       ++  +M   TL+ +A GGI+D +GGGF RYSVDERW +PHFEKMLYD G L 
Sbjct: 224 N-------AQALEMARQTLRKIADGGINDQLGGGFCRYSVDERWLIPHFEKMLYDNGPLL 276

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +Y +A+  + D  +  +  + + +L R+M  P G  +SA DADS          EG FY
Sbjct: 277 ELYANAWCCSGDERFRRVAEETVAWLEREMRAPQGGFYSALDADSEHV-------EGKFY 329

Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT +EV   L   E+A+L + HY L    N + S     H  F   + L ++      A
Sbjct: 330 VWTPQEVAATLSADEYAVLSR-HYGLDQPANFEGS-----HWHFYVAHPLDQV------A 377

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +L + L+    +L   R KL  +R++R RP  D+K++ SWN L+I   A A +      
Sbjct: 378 RELSVELDDAWRLLESARTKLIALRAQRVRPGRDEKILTSWNALMIKGLAHAGRTF---- 433

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                        R++++ +A+ A  FI   L+  + +RL  S+++G S   G+LDDYAF
Sbjct: 434 ------------GREDWIALAQQATDFIHAELW--RNNRLLASWKDGKSNLGGYLDDYAF 479

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+  L++L +    T  L +A EL       F D + GG++ T  +  +++ R K   D 
Sbjct: 480 LLDALVELLQARFRTADLTFACELAEALLVRFEDCDQGGFYFTAHDHETLIFRPKTGFDN 539

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADM 615
           A PSGN+V+   L RL  ++  ++   Y   AE +L +F  ++    A  +  +    + 
Sbjct: 540 ATPSGNAVAAFALQRLGHLLGETR---YLAAAERALKLFYPQIASQPAGFMSFLSVLEEY 596

Query: 616 LSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           L  P  +  VL G    V  ++  LA     Y  +  V+ +    ++EM+          
Sbjct: 597 LDPP--QIAVLRGPAEQVAAWQQTLA---KEYRPSTMVLAL----SDEME--------KL 639

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTD 702
             + +  +   V A VCQ+  C P ++D
Sbjct: 640 PGSLDKPATSVVNAWVCQSVKCLPAISD 667


>gi|456865795|gb|EMF84112.1| PF03190 family protein [Leptospira weilii serovar Topaz str.
           LT2116]
          Length = 716

 Score =  364 bits (935), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 250/714 (35%), Positives = 360/714 (50%), Gaps = 74/714 (10%)

Query: 10  TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
           TK R    LI       TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++Y
Sbjct: 63  TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 122

Query: 64  MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
           M  + A+   GGWPL++FL+PD KP+ GGTYFPPE +YGR  F  IL  ++  W +KR  
Sbjct: 123 MDALHAMDQQGGWPLNMFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWSEKRQE 182

Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKF 181
           L  + +     L ++    A   ++     +N            YD+ FGGF +    KF
Sbjct: 183 LIVASSELSRYLKDSGEGRAIEKQVGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVNKF 242

Query: 182 PRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           P  + +  +L  YHS        SG      +MV  TL  M +GGI+D +GGG  RYS D
Sbjct: 243 PPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCRYSTD 293

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
             W VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I SAE
Sbjct: 294 HHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGICSAE 353

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           DADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN            
Sbjct: 354 DADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN------------ 394

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F+GKN+L E     + A+K      K ++ +L   R KL + RSKR RP  DDK++ SWN
Sbjct: 395 FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKILTSWN 452

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I + A+A                 V   R++++++AE   SFI ++L D    R+  
Sbjct: 453 GLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPNG-RILR 495

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
            FR+  S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G F 
Sbjct: 496 RFRDNESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 553

Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  Y + AE     F  
Sbjct: 554 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESIFLYFTK 611

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTVIHID 656
            L   +++ P +  A       S K +VL+  +   DF +++LAA    +  +     ++
Sbjct: 612 ELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVFAVVN 668

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             + EE           +++  +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 669 ENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 715


>gi|114778919|ref|ZP_01453713.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
 gi|114550835|gb|EAU53402.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
          Length = 685

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 242/695 (34%), Positives = 335/695 (48%), Gaps = 77/695 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  VA++LN +F++IKVDREERPD+D VYM   Q +   GGWPL+
Sbjct: 62  STCHWCHVMEHESFEDPQVAEVLNRYFIAIKVDREERPDIDAVYMHAAQLMNVSGGWPLN 121

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + L+PD KP    TY P E ++GR G   + ++V   W + R  +  S       L++++
Sbjct: 122 LLLTPDKKPFYAATYLPKEGRFGRMGLIELAQRVGVMWKQDRQRIEASANSISSALTDSI 181

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A A +  +   L   A R  A++    +D   GGFG AP FP P  +  +L +      
Sbjct: 182 -AVAKTGAMDMALVDAAYRDTAQR----FDKGSGGFGGAPLFPSPQRLLFLLRY------ 230

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G   +  +   MV  +L  M +GGIHD +GGGFHRYS D  W +PHFEKML DQ  L  
Sbjct: 231 -GILKDQPQALTMVKESLTAMQRGGIHDQLGGGFHRYSTDAHWLLPHFEKMLSDQAMLMM 289

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y + +  T D  ++   RD  +YL RDM       ++AEDADS   EG    +EG FY+
Sbjct: 290 AYAEGWKATGDASFAATARDTAEYLLRDMRDKQDGFYTAEDADS---EG----EEGRFYL 342

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W++ E+   LG  A  F + Y ++  GN       +  +E  G N+L    +   +A   
Sbjct: 343 WSADEIRHALGRRADAFMQAYGVEADGNFS----DEASHEKTGANILHRTGEMDPAA--- 395

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                         R KL   R+KR RP  DDKV+  WNGL I++ A   +IL       
Sbjct: 396 ----------FAAEREKLLASRAKRVRPFRDDKVLADWNGLTIAALAITGRIL------- 438

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                    D   Y+E A  AA FI  +L  +    L H +R G +   G LDDY  ++ 
Sbjct: 439 ---------DEPRYIEAATKAADFILHNLRRDDGS-LLHRWRRGEAGIAGQLDDYTDMVW 488

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GL +LYE     +WL  A+ L +     F   EGGG++     D  ++ R  +  DGA P
Sbjct: 489 GLTELYEATFDARWLKQALALNHIMLSRF-KAEGGGFYQVERSD-DLIARPMQGFDGALP 546

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGN+V++ NL+RL+ +   +             A       DMA   P          + 
Sbjct: 547 SGNAVAMHNLLRLSRLTGDAAL-------AKQAAAVAGHFSDMAEQAPSGLLHLLSAELL 599

Query: 620 SR---KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           +    K VVLVG +SS     MLA  H  Y  N  V+  D A TEE+          A  
Sbjct: 600 AESPGKEVVLVGDRSSAGAGAMLAVLHERYRPNTVVLWHD-AQTEEL----------APF 648

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            R   +   KV   VC+N+ C  P   P  +  LL
Sbjct: 649 TRGQKAVQGKVTVYVCENYRCKLPSNAPAVVRELL 683


>gi|256419531|ref|YP_003120184.1| hypothetical protein Cpin_0485 [Chitinophaga pinensis DSM 2588]
 gi|256034439|gb|ACU57983.1| protein of unknown function DUF255 [Chitinophaga pinensis DSM 2588]
          Length = 680

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 215/563 (38%), Positives = 300/563 (53%), Gaps = 55/563 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE E  A+++N+ F++IK+DREERPD+D +YM  VQA+ G GGWPL+VF
Sbjct: 49  CHWCHVMERESFEHEETARIMNEHFINIKIDREERPDLDHIYMDAVQAMTGSGGWPLNVF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P  GGTYFPP   + RP +  +L  +  A+ ++R+ L        + L   + A
Sbjct: 109 LTPDKLPFYGGTYFPPVKAFNRPSWTDVLLALSQAFKERREDLETQAQNMRDHL---VQA 165

Query: 142 SASSNKLP--DELPQNALRLCAE------QLSKSYDSRFGGFGSAPKFPRPVEIQMML-Y 192
           S  S K P  D +P   L   A+       + +  D  +GGFGSAPKFP    IQ +L Y
Sbjct: 166 SGFSGKAPGQDLVPHEELFTKAQCETIFNNMMQQGDKVWGGFGSAPKFPGTFIIQYLLRY 225

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           H         S    +  +  L +L  M +GGI+D +GGGF RYS D +W  PHFEKMLY
Sbjct: 226 H--------HSFNEPKALEQALLSLDKMIRGGIYDQLGGGFARYSTDAKWLAPHFEKMLY 277

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D   L +V  +A+ LT +  Y+    D L ++ R+M   GG  +SA DADS   EG    
Sbjct: 278 DNALLVDVLSEAYQLTGNELYARTIADTLGFVAREMTDAGGGFYSALDADS---EGV--- 331

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
            EG FY W+ +E+E ILG  A LF   Y +   GN            ++  N+L     +
Sbjct: 332 -EGKFYTWSKEEIEHILGTDAALFCAFYDVTEEGN------------WEETNILWVTKPA 378

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           +  A++ G+  E     L   R KL  VR+KR RP LDDK+I+ WN L+I +  +A    
Sbjct: 379 AVFAAEQGITEEALERSLAISREKLMAVRAKRIRPGLDDKIILGWNALMIHACCKA---- 434

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                     +  +G +R  Y E+  +A  F   HL +       H+F+ G +K P FLD
Sbjct: 435 ----------YAALGIER--YREMGVNAMKFCLEHLQNTDKQSFFHTFKGGVAKYPAFLD 482

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA+++  L+ L E     +WL  A EL       F D  G  ++ T      V++R KE
Sbjct: 483 DYAWMVRALIALQEVSGEPEWLSKAKELTEYVVNNFSDEGGIYFYYTEAGQTDVIVRKKE 542

Query: 553 DHDGAEPSGNSVSVINLVRLASI 575
            +DGA PSGN+V   NL+ L+ +
Sbjct: 543 VYDGATPSGNAVMAANLLYLSVV 565


>gi|448666501|ref|ZP_21685146.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
           13557]
 gi|445771632|gb|EMA22688.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
           13557]
          Length = 717

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 228/691 (32%), Positives = 351/691 (50%), Gaps = 56/691 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFENEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEAL 139
           L+P+ +P   GTYFPPE+K G+PGF  +L+++ D+W   ++R+ +        E +   L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWADPEQREEMENRARQWTEAIESDL 177

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
            A+ ++   P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   
Sbjct: 178 EATPAN---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYS 231

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D G+    +    +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++ 
Sbjct: 232 DGGQQDHLN----VVQETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIP 287

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAF 317
             +L  +       Y+ + R+  ++++R++  P G  FS  DA+S   E      +EG F
Sbjct: 288 RAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESIPPEDPDGDSEEGLF 347

Query: 318 YVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YVWT ++V D + +   A +F           CD   +++P N F+G  VL      S  
Sbjct: 348 YVWTPEQVHDAVDDETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPVSVL 395

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A +     ++    L     + F+ R +RPRP  D+K++  WNGL+I + A  + +L   
Sbjct: 396 AEEYERSEDEITAGLQRALNETFEARKERPRPARDEKILAGWNGLMIRALAEGAIVLDD- 454

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                           EY +VA  A SF+R HL+DE   RL   +++G     G+L+DYA
Sbjct: 455 ----------------EYADVAADALSFVREHLWDETEQRLNRRYKDGDVAIDGYLEDYA 498

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL  G L L+E       L +A++L     E F D + G  F T     S++ R +E  D
Sbjct: 499 FLGRGALTLFEATGDVDHLAFAMDLGQAITEAFWDDDEGTLFFTPTGGESLVARPQELTD 558

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            + PS   V+V  L+ L+     S  D + + AE  L     R+    +    +  A D 
Sbjct: 559 QSTPSSTGVAVDLLLSLSHF---SDDDRFEEVAERVLRTHADRVSSNPLQHASLTLATDT 615

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW----EEHNS 671
               + + + LVG +S  D+ +      A   + + ++   PAD    + W    E   +
Sbjct: 616 YEQGALE-LTLVGDQS--DYPSEWTETLAERYVPRRLLAHRPADEGRFEQWLDALELDEA 672

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                 R     D  V   C+NF+CSPP  D
Sbjct: 673 PPIWAGREPVDGDPTV-YACRNFACSPPKHD 702


>gi|126180264|ref|YP_001048229.1| hypothetical protein Memar_2324 [Methanoculleus marisnigri JR1]
 gi|125863058|gb|ABN58247.1| protein of unknown function DUF255 [Methanoculleus marisnigri JR1]
          Length = 721

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 244/714 (34%), Positives = 357/714 (50%), Gaps = 55/714 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    + CHWCHVME ESF D+ VAKLLND FV IKVDREERPD
Sbjct: 47  GEEAFSRAREEGKPIFLSIGYSACHWCHVMEEESFADQQVAKLLNDVFVCIKVDREERPD 106

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D+VYM    AL G GGWPL++ ++ D KP    +Y P E +YG  G   ++ ++   W 
Sbjct: 107 IDQVYMAAAHALTGAGGWPLTILMTADKKPFFAASYIPKESRYGMTGLLDLIPRISKVWQ 166

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +R  L  +G    +Q+ +AL ++A +     EL +  L        + +D   GGFG A
Sbjct: 167 TQRQGLENAG----DQVLQALQSAARTPPEEGELAEAVLDEAYNMFFRVFDGENGGFGDA 222

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           P+FP P  +  +L +  +   TGK         MV  TL  M +GGI D VG GFHRYS 
Sbjct: 223 PRFPTPHNLIFLLRYGNR---TGK----EPAYTMVEKTLHAMRRGGIFDQVGYGFHRYST 275

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLYDQ  L   Y +A+  T    ++   R+ + Y+ R+M  P G  +SA
Sbjct: 276 DAEWFVPHFEKMLYDQALLVMAYTEAYLATGREEFARTARETIAYVLREMTDPDGGFYSA 335

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
           EDADS   EG    +EG FY+WT  E+  +LGE     F   + +   GN        P 
Sbjct: 336 EDADS---EG----EEGKFYLWTKDEILGVLGEEDGERFSRIFNVTEPGNY----REQPG 384

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
            +  G+N+L      ++ A +   P +     + E R+KL   R +R RP  DDK++  W
Sbjct: 385 GKRTGRNILRLRRPLASWAHEFETPEDDLAWSVEEGRQKLLAARKQRVRPGRDDKILTDW 444

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           N L+I++ A+A++                  D  +Y+  AE AA+F+  +L  E   RL 
Sbjct: 445 NALMIAALAKAARAF----------------DEPDYLAAAERAAAFVLANLRREDG-RLL 487

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           H +R G +     LDDYAF+I  L+++YE      +L  A++L       + D   GG+F
Sbjct: 488 HRYRGGEAGLAATLDDYAFMIWALIEVYEASFAPGYLKTAVDLSRDLIARYWDCNEGGFF 547

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
               +D  V +R K  +DGA PSGNSV++  L  L  + A  + +   + AE    VF  
Sbjct: 548 FVP-DDGDVPVRQKPVYDGAIPSGNSVAMYALFVLGRMTANLELE---ETAERIRRVFAG 603

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            + +   A        + +  P+ + V++ G   + D   M+ A  + Y  +  +I   P
Sbjct: 604 TVSESPTACSHFLTGLEFMLGPNFE-VIISGVPDAEDTRAMIGAIRSHYAPDAVII-FRP 661

Query: 658 ADTEEMDFWEEHNSNNASMARN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +D EE +  E      A   R+     +K  A VC N++C  P TDP  +  L+
Sbjct: 662 SDEEEPEIVE-----VAGFTRDIVMIEEKATAYVCTNYACDIPTTDPDEMVRLV 710


>gi|226291405|gb|EEH46833.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 804

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 234/585 (40%), Positives = 323/585 (55%), Gaps = 40/585 (6%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVME ESF    +A +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+P
Sbjct: 67  CHVMEKESFMSPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 126

Query: 85  DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           DL+P+ GG+Y+P P           G+  F  IL K++D W  ++    +S     +QL 
Sbjct: 127 DLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESAKDITKQLR 186

Query: 137 EALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
           E  +   + +K  D     +L    L    +  +  YD+  GGF  APKFP PV +  ++
Sbjct: 187 E-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPTPVNLSFLV 245

Query: 192 YHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           + S+    + D     E S   ++ + TL  M++GGIHD +G GF RYSV   W +PHFE
Sbjct: 246 HLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTADWSLPHFE 305

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETE 307
           KMLYDQ QL +VY+DAF    D        DI  Y+    M+ P G   S+EDADS  + 
Sbjct: 306 KMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSEDADSRPSP 365

Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
             T K+EGAFYVWT KE++ ILG+  A +   H+ +   GN  +SR++DPH+EF  +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELKQILGQRDADVCARHWGVLADGN--VSRINDPHDEFINQNVL 423

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSF 425
                 S  A + G+  ++ + I+   R KL + R SKR RP LDDK+IV+WNGL I + 
Sbjct: 424 SIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAWNGLAIGAL 483

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
           A+ S +L++      + F             AE A  FI+ +L+DEQT +L   +R G  
Sbjct: 484 AKCSVVLENLDRDKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLWRIYRGGVR 533

Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ---NTQDELFLDREGGGY----F 537
              PGF DDYA+LISGL++LYE       L +A +LQ    T   LF       +     
Sbjct: 534 GDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQRYYTTPSTLFYSPSSSDFSTPTS 593

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
             T   P  LLR+K   D A PS N V   NL+RL++++ G   D
Sbjct: 594 PNTPTLPPPLLRLKPGTDAATPSPNGVIARNLLRLSALLDGGDVD 638


>gi|240276138|gb|EER39650.1| DUF255 domain-containing protein [Ajellomyces capsulatus H143]
 gi|325089996|gb|EGC43306.1| DUF255 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 766

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 232/615 (37%), Positives = 320/615 (52%), Gaps = 76/615 (12%)

Query: 11  KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
           K  R  FL    + CHWCHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YV
Sbjct: 57  KLNRMIFLSIGYSACHWCHVMEKESFMSPEVAAILNKAFIPIKLDREERPDIDDVYMNYV 116

Query: 68  QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDK 119
           QA  G GGWPL+VFL+PDL+P+ GGTY+P P           G+  F  IL K++D W  
Sbjct: 117 QATTGSGGWPLNVFLTPDLEPVFGGTYWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQT 176

Query: 120 K--------RDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSR 171
           +        +D+  Q   FA E      S + +  +   +L    L    +  +  YD  
Sbjct: 177 QQLRCRESAKDITRQLQEFAEEGTYSKQSGAGADGEE--DLEVELLEEAYKHFASRYDPV 234

Query: 172 FGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
            GGF  APKFP P  +  ++  S+    + D     E +   +M + TL  +++GGIHDH
Sbjct: 235 NGGFSRAPKFPTPANLSFLVNLSRFSNAVADIVGYEECAHALEMAIKTLISISRGGIHDH 294

Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-D 287
           +G GF RYSV   W +PHFEKMLYDQ QL  VY DAF    D        DI  Y+    
Sbjct: 295 IGHGFARYSVTADWSLPHFEKMLYDQAQLLRVYTDAFDSAHDPELLGAMYDIAAYITSPP 354

Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTG 346
           ++ P     S+EDADS  T   T K+EGAFYVWT KE + ILG+  A +   H+ + P G
Sbjct: 355 VLSPTSGFHSSEDADSLPTPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDG 414

Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRP 405
           N +  R++DPH+EF  +NVL         A + G+  E+ + I+     KL + R SKR 
Sbjct: 415 NVE--RVNDPHDEFINQNVLHIQTTPGKLAKEFGLSEEEVVRIIKASTEKLREYRESKRV 472

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
           RP LDDK+IV+WNGL I + A+ S +L +          V     +E+   AE+AA FIR
Sbjct: 473 RPALDDKIIVAWNGLAIGALAKCSVVLDN----------VDRIKAQEFRLAAENAAKFIR 522

Query: 466 RHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
           + L+D  + +L   +R       PGF DDYA+LISGL+DLYE      +L +A +LQ+  
Sbjct: 523 QSLFDPASGQLWRIYRGEERGDTPGFADDYAYLISGLIDLYEATFDDSYLQFAEQLQH-- 580

Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
                                           + PS N V   NL+RL++++   + D Y
Sbjct: 581 -------------------------------ASTPSPNGVIARNLLRLSTLL---EDDTY 606

Query: 585 RQNAEHSLAVFETRL 599
           R+ A  +++ F   +
Sbjct: 607 RRLARDTVSAFAVEI 621


>gi|330508169|ref|YP_004384597.1| hypothetical protein MCON_2284 [Methanosaeta concilii GP6]
 gi|328928977|gb|AEB68779.1| protein of unknown function (DUF255) [Methanosaeta concilii GP6]
          Length = 710

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 259/726 (35%), Positives = 359/726 (49%), Gaps = 78/726 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVM  ESFED  VA+LLN  F+ IKVDREERPD
Sbjct: 43  GEEAFEAARREDKPIFLSVGYSTCHWCHVMAHESFEDPNVARLLNQSFICIKVDREERPD 102

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D++YM    A+ G GGWPL+V ++PD KP    TY P +   G  G   ++ +VK+ WD
Sbjct: 103 IDQIYMAAAIAVSGRGGWPLTVMMTPDKKPFFAATYIPKKGHMGLTGLMELIAQVKEMWD 162

Query: 119 KKRDMLAQSGAFAIEQLSEALS---ASASSNKLPDELP-----QNALRLCAEQLSKSYDS 170
             R+ L  S    ++ L    S   A        D L       + L      LS  YD 
Sbjct: 163 NDRESLMSSANIIVDHLKGRQSGRGAGVQKEAHKDSLSGSPFDSSLLSRGYSALSSIYDP 222

Query: 171 RFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
             GGFG+APKFP P  I  +L   K+ ++           +M   TLQ M  GGI+DHVG
Sbjct: 223 ENGGFGTAPKFPTPHHILFLLRCWKRTKNILP-------LEMAKTTLQGMRMGGIYDHVG 275

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
            GFHRYS D  W VPHFEKMLYDQ  LA  Y +A+  T +  Y+   R+IL+Y+ RDM  
Sbjct: 276 FGFHRYSTDPEWFVPHFEKMLYDQALLAMAYAEAYQATGEEEYAQTVREILEYILRDMTS 335

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCD 349
           P G  +SAEDADS   EG    +EG FY WT+ E+++ LGE    L    + +  +GN +
Sbjct: 336 PEGGFYSAEDADS---EG----EEGKFYTWTAVELKESLGEEDFRLLIRLFDVYESGNYE 388

Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
             R           N+L + +  S +AS L +P E+  +   +   +L+  R KR  P  
Sbjct: 389 GER-----------NILRQRSSFSDAASVLKIPEEELYHRSSDMISRLYLAREKRVHPLK 437

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           DDK++  WNGL+I++ ARA+  L+                  +    A  AA F+   + 
Sbjct: 438 DDKILTDWNGLMIAALARAAGALQD----------------PDLATAASRAADFLLEVMR 481

Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
             +  RL H +R G +     LDDYAFLI GL++LYE     K+L  A+ L    D+ F 
Sbjct: 482 TPEG-RLMHRYRQG-ADIQANLDDYAFLIWGLIELYEATFDVKYLKAAVHLNEIMDKHFW 539

Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
           D E GG+F T  +   +L+R KE +DGA PSGNS++++NL+RL  +   +       + E
Sbjct: 540 DGEAGGFFFTADDGEELLVRKKEYYDGALPSGNSIALLNLLRLLHLTGDT-------SLE 592

Query: 590 HSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
              A+          A PL    + CA D    P+ + V LVG       + MLAA    
Sbjct: 593 EKAALLARSALPAVSAQPLGYTMLLCALDYALGPTYE-VALVGSLEDGGLKEMLAAIRIR 651

Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPI 704
           +  NK V+    ++   +          A   R+      K  A VC +  C  P T+  
Sbjct: 652 FLPNKAVVLASGSEIVML----------APFTRDLVPVKGKAAAYVCSDHVCQLPATNAA 701

Query: 705 SLENLL 710
            L  LL
Sbjct: 702 ELMALL 707


>gi|410941737|ref|ZP_11373531.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
 gi|410783286|gb|EKR72283.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
          Length = 698

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 242/699 (34%), Positives = 357/699 (51%), Gaps = 75/699 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  +  +   GGWPL++
Sbjct: 63  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHEMEQQGGWPLNM 122

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP+ GGTYFPPE KYGR GF  +L  ++  W +KR  L  + +    +LS+ L 
Sbjct: 123 FLTPEGKPITGGTYFPPESKYGRKGFLEVLNIIQKVWTEKRSELIAAAS----ELSQYLK 178

Query: 141 ASASSNKLPDE---LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSK 195
            SA S     E      N            YDS+FGGF +    KFP  + +  +L +  
Sbjct: 179 DSAESKSRAQETDFTSANCFDSGFLLYENYYDSQFGGFKTNQVNKFPPNMGLGFLLRYY- 237

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
                  S +     +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD  
Sbjct: 238 ------LSSKNPRALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNS 291

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
               +  +   ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG
Sbjct: 292 LFLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEG 344

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E N   ++
Sbjct: 345 LFYIWDLEEFREVCGEDSFLLEKFWNVSKEGN------------FEGKNILHE-NFRGSN 391

Query: 376 ASKLGMPLEKYLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
            ++     E++  + G   R   KL + RSKR RP  DDK++ SWNGL I +  +     
Sbjct: 392 FTE-----EEFKQLDGALLRGKAKLLERRSKRIRPFRDDKILTSWNGLYIKALVKTG--- 443

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                        +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +
Sbjct: 444 -------------IAFQREDFLKLAEETYSFIEKNLIDSKG-RMLRRFREGESGILGYSN 489

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DY+ +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   
Sbjct: 490 DYSEMIASSIVLFEAGRGIRYLRNAVLWMEEVIRLF--RSSAGVFFDTGIDGEVLLRRSV 547

Query: 553 D-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           D +DG EPS NS    +L++L+ +  G  S+ Y + AE     F   L   A++ P +  
Sbjct: 548 DGYDGVEPSANSSLAHSLIKLSFL--GVNSERYLEIAESIFVYFRKELYSYALSYPYLLS 605

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A       S K +VL+  K+S   +++ A+  + +  +  +  ++  + EE         
Sbjct: 606 AYWSYKHHS-KEIVLI-RKNSEAGKDLFASIRSRFLPDSVLAIVNEDELEEA-------R 656

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +S+     S    +  VC+NFSC  P+ +   LE  +
Sbjct: 657 KLSSLFDFKDSGGNALVYVCENFSCKLPIDNVSDLEKYM 695


>gi|392380898|ref|YP_005030094.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum brasilense Sp245]
 gi|356875862|emb|CCC96610.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum brasilense Sp245]
          Length = 672

 Score =  362 bits (928), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 240/698 (34%), Positives = 348/698 (49%), Gaps = 80/698 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A L+N+ FV+IKVDREERPDVD++Y + +  L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFENPEIAGLMNELFVNIKVDREERPDVDQIYQSALAMLGQQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P  GGTYFPP  +YGRPGF  +LR V + +  K + + ++    +  L +AL 
Sbjct: 110 FLTPEAEPFWGGTYFPPASRYGRPGFPDVLRGVAETYRNKPENVTRN----VAALKDALG 165

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             A  N+   E+    L   A++L +  D   GG G APKFP+ V I  +L+  +    T
Sbjct: 166 KLA-ENRAAGEVDLAMLDQIADRLVREVDPFHGGIGHAPKFPQ-VPIFTLLW--RAWLRT 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           GK       ++ V  TL  M++GGI+DH+GGGF RYSVDE W VPHFEKMLYD  QL ++
Sbjct: 222 GK----EPYREAVTNTLAHMSQGGIYDHLGGGFARYSVDEMWLVPHFEKMLYDNAQLLDL 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
               +   ++  +    R+ + ++ R+MI  GG   + +DADS   EG    +EG FY+W
Sbjct: 278 MTLVWQAEREPLFETRIRETVGWVLREMIAEGGGFAATQDADS---EG----EEGLFYIW 330

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSAS 375
             +E++ +LG  A +FK  Y + P GN            ++G  +L     IE  D+   
Sbjct: 331 NEEEIDRLLGPGAEVFKRAYGVTPQGN------------WEGATILNRLHRIEALDAETE 378

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+            L E R  L+  R KR +P  DDKV+  WNGL+I++ A+A  +    
Sbjct: 379 AT------------LAEQRAILWREREKRIKPGWDDKVLADWNGLMIAALAQAGMVF--- 423

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        D   ++  A+SA +F+R  + ++   RL HS+R G  K    LDDYA
Sbjct: 424 -------------DEPAWIAAAQSAYAFVRDRMTEDG--RLLHSWRAGQLKHRATLDDYA 468

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +    L L+E       L  A       D  F D + GGYF T  +   +++R K   D
Sbjct: 469 HMARAALALHEATGDAGALEQARAWVRVLDAHFWDAQAGGYFYTADDADDLIVRTKSAGD 528

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSGN      L  LA++   +    YR+ A+   A F   L      +P    AA++
Sbjct: 529 AATPSGNGTM---LAVLATLHHRTGEAAYRERADALAAAFSGELSRNFFPLPTYLNAAEL 585

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L       +V+VG   + D    L  A     L   ++ + P  T   D    H ++   
Sbjct: 586 LQ--KALQIVIVGDPQASD-TAALRRAVLDRPLPDRILSVLPPGT---DLPAGHPAHGKG 639

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
           M           A VC   +CSPPVT P +L   L  +
Sbjct: 640 M-----QGGVATAYVCTGMTCSPPVTTPDALAAALTRR 672


>gi|320334089|ref|YP_004170800.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
 gi|319755378|gb|ADV67135.1| hypothetical protein Deima_1486 [Deinococcus maricopensis DSM
           21211]
          Length = 674

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 250/711 (35%), Positives = 337/711 (47%), Gaps = 110/711 (15%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED   A  +N+ FV++KVDRE+RPDVD VYM  VQA+ G GGWP++V
Sbjct: 48  TCHWCHVMAHESFEDAQTAAFMNEHFVNVKVDREQRPDVDAVYMRAVQAMTGAGGWPMTV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTYFPP D YG P F+T+L  V +AW  +RD L    A A+ +   A+S
Sbjct: 108 FLAPDRRPFYAGTYFPPRDAYGMPSFRTVLASVANAWADRRDQL-LGNADALTEHVRAMS 166

Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           A   A+   LP++     L    +   +++D+R GGFGSAPKFP P  +  +L       
Sbjct: 167 APKPAADGALPEDFAPRGL----DNARRTFDARHGGFGSAPKFPAPTFLTYLLTQ----- 217

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                    +G+ M + TL  M +GG+ D +GGGFHRYSVDERW VPHFEKMLYD  QL 
Sbjct: 218 --------PDGRDMAVRTLDAMMRGGLMDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLV 269

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL A  +T    +    R  L Y+ R+++ P G    A+DAD    EG     EG F+
Sbjct: 270 RAYLRAHVVTGRADFLDTARATLAYMERELLTPEGGFACAQDADQ---EGI----EGKFF 322

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASAS 377
           VWT +E  D+LG  A L   HY +   GN       DPH+  F  ++VL  + D    A 
Sbjct: 323 VWTPQEFRDLLGADADLALRHYGVTDAGN-----FQDPHHPAFGRRSVLSVVTDVPELAR 377

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              +  +     LG  R  LF  R  R  P LDDKV+ SWNGL + +FA A ++      
Sbjct: 378 AFSLGEDDVRARLGRARETLFSARRARAHPGLDDKVLTSWNGLALMAFADAYRL------ 431

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                     +    Y++VA   A F+R  L       L H++R   +   G L+D A  
Sbjct: 432 ----------TGETHYLDVARRNADFVRARLTAPDGAPL-HAYR---ADVRGLLEDAALY 477

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR-VKEDHDG 556
             GL+ LY      + L WA  L +       D +  G F ++G D   L+    E  D 
Sbjct: 478 GLGLVALYAAAGNLEHLQWARALWDRARRDHWD-DAAGVFYSSGPDAEALVAPTTETFDA 536

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A  S N+         A+ + G   D Y    E   A    R+        L   A DML
Sbjct: 537 AIMSDNA---------AACLLGLHIDRY--FGEDEGARITARV--------LAGTANDML 577

Query: 617 SVPS------RKH---------VVLVGH-KSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
           + PS      + H         + L+G  +    FE  LAA    +      + + PA+ 
Sbjct: 578 THPSGFGGLWQAHAHLHAPHVEIALLGTPEQRAPFERALAAQDLPF------VTVAPAER 631

Query: 661 -EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              +   E    N              VA VC+NF+C  P  DP +    L
Sbjct: 632 GGGLPLLEGREGNG-------------VAYVCRNFTCDLPARDPAAFTAQL 669


>gi|398348235|ref|ZP_10532938.1| hypothetical protein Lbro5_13624 [Leptospira broomii str. 5399]
          Length = 669

 Score =  361 bits (926), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 258/723 (35%), Positives = 358/723 (49%), Gaps = 78/723 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL     TCHWCHVME ESFEDE  A +LN +FVSIKVDREERPD
Sbjct: 10  GKDAFLKAKEEDKMIFLSIGYATCHWCHVMEKESFEDEATAAVLNQYFVSIKVDREERPD 69

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD++YM  + A+   GGWPL++FL+ + KP+ GGTYFPP  KYGR  F  +L  + + W 
Sbjct: 70  VDRIYMDALHAMNQQGGWPLNMFLTSEGKPITGGTYFPPVAKYGRKSFVEVLNILANLWK 129

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQL--------SKSYDS 170
           +K+  L      A E+L++ L  S  S  L +   Q+A +L ++++         + YD 
Sbjct: 130 EKKGELID----ASEELTQYLKESEESKALNE---QSAFQLPSKKVFENAFGMYDRFYDP 182

Query: 171 RFGGFGS--APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
            F GF S    KFP  + +  +L   K       +GE  +  +MV  TL  M KGGI+D 
Sbjct: 183 EFAGFKSNVTNKFPPSMGLFFLLRFYK------STGE-PKALEMVEETLVAMRKGGIYDQ 235

Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
           +GGG  RYS D +W VPHFEKMLYD        ++ F  T  V Y     D+L+YL RDM
Sbjct: 236 IGGGISRYSTDHKWLVPHFEKMLYDNSLFLEALVECFQTTGHVKYKEAAYDVLEYLSRDM 295

Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNC 348
              GG I SAEDADS   EG    +EG FY+W   E  ++ G  AIL +E + +   GN 
Sbjct: 296 RLQGGGIASAEDADS---EG----EEGLFYLWKRNEFHEVCGSDAILLEEFWNVTEIGN- 347

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
                      F+G N+L E +  +  A   G+  E+ + I+   R+KL   RS R RP 
Sbjct: 348 -----------FEGSNILHE-SFRTNFARLHGLEQEELIEIVDRNRKKLLARRSDRIRPL 395

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
            DDKV++SWN L + +  +A+                      E + +AE    FI  +L
Sbjct: 396 RDDKVLLSWNCLYVKAATKAAMAFGD----------------GELLRLAEETFRFIENNL 439

Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
             E   RL   FR+G ++   +  DYA  I   L L++ G G ++L  AI  +  +D + 
Sbjct: 440 VREDG-RLLRRFRDGEARFLAYSGDYAEFILASLWLFQAGKGIRYLTLAI--RYAEDAVR 496

Query: 529 LDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
           L R   G F  TG D   LLR   D +DG EPS NS        L+ +  G +SD Y   
Sbjct: 497 LFRSPAGVFFDTGSDADDLLRRNVDGYDGVEPSANSSFAFAFTILSRL--GVESDKYSDF 554

Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
           A+   + F+  L+   M  P M  A  + +  S++  V+  + +  D   +     A + 
Sbjct: 555 ADAIFSYFKVELETHPMNYPYMLSAYWLKNSASKELAVV--YSTQEDLFPVWQGIGAMF- 611

Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
           L +TV      D E      E       + RN  S   V A  CQ F C  PV+D ISL 
Sbjct: 612 LPETVFAW-ATDKE-----AEEVGEKILLLRNRVSGGSVKAYYCQGFQCDLPVSDWISLR 665

Query: 708 NLL 710
             L
Sbjct: 666 EKL 668


>gi|219852761|ref|YP_002467193.1| hypothetical protein Mpal_2172 [Methanosphaerula palustris E1-9c]
 gi|219547020|gb|ACL17470.1| protein of unknown function DUF255 [Methanosphaerula palustris
           E1-9c]
          Length = 714

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 242/692 (34%), Positives = 341/692 (49%), Gaps = 64/692 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESF D  VA LLND++++IKVDREERPD+D+VYM   Q + G GGWPL++
Sbjct: 74  TCHWCHVMAEESFMDLKVAALLNDYYIAIKVDREERPDIDQVYMAVCQMMTGSGGWPLTI 133

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD +P    TY P   ++   G   +L  V   W +K   L +     +E L +   
Sbjct: 134 IMTPDRRPFFAATYIPKMSRFRGTGMLDLLPMVAQVWREKPGDLIEVATQVVEALHQPAR 193

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A A      D L      L A     ++D   GGFG APKFP P  +  +L + +     
Sbjct: 194 AGAGPEPTIDLLIAGYRGLAA-----TFDPVRGGFGDAPKFPAPHNLLFLLRYWR----- 243

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            +SGE      MV  TLQ M  GGI+DH+ GGFHRYS D  W VPHFEKMLYDQ  L   
Sbjct: 244 -RSGEPV-ALAMVEQTLQAMRHGGIYDHLAGGFHRYSTDGGWKVPHFEKMLYDQAMLVMA 301

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +AF  T +  Y       + Y+ RD++   G   +A+DADS   EG    +EG +Y+W
Sbjct: 302 YTEAFLATGNREYRKTAEATIQYVLRDLVTREGGFAAAQDADS---EG----EEGRYYLW 354

Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
           T  EV  +L +  A  F   Y +   GN      +DP N +  G+NVL    D+      
Sbjct: 355 TLAEVRGLLTQDEAATFTTAYQMTERGN-----FTDPSNPKLTGRNVLYRSPDA------ 403

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
              PL+     L     KL   R +R  P  DDKV+  WNGL+I++ ARA +        
Sbjct: 404 ---PLQDPDLHLVAADAKLAAARRERVPPLTDDKVLTGWNGLMIAALARAGRAFGV---- 456

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        +Y++VA  AA F+   + D Q  RL H +R+G     G  +DYA LI
Sbjct: 457 ------------ADYIDVAGRAADFLLGTMRD-QGGRLLHRYRDGEVAISGQAEDYAALI 503

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLLDLY+     ++L  A+E+         D  GGG+F+   +   +++R KE +DGA 
Sbjct: 504 WGLLDLYQATFTVRYLADAVEVMKEFTARCWDPAGGGFFSAAEDATDLIVRQKEQYDGAM 563

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PS NSV+ ++L+ LA +   +    Y + AE  L  F T + + +  +     A    ++
Sbjct: 564 PSANSVAFMDLLLLARL---TGEPAYEEQAEE-LGRFMTGVVEQSPLIATFFLAGLDFAL 619

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              + VV+VG + +VD   M+ A    + L  T +   PA     D         ASM R
Sbjct: 620 GPAQEVVIVGDEGAVDTTAMVRALAERF-LPSTTVQFKPAAAGAEDL-TTVAPFTASMER 677

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +    +    VC   SC+PP    + +E +L
Sbjct: 678 KD---GRATVYVCSGQSCAPPA---VGVEAML 703


>gi|392955811|ref|ZP_10321341.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
 gi|391878053|gb|EIT86643.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
          Length = 679

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 239/709 (33%), Positives = 348/709 (49%), Gaps = 83/709 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    + ++  FL    +TCHWCHVM+ ESF+D  VA LLN+ FV+IKVDREERPD
Sbjct: 28  GEEAFEKARREKKPVFLSIGYSTCHWCHVMKKESFDDHEVAALLNERFVAIKVDREERPD 87

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D+VYM   Q L G GGWPL+VFL+ D +P   G YFP ED+YG PGFK+++ ++ + + 
Sbjct: 88  LDQVYMAVCQGLTGQGGWPLNVFLTADQRPFYAGVYFPKEDRYGSPGFKSVITQLSEKYT 147

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++ + +        ++L+E+L         P  L +  L  C  QL + +DS +GGF  A
Sbjct: 148 ERHEEIHDYS----KRLTESLQRKMKQE--PTALQETILHTCFNQLGQMFDSIYGGFSQA 201

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  +L +       G+        +MV  TL  MA GGI+D +G GF RY+V
Sbjct: 202 PKFPAPTILTYLLRY-------GQWQGNDLALQMVERTLDAMADGGIYDQIGYGFSRYAV 254

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D+ W VPHFEKMLYD   L   Y++A+ +TK   Y  I  +I+ Y+   M    G  + A
Sbjct: 255 DQMWLVPHFEKMLYDNALLLIAYVEAYQVTKKPRYQQIAAEIIQYVTTVMRDEQGGFYCA 314

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS   EG    +EG +YV++  E+E  L +           + +  C L  ++D  N
Sbjct: 315 EDADS---EG----EEGKYYVFSKTEIERQLPQE----------QASAFCALYDITDEGN 357

Query: 359 EFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
            F+G NV  LI        A  LG+  EK   ++ + R+ L+  R  R  PH DDK++ S
Sbjct: 358 -FEGNNVPNLIHQRKERI-AQTLGITEEKLSTLVEQARQTLYRYRETRIPPHKDDKILTS 415

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WN L+I   A+A+                   D   Y E A+SA SFI + L      R+
Sbjct: 416 WNALMIVGLAKAA----------------AAWDEPAYREHAKSALSFIEKELVIHD--RV 457

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +R G  +  GF+DDYAFL    L++YE     +++  A  L      LF D   GG+
Sbjct: 458 MVRYREGDVQGKGFIDDYAFLAWAYLEMYEATFDDRYISKAQTLTQDMLSLFWDESHGGF 517

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           +    +   +++  KE +DGA PSGN V+   L +L  + A  +   Y +  E    VF 
Sbjct: 518 YYAGNDAEQLIVTGKEAYDGAMPSGNGVAAYVLWKLGKLTADPQ---YDEKLEALFDVFS 574

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HI 655
           + L         +     ML+      VVLV  +  V        A +   L KT + H+
Sbjct: 575 SDLSHYPTGHTQLLQVW-MLTQMKTAEVVLVAEQEQV--------ASSLRTLQKTFLPHV 625

Query: 656 -----DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
                DP +           +   S    + +    +  VC+NF C  P
Sbjct: 626 VWFLQDPRE----------RAAFTSFQLVDRTKKHPMIYVCENFHCQRP 664


>gi|421098293|ref|ZP_15558964.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
 gi|410798561|gb|EKS00650.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
          Length = 691

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 243/690 (35%), Positives = 350/690 (50%), Gaps = 72/690 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNI 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP+ GGTYFPPE  YGR  F  +L  ++  W++KR  L  + +    +LS+ L 
Sbjct: 115 FLTPDGKPITGGTYFPPEPMYGRKSFLEVLNILRKVWNEKRQELIAASS----ELSQYLK 170

Query: 141 ASASSNKLPDE----LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YH 193
            S     +  +      +N            YD+ FGGF +    KFP  + +  +L YH
Sbjct: 171 DSGERRTIEKQEGGLSSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH 230

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                   +S       +MV  TL  M +GGI+D VGGG  RYS D  W VPHFEKMLYD
Sbjct: 231 --------RSSGNPRALEMVENTLLAMKQGGIYDQVGGGLCRYSTDFYWMVPHFEKMLYD 282

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
                   ++   ++K +       D++ YL RDM    G I SAEDADS   EG    K
Sbjct: 283 NSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVDGGICSAEDADS---EG----K 335

Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           EG FY+W  +E  ++ GE + + ++ + +   GN            F+GKN+L E     
Sbjct: 336 EGLFYIWGLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILYE--SYR 381

Query: 374 ASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           + A+KL     K ++ +L   R KL + R+KR RP  DDK++ SWNGL I +  +A    
Sbjct: 382 SEATKLSEEEWKQIDSVLERGRAKLLERRNKRVRPLRDDKILTSWNGLYIKALTKAG--- 438

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                        V   R++++ +AE   SFI R+L D  + R+   FR+G S   G+ +
Sbjct: 439 -------------VAFQREDFLRLAEETYSFIERNLID-PSGRMLRRFRDGESGILGYSN 484

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA +I+  + L+E G G ++L  A+        LF  R   G F   G D  VLLR   
Sbjct: 485 DYAEMITSSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDAGSDGEVLLRRSV 542

Query: 553 D-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           D +DG EPS NS    +LV+L+  + G  S  YR+ AE     F   L   +++ P +  
Sbjct: 543 DGYDGVEPSANSSLAYSLVKLS--LFGIDSVRYRKFAESIFLYFTKELSTNSLSYPHLLS 600

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A       S K +VL+  K S   +++LA     +  +     I+  + EE         
Sbjct: 601 AYWTYRHHS-KEIVLI-RKDSDSGKDLLAEIQTKFLPDSVFAVINEDELEEA-------R 651

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
             +++  +  S    +  +C+NFSC  PV+
Sbjct: 652 KLSTLFDSRDSGGNALVYICENFSCKLPVS 681


>gi|239627004|ref|ZP_04670035.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
 gi|239517150|gb|EEQ57016.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
          Length = 638

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 213/560 (38%), Positives = 298/560 (53%), Gaps = 63/560 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+EG+A +LN  ++ IKVDREERPDVD VYM+  QA+ G GGWPL+
Sbjct: 7   STCHWCHVMERESFENEGIAGILNRDYICIKVDREERPDVDSVYMSVCQAMNGQGGWPLT 66

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD +P   GTYFPP+ +YGR G + +L  V   W   R+ L + GA  IE   +  
Sbjct: 67  IIMTPDCRPFFSGTYFPPKARYGRVGLEELLAAVSAQWKGGRERLLE-GAGRIEAFLKEQ 125

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +  S +   E+   A RL        +D + GGFG APKFP P  I  ++ +  +   
Sbjct: 126 EQADVSAEPGLEVVHRAFRL----FGDGFDKKNGGFGQAPKFPTPHNIMFLMEYGVRENK 181

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G          M + TL  M +GGI DH+GGGF RYS DE+W VPHFEKMLYD   LA 
Sbjct: 182 PGAV-------DMAMDTLVQMYRGGIFDHIGGGFSRYSTDEQWLVPHFEKMLYDNALLAM 234

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y  A+ LT    Y+ + + IL Y+  ++    G  +  +DADS          EG +YV
Sbjct: 235 AYAKAYGLTGRGLYARVVQRILGYVEAELTHASGGFYCGQDADSDGV-------EGRYYV 287

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL-NDSSASAS 377
           +T +E++ +LG E    F   + +   GN            F+GKN+   L N+   +A 
Sbjct: 288 FTPEEIKQVLGPEDGADFCSQFGITGIGN------------FEGKNIPNLLGNEDYETAG 335

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K               RRKL++ R +R   H DDK++VSWNG +I + A A  +L +   
Sbjct: 336 KEA------------SRRKLYEYRIRRAHLHKDDKILVSWNGWMICACAMAGAVLGA--- 380

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         +Y+++A  A +FIR HL  +   RL   +R+G +   G LDDYA  
Sbjct: 381 -------------GQYVDMAVRAEAFIRTHLVKD--GRLLVRYRDGDAAGQGKLDDYACY 425

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL+LYE   GT +L  A+    T    F DRE GG++    +   +++R KE +DGA
Sbjct: 426 VLALLELYEVTFGTGYLEQAVYWAKTMVLQFFDRERGGFYLYAEDGEQLIVRTKEAYDGA 485

Query: 558 EPSGNSVSVINLVRLASIVA 577
            PSGNS +   L +LA I  
Sbjct: 486 VPSGNSAAARVLQQLAQITG 505


>gi|448435859|ref|ZP_21586927.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
           14210]
 gi|445683294|gb|ELZ35694.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
           14210]
          Length = 739

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 241/719 (33%), Positives = 350/719 (48%), Gaps = 83/719 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLS 136
            + +P+ KP   GTYFPPE +  +PGF+ +  ++ D+W   +++ +M  ++  +A     
Sbjct: 113 AWCTPEGKPFYVGTYFPPEARQNQPGFRDLCERIADSWSDPEQREEMKRRADQWAESARD 172

Query: 137 EALSASASSNKLP----DELPQNA--LRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
           E  S        P    D  P     L   A    +SYD  +GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPDAPGPDGEGDASPPGGDLLESAAASALRSYDDEYGGFGSGGAKFPMPGRIDL 232

Query: 190 MLY-HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           ++  +++   D   S  A         TL  M++GG++D +GGGFHRY+VD  W VPHFE
Sbjct: 233 LMRAYARSGRDALLSAAAG--------TLDGMSRGGMYDQIGGGFHRYAVDREWTVPHFE 284

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD  +L   YLD + L  D  Y+ +  + L +L R++    G  FS  DA S   E 
Sbjct: 285 KMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHDDGGFFSTLDARSRPPE- 343

Query: 309 ATRKK---------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
            +R+          EGAFYVWT +EV+ +L E A  L  E Y ++  GN +         
Sbjct: 344 -SRRDDDGHEAGDVEGAFYVWTPEEVDAVLDEPAASLAAERYGIRSGGNFE--------- 393

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
             +G  V          A+   +  E     L E R  LFD R  RPRP  D+KV+ SWN
Sbjct: 394 --RGTTVPTTAASVEELAADRDLSPEAVRQALTEARTALFDARESRPRPARDEKVLASWN 451

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRL 476
           G  IS+FA A+  L                  + Y ++A  A  F R  LY  D +T  L
Sbjct: 452 GRAISAFADAAGTLG-----------------EPYADIAREALGFCRDRLYDADAETGAL 494

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              + +G  + PG+LDDYAFL  G LD Y      + L +A+EL     + F D + G  
Sbjct: 495 ARRWLDGDVRGPGYLDDYAFLARGALDTYAATGDLEPLGFALELAEALVDEFYDADDGTI 554

Query: 537 FNT---------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YYRQ 586
           + T         T +   ++ R +E  D + PS   V+   L    +++ G ++D  +R+
Sbjct: 555 YFTRDPEGDGGQTDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRFRE 610

Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 646
            A   +     R++   +A   +  AAD++       V +   +   ++   L   +   
Sbjct: 611 IARRVVTTHADRIRGGPLAHASLVRAADLVET-GGVEVTIAADEVPDEWRETLGERY--- 666

Query: 647 DLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
            L   ++   PA    +D W +      +    A  + + D+  A VCQ+F+CSPP TD
Sbjct: 667 -LPNALVAPRPATAAGLDEWLDRLDMAEAPPIWADRSATDDEPTAYVCQDFTCSPPRTD 724


>gi|320160551|ref|YP_004173775.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
 gi|319994404|dbj|BAJ63175.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
          Length = 684

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 243/689 (35%), Positives = 347/689 (50%), Gaps = 74/689 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED  +A++LN  FVSIKVDREERPDVD +YM  V AL G GGWPLSV
Sbjct: 49  ACHWCHVMAHESFEDPQIAEILNQHFVSIKVDREERPDVDGIYMNAVIALTGQGGWPLSV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFPP  ++G P F+ +L     AW+  RD L ++G    EQL++ + 
Sbjct: 109 FLTPEGKPFYGGTYFPPTPRHGLPAFRDVLHAALQAWENDRDDLFKAG----EQLAQHIH 164

Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           A      +P   L  N L      L  SYD R+GG+G+AP+FP+P+ ++ +L    +  +
Sbjct: 165 AMNDWGSVPGLVLRANLLEQVTHALLASYDRRYGGWGNAPRFPQPMALEFLLLQVTRGNE 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   +  K V   LQ M++GG++D +GGGF RYS D  W VPHFEKMLYD  Q+++
Sbjct: 225 --------DALKPVEHNLQVMSRGGLYDIIGGGFARYSTDNHWLVPHFEKMLYDNAQISS 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYL A  L K+ ++  I    LD+L  +M  P G  FS+ DADS   EG    +EG FY+
Sbjct: 277 VYLHAGMLEKNPWFLRIATQTLDFLLEEMRHPLGGFFSSLDADS---EG----EEGKFYL 329

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLS--RMSDPHN-EFKGKNVLIELNDSSASA 376
           W   E+  I             L+P G  D S    + P N  F+GK +L    D     
Sbjct: 330 WDFDELRQI-------------LEPAGQWDFSCQVFNLPRNGNFEGKIILQIQEDWERLP 376

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            K G+    +L  +   R  L+  RS R RP  DDKVIVSWNG  + + A A++ L    
Sbjct: 377 EKTGLSETDFLKQMDTVRALLYQKRSLRVRPSTDDKVIVSWNGFALRALAEAARYL---- 432

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       +R +Y+  A+  A F+  +LY  +   L  ++R G  +    L+DYA 
Sbjct: 433 ------------NRPDYLHAAQQNAHFLLENLYTPRG--LMRTWREGSPRQIALLEDYAS 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           LI GLL LY+      W  WA++L       + D   GG+++T  +   +++R K+  D 
Sbjct: 479 LIIGLLALYQSDDNIVWYEWAVKLGEEMISRYRD-PAGGFYDTRDDQQDLIIRPKDFQDN 537

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A P GNS++   L+ L    +G  S Y  Q A     + +  L     A      A D  
Sbjct: 538 ATPCGNSLASYALLLLYEF-SGDDSIY--QLATRVFPLLQDSLVKYPTAFGFWLQAIDWA 594

Query: 617 SVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             PSR+ V L+  ++  +   F+N+L   +             P        ++   +  
Sbjct: 595 MGPSRQ-VALLAPRTLEELQPFKNILWETYR------------PRLVCASSTFQPATNAP 641

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
           A +   +    +V A +C+ F C  P +D
Sbjct: 642 ALLQERSVLNGEVTAYLCEGFVCLQPTSD 670


>gi|262197654|ref|YP_003268863.1| hypothetical protein [Haliangium ochraceum DSM 14365]
 gi|262081001|gb|ACY16970.1| protein of unknown function DUF255 [Haliangium ochraceum DSM 14365]
          Length = 681

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 247/707 (34%), Positives = 364/707 (51%), Gaps = 86/707 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED  +A ++N+ FV++K+DREERPDVD VYM  +Q L  GGGWPLS 
Sbjct: 49  ACHWCHVMAHESFEDAEIAAVMNELFVNVKIDREERPDVDAVYMNALQILGEGGGWPLSA 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SE 137
           F +PD KP   GTYFPP+D+YGRPGF ++LR +   ++ +RD + Q+    ++ L    E
Sbjct: 109 FCTPDGKPYFLGTYFPPQDRYGRPGFASVLRTMAKVFEDQRDKVDQNTEAIVDGLRRVDE 168

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
                A S ++   L  + L     QL++  D + GG GS PKFP      +       L
Sbjct: 169 HFRRGALSGEV-GALRADLLITAGRQLAQRSDPQHGGLGSKPKFPSSTTHAL-------L 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
              G+    +  ++  L   + MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD GQL
Sbjct: 221 ARAGRLAFGAPAREAFLKQARSMARGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNGQL 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y DA+++ +D  ++ +  + + +L  +M  P G +++++DADS   EG    +EG +
Sbjct: 281 LGIYGDAYAMDQDPAFARVIDETITWLEDEMQHPSGALYASQDADS---EG----EEGKY 333

Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELND 371
           YVWT +E+  +LG   AI F+  Y +  TGN +     LSR+SDP  +          +D
Sbjct: 334 YVWTPEEIRAVLGPVDAIFFERAYGVSETGNFEHGTTVLSRVSDPGGD----------SD 383

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            +A AS                  +L   R +R  P  D KV+  WNGL +    RA   
Sbjct: 384 EAALASAR---------------ARLLAARKQRVAPETDTKVLAGWNGLAVRGAVRA--- 425

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
                      +   G+ R   + +A   A F+  H+  E   RL   F++G +K  G L
Sbjct: 426 -----------WETTGNARA--LALAVRVAEFLAGHMLHEGGTRLWRVFKDGSTKLDGTL 472

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL-DREGGG-YFNTTGEDPSVLLR 549
           DDYAF+  G L L E     +W      L +T  E F  +R+G G ++ T G+D  ++ R
Sbjct: 473 DDYAFVAHGFLHLAEATGDARWWRHGAALIDTILERFYEERDGVGIFYMTPGDDTLLVHR 532

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            + + D A P+G SV+V  L+RLA +    ++      AE  LA    +  +   A   +
Sbjct: 533 PESNSDHAIPAGASVAVACLLRLAQVAEDKRA---LDIAERYLAGRVPQAGENPFAFSRL 589

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A D+        VV+V      D   +LAAA   Y   + ++   PA  E    W   
Sbjct: 590 LSALDLY---LHGQVVVVSAGEGAD--ELLAAARRVYAPARMLV---PALAES---W--- 635

Query: 670 NSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
            + ++ +A  + +AD +  A VC+  +CS PV+D  +L  LL   P+
Sbjct: 636 -AADSLLAGKDAAADGRAQAYVCRGQTCSAPVSDAQALRELLTATPA 681


>gi|325283375|ref|YP_004255916.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
 gi|324315184|gb|ADY26299.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
          Length = 679

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 239/692 (34%), Positives = 347/692 (50%), Gaps = 89/692 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFE+E  A L+N+ FV+IKVDREERPDVD +YM   QA+ G GGWP++
Sbjct: 57  STCHWCHVMAHESFENEATAGLMNERFVNIKVDREERPDVDGIYMAATQAMTGQGGWPMT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL    +P   GTY+PP +  G P F+ ++  V DAW  +R  L ++ A A+ +  +A+
Sbjct: 117 VFLDHQRRPFHAGTYYPPHEGLGLPSFRRVMTAVSDAWQNRRADL-EANAQALTEHIQAM 175

Query: 140 SA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S   SA   + P EL Q  L L    L + +D   GGFG APKFP P  +  +L      
Sbjct: 176 SEPRSAGGQEWPAELLQAPLDL----LPQVFDPVHGGFGGAPKFPAPTTLDFLL------ 225

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
               KSG+  +GQ+M L TL+ M +GGI+D +GGGFHRYSVD +W VPHFEKMLYD  QL
Sbjct: 226 ----KSGD-EQGQQMALHTLRQMGRGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQL 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               L A+ ++ D  ++   R+ L YL R+M  P G  +SA+DAD+   EG T       
Sbjct: 281 TRTLLAAYQVSGDPAFAEAARETLRYLEREMRHPSGSFYSAQDADTEGVEGLT------- 333

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           + WT  E++ +LG E A      Y +   GN +     DPH    G+  ++         
Sbjct: 334 FTWTPAELQAVLGAEDAEWLARFYGVTEGGNFE-----DPHRRDAGRRTVL--------- 379

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S++G    +  + L E R +L   R +RP+PH DDKV+ SWNGLV+++ A AS+IL    
Sbjct: 380 SRVGELTPEQRSRLPELRARLLTAREERPQPHRDDKVLTSWNGLVLAALADASRILGE-- 437

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                           ++E+A   A+++R  +  +    L H++ +G + +  G L+D+A
Sbjct: 438 --------------PHWLELARQNAAWVRETM-RQPDGTLWHTWLDGHAPSVEGLLEDHA 482

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
               GL+ LY+     ++L WA EL       F D   G + ++ G+  ++L R     D
Sbjct: 483 LYGLGLVALYQASGELEYLTWARELWTVVQRDFWDDAAGLFRSSGGKAEALLTRQSSAFD 542

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA--VFETRLKDMAMAVPLM---C 610
            A  S N+ + +  + +          YY      +LA     + L DM  A   M    
Sbjct: 543 SAIISDNAAAALLALWI--------DRYYGDPQAQALAHRTVSSHLADMVQAPHGMGGLW 594

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            AA ML  P  +  ++     S +    L AA A + L    + + PA T       EH 
Sbjct: 595 QAAAMLRAPHTELAII----GSAEERAPLEAAAARFLL--PYVALAPAPTPAGLPVLEHR 648

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               +            A +C N +C  P  D
Sbjct: 649 EGGGT------------AYLCVNRACQLPTQD 668


>gi|448321193|ref|ZP_21510673.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
           10524]
 gi|445604053|gb|ELY58004.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
           10524]
          Length = 724

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 219/600 (36%), Positives = 314/600 (52%), Gaps = 41/600 (6%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF DE VA LLN+ F+ IKVDREERPDVD +YMT  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMEEESFADEEVADLLNEEFIPIKVDREERPDVDSIYMTVCQLVSGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            +L+P+ KP   GTYFP   K G+PGF  +L  + D+W+  R+ +            + L
Sbjct: 113 AWLTPEGKPFYVGTYFPKRSKRGQPGFLDLLEGLADSWETDREEIESRADEWTAAARDQL 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
             +  S    +    + L   A+   +S D + GGFGS  PKFP+P  ++++   ++  +
Sbjct: 173 EETPDSIGAAEPPSSDVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL---ARAYD 229

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+     E ++++  +L  M +GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++ 
Sbjct: 230 RTGR----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIP 285

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              L  + LT D  Y+   R+ L+++ R++    G  FS  DA S + E   R +EGAF+
Sbjct: 286 RALLAGYRLTGDERYAGYVRETLEFVSRELTHDEGGFFSTLDAQSEDPETGER-EEGAFF 344

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT  EV ++LG+   A LF   Y +  +GN            F+G++        S  A
Sbjct: 345 VWTPAEVREVLGDETDADLFCARYDITESGN------------FEGQSQPNLAASISELA 392

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +  +   +    L   R+KLF+ R +RPRP+ D+KV+  WNGL+IS+ A A+  L    
Sbjct: 393 DRFDLEEREVEERLESARQKLFEAREERPRPNRDEKVLAGWNGLMISTCAEAALAL---- 448

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                     G DR  Y E+A  A  F+R  L+D    RL   +++G     G L+DYAF
Sbjct: 449 ----------GEDR--YAEMATDALEFVRDRLWDADEGRLSRRYKDGDVAVQGNLEDYAF 496

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L  YE       L +A+EL    +  F D E    + T     S++ R +E  D 
Sbjct: 497 LARGALGCYEATGEVDHLAFALELARGIEAEFYDAERETLYFTPESGESLVTRPQELTDQ 556

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           + P+   V+V  L+ L       + D +   A   L     RL+  A+    +C AAD L
Sbjct: 557 STPAAAGVAVETLLALEGFA--DEDDEFEGIAASVLGTHAGRLESNALQHVTLCLAADRL 614


>gi|448585374|ref|ZP_21647767.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
           33959]
 gi|445726074|gb|ELZ77691.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
           33959]
          Length = 709

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 231/693 (33%), Positives = 348/693 (50%), Gaps = 66/693 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   RD +        EQ + A+
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRA----EQWTSAI 168

Query: 140 SAS-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKK 196
           +     +  +P E P  + L    +   +  D   GGFG   PKFP+P  I  +L   + 
Sbjct: 169 TDRLEETPDVPGEAPGSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL---RG 225

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              TG+     E   +   +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  
Sbjct: 226 YAVTGR----REALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAG 281

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         +EG 
Sbjct: 282 LASRYLDAARLTGNESYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGT 334

Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ ++A 
Sbjct: 335 FYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FERKTTVLNVSATTAE 382

Query: 376 -ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ 
Sbjct: 383 LAEEYELDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLED 442

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
           ++         + SD       A  A  F+R  L+D++T  L     NG  K  G+L+DY
Sbjct: 443 DS---------LASD-------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDY 486

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AFL  G  DLY+       L +A++L       F D + G  + T     S++ R +E  
Sbjct: 487 AFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPT 546

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D + PS   V+    + L      +    +   A+  L  F  R++   +    +  AA+
Sbjct: 547 DQSTPSSLGVATSLFLDLEQFAPDAD---FGGVADAVLGSFANRVRGSPLEHVSLALAAE 603

Query: 615 MLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNS 671
             +  VP    + +   +   ++   LA+ +    L   V+   P   EE+D W +E   
Sbjct: 604 KAASGVP---ELTIAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELDAWLDELGL 656

Query: 672 NNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
           + A    A    +  +     C+NF+CS P  D
Sbjct: 657 DEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689


>gi|325262773|ref|ZP_08129509.1| dTMP kinase [Clostridium sp. D5]
 gi|324031867|gb|EGB93146.1| dTMP kinase [Clostridium sp. D5]
          Length = 668

 Score =  359 bits (921), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 240/720 (33%), Positives = 359/720 (49%), Gaps = 95/720 (13%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  R  FL    +TCHWCHVM  ESFEDE VA++LN  ++ IKVDREERPD
Sbjct: 28  GPEAFQKAKQEDRPVFLSIGYSTCHWCHVMAHESFEDEQVAEVLNSQYICIKVDREERPD 87

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D VYM+  QA+ G GGWPL+  L+P+ +P   GTYFP   +YG PG   +L ++   W 
Sbjct: 88  IDSVYMSACQAVTGAGGWPLTAILTPEQQPFFLGTYFPKHPRYGHPGLIELLEEIGSLWR 147

Query: 119 KKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
           + R+ L ++G    +Q++E +S    +S  +PD   +  L+   E   + YDSR+GGFG 
Sbjct: 148 ENRNKLIEAG----QQITEFISIPDHASGSIPD---KKGLKRAFELYRRQYDSRWGGFGK 200

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGE-ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
           APKFP P        H+          E   E  +M   TL  MA GG++D +GGGF RY
Sbjct: 201 APKFPAP--------HNLLFLLHYSLLENEQEALEMAEHTLTAMAHGGMNDQIGGGFSRY 252

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S DE+W VPHFEKMLYD   LA  YL+A+ + K   Y+   R  LDY+ R++ GP G+ +
Sbjct: 253 STDEKWLVPHFEKMLYDNALLAIAYLEAYHIKKRELYADTARRTLDYVLRELTGPSGQFY 312

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSD 355
             +DADS   EG     EG +Y ++ +E+  +LG+     F   Y +  +GN        
Sbjct: 313 CGQDADS---EGI----EGKYYFFSPEEIMSVLGDGDGEEFCRIYDITASGN-------- 357

Query: 356 PHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
               F+G+++  LI  ++    A  + +              ++++ R  R   H DDKV
Sbjct: 358 ----FEGRSIPNLIGQSELPWRADDIRL-------------NRIYNYRRNRTLLHRDDKV 400

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           I+SWN  ++ + A+A++IL              G  R  Y + A +   FI+ H+ D+ +
Sbjct: 401 ILSWNSWMMIAMAKAAQIL--------------GDTR--YKDAAIAVHRFIQAHMTDD-S 443

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
            RL H +R G +   G LDDYA     LL+LY       +L  A        ELF DRE 
Sbjct: 444 RRLYHRWREGEAAIEGQLDDYAVYGLALLELYRTAYEPVYLEEAAFFAGQMAELFEDREN 503

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GGYF T  +  +++ R KE +DGA PSGNS + + L +LA       + ++++  E  + 
Sbjct: 504 GGYFLTASDTEALITRPKETYDGAVPSGNSAAAVLLSQLAHYTC---TPFWQEALERQIN 560

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF--ENMLAAAHASYDLNKT 651
                + +          A      PS++ +         +   E +L        LN++
Sbjct: 561 FLAGVVNEYPSGHSFGLQALMSALYPSQELICATSDNGMPEILKEYLLRVP----VLNRS 616

Query: 652 VIHIDPADTEEMD----FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
           VI   P + EE++    F +E+              +  +  +CQN  C+ PV+D   LE
Sbjct: 617 VILKTPENKEELEKAVPFLKEY----------PVPEEGAMFYLCQNGRCTAPVSDLRKLE 666


>gi|448570870|ref|ZP_21639381.1| thioredoxin domain containing protein [Haloferax lucentense DSM
           14919]
 gi|448595768|ref|ZP_21653215.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
           10717]
 gi|445722788|gb|ELZ74439.1| thioredoxin domain containing protein [Haloferax lucentense DSM
           14919]
 gi|445742222|gb|ELZ93717.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
           10717]
          Length = 703

 Score =  358 bits (920), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 239/708 (33%), Positives = 346/708 (48%), Gaps = 96/708 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   RD +          +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRL 172

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
             +  +   P E P  + L    +   +  D   GGFG   PKFP+P  I  +L      
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223

Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                 G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 332 EGTFYVWTPADVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379

Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
           +A  A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++ +A                 A  A  F+R  L+D++T  L     NG  K  G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYL 483

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL+ G  DLY+       L +A++L       F D + G  + T     S++ R +
Sbjct: 484 EDYAFLVRGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543

Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFET 597
           E  D + PS   V+    + L            A  V GS ++  R +  EH SLA+   
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLKQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAE 603

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           +    A  VP +  AAD   VP      L                 AS  L   V+   P
Sbjct: 604 K---AASGVPELTVAAD--EVPDEWRATL-----------------ASRYLPGLVVSRRP 641

Query: 658 ADTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
               E+D W +E   + A    A    +  +     C+NF+CS P  D
Sbjct: 642 GTDAELDAWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689


>gi|399574327|ref|ZP_10768086.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
 gi|399240159|gb|EJN61084.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
          Length = 723

 Score =  358 bits (920), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 242/705 (34%), Positives = 347/705 (49%), Gaps = 67/705 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE VA +LND FV IKVDREERPD+D+VY T  Q + G GGWPLS
Sbjct: 53  SACHWCHVMADESFEDEAVADVLNDEFVPIKVDREERPDLDRVYQTICQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPP+ + G PGF  +LR + ++WD + D          +Q + AL
Sbjct: 113 VWLTPEGKPFYVGTYFPPQARQGAPGFLDLLRNISNSWDSEEDRAEMEN--RADQWTTAL 170

Query: 140 SASASSNKLP-DELPQ-NALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSK 195
               +    P DE P  + L   A+   +  D   GGFGS   PKFP P  I ++L   +
Sbjct: 171 DDQLADTPDPADETPDVDVLGTAAQAALRGADREHGGFGSGEGPKFPHPGRIDLLL---R 227

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             + +G+     E   +   TL  MA GG++D VGGGFHRY+VD  W VPHFEKMLYD  
Sbjct: 228 TYDRSGR----GETLNVATETLDAMANGGLYDQVGGGFHRYTVDRSWTVPHFEKMLYDNA 283

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA-------DSAETEG 308
           +L   YL  + +T +  Y+ I ++   ++ R++  P G  FS  DA       +SAE+  
Sbjct: 284 ELPKSYLAGYQVTGEPRYARIAQETFAFVERELTHPDGGFFSTLDAQSEGFDDESAESAD 343

Query: 309 A-------TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
                     ++EGAFYVWT ++V ++L E  A LF + Y +   GN +           
Sbjct: 344 GDDSEGGEAEREEGAFYVWTPEQVHEVLDEEDAELFCDRYGITKRGNFE----------- 392

Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
            G +VL         A +  +        L   R  LF+ R +RPRP  D+KV+  WNGL
Sbjct: 393 HGTSVLNISTPVEELAEEYDIDRADVSERLTNARVALFEAREERPRPPRDEKVLAGWNGL 452

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
           +ISSFA  +++L      A                 AE A SF+R HL+D+   RL   F
Sbjct: 453 MISSFAMGARVLDPALAGA-----------------AERALSFVREHLWDDDAKRLSRRF 495

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           ++   K  G+L+DYAFL  G  +LY+       L +A++L    +  F D E G  + T 
Sbjct: 496 KDQDVKGDGYLEDYAFLARGAFELYQATGDVDHLAFALDLARVIEAEFWDDEKGTLYFTP 555

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
                ++ R +E  D + PS   V+   LV L      S +D +   AE  L     R++
Sbjct: 556 ASGEQLVTRPQELTDSSTPSSLGVATDLLVDLDHF--DSDAD-FGDIAERVLKTHADRIR 612

Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
              +    +  AA+  +    +  + V      D+  +LA  +    L   V+   P   
Sbjct: 613 GSPLEHVSLALAAEKFARGGLELTLAVDELPD-DWWEVLAGRY----LPGAVVSQRPHSD 667

Query: 661 EEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           +E+D W +    +      A  +    K     C++F+CSPP TD
Sbjct: 668 DELDEWLDVLGLDEVPPIWAGRDGKNGKATVYACESFACSPPQTD 712


>gi|374376399|ref|ZP_09634057.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
 gi|373233239|gb|EHP53034.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
          Length = 687

 Score =  358 bits (920), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 236/698 (33%), Positives = 350/698 (50%), Gaps = 75/698 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFED   A L+N+ F++IKVDREERPD+D +YM  VQ + G GGWPL+V
Sbjct: 49  ACHWCHVMERESFEDAATAALMNEHFINIKVDREERPDIDHIYMDAVQTMTGSGGWPLNV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP  GGTY+PP     RP +K +L  V DA+  KR  + Q      +QL +A S
Sbjct: 109 FLTPDKKPFYGGTYYPPVSYANRPSWKDVLTAVSDAFQNKRTAIQQQAEGLTQQLVDANS 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                    D L       C+  L ++ D+ +GGFG APKFP+   I+ +L +    +D 
Sbjct: 169 FGIGDGSGADFLRDEVDAACSAILKQA-DTSWGGFGRAPKFPQTQTIRFLLRYHYAEKDR 227

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
             S  A    +  L +L  M +GGI+D VGGGF RY+ D  W  PHFEKMLYD   L   
Sbjct: 228 PDSF-ADNALQQALLSLDKMMEGGIYDQVGGGFARYATDTEWLAPHFEKMLYDNALLVVT 286

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
             +A+ +T+D  Y       + ++ R++    G  ++A DADS   EG    +EG FYVW
Sbjct: 287 LSEAYQVTRDERYRGCIEQTIAFIERELTDASGGFYAALDADS---EG----EEGKFYVW 339

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNC---DLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           + KE+E++L E A LF  +Y +  +GN    ++ R+  P  EF   N   E+N++   A 
Sbjct: 340 SKKEIEELLREDADLFCRYYDITESGNWEGKNILRILTPLKEFAATN---EINETLLEA- 395

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                      +L + R +L   R+ R RP LDDK+I+ WN L+ +++++A +   +EA 
Sbjct: 396 -----------LLEKGRLQLLVARAHRIRPALDDKIILGWNALMNTAYSKAFEATGNEA- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y++ A     F+  + ++       H ++ G +K P FLDDYA+L
Sbjct: 444 ---------------YLQRATDNMRFL-LNAFENTDGSFAHVWKAGVAKYPAFLDDYAYL 487

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I  LL L    +   +L  A  L     E F + E G +F T      V+LR KE +DGA
Sbjct: 488 IEALLQLARVTADYSYLEKARALCQGIQEHFAESETGYFFYTPQNQGDVILRKKEVYDGA 547

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V   NL+ L+      +   +R  AE  +     +L +  +  P     A ML+
Sbjct: 548 TPSGNAVMAANLLHLSVCFDLPE---WRVQAEQMI----VQLANAIIKYP-TSFGAWMLA 599

Query: 618 V----PSRKHVVLVG-HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
                   K + L+G +KSS+  + +L      + L   +I   P          +  + 
Sbjct: 600 FYRVQQGSKEIALIGDYKSSL--QELL-----HHFLPGAIIMAGPNADAHYPLLADKRAG 652

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N            ++  +C++++C  PV +   L NLL
Sbjct: 653 N-----------PLLIYLCEHYACRQPVDNLTELFNLL 679


>gi|421090081|ref|ZP_15550882.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
 gi|410001344|gb|EKO51958.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
          Length = 711

 Score =  358 bits (920), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 237/696 (34%), Positives = 353/696 (50%), Gaps = 68/696 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++
Sbjct: 78  TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 137

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
           + A   +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS  
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                 SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD   
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
              +  +   ++K +       DI+ YL RDM   GG I        +  +  + ++EG 
Sbjct: 309 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGI-------CSAEDADSEEEEGL 361

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FY+W  +E  ++ GE + L ++ + +   GN            F+GKN+L E    +   
Sbjct: 362 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 405

Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           S       K+L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +        
Sbjct: 406 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 459

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA
Sbjct: 460 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 508

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +
Sbjct: 509 EMIASSIVLFEAGRGVRYLQNAVFWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 566

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           DG EPS NS    +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A  
Sbjct: 567 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYW 624

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
                SR+ V++   K+S    ++LA   + +  +     ++  + EE           +
Sbjct: 625 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 675

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           S+  +  S    +  VC+NFSC  P+ +   LE  +
Sbjct: 676 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 711


>gi|338532946|ref|YP_004666280.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
 gi|337259042|gb|AEI65202.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
          Length = 696

 Score =  358 bits (920), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 241/698 (34%), Positives = 344/698 (49%), Gaps = 71/698 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+
Sbjct: 57  SACHWCHVMAHESFESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PDLKP  GGTYFPP+D+YGRPGF  +L  ++DAW+ K+D + +  A   E L E  
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDRYGRPGFPRLLGALRDAWENKQDEVQRQAAQFEEGLGEL- 175

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A+   +  P  L    +    + ++K  D   GGFG APKFP P+   +ML   ++   
Sbjct: 176 -ATYGLDAAPSALTAADVVAMGQGMAKQVDPAHGGFGGAPKFPNPMNFALMLRAWRR--- 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G  +  +  V  TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD  QL +
Sbjct: 232 ----GGGAPLKDAVFLTLERMALGGIYDQLGGGFHRYSVDARWRVPHFEKMLYDNAQLLH 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A  +     +  +  + + Y+RR+M   GG  ++A+DADS   EG    +EG F+V
Sbjct: 288 LYAQAQQVEPRPLWRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W  +EV   L E  A L   H+ +KP GN +            G  VL  +   +  A +
Sbjct: 341 WRPEEVRAALPEAQAELVLRHFGIKPEGNFE-----------HGATVLEVVVPVAELARE 389

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+  +     L   R+ LF+ R +R +P  DDK++  WNGL+I   A A+++       
Sbjct: 390 RGLSEDAVARALAAARQTLFEARERRVKPGRDDKLLSGWNGLMIRGLALAARVF------ 443

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     +R E+   A  AA F+    +D    RL  S++ G ++  GFL+DY  L 
Sbjct: 444 ----------ERPEWATWAAEAADFVLAKAWD--GTRLARSYQEGQARIDGFLEDYGDLA 491

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
           SGL  LY+     K+L  A  L      LF D E   Y         +++      D A 
Sbjct: 492 SGLTALYQATFDVKYLEAADALVRRAVALFWDAEKAAYLTAPRGQKDLVVATYGLFDNAS 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S      V LA++  G K   + +  E  +A     L   AM    +  AAD L  
Sbjct: 552 PSGASTLTEAQVELAALT-GDKQ--HLELPERYVARMREGLVRNAMGYGYLGLAADAL-- 606

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF-WEEHNSNNASMA 677
                         ++    +  A AS D+      +D A    +   W+       ++ 
Sbjct: 607 --------------LEGAAAVTVAGASDDVAPLCAAVDHAFAPTVALSWKAPGQPVPALL 652

Query: 678 RNNFSADKVV-----ALVCQNFSCSPPVTDPISLENLL 710
           +  F   + V     A +C+ F C  PVT+P  L   L
Sbjct: 653 QATFEGREPVKGRAAAYLCRGFVCELPVTEPDVLAQRL 690


>gi|410724261|ref|ZP_11363459.1| thioredoxin domain containing protein [Clostridium sp. Maddingley
           MBC34-26]
 gi|410602266|gb|EKQ56747.1| thioredoxin domain containing protein [Clostridium sp. Maddingley
           MBC34-26]
          Length = 617

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 240/689 (34%), Positives = 349/689 (50%), Gaps = 78/689 (11%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE VAK++ND FV++KVDREERPDVD VYMT  QAL G GGWPL++ ++PD K
Sbjct: 1   MAHESFEDEEVAKIMNDNFVAVKVDREERPDVDSVYMTVCQALTGHGGWPLTIIMTPDQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTY+P + KY  PG   IL  V   W + ++ L  +    + +L +      S  +
Sbjct: 61  PFYAGTYYPKKSKYNIPGLMDILNAVVKQWSEDKNKLISTSDGILSELGQYFEGETSCVE 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
           L  +  +N       QL +++D  +GGFG APKFP P +I  +L + K  ++  K+ E +
Sbjct: 121 LTSKTLENGYN----QLLQTFDKNYGGFGEAPKFPTPHKIMFLLRYYKNHKNI-KALEIA 175

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E       TL  M +GG+ DH+G GF RYS D +W VPHFEKMLYD   L   YL+ + +
Sbjct: 176 EK------TLVSMYRGGMFDHIGYGFSRYSTDNKWLVPHFEKMLYDNALLILAYLEGYEI 229

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           TK+  Y  +    L+Y+ R++    G  + AEDADS   EG    +EG +YV+   E+  
Sbjct: 230 TKNELYKDVATKALEYIFRELSNKEGGFYCAEDADS---EG----EEGKYYVFEPSEILR 282

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLE 384
           +LG E    F +++ +   GN            F+GK++  LI+ N+   +  K      
Sbjct: 283 VLGDEDGTYFNDYFDITLNGN------------FEGKSIPNLIKNNEFDKTNDK------ 324

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
               I   C + L   RS R + H DDK++ SWNGL+I++ A+A K+++ E         
Sbjct: 325 ----IKALCEQVLL-YRSDRYKLHKDDKILTSWNGLMIAALAKAYKVIEDE--------- 370

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                   Y E A+ A +FI   L DE  +RL   +R   S+   +LDDYAFL  GL++L
Sbjct: 371 -------RYFEYAKKAVNFIFEKLMDEN-NRLLARYREEESRHKAYLDDYAFLCFGLIEL 422

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNS 563
           YE      +L  A+++       F D +  G++   GED   L+ R KE  DGA PSGNS
Sbjct: 423 YESSFDISFLSKALDINKNMINFFWDYKNYGFY-LYGEDSEQLIARPKELFDGAMPSGNS 481

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
           V+  NL++LA I   S  +   + A   L      +    +       AA      S++ 
Sbjct: 482 VAAYNLIKLARITGDSNLE---EMAGKQLNFICGSILREEINHSFFLLAASFALSESKEL 538

Query: 624 VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWEEHNSNNASMARNNF 681
           V L+  KS  +    L +  A ++L   +   +  D  E  + F +E+          +F
Sbjct: 539 VCLIKDKSEEEKIKDLLSEKAIFNLTTIIKTNENKDEIEKLIPFVKEY----------DF 588

Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
             DK    +C+  SC  PV D   L NLL
Sbjct: 589 INDKSTYYLCKGKSCLAPVNDIDELINLL 617


>gi|433424873|ref|ZP_20406585.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
 gi|432197957|gb|ELK54295.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
          Length = 703

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 228/694 (32%), Positives = 343/694 (49%), Gaps = 68/694 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   RD +          +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRL 172

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
             +  +   P E P  + L    +   +  D   GGFG   PKFP+P  I  +L      
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223

Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                 G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 332 EGTFYVWTPADVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379

Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
           +A  A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++ +A                 A  A  F+R  L+D++T  L     NG  K  G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYL 483

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+       L +A++L       F D + G  + T     S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D + PS   V+    + L      +    + + A+  L  F  R++   +    +  
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPDAG---FGEVADAVLGSFANRVRGSPLEHVSLAL 600

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHN 670
           AA+  +    +  V     ++ +  +   A  AS  L   V+   P    E+D W +E  
Sbjct: 601 AAEKAASGVPELTV-----AADEIPDEWRATLASRYLPGLVVSRRPGTDAELDAWLDELR 655

Query: 671 SNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
            + A    A    +  +     C+NF+CS P  D
Sbjct: 656 LDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689


>gi|385803931|ref|YP_005840331.1| hypothetical protein Hqrw_2868 [Haloquadratum walsbyi C23]
 gi|339729423|emb|CCC40679.1| YyaL family protein [Haloquadratum walsbyi C23]
          Length = 768

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 236/717 (32%), Positives = 347/717 (48%), Gaps = 92/717 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+ VA +LND FV IKVDREERPD+D++Y T  Q + GGGGWPLSV+
Sbjct: 55  CHWCHVMAEESFEDDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVW 114

Query: 82  LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           L+PD KP   GTYFP  ++  R   PGF  I +    AW+  R  L        + L + 
Sbjct: 115 LTPDGKPFYVGTYFPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDR 174

Query: 139 LSASASSNKLPDEL------------PQNA-----------LRLCAEQLSKSYDSRFGGF 175
           L    +++   D              PQ             L   +    ++ D+ +GGF
Sbjct: 175 LEVDTNADTSIDVDDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGF 234

Query: 176 GS-APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           GS  PKFP+P  I+ ++  H++   +T      +        TL  MA GGI+DHVGGGF
Sbjct: 235 GSRGPKFPQPGRIEALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGF 286

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRY+ D +W VPHFEKMLYD  +L+ VYL A+  T    Y+ +  +   +L R++  P G
Sbjct: 287 HRYATDRKWTVPHFEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEG 346

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLS 351
             +S  D   A++EG    +EG FYVWT + + + + +  I  +  + + +   GN    
Sbjct: 347 GFYSTLD---AQSEG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN---- 395

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+G  VL      S  A+K  +  ++ ++ L + R  LFD R  R RP+ D+
Sbjct: 396 --------FEGSTVLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDE 447

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ +WNGL ISS AR   IL++E                +Y E+A  A SFIR HL+D 
Sbjct: 448 KILTAWNGLAISSLARGGLILETE----------------QYTELANDALSFIRTHLWDS 491

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
            + RL   +++G     G+LDDYAFL  G  DLY+     + L +A+ L  +  ELF D 
Sbjct: 492 DSGRLSRRYKDGDVDETGYLDDYAFLARGAFDLYQTTGAVEHLSFAVTLAESIVELFYDT 551

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
            G   + T  +  S++ R ++  D +  S   ++V  L  +    +   S         +
Sbjct: 552 AGETLYLTPEDAESLVARPQDLRDQSTSSSAGIAVQTLNAVDPFTSTDFSGI-------A 604

Query: 592 LAVFETRLKDMAMAVPL----MCCAADMLSVPSRKH-VVLVGHKSSVDFENMLAAAHASY 646
            AV +T   D     PL    +  AAD     +R H  V++ H +  +    + +  AS 
Sbjct: 605 GAVIDTH-ADEIRGRPLEHISLAMAADSR---ARGHDEVVIAHDTDTELSQPIRSDIAST 660

Query: 647 DLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVALVCQNFSCSPPV 700
            L    +   PA    ++ W +    +S  A  A  +    K     C   +CSPP 
Sbjct: 661 YLPGVPLSQRPATVSGLESWTDELGLDSPPAIWAGRHQRDSKATIYACSGRACSPPT 717


>gi|225679668|gb|EEH17952.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 865

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 214/525 (40%), Positives = 299/525 (56%), Gaps = 33/525 (6%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVME ESF    +A +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+P
Sbjct: 67  CHVMEKESFMAPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 126

Query: 85  DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           DL+P+ GG+Y+P P           G+  F  IL K++D W  ++    +S     +QL 
Sbjct: 127 DLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESAKDITKQLR 186

Query: 137 EALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
           E  +   + +K  D     +L    L    +  +  YD+  GGF  APKFP PV +  ++
Sbjct: 187 E-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPTPVNLSFLV 245

Query: 192 YHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           + S+    + D     E S   ++ + TL  M++GGIHD +G GF RYSV   W +PHFE
Sbjct: 246 HLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTADWSLPHFE 305

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETE 307
           KMLYDQ QL +VY+DAF    D        DI  Y+    M+ P G   S+EDADS  + 
Sbjct: 306 KMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSEDADSRPSP 365

Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
             T K+EGAFYVWT KE++ ILG+  A +   H+ +   GN  ++R++DPH+EF  +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELKQILGQRDAEVCARHWGVLADGN--VARINDPHDEFINQNVL 423

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSF 425
                 S  A + G+  ++ + I+   R KL + R SKR RP LDDK+IV+WNGL I + 
Sbjct: 424 SIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAWNGLAIGAL 483

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
           A+ S +L++      + F             AE A  FI+ +L+DEQT +L   +R G  
Sbjct: 484 AKCSVVLENLDREKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLWRIYRGGVR 533

Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
              PGF DDYA+LISGL++LYE       L +A +LQ   ++ FL
Sbjct: 534 GDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQQYLNKHFL 578


>gi|448604533|ref|ZP_21657700.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445743942|gb|ELZ95422.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 708

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 233/698 (33%), Positives = 351/698 (50%), Gaps = 76/698 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEQFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   RD +   A+    AI ++L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTSAITDRL 172

Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
            E   ++  A  +++ D   Q ALR          D   GGFG   PKFP+P  I  +L 
Sbjct: 173 EETPDVAGEAPGSEVLDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL- 223

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
             +    +G+     E   +   +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLY
Sbjct: 224 --RGYAVSGR----HEALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLY 277

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           DQ  LA  YLDA  LT +  Y+ +  +  +++RR++    G +F+  DA S         
Sbjct: 278 DQAGLAARYLDAARLTGNESYATVAAETFEFVRRELTHDDGGLFATLDAQSG-------G 330

Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EG FYVWT  +V  +L E  A LF + Y + P GN            F+ K  ++ ++ 
Sbjct: 331 EEGTFYVWTPDDVRGLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSA 378

Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           ++A  A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ + 
Sbjct: 379 TTADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGAV 438

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +L+ ++                  + A  A  F+R  L+D++T  L     NG  K  G+
Sbjct: 439 VLEDDS----------------LADDARRALDFVRERLWDDETATLSRRVMNGEVKGDGY 482

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G  DLY+       L +A++L       F D + G  + T     S++ R 
Sbjct: 483 LEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRP 542

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+    + L      +  D + + A+  L  F  R++   +    + 
Sbjct: 543 QEPTDQSTPSSLGVATSLFLDLEQF---APEDGFGEVADAVLGSFANRVRGSPLEHVSLA 599

Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-E 667
            AA+  +  VP    + +   +   ++   LA+ +    L   V+   P   EE+D W +
Sbjct: 600 LAAEKAASGVP---ELTIAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELDAWLD 652

Query: 668 EHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
           E   + A      R     D  V   C+NF+CS P  D
Sbjct: 653 ELGLDEAPPIWAGREAADGDPTV-YACENFTCSAPTHD 689


>gi|292655805|ref|YP_003535702.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|448289792|ref|ZP_21480955.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|291370452|gb|ADE02679.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|445581309|gb|ELY35670.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
          Length = 703

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 240/708 (33%), Positives = 344/708 (48%), Gaps = 96/708 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ I+    ++W   R+ +          +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDIVESFAESWLTDREEIENRAEQWTSAITDRL 172

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
             +  +   P E P  + L    +   +  D   GGFG   PKFP+P  I  ML      
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDAML------ 223

Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                 G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 332 EGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379

Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
           +A  A +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++ +A                 A  A  F+R  L+D +T  L     NG  K  G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDAETATLSRRVMNGEVKGDGYL 483

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+       L +A++L       F D + G  + T     S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543

Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFET 597
           E  D + PS   V+    + L            A  V GS ++  R +  EH SLA+   
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAE 603

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
           +    A  VP +  AAD   VP      L                 AS      V+   P
Sbjct: 604 K---AASGVPELTVAAD--EVPDEWRATL-----------------ASRYFPGLVVSRRP 641

Query: 658 ADTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
              EE+D W +E   + A    A    +  +     C+NF+CS P  D
Sbjct: 642 GTDEELDAWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689


>gi|398337804|ref|ZP_10522509.1| hypothetical protein LkmesMB_20984 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 630

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 242/687 (35%), Positives = 346/687 (50%), Gaps = 62/687 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE++ +A  LN  ++SIKVDREERPD+D+++M  + A+   GGWPL++FL+PD K
Sbjct: 1   MERESFENQTIADYLNSHYISIKVDREERPDIDRIFMDALHAMDQQGGWPLNMFLTPDGK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P+ GGTYFPPE +YGR  F  +L  ++  W  KR  L  +     + L E+    AS  +
Sbjct: 61  PITGGTYFPPEQRYGRKSFLEVLNVIQGVWSGKRQELIAASTELAQYLKESGEGRASEKQ 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSG 204
                P+N+           YD +FGGF +    KFP  + +  +L YH         S 
Sbjct: 121 ESGFPPENSFDAGYSLYESYYDPQFGGFKTNHVNKFPPSMGLSFLLRYH--------HSS 172

Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
                 +MV  TL  M +GGI+D VGGG  RYS D  W VPHFEKMLYD        ++ 
Sbjct: 173 GNPRALEMVENTLLAMKQGGIYDQVGGGLCRYSTDHHWLVPHFEKMLYDNSLFLESLVEY 232

Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
             ++K +       D+++YL RDM   GG I SAEDADS   EG    +EG FY+W   E
Sbjct: 233 SQVSKKIPAESFALDVIEYLHRDMRISGGGICSAEDADS---EG----EEGLFYIWDLAE 285

Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
             ++ GE + L ++ + +   GN            F+GKN+L E +  SA A      L+
Sbjct: 286 FREVCGEDSSLLEKFWNVTEKGN------------FEGKNILHE-SYRSAVAKLDAEELK 332

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
           +    L   R+KL + RSKR RP  DDK++ SWNGL I +  +A    +           
Sbjct: 333 RIDAALDRGRKKLLERRSKRIRPLRDDKILTSWNGLYIKALVKAGAAFQ----------- 381

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                R+E++ +AE   SFI ++L D    R+   FR+G S   G+ +DYA +I+  + L
Sbjct: 382 -----REEFLRLAEETYSFIEKNLID-SNGRILRRFRDGESGILGYSNDYAEMIAASIAL 435

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNS 563
           +E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS
Sbjct: 436 FEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSANS 493

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
               +LV+L+  + G  SD YR+ AE     F   L   A++ P +  A       S K 
Sbjct: 494 SLSYSLVKLS--LLGVHSDRYREIAESIFLYFTKELSTHALSYPFLLSAYWSYKNHS-KE 550

Query: 624 VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA 683
           +VL+  K+S   +++LAA    +  N  V  +   + E+           +S+     S 
Sbjct: 551 IVLI-RKNSDAGKDLLAAIGKKFLPNSVVAVVSEDELEDA-------RKLSSLFDARDSG 602

Query: 684 DKVVALVCQNFSCSPPVTDPISLENLL 710
              +  VC+NF+C  PV +   LE  L
Sbjct: 603 GDALVYVCENFACKLPVNNVADLEKFL 629


>gi|302497930|ref|XP_003010964.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
 gi|291174510|gb|EFE30324.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
          Length = 714

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 223/614 (36%), Positives = 331/614 (53%), Gaps = 60/614 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+
Sbjct: 1   MEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLE 60

Query: 88  PLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S      QL E  
Sbjct: 61  PVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFA 120

Query: 140 S-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
                 +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV +  +L  S
Sbjct: 121 EEGTHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLS 180

Query: 195 KKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           +  E   D     E  +  +M + T+  +A+GGI D +G GF RYSV   W +PHFEKML
Sbjct: 181 RYPEEVMDIVGREECVKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKML 240

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGAT 310
           YDQ QL +V++D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T
Sbjct: 241 YDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSTPILSPMGCFYSSEDADSQPSPEDT 300

Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            K+EGA+YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF  +NVL   
Sbjct: 301 EKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIA 358

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
              +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGLVI + A+ 
Sbjct: 359 TTPTQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALAKC 418

Query: 429 SKILKS-EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSK 486
           + +L+  +AE +           K   ++A +A  FI+ +L+D ++ +L   +R +    
Sbjct: 419 AILLEDIDAEKS-----------KHCRQMASNAVKFIKENLFDAESGQLWRIYRADSRGD 467

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ--------------NTQ--DELFLD 530
            PGF DDYA+LISGLL LYE       L +A +LQ              N +  ++ F+ 
Sbjct: 468 TPGFADDYAYLISGLLQLYEATFDDAHLQFADKLQLCGKGKGVWLTARLNAEYLNKYFIS 527

Query: 531 REGG------GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
                     G++ T  E     P  L R+K   D A PS N V   NL+RL+S++    
Sbjct: 528 VSASDSSICTGFYMTPSEAVTDTPGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDES 587

Query: 581 SDYYRQNAEHSLAV 594
                +   H+ AV
Sbjct: 588 YKLKARQTCHAFAV 601


>gi|162450797|ref|YP_001613164.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
 gi|161161379|emb|CAN92684.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
          Length = 716

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 246/719 (34%), Positives = 351/719 (48%), Gaps = 84/719 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFEDE +A+ +ND FV+IKVDREERPD+D +Y   VQ +   GGWPL+V
Sbjct: 52  ACHWCHVMERESFEDEAIARHMNDLFVNIKVDREERPDLDHIYQLVVQLMGRSGGWPLTV 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTYFPP+D  G PGF  +L K+ DA+  +RD + Q      E +  A  
Sbjct: 112 FLTPDQRPFFAGTYFPPKDALGMPGFPKVLDKIADAFRNRRDDVEQQAQEITEAIERAQR 171

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A A +  +      + LR  + QL    D R GG GS PKFP  + + ++L       D 
Sbjct: 172 APARAAGVAAPASSDLLRRASRQLLARLDPRHGGIGSRPKFPNTMALDVLLRRGVLESDR 231

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                A+EG   V  TL  M  GGI DH+ GGFHRYS DERW VPHFEKMLYD   L  +
Sbjct: 232 ----VAAEG---VELTLDRMRDGGIWDHLRGGFHRYSTDERWLVPHFEKMLYDNALLLRL 284

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y D F   K   Y+   R+I+ YL  +M  P G  ++++DADS   EG    +EG F+VW
Sbjct: 285 YADGFRAFKKPIYAETAREIVGYLFAEMRDPEGGFYASQDADS---EG----REGKFFVW 337

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM----SDPHN-EFKGKNVLIELNDSSAS 375
           T +++ D +GE  + +            D++R+    S+  N E  G  VL +      +
Sbjct: 338 TLEQLRDAVGEDQLAY------------DMARLVFGISEEGNFEDSGATVLSQHRTLEQA 385

Query: 376 ASKL-----GMP---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
           A+ +     G P   L++  + L   R  +   R  RPRP  DDKV+ SWNGL+I + A 
Sbjct: 386 AAVIDDGAGGGPSTHLDRCRDALARARVAMLAARDARPRPARDDKVLASWNGLLIGALAD 445

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSK 486
           A + L                D   +++ A  A + + R L   +  R+    ++G P+ 
Sbjct: 446 AGRAL----------------DEPAWVDAAARAFALLERKLL--RGGRVGRYLKDGAPAG 487

Query: 487 A---------------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
           A               PGFLDD A+L +  LDLYE  S  +++  A  + +       D 
Sbjct: 488 ANREHGGSGAAVGDVRPGFLDDQAYLGNAALDLYEATSDPRYVDVARAIADAMIAHHWDE 547

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
              G+F T  +  +++ R ++ +D A PS  S++ +  +RL+ I      + Y   AE  
Sbjct: 548 AAPGFFFTPDDGDALIARTQDIYDQAAPSAASMAALLCLRLSEIA----DERYLSPAERQ 603

Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
           L V      + A  +    C  D L+  +   VV+VG   S     +   A   Y  N+ 
Sbjct: 604 LDVLAPTALENAFGLGQTVCVLDRLTRGA-VTVVVVGEAGSASAAELTREAFKVYLPNRA 662

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           ++ +DPA  E     E       +        D  VA  C+  +CS PVT    L+ LL
Sbjct: 663 IVLVDPARPESAAAVEVVAEGKPA------RPDGAVAYACRGRTCSAPVTTAADLKALL 715


>gi|110668468|ref|YP_658279.1| thioredoxin domain-containing protein [Haloquadratum walsbyi DSM
           16790]
 gi|109626215|emb|CAJ52671.1| YyaL family protein [Haloquadratum walsbyi DSM 16790]
          Length = 768

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 235/717 (32%), Positives = 346/717 (48%), Gaps = 92/717 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+ VA +LND FV IKVDREERPD+D++Y T  Q + GGGGWPLSV+
Sbjct: 55  CHWCHVMAEESFEDDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVW 114

Query: 82  LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           L+PD KP   GTYFP  ++  R   PGF  I +    AW+  R  L        + L + 
Sbjct: 115 LTPDGKPFYVGTYFPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDR 174

Query: 139 LSASASSNKLPDEL------------PQNA-----------LRLCAEQLSKSYDSRFGGF 175
           L    + +   D              PQ             L   +    ++ D+ +GGF
Sbjct: 175 LEVDTNVDTNIDVDDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGF 234

Query: 176 GS-APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           GS  PKFP+   I+ ++  H++   +T      +        TL  MA GGI+DHVGGGF
Sbjct: 235 GSRGPKFPQTGRIEALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGF 286

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRY+ D +W VPHFEKMLYD  +L+ VYL A+  T    Y+ +  +   +L R++  P G
Sbjct: 287 HRYATDRKWTVPHFEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEG 346

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLS 351
             +S  D   A++EG    +EG FYVWT + + + + +  I  +  + + +   GN    
Sbjct: 347 GFYSTLD---AQSEG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN---- 395

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+G  VL      S  A+K  +  ++ ++ L + R  LFD R  R RP+ D+
Sbjct: 396 --------FEGSTVLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDE 447

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ +WNGL ISS AR   IL++E                +Y E+A  A SFIR HL+D 
Sbjct: 448 KILTAWNGLAISSLARGGLILETE----------------QYTELANDALSFIRTHLWDS 491

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
            + RL   +++G     G+LDDYAFL  G  DLY+     + L +A+ L  +  ELF D 
Sbjct: 492 DSGRLSRRYKDGDVDETGYLDDYAFLARGAFDLYQTTGAVEHLCFAVTLAESIVELFYDA 551

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
            G   +    +  S++ R ++  D + PS   ++V  L  +    +   S         +
Sbjct: 552 AGETLYLAPEDAESLVARPQDLRDQSTPSSAGIAVQTLNAVDPFTSTDFSGI-------A 604

Query: 592 LAVFETRLKDMAMAVPL----MCCAADMLSVPSRKH-VVLVGHKSSVDFENMLAAAHASY 646
            AV +T   D     PL    +  AAD     +R H  V++ H +  +   ++ +  AS 
Sbjct: 605 GAVIDTH-ADEIRGRPLEHISLAMAADSR---ARGHDEVVIAHDTDTELSQLIRSDIAST 660

Query: 647 DLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVALVCQNFSCSPPV 700
            L    +   PA    ++ W +    +S  A  A  +    K     C   +CSPP 
Sbjct: 661 YLPGVPLSQRPATVSGLESWTDELGLDSPPAIWAGRHQRDSKATIYACSGRACSPPT 717


>gi|283778697|ref|YP_003369452.1| hypothetical protein Psta_0907 [Pirellula staleyi DSM 6068]
 gi|283437150|gb|ADB15592.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
          Length = 667

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 234/614 (38%), Positives = 324/614 (52%), Gaps = 74/614 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT----YVQALYG--G 73
           ++CHWCHVME ESF D  +AKLLN+ F+ IKVDREERPD+D +YMT    Y+Q   G  G
Sbjct: 81  SSCHWCHVMERESFLDPEIAKLLNENFICIKVDREERPDIDTIYMTAVQTYLQLTTGRRG 140

Query: 74  GGWPLSVFLSPDLKPLMGGTYFPPED--KYGRPGFKTILRKVKDAWDKKRDMLAQSGA-- 129
           GGWP++VFL+P+  P  GGTYFP  D  + G  GF T+  KV + W K+   L       
Sbjct: 141 GGWPMTVFLTPEGNPFFGGTYFPARDGDREGMTGFLTLSSKVSEMWKKEPVKLGDDATTL 200

Query: 130 --FAIEQLS--EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAP 179
             F  +QL   + L A     KL   + +         L+  +D R+GGFG        P
Sbjct: 201 ARFIKDQLEGPKLLLAVVLDTKLTTSVEKG--------LAAQFDERYGGFGFDEIEWQRP 252

Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           KFP P  +Q +L   KK         ASE + M++ TL  MA GGI+DHVGGGFHRYSVD
Sbjct: 253 KFPEPSNLQFLLEIVKKTP-------ASESRAMLVHTLDRMAMGGIYDHVGGGFHRYSVD 305

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
             W +PHFEKMLYD GQL  VY +A++LT D  Y  I R+  +++ R+M    G  ++A 
Sbjct: 306 RMWRIPHFEKMLYDNGQLLTVYSEAYALTGDENYQRIARETAEFMLREMRDTSGGFYAAL 365

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
           D   AETEG     EG FY W   EVE +L       KE + L  +    LSR  +    
Sbjct: 366 D---AETEGV----EGKFYRWDKAEVEKLLT------KEEFELY-SAVYGLSRAPNFEET 411

Query: 360 FKGKNVLIELNDSSASASKLG-MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           F     +I+L D+    +K   + +EK +N L     KL   R+ R RP  D K++   N
Sbjct: 412 F----YVIQLRDTLVDIAKTREITVEKLVNDLRPIHAKLLAARNARKRPLTDTKILAGEN 467

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
           GL I+  A A K+LK                   Y E A +AA+ +   +   +  RL  
Sbjct: 468 GLAITGLATAGKLLKE----------------PRYTEAAATAATLVLSKMTAPE-GRLFR 510

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           ++    +K   +L DY+ L+ GLL L+E     +WL  AI+L + Q ELF D   GG++ 
Sbjct: 511 TYSGEKAKLNAYLSDYSMLVEGLLALHEATGEQRWLDEAIKLTDQQVELFHDVPRGGFYF 570

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           T+ +  S+L RVKE  D A P+GNSV+ +NLV+L  I   ++   Y + AE ++     +
Sbjct: 571 TSKDHESLLARVKETVDSAMPAGNSVAAVNLVKLVKITGKNE---YLKLAEGAIQSAAGQ 627

Query: 599 LKDMAMAVPLMCCA 612
           +++     P +  A
Sbjct: 628 MQENPTVSPRLATA 641


>gi|448448658|ref|ZP_21591316.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
 gi|445814276|gb|EMA64242.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
          Length = 740

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 236/721 (32%), Positives = 344/721 (47%), Gaps = 86/721 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E +    +  +           + L   A    + YD   GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPEAVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
           ++              A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
           HFEKMLYD  +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S  
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341

Query: 306 TEG--------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDP 356
            EG        +    EGAFYVWT +EV+ +L E A  L KE Y ++  GN +       
Sbjct: 342 PEGRRGDDTGDSDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE------- 394

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
               +G  V          A+      ++    L   R  LFD R +RPRP  D+KV+ +
Sbjct: 395 ----RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAA 450

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTH 474
           WNG  IS+FARA   L                  + Y E+A  A  F R  LYD   +T 
Sbjct: 451 WNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESETG 493

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
            L   + +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D + G
Sbjct: 494 ALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDADDG 553

Query: 535 GYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YY 584
             + T   D           ++ R +E  D + PS   V+   L    +++ G ++D   
Sbjct: 554 TIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGEL 609

Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
           R+ AE  +     R++   +    +  AA+++       V +   +   D+   L   + 
Sbjct: 610 REIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGERY- 667

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
              L   ++   PA  + +D W +     A+    A    +  +  A VC+ F+CSPP T
Sbjct: 668 ---LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPPRT 724

Query: 702 D 702
           D
Sbjct: 725 D 725


>gi|448529052|ref|ZP_21620367.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
           700873]
 gi|445709758|gb|ELZ61582.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
           700873]
          Length = 744

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 239/721 (33%), Positives = 347/721 (48%), Gaps = 85/721 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFP E +  +PGF+ +  ++ D+W          ++ D  A+S   
Sbjct: 113 AWCTPEGKPFYVGTYFPLEARRNQPGFRDLCERIADSWSDPEQREEMRRRADQWAESARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E +    +A               L   A    + YD  +GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPDAADPDGEGDASPPGDGLLESAAASALRGYDDEYGGFGSGGAKFPMPGRIDL 232

Query: 190 MLY-HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           ++  +++   D   S  A         TL  MA+GG++D +GGGFHRY+VD  W VPHFE
Sbjct: 233 LMRAYARSGRDALLSAAAG--------TLDGMARGGMYDQIGGGFHRYAVDREWTVPHFE 284

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD  +L   YLD + LT D  Y+ +  + L +L R++    G  FS  DA S   E 
Sbjct: 285 KMLYDNAELPMAYLDGYRLTGDPAYARVASESLAFLDRELRRDDGGFFSTLDARSRPPE- 343

Query: 309 ATRKK----------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
            +R+           EGAFYVWT +EV+ +L E A  L KE Y ++P GN +        
Sbjct: 344 -SRRDGNESEEGEDVEGAFYVWTPEEVDAVLDEPAASLVKERYGIRPGGNFE-------- 394

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
              +G  V          A+   +  E+    L E R  LFD R  RPRP  D+KV+ SW
Sbjct: 395 ---RGTTVPTLAASVDELAADRDLSPEEVREALTEARTALFDARESRPRPARDEKVLASW 451

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHR 475
           NG  IS+FA A+  L                  + Y ++A  A  F R  LYD   +T  
Sbjct: 452 NGRAISAFADAAGTLG-----------------EPYADIAREALDFCRDRLYDPEAETGA 494

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L   + +G  + PG+LDDYAFL  G LD+Y      + L +A+EL       F D + G 
Sbjct: 495 LARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDLEPLGFALELAEALVAEFYDADDGT 554

Query: 536 -YFNTT---------GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YY 584
            YF  +         G+   ++ R +E  D + PS   V+   L    +++ G ++D  +
Sbjct: 555 IYFTRSLDGRESGGDGDAGPLMARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRF 610

Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
           R  A   +     R++   +    +  AAD++       V +   +   ++   L   + 
Sbjct: 611 RDVARRVVTTHADRIRGGPLEHASLVRAADLVET-GGIEVTVAADEVPDEWRETLGERY- 668

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
              L   ++   PA    +D W +      +    A  + +  +  A VC++F+CSPP T
Sbjct: 669 ---LPSALVAPRPATEAGLDEWLDRLDMAEAPPIWAGRDATDGEPTAYVCRDFTCSPPRT 725

Query: 702 D 702
           D
Sbjct: 726 D 726


>gi|358063474|ref|ZP_09150085.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
           WAL-18680]
 gi|356698267|gb|EHI59816.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
           WAL-18680]
          Length = 682

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 216/580 (37%), Positives = 305/580 (52%), Gaps = 64/580 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ SF    +  +  FL    +TCHWCHVME ESFE+EG+A ++N  FV +KVDREERPD
Sbjct: 36  GKESFEKAEREDKPIFLSIGYSTCHWCHVMEEESFENEGIAGIMNREFVCVKVDREERPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD VYM+  QA+ G GGWPL++ ++P+ +P   GTY PP  +YGR G   +L  V   W 
Sbjct: 96  VDSVYMSVCQAMTGQGGWPLTIIMTPECRPFFAGTYLPPVRRYGRMGLAELLNSVAKQWK 155

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           + R  L +S     EQ+ +A     +   +  E+ +  +    +QL +S+D   GGFG A
Sbjct: 156 ENRQQLFRSA----EQI-QAFLRQQTEMDVEGEVSKALVSQGYQQLERSFDEIHGGFGGA 210

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P       +H   L D G   +  E   MV  TL  M +GGI DH+GGGF RYS 
Sbjct: 211 PKFPTP-------HHLLFLMDYGVRRDVPEAFYMVDRTLVQMYRGGIFDHIGGGFSRYST 263

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DERW VPHFEKMLYD   L   Y  A+ +T    Y+ +   IL Y++ ++   GG  +  
Sbjct: 264 DERWLVPHFEKMLYDNALLTLAYAKAYGITGKKLYAEVAGRILGYVKAELTDEGGGFYCG 323

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
           +DADS          EG +YV+T +E+  +LG      F   Y +  +GN          
Sbjct: 324 QDADSDGV-------EGKYYVFTPEEIRAVLGNADGERFLARYGMTGSGN---------- 366

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
             F+GK +   L D      ++  P         E  R+L++ R  R R H DDK++VSW
Sbjct: 367 --FEGKWI-PNLLDYQGDLEEM-QP---------EKDRRLYEYRLARARLHKDDKILVSW 413

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NG +I++  RA  +L+ +A                Y+E+A  A +F+R  L  +   RL 
Sbjct: 414 NGWMITACGRAGAVLEEDA----------------YVEMAVRAEAFLREKLVKD--GRLM 455

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
             +R+G +   G LDDYA     L++LYE    T +L  A EL +   E F D E GG++
Sbjct: 456 VRYRDGEAAGEGKLDDYACYCQALVELYEVTYETDYLRRARELADVMVEQFFDGERGGFY 515

Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
               +   +++R KE +DGA PSGNSV+ + L +L  I  
Sbjct: 516 LYAKDGEELIVRTKETYDGAMPSGNSVAALVLEQLGRITG 555


>gi|83649209|ref|YP_437644.1| hypothetical protein HCH_06582 [Hahella chejuensis KCTC 2396]
 gi|83637252|gb|ABC33219.1| Highly conserved protein containing a thioredoxin domain [Hahella
           chejuensis KCTC 2396]
          Length = 762

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 222/599 (37%), Positives = 320/599 (53%), Gaps = 68/599 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF++E VA+ LN +F+ IKVDRE+RPD+D++YMT VQ + G GGWP+S
Sbjct: 80  STCHWCHVMEEESFDNEEVAQTLNGYFIPIKVDREQRPDLDEIYMTAVQIITGHGGWPMS 139

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            FL+P+  P  G TYFP      RP F  +LRKV + W+++++ L + G     +LSEA+
Sbjct: 140 SFLTPEGNPFFGATYFP------RPRFINLLRKVHELWEEQQENLLEQG----RRLSEAV 189

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S       + + L +N +    E+L    D  +GGFGS PKFP+   +  +L     +E 
Sbjct: 190 SVYLRPKPISETLAENLIETAMEKLIGYSDREWGGFGSEPKFPQEPNLLFLL---DIIER 246

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +  +      +V   L  +  GG++D  GGGFHRY+VD+RW VPHFEKMLY+Q QLA 
Sbjct: 247 DSRPLDRQPAWTVVKTALDALLAGGVYDQAGGGFHRYAVDQRWLVPHFEKMLYNQAQLAR 306

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            ++ A+ L++D  Y  ICR+ LDY+ R+M  P G  +SA DADS   EG    +EG ++V
Sbjct: 307 CFIRAYKLSQDPEYLRICRETLDYVLREMRSPEGVFYSATDADS---EG----EEGKYFV 359

Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W  +E+  +L    +   E  Y +   GN            F+G N+L        SA+ 
Sbjct: 360 WAYQELSQLLDTPGLALAEQVYGVTRKGN------------FEGANILYLPRPLQKSAAT 407

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+  E+ L  L + +  L   RS+R  P  DDKVI  WNG++I++ A  + I    A  
Sbjct: 408 LGLTYEELLQQLADLKAILLQTRSQRVPPLRDDKVITEWNGMMIAALAETAAITGISA-- 465

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAF 496
                         Y + A  AA+ + R    E    HR+  S  N PS     L+DY  
Sbjct: 466 --------------YGDAAVIAANQLWRSQRGEDGLFHRI--SLDNLPSDD-ALLEDYVH 508

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPSVLLRVKEDH 554
            + GLL LY++     WL     L  T +E FLD E GG+F T  + + P +L+R K   
Sbjct: 509 YMEGLLQLYDYTHDHLWLERLEALTTTLEEQFLDAEQGGFFITPQSAQGP-LLVRSKHCS 567

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK---SDYYRQN-AEHSLAVFETRLKDMAMAVPLM 609
           D A  SGNS       +LAS++A  +    D   Q  AE+ +A F  ++    ++ P+ 
Sbjct: 568 DNATISGNS-------QLASVLAALRLRTGDLNVQRMAENQIAAFTGQINRHPLSAPVF 619


>gi|313126304|ref|YP_004036574.1| hypothetical protein Hbor_15590 [Halogeometricum borinquense DSM
           11551]
 gi|448286147|ref|ZP_21477382.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
           11551]
 gi|312292669|gb|ADQ67129.1| hypothetical protein containing a thioredoxin domain
           [Halogeometricum borinquense DSM 11551]
 gi|445575198|gb|ELY29677.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
           11551]
          Length = 725

 Score =  355 bits (911), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 233/700 (33%), Positives = 338/700 (48%), Gaps = 81/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+ VA +LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFEDDDVAAVLNESFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           V+L+P  KP   GTYFP E++  R   PGF  + R   +AW+  R+ +          + 
Sbjct: 113 VWLTPQGKPFYVGTYFPKEERRDRGNVPGFLDLCRSFAEAWENDREEIENRAQQWTAAIQ 172

Query: 137 EALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
           + L A+      P E P    L   A+   +  D  +GGFGS  PKFP+P  ++ +L   
Sbjct: 173 DQLEATPDD---PGESPGTEILGEVAKAALRGADREYGGFGSGGPKFPQPGRVEALLRSY 229

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                   SGE  E   + + TL  MA GG++DHVGGGFHRY+ D +W VPHFEKMLYD 
Sbjct: 230 V------HSGE-DEPLTVAMETLDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKMLYDN 282

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++  VYL A  LT    Y+ + R+  D++ R++  P G  FS  DA S         +E
Sbjct: 283 AEIPRVYLAAHRLTGRADYAEVARETFDFVARELRHPDGGFFSTLDAQSG-------GEE 335

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT ++V + L +   A +F ++Y +   GN +            G  VL      
Sbjct: 336 GTFYVWTPEQVHEALADETRAEVFCDYYGVTSGGNFE-----------NGTTVLTVSATV 384

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
            + A + G+  ++  + L   R  LFD R  R RP  D+KV+  WNGL+ISS A+ + +L
Sbjct: 385 DSVADEHGLTTDEVTDHLDAARETLFDTRESRTRPPRDEKVLAGWNGLMISSLAQGALVL 444

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                              EY E+A  A  F R HL+DE   RL   F++G  K  G+L+
Sbjct: 445 GD-----------------EYAELAADALGFAREHLWDESEGRLSRRFKDGDVKGEGYLE 487

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G  DLY+       L +A+EL       F D   G  + T  +  +++ R +E
Sbjct: 488 DYAFLARGAFDLYQATGDVDHLAFAVELAREIVASFYDDAAGTLYFTPDDGEALVTRPQE 547

Query: 553 DHDGAEPSGNSVSVINLVRL--------ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
             D + PS   V+   L+ L         + VAGS  D +             R++   +
Sbjct: 548 LQDQSTPSSVGVATSLLLDLDAFAPDADFAAVAGSVLDTHAD-----------RIRGRPL 596

Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
               +  AA+  +      +V+ G      F   LA  +    +   V+ I P   +++ 
Sbjct: 597 EHVSLALAAEKRAR-GGSEIVVAGDSLPDSFRQSLAERY----VPDAVLSIRPPTDDDLT 651

Query: 665 FWEE----HNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
            W +     ++      R     +  V   C+  +CSPP 
Sbjct: 652 PWLDTLGVEDAPPVWQGREMRDGEPTV-YACEGRACSPPT 690


>gi|395645901|ref|ZP_10433761.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
 gi|395442641|gb|EJG07398.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
          Length = 690

 Score =  355 bits (911), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 237/692 (34%), Positives = 342/692 (49%), Gaps = 65/692 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED GVA++LN+ FV++KVDREERPD+D VYM    AL G GGWPL+
Sbjct: 55  STCHWCHVMAEESFEDAGVAEVLNEGFVAVKVDREERPDIDAVYMQVCLALTGRGGWPLT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD  P    TY P E + G  G   +L+K++  W+ +RD L  S      ++ + L
Sbjct: 115 IVMTPDRLPFFAATYLPKETRLGVTGLIDVLKKIRHLWETRRDDLVGSA----REIVDDL 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A AS   L  +     LR    ++ + YD  +GGF  +PKFP P    M+++  +    
Sbjct: 171 GAGAS---LRGKAETALLREGYAEMKRRYDPSYGGFDRSPKFPSP---HMIIFLIRYWHW 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG     +  ++    TL+ +  GGI D +G G HRY+ D +W VPHFEKMLYDQ  LA 
Sbjct: 225 TGDPMALAMAEQ----TLREVRGGGIFDQIGFGVHRYATDRKWLVPHFEKMLYDQAMLAL 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            + +A   T D FY     +I  Y++RD+  P G  ++AEDADS   EG     EG FY+
Sbjct: 281 AFTEAHMATGDAFYLSAADEIFTYVQRDLASPEGAFYTAEDADS---EGV----EGKFYL 333

Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT++EV   + GE A LF E Y +   G+ D+     PH     + +          +  
Sbjct: 334 WTAEEVRSAVGGEDAALFIEAYGIG-EGSGDI-----PHRAVSPQVL----------SRT 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+P ++    L   R KL  VR  R RPH D+K+++ WN L++++ ARA +        
Sbjct: 378 TGIPEDEIRRRLEAVREKLLSVRKGRARPHRDEKILLDWNALMVAALARAGRY------- 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                    S R  Y+  A+ AA  +   L       L H + +G +   G L DYA+L+
Sbjct: 431 ---------SGRTGYVAAAQGAAGVLLDRLRRPDGG-LLHRYMDGEAAVSGMLADYAYLV 480

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             L ++YE     + L  A  L +   E F D  GGG++  + +   ++LR KE HDGA 
Sbjct: 481 WALAEVYEASFDPEILREACRLADAMIERFGDPSGGGFYTVSADGEQLILRQKEIHDGAL 540

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNS+++  LV L  +   S+   Y + +  S   F         A      A    S 
Sbjct: 541 PSGNSMALFALVTLFRLTGLSR---YWEASSSSFDAFAGDAGRNPSAHAWYMAALLAAST 597

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
            S   +V+ G         ML    +SY  N TV+     D    D   E   + A M+ 
Sbjct: 598 KS-DELVIAGEGDDPATRKMLDLVASSYRPNLTVLL---KDRRSADVLAEVAPHTALMSA 653

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 K  A +C+  +C  PVT P  L+ +L
Sbjct: 654 QG---GKATAYLCRGTACEQPVTSPEDLDKIL 682


>gi|448414488|ref|ZP_21577557.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
 gi|445682054|gb|ELZ34478.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
          Length = 725

 Score =  355 bits (910), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 230/695 (33%), Positives = 343/695 (49%), Gaps = 71/695 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE VA++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMAEESFEDEAVARVLNESFVPVKVDREERPDLDRIYQTICQLVSGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           V+L+P+ KP   GTYFP E++  R   PGF  +     +AW+  R+ +        EQ +
Sbjct: 113 VWLTPEGKPFYVGTYFPKEERRDRGNVPGFLDLCESFANAWETDREEIENRA----EQWT 168

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKS----YDSRFGGFGS-APKFPRPVEIQMML 191
           +AL       + PDE+ +        +++K+     D  +GGFGS  PKFP+P  I+ +L
Sbjct: 169 DALKDQL--EETPDEVGEAPGTEVLGEVTKAALRGADREYGGFGSGGPKFPQPGRIEALL 226

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
                      SGE  E   + +  L  MA GG++DHVGGGFHRY+ D +W VPHFEKML
Sbjct: 227 RSYV------HSGE-EEPLDVAMEALDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKML 279

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  ++  VYL A  LT    Y+ + R+  D++ R++  P G  +S  DA S        
Sbjct: 280 YDNAEIPRVYLAAHRLTGREAYADVARETFDFVARELRHPDGGFYSTLDAQS-------D 332

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EG FYVWT +EV + L +   A +F ++Y +   GN +            G  VL   
Sbjct: 333 GEEGTFYVWTPEEVRETLDDETRADVFCDYYGVTADGNFE-----------NGTTVLTVS 381

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                 A + G+  E+ ++ L   R  LF+ R  R RP  D+KV+  WNGL++SS A+ S
Sbjct: 382 APIDEVAEERGLTTEEAVDHLDAARETLFEARESRTRPPRDEKVLAGWNGLMVSSLAQGS 441

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L                   EY E+A  A  F+R HL+D    RL   F++G  K  G
Sbjct: 442 LVLGD-----------------EYAELAADALGFVREHLWDSDEKRLSRRFKDGDVKGDG 484

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G  DLY+       L +A++L     E F D   G  + T  +  +++ R
Sbjct: 485 YLEDYAFLARGAFDLYQATGDVDHLAFAVDLSRALVESFYDESAGTLYFTPADGETLVTR 544

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+   L+ L S    +    +   A   L     R++   +    +
Sbjct: 545 PQELQDQSTPSSVGVAASLLLDLDSFAPDAD---FASVAGSVLDTHADRIRGRPLEHVSL 601

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--- 666
             A++  +    + VV     S+    +    A A+  +  +V+ + P   +E+  W   
Sbjct: 602 ALASEKRARGGSEIVV-----SADALPDSFREALATRYVPGSVLSVRPPTDDELAPWLDV 656

Query: 667 -EEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
            +   +      R     +  V   C+  +CSPP 
Sbjct: 657 LDLTEAPPVWKGREMRDGEPTV-YACEGRACSPPA 690


>gi|448639421|ref|ZP_21676747.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
 gi|445762700|gb|EMA13918.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
          Length = 717

 Score =  355 bits (910), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 226/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
           L+P+ +P   GTYFPPE+K G+PGF  +L+++ ++W   +++ +M   AQ    AIE   
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLANSWSDPEQREEMENRAQQWTEAIESDL 177

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           EA  A       P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D G+     +   +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  
Sbjct: 229 AYSDGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
           ++   +L  +       Y+ + R+  ++++R++  P G  FS  DA+SA  +      +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEE 344

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT +EV + + +   A +F +++ +   GN            F+G  VL      
Sbjct: 345 GLFYVWTPEEVHEAVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPV 392

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           +  A +     +     L     + F  R  RPRP  D+KV+  WNGL+I + A  + +L
Sbjct: 393 AVLAEEYDRSEDDITASLQRALNETFKARKSRPRPARDEKVLAGWNGLMIRALAEGAIVL 452

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                              +Y +VA  A SF+R+HL+D    RL   +++      G+L+
Sbjct: 453 DD-----------------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLE 495

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQE 555

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L+ L+     S+ D +   AE  +     R+    +    +  A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLA 612

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
            D     + + + LVG +S  D+        A   + + ++   PA+    + W    E 
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPTEWTETLAEQYIPRRLLAHRPAEKSRFEQWLDTLEV 669

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           + +    A      D+     C+NF+CSPP  D
Sbjct: 670 DESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702


>gi|317122770|ref|YP_004102773.1| hypothetical protein [Thermaerobacter marianensis DSM 12885]
 gi|315592750|gb|ADU52046.1| hypothetical protein Tmar_1963 [Thermaerobacter marianensis DSM
           12885]
          Length = 738

 Score =  355 bits (910), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 262/728 (35%), Positives = 362/728 (49%), Gaps = 101/728 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME E FED  +A+ +N  FV++KVDREERPD+D+VY T  Q L  GGGWPL+V
Sbjct: 55  ACHWCHVMERECFEDPAIAEQMNRGFVNVKVDREERPDLDQVYQTAAQILGSGGGWPLTV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PDLKP   GTYFPPED++G PGF  +L  V DA+  +RD + +     +E L  +  
Sbjct: 115 FLTPDLKPFFAGTYFPPEDRHGLPGFPKVLDAVLDAYRHRRDDVERVANRVVEILRRSAG 174

Query: 141 ASASSNKLPDELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---- 191
              ++ +     P        ++  A ++++ YD ++GGFG APKFP    + ++L    
Sbjct: 175 GPGAAEEPAGAAPAREAARQWIQRAATRIARRYDPQYGGFGRAPKFPHATGLAVLLRAGV 234

Query: 192 --------------YHSKKLEDTGKSGEA-------SEGQK----MVLFTLQCMAKGGIH 226
                           S     T +SG A        E  +    M L TLQ MA GG+ 
Sbjct: 235 ARTPGGPGPSGTTGSGSSGSPGTARSGTADLVAGDVPENPRRHLDMALHTLQAMALGGLF 294

Query: 227 DHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR 286
           DH+ GGFHRY+ D  W +PHFEKMLYDQ QL  +YLDA+ LT D FY+ + R  L ++  
Sbjct: 295 DHLAGGFHRYATDRAWLIPHFEKMLYDQAQLVPLYLDAYRLTGDPFYAGVARQTLHFVLD 354

Query: 287 DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKP 344
           +M  P G   S  DADS   EG    +EGA+YVWT  ++ + LG  + A L    + +  
Sbjct: 355 EMTAPEGGFISTLDADS---EG----REGAYYVWTPDQLREALGDPDEAALAARWFGVTE 407

Query: 345 TGNCDLSRMSDPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR 401
            GN +            G  VL   +   D  A A + G   ++    L   RR+L D R
Sbjct: 408 EGNFE-----------DGTTVLYRAVADQDLPALAREWGTNRDELQRRLESIRRRLLDAR 456

Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
            +R  P  DDK++V WNGL+I++FA+A+ +L                D   Y   A  AA
Sbjct: 457 RRRTPPGRDDKILVGWNGLMIAAFAQAAPVL----------------DEPGYAAAARRAA 500

Query: 462 SFIRRHLYDEQTH-RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
            FI   L   + H RL H++R  P   PGFL DYAFLI GLL L+      +WL  A  L
Sbjct: 501 EFILGTL--RRPHGRLLHAYRGRPLDVPGFLPDYAFLIGGLLALHAADGDPRWLEEADRL 558

Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
                E F D   G +++   E  + L+R  E  D A P+G++ +   L RLA I   + 
Sbjct: 559 ARPMIETFWDDAAGVFYDAPEEAGTPLVRPVELFDQALPAGSAAAATVLARLAVI---TG 615

Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAADMLSVPSRKHVVLVGHKSS---VDFE 636
            + YR+ AE  L        +  +A+   +   AD L       V LVG  ++    ++ 
Sbjct: 616 DEEYRRIAEAYLRRAAALAAEQPLAMASTVLLQADQLE--GYTEVTLVGDPAAPVLAEWR 673

Query: 637 NMLAAAHASYDLNKTVIHIDPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
             L    A + L   V+ + P D  TE    WE  +  +           + VA VC+NF
Sbjct: 674 RRL----AGFYLPGLVLTVRPPDAGTERRAVWEGRDPVDG----------RPVAYVCRNF 719

Query: 695 SCSPPVTD 702
           SCS P TD
Sbjct: 720 SCSLPQTD 727


>gi|448658484|ref|ZP_21682884.1| thioredoxin [Haloarcula californiae ATCC 33799]
 gi|445761209|gb|EMA12458.1| thioredoxin [Haloarcula californiae ATCC 33799]
          Length = 717

 Score =  355 bits (910), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 226/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
           L+P+ +P   GTYFPPE+K G+PGF  +L+++  +W   +++ +M   AQ    AIE   
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRAQQWTEAIESDL 177

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           EA  A       P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D G+     +   +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  
Sbjct: 229 AYADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
           ++   +L  +       Y+ + R+  ++++R++  P G  FS  DA+SA  +      +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEE 344

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT +EV + + +   A +F +++ +   GN            F+G  VL      
Sbjct: 345 GLFYVWTPEEVHEAVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPV 392

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           +  A +     +     L     + F+ R  RPRP  D+KV+  WNGL+I + A  + +L
Sbjct: 393 AVLAEEYDRSEDDITASLQRALNETFEARKSRPRPARDEKVLAGWNGLMIRALAEGAIVL 452

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                              +Y +VA  A SF+R+HL+D    RL   +++      G+L+
Sbjct: 453 DD-----------------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLE 495

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQE 555

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L+ L+     S+ D +   AE  +     R+    +    +  A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLA 612

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
            D     + + + LVG +S  D+        A   + + ++   PA+    + W    E 
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPTEWTETLAEQYIPRRLLAHRPAEKSRFEQWLDTLEV 669

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           + +    A      D+     C+NF+CSPP  D
Sbjct: 670 DESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702


>gi|336477876|ref|YP_004617017.1| hypothetical protein [Methanosalsum zhilinae DSM 4017]
 gi|335931257|gb|AEH61798.1| protein of unknown function DUF255 [Methanosalsum zhilinae DSM
           4017]
          Length = 704

 Score =  355 bits (910), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 231/698 (33%), Positives = 347/698 (49%), Gaps = 62/698 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED  +A ++N  F+ IKVDREERPD+D +YM   Q +    GWP++
Sbjct: 55  STCHWCHVMEEESFEDPKIADMMNRTFICIKVDREERPDIDSMYMKICQQMTERCGWPMT 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V ++P   P    TY P +      G   ++ ++ + W  ++D +        ++L+   
Sbjct: 115 VIMTPGKVPFFISTYVPKKSGLAGIGMADLIPQIAEIWKTRQDEIVNKTEEIKQRLNRIT 174

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A   +  +    P++ ++     L+  YD  +GGFG APKFP P  I  +L H     +
Sbjct: 175 AAPEGAEYIS---PKDVIQKGYHLLAHYYDQNYGGFGRAPKFPAPHNIMFLLRHWNYTGN 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           T       +  KM   TL  M  GGI DHVG GFHRYS DE+W +PHFEKML DQ  LA 
Sbjct: 232 T-------DALKMAETTLTSMQLGGIFDHVGYGFHRYSTDEKWKLPHFEKMLNDQALLAL 284

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+  T    Y    R IL Y+ RDM    G  +SAEDADS   EG     EG FY+
Sbjct: 285 AYTEAYQATGKKVYENTARKILRYVLRDMRSEKGGFYSAEDADS---EGV----EGKFYL 337

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+  IL  E A L    + +K  GN       +   +  G N+L    ++S     
Sbjct: 338 WTEDEIRYILTPEEADLVCRVFNVKREGNF----AEESTGKLTGNNILYMKGETSEIVEP 393

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                E+   +L +   KL++VRS R  P  DDK++  WNGL+I++ A+A         S
Sbjct: 394 TEKENEEIQKLLNQALDKLYEVRSARVHPLKDDKILTDWNGLMIAALAKA---------S 444

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
             F  P       EY+E A++   FI  ++YD  + +L H +    +   GF+DDYA  +
Sbjct: 445 GAFQEP-------EYVEYAKTCTKFILDNMYD-GSGKLLHRYHRENAGIDGFVDDYAAFV 496

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDHDGA 557
            GL++LYE     K+L  A+E+ +     F D +G G YF +      +++R  E  D +
Sbjct: 497 WGLIELYEATFEEKYLQKALEINDYFISHFQDEKGRGFYFTSNDRSGDLIVRSMEICDTS 556

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNS++V+N++RLA +      +     A   LA     +    ++   +  A    S
Sbjct: 557 MPSGNSMAVLNILRLAKMTGDHNLESVASEAIRHLAA---AISHNPISSTYLLSAFYFAS 613

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-----TEEMDFWEEHNSN 672
            P  + V+     ++ D   M+ A   ++ + + V  + PAD     TE + + +E    
Sbjct: 614 EPGCEVVIAAEIDNAKD---MIEALQTNF-IPQCVYLLRPADSSESFTETIGYLKEMKGI 669

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           N   A          A VC+N++CS PVTD + + +L+
Sbjct: 670 NGRPA----------AYVCRNYTCSSPVTDAVEMMDLI 697


>gi|118575698|ref|YP_875441.1| thioredoxin [Cenarchaeum symbiosum A]
 gi|118194219|gb|ABK77137.1| thioredoxin [Cenarchaeum symbiosum A]
          Length = 676

 Score =  354 bits (909), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 238/700 (34%), Positives = 349/700 (49%), Gaps = 84/700 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE+E +A ++N+ F++IKVDREERPD+D +Y    Q   G GGWPLS
Sbjct: 52  SACHWCHVMAHESFENENIADIMNENFINIKVDREERPDIDDIYQKGCQLATGQGGWPLS 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            FL+PD KP   GTY PP   +GR GF++ILR++  AW +K   +  +    +E L    
Sbjct: 112 AFLTPDRKPFYIGTYIPPSSSHGRNGFESILRQLSQAWKEKPGDIKGTAEKFLETLRGGE 171

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A+A     P E  ++ L   A  L +  D+  GGFG APKFP    I  +  +      
Sbjct: 172 RATA-----PAEPDRSVLDEAAVNLLQMADTTHGGFGRAPKFPGSANISFLFRY------ 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            GK    S+  +  L TL  MA+GGI D VGGGFHRYS DERW  PHFEKMLYD   +  
Sbjct: 221 -GKLSGISKFTRFALLTLDRMARGGIFDQVGGGFHRYSTDERWLAPHFEKMLYDNALIPV 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y +A+ +T    Y  I    LDY+ R++  P G  +S++DAD   TEG    +EG +YV
Sbjct: 280 NYAEAYQVTGSPAYLRIMEKTLDYVLRELSSPEGGFYSSQDAD---TEG----EEGRYYV 332

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W+ KEV++ILG  A  F   Y +   GN            ++GK +L      SA A + 
Sbjct: 333 WSKKEVKEILGADADAFCMFYDVTDGGN------------WEGKTILYNGAAPSAVAFQC 380

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+ + +   I+     KL + RS R  P LDDKV+ SWN L++++ AR  +         
Sbjct: 381 GITVGELDGIIERSAAKLLEARSGRVPPGLDDKVLASWNSLMVTALARGYR--------- 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR---LQHSFRNGPSKAPGFLDDYAF 496
                   S    Y++ A     FI     D + HR   L  +++ G ++ PG+LDD+A+
Sbjct: 432 -------ASGEARYLDAARRCLGFI-----DAKMHRDGALMRTYK-GEARIPGYLDDHAY 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
               LLD +E  +  ++L  A E+ +   + F D E GG+F T+     +++R +  +D 
Sbjct: 479 YGCALLDAFEVDAEERYLRRASEIGSHLVQNFWDEERGGFFMTSDVHEGLIVRPRSGYDL 538

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADM 615
           + PSGNS +   ++RL          Y+    E  L   E  +   A A      A   M
Sbjct: 539 SLPSGNSAAAHLMLRL----------YHLTGDESCLKTAERTMSSQAQAAAENPFAFGHM 588

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L+V    H++     + +D    +    A   L + ++ I+ A   ++D          +
Sbjct: 589 LNV-MYMHILGPAEITVLDKGGEIPRGLAEKFLPEALL-INVASQGQLD----------A 636

Query: 676 MARNNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
           ++R  F A K       A +C+N +CS P      +E LL
Sbjct: 637 LSRYPFFAGKSFGGNSTAYICRNKTCSAPQDTMNGVEALL 676


>gi|448424193|ref|ZP_21582319.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
 gi|445682858|gb|ELZ35271.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
          Length = 742

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 236/723 (32%), Positives = 344/723 (47%), Gaps = 88/723 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E +    +  +   +       + L   A    + YD   GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPEAVGSDGEETASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
           ++              A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
           HFEKMLYD  +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S  
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341

Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
            EG               EGAFYVWT +EV+ +L E A  L KE Y ++  GN +     
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                 +G  V          A+      ++    L   R  LFD R +RPRP  D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQ 472
            +WNG  IS+FARA   L                  + Y E+A  A  F R  LYD   +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESE 493

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
           T  L   + +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDAD 553

Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
            G  + T   D           ++ R +E  D + PS   V+   L    +++ G ++D 
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
             R+ AE  +     R++   +    +  AA+++       V +   +   D+   L   
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668

Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
           +    L   ++   PA  + +D W +     A+    A    +  +  A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPP 724

Query: 700 VTD 702
            TD
Sbjct: 725 RTD 727


>gi|335436727|ref|ZP_08559519.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
 gi|335437369|ref|ZP_08560149.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
 gi|334896155|gb|EGM34310.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
 gi|334897442|gb|EGM35575.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
          Length = 715

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 234/700 (33%), Positives = 349/700 (49%), Gaps = 70/700 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+  A +LN+ FV IKVDREERPDVD++Y T  Q L   GGWPLSV+
Sbjct: 55  CHWCHVMAEESFEDDETAAVLNENFVPIKVDREERPDVDRIYQTLAQLLDQQGGWPLSVW 114

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P   GTYFPP+ + GRPGF  +L  ++  W+  R+ + Q      + +S  L  
Sbjct: 115 LTPDGRPFYVGTYFPPDSRGGRPGFAELLEDLQATWENDREGIEQRADQWADAISGELEG 174

Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
           +  A+ +   DEL    LR  A+   ++ D   GGFGS  PKFP+P  +Q++L    +  
Sbjct: 175 TPDAARDTAGDEL----LRSGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFG 230

Query: 199 DT----GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           D     G++ EA+E + ++  TL  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD 
Sbjct: 231 DARREEGENAEATEYRSILTETLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDN 290

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++  V L+A+  T D  Y+ + R+  D+L R++  P G  +S  DA S   EG    +E
Sbjct: 291 AEIPRVLLEAYRATGDERYARVARETFDFLDRELGHPEGGFYSTLDARS---EG----EE 343

Query: 315 GAFYVWTSKEVEDILGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT  +V +++ +     L  E Y +   GN +            G+ VL      
Sbjct: 344 GKFYVWTPAQVREVIDDETDVSLVCERYGITEEGNFE-----------DGQTVLTIAASV 392

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              A++ G+   +    L   R +LFD RS+R RP  D+K++  WNGL IS+ A  S  L
Sbjct: 393 DELAARSGLGAGEVRERLDRAREELFDARSERTRPPRDEKILAGWNGLAISALAEGSLTL 452

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                         G+D   +++ A  A  F+R  L+D+    L+  + +G  +  G+L+
Sbjct: 453 --------------GND---FLDRAVDALEFVRETLWDDDAGLLKRRYIDGDVRVDGYLE 495

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS----VLL 548
           DYAFL  G LD Y        L +A++L    +  F D++ G  + T     S    +L 
Sbjct: 496 DYAFLARGALDCYGASGDLDHLAFALDLAREIETRFFDKDVGTLYFTEAPGESRETDLLA 555

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R +E  D + PS   V+V  LV L   V       + +  E + AV ET    +A A PL
Sbjct: 556 RPQELTDRSTPSSAGVAVDVLVTLDEFVP------HDRFGEIASAVLETHHSAIA-AEPL 608

Query: 609 ----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
               +  A D  +  S   + +   +    + + +   +    L   V+   P     ++
Sbjct: 609 QHASLVLAGDRDANGS-TELTVASDEIPAAWRDRIGETY----LPARVLARRPPTEAGLE 663

Query: 665 FWEEHN--SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            W E         +     + +      C++F+CS P+ D
Sbjct: 664 TWLEQFELGEAPPIFAGRLAEEDATIYACRDFTCSRPLHD 703


>gi|336254491|ref|YP_004597598.1| hypothetical protein Halxa_3105 [Halopiger xanaduensis SH-6]
 gi|335338480|gb|AEH37719.1| protein of unknown function DUF255 [Halopiger xanaduensis SH-6]
          Length = 730

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 233/698 (33%), Positives = 347/698 (49%), Gaps = 65/698 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF+DEGVA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS
Sbjct: 53  SACHWCHVMEEESFQDEGVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGRGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFA 131
            +L+P+ KP   GTYFP E + G+PGF  +  ++ D+W+         + D   ++    
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFLDLCERISDSWNSEDREEMEHRADQWTEAAKDR 172

Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
           +E   E   A  ++     E+    L   A    +S D  +GGFGS  PKFP+P  +Q +
Sbjct: 173 LEDTPEGAGAGGAAEPPSSEV----LETAASAALRSADREYGGFGSDGPKFPQPARLQAL 228

Query: 191 LYHSKKLEDTGKSGEASEGQKMVL-FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
              ++  + TG+     E  + VL  TL  MA GG++DHVG GFHRY VD  W VPHFEK
Sbjct: 229 ---ARAYDRTGR-----EAYREVLEETLDAMAAGGLYDHVGSGFHRYCVDRDWTVPHFEK 280

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           MLYD  ++   +L  + LT D  Y+ +  + L ++ R++    G  FS  DA S + E  
Sbjct: 281 MLYDNAEIPRAFLTGYQLTGDERYAEVVAETLAFVDRELTHEEGGFFSTLDAQSEDPETG 340

Query: 310 TRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
            R +EGAFYVWT  EV + L +   A LF + Y +  +GN            F+G+N   
Sbjct: 341 ER-EEGAFYVWTPDEVREALEDETTADLFCDRYDITESGN------------FEGRNQPN 387

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
            +      A +  +   +    L   R +LF  R  RPRP+ D+KV+  WNGL+I++ A 
Sbjct: 388 RVRPIDDLADEYDLEESEVQKRLETAREQLFAAREGRPRPNRDEKVLAGWNGLMIATCAE 447

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
           A+ +L              G D  +Y ++A  A  F+R  L++E   RL   +++G  K 
Sbjct: 448 AALVL--------------GDD--QYADMAVDALDFVRDRLWNESEQRLNRRYKDGDVKV 491

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
            G+L+DYAFL  G L  YE       L +A+EL    +  F D + G  + T     S++
Sbjct: 492 DGYLEDYAFLARGALGCYEATGEVDHLRFALELARVVEAEFWDADRGTLYFTPESGESLV 551

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            R +E  D + P+   V+V  L+ L         + +   A   L     +++  ++   
Sbjct: 552 TRPQELGDQSTPAATGVAVEVLLALDEFT----DEDFEGIAATVLETHANKIEANSLEHT 607

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW- 666
            +C AAD L   + +  V     ++ D  +      AS      +    PA  E ++ W 
Sbjct: 608 TLCLAADRLESGALEVTV-----AADDLPDEWRDRFASRYFPDRLFARRPATEEGLEDWL 662

Query: 667 EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
           +E     A    A       +    VC++ +CSPP  D
Sbjct: 663 DELGLEEAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 700


>gi|448506299|ref|ZP_21614409.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
 gi|448525080|ref|ZP_21619498.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
 gi|445699949|gb|ELZ51967.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
 gi|445700052|gb|ELZ52067.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
          Length = 742

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 236/723 (32%), Positives = 343/723 (47%), Gaps = 88/723 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E +    +  +           + L   A    + YD   GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPETVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
           ++              A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
           HFEKMLYD  +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S  
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341

Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
            EG               EGAFYVWT +EV+ +L E A  L KE Y ++  GN +     
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                 +G  V          A+      ++    L   R  LFD R +RPRP  D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQ 472
            +WNG  IS+FARA   L                  + Y E+A  A  F R  LY  D +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALEFCRERLYDADRE 493

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
           T  L   + +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDAD 553

Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
            G  + T   D           ++ R +E  D + PS   V+   L    +++ G ++D 
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
             R+ AE  +     R++   +    +  AA+++       V +   +   D+   L   
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668

Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
           +    L   ++   PA  + +D W +     A+    A    +  +  A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPQIWADRGATDGEPTAYVCEGFTCSPP 724

Query: 700 VTD 702
            TD
Sbjct: 725 RTD 727


>gi|53803351|ref|YP_114889.1| hypothetical protein MCA2477 [Methylococcus capsulatus str. Bath]
 gi|53757112|gb|AAU91403.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
          Length = 679

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 238/696 (34%), Positives = 350/696 (50%), Gaps = 76/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFEDE  A+++N  FV+IKVDREERPD+D++Y T  Q L   GGGWPL
Sbjct: 53  SACHWCHVMAHESFEDEATAEVMNRLFVNIKVDREERPDLDRIYQTVHQLLSRRGGGWPL 112

Query: 79  SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +V L+P DL P   GTYFP E +YG P F ++L  +   + + R  LA++G    E L E
Sbjct: 113 TVCLNPHDLVPFFTGTYFPKEPRYGMPAFVSVLHHLAAFYAEHRGDLARNGQVLREAL-E 171

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           A+        +PD      L    + L  S+D+  GGFG APKFPR  +++++L      
Sbjct: 172 AMGREGDGALMPD---AGLLARATQALRTSFDASHGGFGGAPKFPRTADLELLLRSD--- 225

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                     EG +M+  TL  MA+GGI+DH+GGGF RYSVDERW +PHFEKMLYD G L
Sbjct: 226 ---------GEGVEMLRTTLDGMARGGIYDHLGGGFARYSVDERWEIPHFEKMLYDNGPL 276

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y    + T D  Y+ +     +++ R+M  P G  ++A DADS   EG     EG F
Sbjct: 277 LELYARMAAQTGDPAYAVVATGTAEWVIREMQSPEGGYYAALDADS---EGG----EGRF 329

Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W  +EV+ +L  +  ++F   Y L    N            F+G   L       A A
Sbjct: 330 YLWDRQEVQGLLSADEYLVFSLRYGLDGPPN------------FEGHWHLRVARSLEAVA 377

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +  G   ++   +L   R +L   R +R RP  DDKVI +WNGL++     A ++L    
Sbjct: 378 AATGKGGDEVTRLLESARTRLRRAREQRVRPGRDDKVIAAWNGLMVRGMTVAGRLLG--- 434

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                        R ++ME A+ A  F+RR +  +   RL   +R+G ++   +LDD+AF
Sbjct: 435 -------------RADFMESADRALGFVRRTM--DAGGRLMSVYRDGRARFDAYLDDHAF 479

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+   L++ +    T  L WA+ L +   E F D E GG+F T  +  +++ R K   D 
Sbjct: 480 LLDAALEILQTRWSTDDLEWAVSLADRLLERFEDAEHGGFFFTAADHETLIQRPKPWMDE 539

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAADM 615
           + PSGN V++  L+RLA +   S+   Y   AE  L      +     A   LM    + 
Sbjct: 540 SMPSGNGVAIRALIRLAGLTGESR---YADAAERGLRAAHGAMARYPHAHCALMNAVREW 596

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L+ P    V+L G + ++        A A     + +++  P+D   +          ++
Sbjct: 597 LTPPPL--VILRGGREALK----QWCAKAREAAPEALVYAIPSDAVGL---------PSA 641

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +A         VA VC+   C+ P TD +   N +L
Sbjct: 642 LAARMPGPGGPVAYVCRGRVCAAP-TDSLGTLNEIL 676


>gi|448479213|ref|ZP_21604065.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
 gi|445822491|gb|EMA72255.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
          Length = 742

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 236/723 (32%), Positives = 343/723 (47%), Gaps = 88/723 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E +    +  +           + L   A    + YD   GGFGS   KFP P  I +
Sbjct: 173 ELESVPTPEAVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
           ++              A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
           HFEKMLYD  +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S  
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341

Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
            EG               EGAFYVWT +EV+ +L E A  L KE Y ++  GN +     
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396

Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
                 +G  V          A+      ++    L   R  LFD R +RPRP  D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQ 472
            +WNG  IS+FARA   L                  + Y E+A  A  F R  LYD   +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESE 493

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
           T  L   + +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVACGALDVYAATGDPEPLGFALELADALVDEFYDAD 553

Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
            G  + T   D           ++ R +E  D + PS   V+   L    +++ G ++D 
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
             R+ AE  +     R++   +    +  AA+++       V +   +   D+   L   
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668

Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
           +    L   ++   PA  + +D W +     A+    A    +  +  A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPP 724

Query: 700 VTD 702
            TD
Sbjct: 725 RTD 727


>gi|448540737|ref|ZP_21623658.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
 gi|448549039|ref|ZP_21627815.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
 gi|448555786|ref|ZP_21631715.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
 gi|445708890|gb|ELZ60725.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
 gi|445713728|gb|ELZ65503.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
 gi|445717309|gb|ELZ69027.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
          Length = 703

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 227/694 (32%), Positives = 343/694 (49%), Gaps = 72/694 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y    Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQNICQQVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+P+ KP   GTYFPPE + G PGF+ I+    ++W   RD +          +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDIVESFAESWRTDRDEIENRADQWTSAITDRL 172

Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
             +  +   P E P  + L    +   +  D   GGFG   PKFP+P  I  +L      
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223

Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
                 G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  LA+ YLDA  LT +  Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  +V D+L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 332 EGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSAT 379

Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
           +A    +  +   +  + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +
Sbjct: 380 TAELVDEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++         + SD       A  A  F+R  L+D++T  L     NG  K  G+L
Sbjct: 440 LEDDS---------LASD-------ARRALDFVRERLWDDETETLSRRAMNGEVKGDGYL 483

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+       L +A++L       F D + G  + T     S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D + PS   V+    + L      +    + + A+  L  F  R++   +    +  
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPNAD---FGEVADAVLGSFANRVRGSPLEHVSLAL 600

Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
           AA+  +  VP    + +   +   ++   LA+ +    L   V+   P    E+D W +E
Sbjct: 601 AAEKAASGVP---ELTVAADEVPDEWRATLASRY----LPGLVVSRRPGTDAELDAWLDE 653

Query: 669 HNSNNAS--MARNNFSADKVVALVCQNFSCSPPV 700
              + A    A    +  +     C+NF+CS P 
Sbjct: 654 LGLDEAPPIWAGREAADGEPTVYACENFTCSAPT 687


>gi|448726262|ref|ZP_21708672.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
 gi|445795880|gb|EMA46400.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
          Length = 709

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 228/688 (33%), Positives = 338/688 (49%), Gaps = 53/688 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF+D  VA+ LN  FV IKVDREERPD+D++Y T    + G GGWPLS
Sbjct: 51  SACHWCHVMADESFDDPVVAERLNKDFVPIKVDREERPDLDRLYQTVAAMVSGQGGWPLS 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PD +P   GTYFP + K G+PGF  +L  + D+WD +R+ +        + ++  L
Sbjct: 111 VWLTPDGRPFYVGTYFPRKAKRGQPGFLDLLDSIADSWDDEREDIEGRADQWADAMAGEL 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +  S   P E+    L   A++     D   GGFG   KFP+   + +++   +  E 
Sbjct: 171 EGTPDS---PGEVSPGLLETAAQRAVSDADREHGGFGRGQKFPQTGRLHLLM---QAYER 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+       +++ +  L  MA GG+ DH GGGFHRY  D  W VPHFEKMLYD  +L  
Sbjct: 225 TGRDA----FREVAVEALDAMADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELVR 280

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y+  + LT +  Y+ I R+ L ++ R++  P G  FS  DA S     +   +EGAFYV
Sbjct: 281 AYIAGYRLTGEERYAEIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFYV 338

Query: 320 WTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           WT  EV + + +   A LF E Y +   GN +            GK VL         A 
Sbjct: 339 WTPPEVHEAIDDEFAADLFCERYGITEAGNFE-----------DGKTVLTLDTAIDGLAD 387

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G   E+    L   R  +F  R+ R RP  D+KV+  WNGL+IS+FA A   L     
Sbjct: 388 EHGTTTEEIEADLERAREAIFAARTDRDRPARDEKVLAGWNGLMISAFAEAGLALD---- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                        + Y E A +A  F+R  L+DE   +L   F+ G  K  G+L+DYAFL
Sbjct: 444 -------------ETYGETAVAALDFVREQLWDEDEQQLARRFKGGEVKIDGYLEDYAFL 490

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L+ YE     ++L +A++L       F D E G  + T     S++ R +E  D +
Sbjct: 491 ARGALNCYEATGEVEYLTFALDLGRAVVREFFDAEEGTLYFTPQSGESLVARPQELDDQS 550

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PS   V+V  L+ L+    G +   + + AE  L      ++   +    +  AAD  +
Sbjct: 551 TPSSTGVAVDTLLALSQFAPGEE---FGEIAETVLETHAESIEASPLRRASLALAADRHT 607

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS-- 675
             S + + +V  +   ++   +   +    L K ++   P    E+D W +  S + +  
Sbjct: 608 AGSLE-LTIVADELPTEWRERIGRTY----LPKRLLARRPPTDAELDGWLDRLSLDDAPP 662

Query: 676 -MARNNFSADKVVALVCQNFSCSPPVTD 702
             A       +  A VC+ F+CSPP T+
Sbjct: 663 IWADRTGENGEPTAYVCRAFTCSPPQTE 690


>gi|225559995|gb|EEH08277.1| DUF255 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 804

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 226/593 (38%), Positives = 313/593 (52%), Gaps = 71/593 (11%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           CHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+P
Sbjct: 112 CHVMEKESFMSPEVAAILNKAFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTP 171

Query: 85  DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
           DL+P+ GGTY+P P           G+  F  IL K++D W  ++    +S      QL 
Sbjct: 172 DLEPVFGGTYWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQTQQLRCRESAKDITRQLQ 231

Query: 137 EALSASASSNKL-------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQM 189
           E  +   + +KL        ++L    L    +  +  YD   GGF  APKFP P  +  
Sbjct: 232 E-FAEEGTYSKLRGAGADEEEDLEVELLEEAYKHFASRYDPVNGGFSRAPKFPTPANLSF 290

Query: 190 MLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           ++  S+    + D     E +   +M + TL  +++GGIHDH+G GF RYSV   W +PH
Sbjct: 291 LVNLSRFPSAVADIVGYEECAHALEMAIKTLISISRGGIHDHIGHGFARYSVTTDWSLPH 350

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
           FEKMLYDQ QL  VY DAF    D        DI  Y+    ++ P G   S+EDADS  
Sbjct: 351 FEKMLYDQAQLLGVYTDAFDSAHDPELLGAMYDIAAYITSPPVLSPTGGFHSSEDADSLP 410

Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
           T   T K+EGAFYVWT KE + ILG+  A +   H+ + P GN +  R++DPH+EF  +N
Sbjct: 411 TPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGNVE--RVNDPHDEFINQN 468

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVIS 423
           VL         A + G+  E+ + I+     KL + R SKR RP LDDK+IV+WNGL I 
Sbjct: 469 VLNIQTTPGKLAKEFGLSEEEVVRIIKASTEKLREYRESKRVRPALDDKIIVAWNGLAIG 528

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
           + A+ S +L +          V     +E+   AE+AA FIR+ L+D  + +L   +R  
Sbjct: 529 ALAKCSVVLDN----------VDRIKAQEFRLAAENAAKFIRQSLFDPASGQLWRIYRGE 578

Query: 484 P-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
                PGF DDYA+LISGL+DLYE      +L +A +LQ+                    
Sbjct: 579 ERGDTPGFADDYAYLISGLIDLYEATFDDSYLQFAEQLQH-------------------- 618

Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
                         + PS N V   NL+RL++++   + D YR+ A  +++ F
Sbjct: 619 -------------ASTPSPNGVIARNLLRLSTLL---EDDTYRRLARDTVSAF 655


>gi|448624555|ref|ZP_21670503.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
           35960]
 gi|445749760|gb|EMA01202.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
           35960]
          Length = 703

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 233/701 (33%), Positives = 348/701 (49%), Gaps = 82/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P+ KP   GTYFPPE + G PGF+ ++    ++W   R+ +   A+    AI ++L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDREEIENRAEQWTSAITDRL 172

Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
            E   ++  A  +++ D   Q ALR          D   GGFG   PKFP+P  I  +L 
Sbjct: 173 EETPDVAGEAPGSEVLDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL- 223

Query: 193 HSKKLEDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
                      G A  G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFE
Sbjct: 224 ----------RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFE 273

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYDQ  LA  YLDA  LT +  Y+ +  +   ++RR++    G  F+  DA S     
Sbjct: 274 KMLYDQAGLAARYLDAARLTGNESYATVAAETFAFVRRELTHDDGGFFATLDAQSG---- 329

Query: 309 ATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
               +EG FYVWT  +V ++L E  A LF + Y + P GN            F+ K  ++
Sbjct: 330 ---GEEGTFYVWTPDDVRELLPELDADLFCDRYGVTPGGN------------FENKTTVL 374

Query: 368 ELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
            ++ ++A  A +  +   +    L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA
Sbjct: 375 NVSATTADLAEEYDLAESEVEARLEKARKALFAAREGRDRPARDEKVLAGWNGLMISAFA 434

Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           + S +L+ ++                  + A  A  F+R  L+D++T  L     NG  K
Sbjct: 435 QGSVVLEDDS----------------LADDARRALDFVRERLWDDETETLSRRVMNGEVK 478

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
             G+L+DYAFL  G  DLY+       L +A++L       F D + G  + T     S+
Sbjct: 479 GDGYLEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESL 538

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R +E  D + PS   V+    + L      +  D +   A+  L  F  R++   +  
Sbjct: 539 VTRPQEPTDQSTPSSLGVATSLFLDLEQF---APEDGFGDVADAVLGSFANRVRGSPLEH 595

Query: 607 PLMCCAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
             +  AA+  +  VP    + +   +   ++   LA+ +    L   V+   P   EE+D
Sbjct: 596 VSLALAAEKAASGVP---ELTVAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELD 648

Query: 665 FW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
            W +E   + A    A    +  +     C+NF+CS P  D
Sbjct: 649 AWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689


>gi|403747071|ref|ZP_10955267.1| hypothetical protein URH17368_2612 [Alicyclobacillus hesperidum
           URH17-3-68]
 gi|403120377|gb|EJY54770.1| hypothetical protein URH17368_2612 [Alicyclobacillus hesperidum
           URH17-3-68]
          Length = 628

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 241/693 (34%), Positives = 341/693 (49%), Gaps = 68/693 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFEDE VA+ LN  ++SIKVDREERPD+D +YMTY QA+ G GGWPL+V L+PD  
Sbjct: 1   MAHESFEDEQVAQYLNQHYISIKVDREERPDIDHIYMTYCQAVTGEGGWPLTVILTPDGH 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFP   +YGRPG   ILR ++  WD++R+ L  + A  + ++    +A      
Sbjct: 61  PFFAGTYFPKNARYGRPGLLEILRVMRQKWDEEREKLVSASAELVTRMQPIFAA------ 114

Query: 148 LPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
           +P E+  ++A R  A  L + +D  +GGFG APKFP   ++  +L +S+   D G     
Sbjct: 115 MPGEVDGKHAARQAASTLRERFDHAYGGFGDAPKFPAFHQVMFLLRYSRFASDQG----- 169

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
              ++M L TL  + +GGI DHVGGG  RYS D  W VPHFEKMLYD       Y +A+ 
Sbjct: 170 --ARQMALDTLDAIMRGGIADHVGGGIARYSTDAFWRVPHFEKMLYDNALAITAYTEAYQ 227

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
           +T++  Y      I+ +L R++    G  +SA DADS   EG    +EG FYVW  ++V 
Sbjct: 228 VTRNPRYRRFVEQIVTFLERELTSREGAFYSALDADS---EG----QEGRFYVWRPEDVT 280

Query: 327 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEK 385
             LG+      E Y       C    ++D  N F+G +V   ++ D  A AS   M   +
Sbjct: 281 AALGDED---GEWY-------CAFYDITDEGN-FEGYSVPNYVDRDIPAFASARNMSEGE 329

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
               L E  RKL++ R  R  P LDDK++ +WN L IS  A+A  +   E          
Sbjct: 330 LWQWLDEANRKLYEWREHREHPGLDDKILTAWNALAISGLAKAGAVFADE---------- 379

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
                  ++ +A  A   +   L  +   RL   +R+  +    + DD+A+LI+  LDLY
Sbjct: 380 ------HWLGLAVRAVQALETLLVRKPDGRLLARYRDQDAAVFAYADDHAYLIAAYLDLY 433

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
           E      +L  A   Q+  D LF D EG GYF    +   ++ + K  +DGA PS NSV+
Sbjct: 434 EATLDPFYLRRAQHWQSVLDTLFWDSEGSGYFLYGRDAERLIAQPKTVYDGATPSANSVA 493

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
             NL RL ++V     + Y    +  L  F T L + A    L    A ML       VV
Sbjct: 494 AHNLQRLYALVG---DEAYADRLDRLLHAFGTWLME-APVDHLWLVTAAMLRDLGTTEVV 549

Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
                   D   M  A H ++ L + V+    A           N  NA       +AD+
Sbjct: 550 WSSVPGRGDVRAMATAFHLAF-LPEAVLLTPSA---------RPNGENAYPP----AADE 595

Query: 686 VVALVCQNFSCSPPVTD-PISLENLLLEKPSST 717
            +  VC++F C  P  D   ++ NL+   P  T
Sbjct: 596 ALVYVCRHFHCERPEADVAATIANLVANPPRLT 628


>gi|55377924|ref|YP_135774.1| thioredoxin [Haloarcula marismortui ATCC 43049]
 gi|55230649|gb|AAV46068.1| thioredoxin domain containing protein [Haloarcula marismortui ATCC
           43049]
          Length = 733

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 229/709 (32%), Positives = 352/709 (49%), Gaps = 76/709 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
           L+P+ +P   GTYFPPE+K G+PGF  +L+++  +W   +++ +M   AQ    AIE   
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQRAEMENRAQQWTEAIESDL 177

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           EA  A       P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D G+     +   +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  
Sbjct: 229 AYADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----------E 305
           ++   +L  +       Y+ + R+  ++++R++  P G  FS  DA+SA          +
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQ 344

Query: 306 TEGATRK-------KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDP 356
           + G + +       +EG FYVWT ++V D + +   A +F ++Y +   GN         
Sbjct: 345 SSGESPRDDPDGETEEGLFYVWTPEQVHDAVDDETDADIFCDYYGVTEQGN--------- 395

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
              F+G  VL         A +     ++    L     + F+ R  RPRP  D+KV+  
Sbjct: 396 ---FEGATVLAVRKPVPVLAEEYERSEDEITASLQRALNETFEARKDRPRPARDEKVLAG 452

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+I + A  + +L                   +Y +VA  A SF+R HL+D    RL
Sbjct: 453 WNGLMIRALAEGAIVLDD-----------------QYADVAADALSFVREHLWDADAGRL 495

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              +++      G+L+DYAFL  G L L+E     + L +A++L     E F D E G  
Sbjct: 496 NRRYKDDDVAIDGYLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTL 555

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F T     S++ R +E  D + PS   V+V  L+ L+     S+ D +   AE  +    
Sbjct: 556 FFTPTGGESLVARPQELTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHA 612

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            R+    +    +  A D     + + V LVG +S  D+        A   + + ++   
Sbjct: 613 DRVSSNPLQHASLTLATDTYEQGALE-VTLVGDQS--DYPTEWTETLAEQYIPRRLLAHR 669

Query: 657 PADTEEMDFW---EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           PA+    + W    E + +    A      D+     C+NF+CSPP  D
Sbjct: 670 PAEKSRFEQWLDTLEVDESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 718


>gi|448729708|ref|ZP_21712022.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
           5350]
 gi|445794670|gb|EMA45214.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
           5350]
          Length = 721

 Score =  352 bits (904), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 236/689 (34%), Positives = 346/689 (50%), Gaps = 54/689 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA+ LND FV IKVDREERPD+D++Y T    + G GGWPLS
Sbjct: 52  SACHWCHVMEDESFEDEAVAERLNDDFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLS 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
           V+L+PD +P   GTYFP + K G+PGF  +L  + ++W D + D+  ++  +A     E 
Sbjct: 112 VWLTPDGRPFYVGTYFPRDAKRGQPGFLDLLDSIAESWEDDREDVEGRADQWAGAMAGE- 170

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
               A+  +  D    + L   A+Q  +S D  +GGFG   KFP+   + +++   +  E
Sbjct: 171 --LEATPEQPGDPPGSDLLETAAQQAVESADREYGGFGRGQKFPQTGRLHLLM---RAAE 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG++       ++   TL  MA GG+ DHVGGGFHRY+ D  W VPHFEKMLYD  +L 
Sbjct: 226 RTGRAV----FDEVARETLDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELV 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL  +  T+   Y+ + R+ L ++ R++  P G  FS  DA S +  G    +EGAFY
Sbjct: 282 RAYLAGYRRTEAERYAEVARETLGFVERELHHPDGGFFSTLDAQSEDESG--EHEEGAFY 339

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT  EV D + +   A LF E Y +  TGN +            G  VL    D    A
Sbjct: 340 VWTPDEVHDAVDDEFAADLFCERYGVTETGNFE-----------DGTTVLTLSADIEDLA 388

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +     E+    L   R  +F  R++R RP  D+K++  WNGL+IS+FA A   L +  
Sbjct: 389 DEHDTTAEEIEAELERARETVFAARAERARPARDEKILAGWNGLMISAFAEAGLTLDA-- 446

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           + + A +A  FIR HL+D++  RLQ  +++   K  G+L+DYAF
Sbjct: 447 ---------------RFADTAVTALDFIREHLWDDEEKRLQRRYKDEDVKIDGYLEDYAF 491

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L+ YE       L +A++L  T +  F D E    + T     S++ R +E  D 
Sbjct: 492 LARGALNCYEATGDVDHLAFALDLARTIETEFWDSEEETLYFTPQTGESLVARPQELDDQ 551

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           + PS   V+V  L+ L      +  D +   A  SL      ++   +    +  AAD  
Sbjct: 552 STPSSTGVAVDVLLALDHF---TPDDRFEGIATTSLETHAKTVESSPLRRASLALAADRH 608

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
           +  S +  V+         E +      SY L + ++   P   +E+  W +    +   
Sbjct: 609 AAGSLEWTVVSDGVPDAWRERI----GRSY-LPRRLLARRPPSDKELATWCDRLGLDDPP 663

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
           A  A  +    +  A VC++F+CSPP TD
Sbjct: 664 AIWADRDQRDGEPTAYVCRSFTCSPPQTD 692


>gi|337293410|emb|CCB91399.1| uncharacterized protein yyaL [Waddlia chondrophila 2032/99]
          Length = 691

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 213/555 (38%), Positives = 295/555 (53%), Gaps = 53/555 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
           TCHWCHVME ESF++  VA+ LN  F++IKVDREE P+VD++YM + QAL     GWPL+
Sbjct: 55  TCHWCHVMEEESFQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLN 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
           VFL+PDL P    TY PP +  G PG   +++ + + W  K  D +       ++   + 
Sbjct: 115 VFLTPDLLPFFATTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQN 174

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +        LPD   +  + L  + L +  D  +GG   APKFP   +  + L H   LE
Sbjct: 175 IQVYGID--LPD---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALE 228

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G+         +V  TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD   LA
Sbjct: 229 KDGRP------MFLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLA 282

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK   +  +C +++DY+   + G  G   SAEDADS   EG     EG FY
Sbjct: 283 ECYCEAWKATKRSLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFY 335

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
            WT  E++D+LG + + LF   Y    TGN            F+GKN+L         AS
Sbjct: 336 TWTMDEIDDVLGSDDSELFCSVYGATATGN------------FEGKNILHLPALLEHYAS 383

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              M   +    + E + KL+ VR KR  P  DDKV+ SWNGL+I S   A K  +    
Sbjct: 384 DNQMDHFELEARIAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI--- 440

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y++    AA FI  HL+  +  RL   +R G     G LDDYAF+
Sbjct: 441 -------------SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFM 485

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I   L L+E G GT+WL WA  ++    + F   EGG ++ T G+DP++++R     DGA
Sbjct: 486 IRASLTLFEAGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGA 544

Query: 558 EPSGNSVSVINLVRL 572
           EPSGN+V   NL+R+
Sbjct: 545 EPSGNAVHCENLLRI 559


>gi|344211988|ref|YP_004796308.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
           33960]
 gi|343783343|gb|AEM57320.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
           33960]
          Length = 717

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 227/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFENEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
           L+P+ +P   GTYFPPE+K G+PGF  +L+++ D+W   +++ +M   AQ    AIE   
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDL 177

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
           EA  A+      P++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +
Sbjct: 178 EATPAN------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D G+    +    +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  
Sbjct: 229 AHADGGQEDYLT----VVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
           ++   +L  +       Y+ + R+  ++++R++  P G  FS  DA+S   E      +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESVPPEDPDGDSEE 344

Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT ++V D + +   A +F           CD   +++P N F+G  VL      
Sbjct: 345 GLFYVWTPEQVHDAVDDETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPV 392

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           S  A +     ++    L     + F+ R +RPRP  D+KV+  WNGL+I + A  + +L
Sbjct: 393 SVLAEEYEQSEDEITASLQRALNETFEAREERPRPARDEKVLAGWNGLMIRALAEGAIVL 452

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                                      A SF+R HL+D    RL   +++G     G+L+
Sbjct: 453 DDAYADVA-----------------ADALSFVREHLWDADAERLNRRYKDGDVAIDGYLE 495

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL  G L L+E     + L +A++L     E+F D + G  F T     S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGNVEHLAFAMDLGQAITEVFWDDDEGTLFFTPTGGESLVARPQE 555

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D + PS   V+V  L+ L+     S  D +   AE  +     R+    +    +  A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SDDDRFETVAERVIRTHADRVSSNPLQHASLTLA 612

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
            D     + + + LVG +S  D+ +      A   + + ++   PAD    + W    E 
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPSEWTETLAQRYVPRRLLAHRPADDTGFEQWLDALEL 669

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
           + +    A      D+     C+NF+CSPP  D
Sbjct: 670 DESPPIWAGREQVDDEPTVYACRNFACSPPKHD 702


>gi|347735180|ref|ZP_08868108.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
 gi|346921671|gb|EGY02301.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
          Length = 686

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 232/684 (33%), Positives = 332/684 (48%), Gaps = 72/684 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE++ ++ L+ND F++IKVDREERPDVD+VY   +  L   GGWPL++F
Sbjct: 59  CHWCHVMAHESFENQAISSLMNDLFINIKVDREERPDVDQVYQQALSLLGQQGGWPLTMF 118

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P  +P  GGTYFPP  +YGRPGF  +L+ V + + +    ++++    ++ L +AL+ 
Sbjct: 119 LTPKGEPFWGGTYFPPATRYGRPGFPDVLQGVAETYAQDPGKVSRN----VKALGDALAR 174

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            +  N   D +   +L   A++L +  D   GG   APKFP+P    ++     +   T 
Sbjct: 175 LSRGNP-GDAVTVGSLNAVADRLVREVDPFLGGINGAPKFPQPSIFDLLWRAHLRTART- 232

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                 + +  V+ TL  MA GGI+DH+ GGF RYS DE+W VPHFEKMLYD  QL  + 
Sbjct: 233 ------DLRDAVITTLTHMANGGIYDHLAGGFARYSTDEQWLVPHFEKMLYDNAQLVALM 286

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              +  T+D       R+ + ++  +M  PGG   +  DADS   EG    +EG FYVWT
Sbjct: 287 TQVWQGTRDPLLEVRVRETVGWVLNEMKVPGGAFGATLDADS---EG----EEGRFYVWT 339

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
             E++ +LGE A LF  HY +   GN            ++G  +   LN  +  A     
Sbjct: 340 KAEIDRLLGEDAELFCAHYDVTELGN------------WEGHTI---LNRRTPLA----- 379

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA--ESA 439
           P     N L   R +L   R+ R RP  DDKV+  WNGL+I++ ARA  + +     E+A
Sbjct: 380 PGSAEENRLAHARARLLKARALRIRPGWDDKVLADWNGLMIAALARAGFVFEQPGWIEAA 439

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
           +            Y  V  S       H   +   RL HS R G ++  G L+DYA +  
Sbjct: 440 I----------DAYRHVVTSLG-----HTGRDGLDRLYHSGRGGRARHAGLLEDYANMGK 484

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             L L+E      +L  A    +T D  F D   GGY+ T  +   +L+R +   D A P
Sbjct: 485 AALTLHEITGDVAFLDQAARWTDTLDRHFWDAADGGYYTTADDVGDLLVRPRHAQDNAVP 544

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           +GN   + NL RL  +   +  D YR  A+  ++ F   L      +      A+ L   
Sbjct: 545 AGNGTQLGNLTRLWLL---TGQDRYRAQADTLMSAFSGELGRNFFPLSTFLNMAETLL-- 599

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           +  H VLVG     D E   A   A       V  + P      +  E H +   +M   
Sbjct: 600 NGMHAVLVGEGD--DLEPFNAVLRAQSRPTLVVSRLAPG----QNLPEPHPAAGKAMVDG 653

Query: 680 NFSADKVVALVCQNFSCSPPVTDP 703
                +  A VCQ+  CS PVT P
Sbjct: 654 -----RATAYVCQDMRCSLPVTTP 672


>gi|282889930|ref|ZP_06298465.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338175432|ref|YP_004652242.1| hypothetical protein PUV_14380 [Parachlamydia acanthamoebae UV-7]
 gi|281500123|gb|EFB42407.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336479790|emb|CCB86388.1| uncharacterized protein yyaL [Parachlamydia acanthamoebae UV-7]
          Length = 692

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 218/585 (37%), Positives = 310/585 (52%), Gaps = 63/585 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL     TCHWCHVME ESFE+  VA+ LN+ F++IKVDREE P+
Sbjct: 33  GDEAFLAAKEADKPIFLSVGYATCHWCHVMEQESFENLEVAQALNEAFINIKVDREELPE 92

Query: 59  VDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           VD +YM + Q++  G  GWPL+V L+PDL P    TY PP + +G  G   ++ ++ +AW
Sbjct: 93  VDSLYMEFAQSMMSGAAGWPLNVILTPDLYPFFAATYLPPVNSHGLIGMLELVERIHEAW 152

Query: 118 --DKKRDMLAQSGAFAIEQLSEALS--ASASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
             D++  +L QS     E++ E        S   LP   P   +    E L K  D   G
Sbjct: 153 QGDERERILMQS-----EKIVEVFEQHVHTSGELLP---PPEVIEKTIEMLIKLADPVNG 204

Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           G   APKFP   +   +L +S + +D       S    +V  TL+ M +GGI+DH+GGGF
Sbjct: 205 GMKGAPKFPIAYQSVFLLRYSMEKKD-------SRPLFLVERTLEMMRRGGIYDHLGGGF 257

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
            RYSVDE W +PHFEKMLYD   LA+ Y +A+  T++  Y  +C +IL Y+ RDM    G
Sbjct: 258 SRYSVDEAWQIPHFEKMLYDNALLADCYFEAWQATQNPQYKKVCEEILHYVLRDMSHFRG 317

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWT--SKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
             +SAEDADS   EG     EG FY WT    E        + LF  ++ + P GN    
Sbjct: 318 GFYSAEDADS---EG----HEGRFYTWTLEEVEELLGGENESELFVHYFDITPEGN---- 366

Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
                   F+G+NVL         A K+GM  ++   +  E +  L+  R KR  P  DD
Sbjct: 367 --------FEGRNVLHTPLSLEEFAKKMGMDAQQLDLLFTEQKHILWKAREKRVHPFKDD 418

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
           K++ +WNGL+I + A A                    D++ ++  A+++A FI+  L++E
Sbjct: 419 KILTAWNGLMIQAMAEAG---------------CAFCDQR-FLSAAQNSAKFIKAKLWNE 462

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
             H L   +R+  +     LD+YAFLI  LL L+E G GT+WL WA+EL       F   
Sbjct: 463 --HGLLRRWRDDEAMFSAGLDEYAFLIRSLLTLFEAGCGTEWLQWALELNEILKNQF-KA 519

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
             G Y+ T G+D S+++R  +  DGAEPSGN++   NL+RL  + 
Sbjct: 520 LNGAYYQTNGQDLSLVIRKCQFSDGAEPSGNAIQCENLLRLYQLT 564


>gi|10438196|dbj|BAB15192.1| unnamed protein product [Homo sapiens]
          Length = 491

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 209/518 (40%), Positives = 283/518 (54%), Gaps = 48/518 (9%)

Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 271
           M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ D 
Sbjct: 1   MALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDE 60

Query: 272 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 331
           FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L E
Sbjct: 61  FYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPE 119

Query: 332 HAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
             +          L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+
Sbjct: 120 PVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGL 177

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A    +L         
Sbjct: 178 DVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------- 228

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDD 493
                G DR   +  A + A F++RH++D  + RL  +   GP      S  P  GFL+D
Sbjct: 229 -----GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLED 281

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKE 552
           YAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+
Sbjct: 282 YAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKD 341

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A
Sbjct: 342 DQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRA 398

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
                  + K +V+ G + + D + ++   H+ Y  NK +I    AD +   F       
Sbjct: 399 LSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPF 454

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +++ R     D+  A VC+N +CS P+TDP  L  LL
Sbjct: 455 LSTLRRLE---DQATAYVCENQACSVPITDPCELRKLL 489


>gi|418720670|ref|ZP_13279866.1| PF03190 family protein [Leptospira borgpetersenii str. UI 09149]
 gi|410742944|gb|EKQ91689.1| PF03190 family protein [Leptospira borgpetersenii str. UI 09149]
          Length = 631

 Score =  352 bits (902), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 242/689 (35%), Positives = 351/689 (50%), Gaps = 65/689 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD K
Sbjct: 1   MEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P+ GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +     L ++    A   +
Sbjct: 61  PITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQ 120

Query: 148 LPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKS 203
               LP          L +S YD+ FGGF +    KFP  + +  +L YH         S
Sbjct: 121 EEGSLPSKDCFNSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH--------HS 172

Query: 204 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 263
               +  +MV  TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++
Sbjct: 173 SGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVE 232

Query: 264 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 323
              ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  +
Sbjct: 233 CSQVSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFE 285

Query: 324 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
           E  ++ GE + + ++ + +   GN            F+GKN+L E       A+KL    
Sbjct: 286 EFREVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEE 331

Query: 384 EKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 442
            K ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A              
Sbjct: 332 WKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG------------- 378

Query: 443 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 502
              +   R++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +IS  +
Sbjct: 379 ---IAFRREDFLKLAEETYSFIERNLIDPDG-RILRRFRDGESGILGYSNDYAEMISSSI 434

Query: 503 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 561
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 435 VLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSA 492

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           NS    +LV+L+  + G  S  YR+ AE   + F   L   +++ P +  A       S 
Sbjct: 493 NSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSAYWTYRYHS- 549

Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
           K +VL+  K +   +++LAA    +  +     ++  + EE           +++  +  
Sbjct: 550 KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNENELEEA-------RKLSALFDSRD 601

Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
           S    +  VC+NFSC  PV++   L+  +
Sbjct: 602 SGGNALVYVCENFSCKLPVSNLADLQKWI 630


>gi|389847202|ref|YP_006349441.1| hypothetical protein HFX_1748 [Haloferax mediterranei ATCC 33500]
 gi|448614853|ref|ZP_21663881.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
 gi|388244508|gb|AFK19454.1| highly conserved protein containing a thioredoxin domain [Haloferax
           mediterranei ATCC 33500]
 gi|445752940|gb|EMA04359.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
          Length = 703

 Score =  352 bits (902), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 232/698 (33%), Positives = 350/698 (50%), Gaps = 76/698 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPEIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P  KP   GTYFPPE + G PGF+ ++    ++W   RD +   A+    AI ++L
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTHAITDRL 172

Query: 136 SEALSASASS--NKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
            E    +  +  +++ D+  Q ALR        + D   GGFGS  PKFP+P  I  +L 
Sbjct: 173 EETPDTTGETPGSEILDQTVQAALR--------AADRDHGGFGSGGPKFPQPGRIDALL- 223

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
             +    TG+     +   + +  L  MA GG+ DH+GGGFHRY VD +W VPHFEKMLY
Sbjct: 224 --RGYAITGR----RQALDVAVEALDAMANGGLRDHLGGGFHRYCVDRQWTVPHFEKMLY 277

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           DQ  LA+ YLDA+ LT +  Y+ + R+  +++RR++    G  F+  DA S         
Sbjct: 278 DQAGLASRYLDAYRLTGNESYATVARETFEFVRRELSHDDGGFFATLDAQSG-------G 330

Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EG FYVWT ++V   L E  A LF + Y + P GN            F+ K  ++ ++ 
Sbjct: 331 EEGTFYVWTPEDVRSHLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSA 378

Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           ++A  A +  +   +    L E   +LF  R+ R RP  D+KV+  WNGL+IS+FA+ + 
Sbjct: 379 TTADLAEEYDLTESEVEERLEEAHEELFAARTDRERPARDEKVLAGWNGLMISAFAQGAV 438

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
            L  ++                  + A  A  F+R HL+DE +  L     NG  K  G+
Sbjct: 439 ALTDDS----------------LADDARRALDFVREHLWDEASETLSRRVMNGEVKGDGY 482

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYAFL  G  DLY+     + L +AI+L       F D   G  + T     +++ R 
Sbjct: 483 LEDYAFLARGAFDLYQATGDLEPLSFAIDLARATHREFYDDAAGTLYFTPESGEALVTRP 542

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           +E  D + PS   V+    + L      +    +   A+  L  F  R++   +    + 
Sbjct: 543 QEATDQSTPSSLGVATSLFLDLEHFAPDAG---FGDAADAVLESFANRVRGSPLEHVSLV 599

Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-- 666
            AA+  +  VP    + +   +   ++   +A+ +    L   V+   PA  +E+D W  
Sbjct: 600 LAAEKAASGVP---ELTVAADEMPDEWRETIASRY----LPGLVVSRRPATDDELDAWLD 652

Query: 667 --EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
             E   +     AR     +  V   C+NF+CS P  D
Sbjct: 653 ELELDEAPPIWAAREATDGEPTV-YACENFTCSAPTHD 689


>gi|300710941|ref|YP_003736755.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
 gi|448296966|ref|ZP_21487016.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
 gi|299124624|gb|ADJ14963.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
 gi|445580643|gb|ELY35021.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
          Length = 709

 Score =  351 bits (901), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 235/688 (34%), Positives = 343/688 (49%), Gaps = 55/688 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE +AK LN+ FV IKVDREERPD+D +Y T  Q +   GGWPLS
Sbjct: 51  SACHWCHVMEEESFEDEDIAKQLNENFVPIKVDREERPDLDSIYQTICQLVTRRGGWPLS 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PD +P   GTYFP E + G PGF  +L  + ++W+  R+ +        +Q + A+
Sbjct: 111 VWLTPDGRPFYVGTYFPRESRRGTPGFGDLLGNLAESWEGDREEIENRA----DQWTRAI 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLE 198
           +          E P+  L   A+   +  D   GGFG + PKFP+   ++++L   +  +
Sbjct: 167 TDQLEEVPEAGERPEGVLIEAADAALRGADREHGGFGQNGPKFPQTARLEVLL---RAYD 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+        ++V  TL  M   G++D +GGGFHRY+ D  W VPHFEKMLYD  +L 
Sbjct: 224 RTGR----GPYDEVVRETLDAMGSRGMYDQLGGGFHRYATDREWVVPHFEKMLYDNAELP 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL  + +T    Y+ I R+ L ++ R++  P G  +S  DA S + E   R +EGAFY
Sbjct: 280 RSYLAGYRVTGQERYARIVRETLAFVERELGHPDGGFYSTLDAQSEDPETGER-EEGAFY 338

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT   VE++L  E A LF E Y +   GN            F+GK VL       + A 
Sbjct: 339 VWTPAAVEEVLDEERAALFCERYGVDKRGN------------FEGKTVLTLARSVGSLAE 386

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G+  ++  + L E  R+LF+ R +RPRP  D+KV+  WNGL+ISSFA A   L     
Sbjct: 387 EYGLDEDEVEDRLVEAERRLFEAREERPRPRRDEKVLAGWNGLMISSFAEAGLTLD---- 442

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                    GS    Y + A  A  F+R  L+D +  RL   F++   K  G+L+DYAFL
Sbjct: 443 ---------GS----YAKRAAEALEFVREQLWDTEGKRLSRRFKDREVKIDGYLEDYAFL 489

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G  D Y+     + L +A++L    +  F D E    + T      ++ R +E +D +
Sbjct: 490 ARGAFDTYQATGDVEHLKFALDLARAIEREFWDEERETLYFTPEAGEELVARPQELNDQS 549

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PS   V+   L+ L+          +    E  LA    R++   +    +   AD   
Sbjct: 550 TPSSLGVACDVLLSLSQFADAD----FEGIVERVLARHGDRIRGNPLEHATLALVADRFE 605

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNSNNAS- 675
             S + V +       ++   L  A+    L   V+   P   E ++ W +E     A  
Sbjct: 606 NGSLE-VTVAADVLPTEWRERLGEAY----LPGRVLARRPPTEEGLEGWLDELGLEEAPP 660

Query: 676 -MARNNFSADKVVALVCQNFSCSPPVTD 702
             A       +  A VC++F+CSPPVTD
Sbjct: 661 IWADREAREGEATAYVCRSFTCSPPVTD 688


>gi|448677622|ref|ZP_21688812.1| thioredoxin [Haloarcula argentinensis DSM 12282]
 gi|445773297|gb|EMA24330.1| thioredoxin [Haloarcula argentinensis DSM 12282]
          Length = 717

 Score =  351 bits (900), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 225/696 (32%), Positives = 346/696 (49%), Gaps = 66/696 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +
Sbjct: 58  CHWCHVMEEESFENEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
           L+P+ +P   GTYFPPE+K G+PGF  +L+++  +W   ++R+ +        E +   L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRARQWTEAIESDL 177

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
            A+ +    P++  ++ ++       +  D + GG+GS  PKFP+   +  +L       
Sbjct: 178 EATPAD---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL------- 227

Query: 199 DTGKSGEASEGQK----MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                  A  GQ+    +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD 
Sbjct: 228 ----RAHAGGGQEDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDN 283

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATR 311
            ++   +L  +       Y+ + R+  ++++R+M  P G  FS  DA+SA   E EG T 
Sbjct: 284 AEIPRAFLAGYQAIGSERYASVVRETFEFVQREMQHPEGGFFSTLDAESAPIDEPEGET- 342

Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EG FYVWT ++V + + +   A +F +++ +   GN            F+G  VL   
Sbjct: 343 -EEGLFYVWTPEQVHEAVDDETDAEIFCDYFGVTERGN------------FEGATVLAVR 389

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
              S  A +     ++    L     + F+ R  RPRP  D+KV+  WNGL+I + A  +
Sbjct: 390 KPVSVLAEEYDQSEDEITGSLQRALNEAFEARENRPRPARDEKVLAGWNGLMIRTLAEGA 449

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L                           A SF+R +L+D+   RL   +++G     G
Sbjct: 450 IVLDDAYADVA-----------------ADALSFVREYLWDDDAGRLNRRYKDGDVAIDG 492

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +L+DYAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R
Sbjct: 493 YLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVAR 552

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            +E  D + PS   V+V  L+ L+     S  D +   AE  +     R+    +    +
Sbjct: 553 PQELTDQSTPSSTGVAVDLLLSLSHF---SDDDRFESVAERVIRTHADRVSSNPLQHASL 609

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A D     + + + LVG +S  D+        A   + + ++   PAD +  + W + 
Sbjct: 610 TLATDTYEQGALE-LTLVGDQS--DYPTEWTETLAERYVPRRLLAHRPADEDRFEQWLDT 666

Query: 670 NSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
              N S    A      D+     C+NF+CSPP  D
Sbjct: 667 LGLNESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702


>gi|297621186|ref|YP_003709323.1| thymidylate kinase [Waddlia chondrophila WSU 86-1044]
 gi|297376487|gb|ADI38317.1| putative thymidylate kinase [Waddlia chondrophila WSU 86-1044]
          Length = 691

 Score =  350 bits (898), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 212/555 (38%), Positives = 294/555 (52%), Gaps = 53/555 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
           TCHWCHVME ESF++  VA+ LN  F++IKVDREE P+VD++YM + QAL     GWPL+
Sbjct: 55  TCHWCHVMEEESFQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLN 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
           VFL+PDL P    TY PP +  G PG   +++ + + W  K  D +       ++   + 
Sbjct: 115 VFLTPDLLPFFATTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQN 174

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +        LPD   +  + L  + L +  D  +GG   APKFP   +  + L H   LE
Sbjct: 175 IQVYGID--LPD---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALE 228

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G+         +V  TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD   LA
Sbjct: 229 KDGRP------MFLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLA 282

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y +A+  TK   +  +C +++DY+   + G  G   SAEDADS   EG     EG FY
Sbjct: 283 ECYCEAWKATKRSLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFY 335

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
            WT  E++D+LG + + LF   Y     GN            F+GKN+L         AS
Sbjct: 336 TWTMDEIDDVLGSDDSELFCSVYGATAIGN------------FEGKNILHLPALLEHYAS 383

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              M   +    + E + KL+ VR KR  P  DDKV+ SWNGL+I S   A K  +    
Sbjct: 384 DNQMDHFELEARIAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI--- 440

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y++    AA FI  HL+  +  RL   +R G     G LDDYAF+
Sbjct: 441 -------------SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFM 485

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I   L L+E G GT+WL WA  ++    + F   EGG ++ T G+DP++++R     DGA
Sbjct: 486 IRASLTLFEAGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGA 544

Query: 558 EPSGNSVSVINLVRL 572
           EPSGN+V   NL+R+
Sbjct: 545 EPSGNAVHCENLLRI 559


>gi|448738600|ref|ZP_21720623.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
           13552]
 gi|445801484|gb|EMA51818.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
           13552]
          Length = 709

 Score =  350 bits (898), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 226/689 (32%), Positives = 341/689 (49%), Gaps = 55/689 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF+D  VA+ LN+ FV IKVDREERPD+D++Y T    + G GGWPLS
Sbjct: 51  SACHWCHVMADESFDDPAVAEQLNEEFVPIKVDREERPDLDRLYQTVAAMVSGRGGWPLS 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PD +P   GTYFP E K G+PGF  +L  + D+W+ +R+ +        +Q ++A+
Sbjct: 111 VWLTPDGRPFYVGTYFPREAKRGQPGFLDLLDSIADSWNDEREDIESRA----DQWADAM 166

Query: 140 SAS-ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +     +   P E+    L   A++     D   GGFG   KFP+   + +++   +  E
Sbjct: 167 AGELEGTPDTPGEVSPGLLETAAQRAVSEADREHGGFGRGQKFPQTGRLHLLM---QAHE 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+       +++ +  L  +A GG+ DH GGGFHRY  D  W VPHFEKMLYD  +L 
Sbjct: 224 RTGRDA----FREVAVEALDAIADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELV 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL  + LT +  Y+ I R+ L ++ R++  P G  FS  DA S     +   +EGAFY
Sbjct: 280 RAYLAGYRLTGEERYAEIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFY 337

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT +EV + + +   A LF E Y +   GN +            GK VL         A
Sbjct: 338 VWTPQEVHEAVDDEFAADLFCERYGITEAGNFE-----------NGKTVLTIDTTIDGLA 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            + G   E+    L   R  +F  R+ R RP  D+K++  WNGL+IS+FA A   L    
Sbjct: 387 DEHGTTTEEIEADLERAREAIFAARADRERPARDEKILAGWNGLMISAFAEAGLALD--- 443

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                         + Y E A +A  F+   L+DE   +L   F++G  K  G+L+DYAF
Sbjct: 444 --------------ETYSETAVAALGFVHEQLWDEDEQQLARRFKDGEVKIDGYLEDYAF 489

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L+ YE       L +A++L       F D E G  + T     S++ R +E  D 
Sbjct: 490 LARGALNCYEATGEVAQLEFALDLGRAIVREFFDGEEGTLYFTPRSGESLVARPQELDDQ 549

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           + PS   V+V  L+ L+     +  + +   AE  L      ++   +    +  AAD  
Sbjct: 550 STPSSTGVAVDTLLALSQF---APDEEFEDVAETVLETHAESIEASPLRRASLALAADRH 606

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS- 675
           +  S + + +V  +   ++   +  A+    L K ++   P+   E+D W +  S + + 
Sbjct: 607 TAGSLE-LTVVADELPGEWRERIGRAY----LPKRLLARRPSTNAELDDWLDRLSVDDAP 661

Query: 676 --MARNNFSADKVVALVCQNFSCSPPVTD 702
              A       +  A VC+ F+CSPP T+
Sbjct: 662 PIWAERTGEDGEPTAYVCRAFTCSPPQTE 690


>gi|448502781|ref|ZP_21612730.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
 gi|445693844|gb|ELZ45985.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
          Length = 745

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 251/747 (33%), Positives = 349/747 (46%), Gaps = 106/747 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAAVVNDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +  +PGF+ +  ++ D+W          ++ D   QS   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRRNQPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARD 172

Query: 131 AIEQLSEALSASAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEI 187
            +E +       AS   + L D     ALR         YD  +GGFGS   KFP P  I
Sbjct: 173 ELESVPTPAEGDASPPGSDLLDTAAAAALR--------GYDEEYGGFGSGGAKFPMPGRI 224

Query: 188 QMMLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWH 243
            +++              A  G+  +L     TL  MA GG++D VGGGFHRY+VD +W 
Sbjct: 225 DLLM-----------RAYAGRGRDALLSAATGTLDGMADGGMYDQVGGGFHRYAVDRQWT 273

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA-- 301
           VPHFEKMLYD  +L   YLD + LT D  Y+ +  + L +L R++   GG  FS  DA  
Sbjct: 274 VPHFEKMLYDNAELPMAYLDGYRLTGDPRYARVASESLAFLDRELRHEGGGFFSTLDARS 333

Query: 302 --------DSAETEGATRKK--------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKP 344
                   DS   E A            EGAFYVWT +EV+ +L E A  L K+ Y ++ 
Sbjct: 334 RRPASRGSDSEADEEADVDAGNVGGDDVEGAFYVWTPEEVDAVLDEPAASLAKDRYGIRS 393

Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
            GN +           +G  V          A+   +  E     L E R  LFD R  R
Sbjct: 394 GGNFE-----------RGTTVPTIAASVEGLAADRDLSPEAVRETLVEARTALFDARESR 442

Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
           PRP  D+KV+ SWNG  IS+FARA   L                  + Y E+A  A  F 
Sbjct: 443 PRPARDEKVLASWNGRAISAFARAGDSLG-----------------EPYAEIAREALDFC 485

Query: 465 RRHLYDEQTHRLQHSFR--NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN 522
           R  LYD        + R  +G  + PG+LDDYAFL  G LD Y      + L +A++L  
Sbjct: 486 RERLYDADADAGALARRWLDGDVRGPGYLDDYAFLARGALDTYAATGDPEPLGFALDLAG 545

Query: 523 TQDELFLDREGGGYFNT------TGEDPS----VLLRVKEDHDGAEPSGNSVSVINLVRL 572
              E F D + G  + T      T +D +    ++ R +E  D + PS   V+   L  L
Sbjct: 546 ALVEEFYDADDGTIYFTRDLDDGTADDRADAGPLIARPQEFTDRSTPSSLGVAAETLALL 605

Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 632
               A  +   +R+ AE  +     R++   +    +  AAD++       V +   +  
Sbjct: 606 DGFRADGE---FREIAERVVTTHGDRIRGSPLEHASLVRAADLVET-GGIEVTIAAAEVP 661

Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVAL 689
            ++   L   +    L   ++   P     +D W +      +    A  + +  +  A 
Sbjct: 662 REWRETLGERY----LPGALVAPRPLTETGLDEWLDRLGMAEAPPIWADRDATDGEPTAY 717

Query: 690 VCQNFSCSPPVTD-PISLENLLLEKPS 715
           VC+ F+CSPP TD   +LE L   +PS
Sbjct: 718 VCEGFTCSPPRTDLDAALEWLETREPS 744


>gi|448469568|ref|ZP_21600250.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
 gi|445808905|gb|EMA58956.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
          Length = 740

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/721 (33%), Positives = 349/721 (48%), Gaps = 86/721 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE +A +LND FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESIAAVLNDEFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
            + +P+ +P   GTYFPPE +  +PGF+ +  ++ D+W         +++ D    S   
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMERRADQWTTSARD 172

Query: 131 AIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
            +E + + +L+  A  ++ P     N L   A    + YD  +GGFGS   KFP P  I 
Sbjct: 173 ELESVPDPSLAGDAGGSEAPG---PNLLDEAAAAAVRGYDDEYGGFGSGGAKFPMPGRID 229

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           +++   +    TG+    +        TL  MA+GG++D +GGGFHRY+VD +W VPHFE
Sbjct: 230 VLM---RAYARTGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFE 282

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS----- 303
           KMLYD  +L   YLDA  LT D  Y+ +  + L ++ R++    G  FS  DA S     
Sbjct: 283 KMLYDNAELPMAYLDAHRLTGDASYARVASETLGFIDRELRHDDGGFFSTLDARSRPPES 342

Query: 304 ----AETEGATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
               A ++G+   +     EGAFYVWT  EV+  L E A  L KE Y +   GN +    
Sbjct: 343 RRGNAGSDGSDAAEDVADVEGAFYVWTPGEVDAALDEPAASLAKERYGIASGGNFE---- 398

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
                  +G  V          A +  M        L   R  LF+ R  RPRP  D+KV
Sbjct: 399 -------RGTTVPTIAASVPELADQRDMSTADVREALTAARVALFEARESRPRPARDEKV 451

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           + SWNG  IS+FA A ++L                  K Y ++A  A +F R  LYDE+T
Sbjct: 452 LASWNGRAISAFAAAGQVLG-----------------KPYADIASDALAFCRERLYDEET 494

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             L   + +G  + PG+LDD+AFL  G LD Y        L +A++L  T    F D + 
Sbjct: 495 GGLARRWLDGDVRGPGYLDDHAFLARGALDAYSATGDPAALGFALDLAETVVSDFYDADD 554

Query: 534 GG-YFN------TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YR 585
           G  YF       T   D ++  R +E  D + PS   V+   L    +++ G ++D  + 
Sbjct: 555 GTIYFTRDPDEETEQGDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFA 610

Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD-FENMLAAAHA 644
             AE  +     R++   +    +  AAD ++  S    V V   +  D +   LA  + 
Sbjct: 611 DVAERVVTTHADRIRASPLEHVSLVRAADRVA--SGGIEVTVAADAVPDAWRETLAERY- 667

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
              L   ++   P   + +  W +    + +    A  +    +  A VC+  +CSPP T
Sbjct: 668 ---LPGALVAPRPPTEDGLAAWLDRLGMDEAPPIWADRDAVDGEPTAYVCEGRTCSPPET 724

Query: 702 D 702
           D
Sbjct: 725 D 725


>gi|448491519|ref|ZP_21608359.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
           19288]
 gi|445692519|gb|ELZ44690.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
           19288]
          Length = 746

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 242/729 (33%), Positives = 348/729 (47%), Gaps = 96/729 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA ++ND FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAGVVNDSFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +   PGF+ +  ++ D+W          ++ D   QS   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF-GGFGSAPKFPRPVEIQM 189
            +E +        S  +       + L   A    + YD  + G  G   KFP P  I +
Sbjct: 173 ELESVPNP-DTPGSDGEAASPPGDDLLDTAAAAALRGYDEEYGGFGGGGAKFPMPGRIDL 231

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
           ++              A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VP
Sbjct: 232 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 280

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA---- 301
           HFEKMLYD  +L   YLD + L+ D  Y+ +  + L +L R++   GG  FS  DA    
Sbjct: 281 HFEKMLYDNAELPMAYLDGYRLSGDPAYARVAGESLAFLDRELRHEGGAFFSTLDARSRP 340

Query: 302 --------DSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSR 352
                   DS E +G     EGAFYVWT +EV+ +L E A  L K+ Y ++  GN +   
Sbjct: 341 PESRRDGSDSDEGDGEG-DVEGAFYVWTPEEVDAVLDEPAASLAKKRYGIRSGGNFE--- 396

Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
                   +G  V          A+   +  EK   IL E R  LFD R  RPRP  D+K
Sbjct: 397 --------RGTTVPTLAASVEELAADRDLSPEKVREILTEARTTLFDARESRPRPARDEK 448

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-- 470
           V+ SWNG  IS+FARA   L                  +EY E+A  A  F    LYD  
Sbjct: 449 VLASWNGRAISAFARAGDTLG-----------------EEYAEIAREALDFCHERLYDAE 491

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT-QDELF- 528
            +T  L   + +G  + PG+LDDYAFL  G LD+Y      + L +A+EL +   DE + 
Sbjct: 492 NETGALARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALELADALVDEFYD 551

Query: 529 -----------LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
                      LD EG G  +   +   ++ R +E  D + PS   V+   L    +++ 
Sbjct: 552 ADDGTIYFTRDLDGEGAGGGSRNADSGPLIARPQEFTDRSTPSSLGVAAETL----ALLD 607

Query: 578 GSKSD-YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
           G ++D  +R+ AE  L     R++   +    +  AAD++       V +   +   ++ 
Sbjct: 608 GFRTDGEFREIAERVLTTHADRIRGSPLEHASLVRAADVVET-GGIEVTIAADEVPDEWR 666

Query: 637 NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQN 693
             L   +    L   ++   PA  + +D W +      +    A  + +  +  A VC+ 
Sbjct: 667 ETLGERY----LPGALVAPRPATEDGLDAWLDALGMAEAPPIWADRDATDGEPTAYVCEG 722

Query: 694 FSCSPPVTD 702
           F+CSPP TD
Sbjct: 723 FTCSPPRTD 731


>gi|448439398|ref|ZP_21588039.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
 gi|445691449|gb|ELZ43640.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
          Length = 751

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 250/734 (34%), Positives = 355/734 (48%), Gaps = 101/734 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAAVLNEEFVPVKVDREERPDVDSAFMTVSQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
            + +P+ +P   GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S   
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMQRRADQWTTSARD 172

Query: 131 AIEQLSEALSASAS-------SNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSA-PKF 181
            +E + +A +  A        ++    E P  + L   A    + YD  +GGFGS   KF
Sbjct: 173 ELESVPDAEAGPAGGADDAGGTDGADGEAPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKF 232

Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
           P P  I +++    +   TG+    +        TL  MA+GG++D +GGGFHRY+VD +
Sbjct: 233 PMPGRIDVLMRAYAR---TGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQ 285

Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
           W VPHFEKMLYD  +L   +LDA  LT D  Y+ +  + L +L R++    G  FS  DA
Sbjct: 286 WTVPHFEKMLYDNAELPMAFLDAARLTGDASYARVASETLGFLDRELRHDDGGFFSTLDA 345

Query: 302 DSAETEGATRKK----------------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKP 344
            S   E  TR+                 EGAFYVWT  EV+ +L E A  L KE Y ++ 
Sbjct: 346 RSRPPE--TRRGGVGSDGSDGSGHAADVEGAFYVWTPGEVDAVLDEPAASLAKERYGIES 403

Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
            GN +           +G  V          A    M  E     L E R  LF+ R  R
Sbjct: 404 GGNFE-----------RGTTVPTVAASIEELADDHDMSPEAVREALTEARVALFEARESR 452

Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
           PRP  D+KV+ SWNG  IS+FA A ++L                  + Y ++A  A +F 
Sbjct: 453 PRPARDEKVLASWNGRAISAFAAAGQVLG-----------------EPYADIAGDALAFC 495

Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
           R +LYDE T  L   + +G  + PG+LDD+AFL  G LD+Y        L +A++L  T 
Sbjct: 496 RENLYDESTGDLARRWLDGDVRGPGYLDDHAFLARGALDVYAATGDPDALGFALDLAETV 555

Query: 525 DELFLDREGGGYFNT------TGED--PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
              F D E G  + T       GED   ++  R +E  D + PS   V+   LV    ++
Sbjct: 556 VADFYDDEDGTIYFTRDPDEAAGEDGDDTLFARPQEFTDRSTPSSLGVAAETLV----LL 611

Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSS 632
            G ++D  R+ AE + AV  T   D   A PL    +  AAD ++  S    V V  +S 
Sbjct: 612 DGFRTD--REFAEVAEAVVTTH-ADRIRASPLEHVSLVRAADRVA--SGGIEVTVAAESV 666

Query: 633 VD-FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVA 688
            D +   L   +    L   ++   P   + +  W +    +      A  + +  +  A
Sbjct: 667 PDAWRETLGERY----LPGALVAPRPPTEDGLAVWLDRLDMDEAPPVWADRDAADGEPTA 722

Query: 689 LVCQNFSCSPPVTD 702
            VC+  +CSPP TD
Sbjct: 723 YVCEGRTCSPPETD 736


>gi|118579433|ref|YP_900683.1| hypothetical protein Ppro_0998 [Pelobacter propionicus DSM 2379]
 gi|118502143|gb|ABK98625.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
           2379]
          Length = 705

 Score =  348 bits (894), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 223/697 (31%), Positives = 331/697 (47%), Gaps = 78/697 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLS 79
           TCHWCHVM  ESFED  VA ++N   + +KVDREERPD+D +YMT  + L G G GWPL+
Sbjct: 80  TCHWCHVMARESFEDPEVAAIINRHLIPVKVDREERPDIDSLYMTAARILTGSGAGWPLT 139

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+P+ KP    TY P     G  G    + K+ + W+  RD++ ++    +  L E +
Sbjct: 140 IFLTPERKPFYCATYIPKTGSNGVLGIVETVEKISEIWNTNRDLINENSDTVVRALREIV 199

Query: 140 ---SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
              SA     ++ DE            L   YD   GGFG   KFP P  +  +L   ++
Sbjct: 200 APVSADTDFGRVLDE--------AQASLQGMYDYLNGGFGGGAKFPLPHNLSFLLRMWRR 251

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            ++        + ++MV +TL+ M  GGI+D +G GFHRY+VD  W VPHFEKMLYDQ  
Sbjct: 252 TQN-------QDIEEMVAYTLRMMRDGGIYDQLGFGFHRYAVDPEWRVPHFEKMLYDQAL 304

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           +A   L+AF    D F   +  +I  ++  ++  P G   S   ADS          EG 
Sbjct: 305 IAITCLEAFQAYGDEFLKDMAMEIFSFVFDELTSPDGGFCSGLGADSG-------GGEGY 357

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           +Y+W+  E++  L GE + LF E + +  TGN            F+G N+L +    +  
Sbjct: 358 YYLWSRGEIDRNLDGETSRLFCEAFGVTDTGN------------FEGGNILYQPRSVALL 405

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A + G+   +    L   R KL +VR++R RP  D+K++V+WNGL++++ AR + +    
Sbjct: 406 ARENGLDAGELDRRLETARAKLLEVRAERVRPFRDEKILVAWNGLMVAALARGAAV---- 461

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                       S  +  +E A SA  FI R+L+     RL  S+    +  P FL+DYA
Sbjct: 462 ------------SGEQRLLEAARSAVRFIARNLH-TPAGRLLRSYHQSVASVPAFLEDYA 508

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL  G+++LY+       L  A+ L     +LF D   G +++T  E   VL+R+K  HD
Sbjct: 509 FLCWGMVELYQVDGDPVMLQGALGLARGMLDLFSDAVTGAFYDTASEAEQVLVRMKNAHD 568

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA PSGNS++ + L++L  I      +      E  L  +   L +  +A   M  A D 
Sbjct: 569 GAIPSGNSIACLCLLKLGKICG---DEALTHAGERCLVSWMGSLAEQPIAHIQMVTALDF 625

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
              P  + + L+G +       +L   H  +     +      D   M            
Sbjct: 626 FLGPDVE-ITLIGDRDKPGVRELLNVIHRYFIPGLVLRFKGDGDVYPM------------ 672

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                       A VC   +C PPV D   LE LL E
Sbjct: 673 ------VGGLPTAYVCARGACRPPVNDAAQLEQLLSE 703


>gi|386856660|ref|YP_006260837.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
 gi|380000189|gb|AFD25379.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
          Length = 680

 Score =  348 bits (893), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 207/545 (37%), Positives = 281/545 (51%), Gaps = 46/545 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE  A  +N  FV+IKVDREERPD+D VYM   QAL G GGWP++
Sbjct: 47  STCHWCHVMAHESFEDEATAAQMNAGFVNIKVDREERPDIDAVYMAATQALTGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEA 138
           VFL+PD +P   GTYFPP +  G P F  +L  V  AW  +RD ML  +     + L+  
Sbjct: 107 VFLTPDAEPFYAGTYFPPREGLGMPSFGRVLGSVSGAWTTQRDKMLGNA-----QALTAH 161

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +  +++  +  D LP  A  L  E L + YD+  GGFG APKFP P  +  +L  S    
Sbjct: 162 IQEASAPRRGEDPLPDGATGLAVEHLRRVYDADLGGFGGAPKFPSPATLDFLLTQSA--- 218

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                     G+ M L TL+ M  GGIHD +GGGFHRYSVD +W VPHFEKMLYD  QLA
Sbjct: 219 ----------GRDMALHTLRRMGAGGIHDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLA 268

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              L AF ++ D  ++ + R  L YL R+M+   G  FSA+DAD+    G     EG  +
Sbjct: 269 RTLLRAFQVSGDGAFADLARTTLGYLEREMLSAEGGFFSAQDADTPTDHGGV---EGLTF 325

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASAS 377
            WT  E+ ++LG           L+  G  +     DPH  E+  +NVL      S    
Sbjct: 326 TWTPAEIREVLGAGG---DTDLALRAYGVTEEGNFLDPHRPEYGRRNVLHLPTPVSQLTR 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            LG  +   L             R++   P  DDKV+ SWNGL +++FA A+++L     
Sbjct: 383 DLGPDVPTRLEAARAHLLAARQARTQ---PGTDDKVLTSWNGLALAAFADAARVLGD--- 436

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                         + +EVA   A F+RR L       L+H++++G ++  G L+D+   
Sbjct: 437 -------------TQLLEVARRNADFVRRELRLPDG-TLRHTYKDGQARVEGLLEDHVLY 482

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GL+ L++ G     L WA EL       F D E G + +  G   ++L R  +  D A
Sbjct: 483 ALGLVALFQAGGDLAHLHWARELWTVVRRDFWDAEAGVFHSAGGRAETLLTRQAQGFDSA 542

Query: 558 EPSGN 562
             S N
Sbjct: 543 ILSDN 547


>gi|257051594|ref|YP_003129427.1| hypothetical protein Huta_0507 [Halorhabdus utahensis DSM 12940]
 gi|256690357|gb|ACV10694.1| protein of unknown function DUF255 [Halorhabdus utahensis DSM
           12940]
          Length = 717

 Score =  348 bits (892), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 205/568 (36%), Positives = 298/568 (52%), Gaps = 48/568 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A +LN+ FV IKVDREERPDVD++Y T  Q L   GGWPLSV
Sbjct: 54  ACHWCHVMAEESFEDEATAAVLNENFVPIKVDREERPDVDRIYQTLAQLLGQQGGWPLSV 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           +L+PD +P   GTYF P+ + GRPGF  +L  +K+ W+  RD + Q      + +S  L 
Sbjct: 114 WLTPDGRPFYVGTYFAPDSRGGRPGFADLLEDLKETWENDRDGIEQRADQWADAISGELE 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML-----YHS 194
            + +     D      LR  A+   ++ D   GGFGS  PKFP+P  +Q++L     + S
Sbjct: 174 GTPTPADPSDVRSDELLRAGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFGS 233

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           ++  D G   +  E + ++  +L  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD 
Sbjct: 234 ERSAD-GDGADPGEYRAVLTESLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDN 292

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            ++    ++ + +T D  Y+ +  +  ++L R++  P G  +S  DA S   EG    +E
Sbjct: 293 AEIPRALIEGYRVTGDERYARVAGETFEFLDRELGHPEGGFYSTLDARS---EG----EE 345

Query: 315 GAFYVWTSKEVEDILGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVWT +EV   +G+     L  + Y +   GN +            G+ VL      
Sbjct: 346 GKFYVWTPEEVRAAVGDETDVSLVLDRYGITEDGNFE-----------DGQTVLTIAASV 394

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              A++ G+ ++   + L   R +LFD RS+R RP  D+K++  WNGL IS+ A  S  L
Sbjct: 395 DELAAQSGLEVDDVQDRLDRAREQLFDARSERTRPPRDEKILAGWNGLAISALAEGSLAL 454

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           +                  + ++ A  A  F+R  L+DE +  L+  F +G  +  G+L+
Sbjct: 455 ED-----------------DILDRAVDALEFVRETLWDEDSGLLKRRFIDGDVRVEGYLE 497

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPS--VLL 548
           DYAFL  G LD Y+       L +A++L    +  F D + G  + T   G D    +L 
Sbjct: 498 DYAFLARGALDCYQASGDPDQLAFALDLAEEIESRFFDEDAGTLYFTEEAGSDAGTDLLA 557

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIV 576
           R +E  D + PS   V+V  LV L   V
Sbjct: 558 RPQELTDRSTPSSAGVAVDVLVTLDEFV 585


>gi|374293368|ref|YP_005040403.1| hypothetical protein AZOLI_3026 [Azospirillum lipoferum 4B]
 gi|357425307|emb|CBS88194.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum lipoferum 4B]
          Length = 683

 Score =  348 bits (892), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 234/697 (33%), Positives = 344/697 (49%), Gaps = 75/697 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE+  +A L+N+ FV+IKVDREERPD+D +Y + +  L   GGWPL++F
Sbjct: 56  CHWCHVMAHESFENPEIAGLMNELFVNIKVDREERPDLDTIYQSALALLGQQGGWPLTMF 115

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P  GGTYFPP  +YGR GF  +LR +   +  ++D + ++    ++ L  ALS 
Sbjct: 116 LTPDAEPFWGGTYFPPAPRYGRAGFPDVLRGIAGTYANEQDKVGKN----VDALKSALS- 170

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
               N+    +    L   A++L +  D   GG G+APKFP+ V +  +L+  +  + TG
Sbjct: 171 GMGENRSAGAVDAGVLDQVAQRLLREVDPIHGGIGTAPKFPQ-VPLFELLW--RAWQRTG 227

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           +       ++ V  TL  MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD  +L ++ 
Sbjct: 228 R----EPFREAVTHTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLDLM 283

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              +  T+D       R+ + +L R+MI  GG   +  DADS   EG    +EG FY+W 
Sbjct: 284 TLVWQETRDPLLETRIRETVGWLLREMIADGGGFAATLDADS---EG----EEGLFYIWN 336

Query: 322 SKEVEDIL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
            +EV+ +L      +    FK  Y + P GN +   +    N   G    + L D +  A
Sbjct: 337 EEEVDRLLTPALGADGLATFKHVYEVLPQGNWEGVTIL---NRLGG----LSLADDATEA 389

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +            L + R  L   R+KR RP  DDKV+  WNGL+I++   A+       
Sbjct: 390 T------------LAKGREILLRARAKRVRPGWDDKVLADWNGLMIAALTHAALA----- 432

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D  E+++ A  A +F+R  +  ++  RL HS+R+G  K  G LDDYA 
Sbjct: 433 -----------LDEPEWLDAAGRAFAFVRDRM--DKNGRLCHSWRHGQGKHTGMLDDYAH 479

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +    L L+E       L  A     T D  F D   GGYF T  +   +++R K   D 
Sbjct: 480 MARAALALHEATGDPAALDQAKLWVATLDAHFWDGANGGYFFTADDAEGLIVRTKTAFDN 539

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGN      L  LA++   +  D YR+ A+   A F   L      +     + +++
Sbjct: 540 ATPSGNGTM---LAVLATLFQRTGEDAYRERADALAAAFSGELTRNFFPLTTFLNSVELM 596

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           + P +  +V+VG   + + E +          N+ +  + P      D    H +    M
Sbjct: 597 TAPLQ--IVVVGPPKAAETEALRRTVLDHSLPNRILTVLAPG----ADLPANHPAQGKGM 650

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
                      A VC+  +CS PVT P  L  LL  K
Sbjct: 651 RDG-----AATAYVCRGMTCSAPVTAPADLAALLSTK 682


>gi|163786447|ref|ZP_02180895.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
           ALC-1]
 gi|159878307|gb|EDP72363.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
           ALC-1]
          Length = 705

 Score =  348 bits (892), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 224/689 (32%), Positives = 353/689 (51%), Gaps = 84/689 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VA+L+N+ F+SIKVDREERPDVD++YM+ VQ + G GGWPL+  
Sbjct: 81  CHWCHVMEEESFENDSVARLMNENFISIKVDREERPDVDQIYMSAVQLMTGSGGWPLNCI 140

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF       +P +  IL  +   +    + +    A+A E+L+E +  
Sbjct: 141 TLPDGRPVFGGTYFT------KPQWTKILEDMSSLYKTNPEKVI---AYA-EKLTEGVKN 190

Query: 142 SASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +   N   + +  N L++    ++L KS D + GG  +APKFP P  +  +L +S + +D
Sbjct: 191 ADLINVNKEGIQFNKLQIESTVDELKKSLDFKLGGQKNAPKFPMPSNLDFLLRYSFQNDD 250

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   + Q+ V+ +L  MA GGI+D +GGGF RYSVD+RWH+PHFEKMLYD  QL +
Sbjct: 251 -------KDLQQFVMTSLNKMANGGIYDQIGGGFSRYSVDDRWHIPHFEKMLYDNAQLVS 303

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A+  TK+  +  I  + L+++ R++    G  +S+ DADS   EG    +EG FY 
Sbjct: 304 LYSKAYQFTKNEDFKTIVTETLNFIDRELTQEEGAFYSSLDADSKTKEGEL--EEGVFYT 361

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM----SDPHNEF-KGKNVLIELNDSSA 374
           WT  +++  LGE   LFK +Y +  TG  +  +     +   NEF K  N+ I+      
Sbjct: 362 WTKDDLKTELGEDFDLFKSYYNINATGKWEKDQFILYKTKTDNEFIKTNNITIK------ 415

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
                    E +  +L   ++KL++VR+KR RP LDDK + SWN L++ ++  A ++   
Sbjct: 416 ---------ELHSKVLA-WKKKLYEVRAKRERPRLDDKALTSWNALMLKAYVDAYRVF-- 463

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                         +++ Y++ A   A FI+ +   +    L H+++N  S   GF +DY
Sbjct: 464 --------------NKQSYLDKAIDNAKFIKENQI-QNNGSLFHNYKNKKSTIEGFSEDY 508

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A  I+  ++LY+     +WL  A EL +     F ++E   ++ T+  + +++ R  E  
Sbjct: 509 AHTITAYIELYQATFNEQWLNTAKELMDYAIAHFSNKETSMFYFTSDNETNLITRKTEVF 568

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D   PS NSV    L +L          YY   A   LA      K M     L     D
Sbjct: 569 DNVIPSSNSVLADCLFKLGH--------YYSNKAYTDLA------KQM-----LSNVYDD 609

Query: 615 MLSVPS--RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
           +   PS     + L  + ++  +E  ++ + A   L +  +   P     +     + S+
Sbjct: 610 IEKAPSAYTNWLKLYLNYANPYYEVAISGSEADSKLKELNMFYLP----NILISGSNKSS 665

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVT 701
           N  + +N F  D+    VC N +C  PVT
Sbjct: 666 NLPLLKNKFIEDETFIYVCVNGTCKLPVT 694


>gi|168702337|ref|ZP_02734614.1| hypothetical protein GobsU_22617 [Gemmata obscuriglobus UQM 2246]
          Length = 793

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 234/633 (36%), Positives = 322/633 (50%), Gaps = 66/633 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K ++  FL    + CHWCHVME ESF    VAK+LN  FV IKVDREERPD
Sbjct: 64  GPEAFERAKKEKKLIFLSIGYSACHWCHVMERESFSRADVAKILNANFVCIKVDREERPD 123

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED-KYGR---PGFKTILRKVK 114
           VD +YMT +      GGWPL++FL+PD KP+ G TYFPP+D K G    PGFKT+L KV 
Sbjct: 124 VDDIYMTALNTTGEQGGWPLNMFLTPDGKPIFGATYFPPDDRKIGDDTVPGFKTVLNKVM 183

Query: 115 DAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGG 174
           + +DK R  L +      +   EAL A++ +  L   +P     +     +   D   GG
Sbjct: 184 E-FDKDRADLEKQADRVAKATVEALDANSRAIAL---VPLKRDLVSDGLDAFDIDPEHGG 239

Query: 175 FGS------APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
            GS        KFPRP     +L  +KK    G    A    K+   TL  + +GGI+DH
Sbjct: 240 TGSKKRDYKGTKFPRPPVWGFVLTQTKK---PGNERLA----KLTHNTLAKILEGGIYDH 292

Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
           +GGGFHRYS +  W VPHFEKMLYD  QL  +Y +A++L     Y  +  + L+++RR+M
Sbjct: 293 LGGGFHRYSTERTWTVPHFEKMLYDNAQLVELYSEAYALAPRPEYKRVVAETLEFVRREM 352

Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNC 348
             P    +SA DADS +       KEG FYVWT+ EV  +LG  A    +   +K     
Sbjct: 353 TAPEKGFYSALDADSND-------KEGEFYVWTADEVAKVLGTDA----DTAIVKAVYGV 401

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
                 D  +  +    L E+      A +L +  +  L  L   ++KLFD R+KR RP 
Sbjct: 402 TAPNFEDKFHILRLPKPLAEI------AKELKLTEDALLTKLEPLKKKLFDHRAKRERPF 455

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
           LD KVI +WNG +I+ +ARA  + K  A                Y+  A  AA F+   L
Sbjct: 456 LDTKVITAWNGQMIAGYARAGGVFKEPA----------------YVRAAADAADFLLTKL 499

Query: 469 YDEQTHRLQHSFRNGPSKAP-----GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
            D+   RL   +   P   P      FLDDYA+LI GLL+L++     KWL  A  L + 
Sbjct: 500 RDKD-GRLYRMYAAAPGGKPAPKGAAFLDDYAYLIHGLLNLHDATGEPKWLDAAKGLTDL 558

Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
             + + D   GG++ T  +   +  R K+ +DG +PSGNS    NL+RL +    +K + 
Sbjct: 559 AVKHYADPVNGGFYFTAADGEKLFARAKDSYDGVQPSGNSQMARNLLRLGT---KTKDEG 615

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           YR     ++  F   L+    ++PLM    D L
Sbjct: 616 YRDRGIRTVKAFSFALRTAPTSMPLMLRTLDEL 648


>gi|448410530|ref|ZP_21575235.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
 gi|445671566|gb|ELZ24153.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
          Length = 719

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 225/697 (32%), Positives = 335/697 (48%), Gaps = 60/697 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESF DE +A+LLN+ FV IKVDREERPD+D +YM+  Q + G GGWPL+ +
Sbjct: 57  CHWCHVMEEESFADEDIAELLNENFVPIKVDREERPDIDSIYMSICQQVSGRGGWPLNAW 116

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P   GTYFPPE K G PGF+ +L  + ++W    D           Q ++A++ 
Sbjct: 117 LTPDGDPFYVGTYFPPEPKRGAPGFRQLLDDISESWADSEDRAEMED--RARQWTDAIAN 174

Query: 142 S-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              ++   P + P ++ L   A    +  D  FGG+G   KFP+P  +++++        
Sbjct: 175 DLETTPDQPGDAPGEDVLDTTASAALRGADREFGGWGKGQKFPQPGRLRVLMR------- 227

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +SG     +++V  TL  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD  +LA 
Sbjct: 228 AHRSGGRDAYREVVGETLDAMGDGGLYDHVGGGFHRYTTDREWVVPHFEKMLYDNAELAR 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           V+L  +  T    Y    R+ L+++ R++  P G  +S  DA+S        ++EGAFY 
Sbjct: 288 VFLTGYQFTGRERYRETARETLEFVERELTHPDGGFYSTLDAESEGE--EGEREEGAFYA 345

Query: 320 WTSKEVEDILGEH--------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           WT   V+D + E+              A +F+E Y +  TGN +            G+ V
Sbjct: 346 WTPDGVDDAVAEYGPEHGVPGEQASLAAEIFRERYGVTATGNFE-----------GGETV 394

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           L       + A   G+ L    ++L      +F  R +RPRP  D+KV+  WNGL++S+F
Sbjct: 395 LTRSASVESLADDYGLSLGDAEDLLDAATTAVFAAREERPRPPRDEKVLAGWNGLMVSAF 454

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
           A A+ +                 D + +   A  A  F R HL+D  + RL   F++G  
Sbjct: 455 AEAAVV-----------------DDESWAGTATEALDFARDHLWDADSGRLSRRFKDGDV 497

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
              G+L+DYAFL  G  D Y+     + L +A+EL  T +  F D E    + T     S
Sbjct: 498 DIRGYLEDYAFLARGAFDTYQATGEVEHLAFALELARTIETEFWDAEEETLYFTPQSGES 557

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           ++ R +E  D + PS   V+   L+ L   V     D +   A   LA    R++     
Sbjct: 558 LVARPQELADQSTPSSAGVAAELLLALDHFV---DHDRFETVASGVLATHGGRVESNPQQ 614

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
            P +  AAD     + + + L        +   LA  +    L       D A    +D 
Sbjct: 615 HPSLALAADAYRSGAHE-LTLAADPLPESWRETLAETYIPRRLLAPRPPTDDALAAWLDA 673

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
            E  ++     +R     +  V   C++ +CSPP  D
Sbjct: 674 LELADAPPIWASREARDGEPTV-YACRSRTCSPPTQD 709


>gi|448608928|ref|ZP_21660207.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
 gi|445747305|gb|ELZ98761.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
          Length = 702

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 235/707 (33%), Positives = 345/707 (48%), Gaps = 95/707 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A++LN+ F+ +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPEIAEVLNEHFIPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P  KP   GTYFPPE + G PGF+ ++    + W   RD +   A+    AI ++L
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAETWQTDRDEIENRAEQWTHAITDRL 172

Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
            E       A  +++ D+  Q ALR                    PKFP+P  I  +L  
Sbjct: 173 EETPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDAIL-- 222

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
            +    TG+     E   + +  L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----REALDVAVEALDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  LA  YLDA+ LT +  Y+ + R+  +++RR++    G  F+  DA S         +
Sbjct: 278 QAGLAARYLDAYRLTGNESYAAVARETFEFVRRELSHDDGGFFATLDAQS-------DGE 330

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT + V   L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 331 EGTFYVWTPEAVRSHLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSAT 378

Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            S  A++  +  ++  + L E ++ LF  R+ R RP  D+KV+  WNGL+IS+FA+ +  
Sbjct: 379 LSDLAAEYDLSEDEVEDHLEEAKKTLFAARADRERPARDEKVLAGWNGLMISAFAQGAVA 438

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++ +A                 A  A  F+R HL+DE +  L     NG  K  G+L
Sbjct: 439 LEDDSLAAD----------------ARRALDFVREHLWDEASETLSRRVMNGEVKGDGYL 482

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+     + L +AI+L    +  F D   G  + T     +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDLEPLSFAIDLARATNREFYDAAAGTLYFTPESGEALVTRPQ 542

Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EHSLAVFETR 598
           E  D + PS   V+    + L            A  V  S ++  R +  EH   V  T 
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAGFGEAADAVLESYANRIRGSPLEHVSLVLAT- 601

Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
            +  A  VP +  AAD +    R+ +                   AS  L   V+   PA
Sbjct: 602 -EKAASGVPELTAAADEMPDEWRETL-------------------ASRYLPGLVVSRRPA 641

Query: 659 DTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
             +E+D W +E   + A    A    +  K     C++F+CS P  D
Sbjct: 642 TDDELDVWLDELELDEAPPIWAAREATDGKPTVYACESFTCSAPTHD 688


>gi|431930442|ref|YP_007243488.1| thioredoxin domain-containing protein [Thioflavicoccus mobilis
           8321]
 gi|431828745|gb|AGA89858.1| thioredoxin domain protein [Thioflavicoccus mobilis 8321]
          Length = 683

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 230/685 (33%), Positives = 343/685 (50%), Gaps = 63/685 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFED   A L+N  FV+IKVDREERPD+D++Y T  Q L    GGWPL
Sbjct: 54  SACHWCHVMAHESFEDPATAALMNRLFVNIKVDREERPDLDRIYQTAHQLLSSRAGGWPL 113

Query: 79  SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +VFL+P+ L+P   GTYFP E ++G P F+ +L  V+ A+ ++R+ + +     +  L+E
Sbjct: 114 TVFLTPETLEPFFCGTYFPREPRHGLPAFRQLLEGVERAFREQREAIREQSQGLMAALAE 173

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
               +  +  +PD  P    R    QL+ S+D+  GGFG APKFPR  +++++L H    
Sbjct: 174 L---APRAGAIPDSAPLEGAR---RQLAASFDAARGGFGGAPKFPRVPDLELLLRHWAAT 227

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +  G+    +    MV FTL+ M  GGI+D VGGGF+RYSVD+ W +PHFEKMLYD  QL
Sbjct: 228 DAAGQPD--ARALAMVTFTLERMIAGGINDQVGGGFYRYSVDDAWMIPHFEKMLYDNAQL 285

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +  DA+  T +  +        D++  +M    G  +SA DADS   EG    +EG +
Sbjct: 286 LALCCDAWQATSEPVFRAAAEATADWVIGEMQSDEGGYYSALDADS---EG----QEGRY 338

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT +E+E  L           Y           +  P N F+G+  L      +  A 
Sbjct: 339 YVWTREELEGTLAPEEFAAFAARY----------GLDGPAN-FEGRWHLHAQAMPAEVAG 387

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +LG+ + +   ++   RRKL +VR  R RP  D+KV+ +WN L+I   ARA+++L     
Sbjct: 388 RLGLTVAQVEGLIDGARRKLLEVRRARVRPACDEKVLTAWNALMIKGMARAARVLA---- 443

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                       R +Y+  AE A   +R  L+  +  RL  S+ +G +  P +LDD+A L
Sbjct: 444 ------------RPDYLASAERALGLVRSTLW--RDGRLLASYMDGTAHLPAYLDDHAML 489

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I  LL+L +       L +AIEL       F D   GG+F T  +  +++ R K   D +
Sbjct: 490 IDALLELLQVRWRRDDLRFAIELAEILLARFEDSGEGGFFFTASDHETLIHRPKPLADES 549

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            P+GN+V+     RL  ++   +   Y + A   LAV    ++    A   +  A D   
Sbjct: 550 LPAGNAVAARVFQRLGHLLGEPR---YLEAAARVLAVAGGDMRRAPYAHASLLMALDEHL 606

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            P    VV        +    LA    +Y   ++ + I PAD +++        N ASM 
Sbjct: 607 EPGETVVV---RAPPTELPPWLAELQQTYRPRRSALGI-PADEQDL------PGNLASMG 656

Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
                     A +C+   C  P+ +
Sbjct: 657 ----PGPGARAYLCRGTHCEAPIEE 677


>gi|372487318|ref|YP_005026883.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
 gi|359353871|gb|AEV25042.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
          Length = 682

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 242/704 (34%), Positives = 352/704 (50%), Gaps = 82/704 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  E F D  VA  +N  F++IKVDREERPD+D+VY T  Q L G  GGWPL
Sbjct: 48  SACHWCHVMAHECFADATVAAEMNRLFINIKVDREERPDLDQVYQTAHQMLVGRPGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD  P  GGTYFP E ++G P F  +L  V  A+ +K+  +A+ G    E     
Sbjct: 108 TMFLTPDAMPFFGGTYFPREPRHGLPAFVEVLHSVARAFTEKQSEIAEQGRTMREAFGST 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L  +     L +  P   L     +L  +YD R GGFG APKFPRP  +  +L       
Sbjct: 168 LPRAVRGEPLFNADP---LAQAVAELDTNYDRRRGGFGGAPKFPRPAALDFLLRRHAATG 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D    G       M L TL+ MA+GGIHDH+GGGF+RYSVD +W +PHFEKMLYD  QL 
Sbjct: 225 DPHARG-------MALTTLERMAEGGIHDHLGGGFYRYSVDAQWSIPHFEKMLYDNAQLL 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++Y +A++L++   +      I+ +L+ +M  PGG   +A DADS   EG    +EG FY
Sbjct: 278 HLYAEAWALSRKQVFRQAAEGIVAWLQHEMALPGGAFAAALDADS---EG----EEGRFY 330

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSR----MSDPHNEFKGKNVLIELNDSSA 374
           +WT++EV      HA+L        P    D++     +  P N    +  L ++     
Sbjct: 331 LWTAREV------HALL--------PPQQWDVASIHWGLDGPPNFEDAEWHLRQVQPLEQ 376

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A +L +   +    L   R  L   R++R RP  DDKV+   N L I   ARA++    
Sbjct: 377 VAERLRLTPGEARQQLEGARHTLLAARNERIRPGRDDKVLTGCNALAIKGLARAARAF-- 434

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                          R E++ +A  AA F++R L+ +   RL  ++++G ++ P +LDD+
Sbjct: 435 --------------GRPEWLGLACGAADFLQRELWRDG--RLLAAWKDGRARLPAYLDDH 478

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AFL+  +L+L + G        A+ L +   + F DRE GG+F T  +  +++ R K   
Sbjct: 479 AFLLEAMLELLQAGWRDADYRCAVALADALLQHFEDREEGGFFFTAHDHETLIYRTKPVE 538

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAA 613
           D A PSGN V+   L RLA +   S    Y   A  +LA+F   L+    A P L+    
Sbjct: 539 DHATPSGNGVAAFALGRLALL---SGEPRYAAAARRALALFLPDLRQHPGAHPGLLNVLG 595

Query: 614 DMLSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
           D LS P+    VL G  + +  +++ +    A +     ++ + P   +E          
Sbjct: 596 DELSPPAL--AVLQGPAAELARWQDEIGRLPAPW-----LLAVAPTGGDER--------- 639

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL--LEKP 714
                      ++V A VC   +C PP+     LE LL  L KP
Sbjct: 640 --PPPLRKPETERVNAWVCAGVTCLPPID---GLEALLGMLAKP 678


>gi|359690220|ref|ZP_09260221.1| hypothetical protein LlicsVM_17604 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418751442|ref|ZP_13307728.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
 gi|418758573|ref|ZP_13314755.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|384114475|gb|EIE00738.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|404274045|gb|EJZ41365.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
          Length = 695

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 248/704 (35%), Positives = 349/704 (49%), Gaps = 76/704 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE  A++LN  +VSIKVDREERPDVD++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFEDETTAEVLNRDYVSIKVDREERPDVDRIYMDALHAMGQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--- 137
           FL+P+ KP+ GGTYFPP  KYGR  F  +L  +   W  K++ L ++     + L E   
Sbjct: 115 FLTPEGKPITGGTYFPPVPKYGRKSFTEVLGILTGLWKDKKEELLEASEDLTKHLKESEE 174

Query: 138 --ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMML-Y 192
             AL+ +A  +    E+ +N   L      + YD  + GF   S  KFP  + +  +L Y
Sbjct: 175 TRALAGTADISSPGSEVFENGFLL----YDRLYDPEYAGFKSNSVNKFPPSMGLSFLLRY 230

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           H        KS    +  +MV  TL  M KGGI+D +GGG  RYS D  W VPHFEKMLY
Sbjct: 231 H--------KSTGEPKALEMVEETLTAMKKGGIYDQIGGGLCRYSTDHHWLVPHFEKMLY 282

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D        ++ +    +  Y     D+++YL RDM  PGG I SAEDADS   EG    
Sbjct: 283 DNSLFLEALVECYQAVGEEKYKDYAYDVIEYLHRDMRLPGGGIASAEDADS---EG---- 335

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           +EG FY+WT +EV ++ G+ + L  E + +   GN            F+ KN+L E    
Sbjct: 336 EEGLFYLWTKEEVREVCGQDSSLLDEFWNITEKGN------------FEEKNILHE--SF 381

Query: 373 SASASKL-GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
             + S+L G+   +   I+   R+KL + RS R RP  DDK++ SWN L I +  +A+  
Sbjct: 382 RMNFSRLHGLEPSELEEIVSRNRKKLLEKRSTRIRPLRDDKILFSWNCLYIKALTKAAMA 441

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
                               + +  AE    F+ ++L  E   RL   FR G +K   + 
Sbjct: 442 FGD----------------GDLLREAEETYKFLEKNLIREDG-RLLRRFREGEAKILAYS 484

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
            DYA  +   L L++ G G ++L  +I  + T++ + L R   G F  +G D   LLR  
Sbjct: 485 TDYAEFVLASLYLFQAGKGFRYLENSI--RYTEEAIRLFRSPAGVFFDSGIDGEALLRRT 542

Query: 552 ED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
            D +DG EPS NS      V L S +    S+ Y Q A+   + F+  L+   M+ P M 
Sbjct: 543 VDGYDGVEPSANSSFATAFV-LLSKLGVVDSEKYLQYADSIFSYFKPELEAYPMSYPYML 601

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLA--AAHASYDLNKTVIHIDPADTEEMDFWEE 668
            A  +   P R+  V+   +     E +L       S  L +TV+ +   D E      E
Sbjct: 602 SALWLRKSPGRELAVVYSSQ-----EELLPFWKGVGSLFLPETVL-VWANDKE-----AE 650

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
            N     + +N  S   V A VC  F C  PV+D  SL   L+E
Sbjct: 651 ENGEKFLLLKNRNSGGGVKAYVCVGFHCELPVSDWPSLRARLVE 694


>gi|209966075|ref|YP_002298990.1| hypothetical protein RC1_2806 [Rhodospirillum centenum SW]
 gi|209959541|gb|ACJ00178.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 688

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 233/699 (33%), Positives = 345/699 (49%), Gaps = 80/699 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED  +A ++ND FV++KVDREERPDVD++Y + +  L   GGWPL++F
Sbjct: 53  CHWCHVMAHESFEDPTIAAMMNDLFVNVKVDREERPDVDQIYQSALGLLGQQGGWPLTMF 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSEA 138
           L+P+ +P  GGTYFPPE ++GRPGF  +L  V   + ++ D + ++      A+ +L++ 
Sbjct: 113 LTPEGEPFWGGTYFPPERRWGRPGFPDVLLGVSTTYRQEPDKVVRNTTALKDALHRLAQN 172

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
              +     L DE+        A +L +  D   GG GSAPKFP+   ++++    K+  
Sbjct: 173 RPGAGVDVDLLDEV--------AARLVQEVDPVHGGIGSAPKFPQTGIVELLWRAWKR-- 222

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+     + +  V+ TL  M++GGI+DH+GGG+ RYS D+ W VPHFEKMLYD  QL 
Sbjct: 223 -TGR----EDCRAAVVTTLTQMSQGGIYDHLGGGYARYSTDQEWLVPHFEKMLYDNAQLI 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIG----PGGEIFSAE-DADSAETEGATRKK 313
           ++    +  T+D  +    R+ + ++ R+M+     P G  F+A  DADS   EG    +
Sbjct: 278 DLLTTVWQDTRDPLFEARVRETVGWVLREMVSEPGRPVGGGFAATLDADS---EG----E 330

Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           EG FYVWT  EV+ +LG+ A  F   Y +   GN            ++G  +L  L    
Sbjct: 331 EGRFYVWTWAEVDRLLGDRAETFARAYDVTERGN------------WEGTTILNRLKRPE 378

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
                 G P E+    L E R  LF  R  R RP  DDKV+  WNGL+I++ ARA  +  
Sbjct: 379 P-----GTPAEE--GALAEMRAVLFQARGARVRPGWDDKVLADWNGLMIAALARAGAVF- 430

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                          D  +++  A  A  F+R H+ D    RL HS+R G  +  G LDD
Sbjct: 431 ---------------DEPDWIAAARRAYDFVRTHMQDAD-GRLWHSWRAGTLRHRGTLDD 474

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
            A +    L L+E       +  A       D  F D E GGYF T  +   +++R +  
Sbjct: 475 QAAMARAALALFEVTGDGTCVEQARRWAAVADAQFWDTESGGYFLTAADATDLIVRPRNA 534

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D A PSGN   +  L RL  I   +  + +R+ A+  +  F    +      PL     
Sbjct: 535 QDNAVPSGNGTMLGVLARLWLI---TGEEGWRRRADALVTAFGG--EPGRNFFPLATFLN 589

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           ++  +     VV+ G  ++ D   +L A H +      +  + P         + H +  
Sbjct: 590 NVELLHRAVQVVVAGDPAAADTGALLRAVHGAGLPTLVLTPVTPGTALP----DGHPAAG 645

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
             M        +  A VC+  +CS PVTDP +L  LL E
Sbjct: 646 KGMV-----GGRAAAYVCRAMACSLPVTDPAALAALLRE 679


>gi|222479721|ref|YP_002565958.1| hypothetical protein Hlac_1296 [Halorubrum lacusprofundi ATCC
           49239]
 gi|222452623|gb|ACM56888.1| protein of unknown function DUF255 [Halorubrum lacusprofundi ATCC
           49239]
          Length = 744

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 240/724 (33%), Positives = 343/724 (47%), Gaps = 88/724 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE +A +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESIAAVLNEKFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
            + +P  KP   GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S   
Sbjct: 113 AWCTPKGKPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARD 172

Query: 131 AIEQLSEALSAS-ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
            +E + E  +A  AS          + L   A    + YD  +GGFGS   KFP P  I 
Sbjct: 173 ELESVPEPDAAGDASGTGGAGPPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKFPMPGRID 232

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           ++L    +       G+A+        TL  MA+GG++D +GGGFHRY+VD +W VPHFE
Sbjct: 233 VLLRAYAR-----SGGDAA--LTAATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFE 285

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD  +L   YLD + LT D  Y+ +  + L +L R++    G  FS  DA S   E 
Sbjct: 286 KMLYDNAELPMAYLDGYRLTGDASYARVASETLGFLDRELRHDDGGFFSTLDARSRPPEN 345

Query: 309 --------------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
                              EGAFYVWT  EV+ +L E A  L K+ Y ++  GN +    
Sbjct: 346 RRGNAGSDESDDADDVADVEGAFYVWTPAEVDAVLDEPAASLAKDRYGIRSGGNFE---- 401

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
                  +G  V       +  A +  M  E     L   R  LF+ R  RPRP  D+KV
Sbjct: 402 -------RGTTVPTIAASIAELADEHDMSTEAVREALTAARVALFEARESRPRPARDEKV 454

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           + SWNG  IS+FA A ++L                  + Y ++A  A SF R  LYDE+T
Sbjct: 455 LASWNGRAISAFATAGQVLG-----------------EPYADIASDALSFCRERLYDEET 497

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             L   + +G  + PG+LDD+AFL  G LD+Y      + L +A++L  T    F D   
Sbjct: 498 ETLARRWLDGDVRGPGYLDDHAFLARGALDVYSVTGDPEALGFALDLAATVVSDFYDEAD 557

Query: 534 GGYFNTT--------GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
           G  + T         G D ++  R +E  D + PS   V+   L    +++ G ++D  R
Sbjct: 558 GTIYFTRDPDGNAGHGGDDTLFARPQEFTDQSTPSSLGVAAETL----ALLDGFRTD--R 611

Query: 586 QNAEHSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
           + AE +  V  T   D   A PL    +  AAD ++    +  + V        E +   
Sbjct: 612 EFAEVAETVVTTH-ADRIRASPLEHVSLVRAADRVASGGIEVTIAVDAVPDAWRETL--- 667

Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSP 698
                 L   ++   P   + +  W +    + +    A  +    +  A VC+  +CSP
Sbjct: 668 --GERYLPGALVAPRPPTEDGLAAWLDRLDMDEAPPIWADRDAVDGEPTAYVCEGRTCSP 725

Query: 699 PVTD 702
           P TD
Sbjct: 726 PETD 729


>gi|448591505|ref|ZP_21650993.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
 gi|445733479|gb|ELZ85048.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
          Length = 702

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 228/696 (32%), Positives = 339/696 (48%), Gaps = 73/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A+ LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P  KP   GTYFPPE + G PGF+ ++    ++W   RD +   AQ    AI +QL
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQL 172

Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
            +       A  +++ D+  Q ALR                    PKFP+P  I  +L  
Sbjct: 173 EDTPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDSLL-- 222

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
            +    TG+     E   + + +L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----REALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  L   YLD + LT    Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 278 QAGLVPRYLDTYRLTGTEAYADVAVETFEFVRRELSHDDGGFFATLDAQSG-------GE 330

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  EV  +L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 331 EGTFYVWTPDEVRSLLPELEADLFCDRYGITPGGN------------FENKTTVLNVSAT 378

Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            S  A +  +  ++  + L E R+ LF  RS R RP  D+K+I  WNGL+IS+FA+ +  
Sbjct: 379 VSDLAEEYDLSEDEVEDKLAEARKALFAARSGRERPARDEKIIAGWNGLMISAFAQGAVA 438

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++                  + A  A  FIR HL+D     L     NG  K  G+L
Sbjct: 439 LEDDS----------------LADDARRALDFIREHLWDADAEHLSRRVMNGEVKGDGYL 482

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+     + L +A++L       F D   G  + T     +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDVEPLAFALDLGRAIHREFYDDAAGTLYFTPESGEALVTRPQ 542

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D + PS   V+    + L      +    + + A+  L     R++   +    +  
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAG---FGEAADAVLETHANRIRGSPLEHVSLAL 599

Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
           AA+  +  VP    + +   +   ++   LA+ +    L   V+   PA  +E+D W +E
Sbjct: 600 AAEKAASGVP---ELTIAADEIPAEWRETLASRY----LPGLVVAPRPATDDELDAWLDE 652

Query: 669 HNSNNASMARNNFSAD--KVVALVCQNFSCSPPVTD 702
              + A        AD  +     C+NF+CS P  D
Sbjct: 653 LELDEAPPIWAAREADGGEPTVYACENFTCSAPTHD 688


>gi|418753914|ref|ZP_13310150.1| PF03190 family protein [Leptospira santarosai str. MOR084]
 gi|409965755|gb|EKO33616.1| PF03190 family protein [Leptospira santarosai str. MOR084]
          Length = 630

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 241/682 (35%), Positives = 346/682 (50%), Gaps = 70/682 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+PD K
Sbjct: 1   MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P+ GGTYFPPE  YGR  F  +L  ++  W++KR  L      A  +LS+ L  S     
Sbjct: 61  PITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGEGRA 116

Query: 148 LPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDT 200
           +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH       
Sbjct: 117 VEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH------- 169

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD       
Sbjct: 170 -RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLET 228

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
            ++  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG FYVW
Sbjct: 229 LVECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVW 281

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
             +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A    
Sbjct: 282 DLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSE 328

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A            
Sbjct: 329 EEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG----------- 377

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +I+ 
Sbjct: 378 -----VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEMIAS 431

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 559
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 432 SIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEP 489

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           S NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A       
Sbjct: 490 SANSSLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFH 547

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           S K +VL+  K +   +++LA     +  +  +  ++  + EE           +++  +
Sbjct: 548 S-KEIVLI-RKDADSGKDLLAEIQTKFLPDSVLAVVNEDELEEA-------RKLSALFDS 598

Query: 680 NFSADKVVALVCQNFSCSPPVT 701
             S    +  VC+NFSC  P+ 
Sbjct: 599 RDSGGNALVYVCENFSCKLPIA 620


>gi|409730794|ref|ZP_11272353.1| hypothetical protein Hham1_16314 [Halococcus hamelinensis 100A6]
 gi|448723490|ref|ZP_21706008.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
 gi|445787756|gb|EMA38495.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
          Length = 719

 Score =  346 bits (887), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 204/556 (36%), Positives = 298/556 (53%), Gaps = 44/556 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA+ LN+ FV IKVDREERPD+D++Y T +  + G GGWPLS
Sbjct: 52  SSCHWCHVMADESFEDERVAERLNEDFVPIKVDREERPDLDRLYQTVIGMVSGRGGWPLS 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+L+PD +P   GTYFPPE K G+PGF  +L  + +AW+ +R+ +        +Q ++A+
Sbjct: 112 VWLTPDGRPFYIGTYFPPEAKRGQPGFLDLLDSITEAWETEREDIEGRA----DQWADAM 167

Query: 140 SASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +    +   P + P +  L   A    ++ D  +GG G   KFP+   +++++  + +++
Sbjct: 168 TGELEATPEPGDPPGSELLETAARSAVRNADREYGGSGRGQKFPQTGRLRLLMEAADRID 227

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D      A E        L  MA GG+ DHVGGGFHRY+ D  W VPHFEKMLYD  +L 
Sbjct: 228 DEEFGTVAREA-------LDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELV 280

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YLD + L  D  Y+ + R+ L ++ R++  P G  FS  DA S +  G   ++EGAFY
Sbjct: 281 RAYLDGYRLFGDERYAEVARETLGFVERELTSPEGGFFSTLDAQSVDESG--EREEGAFY 338

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT  EV D +G+   A LF E Y +  +GN +            G  VL    D    A
Sbjct: 339 VWTPDEVHDAVGDDRAAELFCERYGISESGNFE-----------NGTTVLTLAADVQGLA 387

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +    +E+    L   R  +F  R++R RP  D+KV+  WNGL++++FA A   L    
Sbjct: 388 DEYDTTVEEVEADLERAREAVFAARAERSRPDRDEKVLAGWNGLMVAAFAEAGLALD--- 444

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           + E A +A  F+R  L++E+  RL   +++G  K  G+L+DYAF
Sbjct: 445 --------------PRFAETAVAALDFVREELWNEEEERLSRRYKDGEVKIDGYLEDYAF 490

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L  YE       L +A++L    +  F D E G  + T     S++ R +E  D 
Sbjct: 491 LARGALACYEATGDVHHLGFALDLARAIESEFWDPEEGTLYFTPSSGESLVARPQELDDQ 550

Query: 557 AEPSGNSVSVINLVRL 572
           + PS   V+V  L+ L
Sbjct: 551 STPSSTGVAVETLLAL 566


>gi|441496345|ref|ZP_20978578.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
 gi|441439862|gb|ELR73159.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
          Length = 680

 Score =  346 bits (887), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 208/560 (37%), Positives = 296/560 (52%), Gaps = 57/560 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME ESFE++ +A ++N+ F+SIK+DREERPDVD++YM  VQA+   GGWPL+
Sbjct: 58  SSCHWCHVMERESFENDSIAAIMNEHFISIKIDREERPDVDQIYMDAVQAMGQSGGWPLN 117

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ D KP  GGTYFPPE       +  +L++V   +++KR  + +S     +QL+ A+
Sbjct: 118 VFLTSDQKPFYGGTYFPPE------SWAQLLKQVARVYNEKRSEVEESA----DQLTNAI 167

Query: 140 SASASSN-KLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
           + S     +L D   E     L    E+LS  +D   GGF  APKFP P     +L +  
Sbjct: 168 ATSEVIKFRLKDNGTEYTTTTLEKMYEKLSMKFDGNKGGFKGAPKFPMPGNWLFLLRYYN 227

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
              D        E  + +  TL  +A+GGI+D +GGGF RYSVD  W VPHFEKMLYD G
Sbjct: 228 ATND-------QEALRQLEVTLSEIARGGIYDQIGGGFARYSVDADWLVPHFEKMLYDNG 280

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL ++Y +A++ TK   Y  +    +D+L R+M    G  +SA DADS   EG    +EG
Sbjct: 281 QLVSLYAEAYTATKLELYKEVVYQTIDWLEREMTSKEGGFYSALDADS---EG----EEG 333

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FYVWT  EVE +LG  A L   +Y ++  GN +           +GKN+L         
Sbjct: 334 KFYVWTKDEVEHVLGAEANLIMSYYNIEKEGNWE-----------EGKNILHMHVSDEEF 382

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A +  + + +    + +    L + RSKR RP LDDKV+  WNGL+      A       
Sbjct: 383 AKRHDLGVAELKEKVWKADELLLEERSKRVRPGLDDKVLAGWNGLMQKGLVDA------- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                     V     +++++A   A F+ +H+  +   RL  SF++G +   G+L+DYA
Sbjct: 436 ---------YVAFGEPKFLDLALRNAHFLDQHMIHD--FRLNRSFKSGKASIDGYLEDYA 484

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F+I     LYE     +WL  A  L +   E F D     +F T      ++ R KE  D
Sbjct: 485 FVIDAYTALYEATFDEQWLKKAKGLMDYTIEHFYDNSEKLFFFTDDRSEKLIARKKEVFD 544

Query: 556 GAEPSGNSVSVINLVRLASI 575
              P+ NS   +NL RL  I
Sbjct: 545 NVIPASNSQMALNLYRLGKI 564


>gi|436836357|ref|YP_007321573.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
 gi|384067770|emb|CCH00980.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
          Length = 682

 Score =  346 bits (887), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 231/690 (33%), Positives = 341/690 (49%), Gaps = 72/690 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+E +AK++N+ FV IKVDREERPDVD VYM  VQA+   GGWPL+
Sbjct: 47  SACHWCHVMERESFENEQIAKIMNERFVCIKVDREERPDVDAVYMEAVQAMGVQGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL PD +P  G TY PP++      +  ++  V+ A+D+ RD L +S     E L+ + 
Sbjct: 107 VFLMPDARPFYGLTYAPPQN------WANLMVGVRQAFDENRDELLRSAEGFAEHLNTSE 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S             Q  +     +L+  +D+  GG G APKFP P     +L ++     
Sbjct: 161 STRFQLQTAEPVYAQETVETMYRKLATRFDTELGGTGRAPKFPMPSIYTFLLRYAD---- 216

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              +G+ S  Q++ L TL  MA GGI+D +GGGF RYS D+ W  PHFEKMLYD  QL  
Sbjct: 217 --LTGDPSAFQQLTL-TLNRMALGGIYDQLGGGFARYSTDKHWFAPHFEKMLYDNAQLLT 273

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +AF++T    Y +     +++L R+++ P G  +SA DADS   EG     EG FY 
Sbjct: 274 LYSEAFAMTGSALYRFTVYHTIEFLERELLSPDGGFYSALDADS---EGI----EGKFYT 326

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W++ E++ ILG+    F + Y + P GN D+      H   +  N+L     + A A +L
Sbjct: 327 WSADELQSILGDDYDWFAQLYTITPEGNWDIG-----HGHGR-TNILHRTETNPAFADQL 380

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G    +    L   + KL  VRS+R RP LDDK++ SWNGL +     A ++        
Sbjct: 381 GWTAAELNERLTTAKEKLLAVRSQRVRPGLDDKLLCSWNGLALKGLVSAYRV-------- 432

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT-HRLQHSFRNGP-----SKAPGFLDD 493
            FN P       E++ +A   A FI++ L D +   RL HS++ GP     ++  GFL+D
Sbjct: 433 -FNEP-------EFLSMALRLAFFIKQKLTDGRNGGRLWHSYKTGPDGVGRARQLGFLED 484

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +I G + LY+     +WL  A  L       F D +    F T      ++ R KE 
Sbjct: 485 YAAVIDGYVALYQATFADEWLTEADRLTQYVLAHFNDPDEPLLFFTDKSGEELIARKKEL 544

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D   P+ NS+   NL  L+ ++   +   Y +  +  L + +  L +    V  +   A
Sbjct: 545 FDNVIPASNSIMAQNLYTLSLLLERPE---YAERVDQMLGLIQPLLDN---EVNYLTNWA 598

Query: 614 DMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
            + ++  R    + +VG     D +       A +  NK +   D              S
Sbjct: 599 SLYTLRVRPTAEIAIVGP----DAQEFRRDIDAKFFPNKVLAGTD------------SRS 642

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
           +   +A+      +    VC N +C  PVT
Sbjct: 643 SLPLLAQRGPIDGQTAIYVCYNRACQLPVT 672


>gi|148264330|ref|YP_001231036.1| hypothetical protein Gura_2283 [Geobacter uraniireducens Rf4]
 gi|146397830|gb|ABQ26463.1| protein of unknown function DUF255 [Geobacter uraniireducens Rf4]
          Length = 700

 Score =  345 bits (886), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 238/695 (34%), Positives = 340/695 (48%), Gaps = 78/695 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME E+FED  VA + N +F+ IKVDREERPD+D+ YM   Q + G GGWPL++
Sbjct: 79  TCHWCHVMEHEAFEDREVAAVFNRFFICIKVDREERPDIDEQYMAVAQMMTGSGGWPLNI 138

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++P+ KP    TY P   + G PG   IL +V + W  +R  L Q     IE L+    
Sbjct: 139 FMTPEKKPFFAATYMPRTPRMGMPGIIQILERVAELWRTERQKLEQDSDVTIEALTHHFQ 198

Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               S  LPD  L QNA     +QL++ YD  +GGFG+ PKFP P+ +  +L   K    
Sbjct: 199 PHPGS--LPDMVLVQNAY----QQLTEMYDDLWGGFGNVPKFPMPLYLTFLLRFWK---- 248

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +SG  +    MV  TL+ + +GGI+D +G GFHRY+VD +W VPHFEKMLYDQ  +A 
Sbjct: 249 --RSGNGAS-LAMVEHTLRMLRQGGIYDQIGFGFHRYAVDRQWLVPHFEKMLYDQALIAI 305

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            YLDAF  T   FY  +  ++  Y+  +M  P G  F+ +DAD   TEG    +EG +Y+
Sbjct: 306 GYLDAFQATAVPFYRQVAEEVFAYVLGEMTSPEGGFFAGQDAD---TEG----EEGNYYI 358

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+   +G + A +F           C L  +++  N F+G+N+L         A++
Sbjct: 359 WTPAEIAAAIGHDEAQVF-----------CRLFDVTEKGN-FEGRNILHLPVPPETFAAR 406

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +  E     L   R  L  VR  R RP  D+KV+ +WNGL+I++ AR   +       
Sbjct: 407 EAILTEVLTADLERWRHTLLKVRGNRIRPFRDEKVLTAWNGLMIAALARGYAL------- 459

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                    S  + ++  A+ AA+FI   L      RL  SF  G +  P FLDDYAF +
Sbjct: 460 ---------SGEERFLAAAKRAAAFIGTRL-TSPGGRLMRSFHLGEASVPAFLDDYAFFV 509

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGA 557
            GL++L++     ++L  A  L +    LF   +GG Y   TG D   L  +++   DG 
Sbjct: 510 WGLIELHQVTLEPEFLDSARFLADEMLRLFHSGKGGLY--ETGLDSEQLPVIRQSARDGV 567

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGNSV+  +L RL  I    +   + ++ E  +  F   +    +A      A+D   
Sbjct: 568 LPSGNSVAAFDLFRLGRITGDGR---FLESGEAVVRTFMGDVTRQPLASLNFLSASDYHL 624

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            P    V L G++  +    ML A H  +  N  + +                       
Sbjct: 625 GPEVT-VTLAGNREELG--GMLDAVHRRFIPNLALRY------------------GGEGG 663

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
            +        A VC   +C P VT   +L  LL E
Sbjct: 664 ESPTVGGLPTAYVCAKGACRPSVTRADALGALLDE 698


>gi|298206807|ref|YP_003714986.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
           HTCC2559]
 gi|83849439|gb|EAP87307.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
           HTCC2559]
          Length = 681

 Score =  345 bits (886), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 216/686 (31%), Positives = 350/686 (51%), Gaps = 70/686 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  +A+++N  F++IKVDREERPDVD+VYM  +Q + G GGWPL++ 
Sbjct: 56  CHWCHVMEHESFEDISIAEVMNANFINIKVDREERPDVDQVYMKALQLMTGQGGWPLNIV 115

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ G TY P      +  +K  L ++ D +    + +        E+LS+ ++ 
Sbjct: 116 ALPDGRPIWGATYLP------KKQWKGSLHQLADLYRSNSEHMITYA----EKLSKGMAQ 165

Query: 142 SASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            +   K     ++ +  L+   +  S  +D  +GG   +PKF  P   Q +L ++ + +D
Sbjct: 166 VSLVTKTDSNTDISKAFLKDSLQTWSNQFDYTYGGTQRSPKFMMPNNYQFLLRYAHQTKD 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                        V+ TL  ++ GG++DH+GGGF RY+VD +WHVPHFEKMLYD  QL +
Sbjct: 226 KSL-------LDYVILTLNKISYGGVYDHIGGGFSRYAVDSKWHVPHFEKMLYDNAQLVS 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A++LTKD +Y  +  + L+++  ++    G  +S+ DADS  TEG  + +EGAFYV
Sbjct: 279 LYSKAYTLTKDPWYKTVVTNTLNFIETELTRDNGSFYSSLDADSLNTEG--KLEEGAFYV 336

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  E++ +L E   LF+ +Y +   G+ +       HN +    VLI    +S  A+  
Sbjct: 337 WTKAELKSLLNEDYPLFEAYYNINEYGHWE-------HNNY----VLIRTKSNSEIANDF 385

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +P+      L   +  L + R KR +P LDDK + SWN L+I+ +  A K  +      
Sbjct: 386 SIPISTLDKKLTSWKALLNNNRQKRAQPRLDDKSLTSWNALMINGYIDAYKAFQIN---- 441

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       +Y+E+A  A++FI   +  ++   L HS+    +K  G+L+DYAF I 
Sbjct: 442 ------------DYLEIALKASNFILDKML-QKDGSLTHSYNKNEAKINGYLEDYAFTIE 488

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + L+E    +KWL  A EL     + F D E   ++  +  D +++ R  E  D   P
Sbjct: 489 AFISLFEVTFNSKWLSKAEELTTYALKHFYDEEQHIFYFNSNLDDALVTRPIEQQDNVIP 548

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           + NS    NL +L+ ++ G KS  Y++ AE  L       K  A             S P
Sbjct: 549 ASNSTMAKNLFKLSHLL-GIKS--YKEIAEQQLKTVLQDAKTYASGYSNWLDVIMNFSFP 605

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
             + +V+ G  +S   +++        +LN     I  A  +E        +N+  + +N
Sbjct: 606 YHE-IVITGKNASNYVKDL--------NLNYIPNSITAATEKE--------NNDLLIFKN 648

Query: 680 NFSADKVVALVCQNFSCSPPVTDPIS 705
            +  ++ +  VC++ +C+ P TD +S
Sbjct: 649 RYVDEQTLIYVCKDNTCNVP-TDKVS 673


>gi|338741363|ref|YP_004678325.1| hypothetical protein HYPMC_4552 [Hyphomicrobium sp. MC1]
 gi|337761926|emb|CCB67761.1| conserved protein of unknown function [Hyphomicrobium sp. MC1]
          Length = 682

 Score =  345 bits (886), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 231/696 (33%), Positives = 340/696 (48%), Gaps = 74/696 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED   A+++ND FV+IKVDREERPD+D +YM  +  L   GGWPL++F
Sbjct: 51  CHWCHVMAHESFEDPETARVMNDLFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMF 110

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L  + KP  GGTYFP E +YGRP F T+L ++ +A+  + + +A++    +  L E  S 
Sbjct: 111 LDSEAKPFWGGTYFPRESRYGRPSFVTVLLRIAEAYQSQPENVAKNTEALVAALKEEAST 170

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           +      PD +P    R     ++++ D   GG   APKFP+     ++   + +  D  
Sbjct: 171 TDRVEAGPD-VPDLVAR-----ITRAVDRDHGGINGAPKFPQWNIFWLLWRGAMRFGD-- 222

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                 + ++ V+ TL+ + +GGI+DH+GGGF RYSVD  W VPHFEKMLYD   L ++ 
Sbjct: 223 -----EDAKQAVITTLRNICQGGIYDHLGGGFARYSVDPFWLVPHFEKMLYDNALLIDLI 277

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
            + +  T+D  +     + + +L+R+MIG  G   ++ DADS   EG    +EG FYVW 
Sbjct: 278 TEVWRETQDPLFKIRIAETVAWLKREMIGEAGGFAASLDADS---EG----EEGKFYVWH 330

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            KE+ D+LG E A +F + Y +   GN             +G  +L  L   S S+ +  
Sbjct: 331 KKEIVDVLGPEDAAIFGKVYGVTRDGNFSEHAAITASGRIEGPTILNRLESQSFSSDEAE 390

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             L        E R KL   R+ R RP  DDK++  WNGL+I++ +RA+ +         
Sbjct: 391 ARLS-------EMRAKLLTRRAGRVRPGWDDKILADWNGLMIAAMSRAAIVF-------- 435

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   D+ E++ +AE+A + +   L      RL HS+R G +KAP    DYA +I  
Sbjct: 436 --------DQPEWLGMAEAAFTCVATKL-SAGGDRLYHSYRGGLAKAPATASDYANMIWA 486

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            L LYE  S  ++L  A       D  + D + GGYF    +   V++R+K   D A PS
Sbjct: 487 ALRLYEATSSDRYLSQAQRWAAVLDTHYWDGDSGGYFTAADDTSDVVVRLKSASDDATPS 546

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCA--ADMLS 617
            N++ + NL+ LA++      D          A   TR+   A+A  P   C   A    
Sbjct: 547 ANAIQLSNLITLAAMTGDLTYD--------DRAAELTRVFSGAVARAPTGHCGLIAAGFD 598

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS--NNAS 675
           +     V ++G   S              DL K + +I       + F  E  S    ++
Sbjct: 599 LGRLVQVAVIGEGRS--------------DLQKALTNISVPGA--VSFISETGSFTEGSA 642

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
           +A       K  A VC    C  PV D   L   LL
Sbjct: 643 LAGKASIGGKSTAYVCVGPVCGMPVQDAQELRKELL 678


>gi|312115384|ref|YP_004012980.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311220513|gb|ADP71881.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 685

 Score =  345 bits (885), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 231/699 (33%), Positives = 349/699 (49%), Gaps = 85/699 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE E  A+L+N  F++IKVDREERPDVD +YMT +Q L   GGWPL++F
Sbjct: 51  CHWCHVMAHESFEKEDTAELMNRLFINIKVDREERPDVDTLYMTALQELGEQGGWPLTMF 110

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P  GGTYFP + ++G+P FK +L  V   + ++++ +AQ+ A+  ++L+  L+ 
Sbjct: 111 LTPDGMPFFGGTYFPDKSRFGKPSFKDVLVNVARVYAQEKETIAQNTAYLKQRLTPRLNY 170

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-----YHSKK 196
            A+      E  +  L   A +   + D   GG   APKFP     Q +      Y+ K 
Sbjct: 171 GAAP-----EFSEEQLAAIAAKFIGAIDPTNGGLRGAPKFPNTTIFQFLWRAGLRYNLKT 225

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             +  K+            TL  + +GGI+DH+GGGF RY+VDERW VPHFEKMLYD   
Sbjct: 226 CIEEVKN------------TLLHICQGGIYDHLGGGFSRYTVDERWLVPHFEKMLYDNAL 273

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L     + +  T+         + + +L+RDMI PGG   ++ DADS   EG    +EG 
Sbjct: 274 LIEFMTEVWKETQSDRLKTRVAETIGWLKRDMIVPGGAFAASYDADS---EG----EEGK 326

Query: 317 FYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           FYVWT++E+ DIL  GE A +F + Y +   GN            ++GK +L  L     
Sbjct: 327 FYVWTAREITDILGHGEEAAIFAQTYDVTEGGN------------WEGKTILNRLK---- 370

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
           + + L    E+ ++   ECR KLF  R +R +P  DDKV+  WNGL I + ARA      
Sbjct: 371 ALALLNGGEERAMD---ECRAKLFAERERRVKPGWDDKVLADWNGLAIRALARAGDAFA- 426

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                          + +++ +A  A  F++  +   +  RL HS+R+G  K P    DY
Sbjct: 427 ---------------QPDWIVLAADAYGFVKSRMI--ENGRLFHSWRDGKLKGPATAADY 469

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +IS  L L++     ++L  A+E     +  + D E GGY+    +   ++LR     
Sbjct: 470 ANIISAALVLHQVTGEPRYLDDAVEWTAIMNRHY-DAEQGGYYFAADDTSDLILRPLSAS 528

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A P+ N+  + NL  L ++   +    Y + A+  L  F+   + MA+    +   A 
Sbjct: 529 DDAVPNANATMLQNLADLYTLTGDAA---YLKRADGLLTAFQGAAQTMAIGYTGLLSGA- 584

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
            L++ S + + + G ++  D      A         TV  ++P          + N   +
Sbjct: 585 -LTLISPQSIAIAGDRAGPDAAAWRRALAEVSLPGATVQWVNP----------DENLPAS 633

Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
           S A    + D K  A +C    CS P+TDP  L++ L E
Sbjct: 634 SPAFGKKAIDGKTTAYICFGPRCSEPITDPAILKDRLKE 672


>gi|448474014|ref|ZP_21601982.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
 gi|445818294|gb|EMA68153.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
          Length = 735

 Score =  345 bits (885), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 235/720 (32%), Positives = 352/720 (48%), Gaps = 86/720 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+ +A +LND FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDDSIAAVLNDQFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
            + +P+ KP   GTYFPPE +  +PGF+ +  ++ D+W          ++ +    S   
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRAEQWTTSARD 172

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
            +E + E   A  + +  P     + L   A    + YD  +GGFGS   KFP P  I +
Sbjct: 173 ELESVPEPGDADDADDTGPSG--SDLLEEAAAAAIRGYDDEYGGFGSGGAKFPMPGRIDL 230

Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
           ++  + +   +     A+        TL  MA+GG++D +GGGFHRY+VD +W +PHFEK
Sbjct: 231 LMRAAARSGRSAALTAATG-------TLDGMARGGVYDQIGGGFHRYAVDRQWTIPHFEK 283

Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------ 303
           MLYD  +L  VYLD + LT D  Y+ +  + L +L R++    G  FS  DA S      
Sbjct: 284 MLYDNAELPMVYLDGYRLTGDPSYARVASESLGFLDRELRHADGGFFSTLDARSRPPAGR 343

Query: 304 ---------AETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRM 353
                     + EG     EGA+YVWT +EV+ +L E A  L K  + ++  GN +    
Sbjct: 344 GGGRGNDEGGDGEGDAPAVEGAYYVWTPEEVDAVLDEPASSLAKARFGIRSGGNFE---- 399

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
                  +G  V          A +   P ++   IL + R  LF+ R  RPRP  D+KV
Sbjct: 400 -------RGTTVPTVAASIEELADEYDRPADEVREILTDARVALFEARETRPRPARDEKV 452

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           + SWNG  IS+FARA  +L                    Y  +A  A +F R  LYDE T
Sbjct: 453 LASWNGRAISAFARAGDVLG-----------------DSYAAIASDALAFCRDRLYDEDT 495

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             L   + +G  + PG+LDDYAFL  G LD+Y      + L +A++L  +  + F +   
Sbjct: 496 GELARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALDLAESLVDAFYEAAD 555

Query: 534 GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNA 588
           G  + T      +D ++  R +E  D + PS   V+   L    +++ G ++D  +R+ A
Sbjct: 556 GTIYFTRDPDASDDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFREIA 611

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS--- 645
           E  +     R++    A PL   +     V + +HV   G + ++  + + AA   +   
Sbjct: 612 EAVVTTHADRIR----ASPLEHVSL----VRAAEHVETGGVEVTIAADEVPAAWRETLGE 663

Query: 646 -YDLNKTVIHIDPADTEEMDFWEEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
            Y     V    P D     + ++   + A    A  +    +  A VC+ F+CSPP TD
Sbjct: 664 RYLPGALVAPRPPTDAGLAAWLDDLGLDEAPPIWADRDALDGEPTAYVCEGFACSPPRTD 723


>gi|417781210|ref|ZP_12428962.1| PF03190 family protein [Leptospira weilii str. 2006001853]
 gi|410778461|gb|EKR63087.1| PF03190 family protein [Leptospira weilii str. 2006001853]
          Length = 630

 Score =  345 bits (884), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 243/694 (35%), Positives = 354/694 (51%), Gaps = 76/694 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD K
Sbjct: 1   MEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P+ GGTYFPPE +YGR  F  IL  ++  W++KR  L      A  +LS  L  S     
Sbjct: 61  PITGGTYFPPEPRYGRKSFLEILNILRKVWNEKRQEL----IVASSELSRYLKDSGEGRA 116

Query: 148 LPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLED 199
           +  +   LP +N            YD+ FGGF +    KFP  + +  +L  YHS     
Sbjct: 117 IEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYYHS----- 171

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              SG      +MV  TL  M +GGI+D +GGG  RYS D  W VPHFEKMLYD      
Sbjct: 172 ---SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCRYSTDHHWMVPHFEKMLYDNSLFLE 227

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             ++   ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+
Sbjct: 228 TLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYI 280

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W  +E  ++ GE + + ++ + +   GN            F+GKN+L E     + A+K 
Sbjct: 281 WDFEEFREVCGEDSQILEKFWNVTKKGN------------FEGKNILHE--SYRSEATKF 326

Query: 380 GMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                K ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A          
Sbjct: 327 SEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG--------- 377

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                  V   R++++++AE   SFI ++L D    R+   FR+G S   G+ +DYA +I
Sbjct: 378 -------VAFQREDFLKLAEETYSFIEKNLIDPNG-RILRRFRDGESGILGYSNDYAEMI 429

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGA 557
           S  + L+E G G ++L  A+     +D + L R   G F  TG D  VLLR   D +DG 
Sbjct: 430 SSSIALFEAGCGIRYLKNAVLWM--EDAIRLFRSPAGVFFDTGSDGEVLLRRSVDGYDGV 487

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EPS N     +LV+L+  + G  S  Y + AE     F   L   +++ P +  A     
Sbjct: 488 EPSANGSLAYSLVKLS--LFGIDSARYGEFAESIFLYFTKELSTNSLSYPHLLSAYWTYR 545

Query: 618 VPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             S K +VL+  +   DF +++LAA    +  +  +  ++  + EE           +++
Sbjct: 546 RHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVLAVVNENELEEA-------RKLSTL 595

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +  S    +  VC+NFSC  PV++   L+  +
Sbjct: 596 FDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 629


>gi|404447779|ref|ZP_11012773.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
 gi|403766365|gb|EJZ27237.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
          Length = 674

 Score =  345 bits (884), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 236/694 (34%), Positives = 346/694 (49%), Gaps = 89/694 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE  A+L+N +FV IK+DREERPD+D +YM  VQA+   GGWPL+
Sbjct: 47  SACHWCHVMEKESFEDEATAQLMNQYFVCIKIDREERPDLDNIYMDAVQAMGLQGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSE 137
           VFL P+ KP  GGTYFP         +K +L+ + +A+ +  D LA+S   F    Q SE
Sbjct: 107 VFLMPNQKPFYGGTYFP------NAQWKALLQNIGEAYQEHYDQLAKSAEEFGNSLQTSE 160

Query: 138 ALSASASSNKL---PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
            L    S       P EL + A++L   Q    +D  +GG    PKFP P     ++ ++
Sbjct: 161 FLKYGLSHGTFQLDPKELAE-AIKLLENQ----FDLDWGGMNRKPKFPMPAIWSFVMDYA 215

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                  KS E    +  V FTL+ +  GGI+DH+ GGF RYSVD  W  PHFEKMLYD 
Sbjct: 216 -----LAKSDEVLLAK--VFFTLKKIGMGGIYDHLRGGFARYSVDGEWFAPHFEKMLYDN 268

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           GQL ++Y  A++++ + FY     + + +L+ +M+   G  ++A+DADS   EG     E
Sbjct: 269 GQLLDLYSKAYAVSGEYFYKEKILETIAWLKSEMLHKEGGFYAAQDADS---EGV----E 321

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           G FY WT +E+E I+GE    F + Y LK  GN +            G N+L +      
Sbjct: 322 GKFYTWTYEELESIVGEDLHWFAKLYNLKYQGNWE-----------DGVNILFQTESYEK 370

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A    +  E Y+  L E + KL  VR++R  P LDDK++  WNGL+IS    A   L  
Sbjct: 371 LAESSELSEEGYIQRLNEIKAKLLSVRNQRIFPGLDDKILSGWNGLMISGLVSAYTSLGD 430

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
           E                E +E++ + A+FI   +Y ++   L  S++NG +  P FL+DY
Sbjct: 431 E----------------EALELSLNNATFILDKMYKDKV--LYRSYKNGHAYTPAFLEDY 472

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +I G + LY+    +KWL+ A EL +   E F D E G ++    +   ++   KE  
Sbjct: 473 AAVIRGFISLYQATLDSKWLLKAKELSDKVIEAFYDEEEGFFYFNNPQAEKLIANKKELF 532

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-- 612
           D   P+ NS+   NL+ L+        D Y   A++ L      +K + +  P   C   
Sbjct: 533 DNVIPASNSIMARNLLDLSMFFY---EDNYAAIAKNMLGT----MKKLIIKEPGFLCNWA 585

Query: 613 ---ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
               DML +P +  V +VG  +    +   A  ++ + L+ +               E+ 
Sbjct: 586 SLYLDML-LP-KAEVAIVGEGAEKLGQEFFAKRNSGFILSAS---------------EKT 628

Query: 670 NSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
           N+    +       D   +  VC N SC  PV+D
Sbjct: 629 NTEIPLLEGKKPDTDGNALIYVCFNRSCQRPVSD 662


>gi|77166007|ref|YP_344532.1| hypothetical protein Noc_2549 [Nitrosococcus oceani ATCC 19707]
 gi|254436399|ref|ZP_05049905.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
 gi|76884321|gb|ABA59002.1| Protein of unknown function DUF255 [Nitrosococcus oceani ATCC
           19707]
 gi|207088089|gb|EDZ65362.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
          Length = 694

 Score =  345 bits (884), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 234/693 (33%), Positives = 351/693 (50%), Gaps = 58/693 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFED   A ++N +F++IKVDREERPD+D++Y    Q L G  GGWPL
Sbjct: 53  SACHWCHVMAHESFEDSETAAVMNQYFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112

Query: 79  SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++FL P    P  GGTYFPPE+++G PGFK +L++V + +  +R+ +       ++   +
Sbjct: 113 TMFLEPIKQAPFFGGTYFPPEERHGLPGFKDLLQRVAEYFHTRREAIQSQNERLLDAFGD 172

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            L A   + ++ + L +  L+    QL++++DSR GGF  APKFP P  I+  L  ++  
Sbjct: 173 -LDARLPAAEV-EGLNRAPLQAAHRQLAQAFDSRHGGFRGAPKFPNPSSIERCLRDARGE 230

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             T    E  +   M   TL+ MA+GGI+D +GGGF RYSVDE W +PHFEKMLYD GQL
Sbjct: 231 HLT--EDEKQQALTMARLTLEQMAQGGIYDQLGGGFCRYSVDEEWRIPHFEKMLYDNGQL 288

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y DA+ L     +  I  +   +  R+M  P G  +S+ DADS   EG     EG F
Sbjct: 289 LVLYRDAYRLWGSGLFRRILEETGHWAVREMQSPEGGYYSSLDADS---EG----HEGKF 341

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT ++V  +LGE        Y+           +  P N F+G   L       A A 
Sbjct: 342 YVWTREQVRALLGEEEYALAARYF----------GLDQPAN-FEGYWHLYAATVPEALAQ 390

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           ++ +P       L   ++KLF  R  R RP  DDK++ +WNGL+I   A A + L     
Sbjct: 391 EMKVPAPGLQEQLTAAKQKLFAAREARIRPGRDDKILTAWNGLMIKGMAAAGQALAQ--- 447

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                 PV       ++  AE A  F+R HL+  Q  RL  S+++G ++  G+LDDYAFL
Sbjct: 448 ------PV-------FIASAERAVDFVRAHLW--QKGRLLVSYKDGRAQHRGYLDDYAFL 492

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL+L +       L +A++L     E F D+  GG++ T  +   ++ R     D A
Sbjct: 493 LDALLELLQVRWRDGDLSFAVDLAEAVLERFEDKAQGGFYFTADDHEILIHRPVPLMDDA 552

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            P+GN V   +L+RL  ++   +   Y + AE +L      ++    A   +    +   
Sbjct: 553 TPAGNGVLAWSLLRLGHLLGEVR---YLKAAESTLKAAWKSIQQTPHAHCSLLKTLEEWL 609

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +P +  V+L G     + E   A A A Y   +  + I P + +++            + 
Sbjct: 610 IPPQI-VILRG--GGEELETWRAVAAAEYAPRRVALAI-PLEAQDLP---------GILG 656

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   V A VC   +CS P+T   +L+  L
Sbjct: 657 EYRPQGTAVTAYVCSGHTCSAPLTRREALKEHL 689


>gi|418053652|ref|ZP_12691708.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           1NES1]
 gi|353211277|gb|EHB76677.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           1NES1]
          Length = 677

 Score =  345 bits (884), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 209/596 (35%), Positives = 315/596 (52%), Gaps = 72/596 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED G A+++N+ FV+IKVDREERPD+D +YM  +  L   GGWPL++F
Sbjct: 51  CHWCHVMAHESFEDSGTAEVMNELFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMF 110

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L  D KP  GGTYFP E +YGRP F T+L ++ +A+  + D         I + +EAL A
Sbjct: 111 LDSDAKPFWGGTYFPREARYGRPAFVTVLLRIAEAYQNQPDN--------IRKNTEALLA 162

Query: 142 SASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +   +  P+E   +A R   +     ++++ D   GG   APKFP+     ++   + + 
Sbjct: 163 ALKES--PNETSADASRPMTKDVVAAIARAVDREHGGLSGAPKFPQWSVFWLLWRGAIRY 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +D          Q+ V+ TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L
Sbjct: 221 DD-------PNAQEAVVTTLRHICQGGIYDHLGGGFARYSVDEFWLVPHFEKMLYDNALL 273

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++  + +  T+D  +     + + +L+R+MIG  G   ++ DADS   EG    +EG F
Sbjct: 274 IDLLTEVWRETQDPIFKTRIAETVTWLKREMIGEAGGFAASLDADS---EG----EEGKF 326

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW++ E+ED+LG E A  F   Y + P GN            F+G  +L  LN      
Sbjct: 327 YVWSAAEIEDVLGAEDAAFFSRVYGVTPEGN------------FEGHTILNRLN------ 368

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             L +   +    L + R KL + R+ R RP  DDK++  WNGL+I++ +RA+ + +   
Sbjct: 369 -SLALLTNEEEAHLAKLRAKLLERRASRIRPGWDDKILADWNGLMIAALSRAAVVFEC-- 425

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +++ +AE A   I   L      RL H++R G +KAP    DYA 
Sbjct: 426 --------------SDWLALAERAFDCIVTKLAAPDG-RLFHAYRKGLAKAPAIASDYAN 470

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           + S  L L+      ++L  A +     D+ + D + GGYF    +   V++R+K   D 
Sbjct: 471 MTSAALRLFAATGSERYLEHARQWTRILDKHYWDVQRGGYFTAADDTGDVVVRLKVASDD 530

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
           A PS N++ + NL+ LA++          Q+ E +  + E     MA+  P+  CA
Sbjct: 531 AAPSANAIQLSNLIALAAVTGDV------QHHERARQLLEAFAPAMALG-PIGHCA 579


>gi|448731719|ref|ZP_21714012.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
           DSM 8989]
 gi|445805618|gb|EMA55820.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
           DSM 8989]
          Length = 580

 Score =  345 bits (884), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 200/565 (35%), Positives = 295/565 (52%), Gaps = 43/565 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA+ LND FV IKVDREERPD+D++Y T    + G GGWPLS
Sbjct: 52  SACHWCHVMEDESFEDERVAERLNDEFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLS 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEA 138
           V+L+PD +P   GTYFP ++K G+PGF  +L  + ++W+  R D+  ++  +A     E 
Sbjct: 112 VWLTPDGRPFYVGTYFPRDEKRGQPGFLDLLDSIAESWENDREDIEGRADQWAGAMAGEL 171

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            +      ++PD    + L   A+Q  ++ D  +GGFG   KFP+   + +++   +  E
Sbjct: 172 EATPEQPGEVPD---SDLLETAAQQAVENADREYGGFGHGQKFPQTGRLHLLM---RAAE 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG+        ++    L  M++GG+ DH GGGFHRY+ D  W VPHFEKMLYD  +L 
Sbjct: 226 RTGRES----FDEVAHEALDAMSEGGLRDHAGGGFHRYTTDREWTVPHFEKMLYDNAELT 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             YL  +  T    Y+ + R+ L ++ R++  P G  FS  DA S +  G   ++EGAFY
Sbjct: 282 RAYLAGYRRTGAERYAEVARETLGFVERELRHPDGGFFSTLDAQSEDESG--EREEGAFY 339

Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           VWT   V D + +   A LF E Y +   GN +            GK VL    +    A
Sbjct: 340 VWTPNGVHDAVDDEFAADLFCERYGVTEAGNFE-----------DGKTVLTVSTEIEDLA 388

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +     E+    L   R  +F  R++R RP  D+KV+  WNGL+IS+FA A   L +  
Sbjct: 389 DEHDTTTEEVSAELERAREAVFAARAERERPERDEKVLAGWNGLMISAFAEAGLALDA-- 446

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           + + A +   F+  HL++++  RLQ  +++G  K  G+L+DYAF
Sbjct: 447 ---------------RFADTAVAGIEFVHEHLWNDEKRRLQRRYKDGDVKIEGYLEDYAF 491

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L  G L+ YE       L +A++L    +  F D +    + T     S++ R +E  D 
Sbjct: 492 LARGALNCYEATGEVDHLAFALDLARAIETEFWDSDEETLYFTPQTGESLVARPQELDDQ 551

Query: 557 AEPSGNSVSVINLVRLASIVAGSKS 581
           + PS   V+V  L+ L    A   S
Sbjct: 552 STPSSTGVAVDVLLALDHFAADRPS 576


>gi|317470765|ref|ZP_07930149.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
 gi|316901754|gb|EFV23684.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
          Length = 679

 Score =  344 bits (883), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 236/699 (33%), Positives = 349/699 (49%), Gaps = 85/699 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFED  VA+LLN  F+SIKVDREERPD+D VYM+  QA+ G GGWP+SV
Sbjct: 53  SCHWCHVMEEESFEDHEVAELLNKHFISIKVDREERPDIDSVYMSVCQAMTGSGGWPMSV 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD KP    TY P   +Y   G   +L ++   W + R+ L + G    + L+    
Sbjct: 113 FMTPDQKPFFAATYLPKTSRYHLTGLMDLLPRISLLWKQDRERLLKIGNEITDHLNTDQR 172

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S + + L +++P  AL      L+ S+D+  GGFG+APKFP P  +  ++   K   D 
Sbjct: 173 PSETVS-LSEDVPAQAL----ADLNASFDNVNGGFGTAPKFPTPAVLLFLIQQYKLCGD- 226

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                  +   M   TL  M +GGI DH+GGGF RYS D+RW VPHFEKMLYD   L   
Sbjct: 227 ------KDSLAMAEHTLLRMYRGGIFDHIGGGFSRYSTDDRWLVPHFEKMLYDNALLLEA 280

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A++  ++  +  I   ++  +  ++  P G  + ++DADS   EG    +EG +Y +
Sbjct: 281 YAEAYACCENPLFPEIADAVVSCVLNELSHPDGGFYCSQDADS---EG----EEGKYYTF 333

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T  EV  +LG E+  LF           C L  ++D  N F+GK++   L  S       
Sbjct: 334 TRDEVLHVLGEENGSLF-----------CSLYDITDRGN-FEGKSIPNLLKQSPFPNDHE 381

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G         L   +R L+  R KR     D K++ SWN L+IS+  +AS+I        
Sbjct: 382 G---------LKRMKRTLYLYRKKRTSLSTDKKILTSWNCLMISALTKASRIF------- 425

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                     R++++  A+ A SF+ +HL  +   RL   + +G +   G L+DYAF   
Sbjct: 426 ---------GREKFLAAAQKAESFLDKHLRKDDG-RLFLRWCDGEAAYDGQLEDYAFYSL 475

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            +L LY      ++L  A++  +    LF DRE GG+F  + E  +++L+ KE +DGA P
Sbjct: 476 SMLSLYRSTFLEEYLEKAVQAADLMISLFFDREHGGFFLYSSESEALILKPKELYDGAMP 535

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV- 618
           SGNS ++  L  L+ I   S    YR   + + + F   L     A    C A  +LS  
Sbjct: 536 SGNSAALHVLFILSKITGKS---IYRDCMDQTFSYFSPELSVHPSAY---CYALSVLSSQ 589

Query: 619 --PSRKHVVLVGHKS-SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
             PSR+ V+    +S    F  +L+       +N   + +           E++    A+
Sbjct: 590 FHPSRQLVITTKKESLPKKFMELLSKPQ----MNDFTVLVKT---------EQNKDTLAA 636

Query: 676 MA----RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +A         ADK    +C+  +C  PV D  SLE LL
Sbjct: 637 IAPFTKEYPVLADKTSCYLCRGGACQAPVFDAESLETLL 675


>gi|288956849|ref|YP_003447190.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
 gi|288909157|dbj|BAI70646.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
          Length = 685

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 232/691 (33%), Positives = 335/691 (48%), Gaps = 80/691 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE+  +A L+N+ F++IKVDREERPD+D +Y + +  L   GGWPL++F
Sbjct: 51  CHWCHVMAHESFENPEIAGLMNELFINIKVDREERPDLDTIYQSALALLGQQGGWPLTMF 110

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P  GGTYFPP  +YGR GF  +LR +   +  + D + ++    +E L  AL+ 
Sbjct: 111 LTPDAEPFWGGTYFPPAQRYGRAGFPDVLRGIAGTYTDEPDKVGKN----VEALRSALAG 166

Query: 142 SAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                S      +    L   A++L +  D   GG GSAPKFP+ V +  +L+ + +   
Sbjct: 167 IGENRSAGAAGTIDAGMLDQVAQRLLREVDPIHGGIGSAPKFPQ-VPLFELLWRAWR--R 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+       +  V  TL  MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD  +L +
Sbjct: 224 TGR----EPFRDAVTHTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLD 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +    +  T+D       R+ + +L R+MI  GG   +  DADS   EG    +EG FY+
Sbjct: 280 LMTLVWQETRDPLLETRIRETVGWLLREMIAEGGGFAATLDADS---EG----EEGLFYI 332

Query: 320 WTSKEVEDILG-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           W  +EV+ +LG     +    FK  Y + P GN            ++G  +L  L   + 
Sbjct: 333 WREEEVDRLLGPALGADGLATFKRVYEVLPQGN------------WEGVTILNRLGGLTP 380

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
           +        E    +L + R  L   R+KR RP  DDKV+  WNGL+I++   A+     
Sbjct: 381 AD-------ESTEAMLAKGREALSRARAKRVRPGWDDKVLADWNGLMIAALTHAALA--- 430

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                         D  E+++ A  A +F+R  +  +   RL HS+R+G  K  G LDDY
Sbjct: 431 -------------LDEPEWLDAAGRAFAFVRDRM--DSGGRLCHSWRHGQGKHAGMLDDY 475

Query: 495 AFLISGLLDLYEFGSGTKWL----VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           A +    L L+E       L    VWA  L    D  F D   GGYF T  +   +++R 
Sbjct: 476 AHMARAALALHEATGDPAALDQAKVWAAAL----DAHFWDDANGGYFFTADDAEGLIVRT 531

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           K  +D A PSGN      L  L  +   +  D YR  AE     F   L      +P   
Sbjct: 532 KTAYDNATPSGNGTM---LAVLTILFQRTGEDAYRDRAEALATAFSGELTRNFFPLPTFL 588

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A ++++ P    +V+VG   + + E +          N+ +  + P      D    H 
Sbjct: 589 NAVELMTAP--LQIVIVGPPRTAETEALRRTVLDRSLPNRILTVLAPKGDFPADLPAGHP 646

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVT 701
           +    M           A VC+  +CS PVT
Sbjct: 647 AQGKGMRDGT-----ATAYVCRGMTCSAPVT 672


>gi|355673311|ref|ZP_09058908.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
           WAL-17108]
 gi|354814777|gb|EHE99376.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
           WAL-17108]
          Length = 688

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 225/632 (35%), Positives = 326/632 (51%), Gaps = 97/632 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED+ +A++LN  FV +KVDREERP++D VYM+  QA+ G GGWPL+
Sbjct: 48  STCHWCHVMAHESFEDKEIARILNTHFVPVKVDREERPEIDMVYMSVCQAMTGRGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ------------- 126
           + ++PD KP   GTY PP  +YG  G   +L KV   W+  R+ L Q             
Sbjct: 108 IIMTPDKKPFFAGTYLPPRSRYGMTGLTELLEKVSGLWETDREQLLQMSRQVMSLIHGRE 167

Query: 127 -SGAFAIEQLSEALSASASS-NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
            +GA  +    + +  + ++ ++  D +         ++LS  +D + GGFG APKFP P
Sbjct: 168 GNGADGMGTAGDGMDGTGTAGDRTEDSVSWELAHEGFKELSAMFDKKHGGFGRAPKFPAP 227

Query: 185 VEIQ-MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
             +  +M+Y++ + ED            M   TL  MA+GGIHD +GGGF RYS DE W 
Sbjct: 228 HNLLFLMMYYAARDED--------HAMDMAEQTLTAMARGGIHDQIGGGFSRYSTDEAWL 279

Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 303
           VPHFEKMLYD   LA  YL+ + LT + +Y  I   IL Y+ R++    G  +  +DADS
Sbjct: 280 VPHFEKMLYDNALLALAYLEGYRLTDNPYYRQIAERILIYVERELSDSDGGFYCGQDADS 339

Query: 304 AETEGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFK 361
              EG     EG FYV++  E+  IL        F + + +   GN            F+
Sbjct: 340 ---EGV----EGKFYVFSKDEIRQILDTPREYDDFCQWFGITEKGN------------FE 380

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
           GKN+   L++     +            +G   +K++D R KR   H DDK++ SWN ++
Sbjct: 381 GKNIPNLLHNPGYKDT---------FPFMGPVCKKVYDHRIKRMALHRDDKILTSWNSMM 431

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           I+++A+A  +L                D+K Y + A +A  F+ +HL DE  HR+   +R
Sbjct: 432 ITAYAKAGLLL----------------DQKAYEKKARNAQMFVEQHLVDE-NHRMFVRYR 474

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTT 540
           +G    PG LDDYA+   GLL LYE      +L  A++      +LF D R+GG YF   
Sbjct: 475 DGERAFPGNLDDYAYYCLGLLALYEATLEVDYLELALKRAAQMADLFWDSRQGGFYF--Y 532

Query: 541 GEDPSVLL-RVKEDHDGAEPSGNSVSVINLV-----------------RLASIVAGSKSD 582
           G D   L+ R KE +DGA PSGNS +   L+                 +LA + AG+K  
Sbjct: 533 GRDVQELIHRPKEIYDGAVPSGNSAAAHVLLALASLTAEPRWQEFADRQLAFLAAGAKG- 591

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
            Y      SL  F   +K ++++  L+C +AD
Sbjct: 592 -YPSAHCFSLMAF---MKALSISRELVCVSAD 619


>gi|257092092|ref|YP_003165733.1| hypothetical protein CAP2UW1_0453 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
 gi|257044616|gb|ACV33804.1| protein of unknown function DUF255 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
          Length = 734

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 226/638 (35%), Positives = 335/638 (52%), Gaps = 71/638 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  R  FL    +TCHWCHVME ESFEDE +A+ LN  +V+IKVDREERPD
Sbjct: 72  GDEAFAEARRLGRPVFLSIGYSTCHWCHVMEAESFEDEAIARFLNRHYVAIKVDREERPD 131

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED--KYGRPGFKTILRKVKDA 116
           +D VYM+ VQ L G GGWP+SV+L+   +P  GGTYFPP D  + G+ GF  +L  + D 
Sbjct: 132 IDAVYMSAVQQLTGAGGWPMSVWLTAAREPFFGGTYFPPRDGGRDGQRGFLPLLGALSDT 191

Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFG 173
           + +  + + Q+    +E +   +  +  +        LP  + +        +S+D+R G
Sbjct: 192 FHRDPERVGQACTALVEAIRHDMQGAYGTGGADAAIGLPAGDVIDATVAHYRQSFDARHG 251

Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
           G   APKFP  + ++++L + ++  D       ++  +M   TL+ MA GG++D +GGGF
Sbjct: 252 GLSRAPKFPSHIPVRLLLRYHQRTGD-------ADALRMATLTLEKMAAGGLYDQLGGGF 304

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRYS D RW VPHFEKMLYD   L   Y +AF +T    ++ + R+  DY+ R+M   GG
Sbjct: 305 HRYSTDVRWLVPHFEKMLYDNALLVVAYAEAFQVTDRADFARVARETCDYILREMTDAGG 364

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVE---DILGEHAIL--FKEHYYLKPTGNC 348
             +SA DADS   EG    +EG F+VW   E+    D LG+      F  HY + P GN 
Sbjct: 365 GFYSATDADS---EG----EEGRFFVWREDEIRRELDALGDGDTTEHFLAHYDVHPGGN- 416

Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
                      ++G  +L            +  P E     L   R +L+ VR++R  P 
Sbjct: 417 -----------WEGHTIL-----------NVPRPDEAAWEALAAARARLYAVRARRTPPL 454

Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
            D+K++  WNGL+IS+ A A ++L                D   Y+  A  AA F+  HL
Sbjct: 455 RDEKILAGWNGLMISALAVAGRVL----------------DAPRYVAAAVRAADFVLTHL 498

Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
                  L+ SF++G ++   FLDD+AFL +GL+DLYE     + L  A+ L  T + LF
Sbjct: 499 RGADGG-LRRSFKDGQARQAAFLDDHAFLAAGLIDLYEATFDVRHLRDALALAETTEHLF 557

Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            D   G +F ++    S++ R K  +DGAEPSG SV+++N +RL  +   +  + +RQ A
Sbjct: 558 AD-PAGAWFMSSEAHESLIAREKPAYDGAEPSGTSVALLNALRLGVL---TDDERWRQIA 613

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
           E  L      L +  +A+     A D L+   R+  V+
Sbjct: 614 ERGLRAHARVLGERPIAMTEALLAVDFLATTPRQIAVV 651


>gi|303245350|ref|ZP_07331634.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
           JJ]
 gi|302493199|gb|EFL53061.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
           JJ]
          Length = 702

 Score =  343 bits (881), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 241/693 (34%), Positives = 333/693 (48%), Gaps = 50/693 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE +A L+    V+IKVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 51  STCHWCHVMERESFEDEDIAALMRAIVVAIKVDREERPDLDTLYMTFCQALTGRGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFP E  +GR G + +L++V  AW   R  +  + A  +  + + +
Sbjct: 111 VFLTPDGEPFFAGTYFPKESGFGRTGMRELLQRVHMAWKSNRQAVIGNAAQLLGAVRDQI 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +A   +     E     L     +L+ S+D   GGFGSAPKFP P     +L   ++   
Sbjct: 171 TARDGTGAA--EPGTVELEAATGELAASFDVENGGFGSAPKFPAP---HNLLLLLREYRR 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG      +   MV  TL  M +GG++DHVG GFHRYS D  W VPHFEKMLYDQ     
Sbjct: 226 TGN----KDLLAMVTATLSAMRRGGVYDHVGFGFHRYSTDAGWLVPHFEKMLYDQALCVM 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             ++A+  T +V+      + L+Y+RRD+  P G  +SAEDADS   EG     EG FYV
Sbjct: 282 ACVEAWQATGEVWLKDTALEALEYVRRDLTSPDGVFYSAEDADS---EGV----EGKFYV 334

Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+ + L  E A L  + Y ++ TGN       +      G N+L        +A+ 
Sbjct: 335 WTEAEIREALPPEDAQLVVDVYGVEATGNF----RDEATGVATGTNILHLPRSLEDAAAG 390

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G  +      L  CR  L  VR KR RP  DDKV+   NG           +      +
Sbjct: 391 RGTSVAALAARLETCRAALLAVREKRARPLCDDKVLTDNNG---------LMLAALAKAA 441

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
             FN      D         +A   + +    E   RL H  R G +   G LDDYAF  
Sbjct: 442 RAFN------DEALAARAVAAADFLLEKMALPED--RLLHRLRQGEAAVAGMLDDYAFFA 493

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+     ++L  A  L       F D   GG+F +  +  S+LLR K  +D A 
Sbjct: 494 WGLVELYQTVFAPRYLERAAALAKAMIAHFGD-GAGGFFLSPDDGESLLLRQKTFYDAAV 552

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGNSV+   L  L  +  G KS  +R+ A         R+ +         C+   +  
Sbjct: 553 PSGNSVAFFVLTTLFRLT-GEKS--FREEAAKLAKAAGGRVAEHPSGYAFFLCSLSQMLA 609

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P+   V L G   + D + +       Y L +  + + PA  ++ +      +  A   R
Sbjct: 610 PA-AEVTLAGDPDAADTQVLARTIFDRY-LPEVAVVLRPAGEDDPEI-----AAIAPFTR 662

Query: 679 NNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
                D   A  VC+  SC PP  D  +L  L+
Sbjct: 663 FQLPLDGAAAAHVCRAGSCQPPTADAATLLELI 695


>gi|451980948|ref|ZP_21929330.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
 gi|451761870|emb|CCQ90575.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
          Length = 697

 Score =  343 bits (881), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 236/695 (33%), Positives = 341/695 (49%), Gaps = 64/695 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED  +A+ LN  FV IKVDREERPDVD +YM  VQA    GGWPL+V
Sbjct: 54  TCHWCHVMERESFEDPEIAEYLNAHFVPIKVDREERPDVDSIYMKSVQAFGQQGGWPLNV 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD  P  GGTY+P   +YG P F  +L  +   W ++ + + +     I  L +   
Sbjct: 114 FVTPDGVPFYGGTYYPSVGRYGLPSFLEVLTFLDKTWREEPEKVEKQSTALINYLKDVSK 173

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGG--FGSAPKFPRPVEIQMMLYHSKKLE 198
              ++    D+L  +      E  ++SYD    G  F    KFP  + + ++L H  +  
Sbjct: 174 QEQNTEGTVDDLGFHGENKTREFYTQSYDRLHHGFLFQQQNKFPPSMGLSLLLRHHHRTG 233

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D       +   +MV  TL+ M +GGI+D +GGG  RYS D +W VPHFEKMLYD G   
Sbjct: 234 D-------ALSLEMVENTLRAMKQGGIYDQIGGGLARYSTDHQWLVPHFEKMLYDNGLFV 286

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              ++ + +T    ++    D+L Y+ RDM    G  +SAEDADS   EG     EG FY
Sbjct: 287 TALIETYQVTGKREFADYANDVLQYIDRDMTSAEGAFYSAEDADS---EGV----EGKFY 339

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT +E+E +LG E A +   +Y + P GN            ++GKN+L         A 
Sbjct: 340 VWTQEEIEKVLGRETASIAIPYYNVLPNGN------------WEGKNILHVKRPPEQIAK 387

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            LG+PL+     + E R KL  VRS+R RP LDDK++ SWNGL+I + A+  ++L     
Sbjct: 388 DLGLPLDHVEAKIAEAREKLLAVRSQRIRPLLDDKILTSWNGLMIRAMAQVGRVL----- 442

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      D  + +  AE A  FI  +L   +  +L   +R G ++  G+L DY  +
Sbjct: 443 -----------DDADRIAKAEKALHFIWNNLRTPEG-KLLRRWREGEARYDGYLCDYTSI 490

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
                DLYE      ++  A  L  T +E F ++  G Y+ T  +   +++R    +DG 
Sbjct: 491 ALACCDLYEATYNPDYINKAEALMKTVEEKFGNQ--GAYYETASDAEELIVRQVSGYDGV 548

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           EPSGNS + + L++LA++      DY R+ AE     F   + +  +    M  A   L 
Sbjct: 549 EPSGNSSAAMALLKLAALT--QNVDYERR-AEKIFLAFSDEVTEYGINSSFMMQALH-LY 604

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHID-PADTEEMDFWEEHNSNNAS 675
           +   K V + G  S    +         +  N      +D  AD + +            
Sbjct: 605 LGGCKQVAVRGVNSDKGLDAFWPLMRRRFFPNAVFAFSLDGDADAQRVPL---------- 654

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +A       K  A VCQ+ SC PPVT    L+NL+
Sbjct: 655 LAGKESLQGKTTAYVCQHGSCLPPVTQVTELKNLV 689


>gi|363583054|ref|ZP_09315864.1| hypothetical protein FbacHQ_16672 [Flavobacteriaceae bacterium
           HQM9]
          Length = 705

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 226/700 (32%), Positives = 349/700 (49%), Gaps = 82/700 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  VA ++N  FV+IK+DREERPD+D+VYM+ VQ + G GGWPL+V 
Sbjct: 80  CHWCHVMEHESFEDSTVAAVMNTNFVNIKIDREERPDIDQVYMSAVQLMTGRGGWPLNVI 139

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYFP ++  G       L++++  ++     L +       +L+E + +
Sbjct: 140 ALPDGRPVWGGTYFPKDEWMGA------LKQIQKIYEDNPAKLEEYAT----KLTEGIQS 189

Query: 142 SASSNKLPDEL--PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            +     P+ L   ++ +       +K +D + GG   APKF  P     +L ++ +   
Sbjct: 190 VSLVKPNPNTLIFEKDTIENAVANWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQ--- 246

Query: 200 TGKSGEASEGQK-MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                 A+E  K  V+ TL  ++ GG++DHVGGGF RYS DE+WHVPHFEKMLYD  QL 
Sbjct: 247 -----SANEKLKEYVITTLNQISYGGVYDHVGGGFARYSTDEKWHVPHFEKMLYDNAQLV 301

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++Y DA+ +TK+ +Y  +  + LD++ R++    G  +S+ DADS    G  + +EGAFY
Sbjct: 302 SLYSDAYLITKNDWYKQVVYETLDFVARELTNDEGAFYSSLDADSLTPSG--KLEEGAFY 359

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VW    +E  LGE   LFK++Y +   G  +       HN +    VLI     +    K
Sbjct: 360 VWQKPALETALGEDFPLFKDYYNINTYGLWE-------HNNY----VLIRKESDANFVEK 408

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             M ++ +L    + ++ L  +RSKR RP LDDK + SWN L++  +A A ++       
Sbjct: 409 HEMEMDAFLQKQKKWKQLLLGIRSKRERPRLDDKTLTSWNALMLKGYADAYRVF------ 462

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                     D  ++++ A + A FI+ + L  + + +L H+++NG S   G+L+DYA  
Sbjct: 463 ----------DNAKFLKAALANAEFIKTKQL--KGSGQLMHNYKNGKSTINGYLEDYAAT 510

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I   + LY+     +WL  + ++ +     F D     YF T+ ED +++ R  E  D  
Sbjct: 511 IEAFIALYQVTFDQQWLDLSKKMIDYVHTHFYDSASEMYFFTSDEDAALVTRNIESSDNV 570

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA----VPLMCCAA 613
            P+ NS+   NL  L+     S  DY  + +   L   +T + +        + LM    
Sbjct: 571 IPASNSIMAKNLYHLSHYY--SNKDYLVR-SRKMLHNIQTNITEYPSGYSNWLDLMLNFT 627

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           D         VV++G  +    E    A    Y  NK +     A T+            
Sbjct: 628 DDFY-----EVVIIGAAA----EEKRVAVQQKYYPNKIMAGSATASTQ------------ 666

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
             +  N FS       +C N +C  PVT+     NLL EK
Sbjct: 667 -PLLLNRFSDTDTHIFICVNNACKYPVTEVSEAFNLLNEK 705


>gi|448455362|ref|ZP_21594542.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
 gi|445813964|gb|EMA63937.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
          Length = 747

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 240/729 (32%), Positives = 349/729 (47%), Gaps = 95/729 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS
Sbjct: 53  SSCHWCHVMAEESFEDESVAAVLNESFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
            + +P+ +P   GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S   
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARD 172

Query: 131 AIEQL---------SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APK 180
            +E +          +A   S +    PD L + A         + YD  +GGFGS   K
Sbjct: 173 ELESVPDSGPVGGAGDAGDMSGAEAPGPDLLDEAAAAAI-----RGYDDEYGGFGSGGAK 227

Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
           FP P  I ++L    K   TG++   +        TL  MA+GG++D VGGGFHRY+VD 
Sbjct: 228 FPMPGRIDVLLRAYAK---TGRNAALT----AATGTLDGMARGGMYDQVGGGFHRYAVDR 280

Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
           +W VPHFEKMLYD  +L   YLDA  LT D  Y+ +  + L +L R++    G  FS  D
Sbjct: 281 QWTVPHFEKMLYDNAELPMAYLDAHRLTGDASYARVANETLGFLDRELRHDEGGFFSTLD 340

Query: 301 ADS---------AETEGATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPT 345
           A S         A ++G+ R       EGAFYVWT  EV+ +L E A  L K+ Y ++  
Sbjct: 341 ARSRPPASRRGDAGSDGSGRDDDANDVEGAFYVWTPGEVDAVLDEPAASLAKDRYGIESG 400

Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
           GN +           +G  V       +  A    M  +     L   R  LF+ R  RP
Sbjct: 401 GNFE-----------RGTTVPTIAASVAELAEAHDMSTDDVRETLTAARVALFEARESRP 449

Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
           RP  D+KV+ SWNG  IS+FA A ++L                  + Y ++A  A +F R
Sbjct: 450 RPARDEKVLASWNGRAISAFAAAGRVLG-----------------EPYADIASDALAFCR 492

Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
             LYDE+T  L   + +G  + PG+LDD+AFL  G LD Y      + L +A++L  T  
Sbjct: 493 ERLYDEETGALARRWLDGDVRGPGYLDDHAFLARGALDAYSATGDPEALGFALDLAETIV 552

Query: 526 ELFLDREGGG-YFN-----TTG--EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
             F D E G  YF      T G   D ++  R +E  D + PS   V+   L    +++ 
Sbjct: 553 SDFYDEEDGTIYFTRDPDETAGGDGDDTLFARPQEFTDRSTPSSLGVAAETL----ALLD 608

Query: 578 GSKSDY-YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
           G ++D  + + AE  +     R++   +    +  AAD ++      V +        + 
Sbjct: 609 GFRTDREFAEVAERVVTTHADRIRASPLEHVSLVRAADRVAS-GGIEVTVATDAVPEAWR 667

Query: 637 NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQN 693
             L   +    L   ++   P   + +  W +    + +    A  +    +  A VC+ 
Sbjct: 668 ETLGERY----LPGALVAPRPPTEDGLAAWLDRLGMDEAPPIWADRDAVDGEPTAYVCEG 723

Query: 694 FSCSPPVTD 702
            +CSPP TD
Sbjct: 724 RTCSPPETD 732


>gi|76802617|ref|YP_327625.1| hypothetical protein NP3966A [Natronomonas pharaonis DSM 2160]
 gi|76558482|emb|CAI50074.1| YyaL family protein [Natronomonas pharaonis DSM 2160]
          Length = 698

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 231/692 (33%), Positives = 330/692 (47%), Gaps = 62/692 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF+D   A +LN+ FV IKVDREERPDVD VYM   Q + G GGWPLSV+
Sbjct: 50  CHWCHVMADESFDDPDTADVLNEHFVPIKVDREERPDVDNVYMQVCQMVRGSGGWPLSVW 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
           L+P+ KP   GTYFPPE     PGFK++L  + +AWD  ++R  L Q      +Q + ++
Sbjct: 110 LTPEGKPFHVGTYFPPEPTKNTPGFKSVLEDIAEAWDDTERRQQLEQQA----DQWATSI 165

Query: 140 SASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S+       P   P  +  L   A     + D   GG+G   KFP P  I ++L   ++ 
Sbjct: 166 SSELEDTPEPVAEPPGEEFLDTAANAAVGNADREHGGWGRGQKFPHPGRIHLLLCAYQQT 225

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +       A E       TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++
Sbjct: 226 DRETYRDVAVE-------TLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEI 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              +L  + +T D  Y+ I  +   ++ R++  P G  +S  DA+S ++ G   ++EGAF
Sbjct: 279 PRAFLAGYQVTGDDRYAEIVAETFAFVDRELTHPDGGFYSTLDAESEDSTGT--REEGAF 336

Query: 318 YVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YVWT + V   +     A LF E Y +   GN +               VL E       
Sbjct: 337 YVWTPEVVAAAVDNETDAELFCERYGVTDAGNFE-----------NATTVLTESRPPEEL 385

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A++  M        +   R +LF+ R++R RP  D+KV+  WNGL+IS+ A  + +L   
Sbjct: 386 AAERVMDTATVEERIERAREQLFESRAERSRPPRDEKVLAGWNGLMISALAEGALVLD-- 443

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                           EY + A +A SF R  L+DE    L   F  G     G+L DYA
Sbjct: 444 ---------------PEYADDAAAALSFCREQLWDETEEVLNRRFEGGTVGIDGYLQDYA 488

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDH 554
           FL  G LDLY+     + L +A+ L       F D + G  YF   G D S+L R ++  
Sbjct: 489 FLGRGALDLYQATGDVEQLSFALSLGRVIQSEFYDADAGTLYFTAEGGD-SLLARPQQLA 547

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAA 613
           D + PS   V+V  L RLA+    +  D     AE  +    + L+   ++   L+  A 
Sbjct: 548 DSSTPSSTGVAVELLSRLAAFDPDAGFD---DVAETVIETHASTLESNPLSHTSLVAAAH 604

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEHN 670
           D  S   R  + +        +   LA  +    L   ++   P   + +D W    + +
Sbjct: 605 D--SAAGRIELTVAAADLPETWRTSLAETY----LPGRLLSRRPPTDDGLDPWLAALDVD 658

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                 A  +    +     C++F+CSPP  D
Sbjct: 659 DVPPIWANRDAKDGEPTVYACRSFTCSPPKHD 690


>gi|375150037|ref|YP_005012478.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361064083|gb|AEW03075.1| hypothetical protein Niako_6853 [Niastella koreensis GR20-10]
          Length = 685

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 204/571 (35%), Positives = 295/571 (51%), Gaps = 71/571 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFE+E  A ++N  F+++K+DREERPD+D +YM  VQA+ G GGWPL++
Sbjct: 52  ACHWCHVMEKESFENEETASMMNAHFINVKIDREERPDLDHIYMDAVQAMTGSGGWPLNI 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-----------MLAQSGA 129
           FL+PD +P  GGTYFPP+  Y RP +  +L  V +AW +KRD            + QS +
Sbjct: 112 FLTPDGRPFYGGTYFPPKAIYNRPSWHDVLTGVANAWTEKRDDIDAQATNLTGHIVQSNS 171

Query: 130 FAIEQLSEALSASA-SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
           F  + +   ++  A  S ++ D +  N +         + D   GGFGSAPKFP+   I 
Sbjct: 172 FGQQAVEGDINMDALFSKEIADTMFNNIM--------GTADKEEGGFGSAPKFPQTFTIG 223

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
            +L +  K  +     +A         +L  M +GG++DH+GGGF RYS D  W VPHFE
Sbjct: 224 YLLRYYHKTGNEQALAQAC-------LSLDKMIRGGLYDHLGGGFARYSTDREWLVPHFE 276

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD   L +V  DA+ LT+   Y     + L ++ R++  P    +SA DADS   EG
Sbjct: 277 KMLYDNALLVSVLCDAWQLTQQPLYKQAVEETLAFVERELHSPEKGFYSALDADS---EG 333

Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN---CDLSRMSDPHNEFKGKNV 365
                EG FYVW+  E+E IL + A +F   Y +   GN    ++  +  P  +F   N 
Sbjct: 334 V----EGKFYVWSKPEIEAILQQDAAVFCAFYDVTEGGNWEHTNILNIRKPLKQFAADN- 388

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
                          +P  +   +L + R KL   R+ R RP LDDK+++ WN L+ +++
Sbjct: 389 --------------NIPEARLQELLQQGREKLLQHRAGRIRPQLDDKILLGWNALMNTAY 434

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
           ++A  +         F  P       +Y EVAE    FI    +        H+++   +
Sbjct: 435 SKAYSV---------FGNP-------QYAEVAEENMKFIMNR-FTRDGLEFFHTYKKEIA 477

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DP 544
           + P FLDDYA+LI  L+ L E      +L  A  L     + F   EG GYF  T +   
Sbjct: 478 RYPAFLDDYAYLIQALIHLQEITGKAAYLYKAKALTQQVIDQF-SEEGTGYFFYTHQGQQ 536

Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
            V++R KE +DGA PSGN++   NL  L  +
Sbjct: 537 DVIVRKKEVYDGAIPSGNAIMAFNLQYLGVV 567


>gi|392399485|ref|YP_006436086.1| thioredoxin domain-containing protein [Flexibacter litoralis DSM
           6794]
 gi|390530563|gb|AFM06293.1| thioredoxin domain protein [Flexibacter litoralis DSM 6794]
          Length = 712

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 223/705 (31%), Positives = 344/705 (48%), Gaps = 77/705 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E VAK +N+ F+ IKVDREERPDVD +YM  VQ +   GGWPL+VF
Sbjct: 49  CHWCHVMEHESFENEDVAKAMNENFICIKVDREERPDVDAIYMEAVQMMGVSGGWPLNVF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+ D KP  GGTYFP ++      +  I+ ++   +  KR+ + +S     + LS +   
Sbjct: 109 LTSDAKPFWGGTYFPAKE------WIDIVEQIGKTYKNKRNEVEESANKVTKVLSISTLE 162

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +   + D    + L    + L K +D+ FGG G APKFP P     +L +   L+   
Sbjct: 163 RYNLKDVSD-FDDSILAKAFQSLEKKFDTEFGGIGEAPKFPMPSYYLFLLRYYDYLDKNN 221

Query: 202 KSGEASEGQK-----MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           +    +   K      +  TL  M +GGI+D +GGGF RYSVD+ W  PHFEKMLYD  Q
Sbjct: 222 QDQNITNPTKNKILSQIHLTLNKMDQGGIYDQIGGGFARYSVDKEWFAPHFEKMLYDNAQ 281

Query: 257 LANVYLDAFSLTKDVFYSYICRDIL----DYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           L ++Y +A+++T+D    ++ ++I+    ++L R++    G  ++A DADS   EG    
Sbjct: 282 LLSLYAEAYTITEDKVQKHVYKEIIEQTTEFLTRELQDKNGGFYAALDADS---EG---- 334

Query: 313 KEGAFYVWTSKEVEDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFK 361
           KEG FY WT  E+E +   H             LFK++Y +   GN        PH   +
Sbjct: 335 KEGKFYTWTIDEIEQVFTNHTFSTSINQEEDLQLFKKYYSITAIGN-----WQSPHAT-E 388

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
           G N+L   N     A +  + L      + E +  L ++R  +  P LDDK++ SWN L+
Sbjct: 389 GANILYRNNTDEEFAQENNIELNNLKCKVKEWQNYLLEIRKTKVSPSLDDKILTSWNALL 448

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-----RL 476
           I  F  +   L                + K+Y+ +A   A FI ++L+D+Q       +L
Sbjct: 449 IKGFCNSYSSL----------------NDKKYLNLALQTAEFIEKNLFDKQNTKNNKLKL 492

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG-G 535
            H+F++G ++  GFL+DYA LI   + LY+     KWL+ A EL       F D+E    
Sbjct: 493 HHTFKDGTAEIDGFLEDYALLIESYIALYQVCFDEKWLLRADELTKYVFTNFYDKEEKLF 552

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           YF    E   ++ + KE  D    S NSV   NL  L  ++   +++ Y++ ++  L+  
Sbjct: 553 YFTNQNESEKLVAQKKELFDNVISSSNSVMATNLYFLGILL---ENNLYKETSKEMLSKV 609

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
            + +      V            P+   + +VG K    ++ +L    + Y  NK ++  
Sbjct: 610 ASLIAAEPRHVSNWASLFTYFLTPT-PEIAIVGEK----YQEVLQEISSFYIPNKVIV-- 662

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
               +EE    E   S+   +       ++    VC+N  C  PV
Sbjct: 663 -ATKSEE----EGQKSSLPLLEMRPVMNNQTTIYVCKNKMCQLPV 702


>gi|291295832|ref|YP_003507230.1| hypothetical protein [Meiothermus ruber DSM 1279]
 gi|290470791|gb|ADD28210.1| protein of unknown function DUF255 [Meiothermus ruber DSM 1279]
          Length = 672

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 227/622 (36%), Positives = 319/622 (51%), Gaps = 71/622 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F       +  FL     TCHWCHVME ESFED  VA+ LN  FV IKVDREERPD
Sbjct: 27  GEEAFAKARAENKPIFLSVGYATCHWCHVMERESFEDPEVAQFLNAHFVPIKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
           VD+VYM+ +QA+ G GGWP+++FL PDL+P  GGTY+PPED+ G P F+ +L  V +AW 
Sbjct: 87  VDQVYMSALQAMTGSGGWPMNMFLMPDLRPFFGGTYWPPEDRQGFPSFRRVLAGVHNAWL 146

Query: 118 DKKRDMLAQSGAFAIEQLSEAL--SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
            +++++L  +     EQL+  L          LPD+L   AL      LS+ +D   GGF
Sbjct: 147 HQQKEVLENA-----EQLTTYLQDQLKPRGGALPDDLHSTAL----AGLSRIFDPAHGGF 197

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           G APKFP+   +  +L  +    +           K +  TL  MA+GG++D VGGGFHR
Sbjct: 198 GGAPKFPQSPALGYLLTQAWLGHEA--------AWKHLQLTLDRMAEGGLYDQVGGGFHR 249

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDA-----FSLTKDVFYSYICRDILDYLRRDMIG 290
           Y+VD  W VPHFEKMLYD  QLA +Y  A      SL +   Y  I ++ LDY+ R++ G
Sbjct: 250 YTVDHIWRVPHFEKMLYDNAQLARLYAAASRMPQASLEQARRYQRIAQETLDYVLRELTG 309

Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDL 350
           P G  +SA+DADS   EG     EG FYVW ++E   +LG  A      + +   GN   
Sbjct: 310 PEGGFWSAQDADS---EGV----EGKFYVWQAEEFRRVLGAEAEAAMLLFGVSEAGN--- 359

Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
                    ++  NVL      +A    LG+  E +   +   R +L+  R +R  P  D
Sbjct: 360 ---------WEHTNVLERRIPDAALMQHLGLGPEAFERWVQSVRHRLYAARQQRTPPLTD 410

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DKV+  WNGL++ + A   + L                +   Y+E A   A+F+ + +Y 
Sbjct: 411 DKVLADWNGLMLRALADVGRWL----------------EEPRYIEAARKNAAFVMQEMYR 454

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
           +    L+HS+R G  K   +L D A    GLL L+E      WL  A +L       F  
Sbjct: 455 DGL--LRHSWRQGQLKPQAYLSDQAHYGLGLLALFEATGEVGWLEGARQLAEAILTHF-- 510

Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
           +E  G F  +  D ++ +   + +DG  PSGN+V+   L RLA++    + D++ Q A  
Sbjct: 511 KEPTGAFRDS-LDQTLPVVALDAYDGPYPSGNAVAAELLFRLAALY--ERPDWH-QAALT 566

Query: 591 SLAVFETRLKDMAMAVPLMCCA 612
           ++     RL   A   P M  A
Sbjct: 567 TVESNAQRLLHNAFGFPAMLQA 588


>gi|257076883|ref|ZP_05571244.1| thymidylate kinase [Ferroplasma acidarmanus fer1]
          Length = 638

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 214/572 (37%), Positives = 302/572 (52%), Gaps = 63/572 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESF D  VAK +N  FV IKVDREE PDVD +YMT+ Q + G GGWPL+V
Sbjct: 48  SCHWCHVMEQESFTDPEVAKRMNSTFVCIKVDREEMPDVDSLYMTFSQVMTGTGGWPLNV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            L+PD KP+   TY P   +    G   +   +   W  KR  + ++G  AI +L     
Sbjct: 108 ILTPDRKPIFAFTYIPRVSRNNMIGIMELAENIDYLWKNKRGEMEKNGDEAISRLRNM-- 165

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                N  P +  + A+    E L ++YDS +GGFG+APKFP    I  +L + K     
Sbjct: 166 ERKEENNSPVDYKK-AIEATYESLKRNYDSEYGGFGNAPKFPSFHNIIFLLNYYKA---H 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           GK     E  +MV  +L+ M  GG++DHVGGGFHRYS D  + +PHFEKM YDQ      
Sbjct: 222 GK----EEALEMVKHSLRMMYIGGMYDHVGGGFHRYSTDPFFRIPHFEKMTYDQAMAIIA 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y  A+ +T D FY  +  +I  +L+++M   G   ++A DADS   EG    +EG +Y W
Sbjct: 278 YSYAYDVTGDTFYKNVVYEIYKFLKQEMFSRG--FYTAMDADS---EG----QEGKYYTW 328

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+ +  G+    F   + + P GN       D ++   G+N+L    D        G
Sbjct: 329 TYEELVENAGKK---FVYDFNILPEGN-----FYDANSRQTGRNILYMGRDIQ------G 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
            P   Y N L   ++     R KR +P  DDK++   NGLVI + + AS I         
Sbjct: 375 DPTTLYKNELEALKKS----REKRIKPLTDDKILTDINGLVIKALSIASMIF-------- 422

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   + K+ +  AE +A FI   +Y ++  +L HS+RNG S   G LDDY+F++SG
Sbjct: 423 --------NDKDMLNTAEGSADFIMNDMYTDK--KLMHSYRNGKSSINGMLDDYSFMVSG 472

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL LYE      +L +A +LQ T  + F D+  GG++N  G   ++L+R+KE +D A PS
Sbjct: 473 LLSLYEASLNDIYLDYARDLQKTIMDTFYDKTSGGFYNGMG---NLLVRLKESYDNAIPS 529

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           G S  + N++    I      D YR   E S+
Sbjct: 530 GFSFEIGNMIVFNYI-----DDKYRVELEKSI 556


>gi|258405434|ref|YP_003198176.1| hypothetical protein Dret_1310 [Desulfohalobium retbaense DSM 5692]
 gi|257797661|gb|ACV68598.1| protein of unknown function DUF255 [Desulfohalobium retbaense DSM
           5692]
          Length = 615

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 206/569 (36%), Positives = 300/569 (52%), Gaps = 45/569 (7%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME E FED  VA +LN   V IKVDREERPD+D  YM+  QAL G GGWPL++
Sbjct: 53  TCHWCHVMERECFEDTEVAHILNTVCVPIKVDREERPDLDTFYMSCCQALSGRGGWPLNL 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P    TY P + ++ +PG   +L  V++ W + R+ + QS    +  + +  S
Sbjct: 113 FLTPDGRPFFAATYIPKQSRFSQPGLLDLLVSVQEDWVRNREQIEQSATRLVSHIHDLFS 172

Query: 141 ASASSNKLPDELPQNAL-RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            S+        LP+NA+     ++L +++D  FGGFG APKFP P  +  +L      +D
Sbjct: 173 DSSGP------LPENAIFEQAVQELRQNHDDDFGGFGKAPKFPTPHVLLFLLRLYDLSQD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                       MV  TL+ + +GGI DH+GGGFHRYS D  WH+PHFEKMLYDQ  L  
Sbjct: 227 RSLL-------NMVDSTLEAICRGGIRDHIGGGFHRYSTDRAWHLPHFEKMLYDQALLLM 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
              +  + T+   +      + +Y+   +    G ++  EDAD   TEG    +EGAFY 
Sbjct: 280 ALAEGHARTRRDLFRREAVAVAEYMLERLHDGDGGLYCGEDAD---TEG----EEGAFYQ 332

Query: 320 WTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  E+E  L      + +    ++  GN     + +   +  GKNVL  + D++ +A +
Sbjct: 333 WTETELEAALPPDTFRVVQTVAGIRSDGNI----LDEATRQRTGKNVLARVADTADAAER 388

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+  E+           L  +R++RP+P LDDK + SWNGL +++ AR+  +L  E   
Sbjct: 389 LGLSEEQVRLEWHRAMATLGGLRAQRPQPFLDDKQLTSWNGLAVAALARSGILLGEE--- 445

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                          +  A   A ++   +  E   RL H  RN  +  PGFL+DYA+ I
Sbjct: 446 -------------HLIAAARETADWVLETMQPEPG-RLWHRARNRHAGIPGFLEDYAYFI 491

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL+L +   G  +   A+ L +T    F D + GG+F T       LLR+K+  D A 
Sbjct: 492 WGLLELVQTSEGQDYRRIALRLADTVLSEFADLKEGGFFQTHAAAQEPLLRLKKVFDDAL 551

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQN 587
           PS N+V + NLVRL    +G  +D  R++
Sbjct: 552 PSENAVMLYNLVRLYG--SGPTNDCARKH 578


>gi|113867298|ref|YP_725787.1| hypothetical protein H16_A1279 [Ralstonia eutropha H16]
 gi|113526074|emb|CAJ92419.1| highly conserved protein containing a thioredoxin domain [Ralstonia
           eutropha H16]
          Length = 673

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 243/690 (35%), Positives = 341/690 (49%), Gaps = 92/690 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFE+  +A L+ND F+SIKVDR+ERPD+D +Y    Q +  GGGWPL+V
Sbjct: 50  TCHWCHVMAHESFENPRIAGLMNDRFISIKVDRQERPDLDDIYQKVPQMMGQGGGWPLTV 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA-- 138
           FL+P  +P  GGTYFPP+D+YGRPG   +L  + +AW  +R+ L  +    IEQ  +   
Sbjct: 110 FLTPQGEPFYGGTYFPPDDRYGRPGLARVLLSLSEAWTHRREALRDT----IEQFQQGFR 165

Query: 139 -LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
            L  +  S +  +E    Q+     A  L+++ D   GG G APKFP      ++L   +
Sbjct: 166 QLDDTVLSREDAEEAAEVQDLPAQTALALARNTDPTHGGLGGAPKFPNASAYDLVLRICQ 225

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +  +                TL  MA GGIHD +GGGF RYSVDERW VPHFEKMLYD G
Sbjct: 226 RTHEPALLDALER-------TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNG 278

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL  +Y +A+ LT    +  +    + Y+ RDM  P G  ++ EDADS   EG    +EG
Sbjct: 279 QLVTLYANAYRLTGKQAWRRVFEGTIAYIVRDMTHPDGGFYAGEDADS---EG----EEG 331

Query: 316 AFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
            FYVWT+ EV+ +LGE    L    Y +   GN +            G++VL        
Sbjct: 332 RFYVWTAPEVKAVLGESEGALACRAYGVTEGGNFE-----------PGRSVL-------Q 373

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A  L  PLE+    L   R +L   R++R RP  DD ++  WNGL+I     A +   +
Sbjct: 374 RAVTL-TPLEE--ARLEGWRERLLAARAQRVRPGRDDNILAGWNGLMIQGLCAAYQATGN 430

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPGFLD 492
            A                ++  A  AASFI+  L   D   +R    +++G  K PGFL+
Sbjct: 431 PA----------------HLAAARRAASFIQDKLTMPDGGVYRY---WKDGTVKVPGFLE 471

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL + L+DLYE     ++L  A EL     + F D   G YF     +P ++ R + 
Sbjct: 472 DYAFLANALIDLYESCFDRRYLDRAAELVALIIDNFWD--DGLYFTPNDGEP-LIHRPRA 528

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
            HDGA PSG S SV + +RL  +   S  D YR  AEH    +                A
Sbjct: 529 PHDGAWPSGISASVFSFLRLHEL---SGEDRYRDLAEHEFQRYRAAASAAPAGFVHFLAA 585

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
           AD     +   ++L G K++     ++ + H +Y L   V+                 + 
Sbjct: 586 ADFAQRGAFG-IILAGDKAAA--AALVESVHRTY-LPARVLAF---------------AE 626

Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVT 701
           +  + +     D +  A VC++ +CS PVT
Sbjct: 627 DVPVGQGRLPVDGRPAAYVCRHRACSAPVT 656


>gi|408826725|ref|ZP_11211615.1| hypothetical protein SsomD4_06008 [Streptomyces somaliensis DSM
           40738]
          Length = 651

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 234/702 (33%), Positives = 333/702 (47%), Gaps = 86/702 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+S
Sbjct: 22  SACHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 81

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++PD +P   GTYFPPE ++G P F+ +L  V  AW  +RD + +     + +LS   
Sbjct: 82  VFMTPDGEPFYFGTYFPPEARHGMPSFRQVLEGVHHAWTSRRDEVDEVAGSIVRELSGRS 141

Query: 140 SASASSNKLPDEL-PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            A       P E  P  AL      L++ YD R GGFG APKFP  + ++ +L H  +  
Sbjct: 142 LALGGDGGAPGEAEPAQALL----ALTREYDERHGGFGGAPKFPPSMVVEFLLRHHAR-- 195

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 196 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 250

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++  P G   SA DADS   +G  R  EGA+Y
Sbjct: 251 RVYTHLWRATGSDLARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 308

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VWT  ++ ++LGE    +   ++           +++     +G +VL    D+  + ++
Sbjct: 309 VWTPAQLREVLGEEDAAYAARFH----------GVTEEGTFEEGASVLRLPVDAGVAGAE 358

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                      L   RR+L   R +R RP  DDK++ +WNGL +++ A            
Sbjct: 359 R----------LAGIRRRLLAARDERARPGRDDKIVAAWNGLAVAALAETGACF------ 402

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFL 497
                     DR + +E A  AA  + R   DE   RL  + ++G + A  G L+DY  +
Sbjct: 403 ----------DRPDLVERATEAADLLVRVHLDEGG-RLARTSKDGRAGANAGVLEDYGDV 451

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDH 554
             G L L        WL +A  L +      LDR   E G  ++T  +   ++ R ++  
Sbjct: 452 AEGFLALAAVTGEGVWLEFAGLLLDG----VLDRFRGEDGELYDTAHDAEQLIRRPQDPT 507

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPL 608
           D A PSG + +   L+   S  A + S+ +R  AE +L V         R     +AV  
Sbjct: 508 DNAAPSGWTAAAGALL---SYAAHTGSEAHRSAAERALGVVRALGPRAPRFVGWGLAV-- 562

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
                 +L  P  + V +VG     D + +  AA         V   +P  ++E    E+
Sbjct: 563 ---TEALLDGP--REVAVVGPAGDADTDALRRAALLGTAPGAVVAVGEPG-SDEFPLLED 616

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                           +  A VC+ F+C  P TDP  L   L
Sbjct: 617 ----------RPLVGGRPAAYVCRRFTCDAPTTDPERLAREL 648


>gi|294102620|ref|YP_003554478.1| hypothetical protein [Aminobacterium colombiense DSM 12261]
 gi|293617600|gb|ADE57754.1| protein of unknown function DUF255 [Aminobacterium colombiense DSM
           12261]
          Length = 595

 Score =  342 bits (877), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 212/578 (36%), Positives = 306/578 (52%), Gaps = 63/578 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    +  +  FL    +TCHWCHVME E F DE VA+LLND  VSIKVDREERPD
Sbjct: 30  GKEAFTKAQEENKPIFLSIGYSTCHWCHVMEKECFSDEEVAQLLNDACVSIKVDREERPD 89

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D V M     + G GGWPL++FL+P+ KP    +Y P E     PG   ++ +VK  W 
Sbjct: 90  IDHVCMAVSLIMNGSGGWPLNLFLTPNGKPFFAASYIPKETSGRIPGLMDMVPRVKWLWL 149

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNK--LPDELPQNALRLCAEQLSKSYDSRFGGFG 176
            +++ + +S     E +  AL    ++ K   PD   +N  +   ++LS+++D  +GGF 
Sbjct: 150 MQKEDVLKSA----ESIMNALEKEMTNQKGTCPD---KNLAKKAFQELSRNFDPLWGGFS 202

Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
            APKFP P  +  +L       + GK  +  +  KMV  TL CMA GGI DH+GGGF RY
Sbjct: 203 KAPKFPMPPVLLFLL-------EYGKIFKEEKAIKMVEKTLDCMAMGGIRDHLGGGFARY 255

Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
           S D  W +PHFEKMLYDQ  L   Y  A+ +T    Y  I  +I  Y+ RD+  P G  F
Sbjct: 256 STDREWKIPHFEKMLYDQALLLKAYTAAWEMTGRDIYKKIAFEIAAYVLRDLRSPEGVFF 315

Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSD 355
           +AEDADS   EG     EG FYVWT +E+  ++  E   LF + Y +   GN     ++ 
Sbjct: 316 AAEDADS---EGV----EGRFYVWTEEEIRRLVPSEDRQLFLQAYGIHGEGNV----LAL 364

Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
           P +       L EL      A+   + L+K    L + R  LF+ R++R RPH D K++ 
Sbjct: 365 PAS-------LEEL------AATYNVELQKLDQSLQKSRALLFEARNRRVRPHCDRKILT 411

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTH 474
            WN L+I + A A +I                 + ++++E A +A  F + + +Y E+  
Sbjct: 412 DWNALMIEALAFAGRIF----------------EERQFIEAARNAVDFLLEKAVYQEK-- 453

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
            + HS  +G    PG L+DY+F I  LL+L E      +    + L  + +++F D + G
Sbjct: 454 EVYHSVADGKGHIPGLLNDYSFFIRALLELEEATGEEDYGEKGMGLLRSMNDIFYDPKRG 513

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
           GYF  +G D  +  R     DG   SGNSV+++NL+R 
Sbjct: 514 GYFMNSGLDELLFFRPWSGEDGVMVSGNSVAMMNLLRF 551


>gi|257388360|ref|YP_003178133.1| hypothetical protein Hmuk_2314 [Halomicrobium mukohataei DSM 12286]
 gi|257170667|gb|ACV48426.1| protein of unknown function DUF255 [Halomicrobium mukohataei DSM
           12286]
          Length = 715

 Score =  342 bits (876), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 213/694 (30%), Positives = 331/694 (47%), Gaps = 63/694 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF D   A LLN+ FV IKVDREERPD+D +YM+  Q + G GGWPLS
Sbjct: 56  SACHWCHVMEDESFSDPETATLLNEHFVPIKVDREERPDLDAIYMSICQQVTGRGGWPLS 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLS 136
            +L+PD +P   GTYFPPE++ G P F  +L  +  +W   +++ +M  ++      Q +
Sbjct: 116 AWLTPDGEPFYVGTYFPPEERRGMPAFGQLLEDIAGSWSDSEQREEMYNRA-----RQWT 170

Query: 137 EALSASASSNKLPDELPQN-ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
           +A+ +       P ++P + AL+   +   ++ D   GG+G+ PKFP+P  +  ++    
Sbjct: 171 DAIESDVGDVGQPGDVPDDEALQAAVDAAIRAADREHGGWGNGPKFPQPGRLHYLMREVA 230

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +        +  + + +V  TL  MA GG+ DHVGGGFHRY  D  W VPHFEKMLYD  
Sbjct: 231 R-------SDRDDVRSVVTETLDAMADGGLFDHVGGGFHRYCTDREWVVPHFEKMLYDNA 283

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA-----T 310
            L   YL  + LT D  Y+ + R+   ++ R++    G  FS  DA S    G       
Sbjct: 284 TLPRAYLAGYQLTGDERYAEVARETFAFVERELTHEDGGFFSTLDAQSVPPAGRREDADA 343

Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
             +EGA++VW   EV   +     A L  + + +  +GN            F+GK VL  
Sbjct: 344 EPEEGAYFVWIPDEVRAAVDSETAADLLCDRFGITESGN------------FEGKTVLTV 391

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
                A +   G+        L   R ++F+ R +RPRP  D+KV+  WNGL+I++ A  
Sbjct: 392 DASIEALSESSGLEASDVERTLASAREQVFEAREERPRPARDEKVLAGWNGLMITAIAEG 451

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
           + +L                           A +F+R HL+DE   RL   +++G     
Sbjct: 452 AIVLDDVDPDPA-----------------ADALAFVREHLWDESEQRLARRYKDGDVAID 494

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G+L+DYAFL  G L L+E     + L +A++L +  +  F D + G  + T     S++ 
Sbjct: 495 GYLEDYAFLARGALTLFEATGEVEHLAFALDLAHAIEREFWDADDGTLYFTPTSGESLVA 554

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R +E  D + PS   V+V  L+ L++ V     D +   A   L     +++   M    
Sbjct: 555 RPQELTDQSTPSSTGVAVQALLSLSAFV---PHDRFETIAAGVLETHANKIEANPMQHAS 611

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  AAD   +     + LV  +   ++   LA  +    L   ++   P    ++D W +
Sbjct: 612 LVVAADRY-LRGDLELTLVADEVPAEWRTTLAETY----LPDRLLAWRPPGDGDLDAWLD 666

Query: 669 ---HNSNNASMARNNFSADKVVALVCQNFSCSPP 699
               +      A       +     C+ F+CSPP
Sbjct: 667 VLGLDDVPPIWADRTERDGEATVYACRQFTCSPP 700


>gi|398343191|ref|ZP_10527894.1| hypothetical protein LinasL1_09021 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 692

 Score =  342 bits (876), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 248/701 (35%), Positives = 344/701 (49%), Gaps = 75/701 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE  A +LN +FVSIKVDREERPDVD++YM  + A+   GGWPL++
Sbjct: 55  TCHWCHVMEKESFEDEATAAVLNQYFVSIKVDREERPDVDRIYMDALHAMNQQGGWPLNM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+ + KP+ GGTYFPP  KYGR  F  IL  +   W +K++ L      A E+L++ L 
Sbjct: 115 FLTSEGKPITGGTYFPPVAKYGRKSFTDILNILATLWKEKKEELID----ASEELAQYLK 170

Query: 141 ASASSNKLPDELPQNALRLCAEQL--------SKSYDSRFGGFGS--APKFPRPVEIQMM 190
            S  S  L +   Q+AL+L ++ +         + YD  F GF S    KFP  + +  +
Sbjct: 171 ESEESKALSE---QSALQLPSKTVFENAFGMYDRFYDPEFAGFKSNVTNKFPPSMGLSFL 227

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L   K       +GE  +  +MV  TL  M KGGI+D +GGG  RYS D +W VPHFEKM
Sbjct: 228 LRFYK------STGE-PKALEMVEETLVAMKKGGIYDQIGGGISRYSTDHKWLVPHFEKM 280

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD        ++ F  T  + Y     D+L+Y+ RDM   GG I SAEDADS   EG  
Sbjct: 281 LYDNSLFLEALVECFQTTGHLKYKEAAYDVLEYISRDMRLQGGGIASAEDADS---EG-- 335

Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
             +EG FY+W   E  ++    AIL +  + +   GN            F+G N+L E +
Sbjct: 336 --EEGLFYLWKRNEFHEVCDSDAILLEAFWNVTEIGN------------FEGSNILHE-S 380

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
             +  A   G+  E+ + I+   ++KL   RS R RP  DDKV++SWN L + +  +A+ 
Sbjct: 381 FRTNFARLHGLEEEELIEIVNRNKKKLLARRSDRIRPLRDDKVLLSWNCLYVKAATKAAM 440

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
                                E + +AE    FI  +L  E   RL   FR G ++   +
Sbjct: 441 AFGD----------------GELLRLAEETFRFIENNLVREDG-RLLRRFREGEARFLAY 483

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
             DYA  I   L L++ G G ++L  AI        LF  R   G F  TG D   LLR 
Sbjct: 484 SGDYAEFILASLWLFQAGKGIRYLTLAIRYAEEAVRLF--RSPAGVFFDTGSDAEDLLRR 541

Query: 551 K-EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
             E +DG EPS NS   +    L+ +  G +S  Y   A+   + F+  L+   M  P M
Sbjct: 542 NVEGYDGVEPSANSSFALAFTILSRL--GVESGRYSDFADAIFSYFKVELETHPMNYPYM 599

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A  + +  S++  V+  + +  D   +     A + L +TV      D E      E 
Sbjct: 600 LSAYWLKNSDSKELAVV--YSTQEDLFPIWQGIGAMF-LPETVFAW-ATDKE-----AEE 650

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 + +N  S   V A  CQ F C  PV+D  SL  +L
Sbjct: 651 AGEKILLLKNRKSGGSVKAYFCQGFRCDLPVSDWNSLRAIL 691


>gi|390953615|ref|YP_006417373.1| thioredoxin domain-containing protein [Aequorivita sublithincola
           DSM 14238]
 gi|390419601|gb|AFL80358.1| thioredoxin domain-containing protein [Aequorivita sublithincola
           DSM 14238]
          Length = 704

 Score =  341 bits (875), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 223/694 (32%), Positives = 346/694 (49%), Gaps = 77/694 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  VA ++N  F+S+KVDREERPDVD+ Y+  VQ + G  GWPL+V 
Sbjct: 78  CHWCHVMEHESFEDSTVAAVMNKNFISVKVDREERPDVDQTYINAVQLMTGSAGWPLNVV 137

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF   D      +   L +++  ++++ + L    A+A  +L E + +
Sbjct: 138 TLPDGRPVWGGTYFRKND------WIDALEQIQKVYNEEPEKLM---AYA-NRLEEGIKS 187

Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 N    +  +       E LS+++D++ GGF  APKF  P  ++ +L  + +  +
Sbjct: 188 MDLVHLNTEDVDFAKYPTSEIVENLSQNFDAKNGGFKGAPKFMMPNNLEFLLRQAVQENN 247

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G        V  TL  MA GG++D +GGGF RYS DE+WHVPHFEKMLYD  QL +
Sbjct: 248 ADLLG-------YVTLTLDKMAYGGLYDQIGGGFARYSTDEKWHVPHFEKMLYDNAQLVS 300

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A+ +TK   Y  +  + LD++ RDM    G  +S+ DADS +  G  + +EGAFYV
Sbjct: 301 LYSNAYLVTKKPLYKEVVEETLDFIARDMTNDEGGFYSSLDADSKDENG--KLEEGAFYV 358

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +TS+E++ IL +   +FKE+Y +   G  +           K   VLI          + 
Sbjct: 359 FTSEELQKILKDDFDIFKEYYNVNSYGKWE-----------KNHYVLIRKKTDDEIEKEF 407

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  E +     + +  L   R+KRP+P LDDK + SWN +++  +  A K         
Sbjct: 408 GITSEAFQQKKEDWKNTLLAYRNKRPKPRLDDKTLTSWNAMMLKGYVDAYKTF------- 460

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                     ++EY++ A   A+FI      ++   L H++++G S   GFL+DYAF I 
Sbjct: 461 ---------GKREYLDAALKNAAFISEKQL-QKNGALFHNYKDGKSSINGFLEDYAFTIE 510

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             +DLY+     KWL  + ++ +     F D E   ++ T+ ED +++ R  E  D   P
Sbjct: 511 AFIDLYQATLDEKWLTLSKKMADYAKTNFFDEEKQMFYFTSKEDAAIVTRNFEYRDNVIP 570

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           + NSV   NL  L+     +  D      E S  +F+    ++           D+LS  
Sbjct: 571 ASNSVMAKNLFVLSKYFEETGFD------EISHQMFKNVSVEIEQYPSGFSNWLDLLSSF 624

Query: 620 SRK--HVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHNSNNASM 676
                 VV+VG   S   +          +LNK  + +I  A ++          N+  +
Sbjct: 625 QNDFYEVVIVGKDVSEKIK----------ELNKHYLPNIIIAGSK--------GENSGPL 666

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDP-ISLENL 709
             N ++ D  +  VC N +C  PV D  I++E+L
Sbjct: 667 FENRYTPDATLIYVCVNNACKLPVEDTKIAIESL 700


>gi|448576201|ref|ZP_21642244.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
 gi|445729881|gb|ELZ81475.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
          Length = 702

 Score =  341 bits (874), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 224/696 (32%), Positives = 338/696 (48%), Gaps = 73/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF D  +A+ LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLS
Sbjct: 53  SACHWCHVMADESFSDPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
           V+L+P  KP   GTYFPPE + G PGF+ ++    ++W   RD +   AQ    AI +QL
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQL 172

Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
            +       A  +++ D+  Q ALR                    PKFP+P  I  +L  
Sbjct: 173 EDTPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDALL-- 222

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
            +    TG+     +   + + +L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----RQALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q  L + YLD + LT    Y+ +  +  +++RR++    G  F+  DA S         +
Sbjct: 278 QAGLVSRYLDTYRLTGTEAYADVAAETFEFVRRELSHDDGGFFATLDAQSG-------GE 330

Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVWT  EV  +L E  A LF + Y + P GN            F+ K  ++ ++ +
Sbjct: 331 EGTFYVWTPDEVRSLLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSAT 378

Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            S  A +  +  ++  + L E R+ LF  RS R RP  D+K++  WNGL+IS+FA+ +  
Sbjct: 379 LSDLAEEYDISEDEVEDKLAEARKALFAARSGRERPARDEKILAGWNGLMISAFAQGAVA 438

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
           L+ ++                  + A  A  F+R HL+D     L     NG  K  G+L
Sbjct: 439 LEDDS----------------LADDARRALDFVREHLWDADAGHLSRRVMNGEVKGDGYL 482

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYAFL  G  DLY+       L +A++L       F D   G  + T     +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDVDPLAFALDLARAIHREFYDDAAGTLYFTPESGEALVTRPQ 542

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D + PS   V+    + L      +    + + A+  L     R++   +    +  
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAG---FGEAADTVLETHANRIRGSPLEHVSLAL 599

Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
           AA+  +  VP    + +   +   ++   LA+ +    L   V+   PA  + +D W +E
Sbjct: 600 AAEKAASGVP---ELTVAADEMPAEWHETLASRY----LPGLVVAPRPATDDGLDAWLDE 652

Query: 669 HNSNNASMARNNFSAD--KVVALVCQNFSCSPPVTD 702
              + A        AD  +     C+NF+CS P  D
Sbjct: 653 LELDEAPPIWAAREADGGEPTVYACENFTCSAPTHD 688


>gi|345864005|ref|ZP_08816211.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
           (vent Tica)]
 gi|345124912|gb|EGW54786.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
           (vent Tica)]
          Length = 799

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 216/628 (34%), Positives = 316/628 (50%), Gaps = 62/628 (9%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    +TCHWCHVME ESFE+E +A+ LN+ F++IKVDRE  PD
Sbjct: 91  GEAAFAKAKRENKPIFLSIGYSTCHWCHVMERESFENESIARFLNEHFIAIKVDRESHPD 150

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D+ YMT V  + G GGWP+S  L+P+ KP  GGTYFPP+       F ++L++++  W+
Sbjct: 151 IDETYMTAVMLMTGSGGWPMSSLLTPEGKPFFGGTYFPPQQ------FASVLQQIQTIWE 204

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           ++ +   Q      E++++A+ A+ S       L   A      Q+ +S+D   GGF  A
Sbjct: 205 ERPEDTRQQA----ERVAKAVEAANSQRGKAKALDSQAADKAVAQMLRSFDELQGGFSQA 260

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    + ++L       D  +     E  + +  TL  MA+GGI+D  GGGFHRYS 
Sbjct: 261 PKFPHEPWLFLLL-------DQLQRQPHPEALQALEVTLDAMARGGIYDQAGGGFHRYST 313

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W VPHFEKMLY+Q QLA +YL A+ LT    Y  +    LDY+ R+M  P G  +SA
Sbjct: 314 DNEWLVPHFEKMLYNQAQLARIYLLAWRLTGKEQYRRVVTQTLDYVLREMTAPSGGFYSA 373

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
            DADSA        +EG F+ W   E+ D L    A L  E Y +   GN          
Sbjct: 374 TDADSA-------GEEGLFFTWIPAEIRDALEPRDAGLAIELYAISERGN---------- 416

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
             F+G+N+L         A    M LE     +    + L  +R +R  P  DDK++ +W
Sbjct: 417 --FEGRNILHLPQSLEEYAETKSMNLEALHQRIDHINQVLRQIREQREHPLRDDKIVTAW 474

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NG++I++FA+A+ +L S++                Y + AE AA F+ +H   +   +L 
Sbjct: 475 NGMMITAFAQAADLLDSDS----------------YRQAAERAAEFLWQH-NRKGAGQLW 517

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
               +G S      +DYA+L  GL  LY+     KWL  + EL +     F +++GG Y 
Sbjct: 518 RVHLDGKSSISANQEDYAYLGEGLSYLYDLTGDPKWLSRSRELADAMLARFQEKDGGFYM 577

Query: 538 NTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           +  GED    +    D   D A  SG+SV++  L RL  + +G     Y+  AE  +A F
Sbjct: 578 SEAGEDHFNAMGRPRDGGSDNAIASGSSVALHLLQRLW-LRSGHLD--YKTAAESLIAYF 634

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH 623
              ++        M  A D L+   R H
Sbjct: 635 AANIERQPNGYTYMLSAVDNLNQGERTH 662


>gi|320101644|ref|YP_004177235.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
 gi|319748926|gb|ADV60686.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
          Length = 909

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 227/631 (35%), Positives = 311/631 (49%), Gaps = 75/631 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME E F D  +A  LN  FV IK+DREERPDVD+ Y+T ++  +G GGWP+S+
Sbjct: 113 ACHWCHVMERECFRDPAIAARLNRDFVCIKLDREERPDVDQTYLTALRT-FGTGGWPMSI 171

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFPPED+ G  GF T+L +V  AW + RD + +        +   L 
Sbjct: 172 FLTPEGKPFYGGTYFPPEDRPGLTGFSTVLDRVARAWREDRDRIERVAGELDAMVGRILV 231

Query: 141 ASASSNKL--PDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMMLY 192
             A+S+ L  P  L  +    C   L   +D  +GGFG        PKFP P  +  +L 
Sbjct: 232 RRAASSVLGPPPVLSSDLTDACYLILCGEFDPEYGGFGFDRTNPRRPKFPEPSRLLFLLE 291

Query: 193 HSKKLEDTGKS-------------GEASEGQ------KMVLFTLQCMAKGGIHDHVGGGF 233
               L++  +              G A+          M LFTL  +A+GG+ DHVGGG+
Sbjct: 292 RHAALKERPRPVKTPARSLLMLDPGPAAAPLIRRAPLDMALFTLDRIARGGLRDHVGGGY 351

Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
           HRY V   W VPHFEK LYD  QLA V++ AF LT D  +      I D++ R+M  P G
Sbjct: 352 HRYCVSRFWIVPHFEKTLYDNAQLARVFVRAFELTGDPRWRDEAEAIFDFVAREMTLPEG 411

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDL 350
              SA DA+S + +G      G +Y+WT  +VE  L    E  I+ + +  L+       
Sbjct: 412 GFLSALDAESRDEDG------GEYYLWTRPQVEQALANPEESRIVLQVYGMLR------- 458

Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
               DP+ E  G+ VL+E  + S  A  LG+ L +    L   RR+L  VR +RP P  D
Sbjct: 459 ----DPNFE-GGRYVLLEPRERSEHARALGLELPELTRRLDAARRRLHQVRDQRPAPRKD 513

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
           DK I  WNGL+I++ A A +              V   +R  Y++ A+ AA F       
Sbjct: 514 DKAIAGWNGLMIAALAEAGR--------------VCDHNRDRYLKAAQRAAEFAWTQFRR 559

Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
           EQ  RL  ++R G +K  GF +DYAFL  GLL LY      +WL  A  L       F D
Sbjct: 560 EQ-DRLARTWRQGVAKGEGFAEDYAFLAEGLLRLYRADGDPRWLERARRLTERMRHDFGD 618

Query: 531 REG--GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            +   GG F  +  D  +  R K+  D   PS N+V+   L+ L  +      D   Q  
Sbjct: 619 PDPNRGGLFFASRRDARLPARFKDPLDSVLPSANAVAARVLIELGRL------DDDPQRY 672

Query: 589 EHSLAVFETRLKDMAM---AVPLMCCAADML 616
           + + A+    L D+A      P+M  A + L
Sbjct: 673 DQAEAILREFLPDLARRPGVWPMMMVALEEL 703


>gi|392966241|ref|ZP_10331660.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
 gi|387845305|emb|CCH53706.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
          Length = 677

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 210/561 (37%), Positives = 295/561 (52%), Gaps = 50/561 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE E VA+++N+ FV IKVDREERPDVD +YM  VQA+   GGWPL+
Sbjct: 48  SACHWCHVMERESFEKEPVARVMNENFVCIKVDREERPDVDAIYMEAVQAMGVQGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEA 138
           VFL PD KP  G TY PP++      +  +L  ++DA+D+ R  LAQS   FA E     
Sbjct: 108 VFLMPDAKPFYGVTYLPPQN------WVNLLGNIRDAFDEHRADLAQSAEGFATEL---N 158

Query: 139 LSASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSK 195
           LS S      P +       L +   ++    D   GG   APKFP P   Q +L Y+  
Sbjct: 159 LSDSERFGLQPADPLFSAETLDVLYRKVHVKADDEKGGMRRAPKFPMPSIWQFLLRYYDS 218

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
            +  T ++  A    ++V  TL  MA GGI+D +GGGF RYS D  W  PHFEKMLYD G
Sbjct: 219 TVASTTENETA---LRLVTLTLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL  +Y +A+SLTK   Y ++    + + +R+++ P G  +SA DADS   EG     EG
Sbjct: 276 QLLTLYSEAYSLTKSPLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV----EG 328

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FY +T+ E+ D LG+    F E Y L   GN +            G+N+L       + 
Sbjct: 329 KFYTFTTSELRDALGDEFDWFAELYNLSEDGNWE-----------HGRNILHRTESDESF 377

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A ++G         L     +L  +R++R RP LDDK++ SWNGL++   A A ++    
Sbjct: 378 AERMGWSAADLSVRLDATHLRLLKIRNERIRPGLDDKILCSWNGLMLKGLATAYRV---- 433

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                F  P       E++ +A   A F+ + + D +  RL H+++ G ++ PGFL+DYA
Sbjct: 434 -----FGEP-------EFLTLALRNAYFLLQKMRDNRNGRLWHTYKEGRARQPGFLEDYA 481

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +I GLL LY+      WL  A  L     + F D     +F T      ++ R KE  D
Sbjct: 482 TVIDGLLALYQATFTESWLTEADRLTQYVFDSFSDPNDDLFFFTDKNGEELIARRKELFD 541

Query: 556 GAEPSGNSVSVINLVRLASIV 576
              PS NS+   NL  ++ ++
Sbjct: 542 NVIPSSNSIMAGNLYAMSLLL 562


>gi|114319387|ref|YP_741070.1| hypothetical protein Mlg_0225 [Alkalilimnicola ehrlichii MLHE-1]
 gi|114225781|gb|ABI55580.1| protein of unknown function DUF255 [Alkalilimnicola ehrlichii
           MLHE-1]
          Length = 697

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 236/693 (34%), Positives = 344/693 (49%), Gaps = 57/693 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFED  +A+L+N+ F++IKVDREERPD+D++Y T  Q L    GGWPL
Sbjct: 51  SACHWCHVMAHESFEDPAIARLMNERFINIKVDREERPDLDRIYQTAHQLLTRRPGGWPL 110

Query: 79  SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++ L+PD + P+  GTYFPP+ + G PGF  +LR+V +A   +   +A         L  
Sbjct: 111 TLVLTPDDQTPVFAGTYFPPDTRGGMPGFADVLRQVDEAIRSQPQAVADQNRALRHALGR 170

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
              A A        L    LR   + L+ S+D   GGFG+APKFP P  I+ +L H    
Sbjct: 171 LAHAPADGGDA--ALGNAPLRAARDALADSFDRVHGGFGAAPKFPHPGGIERLLRHYALT 228

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G   +   M   TL+ MA GGI+D VGGGF RYSVDE W +PHFEKML D   L
Sbjct: 229 LVTG-DGPDRDALHMACHTLRRMALGGIYDQVGGGFARYSVDEYWMIPHFEKMLCDNALL 287

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y DA+  T D  Y+ + ++  +++R +M  P G   ++ DADS   EG     EG +
Sbjct: 288 LGLYADAWHATGDGLYARVVQETAEWVRAEMERPEGGYCTSLDADS---EGG----EGRY 340

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y+WT  EV ++L E      EH +           + +P N F+G+  L      S SA 
Sbjct: 341 YLWTPDEVRELLDEDEWRLVEHRF----------GLDEPAN-FEGRWHLHVQASFSESAR 389

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +LG P E+ + +    R+KL   R +R RP  DDKV+ +WNGL+I++ ARA ++L     
Sbjct: 390 RLGRPREQVVALWQSARQKLQRARGQRVRPGRDDKVLTAWNGLMIAALARAGRLL----- 444

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      D   +   A  A  F+R  L D+Q  RL  S+R G +     L+DYA+L
Sbjct: 445 -----------DEPAWTASALRALGFLRERLADDQG-RLYASWRAGRAAHQACLEDYAYL 492

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + G+L+  +       L +A+ L +T  E F D++ GG++ T  +   ++ R +   D +
Sbjct: 493 LEGVLECLQSEWSDDRLGFALHLADTLLERFQDKDEGGFWMTADDHEPLIHRPRPLADDS 552

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            PSGN+V++  L RL  ++   +   Y +     L      +  M  A   +  A +   
Sbjct: 553 LPSGNAVALRALQRLGHLLGEPR---YLEAVARGLRAAAGAIARMPEAHASLLTALEEYL 609

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            P  + VV+ G           A  +  Y  N+ V  + PAD           +  A+ A
Sbjct: 610 YPP-EIVVIRGAPEVTGPWRTRALKY--YTPNRLVFAL-PADAAPPGVLSGRQTEGAAPA 665

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                     A VC   +C  PV     LE +L
Sbjct: 666 ----------AWVCSGKTCRAPVRSLDELERVL 688


>gi|149369679|ref|ZP_01889531.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
 gi|149357106|gb|EDM45661.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
          Length = 703

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 205/554 (37%), Positives = 294/554 (53%), Gaps = 49/554 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  VA  +N+ F+S+KVDREERPD+D++Y+  VQ + G  GWPL+V 
Sbjct: 80  CHWCHVMEHESFEDSLVAATMNENFISVKVDREERPDLDQIYINAVQLMTGSAGWPLNVV 139

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF  ED      + T+L+K++    +  + L +       QL E +  
Sbjct: 140 TLPDGRPVWGGTYFKKED------WITVLQKIQKINTENPEKLNEIAG----QLEEGIKN 189

Query: 142 --SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               + N    +L    L         S+D RFGG+  APKF  P   + +L ++ + +D
Sbjct: 190 LDLVALNTEDVDLKNYNLDEVIHTWKSSFDHRFGGYKRAPKFMMPSNYEYLLRYAVQDKD 249

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   E Q  VLFTL  MA GGI+D +GGGF RYSVDE+WHVPHFEKMLYD  QL +
Sbjct: 250 -------QELQDYVLFTLDQMAYGGIYDAIGGGFSRYSVDEKWHVPHFEKMLYDNAQLVS 302

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A+ LTK   Y  I  + L ++  +M    G  +S+ DADS   +G    +EGAFYV
Sbjct: 303 LYSNAYKLTKKPLYKEIITETLAFIFEEMTTEEGAFYSSLDADSLTEDGTL--EEGAFYV 360

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +T++E++  LG    LF  +Y +   G  +            GK VLI   D ++ A  L
Sbjct: 361 YTAQELKSQLGTDFDLFAAYYNVNNFGKWE-----------DGKYVLIRDEDDASIAKDL 409

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  E     +   +  L   R  R +P LDDK + SWNGL++  +         +A +A
Sbjct: 410 GISTEALQRKVANWKAILKAYRGFRSKPRLDDKTLTSWNGLMLKGYV--------DAYTA 461

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
           + N        KEY++ A   A FI+     E    L H+++ G S   G+L+DYA +IS
Sbjct: 462 LGN--------KEYLDAALKNAVFIKDKQLKEDG-SLYHNYKEGRSTINGYLEDYASVIS 512

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           G + LYE  +  +WL  A +L +     F D E G ++ T+ EDP ++ R  E  D    
Sbjct: 513 GFISLYEVTADVQWLDLAKKLTDYTFTKFYDTESGMFYFTSSEDPKLVARSVEYRDNVIA 572

Query: 560 SGNSVSVINLVRLA 573
           S N++   N+  L 
Sbjct: 573 SSNAIMAQNIFVLG 586


>gi|408680345|ref|YP_006880172.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
 gi|328884674|emb|CCA57913.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
          Length = 676

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 233/702 (33%), Positives = 342/702 (48%), Gaps = 87/702 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+ +A L+N+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 51  SSCHWCHVMAHESFEDDAIAGLVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD  P   GTYFPPE ++G P F  +L  VKDAW  +RD + +     ++ L+  +
Sbjct: 111 VFLTPDAAPFYFGTYFPPEPRHGMPSFPEVLEGVKDAWADRRDEVGEVAERIVKDLAGRS 170

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L+         +EL Q  L      L++ YD+  GGFG APKFP  + ++ +L H  +  
Sbjct: 171 LAYGGEGVPGEEELAQALL-----GLTREYDATRGGFGGAPKFPPSMTLEFLLRHHAR-- 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L 
Sbjct: 224 -TGAEG----ALQMAADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALLC 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y   +  T       +  +  D++ R++  P G   SA DADS   +G  R  EGA+Y
Sbjct: 279 RAYAHLWKATGSDLARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 336

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  ++ ++LG E A L   HY +   G             F+  + +++L   +  A 
Sbjct: 337 VWTPAQLTEVLGAEDAALAAAHYGVTEAGT------------FEHGSSVLQLPQQAGPAE 384

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                     + +     +L   R +R RP  DDKV+ +WNGL I++ A    +      
Sbjct: 385 A---------DRIASIAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF----- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
                      DR + +E A  AA  + R   DE   RL  + ++G +    G L+DYA 
Sbjct: 431 -----------DRPDLVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNAGVLEDYAD 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKED 553
           +  G L L        WL +A  L +    + LDR   EGG  ++T  +  +++ R ++ 
Sbjct: 479 VAEGFLALAAVTGEGAWLEFAGFLLD----IVLDRFTAEGGALYDTAHDAEALIRRPQDP 534

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----- 608
            D A PSG + +   L+   S  A + SD +R  AE +L V    +K +    P      
Sbjct: 535 TDNATPSGWTAAAGALL---SYAAHTGSDAHRAAAEGALGV----VKALGPRAPRFIGWG 587

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  +  +L  P  + + +VG      F+ +   A  +      +    P D+EE      
Sbjct: 588 LAVSEALLDGP--REIAVVGAPGDEVFQELRRTALRATAPGAVLASGAP-DSEEFPL--- 641

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  +      A    A VC++F+C  PVTDP  L   L
Sbjct: 642 -------LGDRPLVAGGAAAYVCRHFTCDAPVTDPEELRRKL 676


>gi|120434573|ref|YP_860266.1| hypothetical protein GFO_0204 [Gramella forsetii KT0803]
 gi|117576723|emb|CAL65192.1| protein containing DUF255 [Gramella forsetii KT0803]
          Length = 682

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 203/580 (35%), Positives = 300/580 (51%), Gaps = 52/580 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE VA+L+N  ++ IKVDREERPDVD+VYM  VQ + G GGWP+++ 
Sbjct: 57  CHWCHVMEHESFEDEAVAELMNVNYICIKVDREERPDVDQVYMNAVQIMTGMGGWPMNIV 116

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF  E       +   L+++   ++ + + L +      E+L + L  
Sbjct: 117 ALPDGRPVWGGTYFRKEQ------WMEALQQISHLFNSQPEKLLEYA----EKLEQGLKQ 166

Query: 142 SASSNKLPDE-LPQNALRL-CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 + ++  P     +   E+  +S+D + GG+  +PKF  P   + +L ++ +  D
Sbjct: 167 IQIIEPVKEQNKPHKDFFIPIIEKWKRSFDPKNGGYQRSPKFMMPNNYEFLLRYAFQNSD 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   E +   L TL  ++ GG+ D + GGF RYSVDE+WHVPHFEKMLYD  QL  
Sbjct: 227 -------KELKSHCLLTLNRISWGGVFDPIEGGFSRYSVDEKWHVPHFEKMLYDNAQLVQ 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y   + +TK+ +Y  + +  L ++  +M    G  +SA DADSA   G  +K+EGA+YV
Sbjct: 280 LYSKTYKITKNNWYKEVVKQTLQFISAEMTDESGAFYSALDADSANENG--KKEEGAYYV 337

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT + ++ ILG    +F E+Y +   G  +               VLI        +  L
Sbjct: 338 WTKENLKSILGNEFEIFSEYYNINNYGKWEADNY-----------VLIRTKSLDQLSQDL 386

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +P E     + +C  KL   +SKR +P LDDK + SWN L+IS +  A K  ++     
Sbjct: 387 DIPREDLQQRIAQCNLKLKKAKSKREKPGLDDKSLTSWNALMISGYTEAYKAFRN----- 441

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       EY+E AE  A+FI  +   E   RL HS++NG S   G+L+DYAF IS
Sbjct: 442 -----------GEYLEAAEKNAAFILENQLQENG-RLYHSYKNGKSTINGYLEDYAFSIS 489

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             LDLYE     ++L  A  L +  D+ F D   G YF T+ +D  ++ +  E  D   P
Sbjct: 490 AFLDLYECTFEQEYLGRARNLIDVTDKDFTDSVSGLYFFTSDKDRELVTKTIEISDNVIP 549

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           + NS    N+ R   +    K   Y   AE  L +   ++
Sbjct: 550 ASNSEMAKNIFRFGKLTGDMK---YVGKAEKMLQIVMDKI 586


>gi|300024782|ref|YP_003757393.1| hypothetical protein Hden_3279 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299526603|gb|ADJ25072.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           ATCC 51888]
          Length = 678

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 227/694 (32%), Positives = 340/694 (48%), Gaps = 78/694 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED G A+++N++F++IKVDREERPD+D +YM  +  L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFEDPGTAEVMNEFFINIKVDREERPDIDAIYMGALHQLGEQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL  D KP  GGTYFP E +YGRP F T+L ++ +A+  +RD +  +     E L  AL 
Sbjct: 110 FLDSDAKPFWGGTYFPREARYGRPAFVTVLLRIAEAYANQRDDVRNN----TEALLAALK 165

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +   N  P + P+ A    A  +S++ D  +GG   APKFP+   I  +L+        
Sbjct: 166 TAPGDNA-PRQ-PRPATEDVAAAISRAVDREYGGLSGAPKFPQ-WSIFWLLWR------V 216

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G   + ++ +  V+ TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L ++
Sbjct: 217 GIRDDNADAKNGVITTLRHICQGGIYDHLGGGFSRYSVDEYWLVPHFEKMLYDNALLIDL 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
             + +  T+D  +     + + ++ R+MIG  G   ++ DADS   EG    +EG FYVW
Sbjct: 277 MTEVWRETQDPLFKTRVAETIAWIEREMIGEAGGFAASLDADS---EG----EEGKFYVW 329

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
            + E+ED+LG E A  F   Y + P GN            F+G  +L  L         L
Sbjct: 330 NADEIEDVLGAEDAAFFSRVYGVVPGGN------------FEGHTILNRLG-------SL 370

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               E+    L   R KL + R+ R RP  DDK++  WNGL I++ +RA+ +L+  A   
Sbjct: 371 AFLSEEDEARLTSLRAKLLERRASRIRPGWDDKILADWNGLAIAAISRAAIVLEQPA--- 427

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                        ++ +AE A S I   L      RL H++R+G +KAP    DYA +  
Sbjct: 428 -------------WLALAERAFSAITTKLA-ASDGRLFHAYRSGLAKAPATASDYANMTW 473

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + L+      ++L  A +     D+ + D + GGYF    +   V++R+K   D A P
Sbjct: 474 AAIRLFTATGSERYLDQAQQWTRILDKHYWDEDRGGYFTAADDTLDVVVRLKSATDDAAP 533

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA--ADMLS 617
           + N++ + NL+ LA++   +  D   +    + A             P+  CA  A  L 
Sbjct: 534 NANAIQLSNLIALAALTGDAAYDDRARRLSQAFA-------SAVAHTPISHCALLAAELD 586

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
                 V +       D            +L +  I   P   E +   E  +  ++   
Sbjct: 587 ADRVVQVAIQAPPGPCDLRG---------ELQRLSI---PGALEFVGLSEAQSGQSSLFG 634

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
             +    K  A VC    CS P+ +P  L   LL
Sbjct: 635 GKSMIDGKSTAYVCVGPVCSAPIQEPEKLRQALL 668


>gi|386826330|ref|ZP_10113437.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
 gi|386427214|gb|EIJ41042.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
          Length = 700

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 222/697 (31%), Positives = 340/697 (48%), Gaps = 64/697 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFED   A+++N+ F++IKVDREERPD+DK+Y    Q L    GGWPL
Sbjct: 56  SACHWCHVMAHESFEDPETAQVMNELFINIKVDREERPDLDKIYQMAHQILTRRAGGWPL 115

Query: 79  SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQ 134
           ++FL+PD   P  GGTYFP E ++  P FK IL +V + + + R  +    Q  A AIE 
Sbjct: 116 TMFLTPDAHYPFFGGTYFPKEPRFNLPAFKNILYRVAEFYRQNRHGIVEQCQQLAQAIEY 175

Query: 135 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
                +   S   +  EL    L    +Q+ +S+DS +GGF  APKFP    ++ + +H 
Sbjct: 176 HDTPRTEGVSITTISPEL----LNTARQQIEQSFDSEWGGFSKAPKFPHLTNVERLFHHY 231

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                     E  +G ++ + TL  MA GGI+D VGGGF RYSVD+ W +PHFEKMLYD 
Sbjct: 232 HITAHQENPDE--DGLQIAMHTLTRMALGGIYDQVGGGFCRYSVDDYWMIPHFEKMLYDN 289

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
                +Y +A+ L K   Y  + +   D++ R+M    G  +S  DADS   EG     E
Sbjct: 290 APFLTIYSEAWQLAKIPLYKQVAQATADWVLREMQLSEGGFYSTLDADS---EGV----E 342

Query: 315 GAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           G FYVWT +E++ +L  E    F   + L    N + +              L   +D  
Sbjct: 343 GKFYVWTPEEIKGLLSPELYAPFAYQFGLNRPANFEETHWH-----------LFGWHDRE 391

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           A A K  + LE+    L +    LF  R +R  P  D+K++ +WNG++I + A A +I K
Sbjct: 392 AVAVKFDLSLEEVNARLDKALAILFQAREQRVHPQRDEKILTAWNGMMIKALATAGRIFK 451

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                           R +Y+  AE + +FIR  L+  +  +L  ++++G +    +LDD
Sbjct: 452 ----------------RTDYIHAAEQSLNFIRSTLW--KNGKLLATYKDGKAHLNAYLDD 493

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAFLI G+L L +         + +EL +     F D+E GG+F T      ++ R+K  
Sbjct: 494 YAFLIEGILTLLQCRWNNSDYAFMLELVDVLLHEFEDKEKGGFFFTGNHHEQLIARLKPL 553

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D A PSGN V+ + L RL  ++    +D Y + A  ++ +    ++ +A A   +  A 
Sbjct: 554 ADEAIPSGNGVAAVVLGRLGHLLG---NDEYLRAAARTVNIALPAIEQIAYAHNTLLLAV 610

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
           +    P +  ++    K   +++   A     Y   +    I    +E +          
Sbjct: 611 EDYLFPPQLIIIRADAKHLAEWQ---AVCQHDYAPQRLCFAIPNHLSEPL---------- 657

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +  N     + VA +C  + CS P+    +LE  L
Sbjct: 658 TGVLANCKPQGEAVAYICHGYQCSAPIHSLTALEEAL 694


>gi|357391644|ref|YP_004906485.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
 gi|311898121|dbj|BAJ30529.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
          Length = 687

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 243/699 (34%), Positives = 338/699 (48%), Gaps = 79/699 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDEG A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 49  ACHWCHVMAHESFEDEGTAGFLNERFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P   GTYFPPE ++G P F+ +L  V  AW  +R  + +        L+E  S
Sbjct: 109 FLTPEKEPFYFGTYFPPEPRHGMPSFRQVLEGVDKAWTGRRAEVGEVAGRISRDLAERAS 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             A  + +     +  L     +L+KSYD R GGFG APKFP  + ++ +L H  +   T
Sbjct: 169 VYAVGSGVAGVPGEGELGAAVAELAKSYDERRGGFGGAPKFPPSMVLEFLLRHHAR---T 225

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G +       +M   T + MA+GGIHD +GGGF RY+VD  W VPHFEKM YD   L  V
Sbjct: 226 GSAA----ALRMAGRTCEAMARGGIHDQLGGGFARYAVDATWTVPHFEKMCYDNALLLRV 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL  +  T +     +     D+L R++  P G   SA DADS + E   R  EGA+Y W
Sbjct: 282 YLHLWRATGEERARRVALSTADFLLRELRTPEGGFASALDADSLD-EATGRTAEGAYYAW 340

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T +++E +LG   A    E + +   G  +            G +VL  L D        
Sbjct: 341 TPEQLERVLGAADAGYAAELFGVTANGTFE-----------HGSSVLQLLADPEDR---- 385

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               ++Y ++    R KLF+ RS RP P  DDKV+ +WNGL I++ A A  +L+      
Sbjct: 386 ----DRYESV----RAKLFEARSHRPAPARDDKVVAAWNGLAIAALAEAGALLE------ 431

Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFL 497
                     R E +E AE AA   I  HL  +   RL  + R+G + A  G L+DYA  
Sbjct: 432 ----------RPELVEAAERAADLLIAVHLTPDG--RLLRTSRDGRAGANAGVLEDYADT 479

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L LY     + WL  A EL +     F D   G  ++T  +   ++ R ++  D A
Sbjct: 480 AEGFLALYAVTGESSWLQLAGELLDLVLRHFTDEASGALYDTADDAEQLIRRPQDPTDNA 539

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMCC 611
            PSG + +   L+  A+    + SD +R  AE +L +  T      R     +AV     
Sbjct: 540 TPSGWTAAAGALLTYAAY---TGSDRHRTAAERALGIVSTLGTRAPRFTGWGLAV----- 591

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A  +L  P  + V +VG         +  AA  +      V   +P DTE          
Sbjct: 592 AEALLDGP--REVAVVGAPDDPARAALHLAALRATAPGAVVAVGEPGDTE---------- 639

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
               +A       +  A VC++F+C  P  D   L + L
Sbjct: 640 -VPLLADRPLLDGRPAAYVCRHFACERPTADAADLADRL 677


>gi|312143535|ref|YP_003994981.1| glutamate--cysteine ligase [Halanaerobium hydrogeniformans]
 gi|311904186|gb|ADQ14627.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halanaerobium hydrogeniformans]
          Length = 647

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 193/579 (33%), Positives = 305/579 (52%), Gaps = 68/579 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFEDE VA++LN +F+SIKVDREERP++D +YM   Q + G GGWPLS+
Sbjct: 51  TCHWCHVMEKESFEDEEVAQMLNQFFISIKVDREERPEIDSLYMDVCQTMTGSGGWPLSI 110

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++ D KP    TY P E+KYGR G  TIL ++   W ++R  L Q+    +  LS+   
Sbjct: 111 FMTADKKPFYAATYIPKENKYGRKGLLTILPEIHYLWTEERKKLLQASENIVSHLSKINQ 170

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              +      EL  N      E +  +YD ++GGFGS+PKFP    +  +L++ KK   T
Sbjct: 171 NQKA------ELASNIFEKTVEAIESNYDHQYGGFGSSPKFPMYQYLLFLLHYWKK---T 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+    S    ++  TLQ M  GGI+D +  GFHRYS D  W +PHFEKMLYDQ  +  +
Sbjct: 222 GEDKYLS----ILETTLQQMRAGGIYDQLAFGFHRYSTDREWKMPHFEKMLYDQALMIYI 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y  A+  T    Y+ + ++I+ +L  +M+   G  F+A DADS         +EG +Y+W
Sbjct: 278 YTAAYQATAKEIYADVVKEIVSFLESEMLAKEGAFFTAIDADSG-------GEEGKYYLW 330

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
              E++ IL E                   +R++   +    KN+ + L +         
Sbjct: 331 EKSELKSILNE----------------AQFNRLNKIFDIQANKNINLSLKN--------- 365

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             ++ Y N L E + KL   R +R  P  D K++  WNGL+I++ A+A  +LK       
Sbjct: 366 --VQDY-NQLAELKDKLLKHRKERIHPSKDKKILTDWNGLLIAALAKAGFVLK------- 415

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   DR  Y+++A+    FI  ++   +  RL HS+  G       L+DY+FL+ G
Sbjct: 416 -------EDR--YLKLADDVEKFIHNNMKTNKG-RLAHSYYEGEKSKIDNLNDYSFLLWG 465

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L++LY+     ++L+ A +      E F D++   ++ +  ++  + ++    +D + PS
Sbjct: 466 LIELYQATLKDEYLIKAEKTAKIMKEYFWDQKEEAFYFSAKDNEDLFIKQINANDHSLPS 525

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
            NS++  N ++LA +        Y+++A+  +A F  ++
Sbjct: 526 ANSIAAFNFLKLAHLKDNLA---YQKDAQKIIAAFSDQI 561


>gi|395774413|ref|ZP_10454928.1| hypothetical protein Saci8_31786 [Streptomyces acidiscabies 84-104]
          Length = 682

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 235/704 (33%), Positives = 338/704 (48%), Gaps = 90/704 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDQHTADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-A 138
           VFL+PD +P   GTYFPPE ++G P F+ +L  V+ AW  +RD +A+     +  L E  
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGSPSFRQVLEGVRQAWTGRRDEVAEVAGKIVRDLGERE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           LS   +     +EL    L      L++ YD + GGFG APKFP  + I+ +L H  +  
Sbjct: 168 LSFGDAQPPGEEELAAALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGSEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       I  +  D++ R++  P G   SA DADS   +G  +  EGA+Y
Sbjct: 276 RVYAHLWRSTGSELARRIALETADFMVRELRTPEGGFASALDADS--DDGTGKHVEGAYY 333

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           VWT  E+ D LGE A L   ++ +   G  +           +G +VL +   +    A 
Sbjct: 334 VWTMAELRDTLGEDADLAAHYFGVTEDGTFE-----------EGASVLQLPQTEGVFDAD 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K           +     +L   R++RP P  DDK++ +WNGL I++ A           
Sbjct: 383 K-----------IASIHARLLAKRAERPAPGRDDKIVAAWNGLAIAALAETGAYF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      DR + +E A +AA  + R   D+  H  + S    P    G L+DY  +
Sbjct: 427 -----------DRPDLIEAALTAADLVVRIHLDDHAHLSRTSKDGQPGANAGVLEDYGDV 475

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L    +   WL +A  L +     F D E G  ++T  +   ++ R ++  D A
Sbjct: 476 AEGFLALAAVTAEGVWLDFAGLLLDHVLARFTDPESGALYDTASDAEQLIRRPQDPMDNA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
            PSG + +      L S  A + ++ +R  AE +L V    +K +   VP      +  A
Sbjct: 536 TPSGWTAAASA---LLSYAAHTGAEPHRTAAEKALGV----VKALGPRVPRFIGWGLSVA 588

Query: 613 ADMLSVPSRKHVVLVGHK------SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
             +L  P  + V +V  +       ++  + +LA A  +      V+     D++E    
Sbjct: 589 EALLDGP--REVAVVARELTDPAGKNLHRQALLATAPGA------VVAYGVTDSDEFPL- 639

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                    +A    S  +  A VC+NF+C  P TDP  L   L
Sbjct: 640 ---------IADRPLSGSEATAYVCRNFTCDLPTTDPDRLRTAL 674


>gi|154150757|ref|YP_001404375.1| hypothetical protein Mboo_1214 [Methanoregula boonei 6A8]
 gi|153999309|gb|ABS55732.1| protein of unknown function DUF255 [Methanoregula boonei 6A8]
          Length = 723

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 231/714 (32%), Positives = 343/714 (48%), Gaps = 62/714 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  R  FL    + CHWCHVM  ESFE+  VA +LN  FV IKVDREERPD
Sbjct: 54  GGEAFSRAKREDRPLFLSIGYSACHWCHVMARESFENNEVAGILNKHFVCIKVDREERPD 113

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD VYM   Q L G GGWPL++ ++P+ KP   GTYFP   + G PG   IL  + + W+
Sbjct: 114 VDSVYMGICQQLTGQGGWPLTIIMTPEKKPFFAGTYFPKTGRAGMPGLTDILITIANLWE 173

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
            +RD L    A A + LS+A     S +  PD   ++ L     +L+  +DS  GGFG A
Sbjct: 174 TRRDELY---AAAEQILSDAHLLHKSPSGDPD---RHLLDKGFRELAAQFDSANGGFGRA 227

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  I  +L + +       +GE +    M   TL  + +GGI DHVGGG HRY+ 
Sbjct: 228 PKFPAPHNILFLLRYWQ------MTGE-NRALDMAEQTLDAIRQGGIWDHVGGGMHRYAT 280

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D RW VPHFEKML DQ  L     +A++ T  + Y  I  + + Y+ R++  PGG  ++A
Sbjct: 281 DARWLVPHFEKMLSDQAMLVLASTEAYAATGKIRYRTIAEECIAYVLRELRDPGGGFYTA 340

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
           EDADS          EGA+Y+WT +E+  ILG  A      + L P           P +
Sbjct: 341 EDADSP-------AGEGAYYLWTEEEIARILGLDAAFASILFSLTPL----------PGS 383

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           E K  +++            LG+  ++ ++      R+L   R KRP+P  D K++   N
Sbjct: 384 E-KHASIISAAGPDPVLLKNLGITEQELISRRAGILRRLAHEREKRPKPARDTKILTDTN 442

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
            L  ++ ARA ++L + +                Y + A     F+ +++ + +   L H
Sbjct: 443 ALFCTALARAGRVLGNPS----------------YTDAAACTLRFLLQNMRNGEGRILHH 486

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           S   G    PGF DDYA L++  ++LY+  S    +  A+ +       + D+EGGG+F 
Sbjct: 487 S-GGGEHAVPGFADDYAHLVAAHIELYKATSDIACIKEAVTINALLLTHYRDKEGGGFFT 545

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           T      + ++ KE +DGA PS N+ +  NL  L  +     +D + + A          
Sbjct: 546 TADTAVDLPVQKKEWYDGAVPSANTTAFENLTALYRLTG---NDVFNEAALECARFITGA 602

Query: 599 LKDMAMAVP--LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
                 AV   L   A   L+  + + +V+ G  ++   + +LA A   Y L   +I + 
Sbjct: 603 ASRAPHAVTGFLAALACSPLT-GNTQDLVIAGDPANAGTQTLLAVARRQY-LPGLLILLR 660

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           P         +E ++    +        K  A +C   +C PPV+DP  L N L
Sbjct: 661 PPGKAG----DEVDTVFPVVQGKVPHEGKATAYLCTGLACLPPVSDPQELVNQL 710


>gi|110638981|ref|YP_679190.1| hypothetical protein CHU_2595 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281662|gb|ABG59848.1| conserved hypothetical protein; thioredoxin domain [Cytophaga
           hutchinsonii ATCC 33406]
          Length = 681

 Score =  339 bits (869), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 201/557 (36%), Positives = 294/557 (52%), Gaps = 49/557 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME E FE E VA ++ND F++IK+DREERPD+D++YM  V A+   GGWPL+
Sbjct: 56  SACHWCHVMEHECFEKEEVAAVMNDLFINIKIDREERPDLDQIYMDAVSAMGLRGGWPLN 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD KP  GGTYFP +       +  +L ++ +A+   R+ + +S     E L+++ 
Sbjct: 116 VFLTPDAKPFYGGTYFPQDH------WLNLLGQISNAYLNHREDILKSAESFTESLNQSD 169

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                     +   ++ L L  +++S+ +D+  GG   APKFP P    + LY  +    
Sbjct: 170 VFKYGLVDDAETFHKDELDLAYDRISQQFDTDMGGMNKAPKFPMP---SIYLYLLRDYAL 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+ G      + V  TL  MA GGI+D +GGGF RYSVD  W  PHFEKMLYD GQL +
Sbjct: 227 TGRQGSL----QHVELTLDKMAMGGIYDTIGGGFARYSVDGAWFAPHFEKMLYDNGQLLS 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A+++TK   Y  +  +   +L+R+M+ P G  +SA DADS   EG     EG FY 
Sbjct: 283 LYSEAYTVTKKPLYKEVIEETYTWLKREMLSPEGGFYSALDADS---EGV----EGKFYC 335

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W  +E+  ++ E   LF  +Y +   GN +            G N+L +     A A+  
Sbjct: 336 WQYEELAQLIQEDFALFCAYYAITENGNWE-----------HGMNILYKRMSDEAFAAAH 384

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  E     +   +  LF  R  R  P LDDK++ SWNG+++     A +IL    ++A
Sbjct: 385 SISAEALRESVSRWKNILFSERDPREHPGLDDKILASWNGIMLKGLCDAYRIL---GDAA 441

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
           + N  ++              A FI   LYD +T  L HS++N  +  PGFL+DY  +I 
Sbjct: 442 ILNTALMN-------------AEFILTKLYDGKT--LFHSYKNKKATIPGFLEDYTHVID 486

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           G L LYE     +WL  AI L N   + F D + G +F T+     ++ R KE  D   P
Sbjct: 487 GYLALYEVSLDEQWLRQAITLVNHVIDHFYDDDEGLFFYTSRTSEKLIARKKEIFDNVIP 546

Query: 560 SGNSVSVINLVRLASIV 576
           + NS    NL  L  ++
Sbjct: 547 ASNSSLARNLYHLGKLL 563


>gi|431797737|ref|YP_007224641.1| thioredoxin domain-containing protein [Echinicola vietnamensis DSM
           17526]
 gi|430788502|gb|AGA78631.1| thioredoxin domain protein [Echinicola vietnamensis DSM 17526]
          Length = 678

 Score =  338 bits (868), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 207/557 (37%), Positives = 296/557 (53%), Gaps = 55/557 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE  AK++N  FV IK+DREERPD+D +YM  VQ++   GGWPL+
Sbjct: 51  SACHWCHVMEHESFEDEATAKIMNAHFVCIKIDREERPDLDNIYMDAVQSMGLQGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS--GAFAIEQLSE 137
           VFL P+ KP  GGTYFP       P +K +L+ + +A+    D LA+S  G     +L E
Sbjct: 111 VFLMPNQKPFYGGTYFP------NPNWKGLLQNIAEAYATHHDELAKSAEGFGNSIKLKE 164

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
                 + +  P  L    L   A++++   D ++GGF  +PKFP P     +L ++   
Sbjct: 165 REKYRLADD--PSRLTAEDLTHMAQKIASQMDPQWGGFNRSPKFPMPAVWDFLLRYA--- 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                 G+AS  +K VLFTL  +  GGI+DH+ GGF RYSVD  W  PHFEKMLYD GQL
Sbjct: 220 ---ALKGDASLIEK-VLFTLTKIGMGGIYDHLRGGFARYSVDSEWFAPHFEKMLYDNGQL 275

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y  AF L+ D  +     + +++L+ +M+   G  ++A DADS   EG    +EG F
Sbjct: 276 LSLYAKAFQLSGDALFKEKINETVNWLQAEMLQEEGGFYAALDADS---EG----EEGKF 328

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y WT  E+E +L +    F E + +   GN +           KG N+L + +     A 
Sbjct: 329 YTWTHDELESMLDDEDAWFYECFNISEKGNWE-----------KGVNILFQTHTYEEIAH 377

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K G+  E+    L E + +L  +R+ R  P LDDKVI  WNGL IS  A+A     +   
Sbjct: 378 KHGLEEEQLAQNLNEVKERLLKIRNLRTPPGLDDKVIAGWNGLTISGLAQAYWATAN--- 434

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRH-LYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                 P+  S       +A    +FI  H L  EQ +R   S++NG +  P FL+DYA 
Sbjct: 435 ------PLAKS-------LAIQNGTFILDHMLKGEQLYR---SYKNGEAYTPAFLEDYAA 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +I G + LY+  S  +WL+ A  L     E F D + G ++    +  +++   KE  D 
Sbjct: 479 IIQGFIHLYQLTSEPRWLLVAKRLTAFVLEHFFDEDDGLFYFNNPDSETLIANKKEIFDN 538

Query: 557 AEPSGNSVSVINLVRLA 573
             PS N++   NL +L 
Sbjct: 539 VIPSSNALMATNLHQLG 555


>gi|255531347|ref|YP_003091719.1| hypothetical protein Phep_1443 [Pedobacter heparinus DSM 2366]
 gi|255344331|gb|ACU03657.1| protein of unknown function DUF255 [Pedobacter heparinus DSM 2366]
          Length = 670

 Score =  338 bits (867), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 209/585 (35%), Positives = 300/585 (51%), Gaps = 60/585 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+  VA+++N  FV IKVDREERPD+D++YM  +Q + G GGWPL+
Sbjct: 51  SACHWCHVMERESFENHEVAEVMNRHFVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
               PD +P+ GGTYF   D      +  +L  V   W  + D   ++ A+A ++L++ +
Sbjct: 111 CICLPDQRPIYGGTYFRKAD------WVNVLESVAAMWANEPD---KAIAYA-DRLTDGI 160

Query: 140 SASASSNKLP----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
             +     +P    DE  +  L    E   + +D   GG+  APKFP P   Q ML +S 
Sbjct: 161 QNA--EKIIPQIKVDEYTKAHLTAITEPWKRYFDMAEGGYNRAPKFPLPNNWQFMLRYSH 218

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
            ++D      A       L TL+ MA GGI+DHV GGF RYSVD  WHVPHFEKMLYD G
Sbjct: 219 LMQDDATHVSA-------LLTLEKMAMGGIYDHVAGGFSRYSVDGDWHVPHFEKMLYDNG 271

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL ++Y +A+  ++ + +  +  + +++L R+M+ P G  ++A DADS   EG     EG
Sbjct: 272 QLISLYAEAYQYSRSLLFKEVAEESIEWLEREMMSPEGLFYAALDADS---EGV----EG 324

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FYVW   + E +LG+ A L  +++ +   GN           E +  N+L+        
Sbjct: 325 KFYVWDKPDFEAVLGDDADLLSDYFNVTDEGNW----------EEEQTNILLRKFTEEEY 374

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A   G+ + + L  +   + KL   RSKR RP LDDK + +WN + I   A +++I    
Sbjct: 375 AEVKGISVVELLQKIKTAKIKLLQERSKRIRPGLDDKCLTAWNAMAIKGLAESAEIF--- 431

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        D   Y E+A+ AASFI  H+ +     L  +F+N  +  PGFLDDYA
Sbjct: 432 -------------DHPHYYEMAKKAASFILAHV-NTADGGLYRNFKNDKASIPGFLDDYA 477

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F I  L+ LYE      WL  A  L +     F D      F T+    +++ R  E  D
Sbjct: 478 FFIEALIALYEADFDENWLKEAKRLCDYVLLNFEDEHSPMLFYTSAAGETLIARKHEIMD 537

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
              P+ NSV   NL +L  +      D Y   AE  LA    ++K
Sbjct: 538 NVVPASNSVMAQNLHKLGLLF---DEDVYSIKAEEMLAAVLPQIK 579


>gi|116754985|ref|YP_844103.1| hypothetical protein Mthe_1697 [Methanosaeta thermophila PT]
 gi|116666436|gb|ABK15463.1| protein of unknown function DUF255 [Methanosaeta thermophila PT]
          Length = 669

 Score =  338 bits (867), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 234/694 (33%), Positives = 344/694 (49%), Gaps = 91/694 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFEDE +A++LN  FV +KVDREERPD+D +YM   Q + G GGWPL+
Sbjct: 51  STCHWCHVMARESFEDERIAEMLNRAFVCVKVDREERPDIDAIYMEACQIITGRGGWPLT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + +SPD  P    TY P + + G  G + ++  V++ W  +R  L   G   +  + +A 
Sbjct: 111 IIMSPDGIPFFAATYIPKDGRLGMMGLRELIPLVEELWRNRRSELTSLGFKVLNAMRKAD 170

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +   +SN     L +  L     +LS  +D   GGFG APKFP     Q +L+  +    
Sbjct: 171 THLQASNADESTLSRAYL-----ELSGIFDWTSGGFGRAPKFPLA---QNLLFLLRYWHR 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+     +  +MV  TL+ M  GGI+D +  GFHRYS D  W VPHFEKMLYDQ  ++ 
Sbjct: 223 TGE----MKALEMVELTLREMRCGGIYDQLAYGFHRYSTDSSWGVPHFEKMLYDQALMSV 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VYL+A+  T    Y+ +  +IL ++  D+  P G   SA DA+S          EG +Y+
Sbjct: 279 VYLEAYQATGKRDYAIVADEILGFVAEDLRSPDGAFCSALDAESDNI-------EGGYYL 331

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASK 378
           WT  ++ D LG+      E + L+P G  D            GKNVL I L    +    
Sbjct: 332 WTMDQLRDALGDDLKKALEVFVLEPIGGSD------------GKNVLRISLKGELSEFKH 379

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
              P+          RRKL D RS R +P  D+KV+  WNGL+I++F+R +++L  E   
Sbjct: 380 TSEPI----------RRKLLDARSLRRKPFRDEKVLADWNGLMIAAFSRGAQVLGDE--- 426

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         ++ +A  AA F+   ++ +    L HS++         LDDYAFLI
Sbjct: 427 -------------RWLRIASEAADFVLSSMHRDGM--LMHSYKGSRVS---ILDDYAFLI 468

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GL++LY+ G   ++L  A  L +     F D +GG Y+ T  E   ++L+ KE  DGA 
Sbjct: 469 FGLIELYQAGFDGRYLERAEILCDEMVSHFSDPDGGFYY-TMKEQSDIILQRKEIRDGAI 527

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEH--SLAVFETRLKDMAMAVPLMCCAADML 616
           PSG S++ ++++ L  I+        R + E   S+++    +  +   V L+  A D+ 
Sbjct: 528 PSGYSMATMDMLLLGKILG-------RPDLEEIASMSLRHISMASLPAQVGLL-IALDLA 579

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             PS + + +VG   +     ML A  + Y   K V+  D                 AS 
Sbjct: 580 LGPSHE-IAIVGDADNT--RTMLRALWSVYAPRKVVVSGD------------RPPEWASS 624

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            R      K  A VC  ++CS P TD  S+  LL
Sbjct: 625 LRP--VDKKATAYVCSRYTCSFPATDIRSMIELL 656


>gi|313667030|gb|ADR72969.1| DUF255 family protein [Streptomyces sp. OH-4156]
          Length = 673

 Score =  338 bits (866), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 234/703 (33%), Positives = 344/703 (48%), Gaps = 89/703 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A L+N+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDDATAALVNENFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD  P   GTYFPPE ++G P F  +L  VK AW  +RD + +     ++ L+   
Sbjct: 108 VFLTPDAAPFYFGTYFPPEPRHGMPSFPEVLEGVKGAWSDRRDEVGEVAERIVKDLA-GR 166

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S +   + +P  +EL Q  L      L++ YD+  GGFG APKFP  + ++ +L H  + 
Sbjct: 167 SLAYGGDGVPGEEELAQALL-----GLTREYDATHGGFGGAPKFPPSMTLEFLLRHHAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L
Sbjct: 221 --TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y   +  T       +  +  D+L R++  P G   SA DADS   +G  R  EGA+
Sbjct: 275 CRAYAHLWKATGSDLARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGAY 332

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT  ++ ++LG E A L   HY +   G             F+  + +++L   + +A
Sbjct: 333 YVWTPAQLTEVLGAEDAALAAAHYGVTEDGT------------FEHGSSVLQLPREAGTA 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                        +     +L   R +R RP  DDKV+ +WNGL I++ A    +     
Sbjct: 381 DA---------GRIASIAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF---- 427

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                       DR + +E A  AA  + R   DE   RL  + ++G +    G L+DYA
Sbjct: 428 ------------DRPDLVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNDGVLEDYA 474

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKE 552
            +  G L L        WL +A  L +    L +DR   EGG  ++T  +  +++ R ++
Sbjct: 475 DVAEGFLALAAVTGEGAWLDFAGFLLD----LVIDRFTAEGGALYDTAHDAEALIRRPQD 530

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
             D A PSG + +   L+   S  A + SD +R  AE +L V    +K +    P     
Sbjct: 531 PTDNATPSGWTAAAGALL---SYAAHTGSDAHRAAAEGALGV----VKALGPRAPRFIGW 583

Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            +  +  +L  P  + + +VG      F+ +   A  +      V+     D+EE     
Sbjct: 584 GLAVSEALLDGP--REIAVVGAPGDEAFQELRRTALLAT-APGAVLAFGAPDSEEFPLLR 640

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +    +   A          A VC++F+C  PVTDP +L   L
Sbjct: 641 DRPLVSGGPA----------AYVCRHFTCDAPVTDPDALRRKL 673


>gi|374585294|ref|ZP_09658386.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
 gi|373874155|gb|EHQ06149.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
          Length = 685

 Score =  337 bits (865), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 230/701 (32%), Positives = 349/701 (49%), Gaps = 81/701 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED+  A LLN+ +V+IKVDREE PDVD +YM  + A+   GGWPL++
Sbjct: 51  TCHWCHVMERESFEDQSTADLLNEHYVAIKVDREELPDVDSIYMKALHAMGQPGGWPLNL 110

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P+ GGTYFPP+  +GRP FK +L  +   W   R  L ++ +   E L+E   
Sbjct: 111 FLTPDRRPITGGTYFPPQPAHGRPSFKQMLGTLAQMWKNDRPRLLEAASSITEFLNE--- 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF-GSAP-KFPRPVEIQMMLYHSKKLE 198
            +A ++ LPD  P    R   E + +++D + GGF G+ P KFP  + + ++L    +L 
Sbjct: 168 QNALASDLPD--PSIFARFIGE-MEQAFDVQRGGFYGNGPNKFPPSMALMLLL----RLH 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           +  + G +S    MV  TL+ M++GGI+D +GGG  RYS D  W VPHFEKMLYD     
Sbjct: 221 ERDRQGSSSV-LVMVEKTLEAMSRGGIYDQLGGGLCRYSTDPAWLVPHFEKMLYDNALFL 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
               +A+ +T + FY  +  D++ YLRRD++ P G  + AEDADS   EG     EG FY
Sbjct: 280 QALTEAYRITGNDFYRRMAYDVIAYLRRDLMSPEGAFYCAEDADS---EGV----EGKFY 332

Query: 319 VWTSKEVEDILGEHAI------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           VW++ E  + L    +      L   ++ +   GN            F+GKN+L      
Sbjct: 333 VWSAAEFRETLRSSGLSDDEIRLLSLYWNVTEAGN------------FEGKNILHLTGSD 380

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              AS+  + L     +  + R+ LF VR +R RP  DDK++ SWN L+IS+ +RAS + 
Sbjct: 381 EDFASQHSLTLTSLNEMTQKARQALFAVRERRIRPLRDDKILTSWNALMISALSRASIVF 440

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
              + + M                A + A F+  HL   Q  +L   +R+G ++    L 
Sbjct: 441 GDASLADM----------------AVACADFVESHLM--QDGQLMRRYRDGEARFKATLT 482

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIE-LQNTQDELFLDREGGGYFNTTGEDPS--VLLR 549
           D+A L   L+DL+     + ++  A+E  +      F D    G    T ED S  + LR
Sbjct: 483 DHALLGCALIDLFRVTGKSVYMRRALERAEAIMSSFFAD----GRLYETAEDDSDDLFLR 538

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
             + +DG  PSG S ++   V L+    G  +  Y + A+  L  F       A A P M
Sbjct: 539 PIDSYDGVMPSGPSAALRLFVTLSRY--GESARIYEETAKVILRQFSPEWAQAARAYPAM 596

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A    S  +R+ + + G    +     L  +     L+   ++    D+         
Sbjct: 597 VSAFLTFSDEARE-IAITGEADFIGQALKLIGSR----LDGDAVYAFSVDS--------- 642

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +S  + +A  + S   +   +CQ+F+C  P +    L+  L
Sbjct: 643 DSPVSLIAGKDRSRSAIY--LCQDFACQTPFSSVQQLDQAL 681


>gi|398893990|ref|ZP_10646420.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
 gi|398183122|gb|EJM70617.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
          Length = 662

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 237/691 (34%), Positives = 337/691 (48%), Gaps = 88/691 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A+L+N+ F++IKVDR+ERPD+D +Y   VQ +  GGGWPL+V
Sbjct: 49  ACHWCHVMAHESFENPEIARLMNERFINIKVDRQERPDLDDIYQKIVQMMGQGGGWPLTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P  GGTYFPP++ YGR GF  +LR + +AW   R  L Q+ A  + Q   A+ 
Sbjct: 109 FLTPRREPFFGGTYFPPQESYGRAGFPQLLRGLSEAWQNNRAALEQNVAQFL-QGYRAMD 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV--EIQMMLYHSKKLE 198
                   P E  Q A    A   +++ D   GG G+APKFP     ++ + LY      
Sbjct: 168 TQMLEGDTPLEQDQPA--AAARLFARNTDPVHGGLGNAPKFPNVACHDLVLRLYQRLHEP 225

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D  +S E          TL  +A GG++DH+GGGF RY VDE W VPHFEKMLYD GQL 
Sbjct: 226 DLLRSLE---------LTLDQVAAGGLYDHLGGGFARYCVDEHWAVPHFEKMLYDNGQLV 276

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +Y DA+  T +  +  +  + +DY+ RDM  P G  +++EDADS   EG    +EG FY
Sbjct: 277 KLYADAWRATGEPAWRRVFEETIDYILRDMTHPEGGFYASEDADS---EG----EEGKFY 329

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  +V+ +LG+  A L  + Y +  +GN +            G  VL         A+
Sbjct: 330 VWTPAQVQAVLGDPDAALACQAYGVTASGNFE-----------HGTTVL-------HRAA 371

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            L    E  L  L   R KL   R++R RP  D+ ++ SWN L+I     A +       
Sbjct: 372 TLDTAQEAQLAGL---RDKLLVARAQRIRPGRDENILTSWNALMIQGLCAAYQ------- 421

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                     +    +++ A  AA FI   L       L  ++R   +K PGFL+DYAFL
Sbjct: 422 ---------ATGTATHLDAARRAADFILDRLSTPDGG-LYRAWREDTAKVPGFLEDYAFL 471

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVLLRVKEDHD 555
            + LLDLYE      +L  A  L     EL L++  E G YF     +P ++ R +   D
Sbjct: 472 ANALLDLYECEFDQLYLERATRLV----ELILEKFWEDGLYFTPKDGEP-LVHRPRAPQD 526

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSG S SV   +RL  +   +  + YR+ AE  L ++             +  A D 
Sbjct: 527 NAWPSGTSTSVFAFLRLFEL---TGRELYRERAEQVLTMYRAAAAQNPFGFAHLLAAQDF 583

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           +       +V+ G +S+    + L A+     L   V+    A  E++            
Sbjct: 584 VQR-GPISIVIAGERSAA---SALVASLQRRYLPARVL----AFAEDVPI---------- 625

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            A  +    +  A VC+N +C  PVT    L
Sbjct: 626 GAGRHMLKGQTSAYVCRNRTCENPVTSAAEL 656


>gi|326800931|ref|YP_004318750.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326551695|gb|ADZ80080.1| protein of unknown function DUF255 [Sphingobacterium sp. 21]
          Length = 672

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 220/699 (31%), Positives = 349/699 (49%), Gaps = 81/699 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ VA+++N  ++SIKVDREERPD+D++YMT VQ +   GGWPL+
Sbjct: 48  SACHWCHVMERESFENKEVAQVMNRHYISIKVDREERPDIDQIYMTAVQLMTNSGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
               PD +P+ GGTYF P D      +  +L +V+  W  + +   +      E+L++ +
Sbjct: 108 CICLPDGRPVYGGTYFRPAD------WVNVLNQVQALWANEPETAIEYA----EKLAQGI 157

Query: 140 SASAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           + S +   +K+P++  ++ L+   +   +++D   GG+  APKFP P      L +    
Sbjct: 158 TESETFKISKIPEKYSEDDLKEIVKPWQQTFDPIDGGYKRAPKFPLPNNWLFFLRY---- 213

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
              G     ++  +   FTLQ +A GG++D VGGGF RY+VD +WH+PHFEKMLYD  QL
Sbjct: 214 ---GHLANDADILEHTHFTLQHIAAGGLYDQVGGGFARYAVDGQWHIPHFEKMLYDNAQL 270

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y +A+    +  Y  +  + L ++ R+M    G  +SA DADS   EG     EG +
Sbjct: 271 ISLYAEAYLQKPEPLYKRVVEETLQWVDREMTSAEGAFYSALDADS---EGV----EGKY 323

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y +   E++++LG+ A LF  ++ +   GN    +           NVL    D+   A 
Sbjct: 324 YTFQQDEIDNLLGKDADLFISYFSITAAGNWPEEKT----------NVLKTRLDADKLAE 373

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G   E++   L + ++K+   R +R RP LD+K++ SWN +++ ++  A +       
Sbjct: 374 QAGYSKEEWETYLKDIKKKIRHYREQRIRPGLDNKILTSWNAMMLKAYIDAYRTF----- 428

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ---THRLQHSFRNGPSKAPGFLDDY 494
                      ++KEY+ VAE  A FI R L  E+    H+ Q  F+        FLDDY
Sbjct: 429 -----------NKKEYLTVAERNAHFILRKLITEEGTLLHQPQTPFKT----ITAFLDDY 473

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           AF+I   + LYE      WL  A  L +     F DR+ G ++ T+     ++ R  E  
Sbjct: 474 AFVIEAFIALYEVTFNKAWLDQAKSLADYTLAQFYDRQAGAFYYTSDLTEVLITRKFEIM 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D   PS NSV    L +L  I   S    Y++ A   LA    +++    A      A  
Sbjct: 534 DNVIPSSNSVMAHQLNKLGVIFEDST---YKEIAAQLLANVFPQIRTYGSAYS--NWAIR 588

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           +L      H + +    S D    +A     Y  NK ++       EE          N 
Sbjct: 589 LLEEVYGFHEIAITGPQSNDLR--IAIDQKIYSPNKVIL----GGVEE----------NL 632

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
            + RN  + ++ +  VC+N +CS PV +   +ENL+L++
Sbjct: 633 PLLRNRVT-ERSLIYVCKNNTCSLPVDNLKDVENLILKQ 670


>gi|302652658|ref|XP_003018175.1| hypothetical protein TRV_07811 [Trichophyton verrucosum HKI 0517]
 gi|291181788|gb|EFE37530.1| hypothetical protein TRV_07811 [Trichophyton verrucosum HKI 0517]
          Length = 511

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 197/514 (38%), Positives = 292/514 (56%), Gaps = 32/514 (6%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+
Sbjct: 1   MEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLE 60

Query: 88  PLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S      QL E  
Sbjct: 61  PVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFA 120

Query: 140 S-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
                 +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV +  +L  S
Sbjct: 121 EEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLS 180

Query: 195 KKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           +  E   D     E ++  +M + T+  +A+GGI D +G GF RYSV   W +PHFEKML
Sbjct: 181 RYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKML 240

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGAT 310
           YDQ QL +V++D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T
Sbjct: 241 YDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSTPILSPMGCFYSSEDADSQPSPEDT 300

Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            K+EGA+YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF  +NVL   
Sbjct: 301 EKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIA 358

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
              +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGLVI + ++ 
Sbjct: 359 TTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALSKC 418

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSKA 487
           + +L+           +     K    +A +A  FI+ +L+D ++ +L   +R +     
Sbjct: 419 AILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIYRADSRGDT 468

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
           PGF DDYA+LISGLL LYE       L +A +LQ
Sbjct: 469 PGFADDYAYLISGLLQLYEATFDDAHLQFADKLQ 502


>gi|374852688|dbj|BAL55616.1| hypothetical conserved protein [uncultured gamma proteobacterium]
          Length = 723

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 215/566 (37%), Positives = 306/566 (54%), Gaps = 63/566 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    +  +  FL    ++CHWCHVME ESFEDE +A +LN  FV +K+DRE+RPD
Sbjct: 34  GEEAFAKARREAKPIFLSSGYSSCHWCHVMERESFEDEEIAAILNRDFVPVKLDREQRPD 93

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD VYM  VQ L G GGWPLS FL+PD +P  GGTYFPP+       FK +L++V +AW 
Sbjct: 94  VDAVYMHAVQLLTGHGGWPLSAFLTPDGRPFFGGTYFPPQ------AFKRLLQQVAEAWR 147

Query: 119 KKR-DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
            +R ++ AQ+     E+L +AL    S++  P E+    +     ++   +D R GGFG+
Sbjct: 148 SRRAEIEAQA-----ERLKQALLELESTH--PGEIGPETVEAAIAEILAPFDPRHGGFGA 200

Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
           APKFP    + +++       D    G+  +  ++V  TL  MA+GG+ D +G GFHRY 
Sbjct: 201 APKFPNEPWLALLI-------DELWRGDDPKVLEVVRKTLDAMARGGLCDQIGDGFHRYC 253

Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
           VD  + +PHFEKMLY+Q QL  +Y  A +LTKD  ++Y  R   D++ R++  P G  ++
Sbjct: 254 VDAAFQIPHFEKMLYNQAQLGRLYARAAALTKDALFAYAARCTFDFVLRELTAPEGGFYA 313

Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDP 356
           A DADS   EG    +EG FY+WT +E+   L  + A L  E + +  +GN         
Sbjct: 314 AIDADS---EG----EEGKFYLWTPEEIRAALPKDDAELAIELFGVSASGN--------- 357

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
              F+GKNVL      +  A   GM  E+ L  L   R++L+ VR +R  P  DDK++ +
Sbjct: 358 ---FEGKNVLHLPRPLAEIAQAKGMTEEELLACLDRIRQRLYQVRRRRVPPLRDDKIVTA 414

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNG++I++ A A++         +F+ P       +Y+  A  AA F+ RH    Q  RL
Sbjct: 415 WNGMMIAALAEAAR---------LFHEP-------KYLLAARRAAEFLSRHHL--QGERL 456

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
             + RNG     G  +DYAFL  G L LY+  +   WL  A  L       F D   G  
Sbjct: 457 LRASRNGRPAGEGLQEDYAFLAEGFLALYDVSADPVWLQEAEALTAAMLAQFWDEARGAC 516

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGN 562
           F     D  + +R K+  DGA PSGN
Sbjct: 517 FMNRA-DERLAVRPKDLFDGAYPSGN 541


>gi|114326678|ref|YP_743835.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
 gi|114314852|gb|ABI60912.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
          Length = 679

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 232/691 (33%), Positives = 330/691 (47%), Gaps = 96/691 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+  A  +N+ F+ IKVDREERPD+D +YM+ + A+   GGWPL++F
Sbjct: 62  CHWCHVMAHESFEDQATADEMNNAFICIKVDREERPDIDHIYMSALHAMGQQGGWPLTMF 121

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P+ +P  GGTYFPPE ++GRP F+ +L  ++DAW  +R  + Q+    + QL+ A++ 
Sbjct: 122 LTPEGQPFWGGTYFPPEPRFGRPSFRQVLAAIRDAWATRRSAIEQN----LGQLTRAMNR 177

Query: 142 SASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            + +   P  D L  NA+      L ++ D   GGF  APKFP      +  +  ++   
Sbjct: 178 LSETAAGPEVDVLLLNAVDAA---LLRNLDPEKGGFTGAPKFP---NAPVFRFFWQEFHR 231

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG+     E    V   L  MA+GGI+DH+GGGF RYS D  W VPHFEKM YD GQ+  
Sbjct: 232 TGR----PELSDAVHAVLSHMARGGIYDHLGGGFARYSTDAEWLVPHFEKMAYDNGQILE 287

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGP---GGEIFSA-EDADSAETEGATRKKEG 315
           +    ++      Y+    + + +L RDM  P   GG  F+A EDADS   EG    +EG
Sbjct: 288 LLSLGYAQNPTPLYARCIEETVGWLIRDMSVPVEGGGTAFAASEDADS---EG----EEG 340

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FY+W   E++ +LGE A  FK+ + +   GN            ++G  +L  L  S   
Sbjct: 341 RFYIWHEDEIDALLGEAATGFKQAFDVTREGN------------WEGHTILRRLTISP-- 386

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                   E       + RR LF  R  RPRP  DDKV+  WNGLVI    RA+  L   
Sbjct: 387 --------EADAESWAQERRILFQSRENRPRPGRDDKVLADWNGLVIVGLVRAAIAL--- 435

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        DR +++  AESA   +R  L  E   R+ H++R G   A G LDD A
Sbjct: 436 -------------DRADWLSAAESAYEAVRAALGSEDG-RIAHAWRLGRITAAGLLDDQA 481

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +I   L LYE     ++L  A+ L  +    F    G  Y      D   L R     D
Sbjct: 482 SMIRAALSLYEATGQERYLSDAVTLAQSARSFFSSETGAFYTTAHDADDVPLTRPCTASD 541

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSGN +    L RL  +    +   + + A   +  F  R + +A + P +  AAD+
Sbjct: 542 NAVPSGNGMMADALARLYHLTGEQR---WYEAASGLIRAFTGRPQSLA-SSPYLLMAADL 597

Query: 616 LSVPSRKHVVLV-GHKSSVDFENM----LAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
           L   +R  +V + G       ++M    LA    S  + +  +H  P             
Sbjct: 598 L---TRGTLVSIHGQADDPHLQSMVREVLALGDPSVLVCRKPLHAAPDR----------- 643

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                  + +  A     LVC+   CS P+T
Sbjct: 644 -------QTDHVAQTFFVLVCRQTLCSAPLT 667


>gi|408529633|emb|CCK27807.1| hypothetical protein BN159_3428 [Streptomyces davawensis JCM 4913]
          Length = 682

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 230/701 (32%), Positives = 334/701 (47%), Gaps = 86/701 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFEDE  A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 55  SCHWCHVMAHESFEDEATAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTYFPP  ++G P F+ +L  V+ AW  +RD +A+     +  L+E   
Sbjct: 115 FLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVQQAWTGRRDEVAEVAGKIVRDLAEREI 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           +   S    +E    AL      L++ YD++ GGFG APKFP  + I+ +L H  +   T
Sbjct: 175 SYGDSQAPGEEELAGALL----GLTREYDAQRGGFGGAPKFPPSMVIEFLLRHHAR---T 227

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  V
Sbjct: 228 GSEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRV 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA+YVW
Sbjct: 284 YAHLWRSTGSELARRVALETADFMVRELRTNEGGFASALDADS--DDGTGKHVEGAYYVW 341

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T ++  ++LG+ A    +++ +   G  +                          AS L 
Sbjct: 342 TPQQFREVLGDDAERAAQYFGVTEEGTFE------------------------EGASVLQ 377

Query: 381 MPLEKYLNI---LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +P  + L +   +   R +L   R++RP P  DDKV+ +WNGL I++ A           
Sbjct: 378 LPQHEGLFVAEKVASVRERLLAARAERPAPGRDDKVVAAWNGLAIAALAETGAYF----- 432

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                      DR + +E A  AA  + R   DE     + S         G L+DYA +
Sbjct: 433 -----------DRPDLVEAAVCAADLLVRLHLDEHVQIARTSKDGQVGANAGVLEDYADV 481

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L        WL +A  L +     F+D   G  ++T  +   ++ R ++  D A
Sbjct: 482 AEGFLALASVTGEGVWLEFAGFLLDHVLARFVDERSGALYDTAVDAERLIRRPQDPTDNA 541

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
            PSG + +   L+   S  A + ++ +R  AE +L V    +K +   VP      +  A
Sbjct: 542 APSGWTAAAGALL---SYAAQTGAEPHRAAAERALGV----VKALGPRVPRFIGWGLAAA 594

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN---KTVIHIDPADTEEMDFWEEH 669
              L  P  K V +VG   ++D +    A H +  L      V+     D++E+      
Sbjct: 595 EAWLDGP--KEVAVVG--PALD-DPATRALHRTALLGIAPGAVVAAGTPDSDELPL---- 645

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +A       +  A VC+NF+C  P TDP  L   L
Sbjct: 646 ------LAGRPLVGGEPAAYVCRNFTCDAPTTDPERLRAAL 680


>gi|409122619|ref|ZP_11222014.1| thioredoxin domain-containing protein [Gillisia sp. CBA3202]
          Length = 620

 Score =  336 bits (861), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 197/560 (35%), Positives = 305/560 (54%), Gaps = 63/560 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFEDE VA+++N  + +IKVDREERPDVD VYM+ VQ + G GGWP+++ 
Sbjct: 55  CHWCHVMEHESFEDEDVAEIMNTHYYNIKVDREERPDVDMVYMSAVQIMTGSGGWPMNIV 114

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EAL 139
             PD +P+ GGTYF  ED      +K  L ++   + +  + L +      E L   + +
Sbjct: 115 ALPDGRPVWGGTYFRKED------WKNSLLQIAKLYKENPEKLYEYADKLNEGLKNIQLI 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM-----LYHS 194
           ++S S N +        L L +E+L K++D ++GG    PKF  P   + +     LY+ 
Sbjct: 169 ASSKSENDID-------LNLISEKLEKNFDWQYGGTKQTPKFVIPSNFEFLLKYSQLYNH 221

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
           K ++D             V  +L  ++ GGI+DH+ GGF RYSVDE+WH+PHFEKMLYD 
Sbjct: 222 KNIKD------------FVKLSLTKISFGGIYDHIEGGFSRYSVDEKWHIPHFEKMLYDN 269

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
            Q+ ++Y  A+++TK  +Y  +    L+++  ++    G  +S+ DADS +  G  R  E
Sbjct: 270 AQMVSLYSKAYAVTKIGWYREVVEQTLEFIENNLKTKEGSFYSSLDADSIDKNGKLR--E 327

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           GAFY W   E++++L +   LFKE+Y +   G  +        NE+    VLI   D ++
Sbjct: 328 GAFYTWEVDELKELLKDEFSLFKEYYNVNSYGKWE-------DNEY----VLIRTEDEAS 376

Query: 375 SASKLGMPLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             +K  +   ++  I       L  + R+KR +P LDDK + SWN L++S +  A KI  
Sbjct: 377 FLNKNQLDSMEFKAIKAHWLEVLSSEERNKREKPRLDDKQLTSWNALMLSGYVDAYKI-- 434

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                         +  K+Y+  A   A+FI+ HLY  + + L  SF+NG S   G+L+D
Sbjct: 435 --------------TQNKDYLATALQNATFIQEHLYKSEGN-LHRSFKNGISSINGYLED 479

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAF I   + LYE     +WL ++ +L +   ++F + E G ++ T+ +D  ++ R  E 
Sbjct: 480 YAFTIEAFIKLYEITLDFEWLHFSKKLMDYSIQIFYEPETGLFYFTSKQDKPLITRNYEL 539

Query: 554 HDGAEPSGNSVSVINLVRLA 573
            D   P+ NSV   NL +L+
Sbjct: 540 SDNVIPASNSVMAQNLFKLS 559


>gi|29829838|ref|NP_824472.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
 gi|29606947|dbj|BAC71007.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
          Length = 675

 Score =  336 bits (861), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 235/700 (33%), Positives = 339/700 (48%), Gaps = 82/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE  A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SSCHWCHVMAHESFEDETTAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPPE ++G P F+ +L  V+ AW  +RD +A+     +  L+   
Sbjct: 107 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLEGVRSAWTDRRDEVAEVAGKIVRDLAGRE 166

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   SS    +EL Q  L      L++ YD+R GGFG APKFP  + ++ +L H  +  
Sbjct: 167 ISYGDSSTPGEEELAQALL-----GLTRDYDARRGGFGGAPKFPPSMVVEFLLRHHAR-- 219

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 220 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 274

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G+ R  EGA+Y
Sbjct: 275 RVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGSGRHVEGAYY 332

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
           VWT +++E  LG E A L    + +   G  +           +G +VL +   D    A
Sbjct: 333 VWTPEQLEQALGREDAELAARCFGVTRDGTFE-----------EGASVLQLPQQDVVFDA 381

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            +           +   R +L   R++RP P  DDKV+ +WNGL I++ A          
Sbjct: 382 ER-----------IASVRARLLGRRAERPAPGRDDKVVAAWNGLAIAALAETGAYF---- 426

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                       DR + +E A  AA  + R   DE   RL  + ++G + A  G L+DY 
Sbjct: 427 ------------DRPDLVEAAIGAADLLVRLHLDEHA-RLARTSKDGRAGAHAGVLEDYG 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  D
Sbjct: 474 DVAEGFLALASVTGEGVWLEFAGFLLDHVLAQFTDPESGALYDTAADAEKLIRRPQDPTD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG S +   L+   S  A + ++ +R  AE +L V    +K +    P      + 
Sbjct: 534 NATPSGWSAAAGALL---SYAAHTGAEPHRTAAERALGV----VKALGPRAPRFVGWGLA 586

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P  + V +VG               A+  L++T + +  A    +      +
Sbjct: 587 VAEALLDGP--REVSVVGPADD----------PATGTLHRTAL-LGTAPGAVVAVGTPGS 633

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A          A VC+NF+C  P+TD   L   L
Sbjct: 634 DEFPLLADRPLVGGGPAAYVCRNFTCDAPITDADRLRTAL 673


>gi|189424638|ref|YP_001951815.1| hypothetical protein Glov_1579 [Geobacter lovleyi SZ]
 gi|189420897|gb|ACD95295.1| protein of unknown function DUF255 [Geobacter lovleyi SZ]
          Length = 610

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 209/560 (37%), Positives = 294/560 (52%), Gaps = 66/560 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFED+ VA +LN  FV +KVDREERPD+D+  M   Q+L   GGWPL+ 
Sbjct: 72  TCHWCHVMAHESFEDDEVADILNHAFVPVKVDREERPDLDEFCMAACQSLTNSGGWPLNC 131

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL PD  P    TY P E K G PGF  +L  +   W  K++ + ++    +E L + ++
Sbjct: 132 FLKPDGTPFYALTYLPKEPKRGMPGFLELLENIARVWQHKQEAVERNARSLMEALGQ-MA 190

Query: 141 ASASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           A+      PD  EL  +A+      L K +D R+ GFG APKFP P  +  +L    ++E
Sbjct: 191 AAPVQTTAPDLKELADSAV----ATLRKIHDPRYHGFGKAPKFPMPPYLLFLLGRDNRIE 246

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                      Q++ L TLQ M +GGI D +GGG HRYS D+ W VPHFEKMLYDQ  +A
Sbjct: 247 -----------QELALNTLQAMRQGGIWDQLGGGIHRYSTDQHWLVPHFEKMLYDQALVA 295

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
              L A++LTK+  Y  +  ++L+++  ++  P G  +   DADS   EG    +EGA Y
Sbjct: 296 YTALKAYALTKENRYLEMADNLLEFVLAELTAPEGGFYCGLDADS---EG----REGACY 348

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VW  +E+E ILG+ A  F ++Y +   GN           E  G+NVL +   ++   + 
Sbjct: 349 VWKKQELEQILGDQAAFFCQYYGVTEQGNF----------EEPGENVLFQALPAAEEPAA 398

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           +               +KL  VR+ R +P  D KV+  WNGL+I++ AR + +       
Sbjct: 399 IKA-----------AGQKLLQVRAMRQQPLRDLKVLSGWNGLMIAALARGAAL------- 440

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                    ++ + ++E A  AA+FI   L      RL  S+   PS   GFL+DYAFL 
Sbjct: 441 ---------TNNRRWLEAARRAATFISSAL-TRADGRLLRSWCGTPSTIAGFLEDYAFLG 490

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGA 557
            G L+L++ G     L  A +L   +D L L R       T G D   L L + ++HDG 
Sbjct: 491 WGYLELFKAGGDAADLATAEQL--CRDALHLFRTEDERLVTAGNDQEQLPLALSDNHDGV 548

Query: 558 EPSGNSVSVINLVRLASIVA 577
            PSG +  V+NLV LA   A
Sbjct: 549 IPSGPAALVMNLVALAKCTA 568


>gi|75674298|ref|YP_316719.1| hypothetical protein Nwi_0099 [Nitrobacter winogradskyi Nb-255]
 gi|74419168|gb|ABA03367.1| Protein of unknown function DUF255 [Nitrobacter winogradskyi
           Nb-255]
          Length = 676

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 228/691 (32%), Positives = 335/691 (48%), Gaps = 74/691 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED+ VA ++N+ FV IKVDREERPD+D++YM+ +  L   GGWPL++
Sbjct: 59  ACHWCHVMAHESFEDDDVAAVMNELFVCIKVDREERPDIDQIYMSALHHLGEQGGWPLTM 118

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FLSPD  P  GGTYFP    +GRP F  +L+ V   +  + D +A+     I +LSE   
Sbjct: 119 FLSPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFRDQPDQIARHRDTLIARLSE--- 175

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              ++ K P  L    L   A  + +S D   GG   APKFP+   ++++     +  D 
Sbjct: 176 --RATTKSPANLGVAELNNAAVAIMRSTDPVNGGLRGAPKFPQCSVLELLWRAGARTRDD 233

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                 +        TL  M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD  Q+ ++
Sbjct: 234 RFFAATT-------LTLTRMSQGGIYDHIGGGYARYSVDDRWLVPHFEKMLYDNAQILDL 286

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
               ++ +K+  Y     + +D+LRR+M+   G   S+ DADS   EG    +EG FYVW
Sbjct: 287 LALDYARSKNPLYRERAIETVDWLRREMLTAEGGFASSLDADS---EG----EEGRFYVW 339

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           +  E++D+LG          Y   T N +  R + P N  K  +V    ND SA    L 
Sbjct: 340 SLSEIDDVLGAADAADFAARY-DITANGNFERRNIP-NRLKSIDV---ANDDSAHMRAL- 393

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
                        R+KL   R  R RP LDDK++  WNGL+I++    + +         
Sbjct: 394 -------------RKKLLVRRESRVRPGLDDKILADWNGLMIAALVHGACVF-------- 432

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   D+ +++ +A +A  FIR  +   +  RL HS+R G    P    DYA +   
Sbjct: 433 --------DKPDWLRIARAAYDFIRTMM--TRDGRLGHSWREGRLLIPALASDYATMARA 482

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            L L+E      +L  A+  Q+T D  + D   GGY+ T  +   +++R     D A P+
Sbjct: 483 ALALFEATGDGTFLEQALRWQSTLDTHYADAAHGGYYLTADDAEGLIVRPHSSEDDAIPN 542

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
            + V   NLVRLA++   +K   +R   +   A    R  +       +  A D+    +
Sbjct: 543 HDGVIAQNLVRLAALTGDAK---WRDRIDSHFAALLPRATEKGFGQLSLMNALDLRLTGA 599

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
               ++V  + +     + AA    Y     V+H   AD    D            AR  
Sbjct: 600 E---IVVAGEDAQAAALLGAARKLPY-ATSIVLHAPHADALPADH----------PARAK 645

Query: 681 FSA-DKVVALVCQNFSCSPPVTDPISLENLL 710
            SA  +  A +C+  SCS PVT P +L  L+
Sbjct: 646 LSAVAQSAAFICRGQSCSLPVTQPDALNELM 676


>gi|344940058|ref|ZP_08779346.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
 gi|344261250|gb|EGW21521.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
          Length = 754

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 217/602 (36%), Positives = 315/602 (52%), Gaps = 61/602 (10%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    K  +   L    +TC+WCHVME E FE+  +AKL+N+  VSIK+DRE+RPD
Sbjct: 36  GEEAFAKARKENKPILLSIGYSTCYWCHVMEREIFENPEIAKLMNESIVSIKIDREQRPD 95

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YMT  Q +   GGWP +VF++PDLKP   GTYFPP        F ++++++   W 
Sbjct: 96  VDDLYMTATQMMTHSGGWPNNVFVTPDLKPFYAGTYFPP------AAFSSLIQQIHYIWM 149

Query: 119 KKRDML---AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
           + +  L   A+  A AI ++ +    +A S+ LP      AL       S  YD+R GGF
Sbjct: 150 QDQVPLKAQAERLASAIIRIKQQ-ENNAQSSSLPGSRLVEAL---ISHFSDYYDNRLGGF 205

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
             APKFP   +  + L  + +L       E + G      TL+ MA+GGIHDHVGGGFHR
Sbjct: 206 YQAPKFPNE-DALLFLLEAYRLTSNNTCLEMARG------TLEKMAEGGIHDHVGGGFHR 258

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           Y+ D +W +PHFEKMLY+Q  L   Y + ++L+       +   I D+  R M    G  
Sbjct: 259 YATDAQWRIPHFEKMLYNQALLGRAYTELYALSNKPDDRVVAEGIFDFTLRQMTHKDGGF 318

Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRM 353
           +SA DA+       T   EGA+Y WT  E++D L    +A L K HY     G  ++ ++
Sbjct: 319 YSALDAE-------TDAVEGAYYAWTDAELQDALDTDSYAWLMK-HY-----GLAEIPKI 365

Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
              H    G+ VL  +   S SA+  G+  E  +         L + R KR  PHLD+K+
Sbjct: 366 PG-HKHVDGR-VLYLIQPLSESATAEGLSYEDAVKKQQAVMTSLRESRDKRKLPHLDNKI 423

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
           I SWNGL+I +FARA   ++                + EY E +  AA FI  +L  +Q 
Sbjct: 424 ITSWNGLMIDAFARAGLCMR----------------KLEYTEASRRAADFILANL-RKQD 466

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
             L  ++R+G ++   + +DYAF+I GL+ +Y      ++L  A EL     +LF D + 
Sbjct: 467 GSLYRTWRDGQAEISAYFEDYAFMIQGLVSIYRAAKDNRYLQAAKELAAKAKQLFWDEKH 526

Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GGY+ T G +  +L+R+K   D A PSGN+V    L+ L  I   ++   ++Q AE  L 
Sbjct: 527 GGYYFTDGSE-LLLVRMKNAVDSAIPSGNAVMAQALLDLYEITGDAE---WKQQAEALLI 582

Query: 594 VF 595
            F
Sbjct: 583 AF 584


>gi|339325405|ref|YP_004685098.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
 gi|338165562|gb|AEI76617.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
          Length = 666

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 238/701 (33%), Positives = 340/701 (48%), Gaps = 104/701 (14%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESFE+  +A L+N+ F+SIKVDR+ERPD+D +Y    Q +  GGGWPL+V
Sbjct: 49  TCHWCHVMAHESFENPRIAALMNERFISIKVDRQERPDLDDIYQKVPQLMGQGGGWPLTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P  GGTYFPP+D+YGRPG   +L  + +AW  +R  L  +    IEQ  +   
Sbjct: 109 FLTPQGEPFYGGTYFPPDDRYGRPGLPRVLLSLSEAWRHRRQELRDT----IEQFQQGFR 164

Query: 141 A----------SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
                      +  + ++ D   Q AL      L+++ D   GG G APKFP      ++
Sbjct: 165 HLDEGVLSREDAEQAAEVQDLPAQTAL-----ALARNTDPTHGGLGGAPKFPNASAYDLV 219

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L   ++  +                TL  MA GGIHD +GGGF RYSVDERW VPHFEKM
Sbjct: 220 LRICQRTHEPALLDALER-------TLDGMAAGGIHDQLGGGFSRYSVDERWAVPHFEKM 272

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD GQL  +Y +A+ LT    +  +    + Y+ RDM  P G   + EDADS   EG  
Sbjct: 273 LYDNGQLVTLYANAYRLTGKQAWRRVFEGTIAYILRDMTHPDGGFHAGEDADS---EG-- 327

Query: 311 RKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
             +EG FYVWT+ EV+ +LGE    L    Y +   GN +            G++VL   
Sbjct: 328 --EEGRFYVWTAAEVKAVLGESEGALACRAYGVTEGGNFE-----------PGRSVL--- 371

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
                 A  L  PLE+    L   R +L   R++R RP  DD ++  WNGL+I     A 
Sbjct: 372 ----HRAVTL-TPLEE--ARLEGWRERLLAARARRVRPGRDDNILAGWNGLMIQGLCAAY 424

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKA 487
           +   + A                ++  A  AASF++  L   D   +R    ++NG  K 
Sbjct: 425 QATGNPA----------------HLAAARRAASFVQDKLTMPDGGVYRY---WKNGTVKV 465

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSV 546
           PGFL+DYAFL + L+DLYE     ++L  A EL      L +DR  G G + T  +   +
Sbjct: 466 PGFLEDYAFLANALIDLYESCFDRRYLDRAAELVT----LIIDRFRGDGLYFTPNDGEPL 521

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R +  +DGA PSG S SV   +RL  +   +  D YR  AE     +           
Sbjct: 522 IHRPRGPYDGAWPSGISASVFAFLRLHEL---TGEDRYRDLAEQEFQRYRAAATAAPAGF 578

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
             +  AAD     +   ++L G K++     ++ + H +Y L   V+             
Sbjct: 579 VHLLAAADFAQRGAFG-IILAGDKAAA--AALVESVHRTY-LPARVLAF----------- 623

Query: 667 EEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISL 706
               + +  + +     D +  A VC++ +C+ PVT   +L
Sbjct: 624 ----AEDVPVGQGRLPVDGRPAAYVCRHRTCTAPVTSGQAL 660


>gi|427427562|ref|ZP_18917606.1| Thymidylate kinase [Caenispirillum salinarum AK4]
 gi|425883488|gb|EKV32164.1| Thymidylate kinase [Caenispirillum salinarum AK4]
          Length = 678

 Score =  335 bits (858), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 211/574 (36%), Positives = 290/574 (50%), Gaps = 64/574 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED   A ++ND F++IKVDREERPDVD +YM+ +Q +   GGWPL++
Sbjct: 51  ACHWCHVMAHESFEDAETAAVMNDLFINIKVDREERPDVDAIYMSALQLMGQRGGWPLTM 110

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P  GGTYFP +  +GRPGFK +LR+V DA+ +  + ++ +    ++ L + L+
Sbjct: 111 FLTPDGEPFWGGTYFPKDSAFGRPGFKDVLRQVADAYHQSPEKVSNNTGALVDALRKGLN 170

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              SS   P  L    +   AE L+   D  +GG   APKFP       +    +    T
Sbjct: 171 LPQSSEP-PAALALPVVDQLAESLAGHVDPEWGGLRGAPKFPVVFAFDALW---RSWHRT 226

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+     E    VL TL  + +GGI+DH+GGGF RYS D +W VPHFEKMLYD  QL ++
Sbjct: 227 GR----QELHDAVLLTLDRLCQGGIYDHLGGGFARYSTDAQWLVPHFEKMLYDNAQLIDL 282

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
               +  T+         + +D+L R+MI   G   S+ DAD   TEG    +EG FYVW
Sbjct: 283 MTSVWQETRSPLLQARVEETVDWLEREMIAENGAFASSLDAD---TEG----EEGRFYVW 335

Query: 321 TSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL----IELNDSSA 374
           T  E++ +LG    A LFK  Y ++P GN            ++GK VL     ++ D  A
Sbjct: 336 TKDEIDRVLGTDADAALFKRAYDVRPGGN------------WEGKTVLNRNFSDVGDEPA 383

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
             +K           L   R  L   R KR  P  DDKV+  WNGL+I + ARA      
Sbjct: 384 LETK-----------LYRARMLLLRERDKRVMPGRDDKVLADWNGLMIHALARA------ 426

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
               A F  P       E++++A SA   IR  +      RL HSFR G  +    LDDY
Sbjct: 427 ---GAAFGRP-------EWVDLARSAYDGIRDTM-SRPGDRLGHSFRKGRLQDVAMLDDY 475

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +    L L++      ++  A       D  + D   GGYF T  +   ++LR K   
Sbjct: 476 ANMARAALTLHQVTGVADFIDHASRWVAVLDAEYWDDAAGGYFLTAADATDLILRTKSAQ 535

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
           D A PSGN    + L  L  +    +   YR+ A
Sbjct: 536 DNATPSGNGTMAVVLATLWHLTGEER---YRRRA 566


>gi|367469960|ref|ZP_09469682.1| Thymidylate kinase [Patulibacter sp. I11]
 gi|365814937|gb|EHN10113.1| Thymidylate kinase [Patulibacter sp. I11]
          Length = 685

 Score =  335 bits (858), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 231/699 (33%), Positives = 327/699 (46%), Gaps = 71/699 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A ++N  FV +KVDREERPDVD + M  VQA+ G GGWPL+
Sbjct: 48  SACHWCHVMAHESFEDPATASVMNAHFVCVKVDREERPDVDAICMEAVQAITGQGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P+ GGTYFPP+ + G P ++ +L  V +AW ++   + +  +   ++LS A 
Sbjct: 108 VFLTPEQQPIHGGTYFPPQPRQGMPSWRMVLDAVAEAWRERSGEIREQLSDVADRLSGAS 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + +      EL   A+R     L + YDS  GGFG APKFP    +  +L  +     
Sbjct: 168 RLTPADAVPGPELLDAAVR----GLGERYDSVQGGFGGAPKFPPHPSLLFLLQRAADERP 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
              SG A     M   TL+ MA GGI+D +GGGF RY+VD  W VPHFEKMLYD   LA 
Sbjct: 224 GEDSGTAGRAAAMARHTLRSMASGGINDQIGGGFARYAVDGTWTVPHFEKMLYDNALLAR 283

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y++ F L  D          L +L  ++ GP G   SA DADS   EG     EG FYV
Sbjct: 284 AYVEGFRLWGDERLRETAERTLAFLADELRGPEGGFLSALDADS---EGV----EGRFYV 336

Query: 320 WTSKEVEDIL----GEHAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           WT ++V   L     E AI +    EH   +        R   P +E             
Sbjct: 337 WTPEQVRAALSSADAEAAIAWLGVTEHGNFEDGATVLEDRGERPDDE------------- 383

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
                            +   R  L   RS+R RP  DDK +  WNGL I +FA AS +L
Sbjct: 384 ----------------TVARIRAGLLAARSQRIRPGTDDKRVAGWNGLAIHAFAEASAVL 427

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
             E      +   V      ++    +    +RR   D +T     S   G ++    L+
Sbjct: 428 GRE------DLLEVARRAAAFVRRDLTVDGRLRRTWSDRETAGADTSGHGGRARHAAVLE 481

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           D+ FL+   + L+E G   + L WA EL +T    F D E G +F T  +  ++L+R KE
Sbjct: 482 DHGFLLEAAVALFEAGGDPEDLAWARELADTILNRFADPERGAFFATADDAEALLVRRKE 541

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D   PSG + +   L+RLA++   ++   Y   A+  L +  T  + +  AV     A
Sbjct: 542 LDDAPIPSGGASASRGLLRLAALTGEAR---YADAADGWLRLAATVAERIPQAVAYALLA 598

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
            D    P R+ V +VG  ++      +    +   L   V              +  +  
Sbjct: 599 LDERHRPPRE-VAIVGPPAARAALVAVVRERSRPGLVLAV-------------GDGLDDR 644

Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
             ++ R   + D +  A VC+ FSC  PVT+P +L   L
Sbjct: 645 GVALLRGRPTVDGQATAYVCERFSCRAPVTEPDALRAAL 683


>gi|92115739|ref|YP_575468.1| hypothetical protein Nham_0107 [Nitrobacter hamburgensis X14]
 gi|91798633|gb|ABE61008.1| protein of unknown function DUF255 [Nitrobacter hamburgensis X14]
          Length = 682

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 232/696 (33%), Positives = 338/696 (48%), Gaps = 74/696 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+ VA ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++F
Sbjct: 60  CHWCHVMAHESFEDDEVAAVMNELFVCIKVDREERPDIDQIYMNALHLLGEQGGWPLTMF 119

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           LSPD  P  GGTYFP    +GRP F  +L+ V   +  K + +  +    I +LSE    
Sbjct: 120 LSPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFHDKPERVTLNRDAVIARLSERAKV 179

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            + +N     L    L   A  +++S D   GG   APKFP+   ++        L   G
Sbjct: 180 GSPAN-----LGVAELNTAAVSIARSTDPVNGGLHGAPKFPQCSVLEF-------LWRAG 227

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               +         TL  M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD  Q+ ++ 
Sbjct: 228 ARTGSDRFYAATTLTLTQMSQGGIYDHLGGGYARYSVDDRWLVPHFEKMLYDNAQILDLL 287

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              ++ +K+  Y     + + +L R+M+   G   S+ DADS   EG    KEG FYVW+
Sbjct: 288 ALDYARSKNPLYRERAIETVAWLLREMLTGEGGFASSLDADS---EG----KEGKFYVWS 340

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
             E+E++LG   A  F   Y +   GN            F+G+N+   L  SS   S  G
Sbjct: 341 LSEIEEVLGATDAADFAARYDITANGN------------FEGRNIPNRLK-SSDLVSDDG 387

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             +          R KL   R+ R RP LDDKV+  WNGL+I++             +  
Sbjct: 388 AHMRT-------LRAKLLARRAGRVRPGLDDKVLADWNGLMIAALVHG---------ACA 431

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
           F  P       +++E A +A  FIR+ +   +  RL HS+R G    P    DYA ++  
Sbjct: 432 FGLP-------DWLETARTAFEFIRKTM--TRGDRLGHSWREGRLLVPALACDYAAMVRA 482

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            L L E    T +L  A+  Q T D  + D E GGY+ T  +   +++R     D A P+
Sbjct: 483 ALALSEATGDTAYLEQALRWQATLDTHYADVEHGGYYLTADDAEGLIVRPHSTIDDAIPN 542

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
            N +   NLVRLA++   SK   +R   +       +R  +       +  A D+    +
Sbjct: 543 YNGLIAQNLVRLAALTGDSK---WRDRIDALFGALLSRAAENGFGHLALLSALDLRLTGA 599

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
              +V+VG  +  +     A A         V+H+   D        EH +   +     
Sbjct: 600 --EIVVVGEGAQAEALLAAARALPHA--TSIVLHVSRGDALP----AEHPARAKAD---- 647

Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
            S     A VC+N SCS PVT P +L +L++++ S+
Sbjct: 648 -SVQGAAAFVCRNQSCSLPVTTPQALVDLVMQRTSA 682


>gi|266619634|ref|ZP_06112569.1| dTMP kinase [Clostridium hathewayi DSM 13479]
 gi|288868801|gb|EFD01100.1| dTMP kinase [Clostridium hathewayi DSM 13479]
          Length = 622

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 234/685 (34%), Positives = 329/685 (48%), Gaps = 71/685 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE++ +A LLN  +V +KVDREERPDVD VYM+  QA+ G GGWPL++ ++PD K
Sbjct: 1   MEQESFENDRIAALLNREYVCVKVDREERPDVDAVYMSVCQAMNGQGGWPLTIIMTPDCK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFPP  +YGR G + +L  V   W   R+    S       L      + S+  
Sbjct: 61  PFFSGTYFPPYARYGRVGLEELLTAVAGQWKADRETFLDSAGQIEAHLKAQERITMSAEP 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
             D + Q A R    Q   ++D + GGFG APKFP P  +  ++       + G   +  
Sbjct: 121 GVDAVHQ-AFR----QFLGNFDKKNGGFGGAPKFPTPHNLIFLM-------EYGVREKKR 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E   M   TL  M +GGI DH+GGGF RYS DE W VPHFEKMLYD   L   Y++AF L
Sbjct: 169 EALAMAETTLVQMYRGGIFDHIGGGFSRYSTDETWLVPHFEKMLYDNALLVMAYVEAFGL 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T    Y  + R IL Y+  ++    G  +  +DADS   EG     EG +YV+T +E+  
Sbjct: 229 TGRNGYKRVARRILAYVEAELTDEKGGFYCGQDADS---EGL----EGKYYVFTPQEICR 281

Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
           ILG  A           T  C    +++  N F+GK++   L + +  A         + 
Sbjct: 282 ILGPDA----------GTDFCSCYGITERGN-FEGKSIPNLLKNEAYEAV--------WE 322

Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
           N      +KL+D R  R R H DDK++VSWNG +I + A+A  +L               
Sbjct: 323 NHESPDLKKLYDYRITRTRLHRDDKILVSWNGWMICACAKAGAVL--------------- 367

Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
            D   Y+++A  A +FI  +L   +  RL   +R G S   G LDDYA  I  LL+LY  
Sbjct: 368 -DDTNYLDMAVRAETFIHENLV--RDGRLMVRYREGDSAGEGKLDDYACYILALLELYRV 424

Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
              T +L  A +   T  + F DRE GG++ T  +   +++R KE +DGA PSGNS + +
Sbjct: 425 TFQTDYLTRAAQWAETMVQQFFDRERGGFWMTAEDGEPLIVRTKETYDGAVPSGNSAAAL 484

Query: 568 NLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
            L +LA I   +K  D   Q   +     E      + A+  M      +  PSR+ V  
Sbjct: 485 GLYQLARITGETKWQDVLNQQLHYLAGAMEGYPSGHSFALLTMM----NVLYPSRELVCT 540

Query: 627 VGHKSSVDFENMLA--AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
           V    S +  ++LA   A+ +  +    + +  AD E      E       +        
Sbjct: 541 VSPDESGEALSILARRLAYLAETVPGLTVVVKTADNE-----TELTKLAPYIGDYPLPEA 595

Query: 685 KVVALVCQNFSCSPPVTDPISLENL 709
             +  +C    C PPV    SLE L
Sbjct: 596 GSLFYLCSGSRCMPPVK---SLEEL 617


>gi|332292243|ref|YP_004430852.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
 gi|332170329|gb|AEE19584.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
          Length = 679

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 220/684 (32%), Positives = 338/684 (49%), Gaps = 73/684 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFE+  VA+L+N  F +IKVDREERPDVD VYM  VQ +   GGWPL+ 
Sbjct: 53  SCHWCHVMEHESFENTEVAQLMNAHFKNIKVDREERPDVDNVYMNAVQLMTSRGGWPLNA 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYFP E+      + + L ++   +    + L +  A  +EQ  + + 
Sbjct: 113 IALPDGRPVWGGTYFPKEE------WTSALEQIAKLYQTAPEKLIEY-AEKLEQGMQEMD 165

Query: 141 ASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           A   ++  PD   E  QNA+     Q S+ +D+R GG   APKF  P     +L ++ + 
Sbjct: 166 AIIPNDSSPDFKLETLQNAI----SQWSRQWDTRQGGLNRAPKFMMPNNYLFLLRYAHQN 221

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +D        E  + V  TL+ +A GGI+DHVGGGF RYSVD +WHVPHFEKMLYD  QL
Sbjct: 222 QD-------QEILEYVNTTLEQIAFGGINDHVGGGFARYSVDTKWHVPHFEKMLYDNAQL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y  A++ TK+  Y       L ++ R+M    G  +SA DADS   +G    +EGA+
Sbjct: 275 VSLYALAYTKTKNPLYKQTVYQTLTFIAREMTTEDGAFYSAIDADSLTADGIL--EEGAY 332

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT KE++ ++G+   LFKE+Y +   G  +           K   VLI  +     + 
Sbjct: 333 YVWTEKELQTLVGDDFDLFKEYYNINSYGKWE-----------KDNYVLIRQDTDQDFSK 381

Query: 378 KLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +  + +E+ ++   +    L   R S + +P LDDK++ SWNGL+I  +  A +    +A
Sbjct: 382 ECDISVEEIISKKNKWHEDLLRFRESNKEKPRLDDKILTSWNGLMIKGYVDAYRAFNEDA 441

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           ++  A   A+F+  +L  E    L  +F+NG S   G+L+DYA 
Sbjct: 442 ----------------FLTAALKNATFLSTNLMREDG-GLNRTFKNGKSTINGYLEDYAA 484

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           ++   + LYE  +  +WL  A EL +   + F + +   +F  + +DPS+  R  E +D 
Sbjct: 485 IVDAFIALYEVTADNQWLNKAKELTDYTFQHFQNPKNDLFFFKSNQDPSLASRNTEFYDN 544

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
             PS NS+   N+  L+     +    YR  A+  L   +  ++    +           
Sbjct: 545 VIPSSNSIMAKNIFTLSHYYGDNT---YRDTAKAMLHNIQPSIEQSPTSFSNWMDGMLNY 601

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           ++P  + +V+VG  + +     L     SY +   +I      ++   F           
Sbjct: 602 TMPFYE-LVIVGKDAEI-----LRKEFNSYYIPNKLIATSTIKSDHDIF----------- 644

Query: 677 ARNNFSADKVVALVCQNFSCSPPV 700
            +  F  DK    VC N +C  PV
Sbjct: 645 -KGRFHKDKTFIYVCVNNTCQLPV 667


>gi|313203107|ref|YP_004041764.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
 gi|312442423|gb|ADQ78779.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
          Length = 680

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 226/697 (32%), Positives = 338/697 (48%), Gaps = 102/697 (14%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME E FEDE VA+ +N+ FV+IKVDREERPD+D++YMT VQ L   GGWPL+  
Sbjct: 57  CHWCHVMERECFEDEEVARYMNEHFVAIKVDREERPDIDQIYMTAVQLLTERGGWPLNCV 116

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF------AIEQL 135
             PD +P+ GGTYFP                 K  W    DML Q   F        E  
Sbjct: 117 ALPDGRPIYGGTYFP-----------------KAQW---LDMLNQVSGFIQLHPDKTENQ 156

Query: 136 SEALSASASSNK------LPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
           + AL+    +N+      LP  E   N        +    D+  GG+G+APKFP P  +Q
Sbjct: 157 ARALTEGVQNNEMIYRADLPGLEATVNDQEDIFYHIQAGIDTVNGGYGTAPKFPMPSSLQ 216

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
            +L H   L     SG  ++  K +  TL  MA GGI+D +GGGF RY+ DE W +PHFE
Sbjct: 217 FLL-HFHHL-----SGN-NDALKALTTTLDRMAFGGIYDQIGGGFARYATDEAWKIPHFE 269

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KMLYD   L +VY  AF   ++  Y  +  + L+++  ++  P G  +S+ DADS   EG
Sbjct: 270 KMLYDNALLVSVYASAFQYNRNPHYEKVLHETLEFVSSELTSPDGGFYSSLDADS---EG 326

Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
                EG FYVWT  E++ ILG++A L  +++ +   GN + S           +N+L  
Sbjct: 327 V----EGKFYVWTFDELQTILGKNAGLIMDYFQVTAAGNWEES-----------QNILYR 371

Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
             +    A K  +   +    + + R  L  VR+KR +P LDDK++ SWN L++  +  A
Sbjct: 372 KGNDEEIARKHNLSTVELSESIAQARELLQTVRAKRQKPMLDDKILTSWNALMLKGYCDA 431

Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
            ++                + + EY++ A   A+FI R++     + L  +++NG +  P
Sbjct: 432 YRV----------------TAKAEYLQAALRNANFILRYM-KSADNGLFRNYKNGKASIP 474

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
            FLDDYAF+I   + LY+     +WLV A EL       F D E G ++ T+  +P+++ 
Sbjct: 475 AFLDDYAFIIQAFISLYQNTFDEQWLVEASELTEYTVSHFYDPESGMFYYTSDTEPALIA 534

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R  E  D   PS NS    NL  L        +D Y   +E  L      ++  A+   +
Sbjct: 535 RKMEISDNVIPSSNSEMGKNLFVLGHYF---YNDQYITMSEKML----NNVRQNALQGGI 587

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY---DLNKTVIHIDPADTEEMDF 665
                D          +L+G  +S  +E  +   ++     +LN   +H       + + 
Sbjct: 588 YYANWD----------ILMGWFASAPYEVSVVGKNSDLLRKELNTHYLHNIILSGTKFE- 636

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                 +N  + +  +SAD+ +  VC+N  C  PV+D
Sbjct: 637 ------SNLPVLKGKWSADETLIYVCRNHVCQAPVSD 667


>gi|282899862|ref|ZP_06307823.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281195132|gb|EFA70068.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 689

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 232/708 (32%), Positives = 349/708 (49%), Gaps = 115/708 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDLAIAEYMNANFIPIKVDREERPDIDSIYMQSLQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
            FLSP DL P   GTYFP   +YGRPGF  +L+ ++  +D +++   Q  A  +E L   
Sbjct: 108 AFLSPDDLVPFYAGTYFPVAPRYGRPGFLEVLQAIRHYYDHQKEDFRQRKASILEAL--- 164

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-----FPRPVEIQMMLYH 193
           LS++   N   D+   +        L + +++  G     PK     FP     Q++L  
Sbjct: 165 LSSTVLQNHDLDQFAHSQFH---RFLKQGWETAIGVI--TPKQMGNSFPMIPYCQLVLQG 219

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           ++          A++G +M       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD
Sbjct: 220 TR-----FNYPSANDGLQMATQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYD 274

Query: 254 QGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
            GQ+     + +S   ++  +       + +L R+MI P G  ++A+DADS         
Sbjct: 275 NGQIVEYLANLWSAGVEEPAFKRAVAGTVSWLEREMISPTGYFYAAQDADSFNCSTDMEP 334

Query: 313 KEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EGAFYVW+ +E++++L +  +L  KEH+ L   GN            F+GKNVL  L  
Sbjct: 335 EEGAFYVWSYRELQELLSDQELLEVKEHFSLSLEGN------------FEGKNVLQRL-- 380

Query: 372 SSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKV 413
              SA +L   LE  L  L  CR              R   + ++     R  P  D K+
Sbjct: 381 ---SAGELSSSLELILGRLFLCRYGQTAETLTIFPPARNNHEAKTNPWHGRIPPVTDTKM 437

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQ 472
           IV+WN L+IS  ARAS++ +                +  Y+++A  A  FI  H + D +
Sbjct: 438 IVAWNSLMISGLARASEVFQ----------------QPSYLQLAVQATRFILDHQFVDGR 481

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDR 531
            HRL +   +G        +DYA  I  LLDL++  SG + WL  AI LQ+  +E  L  
Sbjct: 482 FHRLNY---DGEPTVLAQSEDYALFIKALLDLHQADSGSSNWLEQAITLQDEFNEFLLSV 538

Query: 532 EGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
           E GGYFNT+ ++   +++R +   D A PS N V++ NL++L  +   + + YY   AE 
Sbjct: 539 ELGGYFNTSSDNSQDLIIRERNFVDNATPSANGVAIANLIKLCLL---TDNLYYLDLAES 595

Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
           +L  F T ++    + P +  A D       ++  LV  +SS+D   +LA  +    +  
Sbjct: 596 ALKAFSTIIEKSPQSCPSLLIAIDWY-----RNSTLV--RSSIDNIKILAGKYLPTTIFD 648

Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
            +  + P +T                          + LVCQ   C P
Sbjct: 649 VISKL-PGNT--------------------------IGLVCQGLKCLP 669


>gi|85714094|ref|ZP_01045083.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
 gi|85699220|gb|EAQ37088.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
          Length = 714

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 224/691 (32%), Positives = 335/691 (48%), Gaps = 74/691 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE VA ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++
Sbjct: 94  ACHWCHVMAHESFEDEDVAAVMNELFVCIKVDREERPDIDQIYMNALHHLGEQGGWPLTM 153

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL PD  P  GGTYFP    +GRP F  +L+ V   + ++ D +A+     I +LSE   
Sbjct: 154 FLFPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFREQPDKIARHRDALIARLSERAR 213

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A   +N    EL  NA  L A+    S D   GG   APKFP+   ++ +     +  D 
Sbjct: 214 ADNPANIGLAEL-DNAAALIAQ----STDPVHGGLRGAPKFPQCSVLEFLWRAGARTHD- 267

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                       V  T+  M++GGI+DH+GGG+ RYSVD++W VPHFEKMLYD  Q+ ++
Sbjct: 268 ------DHFFAAVTLTMTRMSQGGIYDHLGGGYARYSVDDKWLVPHFEKMLYDNAQILDL 321

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
                + +K+  Y     + +D+LRR+M+ P G   S+ DADS   EG    +EG FY+W
Sbjct: 322 LALDHARSKNPLYRERATETVDWLRREMLTPAGGFASSLDADS---EG----EEGRFYIW 374

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           + KE+E++LG   A  F   Y +   GN            F+G+N+   L     ++   
Sbjct: 375 SLKEIEEVLGTTDAADFAARYDITANGN------------FEGRNIPNRLRSIEVASDD- 421

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                 ++  L   R KL   R  R RP LDDK++  WNGL+I++   A+ +        
Sbjct: 422 ----SAHMRAL---REKLLARRESRVRPGLDDKILADWNGLMIAALVHAACVF------- 467

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                    DR +++++A +   F+R  +   +  RL HS+R G    P    DYA +  
Sbjct: 468 ---------DRPDWLQIARAVYDFVRTTM--TRDGRLGHSWREGRLLVPALASDYAAMGR 516

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             L L+E       LV A+  Q+T D  + D E GGY+ T  +   +++R     D A P
Sbjct: 517 AALALFEATGDNDCLVQALRWQSTLDTHYADVEHGGYYLTAADAEGLIVRPHSSDDDATP 576

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           + + +   NLVRLA++   +K   +R   +           +       +  A D+    
Sbjct: 577 NHDGLIAQNLVRLAALTGDTK---WRARIDGLFTALLPSATEKGFGQLSLMNALDLRLTG 633

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           +   +V+VG  +      +L AA         V+H   A+    D   +  + +   A  
Sbjct: 634 A--EIVVVGEDAQAG--ALLNAARKLPHATSIVLHAPHAEALAADHPAQAKARSVRGA-- 687

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   A VC+   CS PV+ P +L  L+
Sbjct: 688 -------AAFVCRQQRCSLPVSIPKTLIELV 711


>gi|346977780|gb|EGY21232.1| spermatogenesis-associated protein [Verticillium dahliae VdLs.17]
          Length = 801

 Score =  333 bits (855), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 226/675 (33%), Positives = 342/675 (50%), Gaps = 91/675 (13%)

Query: 25  CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
           C +  ++SF     A LLN+ FV + VDREERPD+D +YM YVQA+ G GGWPL++FL+P
Sbjct: 70  CRLTAIDSFSHPECASLLNEAFVPVIVDREERPDLDTIYMNYVQAVNGAGGWPLNLFLTP 129

Query: 85  DLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKK--------RDMLAQS 127
           +L+P+ GGTY+P         PE++ G   F  IL+ ++  W ++        +++L++ 
Sbjct: 130 ELEPVFGGTYWPGPGAHTKTGPEEEEGV-DFLAILKNLRKVWQEQEPRCRQEAKEVLSKL 188

Query: 128 GAFAIE---------QLSE--------ALSASASSNKLP----------DELPQNALRLC 160
             FA E         Q+S+        A  ASA S + P           EL  + L   
Sbjct: 189 REFAAEGTLGTRSTVQMSKIGLTSSSTAPVASAVSTENPGAGKTAADVSSELDLDQLEEA 248

Query: 161 AEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTL 217
              ++ ++D  +GGFG APKFP P ++  +L   ++   ++D     E +   +M LFTL
Sbjct: 249 YSHIAGTFDPVYGGFGLAPKFPVPAKLSFLLRLPHYLHPVQDVVGPTECAHATEMALFTL 308

Query: 218 QCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---KDVFY 273
           + +   G+ DHVGG GF RYS+   W +PHFEK+  D   L  +YLDA+ ++   KD   
Sbjct: 309 RKIRDSGLRDHVGGCGFARYSITPDWSIPHFEKLTSDNALLLGLYLDAWLISNGDKDGEL 368

Query: 274 SYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 331
             +  ++ DY     M  PGG   S+E ADS    G T  +EGAF++WT KE + ++G E
Sbjct: 369 YDVVVELADYFSSPPMRLPGGGFASSEAADSYYRRGDTDVREGAFHLWTRKEFDAVIGDE 428

Query: 332 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
           H A +   ++ +   GN +  +  DP++EF  +N+   L + S    + G+  E+   ++
Sbjct: 429 HEATIAATYWNILEHGNVEPDQ--DPNDEFMNQNIPRVLKEQSEIGKQFGISGEEVARVI 486

Query: 391 GECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR--ASKILKSEAESAMFNFPVVG 447
              + KL   R + R RP LDDK+I  WNGLVIS+ AR  A+  +K  A+SA        
Sbjct: 487 ASAKAKLKAHRGRERVRPELDDKIISGWNGLVISALARTGAALAVKDAAKSA-------- 538

Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
               +Y+  A  +A F+R  L+DE+   L   FR        F +DYA+ I GL+DLYE 
Sbjct: 539 ----QYLGAAIQSAEFVRAQLWDEKEKTLYKVFRGTRGSTKAFAEDYAYFIEGLIDLYEA 594

Query: 508 GSGTKWLVWAIELQNTQDELFLDREG----------------GGYFNTTGEDPSVLLRVK 551
                 + +A ELQ TQ +LF D                   G +F TT +    +LR+K
Sbjct: 595 TGEENCIAFADELQQTQIKLFYDASAPTTSASPNPLPAHSSCGAFFATTEDAKHTILRLK 654

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           +  D A PS N+VSV NL RL   +A   ++ Y   A  +L  FE  +       P +  
Sbjct: 655 DGMDTAFPSNNAVSVSNLFRLGVALA---TETYTALARETLNAFEAEILQYPWLFPGLLS 711

Query: 612 AADMLSVPSRKHVVL 626
                 +  R ++V+
Sbjct: 712 GVVSSRLGGRTYIVV 726


>gi|452207570|ref|YP_007487692.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
 gi|452083670|emb|CCQ36982.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
          Length = 709

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 226/698 (32%), Positives = 333/698 (47%), Gaps = 68/698 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED  +A+ LN+ FV IKVDREERPDVD +YM   Q + G GGWPLSV+
Sbjct: 50  CHWCHVMADESFEDPEIAETLNEAFVPIKVDREERPDVDTLYMNVCQMVRGSGGWPLSVW 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLSEA 138
           L+P+ KP   GTYFPPE     P F ++L  + D+W+    +  + +Q+  +A     E 
Sbjct: 110 LTPEGKPFHVGTYFPPEATANMPSFGSVLGDIADSWNDPEGRSRLESQADQWASSTKGEL 169

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKL 197
                 S + P E     L   A    +  D   GG+G   KFP P  I ++L  +    
Sbjct: 170 EGTPDRSGEAPGE---GFLDTAANAAVRGADREAGGWGQGQKFPHPGRIHLLLRAYDATD 226

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            DT +         + L TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++
Sbjct: 227 RDTYR--------DVALETLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEI 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              +L  + LT +  Y+ I  +   +L R++  P G  +S  DA+S ++ G+  ++EGAF
Sbjct: 279 PRAFLAGYRLTGEERYAEIASETFAFLERELTHPDGGFYSTLDAESEDSTGS--REEGAF 336

Query: 318 YVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YVWT + V + + +   A LF E Y +  +GN +            G  VL E       
Sbjct: 337 YVWTPETVREAVDDPTAAELFCERYGVTDSGNFE-----------NGTTVLTESTPIGEL 385

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+   M  +    +L   R +LF+ R  RPRP  D KV+  WNGL+IS+ A  +  L   
Sbjct: 386 AADAVMDTDSVEALLETARSQLFEARESRPRPPRDGKVLAGWNGLMISALAEGALALN-- 443

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH-----RLQHSFRNGPSKAPG 489
                            Y ++AE+A  F R  L+ DE T      RL   F  G     G
Sbjct: 444 ---------------PTYADLAEAALEFCRDRLWEDEGTQDGDVGRLNRRFERGEVGISG 488

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIEL-QNTQDELFLDREGGGYFNTTGEDPSVLL 548
           +L+DYA+L  G  DLY+     + L +A++L +  +   + + EG  YF  TG +  ++ 
Sbjct: 489 YLEDYAYLGRGAFDLYQATGDVEHLQFALQLGRAIRASFYEESEGTLYFTPTGGE-ELIA 547

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R ++  D + PS   V+V  L  L++    +  D      +  L    + L+   +    
Sbjct: 548 RPQQLADSSTPSSTGVAVQLLAALSAFDPDAGFDAV---VDSVLETHASTLESNPITHTS 604

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  AA   SV S +  V  G      +   L+  +    L    + + P     +  W +
Sbjct: 605 LTLAAIDRSVGSPELTVAAGELPPA-WREALSGTY----LPGRTLSVRPPTESGLSAWLD 659

Query: 669 ----HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                ++      R+     + V   C++F+CSPP  D
Sbjct: 660 AIGLEDAPPIWAGRDAVDGRETV-YACRSFTCSPPTHD 696


>gi|389690661|ref|ZP_10179554.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
 gi|388588904|gb|EIM29193.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
          Length = 676

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 241/702 (34%), Positives = 355/702 (50%), Gaps = 87/702 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED  VA ++N+ FV+IKVDREERPDVD VYM+ +  L   GGWPL++
Sbjct: 48  ACHWCHVMAHESFEDADVAAVMNELFVNIKVDREERPDVDHVYMSALHLLGEPGGWPLTM 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P  GGTYFP E ++GRPGF  +LR++   +  + + + ++     + L+ +  
Sbjct: 108 FLTPEGEPFWGGTYFPKEPRFGRPGFVGVLREISRLYRSEPERILKNRDAIKQHLARSDR 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               +  L D      L     +L++  D+  GG   APKFP P  ++ +  ++      
Sbjct: 168 GDGGTLGLVD------LDRLGARLAELIDTENGGLQGAPKFPNPPILECLYRYA------ 215

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G++G+  E ++  L TL+ MA GGIHDH+GGGF RYSVDERW VPHFEKMLYD  QL  +
Sbjct: 216 GRTGDG-EAKRRFLLTLERMALGGIHDHLGGGFARYSVDERWLVPHFEKMLYDNAQLLEL 274

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y  A++ T    +      I+ +L R+M  P G   S+ DADS   EG    +EG FYVW
Sbjct: 275 YGLAYAETGRALFRDAAEGIVIWLGREMTTPEGGFASSLDADS---EG----EEGLFYVW 327

Query: 321 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +  E+ ++LGE  A  F + Y +   GN            F+G+N+   L    A     
Sbjct: 328 SLAEIREVLGEEDAAFFGQVYDITEEGN------------FEGRNIPNRLLSGVAP---- 371

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + +E+ L  L   R KL + RS R RP LDDKV+  WNGL+I++  RAS +L       
Sbjct: 372 -LAIEERLAAL---RAKLLERRSARVRPGLDDKVLADWNGLMIAALVRASPLL------- 420

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                    DR +++ +A+ A  F+   +   +  RL HS+R G    PGF  D+A ++ 
Sbjct: 421 ---------DRPDWIALAQRAYRFVTEAM--TRDGRLGHSWRGGALIVPGFALDHAAMMR 469

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLD---REGGGYFNTTGEDPSVLLRVKEDHDG 556
             L L+E  +   +L    + Q  +D L  D    + G    T      +++R +   D 
Sbjct: 470 AALALFEVTADQAYLR---DAQTWRDRLMSDYRIEDTGALAMTARNADPLVVRPQPTQDD 526

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-MCCAADM 615
           A P+ N V    LVRLA +   ++ D   + A   L    T+L  +A + PL      + 
Sbjct: 527 AVPNANGVCAEALVRLAQL---TEMDGDLRQASEVL----TKLGGIARSSPLGHTSILNA 579

Query: 616 LSVPSRKHVVLV-GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           L +  R   +LV G+ +   FE  L   +    + +          EE+D  + H +   
Sbjct: 580 LDLHLRGLTILVTGNGADALFEAGLKIPYPIRSIRRL------KSDEELD--DNHPAKAL 631

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           +      S     ALVC    CS PVTD   L+  +LE  S+
Sbjct: 632 AA-----SGAGPRALVCAGMRCSLPVTDADGLKAQVLEMSSA 668


>gi|344340301|ref|ZP_08771227.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
 gi|343799959|gb|EGV17907.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
          Length = 691

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 246/711 (34%), Positives = 362/711 (50%), Gaps = 91/711 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFED G A+L+N  FV+IKVDREERPD+DK+Y T  Q L    GGWPL
Sbjct: 58  SACHWCHVMAHESFEDPGTAELMNRLFVNIKVDREERPDLDKIYQTAHQLLAQRPGGWPL 117

Query: 79  SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +VFL PD  KP   GTYFP E ++G P FK +++ V+ A+ +++         AIE  +E
Sbjct: 118 TVFLMPDDQKPFFAGTYFPREPRHGLPAFKQLMQGVERAYREQKT--------AIESQNE 169

Query: 138 ALSASAS------SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
           +L A+ +      S+ LP+   ++A+    +QL  S+D   GGFG APKFP P  + ++L
Sbjct: 170 SLMAALAELEPHASDALPE---RSAIDAALQQLDTSFDPEHGGFGDAPKFPHPTNLDLLL 226

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
            H+     TG    ++  +   ++TL+ M +GG+ D +GGGF+RYSVD  W +PHFEKML
Sbjct: 227 RHATDAPQTGAPDRSALAK--AVWTLERMVRGGLTDQLGGGFYRYSVDALWMIPHFEKML 284

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD G L  +  DAF++T+D  +        D++ R+M  P G  +S+ DADS   EG   
Sbjct: 285 YDNGPLLALCCDAFAVTEDPVFRDAAVMTADWVLREMQSPEGGYWSSLDADS---EG--- 338

Query: 312 KKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EG FYVW  +E+  +L   E+A  F   Y L    NC+            G+  L   
Sbjct: 339 -EEGKFYVWDREEIRALLAPAEYAP-FAAVYRLDRPANCE------------GRWHLHGY 384

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
               A A  LG+   +   +L   R  L+  R +R RP  D+KV+ +WN L+I   ARA+
Sbjct: 385 RTPEAVAVDLGLEPARVQALLAAARATLYVARERRVRPGRDEKVLTAWNALMIKGLARAA 444

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           +                  DR +Y+E AE A +FIR  L+ E   RL  ++++G +    
Sbjct: 445 RTF----------------DRPDYLESAEQALAFIRGTLWREG--RLLATYKDGTAHLNA 486

Query: 490 FLDDYAFLISGLLDLYEFGSGTKW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
           +LDDYA L+  LL+L +    T+W    L +A+ L     + F D  GGG++ T  +  +
Sbjct: 487 YLDDYANLLDALLELLQ----TRWSRADLDFALALAEVLLDQFEDPIGGGFWFTGRDHET 542

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           ++ R K   D A PSGN V+ + L RL  +V   +   Y   AE +L +    ++ M  A
Sbjct: 543 LIHRTKPLGDEAIPSGNGVAALALERLGHLVGEPR---YLAAAERTLKLAAESIRRMPYA 599

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
              +  A D    P    V+  G +     +     A   Y   + V+ I PAD   +  
Sbjct: 600 HATLLFALDEWLDPPETLVIRAGDER---LDAWRREAQRGYRPRRFVLGI-PADESHL-- 653

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
                   A+MA      ++     C    C PP     SL +++  KP+S
Sbjct: 654 ----PGTLAAMA----PGERPRIYRCSGTRCEPPTE---SLADVV--KPTS 691


>gi|289548374|ref|YP_003473362.1| hypothetical protein Thal_0601 [Thermocrinis albus DSM 14484]
 gi|289181991|gb|ADC89235.1| protein of unknown function DUF255 [Thermocrinis albus DSM 14484]
          Length = 655

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 205/589 (34%), Positives = 314/589 (53%), Gaps = 56/589 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  E FE+  +A+++N+ FV+IKVDR+ERPD+D+ Y   V +L G GGWPL+VF
Sbjct: 58  CHWCHVMAKECFENPEIAQIINENFVAIKVDRDERPDIDRRYQEVVVSLTGSGGWPLTVF 117

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD K   GGTYFPPED++GRPGFK++L ++   W + RD + +S     E L    + 
Sbjct: 118 LTPDGKAFFGGTYFPPEDRWGRPGFKSLLLRIAQLWKEDRDRVIRSAEHIFELLR---NY 174

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           S+SS+K  D + +  L      L  S D ++GG G+APKF      +++LYH      TG
Sbjct: 175 SSSSHK--DNVGEELLNRGIANLLASVDYQYGGIGTAPKFHHARAFELLLYHHFF---TG 229

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           ++       + V  TL  MA+GGI+DH+GGGF RYS D+RW VPHFEKML D  +L  VY
Sbjct: 230 QTLPV----EAVEITLDSMARGGIYDHLGGGFFRYSTDDRWIVPHFEKMLSDNAELLLVY 285

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
             AF +TK   Y Y+   IL+Y +R     GG  ++++DAD  + +      EG +Y ++
Sbjct: 286 SLAFQVTKKDLYRYVVEGILNYYQRFGFDEGGGFYASQDADIGDLD------EGGYYTFS 339

Query: 322 SKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +E+  IL E  +     Y+ + P G        DP      KNVL         A+  G
Sbjct: 340 LEELRGILTEEELKVTSLYFDIHPKGEMH----HDP-----SKNVLFIAMSEEEVATATG 390

Query: 381 MPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           +PLE+   +L   RRK+   R S R +P +D  +  +WNGL++ + +   K+        
Sbjct: 391 IPLERVRQLLESARRKMLSYRESTRQQPFIDKTIYTNWNGLMLEALSTCYKV-------- 442

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
            F  P V S        AE  A  + + ++ +   +L H++        G  +DY FL  
Sbjct: 443 -FRIPWVLSS-------AEKTADRLMKEMWKDG--QLMHTY-----GVKGMAEDYIFLAR 487

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAE 558
           GLL L+E     ++L  ++ L +   + F D +G G+F+T  +D  +L +R+K   D   
Sbjct: 488 GLLSLFEVTQKREYLEASVMLAHEAIKKFWDPQGWGFFDTEEKDEGLLRIRLKTLQDTPT 547

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            S N  +    + L S+   ++   + + AE +L  F   ++++ +  P
Sbjct: 548 QSVNGAAPYLYLVLGSVTPYTE---FLEYAEKNLQAFARMVREIPLISP 593


>gi|256005004|ref|ZP_05429976.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
           2360]
 gi|255991073|gb|EEU01183.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
           2360]
          Length = 482

 Score =  333 bits (853), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 189/467 (40%), Positives = 263/467 (56%), Gaps = 59/467 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFEDE VA++LN  FVSIKVDREERPD+D +YMT  QAL G GGWPL+
Sbjct: 53  STCHWCHVMESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTYFP +D+ G PG  +IL+ V + W  ++D LA+  +  +  +SE++
Sbjct: 113 IIMTPDKKPFFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESI 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 +   DE+ ++       Q    +D+ +GGFG+APKFP P  +  +L +  K   
Sbjct: 173 DDDYYYS--VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK--- 227

Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                 A E   +V+   TL  M  GGI+DH+G GF RYS DE+W VPHFEKMLYD   L
Sbjct: 228 ------AKEEYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALL 281

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  YL+ +  TK+  Y+ I ++I  Y+ RDM  P G  +SAEDADS   EG    +EG F
Sbjct: 282 AIAYLETYQATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKF 334

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W+  E++++LGE     F ++Y +   GN            F+G N+   +N +    
Sbjct: 335 YIWSPTEIKEVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDE 382

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            K  + L         CR+KLFD R KR  PH DDK++ +WNGL+I++ A   ++L  E 
Sbjct: 383 DKEFVEL---------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE- 432

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
                          +Y   AE A+ FI   L      RL   +R+G
Sbjct: 433 ---------------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDG 463


>gi|160935413|ref|ZP_02082795.1| hypothetical protein CLOBOL_00308 [Clostridium bolteae ATCC
           BAA-613]
 gi|158441771|gb|EDP19471.1| hypothetical protein CLOBOL_00308 [Clostridium bolteae ATCC
           BAA-613]
          Length = 642

 Score =  332 bits (852), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 206/557 (36%), Positives = 289/557 (51%), Gaps = 49/557 (8%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+E +A++LN  +V +KVDREERPDVD VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENEVIAEILNREYVCVKVDREERPDVDSVYMSVCQAMNGQGGWPLTIIMTPDCR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEALSASASSN 146
           P   GTYFPP  +YGRPG + +L      W  KK  +L Q+G     Q+ + L +   + 
Sbjct: 61  PFFSGTYFPPRARYGRPGLEELLTAAAGQWKVKKEKLLDQAG-----QIEKYLKSQERTE 115

Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
           +   E    A+     QL+  +DS+ GGFGSAPKFP P  +  ++       + G   + 
Sbjct: 116 RQA-EPELGAVHQAFRQLADCFDSKNGGFGSAPKFPAPHNLIFLM-------EYGAREKR 167

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
            E   M   TL  M +GGI DH+GGGF RYS D +W VPHFEKMLYD   L   Y+ A+ 
Sbjct: 168 PEALAMAEKTLVQMYRGGIFDHIGGGFSRYSTDGQWLVPHFEKMLYDNSLLVMAYIKAYG 227

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
            T    Y  +   IL+Y+RR++    G  +  +DADS          EG +YV+T +E+ 
Sbjct: 228 STGRKMYGCVAEKILEYVRRELTDSQGGFYCGQDADSDGV-------EGKYYVFTREEIR 280

Query: 327 DILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
           ++LGE A   F   Y +  TG+ +    S P N  +  N      +   +    G     
Sbjct: 281 EVLGEKAGRDFCRQYGI--TGHGNFEGRSIP-NLLENDNYEEICEEPWGNGDHGGNICHG 337

Query: 386 YLNILG-----ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             + +G     ECRR L+  R  R R H DDK++VSWN  +I + A A  +L  E     
Sbjct: 338 SCDTIGGRENEECRR-LYQYRIDRARLHKDDKILVSWNSWMICACAMAGAVLGEE----- 391

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                      +Y+++A  A +FI+ HL  E   RL   +R+G +   G LDDYA     
Sbjct: 392 -----------QYVDMAVRADAFIKSHLVKE--GRLMVRYRDGDAAGEGKLDDYACYSLA 438

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL+LY       +L  A        E F DRE GG++    +   +++R KE +DGA PS
Sbjct: 439 LLELYRVTFRVDYLKRAAAWAEIMTEQFFDRERGGFYLYAKDGEQLIVRTKETYDGAMPS 498

Query: 561 GNSVSVINLVRLASIVA 577
           GNSV+   L RL  I  
Sbjct: 499 GNSVAAQVLYRLTRITG 515


>gi|402773173|ref|YP_006592710.1| thioredoxin domain-containing protein [Methylocystis sp. SC2]
 gi|401775193|emb|CCJ08059.1| Thioredoxin domain protein [Methylocystis sp. SC2]
          Length = 675

 Score =  332 bits (851), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 216/697 (30%), Positives = 341/697 (48%), Gaps = 79/697 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A L+N+ F+++KVDREERPDVD +Y   +  +   GGWPL++
Sbjct: 52  ACHWCHVMAHESFENPEIAALMNESFINVKVDREERPDVDYLYQQALMMMGQRGGWPLTM 111

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P  GGTYFPP  + GRPGF  +L+ + + W  + + +  +    + +LS  L+
Sbjct: 112 FLTPEGQPFWGGTYFPPFAQGGRPGFAELLKTIAELWRARANAIEHN----VAELSAGLA 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           + + +       P     +CA QL++  D   GGFG+APKFP+   +  +    K     
Sbjct: 168 SLSETTPGEPVSPHLVESICA-QLAQRLDRVDGGFGAAPKFPQTTSLDFLWRAWK----- 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            ++G  S  Q +VL TL  +++GG++DH+GGGF RYS D RW VPHFEKMLYD  QL  +
Sbjct: 222 -RTGRDSLRQAVVL-TLDHISQGGVYDHLGGGFARYSTDNRWLVPHFEKMLYDNAQLIEL 279

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
             + +   +   Y     + ++++ R+M  PGG   S+ DADS   EG    +EG FY W
Sbjct: 280 LTEVWQDERRELYRLRVTETIEWMTREMRAPGGGFASSLDADS---EG----EEGKFYAW 332

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSAS 375
           +  E+ + LG  A  F+  Y +   GN +            GK+VL     IEL D    
Sbjct: 333 SQTEIREALGARAPFFERAYGVSREGNWE-----------HGKSVLNRLGSIELLDEETE 381

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A+        +L             R++R RP  DDKV+  WNGL I++ A+A+ +    
Sbjct: 382 AALARDRAALFL------------ARARRVRPGCDDKVLADWNGLTIAAIAKAACVF--- 426

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        +R++++++A +A  F++  +  ++  RL HS+R   ++    LDDY 
Sbjct: 427 -------------EREDWLDIAIAAFDFVKSAMTTDEG-RLLHSWRCARARHMAVLDDYG 472

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +    L LYE      +L  A       +  + DR  GGYF    +  +++ RVK   D
Sbjct: 473 AMCRAALALYEAAGAPSYLECARRWVEHVEHHYRDRT-GGYFYAADDADTLIARVKIAED 531

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSGN + +  L +L  +   S    YR+ AE     F   +++  +    +    +M
Sbjct: 532 SALPSGNGMMLQALAQLYYLTGES---VYRERAEAIAQDFAGTIRERILGFSSLLNGMEM 588

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L       +V++G   + D   +    +      + +  I PA T        H +   +
Sbjct: 589 LR--EALQIVVIGENDAADTAALKRVIYGVSQPGRVLNVIAPAAT----LPRAHPAFGKT 642

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           M        +  A VC+   CS P+ +P +L   L E
Sbjct: 643 ML-----GARATAYVCRGMVCSLPIIEPDALAAALRE 674


>gi|345850486|ref|ZP_08803482.1| hypothetical protein SZN_12143 [Streptomyces zinciresistens K42]
 gi|345638083|gb|EGX59594.1| hypothetical protein SZN_12143 [Streptomyces zinciresistens K42]
          Length = 637

 Score =  332 bits (851), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 234/700 (33%), Positives = 332/700 (47%), Gaps = 81/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+S
Sbjct: 8   SACHWCHVMAHESFEDDDTAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 67

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VF++PD +P   GTYFPP  + G P F+ +L  V+ AW  +RD +A+     +  L+   
Sbjct: 68  VFMTPDGEPFYFGTYFPPAPRQGMPSFRQVLEGVRGAWTDRRDEVAEVAGKIVRDLAGRE 127

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S          EL Q  L      L++ YD + GGFG APKFP  + I+ +L H  +  
Sbjct: 128 ISYGGPEAPGEQELSQALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 180

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 181 -TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 235

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G+ R  EGA+Y
Sbjct: 236 RVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGSGRHVEGAYY 293

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
           VWT  ++ ++LG E A L   H+ +   G  +            G +VL +   D    A
Sbjct: 294 VWTPAQLREVLGDEDAGLAARHFGVTEEGTFE-----------HGASVLQLPRQDEVFDA 342

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +++              R +L   R+ RP P  DDKV+ +WNGL +++ A          
Sbjct: 343 ARIA-----------SVRERLLSHRAGRPAPGRDDKVVAAWNGLAVAALAETGAYF---- 387

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                       DR + +E A  AA  + R  +D+Q  RL  + R+G + A  G L+DYA
Sbjct: 388 ------------DRPDLVEAALGAADLLVRLHFDDQA-RLTRTSRDGQAGANSGVLEDYA 434

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  D
Sbjct: 435 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFSDEESGALYDTAADAERLIRRPQDPTD 494

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG S +   L+  A+  A +    +R  AE +L V +T    +   VP      + 
Sbjct: 495 NAVPSGWSAAAGALLGYAAQTASAP---HRHAAERALGVVKT----LGPRVPRFIGWGLA 547

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A   L  P  + V +VG   + +    L            V+     D+ E     +  
Sbjct: 548 VAEARLDGP--REVAVVGPALTDEATRALHRTALLGTAPGAVVAAGTPDSGEFPLLADRT 605

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
               + A          A VC++F+C  P TDP  L   L
Sbjct: 606 LRQGAPA----------AYVCRDFTCDAPTTDPERLRAAL 635


>gi|402494465|ref|ZP_10841206.1| thioredoxin domain-containing protein [Aquimarina agarilytica ZC1]
          Length = 706

 Score =  332 bits (850), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 215/694 (30%), Positives = 342/694 (49%), Gaps = 69/694 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  VA ++N  +++IK+DREERPD+D+VYM+ VQ + G GGWPL+V 
Sbjct: 80  CHWCHVMEHESFEDSTVAAVMNKNYINIKIDREERPDIDQVYMSAVQLMTGRGGWPLNVI 139

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTY+P  +  G       L++++  ++     L +      E +      
Sbjct: 140 ALPDGRPVWGGTYYPKAEWMGA------LQQIQKIYEDDPSKLEEYATKLTEGIQSVSLV 193

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           + + N L  E   + +    E  +K +D + GG   APKF  P     +L ++ +  +  
Sbjct: 194 TPNPNALKFE--NSTIESAVETWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQTNN-- 249

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                 + +  V+ TL  ++ GG++DHVGGGF RY+ DE+WHVPHFEKMLYD  QL ++Y
Sbjct: 250 -----EKLKDYVITTLNQISYGGVYDHVGGGFARYATDEKWHVPHFEKMLYDNAQLVSLY 304

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
            DA+ LTK+ +Y  +  + LD+++R++    G  +S+ DADS    G  + +EGAFYVW 
Sbjct: 305 SDAYLLTKNEWYKQVVYETLDFVQRELTNAEGVFYSSLDADSVTHSG--KLEEGAFYVWQ 362

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
              +E  LG E   LF ++Y +   G  +       HN +    VLI     +    K  
Sbjct: 363 KPALETALGVEDFKLFADYYNVNAYGIWE-------HNNY----VLIRNESDADFIEKHK 411

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +    +L    + +++L  +RSKR RP LDDK + SWN L++  +A A  +         
Sbjct: 412 LDKGDFLQKQKKWKQRLLSIRSKRERPRLDDKTLTSWNALMLKGYADAYSVF-------- 463

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   +   +++VA + A+FI+         +L H+++ G S   G+L+DYA  I  
Sbjct: 464 --------NDANFLKVALTNAAFIKNKQM-ASNGQLMHNYKEGKSTINGYLEDYAATIDA 514

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            + LY+     +WL  +  + +   + F D   G +F T+ ED +++ R  E  D   P+
Sbjct: 515 FIALYQVTFDQQWLDLSKTMTDYVFDHFYDDASGLFFFTSDEDAALVTRNIESSDNVIPA 574

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV-FETRLKDMAMAVPLMCCAADMLSVP 619
            NS+   NL +L+   +  K   + Q   H++ V  E      +  + LM    +     
Sbjct: 575 SNSMMAKNLYKLSHYFSNKKYLEHSQKMLHNIQVNIEEYPSGYSNWLDLMLNYTEDFY-- 632

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
               VV+VG  +    E    A    Y  NK +        +E         +N  + +N
Sbjct: 633 ---EVVIVGAAA----EEKRVAIQKQYYPNKII----AGSAKE---------SNQPLLQN 672

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
            FS       +C N +C  PVT+  +   LL +K
Sbjct: 673 RFSEKDTHIFICVNNACKYPVTEVEAAFKLLNDK 706


>gi|288941778|ref|YP_003444018.1| hypothetical protein Alvin_2064 [Allochromatium vinosum DSM 180]
 gi|288897150|gb|ADC62986.1| protein of unknown function DUF255 [Allochromatium vinosum DSM 180]
          Length = 688

 Score =  332 bits (850), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 237/687 (34%), Positives = 339/687 (49%), Gaps = 67/687 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFED   A+ +N  FV+IKVDREERPD+DKVY T  Q L    GGWPL
Sbjct: 59  SACHWCHVMAHESFEDPATAERMNRLFVNIKVDREERPDLDKVYQTAHQLLSQRAGGWPL 118

Query: 79  SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +VFL+PD   P   GTYFP E ++G P F  +L  V+ A+ ++       GA   EQ   
Sbjct: 119 TVFLTPDDHTPFFAGTYFPREPRHGLPSFTQLLVGVERAYREQ-------GAAIREQNRS 171

Query: 138 ALSASAS-SNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
            L A A    +   ELP+  L   A  QL+ S+D+  GGFG APKFP   +++++L    
Sbjct: 172 LLEALAGLEPQGGAELPEAGLLEAAFHQLALSFDAEHGGFGRAPKFPHATDLELLLRRQA 231

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           +L   G   +      M  FTL+ M +GG+ D +GGGF RYSVD+ W +PHFEKMLYD G
Sbjct: 232 RLAANGGDPD-PRPLHMAGFTLERMIRGGLTDQLGGGFCRYSVDDEWMIPHFEKMLYDNG 290

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            L  +  DAFS T +  +        D++ R+M  P G  +S  DADS   EG     EG
Sbjct: 291 PLLALCCDAFSATGESIFRDAALATADWVMREMQSPEGGYYSTLDADS---EG----HEG 343

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FYVW    V      HA L    Y L       +  +  P N F+G+  L      + +
Sbjct: 344 TFYVWDRDAV------HARLSAAEYPL----FAAVYGLDRPPN-FEGRWHLHGYRTPTQA 392

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A  LG+ L +   +L   R  LF  R +R  P  D+K++ +WN L+I   ARA+++L   
Sbjct: 393 AESLGLNLPQAEALLASARATLFSAREQRVHPGRDEKILTAWNALMIKGMARAARVL--- 449

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        DR +Y+E AE A +FIR  L+ +   RL  + ++G +    +LDDYA
Sbjct: 450 -------------DRPDYLESAEQALAFIRSTLWHDG--RLLATCKDGVAHLNAYLDDYA 494

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            LI  LL+L +    +  L +A+EL     + F D E GG++ T      ++ R K   D
Sbjct: 495 NLIDALLELLQVRWSSADLAFAVELAEVLLDEFHDAERGGFWFTGRSHEPLIHRAKPLGD 554

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAAD 614
            + P+GN V+ + L RL  ++   +   Y + A+ +L +    ++ M  A   L+    D
Sbjct: 555 DSMPAGNGVAALALQRLGHLIGEVR---YLEAADGTLRLAAESMRRMPHAHASLLMALDD 611

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
            L  P     +LV   +    E     A   Y  ++ V  I P+  + +          A
Sbjct: 612 WLDPPE----MLVIRAADDRLETWQRLAQQGYRPHRLVFAI-PSGIDAL------PGTLA 660

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVT 701
           SM       ++ +   C+   C PPV 
Sbjct: 661 SMR----GGERPLIYRCRGTHCEPPVA 683


>gi|386842157|ref|YP_006247215.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374102458|gb|AEY91342.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451795451|gb|AGF65500.1| hypothetical protein SHJGH_5837 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 677

 Score =  332 bits (850), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 235/703 (33%), Positives = 334/703 (47%), Gaps = 88/703 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDRATADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-A 138
           VFL+PD +P   GTYFPP  ++G P F+ +L  V+ AW  +RD +A      +  L++  
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVQQAWTTRRDEVADVAGKIVRDLAQRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +   A+      EL Q  L      L++ YD + GGFG APKFP  + ++ +L H  +  
Sbjct: 168 IVRQAAEAPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  D   +L R++    G   SA DADS   +G+ R  EGA+Y
Sbjct: 276 RVYTHLWRATGSDLARRVALDTAQFLLRELRTAEGGFASALDADS--DDGSGRHVEGAYY 333

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           VW   ++ + LG+ A L  +++ +   G  +            G++VL +   +    A 
Sbjct: 334 VWRPDQLREALGDDAELAAQYFGVTDEGTFE-----------HGQSVLQLPQTEGVFEAE 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K           +   + +L   R++RP P  DDKV+ +WNGL I++ A           
Sbjct: 383 K-----------IASVKDRLLAARARRPAPGRDDKVVAAWNGLAIAALAETGACF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDYA 495
                      DR +  E A +AA  + R   DE     R     R GP+   G L+DYA
Sbjct: 427 -----------DRPDLTEAAVAAADLLVRVHLDEHGRLARTSKDGRVGPNA--GVLEDYA 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  D
Sbjct: 474 DVAEGFLALASVTGEGVWLDFAGLLLDHVLARFTDTETGALYDTASDAEQLIRRPQDPTD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG + +   L+   S  A + S+ +R  AE +L V +T    +   VP      + 
Sbjct: 534 NAAPSGWTAAAGALL---SYAAHTGSEPHRAAAERALGVVKT----LGPRVPRFIGWGLA 586

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
            A  +L  P  + V +VG       +   AA H +  L+     V+     D+EE     
Sbjct: 587 VAEALLDGP--REVAVVGPAPD---DERTAALHRTALLSTAPGAVVACGTPDSEEFPL-- 639

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   +A          A VC+ F C  PVTDP +L   L
Sbjct: 640 --------LADRTLVEGAPTAYVCRGFVCDLPVTDPDALRTKL 674


>gi|225871957|ref|YP_002753411.1| hypothetical protein ACP_0267 [Acidobacterium capsulatum ATCC
           51196]
 gi|225793798|gb|ACO33888.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
           51196]
          Length = 702

 Score =  332 bits (850), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 218/694 (31%), Positives = 337/694 (48%), Gaps = 61/694 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES+E+  +A ++N+ F++IKVDR+ERPDVD  Y   VQA+ G GGWPL+  
Sbjct: 53  CHWCHVMDRESYENPAIAAVINEHFIAIKVDRDERPDVDSRYQAAVQAMAGQGGWPLTAI 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P+ KP  GGTYFPPED+YGRPGF+ +LR + D W  +R    ++    +  +    S 
Sbjct: 113 LTPEGKPFFGGTYFPPEDRYGRPGFERVLRSLADVWQNRRGEALETANSVLGAIEHGESF 172

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           +  S  L   + +  +    +Q    +D+R+GGFGS PKFP P  + M++       DT 
Sbjct: 173 AGRSGTLSISIVEKLVSSAVQQ----FDARYGGFGSQPKFPHPSAMDMLI-------DTA 221

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                   ++    TL+ MA GG++D + GGFHRYSVDE+W VPHFEKMLYD   L + Y
Sbjct: 222 SRTGNERVREAATVTLRKMAAGGVYDQLAGGFHRYSVDEQWIVPHFEKMLYDNAGLLSNY 281

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           + AF    +  ++ +  DI+ ++   +     G  ++++DAD           +G ++ W
Sbjct: 282 VHAFQSFVEPEFAAVAVDIIRWMDECLSDRERGGFYASQDAD------INLDDDGDYFTW 335

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E   +L    +     Y+       D+  M D H+  + KNVL      +  A+ L 
Sbjct: 336 TLAEARAVLSNEELAVAASYF-------DIGEMGDMHHNPQ-KNVLHSKRTLAEVAAALS 387

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +  E+    L   + KL   R +RP P +D  +  SWN L IS++ +A+++L        
Sbjct: 388 LSAEEAQKKLDSAKSKLLAARRERPTPFIDTTIYTSWNALAISAYLQAARVLDLPHAR-- 445

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-----GFLDDYA 495
             F ++  DR             I R  + E T  L H       K+P     G LDDYA
Sbjct: 446 -TFALLTLDR-------------ILREAWSE-TSGLSHVVAYADGKSPAAWVAGVLDDYA 490

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP---SVLLRVKE 552
           FL    L+ +E     K+   A ++ +     F D+  G +F+T  +     ++  R K 
Sbjct: 491 FLTDACLEAWESTGDRKYYDAAAQIADAMIARFYDQTSGAFFDTEIQGSKLGALAARRKP 550

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D   P+GN  +   L+RLAS+    +   + + AE +L  F   ++   +       A
Sbjct: 551 LQDTPTPAGNPAAASALLRLASLSGEKR---HAELAEDTLEAFAGVVEHFGLYAGTYGLA 607

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
                +P  + +++ G         + A A A Y +NK+V+  D A        E     
Sbjct: 608 LLRFLLPPAQ-IIVAGDGPRA--RELAAMAVARYAVNKSVVQFDAAQLAV----ENLPPA 660

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            A    +     + VALVCQ  SC PP+T+P +L
Sbjct: 661 LAETLPHLSGFTEPVALVCQGMSCQPPITEPQAL 694


>gi|336427724|ref|ZP_08607719.1| hypothetical protein HMPREF0994_03725 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336008885|gb|EGN38889.1| hypothetical protein HMPREF0994_03725 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 655

 Score =  331 bits (849), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 198/560 (35%), Positives = 291/560 (51%), Gaps = 59/560 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  +A LLN  +V IKVDREERPD+D VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENAAIAGLLNREYVCIKVDREERPDIDSVYMSVCQAMTGQGGWPLTIIMTPDCR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFPP  +YG  G + +L      W  +++ +  S        +E ++A     +
Sbjct: 61  PFFAGTYFPPTARYGSVGLQELLTAAAAQWKLEKEKILDS--------AEQITAYVKEQE 112

Query: 148 LPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
            P   E  ++ + L   Q + ++D + GGFG APKFP P  +  +L       + G    
Sbjct: 113 QPTAAEPGKDMVHLAFRQFADNFDKKNGGFGGAPKFPTPHNLMFLL-------EYGIREN 165

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
           + E   M   TL  M +GGI DH+GGGF RYS D+RW VPHFEKMLYD   LA  YL+A+
Sbjct: 166 SREALDMAETTLTQMYRGGIFDHIGGGFSRYSTDDRWLVPHFEKMLYDNALLAIAYLEAY 225

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
           S T    Y  + + +L Y+ R++    G  +  +DADS          EG +YV+T +E+
Sbjct: 226 SRTGRKLYECVAKKVLRYVERELTDAQGGFYCGQDADSDGV-------EGKYYVFTQEEI 278

Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK---NVLIELNDSSASASKLGM 381
             ILG E    F   Y +   GN            F+GK   N+L   +       + G 
Sbjct: 279 RRILGKEEGEAFCVRYGITANGN------------FEGKSIPNLLGNKDYERICEEQCGC 326

Query: 382 PLEKYLNILG-ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               +++ +G E  +KL++ R +R   H DDK++VSWNG +I ++A+A  +         
Sbjct: 327 DGGGHMDGIGREAFQKLYEYRIRRTPLHKDDKILVSWNGWMICAYAKAGAVFGD------ 380

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                     K Y+++A  A  F+R++L  +   RL   +R+G +   G LDDY   I  
Sbjct: 381 ----------KRYVDMAVRAEGFVRQNLMKD--GRLLVRYRDGDAAGEGKLDDYTCYILA 428

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL+LY+    T +L  A        E F D+E GG++    +   + +R KE++DGA PS
Sbjct: 429 LLELYQVTFQTAYLEQAARCAEILLEQFFDQEKGGFYLYAEDGEQLFMRTKENYDGAMPS 488

Query: 561 GNSVSVINLVRLASIVAGSK 580
           GNSV    L +LA I   +K
Sbjct: 489 GNSVGARVLHKLAQITGETK 508


>gi|329935309|ref|ZP_08285275.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
 gi|329305132|gb|EGG48991.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
          Length = 675

 Score =  331 bits (849), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 235/700 (33%), Positives = 338/700 (48%), Gaps = 83/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+S
Sbjct: 48  SACHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMS 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
           VFL+P+ +P   GTYFPPE ++G P F+ IL+ V  AW ++R+ +A  +G    +     
Sbjct: 108 VFLTPEAEPFYFGTYFPPEPRHGSPSFRQILQGVHQAWTERREEVADVAGKITRDLAGRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L+   +      E+ Q  L      L++ YD+R GGFG APKFP  + ++ +L H  +  
Sbjct: 168 LAHGGAQVPGEQEMAQALL-----GLTREYDARRGGFGGAPKFPPSMVLEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGSEG----ALQMAADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  +++ R++    G   SA DADS   +G  R  EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETAEFMVRELGTAEGGFASALDADS--DDGTGRHVEGAYY 333

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           VWT +++ ++LGE A L   ++ +   G  +            G++VL +   D    A 
Sbjct: 334 VWTPEQLAEVLGEDAGLAARYFGVTEEGTFE-----------HGQSVLQLPQTDGVFDAE 382

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +           +   R +L   RS RP P  DDKV+ +WNGL I++ A           
Sbjct: 383 R-----------VASVRERLLGARSARPAPGRDDKVVAAWNGLAIAALAETGAYF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
                      DR + ++ A  AA  + R   DE   RL  + ++G + A  G L+DYA 
Sbjct: 427 -----------DRPDLVDAAVRAADLLVRLHLDEHG-RLTRTSKDGRAGAHAGVLEDYAD 474

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +  G L L +      WL +A  L       F   E G  F+T  +   ++ R ++  D 
Sbjct: 475 VAEGFLALAQVTGEGVWLEFAGLLLGHVRTRFTGEE-GTLFDTASDAEKLIRRPQDPTDN 533

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMC 610
           A PSG + +   L+   S  A + S+ +R  AE +L V  T      R     +AV    
Sbjct: 534 ATPSGWTAAAGALL---SYAAHTGSEAHRTAAEQALGVVRTLGPRAPRFVGWGLAV---- 586

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P  + V +VG   S+D  +  A       L++T + +  A    +    E +
Sbjct: 587 -AEALLDGP--REVAVVG--PSLDDPDTSA-------LHRTAL-LGTAPGAVVAAGAEGS 633

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A          A VC+NF C  P +D   L   L
Sbjct: 634 EEFPLLADRPLRRGAPAAYVCRNFVCEAPTSDAEELRAAL 673


>gi|302553816|ref|ZP_07306158.1| spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes DSM 40736]
 gi|302471434|gb|EFL34527.1| spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes DSM 40736]
          Length = 677

 Score =  331 bits (849), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 235/701 (33%), Positives = 343/701 (48%), Gaps = 83/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A+ LN+ +VS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDQQTAEYLNEHYVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPP  + G P F+ +L  V+ AWD++RD + +     +  L+   
Sbjct: 108 VFLTPEAEPFYFGTYFPPAPRQGMPSFRQVLEGVRQAWDERRDEVTEVAGKIVRDLA-GR 166

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             S   ++ P   EL Q  L      L++ YD + GGFG APKFP  + ++ +L H  + 
Sbjct: 167 EISYGDDQAPGEQELAQALL-----ALTREYDPQRGGFGGAPKFPPSMALEFLLRHHAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 221 --TGAEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA+
Sbjct: 275 CRVYAHLWRATGSELARRVALETADFMVRELRTTEGGFASALDADS--DDGTGKHVEGAY 332

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
           YVWT  ++ ++LGE  A L  +++ +   G  +            G++VL +   DS   
Sbjct: 333 YVWTPGQLREVLGEQDAELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDSLFD 381

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A K           +   R +L   R++RP P  DDKV+ +WNGL I++ A         
Sbjct: 382 AGK-----------IASVRERLLAKRAERPAPGRDDKVVAAWNGLAIAALAET------- 423

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
              A F+ P              +A   +R HL DEQ  RL  + ++G + A  G L+DY
Sbjct: 424 --GAYFDRP------DLVEAAVAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLEDY 473

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +     F D E G  F+T  +   ++ R ++  
Sbjct: 474 ADVAEGFLALASVTGEGVWLQFAGFLLDHVLVRFTDAESGALFDTAADAERLIRRPQDPT 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---C 611
           D A PSG + +   L+   S  A + S+ +R  A  +L V    +K +   VP       
Sbjct: 534 DNAAPSGWTAAAGALL---SYAAHTGSEPHRTAARKALGV----VKALGPRVPRFIGWGL 586

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY--DLNKTVIHIDPADTEEMDFWEEH 669
           AA   ++   + V +VG   S+D E   A  H +        V+ +    +EE       
Sbjct: 587 AAAEAALDGPREVAIVG--PSLDHEGTRALHHTALLGTAPGAVVAVGTPGSEEFPL---- 640

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +A       +  A VC+NF+C  P T+   L  +L
Sbjct: 641 ------LADRPLVGGEPAAYVCRNFTCDVPTTEVDRLRAVL 675


>gi|332663431|ref|YP_004446219.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332332245|gb|AEE49346.1| protein of unknown function DUF255 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 686

 Score =  331 bits (848), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 223/693 (32%), Positives = 342/693 (49%), Gaps = 74/693 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFE+  VA ++N+ F++IKVDREERPDVD +YM     + G GGWPL+
Sbjct: 47  STCHWCHVMERESFENADVAAIMNENFINIKVDREERPDVDHIYMEACVIMTGSGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            FL+PD +P + GTY+PP   + RP +  +L  V D +  +R  + +  +  I  + +  
Sbjct: 107 CFLTPDGRPFLAGTYYPPLAAFNRPSWPQLLHHVTDVYRNRRKDVEEQASRLIGNIEQTN 166

Query: 140 SASASSN--KLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHS 194
           S   + N  +L    P N + L    + L K++D + GGFG+APKFP  + +Q +L YH 
Sbjct: 167 SYFLAKNEAELSGINPFNPVVLHNVFQTLKKNFDLQDGGFGAAPKFPGSMALQFLLDYHH 226

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
                   +GE  E  +  +F+L  M +GGI+D +GGGF RY+ D  W VPHFEKMLYD 
Sbjct: 227 -------FTGE-KEALEHTVFSLDRMIRGGIYDQLGGGFARYATDRAWLVPHFEKMLYDN 278

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             L  +  D + +T+   +     + L ++ R+M    G  +SA DADS   EG    +E
Sbjct: 279 ALLVGLLSDTYKVTQQPIFRRAIEETLGWIEREMTSADGGFYSALDADS---EG----EE 331

Query: 315 GAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           G FYVW+++E+  +    E A LF  +Y ++P GN            ++G N+L      
Sbjct: 332 GKFYVWSAEEIAAVCPSVEDAALFSSYYGVEPLGN------------WEGHNILWCPLPL 379

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           +A A + G   E         R +L  VR +R RP LDDK+++SWN L+ S++A+A   L
Sbjct: 380 AAFAVEAGQSPEALEARFAPIRTQLMAVRDERIRPGLDDKILLSWNALMASAYAKAYTAL 439

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR----NGPSKAP 488
            +E                 Y   A     F+      ++   L H+++       ++  
Sbjct: 440 GNET----------------YKVAALRNVDFLLEKFKRDEIGGLYHTYKKVKDQDQAQYA 483

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
            FLDDYA+ I+ L+D+YE    T++L  A +L       FLD     ++ T+ +   V+L
Sbjct: 484 AFLDDYAYFIAALIDVYEISLETRYLRQAADLTEYTLAHFLDDTRNLFYFTSKDQQDVVL 543

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R  E +D A PSGNS  V NL RL  +    +   Y + A   L    + L+    +   
Sbjct: 544 RKIELYDNALPSGNSSMVQNLQRLGLLWGKMQ---YIELAAAMLKEMLSGLERYPSSFAR 600

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
              A   +  P  + V +VG ++    E +      +Y  NK ++    AD         
Sbjct: 601 WANALIYMVYPMHE-VAIVGPEA----EELSRELQKNYIPNKVLMGALEAD--------- 646

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
              +   +     +       VCQN++C  PV+
Sbjct: 647 ---DTFPLLAGRQTQGMTQIFVCQNYTCQLPVS 676


>gi|323693373|ref|ZP_08107588.1| hypothetical protein HMPREF9475_02451 [Clostridium symbiosum
           WAL-14673]
 gi|323502578|gb|EGB18425.1| hypothetical protein HMPREF9475_02451 [Clostridium symbiosum
           WAL-14673]
          Length = 639

 Score =  331 bits (848), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 206/561 (36%), Positives = 290/561 (51%), Gaps = 57/561 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  +A+LLN  ++ +KVDREERPD+D VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFPP  +YGR G   +L      W +KR+ L  S       L E    + SS  
Sbjct: 61  PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
            P E+ + A R    Q + S+D + GGFG APKFP P  +  ++       + G   +  
Sbjct: 121 GP-EIVRQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E   M   TL  M +GGI DH+GGGF RYS DERW VPHFEKMLYD   L   Y+ A++L
Sbjct: 169 EALSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYAL 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T    Y      +L Y+  ++  P G  +  +DADS          EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ILG +    F  +Y +   GN            F+GK++   L + +  +     P  + 
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESVCEERPGAEE 329

Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +   + RR       KL+  R KR R H DDK++VSWNG +IS+ A+A  +L       
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K+Y+++A  A  FIR  L   +  RL   +R+G +   G LDDYA    
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LL+LY     T +L  A    +   E FLDRE GG+F    +   +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNS +   L  LA +   +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512


>gi|387790403|ref|YP_006255468.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
           3403]
 gi|379653236|gb|AFD06292.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
           3403]
          Length = 674

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 194/574 (33%), Positives = 295/574 (51%), Gaps = 73/574 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA ++N+ FV IKVDREERPD+D+VYM  VQ + GGGGWPL+
Sbjct: 51  SACHWCHVMEHESFEDEQVASIMNEHFVCIKVDREERPDIDQVYMNAVQLMTGGGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSGA--FAI 132
            F  PD +P  GGTYF  +D        +  F    ++ ++  D+    + QS    F  
Sbjct: 111 CFCLPDQRPFYGGTYFRKQDWMRLLNDLQAFFVNKPKEAEEYADRLHKGIKQSDVVGFVA 170

Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
           EQ                E   N L+   +  ++ +D   GG+  APKFP P   Q +L 
Sbjct: 171 EQ---------------KEYSVNTLKEIVDPWTRYFDYSDGGYNRAPKFPLPNNFQFLLR 215

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           +++  +D   +        +   TL  MA GGI+D +GGGF RYSVD  W VPHFEKMLY
Sbjct: 216 YARLAKDQASN-------VITRLTLDKMAYGGIYDQLGGGFARYSVDSVWLVPHFEKMLY 268

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D GQL ++Y +A+  +  + Y  +  + L+++RR++  P G  +SA DADS   EG    
Sbjct: 269 DNGQLVSLYAEAYQYSGSLLYKNVVAETLEFIRRELTSPEGGFYSALDADS---EGV--- 322

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
            EG FY WT  E++ IL +   +F  +Y +   GN            ++  N+L    D 
Sbjct: 323 -EGKFYCWTRDELKGILSDDEEIFSTYYNVTEEGN------------WEETNILHRKEDD 369

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
              A+  G+  ++   I+  C+ KL  VR  R RP LDDK++ SWNG+++  +  A ++ 
Sbjct: 370 KVIANAHGLSEDELTVIIDRCKAKLMKVREHRVRPGLDDKILTSWNGIMLKGYIDAYRVF 429

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
           + +                EY++ A + ASF+  +L  +     + +++NG +    FLD
Sbjct: 430 RVD----------------EYLQTALTNASFLLENL-KQADGSWKRNYKNGNATINAFLD 472

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DY  +    ++LY+     +WL  A  + +   E F D++ G ++ T+  D  ++ R  E
Sbjct: 473 DYVLVAEAFIELYQATFDEQWLAEAKAIVDYCIEHFYDQQSGMFYYTSNTDEQLITRKFE 532

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
             D   PS NSV    L+++ +        YY+Q
Sbjct: 533 LMDSVIPSSNSVLARVLLKIGT--------YYQQ 558


>gi|373956291|ref|ZP_09616251.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
           18603]
 gi|373892891|gb|EHQ28788.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
           18603]
          Length = 718

 Score =  330 bits (847), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 201/560 (35%), Positives = 295/560 (52%), Gaps = 57/560 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA+++N+ FV IKVDREERPD+D++YM+ VQ + G GGWPL+
Sbjct: 92  SACHWCHVMENESFEDEQVAEIMNEHFVCIKVDREERPDIDQIYMSAVQLMTGRGGWPLN 151

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
               PD +P+ GGTYF   D      +  +L  + + W++K D   ++  +A+ +L+E +
Sbjct: 152 CVCLPDQRPIYGGTYFRKTD------WMALLFNLANFWEQKPD---EAKEYAV-KLTEGI 201

Query: 140 SASASSNKLPDELPQNA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
               +   + +++      L    +   +SYD + GG   APKFP P   Q ++ ++  +
Sbjct: 202 HQYENIGFVNEQMENTPADLEAIVKPWKQSYDFKEGGLNRAPKFPMPNNWQFLMRYAYLM 261

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +D        E   +V  TL+ MAKGGI+DH+GGGF RYSVD  WHVPHFEKMLYD  QL
Sbjct: 262 QD-------EETNVIVRLTLEKMAKGGIYDHIGGGFARYSVDGHWHVPHFEKMLYDNAQL 314

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y +AF+   D  Y  +  + + +++R++  P    +SA DADS   EG     EG F
Sbjct: 315 IGLYSEAFTWCGDELYKKVVAETIAFIQRELTSPENGFYSALDADS---EGV----EGKF 367

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y +T  EVE ILG+ A LF  +Y +   GN           E +  N+    +D +  A 
Sbjct: 368 YTFTLAEVEAILGDDAGLFAIYYNVTNEGNW----------EEEHTNIFFRRDDDAVLAE 417

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           KLG+P +  ++ +   R ++ + R+KR  P LD K++ SWN L++     A +       
Sbjct: 418 KLGIPADALVDKIAGLRNQVLEARAKRVLPGLDYKILTSWNALMLKGLCDAYRAF----- 472

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE--QTHRLQHSFRNGPSK--APGFLDD 493
                      D   Y+E+A   A FI+ +L ++  Q  R+ ++   G  K  A  FLDD
Sbjct: 473 -----------DEPAYLELALKNAHFIKDNLINKNNQLSRV-YAKPTGDEKLDAIAFLDD 520

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA LI   + LYE      WL  A  L     + F D   G +F T      ++ R  E 
Sbjct: 521 YALLIDAFIALYEVTFDEAWLHQAKALTEHTLDHFYDNATGMFFYTPDYGEQLIARKFEV 580

Query: 554 HDGAEPSGNSVSVINLVRLA 573
            D   PS NSV   N  +L+
Sbjct: 581 MDNVMPSSNSVMARNFKKLS 600


>gi|282897059|ref|ZP_06305061.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
 gi|281197711|gb|EFA72605.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
          Length = 657

 Score =  330 bits (847), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 228/717 (31%), Positives = 353/717 (49%), Gaps = 108/717 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 16  SSCHWCTVMEGEAFSDLAIAEYMNANFIPIKVDREERPDIDSIYMQSLQMMTGQGGWPLN 75

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
            FLSP DL P   GTYFP   +YGRPGF  +L+ ++  +D +++   Q  A  +E L   
Sbjct: 76  AFLSPDDLVPFYAGTYFPVSPRYGRPGFLEVLQAIRHYYDHQKEDFRQRKASILESL--- 132

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           LS++   N    +   +      +Q  ++             FP     Q++L  ++   
Sbjct: 133 LSSTVLQNHGSGQFAHSQFHRFLKQGWETAIGVITPRQMGNSFPMIPYCQLVLQGTRF-- 190

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                  A++G +M       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+ 
Sbjct: 191 ---NYPSANDGLEMATQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQIV 247

Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               + +S   +++ +       + +L R+MI P G  ++A+DADS         +EGAF
Sbjct: 248 EYLANLWSAGVEELAFKRAVAGTVSWLEREMISPTGYFYAAQDADSFNYSTDMEPEEGAF 307

Query: 318 YVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+  E++++L +  +L  KEH+ +   GN            F+GKNVL  L     SA
Sbjct: 308 YVWSYGELQELLSDQELLELKEHFSVSLEGN------------FEGKNVLQRL-----SA 350

Query: 377 SKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWN 418
            +LG  LE  L  L   R              R  ++ ++     R  P  D K+IV+WN
Sbjct: 351 GELGSSLELILGRLFLSRYGQTAETLTIFPPARNNYEAKTNPWHGRIPPVTDTKMIVAWN 410

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQ 477
            L+IS  ARAS++ +                +  Y+++A  A  FI  R   + + HRL 
Sbjct: 411 SLMISGLARASQVFQ----------------QPSYLKLAVKATRFILDRQFVNGRFHRLN 454

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGGGY 536
           +   +G        +DYA  I  LLDL++  SG + WL  AI LQ+  +E  L  E GGY
Sbjct: 455 Y---DGEPTVLAQSEDYALFIKALLDLHQADSGSSSWLEQAIALQDEFNEFLLSVELGGY 511

Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           FNT+ ++   +++R +   D A PS N V++ NL++L+ +   + + YY   AE +L  F
Sbjct: 512 FNTSSDNSQDLIIRERNFVDNATPSANGVAIANLIKLSLL---TDNLYYLDLAESALKAF 568

Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
            T ++    + P +  A+D       ++  LV  +S++D   +LA+ +    +   +  +
Sbjct: 569 STMIEKSPQSCPSLLIASDWY-----RNSTLV--RSNIDNIKILASQYLPTTVFDVISKL 621

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
            P +T                          + LVCQ   C P    P+  + LL +
Sbjct: 622 -PTNT--------------------------IGLVCQGLKCLPA---PVDFDELLAQ 648


>gi|330465851|ref|YP_004403594.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
 gi|328808822|gb|AEB42994.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
          Length = 679

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 224/701 (31%), Positives = 340/701 (48%), Gaps = 78/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE+EGV +LLN+ FVSIKVDREERPDVD VYMT  QA+ G GGWP++
Sbjct: 47  SACHWCHVMAHESFENEGVGRLLNEGFVSIKVDREERPDVDAVYMTATQAMTGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF +PD  P   GTYFP      R  F  +L  V  AW ++RD + + GA  +E +  A 
Sbjct: 107 VFATPDGTPFYCGTYFP------RQNFVRLLESVGTAWREQRDAVLRQGAAVVEAVGGAQ 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +    +  L  +L    L   A QL+  YD   GGFG APKFP  + +  +L H ++   
Sbjct: 161 AVGGPTAPLTADL----LDAAATQLAGEYDETNGGFGGAPKFPPHLNLLFLLRHHQR--- 213

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG    + +  +MV  T + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 214 TG----SPQSLEMVRHTCEAMARGGIHDQLAGGFARYSVDGHWTVPHFEKMLYDNALLLR 269

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   + LT D     + RDI  +L  ++  PG    SA DAD+   EG T       YV
Sbjct: 270 VYTQLWRLTGDALALRVARDIARFLADELHRPGQGFASALDADTEGVEGLT-------YV 322

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  ++ ++LG+    +            DL  +++      G +VL    D   +   +
Sbjct: 323 WTPAQLVEVLGDEDGRWA----------ADLFAVTESGTFEHGTSVLKLARDVDDADPAV 372

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK------ 433
               E++ +++    R+L   R  RP+P  DDKV+ +WNGL +++ A   ++++      
Sbjct: 373 ---RERWQDVV----RRLLAARDTRPQPARDDKVVAAWNGLAVTALAEFVRLVETSGRIG 425

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLD 492
           +E E+ +     + +D      + ++A    R H+ D    RL+ + R+G    P G L+
Sbjct: 426 TEGEANLLEGVTIVADGA----MRDTAEYLARVHMVD---GRLRRASRDGRVGEPAGVLE 478

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DY  +      +++     +WL WA +L +T    F    GG +++T  +   ++ R  +
Sbjct: 479 DYGCVAEAFCAMHQVTGEGRWLEWAGQLLDTALAHFA-APGGAFYDTADDAEQLVARPAD 537

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D A PSG S     LV  +++   +   +YR+ AE +L+     +   A         
Sbjct: 538 PTDNATPSGRSAIAAALVAYSAL---TGQTHYREVAEAALSTVAPIVGRHARFTGYAATV 594

Query: 613 AD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
            + +LS P    VV          + ++AAAH        ++   P             +
Sbjct: 595 GEALLSGPYEIAVVTADPAG----DPLVAAAHRHAPPGAVIVAGQP-----------DQA 639

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
               +A       +  A VC+ F C  PV    ++E+L+ +
Sbjct: 640 GVPLLADRPLLDGESAAYVCRGFVCQRPVD---TVEDLVAQ 677


>gi|110635801|ref|YP_676009.1| hypothetical protein Meso_3473 [Chelativorans sp. BNC1]
 gi|110286785|gb|ABG64844.1| protein of unknown function DUF255 [Chelativorans sp. BNC1]
          Length = 676

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 223/611 (36%), Positives = 304/611 (49%), Gaps = 79/611 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  E FED  VA+L+N  FV+IKVDREERPD+D++YMT + A+   GGWPL++
Sbjct: 53  ACHWCHVMAHECFEDNEVAELMNSLFVNIKVDREERPDIDQIYMTALSAMGEQGGWPLTM 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFP   +YGRPGF  +L+ V  AW  K D L +S       +   L+
Sbjct: 113 FLTPEAKPFWGGTYFPKRSRYGRPGFIDVLKAVHSAWQTKEDELLRSADTLSIHVRTHLA 172

Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                +SN++P       LR  AE++   +D + GG   APKFP    + ++  +   LE
Sbjct: 173 PMQGTTSNEVP-------LRALAEKIRAVFDPQLGGLRGAPKFPNAPFLDLLWLN--WLE 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           +  +S      +  VL TL+ M  GGI+DHVGGG  RYSVD +W VPHFEKMLYD  QL 
Sbjct: 224 NGAESD-----RDTVLLTLRSMLAGGIYDHVGGGLARYSVDAQWLVPHFEKMLYDNAQLI 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +   A+  T D  +     D + +L R+M   GG   S+ DADS   EG    +EG FY
Sbjct: 279 RLCSYAYGGTHDRLFRVRIEDTVKWLLREMTVEGGGFASSLDADS---EG----EEGKFY 331

Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPT---GNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           +WT  E+ED+LG  +   L   +    P    GN  L R   P            L+DSS
Sbjct: 332 LWTRAEIEDVLGVGDARELLAIYDLANPEEWEGNPILHRRRHPE----------VLDDSS 381

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
                     E+ L  L +   +L   R  R RP  DDKV+V WNGL I++ A A +   
Sbjct: 382 ----------EQRLRTLLD---RLMAAREARTRPGRDDKVLVDWNGLAIAAIAVAGRQFA 428

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                           R E++E A  A  F+   L   +  RL HS R      P    D
Sbjct: 429 ----------------RPEWIEAAARAFRFV---LESMEEGRLPHSIRGEKRLFPALSSD 469

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +IS  + LY       ++  A +  +  D  +LD  G GYF T  +     +R++ D
Sbjct: 470 YAAMISAAIALYGATHDDSYVDQARQWLDKLDAWYLDDAGSGYFLTASDSADTPMRIRGD 529

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE---TRLKDMAMAVPLMC 610
            D   PS  +  V  LV LA+ V+GS   Y     +H + V E    R ++ A     + 
Sbjct: 530 MDDPIPSATAQIVTALVHLAA-VSGSHELY-----QHGVRVSEAALARAQNQAYGQLGII 583

Query: 611 CAADMLSVPSR 621
           CAA +   P +
Sbjct: 584 CAAALAQRPMK 594


>gi|323484029|ref|ZP_08089400.1| hypothetical protein HMPREF9474_01149 [Clostridium symbiosum
           WAL-14163]
 gi|323402646|gb|EGA94973.1| hypothetical protein HMPREF9474_01149 [Clostridium symbiosum
           WAL-14163]
          Length = 639

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 206/561 (36%), Positives = 289/561 (51%), Gaps = 57/561 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  +A+LLN  ++ +KVDREERPD+D VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFPP  +YGR G   +L      W +KR+ L  S       L E    + SS  
Sbjct: 61  PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
            P E+ + A R    Q + S+D + GGFG APKFP P  +  ++       + G   +  
Sbjct: 121 GP-EIVRQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E   M   TL  M +GGI DH+GGGF RYS DERW VPHFEKMLYD   L   Y+ A+ L
Sbjct: 169 EAVSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYGL 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T    Y      +L Y+  ++  P G  +  +DADS          EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ILG +    F  +Y +   GN            F+GK++   L + +  +     P  + 
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESICEERPGAEE 329

Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +   + RR       KL+  R KR R H DDK++VSWNG +IS+ A+A  +L       
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K+Y+++A  A  FIR  L   +  RL   +R+G +   G LDDYA    
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LL+LY     T +L  A    +   E FLDRE GG+F    +   +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNS +   L  LA +   +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512


>gi|149279373|ref|ZP_01885504.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
 gi|149229899|gb|EDM35287.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
          Length = 674

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 202/581 (34%), Positives = 292/581 (50%), Gaps = 52/581 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE+  VA ++N  +V IKVDREERPD+D++YM  +Q + G GGWPL+
Sbjct: 51  SACHWCHVMERESFENHEVAAVMNQHYVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
               PD +P+ GGTYF  +D      + +IL  V   W  + D   Q      + +  A 
Sbjct: 111 CICLPDQRPVYGGTYFKKDD------WTSILENVAALWLHEPDKALQYADRLTDGIRNAE 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               +  K P       LR   +   +  D   GG+  APKFP P   Q +L +S    D
Sbjct: 165 KIIPNEKKEPYNYTH--LREITDPWKRELDMTDGGYNRAPKFPMPNNWQFLLRYSLLTGD 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                         L +L+ MA GGI+D +GGGF RYSVD RWHVPHFEKMLYD  Q+  
Sbjct: 223 NAT-------HVATLLSLEKMALGGIYDQIGGGFARYSVDGRWHVPHFEKMLYDNAQMIA 275

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A+  T+   ++ +  + + ++ R+M  P G  ++A DADS   EG     EG FYV
Sbjct: 276 LYAEAYQYTQLPLFNSVVAETIGWMAREMRSPEGLFYAALDADS---EGV----EGKFYV 328

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W  +E E +     +L K +Y +  +GN           E +  N+L+        A++ 
Sbjct: 329 WDEEEFEVVTQGDHLLMKAYYQVTSSGNW----------EEEETNILMRRFADEDFAAQQ 378

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+ LE+    +   R KL + RSKR  P LDDK +++WN + I   A  + +        
Sbjct: 379 GITLEELDLKVSAAREKLLEHRSKRVTPALDDKCLLAWNAMAIKGLASCASVF------- 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                     R++Y E+A +AA FI + +  EQ  RL  +F+NG +   GFLDDYAF I 
Sbjct: 432 ---------GRQDYYEMARTAADFILQPM-QEQDGRLYRNFKNGKATISGFLDDYAFFID 481

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            L+ LY++    +WL+ A +   T    F D +   +F T     S++ R  E  D   P
Sbjct: 482 ALIALYQYDFDEQWLLEARKYAETVLGQFADPDSPMFFYTPSGAESLIARKHELMDNVIP 541

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
           + NSV   NL  L  +      D Y + A   LA  + ++K
Sbjct: 542 ASNSVMAQNLHLLGLLF---DDDSYTERASAMLAAIQPQIK 579


>gi|355621830|ref|ZP_09046381.1| hypothetical protein HMPREF1020_00460 [Clostridium sp. 7_3_54FAA]
 gi|354823297|gb|EHF07630.1| hypothetical protein HMPREF1020_00460 [Clostridium sp. 7_3_54FAA]
          Length = 639

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 206/561 (36%), Positives = 288/561 (51%), Gaps = 57/561 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+  +A+LLN  ++ +KVDREERPD+D VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTYFPP  +YGR G   +L      W +KR+ L  S       L E    + SS  
Sbjct: 61  PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120

Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
            P E+   A R    Q + S+D + GGFG APKFP P  +  ++       + G   +  
Sbjct: 121 GP-EIVSQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168

Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
           E   M   TL  M +GGI DH+GGGF RYS DERW VPHFEKMLYD   L   Y+ A+ L
Sbjct: 169 EALSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYGL 228

Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           T    Y      +L Y+  ++  P G  +  +DADS          EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281

Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ILG +    F  +Y +   GN            F+GK++   L + +  +     P  + 
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESVCEERPGAEE 329

Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +   + RR       KL+  R KR R H DDK++VSWNG +IS+ A+A  +L       
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K+Y+++A  A  FIR  L   +  RL   +R+G +   G LDDYA    
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LL+LY     T +L  A    +   E FLDRE GG+F    +   +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNS +   L  LA +   +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512


>gi|440749562|ref|ZP_20928808.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
 gi|436481848|gb|ELP37994.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
          Length = 674

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 210/612 (34%), Positives = 311/612 (50%), Gaps = 59/612 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE  A L+N  FV IK+DREERPD+D +YM  +QA+   GGWPL+
Sbjct: 47  SACHWCHVMERESFEDEETADLMNAHFVCIKIDREERPDLDNIYMEALQAMGVQGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL P+ KP  GGTYFP +       +K +L  + +A+      L +S       +  + 
Sbjct: 107 VFLMPNQKPFYGGTYFPNKQ------WKNLLGSIANAYKNHHGQLLESAEGFGRSIGRSE 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                       L +  + L  ++L+  +D  +GG    PKFP P     +L       D
Sbjct: 161 LEKYGLKAAETGLEKADIELVLDKLTAQFDLEWGGMNRKPKFPMPAVWLFVL-------D 213

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G+  E  + V FTL+ +  GGI+DH+ GG+ RYSVD  W  PHFEKMLYD GQL +
Sbjct: 214 AALLGKDQELLEKVFFTLKKIGMGGIYDHLRGGWARYSVDGEWFAPHFEKMLYDNGQLLD 273

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A+ ++ D F+     + +D++  +M+   G  F+A+DADS   EG     EG FY 
Sbjct: 274 LYAKAYQVSGDEFFKEKVLETVDWIEAEMLLSEGGFFAAQDADS---EGV----EGKFYT 326

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W  +E+E ILGE    FK+ Y LK  GN +            G N+L +    +  A+++
Sbjct: 327 WKYEELEAILGEDLSWFKKLYNLKYQGNWE-----------DGVNILFQTEPYADLAAEI 375

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  + Y   L + + KL  VR++R  P LDDKV+  WNGL I+  A+            
Sbjct: 376 GLSEKAYRERLQQIKTKLLTVRNRRIYPGLDDKVLSGWNGLAIAGLAQV----------- 424

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
              F   GS++   + +A+    F+   ++  Q   L  S+++G +  P FL+DYA +I 
Sbjct: 425 ---FLATGSEKA--LSLAKRNGKFLWEKMFKGQV--LYRSYKDGQAYTPAFLEDYAAVIR 477

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           G + LY+    T+WL+ A EL +   E + D   G +F    +   ++   KE  D   P
Sbjct: 478 GYISLYQASFETEWLLKAKELTDLVLEQYYDEGDGFFFFNNPKAEKLIANKKELFDNVIP 537

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC--AADML- 616
           + NSV   NL  L         + Y+  AEH LA     +K + +  P   C  A+ ML 
Sbjct: 538 ASNSVMARNLQDLGLYFY---QEEYQAIAEHMLA----SVKRLILTEPGFLCNWASLMLH 590

Query: 617 SVPSRKHVVLVG 628
           ++  +  V +VG
Sbjct: 591 TLVPKAEVAVVG 602


>gi|336172537|ref|YP_004579675.1| hypothetical protein [Lacinutrix sp. 5H-3-7-4]
 gi|334727109|gb|AEH01247.1| hypothetical protein Lacal_1399 [Lacinutrix sp. 5H-3-7-4]
          Length = 679

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 215/686 (31%), Positives = 333/686 (48%), Gaps = 76/686 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E VA ++N  F++IK+DREERPD+D+VYM  VQ + G GGWP++V 
Sbjct: 54  CHWCHVMEHESFENEDVAIVMNSNFINIKIDREERPDIDQVYMNAVQLMTGSGGWPMNVV 113

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF  E       +   L ++ D + K  D L +       +L++ + A
Sbjct: 114 ALPDGRPVWGGTYFKKEQ------WVNALNQISDLYKKNPDKLYEYAT----KLAKGIKA 163

Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 N    +     L+      S  +D+  GG G  PKF  P   Q +L        
Sbjct: 164 MDLIKPNTNEPKFDTTFLKEIIADWSVYFDTNKGGIGKEPKFMMPNNYQFLL-------- 215

Query: 200 TGKSGEASEGQKMVLF---TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             + G   + +K++ F   TL  MA GGI+D +GGGF RYSVD++WHVPHFEKMLYD  Q
Sbjct: 216 --RYGYQKQDKKILDFVNTTLTKMAYGGIYDQIGGGFSRYSVDDKWHVPHFEKMLYDNAQ 273

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L ++Y +AF+LTK+  Y  +  + L++++R++ G  G  +S+ DADS   +     +EGA
Sbjct: 274 LVSLYAEAFALTKNELYENVVIETLEFIKRELTGTNGIFYSSLDADSLTEDNVL--EEGA 331

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +YVW  +E++ +L +   LF  +Y +   G  +       H  +    VLI   +     
Sbjct: 332 YYVWKKEELQTLLKDDFKLFSTYYNVNNYGYWE-------HKNY----VLIRDKNDLKFT 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++  + LEK        +  L   R KR  P LDDK + SWN L++  +  A ++L+ E 
Sbjct: 381 NQENITLEKLKEKKKRWKSILLKEREKRNLPRLDDKTLTSWNALMLKGYVDAYRVLQDE- 439

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y++ A   A FI  +   E    L H+++NG S   GFL+DYA 
Sbjct: 440 ---------------NYLDCAIKNAEFILNNQLKEDG-SLYHNYKNGASSINGFLEDYAT 483

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
            I   L LY+  S  KWL  A  L +   + F D E   +F T+ +D  ++++  E  D 
Sbjct: 484 TIDAFLALYQVTSTIKWLDNAKALTDYCFDTFFDTESQLFFFTSNQDKKLIVQTIEYRDN 543

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
             P+ NS+    L  L+       ++YY + +++ L   +  +     A           
Sbjct: 544 VIPASNSIMANCLYMLSHFY---NNNYYLKTSKNMLNNIKPEIHQYGSAFSNWMSLMLNF 600

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           + P  + V + G K+++  +          DLNK  +        E +       NN  +
Sbjct: 601 TEPFYE-VAITGDKANIKVK----------DLNKEYLPNKIVACSERN-------NNLPL 642

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
             N +  +K +  VC N +C  PV +
Sbjct: 643 LHNRYVENKTLIYVCVNNTCKLPVIN 668


>gi|345006662|ref|YP_004809515.1| hypothetical protein [halophilic archaeon DL31]
 gi|344322288|gb|AEN07142.1| hypothetical protein Halar_3548 [halophilic archaeon DL31]
          Length = 727

 Score =  329 bits (844), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 227/708 (32%), Positives = 341/708 (48%), Gaps = 67/708 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED  VA+ +N+ FV +KVDREERPD+D+VY T  Q + GGGGWPLS
Sbjct: 50  SACHWCHVMAEESFEDPAVAETINENFVPVKVDREERPDLDRVYQTVCQLVTGGGGWPLS 109

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGR--PGFKTILRKVKDAW---DKKRDM---LAQSGAFA 131
            +L+P+ KP   GTYFPPE    R  PGF+ + R++ D+W   +++++M     Q  A A
Sbjct: 110 AWLTPEGKPFYIGTYFPPEPHPQRNAPGFQDLCRQIADSWSDPEQRQEMENRAEQWTAAA 169

Query: 132 IEQLSEALSASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
            ++L  A +   + ++   E   +   L   A  + +  D   GGFGS  PKFP P  ++
Sbjct: 170 RDRLEPASTGRNTESETATETLSSTELLDDAAAAVVRGADRTNGGFGSGGPKFPHPGRVE 229

Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
           ++L    ++   G  GE      +    L  M  GG++DH+GGGFHRY VD  W VPHFE
Sbjct: 230 LLL----RVAALGDDGEP---LSVARNALNAMGSGGLYDHLGGGFHRYCVDAEWTVPHFE 282

Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
           KM YD G +   +L  +        + + R+ L+++ R++  P G  +S  DA S ET  
Sbjct: 283 KMAYDNGTIPAAFLAGYRAMGRERDAEVVRETLEFVSRELRHPDGGFYSTLDARS-ETPA 341

Query: 309 A-------TRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGN----CDLSRMSDP 356
           +         ++EGAFYVWT  E+  ++ E  A LF   Y +   GN      +   + P
Sbjct: 342 SRLEDDEEPEREEGAFYVWTPAEIRAVVDEPAATLFCRRYGVISGGNFEGGTSVLNETVP 401

Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
             E  G     E ++ +A  S+     E    +L    ++LF+ R +RPRP  D+KV+  
Sbjct: 402 IAELVGA----EFDEGTAPDSE-----EAVEELLQTATQELFEARGERPRPLRDEKVLAG 452

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNGL+IS+FA A  +L                   +Y E A++A SF+R HL+D    RL
Sbjct: 453 WNGLLISTFAEAGLVLDD-----------------QYTEDAQAALSFVREHLWDADARRL 495

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              F++G     G+L+DYAFL  G  + Y+     + L +A+EL     + F D + G  
Sbjct: 496 SRRFKDGDVAVSGYLEDYAFLGRGAFETYQATGNVEPLSFALELAEVIADAFYDADDGTL 555

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           + T  +   ++ R +E  D + PS    +V  L+ L S          R     +LA   
Sbjct: 556 YFTANDAEELVARPQELTDQSTPSSVGAAVSLLLELDSFTDRDLGAVARD----TLATHR 611

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
            R++   +    +  AAD       +  V  G       E       + Y     +    
Sbjct: 612 DRIEASPVEHVSLVLAADAADRGPLELTVAAGELP----EEWRETLRSRYLPGAVLARRP 667

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSA--DKVVALVCQNFSCSPPVTD 702
           P      ++ +E     A     N  A   +     C++F+CSPP TD
Sbjct: 668 PTKAGLKEWLDELGLEEAPPIWANREAREGEPTVYACRSFTCSPPETD 715


>gi|418471574|ref|ZP_13041379.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
 gi|371547815|gb|EHN76170.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
          Length = 680

 Score =  329 bits (844), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 240/709 (33%), Positives = 341/709 (48%), Gaps = 88/709 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPPE ++G P F+ +L+ V+ AW ++RD +++     +  L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVQQAWAERRDEVSEVAGKIVRDLAGRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   +     ++L Q  L      L++ YD++ GGFG APKFP  + I+ +L H  +  
Sbjct: 168 ISYGDAEAPGEEQLGQALL-----GLTREYDAQRGGFGGAPKFPPSMAIEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  ++ ++LG E A L  +++ +   G  +       H                  AS
Sbjct: 334 VWTPAQLTEVLGAEDAELAAQYFGVTEEGTFE-------HG-----------------AS 369

Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            L +P ++ +     +   R +L   R  RP P  DDKV+ +WNGL I++ A        
Sbjct: 370 VLQLPQQEGVFDAARIASVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET------ 423

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDD 493
               A F  P              +A   +R HL DEQ  R+  + ++G P    G L+D
Sbjct: 424 ---GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RITRTSKDGRPGANAGVLED 472

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA    G L L        WL +A  L +     F D  G G    T  D   L+R  +D
Sbjct: 473 YADAAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD--GSGSLYDTAADAEQLIRRPQD 530

Query: 554 -HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
             D A PSG S +   L+  A   A + S+ +R  AEH+L V    +K +   VP     
Sbjct: 531 PTDNATPSGWSAAAGALLTYA---AHTGSEPHRTAAEHALGV----VKALGPRVPRFIGW 583

Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            +  A  +L  P  + V +VG            A  A+  L++T + +  A    + F  
Sbjct: 584 GLAAAEALLDGP--REVAVVGPAP---------ADPAARGLHRTAL-LGTAPGAVVAFGT 631

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           E +     +A          A VC+NF+C  P TDP  L   L   P+ 
Sbjct: 632 EGSDEFPLLADRPLVGGAAAAYVCRNFTCDAPTTDPERLRAALGAAPTG 680


>gi|455649958|gb|EMF28748.1| hypothetical protein H114_12956 [Streptomyces gancidicus BKS 13-15]
          Length = 679

 Score =  329 bits (844), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 232/700 (33%), Positives = 336/700 (48%), Gaps = 81/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  +N  FVSIKVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDQATADEMNAHFVSIKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
           VFL+PD +P   GTYFPP  ++G P F+ +L  V  AW ++RD + + +G    +     
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVAQAWAERRDEVGEVAGKITRDLAGRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           LS          EL Q  L      L++ YD++ GGFG APKFP  + ++ +L H  +  
Sbjct: 168 LSVGGDEVPGEQELAQALL-----GLTREYDAQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++  P G   SA DADS   +G  R  EGA+Y
Sbjct: 276 RVYTHLWRTTGSELARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 333

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           VWT  ++ ++LG+        Y+           +++     +G +VL +   D  A A+
Sbjct: 334 VWTPAQLREVLGDADAEPAARYF----------GVTEEGTFEEGASVLQLPQRDEVADAA 383

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +           +   R +L   R +RP P  DDKV+ +WNGL I++ A           
Sbjct: 384 R-----------IDGIRERLLAARDRRPAPGRDDKVVAAWNGLAIAALAETGACFG---- 428

Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                       R + +E A +A    +R HL D    R+  + ++G   A  G L+DYA
Sbjct: 429 ------------RPDLVEAAVAAGDLLVRVHLDDHA--RIARTSKDGQVGANAGVLEDYA 474

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     FLD E G  ++T  +   ++ R ++  D
Sbjct: 475 DVAEGFLALASVTGEGVWLDFAGLLVDHILARFLDAESGALYDTASDAERLIRRPQDPTD 534

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG + +      L    A + S+ +R  AE +L V    +K +   VP      + 
Sbjct: 535 NAAPSGWTAAAGA---LLGYAAHTGSEPHRTAAERALGV----VKALGPRVPRFIGWGLA 587

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P  + V +VG           A   A+ +L++T + +  A    +    E +
Sbjct: 588 VAEAVLDGP--REVAVVGRG---------ADDPATAELHRTAL-LGTAPGAVVAVGTEGS 635

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A          A VC+NF+C  P TDP  L   L
Sbjct: 636 DEFPLLADRPLVDGAPAAYVCRNFTCDAPTTDPDRLRTAL 675


>gi|322435300|ref|YP_004217512.1| hypothetical protein AciX9_1682 [Granulicella tundricola MP5ACTX9]
 gi|321163027|gb|ADW68732.1| hypothetical protein AciX9_1682 [Granulicella tundricola MP5ACTX9]
          Length = 702

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 224/702 (31%), Positives = 345/702 (49%), Gaps = 60/702 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES+E+   A+L+N+ F++IKVDR+ERPDVD  Y   V A+ G GGWPL+ F
Sbjct: 48  CHWCHVMDRESYENAETARLINEHFIAIKVDRDERPDVDARYQAAVAAISGQGGWPLTAF 107

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEAL 139
           L+P  +P  GGTYFPP D++GRPG + +L  + +A+  KR+ +  +    I  +  +E+ 
Sbjct: 108 LTPQGQPYFGGTYFPPLDQHGRPGLRRVLMTMAEAFQNKREEVMDTAGSVIAAIEHNESF 167

Query: 140 SASASS--NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             SAS+   +L D+L  +AL        + +D R GGFGS PKFP    + +++  + ++
Sbjct: 168 DGSASNPGTELVDKLIASAL--------QQFDRRNGGFGSQPKFPNSGALDLLIDAASRV 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
               + G A+  +    FTL+ M+KGGI+DH+ GGFHRYSVDERW VPHFEKM YD  +L
Sbjct: 220 --GSQDGIAAAARATAAFTLEKMSKGGIYDHLAGGFHRYSVDERWVVPHFEKMSYDNSEL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG-GEIFSAEDADSAETEGATRKKEGA 316
              Y+ A+    +   + I R+I+ ++   M     G  ++++DAD      A    +G 
Sbjct: 278 LKNYVHAYQTFVEPECARIAREIIRWVEEVMSDRELGGFYASQDAD------ANLDDDGD 331

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           ++ WT  E    L +  +     +Y       D+  + D H+  + KN L         A
Sbjct: 332 YFTWTLAEARAALTKKELAVTAPFY-------DIGELGDMHHNPQ-KNTLHVDQPLETVA 383

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G+ L++   +L     KL+  R  RP P++D  +  +WN ++IS+   A+++L   A
Sbjct: 384 KAAGVSLDQASALLQTSLPKLYAARKTRPTPYIDKTLYTAWNAMMISAHLEAARVL---A 440

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
           + A   F +   DR   +  A    S      Y E +              PG LDDYAF
Sbjct: 441 DPATRLFALKTLDR--VLSTAWHEGSLDHVIAYGESSEPTD--------PIPGILDDYAF 490

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL------LRV 550
                LD +E      +   A+ L +     F D E GG+F+T    P  L       R 
Sbjct: 491 TGHAALDAWEATGHISYFNSALALADAAITKFYDEEKGGFFDTETPAPGELRLGALSTRR 550

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           K   D   P+GN V+      L  + A +  + ++Q A+ +L  F   ++   +      
Sbjct: 551 KPLQDSPTPAGNPVAAAL---LLRLEALTGREDFKQMAKATLECFAAVVEHFGLYAATFG 607

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A   L +P  + VV+VG  S  D   +  AA   Y +NKTV+ + P+    +       
Sbjct: 608 LALQRLLLPPIQ-VVIVGEDSVAD--RLERAALGRYAVNKTVVRLTPSQLTTLP------ 658

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
            + A    +  +     A VC  F+C PPV  P +L  +LLE
Sbjct: 659 PSLAQTLPHFLTTLGSYAAVCTGFTCRPPVNTPEALAEILLE 700


>gi|441179453|ref|ZP_20970097.1| hypothetical protein SRIM_39324 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440614431|gb|ELQ77705.1| hypothetical protein SRIM_39324 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 641

 Score =  328 bits (842), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 228/712 (32%), Positives = 343/712 (48%), Gaps = 103/712 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE VA ++N+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 10  SACHWCHVMAHESFEDEAVAAVINEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 69

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---- 135
           VFL+PD +P   GTYFPP  ++G P F  IL+ V+ AW ++RD + +     +  L    
Sbjct: 70  VFLTPDAEPFYFGTYFPPAPRHGMPSFPQILQGVRGAWAERRDEVGEVAGRIVADLSARS 129

Query: 136 -SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
            SE L+        P++L    L      L++ +D+  GGFG APKFP  + ++ +L H 
Sbjct: 130 VSETLAKGGQVPPGPEDLASALL-----ALTRDFDAVHGGFGGAPKFPPSMALEFLLRHH 184

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            +        E+    +MV  T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD 
Sbjct: 185 ART-------ESEAALQMVQATAEAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDN 237

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             L   Y   + +T       +  +  D++ R++    G   SA DADS   +G+ +  E
Sbjct: 238 ALLCRTYAHLWRVTGSDLARRVAVETADFMVRELRTEEGGFASALDADS--DDGSGKHVE 295

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           GA+YVWT +++  +LGE       HY+    G  +          F+    +++L D+  
Sbjct: 296 GAYYVWTPEQLRAVLGEKDAAVAAHYF----GVTE-------EGTFEEGASVLQLPDTDD 344

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
                    E+  +I    + +L   R  RPRP  DDKV+ +WNGL I++ A        
Sbjct: 345 LVDA-----ERIASI----KERLRAARDSRPRPGRDDKVVAAWNGLAIAALAETGAYF-- 393

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
                         DR + ++ A  AA  + R   D Q  RL  + R+G + A  G L+D
Sbjct: 394 --------------DRPDLVQAATDAADLLVRVHMDWQA-RLHRTSRDGVAGANSGVLED 438

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------EGGGYFNTTGEDPSV 546
           YA +  G L L        W+ +A         LFLD        E G  ++T  +   +
Sbjct: 439 YADVAEGFLALASVTGEGVWVDFA--------GLFLDTVIVHFTAEDGTLYDTADDAEQL 490

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R ++  D A PSG + +   L+  A++   + S  +R+ AE +L V    +K ++   
Sbjct: 491 IRRPQDPTDNATPSGWTAAAGALLSYAAL---TGSGPHREAAERALGV----VKALSGRA 543

Query: 607 PL-----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPA 658
           P      +  A   L  P  + V +VG     D +    A H +  L      V+ +   
Sbjct: 544 PRFIGWGLAVAEAALDGP--REVAVVGP----DGDPATRALHRAALLGTAPGAVVALGAP 597

Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            ++E+   ++    +   A          A VC++F+C  P TDP  L   L
Sbjct: 598 GSDEVPLLKDRPLVDGRPA----------AYVCRHFTCERPTTDPEELGEKL 639


>gi|320107222|ref|YP_004182812.1| N-acylglucosamine 2-epimerase [Terriglobus saanensis SP1PR4]
 gi|319925743|gb|ADV82818.1| N-acylglucosamine 2-epimerase [Terriglobus saanensis SP1PR4]
          Length = 714

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 227/695 (32%), Positives = 345/695 (49%), Gaps = 66/695 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES+E+   A L+N +F++IKVDR+ERPDVD  Y   V A+ G GGWPL+ F
Sbjct: 62  CHWCHVMDRESYENADTADLINRYFIAIKVDRDERPDVDTRYQAAVSAISGQGGWPLTAF 121

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P+ KP  GGTYFPPED++GRP F+ +L+ + DA+  +R  +  S    ++ +    S 
Sbjct: 122 LTPEGKPFFGGTYFPPEDRFGRPSFQRVLQTMADAFQDRRSEVEDSADSVMQAIEFNESF 181

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           S  S+ L  +L    +   AE + K +D ++GGFGS PKFP P  + +       L D  
Sbjct: 182 SGRSSDLGPDL----VNKLAESMLKQFDPQYGGFGSQPKFPHPGALDL-------LTDIA 230

Query: 202 KSGE--ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             G   A +   +V  TL  MA GG+ D +GGGFHRYSVDERW VPHFEKM YD  +L  
Sbjct: 231 SRGGPLAEQASNVVRVTLDKMALGGMRDQIGGGFHRYSVDERWVVPHFEKMAYDNAELLK 290

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFY 318
            Y+ AF       Y+ + R+IL ++   +     G  +S++DAD       T   +G ++
Sbjct: 291 SYVRAFRTFLVPEYAEVAREILRWMDGTLSDRERGGFYSSQDAD------LTLDDDGDYF 344

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
            WT  E   +L    +   E YY       D+  + D H++   +NVL      +  + +
Sbjct: 345 TWTRDEAAAVLSPEELAVAEIYY-------DIGEIGDMHHD-PSRNVLHVRYTLAEVSRR 396

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           +G+  E+  ++L   R KL   RS+R  P +D  +   WNGL I+++  A + L ++ E+
Sbjct: 397 IGITEEEVQSLLLSLRGKLASARSERAAPFVDRTMYTGWNGLCIAAYLEAGRALHNQ-ET 455

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT---HRLQHSFRNGPSKA-PGFLDDY 494
             F    +  DR             + +  ++E+T   H + ++  + P++A  G L+DY
Sbjct: 456 VQFGLRSL--DR-------------LLQEAWNEETGLGHVISYADGHVPAQAVAGVLEDY 500

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL----LRV 550
           AF     +  +E    ++WL  A  L       F D  GGG+F+T       L     R 
Sbjct: 501 AFAGLACVAAWEVTGESRWLRHAEALAARMIRDFADAVGGGFFDTARGSGVALGALSARR 560

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           K   D   P+GNS + + L++LA      K    +  A  +L  F   ++   +      
Sbjct: 561 KPLQDSPTPAGNSAAALFLLQLADWTMDEK---LQAKAADTLETFAGIVEHFGLYAATFG 617

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM--DFWEE 668
            A   L +P  + VV+    SS   E   AAA A Y   K+V+ +  +  E++     E 
Sbjct: 618 LALQRLLLPEIQIVVVGEDDSSAVLE---AAALAGYSATKSVLRLKRSQLEDLRGPMAET 674

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
                A M  N+F      A+VC +  C PP +DP
Sbjct: 675 LPHLPAEMFENSF------AMVCGDGRCQPPTSDP 703


>gi|386360498|ref|YP_006058743.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
 gi|383509525|gb|AFH38957.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
          Length = 639

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 219/592 (36%), Positives = 303/592 (51%), Gaps = 83/592 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESF+DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S
Sbjct: 47  HTCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMS 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+P+ KP  GGTYFP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL
Sbjct: 107 LFLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWTGKREAVLEEA----ERLTRAL 162

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S +    P  LP+ A     + L +++D  +GGF  APKFP+   +  +L  + + E+
Sbjct: 163 WKSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                      +++  TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA 
Sbjct: 221 --------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLAR 272

Query: 260 VYLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           VYL A+ L  +  +  + R+ LD+L    RR+     G   +A D   AE+EG    +EG
Sbjct: 273 VYLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEG 320

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            +Y WT  E+ + LGE   L + ++ L      DL            ++VL    ++   
Sbjct: 321 RYYTWTEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVR 366

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
            + LG   E +       R KL   R +R  P LDDKV+  W+ L + + A A ++   E
Sbjct: 367 EA-LG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE 422

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
           A                Y+E A+  A F+  H+Y  +   L+H++R G      +L D A
Sbjct: 423 A----------------YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQA 463

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           F     L+LY       +L WA         LF  REG          PS+ L  KE  +
Sbjct: 464 FAALAFLELYAATGEWPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEE 511

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           GA PSG S     LVRL ++  G     YR+ AE  LA     L     A+P
Sbjct: 512 GALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 559


>gi|390957418|ref|YP_006421175.1| thioredoxin domain-containing protein [Terriglobus roseus DSM
           18391]
 gi|390412336|gb|AFL87840.1| thioredoxin domain protein [Terriglobus roseus DSM 18391]
          Length = 710

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 219/700 (31%), Positives = 338/700 (48%), Gaps = 68/700 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES+E+   A L+N++FV++KVDR+ERPDVD  Y   V A+ G GGWPL+ F
Sbjct: 54  CHWCHVMDRESYENAETAALINEYFVAVKVDRDERPDVDTRYQAAVAAISGQGGWPLTAF 113

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL------ 135
           L+PD +P  GGTYFPPE++YGRP F+ +L  +  ++  K   + +S +  +E +      
Sbjct: 114 LTPDGRPYFGGTYFPPEERYGRPSFRRVLMTMAGSFYDKHHEVEESASSVMEAIEYSETF 173

Query: 136 ---SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
              +  L AS +S  L D+L   AL        K +D   GGFGS PKFP P  ++M+L 
Sbjct: 174 TGDATDLDASGASLALLDKLIDGAL--------KQFDPIHGGFGSQPKFPHPAALEMLLD 225

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
            + +         A +  +  L +L+ MA+GGI D + GGFHRYSVDERW VPHFEKM Y
Sbjct: 226 AASR-----PGPNAPQCAEAALVSLKKMARGGIFDQLAGGFHRYSVDERWVVPHFEKMAY 280

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATR 311
           D  +L   Y+ AF    D   +   R  + ++   +     G  + ++DAD       + 
Sbjct: 281 DNSELLRAYVHAFQTFVDPECADAARATMQWMDEWLSDRERGGFYGSQDAD------LSL 334

Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
             +G ++ W+  E   +L E      E YY       D+  + D H++   +NVL     
Sbjct: 335 DDDGGYFTWSRDEAAAVLTEDEAKLAELYY-------DIGAVGDMHHD-PARNVLFRPMT 386

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
              +A + G+  E    +L   R KL   R +RP P +D  +   WN + IS++ RA ++
Sbjct: 387 LEQAAQQAGVDAEIAPMMLKVMRSKLLAARLQRPTPFVDKTIYTGWNAMCISAYVRAGRV 446

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGF 490
           L+     A   F     DR   ++VA    +           H + +S    P +   G 
Sbjct: 447 LQVPGAVA---FACKSLDR--VLDVALVEGTL---------KHVVAYSDPAAPHTDVAGV 492

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL--- 547
           LDDY FL    LD++E      +   A  L  T    F D +GGG+F+   +    +   
Sbjct: 493 LDDYVFLGHACLDVWEATGEIVYFEAARVLATTLLRKFYDGKGGGFFDMASDSTETIGAL 552

Query: 548 -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
             R K   D   P+GN      L+RL ++   +  + YR+ A+ +L  F   ++ + +  
Sbjct: 553 STRRKPVQDAPTPAGNPAGAALLLRLHAL---TGDETYRETAQETLETFAVIVEHLGLYG 609

Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
           P    A   L+ P+ + V++ G   +   E +   A A + +NK+V+ I  A    +   
Sbjct: 610 PTFGLALGRLARPAVQVVIVGGGAKAAQLEMV---ALARFAVNKSVVRIARAQLGAL--- 663

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
                  A    +   +D+ +ALVC   +C PP+ D   L
Sbjct: 664 ---PPALAETLPHLPDSDEAIALVCSGMTCQPPIRDAAEL 700


>gi|452943278|ref|YP_007499443.1| thymidylate kinase [Hydrogenobaculum sp. HO]
 gi|452881696|gb|AGG14400.1| thymidylate kinase [Hydrogenobaculum sp. HO]
          Length = 634

 Score =  328 bits (840), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 214/612 (34%), Positives = 306/612 (50%), Gaps = 85/612 (13%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
              +F    K  +  FL    ++CHWCHVME ESFEDE VA  LN +FVSIKVD+EERPD
Sbjct: 29  SEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKESFEDEEVASFLNKYFVSIKVDKEERPD 88

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YM Y   L   GGWPLS FL+P  +P   GTYFP      +  F  +L+++KD WD
Sbjct: 89  IDSLYMEYCVLLNNSGGWPLSAFLTPTKEPFFAGTYFP------KASFLKLLQQIKDLWD 142

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K    + +     +EQL + +++         EL ++ +      L+  YD  FGGF  A
Sbjct: 143 KDSKNIIEKSKRLVEQLKQFMNSFEKR-----ELNESFIDKALFGLANRYDEEFGGFSEA 197

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    + ++L   K+             Q M L TL  M +GGI DHVGGGFHRYS 
Sbjct: 198 PKFPSLHNVLLLLKSQKQ-----------PFQDMALSTLLNMRRGGIWDHVGGGFHRYST 246

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYDQ      Y +A+ LTK+  +       +++++ ++    G  +++
Sbjct: 247 DRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNEIFKDTVYKTINFVKENLY-ENGFFYTS 305

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
            DAD   TEG    +EG FY+WT +E++DIL E A  F E + +K  GN     + +   
Sbjct: 306 MDAD---TEG----EEGGFYLWTYQEIKDILKEKADKFIEFFNIKKEGNF----LDEAKR 354

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
            + GKNVL         A +  +  E+ L IL          R KR +P +DDK+++  N
Sbjct: 355 VYTGKNVLY--------AKEPSLAFEEELKILKA-------FREKRKKPLIDDKILLDQN 399

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
            ++  +   A  +                 D K+++++A        ++L +   H LQH
Sbjct: 400 AMMDFALIEAYLVF----------------DDKDFLDMA-------TKNLNNISKHPLQH 436

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           +  +     P  LDDYA+LI   L LY+       L  AI L     E   D+  GG++ 
Sbjct: 437 ALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSKDALEKAISLTEETIEKLWDKNAGGFYL 495

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           + G+D  VL+  K  +DGA PSGNSV  +NLV L  I   +K D Y    E+   +  + 
Sbjct: 496 SVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVELFFI---TKEDTY----ENRYQILSSI 546

Query: 599 LKDMAMAVPLMC 610
             DM    P  C
Sbjct: 547 YSDMLSRNPTAC 558


>gi|407781159|ref|ZP_11128379.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
 gi|407208585|gb|EKE78503.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
          Length = 680

 Score =  328 bits (840), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 224/705 (31%), Positives = 335/705 (47%), Gaps = 86/705 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED+  A L+N  FV++KVDREERPD+D +Y + +  L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFEDDETAALMNRLFVNVKVDREERPDIDHIYQSALAILGEQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD  P  GGTYFP E +YGRPGFK +L+ + DA  +  D ++++ +   + L +   
Sbjct: 110 FLTPDGDPFWGGTYFPKEARYGRPGFKAVLQAIADAHAEGSDKVSRNASALRQALRQLAE 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +A  N  P  L +      AE+L +  D   GG G APKFP+P  + ++  H       
Sbjct: 170 PAAGENIEPALLDR-----IAERLHREIDPIHGGIGGAPKFPQPGMLMLLWRHWL----- 219

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            +SG   + +  VL TL+ M +GGI+DH+GGGF RYS D +W  PHFEKMLYD  QL  +
Sbjct: 220 -RSGN-QDSRDYVLLTLERMCQGGIYDHLGGGFARYSTDAQWLAPHFEKMLYDNAQLIEM 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
              A   T    +     + + ++ R+MI   G   S+ DADS   EG    +EG FYVW
Sbjct: 278 LTHAALETGRPLFRQRLEETIGWVLREMITDEGGFASSLDADS---EG----EEGKFYVW 330

Query: 321 TSKEVEDIL----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
              E++ +L    GE    FK  Y + P GN +   +         +N   +L + +A +
Sbjct: 331 REAEIDQLLAHLPGEALESFKRAYDVTPEGNWEGVTILH-------RNRRPDLGNGAAES 383

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                        L + R+ LF+ R +R RP  DDKV+  WNGL+I + A+AS       
Sbjct: 384 Q------------LAQVRQLLFEHREQRERPGWDDKVLADWNGLMIRALAQAS------- 424

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
               F F        +++  A  A  ++   +  +   RL+HS R    + P  L+DYA 
Sbjct: 425 ----FAFA-----HADWLRAAIRAFDYVVEKMTLDG--RLRHSRRGDILRHPATLEDYAN 473

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           + S  L L++     ++L  AI   +  D  + D EGGGYF T  +   V+LR K   D 
Sbjct: 474 MASAALALFQITRHQRFLGQAIAWVDVLDRHYWDHEGGGYFTTADDTNDVVLRAKNAQDN 533

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A P+GN   +  L  L  +   +  D YR  A+  +  F   +      +       D+ 
Sbjct: 534 AVPAGNGTMLQVLTTLYHL---TGDDSYRGKADLLIPRFAGEIGRNFFPLATFLNGCDIA 590

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
             P +  + L G  ++  +  +L A                AD               ++
Sbjct: 591 QRPLQ--ITLTGDPTTPTYVGLLRAI---------------ADVSAPGLILHQLGQKGAL 633

Query: 677 ARNNFSADKV------VALVCQNFSCSPPVTDPISLENLLLEKPS 715
             N+ ++  +       A +C    CS P+ +P +L   LL   S
Sbjct: 634 PSNHPASTALEGTLQSAAYLCVGQRCSLPLREPKALSEALLAARS 678


>gi|94969411|ref|YP_591459.1| hypothetical protein Acid345_2384 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94551461|gb|ABF41385.1| protein of unknown function DUF255 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 705

 Score =  328 bits (840), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 216/699 (30%), Positives = 340/699 (48%), Gaps = 62/699 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES++D  VA +LN  F++IKVDR+ERPDVD  Y T V A+ G GGWPL+ F
Sbjct: 53  CHWCHVMDRESYDDPEVADILNREFIAIKVDRDERPDVDSRYQTAVAAITGQGGWPLTAF 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+ + KP  GGTYFPP D +GRPGFK IL  + DA+  +RD + +     +  L  A   
Sbjct: 113 LTTEGKPFYGGTYFPPRDAHGRPGFKKILLAIADAYKNRRDDVLREADGMMTALHHAEGL 172

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDT 200
           +        +     + +  +    S+D + GGFGSAPKFP    ++++L ++++    T
Sbjct: 173 AGHGG----DFNPRVITMMVQSALNSFDPKNGGFGSAPKFPHASIVEVLLDWYAR----T 224

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+ G A+  +     TL+ MA+GG++D + GGFHRYSVDE W VPHFEKM YD  +L   
Sbjct: 225 GEDGAANVART----TLEKMAQGGVYDQIAGGFHRYSVDENWIVPHFEKMSYDNSELLRN 280

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYV 319
           Y+ A  L  D  ++   +DI+ ++   +     G  ++++DAD         + +G ++ 
Sbjct: 281 YVHAAQLFPDAAFAETAKDIIRWVDSTLTDREHGGFYASQDAD------INLEDDGDYFT 334

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  E +  L          +Y       D++ + + H+    KNVL    +    A +L
Sbjct: 335 WTVDEAKAALTAQEFEVAALHY-------DINEVGEMHHN-SAKNVLWIRAEVEEIAMRL 386

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  ++   +L   ++K+   R +RP P++D  V V+WN + +S++  A ++L  +    
Sbjct: 387 SLKPDQIRMLLNSAKQKMLVARLQRPTPYIDKTVYVNWNAMFVSAYLAAGRVLGMKDAH- 445

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSK-APGFLDDYAF 496
             +F +   DR             I     D+Q   H + +S  N   + + G LDDY F
Sbjct: 446 --HFALRTLDR-------------ILGQWNDKQQLPHVIAYSDPNAVLRESRGLLDDYVF 490

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-----RVK 551
                LD YE      +   A ++ +T    F D   GG+F+       V L     R K
Sbjct: 491 TALACLDAYEATGDLTYFRCAQQIADTAIAKFGDATSGGFFDAEPTTEQVALGALSVRRK 550

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
              D   P+GN  + I ++RL +    ++   YR  AE +L  F   ++   +       
Sbjct: 551 AFQDSPTPAGNPAAAILMLRLHAYTNDTR---YRDKAEDTLETFAGAVEQFGIYAGTYGR 607

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           AA   S P  + V++    S+ D E    AA  ++  N +VI +  AD   +        
Sbjct: 608 AAIWFSKPHTQVVIIGTDASAADLER---AAFQTFAENLSVIRLAQADAHLLPPALAETI 664

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            N     +     + VA+VC NF+C PP+T    L + L
Sbjct: 665 PNVPGVNDG----RAVAVVCSNFACQPPITSAQDLTDTL 699


>gi|389645929|ref|XP_003720596.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
 gi|351637988|gb|EHA45853.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
          Length = 865

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 224/669 (33%), Positives = 336/669 (50%), Gaps = 133/669 (19%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CH+C +   ESF ++ VA LLN  F+ I VDREERPD+D +YM Y+QA+   GGWPL+VF
Sbjct: 96  CHYCRLTTQESFRNKNVAALLNSSFIPILVDREERPDIDSIYMNYIQAVNSAGGWPLNVF 155

Query: 82  LSPDLKPLMGGTYFPP---------EDKYGRPGFKTILRKVKDAWDKK--------RDML 124
           L+P+L+P+ GGTY+P          ED      F  IL+K++  W ++        +D++
Sbjct: 156 LTPELEPVFGGTYWPGPGRSTSSAVEDGEEPLDFLGILKKLQKVWTEQEAKCRKEAQDIV 215

Query: 125 AQSGAFA-----------------------------------IEQLSEALSASASSNKLP 149
            Q   FA                                    E   + ++ASAS+  L 
Sbjct: 216 LQLREFAAEGTMGVGNTEKVPSVATTGATVNISTGVAAPTTSTETPKKTVTASASATDLD 275

Query: 150 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED-TGKSGE 205
            +L Q  L      +S+S+D   GGF  +PKFP P ++  +L   +   ++ D  G   E
Sbjct: 276 VDLDQ--LEEAYANISRSFDRVNGGFNLSPKFPTPPKLSFLLRLAHLPPEVGDIVGGPEE 333

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
            +    M L TL+ +  GG+ DH+G GFHRYSV   W VPHFEKM+ D   L  VYLDA+
Sbjct: 334 IARATHMALATLRALRDGGLRDHIGAGFHRYSVTADWSVPHFEKMIADNALLLGVYLDAW 393

Query: 266 ---------SLTKDVFYSYICRDILDYLRRDMIGPGGEIFS-----------AEDADSAE 305
                    + T +  ++ +  ++ DYL      PG E  S           +E +DS +
Sbjct: 394 LGQAAKEGRAPTLEDEFADVVLELGDYLG----NPGSEFGSSSTCQDSLLPTSEASDSYQ 449

Query: 306 TEGATRKKEGAFYVWTSKEVEDIL----------GEH-----AILFKEHYYLKPTGNCDL 350
            +     +EGAFY+WT +E +  +          G+H     A +   ++ +K  GN  +
Sbjct: 450 RKSDKHMREGAFYLWTRREFDATVSNTEDGDLTNGKHDGDFYARVAAAYWNVKEHGN--I 507

Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHL 409
               DPH+EF  +NVL  +   +  ++  G+ +++   IL E RRKL   R S R RP +
Sbjct: 508 PEEQDPHDEFINQNVLRVVKTPAELSTSFGIAVDEVNQILAEARRKLRARRDSDRVRPEV 567

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR---KEYMEVAESAASFIRR 466
           D+K +V++N + +S+ ARA  +L S            G D+     +M  A+ AA  ++ 
Sbjct: 568 DEKQVVAYNAMAMSALARAGVVLWS-----------TGLDKHRGSAWMMCAKQAAIEMKG 616

Query: 467 HLYDEQTHRL-QHSFRNGPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQ 524
            LYD++T +L +H FRN  S      +DYAFLI  LLDLY+  G  + +L WA +LQ+ Q
Sbjct: 617 RLYDQETGKLSRHWFRNKKSSTDALAEDYAFLIEALLDLYDATGDESAYLDWAKQLQDKQ 676

Query: 525 DELFLDREG-----------------GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
            E+F DR                   GG+++T  E P V+LR+K+  D ++PS N+VS  
Sbjct: 677 IEMFYDRVAPSSQNLDSDAAKTKSGSGGFYSTAEEAPDVILRLKDGMDTSQPSTNAVSAS 736

Query: 568 NLVRLASIV 576
           NL RLA I+
Sbjct: 737 NLFRLALIL 745


>gi|167772692|ref|ZP_02444745.1| hypothetical protein ANACOL_04074 [Anaerotruncus colihominis DSM
           17241]
 gi|167665170|gb|EDS09300.1| hypothetical protein ANACOL_04074 [Anaerotruncus colihominis DSM
           17241]
          Length = 614

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 234/698 (33%), Positives = 336/698 (48%), Gaps = 102/698 (14%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFED   A +LN  F+SIKVDREERPD+D VYM   QA+ G GGWPL++ ++P+ K
Sbjct: 1   MERESFEDAQAADVLNSGFISIKVDREERPDIDAVYMAVCQAMTGSGGWPLTILMTPEQK 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
           P   GTY P   +YG+PG   +L++V   W  +R+ L Q+G        E  +  A    
Sbjct: 61  PFWAGTYLPKYSRYGQPGLIDLLKRVSLLWRTEREQLLQAG-------DEIAAYIAQRGP 113

Query: 148 LPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
              + PQ A L   A QL  ++D   GGFG APKFP P  +  ++ +++         ++
Sbjct: 114 GGAQAPQPALLHTAAGQLRAAFDPADGGFGDAPKFPSPHNLLFLMNYARW-------EKS 166

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
           ++ + M   TL  MA+GG+ D VGGGF RYS D RW  PHFEKMLYD   LA  YLDAFS
Sbjct: 167 ADARSMAERTLTQMARGGLFDQVGGGFSRYSTDRRWLAPHFEKMLYDNALLAYAYLDAFS 226

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
                F+    R  LDY+ R++  P G  +  +DADS         +EGA+Y+ T + VE
Sbjct: 227 QDGRPFWETTARRTLDYVLRELTSPEGAFYCGQDADSG-------GEEGAYYLLTPQSVE 279

Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
             LG + A  F   Y +  +GN            F+G+++   L +++      G     
Sbjct: 280 QALGAQDAARFCRWYGITESGN------------FEGRSIANLLENTAYEQEPEG----- 322

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
                G  R +L D R  R   H DDKV+ +WN L+I++ ++A + L             
Sbjct: 323 ----FGRLRERLLDFRRSRAALHRDDKVLTAWNALMIAALSKAYRTL------------- 365

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
            G  R  Y++ A  AA+F+  +L      RL   +R+G +   G LDDYAF    LL+LY
Sbjct: 366 -GDAR--YLDAARRAAAFLHANLTGPDG-RLWLRWRDGEAANMGQLDDYAFYAWALLELY 421

Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
                   L  A+ +  T    F D + GG+F T  +   ++ R KE +DGA PSGN+ +
Sbjct: 422 AADFDAAHLEEAVSMMQTLQVHFWDGQEGGFFLTADDAERLITRPKEIYDGAMPSGNAAA 481

Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC----AADMLSVPSR 621
            + L RL  +   +    ++  A+  LA   ++    A+  P   C    A      PSR
Sbjct: 482 GLVLERLWKL---TGDPVWQTRADGQLAFLASK----ALPYPAGHCFSLLAMGEALYPSR 534

Query: 622 KHVVLVGHKSSVDFENMLAAAHAS--YDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR- 678
           +   LV   S    + +LA A     + L KT                   SN A + R 
Sbjct: 535 E---LVCATSGTVPDGLLALAERRRLHTLIKT------------------PSNAALLERL 573

Query: 679 NNFSA------DKVVALVCQNFSCSPPVTDPISLENLL 710
             F+A      D  +  +CQN +C+ P     +L  LL
Sbjct: 574 APFTAAYPIPEDGALFYLCQNGACAAPAGSVQALVRLL 611


>gi|300113281|ref|YP_003759856.1| hypothetical protein Nwat_0572 [Nitrosococcus watsonii C-113]
 gi|299539218|gb|ADJ27535.1| protein of unknown function DUF255 [Nitrosococcus watsonii C-113]
          Length = 694

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 236/699 (33%), Positives = 367/699 (52%), Gaps = 70/699 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFE+   A ++N+ F++IKVDREERPD+D++Y    Q L G  GGWPL
Sbjct: 53  SACHWCHVMAHESFENPETAAVMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112

Query: 79  SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++FL P    P  GGTYFPPE+++G PGFK +L++V + +  +R+++ QS    +    E
Sbjct: 113 TMFLEPVKQAPFFGGTYFPPEERHGLPGFKDLLQRVAEYFHTRREVI-QSQNERLLDAFE 171

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            L   +S+ ++ + L +  L+   +QL++++DSR+GGF  APKFP P  I+  L  +   
Sbjct: 172 KLDGRSSAAEV-EGLNRAPLQAAHQQLAQAFDSRYGGFRGAPKFPNPSIIERCLRDAHGE 230

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             T    E  +   M   TL+ MA+GGI+D +GGGF RYSVDE+W +PHFEKMLYD GQL
Sbjct: 231 HIT--EDEKQQALTMARLTLEQMAQGGIYDQLGGGFCRYSVDEKWRIPHFEKMLYDNGQL 288

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +Y DA+ L  +  +  I  +   ++ R+M  P G  +S+ DADS   EG     EG F
Sbjct: 289 LVLYRDAYRLWGNGIFRRILEETGHWVVREMQSPEGGYYSSLDADS---EG----HEGKF 341

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT ++V  +L +        Y+           +  P N F+G   L       A A 
Sbjct: 342 YVWTREQVRALLDDEKYTLAVRYF----------SLDQPAN-FEGHWHLYAAMTPEALAE 390

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           ++ +P       L   ++KLF  R  R RP  DDK++ +WN L+I   A A + L     
Sbjct: 391 EMKVPAPGLQEQLTAAKQKLFAAREARIRPGRDDKILTAWNSLMIKGMAAAGQALAQ--- 447

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                 PV       ++  AE A  F+R HL+  Q  RL  S+++G ++  G+LDDYAFL
Sbjct: 448 ------PV-------FIASAEKAVDFVRAHLW--QKGRLLVSYKDGRAQHQGYLDDYAFL 492

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL+L +       L +A++L       F D+  GG++ T  +  +++ R     D A
Sbjct: 493 LDALLELLQVRWRDGDLAFAVDLAEAVLGHFEDKAQGGFYFTADDHETLIHRPVPLMDNA 552

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCAADML 616
            P+GN +   +L+RL  ++   +   Y + AE++L A +E+  +       L+    + L
Sbjct: 553 TPAGNGILAWSLLRLGHLLGEMR---YLKAAENTLKAAWESLQQTPHAHCSLLKALEEWL 609

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM-----DFWEEHNS 671
           + P  + V+L G  S  + E+  A A A+Y   +  + I P + + +     ++W +  +
Sbjct: 610 TPP--QIVILRG--SGEELESWRAVAAAAYAPRRVTLAI-PLEAQYLPGILGEYWPQEAA 664

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                         V A VC   +CS P+T   +L+  L
Sbjct: 665 --------------VTAYVCSGHTCSAPLTQREALKEHL 689


>gi|344344146|ref|ZP_08775011.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
           984]
 gi|343804430|gb|EGV22331.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
           984]
          Length = 683

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 220/613 (35%), Positives = 321/613 (52%), Gaps = 58/613 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESF D  VA L+N  FV+IKVDREERPD+D +Y    Q L G GGGWPL
Sbjct: 58  SACHWCHVMAHESFADPEVATLMNRAFVNIKVDREERPDLDGLYQRAHQLLNGRGGGWPL 117

Query: 79  SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           +VFLSP DL+P   GTYFPP  ++G P F  +L  V+ A+ ++ D + Q G    E L E
Sbjct: 118 TVFLSPHDLRPFFAGTYFPPTPRHGLPAFTQLLAGVERAYREQHDKILQQG----ENLIE 173

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           A  A            +N +     QL+ S+D R GGFG APKFP   E+ ++L  + + 
Sbjct: 174 AF-AGLEPEPGERPPERNLIGAALNQLAVSFDPRHGGFGGAPKFPHAPELALLLRCAARG 232

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +  G+  +A E  +M   +L+ M + G++D +GGGF RY+VD +W +PHFEKMLYD   L
Sbjct: 233 DRPGE--DAPEPLEMARVSLERMIRSGLNDQLGGGFCRYAVDAQWMIPHFEKMLYDNAAL 290

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             +  D  + T +  +        D++ R+M  P G  +S+ DADS   EG    +EG F
Sbjct: 291 LALCCDLHACTGEQLFRSAAESTADWVLREMQSPEGGYYSSLDADS---EG----EEGRF 343

Query: 318 YVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y+W  ++V  +L E     F   Y L    N            F+G+  L      +A A
Sbjct: 344 YLWEREQVRALLPEAEYRPFAAVYGLDRPPN------------FEGRWHLHGHLTPAAVA 391

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           +  G+ LE+  ++LG  R  LF  R +R RP  DDKV+ +WN L+I + ARA+++L    
Sbjct: 392 AAQGLTLEQVQSLLGAARATLFAERERRVRPGRDDKVLGAWNALMIGAMARAARVL---- 447

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       +R +Y+E AE A   +R  L+ +   RL  S R+G      +LDD+A 
Sbjct: 448 ------------ERDDYLESAEQALGCVRERLWRDG--RLLASCRDGRVAFDAYLDDHAL 493

Query: 497 LISGLLDLYEFGSGTKW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           L++ +L+L +    T+W    L +AIEL  T    F D E GG++ T  +   ++ R K 
Sbjct: 494 LLATVLELLQ----TRWSSADLAFAIELAETLLARFHDPEAGGFWFTAHDHERLIHRTKP 549

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
             D   P+GN V+ + L RL  +V   +   Y    E +L +  T ++ +  A   + CA
Sbjct: 550 LADETLPAGNGVAALALQRLGHLVGEPR---YLAAVESTLRLAATAMRRLPHAHATLLCA 606

Query: 613 ADMLSVPSRKHVV 625
            D    P  + V+
Sbjct: 607 LDEWLDPPEQLVI 619


>gi|297202044|ref|ZP_06919441.1| transmembrane protein [Streptomyces sviceus ATCC 29083]
 gi|297148022|gb|EDY58354.2| transmembrane protein [Streptomyces sviceus ATCC 29083]
          Length = 570

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 210/576 (36%), Positives = 297/576 (51%), Gaps = 59/576 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A LLN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 51  SSCHWCHVMAQESFEDQATADLLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP  + G P F+ +L  V+ AW  +RD +A+     +  L+   
Sbjct: 111 VFLTPDAEPFYFGTYFPPSPRQGMPSFRQVLEGVRAAWTDRRDEVAEVAGKIVRDLA-GR 169

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S   ++ P E    A  L    L++ YD++ GGFG APKFP  + ++ +L H  +   
Sbjct: 170 EISYGDSQAPGEEQLAAALLG---LTREYDAQRGGFGGAPKFPPSMVVEFLLRHHAR--- 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 224 TGAEG----ALQMAQDTCERMARGGIHDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCR 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  D  D++ R++    G   SA DADS   +G  R  EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVALDTADFMVRELRTAEGGFASALDADS--DDGTGRHVEGAYYV 337

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           WT +++ ++LGE  A L  +++ +   G  +            G++VL +   D+   A 
Sbjct: 338 WTPEQLREVLGEQDAELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDTVFDAE 386

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K           +   RR+L D R++RP P  DDKV+ +WNGL I++ A           
Sbjct: 387 K-----------VESIRRRLLDARAQRPAPGRDDKVVAAWNGLAIAALAETGAYF----- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
                      DR + ++ A  AA  + R   DEQ  RL  + ++G   A  G L+DYA 
Sbjct: 431 -----------DRPDLVDAALGAADLLVRLHLDEQA-RLSRTSKDGQVGANAGVLEDYAD 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +  G L L        WL +A  L +     F   E G  F+T  +   ++   +   D 
Sbjct: 479 VAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTGPE-GALFDTAADAERLIPPPQNPTDN 537

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           A PSG + +    +   S  A + S+ +R+ AE +L
Sbjct: 538 AVPSGWTAAAPAPL---SYAAQTGSENHREGAEKAL 570


>gi|292493652|ref|YP_003529091.1| hypothetical protein Nhal_3684 [Nitrosococcus halophilus Nc4]
 gi|291582247|gb|ADE16704.1| protein of unknown function DUF255 [Nitrosococcus halophilus Nc4]
          Length = 694

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 234/691 (33%), Positives = 349/691 (50%), Gaps = 70/691 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFE   +A  +N+ F++IKVDREERPD+D++Y    Q L G  GGWPL
Sbjct: 53  SACHWCHVMAHESFESPEIAAAMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112

Query: 79  SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++FL P+ + P  GGTYFPPE ++G PGFK +L ++ + +   R+ +    +  +    E
Sbjct: 113 TMFLEPENQVPFFGGTYFPPEGRHGLPGFKDLLERIAEFFHAHREEIQSQNSRLLAAFEE 172

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             + +++    P+ L    L+   +QL++S+D R+GGF  APKFP P  I+  L   + +
Sbjct: 173 LDTRTSAVE--PEMLGPAPLKAAQQQLAQSFDPRYGGFKGAPKFPNPSSIERCL---RDV 227

Query: 198 EDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
                S EA +    +   TL+ MA+GGI+D +GGGF RY+VD +W +PHFEKMLYD GQ
Sbjct: 228 RGEHLSAEARQKALDLARLTLEQMAQGGIYDQLGGGFCRYAVDSQWRIPHFEKMLYDNGQ 287

Query: 257 LANVYLDAFSLTKDVFYSYICRDILD----YLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           L  +Y DA+ L    + S  CR +L+    +  R+M  P G  +S+ DADS   EG    
Sbjct: 288 LLALYADAYEL----WGSERCRRVLEETGHWAIREMQSPEGGYYSSLDADS---EG---- 336

Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           +EG FYVWT ++V+ +L E        Y+           +  P N F+G   L      
Sbjct: 337 REGKFYVWTREQVQALLEEDEYPLVARYF----------GLDQPAN-FEGHWHLYGAITP 385

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
            A A +L +        L   ++KLF  R +R RP  DDK++ SWNGL+I   A A + L
Sbjct: 386 EALAQELNLSPRILEETLATAKQKLFAAREERIRPGRDDKILTSWNGLMIKGMAAAGQAL 445

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
              A                ++  AE A  F+R HL+ E   RL  S+++G  + PG+LD
Sbjct: 446 AEPA----------------FIASAERALDFVRGHLWREG--RLLVSYKDGRVQHPGYLD 487

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYAFL+  LL L +       L +A+EL       F D   GG++ T  +  +++ R   
Sbjct: 488 DYAFLLDALLALLQARWREGDLAFAVELAEAALAHFEDPAQGGFYFTADDHETLIHRPVP 547

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCC 611
             D A P+GN V   +L RL  ++   +   Y + AE +L      ++    A   L+  
Sbjct: 548 LMDNATPAGNGVLAWSLQRLGHLLGEMR---YLKAAERTLKASWASIQHTPHAHCSLLKT 604

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
             + L  P  + V+L G + ++   +  A A   Y   +  + I        D W     
Sbjct: 605 LEEWLYPP--QMVILRGPEENLG--SWRAIATGEYAPRRVSLAIPKGAR---DLW----- 652

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               +       D+V A VC   +CSPP+T 
Sbjct: 653 --GQLEEYRPEGDRVTAYVCSGHTCSPPLTQ 681


>gi|313675015|ref|YP_004053011.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
 gi|312941713|gb|ADR20903.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
          Length = 675

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 229/690 (33%), Positives = 336/690 (48%), Gaps = 87/690 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVME ESFEDE VAK++N+ ++ IK+DREERPD+D++YM  +Q +   GGWPL+V
Sbjct: 51  ACHWCHVMEHESFEDEEVAKVMNENYICIKLDREERPDIDQIYMDAIQTMGLHGGWPLNV 110

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL P+ KP  GGTYFP      +  +  IL KV  A+   R+ L +S      + ++AL+
Sbjct: 111 FLIPNQKPFYGGTYFP------KNKWLEILDKVAIAFQSSRNQLEESA----NKFAQALN 160

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSY-------DSRFGGFGSAPKFPRPVEIQMML-- 191
           A+         L  NA    ++ LS++Y       D   GG   APKFP PV  Q ++  
Sbjct: 161 AADGEKLSLGAL--NAENFNSKILSEAYQKLGSFLDWDNGGTLGAPKFPMPVIWQFLMKY 218

Query: 192 -YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
            +HS+            E +K + FTL  +A GGI+D +GGGF RYSVD  W  PHFEKM
Sbjct: 219 AFHSQN----------PEAKKALEFTLTSLADGGIYDQIGGGFARYSVDAEWFAPHFEKM 268

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           LYD GQL ++Y DAF  TK+ ++  I  D + +  R+++ P    +SA DADS   EG  
Sbjct: 269 LYDNGQLISLYADAFRFTKNPYFKEIFEDSIRFSAREIMDPYCRFYSALDADS---EG-- 323

Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
             +EG FY WT  E+E ILG+ A    + Y     GN +            G+N+L   +
Sbjct: 324 --EEGKFYTWTYTELEQILGDKAEPILKFYNATEKGNWE-----------NGRNILFRHS 370

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
                     +  EK+   L E +  L D R  R RP +DDK++  WN L +     A K
Sbjct: 371 SIEDFCKAEKIDQEKFKAQLIEAKDSLLDAREDRVRPAMDDKILTGWNALQMKGICDAYK 430

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
             +                 K+Y  +A+    F+   ++D   ++L  SF+N   K   +
Sbjct: 431 AYQD----------------KKYKAIAQDNFVFLSEFVWD--GNQLFRSFKNEQPKIKAY 472

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYA  I   + L+E  S +K L +A +L N   + F D +   +F T      ++ R 
Sbjct: 473 LEDYALAIQASISLFEISSDSKALDFAEKLTNYAIQNFYDEKEKLFFYTDKSSEKLIARK 532

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           KE  D   P+ NSV + NL  L  I+ G+ S  + + +E  L   +  L      +    
Sbjct: 533 KEIFDNVIPASNSVMIENLHWLG-ILKGNSS--FTEISEQMLKQIQHLLPREPKFLANYA 589

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  + +  S   +V+VG K++      L     S+ L  T I   P ++++   W+   
Sbjct: 590 SAYALKAFRSY-DIVIVGTKAT-----ELQKELWSHYLPNTFIMAIPEESKDQLVWKGKE 643

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPV 700
             N           K    VC+N +C  PV
Sbjct: 644 IINT----------KTTIYVCENNACQQPV 663


>gi|456389199|gb|EMF54639.1| hypothetical protein SBD_4307 [Streptomyces bottropensis ATCC
           25435]
          Length = 686

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 230/698 (32%), Positives = 331/698 (47%), Gaps = 76/698 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A+ LN  FV+IKVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 52  SSCHWCHVMAHESFEDGETAEYLNAHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMT 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPP  ++G P F+ +L  V+ AW  +RD +A+     +  L+   
Sbjct: 112 VFLTPDGEPFYFGTYFPPAPRHGMPSFRQVLEGVRAAWADRRDEVAEVAGKIVRDLAGRE 171

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L  +A      DEL Q  L      L++ YD+  GGFG APKFP  + I+ +L H+ +  
Sbjct: 172 LKFAAVDVPGEDELAQALL-----GLTREYDAARGGFGRAPKFPPSMVIEFLLRHAAR-- 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 225 -TGSEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS +  G  +  EGA+Y
Sbjct: 280 RVYAHLWRATGSELARRVALETADFMVRELRTNEGGFASALDADSDDGTGTGKHVEGAYY 339

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VWT +++ ++LGE       H++                        + E       AS 
Sbjct: 340 VWTPEQLTEVLGEEDARLAAHHF-----------------------GVTEEGTFEEGASV 376

Query: 379 LGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           L +P  + +   + +   R +L   R +RP P  DDKV+ +WNGL +++ A         
Sbjct: 377 LQLPQREGVFDADKIESIRERLLAARVRRPAPGRDDKVVAAWNGLAVAALAET------- 429

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
              A F+ P              +A   +R HL DE+  RL  + ++G   A  G L+DY
Sbjct: 430 --GAYFDRP------DLVDAAIAAADLLVRLHL-DERA-RLARTSKDGRVGANAGVLEDY 479

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +     F+D E G  ++T  +   ++ R ++  
Sbjct: 480 ADVAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALYDTASDAEKLIRRPQDPT 539

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A PSG S +      L    A + S+ +R  AE +L V +         +      A+
Sbjct: 540 DNATPSGWSAAAGA---LLGYAAHTGSEPHRTAAERALGVVKALGPRAPRFIGWGLATAE 596

Query: 615 MLSVPSRKHVVL--VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
            L    R+  VL   GH  + +         A       V+ + P D++E+         
Sbjct: 597 ALLDGPREVAVLGPQGHPGTRELHRTALLGTAP----GAVVAVGPPDSDELPL------- 645

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              +A       +  A VC+NF+C  P TD   L   L
Sbjct: 646 ---LADRPLVGGEPTAYVCRNFTCDAPTTDVDRLRTAL 680


>gi|46198930|ref|YP_004597.1| hypothetical protein TTC0622 [Thermus thermophilus HB27]
 gi|46196554|gb|AAS80970.1| hypothetical conserved protein [Thermus thermophilus HB27]
          Length = 642

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 219/591 (37%), Positives = 300/591 (50%), Gaps = 83/591 (14%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESF+DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+
Sbjct: 49  SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL 
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALW 164

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S +    P  LP+ A     + L +++D  +GGF  APKFP+   +  +L  + + E+ 
Sbjct: 165 KSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +++  TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274

Query: 261 YLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           YL A+ L  +  +  + R+ LD+L    RR+     G   +A D   AE+EG    +EG 
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGR 322

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +Y W   E+ + LGE   L + ++ L      DL            ++VL    ++ A  
Sbjct: 323 YYTWAEVELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEAR- 367

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             LG   E +       R KL   R +R  P LDDKV+  W+ L + + A A ++   E 
Sbjct: 368 KVLG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE- 423

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+E A   A F+  H+Y E    L+H++R G      +L D AF
Sbjct: 424 ---------------RYLEAARRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAF 465

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
                L+LY       +L WA  L      LF  REG          PS+ L  KE  +G
Sbjct: 466 AALAFLELYAATGEWPYLDWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEG 513

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           A PSG S     LVRL ++  G     YR+ AE  LA     L     A+P
Sbjct: 514 ALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560


>gi|296445985|ref|ZP_06887935.1| protein of unknown function DUF255 [Methylosinus trichosporium
           OB3b]
 gi|296256503|gb|EFH03580.1| protein of unknown function DUF255 [Methylosinus trichosporium
           OB3b]
          Length = 679

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 226/698 (32%), Positives = 340/698 (48%), Gaps = 76/698 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE++ +A L+N  F+++KVDREERPD+D +Y   +Q L   GGWPL++F
Sbjct: 52  CHWCHVMAAESFENDRIAALMNANFINVKVDREERPDIDHLYQQALQMLGRRGGWPLTMF 111

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS-GAFA--IEQLSEA 138
           L+PD +P  GGTYFPPE ++G PGF  IL+ V + W +K  ++ ++ GA A  +++L+E+
Sbjct: 112 LTPDGEPFWGGTYFPPEPRHGMPGFADILQAVAELWREKPAVVTRNVGAIANGLDRLAES 171

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             A   S  L        L    E+L +  D   GG   APKFP+P  ++ +    K   
Sbjct: 172 APAEPISPVL--------LETITERLEELIDREHGGIRGAPKFPQPPSLEFLWRAWK--- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              ++G AS  ++ VL TL  + +GGI+DH+GGGF RYS DERW  PHFEKMLYD GQL 
Sbjct: 221 ---RTGRASL-REAVLTTLDHICQGGIYDHIGGGFARYSTDERWLAPHFEKMLYDNGQLV 276

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +    +   +   Y+    + +D+  R+M  P G   S+ DADS         +EG FY
Sbjct: 277 ELLTLVWQDERKPLYAARVEETIDWALREMRLPEGVFASSLDADS-------EHEEGKFY 329

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VW++ E++  LGE A  F+  Y +   GN +         E    N L+E+   SA A  
Sbjct: 330 VWSAAEIDAALGERAGAFRAAYDVTEAGNWE---------EKNIPNRLLEMALGSAEAEA 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                   L  L E R           RP  DDK +  WNGL+I++ A A++        
Sbjct: 381 ALAADRAALLALRETRV----------RPGRDDKALADWNGLMIAALAAAAQA------- 423

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                      R +++ VA +A  FI   +      RL HS+R G +K    LDDYA L 
Sbjct: 424 ---------FARPDWLAVATAAFDFIATSMTTADG-RLLHSYRAGRAKHMAVLDDYADLC 473

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              L L+E      +L    E     +  + D   GGYF T  +  +++ R K   D   
Sbjct: 474 RAALTLHEATGDDAYLTRCREWAEIVETHYRD-PAGGYFFTADDAEALIRRAKIAEDAPL 532

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN      L RL  +   +    YR+ AE +L  F   ++   +    +   A++L  
Sbjct: 533 PSGNGAMTQVLARLYHLTGETA---YRERAEATLTAFAGTVRRGLLGYSTLLSGAEILR- 588

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                +V++G +++ D   +L   H +    ++++   P      D    H +   +   
Sbjct: 589 -DGLQIVIIGARAAEDTAALLRVLHETSLPGRSLLVAAPGAALPPD----HPAAGKTQVD 643

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
                 +  A +C+  +CS P+ +P SL   L  +P +
Sbjct: 644 G-----RAAAYMCRGTTCSLPIVEPASLALALRGEPQT 676


>gi|381190578|ref|ZP_09898097.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
 gi|384431187|ref|YP_005640547.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
 gi|333966655|gb|AEG33420.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
 gi|380451573|gb|EIA39178.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
          Length = 642

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 218/591 (36%), Positives = 302/591 (51%), Gaps = 83/591 (14%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESF+DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+
Sbjct: 49  SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL 
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAVLEEA----ERLTRALW 164

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S +    P  LP+ A     + L +++D  +GGF  APKFP+   +  +L  + + E+ 
Sbjct: 165 KSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +++  TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274

Query: 261 YLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           YL A+ L  +  +  + R+ LD+L    RR+     G   +A D   AE+EG    +EG 
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGR 322

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +Y WT  E+ + LGE   L + ++ L      DL            ++VL    ++    
Sbjct: 323 YYTWTEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVRE 368

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           + LG   E +       R KL   R +R  P LDDKV+  W+ L + + A A ++   EA
Sbjct: 369 A-LG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEEA 424

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                           Y+E A+  A F+  H+Y  +   L+H++R G      +L D AF
Sbjct: 425 ----------------YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQAF 465

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
                L+LY       +L WA         LF  REG          PS+ L  KE  +G
Sbjct: 466 AALAFLELYAATGEWPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEEG 513

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           A PSG S     LVRL ++  G     YR+ AE  LA     L     A+P
Sbjct: 514 ALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560


>gi|289769445|ref|ZP_06528823.1| conserved hypothetical protein [Streptomyces lividans TK24]
 gi|289699644|gb|EFD67073.1| conserved hypothetical protein [Streptomyces lividans TK24]
          Length = 680

 Score =  326 bits (836), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 234/701 (33%), Positives = 337/701 (48%), Gaps = 72/701 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPPE ++G P F+ +L+ V+ AW ++RD + +     +  L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVRQAWAERRDEVDEVAGKIVRDLAGRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   +     ++L Q  L      L++ YD R GGFG APKFP  + I+ +L H  +  
Sbjct: 168 ISYGDAEAPGEEQLGQALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHY 333

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
           VWT  ++ ++LG E A L  +++ +   G  +            G +VL +   +S   A
Sbjct: 334 VWTPAQLTEVLGAEDAELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDA 382

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++           +   R +L   R  RP P  DDKV+ +WNGL I++ A          
Sbjct: 383 AR-----------IASVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET-------- 423

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
             A F  P              +A   +R HL DEQ  RL  + ++G + A  G L+DYA
Sbjct: 424 -GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYA 474

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  D
Sbjct: 475 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTD 533

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSG S +   L+   S  A + S  +R  AE +L V +     +   +     AA+ 
Sbjct: 534 NATPSGWSAAAGALL---SYAAHTGSAPHRAAAERALGVVKALGPRVPRFIGWGLAAAEA 590

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L    R+  V+    +            A+  L++T + +  A    + F  E +     
Sbjct: 591 LLDGPREVAVVAPDPAD----------PAARGLHRTAL-LGTAPGAVVAFGTEGSDEFPL 639

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           +A          A VC+NF+C  P TDP  L   L   P+ 
Sbjct: 640 LADRPLVGGAPAAYVCRNFTCDAPTTDPDRLRTALGVAPTG 680


>gi|375097065|ref|ZP_09743330.1| thioredoxin domain containing protein [Saccharomonospora marina
           XMU15]
 gi|374657798|gb|EHR52631.1| thioredoxin domain containing protein [Saccharomonospora marina
           XMU15]
          Length = 673

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 226/693 (32%), Positives = 327/693 (47%), Gaps = 78/693 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+  A  +N  FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFEDDETAAFMNAHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP   GTY+PP  ++G P F+ +L  V  AW ++ D L Q     +  + E  + 
Sbjct: 109 LTPDGKPFHCGTYYPPTPRHGMPSFRQVLTAVARAWSERADELRQGATKIVSHIQEQTAP 168

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A        + + A+      L    D   GGFG APKFP  + ++ +L H    E TG
Sbjct: 169 LAQR-----PVDEEAIATAVSTLRGQIDPGHGGFGGAPKFPPAMVMEFLLRH---YERTG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               ++E   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SAEALSVVELTAEGMARGGIYDQLAGGFARYSVDAAWVVPHFEKMLYDNALLLRCY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T     + +  +  ++L RD+    G   ++ DAD   TEG     EG  YVWT
Sbjct: 277 AHLARRTSSALATRVAAETAEFLLRDLRTQEGGFAASLDAD---TEGV----EGLTYVWT 329

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
             ++ ++LG     +    +          R+++      G + L    D   +A     
Sbjct: 330 PAQLVEVLGPEDGSWAAEVF----------RVTEEGTFEHGASTLQLPRDPDETA----- 374

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
              ++L +       L + R+ RP+P  DDKV+ +WNGL I++ A A   L         
Sbjct: 375 ---RWLRV----STALLEARNGRPQPSRDDKVVTAWNGLAITALAEAGVAL--------- 418

Query: 442 NFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
                  +R +++E A SAA   + RHL D    RL+ S R G   +A G L+DYA L  
Sbjct: 419 -------ERPDWVEAAVSAAELLLDRHLVDA---RLRRSSRGGVVGEAAGVLEDYACLAE 468

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAE 558
           GLL +++    + WL  A  L +T  ELF D E  G F+ T  D   L+ R  +  D A 
Sbjct: 469 GLLAVHQASGESVWLTQATLLLDTALELFSDDELPGAFHDTAADAEALVHRPSDPTDNAT 528

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA-MAVPLMCCAADMLS 617
           PSG S     L+  +++    ++  YRQ  E +L    T +      A   +  A  +L+
Sbjct: 529 PSGASALAGALLTASALAGPDRAGEYRQACERALDRAGTIVAQAPRFAGHWLSVAEALLA 588

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            P +  V +VG  ++   + ++ AA   +     V+   P +   +            +A
Sbjct: 589 GPVQ--VAVVGPDAAARSDLLVEAAREVH--GGGVVLAGPPEAGGVPL----------LA 634

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                     A VC  + C  PVT P  L   L
Sbjct: 635 DRPLVDGNAAAYVCHGYVCERPVTTPQRLAAAL 667


>gi|288932323|ref|YP_003436383.1| hypothetical protein Ferp_1971 [Ferroglobus placidus DSM 10642]
 gi|288894571|gb|ADC66108.1| protein of unknown function DUF255 [Ferroglobus placidus DSM 10642]
          Length = 628

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 205/611 (33%), Positives = 314/611 (51%), Gaps = 73/611 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  + FE+E +AK++N+ FV++KVDR+ERPD+D+ Y  +V A  G GGWPL+VF
Sbjct: 50  CHWCHVMAKKCFENEDIAKIINENFVAVKVDRDERPDIDRRYQEFVFATTGTGGWPLTVF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P  GGTYFPPED +G  GFKT+L K+ + W+K R+ L +S    +E L +    
Sbjct: 110 LTPDGEPFFGGTYFPPEDGFGMIGFKTLLLKISEMWEKDRESLLKSAKQIVESLKKFSER 169

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLED 199
             SSN     L +  ++   + +    D   GG G APKF      +++L  Y+  K ED
Sbjct: 170 DFSSN-FDFTLIEKGIKAVLDNM----DYVNGGIGRAPKFHHAKAFELLLTHYYFTKDED 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             K+ E          TL  MAKGG++D + GGF RYS D+RWHVPHFEKMLYD  +L  
Sbjct: 225 LIKAVE---------LTLDAMAKGGVYDQLIGGFFRYSTDDRWHVPHFEKMLYDNAELLK 275

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A+ +TK   Y  + + I+DY R+  +   G  ++++DAD  E E      EG +Y+
Sbjct: 276 LYTIAYQITKKELYRKVAKGIVDYYRKFGVDERGGFYASQDADIGELE------EGGYYI 329

Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           ++ +E++++L +        Y+ L+                 +GKNVL    D +  +  
Sbjct: 330 FSLEEIKEVLNDEEFRIASLYFGLR-----------------EGKNVLHVSLDENEISEI 372

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           LG+P+ +   I+   + KL +VR +R  P +D  +  +WNGL+I +     K        
Sbjct: 373 LGIPVRRVKEIIESAKEKLLEVRERRETPFIDKTIYTNWNGLMIEAMCDYYK-------- 424

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
             FN P         +EVAE +     R L       L H+         GF +DY F  
Sbjct: 425 -SFNDPWA-------VEVAEKSGE---RLLKFWDGDVLLHT-----DDVEGFSEDYIFFA 468

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGA 557
            GL+ L+E     K+L  A+E+     +LF D + GG+F+       +L L+VK+  D  
Sbjct: 469 KGLIALFEITQKGKYLNAAVEITKRAVDLFWDHKRGGFFDRKSSGNGLLSLKVKDIQDSP 528

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
           + S N ++ + L  L+S+     ++ +   A+ SL  F   L+   +  P     L    
Sbjct: 529 QQSVNGIAPLLLTTLSSVTG---TEEFGALAKKSLRAFAGILEKYPLISPSYMISLYAYI 585

Query: 613 ADMLSVPSRKH 623
             +  V +R+H
Sbjct: 586 RGIYLVKTRRH 596


>gi|354612894|ref|ZP_09030833.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
           90007]
 gi|353222771|gb|EHB87069.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
           90007]
          Length = 667

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 232/698 (33%), Positives = 336/698 (48%), Gaps = 91/698 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF D   A  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFSDADTAAYMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P   GTY+PP  K+G P F  +L  V  AW ++RD L +     +  ++E    
Sbjct: 109 LTPDGEPFHCGTYYPPVSKHGLPSFVQVLTAVTQAWTERRDELVEGAGRIVTHIAE--QT 166

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
              S    DE    AL     +L +  D   GGFG+APKFP  + ++ +L H ++   TG
Sbjct: 167 GPLSEHPVDE---QALSSAVAKLRQEADPANGGFGTAPKFPPSMVLEFLLRHHER---TG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               ++E   +V  T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SAEALSLVELTAERMARGGIYDQLGGGFARYSVDVAWVVPHFEKMLYDNALLLRAY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T     + +  +  ++L RD+    G   ++ DAD+   EG T       YVWT
Sbjct: 277 AHLARRTGSAIATRVAGETAEFLLRDLRTAEGGFAASLDADTDGVEGLT-------YVWT 329

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +++ ++LG E      E + +   G  +           KG + L   +D    A    
Sbjct: 330 PEQLVEVLGPEDGAWAAELFGVTEEGTFE-----------KGASTLRLPHDPDDPA---- 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               ++L +       LF  R  RP+P  DDKVI +WNGL I++ A A   L+       
Sbjct: 375 ----RWLRV----STALFQARGTRPQPARDDKVIAAWNGLAITALAEAGTALR------- 419

Query: 441 FNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
                    R E+++ A SA ++ + RHL D    RL+ S RNG    A G L+D+  L 
Sbjct: 420 ---------RPEWVDAAVSAGAYLLDRHLVD---GRLRRSSRNGEVGAANGVLEDHGCLA 467

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGA 557
            GLL L++    + WL+ A  L +   E F   +  G F+ T +D   L+ R  +  D A
Sbjct: 468 DGLLALHQATGESVWLLEATRLLDIARERFAVADTPGAFHDTADDAEALVHRPSDPTDNA 527

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
            PSG S     L+  +++V   K+  YR  AE ++    +R   +   VP      +  A
Sbjct: 528 SPSGASTVAGALLTASALVGPEKASDYRAAAEQAV----SRAGALVAQVPRFAGHWLSVA 583

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             M + P +  V +VG  +    E +  AAH  +     V+   P ++E +    +    
Sbjct: 584 EAMAAGPVQ--VAVVGPDAEARSELLSTAAHDVH--GGGVVLGGPPESEGVPLLADRPLV 639

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           + S A          A VC  + C  PVT   + E LL
Sbjct: 640 DGSAA----------AYVCHGYVCDRPVT---TTEELL 664


>gi|82701479|ref|YP_411045.1| hypothetical protein Nmul_A0345 [Nitrosospira multiformis ATCC
           25196]
 gi|82409544|gb|ABB73653.1| Protein of unknown function DUF255 [Nitrosospira multiformis ATCC
           25196]
          Length = 700

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 228/705 (32%), Positives = 340/705 (48%), Gaps = 81/705 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  E FED  VA+++N +F++IKVDREERPD+D++Y T +  L    GGWPL
Sbjct: 48  SACHWCHVMAHECFEDAEVAEVMNRYFINIKVDREERPDIDQIYQTALYMLTQRSGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD KP  GGTYFP   ++  PGF  +L +V + +  +R  + +  A  ++  +  
Sbjct: 108 TLFLTPDQKPFFGGTYFPKTPRHSLPGFLDLLPRVAETYRVRRPEIERQSASLLKSFANM 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L + A    +  E P   L     +L   +DS  GGFG  PKF    E+   L   ++  
Sbjct: 168 LPSKAPEAPVFSERP---LEQALAELKNRFDSENGGFGEPPKFLHLTELDFCL---RRYF 221

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G S    E   M   TL+ MA+GGI+D VGGGF+RYS D++W +PHFEKMLYD G L 
Sbjct: 222 TAGNS----EALHMATLTLEKMAEGGIYDQVGGGFYRYSTDKQWQIPHFEKMLYDNGPLL 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIG--------PGGEIFSAEDADSAETEGAT 310
           ++Y DA+  + +  ++ I  +   ++ R+M           G   +S  DADS       
Sbjct: 278 HLYADAWIASGNPLFARIVEETATWVMREMQPEYEENEKRTGAGYWSTLDADSENV---- 333

Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
              EG FYVW   E   IL     +    +Y        LS+ ++  N +    V   L 
Sbjct: 334 ---EGKFYVWDRSEASHILSRREYVVAASHY-------GLSQPANFGNRYWHLAVAQSLP 383

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           +    A   G+   +    L   R+KL   R  R RP  D+K++ SWNGL+I   ARA +
Sbjct: 384 E---IAENFGVTYAEARQWLESGRKKLLAQRQCRVRPGRDEKILTSWNGLMIKGMARAGR 440

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +                  R +++  A  A  FIR  L+  +  RL  ++++G ++   +
Sbjct: 441 VF----------------GRDDWVRSAICAVDFIRSTLW--KNGRLLATWKDGNARLNAY 482

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           LDDYAFL+ GLL+L +       L +AI L     + F D+E GG+F T+ +  +++ R 
Sbjct: 483 LDDYAFLLDGLLELMQTTFRPVDLDFAIALAEVLLDQFEDKEAGGFFFTSHDHENLIHRP 542

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
           K  +D A PSGN V+   L R+  ++   +   Y Q AE +L +F   L    +  P  C
Sbjct: 543 KPGYDNATPSGNGVAAHTLQRMGYLLGEFR---YLQAAERALRLFYPAL----LRHPDSC 595

Query: 611 C----AADMLSVPSRKHVVLVGHKSSVDFENML-AAAHASYDLNKTVIHIDPADTEEMDF 665
           C    A +    P    ++    +    +EN L      +  L   V  + PA       
Sbjct: 596 CSLLLALEQWLTPPPVVILRGKAEPMAKWENALRQRVPIALVLALPVERVTPA------- 648

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +   S+A+   S   V A VC    C P VTD   L+ LL
Sbjct: 649 -----ALPPSLAKPVPSGMGVNAWVCHGVKCLPEVTD---LQELL 685


>gi|299133196|ref|ZP_07026391.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
 gi|298593333|gb|EFI53533.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
          Length = 683

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 238/715 (33%), Positives = 349/715 (48%), Gaps = 110/715 (15%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFEDE  A ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++F
Sbjct: 56  CHWCHVMAHESFEDETTAAVMNELFVPIKVDREERPDIDQIYMNALHLLGEQGGWPLTMF 115

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P+ GGTYFP   +YGR  F  +LR++   +  + D +A + A   + LS+  SA
Sbjct: 116 LTPDGAPVWGGTYFPKTAQYGRAAFVEVLRELARIFRDEPDKIAANKAAIEKSLSQRSSA 175

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A+S  L      N L   A  ++++ D   GG   APKFP+             LE   
Sbjct: 176 DAASIGL------NELDNAAGSIARATDPTNGGLRGAPKFPQ----------CSMLEFLW 219

Query: 202 KSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           ++G  +  ++  + T   L  M++GGI+DH+GGG+ RYSVD RW VPHFEKMLYD  Q+ 
Sbjct: 220 RAGARTGDERYFITTNLALTQMSQGGIYDHLGGGYARYSVDARWLVPHFEKMLYDNAQIL 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++     +   +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG FY
Sbjct: 280 DMLALEHARAPNELYRQRAEETVGWLKREMLTKEGGFASSLDADS---EG----EEGKFY 332

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW+  ++  +LG + A  F   Y +   GN            F+G N+L  L+D S +A+
Sbjct: 333 VWSQADIAHLLGPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSETAT 380

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +           L   R  LF  R KR  P LDDKV+  WNGL I++             
Sbjct: 381 E--------AEQLAALRAILFRAREKRVHPGLDDKVLADWNGLTIAA---------LAHA 423

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
           +  FN       R +++ +A +A  F+   +   +  RL HS+R G    P    D+A +
Sbjct: 424 ANAFN-------RPDWLTLATTAFGFVTTTM--SRRDRLGHSWRAGKLLQPALASDHAAM 474

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I   L LYE      +L  AI  Q   D  + D + GGYF T+ +   ++LR     D A
Sbjct: 475 IRAALALYEATGDHLFLDQAILWQADLDTHYGDPQHGGYFLTSDDAEGLILRPHSTVDDA 534

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-- 615
            P+   ++  NL RLA +    +  + RQ              DM     L   AA+M  
Sbjct: 535 IPNHVGLTAQNLARLAVLTGDER--WRRQ-------------LDMLFKHMLPVAAANMFG 579

Query: 616 -LSVPSRKHVVLVGHKSSVD-----FENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEE 668
            LS+ +   + L G +  V       E +L AA A       V+ + DP           
Sbjct: 580 HLSLLNALDLYLAGSEIVVTGQGEGVEALLKAARALPHATTIVLRVPDP----------- 628

Query: 669 HNSNNASMARNNFSADKV-----VALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
                A +  ++ +ADKV      A VC+  +CS PVT+P +L  L+L + +S+A
Sbjct: 629 -----AKLPPHHPAADKVAPGGGAAFVCRGQTCSLPVTEPDALTALVLREDASSA 678


>gi|452958537|gb|EME63890.1| hypothetical protein H074_04714 [Amycolatopsis decaplanina DSM
           44594]
          Length = 688

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 240/716 (33%), Positives = 333/716 (46%), Gaps = 97/716 (13%)

Query: 10  TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
            K R    L++     CHWCHVM  ESFEDE  A L+N  FV+IKVDREERPD+D VYM 
Sbjct: 54  AKRRNVPILLSVGYAACHWCHVMAHESFEDEATATLMNANFVNIKVDREERPDIDSVYMA 113

Query: 66  YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
             QA+ G GGWP++ FL+P+ +P   GTY+PP  + G P F  +L  V +AWD++   L 
Sbjct: 114 ATQAMTGQGGWPMTCFLTPEGEPFHCGTYYPPSPRPGMPSFSQLLVAVAEAWDERPGELR 173

Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
                 I  L+E       S  LP+ +   A L      L K YD+  GGFG APKFP  
Sbjct: 174 SGARQIIAHLTE------KSGPLPESVVDGAVLESAVASLRKEYDAENGGFGGAPKFPPT 227

Query: 185 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
           + +  +L H ++   TG       G  MV  T + MA GG++D + GGF RYSVD RW V
Sbjct: 228 MALNFLLRHHER---TGS------GLSMVEHTAEAMALGGLNDQLAGGFARYSVDARWEV 278

Query: 245 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 304
           PHFEKMLYD G L   Y     +T   +      +  ++L RD+    G   ++ DAD+ 
Sbjct: 279 PHFEKMLYDNGLLLRFYARFHGVTGYEYARRTVEETAEFLLRDLGTAEGGFAASLDADTD 338

Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGN----CDLSRMSDPHNE 359
             EG T       YVWT  ++ ++LGE       E + +   GN        R+ +PH E
Sbjct: 339 GVEGLT-------YVWTPAQLAEVLGEEDGAWAAELFQVAEPGNFEHGASTLRLREPHPE 391

Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 419
                                   E+Y  +    RR L   R +RP+P  DDKVI +WNG
Sbjct: 392 DA----------------------ERYERV----RRALLAARGQRPQPARDDKVIAAWNG 425

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQH 478
           L I +FA A   L                 R ++++ A  AA+F+  +H  D    RL+ 
Sbjct: 426 LAIGAFANAGSRLG----------------RPQWIDAATRAAAFLMDKHFVD---GRLRR 466

Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
           + R+G      G L+DYA L  GLL+L++     +WL  AI L +     F   +  G +
Sbjct: 467 TSRDGVVGTTAGVLEDYACLAEGLLELHQSTGEPRWLADAITLLDLALAHFGVPDSPGAY 526

Query: 538 NTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAG-SKSDYYRQNAEHSLAVF 595
             T +D  VL+ R  +  D A PSG S ++ N +  AS++AG  +   YR+ AE +LA  
Sbjct: 527 YDTADDAEVLVQRPSDPTDNASPSGAS-ALANALLTASVLAGHDQVGRYREAAEQALARA 585

Query: 596 ETRLKDMA-MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
                     A   +  A    + P +  VV     S  D   +LAAA AS      V+ 
Sbjct: 586 GRLAAHAPRFAGHWLTVAEAAAAGPVQVAVVGPDAASRAD---LLAAAVASSPDGAVVVS 642

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             P D + +            +A          A VC+ + C  PV     L + L
Sbjct: 643 GTP-DADGVPL----------LADRPLVEGAAAAYVCRGYVCERPVATAEELRSQL 687


>gi|291009338|ref|ZP_06567311.1| hypothetical protein SeryN2_32865 [Saccharopolyspora erythraea NRRL
           2338]
          Length = 683

 Score =  326 bits (835), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 229/693 (33%), Positives = 321/693 (46%), Gaps = 89/693 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A ++N+ FV+IKVDREERPDVD VYM   QA+ G GGWP++ 
Sbjct: 51  ACHWCHVMAHESFEDEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTC 110

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTY+P    +G P F+ +L  V  AW ++   + Q+    +EQL     
Sbjct: 111 FLTPDAEPFHCGTYYPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL----- 165

Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            SA    LP+  L    +     +L    D    GFG APKFP  + ++ +L H ++   
Sbjct: 166 -SAQRTALPESFLDDEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSA 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G    A E   M   T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 225 PGSGHTALE---MAEATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY       +      + R+   +L RD+  P G   ++ DAD   TEG     EG  YV
Sbjct: 282 VYAHLARRRESPLAERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYV 334

Query: 320 WTSKEVEDILGE-----HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           WT +++ ++LGE      A LF   +   + + T    L R  DP +  + + V      
Sbjct: 335 WTPEQLAEVLGEADGAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV------ 386

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
                                 R  L++ RS+RP+P  DDKV+ SWNG+ I++   AS  
Sbjct: 387 ----------------------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTA 424

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
           L                   E++  AE AA   + RHL D+   RL+ S R+G    A G
Sbjct: 425 LGE----------------PEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAG 465

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLL 548
            L+DY  L  GLL L++     +WL  A  L +T  E F D +  G YF+T  +   ++ 
Sbjct: 466 VLEDYGCLADGLLSLHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVR 525

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R  +  D A PSG S     L+  +++  GS +  YR  AE +L+      +  A     
Sbjct: 526 RPSDPTDNASPSGASSLTSALLTASALAGGSAAQRYRHAAEQALSRAGLLAERAARFAGH 585

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
               A+ L+      V + G +   D   +L AA         V+  +P  T        
Sbjct: 586 WLSTAEALA-HGPLQVAVAGPEDDGDRAALLEAAWRHSPGGAVVLAGEPEAT-------- 636

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                  +A          A VC+ + C  PVT
Sbjct: 637 ---GVPLLADRPLVGGSAAAYVCRGYLCDRPVT 666


>gi|134097521|ref|YP_001103182.1| hypothetical protein SACE_0923 [Saccharopolyspora erythraea NRRL
           2338]
 gi|133910144|emb|CAM00257.1| protein of unknown function DUF255 [Saccharopolyspora erythraea
           NRRL 2338]
          Length = 681

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 229/693 (33%), Positives = 321/693 (46%), Gaps = 89/693 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A ++N+ FV+IKVDREERPDVD VYM   QA+ G GGWP++ 
Sbjct: 49  ACHWCHVMAHESFEDEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTC 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTY+P    +G P F+ +L  V  AW ++   + Q+    +EQL     
Sbjct: 109 FLTPDAEPFHCGTYYPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL----- 163

Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            SA    LP+  L    +     +L    D    GFG APKFP  + ++ +L H ++   
Sbjct: 164 -SAQRTALPESFLDDEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSA 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G    A E   M   T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 223 PGSGHTALE---MAEATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLR 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY       +      + R+   +L RD+  P G   ++ DAD   TEG     EG  YV
Sbjct: 280 VYAHLARRRESPLAERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYV 332

Query: 320 WTSKEVEDILGE-----HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           WT +++ ++LGE      A LF   +   + + T    L R  DP +  + + V      
Sbjct: 333 WTPEQLAEVLGEADGAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV------ 384

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
                                 R  L++ RS+RP+P  DDKV+ SWNG+ I++   AS  
Sbjct: 385 ----------------------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTA 422

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
           L                   E++  AE AA   + RHL D+   RL+ S R+G    A G
Sbjct: 423 LGE----------------PEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAG 463

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLL 548
            L+DY  L  GLL L++     +WL  A  L +T  E F D +  G YF+T  +   ++ 
Sbjct: 464 VLEDYGCLADGLLSLHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVR 523

Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
           R  +  D A PSG S     L+  +++  GS +  YR  AE +L+      +  A     
Sbjct: 524 RPSDPTDNASPSGASSLTSALLTASALAGGSAAQRYRHAAEQALSRAGLLAERAARFAGH 583

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
               A+ L+      V + G +   D   +L AA         V+  +P  T        
Sbjct: 584 WLSTAEALA-HGPLQVAVAGPEDDGDRAALLEAAWRHSPGGAVVLAGEPEAT-------- 634

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                  +A          A VC+ + C  PVT
Sbjct: 635 ---GVPLLADRPLVGGSAAAYVCRGYLCDRPVT 664


>gi|383785408|ref|YP_005469978.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
 gi|383084321|dbj|BAM07848.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
          Length = 694

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 235/700 (33%), Positives = 350/700 (50%), Gaps = 77/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPL 78
           + CHWCHVM  ESFED   A ++N+ F++IKVDREERPD+D +Y M +       GGWPL
Sbjct: 48  SACHWCHVMAHESFEDPETASVMNESFINIKVDREERPDLDHIYQMAHTVITKRNGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD  P  GGTYFP   ++G PGF ++L +++  +D+ ++ L+ +     E LS +
Sbjct: 108 TMFLTPDQVPFAGGTYFPKSPRFGLPGFISVLHQIRQFYDENKEALSGTKHPVTELLSRS 167

Query: 139 LSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
            +    +N  P  L   P+  LR   + L   +DS  GGF  APKFP P++I      + 
Sbjct: 168 DALGEGANPDPSSLTIEPEARLR---DSLRARFDSEDGGFTPAPKFPHPMDI------AA 218

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
            L +  + GE  +   M   TL+ MA GGI+D +GGGF RYSVD  W +PHFEKMLYD  
Sbjct: 219 CLREYEREGEVFD-LWMARHTLERMASGGIYDQIGGGFSRYSVDGTWTIPHFEKMLYDNA 277

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
            L  VY +   L++D   + +C  I+ +L R+M    G   +A DADS   EG    +EG
Sbjct: 278 LLLCVYAEGAHLSEDAGLASVCDGIVTWLFREMRDSSGAFHAALDADS---EG----EEG 330

Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG--KNVLIELNDS 372
            +YVWT +EV  IL  E   +    Y L  T N +        +EF    KN+       
Sbjct: 331 KYYVWTREEVSRILTPEEYQVVSLTYGLSETPNFE--------HEFWHFRKNLPF----- 377

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
           S  AS+L +    + ++L   + KL  VRS+R  P  DDKV+  WNGL+     RA +IL
Sbjct: 378 SEVASRLSLTEGPFHSLLSSAKEKLLSVRSQRIPPGKDDKVLTGWNGLLARGLIRAGRIL 437

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
                           DR E++   +     +R  L+      L      G S+   +LD
Sbjct: 438 ----------------DRPEWIMEGQKILDILRETLW--TGDHLLAVRTKGESRLNAYLD 479

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA+++  L++          L WA+ L +     F D   GG+  T+ +   ++ R K 
Sbjct: 480 DYAYVLDALVESLATVYRPSDLAWALSLADVLVSKFWDDAAGGFHFTSHDHEQLIHRPKS 539

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
            HD A PSG++V+   L RLA +    + D+  +    +LA++   + +  M    M  A
Sbjct: 540 GHDAAIPSGSAVTCRALNRLAHL--SGRMDWL-EKVGRTLALYSKPMLEQPMGYASMIMA 596

Query: 613 -ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM-DFWEEHN 670
             + LS P    +VLV  KSS+++     +A A   L+  +I +   D+  + DF ++  
Sbjct: 597 LGEYLSPPV---IVLVRGKSSLEWS---LSARAKSPLDTLIIDLGERDSLSLPDFLQKPP 650

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +   S         +  A VC    C  PVTD   L++LL
Sbjct: 651 ATGVSF--------ETQADVCGGGVCLSPVTD---LKDLL 679


>gi|124002212|ref|ZP_01687066.1| thymidylate kinase [Microscilla marina ATCC 23134]
 gi|123992678|gb|EAY32023.1| thymidylate kinase [Microscilla marina ATCC 23134]
          Length = 681

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 221/690 (32%), Positives = 333/690 (48%), Gaps = 85/690 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFED+ VA ++N +F+ IKVDREERPDVD +YM  VQA+   GGWPL+
Sbjct: 55  SACHWCHVMERESFEDDEVAAIMNRYFICIKVDREERPDVDAIYMDAVQAMGQRGGWPLN 114

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
             L+P+ KP    TY P E       +  +L+ V + +  KRD L QS     E   EA+
Sbjct: 115 ALLTPEAKPFYALTYLPKE------SWVQLLQNVAEVYQTKRDELEQSA----EAYREAI 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRF-------GGFGSAPKFPRPVEIQMMLY 192
           + S +      +L  N +R   E L K + S +       GG   APKFP P   Q +L+
Sbjct: 165 ATSEAKKY---DLKPNDIRYAREDLDKMFQSVYNDVDHTRGGTNRAPKFPMPSIWQFLLH 221

Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
           +        +  +  E  + V  TL  MAKGGI+D +GGGF RYSVD  W  PHFEKMLY
Sbjct: 222 YY-------QITKKEEALRTVEVTLNEMAKGGIYDQIGGGFARYSVDADWFAPHFEKMLY 274

Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           D GQL ++Y DA+++T++  Y  +    +D++ R++    G  FSA DADS   EG    
Sbjct: 275 DNGQLLSLYADAYNVTQNPLYQQVVMQTVDFVARELTSEEGGFFSALDADS---EGV--- 328

Query: 313 KEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
            EG FYVW     ++++G E A +  ++Y +    N            ++  N+L     
Sbjct: 329 -EGKFYVWEKTAFDEVIGVEDAAIAADYYQVTSQAN------------WEEGNILHRSIG 375

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
             A A K  + +E     + +   +L   RSKR RP LDDK++ SWNGL++     A ++
Sbjct: 376 DLAFAEKHQIDVESLKQKVTQWNERLLTARSKRIRPGLDDKILTSWNGLMLKGLVDAYRV 435

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
                            D  + + +A + A FI   L  E  ++L HS++NG +    +L
Sbjct: 436 F----------------DSPKLLNLALANAQFIAEKLTTE-NYQLYHSYKNGKASINAYL 478

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           +DYA ++   + LY+     +WL  A  L +     F D+E G +F T      ++ R K
Sbjct: 479 EDYAAVVDAYIALYQATFDEQWLTKAKSLTDYALANFYDKEEGLFFFTDVNAEKLIARKK 538

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           E  D   P+ NS+   NL  L   +   +SD Y+Q A   L   +  + +   +      
Sbjct: 539 ELFDNVIPASNSMMAKNLYWLG--LYYEQSD-YQQKASQMLGQMQKIIVENPESAANWAT 595

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHN 670
                + P+ + V +VG ++    +   A+    Y  NK +   + P D+  +   +   
Sbjct: 596 LYTYFAQPTAE-VAIVGEQA----QEYRASLDKYYYPNKILAGTLQPQDS--LGLLQNRG 648

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPV 700
           + N           +    VC N +C  PV
Sbjct: 649 TING----------QTTVYVCYNKTCQLPV 668


>gi|225418720|ref|ZP_03761909.1| hypothetical protein CLOSTASPAR_05944, partial [Clostridium
           asparagiforme DSM 15981]
 gi|225041746|gb|EEG51992.1| hypothetical protein CLOSTASPAR_05944 [Clostridium asparagiforme
           DSM 15981]
          Length = 506

 Score =  325 bits (832), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 197/519 (37%), Positives = 264/519 (50%), Gaps = 64/519 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVM  ESFED  VAK LN  +V +KVDREERP++D VYM+  QA+ G GGWPL+
Sbjct: 48  STCHWCHVMAHESFEDREVAKRLNADYVPVKVDREERPEIDMVYMSVCQAMTGQGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           + ++PD KP   GTY P   +    G   +L  V + W   R  L       +  L  A 
Sbjct: 108 IIMTPDKKPFFAGTYLPKTSRRNMTGLLELLSAVSEIWKSDRKRLLNMSDQILAVLRRAP 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            AS+ ++      P+   R   E+L  ++D  +GGFG APKFP P  +  ++ +      
Sbjct: 168 DASSPAD------PETLARRGYEELRAAFDRTYGGFGRAPKFPAPHNLLFLMRY------ 215

Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
                 A E Q + +   TL  MA+GGIHDH+GGGF RYS D+ W VPHFEKMLYD   L
Sbjct: 216 ---RAWADEPQALAMAEKTLSSMARGGIHDHLGGGFSRYSTDQMWLVPHFEKMLYDNALL 272

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
           A  YL+ + LT + FY    R ILDY+RR++ GP G  +  +DADS          EG +
Sbjct: 273 ALAYLEGYRLTGNRFYQRTARQILDYVRRELTGPEGGFYCGQDADSQGV-------EGKY 325

Query: 318 YVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           YV++ +E+  +LG       F   Y +   GN            F+G N+   +++    
Sbjct: 326 YVFSEEEIGRVLGSRKDQEKFCRRYGITKEGN------------FEGANIPNLIHNPDYE 373

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
              L M           CRR L++ R KR   H DDK++ SWN L+I + ARA  +L   
Sbjct: 374 QRDLEMD--------ALCRR-LYEYRLKRLPLHRDDKILASWNALMIIACARAGFLL--- 421

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        D   Y+E+A  A  F+ + L+DE   RL   +R G S  PG LDDYA
Sbjct: 422 -------------DDPGYLEMAGRAQMFVEQKLFDENG-RLLVRYRQGESAFPGNLDDYA 467

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
           F    LL LYE      +L  A+       ELF D E G
Sbjct: 468 FYCLALLTLYEVTLDASYLELAVNRAEQMVELFWDEERG 506


>gi|30248134|ref|NP_840204.1| hypothetical protein NE0103 [Nitrosomonas europaea ATCC 19718]
 gi|30180019|emb|CAD84014.1| putative similar to unknown proteins [Nitrosomonas europaea ATCC
           19718]
          Length = 689

 Score =  325 bits (832), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 222/685 (32%), Positives = 340/685 (49%), Gaps = 66/685 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFED  VA  +N+ FV+IKVDREERPD+D++Y +    L +  GGWPL
Sbjct: 48  SACHWCHVMAHESFEDAQVATAMNEHFVNIKVDREERPDIDQIYQSAHYTLNHRSGGWPL 107

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+P+ KP  GGTYFP E +Y  PGF  +L KV + +  ++  + +  A  ++ L+++
Sbjct: 108 TMFLTPEQKPFFGGTYFPKEARYSMPGFLELLPKVAELYRTRKTDIEKQNAVLLKLLAQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L A  +       L +  +    EQL++ +D   GGFG APKF  P E+Q  L       
Sbjct: 168 LPAPDTR---ASALSRQPIDRAWEQLNRLFDETDGGFGDAPKFLHPAELQFCLRRYVTDN 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           DT           +V  TL+ MA+GG++D +GGGF RYS D  W +PHFEKMLYD   + 
Sbjct: 225 DT-------RALHVVTHTLEKMAQGGLYDQLGGGFCRYSTDHSWQIPHFEKMLYDNALML 277

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATRKKEG 315
            +Y + + +T +  +  +  +   ++ R+M   I   G  FS+ DADS         +EG
Sbjct: 278 PLYAETWLVTGNPLFKQVVEETAAWVIREMQSGIDGEGGYFSSLDADS-------EHEEG 330

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            FYVW  + V  IL          YY       D S   + H+        IE       
Sbjct: 331 KFYVWDRQAVSAILTPEEYRVTAAYY-----GLDRSPNFENHHWHLAVTESIE-----TV 380

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A++  +  E    ++   RRKL + R +R RP  D+K++ SWN L+I    RA +I    
Sbjct: 381 AARHQISQEAVQQLIDSARRKLLNEREQRIRPGRDEKILTSWNALMIKGMTRAGQIF--- 437

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
                        +R+E++  A  A  FIR  L+  Q  RL  +F++  +    +LDD+A
Sbjct: 438 -------------EREEWISSAVRALDFIRSRLW--QNDRLLATFKDDKAHLNAYLDDHA 482

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
           FL+  LL L +       L +AI L +     F D+  GG+F T+ +  +++ R K  HD
Sbjct: 483 FLLDSLLTLLQADFRQTDLDFAITLADVLLTRFEDKTSGGFFFTSHDHETLIHRPKTGHD 542

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GA P+GN ++   L RL  ++   +   Y + AE +L VF + L   A +   +    + 
Sbjct: 543 GAIPAGNGIAATTLQRLGHLLNEQR---YLEAAERTLNVFSSGLSLHASSHCSLLITLEE 599

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
              P+ K V+L G++  +    +   A   Y L+K VI + P +  E+           S
Sbjct: 600 FLEPT-KTVILHGNRPEL---QIWLKALLPYSLDKIVIAL-PLELSELP---------DS 645

Query: 676 MARNNFSADKVVALVCQNFSCSPPV 700
           +   +    K+ A VC+   C P +
Sbjct: 646 LKMRSTPDGKISARVCEGRRCLPEI 670


>gi|344203206|ref|YP_004788349.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343955128|gb|AEM70927.1| hypothetical protein Murru_1888 [Muricauda ruestringensis DSM
           13258]
          Length = 699

 Score =  325 bits (832), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 210/686 (30%), Positives = 329/686 (47%), Gaps = 78/686 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME E FED  VA+++N  FV+IK+DREERPDVD++YM  +Q + G GGWPL++ 
Sbjct: 77  CHWCHVMEKECFEDAEVAEVMNKNFVNIKIDREERPDVDQIYMDAIQMISGQGGWPLNIV 136

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P  G TY P ++      +   L ++ + + K +  + Q  A     L+  L A
Sbjct: 137 ALPDGRPFWGATYVPKDN------WIKSLEQLAELYKKDKPRVTQYAA----DLANGLHA 186

Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                ++K  D    + L +  +  ++ +D+  GG   APKF  P     +L+++  +  
Sbjct: 187 INLVENDKDSDLYSLDQLDVAIQNWTQYFDTFLGGHKRAPKFMMPNNWDFLLHYATAV-- 244

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                +  E  + V  TL  MA GG++DHVGGGF RY+VD +WHVPHFEKMLYD GQL +
Sbjct: 245 -----DKPEIMEFVDTTLTRMAYGGVYDHVGGGFSRYAVDTKWHVPHFEKMLYDNGQLTS 299

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A++ TK+  Y  +  + +++++ + +   G  +S+ DADS +        EGA+YV
Sbjct: 300 LYAKAYAATKNELYKNVVEETINFVQEEFLDRSGGFYSSLDADSLDENAELV--EGAYYV 357

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT KE+  +LG+   LF+E++ +   G  +           +   VLI        A K 
Sbjct: 358 WTKKELSGLLGDDFELFQEYFNINSYGYWE-----------EENYVLIRDKSDEEVADKF 406

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + + +    + E   KL   R KRP+P LDDK++ SWNGL++     A + L  E    
Sbjct: 407 NITIPELKTTITESLAKLKGEREKRPKPRLDDKILTSWNGLMLKGLVDAYRYLGEE---- 462

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       +Y+ +A   A FI R +  +    L  + + G S   GFL+DYA +I 
Sbjct: 463 ------------DYLNLALKNAEFIEREMI-KSDGSLYRNHKEGKSTINGFLEDYATVID 509

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
               LYE     KWL  A  L     + F D   G +F T+ ED S++ R  E  D    
Sbjct: 510 AYFSLYEATFDEKWLDLAKNLLEYSKKHFWDETSGMFFYTSDEDQSLIRRTIEVDDNVIS 569

Query: 560 SGNSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           S NS+  INL +   +      G+ S+   +N +     F+ R +  A  + L+     +
Sbjct: 570 SSNSIMAINLYKFHKLYPEESYGNMSEQMLKNVQKD---FDRRAQGFANWLHLV-----L 621

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
                   + ++G     D++N+       Y  N  ++                   N  
Sbjct: 622 FQNQDFYEIAILGE----DYKNLGQQISKEYVPNSILVG-------------SQKEGNLE 664

Query: 676 MARNNFSADKVVALVCQNFSCSPPVT 701
           + +N  + +K +  VC   +C  PVT
Sbjct: 665 LLKNRGNPNKTLVYVCIEGACKLPVT 690


>gi|21223348|ref|NP_629127.1| hypothetical protein SCO4975 [Streptomyces coelicolor A3(2)]
 gi|20520976|emb|CAD30960.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
          Length = 686

 Score =  325 bits (832), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 234/701 (33%), Positives = 337/701 (48%), Gaps = 72/701 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 54  SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPPE ++G P F+ +L+ V+ AW ++RD + +     +  L+   
Sbjct: 114 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVQQAWAERRDEVDEVAGKIVRDLAGRE 173

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   +     ++L Q  L      L++ YD R GGFG APKFP  + I+ +L H  +  
Sbjct: 174 ISYGDAEAPGEEQLGQALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR-- 226

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 227 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA Y
Sbjct: 282 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHY 339

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
           VWT  ++ ++LG E A L  +++ +   G  +            G +VL +   +S   A
Sbjct: 340 VWTPAQLTEVLGAEDAELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDA 388

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++           +   R +L   R  RP P  DDKV+ +WNGL +++ A          
Sbjct: 389 AR-----------IASVRERLLAARDGRPAPGRDDKVVAAWNGLAVAALAET-------- 429

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
             A F  P              +A   +R HL DEQ  RL  + ++G + A  G L+DYA
Sbjct: 430 -GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYA 480

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  D
Sbjct: 481 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTD 539

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A PSG S +   L+   S  A + S  +R  AE +L V +     +   +     AA+ 
Sbjct: 540 NATPSGWSAAAGALL---SYAAHTGSAPHRAAAERALGVVKALGPRVPRFIGWGLAAAEA 596

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
           L    R+  V+              A  A+  L++T + +  A    + F  E +     
Sbjct: 597 LLDGPREVAVVAPDP----------ADPAARGLHRTAL-LGTAPGAVVAFGTEGSDEFPL 645

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           +A          A VC+NF+C  P TDP  L   L   P+ 
Sbjct: 646 LADRPLVGGAPAAYVCRNFTCDAPTTDPDRLRTALGVAPTG 686


>gi|113474681|ref|YP_720742.1| hypothetical protein Tery_0863 [Trichodesmium erythraeum IMS101]
 gi|110165729|gb|ABG50269.1| protein of unknown function DUF255 [Trichodesmium erythraeum
           IMS101]
          Length = 693

 Score =  325 bits (832), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 218/632 (34%), Positives = 324/632 (51%), Gaps = 93/632 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F DE +A+ LN+ F+ IKVDREERPDVD +YM  +Q L G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDEKIAQYLNEKFLPIKVDREERPDVDSIYMQALQMLTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P +GGTYFP E +YGRPGF  +L+K++  +D +++ L       +E L ++
Sbjct: 108 IFLTPDDLIPFVGGTYFPIEPRYGRPGFLEVLQKIRSFYDLEKNKLDTLKVEMLEGLRKS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +    + + L +E+ Q  L +  + +   Y        S   FP     Q  L   KKL 
Sbjct: 168 VLLPEAED-LKEEILQQGLEVITKIIGDRY--------SQQSFPMIPYAQAAL-QGKKLN 217

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ-- 256
              ++       K+ L     +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ  
Sbjct: 218 FKSQNN----SNKVCLERGLNLALGGIYDHVAGGFHRYTVDPNWTVPHFEKMLYDNGQIV 273

Query: 257 --LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             LAN++   +   K  F   I   + ++L+R+M  P G  ++A+DADS  T      +E
Sbjct: 274 EYLANLWSAGYH--KPAFKRGIIGTV-NWLKREMTAPTGFFYAAQDADSFTTPDEVEPEE 330

Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFY+W+ KE+E++L +  +    + ++++P GN            F+GK VL       
Sbjct: 331 GAFYIWSYKELENLLTKEELSELSKQFFIEPNGN------------FEGKIVL-----QR 373

Query: 374 ASASKLGMPLEKYLNILGECRRKL--FDVRSKRPRPH----------------LDDKVIV 415
             A +L   +E  L+ L + R  +  F++ +  P  +                 D K+IV
Sbjct: 374 KQAEELSKTVENSLSKLFKLRYGVQPFNIETFPPATNNKEAKNNNWPGKIPAVTDTKMIV 433

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTH 474
           +WN L+IS  AR + +  S                 EY+E+A +AA F I     D + H
Sbjct: 434 AWNSLMISGLARTATVFNS----------------LEYLELAMNAAHFIITNQQIDGRFH 477

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE----------FGSGTK-WLVWAIELQNT 523
           RL +    G        +DYA  I  LLDL +            + T  WL  AI+LQ+ 
Sbjct: 478 RLNYE---GKPAVTAQSEDYALFIKALLDLQQASISLETLSKLNTNTNFWLETAIKLQDE 534

Query: 524 QDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
            DE    +E  GY+NT+ E    ++LR +   D A P+ N +++ NLVRL+ +   ++  
Sbjct: 535 FDEFLWSQETAGYYNTSYEVTGELILRERNYIDNATPAANGIAIANLVRLSLL---TEEL 591

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           YY   AE +L  F + +K    A P +  A D
Sbjct: 592 YYLDRAESALTAFSSIMKKSPQACPSLFVALD 623


>gi|298293757|ref|YP_003695696.1| hypothetical protein Snov_3807 [Starkeya novella DSM 506]
 gi|296930268|gb|ADH91077.1| protein of unknown function DUF255 [Starkeya novella DSM 506]
          Length = 672

 Score =  324 bits (831), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 238/690 (34%), Positives = 340/690 (49%), Gaps = 89/690 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A ++N+ FV+IKVDREERP+VD++YM+ +Q L   GGWP+++
Sbjct: 49  ACHWCHVMAHESFEDEATAAVMNELFVNIKVDREERPEVDQIYMSALQQLGVQGGWPMTM 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL  +  P  GGTYFP E +YG+P F  +L+ + +A+      +A +    + +L +  +
Sbjct: 109 FLDAEGAPFWGGTYFPKEARYGQPAFTDVLKTMANAYGSGDPRIASNREALLARLRQKAA 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                   P+EL   A R+         DS+ GG   +PKFP    ++++    +  E T
Sbjct: 169 PVGKVTIGPNELDDVAGRILG-----IMDSQHGGLQGSPKFPNTPFLELLW---RAWERT 220

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+       +   L  L  M++GGI+DHVGGG+ RYSVDERW VPHFEKMLYD  Q+  +
Sbjct: 221 GR----QRLRDAALHALDGMSEGGIYDHVGGGYARYSVDERWLVPHFEKMLYDNAQILEL 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
              A+S T    +     + + +L+R+M+   G   ++ DADS   EG     EG +YVW
Sbjct: 277 LGLAYSETLADLFRARAEETVGWLQREMLTTSGAFAASLDADS---EG----HEGRYYVW 329

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T K+V D LG E A  F  HY + P GN +   +S P       N L E+  S A   +L
Sbjct: 330 TLKQVLDALGAEDAEFFARHYDIAPFGNWE--GVSIP-------NRLKEMERSPADEMRL 380

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            M            R KL  VR  R  P  DDKV+  WNGL+I++ A  +          
Sbjct: 381 AM-----------LRDKLLKVRETRVPPGRDDKVLADWNGLMIAALANVA---------- 419

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
               P  G  R E++E+A  A  FI   +  E   RL HS+R G    PG   DYA +I 
Sbjct: 420 ----PRFG--RPEWVELAARAFRFIAESMAREG--RLGHSWREGRLVFPGLSSDYAAMIG 471

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             L L++      +   A+  Q  Q E     E GGY+ T  +   ++LR     D A  
Sbjct: 472 AALALHQATGEASYFDHAVAWQ-AQLEAHHAAEDGGYYLTADDAEGLILRPDAAADDAVT 530

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD--MAMAVPLMCCAADMLS 617
           + N++   NLVRLA++   +  D YR+ A+        RL D  +  A P +   A +L+
Sbjct: 531 NPNALIARNLVRLAAV---TGDDGYRERAD--------RLFDGLLPRAAPSLYSHAGLLN 579

Query: 618 VPSRK----HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSN 672
               +     +V+VG     D   +L AA     ++  +  + DPA   E         N
Sbjct: 580 ALDTRLRAPEIVVVGSGEVAD--ALLDAARRLPRVDLMIERVSDPASLPE---------N 628

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTD 702
           + + A+   S D   A VC    CS PVTD
Sbjct: 629 HPARAKAE-SIDGAAAFVCAGSVCSLPVTD 657


>gi|206603590|gb|EDZ40070.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
           CG']
          Length = 689

 Score =  324 bits (831), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 220/697 (31%), Positives = 338/697 (48%), Gaps = 64/697 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSV 80
           CHWCHVM  ESFE   +AK++N++FV+IKVDREERPD+D++Y M +       GGWPL++
Sbjct: 50  CHWCHVMAHESFERPDIAKVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P   P  GGTYFP + ++G PGF  +L +++D +   R+ L +     ++ L +   
Sbjct: 110 FLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTNP 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            + S+    D  P  AL      L   +D  FGGFG APKFP  +++  +    ++    
Sbjct: 170 VADSTGFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFHRK 223

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G S  A     M   TL  M +GGI DHVGGGF RYSVDERW +PHFEKMLYD   L   
Sbjct: 224 GDSTAA----HMATLTLSAMKRGGIWDHVGGGFARYSVDERWLIPHFEKMLYDNALLLEA 279

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
                S++++  YS    +++ +L R+M    G  +S+ DADS   EG    +EG FYV+
Sbjct: 280 LALGASVSRNPVYSRTAEELVGWLFREMRSEHGVYYSSLDADS---EG----EEGRFYVF 332

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
            ++EV  IL  E   +  +HY L           S+P N       L E       + + 
Sbjct: 333 QAEEVRSILSDEEYRVVSKHYGL-----------SEPPNFESHAWHLYEARSIGELSKEF 381

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +P     + +   R+KLF  RS R RP LDDK++ SWN L+              A++ 
Sbjct: 382 HLPESDIESRIDSARQKLFTYRSLRVRPGLDDKILASWNALM--------------AKAL 427

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
           +F+  ++G  ++E+M        ++ R+++      L   +       P +LDDYAFL+ 
Sbjct: 428 LFSGRILG--KQEWMTAGRKTIDYMHRNMWKNGV--LMAVYSKKEPFLPAYLDDYAFLLL 483

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            +L+        + L +A  + +     F D E GG++ T     +++ R K  HDGA P
Sbjct: 484 AVLESIRIDFRPEDLSFATAIADVLLTEFYDPESGGFYFTGKNHEALIHRPKNGHDGALP 543

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SGN+ +V  L+ L ++        Y   A+ +L ++  ++K+       M  A +  S  
Sbjct: 544 SGNAAAVQGLLWLGTLTGHLP---YTSAADQTLRLYFAQMKEQPAGYTTMISALETYS-- 598

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
             + V+L+    + D++N +       D    VI +  A    +   E          R 
Sbjct: 599 DSQPVILLAGPQAEDWKNTI---RQGLDPEAFVIDLTSAVRNSLPLPEG--------MRK 647

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
           +F  +K    VC+   C P      SL+  L   P S
Sbjct: 648 HFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 684


>gi|429201724|ref|ZP_19193171.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
 gi|428662694|gb|EKX62103.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
          Length = 687

 Score =  324 bits (831), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 225/692 (32%), Positives = 338/692 (48%), Gaps = 82/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 52  SSCHWCHVMAHESFEDRETADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 111

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPP  ++G P F+ +L  V+ AW  +RD + +     +  L+   
Sbjct: 112 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVRAAWADRRDEVTEVAGKIVRDLAGRE 171

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L  +A      ++L +  L      L++ YD+  GGFG APKFP  + I+ +L H  +  
Sbjct: 172 LQFAAVEVPGEEDLARALL-----GLTREYDAVHGGFGGAPKFPPSMVIEFLLRHYAR-- 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 225 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA+Y
Sbjct: 280 RVYAHLWRATGSELARRVALETADFMVRELGTGEGGFASALDADS--DDGTGKHVEGAYY 337

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
           VWT  ++ ++LG+  A L  + + +   G  +            G++VL +  ++    A
Sbjct: 338 VWTPAQLREVLGDQDADLAAQFFGVTEEGTFE-----------HGQSVLRLPQHEGVFDA 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
            K           +   + +L   R++RP P  DDKV+ +WNGL +++ A          
Sbjct: 387 EK-----------IASIKDRLNRARAQRPAPGRDDKVVAAWNGLAVAALAETGAYF---- 431

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
                       DR + +E A +AA  + R   DE+  +L  + ++G   A  G L+DYA
Sbjct: 432 ------------DRPDLVEAAIAAADLLVRLHLDEKA-QLARTSKDGRVGANAGVLEDYA 478

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +     F+D E G  ++T  +   ++ R ++  D
Sbjct: 479 DVAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALYDTAADAEKLIRRPQDPTD 538

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG S +   L+   S  A + S+ +R  AE +L +    +K +   VP      + 
Sbjct: 539 NATPSGWSAAAGALL---SYTAHTGSEPHRAAAERALGI----VKALGPRVPRFIGWGLA 591

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P  + V +VG +       +  AA         V+ +  A+++E+       
Sbjct: 592 TAEALLDGP--REVAVVGPEGHPGTRALHRAALLG-TAPGAVVAVGTAESDELPL----- 643

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                +A       +  A VC+NF+C  P TD
Sbjct: 644 -----LADRPLVGGEPAAYVCRNFTCDAPTTD 670


>gi|357055989|ref|ZP_09117045.1| hypothetical protein HMPREF9467_04017 [Clostridium clostridioforme
           2_1_49FAA]
 gi|355381481|gb|EHG28604.1| hypothetical protein HMPREF9467_04017 [Clostridium clostridioforme
           2_1_49FAA]
          Length = 646

 Score =  324 bits (831), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 205/561 (36%), Positives = 286/561 (50%), Gaps = 51/561 (9%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           ME ESFE+E +A++LN  +V +KVDREERPDVD VYM+  QA+ G GGWPL++ ++PD +
Sbjct: 1   MERESFENEVIAEILNREYVCVKVDREERPDVDSVYMSVCQAMNGQGGWPLTIIMTPDCR 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEALSASASSN 146
           P   GTYFPP  +YGRPG + +L    D W  K+D +L Q+G     Q+ + L +   + 
Sbjct: 61  PFFSGTYFPPRARYGRPGLEELLTAAADQWKAKKDKLLEQAG-----QIEKYLRSQEQTG 115

Query: 147 KLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
           +  + EL   A+     Q + S+D + GGFGSAPKFP P  +  ++       + G   +
Sbjct: 116 RWAEPELA--AVHQAFRQFADSFDRKNGGFGSAPKFPTPHSLIFLM-------EYGARQK 166

Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
             E   M   TL  M +GGI DH+GGGF RYS D +W VPHFEKMLYD   L   Y+ A+
Sbjct: 167 RPEALAMAETTLVQMYRGGIFDHIGGGFSRYSTDGQWLVPHFEKMLYDNSLLVMAYIKAY 226

Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
             T    Y  +   +L+Y+RR++    G  +  +DADS          EG +YV+T +E+
Sbjct: 227 GRTGRKMYGCVAEKVLEYVRRELTDSQGGFYCGQDADSDGV-------EGKYYVFTQEEI 279

Query: 326 EDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
             +LGE A   F   Y +   GN      S P N  + +N      +        G    
Sbjct: 280 RAVLGEKAGRDFCRQYGITRHGN--FEGRSIP-NLLENENYEEICEEPWGGDDHGGNVCH 336

Query: 385 KYLNILG-----ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
              N  G     +C +KL+  R  R R H DDK++VSWNG +I + A A  +L       
Sbjct: 337 GVRNSFGGRKNEDC-KKLYQYRLDRARLHKDDKILVSWNGWMICACAMAGAVLGE----- 390

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K Y+++A  A +FI   L   +  RL    R+G +   G LDDYA    
Sbjct: 391 -----------KRYVDMAVRAEAFINSRLV--KNGRLMVRCRDGDAAGEGKLDDYACYSL 437

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            LL+LY       +L  A        E F DRE GG++    +   +++R KE +DGA P
Sbjct: 438 ALLELYRVTFQADYLKRAAAWAEIMTEQFFDRERGGFYLYAEDGEQLIVRTKETYDGAMP 497

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SGNSV+   L RL  I    K
Sbjct: 498 SGNSVAAQVLHRLTQITGEVK 518


>gi|374987022|ref|YP_004962517.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
 gi|297157674|gb|ADI07386.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
          Length = 677

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 228/703 (32%), Positives = 333/703 (47%), Gaps = 86/703 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE  A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMARESFEDEATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPP  ++G P F+ +L  V+ AW  +RD +       +  L+E  
Sbjct: 108 VFLTPEAEPFYFGTYFPPAPRHGMPSFQQVLEGVQAAWADRRDEVKDVAERIVRDLAERG 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            AS +        P++ L      L++ +D+  GGFG APKFP  + ++ +L H  +   
Sbjct: 168 GASLAYGAAQPPGPED-LHTALMTLTREFDAVHGGFGGAPKFPPSMVLEFLLRHHAR--- 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG         ++V  T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L  
Sbjct: 224 TGSQA----ALQIVQATCEAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDNALLCR 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  +  ++L R++    G   SA DADS + +G     EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVAVETAEFLVRELRTEQGGFASALDADSDDGKGG--HAEGAYYV 337

Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT +++ + LGE  A L  E++ +   G             F+  + ++ L D  A A  
Sbjct: 338 WTPEQLSEALGEKDAELAAEYFGVTEEGT------------FEQSSSVLRLPDREALADA 385

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                      +   R +L   R +RPRP  DDKV+ +WNGL +++ A            
Sbjct: 386 ---------ERIASVRERLLAARGQRPRPGRDDKVVAAWNGLAVAALAETGAYF------ 430

Query: 439 AMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
                     DR + +E A +AA   +R HL D    RL  +  +G + A  G L+DYA 
Sbjct: 431 ----------DRPDLVEAATAAADLLVRVHLDDRG--RLARTSLDGTAGAHAGVLEDYAD 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +  G L L        W+  A  L +T    F   +G  Y   T +D   L+R  +D  D
Sbjct: 479 VAEGFLALSSVTGEGAWVGLAGLLLDTVQRHFAAEDGMLY--DTADDAEALIRRPQDPTD 536

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG + +   L+  A++   +  D  R+ AE +L V +     +   VP      + 
Sbjct: 537 NAAPSGWTAAAGALLSYAAV---TGEDRPREAAERALGVVQA----LGARVPRFIGWGLA 589

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
            A  +L  P  + V +VG     D +    A H +  L      V+ +    + E+    
Sbjct: 590 VAEALLDGP--REVAVVGP----DGDPATRALHRAALLGTAPGAVVAVGEPGSREVPL-- 641

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   +        +  A VC+ F+C  P  D  +L   L
Sbjct: 642 --------LLDRPLLEGRPAAYVCRRFTCDAPTADVGTLAGKL 676


>gi|443624623|ref|ZP_21109091.1| putative Spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes Tue57]
 gi|443341889|gb|ELS56063.1| putative Spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes Tue57]
          Length = 680

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 227/699 (32%), Positives = 325/699 (46%), Gaps = 79/699 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  LN  FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 51  SSCHWCHVMAHESFEDQETADYLNAHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPP  ++G P F+ +L  V  AW  +RD +A+     +  L+   
Sbjct: 111 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVHSAWADRRDEVAEVAGKIVRDLAGRE 170

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   +      EL Q  L      L++ YD + GGFG APKFP  + I+ +L H  +  
Sbjct: 171 ISFGGTEAPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 224 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLC 278

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y   +  T       +  +  D++ R++    G   SA DADS   +G  R  EGA+Y
Sbjct: 279 RGYAHLWRATGSELARRVALETADFMVRELRTNEGGFSSALDADS--DDGTGRHVEGAYY 336

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VWT +++ + LG+        Y+                        + E       +S 
Sbjct: 337 VWTPRQLRETLGDDDAELAARYF-----------------------GVTEEGTFEHGSSV 373

Query: 379 LGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           L +P +  L   + +   R++L D RS+RP P  DDK++ +WNGL I++ A         
Sbjct: 374 LQLPQQDELFDADRVASIRQRLLDRRSERPAPGRDDKIVAAWNGLAIAALAET------- 426

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
              A F+ P              +A   +R HL D    RL  + ++G   A  G L+DY
Sbjct: 427 --GAYFDRP------DLVDAALAAADLLVRLHLDD--AARLARTSKDGQVGANAGVLEDY 476

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
             +  G L L        WL +A  L +     F D E G  ++T  +   ++ R ++  
Sbjct: 477 GDVAEGFLALASVTGEGVWLDFAGFLLDHVLARFTDEESGALYDTAADAEQLIRRPQDPT 536

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---C 611
           D A PSG S +   L+   S  A + S  +R  AE +L V    +K +   VP       
Sbjct: 537 DNAAPSGWSAAAGALL---SYAAQTGSAPHRAAAEKALGV----VKALGPRVPRFVGWGL 589

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A    ++   + V +VG          L            V+ +   D++E+        
Sbjct: 590 AVAEANLDGPREVAIVGPSLDEQATRTLHRTALLATAPGAVVAVGTPDSDELPL------ 643

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
               +A       +  A VC+NF+C  P TDP  L   L
Sbjct: 644 ----LADRPLVGGEPAAYVCRNFTCDAPTTDPERLRTAL 678


>gi|312194562|ref|YP_004014623.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
 gi|311225898|gb|ADP78753.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
          Length = 686

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 216/623 (34%), Positives = 312/623 (50%), Gaps = 71/623 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFEDE  A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDEATAAFMNEHFVNIKVDREERPDVDAVYMDVTVALTGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P   GTYFPP+ + G P F  +L+ + +AW  +RD +  SGA    +L+EA +
Sbjct: 109 FLTPAGEPFFAGTYFPPQGRPGMPAFSQVLQALSEAWVTRRDEIESSGADIARKLAEA-A 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S    +    L  + L    +QL+  +D R GGFG+APKFP  +  +++L H       
Sbjct: 168 ESPVGGRAGTRLDADLLDRAVDQLAGRFDPRNGGFGAAPKFPPSMVAELLLRHH------ 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            +SG+A     +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  V
Sbjct: 222 ARSGDA-RALDLVALTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRV 280

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS-----------AET--- 306
           YL  +  T     + + R+  ++L  D+    G   SA DAD+           AE+   
Sbjct: 281 YLHLWRATGSGLAARVVRETAEFLLADLRTAEGGFASALDADAVPPAAPDGPGGAESGPG 340

Query: 307 -EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
            E  +   EGA YVWT  ++  +L  + A    E + + P G             F+  +
Sbjct: 341 DEHGSHPVEGASYVWTPAQLAAVLAPDDAAWAAELFAVTPEGT------------FEHGS 388

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
            +++L    A  ++           L   R +L   R+ RP+P  DDKV+ SWN      
Sbjct: 389 SVLQLPADPADPAR-----------LARVRDELAAARALRPQPARDDKVVASWN------ 431

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG 483
                 I       A+F  P        ++E AE AAS +R  HL D +  R     + G
Sbjct: 432 ---GLAIAALAEAGALFEVPA-------WIEAAERAASLLRDVHLVDGRLRRTSRHGKVG 481

Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
           P+   G LDDY  +  GLL LY+      WL  A EL +     F   + GG+++T  + 
Sbjct: 482 PNA--GVLDDYGNVAEGLLALYQVTGELAWLELARELLDVARARFRAPD-GGFYDTADDA 538

Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDM 602
            ++L R +E  D   PSG S     L+  A++   + S  +R++AE ++  +     +D 
Sbjct: 539 ETLLRRPREISDSPTPSGQSAFAGALLTYAAL---TGSADHREDAEATVGLLAALLARDA 595

Query: 603 AMAVPLMCCAADMLSVPSRKHVV 625
           + A      A  +L+ P+   VV
Sbjct: 596 SFAGYAGAVAEALLAGPAEVAVV 618


>gi|402848267|ref|ZP_10896531.1| Thymidylate kinase [Rhodovulum sp. PH10]
 gi|402501421|gb|EJW13069.1| Thymidylate kinase [Rhodovulum sp. PH10]
          Length = 710

 Score =  323 bits (829), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 231/702 (32%), Positives = 344/702 (49%), Gaps = 70/702 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED   A ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++
Sbjct: 57  ACHWCHVMAHESFEDPATAAVMNELFVPIKVDREERPDIDQIYMAALHHLGDQGGWPLTM 116

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P+ GGTYFP   ++G+P F  +LR+V   + ++ + + Q+    + +L+    
Sbjct: 117 FLTPSGEPVWGGTYFPRVSRFGKPAFVDVLREVSRLFREEPEKIEQNRRALMGRLAHRAQ 176

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED- 199
           A+        EL +      A Q++ + D   GG   APKFP+P  ++  ++ + + ED 
Sbjct: 177 AAGRPVIGLAELDR-----MAAQIAGAIDLVNGGLRGAPKFPQPTMLE-TIWRAGEREDA 230

Query: 200 -TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG +   +    +V  TL+ M +GGI DH+GGGF RYSVD+RW VPHFEKMLYD  QL 
Sbjct: 231 RTGFAHPTNLFYDLVALTLERMCEGGIFDHLGGGFARYSVDDRWLVPHFEKMLYDNAQLL 290

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +   A + T    +     + + +L R+M  P G   ++ DADS   EG    +EG FY
Sbjct: 291 ELLALAHARTGHELFRQRAEETVGWLLREMTTPEGAFCASLDADS---EG----EEGKFY 343

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND-SSASA 376
           VWT +E+  +LG E A  F  HY ++P GN            F+GK +L  L     A+ 
Sbjct: 344 VWTLEEIVGVLGPEDAARFAAHYDVEPAGN------------FEGKTILDRLPGLDQAAQ 391

Query: 377 SKLGMP--LEKYLNI-----LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
           ++ G+P  L KY +      L   R++LFD RS R RP  DDK++  WNGL I++ A A 
Sbjct: 392 ARTGLPFALHKYADARIEADLAAMRQRLFDARSTRVRPGTDDKILADWNGLTIAALANAG 451

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +L                D    +++A  A +F+   +   +  RL HS+R+G    PG
Sbjct: 452 TLL----------------DVPASIDLARRAFAFVATEM--TRHGRLGHSWRDGRLLFPG 493

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
              DYA +I   L L+E     ++L  A+  Q   D    D E G Y+ +  +   +++R
Sbjct: 494 LASDYAAMIRAALALHEATGEKEFLDRAVAWQEAFDHHHQDVETGTYYLSADDAEGLVVR 553

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
                D A P+ N ++  NLVRLA +   +  D +R+ A+  L     R  D       +
Sbjct: 554 PSATTDDAIPNPNGLAAQNLVRLAVL---TGDDRWRERADALLEGLLPRAADNLFGHLSV 610

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A D+        + +VG    +     L  A         ++   P+         E 
Sbjct: 611 MNALDLRL--RGLEIAIVGEGPHI---AALTGAAQHIPFGSRILFRAPS--------PEA 657

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
              N        +A +  A VC    CS PVT P  L   +L
Sbjct: 658 LPENHPARAQAAAAPEGAAFVCAGERCSLPVTTPEGLREAIL 699


>gi|218437933|ref|YP_002376262.1| hypothetical protein PCC7424_0938 [Cyanothece sp. PCC 7424]
 gi|218170661|gb|ACK69394.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7424]
          Length = 687

 Score =  323 bits (829), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 241/731 (32%), Positives = 343/731 (46%), Gaps = 126/731 (17%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDGAIAEYMNANFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SE 137
           +FL+P DL P  GGTYFP E +Y RPGF  +L+ V+  +D +++ L       +E L + 
Sbjct: 108 IFLTPDDLVPFYGGTYFPVEPRYNRPGFLQVLQSVRHFYDTEKEKLKSFKQEILEVLHNS 167

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-K 196
            +   + +N    EL    L+   + ++KS     G FG  P FP      ++L  S+ K
Sbjct: 168 TILPLSDTNLQAHELFYRGLKTNTQVITKS----VGDFGR-PSFPMIPYASLILQGSRFK 222

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            E      +A+E +   L      A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 223 FESDYDGKQAAEARGADL------ALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276

Query: 257 LANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           +     + +S      Y    R I     +L+R+M  P G  ++A+DAD+         +
Sbjct: 277 IIEYLANLWSSGSQ--YPSFQRAIAGTAQWLKREMTAPEGYFYAAQDADNFVHSEDAEPE 334

Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVW   ++E +L E  +   K  + + P GN            F+G NVL      
Sbjct: 335 EGAFYVWRYSDLEKLLSEDELEALKTAFTITPEGN------------FEGSNVL------ 376

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
               ++ G   E +  IL     KLF VR                           R  P
Sbjct: 377 --QRTQEGTFTEDFEEILD----KLFGVRYGASSQDIEHFPPARNNQEAKTGNWQGRIPP 430

Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
             D K+IV+WN L+IS  ARA  + +          P+       Y E+A  AA FI ++
Sbjct: 431 VTDTKMIVAWNSLMISGLARAYGVFRE---------PL-------YWELATGAAEFICQN 474

Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDE 526
            +  Q  RL      G +      +DYAFLI  LLDL   F S T+WL  AIE+Q   D 
Sbjct: 475 QW--QNGRLHRLNYEGQATVLAQSEDYAFLIKALLDLQTAFPSKTEWLNKAIEIQEEFDN 532

Query: 527 LFLDREGGGYFNT-TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
           LF   E GGY+N  T     +L+R +   D A PS N +++ NL+RL  +   +++  Y 
Sbjct: 533 LFCSVEMGGYYNNATDNSEDLLVRERSYLDNATPSANGIAITNLIRLGRL---TENLSYF 589

Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
           + AE +L  F + L     A P +  A D       +H + V   S +            
Sbjct: 590 EQAERALQAFSSILSQSPQACPSLFTALDWY-----RHGISVRATSQI------------ 632

Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
             L + +    P     +D         A +      +D+ V LVCQ  SC  P T    
Sbjct: 633 --LERLIFQYFPTAVYRVD---------AEL------SDQTVGLVCQGLSCLEPATTLEK 675

Query: 706 LENLLLEKPSS 716
           L+  + +  SS
Sbjct: 676 LQTQMKQATSS 686


>gi|354611184|ref|ZP_09029140.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
 gi|353196004|gb|EHB61506.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
          Length = 724

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 228/709 (32%), Positives = 333/709 (46%), Gaps = 56/709 (7%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESF D+GVA  LN+ FV +KVDREERPDVD +YM   Q + GGGGWPLS
Sbjct: 53  SACHWCHVMEEESFSDDGVAAALNENFVPVKVDREERPDVDSLYMKVCQVVRGGGGWPLS 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            FL+PD KP   GTYFP E K  +PGF  +L  V D+W  +R  L       +      L
Sbjct: 113 AFLTPDRKPFFVGTYFPKEPKRNQPGFTQLLDDVADSWQTERGDLEDRAEQWLSAAKGEL 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                +  L D+ P   L   A  L+++ D   GGFG APKFP+   +  +L      +D
Sbjct: 173 EDLPDATDLGDDSP---LDEAANALARTADRDNGGFGRAPKFPQAGRVDALLRAHDASDD 229

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             + G+      +V   L  MA GG++DH+GGGFHRY  D  W VPHFEKMLYDQ  L  
Sbjct: 230 GKQYGD------IVREALDAMAGGGLYDHLGGGFHRYCTDADWTVPHFEKMLYDQATLVR 283

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK-EGAFY 318
            Y+D +    +  Y+    + L ++ R++  P G  ++  DA S   +    ++ EGAFY
Sbjct: 284 TYVDGYRSFGEERYADEVGETLAFVDRELGHPDGGFYATLDARSPPIDDPEGERVEGAFY 343

Query: 319 VWTSKEVEDILGEHA-------------ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
           VWT ++VE+ + ++A              LF+  Y +   GN +            G+ V
Sbjct: 344 VWTPEQVENAVADYADEAPADVDPGDLVDLFRARYGVDEAGNFE-----------HGQTV 392

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           L         A + G   ++   +L     +L   R  RPRP  DDKV+  WNGL+  ++
Sbjct: 393 LTVSASREELADEFGYQEDEVAELLAAAETRLRAARDDRPRPARDDKVLAGWNGLMARAY 452

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
           A A            F+     +D   Y E A  A   +R  L+D +  RL     +G  
Sbjct: 453 AEA---------GLAFDGAEARADEDSYAERAAEAIDHVRSELWDGE--RLARRVIDGDV 501

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
              G+ +DYA+L +G L  YE       L +A++L +   +   D E G  + T      
Sbjct: 502 AGIGYAEDYAYLAAGALATYEATGDHAHLGFALDLADALLDACYDAETGALYQTPASVQD 561

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           V +R +    G  PS   V+   L+ L +    ++   Y   AE  L  +  R++    A
Sbjct: 562 VDVRSQAVDGGPTPSPVGVAAETLLALDAFDPDAE---YANAAEAMLERYGERVQRSPAA 618

Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
            P +  AADML V   + V +      V++   +  A+    L   ++   P    E+D 
Sbjct: 619 HPTLVLAADML-VTGHREVTVAADSLPVEWRRTVGTAY----LPDRLLSRRPRSAVELDE 673

Query: 666 WEEH--NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
           W      ++   +     S +   A VC+  +CSPP++    +E  L E
Sbjct: 674 WLAALGLADAPPIWAGRQSHEAATAYVCRR-ACSPPLSTAEEIEEWLAE 721


>gi|294631112|ref|ZP_06709672.1| conserved hypothetical protein [Streptomyces sp. e14]
 gi|292834445|gb|EFF92794.1| conserved hypothetical protein [Streptomyces sp. e14]
          Length = 676

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 231/702 (32%), Positives = 326/702 (46%), Gaps = 85/702 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SACHWCHVMAHESFEDQATAGYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP  ++G P F+ +L  V+ AW  +RD + +     +  L++  
Sbjct: 107 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVRQAWATRRDEVTEVAGKIVRDLAQ-R 165

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
                  +LP  +EL Q  L      L++ YD + GGFG APKFP  + ++ +L H  + 
Sbjct: 166 EIGYGGVQLPGEEELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR- 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 220 --TGSEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALL 273

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       +  +  D++ R++    G   SA DADS   +G  R  EGA+
Sbjct: 274 CRVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGTGRHVEGAY 331

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT +++ D LGE        Y+                        + E       +S
Sbjct: 332 YVWTPEQLRDALGEEDAQLAAQYF-----------------------GVTEEGTFEHGSS 368

Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            L +P ++ +     +   RR L + R+ RP P  DDK++ +WNGL I++ A        
Sbjct: 369 VLQLPQQEGVFDAERIESVRRLLLERRAGRPAPGRDDKIVAAWNGLAIAALAETGAYF-- 426

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
                         DR + +E A  AA  + R   DE    L  + R+G   A  G L+D
Sbjct: 427 --------------DRPDLVEAALGAADLLVRLHMDEHAG-LARTSRDGQVGANAGVLED 471

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +  G L L        WL +A  L       F D + G  ++T  +   ++ R ++ 
Sbjct: 472 YADVAEGFLALASVTGEGVWLDFAGLLLGHVLTRFTDPDSGALYDTAADAEQLIRRPQDP 531

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----- 608
            D A PSG S +      L    A + S+ +R  AE +L V    +K +   VP      
Sbjct: 532 TDNATPSGWSAAAGA---LLGYAAHTGSEAHRTAAEKALGV----VKALGPRVPRFIGWG 584

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  A   L  P    VV     S  D         A   L++T + +  A    + +  E
Sbjct: 585 LAVAEAALDGPREVAVVA---PSLAD--------EAGRVLHRTAL-LGTAPGAVVAYGTE 632

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  +A          A VC++F+C  P TDP  L   L
Sbjct: 633 GGEEFPLLADRPLVGGAPAAYVCRDFTCDAPTTDPERLRAAL 674


>gi|307154410|ref|YP_003889794.1| hypothetical protein Cyan7822_4611 [Cyanothece sp. PCC 7822]
 gi|306984638|gb|ADN16519.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7822]
          Length = 685

 Score =  323 bits (827), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 240/730 (32%), Positives = 344/730 (47%), Gaps = 123/730 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDAAIAEYMNTHFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +Y RPGF  +L+ V+  +D ++D L       +E L  A
Sbjct: 108 IFLTPDDLVPFYGGTYFPVEPRYNRPGFLQVLQSVRHFYDNEKDKLKSFKKEILEVLQSA 167

Query: 139 -LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-- 195
            +     +N + ++L    +      ++ S +     FG  P FP      + L  S+  
Sbjct: 168 TVLPLGDANLVSNDLFYRGIETNTAVITNSAND----FGR-PSFPMIPYANLTLQGSRFE 222

Query: 196 -KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            + ++ GK      G+ + L        GGI+DH+GGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 223 FQSQNDGKQAAIQRGEDLAL--------GGIYDHIGGGFHRYTVDSTWTVPHFEKMLYDN 274

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           GQ+     + +S   +V    + R I   + +L+R+M  P G  ++A+DADS  T     
Sbjct: 275 GQIVEYLANLWS--SEVQKPSLARAIAGTVQWLKREMTAPEGYFYAAQDADSFTTPEDVE 332

Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
            +EGAFYVW+  +++ +L    +   K  + + P GN            F+GKNVL    
Sbjct: 333 PEEGAFYVWSYSDIQQLLSTDELEALKTAFTVTPEGN------------FEGKNVL---- 376

Query: 371 DSSASASKLGMPLEKYLNILGECR--------------RKLFDVRS----KRPRPHLDDK 412
              AS  K     E  L+ L   R              R   + +S     R  P  D K
Sbjct: 377 -QRASEGKFAEDFEAVLDKLFAVRYGASSSTLDRFPPARNNAEAKSGNWPGRIPPVTDTK 435

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DE 471
           +IV+WN L+IS  ARA  + +          P+       Y E+A  A  FI  H + + 
Sbjct: 436 MIVAWNSLMISGLARAYGVFRE---------PL-------YWELAVGATEFIFTHQWKNG 479

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLD 530
           + HRL +    G +      +DYAFLI  LLDL       T+WL  AI +Q   D LF  
Sbjct: 480 RLHRLNYE---GETGVLAQSEDYAFLIKALLDLQTASPAETEWLNKAISVQQEFDNLFWS 536

Query: 531 REGGGYFNTTGEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
            E GGY+N + ++   L+ VKE    D A PS N V+V NL+RLA +    +   Y   A
Sbjct: 537 VEMGGYYNNSTDNSQDLI-VKERSYIDNATPSANGVAVTNLIRLARLTENLE---YLSQA 592

Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
           E +L  F + LK    A P +  A D       ++ + V  K  +              L
Sbjct: 593 EQTLQAFSSILKQSPQACPSLFTALDWY-----RYSISVRSKPDI--------------L 633

Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
            + +    P     +D    H             AD+V  LVCQ  SC  P     SLE 
Sbjct: 634 ERLIFQYFPTAVYRVD----HQ-----------LADQVEGLVCQGLSCLEPAR---SLEK 675

Query: 709 LLLEKPSSTA 718
           L  +   +T+
Sbjct: 676 LQQQIKQATS 685


>gi|345008957|ref|YP_004811311.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
 gi|344035306|gb|AEM81031.1| hypothetical protein Strvi_1280 [Streptomyces violaceusniger Tu
           4113]
          Length = 678

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 218/689 (31%), Positives = 322/689 (46%), Gaps = 74/689 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDKATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPP  + G   F+ +L  V  AW  +R+ +       +E L++  
Sbjct: 108 VFLTPEAQPFYFGTYFPPRPRPGMASFRQVLEGVSAAWTDRREEVVDVAGRIVEDLAQRT 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +  S+  P    +  L      L++ +D+  GGFG APKFP  + ++ +L H  +   
Sbjct: 168 GIALGSDA-PAPPGEEDLHAALMGLTREFDATRGGFGGAPKFPPSMALEFLLRHHAR--- 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +MV  T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 224 TGSEG----ALQMVSATCEAMARGGIYDQLGGGFARYSVDAGWTVPHFEKMLYDNALLCR 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  +  D++ R++    G   SA DADS   +G  R  EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVALETADFMVRELRTAQGGFASALDADS--DDGTGRHVEGAYYV 337

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT + + ++LGE    F   Y+                  F+    +++L D    A   
Sbjct: 338 WTPERLREVLGEADAEFAAGYF-----------GVTQEGTFEQGASVLQLPDGKRPADA- 385

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                     +   R +L   R +R RP  DDK++ +WNGL +++ A             
Sbjct: 386 --------GRVASVRERLLAARERRARPGRDDKIVAAWNGLAVAALAETGAYF------- 430

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLI 498
                    DR + ++VA  AA  + R L+ +Q  RL  +  +G +    G L+DYA + 
Sbjct: 431 ---------DRPDLVDVATEAAELLMR-LHMDQRGRLARTSLDGTAGGHAGVLEDYADVA 480

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            G L L        W+ +A  L +T    F   E G  F+T  +  +++ R ++  D A 
Sbjct: 481 EGFLALSAVTGDGAWVDFAGLLLDTVLTRFT-AEDGTLFDTADDAEALIRRPQDPTDNAA 539

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCAA 613
           PSG + +   L+  A+I   S+   +R+ AE +LAV    ++ +   VP      +  A 
Sbjct: 540 PSGWTAAAGALLSYAAITGSSR---HRETAERALAV----VRALGPRVPRFIGWGLAVAE 592

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
             L  P  + V +VG         +  AA  +      V   +P   E            
Sbjct: 593 ARLDGP--REVAVVGPGDDPATRALHRAALLATAPGAVVAVGEPGSGE-----------V 639

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
             +        +  A VC+ F+C  P  D
Sbjct: 640 PLLQDRPLLEGRPAAYVCRGFTCDAPTAD 668


>gi|440700552|ref|ZP_20882794.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
           Car8]
 gi|440276815|gb|ELP65027.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
           Car8]
          Length = 677

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 233/701 (33%), Positives = 336/701 (47%), Gaps = 83/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDQATADYLNENFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPPE + G P F+ +L  V+ AW  +RD +A+     +  L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRSGMPSFREVLEGVRSAWTDRRDEVAEVAQKIVRDLA-GR 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                + + P E  Q    L    L++ YD++ GGFG APKFP  + ++ +L H  +   
Sbjct: 167 EIGYGATEAPTEEDQARALLG---LTREYDAQRGGFGGAPKFPPSMVLEFLLRHGAR--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 221 TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  +  D+L R++    G   SA DADS   +G  +  EGA+YV
Sbjct: 277 VYAHLWRATGSELARRVALETADFLVRELRTAEGGFASALDADS--DDGTGKHVEGAYYV 334

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
           WT  ++ ++LG E A L  +++ +   G  +           +G +VL +  ++    A 
Sbjct: 335 WTPAQLTEVLGAEDAELAAQYFGVTADGTFE-----------EGASVLQLPQHEGVFDAE 383

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           K+              + +L   R +RP P  DDKV+ +WNGL I++ A           
Sbjct: 384 KVDY-----------VKARLLAARGERPAPGRDDKVVAAWNGLAIAALAET--------- 423

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
            A F  P              +A   +R HL D++ H L  + ++G   A  G L+DYA 
Sbjct: 424 GAYFERP------DLVDAALAAADLLVRVHL-DDRAH-LARTSKDGQVGANAGVLEDYAD 475

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +  G L L        WL +A  L +     F+D E G  F+T  +   ++ R ++  D 
Sbjct: 476 VAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALFDTASDAEQLIRRPQDPTDN 535

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCC 611
           A PSG + +   L+  A+         +R  AE +L V    +K +   VP      +  
Sbjct: 536 AVPSGWTAAAGALLGYAAQTGAVP---HRAAAERALGV----VKALGPRVPRFIGWGLAV 588

Query: 612 AADMLSVPSRKHVV--LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
           A  +L  P    VV   +G  ++V        A A       V+ +   D+EE+      
Sbjct: 589 AEALLDGPREVAVVGPSLGDPATVALHRTALLATAP----GAVVAVGSVDSEELPL---- 640

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +A          A VC+NF+C  P TDP  L   L
Sbjct: 641 ------LAGRPLVGGAAAAYVCRNFTCDAPTTDPERLRIAL 675


>gi|284989523|ref|YP_003408077.1| hypothetical protein Gobs_0945 [Geodermatophilus obscurus DSM
           43160]
 gi|284062768|gb|ADB73706.1| protein of unknown function DUF255 [Geodermatophilus obscurus DSM
           43160]
          Length = 665

 Score =  322 bits (825), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 229/691 (33%), Positives = 318/691 (46%), Gaps = 78/691 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A  +N  FV +KVDREERPDVD VYM   QAL G GGWP++V
Sbjct: 49  ACHWCHVMAHESFEDEATAGQMNADFVCVKVDREERPDVDSVYMAATQALTGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +PD +P   GTYFPP   +G P F+ +L  V DAW  +R+ L  +G    E +S  L 
Sbjct: 109 FTTPDGRPFYCGTYFPPRPAHGMPSFRQLLSAVSDAWRSRREDLETAGTRIAEGISSRLD 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                   P  L    L      L+  YD R+GGFG APKFP  + ++ +L H+ +  D 
Sbjct: 169 LGP-----PAPLAAEVLDHAVAALAGEYDERWGGFGGAPKFPPSMVLEFLLRHAARTGD- 222

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +M   TL  MA+GGIHD + GGF RYSVD RW VPHFEKMLYD   L  +
Sbjct: 223 ------DRALRMARGTLGAMARGGIHDQLAGGFARYSVDARWVVPHFEKMLYDNALLLRL 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL  +  T D +   +      +L RD+  P G   SA DAD+   EG T       YVW
Sbjct: 277 YLHLWRATGDEWARRVADATAAFLVRDLDTPEGGFASALDADAEGVEGLT-------YVW 329

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E+ ++LGE    +    +           ++D      G + L  L D    A    
Sbjct: 330 TPAELVEVLGEDDGRWAAAVF----------EVTDAGTFEHGTSTLQLLRDPGDPAR--- 376

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
                    L   R +L   R++RP+P  DDKV+ +WNGL I++ A    +  S +    
Sbjct: 377 ---------LASVRERLGAARARRPQPARDDKVVTAWNGLAIAALAEHGVLTGSPS---- 423

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLIS 499
            +        +   +V          H  D    RL+ + RNG + AP G L+DY  L  
Sbjct: 424 -SVDAARRAAELLADV----------HWGD---GRLRRASRNGVAGAPSGVLEDYGDLAE 469

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           GLL L++     +WL  A +L +     F+D +  G+ +T  +  +++ R  +  DG  P
Sbjct: 470 GLLALHQATGEGRWLELAGDLLDVVAGQFIDAD--GWHDTAADAEALVHRPFDPADGPTP 527

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           SG +      V  A++    +     + A  SLA    R          M     +L+ P
Sbjct: 528 SGLAAVAGAAVTYAALAGAPRHRELGEAAVGSLARLAERAPQAVGWA--MAVGEALLAGP 585

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
                V V   +  D + ++AAA AS      V+  +P D   +            +A  
Sbjct: 586 LE---VAVSGPAGPDRDALVAAARASTSPGAVVVVGEP-DAPGVPL----------LAGR 631

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +  A VC+ F C+ PVTD  +L   L
Sbjct: 632 PLVGGRPAAYVCRGFVCAAPVTDVSALGAAL 662


>gi|269125325|ref|YP_003298695.1| hypothetical protein Tcur_1071 [Thermomonospora curvata DSM 43183]
 gi|268310283|gb|ACY96657.1| protein of unknown function DUF255 [Thermomonospora curvata DSM
           43183]
          Length = 662

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 235/691 (34%), Positives = 323/691 (46%), Gaps = 90/691 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFEDE  A+L+ND FV+IKVDREERPDVD VYM   QA+ G GGWP++VF
Sbjct: 49  CHWCHVMAHESFEDEATARLMNDLFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTVF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
            +PD +P   GTYFP      R  F+ +L  V  AW ++R+ + + G   +E L+    A
Sbjct: 109 ATPDGEPFYCGTYFP------RQQFRALLMAVARAWREEREDVLKQGRKVVEALTARGPA 162

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
              +     E    A+R     L+ SYD+ +GGFG APKFP  + ++ +L H  + +D  
Sbjct: 163 PGETEPPSPERLSAAVR----SLAASYDTAYGGFGGAPKFPPSMVLEFLLRHYARTQD-- 216

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                ++   M   TL+ MA+GGI+D +GGGF RYSVDE W VPHFEKMLYD   LA VY
Sbjct: 217 -----AQALAMATGTLEAMARGGIYDQLGGGFARYSVDEAWVVPHFEKMLYDNALLARVY 271

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              + LT       I  +  +++ RD+  P G + SA DADS   EG    +EG +YVWT
Sbjct: 272 AHWWRLTGSPLAKRIALETCEWMLRDLRTPQGGLASALDADS---EG----QEGKYYVWT 324

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGN--CDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
            +++  +LGE              GN   +L  +++      G +VL    D        
Sbjct: 325 PEQLRRVLGEA------------DGNAAAELLGVTESGTFEHGTSVLRLPGDPGDQ---- 368

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                         R +L   R++R  P  DDKV+ +WNGL I++ A    +L       
Sbjct: 369 --------EWWSRVRARLLAARAERVPPARDDKVVTAWNGLAIAALAECGALLG------ 414

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                     R + +  AE  A  +R  HL D    RL  + R+G P    G L+DYA  
Sbjct: 415 ----------RPDLVGAAEEIARLLREVHLRD---GRLTRTSRDGVPGANAGVLEDYADF 461

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDG 556
             GLL L+        +  A  L  T    F D  GG  F  T +D   L R  +D  D 
Sbjct: 462 AEGLLALHAVTGDPAHVRLAGTLLETVLTHFPDDRGG--FYDTADDAERLFRRPQDPTDN 519

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSG   +   L+  A++   S+   +RQ A  +LA         A         A+ L
Sbjct: 520 ATPSGQFAAAGALLSYAALTGSSR---HRQAAASALAAATLLAGRHARFAGWGLAVAEAL 576

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            V     + +VG  +      +  AA AS           PA    +       + +  +
Sbjct: 577 -VSGPLEIAIVGDPADARTRALHGAALAS-----------PAPGAVITVGTGEAAGDVPL 624

Query: 677 ARNNFSADKV-VALVCQNFSCSPPVTDPISL 706
            R     D    A VC+NF+C  PVT P  L
Sbjct: 625 LRGRTPVDGAPAAYVCRNFTCRLPVTTPADL 655


>gi|383649966|ref|ZP_09960372.1| hypothetical protein SchaN1_31668 [Streptomyces chartreusis NRRL
           12338]
          Length = 677

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 229/700 (32%), Positives = 331/700 (47%), Gaps = 81/700 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A+ LN  +VS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDQQTAEYLNAHYVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPP  + G P F+ +L+ V  AW+++RD + +     +  L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRQGMPSFRQVLQGVHQAWEERRDEVTEVAGKIVRDLAGRE 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +S   +      EL Q  L      L++ YD + GGFG APKFP  + ++ +L H  +  
Sbjct: 168 ISYGDAQTPGEQELAQALL-----ALTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAQDTCERMARGGIYDQIGGGFARYSVDRDWIVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS   +G  +  EGA+Y
Sbjct: 276 RVYAHLWRATGSEPARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYY 333

Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT  ++ ++LGE  A L   ++ +   G  +  R                        S
Sbjct: 334 VWTPAQLREVLGEQDAELAARYFGVTEEGTFEHGR------------------------S 369

Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            L +P +  L   + +   R +L   RS RP P  DDKV+ +WNGL I++ A        
Sbjct: 370 VLQLPQQDGLFDADRIASIRERLLAARSGRPAPGRDDKVVAAWNGLAIAALAET------ 423

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
               A F+ P              +A   +R HL DEQ  RL  + ++G + A  G L+D
Sbjct: 424 ---GAYFDRP------DLVEAALAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLED 472

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +  G L L        WL +A  L +     F D E G  F+T  +   ++ R ++ 
Sbjct: 473 YADVAEGFLALASVTGEGVWLEFAGFLLDHVLARFTDEESGALFDTAADAERLIRRPQDP 532

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC--- 610
            D A PSG + +   L+   S  A + S  +R  AE +L V    +K +   VP      
Sbjct: 533 TDNAAPSGWTAAAGALL---SYAAHTGSQPHRTAAEKALGV----VKALGPRVPRFIGWG 585

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            AA   ++   + V +VG     +    L            V+ +    ++E        
Sbjct: 586 LAAAEAALDGPREVAVVGPSLEHEGTRTLHRTALLGTAPGAVVAVGAPGSDEFPL----- 640

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A       +  A VC+NF+C  P T+   L   L
Sbjct: 641 -----LADRPLVGGEPAAYVCRNFTCDAPTTEADRLRATL 675


>gi|118579500|ref|YP_900750.1| hypothetical protein Ppro_1067 [Pelobacter propionicus DSM 2379]
 gi|118502210|gb|ABK98692.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
           2379]
          Length = 687

 Score =  322 bits (824), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 234/691 (33%), Positives = 320/691 (46%), Gaps = 87/691 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  + FED+ VA LLN  FV IKVDREERPD+D  YMT  Q L G GGWPL++
Sbjct: 76  TCHWCHVMAHDGFEDDQVADLLNRHFVCIKVDREERPDIDDFYMTASQVLTGSGGWPLNI 135

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++PD +P    TY P      R  F  +L  +   W +    + ++ +  +E +     
Sbjct: 136 FMTPDRRPFFAMTYLP------RQRFMELLAGIVTLWQQHPGEVEKNCSAIMEGIERLSR 189

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +     +  EL   A     EQLS  +D  +GGFG APKFP P+ +         L   
Sbjct: 190 GNDHECPVLAELDSLAF----EQLSAIHDRTWGGFGPAPKFPLPLSLGW-------LAGQ 238

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G +G   E  +M   TL  + +GGI D +GGG HRYSVDERW VPHFEKMLYDQ  LA  
Sbjct: 239 GMNGN-QEALEMAQKTLGMIRQGGIWDQLGGGVHRYSVDERWLVPHFEKMLYDQALLAMA 297

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
            LD      D  +  +  DI  ++ R++    G  FSA DADS         +EGA+Y+W
Sbjct: 298 CLDVCLAGNDPAFLTMAEDIFRFVGRELTSTEGAFFSALDADSG-------GEEGAYYLW 350

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  ++E+ILG    LF   + +   GN            F+G+N+L    D     +  G
Sbjct: 351 TRDDIEEILGRDGELFCRFFDVGEKGN------------FQGQNILHMPVDLETFCT--G 396

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              E+   IL +CR +L + R +R  P  D+K+I SWNGL+I++ AR   +         
Sbjct: 397 EDPERTGEILDDCRERLLEYREERSYPLRDEKIITSWNGLMIAALARGGAL--------- 447

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                     +EY+E A  AA FI ++L   Q  RL  S+  GPS  P FL+DYAFL  G
Sbjct: 448 -------GGEQEYIESASRAARFILKNLR-RQDGRLLRSYLAGPSSTPAFLEDYAFLCCG 499

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEP 559
           L++L+E    + W   A+ L +    LF D      F T G D   +  +   D DG  P
Sbjct: 500 LIELFEATLDSFWQEQALLLADEMLRLFRD-PVRCVFVTVGLDAEQMAGQSPRDSDGVLP 558

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
           S  S +    +RL    A  + D          A  +   ++    +  +   A +   P
Sbjct: 559 SPFSRAAHCFIRLG--YACDRDDLLDHAHLLLGAPLDDAAENPLSHLGALQALAMLEQEP 616

Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
           +  H    G + S    ++LA+   S+ L   VI     D E                  
Sbjct: 617 TIIH--FRGQRDSRRIASLLASTR-SFPLPNLVIRFTETDHEGE---------------- 657

Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   ALVC   SC  P  D  SLE  L
Sbjct: 658 --------ALVCAQGSCHGPFPDESSLERQL 680


>gi|322702606|gb|EFY94241.1| hypothetical protein MAA_10309 [Metarhizium anisopliae ARSEF 23]
          Length = 738

 Score =  322 bits (824), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 213/670 (31%), Positives = 333/670 (49%), Gaps = 71/670 (10%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M  ESF +   A +LN+ FV + +DREERPDVD +YM YVQA+   GG
Sbjct: 78  HIGYKACHFCRLMTQESFSNPECAAILNESFVPVIIDREERPDVDTIYMNYVQAVSNVGG 137

Query: 76  WPLSVFLSPDLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKKR----- 121
           WPL+VF++P+L+P+ GGTY+P          E +   P   TI RKV+D W  +      
Sbjct: 138 WPLNVFVTPNLEPVFGGTYWPGPGTSRRVTTESEDESPDCLTIFRKVRDIWHDQETRCRK 197

Query: 122 ---DMLAQSGAFAIEQL-----------------------SEALSASASSNKLPDELPQN 155
              ++LAQ   FA E                         +  + A     ++  EL  +
Sbjct: 198 EASEVLAQLREFAAEGTLGTRGLTGTHPIATPSWNIPSNPTTPIRARDKDAQVSSELDLD 257

Query: 156 ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQKM 212
            L      ++ ++D  +GGFG APKF  P ++  +L+ +     ++D     E     +M
Sbjct: 258 QLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLHLNTFPSAVQDVVGEAECKHATEM 317

Query: 213 VLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--- 268
            + TL+ +  G +HDH+G  GF R SV   W +P+FEK++ D   L  +Y+DA+ +    
Sbjct: 318 AVDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNALLLALYVDAWRIAGGK 377

Query: 269 KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
            D  +  I  ++ DYL    I  P G + ++E ADS    G    +EGA+Y+WT +E + 
Sbjct: 378 ADSEFYDIVLELADYLSSPPIALPSGGLATSEAADSFMRRGDREMREGAYYLWTRREFDS 437

Query: 328 ILG------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
           ++       + + +   H+ ++  GN D     DP+++F   N+L  +      + +  +
Sbjct: 438 VVDASGHDKQISQVAAAHWDVQEGGNVDEDH--DPNDDFINHNILRVVKTQDELSRQFNI 495

Query: 382 PLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             +     +   R++L      +R RP LDDKVI +WNGL IS+ A+AS  LK       
Sbjct: 496 SPDTVRQHIQAARKELKARRERERVRPELDDKVITAWNGLAISALAQASSALK------- 548

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
              PV  +   +Y+  AESAA+FI+  L+DE +  L   +R G  +  GF DDY +LI G
Sbjct: 549 ---PVDSARSDKYLHAAESAAAFIKASLWDESSKLLYRIYREG-RETKGFADDYTYLIHG 604

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LLDL+   S    L +A  LQ TQ+ LF D + G +F+TT   P  +LR+K+  D + PS
Sbjct: 605 LLDLFAATSDEGHLAFADALQKTQNSLFHDSDSGAFFSTTASSPQAILRLKDGMDTSLPS 664

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
            N+V+  NL RL +++     + Y   A  ++  FE  +       P +        +  
Sbjct: 665 VNAVAASNLFRLGALL---DDERYSALARGTVNAFEAEMLQHPWLFPGLLSGVVTARLGP 721

Query: 621 RKHVVLVGHK 630
           R+ V  V +K
Sbjct: 722 RESVSDVKYK 731


>gi|375012491|ref|YP_004989479.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
           DSM 17368]
 gi|359348415|gb|AEV32834.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
           DSM 17368]
          Length = 675

 Score =  321 bits (823), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 227/706 (32%), Positives = 337/706 (47%), Gaps = 107/706 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME +SFED   A L+N+ F+SIKVDREERPDVD+VYMT VQ + G GGWPL+
Sbjct: 64  SACHWCHVMEHQSFEDSAAAALMNEHFISIKVDREERPDVDQVYMTAVQLMTGRGGWPLN 123

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V   PD +P+ GGTYFP      + G+   L+ + + +    + + +      E+L+E +
Sbjct: 124 VITLPDGRPIWGGTYFP------KDGWMQSLQSIVEVYHDDPEKVLEYA----EKLTEGV 173

Query: 140 SAS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             S   S N+ P +  +  + L  +  SK++D + GG   APKFP PV  + +L      
Sbjct: 174 VQSELVSPNETPGDYSKEEIDLLFKNWSKNFDKKEGGSAGAPKFPMPVGYEFLL------ 227

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            + G      E  + +  TL+ MA GGI+D VGGGF RYSVD+ W VPHFEKMLYD GQL
Sbjct: 228 -EYGSLTGNEEAMQQLNLTLRKMAFGGIYDQVGGGFSRYSVDDEWKVPHFEKMLYDNGQL 286

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y  A+  TK+  Y  I    +++L RDM+GP GE +SA DADS   EG    +EG +
Sbjct: 287 VSLYSRAYQKTKNPLYKSIVIQTIEWLERDMLGPDGEFYSALDADS---EG----EEGKY 339

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVW   E+++I+G+       +Y+       DL +      +++G+ VL+  +DS  + S
Sbjct: 340 YVWPEVELKEIIGDSDWEDFTNYF-------DLKK-----GKWEGRIVLMRSDDSENTDS 387

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                 E          ++L  VR  R  P LDDK + SWN L+I+    A K       
Sbjct: 388 AKVKAWE----------QELLKVRENRVPPGLDDKSLTSWNALMITGLVDAYKAFGD--- 434

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                          Y+++A+    ++ ++    +   L HS++ G S   G ++DY F 
Sbjct: 435 -------------SHYLDLAKKNGEWLLKNQV-RKDESLFHSYKKGKSSIDGLIEDYTFA 480

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           + G LDLYE     K+L  A          F D   G +F  +     ++ +  E HD  
Sbjct: 481 VQGFLDLYEATFDVKYLEQANAWMKYAKANFEDEGTGLFFTRSKNAKQLIAKSMEVHDNV 540

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            P+ NSV   NL  L          Y+    E  LA  E  L  M               
Sbjct: 541 IPAANSVMAHNLFHL----------YHLTGNESYLAQSEKMLAQM--------------- 575

Query: 618 VPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN---- 672
                 V LV +  S  ++  +L   +  Y   +  I  + AD + M++ ++   N    
Sbjct: 576 ----DKVRLVTYPESFSNWARLL--LNFKYPFYEVAIVGNEADEKYMEWQKQFVPNVLIQ 629

Query: 673 ------NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                 +  +  N F     +  VC+N  C  PV +     +LLL+
Sbjct: 630 GSWKESDLPLLENRFVKGSTMIYVCENRVCQLPVEEVSKALDLLLK 675


>gi|358457848|ref|ZP_09168063.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
 gi|357078866|gb|EHI88310.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
          Length = 673

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 220/615 (35%), Positives = 308/615 (50%), Gaps = 62/615 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED+  A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDDTTAAYMNEHFVNIKVDREERPDVDSVYMDVTMALTGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P   GTYFPP  + G   F+ +L  V  AWD +R+ +  SGA    +L+EA  
Sbjct: 109 FLTPTGEPFFAGTYFPPTPRPGMGSFRQVLSAVSSAWDTRREEIESSGADIARKLAEAAE 168

Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           A  +  + P   L    L    +QL+  +D R GGFG APKFP  +  +++L H  +   
Sbjct: 169 APVAGGRGPAIRLDGELLDTAVDQLAARFDPRHGGFGGAPKFPPSMVAELLLRHHAR--- 225

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG   E S G  MV  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  
Sbjct: 226 TGN--ERSLG--MVALTCERMARGGIYDQLTGGFARYSVDATWTVPHFEKMLYDNAQLLR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS-----AETEGATRKK- 313
           VYL  +  T D   + + R+   +L  D+  P G   SA DAD+     ++T+G   +  
Sbjct: 282 VYLHLWRTTGDALAARVVRETAAFLLTDLRTPQGGFASALDADAVPPSDSDTDGHPHQPV 341

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGA YVWT  ++ D LG + A      + +  TG  +            G +VL    D 
Sbjct: 342 EGASYVWTPGQLADALGPDDAAWAANLFEVTATGTFE-----------HGSSVLALPADP 390

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
             +            +     R  L   R+ RP+P  DDKV+ SWN            + 
Sbjct: 391 DDA------------DRFARVRATLAATRAARPQPARDDKVVASWN---------GLAVA 429

Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFL 491
                 A+F  P       E++  AE AA  +R  HL D +  R     R GP+   G L
Sbjct: 430 ALAEAGALFEEP-------EWVTAAERAAVLLRDVHLVDGRLRRTSRDGRVGPNV--GVL 480

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           DDY  +  G L L++     +WL  A +L +     F   + GG+++T  + P++L R +
Sbjct: 481 DDYGNVADGFLALHQVTGAVEWLELAGQLLDVARARFRAAD-GGFYDTADDAPTLLRRPR 539

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMC 610
           E  D A PSG S     L+  A++   + S  +R++AE ++ +    L +D   A     
Sbjct: 540 EVSDSATPSGQSAFAGALLTYAAL---TGSAGHREDAEATIGLLAPLLARDARFAGHAGT 596

Query: 611 CAADMLSVPSRKHVV 625
            A  +L+ P    VV
Sbjct: 597 VAEALLAGPPEVAVV 611


>gi|302530109|ref|ZP_07282451.1| transcriptional regulator [Streptomyces sp. AA4]
 gi|302439004|gb|EFL10820.1| transcriptional regulator [Streptomyces sp. AA4]
          Length = 663

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 224/695 (32%), Positives = 329/695 (47%), Gaps = 103/695 (14%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE EG A L+N  FV+IKVDREERPD+D VYM   QA+ G GGWP++ 
Sbjct: 49  ACHWCHVMAHESFEHEGTAALMNAHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTC 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P   GTY+PP  + G P F  +L  V +AW+++ D L +     +  L+E   
Sbjct: 109 FLTPEGEPFHCGTYYPPAPRPGIPSFTQLLLAVAEAWEERPDDLREGAKQIVGHLAE--- 165

Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
               S  L +  +  +AL     +L++  D   GGFG APKFP  + ++ +L H ++   
Sbjct: 166 ---QSGPLKEAAVDADALAEAVTKLAQEADPVHGGFGGAPKFPPSMVLEFLLRHHER--- 219

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG    +++   +     + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 220 TG----SAQAYALAESAAEAMARGGIHDQLGGGFARYSVDAEWIVPHFEKMLYDNALLLR 275

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY    +         +   I+ +L  D++ P G   ++ DAD+   EG T       YV
Sbjct: 276 VYAH-LARRGSASARRVAEGIVRFLEHDLLTPQGGFAASLDADTEGVEGLT-------YV 327

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELNDSSA 374
           WT  ++ ++LGE      E + +   G  +     L   +DP +  + + V         
Sbjct: 328 WTPAQLNEVLGEDGPWAAELFSVTEEGTFEEGASTLQLRADPDDFARFERV--------- 378

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
                              R+ L + R+ RP+P  DDKV+ +WNGL IS+ A A   L  
Sbjct: 379 -------------------RQALLEARAARPQPGRDDKVVAAWNGLAISALAEAGVAL-- 417

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKAP-GFLD 492
                         +R +++E+A +AAS  +  HL D    RL+ S R+G   AP G L+
Sbjct: 418 --------------ERPQWIELARNAASLLLDLHLVD---GRLRRSSRDGAVGAPVGVLE 460

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVK 551
           DYA L  GLL L++     +WL  A  L +     F      G ++ T +D  VL+ R  
Sbjct: 461 DYACLADGLLALHQATGEPRWLTEATRLLDVALTHFASDSAPGAYHDTADDAEVLVQRPS 520

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
           +  D A PSG S     L+  +++    ++  YR  AE +L     R+  +A  VP    
Sbjct: 521 DPTDNASPSGASALAGALLTASALAGSDQAARYRDAAELAL----RRVGLLAARVPRF-- 574

Query: 612 AADMLSVPSRK-----HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
           A   LSV          V +VG + +     ++ AA         V+  +P D   +   
Sbjct: 575 AGHWLSVAEAAQSGPVQVAVVGGERA----QLVTAAAQHIHGGGIVLGGEP-DAPGVPL- 628

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                    +A       +  A VC+ + C  PVT
Sbjct: 629 ---------LADRPLVGGEAAAYVCRGYVCERPVT 654


>gi|289209063|ref|YP_003461129.1| hypothetical protein TK90_1902 [Thioalkalivibrio sp. K90mix]
 gi|288944694|gb|ADC72393.1| protein of unknown function DUF255 [Thioalkalivibrio sp. K90mix]
          Length = 677

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 231/696 (33%), Positives = 343/696 (49%), Gaps = 73/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
           + CHWCHVM  ESFED   A+++N  F++IKVDREERPD+D++Y      L    GGWPL
Sbjct: 47  SACHWCHVMAHESFEDPATAEVMNRRFINIKVDREERPDLDRIYQNAHMLLSQRPGGWPL 106

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +VFL+PD  P   GTYFP   ++G P F  ++ +V D   +  D + +      E L +A
Sbjct: 107 TVFLTPDQVPFFAGTYFPSTPRHGLPSFVDLMNRVADFLAEHPDEIQRQN----ESLQQA 162

Query: 139 LSA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           L+     +   +P       L     +L++++D +FGGFG APKFP P  ++ + +H+ +
Sbjct: 163 LARIYRPAGGAIP---AIGVLDKARAELAQTFDDQFGGFGDAPKFPHPASLEWLAWHAAR 219

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             D       +E ++M+  TL  MA GGI D VGGGF RYSVD RW +PHFEKMLYD G 
Sbjct: 220 HND-------AEAERMLERTLAAMAAGGIFDQVGGGFCRYSVDARWMIPHFEKMLYDNGP 272

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L  +Y +  +   D     +    + +L R+M  P G  +S+ DADS   EG    +EG 
Sbjct: 273 LLGLYAERAAAGDDR-ARRVAEQTVAWLEREMRDPSGAFYSSLDADS---EG----EEGR 324

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           FYVW  + VE +L E   +     +           ++ P N F+G+  L E+   +  A
Sbjct: 325 FYVWDPEMVEGLLPEDEWVVASRVW----------GLNGPAN-FEGRWHLHEVAPIATVA 373

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             LG+   +    LG  R +L   R +R RPH DDK++ +WN L+I+  ARA++ L    
Sbjct: 374 DALGIDESEAETRLGRARERLLAAREQRVRPHRDDKILGAWNALMINGLARAARAL---- 429

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAP-GFLDDY 494
                       +R +++ +A +A   +R  L+ +   RL  SFR G  S+ P  +LDD+
Sbjct: 430 ------------ERHDWLGLARAAMRAVRERLWHDG--RLFASFREGATSELPRAYLDDH 475

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A L+   L L E       L WA  L       F D E GG+F T  +  +++ R K   
Sbjct: 476 ALLLEATLALLEVEWDGDLLGWATTLAEALLADFEDTEHGGFFYTARDHEALIQRPKVYA 535

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A  +GN ++   L +L  ++A  +   Y + AE +LA     ++   +    +  A D
Sbjct: 536 DDAMAAGNGIAAQALQKLGYLLAEPR---YLEAAERTLANAGPMIEQAPLGHMSLLVALD 592

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           M   P    VVL G    +        AH   D    V  I PA  +++           
Sbjct: 593 MHQQPP-PLVVLRGAADELAPWQQRLRAH---DAPMWVFAI-PAQADDL---------PP 638

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           ++A        V A +C+   C  PVTDP +LE +L
Sbjct: 639 ALAEKAAPETGVRAYLCRGLHCEVPVTDPAALEGVL 674


>gi|195952439|ref|YP_002120729.1| hypothetical protein HY04AAS1_0059 [Hydrogenobaculum sp. Y04AAS1]
 gi|195932051|gb|ACG56751.1| protein of unknown function DUF255 [Hydrogenobaculum sp. Y04AAS1]
          Length = 634

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 214/612 (34%), Positives = 305/612 (49%), Gaps = 85/612 (13%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
              +F    K  +  FL    ++CHWCHVME ESFEDE VA  LN  FVSIKVD+EERPD
Sbjct: 29  SEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKESFEDEEVASFLNKCFVSIKVDKEERPD 88

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +Y+ Y   L   GGWPLSVFL+P  +P   GTYFP      +  F  +L ++KD WD
Sbjct: 89  IDSLYIEYCVLLNNSGGWPLSVFLTPTKEPFFAGTYFP------KASFLKLLNQIKDLWD 142

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           K    + +     +EQL + +++         EL ++ +      L+  YD  FGGF  A
Sbjct: 143 KDSKNIIEKSKRMVEQLKQFMNSFEKR-----ELNESFIDKALFGLANRYDEEFGGFSEA 197

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    + ++L   K+             Q M L TL  M +GGI DHVGGGFHRYS 
Sbjct: 198 PKFPSLHNVLLLLKSQKQ-----------PFQDMALSTLLNMRRGGIWDHVGGGFHRYST 246

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D  W +PHFEKMLYDQ      Y +A+ LTK+  +       +++++ ++    G  +++
Sbjct: 247 DRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNEIFKDTVYKTINFVKENLY-ENGFFYTS 305

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
            DAD   TEG    +EG FY+WT +E++DIL E    F E + +K  GN     + +   
Sbjct: 306 MDAD---TEG----EEGGFYLWTYQEIKDILKEKTDKFIEFFNIKKEGNF----LDEAKR 354

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
            + GKNVL         A +  M  E  L +L     K F  R KR +P +DDK+++  N
Sbjct: 355 VYTGKNVLY--------AKEPTMLFENELQVL-----KAF--REKRKKPLIDDKILLDQN 399

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
            ++  +   A  + +                 K+++++A        ++L +   H LQH
Sbjct: 400 AMMDWALIEAYLVFED----------------KDFLDMA-------TKNLNNISKHPLQH 436

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
           +  +     P  LDDYA+LI   L LY+       L  AI L     E   D+  GG++ 
Sbjct: 437 ALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSKDALEKAISLTEEAIEKLWDKNAGGFYL 495

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           + G+D  VL+  K  +DGA PSGNSV  +NLV L  I   +K D Y    E+   +  + 
Sbjct: 496 SVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVELFFI---TKEDTY----ENRYQILSSI 546

Query: 599 LKDMAMAVPLMC 610
             DM    P  C
Sbjct: 547 YSDMLSRNPTAC 558


>gi|302542885|ref|ZP_07295227.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC
           53653]
 gi|302460503|gb|EFL23596.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC
           53653]
          Length = 678

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 235/703 (33%), Positives = 332/703 (47%), Gaps = 86/703 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDAETAEYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP  + G P F+ +L  V+ AW  +RD +       +E L+   
Sbjct: 108 VFLTPDAQPFYFGTYFPPRPRPGMPSFRQVLEGVRAAWADRRDEVRDVAGKIVEDLAGRT 167

Query: 140 SASASSNKLPDELPQNALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             +  S       P  A  L A    L++ +D+  GGFG APKFP  + ++ +L H  + 
Sbjct: 168 GIALGSGA---PQPPGAEDLAAGLMGLTREFDAVRGGFGGAPKFPPSMALEFLLRHHAR- 223

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +MV  T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L
Sbjct: 224 --TGSEG----ALQMVQATCEAMARGGIYDQLGGGFARYAVDAEWIVPHFEKMLYDNALL 277

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       +  +  D+L R+M    G   SA DADS   +G  R  EGA+
Sbjct: 278 CRVYAHLWRATGSDLARRVALETADFLVREMRTEQGGFASALDADS--DDGTGRHVEGAY 335

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT +++ + LGE        Y+           +++     KG +VL +L D +  A 
Sbjct: 336 YVWTPEQLREALGEADAEQAAAYF----------GVTEEGTFEKGASVL-QLPDGARPAD 384

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                       L   R +L   R +R RP  DDK++ +WNGL I++ A           
Sbjct: 385 A---------AQLASVRERLLAARERRERPGRDDKIVAAWNGLAIAALAETGAYF----- 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
                      DR + +E A  AA  + R L+ +   RL  +   G   A  G L+DYA 
Sbjct: 431 -----------DRPDLVEAATEAADLLVR-LHMDNGGRLARTSLGGAVGAHAGVLEDYAD 478

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
           +  G L L        W+ +A  L +T    F   +G  Y   T +D   L+R  +D  D
Sbjct: 479 VAEGFLALSAVSGEGVWVDFAGLLLDTVLHHFAAEDGTLY--DTADDAEALIRRPQDPTD 536

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG + +   L+  A++   S S  +R+ AE +L V    ++ +A  VP      + 
Sbjct: 537 NAVPSGWTAAAGALLSYAAV---SGSGRHREAAERALGV----VRALAGRVPRFIGWGLA 589

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
            A   L  P  + V +VG     D +    A H +  L      VI +    ++E+   E
Sbjct: 590 VAEARLDGP--REVAVVGP----DDDPATRALHRAALLGTAPGAVIAVGAPGSDEVPLLE 643

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                            +  A VC++F+C  P  D  +L   L
Sbjct: 644 G----------RVLLEGRPAAYVCRHFTCDAPTADVAALTAKL 676


>gi|357411497|ref|YP_004923233.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
           33331]
 gi|320008866|gb|ADW03716.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
           33331]
          Length = 675

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 225/687 (32%), Positives = 324/687 (47%), Gaps = 75/687 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED  VA  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDPSVADYLNAHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+ + +P   GTYFPPE ++G P F+ +L  V  AW  +R+ +A+     +  L+   S
Sbjct: 109 FLTAEAEPFYFGTYFPPESRHGMPSFQQVLEGVAAAWTDRREEVAEVAGRIVRDLA-GRS 167

Query: 141 ASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            +A+   LP E  L Q  LRL     ++ YD R GGFG APKFP  + I+ +L H  +  
Sbjct: 168 LAAAEGGLPGEPELAQALLRL-----TRDYDERHGGFGGAPKFPPSMVIEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   +   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGAEG----ALQMAADSCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS + +G  R  EGAFY
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADSEDAQG--RHVEGAFY 333

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VWT  ++ ++LGE    F   Y+           +++     +G +VL  +    A  + 
Sbjct: 334 VWTPAQLREVLGEDDAAFAAEYF----------GVTEEGTFEEGSSVLRLVPAGEAEPAD 383

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                E+   + G    +L   R  RPRP  DDKV+ +WNGL I++ A            
Sbjct: 384 D----ERIAGVRG----RLLAARELRPRPERDDKVVAAWNGLAIAALAETGAYF------ 429

Query: 439 AMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAF 496
                     DR + +E A  AA   +R H+ D    RL  + ++G      G L+DY  
Sbjct: 430 ----------DRPDLVERATEAADLLVRVHMGD--VARLCRTSKDGRAGDNSGVLEDYGD 477

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +  G L L        WL +A  L +   + F   E G  F+T  +   ++ R ++  D 
Sbjct: 478 VAEGFLALASVTGEGAWLEFAGFLLDIVLQHFTG-EKGQLFDTADDAEQLIRRPQDPTDN 536

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A P+G + +   L+   S  A + S+ +R  AE +L V           +      A+ L
Sbjct: 537 ATPAGWTAAAGALL---SYAAHTGSEAHRAAAEGALGVVGALGPKAPRFIGWGLAVAEAL 593

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNSNNAS 675
               R+  V               A   + +L++T ++   P     +    +  S    
Sbjct: 594 LDGPREVAV---------------AGPVAGELHRTALLGRAPGAVVAVGVGPDAGSEFPL 638

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
           +     +     A VC++F C  P TD
Sbjct: 639 LVDRPLAGGAPTAYVCRHFVCDAPTTD 665


>gi|409096974|ref|ZP_11216998.1| hypothetical protein PagrP_00615 [Pedobacter agri PB92]
          Length = 686

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 204/586 (34%), Positives = 292/586 (49%), Gaps = 56/586 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+  VA+++N  FV IKVDREERPD+D++YM  +Q + G GGWPL+  
Sbjct: 69  CHWCHVMERESFENFEVAEVMNKHFVCIKVDREERPDIDQIYMYAIQLMTGSGGWPLNCI 128

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEAL 139
             PD +P+ GGTYF   D      +  IL  V   W  + +   Q        +  SE +
Sbjct: 129 CLPDQRPIYGGTYFRKND------WVNILENVAALWSNEPEKAIQYAERLTSGIRDSEKI 182

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             S +     DE     L    E   + +D  FGG+  APKFP P     +L +   L+D
Sbjct: 183 IPSVTKEDYTDE----HLTEIIEPWKRHFDISFGGYNRAPKFPLPNNWVFLLRYG-YLKD 237

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                 A      V  TL+ M++GGI+D +GGGF RYSVD++WHVPHFEKMLYD  QL +
Sbjct: 238 DESVFTA------VCHTLEEMSRGGIYDQIGGGFARYSVDDKWHVPHFEKMLYDNAQLIS 291

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y +A+  TK   +     + ++++  +M  P G  +SA DADS   EG     EG FYV
Sbjct: 292 LYAEAYQCTKFNSFKQTAVESINWVFNEMTSPEGLFYSALDADS---EGI----EGKFYV 344

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W   E  D+LG+ A L  E++ +   GN           E +  N+L ++       SK 
Sbjct: 345 WDKTEFYDLLGDDAQLLGEYFNITEEGNW----------EEEQTNILRKILSDDDILSKH 394

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  E     +   + KL ++R++R RP LDDK + +WNG++I + A A+ +L  +    
Sbjct: 395 NIDAETLYTKVESAKAKLLNIRNQRIRPGLDDKCLTAWNGMMIKALADAATVLSHDL--- 451

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                        Y + A +AA FI  +L    +  L  + +NG +    FLDDYAFLI 
Sbjct: 452 -------------YYQKAAAAARFILVNL-KTASGGLYRNCKNGKASITAFLDDYAFLIE 497

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
            L+ LYE+     WL  A    +   E F D E   +F T+    S++ R  E  D   P
Sbjct: 498 ALIALYEYDFDENWLNEAKSFTDYVLENFSDSESPMFFYTSATGESLIARKHEVMDNVIP 557

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           + NS    NL +L  +      + Y   A   LA  + ++K    A
Sbjct: 558 ASNSTMAQNLTKLGLLF---DLEGYNNKAAEMLAAVQPKIKTYGSA 600


>gi|398782996|ref|ZP_10546612.1| hypothetical protein SU9_09379 [Streptomyces auratus AGR0001]
 gi|396996281|gb|EJJ07275.1| hypothetical protein SU9_09379 [Streptomyces auratus AGR0001]
          Length = 623

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 224/683 (32%), Positives = 323/683 (47%), Gaps = 70/683 (10%)

Query: 28  MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
           M  ESFED   A LLND FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +
Sbjct: 1   MAHESFEDPATAALLNDHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAE 60

Query: 88  PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSN 146
           P   GTYFPPE ++G P F  IL  V+ AW  +RD + + +G    +    +LSAS  ++
Sbjct: 61  PFYFGTYFPPEPRHGMPSFAQILEGVRSAWADRRDEVGEVAGRIVADLAGRSLSASLPAD 120

Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
           + P    +  L      L++ +D+  GGFG APKFP P+ ++ +L H  +    G     
Sbjct: 121 RRPPRAEE--LHTALMGLTREFDAAHGGFGGAPKFPPPMVLEFLLRHHARTASAGA---- 174

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
               +MV  T   MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L   Y   + 
Sbjct: 175 ---LEMVQATCAAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDNALLCRTYAHLWR 231

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
            T          +  D++ R++    G   SA DADS   +G  R  EGA+YVWT  ++ 
Sbjct: 232 STGSEEARRTAVETADFMVRELRTDQGGFASALDADS--DDGTGRHVEGAYYVWTPGQLR 289

Query: 327 DILGEHAILF-KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
            +LGE    F   H+ +   G  +           +G +VL +L D+           E+
Sbjct: 290 AVLGEEDAEFAAAHFGVTEEGTFE-----------EGASVL-QLPDTEGLVDA-----ER 332

Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
              +    R++L   R +RPRP  DDKV+  WNGL I++ A                   
Sbjct: 333 VARV----RQRLLAAREERPRPGRDDKVVACWNGLAIAALAETGAYF------------- 375

Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDL 504
              DR + ++ A  AA  + R   D Q  RL  + R+G P    G L+DYA +  G L L
Sbjct: 376 ---DRPDLIQAATDAADLLVRVHMDAQV-RLHRTSRDGTPGANSGVLEDYADVAEGFLTL 431

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
                   W+ +A  L +T   L    E G  ++T  +  +++ R ++  D A PSG + 
Sbjct: 432 ASVTGEGVWVEFAGFLLDTV-LLQFTTEDGALYDTAADAEALIRRPQDPTDNATPSGWTA 490

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
           +   L+  A++   + S  +R  AE +L +  T L   A        A    ++   + V
Sbjct: 491 AAGALLSYAAL---TGSGRHRDAAERALGIV-TALAGRAPRFIGWGLAVAEAALDGPREV 546

Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
            +VG         +  AA         V    P             ++   + +N    D
Sbjct: 547 AVVGPPGDPATAALHHAALLGTAPGAVVAMGAP------------GADEVPLLQNRPLVD 594

Query: 685 -KVVALVCQNFSCSPPVTDPISL 706
            K  A VC++F+C  P TDP  L
Sbjct: 595 GKPAAYVCRHFTCERPTTDPAEL 617


>gi|340619141|ref|YP_004737594.1| hypothetical protein zobellia_3176 [Zobellia galactanivorans]
 gi|339733938|emb|CAZ97315.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 703

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 340/687 (49%), Gaps = 86/687 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVME E+FE+E VAK++N+ F++IKVDREERPDVD+VYMT +Q + G GGWPL+
Sbjct: 84  SSCHWCHVMEDETFENEEVAKIMNENFINIKVDREERPDVDQVYMTALQLISGSGGWPLN 143

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V   P+ KPL GGTY      + R  +  +L K+ +        L ++     E+ S+ +
Sbjct: 144 VITLPNGKPLYGGTY------HTREQWMQVLTKISE--------LYKNDPKKAEEYSDMV 189

Query: 140 SASASSNKLP------DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
           +A  +   L       + + + AL+      S ++D   GG     KF  P  +  +L +
Sbjct: 190 AAGIAEANLVEPAKGFESITKEALKTSVANWSPNWDLEEGGEKGVQKFMIPSNLSFLLDY 249

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           +    D        + ++ V  TL  MA GG++D +GGGF+RYS D  W VPHFEKMLYD
Sbjct: 250 AVLTGD-------DKAKRHVRNTLDKMALGGVYDQIGGGFYRYSTDAFWKVPHFEKMLYD 302

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
             Q+ ++Y  A++L KD  Y  +  + +D+L R+M    G   +A DADS   EG    +
Sbjct: 303 NAQVLSLYSKAYTLFKDDAYKNVVWETIDFLDREMKDTNGGYHAALDADS---EG----E 355

Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           EG FYVW  +E++ +LGE   LF  +Y +      +            GK VL    D +
Sbjct: 356 EGKFYVWKEEELKSVLGEGFELFSAYYNINKEAVWE-----------DGKYVLHRKVDDA 404

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
               +  +   K   I  E  +KL   R+KR  P  DDK+I SWN L+++ F  A K   
Sbjct: 405 EFVKEHDIEQGKLNFIKSEWNKKLLAERNKRVFPRSDDKIITSWNALLVNGFVDAYKAF- 463

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                           +K ++E AES  SFIR + Y  Q  +L H+F+ G  +  GF++D
Sbjct: 464 ---------------GQKRFLEKAESVFSFIRSNAY--QNGKLVHTFKKGSKRKEGFIED 506

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YAF+I   L+LY     T++L +A EL    +  F D   G Y    G D  ++ R+ + 
Sbjct: 507 YAFMIDASLELYGLTLNTEYLDFAKELNAKAEAGFADEASGMYHYNEGND--LIARIIKT 564

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            DG  PS N+V   NL RL  +             +++    E   + ++  VP +  +A
Sbjct: 565 DDGVLPSPNAVMAHNLFRLGHL-------------DYNTGYTEKAKRMLSAMVPALTESA 611

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
                 S+ + +L+ H     FE  +    A   L K +  I   +T  +    E   +N
Sbjct: 612 PSY---SKWNALLLNHTYPY-FEIAVVGKDAEV-LIKALNEIHLPNTLVVGSKVE---SN 663

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPV 700
           A + ++ + AD     VC+N +C  PV
Sbjct: 664 APLFKDRYVADGTFIYVCRNTTCKLPV 690


>gi|420252291|ref|ZP_14755426.1| thioredoxin domain protein [Burkholderia sp. BT03]
 gi|398055929|gb|EJL47977.1| thioredoxin domain protein [Burkholderia sp. BT03]
          Length = 664

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 247/706 (34%), Positives = 344/706 (48%), Gaps = 111/706 (15%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A L+N+ +VSIKVDR+ERPD+D++Y    Q +  GGGWPL+V
Sbjct: 49  ACHWCHVMAHESFENPRIASLMNERYVSIKVDRQERPDIDEIYQQVSQMMGQGGGWPLTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P  GGTYFPP+D+YGRP F  +L  + +AW  + D L  +    I Q+ +   
Sbjct: 109 FLTPQGEPFFGGTYFPPDDRYGRPAFARVLIALSEAWRHRHDELRDT----IVQIQQGFR 164

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               + + P    ++     A  L++  D   GG G APKFP P    +ML   ++    
Sbjct: 165 QLDQAQQGPTAAVEDLPAQTARALTRDTDPAHGGLGGAPKFPNPSCYDLMLRVYER---- 220

Query: 201 GKSGEASEGQKMVLF-----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
                    ++  LF     TL  MA GGI+D VGGGF RYSVD  W VPHFEKMLYD G
Sbjct: 221 --------SREPTLFDALERTLDHMAAGGIYDQVGGGFARYSVDAHWAVPHFEKMLYDNG 272

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL  +Y DA+ LT    +  I  + L Y+ RDM  P G  +++EDADS   EG    +EG
Sbjct: 273 QLVKLYADAYRLTGKRTWRRIFEETLAYILRDMTHPEGGFYASEDADS---EG----QEG 325

Query: 316 AFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELND 371
            FY W   E++ +LGE    L    Y +   GN +            G  VL   +EL+ 
Sbjct: 326 KFYCWMPAEIKAVLGESEGALACRAYGVTERGNFE-----------HGATVLHRAVELD- 373

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
                      LE+    L   R +L   R++R RP  DD ++  WNGL+I+    A   
Sbjct: 374 ----------ALEE--TQLAGWRERLLAARARRVRPARDDNILTGWNGLMIAGLCAA--- 418

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPG 489
                      F   G    EY+  A+ AA+FI   L   D    R+   +++G +K PG
Sbjct: 419 -----------FQATGV--PEYLSAAKRAANFIGNELTLADGGVFRV---WKDGVAKVPG 462

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVL 547
           FL+DYAFL + LLDLYE     ++L  AIEL      L LD+  E G YF     +P ++
Sbjct: 463 FLEDYAFLCNALLDLYESCFDRRYLDRAIELAT----LILDKFWEDGLYFTPCDGEP-LV 517

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            R +  +D A PSG S S    VRL ++   +  D Y   AEH    +ET    +  A  
Sbjct: 518 HRPRAPYDSASPSGISSSAFAFVRLHAL---TGRDLYLDRAEHEFRRYETAAGSVPSAFA 574

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            +  A D +     + +V  G K S     +    H +Y L   V+              
Sbjct: 575 HLIAARDFVQRGPLE-IVFAGEKYSAAV--LATGVHRAY-LPARVLAF------------ 618

Query: 668 EHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
              + +  + R     D +  A VC+N +C+ P+T+     N LLE
Sbjct: 619 ---AEHVPIGRECHPVDGRAAAYVCRNRTCAAPMTE----GNALLE 657


>gi|342883561|gb|EGU84024.1| hypothetical protein FOXB_05444 [Fusarium oxysporum Fo5176]
          Length = 870

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 210/661 (31%), Positives = 335/661 (50%), Gaps = 100/661 (15%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M +E+F +   A +LN+ F+ + VDREERPD+D +YM YVQA+   GG
Sbjct: 208 HIGYKACHFCRLMSIETFSNPDSASVLNESFIPVIVDREERPDLDAIYMNYVQAVSNVGG 267

Query: 76  WPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFK--------------TILRKVKDAW---- 117
           WPL+VFL+P+L+P+ GGTY+     +G  G +              TI +KV+D W    
Sbjct: 268 WPLNVFLTPNLEPVFGGTYW-----FGPAGRRHLSDDSTEEVLDSLTIFKKVRDIWIDQE 322

Query: 118 ----DKKRDMLAQSGAFAIEQL----------------------SEALSASASSNKLPDE 151
                +  +++ Q   FA E                        S A +A   S  + +E
Sbjct: 323 ARCRKEATEVVGQLKEFAAEGTLGTRSISAPSALGPAGWGAPAPSHASTAKEKSTAVSEE 382

Query: 152 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK---LEDTGKSGEASE 208
           L  + L      ++ ++D  FGGFG APKF  P ++  +L   K    ++D     E   
Sbjct: 383 LDLDQLEEAYTHIAGTFDPVFGGFGLAPKFLTPPKLAFLLGLLKSPGAVQDVVGEAECKH 442

Query: 209 GQKMVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
             ++ L T++ +  G +HDH+GG GF R SV   W +P+FEK++ D  QL ++Y+DA+ +
Sbjct: 443 ATEIALDTMRHIRDGALHDHIGGTGFSRCSVTADWSIPNFEKLVTDNAQLLSLYIDAWKV 502

Query: 268 T----KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
           +    KD F   +  ++ +YL    ++ P G   S+E ADS   +G   K+EGA+YVWT 
Sbjct: 503 SGGGEKDEFLDVVL-ELAEYLTSSPIVLPEGGFASSEAADSYYRQGDKEKREGAYYVWTR 561

Query: 323 KEVEDILGE----HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +E + +L E     + +   ++ +   GN +    SDP+++F  +N+L   +     +++
Sbjct: 562 REFDSVLDEIDSHMSPILASYWNVNQDGNVE--EESDPNDDFIDQNILRVKSTIEQLSTQ 619

Query: 379 LGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              P+EK    + + RR L   R + R RP LDDK++V WNGLVIS+ ++A+  LK+   
Sbjct: 620 FSTPVEKIKEYIEQGRRALRKRREQERVRPDLDDKIVVGWNGLVISALSKAASSLKT--- 676

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                  +      +   +AE AA+ IR+ L+D    R+ +   +G      F DDYA++
Sbjct: 677 -------LRPEQSSKCRAIAEQAAACIRKKLWD-GNERILYRIWSGGRGNTAFADDYAYM 728

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQN-------------------TQDELFLDREGGGYFN 538
           I GLLDL E     ++L +A  LQ                    TQ  LF D + G +F+
Sbjct: 729 IQGLLDLLELTGNQEYLEFADILQRESSQFPSHLTHPADHAITETQTSLFYDAD-GAFFS 787

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
           T    P  +LR+K+  D + PS N+VSV NL RLA++++   +D     A  ++  FE  
Sbjct: 788 TQANSPYTILRLKDGMDTSLPSTNAVSVANLFRLANLLS---NDDLAAKARQTINAFEVE 844

Query: 599 L 599
           +
Sbjct: 845 V 845


>gi|23100033|ref|NP_693499.1| hypothetical protein OB2578 [Oceanobacillus iheyensis HTE831]
 gi|22778264|dbj|BAC14534.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
          Length = 691

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 223/709 (31%), Positives = 342/709 (48%), Gaps = 78/709 (11%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G ++F    K ++  FL    ++C WCH M  ESF D+ VA LLN ++VSIKVDREERPD
Sbjct: 32  GEKAFNKARKEQKPIFLSIGYSSCTWCHNMNRESFMDQEVAALLNQYYVSIKVDREERPD 91

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           +D +YM   Q + G GGWPL++ ++ D  P   GTYFP    YG PG   IL  +   + 
Sbjct: 92  IDGLYMKACQMMTGHGGWPLTIIMTDDQVPFFAGTYFPKHQNYGLPGLMDILPTIAKKYA 151

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           +    +A+     ++++ +AL  + S         ++++R   +QL++ +D  +GGF   
Sbjct: 152 EDPQQIAE----YMKKVEDALQDTLSKKSNESLTSEDSVR-TYQQLNELFDYPYGGFYKE 206

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP P  +  ++++  K  D           KMV  TL+ + +    DHVG G  RY+ 
Sbjct: 207 PKFPSPHNLSFLIHYYYKTGD-------KNALKMVDMTLKSIFQSSTWDHVGFGVFRYAT 259

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           D +W  PHFEKMLYDQ  L +V +D F +TKD FY     +I+ +++R+M    G  +++
Sbjct: 260 DRKWMFPHFEKMLYDQAFLLDVSVDMFLITKDPFYQLKVNEIIQFVKREMTAENGCFYAS 319

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
             ADS         +EGA+Y+W+ +E+  ILGE    LF E Y + P G           
Sbjct: 320 LSADS-------NGEEGAYYLWSLEEIYSILGEDEGDLFAEAYGIVPVG----------- 361

Query: 358 NEFKGKNVLIELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
              +GKN+      S  S AS  G+ +EK    L +   KL   R  R  P  DDK++ S
Sbjct: 362 -VHQGKNLPYRSGISLESLASTYGIQVEKVKTTLTKSVDKLQKARLLRTAPATDDKILTS 420

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
           WNG +I++ A+A  + + E                 ++  A +    +   L  +  +R 
Sbjct: 421 WNGYMIAALAKAGSVFQEE----------------NWINHAINTMKNLSDILIKD--NRW 462

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
             ++R G +   GFLDDYA ++ G ++L++       L  A  + N   +LF D   GG+
Sbjct: 463 FANYRQGKTNTKGFLDDYAAILWGYIELHQATMEIDHLKKAKTIANDMIKLFWDSNDGGF 522

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           F    +   ++ R KE +D   PSGNS++ I L RLA++  G  S  Y    +  +  F 
Sbjct: 523 FFVANDAEQLISREKEIYDSPIPSGNSLASIQLSRLANLT-GEMS--YYSYVDTMMYTFY 579

Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
             L+D              L     K V+++G  +   F ++       Y  N   IHI 
Sbjct: 580 RELQDEPSGASFFMRNL-FLQQDQTKQVIIIGENTEAFFNHI----RKRYLPN---IHII 631

Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKV----VALVCQNFSCSPPVT 701
            A TE        +S+ A++  N  +  KV       VC NF C+ P T
Sbjct: 632 SA-TE--------SSSLATLLPNGENYKKVNGQTTYYVCSNFHCNRPTT 671


>gi|428781674|ref|YP_007173460.1| thioredoxin domain-containing protein [Dactylococcopsis salina PCC
           8305]
 gi|428695953|gb|AFZ52103.1| thioredoxin domain protein [Dactylococcopsis salina PCC 8305]
          Length = 678

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 220/619 (35%), Positives = 319/619 (51%), Gaps = 76/619 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ LN+ F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDSTIAQYLNENFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P D  P  GGTYFP E +YGRPGF  IL+ ++  +D++++ L    +F  E ++  
Sbjct: 108 IFLTPHDRVPFYGGTYFPLEPRYGRPGFLQILQAIRRFYDQEKEKL---NSFKGEVMT-L 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
           L  SA+       LP +   L  E L K  ++  G     G+ P FP     Q+    ++
Sbjct: 164 LQRSAT-------LPSSETPLNRELLIKGLETAVGITSSRGTPPSFPMIPHAQLARRKTQ 216

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             +++    EA   Q+ +  TL     GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 217 FSDESRYDAEAITTQRGMDLTL-----GGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDNG 271

Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q+     + +S  + +  F S I   +  +L+R+M  P G  ++++DADS  T      +
Sbjct: 272 QIMEYLANLWSSGVKEPAFASAIAHAV-QWLQREMTAPEGYFYASQDADSFTTSEEAEPE 330

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI----- 367
           EGAFYVW+ +E+E +L  E     +  + +   GN            F+G NVL      
Sbjct: 331 EGAFYVWSYQELESLLTPEELNALQSEFTVTSEGN------------FEGNNVLQRQTGG 378

Query: 368 ELNDSSASASK---------LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
           EL+  S +A K         L  P+  +         K      + P P  D K+I +WN
Sbjct: 379 ELSSPSETALKKLFNARYGNLSSPVTPFPPATNNTEAKQTAWEGRIP-PVTDTKMITAWN 437

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
            L+IS  ARA              + V G   K Y E A  AA+FI  + +   + +RL 
Sbjct: 438 SLMISGLARA--------------YAVFG--EKTYWECAVKAANFIGENQWVAGRFYRLN 481

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLY-EFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
           +   +G +      +DYA  I  LLDLY      T+WL  A +LQ T DE     E GGY
Sbjct: 482 Y---DGKATVSAQSEDYALFIKALLDLYCCHPEQTQWLDQATQLQATFDEYLWSSETGGY 538

Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           FNT  ++ S +++R +   D A P+ N V+V NLVRL  +    K+DY   +AE +L  F
Sbjct: 539 FNTAKDNSSDLIIRERTYIDNATPAANGVAVANLVRLFELT--EKTDYV-ASAEKTLQAF 595

Query: 596 ETRLKDMAMAVPLMCCAAD 614
            + ++    A P +    D
Sbjct: 596 SSIMEQSPQACPGLFSGLD 614


>gi|23014746|ref|ZP_00054548.1| COG1331: Highly conserved protein containing a thioredoxin domain
           [Magnetospirillum magnetotacticum MS-1]
          Length = 671

 Score =  319 bits (817), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 230/696 (33%), Positives = 334/696 (47%), Gaps = 75/696 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDEG+A L+ND F++IKVDREERPD+D +Y   +  +   GGWPL+
Sbjct: 49  SACHWCHVMAHESFEDEGIAGLMNDLFINIKVDREERPDLDALYQNALGLIGQHGGWPLT 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+PD +P  GGTYFP + +YGR  F  +L  +  ++ K  D +  +    + ++ E+L
Sbjct: 109 MFLTPDAEPFWGGTYFPAQARYGRAAFPDVLEGISHSFHKDPDKIGHN----VARIRESL 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              A S   P  L    + L A Q  +  D   GG   APKFP+P   +  L+HS     
Sbjct: 165 EQMARSPG-PLSLDMEVVDLGAAQCLRLIDFEDGGTVGAPKFPQPGLFR-FLWHSYL--- 219

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++G +S  +  V  TL  + +GGI+DH+GGGF RYS DE W VPHFEKMLYD  QL +
Sbjct: 220 --RTGNSSL-KDAVTVTLDHICQGGIYDHLGGGFMRYSTDETWLVPHFEKMLYDNAQLVS 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +    +  T    Y     + + +L RDM+  GG   +A DADS   EG    +EG FY 
Sbjct: 277 LLTKVWKQTGSPLYRARIFETVGWLLRDMMAEGGAFAAALDADS---EG----EEGLFYT 329

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WTS+E+  +L  E A  F   Y ++  GN            ++G+N+L   N        
Sbjct: 330 WTSEELSALLDIETATRFGHLYGVQAHGN------------WEGRNIL-HRNHPRGGGDD 376

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                    + L E +  L   R KR  P  DDKV+  WN ++I++ A A+         
Sbjct: 377 ---------HDLAEAKMVLLAERDKRIWPGRDDKVLADWNAMMITALAEAALTF------ 421

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     DR +++  AE A   I   +      R  HS   G ++    LDDYA+ I
Sbjct: 422 ----------DRPDWLAAAEHAFQVITTRMVRPDG-RPAHSLCRGRAETNAVLDDYAWAI 470

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              L LYE  +G ++L  AI           D +GGGYF +  +   V++R K   D A 
Sbjct: 471 FAALTLYETTTGPEYLDQAIAWAEQVHAHHWDGQGGGYFLSADDATDVVIRTKPAFDSAV 530

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN V    L RL  +V G +   +R+ A+   AV +     M   +P M    D  ++
Sbjct: 531 PSGNGVMAEVLARL-WLVTGEER--WRERAQ---AVIDAFGAAMPEQIPHMTSLLDAFAI 584

Query: 619 PSRK-HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
            +    VV+VG         +L A  A+     +++ +   +   +     H ++  S+ 
Sbjct: 585 LAEPLQVVIVGPLDDPGGLALLRAFAATSLPPASLLRVQDGNALPVG----HPAHGKSLV 640

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
                     A +C+  +C  PVTD   L   L EK
Sbjct: 641 DGC-----AAAYICRGSTCRAPVTDSDRLMAQLCEK 671


>gi|227537485|ref|ZP_03967534.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
 gi|227242622|gb|EEI92637.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
          Length = 672

 Score =  319 bits (817), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 190/567 (33%), Positives = 282/567 (49%), Gaps = 57/567 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ +A+ +N ++V +K+DREERPD+D++YMT VQ +   GGWPL+
Sbjct: 48  SACHWCHVMERESFENDAIAQTMNKFYVPVKIDREERPDIDQIYMTAVQLMTNAGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
               PD +P+ GGTYF P D      ++ IL ++   W+++  +  +        + +  
Sbjct: 108 CICLPDGRPIYGGTYFKPHD------WQNILLQIAQMWEEQPQVAIEYATKLTNGIQQ-- 159

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S     N +PD+   + L          +D++ GG+  APKFP P     +L        
Sbjct: 160 SERLPINPIPDQYDSSDLSAIITPWVALFDTKDGGYNRAPKFPLPNNWIFLL-------- 211

Query: 200 TGKSGEASEGQKM---VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
             + G  +  +K+   V FTLQ MA GGI+D +GGGF RYSVD  WH+PHFEKMLYD GQ
Sbjct: 212 --RYGVLAGDEKIIDHVHFTLQKMASGGIYDQIGGGFARYSVDPYWHIPHFEKMLYDNGQ 269

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L +++ +A+      FY  I ++ + +  R+M+ P    + A DADS   EG     EG 
Sbjct: 270 LLSLFSEAYQQRPSPFYKRIVQETIQWANREMLAPNNGFYCALDADS---EGV----EGK 322

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +Y ++  E+EDILGE A LF  ++ +   GN             +  N+ I   D+   A
Sbjct: 323 YYSFSKSEIEDILGEDAPLFISYFNITEEGNW----------AEESTNIPILDPDADQMA 372

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G   E++   L E + KL+  R  R RP LD K + +WN L++     A +I     
Sbjct: 373 LDAGYSAEEWETCLAEAKEKLYSYRETRIRPGLDHKQLATWNALMLKGLTDAYRIF---- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       D   Y++ A   A FI   L  +   R+ H  ++   +  GFLDDYAF
Sbjct: 429 ------------DNSSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFLDDYAF 475

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
                + LYE     KWL  A +L +   ELF D     ++ T      ++ R  E  D 
Sbjct: 476 TTEAFIALYEATFDEKWLDLARQLADKALELFYDSNQKTFYYTADSSGELIARKSEIMDN 535

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDY 583
             P+  S  V+ L +L  +    K DY
Sbjct: 536 VIPASTSTIVLQLKKLGLLF--DKEDY 560


>gi|381211526|ref|ZP_09918597.1| hypothetical protein LGrbi_16484 [Lentibacillus sp. Grbi]
          Length = 582

 Score =  318 bits (816), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 217/637 (34%), Positives = 317/637 (49%), Gaps = 76/637 (11%)

Query: 72  GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 131
           G GGWPLS+F++PD  P   GTYFP   KYG PG   +L ++ + + ++ D + +     
Sbjct: 4   GQGGWPLSIFMTPDKVPFYAGTYFPRVSKYGMPGIMDVLTQLYERYKQEPDHIDEVTKSV 63

Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
            + L + ++A  S N+L  E+     +    QL K +D  +GGFGSAPKFP P   Q +L
Sbjct: 64  TDALEKTVTAK-SENRLTQEMTDKVFK----QLGKRFDFTYGGFGSAPKFPTP---QNLL 115

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           Y  +    TG +       KM   TLQ MAKGGI+DHVG GF RYS DE+W VPHFEKML
Sbjct: 116 YLLRYYHFTGNTA----ALKMTESTLQAMAKGGIYDHVGFGFARYSTDEKWLVPHFEKML 171

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD   L   Y + + +TK+  Y  I   I+ ++ R+M    G   SA DADS   EG   
Sbjct: 172 YDNALLLMAYTECYQITKNPLYKTISEQIITFVVREMHCSEGGFNSAIDADS---EGI-- 226

Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
             EG +YVW   E+ +ILGE    ++   Y + P GN            F+GKN+   LN
Sbjct: 227 --EGKYYVWDYDEIFNILGEELGDIYAAVYGITPDGN------------FEGKNIPNLLN 272

Query: 371 -DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
            DS A A    M + +  + L E R +L   R KR  PH+DDK++ SWN ++I++ A+A 
Sbjct: 273 TDSEAIAKANDMSVSELHHRLDEAREQLLSAREKRVYPHVDDKILTSWNSMMIAALAKAG 332

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
           K                     +Y + AE++ +FI ++L   Q  R+   +R+G  K  G
Sbjct: 333 KAFA----------------EPKYTKAAENSMNFIEQNLI--QNGRVMARYRDGEVKYNG 374

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
           +LDDYAFL+    +LYE     K+L  A  L N   +LF D + GG+F    +   +L R
Sbjct: 375 YLDDYAFLLWAYTELYETTFSLKYLKQARTLANDMIDLFWDNDQGGFFFNGHDSEELLSR 434

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
            K  +DGA PSGN V+ + LV++  +     +DY  +  E     +E  ++     V  +
Sbjct: 435 EKAVYDGALPSGNGVAGVMLVKMGYLTG--DTDYLDKLEEMYHTFYEDIIQVPVAGVHFI 492

Query: 610 CCAADMLSVPSRKHVVLVGHKS--SVDFENMLAAAHASYDLNKTVIHIDPADT--EEMDF 665
                ML     K VV++G  +  +VD +            + T++  + AD   E   F
Sbjct: 493 QSL--MLMENPTKEVVVLGESNPFTVDLQQTFLP-------DVTLLAGNNADKLGEVAPF 543

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
             E+   + ++             VC+NF+C  P TD
Sbjct: 544 VSEYRQLDNAL----------TIYVCENFACHQPTTD 570


>gi|284037137|ref|YP_003387067.1| hypothetical protein Slin_2247 [Spirosoma linguale DSM 74]
 gi|283816430|gb|ADB38268.1| protein of unknown function DUF255 [Spirosoma linguale DSM 74]
          Length = 700

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 207/575 (36%), Positives = 299/575 (52%), Gaps = 60/575 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE E VA+++N  FV IKVDREERPDVD +YM  VQA+   GGWPL+
Sbjct: 48  SACHWCHVMERESFEKEAVAQVMNKHFVCIKVDREERPDVDAIYMDAVQAMGVQGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSE 137
           VFL PD KP  G TY P ++      +  +L  + +A+++ R  LAQS   FA E  LS+
Sbjct: 108 VFLMPDAKPFYGVTYLPQKN------WVNLLESIDNAFNEHRADLAQSAEGFARELNLSD 161

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           A     + N  P   P+  L +   +++   D   GG   APKFP P   + +L +    
Sbjct: 162 AERYGLTQND-PLFAPET-LAVLYRKVAVKADDEKGGMRRAPKFPMPSVWRFLLRYYAVA 219

Query: 198 EDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
             + +  EA+    +   +V  TL  MA GGI+D +GGGF RYS D  W  PHFEKMLYD
Sbjct: 220 SSSRQIAEAADTSDQALNLVRITLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYD 279

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
            GQL  +Y +A+SLTK   Y ++    + + +R+++ P G  +SA DADS   EG     
Sbjct: 280 NGQLLTLYSEAYSLTKSKLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV---- 332

Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           EG FY +T+ E+++ILG     F + Y +   GN +            G+N+L  +    
Sbjct: 333 EGKFYTFTTPELKEILGADFDWFADLYSISENGNWE-----------HGRNILHRIEADD 381

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A+++G  +      L     +L  VR++R RP LDDK++ SWNGL++     A ++  
Sbjct: 382 EFAARMGWSVADLNVRLDATHTRLLRVRNERIRPGLDDKILCSWNGLMLKGLVTAYRV-- 439

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-----NGPSKAP 488
                  F  P       E++ +A   A F+ + + D +  RL H+++      G ++  
Sbjct: 440 -------FGEP-------EFLTLALRLAYFLLKKMRDSRNGRLWHTYKVSEGGTGRARQA 485

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ----NTQDELFLDREGGG---YFNTTG 541
           GFLDDYA +I GLL LY+      WL  A +L         +L +D   G     F T  
Sbjct: 486 GFLDDYAAVIDGLLALYQATFTRNWLTEADQLMQYVLTNFADLSVDELTGPEPLLFFTDK 545

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
               ++ R KE  D   PS NS+   NL  L+ ++
Sbjct: 546 NSEELIARRKELFDNVIPSSNSMMAENLYVLSLLL 580


>gi|55980955|ref|YP_144252.1| hypothetical protein TTHA0986 [Thermus thermophilus HB8]
 gi|55772368|dbj|BAD70809.1| conserved hypothetical protein [Thermus thermophilus HB8]
          Length = 642

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 219/587 (37%), Positives = 300/587 (51%), Gaps = 75/587 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESF+DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+
Sbjct: 49  SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ KP  GGTYFP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL 
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALW 164

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            S S       LP+ A     + L +++D  +GGF  APKFP+   +  +L  + + E+ 
Sbjct: 165 KSLSPPP--GPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     +++  TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           YL A+ L  +  +  + R+ LD+L       GG   +A D   AE+EG    +EG +Y W
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRREGG-FHTALD---AESEG----EEGRYYTW 326

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E+ + LGE   L + ++ L      DL            ++VL    ++ A  + LG
Sbjct: 327 TEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEARKA-LG 371

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              E +       R KL   R +R  P LDDKV+  W+ L + + A A ++   E     
Sbjct: 372 ---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE----- 423

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A+  A F+  H+Y E    L+H++R G      +L D AF    
Sbjct: 424 -----------RYLEAAKRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAFAALA 469

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            L+LY       +L WA  L      LF  REG          PS+ L  KE  +GA PS
Sbjct: 470 FLELYAATGEWPYLDWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEGALPS 517

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           G S     LVRL ++  G     YR+ AE  LA     L     A+P
Sbjct: 518 GESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560


>gi|271969730|ref|YP_003343926.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270512905|gb|ACZ91183.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
          Length = 682

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 232/713 (32%), Positives = 328/713 (46%), Gaps = 109/713 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDEG A L+N+ FV++KVDREERPDVD VYM   QA+ G GGWP++
Sbjct: 47  SACHWCHVMAHESFEDEGTAALMNEHFVNVKVDREERPDVDAVYMAATQAMTGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF +P   P   GTYFP      RP F+ +L  V +AW+  R+ + +  +  +E L+E  
Sbjct: 107 VFATPGGHPFYTGTYFP------RPQFQRLLAGVSNAWNGDREAVLEQSSKIVEALNERS 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +  +     PD L +       + LS+S+D   GGFG APKFP  + ++ +L +    E 
Sbjct: 161 ALPSGPLPTPDTLAR-----AVQSLSRSFDQVRGGFGGAPKFPPSMALEFLLRYGAAAEP 215

Query: 200 -TGKSGEASEGQK-----------------MVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
            TG  G   E ++                 M   TL+ MA+GGI+D +GGGF RYSVD  
Sbjct: 216 RTGAEGGEPEDRREPGAGAGAGAGAPTATAMAGRTLEAMARGGIYDQLGGGFARYSVDAD 275

Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
           W VPHFEKMLYD   L  VY   + LT       +  +  D+L  +M  P G   SA DA
Sbjct: 276 WVVPHFEKMLYDNALLLRVYAHWWRLTGSALGRRVALETADWLLAEMRTPEGGFASALDA 335

Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH----AILFKEHYYLKPTGNCDLSRMSDPH 357
           DS   EG     EG FY WT +E+ ++LGE     A+   E       G   L  +SDP 
Sbjct: 336 DS---EGV----EGKFYAWTPEEIHEVLGEEDGAWAVALYEVTGTFEHGTSVLQLLSDP- 387

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
                       +D+  SA                 R +L   R+ R RP  DDKV+ +W
Sbjct: 388 ------------DDAERSA---------------RVRAELLAARAHRVRPGRDDKVVAAW 420

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL I++ A    +                 DR + +E A +AA  +     D    RL 
Sbjct: 421 NGLAIAALAETGALF----------------DRPDLVEAARAAAVLLDGSHMDGD--RLL 462

Query: 478 HSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
            + R+G + A  G L+DYA L  GLL LY      +W   A  L  T  + F D   GG+
Sbjct: 463 RTSRDGRAGANAGVLEDYADLAEGLLTLYGVTGEVRWFHRAGALLETVLDRFADGS-GGF 521

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF- 595
           F+T  +   +  R ++  D A PSG   +   L+  A++   ++     + A  ++ V  
Sbjct: 522 FDTADDAERLFQRPQDPTDNATPSGQFAAAGALLSYAALTGSARHREAAEAALGTVTVLA 581

Query: 596 --ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
               R     +AV     A   +S P    +V       +D         A+  L++T +
Sbjct: 582 DKHARFAGWGLAV-----AQAAVSGPVEAAIV-----GPLD-------DPATSALHRTAL 624

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            + PA    +   E  ++    +           A VC+ F+C  PVT P  L
Sbjct: 625 -LSPAPGLVVALGEPGSAEVPLLEGRGLLDGAPAAYVCRGFTCRMPVTTPAGL 676


>gi|85817359|gb|EAQ38539.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
          Length = 705

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 212/685 (30%), Positives = 337/685 (49%), Gaps = 75/685 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME ESFED  VA+ +N+ F++IKVDREERPDVD VYM  VQ + G GGWPL+ 
Sbjct: 79  SCHWCHVMEHESFEDTLVAQFMNENFINIKVDREERPDVDNVYMNAVQLMTGRGGWPLNA 138

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYF  ED      +   L +V D +    + L +        L++   
Sbjct: 139 VALPDGRPVWGGTYFSKED------WLNALGQVADIYTSDPNKLVEYADKLGTGLAQMDL 192

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            + + NK       + L+   E+ S+ +D+R GG   APKF  P   + +L ++ +  D 
Sbjct: 193 VTPNPNK--PSFVIDTLQTSIEKWSRQWDTRQGGLNRAPKFMMPNNYEFLLRYAHQNND- 249

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                  E  + V  TL+ +A GG++D VGGGF RYSVD +WH+PHFEKMLYD  QL ++
Sbjct: 250 ------DEILEYVNTTLEQIAFGGVNDQVGGGFARYSVDTKWHIPHFEKMLYDNAQLVSL 303

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y +A+  TK+  Y     + L++++R+M    G  +SA DADS   +G    +EGA+YVW
Sbjct: 304 YSNAYLKTKNPLYKETVYETLEFIKREMTTSQGGFYSALDADSLTPDGEL--EEGAYYVW 361

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+++++G+   LF  +Y +      D  +  + H       VLI  +  +    +  
Sbjct: 362 TEEELKNLVGDDFKLFSAYYNIN-----DYGKWENDH------YVLIRQDLDTDFVKEHQ 410

Query: 381 MPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           + LE+      + R  L   R SK+ +P LDDK++ SWNGL+   +  A ++        
Sbjct: 411 ISLEELTTKKSKWREDLLRFRESKKEKPRLDDKILTSWNGLMTKGYVDAYRVF------- 463

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                    D KE+++ A   A+F+  +L   +   L  ++++G S    +L+DYA  I 
Sbjct: 464 ---------DEKEFLDAALKNANFVVDNLL-RKDGGLNRTYKDGKSTINAYLEDYAATID 513

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + L+E     +WL  A  L +     F + E   ++ T+ EDP++  R  E +D   P
Sbjct: 514 AFIALFEVTMDEQWLEKAKSLTDYTFTHFQNAENKLFYFTSNEDPTLSSRNTEFYDNVIP 573

Query: 560 SGNSVSVINLVRLASIVAGSKSDYY--RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           S NS+   N+  L        S YY  +   + + A+      +   +        D++ 
Sbjct: 574 SSNSIMAKNIFTL--------SHYYLDKTYTDTAAAMLNNMQPNFTQSPTSFSNWMDLML 625

Query: 618 VPSRKH--VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
             ++ +  +V+VG     D +N+LA     Y  NK +     A  +E             
Sbjct: 626 NYTKPYYELVVVGP----DAQNILAELEQEYLPNKLIAATTTASKQE------------- 668

Query: 676 MARNNFSADKVVALVCQNFSCSPPV 700
           +    +   + +  VC N +C  PV
Sbjct: 669 IFEGRYLEGETLIYVCVNNACKLPV 693


>gi|347535413|ref|YP_004842838.1| hypothetical protein FBFL15_0482 [Flavobacterium branchiophilum
           FL-15]
 gi|345528571|emb|CCB68601.1| Protein of unknown function YyaL [Flavobacterium branchiophilum
           FL-15]
          Length = 674

 Score =  318 bits (814), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 216/694 (31%), Positives = 332/694 (47%), Gaps = 74/694 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+  VA+++N  FV+IK+DREERPD+D +YM  +Q + G GGWPL++ 
Sbjct: 50  CHWCHVMEHESFENLEVAQVMNSHFVNIKIDREERPDLDALYMKALQIMTGQGGWPLNMV 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF  ED      + T L+++++ ++ + + +        E+L + +  
Sbjct: 110 CLPDGRPVWGGTYFRKED------WTTALKQIQEVFENQPERMLDYA----EKLQKGIDT 159

Query: 142 SASSNKLPDEL--PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                +  D+L   +  L     +  +S+D  FGG   APKF  P    ++L ++ + +D
Sbjct: 160 IGFKPQFHDDLVFSKKTLEDLISKWKRSFDLDFGGMARAPKFMMPNNYVLLLRYADQNQD 219

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   E    V  TL  MA GG+ D +GGGF RYSVD +WHVPHFEKMLYD  QL  
Sbjct: 220 -------EELLDFVHLTLTKMAYGGLFDVLGGGFSRYSVDMKWHVPHFEKMLYDNAQLLF 272

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  AF  T D  Y  +    + ++ ++         +A DADS  ++     +EGAFY+
Sbjct: 273 LYAQAFQKTGDPLYQEVVEKTIQFIEKEWFTDNKSFCAAYDADSINSQNVL--EEGAFYI 330

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  E+  +LG+  +LF + + +   G+ +            G  VLI+    +  A K 
Sbjct: 331 WTQDELIALLGDDYVLFSKIFNINEFGHWE-----------HGHYVLIQNQTLAYWAEKE 379

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            + L    N   E  +KL+  R +RP+P LD+KVI SWN L I     A K   +     
Sbjct: 380 SIDLAVLKNKKQEWEQKLYQKRQQRPKPRLDNKVITSWNALTIKGLVEAYKTFGT----- 434

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                      K+Y+++A   A FI   L+    H L H ++NG  K  GFL+DYAF+I 
Sbjct: 435 -----------KKYLQMALQNAQFIAHTLWSPDGH-LWHIYQNGTCKINGFLEDYAFVIE 482

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + +YE      WL+ A  L +   + F D     +   + +DP ++ +  E  D   P
Sbjct: 483 AFIHIYEVTFDEDWLLKAKTLTDYTFDYFFDTSKQMFRFNSRKDPELIAQHFEIEDNVIP 542

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAADMLSV 618
           S NSV   NL    + ++ +  + Y Q   H++ +  T   D   A    +    D L  
Sbjct: 543 SSNSVMAHNL----NYLSLAFDNLYYQKTAHNMLLQATANVDYPSAFSNWLWLQMDNLYF 598

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
            S   +VL    + V+     +  H  Y     +      D  ++ + ++  SN      
Sbjct: 599 TSE--MVLNSENAVVE----ASEIHRHYHPENRI--FGCFDHSKIPYLKDKTSN------ 644

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                 K +   C+N  C  PVTD   L+  L+E
Sbjct: 645 ------KSMYYFCKNKECHLPVTDFQLLKKKLME 672


>gi|182436351|ref|YP_001824070.1| hypothetical protein SGR_2558 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178464867|dbj|BAG19387.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 672

 Score =  318 bits (814), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 208/582 (35%), Positives = 298/582 (51%), Gaps = 61/582 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA  LN  FV +KVDREERPD+D VYM  VQA  G GGWP++
Sbjct: 47  SSCHWCHVMAHESFEDETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPPE ++G P F+ +L  V  AW  +R+ +A+     +  L+   
Sbjct: 107 VFLTPDAEPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLA-GR 165

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S     + +P   E+ Q  L      L++ YD + GGFG APKFP  + ++ +L H  + 
Sbjct: 166 SLVHGGDGVPGESEIAQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR- 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 220 --TGSEG----ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 273

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       I  +  D++ R++    G   SA DADS + +G  R  EGA+
Sbjct: 274 CRVYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAY 331

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT  ++ ++LGE    F   Y+           +++     +G +VL    D+     
Sbjct: 332 YVWTPAQLREVLGEDDAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG---- 377

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
               P++     + + R +L   R +RPRP LDDKV+ +WNGL I++ A           
Sbjct: 378 ----PVDA--ARVADVRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYA 495
                      DR + +E A  AA   +R HL   +  RL  + ++G +    G L+DY 
Sbjct: 427 -----------DRPDLVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYG 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +   E F   EGG  ++T  +   ++ R ++  D
Sbjct: 474 DVAEGFLTLAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTD 532

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            A PSG + +   L+   S  A + S+ +R  AE +L V + 
Sbjct: 533 SATPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 571


>gi|209883527|ref|YP_002287384.1| thioredoxin domain-containing protein [Oligotropha carboxidovorans
           OM5]
 gi|337739402|ref|YP_004631130.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
 gi|386028421|ref|YP_005949196.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
 gi|209871723|gb|ACI91519.1| highly conserved protein contAining a thioredoxin domain
           [Oligotropha carboxidovorans OM5]
 gi|336093489|gb|AEI01315.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
 gi|336097066|gb|AEI04889.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
          Length = 684

 Score =  318 bits (814), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 221/694 (31%), Positives = 334/694 (48%), Gaps = 83/694 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED   A+++N+ FV IKVDREERPD+D++YM  +  L   GGWP+++F
Sbjct: 56  CHWCHVMAHESFEDAATAEVMNELFVCIKVDREERPDIDQIYMRALHLLGQQGGWPMTMF 115

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           LSPD  P+ GGTYFP   +YGRP F  I+R+    +  + D +A +       L+E    
Sbjct: 116 LSPDGAPIWGGTYFPNTPQYGRPSFVGIMREFIRIYRDEPDKIAANKTAIERSLAERSPT 175

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +S  L      N L   A  +++S D   GG   APKFP+             LE   
Sbjct: 176 DTASIGL------NELDNVAGSIARSTDPDNGGLRGAPKFPQ----------CSMLEFLW 219

Query: 202 KSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           ++G  +   +  + T   L  M++GGI+DH+GGG+ RY+VD++W VPHFEKMLYD  Q+ 
Sbjct: 220 RAGARTGDDRFFITTNLALTRMSQGGIYDHLGGGYARYTVDDKWLVPHFEKMLYDNAQIL 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++     +   +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG FY
Sbjct: 280 DLLALEHARAPNALYHQRAEETVGWLKREMLTREGGFASSLDADS---EG----EEGRFY 332

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           +W+  E+E++LG + A  F   Y +   GN            F+G+N+L  L D S +A+
Sbjct: 333 IWSQSEIEELLGKDDATFFAAKYGVTADGN------------FEGRNILNRLGDDSDTAT 380

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           +           L   R  LF  R KR RP LDDKV+  WNGL I++   A++       
Sbjct: 381 E--------AEQLAAMRAILFRAREKRVRPGLDDKVLADWNGLTIAALVHAAQAFA---- 428

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                       R +++ +A +A  FI   +   +  RL HS+R G    P    D A +
Sbjct: 429 ------------RPDWLTLAATAFGFITTTM--SRHGRLGHSWRAGKLLQPALASDNAAM 474

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           I   L L+E      +L  A+  Q   D  + D   GGYF T+ +   ++LR     D A
Sbjct: 475 IRAALALHEATGDHLFLDQAVLWQADLDTHYGDPRHGGYFLTSDDAEGLILRPHSSVDDA 534

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            P+   ++  NL RLA +   +  D +R+  +   +       +       +  A D+  
Sbjct: 535 TPNHIGLTAQNLARLAVL---TGDDRWRKQLDTLFSRMLAVAGENVFGHLSLLNALDLYL 591

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSNNASM 676
             +   +V+ G       E +L AA A       V+H+ DPA          H +N+  +
Sbjct: 592 AGAE--IVVTGEGEEA--EALLKAARALPHATTIVLHVPDPAKLP-----AHHPANDKVV 642

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                     VA VC+  +CS PV++  +L  L+
Sbjct: 643 -----PGGGAVAFVCRGQTCSLPVSETDALAALV 671


>gi|399928052|ref|ZP_10785410.1| hypothetical protein MinjM_13607 [Myroides injenensis M09-0166]
          Length = 665

 Score =  318 bits (814), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 223/683 (32%), Positives = 326/683 (47%), Gaps = 75/683 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFED  VA L+N+ F+SIK+DREE PD+D  YM  VQ +   GGWPL+V
Sbjct: 48  TCHWCHVMEHESFEDNKVATLMNNHFISIKIDREEFPDIDAFYMKAVQIMTKQGGWPLNV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYFP      +  +   L ++ + +  K + +     FA EQL E +S
Sbjct: 108 VCLPDGRPIWGGTYFP------KQTWLDSLTQLNELYQTKPETVID---FA-EQLHEGIS 157

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              SS  + +   +  L +  E+ SKS+D   GG+G APKF  P     +LY    L+  
Sbjct: 158 L-LSSGPIENSETRFNLEVLIEKWSKSFDWENGGYGRAPKFMMPSN---LLY----LQKL 209

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G      +  + +  TL  MA GG+ D V GGF RYSVD RWH+PHFEKMLYD  QL  V
Sbjct: 210 GVYSHTKDILEYIDLTLTKMAWGGLFDTVEGGFSRYSVDMRWHIPHFEKMLYDNAQLLTV 269

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y DA+  TK+  Y  +    + Y+  +     G  +SA DADS   +   + KEGA+YVW
Sbjct: 270 YADAYKRTKNNLYKEVIAKTITYIENNWANKEGGYYSALDADSLNHDN--QLKEGAYYVW 327

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T KE++DI+ +   +FK+ + +   G  +           +   VLI+  D  + A++  
Sbjct: 328 TEKELQDIINKEYDIFKQVFNINDNGYWE-----------ENNYVLIQTQDLHSIANQNN 376

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +     + +  E    L   R  R  P LDDK + SWN + I+    +   L        
Sbjct: 377 IEYSHLVTLKKEWEELLLQARKNRKAPRLDDKTLTSWNAMYINGLLNSYTAL-------- 428

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                   + KEY+ +A     FI   L+DE    L H+++NG      +LDDYA+ IS 
Sbjct: 429 --------NNKEYLVLAIKTFDFITAKLWDEDK-GLYHTYKNGQKTIKAYLDDYAYYISA 479

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            ++LYE      +L  A    +   + F D +   +F +      ++  + E  D   PS
Sbjct: 480 AIELYEHTGEDNYLTIAKNCTDYVFDHFYDDKTKFFFYSQDIQEYIIKNI-ETEDNVIPS 538

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
            N++  +NL +LA +       +YR  + + L + +T++ D   A      A    S P+
Sbjct: 539 SNAIMCLNLQKLAVLYDNL---HYRNTSINMLEIIKTQI-DYPSAYSHWLLADLYQSHPA 594

Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
              + LVG            A   S  L K VI      T    F  E  S    + + N
Sbjct: 595 E--ITLVGK----------GALKTSLLLRKKVI------THTFVFPVEQESKIPYLNKEN 636

Query: 681 FSADK-VVALVCQNFSCSPPVTD 702
              DK ++  +C N +C  P  D
Sbjct: 637 ---DKHLLVYLCANSTCYKPEED 656


>gi|295838670|ref|ZP_06825603.1| conserved hypothetical protein [Streptomyces sp. SPB74]
 gi|197699107|gb|EDY46040.1| conserved hypothetical protein [Streptomyces sp. SPB74]
          Length = 683

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 229/693 (33%), Positives = 323/693 (46%), Gaps = 77/693 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED G A  +N+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SACHWCHVMARESFEDVGTAAYVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P  +P   GTYFPP   +G P F+ +L  V+ AW  +R  + +  A     L    
Sbjct: 107 VFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRRAEVDEVAARVTADL---- 162

Query: 140 SASASSNKLPD-ELPQNALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             +     LPD   P  A  L A    L++ YDSR GGFG APKFP  + ++ +L H  +
Sbjct: 163 --TGRGLGLPDGAAPPGADALGAALLGLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR 220

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              TG  G      +M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKML D   
Sbjct: 221 ---TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWTVPHFEKMLSDNAL 273

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L   Y   +  T       +  +  D+L R++  P G   SA DADS   +G  R  EGA
Sbjct: 274 LCRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGA 331

Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            YVWT +++ ++LGE  A L   HY + P G             F+  + ++ L  +   
Sbjct: 332 SYVWTPEQLREVLGEADAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGF 379

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
            S    P++     L   RR L   R +RP P  DDKV+ +WNGLVI++ A         
Sbjct: 380 DSP---PVDA--ARLDRIRRALLAAREERPAPGRDDKVVAAWNGLVIAALAE-------- 426

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
              A F        R + +  A  AA  + R   D + H  + S    P    G L+DYA
Sbjct: 427 -TGAYFG-------RPDLVAAATGAADLLVRVHLDTRGHLTRTSRDGRPGGNAGVLEDYA 478

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        W  +A  L +     F D + G  ++T  +  +++ R ++  D
Sbjct: 479 DVAEGFLTLASVTGEGVWTDFAGLLLDQVLARFRD-DTGALYDTAADAEALIHRPQDPTD 537

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A PSG + +   L+  A++   + S  +R  AE +L+V    +  +A   P      + 
Sbjct: 538 NATPSGWNAAAGALLTYAAL---TGSTAHRAAAEQALSV----VAALAPRAPRFVGHGLA 590

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L+ P    V +VG         +  AA  +      V    P+   E        
Sbjct: 591 VAEALLAGP--YEVAVVGAPEDPRTRALHCAALLATSPGAVVAAGPPSAEPEFPL----- 643

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
                +A          A +C+ F C  P TDP
Sbjct: 644 -----LADRPLVEGAPAAYLCRGFVCDRPETDP 671


>gi|404497256|ref|YP_006721362.1| thioredoxin domain-containing protein YyaL [Geobacter
           metallireducens GS-15]
 gi|418065852|ref|ZP_12703222.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
 gi|78194859|gb|ABB32626.1| thioredoxin domain protein YyaL [Geobacter metallireducens GS-15]
 gi|373561650|gb|EHP87881.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
          Length = 706

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 225/692 (32%), Positives = 328/692 (47%), Gaps = 81/692 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVM  ESF D  VA +LN  FV+IKVDREERPD+D  YM   Q + G GGWPL+V
Sbjct: 79  TCHWCHVMAHESFGDHEVAAVLNRDFVAIKVDREERPDIDDTYMRVAQLMNGSGGWPLTV 138

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
            ++PD +P    TY P   + G PG   IL ++ + W  +R+++ Q+    ++ L     
Sbjct: 139 CMTPDREPFFVATYIPKHSRGGMPGLVEILGRIAEVWKTRRELVHQNCTAILDSLRNLSV 198

Query: 141 ASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           A       P E+P    LR    QL+  +D    GFG APKFP P+ +  +L + ++  D
Sbjct: 199 AK------PGEIPGAEPLRAARSQLAGMFDPVNAGFGQAPKFPMPLNLSFLLRYGRRFGD 252

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G +        MV+ TL+ + +GGI D +G G HRYSVD RW VPHFEKMLYDQ  +A 
Sbjct: 253 PGAT-------VMVVATLEALRRGGIFDQLGFGLHRYSVDSRWLVPHFEKMLYDQALVAM 305

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
             ++AF  T       +   + D++ R++  P G  +SA DAD   TEG    +EG +Y+
Sbjct: 306 AAVEAFQATGQESLREMAEQLCDFVLRELAAPEGGFYSALDAD---TEG----EEGRYYL 358

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  +V  +LGE    LF   + +   GN            F+G N+L         A +
Sbjct: 359 WTPAQVRSVLGETEGELFCRLFDVTGKGN------------FEGANILNLPVLLHEFAQR 406

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            GM  E     +   R  L   R+KR RP  D+K++ +WNGL+I++ AR           
Sbjct: 407 EGMSPENLEEKVEGWRLLLLAERAKRERPFRDEKIVTAWNGLMIAALARL---------- 456

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
               F   G +R  ++  AE+A   I R L      RL  S   G  + P FL+DYA L+
Sbjct: 457 ----FLAGGGER--FLVAAEAALVRILRDLR-RADGRLLRSIHRGEGEVPAFLEDYAALL 509

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL L++     ++   A  L      LF   E  G ++T  +  +VL+R + D+DG  
Sbjct: 510 HGLLALHDATLDPRYREEACSLARDMLRLF-SGEDRGLYDTGNDAETVLMRSRVDYDGVM 568

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN ++   LVRL  +   +  + + +  E  +  F        +A      A D+L  
Sbjct: 569 PSGNGLAATGLVRLGRM---ADEERFVEAGEEIIRAFMAGAGRQPVAHLQTLMALDLLRG 625

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P  +  +  G +  V  + MLA     + +   V+  +P                     
Sbjct: 626 PQVEVAISGGSRGKV--QGMLAEIGKRF-IPGFVLRGEPD-------------------- 662

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +  A VC   +C  PV  P +L  +L
Sbjct: 663 ---QGRRATAQVCAAGACHIPVESPAALGGIL 691


>gi|429859406|gb|ELA34188.1| duf255 domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 811

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 213/652 (32%), Positives = 314/652 (48%), Gaps = 85/652 (13%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+  +   E F     AK+LN+ FV + +DREERP++D +YM YVQA+ G GG
Sbjct: 76  HIGFKPCHYSRLTSTECFTHSECAKILNESFVPVIIDREERPELDTIYMNYVQAVSGNGG 135

Query: 76  WPLSVFLSPDLKPLMGGTYFP-PEDKYGRPG------FKTILRKVKDAWDKKR------- 121
           WPL++FL+P+L+P+ GGTY+P PE   G  G      F  IL+K++  W ++        
Sbjct: 136 WPLNLFLTPELEPVFGGTYYPAPEPNNGSSGDDERLDFLAILKKLQKVWKEQEARCRQEA 195

Query: 122 --------DMLAQSGAFAIEQLSEALSASASSN------------------KLPDELPQN 155
                   D  A+    A   +   ++ S S+                    +  EL   
Sbjct: 196 KEVVVKLHDFAAEGTLGATSTVEPGVAGSQSATLARSETGLEHPGTGRTAAVVSSELDLE 255

Query: 156 ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL---EDTGKSGEASEGQKM 212
            L      ++ ++D  +GGFG APKFP P ++  +L   + L   +D     E +   +M
Sbjct: 256 HLEEAYTHIAGTFDPVYGGFGLAPKFPTPPKLSFLLRLPRYLAPVQDVVGETECAHAAEM 315

Query: 213 VLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--- 268
            LFTL+ +   G+ DHVGG GF RYSV   W VP FEK++     L  +YLDA+ +    
Sbjct: 316 ALFTLRKIRDSGLRDHVGGHGFARYSVTADWSVPRFEKLVVHNALLLGLYLDAWLIATGG 375

Query: 269 --KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
                FY  +  +++DYL    I  P G   S+E ADS    G    +EGA+ +WT +E 
Sbjct: 376 EKNGEFYDVVV-ELVDYLTSAPISLPDGGFVSSEAADSYR-RGDRHLREGAYSLWTRREF 433

Query: 326 EDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
           + ++G+   A L   ++ +   GN +  +  DP++EF  +N+L  + D +    + G+ +
Sbjct: 434 DSVIGDDHEAALAASYWNVLEDGNIEPDQ--DPNDEFVNENILRVVKDKAEIGRQAGITI 491

Query: 384 EKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 442
           +    +L   ++KL   R K R RP  D K++   NGLVI + AR    L          
Sbjct: 492 DDVERVLASAKQKLKAHREKERTRPEADTKIVAGRNGLVIGALARTGSALA--------- 542

Query: 443 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 502
            P+         E A  AA+FIR  L+DE    L   +  G     G  DDYA LI GL+
Sbjct: 543 -PIDADRSNACFEAASKAAAFIRAQLWDENERILYRIYNEGRGDTKGLADDYAHLIEGLI 601

Query: 503 DLYEFGSGTKWLVWAIELQNTQDELFLD--------------REGGGYFNTTGED-PSVL 547
           DLYE     KW  +A ELQ  Q ++F D              R   G F TT E+ P  +
Sbjct: 602 DLYEATGEEKWAEFADELQKVQIDMFYDSTSVPATTPTSPTARSSCGAFYTTPENAPHTI 661

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           LR+K+  D A PS N+VSV NL RL  +++    + Y   A  S+  FE  +
Sbjct: 662 LRLKDGMDTALPSTNAVSVSNLFRLGIMLS---DEAYTALARESINAFEAEI 710


>gi|326776975|ref|ZP_08236240.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
 gi|326657308|gb|EGE42154.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
          Length = 672

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 208/582 (35%), Positives = 297/582 (51%), Gaps = 61/582 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE VA  LN  FV +KVDREERPD+D VYM  VQA  G GGWP++
Sbjct: 47  SSCHWCHVMAHESFEDETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPPE ++G P F+ +L  V  AW  +R+ +A+     +  L    
Sbjct: 107 VFLTPDAEPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLG-GR 165

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S     + +P   E+ Q  L      L++ YD + GGFG APKFP  + ++ +L H  + 
Sbjct: 166 SLVHGGDGVPGESEIAQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR- 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 220 --TGSEG----ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 273

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       I  +  D++ R++    G   SA DADS + +G  R  EGA+
Sbjct: 274 CRVYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAY 331

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT  ++ ++LGE    F   Y+           +++     +G +VL    D+     
Sbjct: 332 YVWTPAQLREVLGEDDAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG---- 377

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
               P++     + + R +L   R +RPRP LDDKV+ +WNGL I++ A           
Sbjct: 378 ----PVDA--ARVADVRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----- 426

Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYA 495
                      DR + +E A  AA   +R HL   +  RL  + ++G +    G L+DY 
Sbjct: 427 -----------DRPDLVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYG 473

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G L L        WL +A  L +   E F   EGG  ++T  +   ++ R ++  D
Sbjct: 474 DVAEGFLTLAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTD 532

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            A PSG + +   L+   S  A + S+ +R  AE +L V + 
Sbjct: 533 SATPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 571


>gi|302519353|ref|ZP_07271695.1| transmembrane protein [Streptomyces sp. SPB78]
 gi|302428248|gb|EFL00064.1| transmembrane protein [Streptomyces sp. SPB78]
          Length = 578

 Score =  317 bits (812), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 213/581 (36%), Positives = 296/581 (50%), Gaps = 60/581 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A  +N  FV +KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SSCHWCHVMARESFEDAETAAYMNAHFVCVKVDREERPDVDAVYMEAVQAATGHGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+P  +P   GTYFPP   +G P F+ +L  V+ AW  +R+ +A   A     L+  A
Sbjct: 107 VFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRREEVADVAARVTADLTGRA 166

Query: 139 LSASA-SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           L   A +S   PD L    L      L++ YDSR GGFG APKFP  + ++ +L H  + 
Sbjct: 167 LGLPADASPPGPDALGAALL-----GLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKML D   L
Sbjct: 221 --TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWIVPHFEKMLSDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y   +  T       +  +  D+L R++  P G   SA DADS   +G  R  EGA 
Sbjct: 275 CRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGAS 332

Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT +++ ++LGE  A L   HY + P G             F+  + ++ L  +  S 
Sbjct: 333 YVWTPEQLREVLGEDDAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGSD 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S    P++     L   RR L   R +RP P  DDKV+ +WNGL I++ A          
Sbjct: 381 SP---PVDA--ARLDRIRRALLAARDERPAPGRDDKVVAAWNGLAIAALAETGAYF---- 431

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTH-RLQHSFRNGPSKA-PGFLDD 493
                       DR + +E A  AA   +R HL    TH RL  + R+G +    G L+D
Sbjct: 432 ------------DRPDLVEAALGAADLLVRVHL---DTHGRLSRTSRDGRTGTNTGVLED 476

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +  G L L        W  +A  L +   + F D + G  ++T  +  +++ R ++ 
Sbjct: 477 YADVAEGFLTLASVTGEGVWTDFAGLLLDHVLDRFRD-DSGALYDTAADAETLIHRPQDP 535

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            D A PSG + +   L+  A++ AGS    +R  +E  L+V
Sbjct: 536 TDNATPSGWNAAAGALLTYAAL-AGSTP--HRAASEQGLSV 573


>gi|441511562|ref|ZP_20993411.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
 gi|441453542|dbj|GAC51372.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
          Length = 674

 Score =  317 bits (812), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 201/575 (34%), Positives = 284/575 (49%), Gaps = 65/575 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFEDE  A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ F
Sbjct: 60  CHWCHVMAHESFEDETTAAQMNRDFVCIKVDREERPDIDAIYMAATVAMTGQGGWPMTCF 119

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P   GTY+PP  +   P F+ +L  V +AW ++R  L  + A   E +    S 
Sbjct: 120 LTPDSDPFYTGTYYPPRPRGQMPSFRQVLTAVTEAWTQRRADLDDTAAKVREHIVVNTSP 179

Query: 142 -SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             A +  + D L  + +R   ++     D   GGFG APKFP    +  ++ H+++  DT
Sbjct: 180 LPAGTVPVDDRLLAHGVRTVLDE----EDREHGGFGGAPKFPPSALLDALIRHTERTGDT 235

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                A         T+  M +GGI+D +GGGF RYSVD  W VPHFEKMLYD  QL   
Sbjct: 236 AAIEAAGR-------TMHAMGRGGIYDQLGGGFARYSVDAGWVVPHFEKMLYDNAQLLRA 288

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y      T D     +  + + +LRRD+  PGG   S+ DAD+   EG+T       YVW
Sbjct: 289 YAHLARRTGDALAHRVVEETVTFLRRDLRVPGG-FASSLDADAGGVEGST-------YVW 340

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  E+ ++LG  A       +                       V+ E        S L 
Sbjct: 341 TPDELAEVLGPEAGRRAAELF-----------------------VVTEQGTFEHGRSTLQ 377

Query: 381 MPLE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           +P + +  + LG  R  LFD R++R +P  DDKV+ +WN + I++ A A   L    E+ 
Sbjct: 378 LPADPEDRDRLGTVRAALFDARARRVQPTRDDKVVTAWNAMTITALAEAGAGL---GETG 434

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
             +  V  +D              +R HL      RL+ S   G   A G LDD+A L +
Sbjct: 435 FVDDAVRCAD------------ELLRGHLVG---GRLRRSSLGGAVGADGGLDDHAALST 479

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAE 558
            LL L++    T+WL   + L +T  ELF D E  G +F+ TGE   ++ R ++  DGA 
Sbjct: 480 ALLTLFQVTGETRWLGAGLGLLDTAIELFADPEAPGAWFDATGE--GLIARPRDPIDGAT 537

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           PSG S+    L+  + +    ++  Y +  EHSL+
Sbjct: 538 PSGASLMAEALLTASMLADPERAVGYAELLEHSLS 572


>gi|126659475|ref|ZP_01730608.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
 gi|126619209|gb|EAZ89945.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
          Length = 686

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 243/719 (33%), Positives = 346/719 (48%), Gaps = 110/719 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LND F+ IKVDREERPD+D +YM+ +Q +   GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRPGF  +L+ ++  +D +++ L     F  E L + 
Sbjct: 108 IFLTPGDLVPFYGGTYFPVEPRYGRPGFLQVLQSIRHFYDVEKEKL---NGFKQEIL-KG 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKK 196
           L  SA+       LP + + +   QL  +  D        +A  F RP    M+ Y +  
Sbjct: 164 LQQSAT-------LPMSEIDVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLA 215

Query: 197 LEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           LE T    GE  E QK+V+   Q +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 216 LEGTRFLFGEPEERQKLVIQRGQDLALGGIFDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           Q    LAN++ +     ++  +       + +L+R+M  P G  ++A+DADS  T+    
Sbjct: 276 QIMEYLANLWSNG---QQEPAFERAIALTVQWLQREMTSPEGYFYAAQDADSFATKEDKE 332

Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
            +EG FYVW  +++E +L    +    E + + P GN            F+GKNVL   N
Sbjct: 333 PEEGTFYVWKYEQLEQLLNTKKLEELTEVFTITPEGN------------FEGKNVLQRRN 380

Query: 371 DSSASASKLGMPLEK-YLNILGECRRKL---FDVRSKRPRPHL----------DDKVIVS 416
            S  S S + + L+K +    G  R  L      ++ +    +          D K+IV+
Sbjct: 381 GSKFSDS-IEIILDKLFQERYGTSRNNLETFLPAKNNQEAQEINWPGRIPAVTDTKMIVA 439

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHR 475
           WN L+IS  ARA  I K          P+       Y ++  +A  FI  +   + + HR
Sbjct: 440 WNSLMISGLARAYAIFKQ---------PL-------YWQLGCNATQFILNKQWLNGRLHR 483

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGG 534
           + +    G        +DY FLI  LLDL+   +  T+WL  AIE+Q   DE F   E G
Sbjct: 484 INYE---GNPSILAQSEDYGFLIKALLDLHAANAQETQWLDKAIEIQQEFDEFFWSLEMG 540

Query: 535 GYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GY+N   ++ + +L+R +   D A PS N +++ NLVRLA +        Y   AE  L 
Sbjct: 541 GYYNNAADNSNDLLVRERSYIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQGLQ 597

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F   L +   A P +  A D           LV    ++              L K + 
Sbjct: 598 AFSHILSESPRACPSLLTALDWYHFG-----CLVRTNETL--------------LPKLMT 638

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
              P     +D              NN   D  V LVCQ  SC  P T    L N ++E
Sbjct: 639 QYFPTTAYCLD--------------NNL-PDNAVGLVCQGLSCLEPATTEEQLLNQIIE 682


>gi|302536490|ref|ZP_07288832.1| conserved hypothetical protein [Streptomyces sp. C]
 gi|302445385|gb|EFL17201.1| conserved hypothetical protein [Streptomyces sp. C]
          Length = 687

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 223/700 (31%), Positives = 329/700 (47%), Gaps = 74/700 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED+  A  +N+ FV+IKVDREERPD+D VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAGESFEDDLAAAYMNEHFVNIKVDREERPDIDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+PD +P   GTYFPPE ++G P F  +L  V+ AW  +R+ +++     +  L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFMQVLEGVRTAWAGRREEVSEVAQRIVRDLAGRQ 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L    +    P+EL +  L      L++ YD+  GGFG APKFP  + ++ +L H  +  
Sbjct: 168 LDYGRAGLPGPEELGRALL-----GLTREYDAARGGFGGAPKFPPSMVLEFLLRHHAR-- 220

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 221 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS E   + +  EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTEQGGFASALDADS-EDPSSGKHVEGAYY 334

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
            WT  E+ ++LGE        Y+    G  +          F+    +++L         
Sbjct: 335 AWTPAELAEVLGEEDGAVAAAYF----GVTE-------EGTFEHGRSVLQLPQ------- 376

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G P+ +   +    R +L   R +RP P  DDKV+ +WNGL +++ A            
Sbjct: 377 -GGPVVEAGKV-ASIRERLLAARGRRPAPGRDDKVVAAWNGLAVAALAECGAFF------ 428

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKA-PGFLDDYA 495
                     +R + +E A  AA  + R  +D      RL  + R+G      G L+DY 
Sbjct: 429 ----------ERPDLVERAIEAADLLVRVHFDSTAGMARLARTSRDGRVGVNAGVLEDYG 478

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
            +  G L L        WL +A  L +     F    G G    T  D   L+R  +D  
Sbjct: 479 DVAEGFLALASVTGEGVWLEFAGFLVDLVMARFT--AGDGSLYDTAHDAEQLIRRPQDPT 536

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           D A PSG + +   L+   S  A + S  +R+ AE +L V           +      A+
Sbjct: 537 DTAAPSGWTAAAGALL---SYAAHTGSAPHREAAERALGVVHALGPRAPRFIGHGLAVAE 593

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNSNN 673
            L V   + V +VGH              A+  L++T ++   P     +    + + + 
Sbjct: 594 AL-VDGPREVAVVGHPED----------PATVALHRTALLATAPGAVVAVGLPRKADGSG 642

Query: 674 AS---MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                +A      D   A VC++F C+ P T+P+SL   L
Sbjct: 643 GEFPLLAERTLVRDLPTAYVCRHFVCARPTTEPVSLAEQL 682


>gi|255033843|ref|YP_003084464.1| hypothetical protein Dfer_0027 [Dyadobacter fermentans DSM 18053]
 gi|254946599|gb|ACT91299.1| protein of unknown function DUF255 [Dyadobacter fermentans DSM
           18053]
          Length = 671

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 224/685 (32%), Positives = 326/685 (47%), Gaps = 75/685 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME E FE E +A+++N +FV IKVDREERPDVD VYM  VQA+   GGWPL+
Sbjct: 47  SACHWCHVMERECFEKEPIAEVMNAYFVCIKVDREERPDVDAVYMDAVQAMGVRGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL PD KP  G TY PP++      +  +L+ +  A+    D LA S    ++ +  + 
Sbjct: 107 VFLLPDSKPFYGVTYLPPQN------WVQLLKSINQAFTNHFDELADSAEGFVQNMIASE 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S      +       + L +  EQ+ + +D++ GG   APKF  P   + +L    +  D
Sbjct: 161 SQKYGLVEGTVHFNADDLDVMFEQIQRHFDTQKGGMDRAPKFMMPSIYKFLL----RYFD 216

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             ++ EA      V  +L  +A GGI+DHVGGG+ RYSVDE W +PHFEKMLYD  QL +
Sbjct: 217 VSQNPEA---LAQVELSLNRIALGGIYDHVGGGWARYSVDEDWFIPHFEKMLYDNAQLLS 273

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY +A+SLT++  Y+      + +L  +M    G  FSA DADS   EG     EG FY+
Sbjct: 274 VYAEAYSLTQNPLYASRIEQTIQWLSAEMRSADGGFFSALDADS---EGI----EGKFYI 326

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT +E++ +LGE    F + Y +   GN +            G N L        +A   
Sbjct: 327 WTQQELQSVLGEDFDWFSKLYNISAQGNWE-----------HGYNHLHLTEPVEHAAKTA 375

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+  + +         KL + R +R RP LDDK++ SWNGL+I       + L  E    
Sbjct: 376 GILTDDFAGRYENAVTKLAEKRRERVRPGLDDKILASWNGLLIKGLTDCYRALGHE---- 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       E  E+A     FI   +      +L HSF+NG +   GFL+DYA +I 
Sbjct: 432 ------------EIRELAIGTGHFIAGKM--TTGSKLNHSFKNGVATVTGFLEDYAAVIE 477

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           G L LY+      WL  A +L       F D+  G +  T     +++ R KE  D   P
Sbjct: 478 GYLGLYQITFEEDWLQKAQQLTEYALSNFYDQSEGFFHFTDAYGEALIARKKELFDNVIP 537

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV---PLMCCAADML 616
           + NS+   NL  L  ++   + DY   + +    + +  L D+        L C  A   
Sbjct: 538 ASNSIMAQNLYTLGKML--DRDDYIEISDKMLSKMTKLLLADVQWVTNWAALYCQRA--- 592

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            VP+ +  ++ G     D + M       +  NK V+    + T  +            +
Sbjct: 593 -VPTAEIAIVGG-----DADAMRKDLDRFFIPNKIVMGTSTSSTLPL-----------LL 635

Query: 677 ARNNFSADKVVALVCQNFSCSPPVT 701
            R + +A K    VC + +C  PVT
Sbjct: 636 NRTDINA-KTAIYVCYDKTCQLPVT 659


>gi|402820063|ref|ZP_10869630.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
           IMCC14465]
 gi|402510806|gb|EJW21068.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
           IMCC14465]
          Length = 751

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 225/724 (31%), Positives = 346/724 (47%), Gaps = 100/724 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE+E +A ++ND FV+IKVDREERPD+D +YM+ +  +   GGWPL++F
Sbjct: 61  CHWCHVMAHESFENEDIASVMNDLFVNIKVDREERPDIDDIYMSALHMMGEQGGWPLTMF 120

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILR-----------KVKDAWDKKRDMLAQSGAF 130
           L PD +P  GGTYFPP  K+GRPGF  I R           KV++  DK    L      
Sbjct: 121 LLPDGRPFWGGTYFPPIAKFGRPGFPDICREIARICTEETDKVQENADKLTQALQNKNNA 180

Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
           A +  ++  +    S  LP  LP++     +E L++  D  +GG   APKFP+P+  +++
Sbjct: 181 AFKAANQKTALEQLSPNLPLGLPEDLASEASENLARQIDLTYGGMQGAPKFPQPLIYELL 240

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
                  +D  ++G     ++ VL TL  +  GGI DH+ GGF RYSVDE W VPHFEKM
Sbjct: 241 ------WQDWLRNGR-DVSREAVLITLSGLCHGGIFDHIRGGFSRYSVDEEWLVPHFEKM 293

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-------GPGGEIFSAED--- 300
           +YD G + ++  + +  T+D   +      +D+L  DM+         G    S +D   
Sbjct: 294 IYDNGLILDLMGNVWKSTRDPMLTDRISKTVDWLLDDMLTNATNNSTDGAAALSKDDTPK 353

Query: 301 ---ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPH 357
              A +A  +  +  +EG +YVWT  E+  +LGE+   F   Y +   GN        P 
Sbjct: 354 PPAAFAASLDADSEGEEGKYYVWTVAELTSLLGENFPDFARTYRVTDAGNF-------PE 406

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLE----KYLNILGECRRKLFDVRSKRPRPHLDDKV 413
               G NV I LN    S    G   E    + LNIL +        ++ R RP  DDK+
Sbjct: 407 GGGAGDNVNI-LNRLPPSLHNEGFDEEARHAQSLNILAQA-------QALRTRPERDDKI 458

Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ- 472
           +  WNGLVI++ AR S + ++                K+++E AE A   + + +  E+ 
Sbjct: 459 LADWNGLVIAALARLSPVFQN----------------KKWLETAERAYRDVMQTMSYEEG 502

Query: 473 -THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
              +L H+ R          +DY+ +    L L+       +L  A  L  T ++ + D 
Sbjct: 503 GCLKLAHAARGESKLNISMAEDYSNMADAALALFSATGTASYLASAEALTKTLEQFYTD- 561

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           + GG++ T+ +  +++ R    +DGA P+ N  ++I + R  ++  G +   YR + E  
Sbjct: 562 DVGGFYMTSSQAETLITRPHTSYDGATPNANG-TMIGVYRRLAVFTGKQD--YRDSLE-- 616

Query: 592 LAVFETRLKDMAMAVPLMC-CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA------ 644
            A+ +T         P M     +  +   +   V+VG  S  DF+ +L  AHA      
Sbjct: 617 -ALIKTHAIAAIKHYPQMPRYLTETENTRHQASCVIVGDPSDNDFKLLLETAHAHPCPGL 675

Query: 645 -------SYDLNKTV-IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
                    DL   + IH  PA+           + NA+  +  F+ D+  A VC + +C
Sbjct: 676 IVHPVGLGQDLPTHIPIHETPANP----------TKNATDDKMPFAFDQPTAYVCTHNTC 725

Query: 697 SPPV 700
            PP 
Sbjct: 726 LPPA 729


>gi|322697732|gb|EFY89508.1| DUF255 domain protein [Metarhizium acridum CQMa 102]
          Length = 724

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 207/641 (32%), Positives = 322/641 (50%), Gaps = 74/641 (11%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M  ESF +   A +LN+ FV + +DREERPD+D +YM YVQA+   GG
Sbjct: 63  HIGYKACHFCRLMTQESFSNPECAAILNESFVPVIIDREERPDIDTIYMNYVQAVSNVGG 122

Query: 76  WPLSVFLSPDLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKKR----- 121
           WPL+VF++P+L+P+ GGTY+P          E +   P   TI +KV+D W  +      
Sbjct: 123 WPLNVFVTPNLEPVFGGTYWPGPGTSRRVAAESEDESPDCLTIFKKVRDIWHDQETRCRK 182

Query: 122 ---DMLAQSGAFAIEQL------------------------SEALSASASSNKLPDELPQ 154
              ++LAQ   FA E                          +  + A     ++  EL  
Sbjct: 183 EASEVLAQLREFAAEGTLGTRGLTGTHPIATPSWNIPSNPENTPIRARDKDAQVSSELDL 242

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQK 211
           + L      ++ ++D  +GGFG APKF  P ++  +L+       ++D     E      
Sbjct: 243 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLHLNTFPSAVQDVVGEAECRHATV 302

Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL--- 267
           M + TL+ +  G +HDH+G  GF R SV   W +P+FEK++ D   L  +YLDA+ +   
Sbjct: 303 MAVDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNALLLVLYLDAWGIAGG 362

Query: 268 -TKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
                FY  +  ++ DYL    I  P G + ++E ADS    G    +EGA+Y+WT +E 
Sbjct: 363 KADSEFYDTVL-ELADYLSSPPIALPSGGLATSEAADSFMRRGDREMREGAYYLWTRREF 421

Query: 326 EDILG------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           + ++       + + +   H+ ++  GN D     DP+++F   N+L  +      + + 
Sbjct: 422 DSVVDASGQDKQISQVAAAHWDVQEGGNVDEDH--DPNDDFINHNILRVVKTPDELSRQF 479

Query: 380 GMPLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            +  +     +   R++L      +R RP LDDKVI +WNGL IS+ A+AS  LK     
Sbjct: 480 NISTDTVRQHIQAARKELKARRERERVRPELDDKVITAWNGLAISALAQASSALK----- 534

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                PV  +  ++Y+  AESAA FI+  L+DE +  L   +R G  +  GF DDY +LI
Sbjct: 535 -----PVDPARSEKYLHAAESAAGFIKASLWDESSKLLYRIYREG-RETKGFADDYTYLI 588

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLLDL+   S    L +A  LQ TQ+ LF D + G +F+TT   P  +LR+K+  D + 
Sbjct: 589 HGLLDLFAATSDESHLAFADALQKTQNSLFHDSDSGAFFSTTASSPQAILRLKDGMDTSL 648

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           PS N+V+  NL RL +++     + Y   A  ++  FE  +
Sbjct: 649 PSINAVAASNLFRLGALL---DDEPYSTLARGTVNAFEAEM 686


>gi|428319651|ref|YP_007117533.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243331|gb|AFZ09117.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 695

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 217/628 (34%), Positives = 319/628 (50%), Gaps = 82/628 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIAQYMNSHFIPIKVDREERPDIDSIYMQTLQMMTGQGGWPLN 107

Query: 80  VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD + P  GGTYFP E +YGRPGF  +L+ ++  +D ++  +    A  +  L ++
Sbjct: 108 VFLTPDERVPFYGGTYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILSNLQQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            + S  + +L  EL Q  L +    ++        G    P FP      M+ Y    L 
Sbjct: 168 AALSGVTAELNRELFQKGLEINTGIVA--------GHNPGPSFP------MIPYAELALR 213

Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            T  + E+    K V       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 214 GTRFNFESKYDSKQVCTQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQI 273

Query: 258 ANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
                + +S  + +  F + I   + ++L+R+MI P G  ++A+DADS  T      +EG
Sbjct: 274 VEYLANLWSAGIQEPAFETAIAGTV-EWLKREMIAPTGYFYAAQDADSFNTSEEVEPEEG 332

Query: 316 AFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----EL 369
           AFYVWT  E+E +L  E     K  + +  +GN            F+GKNVL       L
Sbjct: 333 AFYVWTYAELEQLLTAEELAEIKAQFTVSRSGN------------FEGKNVLQRRHPGRL 380

Query: 370 NDSSASA-SKL------GMP-LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
           +D+  +A +KL      G P   K        +    D    R     D K+I +WN L+
Sbjct: 381 SDTVETALAKLFAVRYGGNPNTVKTFPPARNNQEAKNDSWPGRIPAVTDTKMIAAWNSLM 440

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           IS  ARA+ +  +                 EY+E+A  AA+FI  + + E   R Q    
Sbjct: 441 ISGLARAAAVFGN----------------LEYLELAVKAANFILDNQWTE--GRFQRLNY 482

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYE----FGSGTK---------WLVWAIELQNTQDELF 528
           +G S      +DYA  +  LLDL++     G+G +         WL  A+++Q   DE  
Sbjct: 483 DGQSAVTAQSEDYALFVKALLDLHQASLTLGNGEEAKQLPNSQFWLEKALQVQEEFDEFL 542

Query: 529 LDREGGGYFNTTGEDPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
              E GGY+N T +D S  +L+R +   D A P+ N +++ +LVRLA  + G   +Y  +
Sbjct: 543 WSVELGGYYN-TAQDASGDLLVRERSYIDNATPAANGIAIASLVRLA--LLGPNLEYLDR 599

Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAAD 614
            AE  L  F + ++D   A P +  A D
Sbjct: 600 -AEQGLQAFSSIVQDSPQACPSLLSAID 626


>gi|336120019|ref|YP_004574797.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
 gi|334687809|dbj|BAK37394.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
          Length = 669

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 225/692 (32%), Positives = 319/692 (46%), Gaps = 78/692 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPDVD V+M   QAL G GGWP++V
Sbjct: 49  ACHWCHVMAHESFEDETTAAYLNEHFVSVKVDREERPDVDAVFMAATQALAGQGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTYFPP  + G P F  +L  +  AW  +RD +  S A    +L     
Sbjct: 109 FLTPDRRPFYAGTYFPPRARQGMPAFADVLAAIASAWRDRRDEVLSSVAHISGELERR-- 166

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               + KLP E+ +  L +    L + +D   GGFG APKFP  + ++ +L    +L D 
Sbjct: 167 ---HAPKLPGEVTRAGLDVARANLQREFDEVRGGFGGAPKFPPSMVLEGLL----RLGD- 218

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                  E   MV  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  V
Sbjct: 219 ------DESMAMVDVTCEAMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLGV 272

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y   +  T++     +  + +++L  ++  P G   ++ DADS + +G     EGA+Y W
Sbjct: 273 YTHWWRRTQNPIGERVVAETVEWLVAELRTPQGGFAASLDADSLDEQG--HSAEGAYYAW 330

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
               +  +LGE    +    +           ++D      G++ L  L D         
Sbjct: 331 DPVGLTAVLGEDDGRWAAEVF----------GVTDQGTFEHGRSTLRLLGDPD------- 373

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
            P+      L   R +L   R +RPRP  DDKV+ +WNG +I+S   A+ +         
Sbjct: 374 -PVR-----LASARERLRTTREQRPRPGRDDKVVAAWNGWLIASLVEAAGVFG------- 420

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
                    R +++ +A  AA  I R H  D    RL+ + R+G    A G L+DYA + 
Sbjct: 421 ---------RPDWLALAREAAELIWRVHWVD---GRLRRTSRDGEVGSAAGVLEDYAAMT 468

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              + L    +   WL  A  L       F D  G G+F+T     S+ LR ++  D A 
Sbjct: 469 MAAVRLGCAEADATWLTRAEALAEVILAEFGD--GDGFFDTASGAESLYLRPQDPTDNAT 526

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S +V  L  LA      +SD   +    +        +    A  L+  AA  L  
Sbjct: 527 PSGLSATVHALALLAETT--GRSDLAERAERAAATAGGLVDRAPRFAGWLLAYAASRLVS 584

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
           P    V +VG  S    + +   A+       +VI +   D   ++           +A 
Sbjct: 585 PP-VQVAIVGDASDTGTQELARTAYRCAPAG-SVIMVGVPDEPGLEL----------LAD 632

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +  A VC+ F C  PVTD   L + L
Sbjct: 633 RPLLDGRPTAYVCRGFVCRLPVTDSQELADQL 664


>gi|88604224|ref|YP_504402.1| hypothetical protein Mhun_2996 [Methanospirillum hungatei JF-1]
 gi|88189686|gb|ABD42683.1| protein of unknown function DUF255 [Methanospirillum hungatei JF-1]
          Length = 700

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 225/688 (32%), Positives = 310/688 (45%), Gaps = 77/688 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME   FEDE VA LLN  FVS+KVDREERPD+D+VYM   QA+ G GGWPL VF
Sbjct: 53  CHWCHVMETVCFEDEVVASLLNTHFVSVKVDREERPDIDQVYMAVCQAMTGSGGWPLHVF 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P    T+ P       PG   +L  +   W  +R+ ++       +Q+  A+  
Sbjct: 113 LTPDKRPFYAATFIPKMSSPNMPGMLDLLPYLASVWRDEREKVSDLS----DQIMSAIQE 168

Query: 142 SASSNKL--PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 L  PDEL   A R    +L+  YD ++GGF  APKFP    +  +L ++   +D
Sbjct: 169 QTRRGTLHDPDELIHTAAR----RLTALYDKKYGGFSPAPKFPSVPVLLFLLRYAVIHQD 224

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                       M+  TL  MA GG+ DH+ GGFHRY+ D  W +PHFEKMLYDQ   A 
Sbjct: 225 RSI-------LDMITTTLNRMAWGGMRDHLDGGFHRYATDTAWKLPHFEKMLYDQAMCAI 277

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y + + +TK   Y  + R +L+Y+   +    G   S+EDADS          EGA+Y+
Sbjct: 278 IYTEIWQVTKQDRYRRLARSVLEYMTTVLSDAPGGFSSSEDADSP-------GGEGAYYL 330

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W+  E+E I GE A L    + +   GN     +S  H    G NVL    D     S  
Sbjct: 331 WSYDEIEKIFGEEARLVCTMFGITREGN-----VSGMHGMKPGDNVLFPERDPLEILSAA 385

Query: 380 GM--PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           G+  P + Y +IL      L + R +R RP LDDKV+  WN L I + A A  +   E+ 
Sbjct: 386 GVRDPEKTYASILN----TLTNARKERERPPLDDKVLTDWNALAIQALAFAGMVFHDESL 441

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                              A SAA F+  ++       L H +RNG     G   DY  L
Sbjct: 442 CTR----------------AISAAEFLFSNMVRPDGSVL-HRWRNGQGGIEGTAGDYVHL 484

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
               + LY+    + WL  AI L+ +  + F D   GGYF    E   + +R+KE  DG 
Sbjct: 485 AWACVTLYQTTGNSLWLRRAISLEKSASDRFYDSVHGGYFQVPSET-DLPVRMKEMTDGP 543

Query: 558 EPSGNSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
             S N  + + L  L +I      G KS   RQ  E+       R  D  M         
Sbjct: 544 TFSTNGAAYLLLCALFTITGDELYGQKS---RQIEEYQ------RSLDPRMITGCCTFLC 594

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
            ++    R   VL     S   + + +   +SY      IHI            E + + 
Sbjct: 595 GLIEKNLRGTAVLCNTSGSTGDDEIWSLLWSSYLPGMIRIHI-----------RERSDSY 643

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVT 701
                 +   D     +C +  C PP+T
Sbjct: 644 FLPLYVHCQGDTPALHICSHQQCYPPIT 671


>gi|453051421|gb|EME98928.1| hypothetical protein H340_19073 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 680

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 230/696 (33%), Positives = 333/696 (47%), Gaps = 79/696 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPD+D VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAGESFEDEETAAYLNEHFVSVKVDREERPDIDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPP  ++G P F+ +L  V  AW  +R+ + +     ++ L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVAAAWRDRREEVGEVAGRIVQDLARRP 167

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             +A   + P     + L +    L++ +D+  GGFG APKFP  + ++ +L H  +   
Sbjct: 168 LTAAVGGQPP---AADELHMALMALTREFDAVRGGFGGAPKFPPSMVLEFLLRHHVR--- 221

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG +        MV  T + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 222 TGSAA----ALDMVTATCEAMARGGIHDQLGGGFARYSVDNGWVVPHFEKMLYDNALLCR 277

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  D  D+L R+M    G   SA DADS + +G  R +EGA+YV
Sbjct: 278 VYAHLWRATGSGLARRVALDTADFLVREMRTDQGGFASALDADSDDGQG--RHREGAYYV 335

Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT ++  ++LGE  A L  +++ +   G  +           +G +VL +L DS      
Sbjct: 336 WTPEQFREVLGEADAELAADYFGVTEEGTFE-----------EGASVL-QLPDS------ 377

Query: 379 LGMPLEKYLNI--LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                E+ ++   +   R +L   R++RPRP  DDKV+  WNGL I++ A          
Sbjct: 378 -----ERLVDAERIASVRERLLAARARRPRPGRDDKVVAGWNGLAIAALAETGAYF---- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                       DR + ++ A  AA  + R   D      + S         G L+DYA 
Sbjct: 429 ------------DRPDLVQAATDAADLLVRTHMDWNARLFRTSLDGVAGGHAGVLEDYAD 476

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +  G L L        W+ +A  L +T    F D E G  F+T  +  +++ R ++  D 
Sbjct: 477 VAEGFLALSAVTGEGVWVDFAGLLLDTVLIRFRDEE-GALFDTADDAETLIRRPQDPTDN 535

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMC 610
           A PSG S +   L+  A++   + S  +R+ AE +L V         R     +AV    
Sbjct: 536 ATPSGWSAAAGALLTYAAL---TGSAPHREAAERALGVVRALGPKAPRFIGWGLAV---- 588

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P    V +VG         +   A  S      V   +PA     +      
Sbjct: 589 -AEALLDGP--YEVAVVGPHDDPATRELHRTALLSQRPGLAVALGEPASATAAEV----- 640

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
                +A     A +  A VC+ F+C  P +DP  L
Sbjct: 641 ---PLLADRPLLAGRPAAYVCRGFTCDAPTSDPEEL 673


>gi|381163013|ref|ZP_09872243.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           NA-128]
 gi|379254918|gb|EHY88844.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           NA-128]
          Length = 667

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 226/702 (32%), Positives = 322/702 (45%), Gaps = 96/702 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF DE VA L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFSDEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP   GTY+PP   +G P F+ +L  V  AW ++RD L +     ++ + E    
Sbjct: 109 LTPDGKPFHCGTYYPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE---- 164

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +    P  +    +     +L    D   GGFG APKFP  + ++ +L H    E TG
Sbjct: 165 -QTKPLGPHPVTAETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               + E   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  +  ++L RD+  P G   S+ DAD   TEG     EG  YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPQGAFASSLDAD---TEGV----EGLTYVWT 329

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
            +++ D+LG     +    +                       V +E       AS L +
Sbjct: 330 PQQLVDVLGPDDGAWAAATF----------------------GVTVE-GTFERGASTLRL 366

Query: 382 PLE-----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           P +     +++ +       L + R+ RP+P  DDKVI +WNGL I++ A A   L+   
Sbjct: 367 PRDPDDPSRWMRVTA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ--- 419

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
                        R E++E A +A +F+   H+ D    R   S R+G   +A G L+DY
Sbjct: 420 -------------RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDY 463

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKED 553
           A L  GLL L++     +WLV A  L +T    F      G F+ T  D   L+ R  + 
Sbjct: 464 ACLADGLLSLHQATGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDP 523

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----L 608
            D A PSG S     L+  +++     +  YR   E ++    +R   +   VP      
Sbjct: 524 TDNASPSGASALADALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHW 579

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  A  ML+ P +  V +VG  +    E ++ AA   +     +              E 
Sbjct: 580 LSVAEAMLAGPVQ--VAVVGEDAQARHELVVEAATRVHGGGVVLGG------------EP 625

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  +A          A VC+ + C  PVT P  L + L
Sbjct: 626 EAEGVPLLADRPLVDGSPAAYVCRGYVCDRPVTTPEDLAHAL 667


>gi|428777664|ref|YP_007169451.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
 gi|428691943|gb|AFZ45237.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
          Length = 677

 Score =  315 bits (807), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 232/698 (33%), Positives = 344/698 (49%), Gaps = 94/698 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ LND FV IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDSAIAQYLNDNFVPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+PD + P  GGTYFP E ++GRPGF  IL+ ++  +D++++ L     F  E +   
Sbjct: 108 IFLTPDDRVPFYGGTYFPIEPRFGRPGFLDILKAIRRFYDQEKEKL---NTFKSEVMG-L 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
           L  SA+       LP+    L ++ L+K  ++  G     G+ P FP      M+ Y   
Sbjct: 164 LQQSAT-------LPETQTNLNSDLLTKGIETGVGITSHRGTPPSFP------MIPYAQL 210

Query: 196 KLEDTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            L  T  + E+    K V       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 211 ALRGTRFNYESRYDAKDVAQQRGYDLALGGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDN 270

Query: 255 GQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           GQ+     + +S  + +  F S I + + ++L+R+M  P G  ++++DADS  T  A   
Sbjct: 271 GQIVEYLANLWSSGVEEPAFKSAIAQTV-EWLQREMTAPEGYFYASQDADSFTTSEADEP 329

Query: 313 KEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCD----LSRMSDPHNEFKGKNVLI 367
           +EGAFYVW+ +E+E +L  E     +  + +   GN +    L R +  +   + KN L 
Sbjct: 330 EEGAFYVWSDRELETLLTAEELQALQSEFTVTAEGNFEGSNVLQRQNGGNLSNEAKNALK 389

Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
           +L ++    S +        N   E +   ++ R     P  D K+I +WN L+IS  AR
Sbjct: 390 KLFNARYGNSSIATFPPATNN--SEAKTTAWEGRIP---PVTDTKMITAWNSLMISGLAR 444

Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSK 486
           A              + V G   K Y + A  A +FI  + + E + HRL +   NG + 
Sbjct: 445 A--------------YAVFG--EKTYWDCAVKATNFIWENQWVEGRFHRLNY---NGKAT 485

Query: 487 APGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
                +DYA  I  LLDL+       +WL  A++LQ   DE     E GGYFNT  ++ +
Sbjct: 486 VSAQSEDYALFIKALLDLHACHPEQPQWLDQAVQLQAEFDEYLWSVETGGYFNTANDNSN 545

Query: 546 -VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
            +++R +   D A P+ N V+V NLV+L  I    ++DY   +AE +L  F + ++    
Sbjct: 546 DLIVRERTYIDNATPAANGVAVANLVQLFEIT--EQTDYL-ASAEKTLNAFSSIMEKSPQ 602

Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
           A P +    D        H  LV   S       L A    Y L      ++ +      
Sbjct: 603 ACPGLFSGLDWY-----LHGTLVRSTSE-----QLQALMNQY-LPTCTYRVETS------ 645

Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
                              D  +ALVC+  +C  P TD
Sbjct: 646 -----------------LPDSAIALVCKGLTCLEPATD 666


>gi|418461665|ref|ZP_13032732.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           SZMC 14600]
 gi|359738246|gb|EHK87140.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           SZMC 14600]
          Length = 667

 Score =  315 bits (807), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 226/702 (32%), Positives = 322/702 (45%), Gaps = 96/702 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF DE VA L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFSDEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP   GTY+PP   +G P F+ +L  V  AW ++RD L +     ++ + E    
Sbjct: 109 LTPDGKPFHCGTYYPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE---- 164

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +    P  +    +     +L    D   GGFG APKFP  + ++ +L H    E TG
Sbjct: 165 -QTKPLGPHPVTAETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               + E   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  +  ++L RD+  P G   S+ DAD   TEG     EG  YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPQGAFASSLDAD---TEGV----EGLTYVWT 329

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
            +++ D+LG     +    +                       V +E       AS L +
Sbjct: 330 PQQLVDVLGPDDGAWAAATF----------------------GVTVE-GTFERGASTLRL 366

Query: 382 PLE-----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           P +     +++ +       L + R+ RP+P  DDKVI +WNGL I++ A A   L+   
Sbjct: 367 PRDPDDPSRWMRVTA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ--- 419

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
                        R E++E A +A +F+   H+ D    R   S R+G   +A G L+DY
Sbjct: 420 -------------RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDY 463

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKED 553
           A L  GLL L++     +WLV A  L +T    F      G F+ T  D   L+ R  + 
Sbjct: 464 ACLADGLLSLHQATGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDP 523

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----L 608
            D A PSG S     L+  +++     +  YR   E ++    +R   +   VP      
Sbjct: 524 TDNASPSGASALAGALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHW 579

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
           +  A  ML+ P +  V +VG  +    E ++ AA   +     +              E 
Sbjct: 580 LSVAEAMLAGPVQ--VAVVGEDAQARHELVVEAATRVHGGGVVLGG------------EP 625

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  +A          A VC+ + C  PVT P  L + L
Sbjct: 626 EAEGVPLLADRPLVDGSPAAYVCRGYVCDRPVTTPEDLAHAL 667


>gi|88813137|ref|ZP_01128378.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
 gi|88789621|gb|EAR20747.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
          Length = 689

 Score =  315 bits (807), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 196/551 (35%), Positives = 297/551 (53%), Gaps = 56/551 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
           + CHWCHVM  ESFEDE +A+ +N+ F++IKVDREERPD+D++Y T  Q L    GGWPL
Sbjct: 54  SACHWCHVMAHESFEDETIARAMNEHFINIKVDREERPDLDRIYQTAHQLLNNRPGGWPL 113

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQL 135
           +VFL+P+  P   GTYFPP+  YG PGF  IL ++  A+ ++ + +    Q+   A+ +L
Sbjct: 114 TVFLTPEQMPFFCGTYFPPKSHYGLPGFHEILLQIAQAYRQQHEAIKKQNQAVLDALNRL 173

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
           SE     A +       P+ AL   A   L++ +DS FGGFG APKFP+P  I+ +L H 
Sbjct: 174 SEPPPNRAGA-------PKAALFDNARSALAREFDSTFGGFGPAPKFPQPSSIERLLRHY 226

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            +           +  +M   TL+ MA GGI+D +GGGF RYSVD  W +PHFEKMLYD 
Sbjct: 227 AR--TAANDVPDYDALRMAQLTLRKMALGGIYDQIGGGFARYSVDNYWIIPHFEKMLYDN 284

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           GQL  +Y DA+  T +  +  +  +  ++  R+M  P G  +++ DADS   EG     E
Sbjct: 285 GQLLALYADAWRATGEELFQRVANETAEWALREMRHPDGAFYASLDADS---EGG----E 337

Query: 315 GAFYVWTSKEVEDILGE---HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           GAFY+WT +E+ ++L E     +L +          C L+   +    F+G+  L     
Sbjct: 338 GAFYLWTPEEIRNVLREDEAEVVLAR----------CGLNNQPN----FEGRWHLYVRLT 383

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
            +  A+    P ++ + +    R +L + R +RPRP  D+KV+ SWN L++S  ARA + 
Sbjct: 384 FTDLANNQHRPRQELIALWRSARERLREAREQRPRPPRDEKVLTSWNALMVSGLARAGRR 443

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
             + A +A                  +    F+  +L+  +  RL   +++G +  P +L
Sbjct: 444 FGNTALTA----------------AGDQTLHFLHSNLW--RNGRLLTVWKDGQADLPAYL 485

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           DD+A+L++ LL+  E      WL WA  + +     F D+  GG+F T  +   ++ R +
Sbjct: 486 DDHAYLLAALLEQLEARWEPHWLQWARAIADLLLARFEDKTHGGFFFTADDHEPLVQRPR 545

Query: 552 EDHDGAEPSGN 562
              D A PSGN
Sbjct: 546 PLGDDACPSGN 556


>gi|340975510|gb|EGS22625.1| hypothetical protein CTHT_0010970 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 785

 Score =  315 bits (806), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 215/656 (32%), Positives = 325/656 (49%), Gaps = 102/656 (15%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H+CH+   +SF +  VA+ LN  F+ I +DREERPD+D ++  Y +A+   GGWPL++FL
Sbjct: 84  HFCHLTTQDSFSNPAVAEFLNQSFIPILIDREERPDLDTIFQNYSEAVNATGGWPLNLFL 143

Query: 83  SPDLKPLMGGTYF-----------------------PPEDKYGRPGFKTILRKVKDAWDK 119
           +PDL P+ GGTY+                       P ED YG   F  I +K+   W  
Sbjct: 144 TPDLYPIFGGTYWPGPGTEHSTLGSDRASESAIAGEPGEDSYG--DFLAIAKKIHGFWVT 201

Query: 120 KRDM--------------LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS 165
           + +                AQ G F+    S + +++A+ N    +L  + L     +++
Sbjct: 202 QEERCRREAFEMLHKLQDFAQEGTFSTPVGSGSAASAAADNS---DLDLDQLDEALTRIA 258

Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAK 222
           K +D  + GFG+ PKFP P  +  +L  +K   ++ D     E   G  M L TL+ +  
Sbjct: 259 KMFDPVYHGFGT-PKFPNPARLSFLLRLAKFPTEVSDVIGEREVENGTAMALKTLRRIRD 317

Query: 223 GGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----------SLTKDVF 272
           GG+HDH+G GF R+SV + W +PHFEKM+ +   L  V+LDA+          SL  +  
Sbjct: 318 GGLHDHLGAGFMRFSVTKNWGLPHFEKMVCENALLLGVFLDAWLGYTAGPKGPSLQDE-- 375

Query: 273 YSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 330
           ++ +  ++ DYL   +I  P G   ++E ADS    G    +EGA+Y+WT +E + ++G 
Sbjct: 376 FADVVVEVADYLTGPIIRTPQGGFVTSEAADSYYRRGDKHMREGAYYLWTRREFDQVVGG 435

Query: 331 ------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
                 +HA+     Y+     + ++ + +DP +EF  +NVL    D    + + GMP  
Sbjct: 436 SGTSSDDHALAVAAAYW-NVLEDGNVPQENDPFDEFINQNVLCVNRDVVELSRQFGMPQA 494

Query: 385 KYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
           +   ++ + R KL   R K R RP  D+KV+VS NG+VIS+ AR +  LK          
Sbjct: 495 EIRRVVDDARAKLRAHREKERVRPERDEKVVVSTNGMVISALARTAAALKG--------- 545

Query: 444 PVVGSDR-KEYMEVAESAASFIRRHLYDEQT---HRLQHSFRNGPSKAPGFLDDYAFLIS 499
             V  +R   Y++ AE AASFI+  L+DE+    + L+  +   PS    F DDYAFLI 
Sbjct: 546 --VDDERAARYLKAAEQAASFIKEKLWDEKQTAGNPLRRFWYQRPSDTKAFADDYAFLIE 603

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLD----------------REGGGYFNTTGED 543
           GLLDLY      KW  WA +LQ+ Q  LF D                  GG Y N     
Sbjct: 604 GLLDLYTTTLDKKWADWAKQLQDAQIRLFYDPIVPATTGAQPSPRQAYSGGFYSNELAAI 663

Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
              +LR+K   D ++PS N+V+  NL RL ++ A   S  Y   A  ++  FE  +
Sbjct: 664 SPTILRLKSGMDKSQPSTNAVAAANLFRLGALFA---SKEYTSLARETVNAFEAEV 716


>gi|291447326|ref|ZP_06586716.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
 gi|291350273|gb|EFE77177.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
          Length = 679

 Score =  314 bits (805), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 206/580 (35%), Positives = 292/580 (50%), Gaps = 59/580 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 55  SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
           FL+PD +P   GTYFPPE ++G P F+ +L  V  AW  +RD +A+ +G    +    +L
Sbjct: 115 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSL 174

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                      E+ Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   
Sbjct: 175 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR--- 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 227 TGAEG----ALQMAADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       I  +  D++ R++    G   SA DADS + +G  +  EGA+YV
Sbjct: 283 VYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYV 340

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  ++ ++LGE    F   Y+           +++     +G +VL    D+       
Sbjct: 341 WTPAQLREVLGEDDGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG------ 384

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
             P++    + G  R +L   R +RPRP  DDKV+ +WNGL I++ A             
Sbjct: 385 --PVDA-ARVAG-VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF------- 433

Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                    DR + +E A  AA   +R HL   +  RL  + ++G      G L+DY  +
Sbjct: 434 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 482

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L        WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A
Sbjct: 483 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 541

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            PSG + +   L+   S  A + S+ +R  AE +L V + 
Sbjct: 542 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 578


>gi|239990319|ref|ZP_04710983.1| hypothetical protein SrosN1_23633 [Streptomyces roseosporus NRRL
           11379]
          Length = 673

 Score =  314 bits (805), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 206/580 (35%), Positives = 292/580 (50%), Gaps = 59/580 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
           FL+PD +P   GTYFPPE ++G P F+ +L  V  AW  +RD +A+ +G    +    +L
Sbjct: 109 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSL 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                      E+ Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   
Sbjct: 169 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 221 TGAEG----ALQMAADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       I  +  D++ R++    G   SA DADS + +G  +  EGA+YV
Sbjct: 277 VYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYV 334

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  ++ ++LGE    F   Y+           +++     +G +VL    D+       
Sbjct: 335 WTPAQLREVLGEDDGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG------ 378

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
             P++    + G  R +L   R +RPRP  DDKV+ +WNGL I++ A             
Sbjct: 379 --PVDA-ARVAG-VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF------- 427

Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                    DR + +E A  AA   +R HL   +  RL  + ++G      G L+DY  +
Sbjct: 428 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 476

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L        WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A
Sbjct: 477 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
            PSG + +   L+   S  A + S+ +R  AE +L V + 
Sbjct: 536 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 572


>gi|428201584|ref|YP_007080173.1| thioredoxin domain-containing protein [Pleurocapsa sp. PCC 7327]
 gi|427979016|gb|AFY76616.1| thioredoxin domain protein [Pleurocapsa sp. PCC 7327]
          Length = 685

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 238/713 (33%), Positives = 336/713 (47%), Gaps = 110/713 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEREAFSDSAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL P DL P  GGTYFP E +YGRPGF  +L+ ++  +D +++ L      A++Q  E 
Sbjct: 108 IFLIPGDLVPFYGGTYFPLEPRYGRPGFLQVLQSIRRFYDVEKEKLD-----ALKQ--EI 160

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKL 197
           L     S  LP     +   L  E L +  ++  G     A  F RP    M+ Y S  L
Sbjct: 161 LGGLKQSTILPISTSDS---LSKELLYRGVETNTGVISIGASDFGRP-SFPMIPYASLAL 216

Query: 198 EDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           + +    E+  +G+++     + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 217 QGSRFQFESRYDGRQLSARRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276

Query: 257 LANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           +     + +S   K+  +       + +L+R+M  P G  ++A+DADS  +  A+  +EG
Sbjct: 277 ILEYLSNLWSAGMKEPAFERAIAGTVAWLKREMTTPEGYFYAAQDADSFTSTEASEPEEG 336

Query: 316 AFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           AFYVW   E+E IL    +   K  + +   GN            F+G NVL        
Sbjct: 337 AFYVWRYDELEKILTADELEELKAAFTITEKGN------------FEGSNVL-----QRK 379

Query: 375 SASKLGMPLEKYLNILGECR--RKLFDVRSKRPRPH----------------LDDKVIVS 416
            + KL   LE  L+ L E R   K  ++ +  P  +                 D K+I +
Sbjct: 380 ESGKLSDSLEAILDKLFEVRYGAKSTEIETFVPARNNQEAKTGNWKGRIPAVTDTKMIAA 439

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHR 475
           WN L IS  ARA          A+F  P        Y E+A  AA FI  + + E + HR
Sbjct: 440 WNSLTISGLARA---------YAVFGEP-------SYWELATRAAKFILEYQWIEGRFHR 483

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGG 534
           L +    G +      +DYAF I  LLDL     + T WL  A+E+Q   DE F   E G
Sbjct: 484 LNY---EGQATVLAQSEDYAFFIKALLDLQAASPTETFWLEKAVEVQQEFDEFFWSLEMG 540

Query: 535 GYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           GYFNT  +D   +L+R +   D A P+ N V++ NL+R+A +    +   Y   AE  L 
Sbjct: 541 GYFNTAADDSGDLLVRSRSYIDNATPAANGVAIANLIRIALLTENLE---YLDRAEQGLQ 597

Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
            F   L+    A P +  A D        H  LV  K     E  L      Y    TV+
Sbjct: 598 AFSAVLQQSPQACPSLFAALDWY-----LHATLVRTK-----EEQLKTLIPQY--FPTVV 645

Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
           +   +D  E                      K V ++C+  SC  P      L
Sbjct: 646 YRIESDLPE----------------------KAVGIICRGLSCLEPAQSQAQL 676


>gi|375102437|ref|ZP_09748700.1| thioredoxin domain containing protein [Saccharomonospora cyanea
           NA-134]
 gi|374663169|gb|EHR63047.1| thioredoxin domain containing protein [Saccharomonospora cyanea
           NA-134]
          Length = 670

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 219/698 (31%), Positives = 325/698 (46%), Gaps = 85/698 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF D+ VA  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFADDDVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P   GTY+PP   +G P FK +L  V  AW ++RD L +     ++ ++E    
Sbjct: 109 LTPDAEPFHCGTYYPPVPAHGIPAFKQLLTAVDQAWRERRDELVEGAGRIVDHIAE---- 164

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +    P  +  + +     +L    D   GGFG APKFP  + ++ +L H    E TG
Sbjct: 165 -QTGPLSPHPVTGDTVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               + E   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFARYSVDSGWVVPHFEKMLYDNALLLRFY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  +  ++L RD+  P G   ++ DAD+   EG T       YVWT
Sbjct: 277 AHLARRTDSPLAHRVAGETAEFLLRDLRTPQGAFAASLDADTEGVEGLT-------YVWT 329

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +++ ++LG +      E + +   G             F+     ++L      AS   
Sbjct: 330 PQQLVEVLGPDDGAWAAETFGVTEEGT------------FEHGASTLQLRRDPDDAS--- 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               +++ +       L   R+ RP+P  DDKVI +WNGL I++ A A   L+       
Sbjct: 375 ----RWMRVT----SALLQARNARPQPARDDKVIAAWNGLAITALAEAGVALQ------- 419

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
                    R E++E A +A +F+   H   +    L+ + R+G    A G L+DY  L 
Sbjct: 420 ---------RPEWVEAAVAAGAFVLDVHAGGDTAGGLRRTSRDGVVGTAAGVLEDYGCLA 470

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGA 557
            GLL L++    + WLV A  L +T    F      G F+ T  D   L+ R  +  D A
Sbjct: 471 DGLLALHQATGESVWLVEATTLLDTALRRFGVEGAPGAFHDTAADAEALVHRPSDPTDNA 530

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
            PSG S     L+  +++    ++  YR   E +L    +R   +   VP      +  A
Sbjct: 531 SPSGASALAGALLPASALAGPERAGTYRAACEEAL----SRAGALVAQVPRFAGHWLSVA 586

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             +LS P +  V +VG  ++   E ++ AA   +     +     AD   +         
Sbjct: 587 EALLSGPVQ--VAVVGTDAADRAELVVEAARRVHGGGVVLGGSPEADGVPL--------- 635

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              +A    +     A VC+ + C  PVT P +L   L
Sbjct: 636 ---LADRPLADGAPAAYVCRGYVCDRPVTTPEALARSL 670


>gi|428772641|ref|YP_007164429.1| hypothetical protein Cyast_0808 [Cyanobacterium stanieri PCC 7202]
 gi|428686920|gb|AFZ46780.1| protein of unknown function DUF255 [Cyanobacterium stanieri PCC
           7202]
          Length = 686

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 211/611 (34%), Positives = 314/611 (51%), Gaps = 72/611 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWC VME E+F D  +A  LN  F++IKVDREERPD+D +YM  +Q + G GGWPL++
Sbjct: 49  SCHWCTVMEGEAFSDGAIADYLNQNFIAIKVDREERPDIDSIYMQGLQMMTGQGGWPLNI 108

Query: 81  FLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           FL+P DL P  GGTYFP E +YGRPGF  IL  + + + ++ D L       +  L   +
Sbjct: 109 FLTPHDLVPFYGGTYFPLEPRYGRPGFLQILESIHNFYHQQTDKLNALKEEIVSILENNI 168

Query: 140 SASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           + + S  N L  +L    L   ++ L +   + +GG    P+FP      MM Y +  L 
Sbjct: 169 NLNPSIENHLNTKLLIQGLEKNSQILGR---NEYGG----PRFP------MMPYSNTTLT 215

Query: 199 --DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              T     A +  ++ +     +  GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G 
Sbjct: 216 AIHTLPPETAQKAHQLGIQRGIDLVNGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGL 275

Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           +     + +S  K +  Y   C   L +L R+M+ P G  +SA+DAD+         +EG
Sbjct: 276 IMEFLANLWSSGKENPQYHIACEGTLQWLEREMVAPEGYFYSAQDADNFGNIQDEEPEEG 335

Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
            FYVW   +++ IL  E  I  +E + +   GN            F+GKNVL +  D  A
Sbjct: 336 EFYVWHYLDLQQILSHEELIALQEVFTISNEGN------------FEGKNVLQKHPD-KA 382

Query: 375 SASKLGMPLEKYLNI-LGECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGL 420
               +   L+K   +  G+   +L      R               P  D K+IV+WN L
Sbjct: 383 ITPMVKNALDKLFTMRYGQTPERLTTFPPARNNHEAKSLEWLGRIPPVTDTKMIVAWNSL 442

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ-THRLQHS 479
           +IS  ARA  + K+E                +Y+E+AESA  FI ++ ++ Q  +RL + 
Sbjct: 443 MISGLARAYGVFKNE----------------KYLELAESAVKFILKNQWENQRLYRLNYG 486

Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYE--FGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
            +          +DYAFL+  LLDL +    +G  WL  AI++Q   D+   D++ GGY+
Sbjct: 487 NK---VSVLAQSEDYAFLVKALLDLQQNSLNAGNYWLEKAIKVQQEFDDYCYDQKNGGYY 543

Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           N   ++ S +L++ K   D A PS N V+V NL+RL  +      DY+ + AE +L +F 
Sbjct: 544 NNAYDNSSDLLIKEKGYIDNATPSPNGVAVANLLRLGLMT--DNLDYFEK-AEQTLKIFA 600

Query: 597 TRLKDMAMAVP 607
            ++ +  ++ P
Sbjct: 601 DKMVNSPVSCP 611


>gi|350269357|ref|YP_004880665.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
           Sjm18-20]
 gi|348594199|dbj|BAK98159.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
           Sjm18-20]
          Length = 642

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 201/578 (34%), Positives = 291/578 (50%), Gaps = 81/578 (14%)

Query: 3   RRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDV 59
           + +F   T+  +  FL    ++CHWCHVM  ESFEDE VA +LN  FVS+KVDREERPD+
Sbjct: 47  QEAFKKATRENKPVFLSIGYSSCHWCHVMAKESFEDETVAGVLNKSFVSVKVDREERPDI 106

Query: 60  DKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK 119
           D +YM   Q   GGGGWP SVF++PD KP   GTYFP      +  F  +L  +++ W +
Sbjct: 107 DNIYMRVCQTFTGGGGWPTSVFMTPDQKPFFAGTYFP------KAPFLDLLEVIREKWAE 160

Query: 120 KRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
            +  L   G     Q++E L+ S  S + P   P   ++     L +++D+ FGGFG AP
Sbjct: 161 DKQALLNQG----NQITETLTHSTHSPQTPQTAP---IKAAVSALKETFDNEFGGFGRAP 213

Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
           KFP P  + ++L  +  + +                TL  M KGGI D +G GF RYS D
Sbjct: 214 KFPTPHILYLLLKTAPDMAEK---------------TLIQMYKGGIFDQIGFGFSRYSTD 258

Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
             W VPHFEKMLYD   LA  YL AF  T    Y  +    L Y+ RD+  P G  FSA+
Sbjct: 259 RFWLVPHFEKMLYDNALLATAYLMAFEQTGRELYRTVAEKTLLYMERDLGSPEGGFFSAQ 318

Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHN 358
           DADS         +EG +YV+  +E+  +LGE     F  ++ +   GN           
Sbjct: 319 DADS-------DGEEGKYYVFKPEELTALLGEAEGRRFNAYFGITQNGN----------- 360

Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
            F+G ++   +N+SS   S     ++K+L        K+++ R  R     D KV+ SWN
Sbjct: 361 -FEGYSIPNLINNSSMDDS-----VDKFL-------PKVYEYRKSRTSLRTDQKVLTSWN 407

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
            L +++ A A +I+                 ++ Y++ A     F+ R + D  T  +  
Sbjct: 408 ALALAACANAYRII----------------GKRAYLDTALKTFGFMEREVTDGDT--VFC 449

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
              +G     GFLDDYAF I  L+ L++      +L+ A +LQ      + D + GG+F 
Sbjct: 450 GVTDGVRGGVGFLDDYAFYIYALICLHQATQDPAFLIRAQDLQIKAISEYFDDQNGGFFF 509

Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           +   +  ++   KE +DGA PSGNSV   NL RL ++ 
Sbjct: 510 SGKSNEKLIFNPKETYDGAIPSGNSVMAYNLARLYALT 547


>gi|411002310|ref|ZP_11378639.1| hypothetical protein SgloC_05852 [Streptomyces globisporus C-1027]
          Length = 673

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 226/691 (32%), Positives = 326/691 (47%), Gaps = 85/691 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
           FL+PD +P   GTYFPPE ++G P F+ +L  V  AW  +R+ +A+ +G    +    +L
Sbjct: 109 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTTAWTDRREEVAEVAGRIVADLAGRSL 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                      E+ Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   
Sbjct: 169 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMAVEFLLRHYAR--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 221 TGAEG----ALQMAADTCAAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       I     D++ R++    G   SA DADS + EG  R  EGAFYV
Sbjct: 277 VYAHLWRATGSDEARRIALKTADFMVRELRTAEGGFASALDADSEDAEG--RHVEGAFYV 334

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT +++ ++LGE    F   Y+           +++     +G +VL    D+       
Sbjct: 335 WTPEQLREVLGEDDAAFAAAYF----------GVTEEGTFEEGASVLRLPGDTG------ 378

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
             P++    + G  R +L   R +RP P  DDKV+ +WNGL I++ A             
Sbjct: 379 --PVDA-ARVAG-VRARLLAARDERPHPGRDDKVVAAWNGLAIAALAETGAYF------- 427

Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                    DR + +E A  AA   +R HL   +  RL  + ++G      G L+DY  +
Sbjct: 428 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 476

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L        WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A
Sbjct: 477 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
            PSG + +   L+   S  A + S+ +R  AE +L V    +K +   VP      +  A
Sbjct: 536 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGV----VKALGPRVPRFVGWGLAVA 588

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNS 671
             +L  P  + V + G                  +L++T ++   P          +  +
Sbjct: 589 EALLDGP--REVAVAGPVGG--------------ELHRTALLGRAPGAVVAAGEGPDAGA 632

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
               +        +  A VC++F C  P TD
Sbjct: 633 EFPLLVDRPLVGGEPTAYVCRHFVCDAPTTD 663


>gi|144899665|emb|CAM76529.1| Protein of unknown function DUF255 [Magnetospirillum
           gryphiswaldense MSR-1]
          Length = 650

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 222/695 (31%), Positives = 321/695 (46%), Gaps = 104/695 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFE+  +A L+N  FV++K+DREERPD+D +Y   +Q +   GGWPL+
Sbjct: 53  SACHWCHVMAHESFENPEIAALMNRLFVNVKIDREERPDLDAIYQQALQHMGQHGGWPLT 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +F +PD KP  GGTYFPP  +YGRPGF  +L+ + D W + RD +  +    +  L EAL
Sbjct: 113 MFCTPDGKPFWGGTYFPPAPRYGRPGFPEVLQAIHDLWQRDRDRVDHN----VAALVEAL 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +     +  P  L    L   A+ +    D   GG G APKFP+P     +   +K+   
Sbjct: 169 AHDGGGDASP--LTLEMLDRGAKAILSHVDMEHGGLGGAPKFPQPGLFDYLWRSAKR--- 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG SG      + V  TL  + +GGI DH+GGGF RYS D+ W  PHFEKMLYD GQL +
Sbjct: 224 TGNSGL----HQAVTLTLDRICQGGITDHLGGGFMRYSTDDVWLAPHFEKMLYDNGQLID 279

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +    +  T++  +     + + ++ R+M+    E  +   A  A++EG     EG FY 
Sbjct: 280 LLTLVWQDTQNPLFQTRIEECITWVSREML---AEGAAFAAALDADSEG----HEGRFYT 332

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           W ++E+ D+LG E A +F + Y +   GN            ++G N+   LN S      
Sbjct: 333 WKAQEIIDLLGPETARIFAQAYDVSIQGN------------WEGVNI---LNRSKPQG-- 375

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                 ++   L + R  L   R+ R RP  DDKV+  WNG++I+  ARA  +       
Sbjct: 376 -----HEHEEQLAQARTILLAARANRIRPGRDDKVLADWNGMMIAGLARAGFVFI----- 425

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFI--RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                      R +++++AE A + I  +  L D+   RL HS     +   GF DD A 
Sbjct: 426 -----------RPDWLDMAERAFAVITDKMTLADD---RLAHSLCQEQASHVGFADDLAH 471

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +    L LY+      +L WA       D    D+  GGYF        V++R K   D 
Sbjct: 472 MARAALALYQATGKADYLTWAETWVAAADRHHWDKAKGGYFQVAHSASDVIVRTKTVMDA 531

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PS N   V  L  LA I   +    Y   A+  + VF  +  D               
Sbjct: 532 AVPSANGTMVQVLAILAQI---TDKPAYADRAQAVVTVFMDQFND--------------- 573

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN-KTVIHIDPADTEEMDFWEEHNSNNAS 675
                             F NM +A    +DL    V+   P +  EM     H +    
Sbjct: 574 -----------------HFANM-SALLTGFDLAVDPVLVTLPRNNAEMIDVVRHAALPNL 615

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           + R     D+V+A +C+N  CS P   P  L  +L
Sbjct: 616 IIR---WTDEVMATLCRNSVCSAPTGSPADLARML 647


>gi|400597948|gb|EJP65672.1| DUF255 domain protein [Beauveria bassiana ARSEF 2860]
          Length = 731

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 202/615 (32%), Positives = 321/615 (52%), Gaps = 70/615 (11%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M  ESF +   A +LND F+ + +DRE RPD+D +YM YVQA+   GG
Sbjct: 75  HIGYKACHYCRLMSTESFANTECAAVLNDAFIPVLIDRESRPDLDTIYMNYVQAVSSVGG 134

Query: 76  WPLSVFLSPDLKPLMGGTYFPPEDKYGRP---------GFKTILRKVKDAWDKKR----- 121
           WPL++F++P+L+P+ GGTY+P  +   R           F TI++KV+D W ++      
Sbjct: 135 WPLNLFVTPELEPIFGGTYWPGPNAAPRAHDENAEDALDFLTIVKKVRDIWKEQEARCRK 194

Query: 122 ---DMLAQSGAFAIE------QLSEALSASASSNKLP--DELPQNALR-------LCAEQ 163
              ++LAQ   FA E       +++A + + S    P   E  Q A++       L  +Q
Sbjct: 195 EATEVLAQLREFAAEGTLGTRAIAQAQTIAPSGWAAPAHSEQTQEAVKNVSVSSELDLDQ 254

Query: 164 LSKSY-------DSRFGGFGSAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMV 213
           + ++Y       D  +GGFG APKF  P ++Q ++        ++D     E +    M 
Sbjct: 255 VEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLQFLIGLRDSPSAVQDIVGEAECTHALDMA 314

Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----SLT 268
           + TL+ +  G +HDHVG  GF R SV   W +P+FEK++ D  QL ++YL A+       
Sbjct: 315 VDTLRKIRDGALHDHVGNTGFARCSVTPDWTIPNFEKLVVDNAQLLSLYLTAWRRAGGQA 374

Query: 269 KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
              FY+ I  ++  YL    ++   G + S+E ADS   +G    KEGAFY+WT +E + 
Sbjct: 375 TSEFYN-IVLELATYLTSTPILRSDGLLASSEAADSYARKGDGEMKEGAFYLWTKREFDS 433

Query: 328 IL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
           ++     G   ++   H+ +   GN D     DP+ +F  +N+L  +  S   + +L +P
Sbjct: 434 VIEAAEKGASPVV-AAHWGILEDGNID--EQHDPNEDFMNQNILRVVKTSEELSKQLNIP 490

Query: 383 LEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
           +EK    +   +++L   R S+R RP +DDK +  WNGL +S+ A+ S+ +K+ +     
Sbjct: 491 VEKVEQTIRTSQKELKARRESERVRPEVDDKAVTGWNGLALSALAKTSRAVKTTS----- 545

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
             P + +   +   VA   ASFI++ L+D Q  ++ +    G     GF DDYA++I GL
Sbjct: 546 --PELSA---KCATVASGIASFIQKQLWDAQA-KILYRVWTGERDTEGFADDYAYVIQGL 599

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           LDL++       + +A  LQ  Q   F D   GG+F T     S +LR+K+  D + PS 
Sbjct: 600 LDLFDTNGDESLIEFADALQKAQSSYFYD-PAGGFFTTKAGSSSAILRLKDGMDTSLPST 658

Query: 562 NSVSVINLVRLASIV 576
           N+VSV NL RL  ++
Sbjct: 659 NAVSVANLYRLGHLL 673


>gi|311746315|ref|ZP_07720100.1| dTMP kinase [Algoriphagus sp. PR1]
 gi|126576550|gb|EAZ80828.1| dTMP kinase [Algoriphagus sp. PR1]
          Length = 678

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 204/562 (36%), Positives = 281/562 (50%), Gaps = 59/562 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFED+  A L+N+ FV IK+DREERPD+D +YM  VQA+   GGWPL+
Sbjct: 51  SACHWCHVMERESFEDKLTADLMNESFVCIKIDREERPDIDNIYMDAVQAMGLQGGWPLN 110

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS----GAFAIEQL 135
           VFL P+ KP  GGTYFP +       +K +L  + DA+    D LA+S    G       
Sbjct: 111 VFLMPNQKPFYGGTYFPNQQ------WKNLLANIADAFANHEDKLAESAEGFGRSIARNE 164

Query: 136 SEALSASASSNKL-PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
           +E     +   +L PDEL +  L     QLS   DS +GG    PKFP P     +L   
Sbjct: 165 TEKYGIRSGKIELDPDELAEAVL-----QLSSQIDSEWGGMNRIPKFPMPAIWNFIL--- 216

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
               D     ++   +  VLFTL+ M  GGI+D + GGF RYSVD  W  PHFEKMLYD 
Sbjct: 217 ----DYALLSKSQNLEDKVLFTLKKMGMGGIYDQLKGGFARYSVDGEWFAPHFEKMLYDN 272

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           GQL  +Y  A+  + D F+    ++   +L  +M+   G   +A+DADS   EG     E
Sbjct: 273 GQLLELYAKAYQTSHDDFFLEKIQETYTWLLDEMLQEEGGFHAAQDADS---EGV----E 325

Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           G FY WT +E+  I+ E    F E Y LKP GN +            G N+L +    S 
Sbjct: 326 GKFYTWTYEELSSIIPEEMPWFAELYNLKPQGNWE-----------DGINILFQTKSYSE 374

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
            A+   +  E     L E +  L  +R++R  P  DDKV+  WN L+IS   +A      
Sbjct: 375 VAAAHNLSEEVLNQKLKEVKATLLSIRNQRIYPGKDDKVLCGWNALMISGLVQAY----- 429

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                        SD+K ++++A S   FI + +  ++  RL  S++NG +  P FL+DY
Sbjct: 430 ----------FATSDQK-FLDLALSNRDFISKKVTVDR--RLYRSYKNGVAYTPAFLEDY 476

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A LI   + L+E  S    L  A  L     + F D   G +F        ++   KE  
Sbjct: 477 AALIKADIMLFEATSEASHLKSAERLTKIVLDEFYDENDGFFFFNNPSSEKLIANKKELF 536

Query: 555 DGAEPSGNSVSVINLVRLASIV 576
           D   PS NS+   NL +L+ + 
Sbjct: 537 DNVIPSSNSLMARNLHQLSILT 558


>gi|410479889|ref|YP_006767526.1| thioredoxin [Leptospirillum ferriphilum ML-04]
 gi|406775141|gb|AFS54566.1| conserved hypothetical protein containing a thioredoxin domain
           [Leptospirillum ferriphilum ML-04]
          Length = 699

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 216/698 (30%), Positives = 335/698 (47%), Gaps = 64/698 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLS 79
            CHWCHVM  ESFE   +A ++N++FV+IKVDREERPD+D++Y M +       GGWPL+
Sbjct: 59  ACHWCHVMAHESFERPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLT 118

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+P   P  GGTYFP + ++G PGF  +L +++D +   R+ L +     ++ L +  
Sbjct: 119 MFLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTN 178

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + S     D  P  AL      L   +D  FGGFG APKFP  +++  +    ++ + 
Sbjct: 179 PVADSREFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQR 232

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G S  A     M   TL  M +GGI D VGGGF RYSVDERW +PHFEKMLYD   L  
Sbjct: 233 KGDSTAA----HMATLTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLE 288

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
                 S++K+  YS    +++ +L R+M    G  +S+ DADS   EG    +EG FYV
Sbjct: 289 ALSLGASVSKNPVYSRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYV 341

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASK 378
           + ++EV  IL +        YY           +S P N F+G    L E       + +
Sbjct: 342 FQAEEVRSILSDEEYRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKE 390

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +        +   R+KLF  RS R RP LDDKV+ SWN L+              A++
Sbjct: 391 FHLSESDIERRIESARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKA 436

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
            +F+  ++G  ++E++        ++ R ++  +   L   +       P +LDDYAFL+
Sbjct: 437 LLFSGRILG--KQEWISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLL 492

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             +L+        + L +A  + +     F D E GG++ T     +++ R K  HDGA 
Sbjct: 493 LAVLESMRIDFRPEDLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGAL 552

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN+ +V  L+ L ++        Y   A+ +L ++  ++K+       M  A +  S 
Sbjct: 553 PSGNAAAVQGLLWLGTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS- 608

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              + VV +    + D+++ ++      D    V+ +  A  + +   E          R
Sbjct: 609 -DSQPVVFLAGPQAGDWKDKISCG---VDTEAFVLDLTNAVRDSLPLPEG--------MR 656

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
            +F  +K    VC+   C P      SL+  L   P S
Sbjct: 657 KHFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 694


>gi|408794723|ref|ZP_11206328.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
           5]
 gi|408461958|gb|EKJ85688.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
           5]
          Length = 689

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 218/692 (31%), Positives = 341/692 (49%), Gaps = 81/692 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESFED+  A++LN  FV IK+DREERPD+DK+YM  + A+   GGWPL+
Sbjct: 54  STCHWCHVMERESFEDDSTAEVLNRDFVCIKLDREERPDIDKIYMDALHAMGTQGGWPLN 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+P  +P++GGTYFPPE++YG+  FK +LR V DAW  +R+ L  + A  + Q     
Sbjct: 114 MFLTPTKEPILGGTYFPPENRYGKRSFKEVLRLVSDAWKNQREELI-TAATDLTQYLRDN 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMM--LYHSK 195
               +  K+P    +  +    E+  + YD  F GF   S  KFP  + +  +   Y  K
Sbjct: 173 ETRPNEGKVP---AKEIIEKNFERYVQVYDKEFFGFKTNSVNKFPPSMALSFLTEFYLLK 229

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           K              +M   T   M  GGI+D VGGG  RY+ D  W VPHFEKMLYD  
Sbjct: 230 K---------DPRALEMAFNTAYAMKSGGIYDQVGGGICRYATDHEWLVPHFEKMLYDN- 279

Query: 256 QLANVYLDAFSL----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
              ++Y++A +L    T++ F+  + R+I+ Y+RRDM    G I SAEDADS   EG   
Sbjct: 280 ---SLYVEALALLYKATEEPFFLEVIREIVTYIRRDMTLGSGGIASAEDADS---EG--- 330

Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
            +EG FY+W   E   I+ E  I      +   T   +    +  H  +KGKN  ++   
Sbjct: 331 -EEGKFYIWNHSEFNQIVPEEEI----QGFWNVTEEGNFEHQNILHVYWKGKNPFVD--- 382

Query: 372 SSASASKLGMPLE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
                   G+  + +++N + + + KL   RS+R RP  DDKV+ SWN L I +   A +
Sbjct: 383 --------GIQFKPEFINKIEKTKEKLLAHRSQRIRPLRDDKVLTSWNCLWIRALLSAYE 434

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           +                S   EY+  A+    FI + L  +    L+  FR G +K  G 
Sbjct: 435 V----------------SGDTEYLNDAKKIYRFITKQLVGDDGSILRR-FREGEAKYFGT 477

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIEL-QNTQDELFLDREG--GGYFNTTGEDPSVL 547
           L DY   I   + L++     +    A E+ + + D +F + E   G ++ +   +  ++
Sbjct: 478 LPDYTEFIWVSMKLFQLDEDIE----AYEIGKKSLDYVFANFESKVGPFYESYHGNEDLI 533

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
           +R  E +DG EPSGNS ++++L  L   +   K D  ++ A    A F   L   +++ P
Sbjct: 534 VRTIEGYDGVEPSGNS-TILHLFYLLFSIGYKKVD-LQKKANSIFAYFLPELTQNSLSYP 591

Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            M  A      PS++ +V+     + + + +        D N   + ++ ++ + +    
Sbjct: 592 SMISAFQKFQYPSKEVLVVYKGYDAAEIKEIRKKLSELKDPNLVWLVLEESNAKAL---- 647

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
              +    +     +   ++  VC+NFSC  P
Sbjct: 648 ---APELELLTGRSAGSGILYYVCRNFSCELP 676


>gi|434397636|ref|YP_007131640.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
           7437]
 gi|428268733|gb|AFZ34674.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
           7437]
          Length = 684

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 207/616 (33%), Positives = 304/616 (49%), Gaps = 67/616 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A+ LN  FV+IKVDREERPD+D +YM  VQ + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIAEYLNVNFVAIKVDREERPDLDSIYMQAVQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP + +Y RPGF  +L+ V   + + +  L     F  E LS  
Sbjct: 108 IFLTPGDLVPFYGGTYFPLQPRYNRPGFLDVLQAVLRFYQEDKAKLEH---FKTEILSHL 164

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             ++    + PD L +  L    E  +        G  S P  P            ++  
Sbjct: 165 QQSTVLPLETPDSLTKQLLFAGIETNTGVISPNDLGRPSFPMIPYATLALQGSRFKQEFR 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
              +      G+ +VL        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+ 
Sbjct: 225 YNPQELSWQRGKDLVL--------GGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIL 276

Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               + +S   ++   +    + +++L+R+M  P G  ++A+DADS     A   +EG+F
Sbjct: 277 EYLANLWSAGCQEPEIALAVTETVNWLKREMTAPNGYFYAAQDADSFVDVDAVEPEEGSF 336

Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW  +E+ D L  E     +  + +   GN            F+GKNVL      + S 
Sbjct: 337 YVWNYQELADNLTAEELTELQTEFTVSVEGN------------FEGKNVLQRRQSGNLSD 384

Query: 377 SKLGMPLEKYLNI-LGECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGLVI 422
           S L   LEK   I  G+ +  L      R               P  D K+IV+WN +VI
Sbjct: 385 S-LTNTLEKLFTIRYGQAKESLAIFTPARNNHEAKTTPWQGRIPPVTDTKMIVAWNSIVI 443

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
           S  AR   +  ++                 Y+++A +A +FI +H + DE+ HRL +   
Sbjct: 444 SGLARVYAVFGNQL----------------YLDLAVTATNFILQHQWLDERFHRLNY--- 484

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           +G ++ P   +DYA  I  LLDL       ++WL  A+ +Q   D+L    E GGY+N++
Sbjct: 485 DGLAQVPAQSEDYALFIKALLDLQAATPEKSQWLEQAVRIQTEFDQLLWSNEMGGYYNSS 544

Query: 541 GEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
             D +  L ++E    D A P+ N V+V NLVRL+ +    +   Y   AE +L  F + 
Sbjct: 545 NTDANQELLIQERSYIDNATPAANGVAVTNLVRLSLLTDNLE---YLDRAEQALQAFSSV 601

Query: 599 LKDMAMAVPLMCCAAD 614
           +     A P +  A D
Sbjct: 602 MTRSPQACPTLFVALD 617


>gi|408671866|ref|YP_006871614.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
           17448]
 gi|387853490|gb|AFK01587.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
           17448]
          Length = 679

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 220/700 (31%), Positives = 337/700 (48%), Gaps = 101/700 (14%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE+E +A+++N   V IKVDREERPDVD +YM  +QA+   GGWPL+VF
Sbjct: 50  CHWCHVMERESFENEQIAQIMNQHLVCIKVDREERPDVDAIYMDALQAMGLRGGWPLNVF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEALS 140
           L PD KP  GGTYFPP +      +  ++  + +A+   R+ L +S   F    L +   
Sbjct: 110 LMPDAKPFYGGTYFPPRN------WANLVESIANAFKNDREKLQKSAEGFTQNMLVKESD 163

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               S +      +  L     +L + +D   GG   +PKFP P   + ++ +     D 
Sbjct: 164 KYRMSVEDTLSFSEEELTTIFNRLHQDFDFEKGGMNRSPKFPMPSIWKFLIRYYSITND- 222

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     + ++ TL  +A GGI+D +GGG+ RYS DE W VPHFEKMLYD GQL ++
Sbjct: 223 ------KRAYQHLIHTLNRVALGGIYDTIGGGWTRYSTDEDWKVPHFEKMLYDNGQLISL 276

Query: 261 YLDAFSLTK-----DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           Y +A++LTK     D FY+    + +++L R+M+   G  +SA DADS   EG    +EG
Sbjct: 277 YAEAYALTKSEGNPDNFYAAKVTETIEWLEREMMSKEGGFYSALDADS---EG----EEG 329

Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSA 374
            FY+W  +E+   LGE A  F E +     GN +            G NV+ +E  D   
Sbjct: 330 KFYIWKKEEIIAALGEDAGPFIETFDFTEAGNWE-----------HGNNVVHLEERDFME 378

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
           +    G PL        E ++KLFD R+KR RP LDDK++ SWNGL++     A + L  
Sbjct: 379 N----GWPL------TAEIKQKLFDFRAKRVRPGLDDKILCSWNGLMLKGLVDAYRYL-- 426

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-------DEQTHRLQHSFRNGPSKA 487
                         D ++++++A   A FI+  +          +   L H+++NG +  
Sbjct: 427 --------------DNQKFLDLALKNAHFIKDCMSIKVMNEDGSEARGLWHNYKNGKANI 472

Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
             +L+DYA +I   L LY+      WL  A  L       F D E   ++ T  +   ++
Sbjct: 473 VAYLEDYASVIDAYLALYQVTFDEVWLHEAEMLAIYTVANFYDDEDEFFYFTDSQGEELI 532

Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            R KE  D   P+ NS+   NL  L  I+   ++D+ + +   +L +   ++K + +  P
Sbjct: 533 ARKKEIFDNVIPASNSIMATNLYNLGLILG--RNDFIQIS---NLMI--GKMKRIVLTDP 585

Query: 608 ----LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
                  C A   + P+ + V +VG                  ++ K    ID       
Sbjct: 586 QWVTQWACLATQHTKPTAE-VAMVGK-----------------EITKIRKQIDEVLILNK 627

Query: 664 DFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
            F    N++N  + +N  + D +    VC + +C  P T+
Sbjct: 628 VFVGTTNTSNLPLLQNRVTKDAQTTIFVCFDKTCQLPTTE 667


>gi|333026825|ref|ZP_08454889.1| hypothetical protein STTU_4329 [Streptomyces sp. Tu6071]
 gi|332746677|gb|EGJ77118.1| hypothetical protein STTU_4329 [Streptomyces sp. Tu6071]
          Length = 639

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 230/698 (32%), Positives = 330/698 (47%), Gaps = 83/698 (11%)

Query: 18  LINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 77
           ++   +WCHVM  ESFED   A  +N  FV +KVDREERPDVD VYM  VQA  G GGWP
Sbjct: 1   MLLIIYWCHVMARESFEDAETAAYMNAHFVCVKVDREERPDVDAVYMEAVQAATGHGGWP 60

Query: 78  LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
           ++VFL+P  +P   GTYFPP   +G P F+ +L  V+ AW  +R+ +A   A     L+ 
Sbjct: 61  MTVFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRREEVADVAARVTADLTG 120

Query: 138 ---ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
               L A AS    PD L    L      L++ YDSR GGFG APKFP  + ++ +L H 
Sbjct: 121 RGLGLPADASPPG-PDALGAALL-----GLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHH 174

Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            +   TG  G      +M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKML D 
Sbjct: 175 AR---TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWIVPHFEKMLSDN 227

Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             L   Y   +  T       +  +  D+L R++  P G   SA DADS   +G  R  E
Sbjct: 228 ALLCRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVE 285

Query: 315 GAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GA YVWT +++ ++LGE  A L   HY + P G             F+  + ++ L  + 
Sbjct: 286 GASYVWTPEQLREVLGEDDAALAAAHYGVTPEGT------------FEHGSSVLRLPRTD 333

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
              S    P++     L   RR L   R +RP P  DDKV+ +WNGL I++ A       
Sbjct: 334 GFDSP---PVDA--ARLDRIRRALLAARDERPAPGRDDKVVAAWNGLAIAALAETGAYF- 387

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTH-RLQHSFRNGPSKA-PGF 490
                          DR + +E A  AA   +R HL    TH RL  + R+G + +  G 
Sbjct: 388 ---------------DRPDLVEAALGAADLLVRVHL---DTHGRLSRTSRDGRTGSNTGV 429

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DYA +  G L L        W  +A  L +   + F D + G  ++T  +  +++ R 
Sbjct: 430 LEDYADVAEGFLTLASVTGEGVWTDFAGLLLDHVLDRFRD-DSGALYDTAADAETLIHRP 488

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-- 608
           ++  D A PSG + +   L+  A++   + S  +R  AE +L+V    ++ +A   P   
Sbjct: 489 QDPTDNATPSGWNAAAGALLTYAAL---TGSTPHRAAAEQALSV----VRALAPRAPRFV 541

Query: 609 ---MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
              +  A  +L+ P    V +VG         +   A  +      V    P+   E   
Sbjct: 542 GHGLAVAEALLAGP--YEVAVVGAPEDPRTRALHRTALLATSPGTVVAAGPPSPDPEFPL 599

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
             +    + + A          A +C+ F C  P TDP
Sbjct: 600 LADRPLVDGTPA----------AYLCRGFVCDRPETDP 627


>gi|11499326|ref|NP_070565.1| hypothetical protein AF1737 [Archaeoglobus fulgidus DSM 4304]
 gi|2648814|gb|AAB89512.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 642

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 195/575 (33%), Positives = 294/575 (51%), Gaps = 64/575 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE+E +A+++N  FV+IKVDR+ERPD+DK Y  +V A  G GGWPL+VF
Sbjct: 50  CHWCHVMAKESFENEEIAEMINRNFVAIKVDRDERPDIDKRYQEFVMATTGSGGWPLTVF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP  GGTYFPPED+Y  PGFKT+LRK+ + W   R+ L +S     E+L+EA+  
Sbjct: 110 LTPDGKPFFGGTYFPPEDRYHLPGFKTVLRKIAEMWRHDRERLLKSA----EELTEAVRR 165

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A  +    ++ +  L    E +    D   GGFGSAPKF     ++++L H     D  
Sbjct: 166 YAEGS-FKGDVDEKLLDKGIEAVLDQTDYVNGGFGSAPKFHHAKAVELLLTHHFFTGD-- 222

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                 E  K    TL  MA+GGI+DH+ GGF RYS D +W  PH+EKMLYD  +L  +Y
Sbjct: 223 -----EEVLKAAEITLDAMARGGIYDHLLGGFFRYSTDAKWVTPHYEKMLYDNAELLYLY 277

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
             A++LT    Y  I   I++Y R+      G  ++++DAD  E +      EG +Y+++
Sbjct: 278 SIAYALTGKRLYQKIADGIVEYYRKFGCSNEGGFYASQDADIGELD------EGGYYLFS 331

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK-LG 380
            +E+++IL E        YY                 + +G+  L  +  +    SK LG
Sbjct: 332 DRELKEILDEREFRIATLYY-----------------DIQGERKLPRIFLTEEEISKILG 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           + +E+    +   RRK+ + R +R  P++D  +   WNGL+I +     K+         
Sbjct: 375 VSVEEVERAVNSARRKMLEFREQREMPYIDTTIYAGWNGLMIEALCMHHKVFGDNWS--- 431

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                        +E+AE  A+ + +  +D +   L H+         G  +DY F   G
Sbjct: 432 -------------LEMAEKTANRLLKEFWDGR--ELLHT-----HNVEGLSEDYIFFARG 471

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           LL L+E     ++L    E+ ++  E F D E GG+F++  E   + +R+K  HD    S
Sbjct: 472 LLALFEVTQRHEYLEKCFEIVDSAVEKFWDGEDGGFFDS--ERAVLGIRLKNFHDSPTQS 529

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
            N  +   L+ L++I    +   Y + A   L  F
Sbjct: 530 VNGSAPQLLLALSAITGERR---YEELAVEGLRTF 561


>gi|374369685|ref|ZP_09627707.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
 gi|373098764|gb|EHP39863.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
          Length = 683

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 235/690 (34%), Positives = 340/690 (49%), Gaps = 96/690 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  +A L+N  F+SIKVDR+ERPD+D +Y      +  GGGWPL+V
Sbjct: 49  ACHWCHVMAHESFENPRIAGLMNARFISIKVDRQERPDIDDIYQKVPLMMGQGGGWPLTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKK----RDMLAQ-SGAFAIEQL 135
           FL+P  +P  GGTYFPP+D+YGRPGF  +L  + +AW  +    RDM+ Q    F    L
Sbjct: 109 FLTPQGEPFFGGTYFPPDDRYGRPGFVRVLLSLSEAWTHRRGELRDMIEQFRLGFRQLDL 168

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
            +    +A    LP +         A  L++  D   GG G APKFP      ++L   +
Sbjct: 169 VDLGREAAEVEDLPAQ--------TARALAQDTDPTHGGLGGAPKFPNASGYDLVL---R 217

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             + TG+    +  ++    TL  MA GGIHD +GGGF RYSVDERW VPHFEKMLYD G
Sbjct: 218 ICQRTGEPVLLAALER----TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNG 273

Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           QL  +Y DA+ LT    +  +  + + Y+ RDM  P G  ++ EDADS   EG    +EG
Sbjct: 274 QLVTLYADAYRLTGKPAWRRVFEEAIAYIVRDMTHPDGCFYAGEDADS---EG----EEG 326

Query: 316 AFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
            FYVWT  EV  +LG  E A+             C    ++D  N  +G +VL    + +
Sbjct: 327 RFYVWTPAEVRAVLGASEGAL------------ACRAYGVTDGGNFARGTSVL----NRA 370

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           A+      P ++    L + R +LF  R++R RP  DD ++  WNGL+I     A +   
Sbjct: 371 ATLD----PFDE--ARLEDWRGRLFAARARRARPARDDNILTGWNGLMIQGLCAAYQATG 424

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                     P + + R+    + E         + D   +R   ++++G +K PGFL+D
Sbjct: 425 CP--------PHLAAARRAASAIQEKLT------MPDGGVYR---AWKDGTAKVPGFLED 467

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD--REGGGYFNTTGEDPSVLLRVK 551
           YA L + L+DLYE     ++L  A+EL      L LD  R+ G YF     +P ++ R +
Sbjct: 468 YALLANALIDLYESCFDKRYLDRAVELV----ALILDKFRDDGLYFTPRDGEP-LVHRPR 522

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
             HD A PSG S SV   +RL ++   +  D YR  AE     +             +  
Sbjct: 523 APHDSAWPSGISTSVFAFLRLHAL---TGRDVYRDLAEDEFRRYRAAAAAAPAGFVHLLA 579

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A D  +      ++L G K++     ++ + H +Y L   V+    A  E++        
Sbjct: 580 ARD-FAQRGPFEIILAGDKAAA--AGLVQSVHRAY-LPARVL----AFAEDVPIGHGRRP 631

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
                A          A VC++ +C+ PVT
Sbjct: 632 VKGRPA----------AYVCRHRTCAAPVT 651


>gi|414164591|ref|ZP_11420838.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
 gi|410882371|gb|EKS30211.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
          Length = 684

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 224/710 (31%), Positives = 345/710 (48%), Gaps = 97/710 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A ++N+ FV+IKVDREERPD+D++YM  +  L   GGWPL++
Sbjct: 55  ACHWCHVMAHESFEDEATAAVMNEQFVAIKVDREERPDIDQIYMNALHLLGQQGGWPLTM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD  P+ GGTYFP + +YGR  F  ++++    +  + D +A +       L+E  S
Sbjct: 115 FLTPDGAPIWGGTYFPKQAQYGRASFIDVMQQFMRIYRDEPDKIAANKEAIARSLNERHS 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A  +S  L      N L   A  ++++ D   GG   APKFP+             LE  
Sbjct: 175 ADTASIGL------NELDNAAGSIARATDPDNGGLRGAPKFPQ----------CSMLEFL 218

Query: 201 GKSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            ++G  +  ++  + T   L  M++GGI+DH+GGG+ RYSVDERW VPHFEKMLYD  Q+
Sbjct: 219 WRAGARTGDERYFITTNLALTRMSQGGIYDHLGGGYARYSVDERWLVPHFEKMLYDNAQI 278

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++     +   +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG F
Sbjct: 279 LDMLALEHARAPNELYLQRAEETVGWLKREMLTKEGGFSSSLDADS---EG----EEGRF 331

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+  ++  +LG + A  F   Y +   GN            F+G N+L  L+D S +A
Sbjct: 332 YVWSQSDIAQLLGPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSDTA 379

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           ++           L   R  LF  R KR  P LDDKV+  WNGL+I++            
Sbjct: 380 TE--------AEQLAALRAILFRAREKRVHPGLDDKVLADWNGLMIAA---------LAH 422

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
            +  FN       R +++ +A +   F+   +   +  RL HS+R G    P    D A 
Sbjct: 423 AAGAFN-------RPDWLTLACTVFGFVTTTM--SRHDRLGHSWRAGKLLQPALASDNAA 473

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           +I   L L+E      +L  AI  Q   D  + D + GGYF T  +   ++LR     D 
Sbjct: 474 MIRAALALHEATGDHLFLDQAILWQADLDTHYGDPQHGGYFLTADDAEGLILRPHSSVDD 533

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDMAMAVPLMCCAADM 615
           A P+   ++  NL RLA +    +   +R+  +   A +     ++M   + L+  A D+
Sbjct: 534 AIPNHIGLTAQNLARLAVLTGDER---WRRQLDMLFAHMLSAAARNMFGHLSLL-NALDL 589

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSNNA 674
               +   +V+ G     D   +L  A A    N  V+H+ DP                A
Sbjct: 590 YLAGAE--IVITGQGEEAD--ALLKTARALPHANTIVLHVPDP----------------A 629

Query: 675 SMARNNFSADKV------VALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
            +  ++ +ADK+       A +C+  +CS P+T+P +L   +L   +S +
Sbjct: 630 KLPPHHPAADKIAPGGEAAAFICRGQTCSLPMTEPHALAAFVLRGEASAS 679


>gi|407975443|ref|ZP_11156348.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
 gi|407429071|gb|EKF41750.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
          Length = 673

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 206/570 (36%), Positives = 295/570 (51%), Gaps = 68/570 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE++ VA ++N  FV+IKVDREERP++D++YM  + A    GGWPL++
Sbjct: 54  ACHWCHVMAHESFENDQVADVMNRLFVNIKVDREERPEIDQIYMAALSATGEQGGWPLTM 113

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEAL 139
           FLSPD KP  GGTYFPP+ +YGRPGF  +L  V  AW +K RD+   SG  + E+L + +
Sbjct: 114 FLSPDGKPFWGGTYFPPQQRYGRPGFIEVLNAVHTAWLEKNRDL---SG--SAERLHDHV 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A  S        PQ+A+   AE++    D   GG   APKFP    IQ++      L+ 
Sbjct: 169 KARLSPPSAEGFDPQSAVTDLAERIHGMIDQDMGGLRGAPKFPNMPFIQILWL--SWLQT 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
             +S   S     V+ +L+ M  GGI+DHVGGG  RYS D  W VPHFEKMLYD  QL  
Sbjct: 227 GNQSHRDS-----VITSLKRMLSGGIYDHVGGGLARYSTDANWLVPHFEKMLYDNAQLLR 281

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +    F  T+D  +     +++++L RDM   GG   S+ DADS   EGA    EG  Y+
Sbjct: 282 LLSWVFGETEDELFRIRIEEVINFLLRDMRVNGGAFASSLDADS---EGA----EGKAYL 334

Query: 320 WTSKEVEDILGEHAILFKEHYYL-KPT---GNCDLSRMSDPHNEFKGKNVLIEL-NDSSA 374
           W+  ++E +LG     F   + L KP    G+  L R++  H EF+G +    L ND +A
Sbjct: 335 WSRLQIEAVLGSRTEAFLSTFELTKPDDWHGDPVLHRLA--HPEFQGTDTENALRNDLNA 392

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
                                 L   R+ R +P  DDKV+V WNGL I++ A  ++  + 
Sbjct: 393 ----------------------LLSTRAGRIQPGRDDKVLVDWNGLAIAAIANCARQFQ- 429

Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
                          R+++++ A++A  F+   +   ++ RL HS R G    P    DY
Sbjct: 430 ---------------RQDWLDAAKAAFHFVCESM---ESRRLPHSIRLGKRLFPALSSDY 471

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +IS    LY+      +L  A E   T      D E  G++ T+ +   V LR++ D 
Sbjct: 472 AAMISAATALYQATRKRGFLDQASEWFETLKSWNADEENAGFYLTSSDASDVPLRIRGDV 531

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
           D A PS  ++ +  +  LA++    K + Y
Sbjct: 532 DEAMPSATALIIEAMCGLAALSGDDKVEEY 561


>gi|345001747|ref|YP_004804601.1| hypothetical protein SACTE_4222 [Streptomyces sp. SirexAA-E]
 gi|344317373|gb|AEN12061.1| protein of unknown function DUF255 [Streptomyces sp. SirexAA-E]
          Length = 673

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 225/693 (32%), Positives = 323/693 (46%), Gaps = 71/693 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED  +A  LN+ FV +KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDAALAAYLNEHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ D +P   GTYFPPE ++G P F+ +L  V  AW  +R  +A+     +  L+   
Sbjct: 108 VFLTADAEPFYFGTYFPPEPRHGMPSFRQVLEGVTAAWTGRRGEVAEVAGRIVTDLA-GR 166

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           S +   + +P E P+ A  L A  LS+ YD + GGFG APKFP  + ++ +L H  +   
Sbjct: 167 SLAHGGDGVPGE-PELAQALLA--LSREYDEKHGGFGGAPKFPPSMAVEFLLRHHAR--- 220

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  
Sbjct: 221 TGAEG----ALEMAADTCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCR 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +  T       +  +  D++ R++    G   SA DADS +  G  R  EGA+YV
Sbjct: 277 VYAHLWRATGSDLARRVALETADFMVRELRTTEGGFASALDADSEDARG--RHVEGAYYV 334

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT +++ ++LGE    F   Y+           +S+     +G +VL          ++ 
Sbjct: 335 WTPEQLREVLGEDDAAFAAAYF----------GVSEEGTFEEGSSVL--------RLART 376

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G P E    +  + R +L   R  R RP  DDK++ +WNGL +++ A             
Sbjct: 377 G-PDEDPARV-ADVRARLLAARGDRVRPERDDKIVAAWNGLAVAALAETGAYF------- 427

Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                    DR + +E A  AA   +R H+ D  T RL  + ++G      G L+DY  +
Sbjct: 428 ---------DRPDLIERATEAADLLVRVHMGD--TARLCRTSKDGRAGDNAGVLEDYGDV 476

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L        WL +A  L +   E F   E G  ++T  +   ++ R ++  D A
Sbjct: 477 AEGFLALASVTGEGAWLDFAGFLLDIVLERFTG-ENGQLYDTADDAEQLIRRPQDPTDSA 535

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
            P+G + +   L+   S  A + S+ +R  AE +L V +         +      A+ L 
Sbjct: 536 TPAGWTAAAGALL---SYAAHTGSEAHRTAAEGALGVVKALGPKAPRFIGWGLAVAEALL 592

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
              R+  V       +    +L  A  +      V    P    E             + 
Sbjct: 593 DGPREVAVAGPVGGELHRTALLGRAPGAVVAAGEV----PGGAAEFPL----------LV 638

Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                     A VC++F C  P TD   LE  L
Sbjct: 639 DRPLVDGAPTAYVCRHFVCEAPTTDAEELERGL 671


>gi|383830441|ref|ZP_09985530.1| thioredoxin domain containing protein [Saccharomonospora
           xinjiangensis XJ-54]
 gi|383463094|gb|EID55184.1| thioredoxin domain containing protein [Saccharomonospora
           xinjiangensis XJ-54]
          Length = 667

 Score =  312 bits (800), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 221/697 (31%), Positives = 324/697 (46%), Gaps = 86/697 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF D+ VA  +N+ FV+IKVDREERPD+D VYM   QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFSDDDVAAFMNEHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P+ KP   GTY+PP   +G P F+ +L  V  AW ++R  L +     +E ++E  + 
Sbjct: 109 LTPEGKPFHCGTYYPPVPAHGMPSFRQVLEAVDQAWRERRAELVEGAGRIVEHIAE-RTT 167

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             S++ + ++   +A+      L    D   GGFG APKFP  + ++ +L H    E TG
Sbjct: 168 PLSTHPVDEDTVTSAV----ATLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               +++   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 221 ----SAQALSIVDLTAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRFY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPEGGFASSLDADTDGVEGLT-------YVWT 329

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +++ D+LG +  +   E + +   G  +           +G + L    D    A    
Sbjct: 330 PQQLVDVLGRDDGVWAAETFGVTREGTFE-----------RGASTLQLRRDPDDPA---- 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               +++ +       L + R+ RP+P  DDKVI +WNGL I++ A A   L+       
Sbjct: 375 ----RWMRVTS----ALVEARNARPQPARDDKVIAAWNGLAITALAEAGLALR------- 419

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
                    R E++E A +A +F+           L  S R+G    A G L+DY  L  
Sbjct: 420 ---------RPEWVEAAVAAGAFVLD--VHASGDGLLRSSRDGVAGAAAGVLEDYGCLAD 468

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAE 558
           GLL L++    + WLV A  L +T    F      G F+ T ED   L+ R  +  D A 
Sbjct: 469 GLLALHQATGESGWLVEATSLIDTALRRFGVEGAPGAFHDTAEDAETLVHRPSDPTDNAS 528

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAA 613
           PSG S     L+  +++    ++  YR   E +L     R   +    P      +  A 
Sbjct: 529 PSGASALAGALLTASALAGPDRAGAYRAACEEAL----RRAGALVAQAPRFAGHWLSVAE 584

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
            MLS P +  V +VG  +    + +  AA   +     +     AD   +          
Sbjct: 585 AMLSGPVQ--VAVVGSDAQERADLLTEAARNVHGGGVVLGGSPEADGVPL---------- 632

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             +A  +       A VC  + C  PVTD  SL  LL
Sbjct: 633 --LADRSLVDGAAAAYVCHGYVCDRPVTDTESLARLL 667


>gi|359774323|ref|ZP_09277696.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
 gi|359308634|dbj|GAB20474.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
          Length = 654

 Score =  312 bits (799), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 196/583 (33%), Positives = 293/583 (50%), Gaps = 79/583 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  E FE+E +A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ F
Sbjct: 49  CHWCHVMAHECFENEQIAAQMNAEFVCIKVDREERPDIDAIYMNATVAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+P  +P   GTYFPP  + G+PGF  ++  + D W  +RD + + G    ++L+  L  
Sbjct: 109 LTPAGEPFYCGTYFPPSPRNGQPGFTELMSAITDTWINRRDEVTRVG----KELTGHL-- 162

Query: 142 SASSNKLPDE--LPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           SA+S  LPD   +  +AL + A  +L    D   GGFG APKFP   +++ +L H ++  
Sbjct: 163 SAASGGLPDAQFVLDDALAIHASNELVAQEDRAHGGFGGAPKFPPSAQLEALLRHYERTG 222

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D        E   +V  T Q MA+GGI+D +GGGF RY+VD  W +PHFEKMLYD  QL 
Sbjct: 223 D-------REALGVVERTAQAMARGGIYDQLGGGFSRYAVDIAWAIPHFEKMLYDNAQLL 275

Query: 259 NVYLDAFSLTKD--VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
            VY     +  D     + +  + +D+L  D+   GG   S+ DAD+   EGAT      
Sbjct: 276 RVYAHLACVASDASAMAARVTAETVDFLATDLRVEGG-FASSLDADTDGVEGAT------ 328

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELND 371
            YVWT +E +++LG  +    E + +  TG  +     L    DP N             
Sbjct: 329 -YVWTRREFDELLGSDSDWAAELFTVTETGTFEHGTSTLQLPVDPDN------------- 374

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
                      ++++  ++   R      R KRP+P  D KV+ +WNG+ I+    A   
Sbjct: 375 -----------VQRFAAVVDRLRA----AREKRPQPGRDGKVVTAWNGMTITGLVEAGTA 419

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAE-SAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           L                +R E++++A   A   + RH+ + +  R   S        PG 
Sbjct: 420 L----------------NRPEWVDLAAWCADELLSRHIVEGELRRT--SLDGVVGTTPGM 461

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLR 549
           LDD+A L++GLL L+   +  +WL  AI L +    LF D +  G +F+       ++ R
Sbjct: 462 LDDHAALVTGLLGLFAATAQERWLDAAIALLDKAIGLFGDPDAQGSWFDAPAGATGLITR 521

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
            ++  DGA PSG S+    L+  + + A  K+  Y + A+ +L
Sbjct: 522 PRDPADGATPSGGSLMAEALLTASMLAAPEKAGSYLELADATL 564


>gi|158426331|ref|YP_001527623.1| highly protein [Azorhizobium caulinodans ORS 571]
 gi|158333220|dbj|BAF90705.1| highly conserved protein [Azorhizobium caulinodans ORS 571]
          Length = 657

 Score =  312 bits (799), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 216/616 (35%), Positives = 313/616 (50%), Gaps = 65/616 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED   A L+N  FV+IKVDREERPDVD++YM  +  L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFEDAETADLMNALFVNIKVDREERPDVDQIYMNALHELGEQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+ D  P  GGTYFP    YGRPGFK +L +V  A+ +  + +A +    + +L+ A  
Sbjct: 110 FLNADGAPFWGGTYFPKTASYGRPGFKDVLWQVSQAYRETPEKVAHNTDAILSRLAAAAK 169

Query: 141 -ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
            A   +  L D      L   A+Q++  +D   GG   APKFP+   ++++     +  D
Sbjct: 170 PAGGVALTLAD------LDKAAQQIAGLFDRAHGGLRGAPKFPQAGLLELLWRAGDRTGD 223

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
                   + + +V FTL  M +GGI+DHVGGGF RYSVDERW VPHFEKMLYD  QL  
Sbjct: 224 -------PQLKAVVAFTLNRMCEGGIYDHVGGGFSRYSVDERWLVPHFEKMLYDNAQLLE 276

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +   A+  T D  +    R+ + +L+R+M+   G   ++ DADS   EG     EG FYV
Sbjct: 277 LLALAYQETGDELFLLRARETVSWLKREMVTADGAFAASLDADS---EG----HEGKFYV 329

Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT+ E+  +LG E A  F   Y +   GN            ++G+ +L     +  S   
Sbjct: 330 WTADEIVAVLGKEDAAEFAAFYDVTDEGN------------WEGQTIL-----NRTSFGD 372

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
           + M  E  L  + E   KL   R++R RP LDDKV+  WNGL+I++ ARA  +       
Sbjct: 373 VSMVEEARLRPMKE---KLLAARAQRVRPGLDDKVLADWNGLMIAALARAGAL------- 422

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                     D  E++++A +A   + R +  +   RL HS+R G    PG   D A + 
Sbjct: 423 ---------LDEPEWVDLAATAFDAVVRLMVKDG--RLGHSYREGRLVLPGLASDLAAMA 471

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              + L+E       L  A +  N  +  +LD + G YF T  + P++++R     D A 
Sbjct: 472 RAGIALHEAAGDEAPLAHAEDFLNRLEADYLDPQSGAYFLTAADAPALVMRPLSSLDEAL 531

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+ NSV+   L+RLA++   +  D  R  A+  +           +A P +  A D  + 
Sbjct: 532 PNYNSVAADALIRLAAL---TGQDGLRARADRLIGALTGAAAQNPLAHPSLLNALD--TR 586

Query: 619 PSRKHVVLVGHKSSVD 634
                +V VG +S  D
Sbjct: 587 LRLAEIVAVGARSVRD 602


>gi|325104043|ref|YP_004273697.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324972891|gb|ADY51875.1| protein of unknown function DUF255 [Pedobacter saltans DSM 12145]
          Length = 669

 Score =  311 bits (798), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 193/557 (34%), Positives = 282/557 (50%), Gaps = 54/557 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFEDE VA+++N+ FV IKVDREERPD+D++YM  VQ + G GGWPL+
Sbjct: 48  SACHWCHVMEHESFEDEEVAQIMNEHFVCIKVDREERPDIDQIYMNAVQLMTGRGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            F  PD +P+ GGTYF  ED      +K IL  +   +  K   L ++  +A+ +L + +
Sbjct: 108 CFCLPDQRPIYGGTYFQKED------WKNILHNLAGFYANK---LQEAEEYAV-RLMDGI 157

Query: 140 SASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           + S   S  K   E  Q  +    +     +D   GG   APKFP P     ++  +  +
Sbjct: 158 NQSERLSFVKEEKEYTQEHIENIVKPWKMHFDFSEGGQNRAPKFPMPDNWAFLMKVAHLM 217

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           +D            +   TL  MA GGI+D +GGGF RYSVD  WH+PHFEKMLYD GQL
Sbjct: 218 KDDA-------AFVITRLTLDKMAAGGIYDQLGGGFARYSVDHEWHIPHFEKMLYDNGQL 270

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y DA+   K+  Y  +  +  D+++R+M  P    +SA DADS   EG     EG F
Sbjct: 271 MSLYADAYKYYKNERYKEVVYETYDWIKREMTSPEYGFYSALDADS---EGV----EGKF 323

Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           Y W  +E+E IL  E A +F  +Y +   GN +   +          N L    +    A
Sbjct: 324 YTWDKQEIEKILDKEQAAIFNAYYAVTDEGNWEEEEI----------NHLWIRKEKQHIA 373

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
               + +E+   I+   + +L + R+KR  P LDDK++ SWN L++     A K    + 
Sbjct: 374 EAFHISIERLDEIIQHSKTQLLEYRNKRIHPGLDDKILTSWNALMLKGLCDAYKAFADQ- 432

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                          +++ +A   A F+  +L  E    L  +++NG +    FLDDYA 
Sbjct: 433 ---------------QFLTLALDNAKFLLNNLCREDG-MLYRNYKNGKATIEAFLDDYAL 476

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L    + LYE      W+  A  L +   + F D + G +F T+    +++ R  E  D 
Sbjct: 477 LAQAFISLYEVTFDEAWIFKAKSLCDYVIKHFSDAQSGMFFYTSDASEALVARKYEIMDN 536

Query: 557 AEPSGNSVSVINLVRLA 573
             PS NSV   NL +L+
Sbjct: 537 VIPSSNSVMAWNLRKLS 553


>gi|385681202|ref|ZP_10055130.1| highly conserved protein containing a thioredoxin domain-containing
           protein [Amycolatopsis sp. ATCC 39116]
          Length = 675

 Score =  311 bits (798), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 224/696 (32%), Positives = 328/696 (47%), Gaps = 83/696 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED   A+L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ 
Sbjct: 49  ACHWCHVMAHESFEDAETARLMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTC 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P   GTY+PPE + G P F+ +L  V  AW ++RD L +     +E L+  L 
Sbjct: 109 FLTPDGEPFHCGTYYPPEPRPGMPSFQHLLVAVAQAWQERRDELREGAGKIVEHLAGQLG 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
                   P  +    L     +L+   D   GGFG APKFP  + ++ +L H ++   T
Sbjct: 169 PLP-----PAPVDAGVLDAALLKLTGEADRARGGFGGAPKFPPSMVLEFLLRHHER---T 220

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G    ++E   +V    + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  V
Sbjct: 221 G----SAEALSLVESCAEAMARGGIHDQLAGGFARYSVDASWVVPHFEKMLYDNALLLRV 276

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y      T     + + R   ++L   +    G   ++ DAD       T  +EG  YVW
Sbjct: 277 YAHLARRTGSALAAEVARMTGEFLLARLRTEQGGFAASLDAD-------TLGEEGLTYVW 329

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T  ++ ++LG +      E + +  +G             F+    +++L D        
Sbjct: 330 TPAQLREVLGDDDGAWAAELFSVTESGT------------FEHGASVLQLRDPDDR---- 373

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               E++  +    R  L   R +RP+P  DDKVI +WNGL I++   A   L       
Sbjct: 374 ----ERFERV----RSALLAARDERPQPGRDDKVIAAWNGLAITALCEAGVAL------- 418

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPS-KAPGFLDDYAFL 497
                    D   ++  A+ AAS +   HL D   +RL+ S R+G +  A G L+DY  L
Sbjct: 419 ---------DEPHWVTAAQEAASAVLGIHLRD---NRLRRSSRDGTAGDAAGVLEDYGCL 466

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDG 556
             GLL L++     +WL  A+ L +T    F   +  G ++ T +D  VL+ R  +  D 
Sbjct: 467 AEGLLALHQATGDPRWLTEAVNLLDTALANFAVADTPGAYHDTADDAEVLVHRPSDPTDN 526

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL--KDMAMAVPLMCCAAD 614
           A PSG S ++ N +  AS++ G       + A         +L  K    A   +  A  
Sbjct: 527 ASPSGAS-ALTNALVTASVLVGPDRSARYRAAAEEAVHRTGQLIAKAPRFAGHWLTAAEA 585

Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           +L+ P +  V + G  S+    ++L A  A       V+     D E +           
Sbjct: 586 LLAGPVQ--VAIAGPDSTE--RDLLRAVAARRAHGGAVVLAGEPDAEGVPL--------- 632

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +A     A +  A VC+ + C  PVT P  L + L
Sbjct: 633 -LADRPLVAGQAAAYVCRGYVCDRPVTSPDDLVSAL 667


>gi|288818675|ref|YP_003433023.1| hypothetical protein HTH_1371 [Hydrogenobacter thermophilus TK-6]
 gi|384129427|ref|YP_005512040.1| hypothetical protein [Hydrogenobacter thermophilus TK-6]
 gi|288788075|dbj|BAI69822.1| conserved hypothetical protein [Hydrogenobacter thermophilus TK-6]
 gi|308752264|gb|ADO45747.1| protein of unknown function DUF255 [Hydrogenobacter thermophilus
           TK-6]
          Length = 648

 Score =  311 bits (798), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 198/585 (33%), Positives = 306/585 (52%), Gaps = 53/585 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED  +AK++N+ FV+IKVDR+ERPD+D+ Y   V AL G GGWPL+ F
Sbjct: 52  CHWCHVMAKESFEDPEIAKIINENFVAIKVDRDERPDIDRRYQETVIALTGSGGWPLTAF 111

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD K   GGTYFPPED++GRPG K++L ++   W ++++ + +S      +L      
Sbjct: 112 LTPDGKLFFGGTYFPPEDRWGRPGLKSLLLRISQLWREEKERILKSADHIFLELQ----- 166

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           + SS    D + +  L+     L  S D   GG GSAPKF      +++LYH    ++  
Sbjct: 167 NYSSMTFKDFVDEELLKRGIGALLSSVDYEKGGIGSAPKFHHAKAFELLLYHYYFTKE-- 224

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                   ++ ++ +L  MAKGGI+DH+ GGF RYS D+ W++PHFEKMLYD  +L  +Y
Sbjct: 225 -----EIVKRAIISSLDAMAKGGIYDHLLGGFFRYSTDDTWNIPHFEKMLYDNAELLRLY 279

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
             A+ + ++  Y Y+ + I++Y +       G  ++++DAD    +      EG  Y +T
Sbjct: 280 SLAYQVFENPLYEYVAKGIVNYYKLYGSDQEGGFYASQDADIGVLD------EGGHYTFT 333

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
           S E+  +L    +   + Y+    G     RM  PH++   KNVL    D+   +  L +
Sbjct: 334 SDELRLLLDPEELKVVKLYF----GIDTRGRM--PHHQH--KNVLFINMDAQQVSKVLDI 385

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
           P EK   +L   + K+   R+ R  P++D  +   WNGL+I +     K+ + E    M 
Sbjct: 386 PKEKVEELLKSAKEKMLSYRNSREIPYIDKTIYTGWNGLMIDALCVYYKVFQDEWSLLM- 444

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                          AE  A+ + +  Y + +  L H+  +G S   G+ +DY +L  GL
Sbjct: 445 ---------------AEKTANRLIKERYRDGS--LDHT--DGVS---GYSEDYIYLSQGL 482

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPS 560
           L L+E      +L  A EL +   ELF D +G G+F+T  +   +LL + K   D    S
Sbjct: 483 LSLFEITQNRTYLDMAKELLDKAIELFWDDQGWGFFDTHQKGEGLLLIKHKPIQDTPIQS 542

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
            N  S   L+ + +I   +K   Y + AE +L  F   +++M MA
Sbjct: 543 VNGTSPYLLLLMEAITGDTK---YGEYAEKNLMAFSRFMREMPMA 584


>gi|424867573|ref|ZP_18291355.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
           'C75']
 gi|124516649|gb|EAY58157.1| protein of unknown function [Leptospirillum rubarum]
 gi|387221885|gb|EIJ76392.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
           'C75']
          Length = 689

 Score =  311 bits (798), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 216/698 (30%), Positives = 335/698 (47%), Gaps = 64/698 (9%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLS 79
            CHWCHVM  ESFE   +A ++N++FV+IKVDREERPD+D++Y M +       GGWPL+
Sbjct: 49  ACHWCHVMAHESFERPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLT 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FL+P   P  GGTYFP + ++G PGF  +L +++D +   R+ L +     ++ L +  
Sbjct: 109 MFLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTN 168

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
             + S     D  P  AL      L   +D  FGGFG APKFP  +++  +    ++ + 
Sbjct: 169 PVADSREFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQR 222

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
            G S  A     M   TL  M +GGI D VGGGF RYSVDERW +PHFEKMLYD   L  
Sbjct: 223 KGDSTAA----HMATVTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLE 278

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
                 S++K+  YS    +++ +L R+M    G  +S+ DADS   EG    +EG FYV
Sbjct: 279 ALALGASVSKNPVYSRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYV 331

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASK 378
           + ++EV  IL +        YY           +S P N F+G    L E       + +
Sbjct: 332 FQAEEVRSILSDEEYRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKE 380

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
             +        +   R+KLF  RS R RP LDDKV+ SWN L+              A++
Sbjct: 381 FHLSESDIERRIESARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKA 426

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
            +F+  ++G  ++E++        ++ R ++  +   L   +       P +LDDYAFL+
Sbjct: 427 LLFSGRILG--KQEWISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLL 482

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
             +L+        + L +A  + +     F D E GG++ T     +++ R K  HDGA 
Sbjct: 483 LAVLESMRIDFRPEDLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGAL 542

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSGN+ +V  L+ L ++        Y   A+ +L ++  ++K+       M  A +  S 
Sbjct: 543 PSGNAAAVQGLLWLGTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS- 598

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
              + VV +    + D+++ ++      D    V+ +  A  + +   E          R
Sbjct: 599 -DSQPVVFLAGPQAGDWKDKISCG---VDTEAFVLDLTNAVRDSLPLPEG--------MR 646

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
            +F  +K    VC+   C P      SL+  L   P S
Sbjct: 647 KHFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 684


>gi|302894519|ref|XP_003046140.1| hypothetical protein NECHADRAFT_33848 [Nectria haematococca mpVI
           77-13-4]
 gi|256727067|gb|EEU40427.1| hypothetical protein NECHADRAFT_33848 [Nectria haematococca mpVI
           77-13-4]
          Length = 712

 Score =  311 bits (797), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 212/669 (31%), Positives = 326/669 (48%), Gaps = 91/669 (13%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M +ESF +   A +LN++FV + VDREERPD+D +YM YVQA+   GG
Sbjct: 75  HIGYKACHFCRLMLLESFSNPDCASVLNEFFVPVIVDREERPDLDTIYMNYVQAVSNAGG 134

Query: 76  WPLSVFLSPDLKPLMGGTYFPPEDKYGRP-----------GFKTILRKVKDAWDKKR--- 121
           WPL++FL+P+L+P+ GGTY+P     GR             F TI++KV+D W  +    
Sbjct: 135 WPLNLFLTPNLEPVFGGTYWP--GPAGRRHTTDDSADEVLDFLTIVKKVRDIWSDQESRC 192

Query: 122 -----DMLAQSGAFAIEQLSEALSASASSNKLP----------------------DELPQ 154
                ++L Q   FA E      + SA+S   P                      +EL  
Sbjct: 193 RKEATEVLGQLREFAAEGTLGTRNISATSALAPSGWGAPAPSHTSAPKDKDTSVSEELDL 252

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKF---PRPVEIQMMLYHSKKLEDTGKSGEASEGQK 211
           + L      ++ ++D  +GGFG APKF   P+   +  +L   ++++D     E     +
Sbjct: 253 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLGFLLGLLNFPREVQDVVGEAECKHATE 312

Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT-- 268
           M L TL+ +  G +HDHVGG GF R SV   W +P+FEK++ D  QL ++YLDA+  T  
Sbjct: 313 MALDTLRHIRDGALHDHVGGTGFSRCSVTPDWSIPNFEKLVVDNAQLLSLYLDAWKSTGG 372

Query: 269 -KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
            K   +  I  ++ +YL    I  P G   S+E ADS    G    +EGA+YVWT +E +
Sbjct: 373 DKPTEFFDIVIELAEYLSSAPIALPEGGFASSEAADSHYRRGDREMREGAYYVWTRREFD 432

Query: 327 DILGE----HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
            +L E     + +   H+ +   GN D     DP+++F  +N+L         + +  +P
Sbjct: 433 SVLDEVNKHMSPVLAAHWAVNEDGNVD--EHHDPNDDFINQNILRIERSVQQLSVQFSIP 490

Query: 383 LEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            +K    + E +  L   R K R RP LDDKV+  WNGLVIS+ A+ +  LK        
Sbjct: 491 EDKVRQYVQEGKVALKQRRDKERVRPDLDDKVVAGWNGLVISALAKTALALKG------- 543

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
              +      +Y+ VAE A  FI+  L+D    ++ +   +G  +   F DDYA+L  GL
Sbjct: 544 ---LRPEQSSKYLAVAEKAVKFIQEKLWDSD-RKVLYRIWSGERETQAFADDYAYLTQGL 599

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           LDL++      +LV+A  LQ +                    P  +LR+K+  D + PS 
Sbjct: 600 LDLFDATGNEAYLVFADTLQPSS-------------------PHTILRLKDGMDTSVPST 640

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
           N++SV NL R+A ++A    D    NA  ++  FE  +       P +        + S+
Sbjct: 641 NAISVSNLFRIADLLA---DDKLAVNARQTINAFEAEMLQHPWLFPGLLAGVVTARLGSQ 697

Query: 622 KHVVLVGHK 630
           +  V V ++
Sbjct: 698 RRNVNVNYQ 706


>gi|13473777|ref|NP_105345.1| hypothetical protein mlr4484 [Mesorhizobium loti MAFF303099]
 gi|14024528|dbj|BAB51131.1| mlr4484 [Mesorhizobium loti MAFF303099]
          Length = 671

 Score =  311 bits (797), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 197/556 (35%), Positives = 283/556 (50%), Gaps = 56/556 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE++GVA ++N  FV+IKVDREERPD+D++YM  + ++   GGWPL++
Sbjct: 53  ACHWCHVMAHESFENDGVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTM 112

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP  GGTYFP E +YGRPGF  ++  V  AW +KRD L QS     + L+  + 
Sbjct: 113 FLTPDGKPFWGGTYFPREARYGRPGFIQVMEAVDKAWREKRDSLHQSA----DGLTSHVE 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A  S       L + AL   A ++    D   GG   APKFP      + L+ S      
Sbjct: 169 ARLSGTHARQSLDRGALTDLAGRIDGMVDRDLGGLRGAPKFPN-APFMLTLWLSWL---- 223

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            + G A+  +  VL +L+ M  GGI+DH+GGG  RYS D  W VPHFEKMLYD  +L   
Sbjct: 224 -RDGNAAH-RDDVLVSLERMLAGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAELIRF 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
              AFS + +  +     + +D+L R+M   GG   ++ DADS         +EG FY W
Sbjct: 282 CNWAFSASGNDLFRIRIEETVDWLLREMRVEGGAFAASLDADS-------DGEEGLFYTW 334

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
             +E++ +LG+ + LF +++ L           S PH  ++GK V+ +     A      
Sbjct: 335 NRQEIKTVLGDDSALFFKYFTL-----------SAPHG-WEGKPVIHQTRTQQAQGVA-- 380

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
              EK + +    + +L  VR +R RP LD K +  WNGL+I++ A A + L        
Sbjct: 381 -DREKLIPL----KARLLAVREERVRPGLDAKTLTDWNGLMIAALAEAGRSLG------- 428

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                    R E++E A+ A + I     D    RL HS        P    DYA + + 
Sbjct: 429 ---------RPEWIEAADKAFAHISGASRD---GRLPHSMLGTRKLFPALSSDYAAMANA 476

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            + L+E      ++  A +     D  + D  G GY+ T  +   V +R++ D D A  S
Sbjct: 477 GISLFEASGDWSYIDQAKQFIEQLDHWYPDPAGTGYYLTASDSTDVPIRIRGDVDEAISS 536

Query: 561 GNSVSVINLVRLASIV 576
             S  +  LVRLAS+ 
Sbjct: 537 ATSQIIAALVRLASVT 552


>gi|338213486|ref|YP_004657541.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336307307|gb|AEI50409.1| protein of unknown function DUF255 [Runella slithyformis DSM 19594]
          Length = 700

 Score =  311 bits (797), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 198/580 (34%), Positives = 290/580 (50%), Gaps = 71/580 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE E VA ++N  FV IKVDREERPDVD +YM  + A+   GGWPL+
Sbjct: 48  SACHWCHVMERESFEKEQVAAVMNADFVCIKVDREERPDVDAIYMDAIHAMGARGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---- 135
           VFL PD KP  G TY P ++      +  +L  VK+A+    + L +S     + +    
Sbjct: 108 VFLLPDAKPFYGVTYLPAQN------WVQLLGSVKNAFVNHHEELVKSAEGFTDNMLIKE 161

Query: 136 -------------SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
                         EA  A AS     D+L +       E++   +D+  GG   APKFP
Sbjct: 162 TDKYNLHATSPQGDEADRAEASPAPTLDDLHE-----MFEKIKGHFDTEKGGMDRAPKFP 216

Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
            P   + +L +    ++        E  + +  +L  +A GGI+DHVGGG+ RYSVD+ W
Sbjct: 217 MPSIYKFLLRYYALTQN-------PEALRHIELSLNRIALGGIYDHVGGGWARYSVDDEW 269

Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
            +PHFEKMLYD GQL ++Y +A++LTK+  Y     + +D+L R+M    G  +SA DAD
Sbjct: 270 FIPHFEKMLYDNGQLLSIYSEAYTLTKNELYKSRVYETIDWLEREMTSTEGGFYSALDAD 329

Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG 362
           S   EG     EG FYVWT  E+  +LG+    F + Y ++ +GN +       +N    
Sbjct: 330 S---EGV----EGKFYVWTQAELRSVLGDDFEWFSKLYNIRASGNWEHG-----YNHLHL 377

Query: 363 KNVLIELNDSSASASKLGMPLEKYLNILGE-------CRRKLFDVRSKRPRPHLDDKVIV 415
             +         S  ++G PL   +  L E         +KLF  R  R RP LDDK++ 
Sbjct: 378 TTISFVPETVEKSQWRVGPPLNYLMKGLFEKNSTYQAALQKLFVARESRIRPGLDDKILA 437

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
           SWNGL++     A +    E                ++  +A  +A F++  +     H+
Sbjct: 438 SWNGLMLKGLTDAYRAFGEE----------------KFKTLALQSAHFLKDKM-TAPNHQ 480

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
           L HS++NG +   GFL+DYA ++ G L LY+     +WL  A++L     E   D E   
Sbjct: 481 LWHSYKNGKASIVGFLEDYAAVVDGYLGLYQATFEEQWLDEALKLTAYAIENLYDPEEEL 540

Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
           ++ T      ++ R KE  D   P+ NS+   NL  L ++
Sbjct: 541 FYFTDANAEELIARKKEIFDNVIPASNSLMAHNLFTLGTL 580


>gi|291437584|ref|ZP_06576974.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
 gi|291340479|gb|EFE67435.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
          Length = 677

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 230/701 (32%), Positives = 333/701 (47%), Gaps = 83/701 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SSCHWCHVMAHESFEDRTTADYLNGHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+PD +P   GTYFPPE ++G P F  +L+ +  AW ++RD +          L+   
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFLQVLQGIHQAWQERRDEVTDVAGKITRDLA-GR 166

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
             S    K+P   EL Q  L      L++ YD + GGFG APKFP  + ++ +L H  + 
Sbjct: 167 EISYGDAKVPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 221 --TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       +  +  D++ R++  P G   SA DADS   +G  R  EGA+
Sbjct: 275 CRVYAHLWRATGSELARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAY 332

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
           YVWT  ++ ++LGE  A L   ++ +   G  +           +G +VL +   D    
Sbjct: 333 YVWTPAQLREVLGEEDADLAARYFGVTEEGTFE-----------EGASVLQLPQRDEVFD 381

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
           A++           +   R +L   R+ RP P  DDKV+ +WNGL +++ A         
Sbjct: 382 AAR-----------VDGVRERLLAARAARPAPGRDDKVVAAWNGLAVAALAETGAYF--- 427

Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
                        DR + +E A +A   + R  +DE   R+  + ++G   A  G L+DY
Sbjct: 428 -------------DRPDLVEAAVAAGDLLVRLHFDEHA-RIARTSKDGHVGANAGVLEDY 473

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +     F D + G  ++T  +   ++ R ++  
Sbjct: 474 ADVAEGFLALASVTGEGVWLEFAGLLLDHVLARFTDPDSGALYDTAADAERLIRRPQDPT 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
           D A PSG S +   L+   S  A + S+ +R  AE +L V    +K +   VP      +
Sbjct: 534 DNAVPSGWSAAAGALL---SYAAHTGSEPHRTAAERALGV----VKALGPRVPRFIGWGL 586

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A  +L  P  + + +VG          L            V+ +    ++E       
Sbjct: 587 AVAEAVLDGP--REIAVVGPAPDDPATRTLHRTALLGTAPGAVVAVGTPGSDEFPL---- 640

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +A      D+  A VC++F+C  P TDP  L   L
Sbjct: 641 ------LADRPLVRDEPAAYVCRDFTCDAPTTDPDRLRAAL 675


>gi|238062793|ref|ZP_04607502.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
 gi|237884604|gb|EEP73432.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
          Length = 703

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 223/707 (31%), Positives = 336/707 (47%), Gaps = 75/707 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED GV KLLND FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF
Sbjct: 49  CHWCHVMAHESFEDAGVGKLLNDGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
            +PD  P   GTYFP      +P F  +L  V  AW ++R+ + + G+  +E +  A + 
Sbjct: 109 ATPDGTPFFCGTYFP------KPNFVRLLESVGTAWREQREAVLRQGSAVVEAIGGAQAV 162

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
              +           L   A +L++ YD   GGFG APKFP  + +  +L H ++   TG
Sbjct: 163 GGPTAP----FTAELLDAAAARLAREYDRDNGGFGGAPKFPPHLNLLFLLRHHQR---TG 215

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               ++E  ++   T + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  VY
Sbjct: 216 ----SAESLEIARHTAEAMARGGIHDQLAGGFARYSVDAHWTVPHFEKMLYDNALLLRVY 271

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              + LT D     + RD   +L  ++  PG    SA DAD+   EG T       Y WT
Sbjct: 272 THLWRLTGDPLARRVARDTARFLADELHRPGEGFASALDADTEGVEGLT-------YAWT 324

Query: 322 SKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
             ++ ++LGE       + + + P+G       S P      +   +E      S  +L 
Sbjct: 325 PAQLVEVLGESDGRWAADLFAVTPSGTFAPHSASAPQGGTPDRRKGVE---HGTSVLRLA 381

Query: 381 MPLE--------KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
             ++        ++ +++G    +L   R  RP+P  DDKV+ +WNGL I++ A   +++
Sbjct: 382 RDVDDADPAIRGRWRDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITALAEFVRLV 437

Query: 433 KS------EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
           ++      +A++ +     + +D     + AE  A+    HL D +  R+      G  +
Sbjct: 438 EAVGTGDEQADANLLEGVTIVAD-GALRDAAEHLAAV---HLVDGRLRRVSRDRVVG--E 491

Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
             G L+DY  +      +++     +WL  A +L +T    F    GGG+++T  +   +
Sbjct: 492 PAGVLEDYGCVAEAFCAMHQLTGEGRWLELAGDLLDTALARFA-APGGGFYDTADDAERL 550

Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
           + R  +  D A PSG S  V  LV  A++   S    YR+ AE +LA     +   A   
Sbjct: 551 VTRPADPTDNATPSGRSAIVAALVTYAAL---SGQPRYREVAEAALATVAPIVARHARFT 607

Query: 607 PLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
                A + +LS P    VV          + ++AAA+        ++   P        
Sbjct: 608 GYAATAGEALLSGPYEIAVV----TDDPAGDPLVAAAYRHAPPGAVLVAGRP-------- 655

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                     +A       +  A VC+ F C  PVT   ++E+LL +
Sbjct: 656 ---DQPGVPLLADRPMLDGRPTAYVCRGFVCQRPVT---TVEDLLAQ 696


>gi|154245776|ref|YP_001416734.1| hypothetical protein Xaut_1832 [Xanthobacter autotrophicus Py2]
 gi|154159861|gb|ABS67077.1| protein of unknown function DUF255 [Xanthobacter autotrophicus Py2]
          Length = 669

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 205/573 (35%), Positives = 290/573 (50%), Gaps = 61/573 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE+  VA L+N  FV+IKVDREERPDVD++YM+ +Q L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFENADVAGLMNALFVNIKVDREERPDVDQIYMSALQQLGQSGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL P+ KP  GGTYFPP   YGRPGF  +L++V   + + +D + ++ A  + +L +A +
Sbjct: 110 FLDPEGKPFWGGTYFPPAASYGRPGFTDVLQQVSTVFTQNKDKVEKNTATILARLKKAAT 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             A +    ++L   A RL A      +D   GG   APKFP+   ++ +     + +D 
Sbjct: 170 PVAGAAIGREDLNDAAARLPA-----MFDPVHGGLKGAPKFPQSGLLEFLWRVGTRRKDD 224

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                    + +V  TL  M +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L  +
Sbjct: 225 AL-------KAIVALTLNRMCEGGIYDHLGGGFARYSVDEIWFVPHFEKMLYDNALLLEL 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
              A+S T D  +    R+ + +L+R+M+ P G   ++ DAD   TEG     EG FYVW
Sbjct: 278 LALAYSDTGDALFLTRARETVGWLKREMLTPEGAFAASLDAD---TEG----HEGRFYVW 330

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +  E+  +LG E A  F   Y +   GN ++             N+L        SA   
Sbjct: 331 SEAEITAVLGAEDAAFFNRLYDVSRAGNWEVG------------NILNRTEAGVVSAEDE 378

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                     L   R KL   R KR RP  DDKV+  WNGL+I++ ARA   L       
Sbjct: 379 AR--------LAPLREKLLLAREKRVRPGRDDKVLADWNGLMIAALARAGGFL------- 423

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       E++ +A+ A   +  H+  E   RL HS+       PG   D A +  
Sbjct: 424 ---------GEAEWVALAQRAFDAVVSHMVVEG--RLAHSWCGTKIVLPGLASDLAAMAR 472

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + L+E     + L  A       +    D E G YF T  +  S++LR    HD A P
Sbjct: 473 AGIALHEATGAPEPLAQAAHFLEVLETHHRDPETGAYFLTAYDGDSLILRPLATHDEAVP 532

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           + N+V+   L+RLA++   + +D +R  A+  L
Sbjct: 533 NANAVAADALIRLAAL---TGNDAFRTRADRVL 562


>gi|284033485|ref|YP_003383416.1| hypothetical protein Kfla_5611 [Kribbella flavida DSM 17836]
 gi|283812778|gb|ADB34617.1| protein of unknown function DUF255 [Kribbella flavida DSM 17836]
          Length = 670

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 218/696 (31%), Positives = 327/696 (46%), Gaps = 83/696 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+  A  LN+ FV +KVDREERPDVD +YM    A+ G GGWP+S
Sbjct: 49  SACHWCHVMAHESFEDDATAAYLNEHFVCVKVDREERPDVDAIYMEATVAMTGHGGWPMS 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P  +P   GTYFP + ++G   F+ +L  + DAW  KR+ +   GA  ++QL    
Sbjct: 109 VFLTPAGEPFFCGTYFPLDPRHGMASFRQVLESLVDAWRTKREQIDGIGASVVQQL---- 164

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
              A    + + +    L      L   +D   GGFG APKFP  + +  +L H ++   
Sbjct: 165 --GARQPAVGEAVDAAVLDRAVALLQGDFDPVDGGFGQAPKFPPSMVLDFLLRHHRR--- 219

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG    + E   MV  T + MA+GG++D + GGF RYSVD++W VPHFEKMLYD   L +
Sbjct: 220 TG----SEEALAMVTHTCERMARGGMYDQLAGGFARYSVDKQWIVPHFEKMLYDNALLLD 275

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   +++T       +  +  D+L  ++  P G   SA DAD   TEG    +EG +YV
Sbjct: 276 VYTHWWTVTGSPLAERVALETADFLLAELRTPEGGFASALDAD---TEG----EEGRYYV 328

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           W+  E+ ++LGE A    E         CD++        F+    +++L          
Sbjct: 329 WSPTELRELLGEDADWVIEL--------CDVT------GTFEHGTSVLQLRSDPDD---- 370

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
              L+++  I    R  L D R++R  P  DDKV+ +WNGL I++  RA  +L       
Sbjct: 371 ---LDRWNRI----RSVLRDARARRTYPGRDDKVVAAWNGLAITALTRAGLVL------- 416

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
                    DR EY+E A  AA  + R ++ + + RL  + R+G    A G L+DYA   
Sbjct: 417 ---------DRPEYVEAAVKAAELV-RDVHVDGSGRLHRTSRDGAVGTAHGVLEDYAAYA 466

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              L L        WL  A  L +   + F+    G +F+T  +  ++  R ++  D A 
Sbjct: 467 QACLTLLAATRDDSWLTLAQRLLDRVLQQFV--ADGTFFDTAADAETLAWRPQDATDNAS 524

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+G S++      LAS+   ++  Y +   +   A      +    A   +  A  + S 
Sbjct: 525 PAGVSLAAEAFSTLASVTGEAR--YEQAADQALAASAAIAARAPRFAGRALAVAETLQSG 582

Query: 619 PSRKHVVLVGHKSSVDFE----NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
           P    V+     ++ D +     ++  A AS      V+   P             S+  
Sbjct: 583 PLEIAVIGAEDVAAGDGQEQVTQLVRTALASAPWGTAVVQGKP------------GSDVP 630

Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +A       +  A VCQ F+C  P+  P  L   L
Sbjct: 631 LLAGRGLVDGRAAAYVCQKFTCRLPIVLPEDLRGEL 666


>gi|305665308|ref|YP_003861595.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
 gi|88710063|gb|EAR02295.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
          Length = 703

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 197/595 (33%), Positives = 317/595 (53%), Gaps = 78/595 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVME E+FEDE VA+++N+ F+S+KVDREERPDVD+VYMT VQ + G  GWPL+V
Sbjct: 85  SCHWCHVMEEETFEDEKVAEIMNNDFISVKVDREERPDVDQVYMTAVQLMSGNAGWPLNV 144

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---KKRDMLAQSGAFAIEQLSE 137
            + P+ KPL GGTY      +    +  +L K+ + +     K +  A   +  I+ ++ 
Sbjct: 145 IVLPNGKPLYGGTY------HTNAQWSQVLEKINNLYKDDPTKANEYADMVSKGIQDVNL 198

Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
              +  +S     E+  + L+    Q   ++D   GG     KF  P  +  +L      
Sbjct: 199 IEPSEENS-----EISLDILKEGVTQWKPNWDLERGGNMGPEKFMLPGSLDFLL------ 247

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D  +       +  +  TL  MAKGGI+DH+ GGF+RYS D  W++PHFEKMLYD  QL
Sbjct: 248 -DYAELSNDESVRSYIKTTLDQMAKGGIYDHIAGGFYRYSTDPNWNIPHFEKMLYDNAQL 306

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
            ++Y  A+++ KD  Y  I  + + +L+++M    G  F+A DADS   EG    +EG +
Sbjct: 307 ISLYSKAYTIFKDPVYKQIVLETVAFLQKEMKNTTGGYFAALDADS---EG----EEGKY 359

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASA 376
           YVWT++E+   +  +  LF ++Y             ++   + +G  +++  N +    A
Sbjct: 360 YVWTNEELRSTINNNQELFSKYY------------STEISTKMEGDKIVLRKNQNDEVFA 407

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           S+  + +EK   +  E ++KL +VR+ R +P +DDK+IVSWN L+I+ +  A        
Sbjct: 408 SENEISIEKLQELNKEWKKKLVEVRADRVKPRIDDKIIVSWNALLINGYVDA-------- 459

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                 F   G  R  ++  AES  + I  + Y +  ++L HSF+ G ++  GFL+DY+F
Sbjct: 460 ------FKAFGETR--FLVEAESIFTTIHENAYSD--NQLVHSFKKGSNRTEGFLEDYSF 509

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDHD 555
           L +  L+LY       +L +A +L  T  + F D +   Y FN++    S++ ++ ++ D
Sbjct: 510 LANASLNLYSASMNPDYLNFAQQLIKTTQKRFKDDDSDFYKFNSSN---SLIAKIIKNDD 566

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV-PLM 609
           G  PS N+V   NL+ L  I      +Y +  A HS        K+M +++ PL+
Sbjct: 567 GVIPSPNAVMAHNLLTLGHI------EYNKDYAAHS--------KNMLISIQPLL 607


>gi|386383690|ref|ZP_10069151.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
           NRRL18488]
 gi|385668865|gb|EIF92147.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
           NRRL18488]
          Length = 672

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 238/704 (33%), Positives = 333/704 (47%), Gaps = 90/704 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SSCHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+ D +P   GTYFPPE ++G   F+ +L  V  AW  +R+ + +  A     L+   
Sbjct: 107 VFLNADGEPFYFGTYFPPEPRHGMASFRQVLEGVTAAWRDRREEVGEVAAKITRDLA-GR 165

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +A+     LP  DEL Q  L      L++ YD R+GGF  APKFP  + ++ +L H  + 
Sbjct: 166 AAAHGGEGLPGEDELSQALL-----GLTRDYDERYGGFAGAPKFPPSMVLEFLLRHYAR- 219

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G       M   T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 220 --TGARG----ALDMAAGTCEAMARGGLYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 273

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +          I  +  D+L R++    G   SA DADS +  G     EGAF
Sbjct: 274 CRVYAHLWRADGSPLARRIALETADFLVRELRTAEGGFASALDADSHDPAG--EHGEGAF 331

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           YVWT  ++ + LGE                 D  R ++ +        + E       AS
Sbjct: 332 YVWTPAQLTEALGE----------------ADGRRAAEIYG-------VTEEGTFERGAS 368

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
            L +P E    +    R +LF+ R +RPRP  DDKV+ +WNGL I++ A           
Sbjct: 369 VLRLPGEDDPAL----RARLFEARERRPRPERDDKVVAAWNGLAIAALAETGAFF----- 419

Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYA 495
                      DR + +E A  AA   +R HL D    RL  + ++G     PG L+DYA
Sbjct: 420 -----------DRPDLVERATEAADLLVRVHLGDGA--RLTRTSKDGVAGHNPGVLEDYA 466

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  G + L        WL +A  L +   +LF   E G  F+T  +   ++ R ++  D
Sbjct: 467 DVAEGFIALAGVTGEGVWLDFAGVLLDLVIDLFTG-ENGTLFDTAHDAERLIRRPQDPTD 525

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
            A P+G + +   L+   S  A + S+ +R  AE +L V    +K +   VP      + 
Sbjct: 526 NATPAGWTAAAGALL---SYAAHTGSEPHRAAAERALGV----VKALGPRVPRFAGWGLA 578

Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
            A  +L  P  + + +VG         +   A  +      V   +P D +E    +   
Sbjct: 579 VAEALLDGP--REIAVVGLDGDPAARALHRTALIATAPGAVVASGEP-DGDEFPLLKGRP 635

Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
             N   A          A VC+ F+C  P TDP  L + L   P
Sbjct: 636 LVNGEAA----------AYVCRGFTCRTPTTDPAELASELAGAP 669


>gi|444721531|gb|ELW62264.1| Spermatogenesis-associated protein 20 [Tupaia chinensis]
          Length = 857

 Score =  310 bits (795), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 210/575 (36%), Positives = 289/575 (50%), Gaps = 81/575 (14%)

Query: 178 APKFPRPVEIQMMLYHSKKLED--------TGKSGEASEGQKMVLFTLQCMAKGGIHDHV 229
           AP  P P  + +ML  S  +             + + S  Q+M L TL+ MA GGI DHV
Sbjct: 320 APHHPDPPPLSLMLSVSTVILSFLFSYWLGHRLTQDGSRAQQMALHTLKMMANGGIRDHV 379

Query: 230 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS---------------LTKDVFYS 274
           G          +WHVPHFEKMLYDQ QLA  Y  AF                ++ D FYS
Sbjct: 380 G----------QWHVPHFEKMLYDQAQLAVAYSQAFQAAPVTSIYSLLSAPQISGDEFYS 429

Query: 275 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV-----EDIL 329
            + + IL Y+ R +    G  +SAEDADS    G  R KEGAFYVWT KEV     E +L
Sbjct: 430 DVAKGILQYVSRSLSHRSGGFYSAEDADSPPERG-LRPKEGAFYVWTVKEVLQQLPEPVL 488

Query: 330 G-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
           G         L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ ++
Sbjct: 489 GATEPLTSGQLLMKHYGLTEPGN--ISPNQDPKGELQGQNVLTVRYSLELTAARFGLDVD 546

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
               +L     KLF  R  RP+PHLD K++ +WNGL++S +A    +L            
Sbjct: 547 AVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL------------ 594

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAF 496
             G DR   +  A + A F++RH++D  + RL  +   G       S  P  GFL+DYAF
Sbjct: 595 --GVDR--LITYATNGAKFLKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAF 650

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHD 555
           ++ GLLDLYE    + WL WA+ LQ+TQD+LF D +GGGYF +  E  + L LR+K+D D
Sbjct: 651 VVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQD 710

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
           GAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A   
Sbjct: 711 GAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA 767

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
               + K +V+ G   + D + +L   H+ Y  NK +I    AD +   F        ++
Sbjct: 768 -HQQTLKQIVICGDPQAKDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLST 823

Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           + R     D+  A VC+N +CS P+T+P  L  LL
Sbjct: 824 LRRLE---DRATAYVCENQACSMPITEPSELRKLL 855



 Score =  215 bits (547), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 108/237 (45%), Positives = 153/237 (64%), Gaps = 17/237 (7%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G+ +F    K  +  FL     TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 83  GQEAFDKARKENKPIFLSVGCATCHWCHMMEEESFQNEEIGRLLSEEFVSVKVDREERPD 142

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VDKVYMT+VQA   GGGWP++V+L+PDL+P +GGTYFPPED   R GF+T+L +++D W 
Sbjct: 143 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 202

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
           + ++ L ++     E+++ AL A +  +    +LP +A  +   C +QL + YD  +GGF
Sbjct: 203 QNKNTLLENS----ERVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 258

Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
             APKFP PV +  +   +   +L   G     S  Q+M L TL+ MA GGI DHVG
Sbjct: 259 AEAPKFPTPVILSFLFSYWLGHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVG 310


>gi|300770884|ref|ZP_07080761.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
 gi|300762157|gb|EFK58976.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
          Length = 672

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 192/572 (33%), Positives = 284/572 (49%), Gaps = 67/572 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE++ +A+ +N ++VS+K+DREERPD+D++YMT VQ +   GGWPL+
Sbjct: 48  SACHWCHVMERESFENDAIAQTMNKFYVSVKIDREERPDIDQIYMTAVQLMTNAGGWPLN 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE---QLS 136
               PD +P+ GGTYF P D      ++ IL ++   W+       Q    AIE   +L+
Sbjct: 108 CICLPDGRPIYGGTYFKPHD------WQNILLQIAQMWE-------QQPLVAIEYATKLT 154

Query: 137 EALSASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
           + +  S     N +PD+     L          +D++ GG+  APKFP P     +L   
Sbjct: 155 DGIQQSERLPINPIPDQYNTADLSAIITPWVALFDTKDGGYNRAPKFPLPNNWLFLL--- 211

Query: 195 KKLEDTGKSGEASEGQKM---VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
                  + G  +  +K+   V FTLQ MA GGI+D +GGGF RYSVD  WH+PHFEKML
Sbjct: 212 -------RYGVLAGDEKIIDHVHFTLQKMACGGIYDQIGGGFARYSVDPYWHIPHFEKML 264

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD GQL +++ +A+      FY  + ++ + +  R+M+      + A DADS   EG   
Sbjct: 265 YDNGQLLSLFSEAYQQRPLPFYKRVVQETIHWANREMLAANNGFYCALDADS---EGV-- 319

Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
             EG +Y ++  E+E ILGE A LF  ++ +   GN             +  N+ I   D
Sbjct: 320 --EGKYYSFSKSEIEKILGEDAPLFISYFNITAEGNWTE----------ESTNIPILDPD 367

Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
           +   A + G   E++   L E + KL+  R  R RP LD K + +WN L++     A ++
Sbjct: 368 ADLMALEAGYSAEEWETCLAEAKEKLYRYRETRIRPGLDHKQLATWNALMLKGLTDAYRV 427

Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
                            D   Y++ A   A FI   L  +   R+ H  ++   +  GFL
Sbjct: 428 F----------------DNSSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFL 470

Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
           DDYAF     + LYE     KWL  A +L +   ELF D     ++ T      ++ R  
Sbjct: 471 DDYAFTTEAFIALYEATFDEKWLDLARQLADKALELFYDSHQKTFYYTADSSGELIARKS 530

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           E  D   P+  S  V+ L +L  +    K DY
Sbjct: 531 EIMDNVIPASTSAIVLQLKKLGLLF--DKEDY 560


>gi|374599798|ref|ZP_09672800.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
 gi|423324955|ref|ZP_17302796.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
           103059]
 gi|373911268|gb|EHQ43117.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
 gi|404606964|gb|EKB06498.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
           103059]
          Length = 665

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 202/594 (34%), Positives = 296/594 (49%), Gaps = 79/594 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           +TCHWCHVME ESF +  VA+++N  F+SIKVDREE PDVD  YM  VQ +   GGWPL+
Sbjct: 47  STCHWCHVMEEESFTNPAVAEVMNQDFISIKVDREEHPDVDAYYMKAVQLMTKQGGWPLN 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ--------SGAFA 131
           V   PD +P+ GGTYFP                 K  W      LAQ        +  FA
Sbjct: 107 VVCLPDGRPIWGGTYFP-----------------KQTWVNALTQLAQLHQNKPEATLEFA 149

Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
             +L E +     +  + +E  +  L +  E+  +S+D  +GG+  APKF  P     +L
Sbjct: 150 T-KLQEGVYIMGLA-PVANEESRFNLDIVLEKWKQSFDLEYGGYQRAPKFMMPTN---LL 204

Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
           Y    L+  G      +    +  TL  MA GGI D + GGF RYSVD +WH+PHFEKML
Sbjct: 205 Y----LQKVGDLTRDKDLLHYIDLTLTQMAWGGIFDVLEGGFSRYSVDFKWHIPHFEKML 260

Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           YD  QL +VY DA+  T +  Y  +    + +++R+ +   G I+SA DADS   +G + 
Sbjct: 261 YDNAQLLSVYSDAYKRTANPLYLEVITKTIQFIQRNWLSDWGGIYSALDADSVNDKGIS- 319

Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
            +EGA+YVWT   +  ILG+   LF + + +   G  +           +G  VLI+ N 
Sbjct: 320 -QEGAYYVWTEATLRRILGDDFSLFAQIFNVNAYGYWE-----------EGHFVLIQ-NQ 366

Query: 372 SSASASKLGMPLEKYLNILGECRRK------LFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
             AS +         L++     RK      L + R  RP+PHLDDK+I SWN ++I+  
Sbjct: 367 PLASIATANQ-----LDVFDLQERKKKWEQLLLEERDHRPKPHLDDKIICSWNAMLITGL 421

Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
             A                   ++   Y++ AES   +I+ +L DE+   L HS  N  +
Sbjct: 422 LDAYS----------------ATNETSYLQQAESIYHYIQTYLLDEE-RGLFHSSHNQNA 464

Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
              G+LDDYAF I  L+ L+E  +   +L  A  L +   +LFLD +   ++       +
Sbjct: 465 HTLGYLDDYAFYIQALIRLFEHTANQDYLWQAKRLMDLTLDLFLDEKSKFFYFNQASQAN 524

Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
            +LR  E  D   PS N+V  ++L++L       +  +Y Q A+H + V ++ L
Sbjct: 525 HILRSIETEDNVIPSANAVLCMSLLQLG---VAFEHAHYTQLAQHMIEVMQSNL 575


>gi|359145694|ref|ZP_09179393.1| hypothetical protein StrS4_07994 [Streptomyces sp. S4]
          Length = 675

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 235/705 (33%), Positives = 336/705 (47%), Gaps = 93/705 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E  
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            A     +LP  +E  Q  L      L++ YD   GGFG APKFP  + ++ +L H  + 
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY+  +  T       +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT  ++ ++LGE    +   H+ +   G             F+    ++ L     + 
Sbjct: 333 YVWTPAQLVEVLGEEDGRVAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G         +   R +L++ R +RP P  DDKV+ +WNGL I++ A A        
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
                       +R + ++ A +AA   +R HL D    RL  + R+G  S   G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +   + F D E G  ++T  +   ++ R ++  
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
           D A PSG + +      L    A + S+ +R  AE +L V    +  +   VP      +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
                +L  P  + V +VG  S    +   A  H +  L+     V+   PAD E     
Sbjct: 587 AVTEALLDGP--REVAVVGDPS----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635

Query: 667 EEHNSNNASMARNNFSADKV-VALVCQNFSCSPPVTDPISLENLL 710
                    +      AD    A VC+ F C  P TDP  L   L
Sbjct: 636 -------LPLLAGRVPADGAPTAYVCRGFVCDAPTTDPALLAAQL 673


>gi|357028650|ref|ZP_09090680.1| hypothetical protein MEA186_27750 [Mesorhizobium amorphae
           CCNWGS0123]
 gi|355537917|gb|EHH07167.1| hypothetical protein MEA186_27750 [Mesorhizobium amorphae
           CCNWGS0123]
          Length = 672

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 227/694 (32%), Positives = 333/694 (47%), Gaps = 83/694 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE++ VA ++N  FV+IKVDREERPD+D++YM  + A+   GGWPL++F
Sbjct: 54  CHWCHVMAHESFENDTVAAVMNRLFVNIKVDREERPDIDQIYMAALHAMGEQGGWPLTMF 113

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP  GGTYFP + +YGRPGF  ++  V  AW +KR+ LAQS A  +    E   A
Sbjct: 114 LTPDGKPFWGGTYFPRDARYGRPGFIQVMEAVDKAWREKRESLAQS-ADGLTSHVETRLA 172

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A +  + D   ++ L   A ++    D   GG   APKFP        L+ S   + T 
Sbjct: 173 GAHTKAVLD---RDTLGDLAGRIDGMIDRELGGLRGAPKFPN-APFMHTLWLSWLRDGTA 228

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
              +A      VL +L+ M  GGI+DHVGGG  RYS D  W VPHFEKMLYD  QL  + 
Sbjct: 229 SHRDA------VLLSLEMMLAGGIYDHVGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRMC 282

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
             A++ T    +     D +++L R+M   GG   ++ DADS         +EG FY W+
Sbjct: 283 NWAYAATGSDLFRLRIEDTVEWLLREMRVDGGAFAASLDADS-------DGEEGLFYTWS 335

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
             ++  +LG+ + LF  ++ L           S PH  ++GK ++ +    + +   LG+
Sbjct: 336 RDDINSVLGDDSALFFNYFIL-----------STPHG-WEGKPIIHQ----TQAQQSLGI 379

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
                L  L   + KL   R +R RP  D K +  WNGL+I++ A A + L         
Sbjct: 380 ADRDQLAPL---KAKLLAAREQRIRPGRDGKALTDWNGLMIAALAEAGRTLT-------- 428

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                   R ++++ A  A S I    ++    RL HS        P    DYA + +  
Sbjct: 429 --------RSDWIDAAAQAFSHIAGASHE---GRLPHSMLGAKKLFPALSSDYAAMTNAA 477

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           + L+E      ++  A       D    D E  GY+ T  +   V +R++ D D A PS 
Sbjct: 478 ISLFEATGDPNYVEQARHFVAQLDLWHRDSESTGYYLTASDSGDVPIRIRGDVDEAIPSA 537

Query: 562 NSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
           +S  +  LVRL+S       G K+      AEH++    T  +    A  +  CA   L+
Sbjct: 538 SSQIIEALVRLSSATGDLDLGEKA---WTTAEHAMG--RTAQQAYGQAGIVNACA---LA 589

Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
           +   K VV+     S +  +++  A+ + D  +  I +    TE         +N  ++ 
Sbjct: 590 LEPLKLVVV----DSPENPSLVPVANRNPDPRRVDIVVQ-VGTE---------ANRPTLP 635

Query: 678 RNNF-SADKVVALVCQNFSCSPPVTDPISLENLL 710
                  DK  A +C    C P VTDP  LE LL
Sbjct: 636 GGVLPPTDKPGAWLCTGQVCLPVVTDPEELEELL 669


>gi|117929090|ref|YP_873641.1| hypothetical protein Acel_1883 [Acidothermus cellulolyticus 11B]
 gi|117649553|gb|ABK53655.1| protein of unknown function DUF255 [Acidothermus cellulolyticus
           11B]
          Length = 658

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 236/702 (33%), Positives = 328/702 (46%), Gaps = 104/702 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHVM  ESFED   A  +N+ FV +KVDREERPD+D VYM   QA+ G GGWPL+
Sbjct: 48  SSCHWCHVMAHESFEDPATAAFMNEHFVCVKVDREERPDIDAVYMEATQAMTGRGGWPLT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
            FL+PD +P   GTYFP E + G P F+ +L  V  AW  +   L  +    +  L +  
Sbjct: 108 CFLTPDGEPFFTGTYFPKEPRAGMPAFRQVLEAVWTAWQSRSADLVAAARRVVAVLQQ-- 165

Query: 140 SASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
                 ++L D+L     + L     +L + YD   GGFGSAPKFP    ++ +L +   
Sbjct: 166 -----GSRLTDDLGAIDADLLDAAVGELRRQYDPVHGGFGSAPKFPSATTLEFLLRY--- 217

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
               G  G      +MV  T + MA+GGI+D + GGFHRYSVD  W VPHFEKMLYD  Q
Sbjct: 218 ----GSLG----AMEMVAVTCEHMARGGIYDQLAGGFHRYSVDAAWTVPHFEKMLYDNAQ 269

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L  VYL  +  T+      I  ++ ++L RD+  P G   +A DAD+   EG T      
Sbjct: 270 LLGVYLHWWRRTQHQLARRIVEEVAEFLLRDLCTPAGGFAAALDADAGGVEGGT------ 323

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
            YVWT  E+ D LG + A    E + +   GN +            G++VL    D+   
Sbjct: 324 -YVWTLAELRDALGSDDAAYAAELFGVTEHGNTE-----------DGRSVLQLAVDAP-- 369

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
                  LE++  I    R++L  VRS+R +P  DDK+I SWNGL ++S A A  +L   
Sbjct: 370 ------DLERWRRI----RQRLLAVRSRRAQPARDDKIIASWNGLAVASLAEAGFLL--- 416

Query: 436 AESAMFNFPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDD 493
                        DR   ++ A  SA   I  HL D    RL  S R+G  +   G LDD
Sbjct: 417 -------------DRDALVDAAVRSAEYLIDVHLRD---GRLCRSSRDGERNPVDGALDD 460

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRV 550
           YA +  GLL L +  S  ++L    EL     E  L     E GG+++T  +   ++ R 
Sbjct: 461 YANVAQGLLTLAQIRSEARYL----ELAGALLEAILTHFRAEDGGFYDTADDAERLVRRP 516

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL--AVFETRLKDMAMAVPL 608
           +   D A PSGNS +   L+  A++   + S  +R     +L   V   R    A+   L
Sbjct: 517 RTFTDDATPSGNSAAAHALLTYAAL---TGSQRHRDAVPGALRPTVRLARRYPHAVGYGL 573

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
              AA  L  P+   + +VG  S                L +T   +D           +
Sbjct: 574 ATIAA-WLDGPA--EIAVVGDGS----------------LWRTAWLVDRPGAVRAARAAD 614

Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                  +        + +A VC+NF C  PV     L  LL
Sbjct: 615 GPPWAPLLEGRTAPPGQSLAYVCRNFECQRPVASEAELRALL 656


>gi|72160855|ref|YP_288512.1| hypothetical protein Tfu_0451 [Thermobifida fusca YX]
 gi|71914587|gb|AAZ54489.1| conserved hypothetical protein [Thermobifida fusca YX]
          Length = 665

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 229/690 (33%), Positives = 326/690 (47%), Gaps = 96/690 (13%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF DE  A+++N  FV++KVDREERPDVD VYM   QA+ G GGWP++VF
Sbjct: 50  CHWCHVMARESFADEQTAQIMNANFVNVKVDREERPDVDAVYMEATQAMTGHGGWPMTVF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
            +PD +P   GTYFP E       F+ +L  +  AW   R  +   G    ++++EALSA
Sbjct: 110 ATPDGEPFYCGTYFPREH------FQRLLLGISHAWRTDRTGVVGQG----KRVAEALSA 159

Query: 142 SASSNKLPDELPQNA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                 LP   P +A  L     +L+  YD+  GG+G+APKFP    ++ +L H  ++ D
Sbjct: 160 ---PRTLPSGPPPSAQVLEQAVARLAAEYDTVNGGYGTAPKFPPSPVMEFLLRHHARVSD 216

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               G  +E  +MV  T + MA+GGI+D + GGF RY+VD  W VPHFEKMLYD   L  
Sbjct: 217 ----GAETEALRMVRHTAEAMARGGIYDQLAGGFARYAVDATWTVPHFEKMLYDNALLLR 272

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
            Y   +  T D     +  +  D++  ++    G   SA DADS   EG    +EG +YV
Sbjct: 273 CYTHLWRQTGDELARRVAVETADWMVAELRTAEGGFASALDADS---EG----EEGRYYV 325

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  ++ D+LGE    +            +L  +++     +G +VL    D        
Sbjct: 326 WTPAQLRDVLGEEDGAWA----------AELFGVTEQGTFERGTSVLQLRADPDDR---- 371

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
               E+Y  +    R +L   R+ R  P  DDKV+  WNGL I+  A A  +L       
Sbjct: 372 ----ERYAYV----RDRLRKARANRVPPARDDKVVTGWNGLAIAGLAEAGALL------- 416

Query: 440 MFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
                    DR + +E A  AA   + RH  D    RL    R+G P  + G L+DYA L
Sbjct: 417 ---------DRPDLVERAREAARLVVERHYAD---GRLVRVSRDGVPGTSAGVLEDYANL 464

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             GLL L+      +W+    EL  T    F D   GG+++T  +  ++  R +E  D A
Sbjct: 465 AEGLLALHAVTGEIRWVGVCGELLETVLTRFTDGS-GGFYDTADDAEALFNRPREFTDDA 523

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMCC 611
            PSG S +   L+  A++   + S  +R+ AE +L V  T      R     MAV     
Sbjct: 524 TPSGWSAAAGALLSYAAL---TGSFRHREAAEAALGVVSTLAEKTPRFAGWGMAV----- 575

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A  +L+ P    + +VG K     E +   A  +      V   D  +   +   E    
Sbjct: 576 AEALLAGPV--EIAVVGPKGDPVAEELHRTALLATTPGTVVSRGDGVNDGGIGLLEGRTL 633

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
            +   A          A VC+NF+C  P T
Sbjct: 634 VDGRPA----------AYVCRNFTCRLPAT 653


>gi|372222108|ref|ZP_09500529.1| hypothetical protein MzeaS_07308 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 701

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 185/555 (33%), Positives = 296/555 (53%), Gaps = 47/555 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME E FE+E VAKL+N+ F++IK+DREERPDVD++YM  +Q + G GGWPL++ 
Sbjct: 78  CHWCHVMEEECFENEEVAKLMNENFINIKIDREERPDVDQIYMDAIQMMTGNGGWPLNIV 137

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS- 140
             PD +P  G TY P ++      +   L+ + D +    + + Q  A  +EQ  +A++ 
Sbjct: 138 ALPDGRPFWGATYLPKDN------WTKSLKSLIDLYHNDPEKV-QEYAGKLEQGIQAINL 190

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               ++K+     +  L L  +  S S+D+  GG+  APKF  P  ++ +L+++      
Sbjct: 191 VENKTSKI--HFTKEELDLAVQNWSTSFDTYLGGYKRAPKFMMPNNLEYLLHYA------ 242

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
             + +     + V  TL  MA GGI D + GGF RY+VD +WHVPHFEKMLYD GQL ++
Sbjct: 243 -TANKNDTILEYVNTTLTRMAYGGIFDPIDGGFSRYAVDVKWHVPHFEKMLYDNGQLISL 301

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y  A+++TK+  Y       + +   +++   G  +S+ DADS    G  + +EGA+YVW
Sbjct: 302 YSKAYAVTKNSLYKETVEKSVGFATLELLDTNGGFYSSLDADSKNNSG--KLEEGAYYVW 359

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T KE++ ILG  + +FK +Y +   G  +           + K VLI     +  A  LG
Sbjct: 360 TEKELDSILGSESSVFKTYYNINSYGYWE-----------EDKYVLIRDASDNELADSLG 408

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +        + +  ++L  VR +R +P LDDK++ SWNGL++     A + L+++     
Sbjct: 409 IATTNLTQQIAKNLKQLKKVRGQREKPRLDDKILTSWNGLMLKGLTDAYRYLQND----- 463

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                      +Y+++A   A+F+ + +  +    +  + +NG S   GFLDDYA LI G
Sbjct: 464 -----------KYLQLALKNANFLEQEIIQDD-FSVYRNHKNGKSSINGFLDDYATLIDG 511

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            + LYE     +WL  A  L +     F D+E   ++ T+  D  ++ R  E +D    +
Sbjct: 512 FIGLYEVTFDDRWLTLAKNLTDYAITHFKDQESNMFYYTSDLDDKLIRRSIETNDNVISA 571

Query: 561 GNSVSVINLVRLASI 575
            NS+   NL +L  +
Sbjct: 572 SNSIMANNLYKLHKV 586


>gi|383775980|ref|YP_005460546.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
 gi|381369212|dbj|BAL86030.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
          Length = 688

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 237/720 (32%), Positives = 349/720 (48%), Gaps = 82/720 (11%)

Query: 11  KTRRTHFLIN----TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
           K R    LI+    +CHWCHVM  ESFED  +A  +N+ FVS+KVDREERPDVD VYMT 
Sbjct: 34  KRRDVPLLISVGYSSCHWCHVMAHESFEDAAIAAQMNEGFVSVKVDREERPDVDAVYMTA 93

Query: 67  VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
            QA+ G GGWP++VF +PD  P   GTYF P D++GR     +L  V  AW  +RD + +
Sbjct: 94  TQAMTGQGGWPMTVFATPDGDPFFCGTYF-PRDQFGR-----LLASVTTAWRDQRDDVLK 147

Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
            GA  +E +  A        + P  +  + L   A+ L+K  D  +GGFG APKFP  + 
Sbjct: 148 QGAAVVEAVGGAQMIGGP--RAP--ISGDLLAAAAQGLAKEQDQTYGGFGGAPKFPPHMN 203

Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           +  +L H ++   TG    +++  ++V    + MA+GGI+D + GGF RY+VDE W VPH
Sbjct: 204 LLFLLRHHER---TG----SADALEIVRHACERMARGGIYDQLAGGFARYAVDETWTVPH 256

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD   L  VY   + LT D+F   I  +   +L RD+    G + SA DAD++  
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDLFARRIADETAAFLLRDLGTAQGGLASALDADTSGV 316

Query: 307 EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFK---- 361
           EG T       Y WT  E+ + LG E      + + +   G    +  S P +       
Sbjct: 317 EGLT-------YAWTPAELAEALGAEDGAWAADLFRVTEPGTFAHNSASAPIDGAADRMK 369

Query: 362 ----GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               GK+VL+   D   +   +   +E++ ++    R++L   R+ RP+P  DDKV+ SW
Sbjct: 370 GVEHGKSVLVLARDIDEADPAI---VERWRDV----RQRLLTARNGRPQPARDDKVVASW 422

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL I++ A    +L   A S           R   + +AE  A    RHL D    RL+
Sbjct: 423 NGLAITALAE-HGVLTGSAGS-----------RDAAVALAEVLAD---RHLVD---GRLR 464

Query: 478 HSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
              R+G +  P G L+DY  +    L +++  +  +WL  A EL +     F   + GG+
Sbjct: 465 RVSRDGVAGEPAGVLEDYGSVAEAFLAVHQVTASPRWLTLAGELLDVALARFGSGD-GGF 523

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           ++T  +   +L R  +  D A PSG SV    LV  A++   S S  +R+ A+ +LA   
Sbjct: 524 YDTADDAEKLLTRPADPTDNATPSGLSVVCAALVSYAAL---SGSTAHREAADAALATVG 580

Query: 597 TRLKDMA-MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
             +      A      A   L+ P   + + +        + ++ AAH S     TVI +
Sbjct: 581 PLIGGHPRFAGYAAAVAEAALTGP---YEIAIATTDRTAADPLVEAAHWSAP-GGTVIVV 636

Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
              D   +            +A          A VC+ F C  PVT P  L + L + P+
Sbjct: 637 GEPDRPGVPL----------LADRPLIGGASTAYVCRGFVCDRPVTTPGDLADRLGQSPT 686


>gi|443288943|ref|ZP_21028037.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
 gi|385888344|emb|CCH16111.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
          Length = 680

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 209/586 (35%), Positives = 285/586 (48%), Gaps = 60/586 (10%)

Query: 10  TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
            K R    LI+     CHWCHVM  ESFE+E VA LLND FVSIKVDREERPDVD VYMT
Sbjct: 33  AKRRDVPVLISVGYAACHWCHVMAHESFENEQVAALLNDNFVSIKVDREERPDVDAVYMT 92

Query: 66  YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
             QA+ G GGWP++VF +PD  P   GTYFP      R  F  +L+ V  AW  +R  + 
Sbjct: 93  ATQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFVRLLQSVTTAWADQRAEVL 146

Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 185
           + GA  +E +  A +    +  L   L    L   A  L+  YD+  GGFG APKFP  +
Sbjct: 147 RQGAAVVEAIGGAQAVGGPTAPLDGPL----LDAAAGNLASGYDATNGGFGGAPKFPPHM 202

Query: 186 EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
            +  +L H ++  D           ++V  T + MA+GGI+D + GGF RYSVD  W VP
Sbjct: 203 NLLFLLRHHQRTGD-------PRSLEIVRHTAEAMARGGIYDQLAGGFARYSVDAHWTVP 255

Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
           HFEKMLYD   L  VY   + LT D     + RD   +L  ++  PG    SA DAD+  
Sbjct: 256 HFEKMLYDNALLLRVYAQLWRLTGDPLARRVARDTARFLADELHRPGEGFASALDADTEG 315

Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
            EG T       Y WT  ++ + LGE    F            DL  ++D      G +V
Sbjct: 316 VEGLT-------YAWTPAQLVEALGEDDGRFA----------ADLFTVTDEGTFEHGMSV 358

Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
           L    D    A ++     ++  ++G+    L   R  RP+P  DDKV+ +WNGL I++ 
Sbjct: 359 LRLARDVDDVAPEV---RARWQRVVGQ----LLAARDTRPQPARDDKVVAAWNGLAITAI 411

Query: 426 AR----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
           A     A+     E E A     V         + AE  A+    H+ D    RL+   R
Sbjct: 412 AEFLQVAALYASPEDEDANLMEGVTIVADGAMRDAAEHLATV---HVVD---GRLRRVSR 465

Query: 482 NGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
           +G   AP G L+DY  +      L++     +WL  A +L +   E F    GG Y++T 
Sbjct: 466 DGRVGAPAGVLEDYGCVAEAFCALHQLTGEGRWLTVAGQLLDAALEHFA-APGGAYYDTA 524

Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
            +   ++ R  +  D A PSG S  V  LV  A++   ++   YR+
Sbjct: 525 DDAEQLVARPADPTDNATPSGRSALVAGLVSYAALTGETR---YRE 567


>gi|427707072|ref|YP_007049449.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
 gi|427359577|gb|AFY42299.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
          Length = 685

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 211/623 (33%), Positives = 315/623 (50%), Gaps = 82/623 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDGAIADYMNTNFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
            FLSP DL P   GTYFP + +YGRPGF  +L+ ++  +D +++ L Q  A  ++ L   
Sbjct: 108 TFLSPEDLVPFYAGTYFPVDPRYGRPGFLQVLQALRRYYDTEKEDLRQRKAVILDSL--- 164

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSK 195
           L+++   N  P E+ ++ L      L K +++  G   S      FP      M+ Y   
Sbjct: 165 LTSAVLQNSDPQEVQEHEL------LGKGWETSTGIITSNQYGNSFP------MIPYSEL 212

Query: 196 KLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            L  T  +  +  +G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 213 ALRGTRFNLPSRYDGKQICTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDN 272

Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           GQ+     + +S   ++  ++      + +L+R+MI P G  ++A+DADS     A   +
Sbjct: 273 GQIVEYLANLWSAGIQEPAFARAIAGTVQWLQREMIAPEGYFYAAQDADSFTNSDAVEPE 332

Query: 314 EGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVW+  ++E +L  E     ++ + +   GN            F+  NVL   N  
Sbjct: 333 EGAFYVWSYSDLEQLLTSEELTQLQQEFTVSSQGN------------FESLNVLQRRN-- 378

Query: 373 SASASKLGMPLEKYLNILGECRR-------KLFDV--RSKRPRPH---------LDDKVI 414
                +L   +E+ L  L   R        K+F     ++  + H          D K+I
Sbjct: 379 ---VGQLSAEIERILAKLFTARYGDKAESLKIFPPARNNQEAKTHNWPGRIPSVTDTKMI 435

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQT 473
           V+WN L+IS  ARA  +         F  P+       Y+E+A  AA+FI  H + D + 
Sbjct: 436 VAWNSLMISGLARAGGV---------FQEPL-------YLELAAQAANFILEHQFVDGRF 479

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDRE 532
           HRL +    G +      +DYAF I  LLDL        +WL  AI +Q   DE     E
Sbjct: 480 HRLNY---QGEATVLAQSEDYAFFIKALLDLQACSPDDQQWLENAIAIQAEFDEFLWSVE 536

Query: 533 GGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
            GGYFNT+ +    +++R +   D A PS N V++ NLVRL+ +   + + +Y   AE  
Sbjct: 537 LGGYFNTSSDASQDLIIRERSYTDNATPSANGVAIANLVRLSLL---TDNLHYLDLAEQG 593

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F + +     A P +  A D
Sbjct: 594 LKAFRSVMSSHPQACPSLFTALD 616


>gi|220935906|ref|YP_002514805.1| hypothetical protein Tgr7_2744 [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
 gi|219997216|gb|ACL73818.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
          Length = 676

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 228/686 (33%), Positives = 339/686 (49%), Gaps = 70/686 (10%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT-YVQALYGGGGWPL 78
           + CHWCHVM  ESFED   A+++N  +V+IKVDREERPD+DK+Y T +       GGWPL
Sbjct: 52  SACHWCHVMAHESFEDPATAQVMNRLYVNIKVDREERPDLDKIYQTAHFMLSQRSGGWPL 111

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           ++FL+PD  P  GGTYFP   ++G P F+ +L ++   + ++RD + +  A     L  A
Sbjct: 112 TMFLTPDQVPFFGGTYFPDAPRHGLPAFRDLLERIAGFYHERRDEIERQNA----SLQGA 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L+   S     D L    L      +++ +D R GGFG+ PKFP P  ++ +L H  +  
Sbjct: 168 LTGLFSPRGH-DPLNSAVLDTVRSAIAQQFDERDGGFGTPPKFPHPSTLERLLRHHAQTH 226

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D          + M  FTL+ MA+GG++D + GGF RYS D +W +PHFEKMLYD G L 
Sbjct: 227 D-------ERARYMACFTLEKMARGGLNDQLAGGFCRYSTDGQWMIPHFEKMLYDNGPLL 279

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +Y  A++ T D +++ +      +  + M  P G  +SA DADS   EG    +EG +Y
Sbjct: 280 ALYAQAYAATGDAYFADVAGRTAAWAVQTMQSPEGGFYSALDADS---EG----EEGRYY 332

Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VW  +EV  ++ E    +F   Y L    N            F+G+  L         A 
Sbjct: 333 VWQPEEVRKLVPEEVYPVFARVYGLDRGPN------------FEGRWHLHSFVTPEQLAK 380

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           + G        ++   R  L   R KR  P LDDK++ SWN L+I   A A++ L     
Sbjct: 381 ESGTDEATIEAMIEAARAPLLAARDKRVPPGLDDKILTSWNALMIRGLAVAARHLG---- 436

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
                       R E+++ A  A  FIR  L+  +  RL  +++NG ++   +LDD+A+L
Sbjct: 437 ------------RSEWVDAASRALDFIRAQLW--RDGRLLATYKNGSARLSAYLDDHAYL 482

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +  LL+L +    T+ LV+A E+       F D E GG+F T  +  +++ R K   D A
Sbjct: 483 LDALLELLQVRWRTEDLVFAREIAEILLAHFEDSEHGGFFFTADDHEALIQRPKTFADEA 542

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAADML 616
            PSGN V+ + L RL  ++   +   Y + AE ++ +  T +    MA   L+    + L
Sbjct: 543 MPSGNGVAALALNRLGHLLGEPR---YVEAAERTVRLATTLMDQAPMAHASLISAFEEQL 599

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            +P  K V+L G    +  E   A     Y   + V  I PAD  ++    E  +  A  
Sbjct: 600 YLP--KLVILRGEAQRI--ETWRAELERDYAPRRLVFAI-PADASDL---PEALATKAPK 651

Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
                   + VA VC    CS PVTD
Sbjct: 652 G-------EAVAYVCTGTRCSAPVTD 670


>gi|443327996|ref|ZP_21056601.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
 gi|442792405|gb|ELS01887.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
          Length = 682

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 213/623 (34%), Positives = 304/623 (48%), Gaps = 79/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN+ FV IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDNAIADYLNNNFVPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP   +Y RP F  IL+ V+  +D + + L       +  L  +
Sbjct: 108 IFLTPGDLVPFYGGTYFPVTPRYNRPSFIDILKSVRRFYDVETEKLEGFKTEILFNLQRS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            S   + + L  EL    L      LS     R       P FP      M+ Y +  L+
Sbjct: 168 TSLETTEDALTSELLDQGLETNTAVLSSGDPGR-------PNFP------MIPYATAALQ 214

Query: 199 DTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            +  +     +  K+ L   Q +  GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 215 GSRLNFNNRYDADKLCLQRGQDLVLGGICDHVAGGFHRYTVDHTWTVPHFEKMLYDNGQI 274

Query: 258 ANVYLDAFSLTKDVF-YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                + +S  +           I+++L+R+M+ P G  ++++DAD+  T  A   +EG 
Sbjct: 275 LEYLANLWSCQRHFLTIEDAIAGIVNWLKREMLAPQGYFYASQDADNFATAEAAEPEEGL 334

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVW+  E+E++L  E     +  + + P GN            F+G NVL   N    S
Sbjct: 335 FYVWSYNELENLLSAEELAELQAEFSITPQGN------------FEGSNVLQRFNHEELS 382

Query: 376 ASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSW 417
            S     LE+ L  L   R              +   + ++K    R  P  D K+I +W
Sbjct: 383 PS-----LEQTLQKLFAARYGEKQTGIDTFPVAKNNREAKTKPWPGRIPPVTDTKMITAW 437

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRL 476
           N L+IS  ARA+ +L                    Y ++AE+ A+FI +  + E + HRL
Sbjct: 438 NSLIISGLARAASVLGI----------------TNYQQLAENTANFILQQQWLEGRLHRL 481

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGG 535
            +   +G +      +DYA  I  LLDL++      +WL  AI LQ   D LF    GGG
Sbjct: 482 NY---DGQATVLAQSEDYALFIKALLDLHQSSPQNPQWLDSAIALQAEFDRLFWSEMGGG 538

Query: 536 YFNTTGED--PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           Y+N  G D   ++L+R +   D A P+ N V++ NLVRL  +    +   YR  AE  L 
Sbjct: 539 YYN-NGSDVGDNLLIRERSYMDNATPAANGVAMANLVRLFLLTDNLE---YRDRAEQGLQ 594

Query: 594 VFETRLKDMAMAVPLMCCAADML 616
            F   +K    A P +  A D L
Sbjct: 595 AFAGIMKSSPQACPSLFVALDWL 617


>gi|378728836|gb|EHY55295.1| hypothetical protein HMPREF1120_03437 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 842

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 215/630 (34%), Positives = 295/630 (46%), Gaps = 106/630 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           N CHWCHVME ESF    VA  LN  F+ IKVDRE RPD+D +YM YV A  G GGWPL+
Sbjct: 57  NACHWCHVMERESFSSPEVASFLNKHFIPIKVDRECRPDLDDIYMNYVTATTGSGGWPLN 116

Query: 80  VFLSPDLKPLMGGTYFPPEDKY-----------GRPGFKTILRKVKDAWDKKR------- 121
           VFL+PDL+P+ GGTY+P                  P F  ILRK+++ W  +R       
Sbjct: 117 VFLTPDLRPVFGGTYWPGPSSTTNLHRKASHDEAAPSFLDILRKMQEVWSTQRERCRRSS 176

Query: 122 -DMLAQSGAFAIEQLSEALSAS-----ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
            D+  Q  AFA E +    + S     +S ++ P+ L  + L          YDS  GGF
Sbjct: 177 TDITTQLRAFAAEGIHSQSNGSVRDGGSSGSEEPEPLELDLLDDALNHFIARYDSTNGGF 236

Query: 176 GSAP---KFPRPVEIQMMLYHSKKLEDT-------------GKSGEAS--EGQKMVLFTL 217
            ++    KFP P  +  +L     +                G  GE S  +   M L TL
Sbjct: 237 SASTNGQKFPTPSNLAFLLRIGAAIAQPSTHTRFGFFSPVLGILGEDSCLKAASMALHTL 296

Query: 218 QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC 277
           + M++ G+ D +G GFHRYSV   W++PHFEKM+ D  QL   Y DA++L +D       
Sbjct: 297 KAMSRSGLRDQLGYGFHRYSVTPDWNLPHFEKMMCDNAQLLGCYCDAWALGRDPEILGTI 356

Query: 278 RDILDYL---RRDMIGPGGEIFSAEDADS--------AETEGA-TRKKEGAFYVWTSKEV 325
            ++++Y       ++ PGG  +++EDADS          TE A   KKEGAFYVWT KE+
Sbjct: 357 YNLVEYFTNPESPIVRPGGGWYASEDADSRPSRTGNGGGTETAHNEKKEGAFYVWTYKEL 416

Query: 326 EDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
           E +LGE  A +   H+ +KP GN  +    D H+EF  +NVL      S  A + G+  +
Sbjct: 417 ESLLGEQDAPIIARHFGVKPHGN--VPAQHDIHDEFLSQNVLHVDATPSTLAKEFGIAED 474

Query: 385 KYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
           + + I+   R KL + R ++R  P +D  VI SWNGL I+S  RA+  L +         
Sbjct: 475 EVVRIIKRGRTKLLEHRKAEREPPQVDTNVIASWNGLAIASLTRAANTLAT--------- 525

Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG-------------- 489
            V         E AE AA+F+   +YD  T RL           P               
Sbjct: 526 -VDKHRAARCQEAAERAATFVHCAMYDPTTGRLARIANATDKSRPRSRSKSASHASNNDN 584

Query: 490 -------------FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
                        F+DDYA++    L LY+      +L WA++LQ   D  F D   G  
Sbjct: 585 DNSNGGGGGSNIVFVDDYAYMTQAALMLYDLTLSQPYLDWAVQLQEYLDTHFADVTEGSS 644

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 566
            +  G D            GA  +G S+S 
Sbjct: 645 TSGAGTD-----------KGASANGASIST 663


>gi|328541699|ref|YP_004301808.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
 gi|326411451|gb|ADZ68514.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
          Length = 670

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 227/701 (32%), Positives = 331/701 (47%), Gaps = 95/701 (13%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFED   A+++N  FV+IKVDREERPD+D++YM  + AL   GGWPL++
Sbjct: 50  ACHWCHVMAHESFEDPATAEVMNRLFVNIKVDREERPDIDQIYMNALHALGEQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD +P  GGTYFP E ++GRP F  IL  V   +  +R  + ++    ++ L +   
Sbjct: 110 FLTPDGEPFWGGTYFPKEARWGRPAFVDILEAVAATYRSERSRIDRNRTGLMQVLKQRAQ 169

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +A        L    L L  ++L   +D   GG   APKFP+   + ++     +   T
Sbjct: 170 PAAP-------LDSAILVLAGDRLLSLFDPEHGGIRGAPKFPQASILDLVWRAGLR---T 219

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G        ++  L TL+ ++ GGI+DH+ GG  RYSVDERW VPHFEKMLYD  Q    
Sbjct: 220 GNPA----ARETFLHTLRQISNGGIYDHLKGGIARYSVDERWLVPHFEKMLYDNAQYLQH 275

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
            L A+  T +  +     + + +L  +M  P G   S+ DADS   EG    +EG FYVW
Sbjct: 276 LLTAWLATGEDLFRCRIDETVGWLLDEMRLPEGGFASSLDADS---EG----EEGRFYVW 328

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T+ EV ++LG  A  F   Y +   GN            ++G  +L  L  ++AS     
Sbjct: 329 TAAEVAEVLGADAAFFARFYDISAAGN------------WEGVTILNRLTGTAAS----- 371

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
            P E+  N L   R KL   R+ R RP LDDKV+  WNGL+I++ ARA +I+        
Sbjct: 372 -PEEE--NRLAALRAKLLSRRASRVRPALDDKVLADWNGLLIAALARAGRIVS------- 421

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                    R+ ++  AE A  FI   +      RL H++R G    PGF  D+A ++  
Sbjct: 422 ---------RESWIAAAEQAFRFIAESM--TGGGRLGHAWRAGRLVFPGFASDHAAMMQA 470

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLD-------REGGGYFNTTGEDPSVLLRVKED 553
            + L E         W  +      E F D         GGG++ T  +   ++LR    
Sbjct: 471 AIALAEARP------WDAQHYLRIAEGFADALVRHYAAPGGGFYMTADDATDLILRPLSS 524

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D A P+ NSV+     RL  +    +   +R  A+     F   +     A   + CA 
Sbjct: 525 ADEAVPNANSVAADAFARLYLLTGDRR---HRDVADAVFHAFAGDVPKNLFATASLLCAF 581

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA----DTEEMDFWEEH 669
           D   +  R  VV+  + S  D  N++ +      L++ V   DPA     TE  D   + 
Sbjct: 582 DT-RINGRLAVVVAPNGS--DPSNLVDS------LDRAV---DPALTRLVTESTDGLPKD 629

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
           +  +   A +     +  A VC+  +CS P      L+  L
Sbjct: 630 HPAHGKPALDG----RPAAYVCREGACSLPAATTTELQRTL 666


>gi|291451582|ref|ZP_06590972.1| conserved hypothetical protein [Streptomyces albus J1074]
 gi|291354531|gb|EFE81433.1| conserved hypothetical protein [Streptomyces albus J1074]
          Length = 675

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 234/705 (33%), Positives = 335/705 (47%), Gaps = 93/705 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E  
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            A     +LP  +E  Q  L      L++ YD   GGFG APKFP  + ++ +L H  + 
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY+  +  T       +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT  ++ ++LGE    +   H+ +   G             F+    ++ L     + 
Sbjct: 333 YVWTPAQLVEVLGEEDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G         +   R +L++ R +RP P  DDKV+ +WNGL I++ A A        
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
                       +R + ++ A +AA   +R HL D    RL  + R+G  S   G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +   + F D E G  ++T  +   ++ R ++  
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
           D A PSG + +      L    A + S+ +R  AE +L V    +  +   VP      +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
                +L  P  + V +VG       +   A  H +  L+     V+   PAD E     
Sbjct: 587 AVTEALLDGP--REVAVVGDPD----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635

Query: 667 EEHNSNNASMARNNFSADKV-VALVCQNFSCSPPVTDPISLENLL 710
                    +      AD    A VC+ F C  P TDP  L   L
Sbjct: 636 -------LPLLAGRVPADGAPTAYVCRGFVCDAPTTDPALLAAQL 673


>gi|422304439|ref|ZP_16391784.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
 gi|389790409|emb|CCI13705.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
          Length = 692

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 214/623 (34%), Positives = 309/623 (49%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L   +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             ED+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDREPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+ D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSHLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               KLG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGKLGKDIENMLDKLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F  P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AIELQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIELQGEFDRWFWAEDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L+    A P +  A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618


>gi|218246233|ref|YP_002371604.1| hypothetical protein PCC8801_1388 [Cyanothece sp. PCC 8801]
 gi|218166711|gb|ACK65448.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8801]
          Length = 688

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 220/618 (35%), Positives = 309/618 (50%), Gaps = 70/618 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LND F+ IK+DREERPD+D +YM  VQ +   GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRPGF  +L+ ++  +D ++D L    +F      E 
Sbjct: 108 IFLTPDDLVPFYGGTYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEI 160

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKL 197
           L     S  LP     NA  L  E   +   +        P+ F RP    M+ Y +  L
Sbjct: 161 LDTLQKSAILP---VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLAL 216

Query: 198 EDTGKSGEASEGQKMVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           + +  + ++ E Q  V +   + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 217 QGSRFAFQSQENQATVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276

Query: 257 LANVYLDAFSL--TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           +     + +S    +  F   I R + ++L+R+M  P G  ++A+DAD+  T      +E
Sbjct: 277 IVEYLANLWSQGHQEPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEPEE 335

Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW  +E+ED L  E   L +  + L   GN            F+G NVL       
Sbjct: 336 GAFYVWKYQELEDCLTSEELKLLEATFSLTAEGN------------FEGSNVLQRRMGGE 383

Query: 374 ASASKLGMPLEKYLNI-LGECRRKLF-------------DVRSKRPRPHLDDKVIVSWNG 419
            S + L + L+K   I  G  R+ L                   R  P  D K+IV+WN 
Sbjct: 384 FSEA-LEVILDKLFMIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWNS 442

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQH 478
           L+IS  ARA  +         F  P+       Y E+A +A  FI +  + + + +RL +
Sbjct: 443 LMISGLARAYGV---------FGDPL-------YWELAINATEFILQEQWVNNRLYRLNY 486

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYF 537
               G        +DYAF I  LLDL +     + WL  A E+Q   DE F   EGGGY+
Sbjct: 487 E---GQPSVLAQAEDYAFFIKALLDLQKANPWERQWLEKAKEVQEEFDEFFWSIEGGGYY 543

Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           N   ++   +L+R +   D A PS N V++ NLVRL+ +        Y   AE  L  F 
Sbjct: 544 NNASDNSGDLLIRERSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTFS 600

Query: 597 TRLKDMAMAVPLMCCAAD 614
           + L     A P +  A D
Sbjct: 601 SVLSQSPKACPSLFVALD 618


>gi|428224685|ref|YP_007108782.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
 gi|427984586|gb|AFY65730.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
          Length = 682

 Score =  308 bits (789), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 229/715 (32%), Positives = 338/715 (47%), Gaps = 116/715 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F +  +A  +ND+FV IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSNGAIAAYMNDFFVPIKVDREERPDLDSIYMQSLQLMVGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P DL P  GGTYFP + +YGRPGF  +L+ ++  +D ++D ++      +E L EA
Sbjct: 108 VFLAPDDLVPFYGGTYFPVDPRYGRPGFLQVLQAIRRHFDTEKDKVSAVKQEILEHLQEA 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
            S            P     L  + L+KS +   G     G  P FP      M+ Y   
Sbjct: 168 GSLE----------PGQGSDLTHDLLAKSLEYSTGILSARGPGPSFP------MIPYGEA 211

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
               T  S E  +   +     + +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD G
Sbjct: 212 AQRATRLSLERYDAGTICQQRGEHLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 271

Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           Q    LAN +  A  +T+  F   I   +  +L+R+M    G  ++A+DAD+  +  A  
Sbjct: 272 QILEYLANEW--ARGVTEPAFERAIAGTV-TWLKREMTDAQGYFYAAQDADNFTSPEALE 328

Query: 312 KKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
            +EG FYVW   E+  +L   E A L +E + + P+GN            F+G+NVL   
Sbjct: 329 PEEGDFYVWRYDELAALLTPAELAAL-QEEFTVTPSGN------------FEGRNVLQRS 375

Query: 370 NDSSAS-----------ASKLGMPLEKYLNILGECRRKLFDVRS--KRPRPHLDDKVIVS 416
            + S S           A + G P             ++   ++   R  P  D K+I +
Sbjct: 376 REGSLSEVAEAALAKLFAVRYGAPPVAVPTFPPAPSAQVAKTQTWPGRIPPVTDTKMIAA 435

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHR 475
           WN L+IS  ARA+ + +                R+EY ++A  AA F+  H + E + HR
Sbjct: 436 WNSLMISGLARAAAVWQ----------------REEYYQLAAGAARFLLAHQWVEGRFHR 479

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGG 534
           L +   +G +      +DYA  I  L+DL +   G + W+  A+++Q   D L    EGG
Sbjct: 480 LNY---DGEASVLAQSEDYALFIKALIDLDQARPGAEDWIEQAVKVQREFDALLGAEEGG 536

Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            Y         +++R +   D A P+ NS+++ NLVRLA +   ++   Y   AE +L  
Sbjct: 537 YYNAARDRSQDLVIRERSYADNATPAPNSIAIANLVRLALL---TEDLSYLDRAEKALQS 593

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   +     A P M  A D+     R H+++   +++ D    LAA +    + K    
Sbjct: 594 FSAPMARSPQACPSMFGALDLY----RNHLLI---RATPDVLQTLAARYCPTAVYKVADE 646

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
           +                            +  V LVCQ  SC  P     SLE L
Sbjct: 647 L---------------------------PEGAVGLVCQGLSCQEPAR---SLEQL 671


>gi|411116326|ref|ZP_11388814.1| thioredoxin domain-containing protein [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410713817|gb|EKQ71317.1| thioredoxin domain-containing protein [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 698

 Score =  308 bits (789), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 235/721 (32%), Positives = 344/721 (47%), Gaps = 122/721 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +AK +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 60  SSCHWCTVMEGEAFSDQEIAKFMNTNFLPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 119

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRP F  +L  V+  +D+++  L    A       E 
Sbjct: 120 IFLTPDDLVPFYGGTYFPVEPRYGRPSFLQVLEGVRRFYDQEKTKLQSVKA-------EI 172

Query: 139 LSASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           LS   SS  LP  + LP++      E  +    S+  G    P FP       M+ ++  
Sbjct: 173 LSNLQSSTLLPAVEALPRDVFLHGLEYNTGVISSKSVG----PSFP-------MIPYADV 221

Query: 197 LEDTGKSGEASEGQKMVLFTLQC--MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            +   +    S    + + T +   +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 222 AQRAMRFLAKSRYNALEVSTQRGIDLALGGIFDHVGGGFHRYTVDPTWTVPHFEKMLYDN 281

Query: 255 GQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
           GQ+     + +S  + +  F   I   + ++L+R+M  P G  ++A+DADS  +  AT  
Sbjct: 282 GQIMEYLANQWSADVQEPAFKRAIALTV-EWLQREMTAPEGYFYAAQDADSFTSPDATEP 340

Query: 313 KEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EGAFYVW   E+  +L E  +   +    +   GN            F+G NVL +   
Sbjct: 341 EEGAFYVWGYDELTTLLTEKELREMQTQLTITEKGN------------FEGVNVL-QRRH 387

Query: 372 SSASASKLGMPLEKYLNI---LGECRRKLF-DVRSKRPR----------PHLDDKVIVSW 417
           S   +  +   L+K   I   +G  R K F   R+ R            P  D K+IV+W
Sbjct: 388 SGQLSEAIETALDKLFQIRYGIGTDRIKPFPPARNNREAQEMPWAGRIPPVTDTKMIVAW 447

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRL 476
           N L+IS  ARA+ + ++ +                ++E+A +A  FI  R   + + HR+
Sbjct: 448 NSLMISGLARAAAVFQNCS----------------WLELAVNATQFILERQWVENRLHRV 491

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYE-------FGSGTKWLVWAIELQNTQDELFL 529
            +   NG        +DYA  I  LLDL++         + + +L  A+ +Q   DE   
Sbjct: 492 NY---NGQPSVLAQSEDYALFIKALLDLHQAYQSLDSVAALSSFLDAAVRVQAELDEFLW 548

Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
             E GGYFN T   P +L+R +   D A P+ N V+V NLVRLA +   ++   Y   AE
Sbjct: 549 SVELGGYFN-TDRTPDLLVRERSYMDNATPAANGVAVANLVRLALL---TEDLSYLDRAE 604

Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
            +L  F + ++    A P +    D        H  LV  +++ D   +LAA +    + 
Sbjct: 605 QTLKAFGSVMERSPQACPSLFVGMDWF-----LHQTLV--RATPDAIALLAAQYQPTVMY 657

Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
           KT + + PA                            V LVCQ  SC  P     S+E L
Sbjct: 658 KTEVDL-PAGA--------------------------VGLVCQGLSCKEPAR---SMEQL 687

Query: 710 L 710
           L
Sbjct: 688 L 688


>gi|358396472|gb|EHK45853.1| hypothetical protein TRIATDRAFT_241655 [Trichoderma atroviride IMI
           206040]
          Length = 726

 Score =  308 bits (789), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 202/638 (31%), Positives = 319/638 (50%), Gaps = 71/638 (11%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+  +M +ESF +   A +LN  F+ I VDRE RPD+D +YM YVQA+   GG
Sbjct: 67  HIGYRACHFSRLMALESFMNPDCAAVLNHSFIPIIVDREVRPDIDTIYMNYVQAVSNSGG 126

Query: 76  WPLSVFLSPDLKPLMGGTYFP--------PEDKYGRP-GFKTILRKVKDAWDKKR----- 121
           WPL++FL+P+L+P+ GGTY+P         ED    P  F  I++KV++ W  ++     
Sbjct: 127 WPLNLFLTPELEPVFGGTYWPGPSVARRAAEDHGDEPLDFLVIVKKVRNIWKDQQARCRK 186

Query: 122 ---DMLAQSGAFAIE--------------QLSEALSASASSNK----------LPDELPQ 154
              +++ Q   FA E              Q++ A  A+  SN+          +  EL  
Sbjct: 187 EATEVIGQLREFAAEGTLGKRSIAAPQQQQIAPAGWAAPVSNQPVAKVSDSTDVSSELDI 246

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQK 211
           + L      ++ ++D  +GGFG APKF  P ++  +L        ++D     E      
Sbjct: 247 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLNLVNFPAPVQDVVGEAECKHALD 306

Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT-- 268
           M L TL+ +  G +HDH+G  GF R SV   W +P+FEK++ D  +L  +YL+A+  +  
Sbjct: 307 MALDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNAELLQLYLEAWRKSGA 366

Query: 269 -KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
            +D  +  +  ++ DYL    I  P G   S+E ADS    G   K+EGA+Y+WT +E  
Sbjct: 367 REDSEFYNVVIELADYLTSPPIALPDGGFASSEAADSYAKRGDAEKREGAYYLWTRREFA 426

Query: 327 DILG---EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
            ++    +H     E Y+ ++  GN D     DP+++F  +N+L         + +  +P
Sbjct: 427 SVVNADDKHISAIAEAYWDVQEDGNVDEDH--DPNDDFINQNILRIRKTPEELSKQFNVP 484

Query: 383 LEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
           +      +   R  L   R K RP P +DDK++  WNGLV+S+  R +  LK        
Sbjct: 485 VATVKRDIETAREALKKRREKERPHPDVDDKIVAGWNGLVVSALIRTAAFLKE------- 537

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
              +     ++Y+  A+ + SFI+  L+DE+   L   + +G     GF DDYA+L  GL
Sbjct: 538 ---LQPERSRKYLGAAKKSISFIKEKLWDEKNKILYRIWSDG-RHTEGFADDYAYLTHGL 593

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           LDL++      +L +A  LQ +Q+  F D   G +++TT   P  +LR+K+  D + PS 
Sbjct: 594 LDLFDATGDESYLEFADNLQKSQNAFFYD-SAGAFYSTTPSSPHTILRLKDGMDTSLPST 652

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           N VSV NL RL  ++A  K   +   A  ++  FE  +
Sbjct: 653 NGVSVSNLFRLGELLADEK---FTGLARETINAFEAEM 687


>gi|427728058|ref|YP_007074295.1| hypothetical protein Nos7524_0793 [Nostoc sp. PCC 7524]
 gi|427363977|gb|AFY46698.1| highly conserved protein containing a thioredoxin domain [Nostoc
           sp. PCC 7524]
          Length = 688

 Score =  308 bits (789), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 227/713 (31%), Positives = 341/713 (47%), Gaps = 124/713 (17%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQALAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
           VFL+P DL P   GTYFP E +Y RPGF  +L+ ++  +D +++ L Q  A  +E L  S
Sbjct: 108 VFLTPEDLVPFYAGTYFPLEPRYNRPGFLQVLQALRRYYDTEKEELRQRKAVILESLLTS 167

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMML 191
             L   A+      EL           L + +++  G      +G++  FP      M+ 
Sbjct: 168 AVLQGDATQEAEAQEL-----------LGRGWETSTGIITPNQYGNS--FP------MIP 208

Query: 192 YHSKKLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           Y    L  T  +  +  + Q++       +A GGI+DHV GGFHRY+VD  W VPHFEKM
Sbjct: 209 YAELALRGTRFNFPSRYDAQQVCTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKM 268

Query: 251 LYDQGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
           LYD GQ+     + +S   ++  ++      +++L+R+M  P G  ++A+DADS      
Sbjct: 269 LYDNGQIVEFLANLWSAGIQEPAFTRAVAGTIEWLQREMTAPEGYFYAAQDADSFTNPAE 328

Query: 310 TRKKEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
           T  +EGAFYVW+  E+ ++L    +   ++ + + P GN            F+GKNVL  
Sbjct: 329 TEPEEGAFYVWSYTELAELLSPTELAELQQQFTVTPNGN------------FEGKNVLQR 376

Query: 369 LNDSSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLD 410
            N       +L + LE  L+ L   R              R   + ++     R     D
Sbjct: 377 RN-----PGQLSITLETALDKLFTARYGAAPDALETFPPARDNQEAKTSNWPGRIPSVTD 431

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH-LY 469
            K+IV+WN L+IS  ARA         +A+F  P+ G       ++A  AA FI +H L 
Sbjct: 432 TKMIVAWNSLMISGLARA---------AAVFQEPIYG-------DIAARAAKFILQHQLV 475

Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELF 528
           + + HRL +    G        +DYAF I  LLDL       + WL  AI LQ   +E  
Sbjct: 476 NGRFHRLNY---QGQPTVLAQSEDYAFFIKALLDLQACSPEQRFWLENAIALQTEFNEFL 532

Query: 529 LDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
              E GGYFNT  +    +++R +   D A PS N V++ NLVRL  +   +   +Y   
Sbjct: 533 WSVELGGYFNTASDASQELIVRERSYADNATPSANGVAIANLVRLTLL---TDDLHYLDL 589

Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
           AE  L  F + ++    A P +  A D       ++  L+  +S+ +  N+L   +    
Sbjct: 590 AEQGLKAFNSVMQQAPQACPSLFTALDWY-----RNCTLI--RSTTEQINVLIPKY---- 638

Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
           L   V+++                       +N   D  V LVCQ   C P V
Sbjct: 639 LPNVVLNV----------------------VSNLPTDS-VGLVCQGLKCLPSV 668


>gi|17228732|ref|NP_485280.1| hypothetical protein all1237 [Nostoc sp. PCC 7120]
 gi|17130584|dbj|BAB73194.1| all1237 [Nostoc sp. PCC 7120]
          Length = 685

 Score =  308 bits (788), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 216/650 (33%), Positives = 312/650 (48%), Gaps = 87/650 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIADYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
           VFLSP DL P   GTYFP E KY RPGF  IL  ++  +D +++ L Q  A  +E L  S
Sbjct: 108 VFLSPEDLVPFYAGTYFPIEPKYNRPGFLQILEALRRYYDTEKEDLRQRKALIVESLLTS 167

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L   A+      EL +         +++   + +G       FP      M+ Y    
Sbjct: 168 AVLKGEATQEAEESELLKRGWETNTSVITR---NEYGN-----SFP------MIPYAELA 213

Query: 197 LEDTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           L  T  +     +GQ++       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD G
Sbjct: 214 LRGTRFNFASRYDGQQVSTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 273

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   K+  ++      + +L+R+M  P G  ++A+DADS  T      +E
Sbjct: 274 QIVEYLANLWSAGVKEPAFARAVTGTVVWLQREMTAPAGYFYAAQDADSFTTPTDVEPEE 333

Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+E ++    +   ++ + + P GN            F+GKNVL       
Sbjct: 334 GAFYVWSYAELEQLVTPTELTELQQQFTVSPQGN------------FEGKNVL-----QR 376

Query: 374 ASASKLGMPLEKYLNILGECRR-KLFDVRSKRPRPH-----------------LDDKVIV 415
               +LG  +E  L  L   R     D     P                     D K+IV
Sbjct: 377 RQPGELGATIETALGKLFAARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSVTDTKMIV 436

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTH 474
           +WN L+IS  ARA+ +         F  P+ G       E+A  AA+FI      D + H
Sbjct: 437 AWNSLMISGLARAAGV---------FQQPLAG-------ELAAKAANFILENQFVDGRFH 480

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREG 533
           RL +    G +      +DYA  I  LLDL+      + WL  AI LQ+  DE     E 
Sbjct: 481 RLNY---RGEAAVLAQSEDYALFIKALLDLHTAEPENRFWLEKAIALQHQFDEFLWSIEL 537

Query: 534 GGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           GGYFNT  +    +++R +   D A PS N V++ NLVRL+ +   +   +Y   AE  L
Sbjct: 538 GGYFNTASDASQDLIIRERSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEQGL 594

Query: 593 AVFETRLKDMAMAVPLMCCAAD-------MLSVPSRKHVVLVGHKSSVDF 635
             F++ +     A P +  A D       + S   + H ++  +  +V F
Sbjct: 595 KAFKSVMSSAPQACPSLFTALDWYRNSTLIRSTNEQIHTLIPSYLPTVAF 644


>gi|343087024|ref|YP_004776319.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342355558|gb|AEL28088.1| protein of unknown function DUF255 [Cyclobacterium marinum DSM 745]
          Length = 682

 Score =  308 bits (788), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 215/618 (34%), Positives = 297/618 (48%), Gaps = 61/618 (9%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVME ESFE + VAKL+N  F+ IK+DREERPD+D +YM  VQ +   GGWPL+
Sbjct: 56  SACHWCHVMEGESFEAKDVAKLMNAHFICIKIDREERPDLDNIYMEAVQVMGLQGGWPLN 115

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL P+ KP  GGTYF  E       +  +L  V  A+ ++ D L +S     + +  ++
Sbjct: 116 VFLLPNQKPFYGGTYFSKEQ------WIQVLSGVAQAFSQQYDDLVKSAEGFGQSIERSV 169

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
                  K   +     +R  A+ L    D  +GG    PKFP PV I   L     L+D
Sbjct: 170 IEKYGLKKGKSKFFPETIRQIAKDLIGKIDPVWGGMKRVPKFPMPV-IWSFLLDMAILDD 228

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
               GE       V FTL+ MA GGI+DH+GGGF RYSVD  W  PHFEKMLYD GQL +
Sbjct: 229 HEDLGEK------VCFTLEKMAMGGIYDHLGGGFCRYSVDGEWFAPHFEKMLYDNGQLLS 282

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           +Y  A+  + +  +     + + +L  DM GP    +SA DADS         +EG FY 
Sbjct: 283 LYSKAYQYSANALFREKITETISWLLNDMCGPEMGFYSALDADS-------DGEEGRFYT 335

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           WT  E++D+LG+    F + Y +K  GN +            GKN+L +           
Sbjct: 336 WTFSELKDLLGDDLNWFCQLYGIKEQGNWE-----------AGKNILYQTLPYVEVGENF 384

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G   E  L+ L E + KL + R  R RP LDDK+I  WNG VI     A   L  E    
Sbjct: 385 GFTQEALLSKLREVKLKLKEKRESRTRPGLDDKIISGWNGWVIKGLCDAYLALGEE---- 440

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                       E    A    +FI  H+  E  + L  S++ G +  P FL+DYA +I 
Sbjct: 441 ------------EIRNTAVRTGNFIWHHMVIE--NELYRSYKGGQAYTPAFLEDYAAVIQ 486

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + LY+    + WL  A  L       F D E   ++    +   ++   KE  D   P
Sbjct: 487 SFISLYKISFDSFWLRRAELLAQRVLRNFHDEEDEMFYFNDPKIEKLIANKKELFDNVIP 546

Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMCCAADML- 616
           S NSV   NL +L   +    +D Y   A+  L +    + DM +  P  L   A+  L 
Sbjct: 547 SSNSVMARNLHQLGLYLY---NDTYLAQAKSMLQL----VSDMLIKEPDFLANWASFYLE 599

Query: 617 -SVPSRKHVVLVGHKSSV 633
            SVP+ + +V+ G ++S 
Sbjct: 600 QSVPTAE-IVIAGKEAST 616


>gi|75906768|ref|YP_321064.1| hypothetical protein Ava_0545 [Anabaena variabilis ATCC 29413]
 gi|75700493|gb|ABA20169.1| Protein of unknown function DUF255 [Anabaena variabilis ATCC 29413]
          Length = 711

 Score =  308 bits (788), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 211/617 (34%), Positives = 307/617 (49%), Gaps = 70/617 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 74  SSCHWCTVMEGEAFSDQAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 133

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
           VFLSP DL P   GTYFP E KY RPGF  +L  ++  +D +++ L Q  A  +E L  S
Sbjct: 134 VFLSPEDLVPFYAGTYFPLEPKYNRPGFLQVLEALRRYYDTEKEDLRQRKALIVESLLTS 193

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L   A+      EL ++        +++   + +G       FP     ++ L  ++ 
Sbjct: 194 AVLKGEATQEAEESELLRSGWETNTGVITR---NEYGN-----SFPMIPYAELALRGTRF 245

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              +   GE    Q+ +      +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 246 NFASRYEGEQISTQRGL-----DLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQ 300

Query: 257 LANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
           +     + +S   ++  ++      + +L+R+M  P G  ++A+DADS  T   T  +EG
Sbjct: 301 IVEYLANLWSAGVQEPSFARAVTGTVAWLQREMTAPAGYFYAAQDADSFTTPTDTEPEEG 360

Query: 316 AFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
           AFYVW+  E+E +L    +   ++ + + P GN            F+GKNVL   +    
Sbjct: 361 AFYVWSYAELEQLLTPTELTELQQQFTVSPQGN------------FEGKNVLQRRHQWEL 408

Query: 375 SA---SKLGM-----------PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
           SA   + LG             LE +         K      + P    D K+IV+WN L
Sbjct: 409 SATIETALGKLFVARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSV-TDTKMIVAWNSL 467

Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHS 479
           +IS  ARA         +A+F  P+ G       E+A  AA+FI      D + +RL + 
Sbjct: 468 MISGLARA---------AAVFQQPLAG-------ELAAKAANFILENQFVDGRFYRLNY- 510

Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFN 538
              G +      +DYA  I  LLDL+      + WL  AI LQ   DE     E GGYFN
Sbjct: 511 --RGEAAVLAQSEDYALFIKALLDLHAATPENRFWLEKAIALQQQFDEFLWSIELGGYFN 568

Query: 539 TTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           T  +    +++R +   D A PS N V++ NLVRL+ +   +   +Y   AE  L  F+T
Sbjct: 569 TASDASQDLIIRERSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEAGLKAFKT 625

Query: 598 RLKDMAMAVPLMCCAAD 614
            +     A P +  A D
Sbjct: 626 VMSSAPQACPSLFTALD 642


>gi|312138733|ref|YP_004006069.1| hypothetical protein REQ_12910 [Rhodococcus equi 103S]
 gi|311888072|emb|CBH47384.1| conserved hypothetical protein [Rhodococcus equi 103S]
          Length = 674

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 198/575 (34%), Positives = 288/575 (50%), Gaps = 63/575 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+  A ++N+ FV IKVDREERPD+D VYM    A+ G GGWP++ F
Sbjct: 57  CHWCHVMAHESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCF 116

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P   GTY+P E + G P F  +L  V D W  +R  +  + A  + +L  + S 
Sbjct: 117 LTPDGAPFYCGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SG 175

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           +  +   P ++P   L      + +  D   GGFG APKFP  + ++ +L   ++     
Sbjct: 176 ALPAGGAPIDVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT---- 229

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               A    + V  T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD   L   Y
Sbjct: 230 ---SAGPTLRAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFY 286

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  + +D+L RD+    G   SA DAD       T  +EG  Y WT
Sbjct: 287 AHLARRTGSALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWT 339

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +++ D++G +      E + +  TG  +           +G +VL    D         
Sbjct: 340 PQQIADVVGDDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD--------- 379

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
            PL+   + L + R +L   R++RP+P  DDKV+ +WNGL I++ A A   L        
Sbjct: 380 -PLDA--DRLADVRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG------- 429

Query: 441 FNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLI 498
                    R +++E AE  A  +   HL D    RL+ +   G    P G L+DY  L 
Sbjct: 430 ---------RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALA 477

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +GL  L++     +WL  A  L +T  + F D  E G +F+T  +  +++ R ++  DGA
Sbjct: 478 AGLSTLHQVTGAAEWLEAATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGA 537

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
            PSG SV+   L+  +S+VA  +S  Y   A  SL
Sbjct: 538 TPSGASVTTEALLTASSLVAADRSARYAVAAADSL 572


>gi|407778219|ref|ZP_11125484.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
 gi|407299900|gb|EKF19027.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
          Length = 668

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 216/697 (30%), Positives = 330/697 (47%), Gaps = 87/697 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFE++ VA ++N  F++IKVDREERP++D++YM  + A    GGWPL++F
Sbjct: 53  CHWCHVMAHESFENDAVAAVMNRLFINIKVDREERPEIDQIYMAALAATGEQGGWPLTMF 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P  GGTYFPPE ++GRPGF  +L+ +  AW +KR  L +S       +  +L+ 
Sbjct: 113 LTPDGSPFWGGTYFPPEPRFGRPGFVQVLQAIDAAWREKRHELTKSAGNLKAHVQASLAP 172

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
                  PD +    LR  A ++    D   GG   APKFP    ++++     +  D  
Sbjct: 173 PPGEPPEPDAM----LRDLAARVHGMIDPALGGLRGAPKFPNAPFMKILWLDGIQHGDRT 228

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
           +        + V  +L+ M  GGI+DHVGGG  RY+VD+RW VPHFEKMLYD  QL  + 
Sbjct: 229 RI-------EAVADSLRHMLSGGIYDHVGGGLARYAVDDRWVVPHFEKMLYDNAQLLQLL 281

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              ++ T D  +     + +D+L R+M   GG   S+ DAD       T  +EG  YVW+
Sbjct: 282 CWVYARTHDQLFRIRIEETVDWLLREMRVDGGGFASSLDAD-------TDGEEGKTYVWS 334

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
            +E+ ++LG  A  F + + L+        + +D H +     +L  LN  +A+      
Sbjct: 335 RQELGEVLGSEAGAFLDVFTLE--------KPADWHRD----PILHRLNHPAATDPASET 382

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            +   L+       +L   R  RP+P  DDK++V WNG+ I++ A A ++L         
Sbjct: 383 RMRTLLD-------RLLVARQARPQPGRDDKLLVDWNGMTITALATAGRLL--------- 426

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                  DR ++ + A +A  F+   +   +  RL HS R      P    DYA +IS  
Sbjct: 427 -------DRPDWTQAARTAFRFVCESM---ENGRLPHSIRGDKQLFPALSSDYAAMISAA 476

Query: 502 LDLYEFGSGTKWLV----WAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             LY   S    L     WA +LQ        D+ G G++ +  +   V +R++ D D A
Sbjct: 477 TALYGATSDDALLQQARKWAGQLQRWHQ----DKAGSGFYMSASDSGDVPMRIRGDVDEA 532

Query: 558 EPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
            PS  S  +  L  LA++    + +    + A  +L     +    A  V      A  +
Sbjct: 533 IPSATSQVIEALAALATLTGDEEMTGLLHETARTALGRAARQPYGQAGTV-----HAASV 587

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
           +V +RK +V+V    SV F                V + +P D    D          ++
Sbjct: 588 AVSARK-LVMVEPAGSVVF--------------IPVANRNP-DPRRFDSVVSTGGEKVTL 631

Query: 677 ARN-NFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
             +      +  A +C   +C PP T+P +LE  L E
Sbjct: 632 PGDVVVDTTRPAAYLCIGQTCLPPFTEPSALEEALRE 668


>gi|257057143|ref|YP_003134975.1| highly conserved protein containing a thioredoxin domain-containing
           protein [Saccharomonospora viridis DSM 43017]
 gi|256587015|gb|ACU98148.1| highly conserved protein containing a thioredoxin domain protein
           [Saccharomonospora viridis DSM 43017]
          Length = 667

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 216/694 (31%), Positives = 322/694 (46%), Gaps = 88/694 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESF D  VA  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ F
Sbjct: 49  CHWCHVMAHESFADADVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD KP   GTY+PP    G P FK +L  V  AWD++RD L +     ++ ++E    
Sbjct: 109 LTPDGKPFHCGTYYPPVPTQGMPSFKQVLTAVAQAWDERRDELVEGAGRIVDHIAE---- 164

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             +    P  +  + +     +L    D   GGFG APKFP  + ++ +L H ++     
Sbjct: 165 -QTRPLSPQPVTADTIASAVAKLRTEVDPENGGFGGAPKFPPSMVLEFLLRHYERT---- 219

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
              ++ E   +V  T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD   L   Y
Sbjct: 220 ---DSMEVLSIVDMTAEGMARGGVYDQLAGGFARYSVDAEWVVPHFEKMLYDNALLLRCY 276

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT
Sbjct: 277 AHLARRTGSPLAHRVAGETAEFLLRDLRTPQGGFASSLDADAEGVEGLT-------YVWT 329

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
            +++ D+LG +      E + +   G  +           +G + L    D    A    
Sbjct: 330 REQLVDVLGPDDGAWAAETFGVTEEGTFE-----------RGASTLRLPQDPDDPA---- 374

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
               +++ +       L D R++RP+P  DDKVI +WNGL I++ A A   L+       
Sbjct: 375 ----RWMRVTS----TLLDARNERPQPARDDKVIAAWNGLAITALAEAGVALQ------- 419

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
                    R +++E A +A SF+   H  D+    L+ S R+G   +A   L+DY    
Sbjct: 420 ---------RPDWIEAAVAAGSFVLDVHKTDDG---LRRSSRDGVVGEADAVLEDYGCFA 467

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
            GLL L++     +WL  AI L +     F ++   G Y +T  +   ++ R  +  D A
Sbjct: 468 DGLLALHQATGEPRWLEEAIALLDIALRRFGVEGMPGAYHDTAVDAEELVHRPSDPTDNA 527

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
            PSG S     L+  +++    ++  YR   E +LA    R   +   VP      +  A
Sbjct: 528 SPSGASALAGALLTASALAGPERASAYRAACEEALA----RAGALIAQVPRFAGHWLSVA 583

Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
             ML+ P +  VV    +     E  +  A  +      V+   P D E +         
Sbjct: 584 EAMLAGPVQVAVVGTDARQR---ERFVVEAAQNIHGGGVVLGGVP-DAEGVPL------- 632

Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
              +        +  A VC+ + C  PVT P +L
Sbjct: 633 ---LTDRPLVDGRPAAYVCRGYVCDRPVTTPEAL 663


>gi|384567356|ref|ZP_10014460.1| thioredoxin domain-containing protein [Saccharomonospora glauca
           K62]
 gi|384523210|gb|EIF00406.1| thioredoxin domain-containing protein [Saccharomonospora glauca
           K62]
          Length = 670

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 226/701 (32%), Positives = 328/701 (46%), Gaps = 89/701 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESF D+ VA  +ND FV+IKVDREERPD+D VYMT  QA+ G GGWP++ 
Sbjct: 48  ACHWCHVMAHESFSDDEVAAFMNDHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTC 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP   GTY+PP   +G P FK +L  V  AW ++RD L +     ++ + E   
Sbjct: 108 FLTPDGKPFHCGTYYPPVPAHGMPSFKQVLVAVDQAWRERRDELVEGAGRVVDHIVE--- 164

Query: 141 ASASSNKLPDELPQNALRLCA--EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                 K     P  A  + A   +L +  D   GGFG APKFP  + ++ +L H    E
Sbjct: 165 ----QTKPLSLRPVTAETVAAAVSKLRREADPGNGGFGGAPKFPPSMVLEFLLRH---YE 217

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG    + E   +V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 218 RTG----SVEALSVVDATAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLL 273

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
             Y      T       +  +  ++L RD+  P G   S+ DAD+   EG T       Y
Sbjct: 274 RFYAHLARRTGSALAYRVAGETAEFLLRDLRTPQGAFASSLDADTEGVEGLT-------Y 326

Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           VWT +++ D+LG E      + + +   G  +           +G + L    D    A 
Sbjct: 327 VWTPQQLVDVLGPEDGAWAAKLFGVTEEGTFE-----------RGASTLQLRRDPDDPA- 374

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                  +++ +     R     R+ RP+P  DDKVI +WNGL I++ A A   L+    
Sbjct: 375 -------RWMRVTSALSR----ARAARPQPARDDKVIAAWNGLAITALAEAGVALR---- 419

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYA 495
                       R E++E A +AA+F+   H+  +    L+ S R+G    A   L+DY 
Sbjct: 420 ------------RPEWVEAAVAAAAFVLDVHVGGDGAEGLRRSSRDGVVGDAAAVLEDYG 467

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDH 554
            L  GLL L++      WL  A  L +T    F +D   G + +T  +  +++ R  +  
Sbjct: 468 CLADGLLALHQATGEPVWLTEATALLDTALRRFGVDGAPGAFHDTAADAEALVHRPSDPT 527

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LM 609
           D A PSG S     L+  +++    ++  YR   E +L    +R   +   VP      +
Sbjct: 528 DNASPSGASALAGALLTASALAGPERAGAYRAACEEAL----SRAGVLVEQVPRFAGHWL 583

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
             A  +LS P +  VV  G K   D   ++A A         V+  +P + E +    + 
Sbjct: 584 SVAEALLSGPVQVAVVGAGAK---DRAELVAEAARGVHGGGVVLGGEP-EAEGVPLLADR 639

Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
              + + A          A VC+ + C  PVT P +L   L
Sbjct: 640 PLVDGAPA----------AYVCRGYVCDRPVTTPEALARSL 670


>gi|421744678|ref|ZP_16182637.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
 gi|406686908|gb|EKC90970.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
          Length = 675

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 233/704 (33%), Positives = 335/704 (47%), Gaps = 91/704 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFEDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 48  SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VFL+P+ +P   GTYFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E  
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
            A     +LP  +E  Q  L      L++ YD   GGFG APKFP  + ++ +L H  + 
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY+  +  T       +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT  ++ ++LGE    +   H+ +   G             F+    ++ L     + 
Sbjct: 333 YVWTPAQLVEVLGEEDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
              G         +   R +L++ R +RP P  DDKV+ +WNGL I++ A A        
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
                       +R + ++ A +AA   +R HL D    RL  + R+G  S   G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
           A +  G L L        WL +A  L +   + F D E G  ++T  +   ++ R ++  
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
           D A PSG + +      L    A + S+ +R  AE +L V    +  +   VP      +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
                +L  P  + V +VG       +   A  H +  L+     V+   PAD E     
Sbjct: 587 AVTEALLDGP--REVAVVGDPD----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                    +A    +     A VC+ F C  P TDP  L   L
Sbjct: 636 ------LPLLAGRVPAEGAPTAYVCRGFVCDAPTTDPALLAAQL 673


>gi|346321450|gb|EGX91049.1| DUF255 domain protein [Cordyceps militaris CM01]
          Length = 735

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 199/617 (32%), Positives = 316/617 (51%), Gaps = 72/617 (11%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+C +M +ESF +   A +LND F+ + +DRE RPD+D +YM YVQA+   GG
Sbjct: 77  HIGYKACHYCRLMSIESFANAECAAVLNDAFIPVLIDRESRPDLDTIYMNYVQAVSSVGG 136

Query: 76  WPLSVFLSPDLKPLMGGTYFPPEDKYGRP---------GFKTILRKVKDAWDKKR----- 121
           WPL++F++P+L+P+ GGTY+P  +   R           F TI++KV+D+W ++      
Sbjct: 137 WPLNLFVTPELEPVFGGTYWPGPNAARRAHDESTEDALDFLTIIKKVRDSWKEQESRCRK 196

Query: 122 ---DMLAQSGAFAIEQLSEALSASASSNKLP----------------------DELPQNA 156
              ++LAQ   FA E        + + N +P                       EL  + 
Sbjct: 197 EATEVLAQLREFAAEGTLGTRPVTQTQNFVPSGWAAPISSESSQGMDKTASVSSELDLDQ 256

Query: 157 LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHS--KKLEDTGKSGEASEGQKMV 213
           L      ++ ++D  +GGFG APKF  P ++Q +L  H+    ++D     E +    M 
Sbjct: 257 LEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLQFLLELHTSPSAVQDIVGEAECAHATDMA 316

Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----SLT 268
           L TL+ +  G +HDHVG  GF R SV   W +P+FEK++ D  QL ++YL A+       
Sbjct: 317 LDTLRKIRDGALHDHVGATGFARCSVTPDWTIPNFEKLVVDNAQLLSLYLTAWHRAGGQA 376

Query: 269 KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
              FY  I  ++++YL    ++   G + S+E ADS    G    KEGAFY+WT +E + 
Sbjct: 377 TSEFYD-IVLELVEYLTSTPILRSDGLLASSEAADSYVRNGDRGMKEGAFYLWTKREFDS 435

Query: 328 IL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
           ++     G   ++   H+ +   GN D     DP+++F  +N+L  +  S   +    + 
Sbjct: 436 VIEAAEKGASPVV-AAHWGVLEDGNVD--EQHDPNDDFMKQNILRVVKTSEELSKLFSVS 492

Query: 383 LEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
           +E+    +   R +L   R  +R RP +DDK +  WNGL +S+ A+        AE+ + 
Sbjct: 493 VERIEQSIHTARNELKRRREGERVRPEVDDKAVTGWNGLALSALAKT-------AEALVT 545

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
             P + +   +   VA   ASFI++HL+D Q+ ++ +    G      F +DYA++I GL
Sbjct: 546 VNPEISA---KCNTVASGIASFIQKHLWDTQS-KILYRIWTGDRDTEAFAEDYAYVIQGL 601

Query: 502 LDLYEFGSGTKWLVWAIELQNT--QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
           LDL++       + +A +LQ T  Q   F D   GG+F TT E    +LR+K+  D + P
Sbjct: 602 LDLFDTNGDESLIAFADQLQRTEAQASYFYD-AAGGFFTTTAESTFAILRLKDGMDTSLP 660

Query: 560 SGNSVSVINLVRLASIV 576
           S N+VSV NL RL  ++
Sbjct: 661 STNAVSVSNLYRLGQLL 677


>gi|376005318|ref|ZP_09782832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375326245|emb|CCE18585.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 686

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 207/631 (32%), Positives = 312/631 (49%), Gaps = 97/631 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P D  P  GGTYFP E +YGRPGF  +L+ + + +   ++ L       + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +         P EL ++ L+   E  +     + +GG    P+FP  +    M +   +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             + K     +G+   L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                D +S   K   Y       +++L+R+M  P G  ++A+DADS  T      +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVWT++E+E  L        +  + +  +GN            F+GK VL   N     
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWN----- 375

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL-------------------------D 410
             +L   +E  L        KLF VR   P   +                         D
Sbjct: 376 CDELDPLIETALT-------KLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTD 428

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY- 469
            K+IV+WN L+IS  A+A+++L                D  EY+E+A  AA F+  H + 
Sbjct: 429 TKMIVAWNALMISGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWV 472

Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-----WLVWAIELQNTQ 524
           D++ HR+ +   +G        +DYA LI  L+DL++           WL  A+++QN  
Sbjct: 473 DDRFHRVNY---DGKVAVLSQSEDYALLIKALIDLHQASLQQPELADFWLTNAVQVQNEF 529

Query: 525 DELFLDREGGGYFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           D+     E GGYFNT  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   
Sbjct: 530 DQYLWSVELGGYFNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLN 586

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           Y   A  +L  F + ++    A P +  A D
Sbjct: 587 YLDRALQALEAFASVMRQSPQACPSLFVAFD 617


>gi|257059286|ref|YP_003137174.1| hypothetical protein Cyan8802_1422 [Cyanothece sp. PCC 8802]
 gi|256589452|gb|ACV00339.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8802]
          Length = 688

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 233/710 (32%), Positives = 333/710 (46%), Gaps = 114/710 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LND F+ IK+DREERPD+D +YM  VQ +   GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRPGF  +L+ ++  +D ++D L    +F      E 
Sbjct: 108 IFLTPDDLVPFYGGTYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEI 160

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKL 197
           L     S  LP     NA  L  E   +   +        P+ F RP    M+ Y +  L
Sbjct: 161 LDTLQKSAILP---VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLAL 216

Query: 198 EDTGKSGEASEGQKMVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
           + +  + ++ E Q  V +   + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 217 QGSRFAFQSQENQATVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276

Query: 257 ----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
               LAN++   +   +  F   I R + ++L+R+M  P G  ++A+DAD+  T      
Sbjct: 277 IVEYLANLWSQGYQ--EPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEP 333

Query: 313 KEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
           +EGAFYVW  +E+E+ L  E   L +  + L   GN            F+G NVL     
Sbjct: 334 EEGAFYVWKFQELEEYLNSEEFKLLEATFSLTAEGN------------FEGSNVLQRRMG 381

Query: 372 SSASASKLGMPLEKYLNILGECRRKLF-------------DVRSKRPRPHLDDKVIVSWN 418
              S +   +  + ++   G  R+ L                   R  P  D K+IV+WN
Sbjct: 382 GEFSEALEAILDKLFMIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWN 441

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
            L+IS  ARA  +         F  P+       Y E+A +A  FI +  + + + +RL 
Sbjct: 442 SLMISGLARAYGV---------FGDPL-------YWELAINATEFILQEQWVNNRLYRLN 485

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGY 536
           +    G        +DYAF I  LLDL       + WL  A E+Q   DE F   EGGGY
Sbjct: 486 YE---GQPSVLAQAEDYAFFIKALLDLQRANPWERQWLEKAKEVQEEFDEFFWSIEGGGY 542

Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           +N   ++   +L+R +   D A PS N V++ NLVRL+ +        Y   AE  L  F
Sbjct: 543 YNNASDNSGDLLIRERSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTF 599

Query: 596 ETRLKDMAMAVPLMCCAADML----SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
            + L     A P +  A D      SV + K +                       L + 
Sbjct: 600 SSVLSQSPKACPSLFVALDWYRFGNSVQTTKEI-----------------------LKQF 636

Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
           +    P    ++    +H  +N+            V LVCQ  SC  P T
Sbjct: 637 ITQYFPVTVYQLT---DHLPDNS------------VGLVCQGLSCLEPAT 671


>gi|258511893|ref|YP_003185327.1| hypothetical protein Aaci_1926 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
 gi|257478619|gb|ACV58938.1| protein of unknown function DUF255 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
          Length = 626

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 209/602 (34%), Positives = 293/602 (48%), Gaps = 54/602 (8%)

Query: 27  VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
           +M  ESFEDE VA +LN  +V+IKVDREERPD+D +YMTY QAL G GGWPL++ ++PD 
Sbjct: 1   MMAHESFEDETVAAILNAHYVAIKVDREERPDIDHIYMTYCQALQGEGGWPLTIIMTPDG 60

Query: 87  KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
            P   GTYFP   +YGRPG   IL+++   W   R  L ++     E++       A   
Sbjct: 61  HPFFAGTYFPKTPRYGRPGLIQILQEIARLWQTDRARLERASRSMAERMQPLFEGQAGEA 120

Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
           +      + A     E L  ++D+ +GGFG APKFP    +Q +L ++ +L  + ++   
Sbjct: 121 R-----GREAADRAYEALEATFDTEYGGFGPAPKFPTFHRVQFLLRYA-RLRPSERAA-- 172

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
                M L TL+ + +GGI DHVGGG  RYS D  W VPHFEKMLYD       Y DA++
Sbjct: 173 ----AMALSTLRAIQRGGIVDHVGGGMARYSTDPFWRVPHFEKMLYDNALALAAYADAYA 228

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
             KD  +    R  + +  R+M  P G  +SA DADS+         EG FY W  ++V 
Sbjct: 229 HAKDPAFLRFVRQTVAFFEREMRSPEGLYYSAVDADSS-------GGEGRFYFWRPEDVI 281

Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLE 384
             LG E   L+   Y +   GN            F+G NV   ++ D +A A+  GM  E
Sbjct: 282 AALGPEDGELYNAFYDITEAGN------------FEGANVPNYIDQDPAAFAASRGMTEE 329

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
           +    L     KL  VR  R RP +DDK + +WN L+    ARA    K  A        
Sbjct: 330 ELWQKLDALNEKLRAVRDARERPAIDDKCLTAWNALMAYGLARAGLACKETA-------- 381

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                   +++ A    + I R L      RL   +R+G +    + DD+A+L++  L+L
Sbjct: 382 --------WVDRAREVVAAIERILMRADDGRLLARYRDGEAGIFAYADDHAYLVAAYLEL 433

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV-KEDHDGAEPSGNS 563
           Y       +L  A   Q  QD LF D+  GGY    G D   L+ V K  +DGA PS NS
Sbjct: 434 YRATLDRAYLDRARHWQAVQDALFWDKAQGGY-TFYGRDAESLIAVPKPVYDGAMPSANS 492

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
            S  NL  L ++   ++   Y    +  +  F   +    M    +  AA M  V S + 
Sbjct: 493 QSAHNLWILHALTGDAE---YADRLDGLVRAFGGDIASAPMDCLWLVTAAMMSEVGSTEI 549

Query: 624 VV 625
           V+
Sbjct: 550 VI 551


>gi|256389916|ref|YP_003111480.1| hypothetical protein Caci_0704 [Catenulispora acidiphila DSM 44928]
 gi|256356142|gb|ACU69639.1| protein of unknown function DUF255 [Catenulispora acidiphila DSM
           44928]
          Length = 710

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 197/577 (34%), Positives = 294/577 (50%), Gaps = 61/577 (10%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFEDE  A L+N+ +V +KVDREERPDVD VYM   QA+ GGGGWP++VF
Sbjct: 49  CHWCHVMAHESFEDEATAALMNEKYVCVKVDREERPDVDAVYMAATQAMTGGGGWPMTVF 108

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
            +P+ KP   GTY+PP  ++G P F+ +L  V  AW   R+ + ++G   + +L+     
Sbjct: 109 ATPEGKPFQAGTYYPPVARHGLPSFRQLLVAVDRAWGDIREDVLRAGDGLVAELAHHARV 168

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
            A +  +PD     AL      L + +D   GGFG APKFP  + ++ +L H  +  D  
Sbjct: 169 VAGAEGVPD---AGALATAVGVLRREFDGVRGGFGGAPKFPPSMTLEQLLRHHARTGD-- 223

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                ++   MV  T + MA+GG++D +GGGF RY+VD+ W VPHFEKMLYD   L   Y
Sbjct: 224 -----ADALAMVRQTCEAMARGGMYDQLGGGFARYAVDDAWVVPHFEKMLYDNALLLRAY 278

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMI--GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           L  +  T D     +  +  D++ R++   G GG   S+ DAD       T   EG FY 
Sbjct: 279 LHLWRATGDALALRVVNETADWMLRELWLDGAGG-FASSLDAD-------TDGVEGKFYA 330

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASK 378
           W ++++ D +GE     KE                     F+ G +VL  L D       
Sbjct: 331 WDAEQIADAVGE-----KEAGDAGDAAWAAAVFNVTAQGTFEHGLSVLQLLQDPD----- 380

Query: 379 LGMPLEKYLNILGECRRKLFDV-RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
               L+++  I    R  LF+  R +R  P  DDK + +WNGL +++ A A  +      
Sbjct: 381 ---DLDRFQRI----RDSLFEARRDQRTAPGRDDKAVAAWNGLAVAALAEAGAL------ 427

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA--PGFLDDYA 495
                     + R+E +  A   A  + R  +D +T RL  + R+G + A  PG L+DYA
Sbjct: 428 ----------TGRQELVSAARQTAEMLERIHWDGKTMRLTRTSRDGVAGAQNPGVLEDYA 477

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
            +  GLL LY     T+W  +A  L +   + F D + G +++T  +  +++ R  +  D
Sbjct: 478 DVAEGLLALYAVTGETRWFAFAGRLLDVVLDNFRD-DSGLFYDTADDAEALIFRPADPTD 536

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
            A P G S +   L+  A++   + S  +R+ AE +L
Sbjct: 537 NATPGGTSAAAGALLTYAAL---TGSGRHREAAEQAL 570


>gi|119488064|ref|ZP_01621508.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
 gi|119455353|gb|EAW36492.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
          Length = 688

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 210/638 (32%), Positives = 321/638 (50%), Gaps = 109/638 (17%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  VA+ +N+ F+SIKVDREERP++D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDGAVAQYMNEHFISIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FLSP DL P +GGTYFP + +YG+PGF  +LR+V+  ++ ++  L        +++  A
Sbjct: 108 IFLSPDDLVPFVGGTYFPVQPRYGQPGFLEVLRRVRGFYNTEKTRLQNLK----QEIRNA 163

Query: 139 LSASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
           L  S   S+++L + L Q  L      +++   +  GG    P+FP      M+ Y    
Sbjct: 164 LVQSTVLSASQLNEGLLQQGLTTNTAVITR---NDLGG----PRFP------MIPYADTA 210

Query: 197 LEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           L D     E+  + Q+        +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD G
Sbjct: 211 LHDVRFDFESPYDSQQACTQRGTDLASGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 270

Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q+     + +S  +TK  F   I   +  +L+R+M  P G  ++++DAD+  T      +
Sbjct: 271 QIVEYLANLWSAGITKPAFERSISGTV-SWLKREMTAPKGHFYASQDADNFTTPEDVEPE 329

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EG FYVW  +++E+I+  E     +  + +  +GN            F+GKNVL   N  
Sbjct: 330 EGEFYVWNWQDLEEIVSPEEFGELQAQFSITKSGN------------FEGKNVLQRWN-- 375

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
                 L  P+E  L        KLF VR                         S R  P
Sbjct: 376 ---CDALSQPIESAL-------AKLFAVRYGAKPQDLETFPPATNNQEAKSKNWSGRIPP 425

Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
             D K+IV+WN L+IS  ARA+ + +                + EY+++A +AA FI  +
Sbjct: 426 VTDTKMIVAWNSLMISGLARAATVFQ----------------QPEYLKIATTAAQFILEN 469

Query: 468 LY-DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE-------FGSGTKWLVWAIE 519
            + D + HR+ +   +G        +DYA  I  L+DL++       F     W   A++
Sbjct: 470 QWVDGRLHRVNY---DGNPDVLAQSEDYALFIKALIDLHQASLIESSFQLPEYWFEKAVK 526

Query: 520 LQNTQDELFLDREGGGYFNT---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
           +Q   D+     E GGY+N    TG++  +L+R +   D A P+ N V++ NLVRL   +
Sbjct: 527 VQQEFDQFLWSVELGGYYNIGTDTGQE--LLMRERSYTDNATPAANGVAMANLVRL--FL 582

Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
              + DY  + AE  +  F + ++    A P +  A D
Sbjct: 583 LTEQLDYLDK-AEQGIQAFSSIMEKSPQACPSLFVALD 619


>gi|409198348|ref|ZP_11227011.1| thioredoxin domain-containing protein [Marinilabilia salmonicolor
           JCM 21150]
          Length = 675

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 210/697 (30%), Positives = 323/697 (46%), Gaps = 81/697 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  E FEDE  A+L+N+ F+ IKVDREERPDVD  ++T VQ +   GGWPL+
Sbjct: 53  SACHWCHVMAHECFEDEETARLMNEHFICIKVDREERPDVDNFFITAVQLMGAQGGWPLN 112

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V   PD +P  GGTYFP +       +K IL K+   +   R+ L          + +  
Sbjct: 113 VVTLPDGQPFWGGTYFPKDQ------WKEILIKINKLFHSDREKLTHHAHQLTTGIQQTS 166

Query: 140 SASASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YH 193
             S+  +++PD  E+   AL    E+ S  +D + GG    PKFP PV ++ +L    +H
Sbjct: 167 MISSEQSEVPDLSEVINEAL----ERWSAQWDLQLGGSLGKPKFPMPVNLEFLLHLHFHH 222

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
            +K+               +  TLQ MA+GGI+D  GGGF RYSVDE W VPHFEKMLYD
Sbjct: 223 PQKM-----------FSDFLNTTLQQMARGGIYDQAGGGFARYSVDEFWKVPHFEKMLYD 271

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
             QL  +Y  A++ +    Y  + ++ + ++   ++ P G  FSA DADS   EG    +
Sbjct: 272 NAQLIELYSHAYAHSGIKEYRDVVKETIAFVENKLMHPSGAFFSALDADS---EG----E 324

Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           EG +YVWT +E+ +I G    LF +++ +   G+ +            G  +L+      
Sbjct: 325 EGKYYVWTEEELLNIFGRDFPLFADYFNVNENGHWE-----------NGNYILLRTGSDE 373

Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
             A K  M LE+    +   ++ L + R KR RP LDDK I SWN L+      A K + 
Sbjct: 374 EFAHKHKMTLEEVEKRVSVWKKDLVNRRKKRIRPGLDDKTITSWNALMTKGLVEAHKAVS 433

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
                              + ++A     FI   L  +    L  ++++G +   GF++D
Sbjct: 434 D----------------SHFRKLALKNGEFICHSLISKDG-SLFRTWKDGRASVTGFMED 476

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
           YA +IS  + LYE     KW+  +  L +  ++ F D+  G +         +     + 
Sbjct: 477 YASVISAFIGLYEITGDEKWIEQSSRLADYAEKAFYDKATGQFHYMEKNQTELPANHFDT 536

Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
            D   PS NS+    L +LA++       +YR+ AE  L     + K+            
Sbjct: 537 QDNVIPSANSMMGHALFKLAALTG---DQHYRETAEKMLNQMLLQFKNYPWGFAHWGSLM 593

Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
            M+  PS + VV+ G K+    + +       Y  N     + P    E++         
Sbjct: 594 LMIHKPSFE-VVVAGSKTVQALQRL----QKQYRPNVIWAPLKPESPGELN--------- 639

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             + +N  S +++   VC   +C  PV      ++LL
Sbjct: 640 --ITKNRKSDEEITIYVCAQGACQLPVHSVEEAQHLL 674


>gi|390440171|ref|ZP_10228522.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
 gi|389836455|emb|CCI32648.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
          Length = 692

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 214/624 (34%), Positives = 312/624 (50%), Gaps = 82/624 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-- 195
           L  SA   +    L + +L     E  +K        +G  P FP      + L  S+  
Sbjct: 164 LRQSAILPRAETNLAEPSLLATGIETNTKVIRVNPNNYGR-PSFPMIPYSHLALQGSRFG 222

Query: 196 -KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
              +D+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 223 DDFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDN 274

Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           GQ+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +
Sbjct: 275 GQIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPE 334

Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVW+ + + D L    + L + ++ +   GN            F+G+NVL      
Sbjct: 335 EGAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----Q 377

Query: 373 SASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVI 414
                KLG  +E  L+ L     G  + +L      R                  D K+I
Sbjct: 378 RRQGGKLGKEIENMLDKLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMI 437

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQT 473
           V+WN L+IS  ARA          A+F  P+       Y ++A  AA FI +H + D + 
Sbjct: 438 VAWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRF 481

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDRE 532
            RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   +
Sbjct: 482 QRLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAED 538

Query: 533 GGGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
            GGYFN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE 
Sbjct: 539 EGGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEK 594

Query: 591 SLAVFETRLKDMAMAVPLMCCAAD 614
           +L  F T L+    A P +  A D
Sbjct: 595 ALQSFTTILEQSPTACPSLFVALD 618


>gi|291569597|dbj|BAI91869.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 686

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 205/620 (33%), Positives = 315/620 (50%), Gaps = 75/620 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P D  P  GGTYFP E +YGRPGF  +L+ + + +   ++ L       + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYHTDKNKLETVTEEILTQLRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +         P EL ++ L+   E  +     + +GG    P+FP      M    S+ +
Sbjct: 168 VILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPMIPYADMAWRGSRLI 217

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             +   G+A+  Q+      + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 218 SSSKVDGKAACLQRG-----KDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                D +S   K   +       +++L+R+M  P G  ++A+DADS  T      +EGA
Sbjct: 273 LEFLADLWSEGEKQPAFQRSINGTVEWLKREMTAPQGYFYAAQDADSFVTSQDKEPEEGA 332

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
           FYVWT++E+E  L  E     +  + +  +GN            F+GK V          
Sbjct: 333 FYVWTNQELETFLTSEEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELD 380

Query: 366 -LIELNDSSASASKLGMPLEKYLNI-LGECRR--KLFDVRSKRPRPHLDDKVIVSWNGLV 421
            LIE   +   A + G P E+     + E  +  K  D   + P    D K+IV+WN L+
Sbjct: 381 PLIETALAKLFAVRYGAPPEEVKTFPVAENNQGAKQRDWPGRIP-AVTDTKMIVAWNALM 439

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSF 480
           IS  A+A+++                 D  EY+E+A +AA FI +H + D++ HR+ +  
Sbjct: 440 ISGLAKAARVF----------------DNSEYLELATTAAKFILKHQWVDDRFHRVNY-- 481

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-----WLVWAIELQNTQDELFLDREGGG 535
            +G        +DYA  +  L+DL++           WL  A+ +Q+  DE     E GG
Sbjct: 482 -DGQVAVLSQAEDYALFVKALIDLHQASLQQPELAEFWLTNAVNVQSELDEYLWSMELGG 540

Query: 536 YFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           YFNT  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  
Sbjct: 541 YFNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRAGQALEA 597

Query: 595 FETRLKDMAMAVPLMCCAAD 614
           F + ++    A P +  A D
Sbjct: 598 FASIMRQSPQACPSLFVAFD 617


>gi|354566297|ref|ZP_08985470.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
 gi|353546805|gb|EHC16253.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
          Length = 691

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 236/724 (32%), Positives = 338/724 (46%), Gaps = 123/724 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D G+A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDPGIAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
            FLSP DL P   GTYFP E +YGRPGF  +L+ ++  +D ++  L    A  +E L  S
Sbjct: 108 AFLSPDDLVPFYAGTYFPVEPRYGRPGFLQVLQAIRHYYDTEKQDLRDRKAVILESLLTS 167

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L    ++     EL           ++    +++G       FP     ++ L    +
Sbjct: 168 AVLQQQGTTATQDKELLHKGRETSTGIITP---NQYGN-----SFPMIPYAELAL-RGTR 218

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
            E T +     +G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 219 FEVTSE----YDGKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQ 274

Query: 257 LANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------AETEG 308
           +     + +S  + +  F   I   +  +L+R+M  P G  ++A+DADS         +G
Sbjct: 275 IVEYLANLWSAGIEEPAFKRAIAGTV-QWLKREMTAPEGYFYAAQDADSFTPPYQGGDKG 333

Query: 309 ATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
            +  +EGAFYVWT  E+E +L  E  I  ++ + +   GN            F+ KNVL 
Sbjct: 334 GSEPEEGAFYVWTFSELEQLLTAEELIELQQQFTVTANGN------------FESKNVLQ 381

Query: 368 ELNDSSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHL 409
                  SA+     +E  L  L   R              R   + +S+    R     
Sbjct: 382 RRRSGELSAT-----VETALKKLFVARYGATPESLETFPPARNNQEAKSRHWPGRIPAVT 436

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K+IV+WN L+IS  ARA          A+F  PV       Y+E+A +AA FI  H +
Sbjct: 437 DTKMIVAWNSLMISGLARA---------YAVFREPV-------YLELATTAADFIVNHQF 480

Query: 470 -DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDEL 527
            D + HRL  ++ N P+      +DYAF I  LLDL        KWL  AI LQ   DE 
Sbjct: 481 VDGRFHRL--NYENQPT-VLAQSEDYAFFIKALLDLQTCSPEQNKWLERAIALQEEFDEY 537

Query: 528 FLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
               E GGY+NT+ +    +++R +   D A PS N V++ NLVRLA     + + +Y  
Sbjct: 538 LWSVELGGYYNTSSDASQDLIVRERSYVDNATPSANGVAIANLVRLALF---TDNLHYLD 594

Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 646
            AE  L  F + +     A P +  A D                               +
Sbjct: 595 LAEQGLNAFRSVMNSTPQACPSLFTALD-------------------------------W 623

Query: 647 DLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
             N T+I      TE++         +   A  +   D  V LVCQ   C P  +   SL
Sbjct: 624 YRNSTLIR---TTTEQLHSLMSQYLPSVVFAIASKLPDNSVGLVCQGLKCLPAAS---SL 677

Query: 707 ENLL 710
           E +L
Sbjct: 678 EQML 681


>gi|310797732|gb|EFQ32625.1| hypothetical protein GLRG_07639 [Glomerella graminicola M1.001]
          Length = 811

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 205/650 (31%), Positives = 319/650 (49%), Gaps = 81/650 (12%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H     CH+  +   E F     A +LN+ F+ + +DREERP++D +YM YVQA+ G GG
Sbjct: 78  HIGFKACHYSRLTSTECFTHRECAAILNESFIPVIIDREERPELDTIYMNYVQAVSGSGG 137

Query: 76  WPLSVFLSPDLKPLMGGTYFPP-------EDKYGRPGFKTILRKVKDAWDKKRDMLAQSG 128
           WPL++FL+P+L+P+ GGTY+P         D   R  F  ILRK++  W ++     Q  
Sbjct: 138 WPLNLFLTPELEPVFGGTYYPAPGPNNGGSDDEDRLDFLAILRKLQKVWREQEGRCRQEA 197

Query: 129 AFAIEQL--------------------SEALSASASSNKL------------PDELPQNA 156
              + +L                    S+ ++   S   L              EL  + 
Sbjct: 198 KEVVVKLHDFAAEGTLGTATVQPGVAGSQTIAIGRSETGLEHPGTGRTAAAVSSELDLDL 257

Query: 157 LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL---EDTGKSGEASEGQKMV 213
           L      ++ ++D  +GGFG APKFP P ++  +L   + L   +D     E +   +M 
Sbjct: 258 LEEAYSHIAGTFDPVYGGFGLAPKFPTPPKLSFLLRLPRYLAPVQDVVGESECAHATEMA 317

Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---- 268
           LFTL+ +    + DHVGG GF RYSV   W VP FEK++     L  +YLDA+ +     
Sbjct: 318 LFTLRKIRDSSLRDHVGGCGFARYSVTADWSVPRFEKLIAHNALLLGLYLDAWLIATGGE 377

Query: 269 KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
           K   +  +  +++DYL    I  P G   S+E ADS    G    +EGA+ +WT +E + 
Sbjct: 378 KGTEFYDVVVELVDYLSSPPISLPEGGFVSSEAADSYYRRGDRHMREGAYNLWTRREFDT 437

Query: 328 ILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
           ++G+   A L   ++ +   GN +  +  DP++EF  +N+L  + D S    + G+ +++
Sbjct: 438 VIGDDHEAALAASYWNVLEHGNVEPDQ--DPNDEFMNENILRVVKDVSEIGRQAGITVDE 495

Query: 386 YLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
              ++   ++KL   R K R RP +D K++   NGLVIS+  RA   L +          
Sbjct: 496 VKRVISSAKQKLKVHREKERVRPEVDAKIVAGRNGLVISALTRAGLALAT---------- 545

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
           V  +  +  +  A  AA FIR +L+DE+   L   +  G  +A G  +DYA+LI GL+ L
Sbjct: 546 VDAAKSQAAIASAGRAAEFIRANLWDEKERILYRIWNEGRGEAKGLAEDYAYLIEGLIGL 605

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLD--------------REGGGYFNTTGED-PSVLLR 549
           YE  +  +W+ +A ELQ  Q + F D              R   G F  T E+ P  +LR
Sbjct: 606 YEATADERWIEFADELQKVQIDTFYDSPSVGTSVLESPASRSSCGAFYITAENAPHTILR 665

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           +K+  D A PS N+VSV NL RL ++++    + Y   A  S+  FE  +
Sbjct: 666 LKDGMDTALPSTNAVSVSNLFRLGTMLS---DEAYTALARESINAFEAEI 712


>gi|433772248|ref|YP_007302715.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
 gi|433664263|gb|AGB43339.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
          Length = 675

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 200/557 (35%), Positives = 281/557 (50%), Gaps = 58/557 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFE++ VA ++N  FV+IKVDREERPD+D++YM  + ++   GGWPL++
Sbjct: 56  ACHWCHVMAHESFENDDVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTM 115

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+PD KP  GGTYFP E +YGRPGF  ++  V  AW +KR  L QS       +   LS
Sbjct: 116 FLTPDGKPFWGGTYFPREPRYGRPGFIQVMEAVDKAWREKRTSLHQSADGLTSHVEARLS 175

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A+ S   L  ++    L   A ++S   D   GG   APKFP    +Q +      L D 
Sbjct: 176 ATHSKALLDRDM----LSDLAGRVSGMIDRDRGGLAGAPKFPNAPFMQTLWL--SWLRD- 228

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
              G A+  +  VL +L+ M  GGI+DH+GGG  RYS D  W VPHFEKMLYD  QL   
Sbjct: 229 ---GNAAH-RDDVLVSLEHMLSGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRF 284

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
              A + T +  +     D + +L R+M   GG   ++ DADS         +EG FY W
Sbjct: 285 CNWALAATGNDLFRVRIEDTVGWLLREMRVEGGAFAASLDADS-------DGEEGLFYTW 337

Query: 321 TSKEVEDILGEHAILFKEHYYL-KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           +  E+E +LG+ + LF +++ L  P G             ++GK VL +    + S    
Sbjct: 338 SRGEIESVLGDDSTLFFKYFSLSSPPG-------------WEGKPVLHQ----TLSQQAF 380

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
           G+   + L  L   + +L  VR +R RP LD K +  WNGL+I++ A A + L       
Sbjct: 381 GVADRERLVPL---KTRLLTVREQRVRPGLDAKTLTDWNGLMIAALAEAGRSLA------ 431

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
                     R +++E A  A + I +   D    RL HS        P    DYA + +
Sbjct: 432 ----------RPDWIEAAAKAFAHIGKAGRD---GRLPHSMLGVRKLFPALSSDYAAMTN 478

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             + L+E      ++  A +     D    D EG GY+ T  +   V +R++ D D A P
Sbjct: 479 AAISLFEATEDWSYVEQASQFLGQLDHWHADVEGTGYYLTASDSTDVPIRIRGDVDEAIP 538

Query: 560 SGNSVSVINLVRLASIV 576
           S  S  +   VRLASI 
Sbjct: 539 SATSQIIEAQVRLASIT 555


>gi|295132488|ref|YP_003583164.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
 gi|294980503|gb|ADF50968.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
          Length = 678

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 192/584 (32%), Positives = 290/584 (49%), Gaps = 48/584 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFED  VA ++N  ++SIKVDREERPD+D+VYM  VQ + G GGWP+++ 
Sbjct: 53  CHWCHVMEHESFEDPEVADIMNAHYISIKVDREERPDIDQVYMQAVQLMTGSGGWPMNIV 112

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
             PD +P+ GGTYF  E       +K+ L +++  + K+   L        E L +    
Sbjct: 113 ALPDGRPVWGGTYFRKEQ------WKSALLQIQQIYKKESTQLTNYANKLKEGLQQLNLI 166

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
              +N    E  Q  L    E      D + GG  +APKF  P  +  +L ++ + +D  
Sbjct: 167 DIGNNSY--EFSQKRLGEFIEIWKPYLDMKLGGTKNAPKFMMPTNLDFLLRYAYQFKD-- 222

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                 + Q+ VL +L  ++ GG  DH+GGGF RYSVD+RWHVPHFEKMLYD  QL ++Y
Sbjct: 223 -----KKLQEYVLHSLDKISFGGTFDHIGGGFARYSVDDRWHVPHFEKMLYDNAQLLSLY 277

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
             A+ LT+D +Y  + +    ++  ++    G  +SA DADS   +G   ++EGAFY W 
Sbjct: 278 SKAYKLTQDHWYKEVIKKTARFIETELTDSTGAFYSALDADSENAKG--NQEEGAFYTWK 335

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
            +E+E++L     LF  ++ +   G  +            G  +L +         K  +
Sbjct: 336 KEELEELLASEFDLFSAYFNINARGYWE-----------NGNYILYKTEKDDDFTKKHNI 384

Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
            LE+         + L + R KR +P LDDK + SWN L ++ FA A             
Sbjct: 385 SLEELYQKKSNWTKILSEARKKRKKPGLDDKTLTSWNALSLNGFAEA------------- 431

Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
                 + +  Y+ +A   A FI ++  +   + L HS++N  SK   +L+DYAF I   
Sbjct: 432 ---YTATGKNHYLNIALKNAEFIIQNQLNPD-YSLFHSYKNKQSKINAYLEDYAFTIEAF 487

Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
           L LYE     KW+  +  L     E F ++E   +  T+ +D +++    E  D   P+ 
Sbjct: 488 LKLYEVTFDKKWIDISSHLTKYCFENFYNQENTLFNFTSKKDDALISTPIELTDNVIPAS 547

Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
           NSV   NL RL  +   S+   Y + +E  L V   ++    M 
Sbjct: 548 NSVMANNLFRLGRLTGTSR---YLEVSEKMLQVISGKIGSYPMG 588


>gi|86606925|ref|YP_475688.1| hypothetical protein CYA_2291 [Synechococcus sp. JA-3-3Ab]
 gi|86555467|gb|ABD00425.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 701

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 207/625 (33%), Positives = 301/625 (48%), Gaps = 74/625 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDPEIAAFLNAHFLPIKVDREERPDLDSIYMQALQLMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P DL P   GTYFP E ++GRPGF T+L+++   + +++D +       +  L+  
Sbjct: 108 VFLTPDDLVPFYAGTYFPVEPRFGRPGFLTVLQRILQFYRQEKDKIEDMKGQILAALT-T 166

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           LS     + +P +L ++ +      L+ +        G+  +FP     Q++L  ++   
Sbjct: 167 LSDLVPEDHIPPDLLRSGIPKIQPLLANA--------GAVQQFPMMPYAQLVLRSARFDP 218

Query: 199 DTGKSGEAS-------EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
             G  G  +        G  +VL        GGI DHV GGFHRY+VD  W VPHFEKML
Sbjct: 219 PEGIPGSPTALERAKERGMALVL--------GGIFDHVAGGFHRYTVDPTWTVPHFEKML 270

Query: 252 YDQGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
           YD GQ+     + ++   +D       R  ++++ R+M  P G  ++A+DADS       
Sbjct: 271 YDNGQILEFLSELWAHGIQDAAIERAVRLTVEWVAREMTAPAGYFYAAQDADSFARREDA 330

Query: 311 RKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
             +EG FYVW  +E++D+L E      ++ ++L P GN        P      +    EL
Sbjct: 331 EPEEGEFYVWRWQELQDLLDEETFRALQQAFFLLPGGNFP----DRPGCIVLQRRQGGEL 386

Query: 370 NDSSASASKLGMPLEKYLNILGECRRKL-----FDVRSKRPR-------PHLDDKVIVSW 417
                +A    +   +Y    G   R+       D +S R +       P  D K+IVSW
Sbjct: 387 PPEVETALTTHLFRARY----GSTERRTPFPLAVDAQSARRQSWPGRIPPVTDTKMIVSW 442

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
           NGL+IS  ARA ++   E                +Y+ +A  AA FI       QT  L 
Sbjct: 443 NGLMISGLARAYQVFGEE----------------DYLRLALRAAQFILSQQRHPQTGSLL 486

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEF-------GSGTKWLVWAIELQNTQDELFLD 530
               +G ++ P   +DYA LI  LLDL++         S   WL  AI LQ   D    D
Sbjct: 487 RLNYDGTAQVPAQSEDYALLIKALLDLHQACLPRTGDPSSQYWLEAAIRLQQEMDTRLWD 546

Query: 531 REGGGYFNTTGED-PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
              GGYF +  +  P +L+R KE  D A P+ N V+V NLVRLA+I        Y + AE
Sbjct: 547 EARGGYFVSDAQSTPELLVREKEFQDNATPAANGVAVANLVRLAAITGDLD---YLERAE 603

Query: 590 HSLAVFETRLKDMAMAVPLMCCAAD 614
            +L  F   +       P +    D
Sbjct: 604 QALKTFAHIMSTQPRVCPSLFVGLD 628


>gi|297192427|ref|ZP_06909825.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
           25486]
 gi|297151361|gb|EDY61872.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 678

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 229/710 (32%), Positives = 329/710 (46%), Gaps = 102/710 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWCHV+  ESFED   A  +N+ FV+IKVDREERPDVD VYM  VQA  G GGWP+S
Sbjct: 54  SSCHWCHVLAHESFEDAETAAYMNEHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMS 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           V+++ D +P   GTYFPP  ++G P F+ +L  V DAW  +RD + +        L+ A 
Sbjct: 114 VWMTADGEPFYFGTYFPPAPRHGMPSFRQVLEGVSDAWTGRRDEVGEVAQRIASDLA-AR 172

Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           S     + +P  +EL Q  L      L++ YD R GGFG APKFP  + ++ +L H  + 
Sbjct: 173 SLVVGGDGVPGEEELAQALL-----GLTRDYDERHGGFGGAPKFPPSMVLEFLLRHHAR- 226

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L
Sbjct: 227 --TGAEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
             VY   +  T       +  +  D+L R++    G   SA DADS   +G     EGAF
Sbjct: 281 CRVYAHLWRATGSDLARRVALETADFLVRELRTSEGGFASALDADSDTADGG--HAEGAF 338

Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVWT  ++ ++LGE       E + +   G             F+  + ++ L    A A
Sbjct: 339 YVWTPAQLREVLGEEDGARAAELFAVTEEGT------------FEEGSSVLRLPHGEADA 386

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
                          + R++L   R +RPRP  DDKV+ +WNGL I++ A          
Sbjct: 387 ---------------DLRQRLLAAREERPRPGRDDKVVAAWNGLAIAALAETGAFFG--- 428

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHL-YDEQTHRLQHSFRNGPSKA-PGFLDD 493
                        R + +E A  AA   +R H+ ++    RL  + ++G   A  G L+D
Sbjct: 429 -------------RPDLVERATEAADLLVRVHMDFEAGGVRLHRTSKDGRLGANAGVLED 475

Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRV 550
           YA +  G L L   G    WL +A  L +    + +DR   EG   ++T   D   L+R 
Sbjct: 476 YADVAEGFLALAAVGGEGSWLEFAGFLLD----MVMDRFTGEGCALYDTA-HDAEPLIRR 530

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAG---SKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
            +D     P+ N+         A+++     + S+ +R  AE +L V    +K +    P
Sbjct: 531 PQD-----PTDNAAPSGWSAAAAALLLYSAHTGSEAHRTAAEGALGV----VKGLGPRAP 581

Query: 608 L-----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
                 +  A  +L  P  + V +VG         +   A         V   +P D++E
Sbjct: 582 RFIGWGLAAAEALLDGP--REVAVVGRPGDPATRELHLTALMGTAPGAAVAVGEP-DSDE 638

Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                +    N S A          A VC+ F C  P TD   L   L +
Sbjct: 639 FPLLRDRPLVNGSSA----------AYVCRGFVCDSPTTDATELARKLTD 678


>gi|425470696|ref|ZP_18849556.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
 gi|389883513|emb|CCI36064.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
          Length = 692

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 211/623 (33%), Positives = 315/623 (50%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTDEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L + +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLAEPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             ED+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+ + + D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-------GECRRKLF-DVRSKRPRPHL----------DDKVIV 415
               +LG  +E  L+ L        + +  LF   R  +   ++          D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F+ P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATVAAEFILQHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L++   A P +  A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618


>gi|334119055|ref|ZP_08493142.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
 gi|333458526|gb|EGK87143.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
          Length = 695

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 213/642 (33%), Positives = 313/642 (48%), Gaps = 110/642 (17%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ +KVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIAEYMNSHFIPVKVDREERPDIDSIYMQTLQMMTGQGGWPLN 107

Query: 80  VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD + P  GGTYFP E +YGRPGF  +L+ ++  +D ++  +    A  +  L + 
Sbjct: 108 VFLTPDERVPFYGGTYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILGNLQQT 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            + S  + +L  E+ Q  L L    ++        G    P FP      M+ Y    L 
Sbjct: 168 AALSGVTAELNREIFQKGLELNTGIVA--------GHNPGPSFP------MIPYAELALR 213

Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ- 256
            T  + E+    K V       +A GGI+D VGGGFHRY+VD  W VPHFEKMLYD GQ 
Sbjct: 214 GTRFNFESKYDSKQVCTQRGLDLALGGIYDQVGGGFHRYTVDPTWTVPHFEKMLYDNGQI 273

Query: 257 ---LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
              LAN++     + +  F + I   + ++L+R+M  P G  ++A+DADS  T      +
Sbjct: 274 VEYLANLW--GAGIQEPAFETAIAGTV-EWLKREMTAPTGYFYAAQDADSFNTSEEVEPE 330

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVWT  E+E +L  E     K H+ +  +GN            F+GKNVL   +  
Sbjct: 331 EGAFYVWTYAELEQLLTPEELAEIKAHFTVSRSGN------------FEGKNVLQRRHPG 378

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
             S            + +     KLF VR                           R   
Sbjct: 379 KLS------------DTVKTALAKLFQVRYGGNPDSVKTFPPARNNQEAKNESWPGRIPA 426

Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
             D K+I +WN LVIS  ARA+ +  +                 EY+E+A  AA+FI  +
Sbjct: 427 VTDTKMIAAWNSLVISGLARAAAVFGN----------------WEYLELAVKAANFILDN 470

Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE----FGSGTK---------WL 514
            + +   R Q    +G S      +DYA  +  LLDL++     G+G +         WL
Sbjct: 471 QWTD--GRFQRLNYDGHSAVTAQSEDYALFVKALLDLHQASLTLGNGEEAKQLPNSQFWL 528

Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPS--VLLRVKEDHDGAEPSGNSVSVINLVRL 572
             A+++Q   DE     E GGY+N T +D S  +L+R +   D A P+ N +++ +LVRL
Sbjct: 529 NKAVQVQEEFDEFLWSVELGGYYN-TAKDASGDLLVRERSYIDNATPAANGIAIASLVRL 587

Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           A +  G   +Y  + A+  L  F + ++D   A P +  A D
Sbjct: 588 ALL--GPNLEYLDR-AQQGLQAFSSIVQDAPQACPSLLSAID 626


>gi|423065340|ref|ZP_17054130.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
 gi|406713250|gb|EKD08422.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
          Length = 686

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 205/619 (33%), Positives = 312/619 (50%), Gaps = 73/619 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P D  P  GGTYFP E +YGRPGF  +L+ + + +   ++ L       + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +         P EL ++ L+   E  +     + +GG    P+FP  +    M +   +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             + K     +G+   L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                D +S   K   Y       +++L+R+M  P G  ++A+DADS  T      +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
           FYVWT++E+E  L        +  + +  +GN            F+GK V          
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELE 380

Query: 366 -LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNGLVI 422
            LIE   +   A + G P  +          +    R    R P + D K+IV+WN L+I
Sbjct: 381 PLIETALAKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMI 440

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
           S  A+A+++L                D  EY+E+A  AA F+  H + D++ HR+ +   
Sbjct: 441 SGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY--- 481

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGY 536
           +G        +DYA LI  L+DL++           WL  A+++QN  D+     E GGY
Sbjct: 482 DGKVAVLSQSEDYALLIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGY 541

Query: 537 FNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           FNT  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F
Sbjct: 542 FNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAF 598

Query: 596 ETRLKDMAMAVPLMCCAAD 614
            + ++    A P +  A D
Sbjct: 599 ASVMRQSPQACPSLFVAFD 617


>gi|254409993|ref|ZP_05023773.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183029|gb|EDX78013.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 695

 Score =  306 bits (783), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 223/714 (31%), Positives = 337/714 (47%), Gaps = 121/714 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDPAIAQYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P D  P  GGTYFP E +YGRPGF  +L+ ++  +D ++  L       +  L ++
Sbjct: 108 IFLTPEDRVPFYGGTYFPVEPRYGRPGFLQVLQAIRRFYDVEKTKLQNFKDEILGHLQQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           +   AS      +L    LR   ++  +  DS  G +G  P FP      + L   +  E
Sbjct: 168 VLLPASG-----QLTAELLRQGMDKTIRIVDS--GSYG--PSFPMIPYADLALRGIRFQE 218

Query: 199 DTGKSG-EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            T     +AS  + + L      AKGGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 219 MTEVDAYQASRSRGLDL------AKGGIYDHVAGGFHRYTVDATWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                + +S+  K+  +       + +L R+M    G  ++A+DADS     A   +EGA
Sbjct: 273 VEYLANLWSVGIKEAAFERAISGTVQWLTREMTASSGYFYAAQDADSFTEPSAAEPEEGA 332

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELN 370
           FYVW+  E++ +L  E     +E + + P GN            F+G+NVL      +L+
Sbjct: 333 FYVWSYAELQQLLTAEELAELQEQFTVTPEGN------------FEGQNVLQRRYSDQLS 380

Query: 371 DSSASA------SKLGMP---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
           D+  +A      ++ G P   LE +         K  +   + P    D K+IV+WN L+
Sbjct: 381 DTLETALAKLFTARYGSPPDSLETFPPAQNNQEAKTKNWSGRIP-AVTDTKMIVAWNSLM 439

Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSF 480
           IS  ARA  + +                + EY+E+A +AA FI  + + D++ HRL +  
Sbjct: 440 ISGLARAYGVFR----------------KPEYLELATTAAKFILENQWVDQRFHRLNY-- 481

Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGT-------------KWLVWAIELQNTQDEL 527
             G +      +DYA  I  LLDL++   G               WL  AI++Q+  DE 
Sbjct: 482 -EGEASILAQSEDYALFIKALLDLHQASLGLATAQESSQSPIPDSWLEEAIKVQDEFDEY 540

Query: 528 FLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
               E  GY+N   +    +L+R +   D A P+ N V++ NLVRL  +   +++  Y  
Sbjct: 541 LWSVELAGYYNAANDSSGDLLIRERSYTDNATPAANGVAIANLVRLTLL---TENLAYLD 597

Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADML--SVPSRKHVVLVGHKSSVDFENMLAAAHA 644
            AE +L  F + +   + + P +  A D    S   R +V  +    +  F         
Sbjct: 598 RAEVALNAFSSVMNQSSQSCPSLFTALDWFRNSTLIRTNVAQILSLMTQYFP-------- 649

Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
                 T+  I+P+  E                         V LVCQ  SC P
Sbjct: 650 -----ATMYRIEPSLPE-----------------------NAVGLVCQGLSCKP 675


>gi|425465473|ref|ZP_18844782.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
 gi|389832278|emb|CCI24243.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
          Length = 692

 Score =  305 bits (782), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 208/622 (33%), Positives = 311/622 (50%), Gaps = 78/622 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L   +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             +D+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+ D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               +LG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGELGKEIEDMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F+ P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATVAAEFILKHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDE 539

Query: 534 GGYFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           GGYFNT  +    ++LR +   D A PS N +++ NL+RL+ +    +   Y   AE +L
Sbjct: 540 GGYFNTASDHSLDLILRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKAL 596

Query: 593 AVFETRLKDMAMAVPLMCCAAD 614
             F T L++   A P +  A D
Sbjct: 597 QSFSTILEESPTACPSLFVALD 618


>gi|425459385|ref|ZP_18838871.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
 gi|389822926|emb|CCI29290.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
          Length = 692

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 209/623 (33%), Positives = 311/623 (49%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++  A  +  L ++
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSKFTAEMLGALRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
                +   L D    + L    E  +         +G  P FP      + L  S+   
Sbjct: 168 AILPRAETNLADP---SLLATGIETNTAVIQVNPNNYGR-PSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             ED+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+ + + D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               +LG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGELGKEIENLLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F+ P+       Y +++  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMSTQAAEFILQHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T+WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAGDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NLVRL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L+    A P +  A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618


>gi|300789899|ref|YP_003770190.1| hypothetical protein AMED_8085 [Amycolatopsis mediterranei U32]
 gi|384153415|ref|YP_005536231.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
 gi|399541779|ref|YP_006554441.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
 gi|299799413|gb|ADJ49788.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340531569|gb|AEK46774.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
 gi|398322549|gb|AFO81496.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
          Length = 879

 Score =  305 bits (781), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 226/709 (31%), Positives = 326/709 (45%), Gaps = 94/709 (13%)

Query: 10  TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
            K R    L++     CHWCHVM  ESFED G A L+N  FV+IKVDREERPD+D VYM 
Sbjct: 257 AKRRNVPILLSVGYAACHWCHVMAHESFEDAGTAALMNANFVTIKVDREERPDIDAVYMA 316

Query: 66  YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
             QA+ G GGWP++ FL+PD +P   GTY+PP  + G P F+ +L  V  +W ++ D L 
Sbjct: 317 ATQAMTGQGGWPMTCFLTPDGEPFHCGTYYPPSPRPGMPSFRQLLVAVVQSWQERPDELV 376

Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRP 184
                 +  L+E       +  L + +   A+   A  +L +  D   GGFG APKFP  
Sbjct: 377 DGAKQIVAHLAE------QTGPLKESVVDEAVLAGAVGKLQQEADRVNGGFGRAPKFPPS 430

Query: 185 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
           + ++ +L H    E TG +   S    +V  T + MA+GG++D + GGF RYSVD  W V
Sbjct: 431 MVLEFLLRHH---ERTGSAVALS----LVDSTAEAMARGGLYDQLAGGFARYSVDAEWIV 483

Query: 245 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 304
           PHFEKMLYD   L   Y   +  T       +     ++L   +  P G   S+ DAD+ 
Sbjct: 484 PHFEKMLYDNALLLRFYAHLWRRTGSATALRVATGTAEFLFESLRTPEGGFASSLDADTE 543

Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
             EG T       YVWT  ++ +++G+ +    E + +   G  +           +G +
Sbjct: 544 GVEGLT-------YVWTPAQLREVVGDDSA--AELFGVTKEGTFE-----------EGAS 583

Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
            L    D       L  P+          R KL + R+KRP+P  DDKVI SWNGL I++
Sbjct: 584 TLRLFGD-------LPEPM----------RVKLLEARAKRPQPGRDDKVIASWNGLAITA 626

Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG 483
            A A   L                DR +++E A  AA  + R H+ D    RL+ S R+G
Sbjct: 627 LAEAGVAL----------------DRPQWIEWAREAAELLLRVHVVD---GRLRRSSRDG 667

Query: 484 -PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTG 541
              ++ G L+DYA +  G L L++     KWL  A  L +     F   +  G YF+T  
Sbjct: 668 VVGESAGVLEDYACVADGFLALHQATGAAKWLTEATRLLDLALAHFASPDVPGAYFDTAD 727

Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
           +  +++ R  +  D A PSG S     L+  +++   + S  YR+ AE +L    +R   
Sbjct: 728 DAETLVQRPADPGDNASPSGASALAGALLTASALAGHADSGRYREAAERAL----SRAGV 783

Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
           +A  VP    A   LSV   +    V    +     +L AA         V+  +P D  
Sbjct: 784 LAGRVPRF--AGHWLSVAEARQAGPVQVAVAGASPELLRAAARGIHGGGVVLAGEP-DAP 840

Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +            +A          A VC+ + C  PVT    L   L
Sbjct: 841 GVPL----------LADRPLVDGAPAAYVCRGYVCDRPVTSAAELTARL 879


>gi|425435449|ref|ZP_18815900.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
 gi|389679973|emb|CCH91261.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
          Length = 692

 Score =  305 bits (781), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 212/623 (34%), Positives = 313/623 (50%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L   +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLADPSLLATGIETNTAVIQVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             ED+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFEDSLQQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+ + + D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-------GECRRKLF-DVRSKRPRPHL----------DDKVIV 415
               +LG  +E  L+ L        + +  LF   R  +   ++          D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F+ P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NLVRL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L+    A P +  A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618


>gi|217978724|ref|YP_002362871.1| hypothetical protein Msil_2586 [Methylocella silvestris BL2]
 gi|217504100|gb|ACK51509.1| protein of unknown function DUF255 [Methylocella silvestris BL2]
          Length = 691

 Score =  305 bits (781), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 226/705 (32%), Positives = 340/705 (48%), Gaps = 88/705 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE  A ++N+ FV+IKVDREERPD+D +YM  + A    GGWPL++
Sbjct: 55  ACHWCHVMAHESFEDEATAAVMNELFVNIKVDREERPDIDHIYMQALHAFGERGGWPLTM 114

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P  GGTYFP  ++YGRP F T+LR V  A+ ++   +A +       L++A +
Sbjct: 115 FLTPKGEPFWGGTYFPKTEQYGRPAFVTVLRTVAHAFHEEPHRIAANVGAVRRNLTKAPT 174

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           AS     L        +   A QL  + D+  GG   APKFP    I  ML+ +      
Sbjct: 175 ASGGDFSLAQ------MDDIAAQLVTAIDTVDGGLKGAPKFPN-TPILEMLWRAG----- 222

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
            ++G A+  Q M L  L+ M++GGI+DH+GGG+ RYS D+RW VPHFEKMLYD  Q+   
Sbjct: 223 ARTGTAAYRQAMRL-ALEKMSEGGIYDHLGGGYARYSTDDRWLVPHFEKMLYDNAQILEC 281

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
               +   KD  +    R+ + +L R+M  PGG   ++ DADS   EG     EG FYVW
Sbjct: 282 LALCYDAFKDDLFLQRARETVAWLEREMTNPGGAFSASLDADS---EGI----EGKFYVW 334

Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
           T  E+ + LG + A  F + Y     GN       D H    G  +L  L  +  +A + 
Sbjct: 335 TFDELVEPLGADEARFFGKFYNAARIGN-----WVDAHYP-NGVTILNRLESARPTAEEE 388

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
                     L   R++LFD R  R  P LDDK++  WNGL+I++   A+ +        
Sbjct: 389 AR--------LAPLRQRLFDRREARVHPGLDDKIMADWNGLMIAALVNAATL-------- 432

Query: 440 MFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAF 496
                   +    ++ +A  A +FI    LY ++    RL HSFR G    PG   DY+ 
Sbjct: 433 --------TGEHRWIALAARAYNFIVATMLYRDEAGLTRLAHSFRAGVLVKPGLALDYST 484

Query: 497 LISGLLDLY------EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           ++   L LY      EF +   +L  A     T +   +D +         +   V++++
Sbjct: 485 MMRAALALYEVRNLKEFAATRDYLSDARAFAQTLEACHIDPDSRLITMAAKDAADVIVKL 544

Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV--PL 608
               D A P+ + V +  L+RLA  V+G +    R +A          +K M  ++   +
Sbjct: 545 APTADDAIPNAHPVYLGALIRLAG-VSGDQGALDRADA---------LIKAMGPSIRGNI 594

Query: 609 MCCAADMLSVPSR---KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
           +  A  + ++  R   + +V  G   +  +E  L A      +++ V+ +D  D      
Sbjct: 595 VGHAGTLNAIDLRLRVREIVTAGPARAPLYEAALGAPF----IDRIVMDLDRPD------ 644

Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
             E  + + + A+    A +  A VC   +CS P  D  +L  LL
Sbjct: 645 --EIPAAHPARAQAEL-AGEAAAFVCAGGACSLPARDVDALRQLL 686


>gi|320589398|gb|EFX01859.1| duf255 domain containing protein [Grosmannia clavigera kw1407]
          Length = 836

 Score =  305 bits (780), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 208/634 (32%), Positives = 310/634 (48%), Gaps = 71/634 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CH+CH    +SF    VA++LN  F+ I VDREERPD+D +Y  Y+Q +    GWP++VF
Sbjct: 97  CHYCHTTTQDSFSSPAVAEILNTSFIPIVVDREERPDIDAIYWNYLQLVNSSAGWPINVF 156

Query: 82  LSPDLKPLMGGTYFPPEDKYGRP-------------GFKTILRKVKDAW--------DKK 120
           L+P+L+P+ GGTY+P     G               GF  IL+K++ +W        ++ 
Sbjct: 157 LTPELEPVFGGTYWPGPGSEGSVRDGQEDGGEDEMIGFLGILKKLRQSWTDREAQCREEA 216

Query: 121 RDMLAQSGAFAIEQ-------LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
           R+ + Q   FA E        L   ++  A       +L  + L     QL K++D   G
Sbjct: 217 RETVVQLRKFAAEGTLGPRGLLRPTVAEGAPYLSRDLDLDIDQLDDAYTQLKKTFDPVNG 276

Query: 174 GFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
           GFG  PKF  P +   +L        ++      EA    +M LFTL+ +   G+HDH+ 
Sbjct: 277 GFGVVPKFVTPAKYSFLLKLGSFPNVVQGIIGDAEAKNAVQMALFTLRKLQDSGLHDHLR 336

Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----------SLTKDVFYSYICRDI 280
           GGF R S    W +PHFEK++ D   L ++YLDA+          +   D  ++ +   +
Sbjct: 337 GGFSRASHTINWTLPHFEKLVPDNALLLSLYLDAWLYGLRTSGTGAKGTDAEFADVVYAL 396

Query: 281 LDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH------- 332
            DYL    I   GG   S+E ADS    G    +EGA+YVWT +E + ++G         
Sbjct: 397 ADYLSSSPIRLEGGGFASSEAADSYYRRGDNHTREGAYYVWTRREFDAVVGGQRSENDLD 456

Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
                 ++ +   GN D  R  DP++EF  +NVL    D+S  A + G+     L ++  
Sbjct: 457 TRAAAAYWNVLEHGNVD--REDDPNDEFINQNVLYVNKDASEVARQFGISRSDVLRVVKT 514

Query: 393 CRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 451
            ++KL   R K R RP  D KV V+ NG+VI++ AR   +L        F+ P  G   +
Sbjct: 515 SKKKLAAHREKERVRPAADRKVTVANNGVVIAALARVGAVLVHGG----FD-PANG---E 566

Query: 452 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSG 510
           +Y+  A SAA FI+ +L+D Q   L  ++  G      GF +DYA LI GLL+LYE    
Sbjct: 567 KYISAARSAARFIKANLWDVQDKCLFRTYSYGQKGTNCGFAEDYAVLIEGLLELYEATGE 626

Query: 511 TKWLVWAIELQNTQDELFLD----------REGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
            +WL WA +LQ  Q E F D             GG++ T+  +P  +LR+K+  D   P+
Sbjct: 627 LEWLQWADQLQQRQIEQFYDGVDMPPTSSHSASGGFYRTSEHEPFNILRIKDGMDTTLPA 686

Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
            N V+  NL RL S++   +  +  +   HS  V
Sbjct: 687 TNGVAASNLFRLGSLLGDEEYSHLARETIHSFEV 720


>gi|423133250|ref|ZP_17120897.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
           101113]
 gi|371649306|gb|EHO14787.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
           101113]
          Length = 667

 Score =  305 bits (780), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 192/560 (34%), Positives = 289/560 (51%), Gaps = 50/560 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ VA L+N+ F+SIKVDREE P +D  YM  +Q +   GGWPL+V
Sbjct: 48  TCHWCHVMEKESFENQEVADLMNEHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYF       R  +   L ++   + +KRD +     FA  QL E +S
Sbjct: 108 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 157

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              S   +  E  +    L  E   KS+D  +GG+   PKF  P     +LY  KK    
Sbjct: 158 I-LSQAPIAQEDSRFNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK---- 209

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G      +  + +  TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +V
Sbjct: 210 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 269

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y D +  T +  Y  +    +D++  +     G  +SA DADS ++    + +EGAFYVW
Sbjct: 270 YADGYKRTHNKLYKEVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYVW 327

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+++++ +   LF   + +   G+ + S+            VLI+  +    A++  
Sbjct: 328 TIEELKELVQQDFPLFSTVFNINSFGHWENSQY-----------VLIQTRELIDIANENN 376

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +PLE   N   +    L   R+ RP+P LDDK + SWN + I+    A    ++ A    
Sbjct: 377 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 432

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I G
Sbjct: 433 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 479

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG-GYFNTTGEDPSVLLRVKEDHDGAEP 559
           L+ L+E     +++  A  L +   + FLD E    YFN   ++ ++   + E  D   P
Sbjct: 480 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFNKHNQEDTITPAI-ETEDNVIP 538

Query: 560 SGNSVSVINLVRLASIVAGS 579
           S N++  +NL +L  +   S
Sbjct: 539 SSNAIMAMNLYKLGLLYENS 558


>gi|384135742|ref|YP_005518456.1| hypothetical protein TC41_2025 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
 gi|339289827|gb|AEJ43937.1| protein of unknown function DUF255 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
          Length = 626

 Score =  305 bits (780), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 209/602 (34%), Positives = 289/602 (48%), Gaps = 54/602 (8%)

Query: 27  VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
           +M  ESFEDE VA +LN+ +V+IKVDREERPD+D +YMTY QAL G GGWPL++ ++PD 
Sbjct: 1   MMAHESFEDEKVAAILNEHYVAIKVDREERPDIDHIYMTYCQALQGEGGWPLTIIMTPDG 60

Query: 87  KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
            P   GTYFP   +YG PG   IL+++   W   R  L ++     E++       A   
Sbjct: 61  YPFFAGTYFPKTPRYGPPGLIQILQEIARLWQTDRARLERASRSMAERMQPLFEGQAGEA 120

Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
           +  D   Q       + L  ++D  +GGFG APKFP    +Q +L +++   +       
Sbjct: 121 RGRDAADQ-----AYQALEAAFDHEYGGFGPAPKFPTFHRVQFLLRYARLRPN------- 168

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
                M L TL+ + +GGI DHVGGG  RYS D  W VPHFEKMLYD       Y DA+ 
Sbjct: 169 ERAAAMALSTLRAIQRGGIVDHVGGGMARYSTDPFWRVPHFEKMLYDNALALAAYADAYV 228

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
             KD  +    R  + +  R+M  P G  +SA DADSA         EG FY+W  ++V 
Sbjct: 229 HAKDPAFLRFVRQTVAFFDREMQSPEGLYYSAVDADSA-------GGEGRFYLWRPEDVI 281

Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLE 384
             LG E   LF   Y +   GN            F+G NV   ++ D +A A+  GM  E
Sbjct: 282 AALGPEDGELFNAFYDITEAGN------------FEGANVPNYIDQDPAAFAASRGMTEE 329

Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
           +    L +   KL  VR  R RP +DDK + +WN L+    ARA       A        
Sbjct: 330 ELWQKLDDLNAKLRAVRDGRERPAIDDKCLTAWNALMAYGLARAGLAFGEMA-------- 381

Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
                   ++  A    + I R L      RL   +R+G +    + DD+A+L++  L+L
Sbjct: 382 --------WVNRATEVVAAIERILVRPDDGRLLARYRDGEAGIFAYADDHAYLVAAYLEL 433

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV-KEDHDGAEPSGNS 563
           Y       +L  A   Q  QD LF D+  GGY    G D   L+ V K  +DGA PS NS
Sbjct: 434 YRATLDRAYLDRARHWQAVQDALFWDKAQGGY-TFYGRDAESLIAVPKPVYDGAMPSANS 492

Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
            S  NL  L ++   ++   Y    +  L  F   ++   M    +  AA M  V S + 
Sbjct: 493 QSAHNLWMLHALTGDAE---YADRLDALLRAFGGDIRSAPMDCLWLVTAAMMSEVGSTEI 549

Query: 624 VV 625
           V+
Sbjct: 550 VI 551


>gi|427718285|ref|YP_007066279.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
 gi|427350721|gb|AFY33445.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
          Length = 690

 Score =  305 bits (780), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 214/628 (34%), Positives = 310/628 (49%), Gaps = 87/628 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDLAIAQYMNTNFLPIKVDREERPDLDSIYMQALQMMNGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
           VFLSP DL P   GTYFP E +YGRPGF  +L+ ++  +D + + L Q  A  +E L  S
Sbjct: 108 VFLSPEDLVPFYAGTYFPLEPRYGRPGFLQVLQAIRRYYDTETEDLRQRKAVIVESLLTS 167

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L   ++ +   +EL +     C   ++               FP      M+ Y    
Sbjct: 168 AVLQDGSTQDIQENELLRQGWETCTGVITPHQQGN--------SFP------MIPYAELA 213

Query: 197 LEDTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           L  T  +     +G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 214 LRGTRFNFASHYDGKQICQQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNG 273

Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q+     + +S  + +  F   I + + ++L+R+M  P G  ++A+DADS     A   +
Sbjct: 274 QIVEYLANLWSAGVQEPAFARAIAKTV-EWLQREMTAPAGYFYAAQDADSFINPTAVEPE 332

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVWT  E+  +L  E     ++ + + P GN            F+ KNVL  L+  
Sbjct: 333 EGAFYVWTYSELAKLLTPEELTELQQQFTVTPHGN------------FESKNVLQRLH-- 378

Query: 373 SASASKLGMPLEKYLNILGECRRKL-------FDVRSK-----------RPRPHLDDKVI 414
              + +L   LEK L  L + R  +       F   S            R     D K+I
Sbjct: 379 ---SGELSKTLEKALGKLFKARYGITPESLDTFPPASNNQEAKTNNWPGRIPSVTDTKMI 435

Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT 473
           V+WN L+IS  ARAS +         F  P+       Y+++A  AA+FI      D + 
Sbjct: 436 VAWNSLMISGLARASGV---------FQQPL-------YLQIAARAANFIWDNQFVDGRF 479

Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG------SGTKWLVWAIELQNTQDEL 527
           HRL +    G        +DYA  I  LLDL++        S + WL  AI LQ+  D  
Sbjct: 480 HRLNYV---GQPNVLAQSEDYALFIKALLDLHQATLLIGNESASFWLEKAIALQDEFDAY 536

Query: 528 FLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
               E GGY+N + +    +++R +   D A PS N V++ NLVRL  +   + + +Y  
Sbjct: 537 LWSVELGGYYNASIDASQDLIVRERSYADNATPSANGVAIANLVRLTLL---TDNLHYLD 593

Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAAD 614
            AE  L  F+T +     A P +  A D
Sbjct: 594 LAEQGLKAFKTVMSRSPQACPSLFTALD 621


>gi|164422571|ref|XP_957963.2| hypothetical protein NCU09980 [Neurospora crassa OR74A]
 gi|157069724|gb|EAA28727.2| hypothetical protein NCU09980 [Neurospora crassa OR74A]
          Length = 827

 Score =  305 bits (780), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 209/660 (31%), Positives = 326/660 (49%), Gaps = 97/660 (14%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H   + +  SF +  VA  LN  F+ + +DR+ERPD+D +Y  Y +A+   GGWPL++FL
Sbjct: 126 HIGFLADHHSFSNNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185

Query: 83  SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
           +PDL P+ GGTY+P                        PE      G       F  I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVGGVAATPEASSINGGGEESYNDFLAIAK 245

Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
           K+   W ++ +                AQ G F+    E +      +A+  +   +L  
Sbjct: 246 KIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
           + L    +++ K +D    GFG+ PKFP P  +  +L  +   +++ D     E      
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364

Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
           M   TL+ +  GG+ DHVG GF R+SV   W +PHFEKM+ +   L  VYLDA+      
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424

Query: 266 -----SLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYV 319
                 L+ +  ++ +  D+ DYL   +I   GG   ++E ADS   +G    +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFADVVIDLADYLTSPLIQFSGGGFVTSEAADSFYRKGDRHMREGAYYL 484

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
           WT +E +D++G          Y     + ++ R  DPH+EF  +NVL  +   D+ A + 
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDTQALSK 544

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           + G+P+     I+ + R +L   R + RPRP  D+KV+V  NG+VIS+ AR + +++   
Sbjct: 545 QFGIPVNDVKKIIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRE-- 602

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE---QTHRLQHSFR-NGPSKAPGFLD 492
                   +  +  ++Y+E A+ AA+FI+ +L+ +   Q+ ++   F  N PS    F D
Sbjct: 603 --------LDKTKSQKYLEAAQQAAAFIKENLWVQDGTQSRKVLKRFWFNQPSDTRAFAD 654

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE------------GGGYFNTT 540
           DYAFLI GLLDLYE     KWLVWA ELQ+ Q ELF D               GG+++T 
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPVVGSTPSLRHSYTGGFYSTE 714

Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
               S  +LR+K   D ++PS N+VS  NL RL +I+   +  + RQ  E ++  FE  +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771


>gi|425446506|ref|ZP_18826509.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
 gi|389733246|emb|CCI02963.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
          Length = 689

 Score =  304 bits (779), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 214/621 (34%), Positives = 313/621 (50%), Gaps = 76/621 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           L  SA   +    L   +L     E+ +         +G  P FP      + L  S+  
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIEKNTAVIRVNPNNYGR-PSFPMIPYSHLALQGSRFG 222

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
           ED   S   +  Q+      + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 223 EDFDDSLRQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQI 277

Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +EGA
Sbjct: 278 VEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGA 337

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVW+  E+ D L    + + + ++ +   GN            F+G+NVL         
Sbjct: 338 FYVWSDLELRDYLSTEELGVLQANFTVTAEGN------------FEGRNVL-----QRRQ 380

Query: 376 ASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSW 417
             +LG  +E  L+ L     G  + +L      R                  D K+IV+W
Sbjct: 381 GGELGEEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAW 440

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRL 476
           N L+IS  ARA          A+F  P+       Y ++A  AA FI +H + D +  RL
Sbjct: 441 NSLMISGLARA---------FAVFGEPL-------YWQMAAQAAEFILKHQWLDGRFQRL 484

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGG 535
            +    G +      +D+A+ I  LLDL       T+WL  AI+LQ   D  F   + GG
Sbjct: 485 NY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAEDEGG 541

Query: 536 YFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
           YFNT   D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L 
Sbjct: 542 YFNTAS-DHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQ 597

Query: 594 VFETRLKDMAMAVPLMCCAAD 614
            F T L+    A P +  A D
Sbjct: 598 SFSTILEQSPTACPSLFVALD 618


>gi|428211294|ref|YP_007084438.1| thioredoxin domain-containing protein [Oscillatoria acuminata PCC
           6304]
 gi|427999675|gb|AFY80518.1| thioredoxin domain protein [Oscillatoria acuminata PCC 6304]
          Length = 691

 Score =  304 bits (779), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 209/622 (33%), Positives = 310/622 (49%), Gaps = 80/622 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F  E +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSSEAIASYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRPGF  +L+ ++  +D ++  LA      +  L +A
Sbjct: 108 IFLTPDDLIPFYGGTYFPVEPRYGRPGFLELLQAIRRYYDLEKGKLAAFKEEIMGHLQQA 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            +   + + LP+EL    L      ++      +G     P FP      MM Y    L+
Sbjct: 168 ATLPGTED-LPEELLWKGLETSVTVIAH---REYG-----PSFP------MMPYAQVVLQ 212

Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ- 256
            T    E+   ++  +      +A GGI+D V GGFHRY+VD  W VPHFEKMLYD GQ 
Sbjct: 213 STRFDRESEYDERSAIAQRGIDLASGGIYDAVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 257 ---LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
              LAN++ +     ++  + +     + +L+R+M  P G  ++A+DADS  T      +
Sbjct: 273 VEFLANLWSEGI---QEPGFEWAVAGTIQWLKREMTAPEGYFYAAQDADSFITPEDKEPE 329

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVWT +E+E +L  E      + ++L P GN            F+GK VL   N  
Sbjct: 330 EGAFYVWTYQELERLLTVEEFTALNQEFFLSPEGN------------FEGKIVLKRTNLQ 377

Query: 373 SASAS-----------KLGMPLEKYLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWN 418
           + S +           + G   E        C     K  +   + P P  D K+IV+WN
Sbjct: 378 ALSPTVETALAKLFKVRYGALPEAVKTFPPACNNHEAKTHNWPGRIP-PVTDPKMIVAWN 436

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
            L+IS  ARA+ +  +                 EY  +A +AA+FI  H + E + HRL 
Sbjct: 437 SLMISGLARAAVVFGN----------------GEYATLATTAANFILDHQWVEGRFHRLN 480

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFLDREG 533
           +   +G +      +DYA  I  LLDL +      S + WL  AI++Q   DE     E 
Sbjct: 481 Y---DGQAAVLAQSEDYALFIKALLDLEQMEQVHPSNSNWLEKAIQVQEEFDEFLWSVEL 537

Query: 534 GGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
           GGYFNT  +  S +++R +   D A P+ N V++ +L+RL+     ++   Y   A ++L
Sbjct: 538 GGYFNTAKDSSSDLIVRERSYTDNATPAANGVAIASLIRLSMF---TEDLSYLDRAFNAL 594

Query: 593 AVFETRLKDMAMAVPLMCCAAD 614
             F   +     A P +  A D
Sbjct: 595 KSFGAIMDRAPSACPSLFAALD 616


>gi|209523771|ref|ZP_03272324.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
 gi|209495803|gb|EDZ96105.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
          Length = 686

 Score =  304 bits (779), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 204/619 (32%), Positives = 311/619 (50%), Gaps = 73/619 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+P D  P  GGTYFP E +YGRPGF  +L+ + + +   ++ L       + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
           +         P EL ++ L+   E  +     + +GG    P+FP  +    M +   +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
             + K     +G+   L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                D +S   K   Y       +++L+R+M  P G  ++A+DADS  T      +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332

Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
           FYVWT++E+E  L        +  + +  +GN            F+GK V          
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELE 380

Query: 366 -LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNGLVI 422
            LIE   +   A + G P  +          +    R    R P + D K+IV+WN L+I
Sbjct: 381 PLIETALAKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMI 440

Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
           S  A+A+++L                D  EY+E+A  AA F+  H + D++ HR+ +   
Sbjct: 441 SGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY--- 481

Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGY 536
           +G        +DYA  I  L+DL++           WL  A+++QN  D+     E GGY
Sbjct: 482 DGKVAVLSQSEDYALFIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGY 541

Query: 537 FNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
           FNT  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F
Sbjct: 542 FNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAF 598

Query: 596 ETRLKDMAMAVPLMCCAAD 614
            + ++    A P +  A D
Sbjct: 599 ASVMRQSPQACPSLFVAFD 617


>gi|186686249|ref|YP_001869445.1| hypothetical protein Npun_R6218 [Nostoc punctiforme PCC 73102]
 gi|186468701|gb|ACC84502.1| protein of unknown function DUF255 [Nostoc punctiforme PCC 73102]
          Length = 685

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 208/618 (33%), Positives = 308/618 (49%), Gaps = 72/618 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  +N  ++ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDSAIADYMNANYLPIKVDREERPDLDSIYMQALQMMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FLSP DL P   GTYFP + +YGRPGF  +L+ ++  +D ++  L Q  A  IE L   
Sbjct: 108 IFLSPEDLVPFYAGTYFPVDPRYGRPGFLQVLQALRRYYDTEKAELQQRKALIIESL--- 164

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG---SAPKFPRPVEIQMMLYHSK 195
           L+++   +   DEL    L      L + +++  G      S   FP      M+ Y   
Sbjct: 165 LTSAVLQDGTTDELEDREL------LRQGWETSTGVITPGQSGNSFP------MIPYTEL 212

Query: 196 KLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
            L  T  + E+  +G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD 
Sbjct: 213 ALRGTRFNFESRYDGKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDN 272

Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           GQ+     + +S   ++  +       + +L+R+M  P G  ++++DADS     A   +
Sbjct: 273 GQIVEYIANLWSAGVQEPAFERAVAVTVQWLKREMTAPEGYFYASQDADSFTEPTAVEPE 332

Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVW+  EV+ +L  E     ++ + + P GN            F+G+NVL   N  
Sbjct: 333 EGAFYVWSYSEVQQLLTPEELTELQQQFTVTPNGN------------FEGRNVLQRRNSG 380

Query: 373 SASAS-----------KLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNG 419
             SA+           + G+  E        C  +     +   R P + D K+IV+WN 
Sbjct: 381 KLSATLETSLSKLFTARYGVSSELLETFPPACNNQEAKTTNWPGRIPSVTDTKMIVAWNS 440

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQH 478
           L+IS  A+A+ +         F  P+       Y+E+A  AA+FI      D +  RL +
Sbjct: 441 LMISGLAKAAGV---------FQQPL-------YLELAARAANFILENQFVDGRFQRLNY 484

Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYF 537
               G        +DYAF +  LLDL       K WL  AI +Q+   E     E GGYF
Sbjct: 485 ---QGEPTVLAQSEDYAFFVKALLDLQASNPEHKQWLENAIAIQDEFTEFLWSVELGGYF 541

Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
           NT+ +    +++R +   D A PS N +++ NLVRLA +        Y   AE  L  F+
Sbjct: 542 NTSSDSSQDLIVRERSYADNATPSANGIAIANLVRLALLTDNLD---YLDLAELGLKAFK 598

Query: 597 TRLKDMAMAVPLMCCAAD 614
           + +     A P +  A D
Sbjct: 599 SVMHRAPQACPSLFTALD 616


>gi|166365023|ref|YP_001657296.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
 gi|166087396|dbj|BAG02104.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
          Length = 692

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 211/623 (33%), Positives = 311/623 (49%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L   +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRSETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             +D+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+ D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               +LG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGELGKEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F  P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L++   A P +  A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618


>gi|336464974|gb|EGO53214.1| hypothetical protein NEUTE1DRAFT_126582 [Neurospora tetrasperma
           FGSC 2508]
          Length = 827

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 209/660 (31%), Positives = 325/660 (49%), Gaps = 97/660 (14%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H   + +  SF +  VA  LN  F+ + +DR+ERPD+D +Y  Y +A+   GGWPL++FL
Sbjct: 126 HIGFLADHHSFSNNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185

Query: 83  SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
           +PDL P+ GGTY+P                        PE      G       F  I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVVGGAATPEASSINGGGEESYNDFLAIAK 245

Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
           KV   W ++ +                AQ G F+    E +      +A+  +   +L  
Sbjct: 246 KVHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
           + L    +++ K +D    GFG+ PKFP P  +  +L  +   +++ D     E      
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364

Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
           M   TL+ +  GG+ DHVG GF R+SV   W +PHFEKM+ +   L  VYLDA+      
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424

Query: 266 -----SLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
                 L+ +  ++ +  D+ DYL   +I   GG   ++E ADS   +G    +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFANVVIDLADYLTSPLIQSSGGGFITSEAADSFYRKGDRHMREGAYYL 484

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
           WT +E +D++G          Y     + ++ R  DPH+EF  +NVL  +   D  A + 
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDIQALSK 544

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           + G+P+     ++ + R +L   R + RPRP  D+KV+V  NG+VIS+ AR + +++   
Sbjct: 545 QFGIPVNDVKKMIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRD-- 602

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE---QTHRLQHSFR-NGPSKAPGFLD 492
                   +  +  ++Y+E A+ AA+FI+ +L+ +   Q+ ++   F  N PS    F D
Sbjct: 603 --------LDKTKSQKYLEAAQRAATFIKENLWVQDGTQSRKVLKRFWFNQPSDTRAFAD 654

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG------------GGYFNTT 540
           DYAFLI GLLDLYE     KWLVWA ELQ+ Q ELF D               GG+++T 
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPAVGSTPSLRHSYTGGFYSTE 714

Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
               S  +LR+K   D ++PS N+VS  NL RL +I+   +  + RQ  E ++  FE  +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771


>gi|350297081|gb|EGZ78058.1| hypothetical protein NEUTE2DRAFT_101642 [Neurospora tetrasperma
           FGSC 2509]
          Length = 827

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 208/660 (31%), Positives = 324/660 (49%), Gaps = 97/660 (14%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H   + +  SF +  VA  LN  F+ + +DR+ERPD+D +Y  Y +A+   GGWPL++FL
Sbjct: 126 HIGFLADHHSFANNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185

Query: 83  SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
           +PDL P+ GGTY+P                        PE      G       F  I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVGGGAATPEVSSINGGGEESYNDFLAIAK 245

Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
           K+   W ++ +                AQ G F+    E +      +A+  +   +L  
Sbjct: 246 KIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305

Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
           + L    +++ K +D    GFG+ PKFP P  +  +L  +   +++ D     E      
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364

Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
           M   TL+ +  GG+ DHVG GF R+SV   W +PHFEKM+ +   L  VYLDA+      
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424

Query: 266 -----SLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
                 L+ +  ++ +  D+ DYL   +I   GG   ++E ADS   +G    +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFADVVIDLADYLTSPLIQSSGGGFITSEAADSFYRKGDRHMREGAYYL 484

Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
           WT +E +D++G          Y     + ++ R  DPH+EF  +NVL  +   D  A + 
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDIQALSK 544

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
           + G+P+     ++ + R +L   R + RPRP  D+KV+V  NG+VIS+ AR + +++   
Sbjct: 545 QFGIPVNDVKKMIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRD-- 602

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR----LQHSFRNGPSKAPGFLD 492
                   +  +  ++Y+E A+ AA+FI+ +L+ +   R    L+  + N PS    F D
Sbjct: 603 --------LDKTKSQKYLEAAQHAATFIKENLWVQDGTRSRKVLKRFWFNQPSDTRAFAD 654

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG------------GGYFNTT 540
           DYAFLI GLLDLYE     KWLVWA ELQ+ Q ELF D               GG+++T 
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPAVGSTPSLRHSYTGGFYSTE 714

Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
               S  +LR+K   D ++PS N+VS  NL RL +I+   +  + RQ  E ++  FE  +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771


>gi|300864691|ref|ZP_07109547.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300337297|emb|CBN54695.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 694

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 213/633 (33%), Positives = 312/633 (49%), Gaps = 93/633 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F +  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMENEAFSNAAIAEYMNAHFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL P D  P  GGTYFP   +YGRPGF  +L  ++  +D ++  L    AF  E L+  
Sbjct: 108 IFLDPIDRIPFYGGTYFPVYPRYGRPGFLEVLHAIRRFYDLEKGKLQ---AFKEEILAHF 164

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
             ++A S    ++L    LR   E  +    +R  G    P FP      MM Y    L 
Sbjct: 165 QQSAALSGT--EKLSGKLLRRGLETSTAIISAREYG----PSFP------MMPYSESALR 212

Query: 199 DTGKSGEA-SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
               + E  S+ Q++       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 213 GMRFNLEGKSDSQQVCTQRGLDLALGGIYDHVAGGFHRYTVDGTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                + +S   ++  +       +++L+R+MI P G  ++A+DAD+      T  +EGA
Sbjct: 273 VEYLANLWSAGVREPAFERAVAGTVEWLQREMIAPAGYFYAAQDADNFTNIEETEPEEGA 332

Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FYVW+  E+E++L  +     +E + +  TGN            F+ KNVL         
Sbjct: 333 FYVWSYSELENLLEADEFRELQEQFTVTQTGN------------FEAKNVL-----QRRH 375

Query: 376 ASKLGMPLEKYLNILGECR-------------------RKLFDVRSKRPRPHLDDKVIVS 416
             KL   LE  L  L + R                    K +D   + P    D K+IV+
Sbjct: 376 PGKLSSTLETALAKLFKVRYGAVPESVKVFPPARNNQEAKSYDWPGRIP-AVTDTKMIVA 434

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHR 475
           WN L+IS  ARA+ +                  + EY+E+A  AA+FI  + + D + HR
Sbjct: 435 WNSLMISGLARATAVFH----------------KSEYLELAAKAANFILDNQWIDGRFHR 478

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG---TK----------WLVWAIELQN 522
           L +   +G S      +DYA  +  LLDL++   G   TK          WL  A+++Q 
Sbjct: 479 LNY---DGKSAVMAQSEDYALFLKALLDLHQVSEGWLETKPDSFNLKPEVWLEKAVKIQE 535

Query: 523 TQDELFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
             DE     E GGY+NT  +  + +L+R +   D A P+ N V++ NLVRL  +    + 
Sbjct: 536 EFDEFLWSIEVGGYYNTASDASADLLVRERSYTDNATPAANGVAIANLVRLTLLTEDLQ- 594

Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
             Y   AE  L  F + ++D   A P +  A D
Sbjct: 595 --YLDRAEQGLQAFSSVMQDSPQACPSLFAALD 625


>gi|428770863|ref|YP_007162653.1| hypothetical protein Cyan10605_2528 [Cyanobacterium aponinum PCC
           10605]
 gi|428685142|gb|AFZ54609.1| protein of unknown function DUF255 [Cyanobacterium aponinum PCC
           10605]
          Length = 676

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 228/712 (32%), Positives = 343/712 (48%), Gaps = 115/712 (16%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWC VME E+F D  +A  LND F+SIKVDREERPD+D +YMT +Q + G GGWPL++
Sbjct: 49  SCHWCTVMEGEAFSDGAIASYLNDNFISIKVDREERPDIDSIYMTALQMMTGQGGWPLNI 108

Query: 81  FLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SEA 138
           FLSP DL P  GGTYFP E +YGRPGF  IL+ ++D +  K D         ++ L + +
Sbjct: 109 FLSPDDLVPFYGGTYFPIEPRYGRPGFLQILQALRDFYHDKSDKFISLKNEIVKGLETNS 168

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                S N+L  EL Q  +   ++ ++++       +GS P+FP      MM Y +  L+
Sbjct: 169 NIIFTSENQLTPELLQQGIANNSKVIARN------DYGS-PRFP------MMPYSNITLQ 215

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ-- 256
              K     +   + +     +  GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G   
Sbjct: 216 GGVKDKNYRD---LAIRRALDLVNGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGLIM 272

Query: 257 --LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
             LAN++ +   +++       C  I D+L+R+M    G  ++A+DAD+         +E
Sbjct: 273 EFLANLWANGVEISE---IKRACEGIKDWLKREMTSEKGYFYAAQDADNFADIHHIEPEE 329

Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           G FYVW+ +++++IL  E    F + + +   GN            F+ KNVL +  D S
Sbjct: 330 GEFYVWSYQQLKEILSAEEFNAFIDTFIISEDGN------------FESKNVLQKREDKS 377

Query: 374 ASASKLGMPLEKYLNI-LGECRRKL--------------FDVRSKRPRPHLDDKVIVSWN 418
            +   +   L+K   +  GE R  L              F    + P P  D K+I++WN
Sbjct: 378 IN-EIINNALDKLFKVRYGEERNSLEKFSPAKNNQEAKTFQWLGRIP-PVTDTKMILAWN 435

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
            L+IS  A A  + +  +                Y+++AE A  FI  H ++  + HRL 
Sbjct: 436 SLMISGLATAYGVFQDVS----------------YLDLAEKATEFILNHQWENGRLHRLN 479

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK--WLVWAIELQNTQDELFLDREGGG 535
           +    G        +DY+  I  LLDL +        +L  AI++Q   ++   D+E GG
Sbjct: 480 YE---GNVAVFAQSEDYSLFIKALLDLAQNHPTNTGFYLDQAIKIQAEFNQFCQDKEQGG 536

Query: 536 YFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           Y+N   ++ S +L+R K   D A PS N +++ NLVRL       K   Y   AE +L +
Sbjct: 537 YYNNAHDNSSDLLIREKSYIDNATPSPNGIAIANLVRLHLFTDEEK---YLDEAEKTLKL 593

Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
           F   +   + + P +  A +        ++     K++ D +  L   +    L  TVI 
Sbjct: 594 FSDIMNKASTSCPSLFTALNW-------YLNRTSVKTTKDTKLQLIQKY----LPNTVIR 642

Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
            D          EE  SN+             +A+VC+  SC  P T    L
Sbjct: 643 TD----------EELPSNS-------------IAIVCRGVSCFEPATTITQL 671


>gi|145593487|ref|YP_001157784.1| hypothetical protein Strop_0929 [Salinispora tropica CNB-440]
 gi|145302824|gb|ABP53406.1| protein of unknown function DUF255 [Salinispora tropica CNB-440]
          Length = 699

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 191/553 (34%), Positives = 271/553 (49%), Gaps = 44/553 (7%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESF DE VA LLN+ FV+IKVDREERPDVD VYMT  QA+ G GGWP++V
Sbjct: 48  ACHWCHVMAHESFADEQVAALLNEGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +PD  P   GTYFP      +P F  +L+ V  AW  +R  + Q GA  +E +  A +
Sbjct: 108 FAAPDGTPFFCGTYFP------KPNFLRLLQSVTTAWQDQRSAVLQQGAAVVEAIGGAQA 161

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
               S  L  +L    L   A++L + YD   GGFG APKFP  + +  +L   ++  D 
Sbjct: 162 VGGPSAPLTVDL----LDAAADRLGEEYDEANGGFGGAPKFPPHLNLLFLLRRYQRTGD- 216

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                     ++V  T + MA+GG+HD + GGF RY VD +W VPHFEKMLYD   L  V
Sbjct: 217 ------QRSLEIVRHTAEAMARGGLHDQLAGGFARYCVDGQWAVPHFEKMLYDNALLLRV 270

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y   + LT D     + RD   +L  ++  PG    SA DAD+   EG T       YVW
Sbjct: 271 YTHLWRLTGDPMARRVARDTARFLADELHRPGEGFASALDADADGVEGLT-------YVW 323

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  ++ + LGE    +    +            + P  E +      E    SAS  +L 
Sbjct: 324 TPAQLVEALGEEDGRWAADLFAVTEQGSFTPHAASPPGEARSG---AEAAAQSASVLRLA 380

Query: 381 MPLEKYLNIL----GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             ++     +     E   +L  VR  RP+P  DDKV+ +WNGL I++ A   ++    A
Sbjct: 381 RDVDDATPEVQARWQEIAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAGYA 440

Query: 437 ESAMFNFPVVGSDRKEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
           E A    P   ++  E + +       ++A    R HL   +  R     R G  +A G 
Sbjct: 441 EDA----PGPDANLMEGVTIVADGAMRDAAEHLARVHLVAGRLRRTSRDGRVG--EAAGV 494

Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
           L+DY  +      +++     +WL+ A +L +   E F   + G +++T  +   ++ R 
Sbjct: 495 LEDYGCVAEAFCAMHQLTGEGRWLILAGQLLDVALERFAAPQ-GSFYDTADDAERLVSRP 553

Query: 551 KEDHDGAEPSGNS 563
            +  D A PSG S
Sbjct: 554 ADPTDNATPSGRS 566


>gi|423129587|ref|ZP_17117262.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
           12901]
 gi|371648637|gb|EHO14125.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
           12901]
          Length = 706

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 187/559 (33%), Positives = 286/559 (51%), Gaps = 48/559 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ VA L+N  F+SIKVDREE P +D  YM  +Q +   GGWPL+V
Sbjct: 87  TCHWCHVMEKESFENQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 146

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYF       R  +   L ++   + +KRD +         QL E +S
Sbjct: 147 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLDFAT----QLQEGIS 196

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             + +    +E   N   L  E   KS+D  +GG+  APKF  P     +LY  KK    
Sbjct: 197 ILSQAPIAQEESRFNT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQKK---- 248

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G      +  + +  TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +V
Sbjct: 249 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 308

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y D +  T +  Y  +    ++++  +     G  +SA DADS ++    + +EGAFY+W
Sbjct: 309 YADGYKRTHNKLYKEVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 366

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+++++ +   LF   + +   G+ +       +N++    VLI+  +    A++  
Sbjct: 367 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 415

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +PLE   N   +    L   R+ RP+P LDDK + SWN + I+    A    ++ A    
Sbjct: 416 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 471

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I G
Sbjct: 472 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 518

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L+ L+E     +++  A  L +   + FLD E   ++ +       +    E  D   PS
Sbjct: 519 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 578

Query: 561 GNSVSVINLVRLASIVAGS 579
            N++  INL +L  +   S
Sbjct: 579 SNAIMAINLYKLGLLYENS 597


>gi|386845926|ref|YP_006263939.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
 gi|359833430|gb|AEV81871.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
          Length = 663

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 197/561 (35%), Positives = 283/561 (50%), Gaps = 63/561 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+ VA  LN  FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF
Sbjct: 50  CHWCHVMAHESFEDDAVAAQLNADFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
            +PD  P   GTYFP +       F  +L  V  AW  +RD + + GA  ++ +  A + 
Sbjct: 110 ATPDGDPFYCGTYFPKQQ------FTRLLTSVTAAWRDERDGVLKQGAAVVQAVGGAQAV 163

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
                 +  E+   A    A++    +D  +GGFG APKFP  + +  +L H   LE TG
Sbjct: 164 GGPVAAVTAEMLAAAAAGLAQE----HDQTYGGFGGAPKFPPHMNLLFLLRH---LERTG 216

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               ++E  ++V  T + MA+GGI+D + GGF RY+VDE W VPHFEKMLYD   L  VY
Sbjct: 217 ----SAEALELVRHTAERMARGGIYDQLAGGFARYAVDEHWTVPHFEKMLYDNALLLRVY 272

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
              + LT DV    +  +  ++L RD+  P G + SA DAD+   EG T       Y WT
Sbjct: 273 TQLWRLTGDVPARRVADETAEFLLRDLATPAGGLASALDADTDGVEGLT-------YAWT 325

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASKLG 380
             E+ ++LG     +            DL R++ P   F+ G++VL+   D  A+   L 
Sbjct: 326 PAELTEVLGPDDGAWA----------ADLFRVT-PDGTFEHGRSVLVLARDIDAADPAL- 373

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
             ++++ ++    R +L D R KRP+P  DDKV+ SWNGL I++ A    +  S A    
Sbjct: 374 --VDRWRDV----RARLLDARGKRPQPARDDKVVASWNGLAITALAEHGALTGSTASREA 427

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLIS 499
                              A     RHL D    RL+   R+G    P G L+DY  +  
Sbjct: 428 AV---------------ALAGVLADRHLID---GRLRRVSRDGVVGDPAGVLEDYGCVAE 469

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
             L +++  +  +W   A  L +     F     GG+++T  +   ++ R  +  D A P
Sbjct: 470 AFLAVHQITADPRWSRLAGRLLDVALARF-GTGSGGFYDTADDAEKLVTRPADPTDNATP 528

Query: 560 SGNSVSVINLVRLASIVAGSK 580
           SG +     LV  A++   ++
Sbjct: 529 SGLAAVCAALVTYAALTGETR 549


>gi|443651764|ref|ZP_21130697.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
 gi|159027460|emb|CAO89425.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334405|gb|ELS48917.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
          Length = 692

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 212/623 (34%), Positives = 309/623 (49%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  ++++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYEEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L   +L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLADPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             ED+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAHQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGDQEAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+ D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               +LG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F  P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NLVRL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L+    A P +  A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618


>gi|159036527|ref|YP_001535780.1| hypothetical protein Sare_0871 [Salinispora arenicola CNS-205]
 gi|157915362|gb|ABV96789.1| protein of unknown function DUF255 [Salinispora arenicola CNS-205]
          Length = 699

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 192/555 (34%), Positives = 276/555 (49%), Gaps = 46/555 (8%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESF DE V  LLN+ FV+IKVDREERPDVD VYMT  QA+ G GGWP++
Sbjct: 47  SACHWCHVMAHESFADEQVGALLNENFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF +PD  P   GTYFP      +P F  +L+ V  AW  +R  + + GA  +E +  A 
Sbjct: 107 VFATPDGTPFFCGTYFP------KPNFLRLLQSVAAAWRDQRAAVLRQGAAVVEAIGGAQ 160

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +    S  L  EL    L   A++L++ YD   GGFG APKFP  + +  +L   ++ + 
Sbjct: 161 AVGGPSAPLTAEL----LDAAADRLAEEYDETNGGFGGAPKFPPHLNLLFLL---RQYQR 213

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
           TG    A    +++  T + MA+GG+HD + GGF RYSVD RW VPHFEKMLYD   L  
Sbjct: 214 TG----AQRSLEIIRHTCEAMARGGLHDQLAGGFARYSVDGRWAVPHFEKMLYDNALLLR 269

Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
           VY   + LT D     + RD   +L  ++  PG    SA DAD+   EG T       YV
Sbjct: 270 VYTHLWRLTGDQLARRVARDTARFLADELHRPGEGFASALDADTDGVEGLT-------YV 322

Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           WT  ++ + LGE    +    + +   G+      + P       +      D   S  +
Sbjct: 323 WTPAQLVEALGEEDGRWAADLFDVTEEGSFTPHAAAPPGEALTAADA----TDQPTSVLR 378

Query: 379 LGMPLE----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
           L   ++    +      E   +L  VR  RP+P  DDKV+ +WNGL I++ A   ++   
Sbjct: 379 LARDVDDAAPEVRTRWQEVAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAG 438

Query: 435 EAESAMFNFPVVGSDRKEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
            AE A    P   ++  E + +       ++A    + HL D +  R     R G  +A 
Sbjct: 439 YAEDA----PGQDANLMEGVTIVADGAMRDAAEHLAQVHLVDGRLRRTSRDGRVG--EAA 492

Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
           G L+DY  +      +++     +WLV A  L +   E F   + G +++T  +   ++ 
Sbjct: 493 GVLEDYGCVAEAFCAMHQVTGEGRWLVLAGRLLDVALERFAAPD-GSFYDTADDAERLVS 551

Query: 549 RVKEDHDGAEPSGNS 563
           R  +  D A PSG S
Sbjct: 552 RPADPTDNATPSGRS 566


>gi|373108743|ref|ZP_09523024.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
           10230]
 gi|371645988|gb|EHO11505.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
           10230]
          Length = 681

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 189/559 (33%), Positives = 288/559 (51%), Gaps = 48/559 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ VA L+N  F+SIKVDREE P +D  YM  +Q +   GGWPL+V
Sbjct: 62  TCHWCHVMEKESFENQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 121

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYF       R  +   L ++   + +KRD +     FA  QL E +S
Sbjct: 122 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 171

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
             + +    +E   N   L  E   KS+D  +GG+  APKF  P     +LY  KK    
Sbjct: 172 ILSQAPIAQEESRFNT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQKK---- 223

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G      +  + +  TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +V
Sbjct: 224 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 283

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y D +  T +  Y  +    ++++  +     G  +SA DADS ++    + +EGAFY+W
Sbjct: 284 YADGYKRTHNKLYKEVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 341

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+++++ +   LF   + +   G+ +       +N++    VLI+  +    A++  
Sbjct: 342 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 390

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +PLE   N   +    L   R+ RP+P LDDK + SWN + I+    A    ++ A    
Sbjct: 391 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 446

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I G
Sbjct: 447 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 493

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L+ L+E     +++  A  L +   + FLD E   ++ +       +    E  D   PS
Sbjct: 494 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 553

Query: 561 GNSVSVINLVRLASIVAGS 579
            N++  INL +L  +   S
Sbjct: 554 SNAIMAINLYKLGLLYENS 572


>gi|150026141|ref|YP_001296967.1| hypothetical protein FP2103 [Flavobacterium psychrophilum JIP02/86]
 gi|149772682|emb|CAL44165.1| Protein of unknown function YyaL [Flavobacterium psychrophilum
           JIP02/86]
          Length = 686

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 191/570 (33%), Positives = 284/570 (49%), Gaps = 54/570 (9%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVME ESFE++ VA ++N  F+SIKVDREERPDVD +YM  VQ +   GGWPL+V 
Sbjct: 63  CHWCHVMEHESFENQEVASVMNLNFISIKVDREERPDVDAIYMKAVQMMTNRGGWPLNVV 122

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEA 138
             PD +P+ GGTYF  E+      +   L+++ + +    +K    AQ     I+ L   
Sbjct: 123 CLPDGRPIWGGTYFQKEE------WTNTLQQLHELYVSNPQKIIKYAQKLHQGIQVLGTI 176

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
              +A      ++   N ++   E+ SKS+D  +GG+  APKF  P            L+
Sbjct: 177 QHHTAQ-----EQNHTNNIKPLVEKWSKSFDWEYGGYARAPKFMMPNNYLF-------LQ 224

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
             G   ++ E    V  TL  MA GGI D + GGF RYSVD RWH+PHFEKMLYD GQL 
Sbjct: 225 RYGYQTKSQELLNFVDLTLTKMAHGGIFDTIAGGFSRYSVDIRWHIPHFEKMLYDNGQLV 284

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
           ++Y  A+  T++  Y  +    L ++ R+ +      ++A DADS         +EGAFY
Sbjct: 285 SLYAQAYKRTQNPLYKEVIEKTLTFVEREFLNSDNGFYAALDADSLNQNNEL--EEGAFY 342

Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           VWT  E+++IL     +F   Y +   G  +     D H       VLI+   S + ASK
Sbjct: 343 VWTKTELQEILKNDFEIFSHLYNVNDFGFWE----HDNH-------VLIQNQPSKSIASK 391

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
            G+   +  N      + LF  R KRP+P LDDK + SWN +++  +  A   L ++   
Sbjct: 392 FGLTENELQNKRKNWEQLLFTKREKRPKPRLDDKSLTSWNAIMLKGYTDAYNALGNQ--- 448

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                        +Y+ +AE  A FI    +  +   L  S++   S   GFL+DYAF I
Sbjct: 449 -------------KYLAIAEKNAQFITTKQWSAEGF-LYRSYKKNKSTIEGFLEDYAFTI 494

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              + LY+     K+L  A +L +   + F + +   +   + +   ++ +  E  D   
Sbjct: 495 DAFISLYQATLNEKYLQQAKQLTDYCFDNFYNEKQHFFAFNSRKSAQLIAQHFETEDNVM 554

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
           P+ NSV   NL  L  + +   ++YY + A
Sbjct: 555 PASNSVMANNLYVLGLLFS---NNYYEKIA 581


>gi|367034245|ref|XP_003666405.1| hypothetical protein MYCTH_2311055 [Myceliophthora thermophila ATCC
           42464]
 gi|347013677|gb|AEO61160.1| hypothetical protein MYCTH_2311055 [Myceliophthora thermophila ATCC
           42464]
          Length = 827

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 218/681 (32%), Positives = 333/681 (48%), Gaps = 114/681 (16%)

Query: 16  HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
           H      H+CH+   +SF +  VA LLN+ F+ I VDREERPD+D +Y  Y +A+   GG
Sbjct: 73  HIGFQADHFCHLTTQDSFSNPSVAALLNNSFIPILVDREERPDLDTIYQNYSEAVNATGG 132

Query: 76  WPLSVFLSPDLKPLMGGTYFP-PEDKY--------------------------GRPG--- 105
           WPL++FL+PDL P+ GGTY+P P  ++                          G  G   
Sbjct: 133 WPLNLFLTPDLYPIFGGTYWPGPGTEHSSAAASAAGGGGGGGGGGSGTGAISRGSAGEES 192

Query: 106 ---FKTILRKVKDAWDKKRDM--------------LAQSGAF---AIEQLSEALSASASS 145
              F  I +K+   W ++ +                AQ G F   A   +S    ASA +
Sbjct: 193 YSDFLGIAKKIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFGAGATLPVSATPVASAGA 252

Query: 146 NKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 197
              P      +L  + L     +++K +D    GFG+ PKFP P  +  +L  ++   ++
Sbjct: 253 GPAPVSVDPGDLDLDQLDEALARITKMFDPVDYGFGT-PKFPNPARLSFLLRLAQFPGEV 311

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D     E     +M L TL+ +  G + DHVG GF R+SV   W +PHFEKM+ +   L
Sbjct: 312 RDVIGDEEVENAVRMALGTLRRIRDGALRDHVGAGFMRFSVTSNWSMPHFEKMVGENALL 371

Query: 258 ANVYLDAF-SLTKDVF--------YSYICRDILDYLRRDMIGPG-GEIFSAEDADSAETE 307
             V+LDA+  L +D          ++ +  ++ DYL   ++    G   S+E ADS   +
Sbjct: 372 LGVFLDAWLGLPRDAGKGPALDDEFADVVLELADYLTSPIVRVAEGGFVSSEAADSFYRK 431

Query: 308 GATRKKEGAFYVWTSKEVEDILG-----EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFK 361
           G    +EGAFY WT +E + ++G     +HA      Y+ ++  GN  +++  DP +EF 
Sbjct: 432 GDRHMREGAFYTWTRREFDQVVGGGSSDDHASTVAAAYWDVQEDGN--VAQEQDPFDEFI 489

Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGL 420
            +N+L     ++  + +LG+P  +  +++   R KL   R K RPRP  D+K++VS NG+
Sbjct: 490 NQNILSVKASAAELSKQLGIPPSEIKHLVSVAREKLRAHREKERPRPPRDEKIVVSTNGM 549

Query: 421 VISSFARASKILKS-EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR---L 476
           VIS+ +R +  L+S E E A         DR  Y++ A  AA+FI+ +L+D    +   L
Sbjct: 550 VISALSRTAAALRSLEGERA---------DR--YLQAARDAAAFIKENLWDGANSKGNPL 598

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD------ 530
              F   PS+   F DDYAFLI GLLDLY      +W+ WA +LQ+ Q  LF D      
Sbjct: 599 HRFFWERPSQVLAFADDYAFLIDGLLDLYNATLEQEWVDWARQLQDAQTNLFYDAPLTGP 658

Query: 531 -----------REGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
                         GG+++T  E  S  +LR+K   D ++PS N+VS  NL RL +++  
Sbjct: 659 VSTDTAPSPRHAHSGGFYSTESETLSPTILRLKSGMDKSQPSTNAVSASNLFRLGTLLG- 717

Query: 579 SKSDYYRQNAEHSLAVFETRL 599
              D Y   A  ++  FE  +
Sbjct: 718 --VDAYLIQARETVNAFEAEI 736


>gi|254381981|ref|ZP_04997344.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194340889|gb|EDX21855.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 686

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 224/703 (31%), Positives = 324/703 (46%), Gaps = 80/703 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED   A  +N+ FV++KVDREERPDVD VYM  VQA  G GGWP++
Sbjct: 47  SACHWCHVMAHESFEDGATAAYMNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
           VFL+ D +P   GTYFPPE ++G P F  +L  V  AW  + + + +     +  L+   
Sbjct: 107 VFLTADAEPFYFGTYFPPEPRHGMPSFPQVLEGVHTAWTGRPEEVTEVARRIVGDLAGRR 166

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                ++   P+EL    L      L++ YD+  GGFG APKFP  + ++ +L H  +  
Sbjct: 167 PDYGKAAVPGPEELAGALL-----GLTREYDAAHGGFGGAPKFPPSMVLEFLLRHHAR-- 219

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
            TG  G      +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L 
Sbjct: 220 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 274

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            VY   +  T       +  +  D++ R++    G   SA DADS E E   +  EGA+Y
Sbjct: 275 RVYAHLWRATGSELARRVALETADFMVRELRTREGGFASALDADSEEPE-TGKHVEGAYY 333

Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
            WT  ++ ++LGE    L    + +   G  +            G +VL    D  A   
Sbjct: 334 AWTPDQLREVLGEADGELAAGCFGVTEEGTFE-----------HGTSVLRLPQDGPA--- 379

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
              +  E++ +I    R +L   R  RP P  DDKV+ +WNGL I++ A           
Sbjct: 380 ---VDAERFASI----RARLLAARGGRPAPGRDDKVVAAWNGLAIAALAECGAYF----- 427

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKA-PGFLDDY 494
                      +R + +E A  AA  + R  +D      RL  + ++G + A  G L+DY
Sbjct: 428 -----------ERPDLIERATEAADLLVRVHFDAAAGGPRLARTSKDGRAGANAGVLEDY 476

Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
             +  G L L        WL +A  L +   +LF   E G  ++T  +   ++ R ++  
Sbjct: 477 GDVAEGFLALAAVTGEGVWLEFAGFLVDLVLDLFT-AEDGSLYDTAHDAERLIRRPQDPT 535

Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
           D A PSG + +   L+   S  A + S  +R  AE +L V       +   VP      +
Sbjct: 536 DSAAPSGWTAAAGALL---SYAAHTGSQAHRTAAERALGVVHA----LGPRVPRFIGHGL 588

Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP--ADTEEMDFWE 667
             A  +L  P  + V +VG      +  +   A         V    P  AD    +F  
Sbjct: 589 AVAEALLDGP--REVAVVGDPDDPQWAALHRTALLGTAPGAVVAAGPPRAADGSGGEF-- 644

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                   +A          A VC++F C+ P TDP+ L   L
Sbjct: 645 ------PLLAERAPVRGLPAAYVCRHFVCARPTTDPVELAEQL 681


>gi|334338370|ref|YP_004543522.1| hypothetical protein Isova_2944 [Isoptericola variabilis 225]
 gi|334108738|gb|AEG45628.1| protein of unknown function DUF255 [Isoptericola variabilis 225]
          Length = 658

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 229/692 (33%), Positives = 326/692 (47%), Gaps = 88/692 (12%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+ VA  L D FV+IKVDREERPDVD VYM    AL G GGWP++ F
Sbjct: 50  CHWCHVMAHESFEDDDVAAALADRFVAIKVDREERPDVDAVYMGATTALTGQGGWPMTCF 109

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD +P   GTY+P      R  F  +L  V +AW ++RD + + GA     L+EA+ A
Sbjct: 110 LTPDGEPFFAGTYYP------REHFLQVLDAVWEAWTERRDAVERQGA----ALTEAI-A 158

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
             S+   PD L + AL      +++  D   GGFG APKFP  + ++ +L H  +  D  
Sbjct: 159 RTSARLTPDVLDEAALERSVRLVARDADPEHGGFGGAPKFPPSMTLEHLLRHHARTGD-- 216

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
                    ++V  T + MA+GGI+D + GGF RY+VD  W VPHFEKMLYD  QL  VY
Sbjct: 217 -----PSALELVERTCEAMARGGIYDQLAGGFARYAVDAAWVVPHFEKMLYDNAQLLRVY 271

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
           L  +  T       + R+  ++LR D+  P G   SA DAD+   EG T       YVWT
Sbjct: 272 LHWYRATGSPLAERVVRETAEFLRADLRTPEGGFASALDADTDGVEGLT-------YVWT 324

Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDS-SASASKL 379
           ++++ D+LG                         P +  +   VL + L  +     S L
Sbjct: 325 AEQLADVLG-------------------------PADGARAAEVLSVTLEGTFEHGTSTL 359

Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
            +  +         R +L + R+ RP+P  DDKV+ +WNGL I++ A A ++L       
Sbjct: 360 QLREDPDPEWWTGVRARLAEARAGRPQPARDDKVVTAWNGLAIAALAEAGELL------- 412

Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
               P    D ++  ++       +R H+ D    RL+ + R G    APG   D+  L 
Sbjct: 413 --GVPGYVDDARDCADL------LLRLHVVD---GRLRRASRGGVVGTAPGVAADHGDLA 461

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
            GLL L++    T+WL  A EL     E F D   GG+++   +   ++ R K+  DG E
Sbjct: 462 EGLLALHQATGETRWLDAAGELLEVALERFGD-GAGGFYDVADDAERLVSRPKDPTDGPE 520

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           PSG S     L   A++   S+   +R+ AE ++A   T  K +          A+ L+ 
Sbjct: 521 PSGQSSLAGALATYAALTGSSR---HREAAEAAVAAAGTLAKQVPRFAGWTLAVAEALAA 577

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                V +VG           AA  +S      V+ +   DT  +            +A 
Sbjct: 578 -GPLQVAVVGPDDGARLALERAARASSS--PGLVLAVGEPDTPGVPL----------LAD 624

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
                 +  A VC+ F C  PVT    LE  L
Sbjct: 625 RPLVDGRPAAYVCRGFVCDRPVTTVEELERAL 656


>gi|423328847|ref|ZP_17306654.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
           3837]
 gi|404604409|gb|EKB04043.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
           3837]
          Length = 667

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 188/559 (33%), Positives = 286/559 (51%), Gaps = 48/559 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           TCHWCHVME ESFE++ VA ++N  F+SIKVDREE P +D  YM  +Q +   GGWPL+V
Sbjct: 48  TCHWCHVMEKESFENQEVADIMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 107

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
              PD +P+ GGTYF  E       +   L ++   + +KRD +     FA  QL E +S
Sbjct: 108 VCLPDGRPIWGGTYFKKE------AWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 157

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              S   +  E  +    L  E   KS+D  +GG+   PKF  P     +LY  KK    
Sbjct: 158 I-LSQAPIAQEDSRFNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK---- 209

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G      +  + +  TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +V
Sbjct: 210 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 269

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y D +  T +  Y  +    +D++  +     G  +SA DADS ++    + +EGAFY+W
Sbjct: 270 YADGYKRTHNKLYKEVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 327

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T +E+++++ +   LF   + +   G+ +       +N++    VLI+  +    A++  
Sbjct: 328 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 376

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
           +PLE   N   +    L   R+ RP+P LDDK + SWN + I+    A    ++ A    
Sbjct: 377 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 432

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
                       Y+E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I G
Sbjct: 433 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 479

Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
           L+ L+E     +++  A  L +   + FLD E   ++ +       +    E  D   PS
Sbjct: 480 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 539

Query: 561 GNSVSVINLVRLASIVAGS 579
            N++  INL +L  +   S
Sbjct: 540 SNAIMAINLYKLGLLYENS 558


>gi|425450832|ref|ZP_18830655.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
 gi|389768138|emb|CCI06653.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
          Length = 692

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 210/620 (33%), Positives = 311/620 (50%), Gaps = 74/620 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  + ++++ L++  A  +  L ++
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYGEEKEKLSKFTAEMLGALRQS 167

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
                +   L D    + L    E  +         +G  P FP      + L  S+  +
Sbjct: 168 AILPRAETNLADP---SLLATGIETNTAVIQVNPNNYGR-PSFPMIPYSHLALQGSRFGD 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D   S + +  Q+      + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+ 
Sbjct: 224 DFDDSLQQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIV 278

Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +EGAF
Sbjct: 279 EYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKARDREPEEGAF 338

Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+  E+ D L    + L + ++ +   GN            F+G+NVL          
Sbjct: 339 YVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQG 381

Query: 377 SKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWN 418
            +LG  +E  L+ L     G  + +L      R                  D K+IV+WN
Sbjct: 382 GELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWN 441

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
            L+IS  ARA          A+F+ P+       Y ++A  AA FI +H + D +  RL 
Sbjct: 442 SLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQRLN 485

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGY 536
           +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGY
Sbjct: 486 Y---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWSEDEGGY 542

Query: 537 FNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           FN T  D S+ L V+E    D A PS N +++ NLVRL+ +    +   Y   AE +L  
Sbjct: 543 FN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQS 598

Query: 595 FETRLKDMAMAVPLMCCAAD 614
           F T L+    A P +  A D
Sbjct: 599 FSTILEQSPTACPSLFVALD 618


>gi|294814700|ref|ZP_06773343.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
           27064]
 gi|326443082|ref|ZP_08217816.1| hypothetical protein SclaA2_18553 [Streptomyces clavuligerus ATCC
           27064]
 gi|294327299|gb|EFG08942.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
           27064]
          Length = 675

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 226/695 (32%), Positives = 327/695 (47%), Gaps = 81/695 (11%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED   A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDGATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F++ + +P   GTYFPPE ++G P F+ +L  V  AW  +RD + +  A     L+   S
Sbjct: 109 FMTAEGEPFYFGTYFPPEPRHGMPSFRQVLEGVTAAWTGRRDEVDEVAARIRRDLA-GRS 167

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
            +   + +P    Q    +    LS+ YD R GGFG APKFP  + ++ +L H  +   T
Sbjct: 168 LAHGGDGVPGAEEQARALIG---LSREYDERHGGFGGAPKFPPSMVLEFLLRHHAR---T 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G         +M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  V
Sbjct: 222 GSEA----ALQMAAETAEAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRV 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
           Y   + LT       +  +  D++ R++    G   SA DADS   +G   + EGAFYVW
Sbjct: 278 YARLWRLTGAPLARRVALETADFMVRELRTAEGGFASALDADSTGADGV--RAEGAFYVW 335

Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           T  ++ ++LGE                 +L  ++D      G +VL    D         
Sbjct: 336 TPAQLTEVLGEE----------DGRRAAELYGVTDEGTFEHGTSVLRLPGDDPGPG---- 381

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
                        R++L   R  R RP  DDKV+ +WNGL I++ A              
Sbjct: 382 ------------IRQRLLASRELRERPERDDKVVAAWNGLAIAALAETGAYF-------- 421

Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
                   DR + +E A  AA  + R L+ + + RL  + R+G   +  G L+DY  +  
Sbjct: 422 --------DRPDLVERATEAADLLVR-LHLDGSARLTRTSRDGRAGRNAGVLEDYGDVAE 472

Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDG 556
           G L L        WL +A  L +    + LDR   E G  ++T  +   ++ R ++  D 
Sbjct: 473 GFLALASVTGEGVWLEFAGLLLD----IVLDRFTGENGTLYDTAHDAEQLIRRPQDPTDN 528

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSG + +   L+   S  A + S+ +R  AE +L V +         +     AA+ L
Sbjct: 529 AAPSGWTAAAGALL---SYAAHTGSEAHRTAAERALGVVKALGPRAPRFIGWGLAAAEAL 585

Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
            +   + V +VG     D E+      A+ +L++T +                  +   +
Sbjct: 586 -LDGPREVAVVG-----DPED-----PAARELHRTALLAPAPGAVVAA--GAPGGDEFPL 632

Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
            R+    D +  A VC+ F C  PVT P +L   L
Sbjct: 633 LRDRDLVDGRAAAYVCRGFVCRRPVTGPSALAEEL 667


>gi|425439757|ref|ZP_18820072.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
 gi|389719932|emb|CCH96294.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
          Length = 692

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 211/623 (33%), Positives = 310/623 (49%), Gaps = 80/623 (12%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDRAIADYLNHYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
           L  SA   +    L    L     + + +           P FP      + L  S+   
Sbjct: 164 LRQSAILPRAETNLAAPYLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             +D+ +      G+ + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           Q+     + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335

Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW+  E+ D L    + L + ++ +   GN            F+G+NVL       
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378

Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
               +LG  +E  L+ L     G  + +L      R                  D K+IV
Sbjct: 379 RQGGELGEEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
           +WN L+IS  ARA          A+F+ P+       Y ++A  AA FI +H + D +  
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILKHQWLDGRFQ 482

Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
           RL +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + 
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDE 539

Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           GGYFN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F T L++   A P +  A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618


>gi|380805071|gb|AFE74411.1| spermatogenesis-associated protein 20 precursor, partial [Macaca
           mulatta]
          Length = 397

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 165/420 (39%), Positives = 238/420 (56%), Gaps = 43/420 (10%)

Query: 79  SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ A
Sbjct: 1   NVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTA 56

Query: 139 LSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH-- 193
           L A +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +  +  
Sbjct: 57  LLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWL 116

Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
           S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYD
Sbjct: 117 SHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYD 171

Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q QLA  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R K
Sbjct: 172 QAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPK 230

Query: 314 EGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
           EGA+YVWT KEV+ +L E  +          L  +HY L   GN   S+  DP  E +G+
Sbjct: 231 EGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGNISPSQ--DPKGELQGQ 288

Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
           NVL        +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S
Sbjct: 289 NVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVS 348

Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
            +A    +L              G DR   +  A + A F++RH++D  + RL  +   G
Sbjct: 349 GYAVTGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTG 392


>gi|408395590|gb|EKJ74769.1| hypothetical protein FPSE_05104 [Fusarium pseudograminearum CS3096]
          Length = 717

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 196/607 (32%), Positives = 310/607 (51%), Gaps = 67/607 (11%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H C +M +E+F +   A +LN+ FV + VDREERPD++ VYM Y QA++  GGWPL+VFL
Sbjct: 84  HSCRLMSIETFSNPESAAVLNESFVPVIVDREERPDIEAVYMNYAQAVHKVGGWPLNVFL 143

Query: 83  SPDLKPLMGGTYFP-PEDKYGRPGFK--------TILRKVKDAWDKKR--------DMLA 125
           +P+L+P+ GGTY+  P  +    G          TIL K++D W+ +         +++A
Sbjct: 144 TPNLEPVFGGTYWVGPAGRRRHNGDSTDEVLDSLTILNKMRDTWNDQEARCRKEATEIVA 203

Query: 126 QSGAFAIEQLSEALSASASSNKLP-----------------------DELPQNALRLCAE 162
           Q   FA E      S +A S   P                        EL  + L +   
Sbjct: 204 QLKEFAAEGTLGTRSITAPSALGPLAGWGAPAPSNPSTTENRTMIVSQELDLDQLEVAYR 263

Query: 163 QLSKSYDSRFGGFGSAPKFPRPVEIQM---MLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
            ++ ++D   GGFG APK+  P ++     +L     ++D     E     K+ L+TL+ 
Sbjct: 264 NIAGTFDPVHGGFGLAPKYMIPPKLTFLLGLLTAPGPVQDVVGYDECRHATKIALYTLRQ 323

Query: 220 MAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---KDVFYSY 275
           +  G +HDH+G  GF   SV   W +P+FEK++ D  QL ++Y+DA+  +   +   +  
Sbjct: 324 IRDGALHDHIGATGFSHCSVTADWSIPNFEKLVIDNAQLLSLYIDAWKASGGGEQGEFLD 383

Query: 276 ICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--- 331
           +  ++++YL    +  P G   S+E ADS   +G   K+EGA+YVWT +E + +L +   
Sbjct: 384 VVLELIEYLTTSPVTLPEGGFASSEAADSYYRQGDNEKREGAYYVWTWREFKSVLDDIDH 443

Query: 332 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
             + +   ++ +   GN  +   +DP+++F  +N+L         +S    P+EK    +
Sbjct: 444 HMSPILAAYWNVNKDGN--VKETNDPNDDFMNQNILCVKTTVEQLSSHFSTPVEKIREYI 501

Query: 391 GECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 449
            + +  L   R + R RP LDDK++  WNGLVIS+ ++A+  L++          +    
Sbjct: 502 EKGKAALRKKREQERVRPELDDKIVAGWNGLVISALSKAASALRT----------LKPEQ 551

Query: 450 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 509
                  AE AA+ I+  L+D     L  ++  G      F DDYA+LI GLLDL+    
Sbjct: 552 SSRCKSAAERAAACIKERLWDADEKVLYRTW-CGERGHTAFADDYAYLIQGLLDLFGLTE 610

Query: 510 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 569
             ++L +A  LQ TQ  LF D + G +F T    P V+LR+KE  D + PS N+VSV NL
Sbjct: 611 NHQYLEFAETLQQTQISLFFD-DDGAFFTTKAHSPHVILRLKEGMDTSLPSTNAVSVANL 669

Query: 570 VRLASIV 576
            RLAS++
Sbjct: 670 FRLASLL 676


>gi|374310263|ref|YP_005056693.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358752273|gb|AEU35663.1| hypothetical protein AciX8_1320 [Granulicella mallensis MP5ACTX8]
          Length = 704

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 214/697 (30%), Positives = 341/697 (48%), Gaps = 61/697 (8%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM+ ES+E+   A+++N+ F+++KVDR+ERPDVD  Y   +  + G GGWPL+ F
Sbjct: 54  CHWCHVMDRESYENAATAEVINEHFIAVKVDRDERPDVDTRYQAAISTISGQGGWPLTAF 113

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA---FAIEQLSEA 138
           L+P+ KP  GGTYFPP+D+YGRP F+ +L  + D +  +RD + +S      AIE+ +E+
Sbjct: 114 LTPEGKPYFGGTYFPPDDRYGRPSFQRVLLTMADVFQNRRDEVEESAGGVMLAIEE-NES 172

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            S  A +   P  L    + L   Q    +D + GGFGS PKFP    I ++      ++
Sbjct: 173 FSVPAGNPGAP--LLDKLVALTVSQ----FDQKNGGFGSQPKFPNSGAIDLL------ID 220

Query: 199 DTGKSGE-ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
              + GE A + + +   TLQ MA GGIHD + GGFHRYSVDERW VPHFEKM YD  +L
Sbjct: 221 AASRGGELAEQARHVATVTLQKMAAGGIHDQLAGGFHRYSVDERWIVPHFEKMAYDNSEL 280

Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
              Y+ AF    +  ++ + +DIL ++   +       F A     ++    +   +G +
Sbjct: 281 LKNYVHAFQSFGEPEFARVAKDILRWMDEWLSDREQGGFYA-----SQDADDSLDDDGDY 335

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           + WT  E + +L        E Y+       +L  + D H+  + KNVL       A A 
Sbjct: 336 FTWTRAEAKAVLTAEEFAVAELYF-------NLRDVGDMHHNPQ-KNVLHLGEPVEAIAR 387

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
           KL   L++    L     KL+  R +R  P++D  +   WNG+ ++++  A+++L     
Sbjct: 388 KLNRALDEVNETLAAATGKLYAARLQRKTPYVDKTIYTGWNGMCLAAYFEAARVLDL--- 444

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
             + +F +   DR   + VA      +         H + +      ++  G L+DY FL
Sbjct: 445 PEVRSFALRSLDR--VLNVAWDPVEGL--------AHVVAYGEGGSAARVAGVLEDYGFL 494

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT----GEDP--SVLLRVK 551
            + +LD +E     ++   A  + +     F D  GGG+F+T        P  ++  R K
Sbjct: 495 ANAVLDAWESTGELRYFTAAQAIADVMLVRFYDAAGGGFFDTERMEGAPQPIGALSTRRK 554

Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
              D   P+GNSV+V  L+RLA++   + SD Y + A+ +L  F   ++   +       
Sbjct: 555 PLQDAPTPAGNSVAVTLLLRLAALT--NHSD-YGERAQETLEAFAGVVEHFGLYAASYGL 611

Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
           A    +V S   + +VG  +        A   A + +NK+VI +D +   E+        
Sbjct: 612 ALRR-AVESSVQICVVGDDARARELEAAAV--AGFAVNKSVIRLDRSRFHELPAALAETL 668

Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
            N      +F      A+VC+  +C PP+     L N
Sbjct: 669 PNLPQVEGSF------AVVCKGNTCLPPIQSVEELRN 699


>gi|434393621|ref|YP_007128568.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
 gi|428265462|gb|AFZ31408.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
          Length = 687

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 221/665 (33%), Positives = 320/665 (48%), Gaps = 119/665 (17%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDLAIADYMNAHFLPIKVDREERPDLDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAF--AIEQL 135
           +F++PD L P  GGTYFP E +YGRPGF  +L+ ++  +D +K+D+LA+  A   AI+Q 
Sbjct: 108 IFIAPDDLVPFYGGTYFPVEPRYGRPGFLQVLQAIRRYYDTEKQDLLARKAAILEAIQQ- 166

Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMM 190
               SA     +  DE          + L K  ++  G      +G+  +FP     ++ 
Sbjct: 167 ----SAVLPKTQQSDE----------DLLKKGIETNTGVITPHDYGT--QFPMIPYAELA 210

Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
           L  ++      +       Q+  L     +A GGI+DHV GGFHRY+VD  W VPHFEKM
Sbjct: 211 LRGTRFNYSAWRYDIPQVCQQRGL----DLALGGIYDHVAGGFHRYTVDPTWTVPHFEKM 266

Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETE 307
           LYD GQ+     + +S    V    I R I   + +L+R+M  P G  ++A+DADS  + 
Sbjct: 267 LYDNGQIVEYLANLWS--NGVQEPAIERAIALTVQWLKREMTAPEGYFYAAQDADSFTSP 324

Query: 308 GATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
                +EGAFYVW+  E++ IL  E     ++ + +   GN            F+G+ VL
Sbjct: 325 YEAEPEEGAFYVWSYSELQQILSSEELSALEQQFTITSQGN------------FEGQIVL 372

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR------------------------- 401
              +  S S            +I  +   KLF VR                         
Sbjct: 373 QRRHPGSLS------------DITEQALSKLFTVRYGATPESLDVFPPARNNQEAKTQNW 420

Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
           S R     D K+IV+WN L+IS  ARA  + K                + EY+E+A S+A
Sbjct: 421 SGRIPAVTDTKMIVAWNSLMISGLARAYAVFK----------------KSEYLEIALSSA 464

Query: 462 SFIRRH-LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVW 516
            FI  H   D + HRL +    G +      +DYA  I  LLDLY+      +   WL  
Sbjct: 465 RFILNHQQVDGRFHRLNY---EGQTSVIAQSEDYALFIKALLDLYQVTLKDANSQHWLEQ 521

Query: 517 AIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
           AI LQ   DE     E GGY+NT  +    +++R +   D A P+ N V++ NLVRLA +
Sbjct: 522 AIALQAEFDEYLWSIELGGYYNTASDASRDLIVRERSYADNATPAANGVAIANLVRLALL 581

Query: 576 VAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF 635
              ++   Y   AE +L  F + +     A P +  A D       ++  LV   +S   
Sbjct: 582 ---TEKLSYLDRAEQALQAFTSVMDSAPQACPSLFTALDWY-----RNCTLV-RTTSTTL 632

Query: 636 ENMLA 640
           E +LA
Sbjct: 633 ETVLA 637


>gi|288917991|ref|ZP_06412350.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
 gi|288350646|gb|EFC84864.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
          Length = 669

 Score =  301 bits (772), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 197/609 (32%), Positives = 292/609 (47%), Gaps = 54/609 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED  +A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDAQIAAYMNEHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--A 138
           FL+P  +P   GTYFPP  + G+  F  +L  V DAW ++R+ + ++GA    +L+E  A
Sbjct: 109 FLTPAAEPFFAGTYFPPRPRQGQTSFPQLLTAVSDAWTQRREEIEEAGADIARRLAEVVA 168

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L    +  +   +L  + L      L+  +D+R GGFG  PKFP  +  +++L H  +  
Sbjct: 169 LPGGTAGGEGGPQLGADLLDGAVAGLAGRFDARHGGFGPKPKFPPSMVAELLLRHWARTG 228

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D           +MV  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL 
Sbjct: 229 D-------DRALEMVRVTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLL 281

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAF 317
            VYL  +  T       + R+ +++L  D+  P G   SA DAD+    +     +EGA 
Sbjct: 282 RVYLHLWRATGSALAERVVRETVEFLLTDLRTPEGGFASALDADAVPAGQPNAHPEEGAS 341

Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
           Y WT  ++ D+LG     +             +  +++      G +VL+   D    A 
Sbjct: 342 YSWTPAQLADVLGPEDGAWA----------AGVLGVTEAGTFEHGTSVLMLPADPDDPAR 391

Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
                           R  L   RS RP+P  DDK++ +WN            I      
Sbjct: 392 ------------FARVRSALAAARSSRPQPARDDKIVAAWN---------GLAIAALAEA 430

Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
            A+   P   +      E+          HL+D +  R     R GP+   G L+DY  +
Sbjct: 431 GALLAEPAWIAAATRAAELLRDV------HLHDGRLWRTSRDGRRGPNA--GVLEDYGCV 482

Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
             G L L++  +  +WL  A EL +     F   + GG+F+T  +  ++L R +E  D A
Sbjct: 483 ADGYLALHQVTADPRWLTLAGELLDVVRARFAAPD-GGFFDTADDAEALLRRPRESSDSA 541

Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADML 616
            PSG +     ++  A++   ++   +R  A  ++ +    L KD   A      A  +L
Sbjct: 542 TPSGQAAVAGAMLTFAALTGSAE---HRDAAVATVGLLMPLLAKDARYAGWAGAVAEAVL 598

Query: 617 SVPSRKHVV 625
           + P+   VV
Sbjct: 599 AGPAEVAVV 607


>gi|440682478|ref|YP_007157273.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
 gi|428679597|gb|AFZ58363.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
          Length = 693

 Score =  301 bits (772), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 212/629 (33%), Positives = 320/629 (50%), Gaps = 86/629 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDLEIAQYMNTNFLPIKVDREERPDLDSIYMQTLQFMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+  DL P   GTYFP + +YGRPGF  +L  ++  +D +++ L Q  A  +    EA
Sbjct: 108 VFLAADDLVPFYAGTYFPVDPRYGRPGFLQVLEALRRYYDTEKEELRQRKALIV----EA 163

Query: 139 LSASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHS 194
           L  SA   K+ + E+  N L      L K +++  G   S      FP      M+ Y  
Sbjct: 164 LLTSAVMQKVTNQEVADNQL------LQKGWETCTGIITSKQVGNSFP------MIPYAE 211

Query: 195 KKLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
             L  T  + +   +GQ++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD
Sbjct: 212 FALRGTRFNYQFQYDGQQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYD 271

Query: 254 QGQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
            GQ+     + +S  + +  F   +   +  +L+R+M   GG  ++A+DADS     A  
Sbjct: 272 NGQIIEYLANLWSGGIQEPAFERAVAGTV-KWLQREMTAQGGYFYAAQDADSFINSTAIE 330

Query: 312 KKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---- 366
            +EGAFYVW+ +E++ +L  E     ++ + +   GN            F+G+ VL    
Sbjct: 331 PEEGAFYVWSYRELQQLLTTEELNELQQQFAVTANGN------------FEGQIVLQRSH 378

Query: 367 -------IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR--PHL-DDKVIVS 416
                  +E+  S    ++ G   E   N     R      ++  P   P + D K+IV+
Sbjct: 379 PGELSQTLEIALSKLFTARYGATPESLSN-FPPARDNQEAKKTNWPGRIPAVTDTKMIVA 437

Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHR 475
           WN L+IS  ARA+++ +                +  Y+E+A  AA FI  H + D + HR
Sbjct: 438 WNSLMISGLARAAEVFQ----------------QPNYLELAAQAARFILDHQFVDGRFHR 481

Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG---------TKWLVWAIELQNTQDE 526
           L +    G +      +DYAF I  LLDL++   G         + WL  A+ LQ+  DE
Sbjct: 482 LNYE---GEATVLAQSEDYAFFIKALLDLHQATLGQLDHVSSQNSDWLEKAVSLQDEFDE 538

Query: 527 LFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
                E GGYFNT+ ++   +++R +   D A PS N +++ NLVRLA +   + + +Y 
Sbjct: 539 FLWSIELGGYFNTSSDNSQDLIVRERSYIDNATPSANGIAIANLVRLALL---TDNLHYL 595

Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
             AE  L  F+  + +   A P +  A D
Sbjct: 596 DLAEQGLTAFKGVMSNSPQACPSLFTALD 624


>gi|171683203|ref|XP_001906544.1| hypothetical protein [Podospora anserina S mat+]
 gi|170941561|emb|CAP67213.1| unnamed protein product [Podospora anserina S mat+]
          Length = 753

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 215/632 (34%), Positives = 311/632 (49%), Gaps = 71/632 (11%)

Query: 23  HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
           H CH+   ++F +  VA  LN+ FV I VDREERPD+D +Y  Y  A+    GWPL +F 
Sbjct: 84  HLCHITTRDTFHNPTVAAFLNEHFVPIIVDREERPDLDAIYQNYSVAVNSISGWPLHLFF 143

Query: 83  SPDLKPLMGGTYFPPEDKYGRPG----FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE- 137
           +PDL+P     Y P     G  G      TIL+     W +K     +  A  +E L + 
Sbjct: 144 TPDLEPFFANAYLPAPGTVGEDGEACDLLTILQSNHRLWVEKEQKCREEAAKELEGLEKF 203

Query: 138 --------ALSASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSA--PKFPRPVE 186
                   A + +A++    D E+  + + L   +++K +D   GGFG    PKFP P  
Sbjct: 204 VQEGALPLARAPNATATYDSDIEVDLDHVELAVSRIAKLFDPVHGGFGQPGEPKFPNPAR 263

Query: 187 IQMMLYHSKKLEDT-----GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
           +  +L   ++  DT     G   +     KM L TL  M   G+ DH+G GF R S    
Sbjct: 264 LSFLL-RLRECPDTVRDVIGGDEDVERATKMALQTLSKMKNSGLRDHIGEGFMRMSSTSD 322

Query: 242 WHVPHFEKMLYDQGQLANVYLDAF-------SLTKDVFYSYICRDILDYLRRDMIGP-GG 293
           W++PHFEKM+ D   L  VYLDA+        LT    ++ +   + DYL    I    G
Sbjct: 323 WNMPHFEKMVGDNALLLGVYLDAWLGNRKGTQLTNQDEFADVVLGLADYLISPAIQQENG 382

Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSR 352
              S+E A S   +G      G FY+WT +E +++LG  A      Y+ ++  GN    R
Sbjct: 383 GFISSEAAYSYYRKGEQHMTNGTFYLWTHREFDEVLGPEASNIAAAYWNVQEDGNVPQER 442

Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDD 411
             DP +EF  +N+L   N     +++ G+P+E+   I+   ++KL   R K R RP  D 
Sbjct: 443 --DPSDEFLNQNILSAGNGVHELSTQHGLPVEEIHRIIASSKKKLLAHRDKERVRPPRDT 500

Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-- 469
           K+I   NG+VIS+ +R+    ++ AE+      V  S   EY++ AE AA FI  +L+  
Sbjct: 501 KIIAGVNGMVISALSRS----QAAAEA------VGHSKSAEYIKRAEKAAQFIFDNLWLN 550

Query: 470 DEQT-------HRLQHSF-RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
           D  T       H++ H +  NGPS+   F DDYAFLI GLLDLYE     +WL WA +LQ
Sbjct: 551 DINTEGPNGGQHKVLHRYWNNGPSETLAFADDYAFLIEGLLDLYEATLSKRWLNWAQDLQ 610

Query: 522 NTQDELFLDRE-------------GGGYFNTTGED-PSVLLRVKEDHDGAEPSGNSVSVI 567
           + Q+ LF D                GG+++T  +   S + R+K   D   PS N+VS  
Sbjct: 611 DAQNRLFYDSPSAVNGTPSRRAAGSGGFYSTELQTISSNIPRLKSAMDILIPSVNAVSAS 670

Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
           NL RL SI A S+   Y+Q A  ++  F+  L
Sbjct: 671 NLYRLGSIFAESR---YKQIALETIKAFDPEL 699


>gi|374850591|dbj|BAL53576.1| hypothetical conserved protein [uncultured Bacteroidetes bacterium]
          Length = 676

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 194/569 (34%), Positives = 286/569 (50%), Gaps = 51/569 (8%)

Query: 2   GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
           G  +F    + ++  FL    + CHWCHVME ESF D  VA LL  W++ IKVDREERPD
Sbjct: 27  GEEAFARARREQKLVFLSIGYSACHWCHVMEEESFADPEVAALLERWYIPIKVDREERPD 86

Query: 59  VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
           VD +YM+  QA+ G GGWPL+V L+P+ + +  GTYFP      R G   +L ++   W 
Sbjct: 87  VDALYMSICQAMTGQGGWPLTVILTPEREVIFAGTYFPKRSTPYRIGLIELLERIAALWQ 146

Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
           +   ML  S    +E+++  L ++ S +     +    +    EQL K +D R+GGFG+ 
Sbjct: 147 QDGQMLRSSAHALMERIAPHLRSAHSGH-----ITAGTITAALEQLDKLFDRRYGGFGTR 201

Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
           PKFP    +  +L    +         ++    +   TL+ M  GGI DHVG GFHRYS 
Sbjct: 202 PKFPMAAALWFLLIAGPR--------TSTRALDIATATLEAMRWGGIWDHVGFGFHRYST 253

Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
           DERW +PHFEKMLYDQ  L  VY +A  +TK   +     +I  YL R ++   G   ++
Sbjct: 254 DERWFLPHFEKMLYDQALLLLVYAEAARITKRRLFEITAMEIAAYLDRTLLLEHGAFAAS 313

Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
           EDAD+ +        EGAFY W  +++  ++  H     +  ++L P GN        P 
Sbjct: 314 EDADTPD-------GEGAFYQWRYEDLRRLIPSHEFERMRAIFHLSPEGNAHDEATGQP- 365

Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
               G+N+L     +     + G  LE++L      R++L  VR+ R RP  D+KV+  W
Sbjct: 366 ---TGRNILSAGTRTEDVLERFGGTLEEFLAWWEPLRQRLETVRNSRARPARDEKVLCDW 422

Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRL 476
           N LV+++ ARA ++L+          P +       +E A    S++ R H++ + T  L
Sbjct: 423 NALVVAALARAGRLLRQ---------PTL-------IERARRTWSYLERVHVHADGT--L 464

Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
            H   +G     GFLDDYAF     L+LY       +L     L ++  E F+D  G G 
Sbjct: 465 AHCSYSGEPAIDGFLDDYAFAAWAALELYHATGANDFLEHVEHLLHSITERFVD--GDGI 522

Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
             T     + +L + E  DGA  SG  ++
Sbjct: 523 VRTAAS--ADVLPLTEPSDGATVSGIGIT 549


>gi|158312686|ref|YP_001505194.1| hypothetical protein Franean1_0830 [Frankia sp. EAN1pec]
 gi|158108091|gb|ABW10288.1| protein of unknown function DUF255 [Frankia sp. EAN1pec]
          Length = 669

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 219/637 (34%), Positives = 303/637 (47%), Gaps = 69/637 (10%)

Query: 2   GRRSFCGGTKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERP 57
           G  +F   T TR    L++     CHWCHVM  ESFED  +A  +N  FV+IKVDREERP
Sbjct: 27  GPEAFAEAT-TRGVPVLLSVGYAACHWCHVMAHESFEDPEIAAYMNQHFVNIKVDREERP 85

Query: 58  DVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
           DVD VYM    AL G GGWP++VFL+P  +P   GTYFPP    G   F  ++  + DAW
Sbjct: 86  DVDSVYMDVTVALTGHGGWPMTVFLTPAAEPFFAGTYFPPRPMRGSASFPQVMAAIVDAW 145

Query: 118 DKKRDMLAQSGAFAIEQLSE--ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
             +R  + QSGA    QL+E  A   +AS      ++  + L      L+  +DS  GGF
Sbjct: 146 TARRAEVEQSGADIARQLAEAVAPGGAASGGGATTQITADLLDRAVAGLADRFDSVHGGF 205

Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
           G APKFP  +  +M+L    +  D    G       MV  T + MA+GG++D +GGGF R
Sbjct: 206 GGAPKFPPSMVAEMLLRSWARTGDGRALG-------MVRETCERMARGGMYDQLGGGFAR 258

Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
           YSVDE W VPHFEKMLYD  QL  VYL  +  T       + R+   +L  D+  P G  
Sbjct: 259 YSVDESWTVPHFEKMLYDNAQLLRVYLHLWRATGLPLAERVVRETAAFLLADLRTPEGGF 318

Query: 296 FSAEDADS--AETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR 352
            SA DAD+  A + G    +EGA Y WT  ++ D+LG +   L      +   G+ +   
Sbjct: 319 ASALDADAVPAGSPGG-HPEEGASYSWTPAQLVDVLGPDDGALAARVLGVTAEGSFE--- 374

Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
                    G +VL+   D    A                 R  L   R+ RP+P  DDK
Sbjct: 375 --------HGTSVLMLPADPEDPARFA------------RVRAALAAARATRPQPARDDK 414

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDE 471
           ++ +WNGLVI + A A  +L                    ++  AE AA  +R  HL++ 
Sbjct: 415 IVAAWNGLVIGALAEAGALLGE----------------PSWVGAAERAAELLRDVHLHEG 458

Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
           +  R     R GP+   G L+DY  +  G L L++      WL  A EL +     F   
Sbjct: 459 RLWRTSRDGRRGPNA--GVLEDYGCVAEGFLTLHQVTGAAGWLALAGELLDVVRARFAAP 516

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEH 590
           + GGYF+T  +  ++L R ++  D A PSG +     L+  A++   +   D  R   E 
Sbjct: 517 D-GGYFDTADDAEALLRRPRDASDSATPSGQAAVAGALLTYAALTGSADHRDSARATVEQ 575

Query: 591 SLAVF--ETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
              +   + R    A AV     A  +L+ P+   VV
Sbjct: 576 LTPLLSRDARFAGWAGAV-----AEALLAGPAEVAVV 607


>gi|427733870|ref|YP_007053414.1| thioredoxin domain-containing protein [Rivularia sp. PCC 7116]
 gi|427368911|gb|AFY52867.1| thioredoxin domain protein [Rivularia sp. PCC 7116]
          Length = 691

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 210/631 (33%), Positives = 309/631 (48%), Gaps = 93/631 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  VA+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDLEVAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
            FLSP DL P   GTYFPPE++Y RPGF  +L+ ++  +D ++  L +  A  +E L  S
Sbjct: 108 AFLSPDDLVPFYAGTYFPPEERYNRPGFLQVLKAIRHYYDTEKQDLQKRKAVILESLLTS 167

Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
             L   A++    ++L Q    +    ++ +             FP     QM L  S+ 
Sbjct: 168 AVLQTEATAETQDNQLLQKGWEIFTGIIAPNEQGN--------SFPTIPYAQMALQGSRF 219

Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
              +    +    Q+ +      +A GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ
Sbjct: 220 NFTSRYDCKQICTQRGL-----DLALGGIFDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQ 274

Query: 257 LANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
           +     + +S  + +  F + I + +  +L+R+M  P G  ++A+DADS  T+     +E
Sbjct: 275 IVEYLANLWSAGVKEPAFETAIAKTV-KWLQREMTAPNGYFYAAQDADSFITQEDVEPEE 333

Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
           GAFYVW   ++E +L    +   ++++ + P GN            F+ +NVL + N   
Sbjct: 334 GAFYVWGFSDLEQLLTRAELTELQQNFTVTPNGN------------FENQNVLQKRN--- 378

Query: 374 ASASKLGMPLEKYLNILGECRR-------KLF-----DVRSK------RPRPHLDDKVIV 415
             + +L   LE  L  L   R        K F     + ++K      R  P  D K+IV
Sbjct: 379 --SDRLSNTLEATLEKLFTARYGDDSSTIKTFAPARNNAQAKSHNWQGRIPPVTDTKMIV 436

Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTH 474
           +WN ++IS  ARA  +                  + EY+E+A  AA F+      D + +
Sbjct: 437 AWNAIMISGLARAYAVFS----------------QLEYLEMATQAAKFVLENQFVDGRFY 480

Query: 475 RLQHSFRNGPSKAPGFL---DDYAFLISGLLDLYE------FGSGTKWLVWAIELQNTQD 525
           RL +  +      PG L   +DYA  I  LLDL++       G    WL  A+ LQ   +
Sbjct: 481 RLNYEGK------PGVLAQSEDYALFIKALLDLHQACFKADTGKPAFWLEKAVSLQEEFN 534

Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDY 583
           +     E  GYFN T  D S  L V+E +  D A PS N +++ NLVRL  +    +   
Sbjct: 535 DYLWSVELHGYFN-TASDASKELIVRERNYIDSATPSANGIALCNLVRLTLVTDNLQ--- 590

Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           Y   AE +L  F   + D   A P +  A D
Sbjct: 591 YLNLAEQALTAFRGVMNDATQACPSLFVALD 621


>gi|425456902|ref|ZP_18836608.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
 gi|389801878|emb|CCI18996.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
          Length = 692

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 210/620 (33%), Positives = 311/620 (50%), Gaps = 74/620 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107

Query: 80  VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           VFL+PD L P  GGTYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L  SA   +    L   +L     + + +           P FP      + L  S+  +
Sbjct: 164 LRQSAILPRSETNLAAPSLLTTGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D   S + +  Q+      + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+ 
Sbjct: 224 DFDDSLQQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIV 278

Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               + +S   ++  +    +  +++L+R+M  P G  ++A+DADS E       +EGAF
Sbjct: 279 EYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAF 338

Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW+  E+ D L    + L + ++ +   GN            F+G+NVL          
Sbjct: 339 YVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQG 381

Query: 377 SKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWN 418
            +LG  +E  L+ L     G  + +L      R                  D K+IV+WN
Sbjct: 382 GELGKEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWN 441

Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
            L+IS  ARA          A+F  P+       Y ++A  A  FI ++ + D +  RL 
Sbjct: 442 SLMISGLARA---------FAVFGEPL-------YWQMATVATEFILKYQWLDGRFQRLN 485

Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGY 536
           +    G +      +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGY
Sbjct: 486 Y---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDEGGY 542

Query: 537 FNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
           FN T  D S+ L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  
Sbjct: 543 FN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQS 598

Query: 595 FETRLKDMAMAVPLMCCAAD 614
           F T L+    A P +  A D
Sbjct: 599 FSTILEQSPTACPSLFVALD 618


>gi|359457589|ref|ZP_09246152.1| hypothetical protein ACCM5_02608 [Acaryochloris sp. CCMEE 5410]
          Length = 695

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 211/633 (33%), Positives = 302/633 (47%), Gaps = 99/633 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F +  +AK +N  ++ IKVDREERPD+D +YM  VQA+ G GGWPL+
Sbjct: 57  SSCHWCTVMEGEAFSNSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLN 116

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FLSP DL P  GGTYFP E KYGRPGF  +L  ++  +D +++ L        E+LS  
Sbjct: 117 MFLSPGDLVPFYGGTYFPEEPKYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGH 172

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L +S   N + D  P+   +  A+  +   +   G     P FP      MM Y +  L 
Sbjct: 173 LQSSTVLNPIGDLQPELLSKGIAKNTTVLINKMPG-----PSFP------MMPYATIALH 221

Query: 199 DTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            +   + E  + Q+        +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 222 GSRFSTSEQEQAQQACRQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 281

Query: 258 ANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
                + +S  + +  F   I   +  +L+R+M    G  ++A+DAD+  T      +EG
Sbjct: 282 VEYLANLWSTGVEEPAFKRAIAVTVA-WLQREMTAEAGYFYAAQDADNFVTTADIEPEEG 340

Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
            FY WT  E+  +L  E      E + L   GN +            G  VL        
Sbjct: 341 RFYTWTDSELTHLLTPEEYAAMAEIFNLSVQGNFE-----------DGLTVLQRQQPGVI 389

Query: 375 SASKLGMPLEKYLNILGECRRKLFDVR-SKRPR------------------------PHL 409
           S +            + E  +KLF VR   RP                         P  
Sbjct: 390 SET------------VEEALQKLFQVRYGDRPESLKTFPPATHNQVAKTHPWPGRIPPVT 437

Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
           D K+IV+WN L+IS  ARA+ + +                + +Y+ +A  AASFI    +
Sbjct: 438 DTKMIVAWNSLMISGLARAAAVFQ----------------QPDYLALATKAASFILDQQW 481

Query: 470 DE-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQN 522
            E + HR+ +   +G        +DYA LI   LDL++       G  ++WL  A   Q 
Sbjct: 482 SEGRLHRVNY---DGEIAVIAQSEDYALLIKAFLDLHQACQSLAVGQASRWLEAAQTTQA 538

Query: 523 TQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
             DE     EGGGYFNT  E    +L+R +   D A P+ N V++ NL+RL+      ++
Sbjct: 539 EFDEHLWAVEGGGYFNTGSEISEELLIRERSWLDNATPAANGVAIANLIRLSLFC--DRT 596

Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           +Y  Q AE +L  F   +     A P +  A D
Sbjct: 597 EYLSQ-AEQALQTFGQVMDSSTQACPSLFVALD 628


>gi|172036954|ref|YP_001803455.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51142]
 gi|354554754|ref|ZP_08974058.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51472]
 gi|171698408|gb|ACB51389.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51142]
 gi|353553563|gb|EHC22955.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51472]
          Length = 686

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 234/715 (32%), Positives = 334/715 (46%), Gaps = 102/715 (14%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  LND F+ IKVDREERPD+D +YM+ +Q +   GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFCDLAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP E +YGRPGF  +L+ ++  +D +++ L     F  +++   
Sbjct: 108 IFLTPGDLVPFYGGTYFPVEPRYGRPGFLQVLQSIRRFYDVEKEKL---NGFK-QEIVNT 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKK 196
           L  SA        LP+  + +   QL  +  D        +A  F RP    M+ Y +  
Sbjct: 164 LQQSAI-------LPKTDINVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLA 215

Query: 197 LEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
           L+ T    GE  E   +V+   Q +A GGI D VGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 216 LQGTRFLFGEPEERHILVIQRGQDLALGGIFDQVGGGFHRYTVDSTWTVPHFEKMLYDNG 275

Query: 256 QLANVYLDAFSLTKD--VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
           Q+     + +S  +    F   I   +  +L+R+M  P G  ++A+DADS  T+     +
Sbjct: 276 QIVEYLANLWSSGQQEPAFERAIALTV-QWLQREMTAPDGYFYAAQDADSFATKEDKEPE 334

Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
           EGAFYVW  +++E +L    +    + + + P GN            F+GKNVL   N  
Sbjct: 335 EGAFYVWEYEQLEQLLTSTELEALTDVFTITPEGN------------FEGKNVLQRRNKE 382

Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSK-------------RPRPHLDDKVIVSWNG 419
             S S   +  + +    G  R  L   ++              R  P  D K+IV+WNG
Sbjct: 383 KLSDSIETILDKLFKERYGTSRNNLDTFQAAKNNQDAKTIHWPGRIPPVTDTKMIVAWNG 442

Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 479
           L+IS  ARA  + K          P+       Y ++A +A  FI    +     R Q  
Sbjct: 443 LMISGLARAYAVFKQ---------PL-------YWQLACNATQFILEKQW--VNGRFQRI 484

Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFN 538
              G        +DYAF I  LLDL       T+WL  A+E+Q   DE F   + GGY+N
Sbjct: 485 NYQGNPSILAQSEDYAFFIKALLDLQAANPQDTQWLDKAMEIQQEFDEYFWSVDTGGYYN 544

Query: 539 TTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
              ++ + LL R +   D A PS N +++ NLVRLA +        Y   AE +L  F  
Sbjct: 545 NADDNNNDLLVRERSYIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQALQAFSY 601

Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
            L++   A P +  A D           LV    +V              L   +    P
Sbjct: 602 VLRESPRACPSLLTALDWYHFG-----CLVRTNETV--------------LPTLITRYLP 642

Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
                +D   ++  NNA            + LVCQ  SC  P T    L + ++E
Sbjct: 643 TTAYRLD---DNLPNNA------------IGLVCQGLSCLEPATTQEQLLSQIIE 682


>gi|325676575|ref|ZP_08156253.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
 gi|325552753|gb|EGD22437.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
          Length = 674

 Score =  299 bits (766), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 192/559 (34%), Positives = 282/559 (50%), Gaps = 63/559 (11%)

Query: 22  CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
           CHWCHVM  ESFED+  A ++N+ FV IKVDREERPD+D VYM    A+ G GGWP++ F
Sbjct: 57  CHWCHVMAHESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCF 116

Query: 82  LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
           L+PD  P   GTY+P E + G P F  +L  V D W  +R  +  + A  + +L  + S 
Sbjct: 117 LTPDGAPFYCGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SG 175

Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
           +  +   P ++P   L      + +  D   GGFG APKFP  + ++ +L   ++     
Sbjct: 176 ALPAGGAPIDVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT---- 229

Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
               A    + V  T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD   L   Y
Sbjct: 230 ---SAGPTLRAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFY 286

Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
                 T       +  + +D+L RD+    G   SA DAD       T  +EG  Y WT
Sbjct: 287 AHLARRTGSALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWT 339

Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
           ++++ D++G +      E + +  TG  +           +G +VL    D         
Sbjct: 340 AQQIADVVGDDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD--------- 379

Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
            PL+   + L + R +L   R++RP+P  DDKV+ +WNGL I++ A A   L        
Sbjct: 380 -PLDA--DRLADIRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG------- 429

Query: 441 FNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLI 498
                    R +++E AE  A  +   HL D    RL+ +   G    P G L+DY  L 
Sbjct: 430 ---------RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALA 477

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGA 557
           +GL  L++     +WL  A  L +T  + F D  E G +F+T  +  +++ R ++  DGA
Sbjct: 478 TGLSTLHQVTGVAEWLEVATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGA 537

Query: 558 EPSGNSVSVINLVRLASIV 576
            PSG SV+   L+  +S+V
Sbjct: 538 TPSGASVTTEALLTASSLV 556


>gi|290957891|ref|YP_003489073.1| hypothetical protein SCAB_34251 [Streptomyces scabiei 87.22]
 gi|260647417|emb|CBG70522.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 691

 Score =  299 bits (765), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 220/707 (31%), Positives = 326/707 (46%), Gaps = 83/707 (11%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           + CHWCHVM  ESFED+G A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+S
Sbjct: 54  SACHWCHVMAKESFEDKGTAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 113

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           VF++P  +P   GTYFPP  + G P F+ +L  V  AW  +R  +A         L+E  
Sbjct: 114 VFMTPAAEPFYFGTYFPPGPRQGMPSFRQVLEGVHHAWSSRRQEVADVAVKITRDLAE-R 172

Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
           +  A S+ LP    Q    L   QL++  DS  G F  + KFP  + ++ +L H  +   
Sbjct: 173 ALGAGSDGLPTGETQAQALL---QLTRDVDSTSGWFKGSTKFPPSMVVEFLLRHHAR--- 226

Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV---DERWHVPHFEKMLYDQGQ 256
           TG        ++M       MA+  ++D VGGGFHRY +    +   VPHFEKMLYD   
Sbjct: 227 TGSVA----AREMAEGLCGAMARSSLYDQVGGGFHRYVLLAHADGPLVPHFEKMLYDNAL 282

Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
           L  VY   +  T       +  +  D++ R++    G   SA DADS +  G+ +  EGA
Sbjct: 283 LCRVYAHLWRATGSEPARRVALETADFMVRELRTNEGGFASALDADSDDGTGSGKHVEGA 342

Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           +YVWT +++ ++LGE        Y+                        + E       A
Sbjct: 343 YYVWTPEQLTEVLGEEDAALAVRYF-----------------------GVTEEGTFEEGA 379

Query: 377 SKLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
           S L +P ++ +     +   R +L   RS+RP P  DDKV+ +WNGL +++ A       
Sbjct: 380 SVLQLPQQEGVFDAERIESVRERLLAARSRRPAPGRDDKVVAAWNGLAVAALAETGAYF- 438

Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLD 492
                          DR + ++ A +AA  + R   DE+  RL  + R+G + A  G L+
Sbjct: 439 ---------------DRPDLVDAAITAADLLVRLHLDERA-RLTRTSRDGQAGANAGVLE 482

Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
           DYA +  G L L        WL +A  L +     F+D   G  ++T  +   ++ R ++
Sbjct: 483 DYADVAEGFLALASVTGEGVWLEFAGFLLDHVLARFVDEGSGALYDTASDAEKLIRRPQD 542

Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
             D A PSG S +      L    A + S+ +R+ AE +L V    +K +    P     
Sbjct: 543 PTDNATPSGWSAAAGA---LLGYAAQTGSEPHRRAAERALGV----VKALGPRAPRFIGW 595

Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
            +  A  +L  P  + V +VG +       +  AA         V+ +  AD +E+    
Sbjct: 596 GLATAEALLDGP--REVAVVGPEGHPGRRELHRAALLG-TAPGAVVAVGVADGDELPL-- 650

Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
                   +A       +  A VC++F+C  P TD   L  +L   P
Sbjct: 651 --------LAGRPLVGGEPAAYVCRHFTCDAPTTDADRLREVLGAAP 689


>gi|297626872|ref|YP_003688635.1| thioredoxin [Propionibacterium freudenreichii subsp. shermanii
           CIRM-BIA1]
 gi|296922637|emb|CBL57214.1| Conserved protein containing thioredoxin domain [Propionibacterium
           freudenreichii subsp. shermanii CIRM-BIA1]
          Length = 894

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 222/707 (31%), Positives = 328/707 (46%), Gaps = 85/707 (12%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESF D  VA+ +ND FV+I VDREERPDVD+V+M   QAL G GGWP++V
Sbjct: 49  SCHWCHVMAQESFRDPQVAQFVNDNFVAIAVDREERPDVDQVFMNATQALTGQGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           F +PD +P   GTYFP + + G+P F  + + +  AW ++RD + +SGA    QL++  S
Sbjct: 109 FCTPDGEPFFAGTYFPSQARVGQPSFLQVCQTLARAWAERRDEVVESGAHIASQLADQAS 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A+  +     E P  A  L A  L+   D   GGFG+APKFP+P  +  ++         
Sbjct: 169 AADPAGDQTGE-PPAADELLARALAL-VDPDNGGFGTAPKFPQPASLDALMV-------- 218

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ----GQ 256
             +GE  +    V  +L+ + +GGIHD VGGGFHRY+VD  W VPHFEKML D     G 
Sbjct: 219 --TGEPHQ-IGAVQLSLEHIVRGGIHDIVGGGFHRYAVDAAWAVPHFEKMLDDNALLLGT 275

Query: 257 LANVYLDAFSLTKDV--FYSYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATR 311
           L   +      T D+   +    R I+ +L R+M      G    S +DADS + +G  +
Sbjct: 276 LTRAWRRTGPETGDLREHFELAIRGIVGWLSREMAITTDAGTAFASGQDADSLDADG--Q 333

Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEH-YYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
           + EGAFY+WT  +VE +      LF +  ++L P G                      + 
Sbjct: 334 RVEGAFYLWTPHQVEAVFNRRDALFAQAVFHLTPKGT---------------------MP 372

Query: 371 DSSASASKLGMP-LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
           D S++    G P  ++   ILGE R    +VR++RP P  DDKV+  WNGL+  S   A+
Sbjct: 373 DHSSTLRLHGDPDPDRLKRILGELR----EVRARRPAPARDDKVVAGWNGLLADSLTSAA 428

Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
            +         F  P       E++ +A S   ++    + +  H  + S       AP 
Sbjct: 429 MV---------FGEP-------EWLTMARSVLDYLWSVHHFDTDHAARSSLAGVAGPAPA 472

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
            L+DYA    G   L      T+ L  A+ +     ELF   + GG+F+    D ++  R
Sbjct: 473 VLEDYAGFALGAARLAGATGDTELLDRAVTVLGRGVELF-GADDGGFFDAQ-HDEALFTR 530

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPL 608
            ++  D   PS  S+ V  L  +A +      +D  R+       V E         +  
Sbjct: 531 ARQLADEGGPSATSIMVTALQVVAGLTGNRDWADRARRAEPGLWQVLEQTPLASGWGLTQ 590

Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD--FW 666
           +   A   +   R  V +V  +S      +LA A        TV  +   D       F 
Sbjct: 591 LAIDAQATAGMGRAQVAIVDPESRP--MGLLARAVWRLAPEGTVAALGTPDAPGFGELFA 648

Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
           + H+ + A             A +C++ +C  PVTD   L + L  +
Sbjct: 649 QRHDIDGAP-----------TAYICRDETCFDPVTDFTRLRDPLWRR 684


>gi|158334352|ref|YP_001515524.1| hypothetical protein AM1_1172 [Acaryochloris marina MBIC11017]
 gi|158304593|gb|ABW26210.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 686

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 208/632 (32%), Positives = 303/632 (47%), Gaps = 97/632 (15%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F +  +AK +N  ++ IKVDREERPD+D +YM  VQA+ G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSNSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FLSP DL P  GGTYFP E +YGRPGF  +L  ++  +D +++ L        E+LS  
Sbjct: 108 MFLSPGDLVPFYGGTYFPEEPRYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGH 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-KL 197
           L +S   N + D  P+    L ++ ++K+           P FP      + L+ S+   
Sbjct: 164 LQSSTVLNPIGDLQPE----LLSKGIAKNTTVLINKM-PGPSFPMMPYAAIALHGSRFST 218

Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
            D  K+ +A   + + L      A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+
Sbjct: 219 PDQEKAQQACRQRGLDL------ALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272

Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
                + +S   K+  +       + +L+R+M    G  ++A+DAD+  T      +EG 
Sbjct: 273 VEYLANLWSAGVKEPAFERAIAGTVAWLQREMTAEAGYFYAAQDADNFVTTADIEPEEGR 332

Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
           FY WT  E+  +L  E      E + L   GN +            G  VL        S
Sbjct: 333 FYTWTDSELTHLLTTEEYAAMAEIFNLSAQGNFE-----------DGLTVLQRQQPGVIS 381

Query: 376 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPR------------------------PHLD 410
            +            + E  RKLF VR  +RP                         P  D
Sbjct: 382 ET------------VEEALRKLFQVRYGERPESLTTFPPATNNQVAKTHPWPGRIPPVTD 429

Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
            K+IV+WN L+IS  ARA+ + +                + +Y+ +A  AA FI    + 
Sbjct: 430 TKMIVAWNSLMISGLARAAAVFQ----------------QPDYLALATKAARFILDQQWS 473

Query: 471 E-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQNT 523
           E + HR+ +   +G        +DYA LI   LDL++          ++WL  A   Q  
Sbjct: 474 EGRLHRVNY---DGEIAVIAQSEDYALLIKAFLDLHQASQSLAVDQASRWLEAAQTTQAE 530

Query: 524 QDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
            DE     EGGGYFNT  E    +L+R +   D A P+ N V++ NL+RL+ +    +++
Sbjct: 531 FDEHLWAVEGGGYFNTGSEMSEELLIRERSWLDNATPAANGVAIANLIRLSLVC--DRTE 588

Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
           Y  Q AE +L  F   +     A P +  A D
Sbjct: 589 YLSQ-AEQALQTFGQVMGSSTQACPSLFVALD 619


>gi|154251723|ref|YP_001412547.1| hypothetical protein Plav_1270 [Parvibaculum lavamentivorans DS-1]
 gi|154155673|gb|ABS62890.1| protein of unknown function DUF255 [Parvibaculum lavamentivorans
           DS-1]
          Length = 676

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 227/689 (32%), Positives = 339/689 (49%), Gaps = 74/689 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
            CHWCHVM  ESFEDE VA ++N+ FV+IKVDREERPD+D +YM+ +  L   GGWPL++
Sbjct: 50  ACHWCHVMAHESFEDESVAAVMNEHFVNIKVDREERPDIDAIYMSALHLLGQQGGWPLTM 109

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P+ +P  GGTYFP E  YGRPGF  +L +V   + ++   + ++    ++ L E  S
Sbjct: 110 FLTPEGEPFWGGTYFPKEPNYGRPGFVQVLEEVARIFREEPAKVYKNRTALVKALEEQ-S 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
           A+A   +   ++P     + AE+L +  D   GG   APKFP+ V +  +L+ +     T
Sbjct: 169 ATARPGEPTPQVPI----VVAEKLREIMDPVHGGIRGAPKFPQ-VPLLTLLWRAHL--RT 221

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
           G+   A+     V   L  M++GGI+DH+GGG+ RYSVDE W  PHFEKMLYD   L ++
Sbjct: 222 GREDLAAP----VSRALDHMSEGGIYDHLGGGYARYSVDEFWLAPHFEKMLYDNALLIDL 277

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
               +  T+   Y    R+ +++L R+M+  GG   ++ DADS   EG     EG FYVW
Sbjct: 278 LTLVWQETRKPLYERRIRETVEWLAREMVTEGGGFAASLDADS---EGV----EGKFYVW 330

Query: 321 TSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
           +  E++++L  GE A LFK+ Y +   GN            ++  N+L  L  + A  + 
Sbjct: 331 SEAEIDNLLTPGE-AELFKQVYNVSGEGN------------WEETNILNRLARADAPFTA 377

Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
                 +    L   + +LF  R  R  P  DDKV+  WNGL+I++ ARA          
Sbjct: 378 ------EEEAALEPLKARLFLERDLRVHPGFDDKVLADWNGLMIAALARAGAAFGEAG-- 429

Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
                         + E+A +A  F+   +   +  RL H++R G  +     DD A + 
Sbjct: 430 --------------WTEMAAAAFRFVMTEM--RKDGRLHHAWRAGKLQHIAMADDLANMA 473

Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
              L LYE     ++L  A  L       + D   GGYF T  + P++++R +   D A 
Sbjct: 474 DAALALYEATGEAEYLQAAESLAAELGAHYRDETNGGYFFTADDAPALIVRRRTVADDAT 533

Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
           P+ N      L RLA +    K DY  + A+  +  F   L+      PL    A + + 
Sbjct: 534 PAANGTMPGVLARLALMT--GKQDYLAR-ADELIRAFAGELQQNIF--PLGSYIASLDTR 588

Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
                +VL+G K+       LA A     L   V+ +  AD   +   E H ++  +   
Sbjct: 589 LKPVQIVLIGSKAET---AELARAAFGTSLPARVL-MRVADGSALP--EGHPAHGKTALD 642

Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLE 707
                 K  A VC   +CS PVT+  +LE
Sbjct: 643 G-----KPTAYVCAGETCSLPVTEAAALE 666


>gi|37521713|ref|NP_925090.1| hypothetical protein gll2144 [Gloeobacter violaceus PCC 7421]
 gi|35212711|dbj|BAC90085.1| gll2144 [Gloeobacter violaceus PCC 7421]
          Length = 650

 Score =  298 bits (763), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 226/697 (32%), Positives = 315/697 (45%), Gaps = 114/697 (16%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D  +A  +N  FV+IKVDREERPD+D +YM  +Q +   GGWPL+
Sbjct: 53  SSCHWCTVMENEAFSDPEIAGFMNAHFVAIKVDREERPDIDAIYMQALQLMNQQGGWPLN 112

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP +D+YGRPGF  +L  + D +  +R+ L        E++  A
Sbjct: 113 IFLTPGDLVPFYGGTYFPVQDRYGRPGFLRVLEAIHDYYRGQRERLGDHK----ERMLGA 168

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
           L A+     L  ELP + LR     L     +     G  P FP        L   + LE
Sbjct: 169 LEAATRLQPL-SELPPDPLRRAVPPLR----ALLARDGMGPSFPMIPHAGFALRMGRFLE 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
                     G+         +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD GQ+ 
Sbjct: 224 VELAQSACERGED--------LATGGIFDHVGGGFHRYTVDGTWTVPHFEKMLYDNGQIV 275

Query: 259 NVYLDAFSLTKDV-FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
               D ++    +  +         +L R+M    G  ++A+DADS   EG    +EG F
Sbjct: 276 EFLSDLWASGLHIPAFERAVEFTHRWLLREMTDGRGYFYAAQDADS---EG----EEGKF 328

Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
           YVW++ E+++IL GE     +  ++L   GN            F+G+  +++      S 
Sbjct: 329 YVWSASELQEILSGEELAALESAFFLSAEGN------------FEGRTTVLQRR----SG 372

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             L   +E  L        KLF VRS+R     D K+IVSWN L+I+   RA+ +     
Sbjct: 373 DVLAPVVETALT-------KLFGVRSRRVPAATDTKLIVSWNALMIAGLNRAADVF---- 421

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDDYA 495
                        R EY E A  AA FI  H     + +RL +   +G    P   +DYA
Sbjct: 422 ------------GRPEYRETAVGAARFILEHQRAPGEFYRLNY---DGEPAIPAHAEDYA 466

Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
             I  L+DLY      +WL  A  LQ   DE   D E GGYF+     P +L+R K+  D
Sbjct: 467 CFIKALIDLYVSTQQGEWLEAARALQQQMDERLWDLEMGGYFSAPS-GPDLLIREKDFQD 525

Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
            A P+ N ++  NLVRL  +   +    Y + AE  L  F   L ++  A P +    D 
Sbjct: 526 SATPAANGLAAANLVRLFLL---TDEPAYLEAAEALLRQFARILAEVPRAGPSLLAGYD- 581

Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM--DFWEEHNSNN 673
                                         +  N+ ++  DP    E+   +W       
Sbjct: 582 ------------------------------WYRNQVLVQSDPERIAELLRGYW------- 604

Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
            +           VALVC+   C  P+     LE  L
Sbjct: 605 PTAVFKAVDVKPAVALVCEGLRCLEPIESEAQLEAQL 641


>gi|86742579|ref|YP_482979.1| hypothetical protein Francci3_3900 [Frankia sp. CcI3]
 gi|86569441|gb|ABD13250.1| protein of unknown function DUF255 [Frankia sp. CcI3]
          Length = 673

 Score =  298 bits (763), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 212/617 (34%), Positives = 297/617 (48%), Gaps = 66/617 (10%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
           +CHWCHVM  ESFED   A+ +ND FV+IKVDREERPDVD VYM    AL G GGWP++V
Sbjct: 49  SCHWCHVMAHESFEDAATAEYMNDHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTV 108

Query: 81  FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
           FL+P  +P   GTYFPP  + G   F+ +L  V +AW  +RD + +SGA    +L+EA +
Sbjct: 109 FLTPTAEPFFAGTYFPPRPRPGMGSFRQVLTAVTEAWRTRRDEIEESGADIARRLAEAAT 168

Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
              +S  L  E+    L      LS  +D+R GGFG APKFP  +  +M+L HS +  D 
Sbjct: 169 RGPASG-LAAEITPALLDTAVAGLSARFDARHGGFGGAPKFPPSMVAEMLLRHSARTGD- 226

Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
                 +   +MV  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  V
Sbjct: 227 ------ARSLEMVAVTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNALLLRV 280

Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD---------SAETEGATR 311
           YL  +  T       + R+   +L  D+  P G   SA DAD         SA   GA  
Sbjct: 281 YLHLWRATGSALAERVVRETAAFLLADLRTPQGGFASALDADAVPADAVPASAAPAGA-H 339

Query: 312 KKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
            +EGA Y WT  +   +LG E        + +   G+ +           +G +VL    
Sbjct: 340 PEEGASYAWTPAQFVAVLGPEDGRWAAGVFGVTEQGSFE-----------RGTSVLRLPA 388

Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
           D          P +                   +P    DDKV+ +WNGL I++ A A  
Sbjct: 389 D----------PDDPARFAAVRAALAAARATRPQP--ARDDKVVAAWNGLAIAALAEAGA 436

Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPG 489
           +                 D  +++  AE AA  +R  HL + +  R     R G +   G
Sbjct: 437 LF----------------DEPDWVRAAEQAAVLLRDVHLVNGRLRRTSRDGRVGVNA--G 478

Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
            L+DY  +  GLL L++     +WL  A  L +   + F   + GG+F+T  +   +L R
Sbjct: 479 VLEDYGDVAEGLLTLHQVTGDPEWLALAGTLLDIVRDRFAASD-GGFFDTADDAEVLLRR 537

Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPL 608
            ++D D A PSG +     LV  A++   + S  +R  AE ++A     L +D   A   
Sbjct: 538 PRDDSDSATPSGQAAVAGALVSYAAL---TGSTEHRSAAETTVARVAPLLARDARFAGWA 594

Query: 609 MCCAADMLSVPSRKHVV 625
              A  +L+ P+   VV
Sbjct: 595 GAVAEALLAGPAEVAVV 611


>gi|121604944|ref|YP_982273.1| hypothetical protein Pnap_2043 [Polaromonas naphthalenivorans CJ2]
 gi|120593913|gb|ABM37352.1| protein of unknown function DUF255 [Polaromonas naphthalenivorans
           CJ2]
          Length = 610

 Score =  297 bits (761), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 198/602 (32%), Positives = 309/602 (51%), Gaps = 52/602 (8%)

Query: 21  TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
            CHWCHVM  ESF D  +A L+N+ FV+IKVDREERPD+D VY    Q L   GGGWPL+
Sbjct: 49  ACHWCHVMAAESFSDPAIAALMNEGFVNIKVDREERPDLDAVYQMAHQLLRRTGGGWPLT 108

Query: 80  VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
           +FLSP   P   GTYFP     G+  F+ +L  V   W ++R  LA+      +Q   A 
Sbjct: 109 IFLSPQGVPFYSGTYFPSAAPEGQATFQAVLGSVSAVWREQRPALARQ-----DQALLAA 163

Query: 140 SASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
            A+++  +    +P  A+R  A +QL+ ++D   GGFG+APKFP P ++  +L  +++  
Sbjct: 164 LAASAPRRDDAAVPGAAVRAQALQQLATAFDPAQGGFGAAPKFPHPSDLAFLLRRAREEG 223

Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
           D       ++ ++M L TL+ MA+GG++D +GGGF RYSVD +W +PHFEKML D G L 
Sbjct: 224 D-------AQAREMALLTLRKMAEGGLYDQIGGGFFRYSVDAQWRIPHFEKMLCDNGVLL 276

Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
            +Y DA +LT +  +  +  D   +  R+M    G   ++  AD A+       +EG FY
Sbjct: 277 ALYADALALTGEPLFRRVVEDTASWALREMQSSAGGFHASLAADDAQ------GREGRFY 330

Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-A 376
           VW S+ +   L  +   +   H+ L          +  P   F+G++  + +  ++   A
Sbjct: 331 VWESEPLRLALSPNEWDVCAAHWGL----------VDGPG--FEGRHWHLRVARAAGPLA 378

Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
             L  P  +   ++   R KL   R KR RP  D K++  W  L+++  ARAS + +   
Sbjct: 379 VTLRRPEAQVEELIASARPKLLAERDKRERPARDAKLLTGWTALMMTGLARASAVCQ--- 435

Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
                        R E++  A SA  F++   + +      H     P +A  FLDD+AF
Sbjct: 436 -------------RPEWLLAARSALRFVQAGRWQDDGRTSGHLLAL-PGQA-AFLDDHAF 480

Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
           L+  +L L++       L +A  +       F DR+ GG+F T  + P+++ R+K   D 
Sbjct: 481 LLEAVLALHDADPQPGDLPFAQAIAKAMLAQFEDRDAGGFFFTRHDAPALIHRLKTGLDA 540

Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
           A PSGN  + + L+ L+  +   ++  YR  AE  + VF   + +   + P +  AA++L
Sbjct: 541 ATPSGNGTAALALLALSGKLDAPQAAAYRLAAERCVRVFAATVLNDPASFPRLLQAAELL 600

Query: 617 SV 618
             
Sbjct: 601 QA 602


>gi|302865439|ref|YP_003834076.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
           27029]
 gi|302568298|gb|ADL44500.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
           27029]
          Length = 678

 Score =  297 bits (761), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 221/699 (31%), Positives = 321/699 (45%), Gaps = 72/699 (10%)

Query: 11  KTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
           K R    LI+     CHWCHVM  ESFE+E VA+L+ND FV +KVDREERPDVD VYMT 
Sbjct: 34  KRRDVPVLISVGYAACHWCHVMAHESFENEAVARLMNDDFVCVKVDREERPDVDAVYMTA 93

Query: 67  VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
            QA+ G GGWP++VF +PD  P   GTYFP      R  F  +L  V  AW  +R+ + +
Sbjct: 94  TQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFIRLLGSVATAWRDQREAVLR 147

Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
            G   +E +  A +    +  L  EL    L   A +L+  YD   GGFG APKFP  + 
Sbjct: 148 QGTAVVEAIGGAQAVGGVTAPLTAEL----LDAAASRLAGEYDETNGGFGGAPKFPPHMN 203

Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           +  +L H ++   TG    ++   ++V  T + MA+GG++D + GGF RYSVD  W VPH
Sbjct: 204 LLFLLRHHQR---TG----SARSLEIVRHTCEAMARGGLNDQLAGGFARYSVDGHWTVPH 256

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD   L  VY   + LT D     + RD   +L  ++   G    SA DAD+   
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDRLARRVARDTARFLADELHRAGEGFASALDADTEGV 316

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           EG T       YVWT  ++ ++LGE    F            DL  ++       G +VL
Sbjct: 317 EGLT-------YVWTPDQLVEVLGEDDGRFA----------ADLFEVTADGTFEHGTSVL 359

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
               D   +  ++     ++ +++G    +L   R  RP+P  DDKV+ +WNGL I++ A
Sbjct: 360 RLARDVDDADPEV---RARWQDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITAIA 412

Query: 427 R----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
                AS ++  + E A     V+        + AE  A+    HL D +  R+      
Sbjct: 413 EFQQVASLLVSPDDEDANLMDGVLIVSDGAMRDAAEHLATV---HLVDGRLRRVSRDKVV 469

Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
           G  +  G L+DY  +      +++     +WL  A EL +     F   + G +++T  +
Sbjct: 470 G--QPAGVLEDYGCVAEAFCAMHQLTGEGRWLTLAGELLDVALARFAGPD-GAFYDTADD 526

Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
              ++ R  +  D A PSG S  V  LV  A++   ++   YR+ AE +L      +   
Sbjct: 527 AERLVTRPADPTDNATPSGRSAIVAALVAYAALTGETR---YREAAEKTLTTVAPIVDRH 583

Query: 603 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
           A          + L     +  V  G       + ++AAA         V+   P     
Sbjct: 584 ARFTGYAATVGEALLSGPYEIAVATGDPEG---DPLVAAARRHAPPGAVVVAGAP----- 635

Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                        +A   F   K  A VC+ F C  PVT
Sbjct: 636 ------DQPGVPLLAGRPFVDGKPAAYVCRGFVCQRPVT 668


>gi|315501987|ref|YP_004080874.1| n-acylglucosamine 2-epimerase [Micromonospora sp. L5]
 gi|315408606|gb|ADU06723.1| N-acylglucosamine 2-epimerase [Micromonospora sp. L5]
          Length = 678

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 221/699 (31%), Positives = 321/699 (45%), Gaps = 72/699 (10%)

Query: 11  KTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
           K R    LI+     CHWCHVM  ESFE+E VA+L+ND FV +KVDREERPDVD VYMT 
Sbjct: 34  KRRDVPVLISVGYAACHWCHVMAHESFENEAVARLMNDDFVCVKVDREERPDVDAVYMTA 93

Query: 67  VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
            QA+ G GGWP++VF +PD  P   GTYFP      R  F  +L  V  AW  +R+ + +
Sbjct: 94  TQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFIRLLGSVATAWRDQREAVLR 147

Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
            G   +E +  A +    +  L  EL    L   A +L+  YD   GGFG APKFP  + 
Sbjct: 148 QGTAVVEAIGGAQAVGGVTAPLTAEL----LDAAASRLAGEYDETNGGFGGAPKFPPHMN 203

Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
           +  +L H ++   TG    ++   ++V  T + MA+GG++D + GGF RYSVD  W VPH
Sbjct: 204 LLFLLRHHQR---TG----SARSLEIVRHTCEAMARGGLNDQLAGGFARYSVDGHWTVPH 256

Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
           FEKMLYD   L  VY   + LT D     + RD   +L  ++   G    SA DAD+   
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDRLARRVARDTARFLADELHRAGEGFASALDADTEGV 316

Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
           EG T       YVWT  ++ ++LGE    F            DL  ++       G +VL
Sbjct: 317 EGLT-------YVWTPGQLVEVLGEDDGRFA----------ADLFEVTADGTFEHGTSVL 359

Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
               D   +  ++     ++ +++G    +L   R  RP+P  DDKV+ +WNGL I++ A
Sbjct: 360 RLARDVDDADPEV---RARWQDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITAIA 412

Query: 427 R----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
                AS ++  + E A     V+        + AE  A+    HL D +  R+      
Sbjct: 413 EFQQVASLLVSPDDEDANLMDGVLIVSDGAMRDAAEHLATV---HLVDGRLRRVSRDKVV 469

Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
           G  +  G L+DY  +      +++     +WL  A EL +     F   + G +++T  +
Sbjct: 470 G--QPAGVLEDYGCVAEAFCAMHQLTGEGRWLTLAGELLDVALARFAGPD-GAFYDTADD 526

Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
              ++ R  +  D A PSG S  V  LV  A++   ++   YR+ AE +L      +   
Sbjct: 527 AERLVTRPADPTDNATPSGRSAIVAALVAYAALTGETR---YREAAEKTLTTVAPIVDRH 583

Query: 603 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
           A          + L     +  V  G       + ++AAA         V+   P     
Sbjct: 584 ARFTGYAATVGEALLSGPYEIAVATGDPEG---DPLVAAARRHAPPGAVVVAGAP----- 635

Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
                        +A   F   K  A VC+ F C  PVT
Sbjct: 636 ------DQPGVPLLAGRPFVDGKPAAYVCRGFVCQRPVT 668


>gi|427723011|ref|YP_007070288.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
 gi|427354731|gb|AFY37454.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
          Length = 681

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 210/623 (33%), Positives = 303/623 (48%), Gaps = 86/623 (13%)

Query: 20  NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
           ++CHWC VME E+F D+ +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL+
Sbjct: 48  SSCHWCTVMEGEAFSDQAIADYLNANFLPIKVDREERPDIDSIYMQALQLMTGQGGWPLN 107

Query: 80  VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
           +FL+P DL P  GGTYFP   +Y RPGF  +L  ++  +D + + L +      E++   
Sbjct: 108 IFLTPDDLIPFYGGTYFPVSPRYNRPGFLDVLSSIRHFYDDEPERLKEIK----EEIFTI 163

Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSK 195
           L  S +       LP   L L    L KS ++  G  G     P FP      + L  S+
Sbjct: 164 LDRSVT-------LPTTELSLDQTLLEKSIEACTGVVGRVSHGPSFPMIPYAAIALQGSR 216

Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
             E+T   G A   ++ +      +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G
Sbjct: 217 FTENTKHDGSAITKKRGL-----DLALGGIYDHVGGGFHRYTVDPNWTVPHFEKMLYDNG 271

Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
           Q    LAN++ +    T +  +       +++L R+M  P G  ++A+DADS    G   
Sbjct: 272 QITEFLANLWANG---TTEPSFKTALEGTVEWLSREMTAPQGYFYAAQDADSFLDAGHVE 328

Query: 312 KKEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
            +EG FYVW   E++    + A    +E+++++P GN            F+GK VL    
Sbjct: 329 PEEGTFYVWDFDELQTQFSDTAFQELQENFFIEPDGN------------FEGKIVL---- 372

Query: 371 DSSASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDK 412
               +++++   L+  LN L     G  R+ L      R                  D K
Sbjct: 373 -KRRASTEIPESLQATLNQLFAERYGGDRQSLETFPPARDNAEAKNTDWAGRIPAVTDTK 431

Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
           +IV+WN L+IS  AR   +L  E                +  ++A +  +FI    + E 
Sbjct: 432 LIVAWNALMISGLARIYGVLSLE----------------KAWDLAVNCVNFILETQWQE- 474

Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDR 531
            H  + +F   P       +DYAFLI  LLDL     + T WL  AI LQ+  D  F   
Sbjct: 475 GHLYRLNFGEEPDGVAQ-SEDYAFLIKALLDLQANNPTETHWLDKAITLQSEFDAKFWSA 533

Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
           E  GYFN T E   +L++ +   D A PS N ++V NL+RL  +   ++   Y   AE +
Sbjct: 534 ETKGYFNNT-EAKELLIKERSYQDNATPSANGIAVTNLIRLFLL---TEDLAYLDKAEQA 589

Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
           L  F   L   +   P +  A D
Sbjct: 590 LQTFAVVLDKSSQQAPSLIAALD 612


>gi|365866818|ref|ZP_09406418.1| hypothetical protein SPW_6722 [Streptomyces sp. W007]
 gi|364003721|gb|EHM24861.1| hypothetical protein SPW_6722 [Streptomyces sp. W007]
          Length = 619

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 202/573 (35%), Positives = 289/573 (50%), Gaps = 57/573 (9%)

Query: 27  VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
           +M  ESFEDE VA  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+ D 
Sbjct: 1   MMAHESFEDETVAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTADA 60

Query: 87  KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
           +P   GTYFPPE ++G P F+ +L  V  AW  +R+ +A+     +  L+   S     +
Sbjct: 61  EPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAGRIVADLA-GRSLVHGGD 119

Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
            +P E  + A  L    L++ YD + GGFG APKFP  + ++ +L H  +   TG  G  
Sbjct: 120 GVPGE-QETAQALLG--LTREYDEQHGGFGGAPKFPPSMAVEFLLRHYAR---TGSEG-- 171

Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
               +M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   + 
Sbjct: 172 --ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWR 229

Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
            T       I  +  D++ R++    G   SA DADS + +G  R  EGAFYVWT  ++ 
Sbjct: 230 TTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAFYVWTPGQLR 287

Query: 327 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
           ++LGE    F   Y+           +++     +G +VL    D+         P++  
Sbjct: 288 EVLGEDDAAFAAAYF----------GVTEEGTFEEGASVLRLPGDTG--------PVDA- 328

Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
              + + R +L   R++RPRP  DDKV+ +WNGL I++ A                    
Sbjct: 329 -ARVADVRARLLAARAERPRPGRDDKVVAAWNGLAIAALAETGAYF-------------- 373

Query: 447 GSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDL 504
             DR + +E A  AA   +R HL   +  RL  + ++G      G L+DY  +  G L L
Sbjct: 374 --DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDVAEGFLAL 429

Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
                   WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + 
Sbjct: 430 AAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTA 488

Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
           +   L+   S  A + S+ +R  AE +L V + 
Sbjct: 489 AAGALL---SYAAYTGSEAHRTAAEGALGVVKA 518


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,821,563,226
Number of Sequences: 23463169
Number of extensions: 524598392
Number of successful extensions: 1077751
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1480
Number of HSP's successfully gapped in prelim test: 102
Number of HSP's that attempted gapping in prelim test: 1066747
Number of HSP's gapped (non-prelim): 2237
length of query: 718
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 568
effective length of database: 8,839,720,017
effective search space: 5020960969656
effective search space used: 5020960969656
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)