BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005023
(718 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359479833|ref|XP_002267103.2| PREDICTED: spermatogenesis-associated protein 20-like [Vitis
vinifera]
Length = 819
Score = 1225 bits (3169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 582/698 (83%), Positives = 633/698 (90%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS
Sbjct: 122 STCHWCHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 181
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ KRD+L +SGAFAIEQLSEAL
Sbjct: 182 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWENKRDVLVKSGAFAIEQLSEAL 241
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA+ASSNKL D +PQ AL LCAEQL+ +YD +GGFGSAPKFPRPVEIQ+MLYH KKLE+
Sbjct: 242 SATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEE 301
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GKSGEA+E KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 302 SGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 361
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+
Sbjct: 362 AYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYI 421
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKL
Sbjct: 422 WTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKL 481
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GMP+EKYL+ILG CRRKLFDVR RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE
Sbjct: 482 GMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGT 541
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 542 KFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLIS 601
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLD+YEFG T WLVWAIELQ+TQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 602 GLLDIYEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 661
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVSVINLVRL S+VAGS + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVP
Sbjct: 662 SGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVP 721
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FWE NSN A MA+N
Sbjct: 722 SRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFWEAMNSNIALMAKN 781
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
NF+ DKVVALVCQNF+CS PVTD SL+ LL KPSS
Sbjct: 782 NFAPDKVVALVCQNFTCSSPVTDSTSLKALLCLKPSSA 819
>gi|296086616|emb|CBI32251.3| unnamed protein product [Vitis vinifera]
Length = 754
Score = 1224 bits (3167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 582/698 (83%), Positives = 633/698 (90%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS
Sbjct: 57 STCHWCHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ KRD+L +SGAFAIEQLSEAL
Sbjct: 117 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWENKRDVLVKSGAFAIEQLSEAL 176
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA+ASSNKL D +PQ AL LCAEQL+ +YD +GGFGSAPKFPRPVEIQ+MLYH KKLE+
Sbjct: 177 SATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEE 236
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GKSGEA+E KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 237 SGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 296
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+
Sbjct: 297 AYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYI 356
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKL
Sbjct: 357 WTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKL 416
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GMP+EKYL+ILG CRRKLFDVR RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE
Sbjct: 417 GMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGT 476
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 477 KFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLIS 536
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLD+YEFG T WLVWAIELQ+TQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 537 GLLDIYEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 596
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVSVINLVRL S+VAGS + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVP
Sbjct: 597 SGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVP 656
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FWE NSN A MA+N
Sbjct: 657 SRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFWEAMNSNIALMAKN 716
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
NF+ DKVVALVCQNF+CS PVTD SL+ LL KPSS
Sbjct: 717 NFAPDKVVALVCQNFTCSSPVTDSTSLKALLCLKPSSA 754
>gi|255559290|ref|XP_002520665.1| conserved hypothetical protein [Ricinus communis]
gi|223540050|gb|EEF41627.1| conserved hypothetical protein [Ricinus communis]
Length = 874
Score = 1204 bits (3115), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/698 (82%), Positives = 637/698 (91%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYMT+VQALYGGGGWPLS
Sbjct: 62 STCHWCHVMEVESFEDESVAKLLNDWFVSIKVDREERPDVDKVYMTFVQALYGGGGWPLS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPPED YGRPGFKT+LRKVKDAWDKKRD+L +SGAFAIEQLSEAL
Sbjct: 122 VFLSPDLKPLMGGTYFPPEDNYGRPGFKTLLRKVKDAWDKKRDVLIKSGAFAIEQLSEAL 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SASAS+NKLPD LPQNALR CAEQLS+SYD+RFGGFGSAPKFPRPVEIQ+MLYH+KKLED
Sbjct: 182 SASASTNKLPDGLPQNALRSCAEQLSQSYDARFGGFGSAPKFPRPVEIQLMLYHAKKLED 241
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ K +A EG KMV +LQCMAKGGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQLAN
Sbjct: 242 SEKVDDAKEGFKMVFSSLQCMAKGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 301
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+YLDAFS+T DVFYS++ RDILDYLRRDMIG GEIFSAEDADSAE EGA +K+EGAFYV
Sbjct: 302 IYLDAFSITNDVFYSFVSRDILDYLRRDMIGQKGEIFSAEDADSAEHEGAKKKREGAFYV 361
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT KE++DILGEHA LFK+HYY+KP GNCDLSRMSDPH EFKGKNVLIELND SA ASK
Sbjct: 362 WTDKEIDDILGEHATLFKDHYYIKPLGNCDLSRMSDPHKEFKGKNVLIELNDPSALASKH 421
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+P+EKY +ILGE +R LFDVR++RPRPHLDDKVIVSWNGL IS+FARASKILK E+E
Sbjct: 422 GLPIEKYQDILGESKRMLFDVRARRPRPHLDDKVIVSWNGLAISAFARASKILKRESEGT 481
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+NFPVVG D +EY+EVAE+AA+FIR+HLY+EQT RLQHSFRNGPSKAPGFLDDYAFLIS
Sbjct: 482 RYNFPVVGCDPREYIEVAENAATFIRKHLYEEQTRRLQHSFRNGPSKAPGFLDDYAFLIS 541
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYEFG G WLVWA ELQNTQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 542 GLLDLYEFGGGIYWLVWATELQNTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEP 601
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INL+RLAS+V GSKS+ YR NAEH LAVFETRLKDMAMAVPLMCCAADM+SVP
Sbjct: 602 SGNSVSAINLIRLASMVTGSKSECYRHNAEHLLAVFETRLKDMAMAVPLMCCAADMISVP 661
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVGHK S + ++MLAAAH SYD NKTVIHIDP + EEM+FW ++NSN A MA+N
Sbjct: 662 SRKQVVLVGHKPSSELDDMLAAAHESYDPNKTVIHIDPTNNEEMEFWADNNSNIALMAKN 721
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
NF+ADKVVA+VCQNF+CSPPVTDP SL+ LL +KP++
Sbjct: 722 NFTADKVVAVVCQNFTCSPPVTDPKSLKALLSKKPAAV 759
>gi|449436537|ref|XP_004136049.1| PREDICTED: spermatogenesis-associated protein 20-like [Cucumis
sativus]
Length = 855
Score = 1191 bits (3082), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 561/717 (78%), Positives = 626/717 (87%), Gaps = 3/717 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K FL +TCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPD
Sbjct: 139 GEEAFAEAQKRNVPIFLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPD 198
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMTYVQALY GGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD
Sbjct: 199 VDKVYMTYVQALYSGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWD 258
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
KRD+L +SG FAIEQLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD FGGFGSA
Sbjct: 259 NKRDVLVKSGTFAIEQLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSA 318
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPRPVE Q+MLY++K+LE++GKS EA E MV+F LQCMA+GGIHDHVGGGFHRYSV
Sbjct: 319 PKFPRPVEAQLMLYYAKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSV 378
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE WHVPHFEKMLYDQGQ+ NVYLDAFS+TKDVFYS++ RD+LDYLRRDMIG GEI+SA
Sbjct: 379 DECWHVPHFEKMLYDQGQITNVYLDAFSITKDVFYSWVSRDVLDYLRRDMIGTQGEIYSA 438
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADSAE+EGATRKKEGAFYVWT KE++DILGEHA FKEHYY+KP+GNCDLSRMSDPH+
Sbjct: 439 EDADSAESEGATRKKEGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHD 498
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
EFKGKNVLIE+ S AS MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWN
Sbjct: 499 EFKGKNVLIEMKSVSEMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWN 558
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL ISSFARASKIL++E E F FPVVG D KEY +VAE AA FI+ LYDEQTHRLQH
Sbjct: 559 GLTISSFARASKILRNEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQH 618
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
SFRNGPSKAPGFLDDYAFLI GLLDLYE+G G WLVWAIELQ TQDELFLDREGGGY+N
Sbjct: 619 SFRNGPSKAPGFLDDYAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYN 678
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
TTGED SV+LRVKEDHDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE R
Sbjct: 679 TTGEDKSVILRVKEDHDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKR 738
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
LK+MA+AVPL+CCAA M S+PSRK VVLVGHK+S FE LAAAHASYD N+TVIH+DP
Sbjct: 739 LKEMAVAVPLLCCAAGMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTVIHVDPT 798
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
D E+ FWEE+N + A MA+NNF+ADKVVALVCQNF+C P+TDP SLE +L EKPS
Sbjct: 799 DDTELQFWEENNRSIAVMAKNNFAADKVVALVCQNFTCKAPITDPGSLEAMLAEKPS 855
>gi|449498445|ref|XP_004160539.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
20-like [Cucumis sativus]
Length = 855
Score = 1183 bits (3060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 557/717 (77%), Positives = 622/717 (86%), Gaps = 3/717 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K FL +TCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPD
Sbjct: 139 GEEAFAEAQKRNVPIFLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPD 198
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMTYVQALY GGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD
Sbjct: 199 VDKVYMTYVQALYSGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWD 258
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
KRD+L +SG FAIEQLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD FGGFGSA
Sbjct: 259 NKRDVLVKSGTFAIEQLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSA 318
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPRPVE Q+MLY++K+LE++GKS EA E MV+F LQCMA+GGIHDHVGGGFHRYSV
Sbjct: 319 PKFPRPVEAQLMLYYAKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSV 378
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE WHVPHFEKMLYDQG + NVYLDAFS+TKD YS++ RD+LDYLRRDMIG GEI+SA
Sbjct: 379 DECWHVPHFEKMLYDQGXITNVYLDAFSITKDXLYSWVSRDVLDYLRRDMIGTQGEIYSA 438
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADSAE+EGATR KEGAFYVWT KE++DILGEHA FKEHYY+KP+GNCDLSRMSDPH+
Sbjct: 439 EDADSAESEGATRXKEGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHD 498
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
EFKGKNVLIE+ S AS MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWN
Sbjct: 499 EFKGKNVLIEMKSVSEMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWN 558
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL ISSFARASKIL++E E F FPVVG D KEY +VAE AA FI+ LYDEQTHRLQH
Sbjct: 559 GLTISSFARASKILRNEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQH 618
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
SFRNGPSKAPGFLDDYAFLI GLLDLYE+G G WLVWAIELQ TQDELFLDREGGGY+N
Sbjct: 619 SFRNGPSKAPGFLDDYAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYN 678
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
TTGED SV+LRVKEDHDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE R
Sbjct: 679 TTGEDKSVILRVKEDHDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKR 738
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
LK+MA+AVPL+CCAA M S+PSRK VVLVGHK+S FE LAAAHASYD N+TVIH+DP
Sbjct: 739 LKEMAVAVPLLCCAAGMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTVIHVDPT 798
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
D E+ FWEE+N + A MA+NNF+ADKVVALVCQNF+C P+TDP SLE +L EKPS
Sbjct: 799 DDTELQFWEENNRSIAVMAKNNFAADKVVALVCQNFTCKAPITDPGSLEAMLAEKPS 855
>gi|356570951|ref|XP_003553646.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
Length = 755
Score = 1165 bits (3014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 552/688 (80%), Positives = 608/688 (88%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLS
Sbjct: 56 STCHWCHVMEVESFEDEAVAKLLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLS 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRK+K+AWD KRDML + G++AIEQLSEA+
Sbjct: 116 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKLKEAWDSKRDMLIKRGSYAIEQLSEAM 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SAS+ S+KLPD +P +ALRLC+EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLED
Sbjct: 176 SASSDSDKLPDGVPADALRLCSEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLED 235
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TGK A+ QKMV F+LQCMAKGG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 236 TGKLDGANRIQKMVFFSLQCMAKGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 295
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+
Sbjct: 296 VYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYI 355
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT KEV DILGEHA LF+EHYY+K +GNC+LS MSDPH+EFKGKNVLIE + S ASK
Sbjct: 356 WTGKEVADILGEHAALFEEHYYIKQSGNCNLSGMSDPHDEFKGKNVLIERKEPSELASKY 415
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GM +E Y ILGECR KLF+VRS+RP+PHLDDKVIVSWNGL ISSFARASKILK E E
Sbjct: 416 GMSIETYQEILGECRHKLFEVRSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEVEGT 475
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F FPVVG++ K Y+ +AE AA FI + LY+ +THRL HSFR+ PSKAP FLDDYAFLIS
Sbjct: 476 KFYFPVVGTEAKGYLRIAEKAAFFIWKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLIS 535
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYEFG G WL+WAIELQ TQD LFLDR GGGYFN TGED SVLLRVKEDHDGAEP
Sbjct: 536 GLLDLYEFGGGINWLLWAIELQETQDALFLDRTGGGYFNNTGEDSSVLLRVKEDHDGAEP 595
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INL+RLAS+VAGSK+++Y+QNAEH LAVFE RLKDMAMAVPLMCCAADML VP
Sbjct: 596 SGNSVSAINLIRLASMVAGSKAEHYKQNAEHLLAVFERRLKDMAMAVPLMCCAADMLHVP 655
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VV+VG ++S DFENMLAAAHA YD N+TVIHIDP + EEM FWE +NSN A MA+N
Sbjct: 656 SRKQVVVVGERTSGDFENMLAAAHALYDPNRTVIHIDPNNKEEMGFWEVNNSNVALMAKN 715
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLE 707
NF+ DKVVALVCQNF+CSPPVTD SLE
Sbjct: 716 NFAVDKVVALVCQNFTCSPPVTDHSSLE 743
>gi|115432144|gb|ABI97349.1| cold-induced thioredoxin domain-containing protein [Ammopiptanthus
mongolicus]
Length = 839
Score = 1157 bits (2993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/712 (79%), Positives = 621/712 (87%), Gaps = 3/712 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ FL +TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPD
Sbjct: 119 GEEAFSEASRRDVPIFLSIGYSTCHWCHVMEVESFEDEEVAKLLNDWFVSIKVDREERPD 178
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRKVK+AWD
Sbjct: 179 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWD 238
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
KRDML +SGAF IEQLSEALSAS+ S+KLPD +P AL LC+EQLS SYDS+FGGFGSA
Sbjct: 239 SKRDMLIKSGAFTIEQLSEALSASSVSDKLPDGVPDEALNLCSEQLSGSYDSKFGGFGSA 298
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPRPVE +MLYHS+KLEDTGK G A+E QKMV F LQCMAKGGIHDH+GGGFHRYSV
Sbjct: 299 PKFPRPVEFNLMLYHSRKLEDTGKLGAANESQKMVFFNLQCMAKGGIHDHIGGGFHRYSV 358
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE WHVPHFEKMLYDQGQLANVYLDAFS+TKD FYS I +DILDYLRRDMIGP GEIFSA
Sbjct: 359 DECWHVPHFEKMLYDQGQLANVYLDAFSITKDTFYSCISQDILDYLRRDMIGPEGEIFSA 418
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADSAE EGATRKKEGAFY+WTSKEVEDILG+HA LFKEHYY+K +GNCDLSRMSDPH+
Sbjct: 419 EDADSAEIEGATRKKEGAFYIWTSKEVEDILGDHAALFKEHYYIKQSGNCDLSRMSDPHD 478
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
EFKGKNVLIE D+S ASK GM +E Y ILGECRRKLF+VRS+R RPHLDDKVIVSWN
Sbjct: 479 EFKGKNVLIERKDTSEMASKYGMSVETYQEILGECRRKLFEVRSRRSRPHLDDKVIVSWN 538
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL ISSFARASKILK EAE FNFPVVG++ KEY+ +AE AA FIR+ LYD +THRL H
Sbjct: 539 GLAISSFARASKILKREAEGTKFNFPVVGTEPKEYLVIAEKAAFFIRKQLYDVETHRLHH 598
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
SFRN PSKAPGFLDDYAFLISGLLDLYEFG G WL+WA ELQ TQD LFLDR+GGGYFN
Sbjct: 599 SFRNSPSKAPGFLDDYAFLISGLLDLYEFGGGINWLLWAFELQETQDALFLDRDGGGYFN 658
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
GEDPSVLLRVKEDHDGAEPSGNSVS INL+RLAS+VAGSK+ Y++NAEH LAVFE R
Sbjct: 659 NAGEDPSVLLRVKEDHDGAEPSGNSVSAINLIRLASMVAGSKAADYKRNAEHLLAVFEKR 718
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
LKDMAMAVPLMCCAADML VPSRK VV+VG +S +FE+MLAAAHASYD N+TV+HIDP
Sbjct: 719 LKDMAMAVPLMCCAADMLRVPSRKQVVVVGERSFEEFESMLAAAHASYDPNRTVVHIDPN 778
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
EEM+FWE +NSN A MA+NN+ +KVVALVCQNF+CSPPVTD ++LE LL
Sbjct: 779 YKEEMEFWEVNNSNIALMAKNNYRVNKVVALVCQNFTCSPPVTDHLALEALL 830
>gi|356505532|ref|XP_003521544.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
Length = 809
Score = 1157 bits (2992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/698 (80%), Positives = 622/698 (89%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLS
Sbjct: 110 STCHWCHVMEVESFEDEAVAKLLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLS 169
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRKVK+AWD KRDML +SG++AIEQLSEA+
Sbjct: 170 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWDSKRDMLIKSGSYAIEQLSEAM 229
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SAS+ S+KLPD +P +ALRLC+EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLED
Sbjct: 230 SASSDSDKLPDGVPADALRLCSEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLED 289
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TGK G A+ Q+MV F+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 290 TGKLGVANGSQQMVFFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 349
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+
Sbjct: 350 VYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYI 409
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTSKEVED+LGEHA LF+EHYY+K GNCDLS MSDPH+EFKGKNVLIE + S ASK
Sbjct: 410 WTSKEVEDLLGEHAALFEEHYYIKQLGNCDLSGMSDPHDEFKGKNVLIERKEPSELASKY 469
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GM +E Y ILGECR KLF+VRS+RP+PHLDDKVIVSWNGL ISSFARASKILK EAE
Sbjct: 470 GMSVETYQEILGECRHKLFEVRSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEAEGT 529
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F FPV+G++ KEYM +AE AASFIR+ LY+ +THRL HSFR+ PSKAP FLDDYAFLIS
Sbjct: 530 KFYFPVIGTEPKEYMGIAEKAASFIRKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLIS 589
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYEFG G WL+WAIELQ TQD LFLD+ GGGYFN TGED SVLLRVKEDHDGAEP
Sbjct: 590 GLLDLYEFGGGISWLLWAIELQETQDALFLDKTGGGYFNNTGEDASVLLRVKEDHDGAEP 649
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INL+RLAS+VAGSK+++Y++NAEH LAVFE RLKDMAMAVPLMCCAADML V
Sbjct: 650 SGNSVSAINLIRLASMVAGSKAEHYKRNAEHLLAVFEKRLKDMAMAVPLMCCAADMLRVL 709
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VV+VG ++S DFENMLAAAHA YD N+TVIHIDP + +EM+FWE +NSN A MA+N
Sbjct: 710 SRKQVVVVGERTSEDFENMLAAAHAVYDPNRTVIHIDPNNKDEMEFWEVNNSNVALMAKN 769
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
NF+ +KVVALVCQNF+CSP VTD SL+ LL +KPSS+
Sbjct: 770 NFAVNKVVALVCQNFTCSPSVTDHSSLKALLSKKPSSS 807
>gi|224132400|ref|XP_002321330.1| predicted protein [Populus trichocarpa]
gi|222862103|gb|EEE99645.1| predicted protein [Populus trichocarpa]
Length = 756
Score = 1154 bits (2985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/683 (83%), Positives = 618/683 (90%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM+VESFEDE VA+LLND FVS+KVDREERPDVDKVYMT+VQALYGGGGWPLS
Sbjct: 61 STCHWCHVMKVESFEDEEVAELLNDSFVSVKVDREERPDVDKVYMTFVQALYGGGGWPLS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF+SPDLKPLMGGTYFPP+DKYGRPGFKTILRKVKDAW KRD L +SGAFAIEQLSEAL
Sbjct: 121 VFISPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKDAWFSKRDTLVKSGAFAIEQLSEAL 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SASASS KLPDEL QNAL LCAEQLS+SYDSR+GGFGSAPKFPRPVEIQ+MLYHSKKL+D
Sbjct: 181 SASASSKKLPDELSQNALHLCAEQLSQSYDSRYGGFGSAPKFPRPVEIQLMLYHSKKLDD 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G E+ +G +MV FTLQCMA+GGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQL N
Sbjct: 241 AGNYSESKKGLQMVFFTLQCMARGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQLVN 300
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLDAFS+T DVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE E A +KKEGAFY+
Sbjct: 301 VYLDAFSITNDVFYSSLSRDILDYLRRDMIGPEGEIFSAEDADSAEREDAKKKKEGAFYI 360
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTS+E++D+LGEHA LFK+HYY+KP GNCDLSRMSDP +EFKGKNVLIEL D+SA A K
Sbjct: 361 WTSQEIDDLLGEHATLFKDHYYVKPLGNCDLSRMSDPQDEFKGKNVLIELTDTSAPAKKY 420
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+PLEKYL+ILGECR+KLFD RS+ PRPHLDDKVIVSWNGL ISS ARASKIL EAE
Sbjct: 421 GLPLEKYLDILGECRQKLFDARSRGPRPHLDDKVIVSWNGLAISSLARASKILMGEAEGT 480
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+NFPVVG D KEYM AE AASFIRRHLY+EQ HRL+HSFRNGPSKAPGFLDDYAFLIS
Sbjct: 481 KYNFPVVGCDPKEYMTAAEKAASFIRRHLYNEQAHRLEHSFRNGPSKAPGFLDDYAFLIS 540
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYE G G WLVWA ELQN QDELFLDREGGGYFNT GEDPSVLLRVKEDHDGAEP
Sbjct: 541 GLLDLYEVGGGIHWLVWATELQNKQDELFLDREGGGYFNTPGEDPSVLLRVKEDHDGAEP 600
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INL+RLAS++ GSKS+YYRQNAEH LAVFE+RLKDMAMAVPLMCCAADM+SVP
Sbjct: 601 SGNSVSAINLIRLASMMTGSKSEYYRQNAEHLLAVFESRLKDMAMAVPLMCCAADMISVP 660
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
S K VVLVGHKSS++F+ MLAAAHASYD N+TVIHIDP D EEM+ WE++NSN A MARN
Sbjct: 661 SHKQVVLVGHKSSLEFDKMLAAAHASYDPNRTVIHIDPTDNEEMEIWEDNNSNIALMARN 720
Query: 680 NFSADKVVALVCQNFSCSPPVTD 702
NF+ADKVVALVCQNF+CSPPVTD
Sbjct: 721 NFAADKVVALVCQNFTCSPPVTD 743
>gi|357511183|ref|XP_003625880.1| Spermatogenesis-associated protein [Medicago truncatula]
gi|355500895|gb|AES82098.1| Spermatogenesis-associated protein [Medicago truncatula]
Length = 809
Score = 1145 bits (2962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 561/708 (79%), Positives = 621/708 (87%), Gaps = 11/708 (1%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDEG+AKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL+
Sbjct: 102 STCHWCHVMEVESFEDEGIAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLT 161
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVK+AW+ KRDML +SG FAIEQLSEAL
Sbjct: 162 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKEAWENKRDMLVKSGTFAIEQLSEAL 221
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S+S++S+KLPD + ++ALRLC+EQLS++YDS +GGFGSAPKFPRPVEI +MLY SKKLED
Sbjct: 222 SSSSNSDKLPDGVSEDALRLCSEQLSENYDSEYGGFGSAPKFPRPVEINLMLYKSKKLED 281
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH-----------VPHFE 248
TGK A++ QKMV FTLQCMAKGG+HDHVGGGFHRYSVDE WH VPHFE
Sbjct: 282 TGKLDGANKSQKMVFFTLQCMAKGGVHDHVGGGFHRYSVDECWHDIYSLSSYTHAVPHFE 341
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYDQGQLANVYLDAFS+TKD FYS + RDILDYLRRDMIGP GEIFSAEDADSAE EG
Sbjct: 342 KMLYDQGQLANVYLDAFSITKDTFYSSLSRDILDYLRRDMIGPEGEIFSAEDADSAENEG 401
Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
TRKKEGAFYVWTSKEVED+LGEHA LF+EHYY+K GNCDLS MSDPHNEFKGKNVLIE
Sbjct: 402 DTRKKEGAFYVWTSKEVEDLLGEHAALFEEHYYIKQMGNCDLSEMSDPHNEFKGKNVLIE 461
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
DSS ASK GM +E Y ILGECRRKLF+VR KRP+PHLDDKVIVSWNGLVISSFARA
Sbjct: 462 RKDSSEMASKYGMSIETYQEILGECRRKLFEVRLKRPKPHLDDKVIVSWNGLVISSFARA 521
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
SKILK EAE FNFPVVG++ KEY+ +A+ AASFI+ LY+ +THRLQHSFRN PSKAP
Sbjct: 522 SKILKGEAEGIKFNFPVVGTEPKEYLRIADKAASFIKNQLYNTETHRLQHSFRNSPSKAP 581
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
GFLDDYAFLISGLLDLYEFG WL+WAIELQ TQD LFLD++GGGYFN TGED SVLL
Sbjct: 582 GFLDDYAFLISGLLDLYEFGGEINWLLWAIELQETQDTLFLDKDGGGYFNNTGEDSSVLL 641
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
RVKEDHDGAEPSGNSVS +NL+RLAS+V+GSK+++Y++NAEH LAVFE RLKD AMAVPL
Sbjct: 642 RVKEDHDGAEPSGNSVSALNLIRLASLVSGSKAEHYKRNAEHLLAVFEKRLKDTAMAVPL 701
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
MCCAADML VPSRK VVLVG ++S +FE+ML AAHA YD N+TVIHIDP + EEMDFWE
Sbjct: 702 MCCAADMLRVPSRKQVVLVGERTSEEFESMLGAAHALYDPNRTVIHIDPNNKEEMDFWEV 761
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+NSN A MA+NN+S KVVALVCQNF+CS PVTD SLE LL +KPSS
Sbjct: 762 NNSNIALMAKNNYSGSKVVALVCQNFTCSAPVTDHSSLEALLSQKPSS 809
>gi|147817761|emb|CAN68939.1| hypothetical protein VITISV_028994 [Vitis vinifera]
Length = 1575
Score = 1122 bits (2903), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 541/680 (79%), Positives = 589/680 (86%), Gaps = 21/680 (3%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP
Sbjct: 88 CHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 147
Query: 85 DLKPLMGGTYFPPEDKYGRPGFKTILR------------------KVKDAWDKKRDMLAQ 126
DLKPLMGGTYFPP+DKYGRPGFKT+LR KVKDAW+ KRD+L +
Sbjct: 148 DLKPLMGGTYFPPDDKYGRPGFKTVLRMSIFVFVLAILLYLYSFRKVKDAWENKRDVLVK 207
Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
SGAFAIEQLSEALSA+ASSNKL D +PQ AL LCAEQL+ +YD +GGFGSAPKFPRPVE
Sbjct: 208 SGAFAIEQLSEALSATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVE 267
Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
IQ+MLYH KKLE++GKSGEA+E KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPH
Sbjct: 268 IQLMLYHYKKLEESGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPH 327
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYDQGQLAN YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+
Sbjct: 328 FEKMLYDQGQLANAYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAES 387
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
E A RKKEGAFY+WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVL
Sbjct: 388 EDAARKKEGAFYIWTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVL 447
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
IE N +SA ASKLGMP+EKYL+ILG CRRKLFDVR RPRPHLDDKVIVSWNGL ISSFA
Sbjct: 448 IERNCASAMASKLGMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFA 507
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
RASKILKSEAE F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSK
Sbjct: 508 RASKILKSEAEGTKFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSK 567
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
APGFLDDYAFLISGLLD+YEFG T WLVWAIELQ+TQ GEDPSV
Sbjct: 568 APGFLDDYAFLISGLLDIYEFGGNTNWLVWAIELQDTQAWTLYPVPSP---ILGGEDPSV 624
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
LLRVKEDHDGAEPSGNSVSVINLVRL S+VAGS + +R+NAEH LAVFETRLKDMAMAV
Sbjct: 625 LLRVKEDHDGAEPSGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAV 684
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
PLMCC ADM SVPSRK VVLVGHKSSV+FE+MLAAAHA YD N+TVIHIDP +TE+M+FW
Sbjct: 685 PLMCCGADMFSVPSRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTVIHIDPTETEQMEFW 744
Query: 667 EEHNSNNASMARNNFSADKV 686
E NSN A MA+NNF+ DK+
Sbjct: 745 EAMNSNIALMAKNNFAPDKL 764
>gi|30679394|ref|NP_192229.3| uncharacterized protein [Arabidopsis thaliana]
gi|332656888|gb|AEE82288.1| uncharacterized protein [Arabidopsis thaliana]
Length = 818
Score = 1099 bits (2842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/691 (75%), Positives = 596/691 (86%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 126 STCHWCHVMEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 185
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+AL
Sbjct: 186 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKAL 245
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SAS ++KL D + + A+ CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL++
Sbjct: 246 SASTGADKLSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKE 305
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GK+ EA E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 306 SGKTSEADEEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 365
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLD FS+TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 366 VYLDGFSITKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 425
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTS E++++LGE+A LFKEHYY+K +GNCDLS SDPHNEF GKNVLIE N++SA ASK
Sbjct: 426 WTSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKF 485
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ +EKY ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES
Sbjct: 486 SLSVEKYQEILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPEST 545
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ FPVV S ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLIS
Sbjct: 546 KYYFPVVNSQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIS 605
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEP
Sbjct: 606 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEP 665
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INLVRLASIVAG K++ Y A LAVFE RL+++A+AVPLMCC+ADM+SVP
Sbjct: 666 SGNSVSAINLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVP 725
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVG KSS + NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+
Sbjct: 726 SRKQVVLVGSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKK 785
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N +++KVVALVCQ+F+CSPPV D SL LL
Sbjct: 786 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 816
>gi|17064908|gb|AAL32608.1| predicted protein of unknown function [Arabidopsis thaliana]
gi|34098807|gb|AAQ56786.1| At4g03200 [Arabidopsis thaliana]
Length = 756
Score = 1099 bits (2842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/691 (75%), Positives = 596/691 (86%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 64 STCHWCHVMEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 123
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+AL
Sbjct: 124 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKAL 183
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SAS ++KL D + + A+ CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL++
Sbjct: 184 SASTGADKLSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKE 243
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GK+ EA E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 244 SGKTSEADEEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 303
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLD FS+TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 304 VYLDGFSITKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 363
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTS E++++LGE+A LFKEHYY+K +GNCDLS SDPHNEF GKNVLIE N++SA ASK
Sbjct: 364 WTSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKF 423
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ +EKY ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES
Sbjct: 424 SLSVEKYQEILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPEST 483
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ FPVV S ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLIS
Sbjct: 484 KYYFPVVNSQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIS 543
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEP
Sbjct: 544 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEP 603
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INLVRLASIVAG K++ Y A LAVFE RL+++A+AVPLMCC+ADM+SVP
Sbjct: 604 SGNSVSAINLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVP 663
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVG KSS + NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+
Sbjct: 664 SRKQVVLVGSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKK 723
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N +++KVVALVCQ+F+CSPPV D SL LL
Sbjct: 724 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 754
>gi|297813987|ref|XP_002874877.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
lyrata]
gi|297320714|gb|EFH51136.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
lyrata]
Length = 812
Score = 1092 bits (2824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/691 (74%), Positives = 594/691 (85%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFEDE VAKLLND FVSIKVDREERPDVDKVYM++VQALYGGGGWPLS
Sbjct: 120 STCHWCHVMEVESFEDEEVAKLLNDSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLS 179
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP D YGRPGFKT+L+KVKDAWD KRD L +SG +AIE+L++AL
Sbjct: 180 VFLSPDLKPLMGGTYFPPNDNYGRPGFKTLLKKVKDAWDSKRDTLVKSGTYAIEELTKAL 239
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SASA ++KL D + + A+ +CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLY+ KKL++
Sbjct: 240 SASAGADKLSDGISREAVSICAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYYFKKLKE 299
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GK+ EA E Q MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN
Sbjct: 300 SGKTSEADEEQSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLAN 359
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLD F +TKDV YSY+ +DILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+
Sbjct: 360 VYLDGFIITKDVIYSYVAKDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYI 419
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W+S E++++LGE+A LFKEHYY+K +GNCDLS SDPHNEF GKNVLIE N+ SA ASK
Sbjct: 420 WSSDEIDEVLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNEMSAMASKF 479
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ +EKY ILGECR+KLFDVR RP+PHLDDK+IVSWNGLVISSFARASK+LK+E ES
Sbjct: 480 SLSVEKYQEILGECRKKLFDVRLNRPKPHLDDKIIVSWNGLVISSFARASKMLKAEPEST 539
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ FPVV S +EY+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLI+
Sbjct: 540 KYCFPVVNSQPEEYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIA 599
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYE G G +WL WAI+LQ TQDEL+LDREGG YFNT G+D SVLLRVKEDHDGAEP
Sbjct: 600 GLLDLYENGGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDSSVLLRVKEDHDGAEP 659
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS INLVRLASIV G K+D Y A LAVFE RL++MA+AVPLMCCAADM+SVP
Sbjct: 660 SGNSVSAINLVRLASIVTGEKADSYLNTAHRLLAVFELRLREMAVAVPLMCCAADMISVP 719
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVG KSS + NML+AAH+ YD NKTVIHIDP++++EM+FWEE+NSN A MA+
Sbjct: 720 SRKQVVLVGSKSSPELNNMLSAAHSVYDPNKTVIHIDPSNSDEMEFWEEYNSNVAEMAKK 779
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N +++KVVALVCQ+F+CSPPV D SL LL
Sbjct: 780 NRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 810
>gi|319428654|gb|ADV56678.1| hypothetical protein [Phaseolus vulgaris]
Length = 804
Score = 1085 bits (2805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 545/732 (74%), Positives = 603/732 (82%), Gaps = 48/732 (6%)
Query: 26 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 85
H+ VESFED VAKLLNDWFVSIKVDREERPDVDK ALYGGGGWPLSVFLSPD
Sbjct: 78 HLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------ALYGGGGWPLSVFLSPD 130
Query: 86 LKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDAWDKKRDMLAQSGAFAI 132
LKPLMGGTYFPP+DKYGRPGFKTILR KVK AWD KRDML +SGAFAI
Sbjct: 131 LKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQAWDSKRDMLIKSGAFAI 190
Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
EQLSEA+S S++S+KLPD +P +ALRLC+EQLS YDS+FGGFGSAPKFPRPVEI +MLY
Sbjct: 191 EQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFGSAPKFPRPVEINLMLY 250
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
HSKKLE+TGK A+ QKMVLF+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLY
Sbjct: 251 HSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLY 310
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
DQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RK
Sbjct: 311 DQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARK 370
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
KEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDPHNEFK KNVLIE +
Sbjct: 371 KEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDPHNEFKEKNVLIERKEL 430
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
S ASK GM +E Y ILGECRRKLF+ RS+RP+PHLDDKVIVSWNGL +SSFARASKIL
Sbjct: 431 SELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVSWNGLAVSSFARASKIL 490
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
KSEAE F FPVVG++ KEYM +AE AA FIR+ LYD +T RL HSFR PSKAPGFLD
Sbjct: 491 KSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRLYHSFRRSPSKAPGFLD 550
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFLISGLLDLYEFG G WL+WAIELQ TQD LFLD+ GGGYFN TGEDPSVLLRVKE
Sbjct: 551 DYAFLISGLLDLYEFGGGVSWLLWAIELQETQDSLFLDKAGGGYFNNTGEDPSVLLRVKE 610
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-------------------- 592
DHDGAEPSGNSVS INL+RLAS+V+GSK++ YR+NAEH L
Sbjct: 611 DHDGAEPSGNSVSAINLIRLASMVSGSKAENYRRNAEHLLVCKLLSLFPLKAFSSHICAN 670
Query: 593 --------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG ++S +FENML AAHA
Sbjct: 671 NGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVGGRTSEEFENMLTAAHA 730
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
YD N+TVIHIDP++ EEM+FWE +NSN + MA+NN++ +KVVALVCQNF+CSPP+TD
Sbjct: 731 LYDPNRTVIHIDPSNKEEMEFWEVNNSNVSLMAKNNYAVNKVVALVCQNFTCSPPLTDRS 790
Query: 705 SLENLLLEKPSS 716
SLE LL +KPSS
Sbjct: 791 SLEALLSKKPSS 802
>gi|319428671|gb|ADV56694.1| hypothetical protein [Phaseolus vulgaris]
Length = 804
Score = 1084 bits (2803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 544/732 (74%), Positives = 603/732 (82%), Gaps = 48/732 (6%)
Query: 26 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 85
H+ VESFED VAKLLNDWFVSIKVDREERPDVDK ALYGGGGWPLSVFLSPD
Sbjct: 78 HLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------ALYGGGGWPLSVFLSPD 130
Query: 86 LKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDAWDKKRDMLAQSGAFAI 132
LKPLMGGTYFPP+DKYGRPGFKTILR KVK AWD KRDML +SGAFAI
Sbjct: 131 LKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQAWDSKRDMLIKSGAFAI 190
Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
EQLSEA+S S++S+KLPD +P +ALRLC+EQLS YDS+FGGFGSAPKFPRPVEI +MLY
Sbjct: 191 EQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFGSAPKFPRPVEINLMLY 250
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
HSKKLE+TGK A+ QKMVLF+LQCMAKGGIHDH+GGGFHRYSVDE WHVPHFEKMLY
Sbjct: 251 HSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRYSVDECWHVPHFEKMLY 310
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
DQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RK
Sbjct: 311 DQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARK 370
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
KEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDPHNEFK KNVLIE +
Sbjct: 371 KEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDPHNEFKEKNVLIERKEL 430
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
S ASK GM +E Y ILGECRRKLF+ RS+RP+PHLDDKVIVSWNGL +SSFARASKIL
Sbjct: 431 SELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVSWNGLAVSSFARASKIL 490
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
KSEAE F FPVVG++ KEYM +AE AA FIR+ LYD +T RL HSFR PSKAPGFLD
Sbjct: 491 KSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRLYHSFRRSPSKAPGFLD 550
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFLISGLLDLYEFG G WL+WAIELQ TQD LFLD+ GGGYFN TGEDPSVLLRVKE
Sbjct: 551 DYAFLISGLLDLYEFGGGISWLLWAIELQETQDSLFLDKAGGGYFNNTGEDPSVLLRVKE 610
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-------------------- 592
DHDGAEPSGNSVS INL+RLAS+V+GSK++ Y++NAEH L
Sbjct: 611 DHDGAEPSGNSVSAINLIRLASMVSGSKAENYKRNAEHLLVCKLLVLFLLKAFSSHICAN 670
Query: 593 --------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG ++S +FENML AAHA
Sbjct: 671 NGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVGGRTSEEFENMLTAAHA 730
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
YD N+TVIHIDP++ EEM+FWE +NSN + MA+NN++ +KVVALVCQNF+CSPP+TD
Sbjct: 731 LYDPNRTVIHIDPSNKEEMEFWEVNNSNVSLMAKNNYAVNKVVALVCQNFTCSPPLTDRS 790
Query: 705 SLENLLLEKPSS 716
SLE LL +KPSS
Sbjct: 791 SLEALLSKKPSS 802
>gi|186511491|ref|NP_001118924.1| uncharacterized protein [Arabidopsis thaliana]
gi|332656889|gb|AEE82289.1| uncharacterized protein [Arabidopsis thaliana]
Length = 685
Score = 1080 bits (2793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 512/683 (74%), Positives = 588/683 (86%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
MEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLK
Sbjct: 1 MEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
PLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS ++K
Sbjct: 61 PLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADK 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L D + + A+ CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL+++GK+ EA
Sbjct: 121 LSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKESGKTSEAD 180
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+
Sbjct: 181 EEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSI 240
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E+++
Sbjct: 241 TKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDE 300
Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
+LGE+A LFKEHYY+K +GNCDLS SDPHNEF GKNVLIE N++SA ASK + +EKY
Sbjct: 301 VLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQ 360
Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES + FPVV
Sbjct: 361 EILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVN 420
Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
S ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE
Sbjct: 421 SQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYEN 480
Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEPSGNSVS I
Sbjct: 481 GGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEPSGNSVSAI 540
Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
NLVRLASIVAG K++ Y A LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLV
Sbjct: 541 NLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLV 600
Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVV 687
G KSS + NML+AAH+ YD NKTVIHIDP+ ++E++FWEEHNSN A MA+ N +++KVV
Sbjct: 601 GSKSSPELTNMLSAAHSVYDPNKTVIHIDPSSSDEIEFWEEHNSNVAEMAKKNRNSEKVV 660
Query: 688 ALVCQNFSCSPPVTDPISLENLL 710
ALVCQ+F+CSPPV D SL LL
Sbjct: 661 ALVCQHFTCSPPVFDSSSLTRLL 683
>gi|242059825|ref|XP_002459058.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
gi|241931033|gb|EES04178.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
Length = 821
Score = 1004 bits (2596), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/691 (70%), Positives = 571/691 (82%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFE+E VAKLLNDWFVSIKVDREERPDVDKVYMTYV AL+GGGGWPLS
Sbjct: 121 STCHWCHVMEVESFENEEVAKLLNDWFVSIKVDREERPDVDKVYMTYVSALHGGGGWPLS 180
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVK+AW+ KR+ L +SG IEQL +AL
Sbjct: 181 VFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKEAWETKREALERSGNLVIEQLRDAL 240
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S ASS +P++L ++ C EQL+ YD +FGGFGSAPKFPRPVE +MLY +K +
Sbjct: 241 STKASSQDVPNDLAAVSVDQCVEQLASRYDPKFGGFGSAPKFPRPVEDYIMLYKFRKHME 300
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
GK EA +KMV TL CMA+GG+HDHVGGGFHRYSVDE WH+PHFEKMLYDQGQ+ N
Sbjct: 301 AGKESEALNIKKMVTHTLDCMARGGVHDHVGGGFHRYSVDECWHIPHFEKMLYDQGQIVN 360
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLD F +T D +YS + RDILDYLRRDMIG GEIFSAEDADSAE EGA RKKEGAFYV
Sbjct: 361 VYLDTFLITGDEYYSIVARDILDYLRRDMIGKEGEIFSAEDADSAEYEGAPRKKEGAFYV 420
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTSKE+ED LGE+A LFK HYY+K +GNCDLS MSDPHNEF KNVLIE +S+ ASK
Sbjct: 421 WTSKEIEDTLGENAELFKNHYYVKSSGNCDLSPMSDPHNEFSCKNVLIERKPASSMASKC 480
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G L++Y ILG+CR+KLF VRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS
Sbjct: 481 GKSLDEYSQILGDCRQKLFHVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGPSGT 540
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+FNFPV G + EY+EVAE+AA+FI+ LYD + RL HS+RNGPSKAPGFLDDYAFLIS
Sbjct: 541 LFNFPVTGCNPVEYLEVAENAANFIKEKLYDASSKRLHHSYRNGPSKAPGFLDDYAFLIS 600
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLDLYEFG T+WL+WA++LQ TQD+LFLD++GGGYFNT GEDPSVLLRVKED+DGAEP
Sbjct: 601 GLLDLYEFGGKTEWLLWAVQLQVTQDDLFLDKQGGGYFNTPGEDPSVLLRVKEDYDGAEP 660
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSV+ INL+RL+SI SKS Y+ + EH LAVFETRL+ +++A+PLMCCAADMLSVP
Sbjct: 661 SGNSVAAINLIRLSSIFDVSKSTGYKSSVEHLLAVFETRLRQLSIALPLMCCAADMLSVP 720
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVG K S +F++M+AA + YD N+TVI IDP +TEEM+FW+ +N++ A MAR+
Sbjct: 721 SRKQVVLVGQKGSEEFQDMVAATFSLYDPNRTVIQIDPRNTEEMEFWDCNNADIAQMARS 780
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + VA VCQ+F CSPPVT P +L LL
Sbjct: 781 SPLGEPAVAHVCQDFKCSPPVTSPGALRELL 811
>gi|357131648|ref|XP_003567448.1| PREDICTED: spermatogenesis-associated protein 20-like [Brachypodium
distachyon]
Length = 814
Score = 994 bits (2570), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/691 (69%), Positives = 568/691 (82%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVMEVESFE+E VAK+LNDWFVSIKVDREERPDVDKVYMTYV ALYGGGGWPLS
Sbjct: 113 STCHWCHVMEVESFENEEVAKILNDWFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLS 172
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSP+LKPLMGGTYFPP+DKYGRPGFKT+LR+VK+AW+ KRD L Q+G IEQL +AL
Sbjct: 173 VFLSPNLKPLMGGTYFPPDDKYGRPGFKTVLRRVKEAWETKRDALEQAGNVVIEQLRDAL 232
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA A+S +P+++ + C E+L+ +YD +FGGFGSAPKFPRPVE +MLY +K +
Sbjct: 233 SAKATSQDVPNDVAVVYVDTCVEKLASNYDPKFGGFGSAPKFPRPVEDCIMLYKFRKHME 292
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ E KMV TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEKMLYDQGQ+AN
Sbjct: 293 ARRESEGQNILKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEKMLYDQGQIAN 352
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYLD F +T D YS + RDILDYLRRDMIG GEIFSAEDADS+E EGA RKKEG+FYV
Sbjct: 353 VYLDTFLITGDECYSSVARDILDYLRRDMIGEEGEIFSAEDADSSEYEGAPRKKEGSFYV 412
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WTSKE+ED LGE A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE S ASK
Sbjct: 413 WTSKEIEDTLGEDAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIERKPGSLVASKS 472
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G +++Y ILG+CR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS +
Sbjct: 473 GKSVDEYSQILGDCRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGSIGT 532
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F FPV G EY++VAE AA+FI++ LYD + RL HS+RNGP+KAPGFLDDYAFLI+
Sbjct: 533 RFYFPVTGCHPIEYLQVAEKAATFIKQKLYDASSKRLHHSYRNGPAKAPGFLDDYAFLIN 592
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLLD+YE+G T+WL+WA++LQ QD+LFLDR+GGGYFNT GEDPSVLLRVKED+DGAEP
Sbjct: 593 GLLDIYEYGGKTEWLLWAVQLQVIQDQLFLDRQGGGYFNTPGEDPSVLLRVKEDYDGAEP 652
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNS++ INL+RL+SI +KS+ Y++N EH LAVFETRL+++ +A+PLMCCAADMLSVP
Sbjct: 653 SGNSMAAINLIRLSSIFDAAKSEGYKRNVEHLLAVFETRLRELGIALPLMCCAADMLSVP 712
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
SRK VVLVG K S +F++M+AA +SYD N+TVI IDP +TEEM FWE +N+N A MAR+
Sbjct: 713 SRKQVVLVGDKGSTEFQDMVAATFSSYDPNRTVIQIDPRNTEEMGFWESNNANIAQMARS 772
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ VVA VCQ+F CSPPVT P +L LL
Sbjct: 773 SPPEKLVVAHVCQDFKCSPPVTSPGALRELL 803
>gi|222619828|gb|EEE55960.1| hypothetical protein OsJ_04681 [Oryza sativa Japonica Group]
Length = 791
Score = 958 bits (2476), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/710 (66%), Positives = 563/710 (79%), Gaps = 24/710 (3%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVMEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP
Sbjct: 69 CHVMEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSP 128
Query: 85 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 144
+LKPLMGGTYFPP+DKYGR GFKTILRKVK+AW+ KRD L ++G I+QL +ALSA AS
Sbjct: 129 NLKPLMGGTYFPPDDKYGRTGFKTILRKVKEAWETKRDALEKTGNVVIKQLRDALSAKAS 188
Query: 145 SNKLPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPK 180
S +P++L ++ C E QL+ SYD +FGG+GSAPK
Sbjct: 189 SQDMPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPK 248
Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
FPRPVE +MLY +K ++G+ E+ KM+ TLQCMA+GG+HDHVGGGFHRYSVDE
Sbjct: 249 FPRPVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDE 308
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
WHVPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG GEI+SAED
Sbjct: 309 CWHVPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAED 368
Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEF 360
ADSAE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EF
Sbjct: 369 ADSAEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEF 428
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
KGKNVLIE +S ASK G +++Y ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL
Sbjct: 429 KGKNVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGL 488
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
IS+FARAS+ILKSE F FP+ G + +EY+ VAE AA FI+ LYD ++RL HS+
Sbjct: 489 AISAFARASQILKSEPTGTRFCFPITGCNPEEYLGVAEKAARFIKEKLYDSSSNRLNHSY 548
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
RNGP+KAPGFLDDYAFLI+GLLDLYE+G +WL+WA LQ QDELFLD++GGGYFNT
Sbjct: 549 RNGPAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQDELFLDKQGGGYFNTP 608
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
GEDPSVLLRVKED+DGAEPSGNSV+ INL+RL+SI +KSD Y+ N EH LAVF+TRL+
Sbjct: 609 GEDPSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYKCNVEHLLAVFQTRLR 668
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
++ +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++YD N+TVI IDP +T
Sbjct: 669 ELGIALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFSTYDPNRTVIQIDPRNT 728
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
EEM FWE +N+ A MAR++ VA VCQ+F CSPPVT +L LL
Sbjct: 729 EEMGFWESNNAIIAQMARSSPPEKPAVAHVCQDFKCSPPVTSADALRVLL 778
>gi|218189686|gb|EEC72113.1| hypothetical protein OsI_05096 [Oryza sativa Indica Group]
Length = 806
Score = 947 bits (2448), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/725 (65%), Positives = 563/725 (77%), Gaps = 39/725 (5%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVMEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP
Sbjct: 69 CHVMEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSP 128
Query: 85 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 144
+LKPLMGGTYFPP+DKYGRPGFKTILRKVK+AW+ K D L ++G I+QL +ALSA AS
Sbjct: 129 NLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWETKCDALEKTGNVVIKQLRDALSAKAS 188
Query: 145 SNKLPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPK 180
S +P++L ++ C E QL+ SYD +FGG+GSAPK
Sbjct: 189 SQDIPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPK 248
Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
FPRPVE +MLY +K ++G+ E+ KM+ TLQCMA+GG+HDHVGGGFHRYSVDE
Sbjct: 249 FPRPVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDE 308
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
WHVPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG GEI+SAED
Sbjct: 309 CWHVPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAED 368
Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEF 360
ADSAE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EF
Sbjct: 369 ADSAEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEF 428
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
KGKNVLIE +S ASK G +++Y ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL
Sbjct: 429 KGKNVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGL 488
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSD---------------RKEYMEVAESAASFIR 465
IS+FARAS+ILKSE F FP+ G + +EY+ VAE AA FI+
Sbjct: 489 AISAFARASQILKSEPTGTRFCFPITGCNFSLVKQSLGCACPYMPEEYLGVAEKAARFIK 548
Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
LYD ++RL HS+RNGP+KAPGFLDDYAFLI+GLLDLYE+G +WL+WA LQ QD
Sbjct: 549 EKLYDSSSNRLNHSYRNGPAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQD 608
Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
ELFLD++GGGYFNT GEDPSVLLRVKED+DGAEPSGNSV+ INL+RL+SI +KSD Y+
Sbjct: 609 ELFLDKQGGGYFNTPGEDPSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYK 668
Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
N EH LAVF+TRL+++ +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++
Sbjct: 669 CNVEHLLAVFQTRLRELGIALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFST 728
Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
YD N+TVI IDP +TEEM FWE +N+ A MAR++ VA VCQ+F CSPPVT +
Sbjct: 729 YDPNRTVIQIDPRNTEEMGFWESNNAIIAQMARSSPPEKPAVAHVCQDFKCSPPVTSADA 788
Query: 706 LENLL 710
L LL
Sbjct: 789 LRVLL 793
>gi|168008753|ref|XP_001757071.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691942|gb|EDQ78302.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 772
Score = 891 bits (2303), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/714 (58%), Positives = 540/714 (75%), Gaps = 7/714 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVMEVESFE+E +AKL N+WFV+IKVDREERPD
Sbjct: 42 GEEAFAKAREEDKPIFLSVGYSTCHWCHVMEVESFENEEIAKLQNEWFVNIKVDREERPD 101
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMTYVQA GGGGWP+SVFL+P+LKP++GGTYFPP+DKYGRPGFKT+L++V++ W+
Sbjct: 102 VDKVYMTYVQASQGGGGWPMSVFLTPELKPIVGGTYFPPDDKYGRPGFKTVLKRVREVWE 161
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGS 177
K+D+L +SG ++QL+EA +A A S +L + +P A+ LCA QLSK +DS+ GGFG
Sbjct: 162 SKKDVLRESGKQVVQQLAEATAAVAPSTELTESSVPAQAVTLCANQLSKGFDSKLGGFGG 221
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFPRPVE+ +M+ + K+LE GK A++ +M LF+LQCMA GG+HDHVGGGFHRYS
Sbjct: 222 APKFPRPVEVALMMRNYKRLEQQGKEQYATKALEMALFSLQCMANGGMHDHVGGGFHRYS 281
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
VDE WHVPHFEKMLYD QL NVYLDAF+++KD+ YSY+ RD+LDYL RDM P G I+S
Sbjct: 282 VDEYWHVPHFEKMLYDNAQLVNVYLDAFAVSKDLTYSYVARDVLDYLIRDMTHPEGGIYS 341
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDP 356
AEDADSAET +T+KKEG FY+WT +E+E++LG E A +F +YY+K GNCDLSRMSDP
Sbjct: 342 AEDADSAETTSSTKKKEGLFYIWTLQEIEEVLGKEQAQMFIAYYYVKAEGNCDLSRMSDP 401
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
H EF GKNVLI+ ++ A+K G E LG+CR KL RS+RP PHLDDKVIV+
Sbjct: 402 HGEFGGKNVLIKRSNVDI-ATKFGKMPEDVSQYLGQCRAKLHAYRSQRPHPHLDDKVIVA 460
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL IS+FARAS+IL +E + FPV G KEY+ VAE AA FI+ LY+E+T RL
Sbjct: 461 WNGLAISAFARASRILLNEPSGVRYEFPVTGCHPKEYLVVAERAAHFIKSKLYNEKTKRL 520
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
S+RNGPSKAPGFLDDYAFLI+GLLDL+E G KWL WA+ELQ++QDE FLD+EGG Y
Sbjct: 521 TRSYRNGPSKAPGFLDDYAFLIAGLLDLFECGGDYKWLQWALELQSSQDEQFLDKEGGAY 580
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
+ T DPS+L R+KED+DGAEPSGNSV+ INL+RL+S+V G ++ AEH LAV+E
Sbjct: 581 YITPEGDPSILFRMKEDYDGAEPSGNSVAAINLLRLSSLVTGDLAESVHTTAEHLLAVYE 640
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
R+K++AMAVPL+CCA D SV +++ +++ G ++S D + ++ A HA +D ++ VI ID
Sbjct: 641 QRVKEVAMAVPLLCCAFDSFSVAAKRQIIIAGVRNSPDTDALMTACHAPFDPDRNVILID 700
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ EE DFW+ NS +MAR + +A VCQNF+C P D ++LE LL
Sbjct: 701 ESNPEERDFWQSVNSTALAMARKAQDG-RALAYVCQNFTCQAPTGDHVALEQLL 753
>gi|4262148|gb|AAD14448.1| predicted protein of unknown function [Arabidopsis thaliana]
gi|7270190|emb|CAB77805.1| predicted protein of unknown function [Arabidopsis thaliana]
Length = 794
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/660 (65%), Positives = 499/660 (75%), Gaps = 73/660 (11%)
Query: 51 VDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTIL 110
VDREERPDVDK ALYGGGGWPLSVFLSPDLKPLMGGTYFPP D YGRPGFKT+L
Sbjct: 206 VDREERPDVDK-------ALYGGGGWPLSVFLSPDLKPLMGGTYFPPNDNYGRPGFKTLL 258
Query: 111 RKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDS 170
+KVKDAW+ KRD L +SG +AIE+LS+ALSAS ++KL D + + AL+
Sbjct: 259 KKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADKLSDGISREALK------------ 306
Query: 171 RFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
++GK+ EA E + MVLF+LQ MA GG+HDH+G
Sbjct: 307 ----------------------------ESGKTSEADEEKSMVLFSLQGMANGGMHDHIG 338
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+TKDV YSY+ RDILDYLRRDMI
Sbjct: 339 GGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSITKDVMYSYVARDILDYLRRDMIA 398
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDL 350
P G IFSAEDADS E EGA RKKEGAFY+WTS E++++LGE+A LFKEHYY+K +GNCDL
Sbjct: 399 PEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDEVLGENADLFKEHYYVKKSGNCDL 458
Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
S SDPHNEF GKNVLIE N++SA ASK + +EKY ILGECRRKLFDVR KRP+PHLD
Sbjct: 459 SSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQEILGECRRKLFDVRLKRPKPHLD 518
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK+IVSWNGLVISSFARASKILK+E ES + FPVV S ++Y+EVAE AA FIR +LYD
Sbjct: 519 DKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVNSQPEDYIEVAEKAALFIRGNLYD 578
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
EQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE G G +WL WAI+LQ TQ
Sbjct: 579 EQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYENGGGIEWLKWAIKLQETQ------ 632
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
+DHDGAEPSGNSVS INLVRLASIVAG K++ Y A
Sbjct: 633 --------------------AKDHDGAEPSGNSVSAINLVRLASIVAGEKAESYLNTAHR 672
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLVG KSS + NML+AAH+ YD NK
Sbjct: 673 LLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLVGSKSSPELTNMLSAAHSVYDPNK 732
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
TVIHIDP+ ++E++FWEEHNSN A MA+ N +++KVVALVCQ+F+CSPPV D SL LL
Sbjct: 733 TVIHIDPSSSDEIEFWEEHNSNVAEMAKKNRNSEKVVALVCQHFTCSPPVFDSSSLTRLL 792
>gi|302824870|ref|XP_002994074.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
gi|300138080|gb|EFJ04861.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
Length = 769
Score = 868 bits (2243), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/721 (56%), Positives = 536/721 (74%), Gaps = 4/721 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVMEVESFE E VAKLLNDWFVSIKVDREERPD
Sbjct: 47 GEEAFAKAKAEDKPIFLSVGYSTCHWCHVMEVESFESEEVAKLLNDWFVSIKVDREERPD 106
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YMT+VQA GGGGWP+SVFL+P+LKP++GGTYFPPED YGRPGFKT+LR+VK+ WD
Sbjct: 107 VDKIYMTFVQASQGGGGWPMSVFLTPELKPIVGGTYFPPEDNYGRPGFKTVLRRVKENWD 166
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ +L +G I+QL+EA++A A+S ++ + + A++LCA QL K +D++ GGFGSA
Sbjct: 167 SRKAVLRNAGDNVIQQLAEAMAACATSLQVSGGVAEQAVQLCASQLMKGFDAKLGGFGSA 226
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPRPVE+ +ML + K+L+ GK+ + + +M F LQCMA+GG+HDHVGGGFHRYSV
Sbjct: 227 PKFPRPVELNLMLRYYKRLDQAGKASLSKKALEMASFNLQCMARGGMHDHVGGGFHRYSV 286
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+ WHVPHFEKMLYDQ QLAN YLD + +T+D ++ + RDILDYL RDM P G IFSA
Sbjct: 287 DDYWHVPHFEKMLYDQAQLANAYLDVYLVTRDTMHACVARDILDYLNRDMTHPEGGIFSA 346
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS E G+++KKEGAFYVWT+KE+ED+LG + A +F HYY++ GNC+LSRMSDPH
Sbjct: 347 EDADSLEPSGSSKKKEGAFYVWTAKEIEDVLGKDRAQIFAAHYYVREQGNCNLSRMSDPH 406
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
NEF GKNVLIE + + +K G +E+ ++LG+CR L RSKRPRPHLDDKVIV+W
Sbjct: 407 NEFLGKNVLIERQSLADTVAKFGKTVEETADLLGQCRELLHAHRSKRPRPHLDDKVIVAW 466
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL IS+++RAS+ L++E E FP +G D K+Y+ VAE A F++ +Y+ RLQ
Sbjct: 467 NGLAISAYSRASRFLRAEPEGLKHYFPDMGCDPKDYLIVAERIAKFVKDKIYNASAKRLQ 526
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
S+R PS+APGFLDDYAFLI+GLLDLYE TKWL W ELQ QD LFLD+EGGGYF
Sbjct: 527 RSYRKSPSQAPGFLDDYAFLIAGLLDLYEASGDTKWLAWVFELQEVQDHLFLDKEGGGYF 586
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+T D S+L R+KED+DGAEPSGNSV+ INL+RLASI G + + + A+H LAVFE
Sbjct: 587 STAEGDSSILFRMKEDYDGAEPSGNSVAAINLLRLASICHGEEGKLFLERAQHLLAVFEG 646
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
++K++AMAVPLMCCA D+L+VPS++ +++ G K+S +F+ ++ +H +D + T+I IDP
Sbjct: 647 KVKELAMAVPLMCCAYDVLAVPSKRQILVAGAKTSGEFDALVTTSHLFFDPDSTIIQIDP 706
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
+++FW+ N +MA+ K VA VCQ+F C PV+D +LE LL + S
Sbjct: 707 ELPSDVEFWQAKNPMLLAMAQGKAPKSKAVAFVCQDFKCYAPVSDAAALERLLNKNKSKV 766
Query: 718 A 718
A
Sbjct: 767 A 767
>gi|326515716|dbj|BAK07104.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 532
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/521 (68%), Positives = 419/521 (80%)
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
MLY +K + G+ EA KMV TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEK
Sbjct: 1 MLYKFRKHMEAGQKSEAENIMKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEK 60
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
MLYDQGQ+AN YLD + +T D +YS + RDILDYLRRDMIG GEIFSAEDADSAE EG
Sbjct: 61 MLYDQGQIANAYLDTYVITGDEYYSSVARDILDYLRRDMIGEDGEIFSAEDADSAEYEGD 120
Query: 310 TRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
RKKEG+FYVWTS+E+ED LGE+A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE
Sbjct: 121 ARKKEGSFYVWTSQEIEDTLGENAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIER 180
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S ASK G +++Y ILGECR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS
Sbjct: 181 KPGSLMASKYGKSVDEYYGILGECRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARAS 240
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+ILKS F FPV G D EY++VAE AA+FI+ LYD + RL HS+RNGP+KAPG
Sbjct: 241 QILKSGPPGTKFYFPVTGCDPVEYLQVAEKAANFIKEKLYDAGSKRLHHSYRNGPAKAPG 300
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
FLDDYAFLI+GLLDL+E+G +WL+WAIELQ QDELFLD++GGGYFNT GEDPSVLLR
Sbjct: 301 FLDDYAFLINGLLDLFEYGGKMEWLLWAIELQVIQDELFLDKQGGGYFNTPGEDPSVLLR 360
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
VKED+DGAEPSGNS++ IN+VRL+SI+ +KS+ Y++N EH LAVFETRLK++ +A+PLM
Sbjct: 361 VKEDYDGAEPSGNSMAAINMVRLSSILDAAKSEGYKRNVEHLLAVFETRLKELGIALPLM 420
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
CCAADML+VPSRK VVLVG K+S +F++M+ AA SYD N+TVI ID + EEM FWE +
Sbjct: 421 CCAADMLTVPSRKQVVLVGDKASPEFQDMVVAAFLSYDPNRTVIQIDASKMEEMAFWESN 480
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N+N A MAR++ S VA VCQ F CSPPVT P +L LL
Sbjct: 481 NANIAQMARSSPSGKPAVAHVCQEFKCSPPVTSPGALRELL 521
>gi|384252567|gb|EIE26043.1| hypothetical protein COCSUDRAFT_52662 [Coccomyxa subellipsoidea
C-169]
Length = 796
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/713 (49%), Positives = 465/713 (65%), Gaps = 18/713 (2%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE E +AKL+ND FV+IKVD+EER DVD+VYMTYVQA GGGGWP+SV
Sbjct: 72 TCHWCHVMERESFESEAIAKLMNDSFVNIKVDKEERSDVDRVYMTYVQATSGGGGWPMSV 131
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PDL+P +GGTY+PP+D YGRPGF T+L+++ D W +++ + + A + QL+EA+
Sbjct: 132 FLTPDLQPFLGGTYYPPQDAYGRPGFSTVLKRIADVWRSRKNEVIEQSADTMRQLNEAIQ 191
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY-HSKKLED 199
+LP+ + C L+ +D GGFG+APKFPRP EI ++L H + +D
Sbjct: 192 PQGGKAELPEGAAGRFIESCYSMLASRFDPTLGGFGAAPKFPRPAEINLLLVEHLRASQD 251
Query: 200 -------TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
SG + M TLQ MA GG++DHVGGGFHRYSVDE WHVPHFEKMLY
Sbjct: 252 REASSATASSSGRRRDALGMAETTLQRMAAGGMYDHVGGGFHRYSVDEHWHVPHFEKMLY 311
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D GQLA YLDA+ T DV Y+ + R ILDYL RDM P G +SAEDADS + G +K
Sbjct: 312 DNGQLAQTYLDAYRATGDVRYARVARGILDYLHRDMTHPEGGFYSAEDADSLDASG--KK 369
Query: 313 KEGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
EGAFYVW++ E++++LG E +FK+HYY+K +GN DLS SD H EF G N LIE
Sbjct: 370 SEGAFYVWSADEIDEVLGTDSERGRVFKQHYYVKASGNTDLSPRSDQHGEFTGLNCLIER 429
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
A+A+K G+ +E+ L + R+ L + RS+RPRPHLDDKV+ +WNGL I +FA AS
Sbjct: 430 ESVKATATKFGLSVEETEGTLAKARQLLHERRSQRPRPHLDDKVVTAWNGLAIGAFANAS 489
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
++L +E + FPV G K+Y+ A AA F+R ++D RL+ SF GPS G
Sbjct: 490 RVLANEPQPPTPLFPVEGRPAKDYLTDAIRAAEFVRDKVWDADARRLRRSFCRGPSDVGG 549
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
F DDYAFL+SGLLDL+ +WL +A++LQ QDELF D GGYF+TTGEDPS+LLR
Sbjct: 550 FADDYAFLVSGLLDLHAASGDAQWLQFALQLQAAQDELFWDDAAGGYFSTTGEDPSILLR 609
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+KED+DGAEP+ +S++ NL+RLA++ S+ R A + A F RL +M++A+P M
Sbjct: 610 MKEDYDGAEPAPSSIAAANLLRLAALTDPDASEPLRARASAAAAAFRERLAEMSLAMPQM 669
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
CCA +L + V++ G + D E +L AA A + +K VI IDP+D ++FW H
Sbjct: 670 CCALHLLDSGHLRQVIIAGRLGAADTEALLDAAQAIFAPDKAVIFIDPSDEASVEFWRGH 729
Query: 670 NSNNASMARN-NFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE---KPSST 717
N +M AD A VCQNF+C P TDP L+ L E PS+T
Sbjct: 730 NPQALAMVEGAGLQADSSATAFVCQNFTCKAPTTDPQKLKAALGEARSAPSTT 782
>gi|302838582|ref|XP_002950849.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
nagariensis]
gi|300263966|gb|EFJ48164.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
nagariensis]
Length = 890
Score = 611 bits (1575), Expect = e-172, Method: Compositional matrix adjust.
Identities = 322/742 (43%), Positives = 433/742 (58%), Gaps = 55/742 (7%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE E VA+LLN F+SIKVDREERPDVD+VYMTYVQA+ G GGWP+SV
Sbjct: 76 TCHWCHVMERESFESEEVAELLNRDFISIKVDREERPDVDRVYMTYVQAVSGSGGWPMSV 135
Query: 81 FLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
+L+P L+P GGTY+PP+D++ PGF T+L ++ W R L A
Sbjct: 136 WLTPSLEPFYGGTYYPPKDRFVGGQLALPGFSTVLLRIGSLWRTNRQDLKSKVEAAAAPA 195
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+A+ + LP L A+ C L++ YD+ +GGFG APKFPRP EI ++L +
Sbjct: 196 GPTEAAANAGAALPPSLAAAAVDACGHDLARRYDAEYGGFGGAPKFPRPSEINLLLRAAV 255
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ + G A + M L +L MA GG++D +GGGFHRYSVDE WHVPHFEKMLYD
Sbjct: 256 RQMEQGDQLAAQRRRSMALHSLTAMASGGMYDQLGGGFHRYSVDELWHVPHFEKMLYDNP 315
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE---------- 305
QLA YL AF LT D Y+ + R +LDYL RDM PGG ++SAEDADS +
Sbjct: 316 QLALSYLAAFQLTADKQYALVARGVLDYLLRDMTSPGGGLYSAEDADSEDPHSYMTSTTT 375
Query: 306 --------TEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDP 356
E + +KEGAFY+W EV +LG E F Y + GNC+ S SDP
Sbjct: 376 AAAAAPAAMEAGSERKEGAFYIWDHSEVVSVLGPELGPFFCLVYGIDEEGNCNRSSRSDP 435
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPL----EKYLNILGECRRKLFDVRSKRPRPHLDDK 412
H EF+GKNV + +A++LG+P + L R L R+ RPRP LDDK
Sbjct: 436 HGEFEGKNVPYIATQPAVAAARLGLPYGDDAAEAARRLSAAREALHAARASRPRPSLDDK 495
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
++ +WNG+ I +FA AS++L SE + FP G Y++ A A+F+R HL+D
Sbjct: 496 IVTAWNGMGIGAFAVASRVLASEQQVERL-FPSEGRAPAAYLDAAVRVAAFVREHLWDPA 554
Query: 473 ----THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
RL+ S+ GPS GF DDY+ L+SGLLDLYE G G +WL WA++LQ QD+LF
Sbjct: 555 AGGGVGRLRRSYCKGPSAVAGFADDYSALVSGLLDLYECGGGREWLEWALQLQAVQDQLF 614
Query: 529 LDREGGGYFNT-----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV------- 576
D + GGYF+T DPS+ +R+K+D+DGAEP+ +SV+ NL+RLA ++
Sbjct: 615 WDPQSGGYFSTPDPASADADPSIRIRIKDDYDGAEPTASSVAASNLLRLADMIQERPLYD 674
Query: 577 --AGSKSDY---YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
A + + + Y + A +LA F R+ +AVP MCCAA S + V++ G
Sbjct: 675 TTASTTTGHAMPYDEAARRTLAAFSARITQAPLAVPQMCCAAHTFSKRPLRQVIVAGTAG 734
Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
+ D +L A H+ Y +K V+ +DP+D +M FW +HN M V +C
Sbjct: 735 ATDTGALLDAVHSPYCPDKVVLVMDPSDPRDMAFWRKHNPPAYDMV-----TQPAVVFIC 789
Query: 692 QNFSCSPPVTDPISLENLLLEK 713
QNF+C P TDP + LL ++
Sbjct: 790 QNFTCQAPTTDPARVRQLLAQR 811
>gi|260801315|ref|XP_002595541.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
gi|229280788|gb|EEN51553.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
Length = 741
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 328/731 (44%), Positives = 438/731 (59%), Gaps = 56/731 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFE E V K++N+ FV++KVDREERPD
Sbjct: 43 GEDAFKKAKKENKPIFLSVGYSTCHWCHVMERESFESEEVGKIMNEHFVNVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM+++QA GGGGWP+SV+L+PDLKP+ GGTYFPP+D GRPGF TIL ++ + W
Sbjct: 103 VDKVYMSFIQATSGGGGWPMSVWLTPDLKPIAGGTYFPPKDHMGRPGFSTILTRISEQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
+D L Q G I+ L E ++SA S+ LP Q +++ C +QL SYD FGGFG
Sbjct: 163 NNKDKLIQQGNMVIDALKELSVSAVDSTATLPG---QESVKKCLDQLDNSYDEEFGGFGH 219
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFP+PV + ++ T EA M L TL+ MAKGG++DH+G GFHRYS
Sbjct: 220 APKFPQPVNFNFLFRVWSSMKGT---PEAQRALDMALETLRFMAKGGMYDHIGQGFHRYS 276
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
D WHVPHFEKMLYDQGQLA Y DA+ +TKD ++ I RDIL Y+ RD+ G +S
Sbjct: 277 TDRTWHVPHFEKMLYDQGQLAVAYCDAYQITKDPIFADIARDILLYVSRDLSDRQGGFYS 336
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNC 348
AEDADS G KKEGAF VW + E+ ++LGE A LF +HY + +GN
Sbjct: 337 AEDADSLPNPGHKTKKEGAFCVWEADEIRNLLGEKLPHYDDMTFADLFAKHYNINRSGNV 396
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
+ DPH E GKNVLI +A G+ + +LG+CR LF VR KRP PH
Sbjct: 397 AFDQ--DPHGELAGKNVLIVRGSVENTAKAFGLEAAQVEEVLGKCRDILFKVRRKRPPPH 454
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
DDK+I +WNGL+IS FARA+++L EA +Y++ A AA F+R+ +
Sbjct: 455 RDDKMITAWNGLMISGFARAAQVL-GEA---------------QYLDRAVKAAKFVRKKM 498
Query: 469 YDEQTHRLQHSFRNGP---------SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
YD+ T +L S + P + GF DDYAFLI GLLDLYE +W+ WA +
Sbjct: 499 YDDSTGKLLRSCYHDPEMDRVTQIANPIDGFADDYAFLIRGLLDLYEASYNEEWVEWAAQ 558
Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
LQ QDELF D EG YF +G DPSVL+R+KED DGAEPS NSVS NL+RLAS
Sbjct: 559 LQRKQDELFWDSEGLAYFTVSGADPSVLIRMKEDQDGAEPSANSVSAGNLLRLASF---H 615
Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
+ +R + + F RL + +A+P M A + + K +++ G+ D + +L
Sbjct: 616 DDEGWRNKSVQLMTAFGARLAAIPLALPEMVSAL-IFYQQTPKQIIIAGNPRDRDTKALL 674
Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
H+S++ NK +I AD +E + E +++ + + K A VC+N++CS P
Sbjct: 675 QCVHSSFNPNKILI---IADGKEHGYLYEKLKVLSTLKKVD---GKATAYVCENYACSLP 728
Query: 700 VTDPISLENLL 710
V + L+ LL
Sbjct: 729 VNTVLELDELL 739
>gi|390355802|ref|XP_003728630.1| PREDICTED: spermatogenesis-associated protein 20
[Strongylocentrotus purpuratus]
Length = 671
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 316/702 (45%), Positives = 421/702 (59%), Gaps = 52/702 (7%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ + KL+N+ +VSIKVDREERPDVD+VYMT++QA GGGGWP+SV+L+PDLK
Sbjct: 1 MERESFENVDIGKLMNEHYVSIKVDREERPDVDRVYMTFIQATAGGGGWPMSVWLTPDLK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
PLMGGTYFPP D++GRPGF TIL+ + W + R+ L Q IE L A+ ++S+
Sbjct: 61 PLMGGTYFPPHDRFGRPGFPTILQSIARQWGENREALEQQSTKIIEALQAAVKVKSTSD- 119
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGE 205
P L + C +QL+ S+D+++GGFG APKFP+PV + LY S G+S
Sbjct: 120 -PSPLGTEVMEKCFKQLTDSFDNQYGGFGGAPKFPQPVNFNFLFRLYSSPP----GESEI 174
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
G KM L TL+ MAKGGIHDHV GFHRYS D WHVPHFEKMLYDQGQLA YLDA+
Sbjct: 175 GERGLKMCLHTLKMMAKGGIHDHVSQGFHRYSTDRFWHVPHFEKMLYDQGQLAVAYLDAY 234
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
+TK+ ++ + RDIL+Y+ RD+ G +SAEDADS T KKEGAF VWT EV
Sbjct: 235 QITKEAVFADVARDILEYVGRDLSDKAGGFYSAEDADSLPAADETHKKEGAFCVWTDTEV 294
Query: 326 EDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
L + A +F +HY +K GN D + DPH E K +NVLI ++A
Sbjct: 295 RTHLSDMVEGSDSVTLADVFCKHYDIKTGGNVDFEQ--DPHGELKDQNVLIARGSVDSTA 352
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S LG+ L RR L +VR +RPRPHLDDK++ +WNGL+IS F+RA ++L++
Sbjct: 353 SMLGLTEGTVEAALETARRTLHEVRLERPRPHLDDKMLTAWNGLMISGFSRAGQVLQA-- 410
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-RLQHSFRNG-------PSKAP 488
E+ + AE A +FIR+HLYD T L+ ++RN P
Sbjct: 411 --------------PEFTQRAEQAVTFIRQHLYDPSTGCLLRSAYRNKEGDIAQIPIPIQ 456
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
GF+DDY FLI GLLDLYE +W+ WA +LQ DEL D E GGYF+TT +D S+LL
Sbjct: 457 GFVDDYCFLIRGLLDLYEANYDEQWIEWASQLQEKLDELLWDTENGGYFSTTDKDSSILL 516
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R+KED DGAEPS NSV+ +NL+RL+ + ++ D Y++ A +VF RL+ + +A+P
Sbjct: 517 RLKEDQDGAEPSANSVACMNLLRLSHYL--NRPD-YQEKASKLFSVFGERLQKIPIALPE 573
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
M A + + K +++ G + D +L H Y NK +I D T F
Sbjct: 574 MASAL-LFQESTAKQIIICGDPQAEDTRLLLQCVHTHYLPNKVLILTDEGQTS--GFLSS 630
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ R + K A VC+N+ C PV L +LL
Sbjct: 631 RLDILKTLQRID---GKATAYVCENYQCQLPVNSVDDLSDLL 669
>gi|270011341|gb|EFA07789.1| hypothetical protein TcasGA2_TC005347 [Tribolium castaneum]
Length = 804
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 316/728 (43%), Positives = 426/728 (58%), Gaps = 52/728 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCHVME ESFEDE VAK++N F+++KVDREERPD
Sbjct: 100 GQEAFDRAKKENKLIFLSVGYSTCHWCHVMEKESFEDEEVAKIMNQHFINVKVDREERPD 159
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM ++QA GGGGWP+SVFL+P L+PL GGTYFPPEDKYGRPGFKT+L+ + + W
Sbjct: 160 VDKLYMAFIQASVGGGGWPMSVFLTPTLEPLAGGTYFPPEDKYGRPGFKTVLKSIAEQWR 219
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ +A SG +++E L + S+ + + ++ + C QLS SY+ FGGF +
Sbjct: 220 TKQSAIANSGKYSLEVLRKVSEREISAKQDINVPGEDVWKKCLLQLSHSYEDDFGGFSAQ 279
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+P + + + + S + M L TL+ MA GGIHDHV GF RYSV
Sbjct: 280 PKFPQPCNLNFLFHMYSR---DKHSEQGFRCLHMCLNTLRKMAYGGIHDHVNCGFARYSV 336
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+RWHVPHFEKMLYDQ QLA Y DAF +TKD F++ + RDIL Y+ RD+ P G + A
Sbjct: 337 DDRWHVPHFEKMLYDQAQLAVSYADAFVVTKDDFFAEVLRDILLYVSRDLSHPLGGFYGA 396
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-------HAILFKEHYYLKPTGNCDLS 351
EDADS EGA+ K+EGAF VW +E+ +LGE H LF HY +K GN + +
Sbjct: 397 EDADSYPYEGASHKREGAFCVWEFEEISKLLGETKTDDISHRDLFIYHYNVKEDGNVNPA 456
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
+ DPH+E + KN+L+ ++ K +E IL C L+ R KRP+PH+D
Sbjct: 457 Q--DPHHELEKKNILVCFGSFEDTSRKFKTSVETVKEILKSCHEILYKERQKRPKPHVDT 514
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL+IS FA+A +LK + EY+ A AA+FI++ LY+E
Sbjct: 515 KIVTSWNGLMISGFAKAGFVLKDQ----------------EYINRAILAATFIKKFLYNE 558
Query: 472 QTHRLQHSFRNG--------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
Q L G P+ GFLDDYAFLI GLLDLYE WL WA LQ
Sbjct: 559 QDKTLLRCCYKGDNAKIVQTPTPVNGFLDDYAFLIRGLLDLYEASLDADWLSWAEVLQEQ 618
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
QD LF D +G GYF + D S+L+R KED DGAEP GNS++V NL+RLA+ + ++D
Sbjct: 619 QDRLFWDTKGSGYFTSPANDSSILIRGKEDQDGAEPCGNSIAVHNLIRLAAYL--DRAD- 675
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
R A +L VF RLK + +A+P M A + S V + G + + ++
Sbjct: 676 LRAKAGRTLTVFADRLKSIPVALPEMTSAL-LFYHNSPTQVFIAGPTEDNNTQALIDVVR 734
Query: 644 ASYDLNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + + + D P + H S+AR K A VC+NF+CS PVT+
Sbjct: 735 SRFIPGRILAVTDGPGGL----LYRRHE----SLARLRPIQGKPAAYVCRNFACSLPVTE 786
Query: 703 PISLENLL 710
P L + L
Sbjct: 787 PEELASNL 794
>gi|348502030|ref|XP_003438572.1| PREDICTED: spermatogenesis-associated protein 20 [Oreochromis
niloticus]
Length = 748
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 311/717 (43%), Positives = 428/717 (59%), Gaps = 52/717 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + K+L++ FV IK+DREERPDVDKVYMT+VQA GGGGWP+S
Sbjct: 62 STCHWCHVMERESFEDEEIGKILSENFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L+P +GGTYFPP D+ GRPGFKT+L ++ D W R L SG IE L +
Sbjct: 122 VWLTPELRPFIGGTYFPPRDRGGRPGFKTVLTRIIDQWQNNRPALESSGERIIEALKKGT 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +A++ + P P A R C +QL+ S++ +GGF APKFP PV + ++ +
Sbjct: 182 TITANAGQSPPLAPDVANR-CFQQLAHSFEEEYGGFRDAPKFPSPVNLMFLISYWTVNRS 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E E +M L TL+ MA GGIHDH+ GFHRYS D WHVPHFEKMLYDQ QLA
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHIAQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A ++ + F++ + +D+L Y+ RD+ G +SAEDADS G K+EGAF V
Sbjct: 298 AYITASQVSGEQFFAEVAKDVLLYVSRDLSDKSGGFYSAEDADSVPALGGPEKREGAFCV 357
Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
WT+ EV ++L A +F HY +K GN ++ DPH E +G+NVLI
Sbjct: 358 WTASEVRELLPDVVEGAAGNATLADIFMHHYGVKEQGN--VAPEQDPHGELQGQNVLIVR 415
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A++ G+ +EK +L R K+ +VR RPRPHLD K++ SWNGL++S++AR
Sbjct: 416 YSVELTAARFGITVEKVNELLASARAKMAEVRKSRPRPHLDTKMLASWNGLMLSAYARVG 475
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
+L K+ +E A A F++ HL+D + + S G
Sbjct: 476 AVLGD----------------KDLVERAVKAGGFLKEHLWDAKRQTILRSCYRGDQMEVQ 519
Query: 484 ---PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
PS + GFLDDYAF+I GLLDLYE T+WL WA ELQ QD LF D +GGGYF +
Sbjct: 520 QISPSIS-GFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDVLFWDDQGGGYFCSD 578
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
D +VLL++KED DGAEPS NSVS NL+RL+ + + Q ++ L F RL
Sbjct: 579 PTDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGRQE---WLQKSQQLLTAFSDRLT 635
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
+ +A+P M A M + K +V+ G + + D ++LAA ++ + L V+ + +T
Sbjct: 636 TVPIALPEMVRAL-MAQHYTLKQIVICGQRDAPDTTSLLAAVNSLF-LPYKVLMLADGNT 693
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
E F + +SM++ A A VCQ+F+CS PVTDP L LLL+ + T
Sbjct: 694 E--SFLCQRLPVLSSMSQLRGVA---TAYVCQDFTCSLPVTDPQELRRLLLDGTTDT 745
>gi|363740931|ref|XP_420103.3| PREDICTED: spermatogenesis-associated protein 20 [Gallus gallus]
Length = 737
Score = 570 bits (1469), Expect = e-159, Method: Compositional matrix adjust.
Identities = 316/730 (43%), Positives = 430/730 (58%), Gaps = 53/730 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL +TCHWCHVME ESF+++ + ++++ FV IKVDREERPD
Sbjct: 38 GQEAFDKAKRENKLIFLSVGYSTCHWCHVMEEESFKNQEIGEIMSKNFVCIKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGGWP+SV+L+PDL+P +GGTYFPPED GF+T+L ++ + W
Sbjct: 98 VDKVYMTFVQATSGGGGWPMSVWLTPDLRPFVGGTYFPPEDSAHHVGFRTVLLRIAEQWR 157
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ ++ L QS +E L +LS + ++ Q L C +QLS SYD +GGF
Sbjct: 158 QNQEALLQSSQRILEAL-RSLSRVGTQDQQAAPPAQEVLTTCFQQLSGSYDEEYGGFSQC 216
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP PV + + + T E + +M L TL+ MA GGIHDH+G GFHRYS
Sbjct: 217 PKFPTPVNLNFLFTYWALHRTT---PEGARALQMSLHTLKMMAHGGIHDHIGQGFHRYST 273
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D WHVPHFEKMLYDQGQLA VY AF ++ D F++ + DIL Y RD+ P G +SA
Sbjct: 274 DRHWHVPHFEKMLYDQGQLAVVYSRAFQISGDEFFADVAADILLYASRDLGSPAGGFYSA 333
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDIL-------GEHAIL---FKEHYYLKPTGNC 348
EDADS T ++ K+EGAF VW ++EV +L E L F HY +K GN
Sbjct: 334 EDADSYPTATSSEKREGAFCVWAAEEVRALLPDPVEGAAEGTTLGDVFMHHYGVKEDGN- 392
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
+S DPH E +GKNVLI + +A+ G+ + +L E RR+L R++RPRPH
Sbjct: 393 -VSPRKDPHKELQGKNVLIAHSSPELTAAHFGLEPGQLSAVLQEGRRRLQAARAQRPRPH 451
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LD K++ SWNGL+IS FA+A +L ++EY+ A AA F+RRHL
Sbjct: 452 LDTKMLASWNGLMISGFAQAGAVLA----------------KQEYVSRAAQAAGFVRRHL 495
Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
++ + RL S G S AP GFL+DY F+I GL DLYE WL WA++L
Sbjct: 496 WEPGSGRLLRSCYRGEADVVEQSAAPIHGFLEDYVFVIQGLFDLYEASLDQSWLEWALQL 555
Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
Q+TQD+LF D +G YF++ DPS+LLR+K+D DGAEP+ NSV+V NL+R AS S
Sbjct: 556 QHTQDKLFWDPKGFAYFSSEAGDPSLLLRLKDDQDGAEPAANSVTVTNLLRAASY---SG 612
Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
+ + A LA F RL+ + +A+P M A + + K VV+ G D + ML+
Sbjct: 613 HMEWVEKAGQILAAFSERLQKIPLALPEMARATAVFH-HTLKQVVICGDPQGEDTKEMLS 671
Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
H+++ NK +I AD + F +S+ R K A VC NF+CS PV
Sbjct: 672 CVHSTFIPNKVLIL---ADGDGAGFLYRQLPFLSSLERKE---GKATAYVCSNFTCSLPV 725
Query: 701 TDPISLENLL 710
T P +L+ LL
Sbjct: 726 TSPRALQELL 735
>gi|189240570|ref|XP_973977.2| PREDICTED: similar to predicted protein [Tribolium castaneum]
Length = 754
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 316/746 (42%), Positives = 427/746 (57%), Gaps = 70/746 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCHVME ESFEDE VAK++N F+++KVDREERPD
Sbjct: 32 GQEAFDRAKKENKLIFLSVGYSTCHWCHVMEKESFEDEEVAKIMNQHFINVKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM ++QA GGGGWP+SVFL+P L+PL GGTYFPPEDKYGRPGFKT+L+ + + W
Sbjct: 92 VDKLYMAFIQASVGGGGWPMSVFLTPTLEPLAGGTYFPPEDKYGRPGFKTVLKSIAEQWR 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ +A SG +++E L + S+ + + ++ + C QLS SY+ FGGF +
Sbjct: 152 TKQSAIANSGKYSLEVLRKVSEREISAKQDINVPGEDVWKKCLLQLSHSYEDDFGGFSAQ 211
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+P + + + + S + M L TL+ MA GGIHDHV GF RYSV
Sbjct: 212 PKFPQPCNLNFLFHMYSR---DKHSEQGFRCLHMCLNTLRKMAYGGIHDHVNCGFARYSV 268
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+RWHVPHFEKMLYDQ QLA Y DAF +TKD F++ + RDIL Y+ RD+ P G + A
Sbjct: 269 DDRWHVPHFEKMLYDQAQLAVSYADAFVVTKDDFFAEVLRDILLYVSRDLSHPLGGFYGA 328
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-------HAILFKEHYYLKPTGNCDLS 351
EDADS EGA+ K+EGAF VW +E+ +LGE H LF HY +K GN + +
Sbjct: 329 EDADSYPYEGASHKREGAFCVWEFEEISKLLGETKTDDISHRDLFIYHYNVKEDGNVNPA 388
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
+ DPH+E + KN+L+ ++ K +E IL C L+ R KRP+PH+D
Sbjct: 389 Q--DPHHELEKKNILVCFGSFEDTSRKFKTSVETVKEILKSCHEILYKERQKRPKPHVDT 446
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL+IS FA+A +LK + EY+ A AA+FI++ LY+E
Sbjct: 447 KIVTSWNGLMISGFAKAGFVLKDQ----------------EYINRAILAATFIKKFLYNE 490
Query: 472 QTHRL--------------------------QHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
Q L +S P+ GFLDDYAFLI GLLDLY
Sbjct: 491 QDKTLLRCCYKGDNAKIVQTVANLLSKSQPTLNSINRRPTPVNGFLDDYAFLIRGLLDLY 550
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E WL WA LQ QD LF D +G GYF + D S+L+R KED DGAEP GNS++
Sbjct: 551 EASLDADWLSWAEVLQEQQDRLFWDTKGSGYFTSPANDSSILIRGKEDQDGAEPCGNSIA 610
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
V NL+RLA+ + ++D R A +L VF RLK + +A+P M A + S V
Sbjct: 611 VHNLIRLAAYL--DRAD-LRAKAGRTLTVFADRLKSIPVALPEMTSAL-LFYHNSPTQVF 666
Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSAD 684
+ G + + ++ + + + + D P + H S+AR
Sbjct: 667 IAGPTEDNNTQALIDVVRSRFIPGRILAVTDGPGGL----LYRRHE----SLARLRPIQG 718
Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
K A VC+NF+CS PVT+P L + L
Sbjct: 719 KPAAYVCRNFACSLPVTEPEELASNL 744
>gi|410895871|ref|XP_003961423.1| PREDICTED: spermatogenesis-associated protein 20-like [Takifugu
rubripes]
Length = 748
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 306/714 (42%), Positives = 421/714 (58%), Gaps = 50/714 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT++QA G GGWP+S
Sbjct: 62 STCHWCHVMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFIQATSGSGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL+P +GGTYFPP D RPG KT+L ++ D W R L +G +E L +
Sbjct: 122 VWLTPDLRPFIGGTYFPPRDHGRRPGLKTVLMRIIDQWTNNRSALESNGNKILEALKKGT 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +A + P P + + C +QL+ SY+ +GGF +PKFP PV + ++ +
Sbjct: 182 AIAADAGTSPPFAP-DVTKRCFQQLANSYEEEYGGFRDSPKFPSPVNLMFLMSYWCMNRS 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E E +M L TL+ MA GGIHDHV GFHRYS D WHVPHFEKMLYDQ QLA
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A ++ + FY+ + +DIL Y+ RD+ G +SAEDADS G T K+EGAF +
Sbjct: 298 AYITASQVSGEQFYADVAKDILCYVSRDLSDKSGGFYSAEDADSLPHCGGTEKREGAFCI 357
Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
WT+ EV ++L A +F HY +K GN +S DPH E +G+NVLI
Sbjct: 358 WTASEVRELLPDVVEGTAGSATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVR 415
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A+ G+ +E+ N+L R K+ ++R RPRPHLD K++ SWNGL++S++AR
Sbjct: 416 YSLELTAAHFGVSIEEVTNLLASARAKMAEIRKSRPRPHLDTKMLASWNGLMLSAYARVG 475
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
+L +A +E A AA+F++ H++D + L S G
Sbjct: 476 AVLGDKA----------------LLERAVQAANFLQEHMWDPEQQTLLRSCYLGDDMELQ 519
Query: 484 --PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
GFLDDYAF+I GLLDL+E T+WL WA ELQ QD+LF D EGGGYF +
Sbjct: 520 QISPPISGFLDDYAFIICGLLDLHEATLQTEWLRWAEELQLRQDKLFWDDEGGGYFCSDP 579
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
D +VLLR+KED DGAEPS NSVS NL+RL+ + + Q +E LA F RL
Sbjct: 580 SDFTVLLRLKEDQDGAEPSANSVSAFNLLRLSEYTGKQE---WLQKSERLLAAFTDRLTK 636
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+ +A+P M A M + K +V+ G + S D +LA ++ + +K ++ ID E
Sbjct: 637 VPIALPEMVRAL-MAQHYTLKKIVICGKRDSPDTVTLLATVNSLFLPHKVLMLID--GDE 693
Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
+ + H + + ++ + A +C NF+CS PVTDP L LLL++ S
Sbjct: 694 DSSLQQRHPALYSITQQDGVA----TAYICHNFTCSLPVTDPQELRRLLLDETS 743
>gi|317419139|emb|CBN81176.1| Spermatogenesis-associated protein 20 [Dicentrarchus labrax]
Length = 748
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 311/717 (43%), Positives = 427/717 (59%), Gaps = 52/717 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT+VQA GGGGWP+S
Sbjct: 62 STCHWCHVMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L+P +GGTYFPP D RPG KT+L ++ + W R L SG +E L +
Sbjct: 122 VWLTPELRPFIGGTYFPPRDHARRPGLKTVLTRIMEQWQNNRPALESSGERILEALKKGT 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +A+ + P P A R C +QL+ SY+ +GGF APKFP PV + ++ +
Sbjct: 182 AVAANPGESPPLAPDVANR-CFQQLAHSYEEEYGGFRDAPKFPTPVNLMFLMSYWSVNRS 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E E +M L TL+ MA GGIHDHV GFHRYS D WHVPHFEKMLYDQ QLA
Sbjct: 241 T---SEGVEALQMALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 297
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A ++ + ++ + +DIL Y+ RD+ G +SAEDADS G K+EGAF V
Sbjct: 298 AYITASQVSGEQLFADVAKDILLYVTRDLSDKSGGFYSAEDADSVPASGGPEKREGAFCV 357
Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
WT+ EV ++L A +F HY +K GN ++ DPH E +G+NVLI
Sbjct: 358 WTATEVRELLPDVVEGATGSATQADIFMHHYGVKVQGN--VAPEQDPHGELQGQNVLIVR 415
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A+ G+ +EK +L R K+ +VR RP PHLD K++ SWNGL++S++AR
Sbjct: 416 YSVELTAAHFGISVEKVNELLASARGKMAEVRKSRPCPHLDTKMLGSWNGLMLSAYARVG 475
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSFRNGPSKA- 487
+L +A +E A A +F++ HL+D EQ L+ +R +
Sbjct: 476 AVLGDKA----------------LLERAAQAGNFLKEHLWDAEQQTILRSCYRGDEMEVQ 519
Query: 488 ------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
GFLDDYAF+I GLLDLYE T+WL WA ELQ QDELFLD +GGGYF++
Sbjct: 520 QISPPISGFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDELFLDDQGGGYFSSDP 579
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
D +VLL++KED DGAEPSGNSVS NL+RL+ + + Q ++ LA F RL
Sbjct: 580 SDNTVLLQLKEDQDGAEPSGNSVSASNLLRLSHYTGRQE---WLQRSQQLLAAFTDRLTR 636
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID-PADT 660
+ +A+P M M + K +V+ G + + D ++LA ++ + +K ++ D AD+
Sbjct: 637 VPIALPEMVRTL-MAQHYTLKQIVICGQRDAPDTASLLATINSLFLPHKVLMLTDGDADS 695
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
F + +SM++ + A A VCQ+F+CS PVTDP L LLL+ + T
Sbjct: 696 ----FLCQRLPVLSSMSQQDGVA---TAYVCQDFTCSLPVTDPQELRRLLLDGTTET 745
>gi|326672402|ref|XP_001920588.3| PREDICTED: spermatogenesis-associated protein 20 [Danio rerio]
Length = 818
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 307/712 (43%), Positives = 419/712 (58%), Gaps = 52/712 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + K+L+D FV IKVDREERPDVDKVYMT+VQA GGGGWP+S
Sbjct: 140 STCHWCHVMERESFEDEEIGKILSDNFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMS 199
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDLKP +GGTYFPP D RPG KT+L ++ + W R+ L SG +E L +
Sbjct: 200 VWLTPDLKPFIGGTYFPPRDSGRRPGLKTVLLRIIEQWQTNRETLESSGERVLEALRKGT 259
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ SAS + P A R C +QL+ S++ +GGF APKFP PV ++ ++
Sbjct: 260 AISASPGETLPPGPDVANR-CYQQLAHSFEEEYGGFREAPKFPSPVNLKFLMSFWAV--- 315
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
S E +E +M L TL+ MA GGIHDHV GFHRYS D WHVPHFEKMLYDQGQLA
Sbjct: 316 NRSSSEGAEALQMALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQGQLAV 375
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A+ ++ + ++ + RD+L Y+ RD+ G +SAEDADS T +T K+EGAF V
Sbjct: 376 AYITAYQVSGEQLFADVARDVLLYVSRDLSDKSGGFYSAEDADSFPTVESTEKREGAFCV 435
Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
WT+ E+ ++L A +F HY +K GN D ++ DPH E +G+NVLI
Sbjct: 436 WTAGEIRELLPDIVEGATGGATQADIFMHHYGVKEQGNVDPAQ--DPHGELQGQNVLIVR 493
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A+ G+ + + +L E R KL +VR RP PHLD K++ SWNGL++S FAR
Sbjct: 494 YSVELTAAHFGISVNRLSELLSEARAKLAEVRRARPPPHLDTKMLASWNGLMLSGFARVG 553
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------ 483
+L +A +E AE AA F++ HL+DE R+ HS G
Sbjct: 554 AVLGDKA----------------LLERAERAACFLQDHLWDEDGQRILHSCYRGNNMEVE 597
Query: 484 --PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
S GFLDDYAF++ GLLDL+E +WL WA ELQ QD+LF D +G GYF +
Sbjct: 598 QVASPITGFLDDYAFVVCGLLDLFEATQKFRWLQWAEELQLRQDQLFWDSQGSGYFCSDP 657
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
DP++LL +K+D DGAEPS NSVS +NL+RL+ + D+ Q +E L F RL
Sbjct: 658 SDPTLLLALKQDQDGAEPSANSVSAMNLLRLSHFTG--RQDWI-QRSEQLLTAFSDRLLK 714
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+ +A+P M M + K +V+ G + D ++++ ++ + L V+ + +TE
Sbjct: 715 VPIALPDMVRGV-MAHHYTLKQIVICGLPDAEDTASLISCVNSLF-LPHKVLMLADGNTE 772
Query: 662 EMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
+ + + D K A VC+NF C+ PVT P L LL+E
Sbjct: 773 GFLY------DKLPILSTLVPQDGKATAYVCENFVCALPVTCPQELRRLLME 818
>gi|327264961|ref|XP_003217277.1| PREDICTED: spermatogenesis-associated protein 20-like [Anolis
carolinensis]
Length = 739
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 307/731 (41%), Positives = 429/731 (58%), Gaps = 53/731 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCHVME ESF++E +A++LN+ FVSIKVDREERPD
Sbjct: 40 GQEAFDKAKKEDKLIFLSVGYSTCHWCHVMEHESFQNEEIAQILNENFVSIKVDREERPD 99
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+PDLKP +GGTYFPPED + GF+T+L ++ + W
Sbjct: 100 VDKVYMTFVQATSSGGGWPMSVWLTPDLKPFVGGTYFPPEDGIYQVGFRTVLIRILEQWK 159
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ R L ++ + L + ++P L + + C +QLS+SYD +GGF
Sbjct: 160 RNRAALLENSQKILSALLARVDVGVRGEEIPPSL-KEVMSRCFQQLSESYDEEYGGFSET 218
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP PV + + + T E + +M L TL+ MA GGIHDH+ GFHRYS
Sbjct: 219 PKFPTPVNMNFLFSYWALHRST---SEGARALQMALHTLKMMAYGGIHDHIAQGFHRYST 275
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+RWHVPHFEKMLYDQGQLA V+ AF ++ D F++ I DIL Y RD+ G +SA
Sbjct: 276 DQRWHVPHFEKMLYDQGQLAVVFAKAFQISGDEFFADIVADILLYASRDLSDKSGGFYSA 335
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPTGNC 348
EDADS T + +K+EGAF VWT++E+ +L + A +F HY +K GN
Sbjct: 336 EDADSYPTAKSEKKQEGAFCVWTAEEIRHLLPDLIEGSPERKSVADVFMHHYGVKEDGN- 394
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
++ M DPHNE KGKNVLI +A++ G+ LE+ +L + R +L+ R++RPRPH
Sbjct: 395 -VNPMKDPHNELKGKNVLIVQYSLELTAARFGLGLEQLKTMLVKSRDQLYKARAQRPRPH 453
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LD K++ SWNGL+IS FA++ IL +KEY++ A + A F+R ++
Sbjct: 454 LDTKMLASWNGLMISGFAQSGAIL----------------GKKEYVDRAVNTADFLRNYM 497
Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
++ +L S G S P GFL+DY F+I L DLYE WL WA++L
Sbjct: 498 FNASNGKLLRSCYQGKENSVDKSSVPIHGFLEDYVFVIQALFDLYEASLNPSWLEWAVQL 557
Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
Q+ QDELF D +G YF T DPS+LLR+K+D DGAEPS NSV+V NL+R AS +
Sbjct: 558 QHKQDELFWDPKGFAYFTTEASDPSLLLRMKDDQDGAEPSPNSVAVSNLLRAASYTGHKE 617
Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
+ + A L+ F RL + + +P M A + ++K VV+ G D +L
Sbjct: 618 ---WVKKAGQILSAFSERLLKIPVVLPEMARATAAFHL-TQKQVVICGDPKGEDTRELLH 673
Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
++++ N+ +I AD F + +S+ + N K A +C+NF+CS PV
Sbjct: 674 CYYSTFTPNRVLIF---ADGNTTGFPYQQLGFLSSLEKKN---GKATAYLCENFACSLPV 727
Query: 701 TDPISLENLLL 711
T L LLL
Sbjct: 728 TSSQELRCLLL 738
>gi|156368209|ref|XP_001627588.1| predicted protein [Nematostella vectensis]
gi|156214502|gb|EDO35488.1| predicted protein [Nematostella vectensis]
Length = 735
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 303/734 (41%), Positives = 418/734 (56%), Gaps = 60/734 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K ++ FL +TCHWCHVME ESFEDE +AK+LN+ F+ +KVDREERPD
Sbjct: 37 GDEAFQKAKKEQKPIFLSVGYSTCHWCHVMERESFEDENIAKILNENFIPVKVDREERPD 96
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD+VYMTY+QA+ GGGGWP+S++L+PDLKP + GTYFPP D GRPGF T+L + WD
Sbjct: 97 VDRVYMTYIQAMVGGGGWPMSLWLTPDLKPFVAGTYFPPNDMAGRPGFGTVLGHIIKQWD 156
Query: 119 KKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
+ Q + + E A + +P+ + + + +SKS+D GGFG
Sbjct: 157 TNKPKFTQQSTIVMNAILEHASEIGLDAKDMPN---KEVIEKLYQGMSKSFDEELGGFGG 213
Query: 178 APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
APKFP+P + YH K + E + L TL+CM KGGIHDHVG GFHRY
Sbjct: 214 APKFPQPATFNFLFKYHLLK----NGTEEGERALHICLKTLECMGKGGIHDHVGQGFHRY 269
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S D WHVPHFEKMLYDQ Q+A Y + +TKD ++ CRDIL Y+ RD+ G +
Sbjct: 270 STDRFWHVPHFEKMLYDQAQIAAAYAMGYQMTKDEKFAETCRDILLYVMRDLSHKLGGFY 329
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-----------AILFKEHYYLKPT 345
SAEDADS + AT+K EGAFYVW +E++D+L + + LF +HY ++
Sbjct: 330 SAEDADSLPSPNATKKTEGAFYVWEEQELKDLLSDSLPTKGGGSILLSELFNKHYGVQAE 389
Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
GN + DPH E KNVLI + L + ++ L + R LF+ R KRP
Sbjct: 390 GN--VKPHQDPHKELVKKNVLIVRGSLQDTIKDLDVEEDEAKEQLAKAREILFEERKKRP 447
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
PHLDDK+I SWNGL+IS FAR+ ++L E Y+ A AA F+R
Sbjct: 448 APHLDDKMITSWNGLMISGFARSGQVLGEEV----------------YILRAIKAAEFVR 491
Query: 466 RHLYDEQTHRLQHSFRNG--------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWA 517
HLYD+ + L S G + G+ DY +LI+GLLDLYE +WL WA
Sbjct: 492 THLYDKSSGELLRSCYRGDKDSIAQIATPIKGYGCDYVYLINGLLDLYEASFDEQWLKWA 551
Query: 518 IELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
ELQ+ DELFLD+E GGYF T D S+L+R+K++ DGAEPS NS++V+NL+RL + V
Sbjct: 552 EELQDKADELFLDKEKGGYFEVTEADKSILVRLKDEQDGAEPSANSLAVMNLMRLGNFVD 611
Query: 578 GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFEN 637
+ YR A+ V+E+RL+ + +A+P + ++ K +++ G + + D +
Sbjct: 612 CQR---YRDQAQRIFMVYESRLRQIPLALPELVSNFITHNL-GMKQIIIAGDRDADDTKL 667
Query: 638 MLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCS 697
++ H+ Y NK ++ D D F S ++ R + K A VCQN++C
Sbjct: 668 LMRCVHSHYIPNKVLLLCDGKDG----FLSTKLSVFKTLQRVD---GKATAYVCQNYTCQ 720
Query: 698 PPVTDPISLENLLL 711
PVT L LL+
Sbjct: 721 LPVTSEEELTKLLV 734
>gi|241111177|ref|XP_002399229.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
gi|215492917|gb|EEC02558.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
Length = 745
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 309/717 (43%), Positives = 416/717 (58%), Gaps = 59/717 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ +A+L+N+ FV++KVDREERPD+D+VYMTY+QA GGGGWP+S
Sbjct: 65 STCHWCHVMERESFENADIARLMNEHFVNVKVDREERPDLDRVYMTYIQATSGGGGWPMS 124
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
V+L+PDLKP++GGTYFPP+D+Y GRPGFKT+L + + + ++L Q+ EA
Sbjct: 125 VWLTPDLKPIVGGTYFPPDDRYFGRPGFKTLLAAIAEQGSRIVEILRQASDLRSSDEREA 184
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+A+++S C EQLS+SYD GGFG APKFP+ V + +L H+ +
Sbjct: 185 GAAASTSGSEAVPRASTVAATCFEQLSRSYDEAMGGFGKAPKFPQCVNLNFLLRHAVASQ 244
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ G EA+ +M + TL MA+GGIHDHV GFHRYS D WHVPHFEKMLYDQ QLA
Sbjct: 245 EPG---EAARALEMCVNTLNKMARGGIHDHVAKGFHRYSTDGGWHVPHFEKMLYDQAQLA 301
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+AF T+D + + RD+LDY+ RD+ G +SAEDADS + KKEGAF
Sbjct: 302 RAYLEAFQATRDPHLAQVARDVLDYVERDLSHQSGGFYSAEDADSLPEASSGEKKEGAFC 361
Query: 319 VWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
VW EV +L E A LF ++ ++ GN D M DPH+E KGKNVL+
Sbjct: 362 VWEEAEVRRLLPEPLPGCPGRTVADLFCRYFGVEAGGNVD--PMQDPHDELKGKNVLVVR 419
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+ A + G+ L ++L + RR L + R +RPRPHLDDK + +WNGL++S FA A+
Sbjct: 420 ESQESLAERFGLELPVLHSLLEDARRVLLEARQRRPRPHLDDKFLAAWNGLMVSGFATAA 479
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS---- 485
K+L DR+ Y A A +F+ +HLYDE L S G
Sbjct: 480 KVL---------------GDRR-YAGRALQAVAFLGQHLYDEDRKSLLRSAYRGEGGHVT 523
Query: 486 ----KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
PG L+DYAF + GLLD YE L+ A ELQ+ QD F D + GGYF ++G
Sbjct: 524 QTARPIPGVLEDYAFTVQGLLDTYEACFEAPCLLRAEELQDAQDARFWDPDQGGYFLSSG 583
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
ED +LLR+K+D DGAEPS NSVS+ NLVRL+ ++ +++D R+ A+ + RL
Sbjct: 584 EDAHLLLRLKDDQDGAEPSPNSVSLSNLVRLSVLL--NRAD-LRERAQRLAEAYARRLSL 640
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+ +A+P M C L + VV+ G K + +L+ + T I D
Sbjct: 641 LPLALPEMVCGLLRLQA-GPQEVVVAGGKDHPGTQELLSCLRGHFLPFLTTILAD----- 694
Query: 662 EMDFWEEHNSNNASMARNNFSADKVV-----ALVCQNFSCSPPVTDPISLENLLLEK 713
+ N NF A K V A VC+NF CS PVT + LE LL +K
Sbjct: 695 ------QDPENPLRERLPNFDAYKCVDGKPTAYVCRNFVCSKPVTSAVELERLLQQK 745
>gi|321473187|gb|EFX84155.1| hypothetical protein DAPPUDRAFT_47524 [Daphnia pulex]
Length = 661
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 290/622 (46%), Positives = 393/622 (63%), Gaps = 55/622 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA+L+N F++IKVDREERPDVDK+YM++VQA+ G GGWP+S
Sbjct: 61 STCHWCHVMEKESFEDENVAELMNSEFINIKVDREERPDVDKMYMSFVQAITGRGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
V+++P+LKP+ GGTY+PP+D+Y G+PGFKTIL+ + + W + SG E++ A
Sbjct: 121 VWMTPELKPVYGGTYYPPDDRYYGQPGFKTILKSLAEQWKENPGKFKASG----EKIMTA 176
Query: 139 LSASASSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L+ S++ + D++P + LC +QL SY+ +FGGF APKFP+PV + ++L
Sbjct: 177 LARSSTLGR-GDQVPSAFDCGHLCFQQLRGSYEPKFGGFSKAPKFPQPVNMNLLLRWHVL 235
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D S A + M L TL+ MAKGGI DHV GF RYS DE+WHVPHFEKMLYDQ Q
Sbjct: 236 SDDAADSDLALD---MCLHTLRMMAKGGIFDHVRLGFARYSTDEKWHVPHFEKMLYDQAQ 292
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA VY DA+ LTKD ++ + DIL Y+ D+ P G +SAEDADS G+ K+EGA
Sbjct: 293 LALVYTDAYLLTKDQDFARVASDILTYVSNDLSDPSGGFYSAEDADSYPETGSDEKREGA 352
Query: 317 FYVWTSKEVEDILGEHAI------------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
F VW+ KE++ +L + H+ ++P+GN D DPH+E KG+N
Sbjct: 353 FCVWSHKEIQSVLASQPAPSQVGPDVTVSDIVCYHFDIRPSGNVD--PYQDPHDELKGQN 410
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
VLI +A+K G+ ++ +L + + R +RPRPHLDDK++ SWNGL+IS+
Sbjct: 411 VLIIRGSDEETAAKFGLSMDVLRELLETALSTMREARQRRPRPHLDDKMLASWNGLMISA 470
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG 483
ARA +IL R Y+E A AA F+R+HLYD Q+ RL S +R G
Sbjct: 471 LARAGQILG----------------RDTYVERAAKAAEFVRQHLYDGQSGRLLRSCYRGG 514
Query: 484 PSKAP----------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
+ GFLDDYAF+I GLLDLY KW+ WA ELQ QD+LF D
Sbjct: 515 DGQQDAVSQNAEPIGGFLDDYAFVIRGLLDLYTACQDEKWIQWADELQQKQDQLFWDPSQ 574
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GGYF++ DPS+L+R+KE+ DGAEPSGNS++V NL RLA VA +SD YR A +L
Sbjct: 575 GGYFSSAAGDPSILIRLKEEQDGAEPSGNSIAVGNLERLA--VAVDRSD-YRDQARRTLC 631
Query: 594 VFETRLKDMAMAVPLMCCAADM 615
+F+ RL + +++P M A +
Sbjct: 632 LFQDRLAKIPVSLPEMVAALQL 653
>gi|193215110|ref|YP_001996309.1| hypothetical protein Ctha_1399 [Chloroherpeton thalassium ATCC
35110]
gi|193088587|gb|ACF13862.1| protein of unknown function DUF255 [Chloroherpeton thalassium ATCC
35110]
Length = 710
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 288/703 (40%), Positives = 406/703 (57%), Gaps = 56/703 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E +A++LN+ FVSIKVDREE PD+DKVYMTYVQA G GGWP+S
Sbjct: 54 STCHWCHVMERESFENEEIARILNEHFVSIKVDREEHPDLDKVYMTYVQASTGSGGWPMS 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+LKP GGTYFPP D YGRPGF ++L K+ ++W + R+ + Q+ EQL
Sbjct: 114 VWLTPELKPFFGGTYFPPSDSYGRPGFGSMLLKIAESWQQSRERVLQAAGNISEQLQAFS 173
Query: 140 SASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKK 196
A + K+PDE A + Q +D +GGFG+APKFPRP + + +H K
Sbjct: 174 EMQAEAGAKVPDEA---AFQNTFAQFESVFDKDWGGFGNAPKFPRPAILNFLFTFFHQTK 230
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKM 250
E +M L TL+ MA GG+HDH+ GGGF RYS D WHVPHFEKM
Sbjct: 231 NE---------AALRMALHTLRKMADGGMHDHISVPGKGGGGFARYSTDAYWHVPHFEKM 281
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD QLA+ YLDA+ +T D F++ RDI +Y+ DM P G +SAEDADS +
Sbjct: 282 LYDNAQLASAYLDAYQITSDRFFADTARDIFNYVLCDMTAPEGGFYSAEDADSLAAPESP 341
Query: 311 RKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
K EGAFYVW E++ +LG+ A +F Y + P GN + DPH EFKGKN+LI
Sbjct: 342 EKTEGAFYVWERAEIDALLGDEASQIFSFIYGVHPGGNASV----DPHGEFKGKNILIRR 397
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S +A + G ++ + R +LFD R +RPRPH DDK++ +WNGL+IS+FA+
Sbjct: 398 ATLSQAAQEFGKSEADIAEVMAKSRERLFDARLQRPRPHRDDKILTAWNGLMISAFAKGY 457
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L D Y+ A+ AA F+ LY+++T L +R+G S G
Sbjct: 458 MVL----------------DEATYLHAAQKAADFVIEKLYNKETGGLLRRYRDGESAIDG 501
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
DDYAF + L+DLYE K+L A++L Q+ LF D + GG+F++T E+ SV+ R
Sbjct: 502 KADDYAFFVQALIDLYEASFQFKYLSLALDLAEKQNALFYDAQNGGFFSSTSENKSVIFR 561
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+K+D DGAEPS NSV+ +NL+RL+ + + + +RQ AE ++ F L + +P M
Sbjct: 562 LKDDQDGAEPSANSVAALNLLRLSQM---ADREDFRQKAEATVNFFGKILSEAGNQMPQM 618
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A L K ++L G S + + A + Y+ K ++H EE
Sbjct: 619 FAALSFLKQKP-KQIILTGAPDSPELRALRKAIDSVYEPVKVLLHAT----------EET 667
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ ++ + + K A +C N++C P ++P + L+E
Sbjct: 668 AGLTSFLSSLSLGSQKPTAYICINYACRLPTSEPAKVREFLVE 710
>gi|357626408|gb|EHJ76509.1| hypothetical protein KGM_19065 [Danaus plexippus]
Length = 813
Score = 543 bits (1399), Expect = e-151, Method: Compositional matrix adjust.
Identities = 306/714 (42%), Positives = 409/714 (57%), Gaps = 55/714 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE E VAK++N+ F++IKVDREERPD+D+VYM +V A GGGGWP+S
Sbjct: 133 STCHWCHVMERESFESEDVAKIMNEHFINIKVDREERPDLDRVYMLFVMATTGGGGWPMS 192
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDL+P+ GGTYFPPED++GRPGFKTIL + W + + ++ ++ L
Sbjct: 193 VFLTPDLRPVTGGTYFPPEDRWGRPGFKTILLSLAKKWKENQTQFLEASINIMDALQNIS 252
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +N +P E N C + +++ FGGFG+APKFP+ I L+H +
Sbjct: 253 NVKVETNSVPGEATWNK---CVRRYITNFEPHFGGFGTAPKFPQ-ASIFNFLFHFYARDK 308
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ E + +M L TL ++KGGIHDHV GF RYSVD WHVPHFEKMLYDQ QL
Sbjct: 309 --QNPEGKQCLEMCLHTLTKISKGGIHDHVASGFARYSVDNDWHVPHFEKMLYDQAQLMV 366
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y DA+ TK+ +Y+ + RDI+ Y+ RD+ G +SAEDADS GA +KKEGAF V
Sbjct: 367 AYTDAYLATKEEYYADVVRDIVKYVNRDLRHDLGGYYSAEDADSYPVFGADKKKEGAFCV 426
Query: 320 WTSKEVEDILGEHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
W E+ ++G+ + +F +++ ++ +GN +S SDPH E KNVLI
Sbjct: 427 WEYDEINSLIGDKKVGNVSYLEIFCDYFNVEESGN--VSPESDPHGELTNKNVLIIYGSE 484
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ASK + ++ +L EC L++ RSKRPRPHLD K++ SWNGL IS A A +
Sbjct: 485 EETASKFEITKDQLKQVLKECIDILYEARSKRPRPHLDTKMLCSWNGLAISGLAHAGQ-- 542
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF----------RN 482
G K ++E A A+FI+ HLYD++ L HS N
Sbjct: 543 --------------GLGEKSFVEDAIKTANFIKEHLYDQENKTLLHSCYKAEDGNITQTN 588
Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
P K GFLDDYAFLI GLLDLYE WL WA ELQ Q+ELF D + GGYF + E
Sbjct: 589 PPIK--GFLDDYAFLIRGLLDLYEASLDLHWLNWARELQEKQNELFWDSDNGGYFTCSAE 646
Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS----DYYRQNAEHSLAVFETR 598
D SV+LR+KED DGAEPSGNSVS NL RLA+ S + D R A+ L F R
Sbjct: 647 DTSVVLRLKEDQDGAEPSGNSVSCHNLQRLAAYADKSSAEEGGDRERDMAKKVLMAFAKR 706
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
L D A P M A M S V++ G S ++ A + + + DP
Sbjct: 707 LIDSPTASPEMMSAL-MFFTDSPTQVLISGGCSDPRTLALVRAVRSRLLPGRVLAVADPK 765
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
D+ ++ ++R + + A VC+ ++CS PVT LE LL E
Sbjct: 766 DSPA-------GMSDILLSRIRSTGEAPTAYVCRRYACSLPVTSVQQLETLLDE 812
>gi|328702149|ref|XP_001952649.2| PREDICTED: spermatogenesis-associated protein 20-like
[Acyrthosiphon pisum]
Length = 784
Score = 543 bits (1399), Expect = e-151, Method: Compositional matrix adjust.
Identities = 318/748 (42%), Positives = 421/748 (56%), Gaps = 78/748 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ FL +TCHWCHVME ESFE++ VA ++N+ +V+IKVDREERPD
Sbjct: 81 GDEAFEKARSEKKLIFLSVGYSTCHWCHVMEHESFENQDVAAVMNEHYVNIKVDREERPD 140
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
VD++YMT+VQA G GGWP+SVFL+PDLKP+ GGTY+PPED YGRPGFKTIL + W
Sbjct: 141 VDQLYMTFVQAASGQGGWPMSVFLTPDLKPIGGGTYYPPEDAYGRPGFKTILLHMAKRWK 200
Query: 118 ----------DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS 167
K +L + AF I QL LS N P+ + C QL +
Sbjct: 201 SDSKSMLENSSKMMKILNDTTAFDI-QLGTELSNIMKPN------PKTWIT-CYSQLQRI 252
Query: 168 YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHD 227
YD +GGFG PKFP+P + + + S K+ KS E + +M L TLQ M GGIHD
Sbjct: 253 YDDEWGGFGMPPKFPQPTILDFLFHISHKM---SKSYEGKKSLEMALETLQKMTMGGIHD 309
Query: 228 HVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD 287
H+G GF RYS DE+WHVPHFEKMLYDQ QLA Y AF +TK YS + DIL Y+ RD
Sbjct: 310 HIGQGFARYSTDEKWHVPHFEKMLYDQAQLAVSYTTAFQITKHEQYSDVVHDILQYVSRD 369
Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKE 338
+ G +SAEDADS T +T+K+EGAF WT +EV+ +L + + LF
Sbjct: 370 LSHKLGGFYSAEDADSLPTVDSTKKREGAFCTWTQEEVKTLLDQPLDSNPDIKLSELFCW 429
Query: 339 HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLF 398
H+ + P GN SDPH E G+NVLIE +A K + +E L + LF
Sbjct: 430 HFSVLPNGNVRPD--SDPHGELLGQNVLIEFRSKENTAKKFQITVENVEKELKIAKSILF 487
Query: 399 DVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAE 458
+ R KRPRPHLD+K+I SWNGL+I+++ARA+ L E EY + A
Sbjct: 488 EARKKRPRPHLDNKIITSWNGLMITAYARAASALNVE----------------EYKQRAI 531
Query: 459 SAASFIRRHLYDEQTHRLQHSFRNG-------PSKAPGFLDDYAFLISGLLDLYEFGSGT 511
AA F++ H ++ L+ + N GFL+DYAFLI GLLDLYE +
Sbjct: 532 KAAEFLKTHAWNNSV-LLRSCYVNDIGDIANIEKPIAGFLNDYAFLIRGLLDLYECTLQS 590
Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 571
KWL WA ELQ QDELF D+E GY++++ +DPS++LR K DHDGAEPSGNS+S +NL+R
Sbjct: 591 KWLKWADELQEQQDELFWDKEKFGYYSSSDKDPSIILRFKSDHDGAEPSGNSISALNLLR 650
Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
L+ + S+ YR + F RL + A+P + A L S V + G
Sbjct: 651 LSILTEKSE---YRSKIDPLFLAFAGRLSGSSSALPALVSAL-TLHCDSITSVYVTGDLD 706
Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
+ + E +L+A Y N + H D E+ + +A N KV A VC
Sbjct: 707 NPELEALLSAIRQRYMPNLVLAHADENSLSEL-------AKGLGIAENG----KVAAYVC 755
Query: 692 QNFSCSPPVTDPISLENLL---LEKPSS 716
+N +C+ PV L LL +E P+S
Sbjct: 756 KNNTCNLPVHSTEELIALLDGRVESPAS 783
>gi|116626220|ref|YP_828376.1| hypothetical protein Acid_7180 [Candidatus Solibacter usitatus
Ellin6076]
gi|116229382|gb|ABJ88091.1| protein of unknown function DUF255 [Candidatus Solibacter usitatus
Ellin6076]
Length = 704
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 299/694 (43%), Positives = 412/694 (59%), Gaps = 42/694 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E +A LLN +++IKVDREERPDVD++YMT+VQA G GGWP+S
Sbjct: 49 STCHWCHVMERESFENEEIAALLNRDYIAIKVDREERPDVDRIYMTFVQATTGSGGWPMS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L+P GGTYFPPE+++G PGF +IL ++ W R + +S IEQL + +
Sbjct: 109 VWLTPELEPFFGGTYFPPENRWGHPGFGSILTQIAGVWRDNRPQVVESARDVIEQLKKHV 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S + Q L +++D+R GGFG+APKFPR V I L L
Sbjct: 169 EVAPSHGGV--AFDQATLDSGFSVFRRTFDTRTGGFGAAPKFPR-VSIHHFL-----LRY 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++G E MVL TL+ MA+GG++D +GGGFHRYSVD+RW VPHFEKMLYDQ Q+A
Sbjct: 221 YARTGN-KEALDMVLLTLREMARGGMNDQLGGGFHRYSVDDRWFVPHFEKMLYDQAQIAI 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFY 318
YL+AF +T D Y+ R I DY+ RDM GG +SAEDADS T E T K EGAFY
Sbjct: 280 SYLEAFQVTGDAQYADTARAIFDYVLRDMTDSGGGFYSAEDADSIITPEQPTLKGEGAFY 339
Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+W+ +E+ ++G A F Y ++ GN + +DPH EF GKN+L + + +A
Sbjct: 340 IWSMEEIHALVGAPASDWFCYRYGVREGGNVE----NDPHGEFTGKNILYQQHTLEQTAE 395
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
G P + L R L R+KR RPHLDDK++ SWNGL+IS+FA+ +L+
Sbjct: 396 HFGQPAGEMDATLDNAARILLQARAKRVRPHLDDKILTSWNGLMISAFAKGGAVLEEPRY 455
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ A AA+F+ L D + L +R G + PGFLDDYAF
Sbjct: 456 AEA----------------ARRAAAFVAGRLCDAASGTLLRRYREGDAAIPGFLDDYAFF 499
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ GLLDLYE L AI L Q ELF DRE G +F+T DP ++LRVKED+DGA
Sbjct: 500 VQGLLDLYEAQFDLSHLQLAIRLTEKQLELFEDREAGAFFSTIDGDPELVLRVKEDYDGA 559
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EPSGNSVSV+NLVRLA I + D +RQ+A +L+ F +RL MAVP + A + ++
Sbjct: 560 EPSGNSVSVMNLVRLAQI---TNRDQFRQSAGRALSAFASRLSVAPMAVPQLLAACEFVT 616
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
R+ ++ G + S + + ML H + N+ V+ +D A+ + +
Sbjct: 617 GQPRE-IIFAGTRDSAELQAMLHELHRRFIPNRVVLLVDSAEARKT------LAGGIPSI 669
Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
+ AD + A VC++++C PV+DP + L+
Sbjct: 670 ESMLPADGRATAYVCRDYTCQLPVSDPANFAELI 703
>gi|340721576|ref|XP_003399194.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
terrestris]
Length = 831
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 298/718 (41%), Positives = 417/718 (58%), Gaps = 63/718 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF ++ +A+++N F++IKVD+EERPD+DK+YMT++QA G GGWP+S
Sbjct: 146 STCHWCHVMEKESFTNKEIAEIMNKNFINIKVDKEERPDIDKIYMTFIQATSGHGGWPMS 205
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ DLKP++GGTYFPPED + + GFKTIL V W++ R L + G+ +E L ++
Sbjct: 206 VFLTADLKPIIGGTYFPPEDTFRQIGFKTILLSVAQKWNQSRSKLTEIGSTNLETLC-SI 264
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
S +S K+ D ++C +Q ++ +FGGFGS +PKFP+PV + L+H
Sbjct: 265 SKIPNSLKVHDTPSLECSKICIQQFVNGFEPKFGGFGSTYNMQSPKFPQPVNLN-FLFHM 323
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ +S M ++TL+ M+ GGIHDHVG GF RY+ D WHVPHFEKMLYDQ
Sbjct: 324 YARQPNVES--VRPCLHMSVYTLKKMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 381
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
GQL Y DA+ +TKD F++ I DI Y+ RD+ G +SAEDADS T A KKE
Sbjct: 382 GQLMKSYADAYLVTKDNFFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPTHDAHAKKE 441
Query: 315 GAFYVWTSKEVEDILGEHAI---------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
GAFYVW++ E++ IL + +F H+ + +GN + DPH E K KNV
Sbjct: 442 GAFYVWSAVEIKSILNKEVSDETHVKLSDIFCRHFNVNESGN--VKSHQDPHGEIKEKNV 499
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
LI N+ +A +P+E+ L E L+ VRS RPRPHLDDK+I +WNGL+IS
Sbjct: 500 LIAYNEIEETARYFNLPVEETKMYLKEACSMLYKVRSARPRPHLDDKIITAWNGLMISGL 559
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG- 483
A F + K+Y+E A AA FI+ +L+DE + L HS +R+
Sbjct: 560 A----------------FGGAAVNNKQYIERAADAAKFIKEYLFDETKNILLHSCYRDEK 603
Query: 484 ------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+ PGFLDDYAF+I GLLDLYE +WL +A +LQ+ QD+ F D + GGYF
Sbjct: 604 DTIIQISTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDEKDGGYF 663
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+TT DPS++LR+KE +DGAEPSGNS++ NL+RLA + D ++ A H VF
Sbjct: 664 STTSSDPSIILRLKEAYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAAHLFRVFRH 720
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNKTV 652
L + VP + S R H + +VG + + D + +L + N+ +
Sbjct: 721 LLMQSPVTVP------QLTSALVRYHDDAAQMYVVGKRGAKDTDELLRVIYKRLIPNRIL 774
Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ IDP T + + + N N + VC++ +CS PVT P L LL
Sbjct: 775 LLIDPDKTNSLLLRKNQHLRNMKSVNN-----RATVYVCKHRTCSLPVTSPEQLATLL 827
>gi|345485510|ref|XP_001604421.2| PREDICTED: spermatogenesis-associated protein 20-like [Nasonia
vitripennis]
Length = 797
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 309/729 (42%), Positives = 424/729 (58%), Gaps = 61/729 (8%)
Query: 11 KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
K RR LI +TCHWCHVME ESFE+ VAK++N +FV+IKVDREERPD+D+VYM
Sbjct: 98 KARREDKLIFLSVGYSTCHWCHVMEKESFENPEVAKIMNRYFVNIKVDREERPDIDRVYM 157
Query: 65 TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
T++Q++ G GGWP+SVFL+PDL P+ GGTYFPP DKYG+PGF IL + W + + L
Sbjct: 158 TFIQSISGHGGWPMSVFLTPDLTPITGGTYFPPVDKYGQPGFSRILESIATKWIESKQDL 217
Query: 125 AQSGAFAIEQLSEALSASASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFGGFGSAPKF 181
+SG+ ++ L +++ + K P+E +P + C +QL ++ FGGF APKF
Sbjct: 218 LKSGSKILQVLKKSVES-----KDPEEASVPSVDCANTCVKQLINGFEPSFGGFSRAPKF 272
Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
P+PV ++ + + TG++G+ + M + TL MA GGIHDHVG GF RYSVD +
Sbjct: 273 PQPVNFNLLFLMYAR-DPTGETGK--QCLNMCVHTLTKMANGGIHDHVGQGFSRYSVDGK 329
Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
WHVPHFEKMLYDQGQL Y +A+ +KD ++ I DI+ Y+ RD+ P G +SAEDA
Sbjct: 330 WHVPHFEKMLYDQGQLLRSYSEAYLASKDPLFAEIVNDIVTYVARDLRHPEGGFYSAEDA 389
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGE---------HAILFKEHYYLKPTGNCDLSR 352
DS + T KKEGAFYVW ++VE +L + + LF H+ +KP GN + R
Sbjct: 390 DSFPSFEDTEKKEGAFYVWRYEDVESLLDKVISEKEGLTLSDLFCYHFNVKPEGN--VQR 447
Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
DPH E +NVLI + +A + ++ L + LF+ R+KRPRPHLDDK
Sbjct: 448 QQDPHGELMNQNVLIAFGSIAETAEHFKLSIDSVKAHLEKSISILFEERNKRPRPHLDDK 507
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
++ +WNGLVIS + A+ L D +Y + AE AA FI R+LY++
Sbjct: 508 IVTAWNGLVISGLSHAASAL----------------DNPKYTKFAEDAARFIERYLYNKD 551
Query: 473 THRLQHSFRNGPSKA--------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
L S G S GF DYAF I GLLDLYE WL +A ELQ+ Q
Sbjct: 552 DKVLLRSCYRGDSDQILQTSVPIKGFQVDYAFAIRGLLDLYEVSFNAHWLEFAEELQDIQ 611
Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
D LF D + GGYF+TT +D SV+LR+K+D DGAEPSGNSV+ NLVRLAS + ++D
Sbjct: 612 DSLFWDDKSGGYFSTTTDDRSVILRLKDDQDGAEPSGNSVACGNLVRLASYL--DRTD-L 668
Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
AE L+ + L +A P + A L + S V ++G K + D + +L +
Sbjct: 669 SSKAEKLLSSMQEILIQFPVACPELVTALVTL-IDSTTQVYIIGKKDTDDTKQLLKVLQS 727
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
K V+ D + + + + + N M + N + A VC + CS PVTDP
Sbjct: 728 KLVPGKIVMLADGVNQDNVLY--KKNEVIGKMKQQN---GRATAYVCHHHICSLPVTDPK 782
Query: 705 SLENLLLEK 713
LE+LL +K
Sbjct: 783 DLESLLDKK 791
>gi|427788829|gb|JAA59866.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 766
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 311/735 (42%), Positives = 416/735 (56%), Gaps = 76/735 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE++ +AK++ND FV++KVDREERPDVD+VYMTY+QA GGGGWP+S
Sbjct: 65 STCHWCHVMERESFENDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMS 124
Query: 80 VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQL 135
++L+PDLKP++GGTYFPP+D+ YG+PGFKT+L + + W K R L G F I EQ
Sbjct: 125 IWLTPDLKPVVGGTYFPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQT 184
Query: 136 SE-----------ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
S+ + S ++ K P + C QL +SYD GGFG APKFP+
Sbjct: 185 SDVRVFGGDGVPTSPRGSEANQKCP--FAPDVATTCYRQLERSYDVSMGGFGRAPKFPQC 242
Query: 185 VEIQMMLYHSKKLEDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
V + +L + L EA + +M + TL+ MA+GGIHDH+G GFHRYS D
Sbjct: 243 VNLNFLLRYRAVLLQGDPPPEAKTAVDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDG 302
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
+WHVPHFEKMLYDQ QL Y +A+ +T D + + RDIL Y+ RD+ P G +SAED
Sbjct: 303 KWHVPHFEKMLYDQAQLTRTYSEAYQVTHDRRLADVARDILCYVERDLSHPSGGFYSAED 362
Query: 301 ADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLS 351
ADS G K+EGAF VW EV +L E A + +Y ++ +GN D
Sbjct: 363 ADSYPEHGDKEKREGAFCVWEESEVYRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD-- 420
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
M DPH+E K KNVLI + A+ G+ + +L R LF+ R +RP+PHLDD
Sbjct: 421 PMQDPHDELKRKNVLIVRESKESVAACYGLEVGVLDALLERARETLFEARLRRPKPHLDD 480
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K + SWNGL+IS FA A++ L N PV Y++ A FI++HLY+
Sbjct: 481 KFLTSWNGLMISGFAIAARTL---------NQPV-------YLDRALKCVEFIKKHLYNP 524
Query: 472 QTHRLQHS-FR-------NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
+ L S +R G G L+DYAFLI LLD+YE L+WA ELQ+
Sbjct: 525 KKKTLIRSAYRGEDGSVVQGSQPIDGVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDK 584
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
QD LF D++ GYF + GEDP+V+LR+K+D DGAEPS NSVS+ NLVRL+ ++ + D
Sbjct: 585 QDRLFWDKKDMGYFLSNGEDPTVVLRLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDE 641
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
RQ AE +V+ R+ + +A+P M C L + VV+ G + + +L+
Sbjct: 642 LRQRAEKLASVYGQRMILVPLALPEMVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSCLR 700
Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA-----DKVVALVCQNFSCSP 698
+ TVI D + N NF K A VCQ+F CS
Sbjct: 701 RHFLPFVTVILAD-----------QDPENPLRKRLTNFDGYTCVNGKPAAYVCQDFQCSK 749
Query: 699 PVTDPISLENLLLEK 713
PVT LE LL K
Sbjct: 750 PVTTAAELEALLTAK 764
>gi|340370640|ref|XP_003383854.1| PREDICTED: spermatogenesis-associated protein 20 [Amphimedon
queenslandica]
Length = 741
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 311/738 (42%), Positives = 431/738 (58%), Gaps = 65/738 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFE + VAK+LND FVSIKVDREERPD
Sbjct: 36 GEEAFTKSRNENKPIFLSVGYSTCHWCHVMERESFESDTVAKVLNDHFVSIKVDREERPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA G GGWP+SVFL+P+LKP +GGTYFPPED + P F TIL V + W
Sbjct: 96 VDKVYMTFVQATQGSGGWPMSVFLTPELKPFLGGTYFPPEDSFRSPSFLTILNAVHEQWT 155
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGS 177
K D + Q ++ L A++ S+S N +LP A ++ AE L+ +DS++GGFG
Sbjct: 156 KDHDNIKQKMNPLMKALQAAVAGSSSLNP---QLPGTACIQKAAEMLADRFDSKYGGFGQ 212
Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
+ KFP+PV + ++L Y + G AS VLFTL+ M+ GG+HDH+G GFHR
Sbjct: 213 SMKFPQPVILDLLLRIYARYPSSEMGDGALAS-----VLFTLEAMSNGGMHDHIGQGFHR 267
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS D WHVPHFEKMLYDQ QL YL A+ +TKD + DIL+Y+ RD+ G
Sbjct: 268 YSTDPYWHVPHFEKMLYDQAQLVVTYLSAYQITKDDKFKETAVDILEYVLRDLGDKDGGF 327
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPT 345
+SAEDADS G KKEGAF VWT +E++ IL + A LF + +K
Sbjct: 328 YSAEDADSYRCHGDKEKKEGAFCVWTWEEIQSILLDPLPGGDTDKTLADLFSSRFGVKKG 387
Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
GN ++ DPH E +NVLI +S+ + +E+ ++L E + +L+ +R++RP
Sbjct: 388 GNVRPNQ--DPHGELINQNVLIIKKSFEELSSEFSLEVEQVKSLLMEAKDRLYKMRAERP 445
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
+PH DDK++ +WNGL++S+ +RAS++L EY+E A+SAASFIR
Sbjct: 446 KPHRDDKILTAWNGLMVSALSRASQVLGG----------------SEYLERAKSAASFIR 489
Query: 466 RHLYD-EQTHRLQHSFRN-----GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
LYD E++ L++++R+ S GF DDYAFLI GL+DLYE WL WA+E
Sbjct: 490 DSLYDKEKSVLLRNAYRDENDVLSVSTVEGFADDYAFLIRGLIDLYEASHDPLWLKWALE 549
Query: 520 LQNTQDELFLD------REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 573
LQ QD LFLD E GGYF+T+G D S+LLR+K+ DGAEPS NSVS NL+RL+
Sbjct: 550 LQEQQDRLFLDIKGEEGEEKGGYFSTSGMDDSILLRMKDGEDGAEPSANSVSAENLLRLS 609
Query: 574 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLSVPSRKHVVLVGHKSS 632
S S+ R +E+ F + + + A+ + A L P K V++VG S
Sbjct: 610 SFFDKSE---LRSKSENIFKTFNSSMMEHPPAMAALIGAFISYLQKP--KQVIIVGLISG 664
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + +L+ H+ + NKT+I DP+ + + M DK +C+
Sbjct: 665 DDTQALLSCIHSHFIPNKTLILHDPSSPSPLLMESLPLLKDMIMVD-----DKATVYLCE 719
Query: 693 NFSCSPPVTDPISLENLL 710
++ C+ P L++++
Sbjct: 720 DYKCAAPTNSSTVLKDMI 737
>gi|385648253|ref|NP_001245301.1| spermatogenesis-associated protein 20 isoform 2 precursor [Homo
sapiens]
gi|311033529|sp|Q8TB22.3|SPT20_HUMAN RecName: Full=Spermatogenesis-associated protein 20; AltName:
Full=Sperm-specific protein 411; Short=Ssp411; Flags:
Precursor
Length = 786
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSVPITDPCELRKLL 784
>gi|193787397|dbj|BAG52603.1| unnamed protein product [Homo sapiens]
Length = 742
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 303/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GEEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ +D L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKDTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSVPITDPCELRKLL 740
>gi|84040225|gb|AAI11030.1| SPATA20 protein [Homo sapiens]
gi|119615009|gb|EAW94603.1| spermatogenesis associated 20, isoform CRA_a [Homo sapiens]
Length = 786
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSVPITDPCELRKLL 784
>gi|385648255|ref|NP_001245302.1| spermatogenesis-associated protein 20 isoform 3 [Homo sapiens]
Length = 742
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSVPITDPCELRKLL 740
>gi|31542723|ref|NP_073738.2| spermatogenesis-associated protein 20 isoform 1 precursor [Homo
sapiens]
gi|19263653|gb|AAH25255.1| Spermatogenesis associated 20 [Homo sapiens]
Length = 802
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 425/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSVPITDPCELRKLL 800
>gi|119615011|gb|EAW94605.1| spermatogenesis associated 20, isoform CRA_c [Homo sapiens]
Length = 742
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSVPITDPCELRKLL 740
>gi|426347561|ref|XP_004041418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Gorilla
gorilla gorilla]
Length = 786
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ + + G
Sbjct: 318 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + P S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M CA + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|41351283|gb|AAH65526.1| SPATA20 protein [Homo sapiens]
Length = 742
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GEEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSVPITDPCELRKLL 740
>gi|119615010|gb|EAW94604.1| spermatogenesis associated 20, isoform CRA_b [Homo sapiens]
Length = 802
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSVPITDPCELRKLL 800
>gi|158257042|dbj|BAF84494.1| unnamed protein product [Homo sapiens]
Length = 742
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GEEAFDKARKESKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSVPITDPCELRKLL 740
>gi|426347557|ref|XP_004041416.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Gorilla
gorilla gorilla]
Length = 802
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ + + G
Sbjct: 334 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + P S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M CA + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|426347555|ref|XP_004041415.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Gorilla
gorilla gorilla]
Length = 742
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ + + G
Sbjct: 274 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + P S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M CA + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|426347559|ref|XP_004041417.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Gorilla
gorilla gorilla]
Length = 786
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 299/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ + + G
Sbjct: 318 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + P S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTSPGGTVDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M CA + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|343958896|dbj|BAK63303.1| SPATA20 protein [Pan troglodytes]
Length = 742
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF+DE + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQDEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|114669341|ref|XP_001170552.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Pan
troglodytes]
gi|397493180|ref|XP_003817490.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pan
paniscus]
Length = 786
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|410051894|ref|XP_003953187.1| PREDICTED: spermatogenesis-associated protein 20 [Pan troglodytes]
Length = 786
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|403279582|ref|XP_003931326.1| PREDICTED: spermatogenesis-associated protein 20 [Saimiri
boliviensis boliviensis]
Length = 742
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNALLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + +DIL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT+ EV+ +L E + LF +HY L
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTANEVQQLLPEPVLGATEPLTSGQLFMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISSSQDPKGELQGQNVLTVRYSLELTAARFGLDVEGVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTSSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|10437433|dbj|BAB15051.1| unnamed protein product [Homo sapiens]
Length = 786
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 422/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D YS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDELYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
+ RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LERHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSVPITDPCELRKLL 784
>gi|114669347|ref|XP_001170636.1| PREDICTED: spermatogenesis-associated protein 20 isoform 7 [Pan
troglodytes]
gi|397493176|ref|XP_003817488.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pan
paniscus]
Length = 742
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|114669339|ref|XP_511882.2| PREDICTED: spermatogenesis-associated protein 20 isoform 8 [Pan
troglodytes]
gi|397493178|ref|XP_003817489.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pan
paniscus]
gi|410211920|gb|JAA03179.1| spermatogenesis associated 20 [Pan troglodytes]
gi|410266782|gb|JAA21357.1| spermatogenesis associated 20 [Pan troglodytes]
gi|410349593|gb|JAA41400.1| spermatogenesis associated 20 [Pan troglodytes]
Length = 802
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|449479427|ref|XP_002191427.2| PREDICTED: spermatogenesis-associated protein 20 [Taeniopygia
guttata]
Length = 753
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 290/709 (40%), Positives = 397/709 (55%), Gaps = 64/709 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF+ + + ++N+ FV IKVDREERPDVDKVYMT+VQA GGGGWP+S
Sbjct: 89 STCHWCHVMEEESFKSKEIGDIMNEHFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMS 148
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDLKP GGTYFPPED GF+T+L ++ + W + +D L S +E L
Sbjct: 149 VWLTPDLKPFAGGTYFPPEDGVNHVGFRTVLLRIAEQWKENKDALLGSSQRILEALRHTS 208
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
P + + C +QLS+SYD +GGF PKFP PV + + + +
Sbjct: 209 EIRVQGQASPPP-AKEVMDTCFQQLSRSYDEEYGGFSKCPKFPSPVNLNFLFTYWALHQT 267
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E + +M L TL+ MA GGIHDH+G GFHRYS+D+ WHVPHFEKMLYDQGQLA
Sbjct: 268 T---PEGARALQMALHTLKMMALGGIHDHIGQGFHRYSIDQHWHVPHFEKMLYDQGQLAA 324
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y AF ++ D F++ + RDIL Y+ RD+ G +SA+DADS T + K+EGAF V
Sbjct: 325 IYSKAFQISGDEFFADVVRDILLYVSRDLSDQAGGFYSAQDADSYPTTTSREKREGAFCV 384
Query: 320 WTSKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
W +KE+ +L + A +F HY +K GN D +R DP+ E KGKNVLI
Sbjct: 385 WAAKELRALLPDPVEGATEGTTLADVFMHHYGVKEAGNVDPAR--DPYQELKGKNVLIVR 442
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A+K G+ + +L EC+++L R++RP+PHLD K++ +WNGL+IS FA+A
Sbjct: 443 CAPELTAAKFGLEPGRLSTLLQECQQRLSSARAQRPQPHLDTKMLAAWNGLMISGFAQAG 502
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------QHSFR 481
L + Y+ A AA+F+R HL+D + +L +S
Sbjct: 503 AALSEQG----------------YVSRAAQAAAFLRTHLFDPDSGKLLRSCYQGMHNSVE 546
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
G GFL+DY F+I L DLYE WL WA+ LQ+ QD+LF D +G YF+T
Sbjct: 547 QGAVPIQGFLEDYVFVIQALFDLYEVSLEQGWLEWALHLQHMQDKLFWDPKGFAYFSTEA 606
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
DPS+LLR+K+D DGAEP+ NSV+V NL +Q L R+
Sbjct: 607 SDPSLLLRLKDDQDGAEPAPNSVAVTNLRE------------KKQTRSEQL-----RVPM 649
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+ + VP M + + K VV+ G D + ML + + NK ++ AD +
Sbjct: 650 ITVVVPEMLRTTAVFH-HTLKQVVICGDPQGEDTKEMLHCVRSVFSPNKVLM---VADGD 705
Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
F AS+ R + K A VC NF+CS PVT L +L
Sbjct: 706 NAGFLYRQLPFLASLERKD---GKATAYVCSNFTCSLPVTSVQELRGML 751
>gi|134085853|ref|NP_001076876.1| spermatogenesis-associated protein 20 [Bos taurus]
gi|133777605|gb|AAI23690.1| SPATA20 protein [Bos taurus]
gi|296476477|tpg|DAA18592.1| TPA: spermatogenesis associated 20 [Bos taurus]
Length = 789
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 420/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+PDL+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L ++ ++++ AL A ++ + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QL Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVVRNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGAFYVWT KEV+ +L E + L +HY L
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S FA +L E + N+ + G A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVINYAING-------------AKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DRATAYVCENQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L +L
Sbjct: 772 ACSMPITEPCELRKVL 787
>gi|440910483|gb|ELR60277.1| Spermatogenesis-associated protein 20 [Bos grunniens mutus]
Length = 789
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/736 (41%), Positives = 420/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+PDL+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L ++ ++++ AL A ++ + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QL Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVVRNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGAFYVWT KEV+ +L E + L +HY L
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S FA +L E + N+ + G A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVINYAING-------------AKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DRATAYVCENQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L +L
Sbjct: 772 ACSMPITEPCELRKVL 787
>gi|350406875|ref|XP_003487911.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
impatiens]
Length = 831
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 293/721 (40%), Positives = 410/721 (56%), Gaps = 63/721 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF ++ +A+++N F++IKVD+EERPD+D++YMT++QA G GGWP+S
Sbjct: 146 STCHWCHVMEKESFTNKEIAEIMNKNFINIKVDKEERPDIDRIYMTFIQATSGHGGWPMS 205
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ DLKP++GGTYFPPED + + GFKTIL V W++ R L + G+ +E L ++
Sbjct: 206 VFLTTDLKPIVGGTYFPPEDTFRQTGFKTILLSVAQKWNQSRSKLTEIGSTNLETL-HSI 264
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
S S K+ D ++C +QL ++ +FGGFGS +PKFP+PV L+H
Sbjct: 265 SKIPDSLKVHDIPSLECSKICIQQLVNEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHM 323
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ +S M ++TL+ M+ GGIHDHVG GF RY+ D WHVPHFEKMLYDQ
Sbjct: 324 YARQPNVES--VRPCLYMSVYTLKRMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 381
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
GQL Y DA+ +TKD +++ I DI Y+ RD+ G +SAEDADS KKE
Sbjct: 382 GQLMKSYADAYLVTKDNYFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPMHDTHAKKE 441
Query: 315 GAFYVWTSKEVEDILGEHAI---------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
GAFYVW++ E++ +L + +F H+ + +GN + DPH E KNV
Sbjct: 442 GAFYVWSAMEIKSLLNKEVSDENHVKLSDIFCRHFNVNESGN--VKSHQDPHGEMGQKNV 499
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
LI N+ +A +P+E+ L E L+ VRS RPRPHLDDK+I SWNGL+IS
Sbjct: 500 LIAYNEIEETARYFNLPIEETKMYLKEACSMLYKVRSARPRPHLDDKIITSWNGLMISGL 559
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
A F + K+Y+E A AA FI+ +L+DE + L HS
Sbjct: 560 A----------------FGGAAVNNKQYIEHAADAAKFIKEYLFDETKNILLHSCYRDEK 603
Query: 480 --FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+ PGFLDDYAF+I GLLDLYE +WL +A +LQ+ QD+ F D GGYF
Sbjct: 604 GTITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDETNGGYF 663
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TT DPS++LR+KE +DGAEPSGNS++ NL+RLA + D ++ A F
Sbjct: 664 LTTSSDPSIILRLKEVYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAARLFGAFRY 720
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNKTV 652
L +AVP + S R H + +VG + + D + +L + N+ +
Sbjct: 721 LLMQRPVAVP------QLTSALVRYHDDAAQIYVVGKRGAKDTDELLRVIYKRLIPNRIL 774
Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ IDP +T + + + N N + VC++ +CS PVT P L LL E
Sbjct: 775 LLIDPDETNSVLLRKNQHLRNMKSLNN-----RTTVYVCKHRTCSLPVTSPEQLATLLDE 829
Query: 713 K 713
+
Sbjct: 830 Q 830
>gi|47211932|emb|CAF92441.1| unnamed protein product [Tetraodon nigroviridis]
Length = 833
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 288/668 (43%), Positives = 387/668 (57%), Gaps = 69/668 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + K+LND FV IK+DREERPDVDKVYMT+VQA GGGGWP+S
Sbjct: 46 STCHWCHVMERESFEDEEIGKILNDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMS 105
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL+P +GGTYFPP D GRPG KT+L ++ D W R L +G +E L +
Sbjct: 106 VWLTPDLRPFIGGTYFPPRDHGGRPGLKTVLMRIIDQWRNNRPTLESNGNKILEALRKGT 165
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ ++ + P P A R C +QL+ SY+ +GGF APKFP PV + ++ +
Sbjct: 166 AIASDAGSSPAFAPDVAKR-CFQQLANSYEEEYGGFREAPKFPSPVNLMFLMSYWCVNRS 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E E +M L TL+ MA GGI+DHV GFHRYS D WHVPHFEKMLYDQ QLA
Sbjct: 225 TS---EGVEALQMALHTLRMMALGGINDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAV 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A + + FY+ + +D+L Y+ RD+ G +SAEDADSA G K+EGAF +
Sbjct: 282 AYITASQASGEQFYADVAKDVLRYVSRDLSDKSGGFYSAEDADSAPPSGGAEKREGAFCI 341
Query: 320 WTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
WT+ EV ++L A +F HY +K GN +S DPH E +G+NVLI
Sbjct: 342 WTASEVRELLPDVVKGASASATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVR 399
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+A+ G+ +E+ +L R K+ VR RPRPHLD K++ SWNGL++S++AR
Sbjct: 400 YSLELTAAHFGISVEEVSALLASARAKMAAVRKSRPRPHLDTKMLASWNGLMLSAYARVG 459
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSF-------- 480
+L K +E A AA+F++ HL+D EQ L+ +
Sbjct: 460 AVLGD----------------KTLLERAAQAANFLQEHLWDPEQQIVLRSCYLGDNMELQ 503
Query: 481 ----------------------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
R+ P GFLDDYAF+I GLLDL+E T+WL WA
Sbjct: 504 QMTIKLNLPELSNENNYETVTQRSQPIS--GFLDDYAFIICGLLDLHEATLQTEWLRWAE 561
Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
ELQ QD+LF D +GGGYF + D +VLL++KED DGAEPS NSVS NL+RL+
Sbjct: 562 ELQLRQDKLFWDEQGGGYFCSDPSDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGR 621
Query: 579 SKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 638
+ + Q ++ LA F RL +A+P M A M + K +V+ G + S D +
Sbjct: 622 QE---WLQKSQRLLAAFTDRLTRAPIALPEMVRAL-MAQHYTLKQIVICGQRDSPDTAAL 677
Query: 639 LAAAHASY 646
L+ ++ +
Sbjct: 678 LSTVNSLF 685
>gi|297700798|ref|XP_002827419.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pongo
abelii]
Length = 786
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|402899623|ref|XP_003912790.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Papio
anubis]
Length = 786
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|297700802|ref|XP_002827421.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pongo
abelii]
Length = 742
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|402899619|ref|XP_003912788.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Papio
anubis]
Length = 742
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|410298424|gb|JAA27812.1| spermatogenesis associated 20 [Pan troglodytes]
Length = 802
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 301/736 (40%), Positives = 422/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFLI---NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGSPTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|109114323|ref|XP_001099418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Macaca
mulatta]
Length = 786
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 495 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 538
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 539 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 598
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 599 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 658
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 659 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 714
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 715 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 768
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 769 ACSMPITDPCELRKLL 784
>gi|109114325|ref|XP_001099321.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Macaca
mulatta]
Length = 742
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 43 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 103 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 163 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 218
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 219 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 273
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 274 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 333
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 334 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 392
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 393 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 450
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 451 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 494
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 495 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 554
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 555 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 614
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 615 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 670
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 671 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 724
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 725 ACSMPITDPCELRKLL 740
>gi|332246333|ref|XP_003272309.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
20 [Nomascus leucogenys]
Length = 802
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEKESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLAPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L +S ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLESS----QRVTTALLARSEISVGDRQLPPSAATMSNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G KEGA+YVWT KE + +L E L +HY L
Sbjct: 394 GFYSAEDADSPPERGMX-PKEGAYYVWTVKEFQQLLPEPVPGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD+K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDNKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLIRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M CA + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVRCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|355753994|gb|EHH57959.1| hypothetical protein EGM_07713, partial [Macaca fascicularis]
Length = 777
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 78 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 137
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 138 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 197
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 198 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 253
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 254 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 308
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 309 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 368
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 369 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 427
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 428 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 485
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 486 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 529
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 530 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 589
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 590 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 649
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 650 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 705
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 706 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 759
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 760 ACSMPITDPCELRKLL 775
>gi|402899621|ref|XP_003912789.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Papio
anubis]
Length = 802
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LISYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|297700800|ref|XP_002827420.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pongo
abelii]
Length = 802
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDMAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|109114321|ref|XP_001099622.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Macaca
mulatta]
gi|355568523|gb|EHH24804.1| hypothetical protein EGK_08527 [Macaca mulatta]
Length = 802
Score = 527 bits (1357), Expect = e-147, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 423/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F +++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DQATAYVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 785 ACSMPITDPCELRKLL 800
>gi|182413448|ref|YP_001818514.1| hypothetical protein Oter_1630 [Opitutus terrae PB90-1]
gi|177840662|gb|ACB74914.1| protein of unknown function DUF255 [Opitutus terrae PB90-1]
Length = 751
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 310/746 (41%), Positives = 409/746 (54%), Gaps = 60/746 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ FL TCHWCHVM ESFE+E VA+LLN+ FV+IKVDREERPD
Sbjct: 27 GEAAFAKARAEQKPIFLSIGYATCHWCHVMAHESFENEAVAQLLNESFVAIKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD+VYMTYVQA+ G GGWPLS +L+PDLKP GGTYFPPED+ GR GF ILR + W
Sbjct: 87 VDRVYMTYVQAMTGHGGWPLSAWLTPDLKPFFGGTYFPPEDRQGRAGFAAILRAIAHGWS 146
Query: 119 KKRDMLAQSGAFAIEQLSE--------------ALSASASSNKLPDELPQN-------AL 157
+R+ L G I L E SA A D L A
Sbjct: 147 TEREKLVAEGERVIAALREHQQSKTADVSKSTGGESAGAEIGSGIDALIHQLHERGAPAF 206
Query: 158 RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA-SEGQKMVLFT 216
+ +++D GGFG APKFPR + L+ + L+ G + EA +E ++ T
Sbjct: 207 ERGFQYFYEAFDPEHGGFGGAPKFPRASNLS-FLFRAAALQ--GVASEAGAEAIRLASAT 263
Query: 217 LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYI 276
LQ MA+GGIHDHVGGGFHRYSVDERW VPHFEKMLYDQ Q+A L+A T D ++++
Sbjct: 264 LQAMARGGIHDHVGGGFHRYSVDERWFVPHFEKMLYDQAQIALNALEAKQATGDERFAWL 323
Query: 277 CRDILDYLRRDMIGPGGEIFSAEDADSAETEG----ATRKKEGAFYVWTSKEVEDILGEH 332
RDIL Y+ RD+ P G +SAEDADSA +K EGAFYVW E+E +LG+
Sbjct: 324 ARDILTYVLRDLAHPDGGFYSAEDADSAAANAEPGHGGKKVEGAFYVWAQSEIEQVLGDE 383
Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
A L EH+ +KP GN + DPH EF GKNVL + + +A + E L
Sbjct: 384 ARLVCEHFGVKPDGN--VPGQLDPHGEFTGKNVLAQAQPLATTAKAHELTPEMASERLQA 441
Query: 393 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 452
+L VR++RPRP DDK+I +WNGL+IS+ A+A +L+ ++A
Sbjct: 442 ALERLRAVRAQRPRPLRDDKIITAWNGLMISALAKAHVVLELAEDAA----------ETL 491
Query: 453 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
Y+ A A F+ R L+D L S+R G S GF +DYAF+I GLLDLYE G +
Sbjct: 492 YLGAATRTAEFVERELFDRDRAILFRSWRGGRSAVEGFAEDYAFMIQGLLDLYEAGFDVR 551
Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
WL WA LQ T D F D E GGYFN+ +DP ++LR+KED+DGAEP+ +SV+ +NL+RL
Sbjct: 552 WLQWAERLQATMDARFWDAEHGGYFNSASDDPHLVLRLKEDYDGAEPAPSSVAAMNLLRL 611
Query: 573 ASIVAGSKSDY------YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
++ + YR+ ++ F+ + A+P M CA + +P HVVL
Sbjct: 612 GVMIERPGAAAAAGGIDYRERGLRTILAFQEQWSQTPQALPQMLCALERALMPP-AHVVL 670
Query: 627 VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN--NASMARNNFSAD 684
G F +L ++ AD E W + RN
Sbjct: 671 AGQPGDEAFRALLRVVQGRLGSQHVLL---VADGGEGQRWLSARAPWLTTMTPRNG---- 723
Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC++F+C PV P +L +LL
Sbjct: 724 QATAYVCEDFTCQAPVESPAALRDLL 749
>gi|350590464|ref|XP_003483066.1| PREDICTED: spermatogenesis-associated protein 20-like [Sus scrofa]
Length = 749
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 300/736 (40%), Positives = 418/736 (56%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 50 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 109
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 110 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 169
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 170 QNKKTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 225
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 226 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 280
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QL Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 281 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 340
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
+SAEDADS G R KEGAFY+WT KEV+ +L EH L +HY L
Sbjct: 341 GFYSAEDADSPPERG-MRPKEGAFYLWTVKEVQQLLPEHVPGATEPLTSGQLLMKHYGLT 399
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 400 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVQTLLNTGLEKLFQARKH 457
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S FA +L E + N+ + G A F
Sbjct: 458 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RLINYAING-------------AKF 501
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DY F++ GLLDLYE + WL
Sbjct: 502 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYTFVVRGLLDLYEASQESAWLE 561
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 562 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 621
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 622 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 677
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 678 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLGTLRRLE---DRATAYVCENQ 731
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 732 ACSMPITEPCELRKLL 747
>gi|344285393|ref|XP_003414446.1| PREDICTED: spermatogenesis-associated protein 20 [Loxodonta
africana]
Length = 789
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 303/738 (41%), Positives = 422/738 (57%), Gaps = 65/738 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+P+L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ R+ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNRNTLLENS----QRVTAALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRITQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +W VPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 321 HRYSTDRQWLVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGAFY+WT KE++ +L E + L +HY L
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYLWTVKEIQQLLPEPVLGASEPLTSGQLLTKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF VR
Sbjct: 440 EAGN--ISPNQDPKGELQGQNVLNVRYSLELTAARFGLDVEAVRTLLNLGLEKLFQVRKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRPHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 498 RPRPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GMDR--LINCAINGAKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D T RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVATGRLMRTCYAGSGGTVEHSDPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 718 TKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRLE---DQATAYVCENQ 771
Query: 695 SCSPPVTDPISLENLLLE 712
+CS P+T+P L LLL+
Sbjct: 772 ACSMPITEPCELRKLLLQ 789
>gi|449283068|gb|EMC89771.1| Spermatogenesis-associated protein 20, partial [Columba livia]
Length = 682
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 286/671 (42%), Positives = 395/671 (58%), Gaps = 53/671 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCHVME ESF+++ + ++++ FV IKVDREERPD
Sbjct: 44 GQEAFDKAKKENKLIFLSVGYSTCHWCHVMEEESFKNKEIGEIMSKNFVCIKVDREERPD 103
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+ A GGGGWP+SV+L+PDLKP GGTYFPPED R GF+T+L ++ + W
Sbjct: 104 VDKVYMTF--ATSGGGGWPMSVWLTPDLKPFAGGTYFPPEDGVHRVGFRTVLLRIAEQWK 161
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ +D L +S +E L + P + + C +QLS SYD +GGF +
Sbjct: 162 ENKDSLLESSRKILEALQHVSEIRVRGQESPPP-SKEVMATCFQQLSNSYDEDYGGFSKS 220
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP PV + L+ L T + E + +M L TL+ MA GGIHDH+ GFHRYS
Sbjct: 221 PKFPSPVNLNF-LFTYWALHRT--TPEGARALQMALHTLKMMAHGGIHDHIDQGFHRYST 277
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+ WHVPHFEKMLYDQGQLA Y AF ++ D F++ + +DIL Y+ RD+ G +SA
Sbjct: 278 DQHWHVPHFEKMLYDQGQLAATYSRAFQISGDQFFADVAQDILLYVSRDLSDQAGGFYSA 337
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPTGNC 348
EDADS T + K+EGAF VW ++E+ +L + +F HY +K TGN
Sbjct: 338 EDADSYPTTASKEKREGAFCVWAAEEIRALLPDPVEGATEGTTLGDVFMHHYGVKETGN- 396
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
+S M DPH E KGKNVLI +A++ G+ L + +L E R++L R++RPRPH
Sbjct: 397 -VSPMQDPHQELKGKNVLIVRCSPEVTAAQFGLELGRLGAVLQEGRQRLSTARAQRPRPH 455
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LD K++ +WNGL+IS FA+A +L D++EY+ A AA+F+R+HL
Sbjct: 456 LDTKMLAAWNGLMISGFAQAGTVL----------------DKQEYVSRAAQAAAFLRKHL 499
Query: 469 YDEQTHRLQHSFRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
+D + RL S G S P GFL+DY F+I L DLYE WL WA++L
Sbjct: 500 FDPTSGRLLRSCYRGRDNTVEQSAVPIQGFLEDYVFVIQALFDLYEASLEQDWLEWALQL 559
Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
Q+ QD+LF D +G YF++ DPS+LLR+K D DGAEP+ NSV+V NL+R A A +
Sbjct: 560 QHMQDKLFWDSKGFAYFSSEAGDPSLLLRLKGDQDGAEPTANSVTVTNLLRAACYSAHME 619
Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
+ + A LA F RL+ +P+M A + + K V++ G D + ML
Sbjct: 620 ---WVEKAGQILAAFSERLQK----IPIMARATAVFH-HTLKQVIICGDPQGEDTKEMLR 671
Query: 641 AAHASYDLNKT 651
H+ + NK
Sbjct: 672 CVHSVFSPNKV 682
>gi|73966409|ref|XP_548202.2| PREDICTED: spermatogenesis-associated protein 20 [Canis lupus
familiaris]
Length = 789
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 295/736 (40%), Positives = 424/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + LLN+ FVS+KVDREERPD
Sbjct: 90 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R +EGAFYVWT KEV+++L E + L +HY L
Sbjct: 381 GFYSAEDADSPPERG-MRPREGAFYVWTVKEVQNLLPEPVLGATEPLTSGQLLMKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ ++ +L KLF R
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L E + N+ + G A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGQE---RLINYAING-------------AKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVASGRLMRTCYAGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+R+
Sbjct: 602 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRMHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I A+ + F +++ R D+ A VC++
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCEDQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 772 ACSMPITEPCELRKLL 787
>gi|410349595|gb|JAA41401.1| spermatogenesis associated 20 [Pan troglodytes]
Length = 802
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 299/736 (40%), Positives = 421/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 162
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM +VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 163 VDKVYMMFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 222
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 223 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 278
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 279 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 333
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 334 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 393
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 394 GFYSAEDADSPPERG-LRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 453 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 511 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 554
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 555 LKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 614
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 615 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 674
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 675 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 730
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I D + + W S ++ R D+ A VC+N
Sbjct: 731 TKALVQCVHSVYIPNKVLILADGDPSSFLSHWLPFLS---TLRRQE---DQATASVCENQ 784
Query: 695 SCSPPVTDPISLENLL 710
+CS +TD L LL
Sbjct: 785 ACSMLITDTCELRKLL 800
>gi|380028980|ref|XP_003698161.1| PREDICTED: spermatogenesis-associated protein 20 [Apis florea]
Length = 746
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 284/722 (39%), Positives = 415/722 (57%), Gaps = 65/722 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF+++ +A ++N F++IKVD+EERPD+D++YMT+VQA G GGWP+S
Sbjct: 61 STCHWCHVMEKESFKNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP+ GGTYFPPED + GFKTIL + W++ + + ++G+ +E L + +
Sbjct: 121 VFLTPDLKPIFGGTYFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNI 179
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
S ++KL D +C +QL ++ +FGGFGS +PKFP+PV + +
Sbjct: 180 SKIPHTSKLHDIPSLECSEICIQQLENEFEPKFGGFGSIYNMQSPKFPQPVNFNFLFHMY 239
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + + A M ++TL+ M+ GGIHDHVG GF RY+ D WHVPHFEKMLYDQ
Sbjct: 240 ARQPN---ADLARLCLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 296
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
QL Y DA+ TK+ +++ I DI Y+ RD+ G +SAEDADS T A+ KKE
Sbjct: 297 AQLMKSYADAYLATKNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKE 356
Query: 315 GAFYVWTSKEVEDILGEHAIL-----------FKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
GAFY+WT+ E++ +L + +L F H+ +K GN + DPH E +GK
Sbjct: 357 GAFYIWTAIEIKSLLNKELLLSNEKHIKLSDIFCHHFNIKELGN--IKSYQDPHGELEGK 414
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVLI N+ +A +P+E+ L E L+ RS RPRPHLDDK+I +WNGL+IS
Sbjct: 415 NVLIMYNEIEETAKHFNLPVEEVKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMIS 474
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS---- 479
A F + K+Y++ A A FI+R+L+D+ + L HS
Sbjct: 475 GLA----------------FGGTAVNNKQYVKYAVDAIKFIKRYLFDKTKNILLHSCYRD 518
Query: 480 ----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
+ PGFLDDYAF+I GLLDLYE +WL +A +LQ+ QD+ F D GG
Sbjct: 519 EKNIITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNGG 578
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
YF+TT DPS++LR+KE +DGAEPSGNS++ NL+RLA + S+ ++ A F
Sbjct: 579 YFSTTSNDPSIILRLKEAYDGAEPSGNSIAAENLLRLADYLGRSE---FKDKAVRLFGTF 635
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNK 650
L +++P ++S R H + +VG +++ D +++L+ + +
Sbjct: 636 RHLLIKRPVSIP------QLVSALIRYHDDATQIYVVGKRNAKDTDDLLSVIYKRLIPGR 689
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ ID T + F + + N N + +C++ +CS PVT+ L LL
Sbjct: 690 ILFLIDHDKTNSILFRKNEHFRNMKPVNN-----QTTVYICKHCTCSLPVTNSEQLAILL 744
Query: 711 LE 712
E
Sbjct: 745 DE 746
>gi|171910219|ref|ZP_02925689.1| hypothetical protein VspiD_03585 [Verrucomicrobium spinosum DSM
4136]
Length = 723
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 288/693 (41%), Positives = 398/693 (57%), Gaps = 34/693 (4%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E A++LN+ F+SIKVDREERPDVD YMTY QA+ GGGGWPL+
Sbjct: 61 STCHWCHVMERESFENEETAQVLNEHFISIKVDREERPDVDLTYMTYAQAVSGGGGWPLN 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
V+L+P+LKP GTYFPPED+ GR GF+ + K+ + W D + ++ +SGA AI++L E
Sbjct: 121 VWLTPELKPFFAGTYFPPEDRGGRMGFRALCLKIAEVWKDDRAGVMERSGA-AIQKLQEY 179
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + P + ++ + +S ++D GGF APKFPRPV + ++ K L
Sbjct: 180 IEDEQKHHDAPFDA---VMKKAYDDVSNAFDYHEGGFSGAPKFPRPVTLNLLGRLKKHLA 236
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ E++ M TL CMA GGI DHVGGGFHRYSVD WHVPH+EKMLYDQ QL
Sbjct: 237 LKKEESESNWAVAMGKTTLTCMANGGIRDHVGGGFHRYSVDGYWHVPHYEKMLYDQAQLL 296
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y++ T ++ I R+I++Y++RD+ P G +SAEDADS + T K EGAFY
Sbjct: 297 TAYVEGHQHTGLKSFAAIAREIVEYVKRDLRHPEGAFYSAEDADSYTDDTRTTKGEGAFY 356
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW + E++++LG E +F+ Y + GN SDPH E KG N L +A
Sbjct: 357 VWKAAEIDELLGKEEGSIFRYAYGARRDGNARPE--SDPHEELKGLNTLFRAYSPKKTAE 414
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ +K IL R+ LF+ R KRP PHLDDKV+ +WNGL+IS ARA+ L
Sbjct: 415 YFKLEEDKVAEILERGRKVLFEAREKRPHPHLDDKVLTAWNGLMISGLARAAGAL----- 469
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ ++E+A +A FI HL D+ ++ L+ S+R G S GF DYA L
Sbjct: 470 -----------NEPSFLELATQSAQFIYDHLSDKGSN-LRRSWREGVSTVHGFASDYALL 517
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I GLLDLYE G KWL WA LQ + + D E GGYF+ + P+ +L+VKED+D A
Sbjct: 518 IQGLLDLYEAGFDVKWLQWAAALQEEFETKYGDPEKGGYFSVSKAIPNSVLQVKEDYDSA 577
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EPS NSV+ +NL RLA ++A + R+ L +F L++ VP M A D S
Sbjct: 578 EPSPNSVAAMNLFRLARMLA---REDLRERGAKVLRLFGKSLEESPFTVPAMVAALD-FS 633
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+VL G K F+ + A + Y + ++H D + + N ++
Sbjct: 634 HYGEVEIVLAGSKDDAGFQTLATAVRSRYLPHAVLLHADGGAGQAF-----LATRNEALG 688
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N + A VC+N C PVT +L+ +L
Sbjct: 689 AMNPVNGQAAAYVCRNRVCQSPVTTVEALKGIL 721
>gi|328781619|ref|XP_393124.4| PREDICTED: spermatogenesis-associated protein 20 [Apis mellifera]
Length = 804
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 283/716 (39%), Positives = 413/716 (57%), Gaps = 53/716 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCH+ME ESF+++ +A ++N F++IKVD+EERPD+D++YMT+VQA G GGWP+S
Sbjct: 120 STCHWCHIMEKESFKNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMS 179
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP+ GGTYFPPED + GFKTIL + W++ + + ++G+ +E L + +
Sbjct: 180 VFLTPDLKPIFGGTYFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNI 238
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHS 194
S ++KL D ++C +QL ++ +FGGFGS +PKFP+PV L+H
Sbjct: 239 SKIPHTSKLHDIPSLECSKICIQQLENEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHM 297
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ G A M ++TL+ M+ GGIHDHVG GF RY+ D WHVPHFEKMLYDQ
Sbjct: 298 YARQPNGDL--ARLCLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQ 355
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
QL Y DA+ TK+ +++ I DI Y+ RD+ G +SAEDADS T A+ KKE
Sbjct: 356 AQLMKSYADAYLATKNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKE 415
Query: 315 GAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
GAFYVWT+ E++ +L + + +F H+ +K GN + DPH E +GKNV
Sbjct: 416 GAFYVWTAMEIKSLLNKELSDEKHIKLSDVFCHHFNIKELGN--IKSYQDPHGELEGKNV 473
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
LI N+ +A +P+E+ L E L+ RS RPRPHLDDK+I +WNGL+IS
Sbjct: 474 LIMYNEIEETAKHFNLPVEEMKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMISGL 533
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
A F + K+Y+E A A FI+R+L+D+ + L HS
Sbjct: 534 A----------------FGGTAVNNKQYIEYAVDAIKFIKRYLFDKTKNILLHSCYRDEK 577
Query: 480 --FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+ PGFLDDYAF+I GLLDLYE +WL +A +LQ+ QD+ F D GYF
Sbjct: 578 NIITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNAGYF 637
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+TT D S++LR+KE +DGAEPSGNS++ NL+RLA + S+ + A F
Sbjct: 638 STTSNDLSIILRLKEAYDGAEPSGNSIAAENLLRLADYLGRSE---LKDKAVRLFGTFRH 694
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L +++P + A + + +VG +++ D +++L+ + + + ID
Sbjct: 695 LLIKRPVSIPQLVSAL-IRYHDDTTQIYVVGKRNAKDTDDLLSVIYKRLIPGRILFLIDH 753
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
T + F + + N + N + +C++ +CS PVT+ L LL E+
Sbjct: 754 DKTNSILFRKNEHFRNMKLVNN-----RTTVYICKHCTCSLPVTNSEQLAILLDEQ 804
>gi|328874248|gb|EGG22614.1| DUF255 family protein [Dictyostelium fasciculatum]
Length = 815
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 299/725 (41%), Positives = 423/725 (58%), Gaps = 58/725 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFE+ +A+++N+ FV+IKVDREERPD
Sbjct: 129 GTEAFEEAKKQDKLIFLSVGYSTCHWCHVMERESFENPDIARIMNELFVNIKVDREERPD 188
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+DK+YMTY+ ++G GGWP+SV+L+PDL PL GGTYF + +GRPGF +++ + W
Sbjct: 189 IDKLYMTYITEVFGHGGWPMSVWLTPDLAPLTGGTYFSSKASHGRPGFGVRCQQIANIWK 248
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K ++M GA I+ L E S N + L + C ++K +DS +GGF A
Sbjct: 249 KDKEMAISRGASFIDYLKE--SKPKGDNNVA--LSNATITKCTGMITKQFDSVYGGFSDA 304
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPR +Y+ +L G +SE + + FTL MA GGIHDH+GGGFHRYSV
Sbjct: 305 PKFPR-----CSVYN--ELNVCG----SSEDLEQLDFTLLKMACGGIHDHLGGGFHRYSV 353
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
E W VPHFEKMLYDQGQ+ANVY+DA+ TK+ + + DIL Y++RD+ G +SA
Sbjct: 354 TEDWRVPHFEKMLYDQGQIANVYIDAYLRTKNPLFRQVVYDILHYVQRDLTDSQGGFYSA 413
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDP 356
EDADS E K+EGAFYVWT +E+E +LG + + +KP+GN D S SDP
Sbjct: 414 EDADSLNKE-TNEKQEGAFYVWTLQEIEKLLGSALDTEVVAYMFDVKPSGNVDPS--SDP 470
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS-KRPRPHLDDKVIV 415
H E GKN+L +++ + +ASK EK I+ ++ L++ R+ R RPHLDDK+I
Sbjct: 471 HGELTGKNILHKVHTTEETASKFNHTPEKIEEIVERSKKILYEYRTNNRVRPHLDDKIIT 530
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTH 474
+WNGL+IS+FARA ++ KE++ A+ A FI+ +LY E
Sbjct: 531 AWNGLMISAFARAYQVF----------------GEKEFLVSAQRAVEFIQSGNLYQESNQ 574
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
L ++R+GPS GF DDYAFLI LLDLYE L WA++LQ Q ELF D + G
Sbjct: 575 ILIRNYRHGPSNVEGFSDDYAFLIQALLDLYEASFDESHLRWALQLQKKQIELFWDEKEG 634
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G+F T G DP++L R KE+HDGAEPS SVS NL+RL++++ D + + A+ ++
Sbjct: 635 GFFTTNGRDPTLLSRQKEEHDGAEPSAQSVSSCNLLRLSNML---HLDEFEERAQKTMEG 691
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG-------HKSSVDFENMLAAAHASYD 647
L+ + +P M CA L P + + +VG H S+ + ++ H
Sbjct: 692 SSIYLEKAPLVMPQMVCALKYLIDPFYQ-ITVVGSLDPSSKHYSTT--QELVNVIHQKPI 748
Query: 648 LNKTVIHID-PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFS-CSPPVTDPIS 705
NK ++ +D AD ++ F + ++S+A+ S D+ VC N C P+ S
Sbjct: 749 PNKVLLFVDIDADMDKSIF--KQVDPDSSVAKYTLSNDQPTVYVCSNEEGCYAPINTIDS 806
Query: 706 LENLL 710
+ N L
Sbjct: 807 INNQL 811
>gi|226533705|ref|NP_001152785.1| spermatogenesis-associated protein 20 [Sus scrofa]
gi|226354712|gb|ACO50965.1| spermatogenesis associated 20 [Sus scrofa]
Length = 789
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 298/736 (40%), Positives = 415/736 (56%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKKTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QL Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
+SAEDADS G R KEGAFY+WT KEV+ +L EH L +HY L
Sbjct: 381 GFYSAEDADSPPGRG-MRPKEGAFYLWTVKEVQQLLPEHVPGATEPLTSGQLLMKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ E +L KLF R
Sbjct: 440 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDAEAVQTLLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S FA +L E + N+ + G A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RLINYAING-------------AKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DY F++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYTFVVRGLLDLYEASQESAWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+ QD LF D GGGYF + E + L LR+K+D DGAEPS N VS NL+RL
Sbjct: 602 WALRLQDMQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANFVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLGTLRRLE---DRATAYVCENQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 772 ACSMPITEPCELRKLL 787
>gi|395826687|ref|XP_003786547.1| PREDICTED: spermatogenesis-associated protein 20 [Otolemur
garnettii]
Length = 752
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 295/738 (39%), Positives = 421/738 (57%), Gaps = 69/738 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ F+S+KVDREERPD
Sbjct: 53 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFISVKVDREERPD 112
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 113 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 172
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 173 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 228
Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + ++ + +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 229 AEAPKFPTPVILNFLFFYWLNHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 283
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D F+S + + IL Y+ R + G
Sbjct: 284 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSHAFQISGDEFFSDVAKGILQYVSRSLTHRFG 343
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+ AEDADS G R KEGAFYVWT KEV+ +L E L +HY L
Sbjct: 344 GFYCAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPIPGATEPLTSGQLLMKHYGLT 402
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN LS+ DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 403 EAGNIGLSQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 460
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD+K++ +WNGL++S +A +L E + + A S A F
Sbjct: 461 RPKPHLDNKMLAAWNGLMVSGYAVTGAVLGIE----------------KLINCATSGAKF 504
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D T RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 505 LKRHMFDVATGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 564
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 565 WALRLQDTQDRLFWDCQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 624
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
+ L F R++ + +A+P M LS + K +V+ G + +
Sbjct: 625 FTGHRD---WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAHQQTLKQIVICGDRQA 678
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + ++ H+ Y NK +I +D + F +++ R D+ A V +
Sbjct: 679 KDTKALVQCVHSMYIPNKVLIL---SDGDPSSFMSRQLPFLSTLRRLE---DRATAYVYE 732
Query: 693 NFSCSPPVTDPISLENLL 710
N +CS P+T+P L LL
Sbjct: 733 NQACSMPITEPCELRKLL 750
>gi|189500022|ref|YP_001959492.1| hypothetical protein Cphamn1_1072 [Chlorobium phaeobacteroides BS1]
gi|189495463|gb|ACE04011.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides
BS1]
Length = 712
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 287/703 (40%), Positives = 404/703 (57%), Gaps = 56/703 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE++ +A+LLN FV +KVDREERPD+D++YMTYVQA G GGWP+S
Sbjct: 54 STCHWCHVMERESFENDRIAELLNRAFVPVKVDREERPDIDRLYMTYVQATTGSGGWPMS 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDLKP GG+YFPPED+YG+PGF ++L ++ AW + R+ + EQL EAL
Sbjct: 114 VWLTPDLKPFFGGSYFPPEDRYGKPGFHSLLLSIERAWKEDRNRFLSAAEGMTEQL-EAL 172
Query: 140 SASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S P+ +P + A+ + +D GGFG+APKFP+P ++ +L +S
Sbjct: 173 SLQK-----PETVPLDEQVFHHAAKTFAGMFDKEDGGFGNAPKFPQPSILEFLLAYSYF- 226
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKML 251
TG E ++MVL +L+ MA GGIHDH+ GGGF RYS D RWHVPHFEKML
Sbjct: 227 --TGN----QEAKEMVLLSLRKMASGGIHDHLGIKNLGGGGFARYSTDVRWHVPHFEKML 280
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD QLA V +A+ +T + Y+ + DIL+Y+ DM G +SAEDADS +
Sbjct: 281 YDNAQLAVVATEAYQITGENLYANLADDILNYVLCDMTDNKGGFYSAEDADSFPNSKSKA 340
Query: 312 KKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
KKEGAFY W+ +E+ L +F Y ++ GN + DPH EF G+N+L N
Sbjct: 341 KKEGAFYTWSIQEITAKLDPLETDIFCFIYGVESDGNA----LDDPHLEFTGRNILFARN 396
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
D A+A++ MP E I + R KLF R+ RPRPHLDDK++ SWNGL+IS+ ++AS
Sbjct: 397 DIEAAAAQFSMPSEIIREITDDAREKLFHSRNDRPRPHLDDKILTSWNGLMISALSKASC 456
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L+S+ Y++ A AA FI +LY RL +R+G + G
Sbjct: 457 VLRSQ----------------NYLDAALKAAEFILNNLYSTTDGRLLRRYRSGQAGIGGK 500
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
DDY+F I GLLDLYE S ++L A++L Q ELF D + GG+FN +D SV +R+
Sbjct: 501 ADDYSFFIQGLLDLYEASSEHRYLSNAVKLMEKQIELFFDDKSGGFFNAASDDSSVPIRM 560
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
KED+DGAEPS NS++ +L RLA ++ D +R+ A+ ++A F LK+ +P +
Sbjct: 561 KEDYDGAEPSPNSINTFSLYRLADMM---DRDDFREIADKTIAYFSKSLKENGRQLPCLL 617
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A ML + V+L G + + +N+ Y + +IH + E DF
Sbjct: 618 KTA-MLPFYGTRQVILTGERHNETMKNLENTLGEMYLPDMFIIHASGNNAENTDF----- 671
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ + + A VC N +C+ P L + K
Sbjct: 672 -----LKKITLKSTGNAAYVCSNQTCNLPAYSAKELRKIFSAK 709
>gi|348562581|ref|XP_003467088.1| PREDICTED: spermatogenesis-associated protein 20-like [Cavia
porcellus]
Length = 789
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 299/740 (40%), Positives = 422/740 (57%), Gaps = 69/740 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME E+F++E +A+LLN+ FVS+KVDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEETFQNEEIARLLNEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L S ++++ AL A + + ++P A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLDSS----QRVTTALLARSEISMGDRQMPPTAATMSSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILSFLFSYWLGHRMAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +W VPHFEKMLYDQGQLA Y AF ++ D FYS + + IL Y+ R + G
Sbjct: 321 HRYSTDRQWQVPHFEKMLYDQGQLAVSYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G R KEGAFYVWT KEV+ +L E L +HY L
Sbjct: 381 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQRLLPEAVPGATEPLTAGQLLIKHYGLT 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
TGN + + D E G+NVL +A++ G+ +E ++L KL R +
Sbjct: 440 ETGNINTCQ--DSKGELHGQNVLTVRYSLELTAARFGLEVEAVRSLLTAGVDKLLQARKQ 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G D+ + A + A F
Sbjct: 498 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GIDK--LVHSATNCAKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAP--------GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D T RL+ + G GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVATGRLRRTCYAGTGTTVEHRDPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+ QD LF D +GGGYF + E S+ LRVK+D DGAEPS NSV+ NL+RL
Sbjct: 602 WALRLQDAQDRLFWDSQGGGYFCSEAELGGSLPLRVKDDQDGAEPSANSVAAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
D+ + A L F R++ + +A+P M A LS + K +V+ G +++
Sbjct: 662 FTG--HKDWLDKCA-CLLTAFSERMRRVPVALPEMVRA---LSAHQQGLKQIVICGERTA 715
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D +L HA Y NK +I AD + F +++ R D+ A V +
Sbjct: 716 KDTRALLQCVHALYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DRATAYVYE 769
Query: 693 NFSCSPPVTDPISLENLLLE 712
N +CS P+T+P L+ LLL+
Sbjct: 770 NQACSMPITEPCELQKLLLQ 789
>gi|281208328|gb|EFA82504.1| DUF255 family protein [Polysphondylium pallidum PN500]
Length = 863
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 280/697 (40%), Positives = 404/697 (57%), Gaps = 41/697 (5%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL +TCHWCHVME ESFEDE +AK++ND FV+IKVDREERPD
Sbjct: 140 GQEAFDAAKQQDKLIFLSVGYSTCHWCHVMERESFEDETIAKVMNDLFVNIKVDREERPD 199
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+DK+YMTY+ G GGWP+SV+L+PDL+P+ GGTYFPP KYGR GF I +K+ W
Sbjct: 200 IDKIYMTYITETSGSGGWPMSVWLTPDLRPITGGTYFPPTTKYGRGGFPDICKKISTMWK 259
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R + +SGA I L E NK + + L+ C ++ K +D FGGF A
Sbjct: 260 DDRKRVLESGASFITYLKE---EKPKGNK-DAAISFDTLKTCHSEIVKRFDPEFGGFSEA 315
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPR L + E+ + + FTL+ M++GGI+DH+ GGFHRYSV
Sbjct: 316 PKFPRTSIFNF-------LHRVHRRFESDNTLEKLHFTLEKMSRGGIYDHLAGGFHRYSV 368
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
E W VPHFEKMLYDQGQ+ +VYLDA+ ++K+ + + +++Y+ RD+ G +SA
Sbjct: 369 TEDWKVPHFEKMLYDQGQIVSVYLDAYQISKNEHFKDVATGVIEYVLRDLTHVDGGFYSA 428
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDP 356
EDADS + +G K EGAFYVW E++ + E + L F + + P GN +S DP
Sbjct: 429 EDADSLDDKG--EKTEGAFYVWDYSEIKKAVPEESDLEIFNFIFGISPNGN--VSASEDP 484
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
H EF KN++++ + ++KL +P+E+ + + + L +R+KR RPHLDDK+I S
Sbjct: 485 HGEFLDKNIIMQFHTFEECSNKLNIPVEQVKQSIEKSKVSLLKLRAKRARPHLDDKIITS 544
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WN L+IS+ +++ F ++G R Y+E A+ + FI+ +LY+ + L
Sbjct: 545 WNALMISALSKS--------------FQLLGEQR--YLEAAKKSVHFIKTNLYNAEKQTL 588
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
++R GPSK GF DDYAFLI LLDLYE +L WA+ELQ QD+LF D+EG GY
Sbjct: 589 IRNYREGPSKVEGFTDDYAFLIQALLDLYECCFDIAYLEWAVELQAKQDKLFWDKEGHGY 648
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F+++G D S+L R+KE+HDGAEPS SV+ NL+R+ +++ D Y NA L
Sbjct: 649 FSSSGLDSSILSRLKEEHDGAEPSCQSVACNNLIRIGNML---HDDDYTDNALLLLESVS 705
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
L + P M + P+ KSS + ++L H Y NK ++ D
Sbjct: 706 LYLHRAPIVFPQMVVSLANHLEPTYT-FSFAADKSSAELRSLLDTIHTFYMPNKVLLLKD 764
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQN 693
++M F+ E + +A + + DK +C +
Sbjct: 765 TEHPQDMTFFSELD-QHAILLKYTKLYDKPTLYICSD 800
>gi|344252175|gb|EGW08279.1| Spermatogenesis-associated protein 20 [Cricetulus griseus]
Length = 1263
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 296/736 (40%), Positives = 420/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 564 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 623
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+++P L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 624 VDKVYMTFVQATSSGGGWPMNVWMTPSLQPFVGGTYFPPEDGLTRVGFRTVLTRIRDQWK 683
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 684 QNKNTLLENS----QRVTTALLARSEISVGDRQVPPSAATMNTRCFQQLDEGYDEEYGGF 739
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 740 AEAPKFPTPVILNFLFSYWLSHRLAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 794
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA VY AF ++ D FYS + + IL Y+ R + G
Sbjct: 795 HRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAFQISGDEFYSDVAKGILQYVTRSLSHRSG 854
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADSA G + KEGAFYVWT +E++ +L E L +HY L
Sbjct: 855 GFYSAEDADSAPERG-MKPKEGAFYVWTVQEIQQLLPEPVGGASEPLTSGQLLMKHYGLS 913
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 914 EAGNINSNQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVSTLLNTGLEKLFQARKH 971
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD K++ +WNGL++S FA +L G D+ + A + A F
Sbjct: 972 RPKAHLDSKMLAAWNGLMVSGFAVTGAVL--------------GMDK--LVTQATNGAKF 1015
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 1016 LKRHMFDVASGRLKRTCYAGTGGSVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 1075
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E S L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 1076 WALRLQDTQDRLFWDSRGGGYFCSEAELGSDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 1135
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G D
Sbjct: 1136 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQETLKQIVICGDPQGKD 1191
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F +++ R D+ A + +N
Sbjct: 1192 TKALLQCVHSIYLPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATAYIFENQ 1245
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 1246 ACSMPITEPCELRKLL 1261
>gi|426237729|ref|XP_004012810.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
20 [Ovis aries]
Length = 795
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 299/739 (40%), Positives = 412/739 (55%), Gaps = 67/739 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 92 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 151
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP+SV+L+P+L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 152 VDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWK 211
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ + L ++ L A SA + ++ P+ + C +QL + YD +GGF A
Sbjct: 212 QNKSTLLENSQRVTTALL-ARSAISMGDRQXSAAPRPS--RCFQQLDEGYDEEYGGFAEA 268
Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
PKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GFHRY
Sbjct: 269 PKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRY 323
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S D +WHVPHFEKMLYDQ QL Y AF ++ D FYS + + IL Y+ R++ G +
Sbjct: 324 STDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFYSEVAKGILQYVARNLSHRSGGFY 383
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTG 346
SAEDADS G R KEGAFYVWT KEV+ +L E + L +HY L G
Sbjct: 384 SAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLTEAG 442
Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR 406
N +S DP E +G+NVL +A++ G+ +E +L KLF R RP+
Sbjct: 443 N--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPK 500
Query: 407 PHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR 466
PHLD K++ +WNGL++S FA +L E + A + A F++R
Sbjct: 501 PHLDSKMLAAWNGLMVSGFAVTGAVLGQE----------------RVVSYAINGAKFLKR 544
Query: 467 HLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
H++D + RL + G S P GFL+DYAF++ GLLDLYE + WL WA+
Sbjct: 545 HMFDVASGRLMRTCYAGAGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWAL 604
Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVL-------LRVKEDHDGAEPSGNSVSVINLVR 571
LQ+TQD LF D GGGYF + E + L LR+++D DGAEPS NSVS NL+R
Sbjct: 605 RLQDTQDRLFWDSRGGGYFCSEAELGAGLPWGGGLPLRLEDDQDGAEPSANSVSAHNLLR 664
Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
L G K + L F R++ + +A+P M A + K +V+ G
Sbjct: 665 LHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQ 720
Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
+ D + +L H+ Y NK +I AD + F ++ R D+ A VC
Sbjct: 721 AKDTKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRRIE---DRATAYVC 774
Query: 692 QNFSCSPPVTDPISLENLL 710
+N +CS P+T+P L LL
Sbjct: 775 ENQACSMPITEPCELRKLL 793
>gi|354478455|ref|XP_003501430.1| PREDICTED: spermatogenesis-associated protein 20 [Cricetulus
griseus]
Length = 789
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 296/736 (40%), Positives = 420/736 (57%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+++P L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWMTPSLQPFVGGTYFPPEDGLTRVGFRTVLTRIRDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQVPPSAATMNTRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRLAQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA VY AF ++ D FYS + + IL Y+ R + G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAFQISGDEFYSDVAKGILQYVTRSLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADSA G + KEGAFYVWT +E++ +L E L +HY L
Sbjct: 381 GFYSAEDADSAPERG-MKPKEGAFYVWTVQEIQQLLPEPVGGASEPLTSGQLLMKHYGLS 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 440 EAGNINSNQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVSTLLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD K++ +WNGL++S FA +L G D+ + A + A F
Sbjct: 498 RPKAHLDSKMLAAWNGLMVSGFAVTGAVL--------------GMDK--LVTQATNGAKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVASGRLKRTCYAGTGGSVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E S L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 602 WALRLQDTQDRLFWDSRGGGYFCSEAELGSDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G D
Sbjct: 662 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQETLKQIVICGDPQGKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F +++ R D+ A + +N
Sbjct: 718 TKALLQCVHSIYLPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATAYIFENQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 772 ACSMPITEPCELRKLL 787
>gi|301620517|ref|XP_002939623.1| PREDICTED: spermatogenesis-associated protein 20-like [Xenopus
(Silurana) tropicalis]
Length = 775
Score = 517 bits (1331), Expect = e-143, Method: Compositional matrix adjust.
Identities = 280/651 (43%), Positives = 385/651 (59%), Gaps = 56/651 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPDVDKVYMT++QA GGGWP+S
Sbjct: 124 STCHWCHVMERESFEDEEIGRILNENFICVKVDREERPDVDKVYMTFLQATDSGGGWPMS 183
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL+P +GGTYFPPED R F+T+L ++ + W + R AF E+ L
Sbjct: 184 VWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWKENR-------AFLCERSERIL 236
Query: 140 SASASSNKL------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--L 191
S SS+ + P LP +LC +QL + +D +GGFG PKFP PV + L
Sbjct: 237 SVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEYGGFGEFPKFPTPVNFSFLFCL 294
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ K S E ++ M + TL+ M GGIHDH+G GFHRYS D+ WHVPHFEKML
Sbjct: 295 WALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIGKGFHRYSTDQTWHVPHFEKML 349
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YDQGQLA Y +AF ++ +S DIL Y+ +++ G +SAEDADS +
Sbjct: 350 YDQGQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSDDAGGFYSAEDADSLPNAQSKE 409
Query: 312 KKEGAFYVWTSKEVEDILGE--------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
KKEGAF WT+KE++ +L + +F HY +K GN S+ D H E +G+
Sbjct: 410 KKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGMKEEGNVSASQ--DIHGELQGQ 467
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVLI + +A+K G+ + + IL CR +L+ R RP P D ++ SWNGL++S
Sbjct: 468 NVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARRLRPPPQRDTNILASWNGLMLS 527
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
AR IL+ E EY+E A+ AASF+ ++YD ++ L SF G
Sbjct: 528 GLARCGVILRDE----------------EYIERAKLAASFLHENMYDLKSGILLRSFYKG 571
Query: 484 PSK----APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
PGFLDDYAF++ GLLDLYE +L WA++LQ+ QD+LF D +G GYF +
Sbjct: 572 HQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWALQLQDRQDQLFWDAKGSGYFCS 631
Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
D S+LLR+K+D DGAEPSGNSVSV+NL+RLA ++ + + + LA F RL
Sbjct: 632 DASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGRTE---FTERSGQILAAFSERL 688
Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ ++P M +M+ + K VV+ G K + +L AA + Y NK
Sbjct: 689 LKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTRELLEAAQSMYVPNK 738
>gi|116487451|gb|AAI25719.1| LOC779596 protein [Xenopus (Silurana) tropicalis]
Length = 770
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 283/672 (42%), Positives = 392/672 (58%), Gaps = 59/672 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL +TCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPD
Sbjct: 97 GQEAFSRAAREMKPIFLSVGYSTCHWCHVMERESFEDEEIGRILNENFICVKVDREERPD 156
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT++QA GGGWP+SV+L+PDL+P +GGTYFPPED R F+T+L ++ + W
Sbjct: 157 VDKVYMTFLQATDSGGGWPMSVWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWK 216
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKL------PDELPQNALRLCAEQLSKSYDSRF 172
+ R AF E+ LS SS+ + P LP +LC +QL + +D +
Sbjct: 217 ENR-------AFLCERSERILSVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEY 267
Query: 173 GGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
GGFG PKFP PV + L+ K S E ++ M + TL+ M GGIHDH+G
Sbjct: 268 GGFGEFPKFPTPVNFSFLFCLWALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIG 322
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GFHRYS D+ WHVPHFEKMLYDQ QLA Y +AF ++ +S DIL Y+ +++
Sbjct: 323 KGFHRYSTDQTWHVPHFEKMLYDQAQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSD 382
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--------HAILFKEHYYL 342
G +SAEDADS + KKEGAF WT+KE++ +L + +F HY +
Sbjct: 383 DAGGFYSAEDADSLPNAQSKEKKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGM 442
Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
K GN S+ D H E +G+NVLI + +A+K G+ + + IL CR +L+ R
Sbjct: 443 KEEGNVSASQ--DIHGELQGQNVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARR 500
Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
RP P D K++ SWNGL++S AR IL+ E Y+E A+ AAS
Sbjct: 501 LRPPPQRDTKILASWNGLMLSGLARCGVILRDEG----------------YIERAKLAAS 544
Query: 463 FIRRHLYDEQTHRLQHSFRNGPSK----APGFLDDYAFLISGLLDLYEFGSGTKWLVWAI 518
F+ ++YD ++ L SF G PGFLDDYAF++ GLLDLYE +L WA+
Sbjct: 545 FLHENMYDLKSGILLRSFYKGHQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWAL 604
Query: 519 ELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
+LQ+ QD+LF D +G GYF + D S+LLR+K+D DGAEPSGNSVSV+NL+RLA
Sbjct: 605 QLQDRQDQLFWDAKGSGYFCSDASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGR 664
Query: 579 SKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 638
++ + + + LA F RL + ++P M +M+ + K VV+ G K + +
Sbjct: 665 TE---FTERSGQILAAFSERLLKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTREL 720
Query: 639 LAAAHASYDLNK 650
L AA + Y NK
Sbjct: 721 LEAAQSMYVPNK 732
>gi|383859631|ref|XP_003705296.1| PREDICTED: spermatogenesis-associated protein 20 [Megachile
rotundata]
Length = 744
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 292/723 (40%), Positives = 406/723 (56%), Gaps = 68/723 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF ++ +A ++N FV+IKVD ERPD+DK+YM +VQA G GGWP+S
Sbjct: 60 STCHWCHVMEKESFTNKEIADIMNKHFVNIKVDNGERPDIDKIYMAFVQATTGHGGWPMS 119
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP+ GGTYFPPED + + GFKTIL + D W+ + + + G+ + L +
Sbjct: 120 VFLTPDLKPVFGGTYFPPEDTFRQTGFKTILLNIADKWNSLKTKITEVGSANFKTLKDIS 179
Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGSA-----PKFPRPVEIQMM--L 191
+S K E+P +CA QL+ ++ FGGF S+ PKFP+PV + +
Sbjct: 180 KVPQTSKK--HEVPSLECSNVCALQLASEFEPEFGGFTSSFDMHTPKFPQPVIFNFLFHM 237
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
Y E+ KS M ++TL+ +A GGIHDH+G GF RY+ D +WHVPHFEKML
Sbjct: 238 YARHPNEELAKS-----CLHMCVYTLKKIAFGGIHDHIGQGFSRYATDGKWHVPHFEKML 292
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YDQGQL Y DA+ TKD +++ I DI Y+ RD+ G +SAEDADS T A
Sbjct: 293 YDQGQLMKSYADAYVTTKDNYFAEIVDDIAAYVIRDLRHQEGGFYSAEDADSYATSDAHE 352
Query: 312 KKEGAFYVWTSKEVEDILGEH--------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
K EGAFYVWT+ E++ +L + + +F H+ +K +GN + DP E GK
Sbjct: 353 KLEGAFYVWTAAEIKSLLDKKVSSENIKLSDIFCHHFNVKESGN--VKGYQDPRGELTGK 410
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVLI D +A +E+ N L + L++ R RPRPHLDDK+I SWNGL+IS
Sbjct: 411 NVLIVYEDIDDTAKHFNCTVEEIKNYLKDACSILYEARQARPRPHLDDKIITSWNGLMIS 470
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRN 482
A ++ D K+Y+E A AA FI+R+L+DE L HS +RN
Sbjct: 471 GLAYGGAVV----------------DNKQYIEYATDAAKFIKRYLFDEAKDILLHSCYRN 514
Query: 483 GPSKAP-------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
+K GFLDDYAF+I GLLDLYE G +WL +A LQ+ QD+L D GG
Sbjct: 515 AENKITQINEPIHGFLDDYAFVIKGLLDLYEAGFDEQWLEFAERLQDIQDKLLWDETSGG 574
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
YF TT +DPS+++R+KE HDGAEPSGNS+S NL+RLA + S + F
Sbjct: 575 YFTTTSDDPSIIVRLKEAHDGAEPSGNSISAENLLRLAYYLGRSD---LKDKVVRLFGAF 631
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENMLAAAHASYDLNK 650
L +AVP ++S R H + +VG + + D +++L + +
Sbjct: 632 RHLLTQRPIAVP------QLVSALVRYHDDATQIYVVGKRGAKDTDDLLRVIYKRLIPGR 685
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ ID + + + + N D+ VC+ +CS PV++ LE LL
Sbjct: 686 ILMLIDHDEADSILLGKNERLRNMKPLN-----DQATVYVCKYRTCSLPVSNSKQLEKLL 740
Query: 711 LEK 713
E+
Sbjct: 741 DEQ 743
>gi|324505187|gb|ADY42236.1| Unknown [Ascaris suum]
Length = 775
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 299/740 (40%), Positives = 414/740 (55%), Gaps = 84/740 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F R FL +TCHWCHVM ESFE++ +A +LN+ FVSIKVDREERPD
Sbjct: 81 GDEAFTKAKTLNRLIFLSVGYSTCHWCHVMAHESFENQTIADILNENFVSIKVDREERPD 140
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YMT++QA+ GGGGWP+SVFL+PDL P+ GGTYFPPED+YGRPGF +ILR + + W
Sbjct: 141 VDKLYMTFIQAISGGGGWPMSVFLTPDLNPVTGGTYFPPEDRYGRPGFASILRTIAEKWQ 200
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ D + G FA L+ A+ + +N+ +N C +L+ +D + GFG A
Sbjct: 201 LEGDQIRGQG-FA---LANAIKKAFLTNRETVPADENVALTCYTELADRFDETYKGFGGA 256
Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
PKFP+P E+ ML Y + K GK KMV TL+ MA+GGIHDH+G GFHRY
Sbjct: 257 PKFPKPAELDFMLSFYANNKSTTEGKL-----ALKMVGETLEAMARGGIHDHIGKGFHRY 311
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC-------RDILDYLRRDMI 289
+VD WHVPHFEKMLYDQ QL +VY + YS +C DI DY+ R++
Sbjct: 312 AVDAAWHVPHFEKMLYDQAQLLSVYAN---------YSLVCGQMKEIVEDIADYVYRNLT 362
Query: 290 GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG----------EHAILFKEH 339
P G +SA+DADS + A K+EGAFYVWT +E++D L + A FK++
Sbjct: 363 HPEGGFYSAQDADSLPSHNAKAKREGAFYVWTEQEIDDALKDVTVNGDSSVDVATYFKQY 422
Query: 340 YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFD 399
+ +K GNC +DPH E K +NVL + SA KLG+ +K I+ + R+ L +
Sbjct: 423 FGVKANGNCPSD--TDPHGELKLQNVLAMKDSHKDSARKLGISEDKLTAIIEKARQVLVE 480
Query: 400 VRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
R++RP PHLD K++ SWNGL+IS +RAS V + + E A+
Sbjct: 481 ARAQRPEPHLDSKMLTSWNGLMISGLSRAS----------------VAAGKPELAGRAQK 524
Query: 460 AASFIRRHLYDEQTHRLQHSFRN---------GPSKAPGFLDDYAFLISGLLDLYEFGSG 510
FI++++ E L+ ++ + P KA F DDYAFLI GLLDLYE
Sbjct: 525 VVEFIKKYMLSENGELLRTAYTDESGGVVHNSKPVKA--FADDYAFLIEGLLDLYEVTFD 582
Query: 511 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 570
L +A ELQ DE F D + + + DPS++ R EDHDGAEP+ NSV+ +NLV
Sbjct: 583 ENLLKFASELQKQFDERFWDTDNNAGYFLSETDPSIMTRFMEDHDGAEPATNSVAALNLV 642
Query: 571 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 630
RLASI + +R + L RL+ +P M A S P+ VV++G +
Sbjct: 643 RLASIF---DEERFRDRVANILESVSLRLRRYPSVLPKMVTALMRHSRPA-TLVVVIGKR 698
Query: 631 SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE-EHNSNNASMARNNFSADKVVAL 689
+ ML + N+++I +D D W E N + ++ R S K
Sbjct: 699 DDPLTQQMLDEIKRHFIPNQSLISLDATK----DLWLIEQNDHFGTLLR---STTKPAVF 751
Query: 690 VCQNFSCSPPVTDPISLENL 709
+C++F C+ P+T SL++L
Sbjct: 752 ICEHFKCNQPIT---SLDDL 768
>gi|307166116|gb|EFN60365.1| Spermatogenesis-associated protein 20 [Camponotus floridanus]
Length = 754
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 291/714 (40%), Positives = 410/714 (57%), Gaps = 56/714 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E +A+++N+ FV+IKVDREERPD+D++YMT+VQA G GGWP+S
Sbjct: 66 STCHWCHVMEKESFENEDIARIMNENFVNIKVDREERPDIDRIYMTFVQAKSGHGGWPMS 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFLSPDL P+ GGTYFPP+ KYG GFK++L V W +++ + +S A +E+L + +
Sbjct: 126 VFLSPDLMPVTGGTYFPPDGKYGLIGFKSLLLAVAKEWTQQKSNIIKSAANIVERLKDIV 185
Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYH 193
K D P LC L+ Y+ +FGGF S +PKFP PV L+
Sbjct: 186 ECKQGLKK-DDGFPTAECALLCVHLLANGYEPKFGGFSSRSWMNSPKFPEPVNFN-FLFS 243
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ L + S + +M L TL MA GGIHDHVG GF RYSVD WHVPHFEKMLYD
Sbjct: 244 TYALSTS--SELRKQCLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFEKMLYD 301
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q Q+ Y DA+ +TKD FYS I DI Y+ RD+ G +SAEDADS A+ K+
Sbjct: 302 QAQIIQAYADAYVITKDSFYSDIVDDIATYVVRDLRHKEGGFYSAEDADSLPEPQASAKR 361
Query: 314 EGAFYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNEFKGKN 364
EGAFYVW KEV+ +L G + F + H+ +K GN + + DPH E GKN
Sbjct: 362 EGAFYVWPYKEVKTLLDKKIPGNDNVRFSDLICYHFNVKKEGN--VRKAQDPHGELTGKN 419
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
V I + +A G+ +E + + E + LF+ RSKRPRPHLDDK++ +WNGL+IS
Sbjct: 420 VFIVYDGIEQTAEHFGISVENTKSYIKEACQILFEERSKRPRPHLDDKIVTAWNGLMISG 479
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG- 483
FARA ++++ +Y+E+A AA F++++L+D+ L S G
Sbjct: 480 FARAGAAVRND----------------KYVELATDAAKFVKQYLFDKNKGVLLRSCYRGE 523
Query: 484 -----PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ P GF DDYAF++ GLLDLYE +WL +A ELQ+ QD LF D + GGY
Sbjct: 524 DDRIMQTSVPIHGFHDDYAFVVKGLLDLYEANFDAQWLEFAEELQDIQDRLFWDSQDGGY 583
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F+T E+ ++LR+K+ HDGAEPS NS++ NL+RLA+ + S+ + A L+ F
Sbjct: 584 FSTV-ENSQMILRMKDAHDGAEPSSNSIACSNLLRLATYLDRSE---LKDKAGQLLSAFG 639
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
L +M + P + A +L + + + G + D ML + ++ D
Sbjct: 640 KGLTEMPIMFPQLTLA--LLEYHNATQIYIAGRPDAEDTIEMLNVIRERVIPGRVLLLAD 697
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P + + NA +++ + LVC+ +CS P+T+P L + L
Sbjct: 698 PEQQDNVLL-----RKNAVVSKLKPQKGRATVLVCRRQACSIPITNPSELASQL 746
>gi|307213879|gb|EFN89140.1| Spermatogenesis-associated protein 20 [Harpegnathos saltator]
Length = 755
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 292/719 (40%), Positives = 411/719 (57%), Gaps = 59/719 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E +A ++ND F++IKVDREERPD+D++YMT+VQA G GGWP+S
Sbjct: 66 STCHWCHVMEKESFENEEIAHIMNDNFINIKVDREERPDIDRIYMTFVQAKSGHGGWPMS 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+L P+ GGTYFPP+D+YG GFK++L +V W ++++ + +SGA + +L + +
Sbjct: 126 VFLAPNLTPVTGGTYFPPDDRYGLIGFKSLLLEVAKKWAQQKNDIIKSGANIVSRLKDMV 185
Query: 140 SASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMM--L 191
S K D P LC L+ Y+ +FGGFGS APKFP PV + +
Sbjct: 186 ERRQSL-KEGDGFPTVECGFLCVHLLANGYEPKFGGFGSQFRMNAPKFPEPVNFNFLFSV 244
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
Y L + K E +M L TL MA GGIHDHVG GF RYSVD WHVPHFEKML
Sbjct: 245 YALSNLSELRK-----ECLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFEKML 299
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YDQ Q+ Y DA+ +TKD FYS I DI Y+ RD+ G +SAEDADS ++
Sbjct: 300 YDQAQIIQAYADAYVITKDSFYSDIVDDIAKYVERDLRHKEGGFYSAEDADSLPESKSSA 359
Query: 312 KKEGAFYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNEFKG 362
K+EGAFYVWT EV+ +L G + + F + H+ +K GN + + DPH E G
Sbjct: 360 KREGAFYVWTYDEVKSLLNKKVPGRNNVRFFDLICYHFNVKKEGN--VRKAQDPHGELTG 417
Query: 363 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 422
KNVLI +A + LE + + LF RSKRPRPHLDDK++ +WNGL+I
Sbjct: 418 KNVLIAYEAVEKTAEHFNISLEDTKTYIKQACLILFKERSKRPRPHLDDKMVTAWNGLMI 477
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF-- 480
S FARA +++ +Y+E+A AA F+ ++L+D+ L S
Sbjct: 478 SGFARAGAAVRNS----------------KYVELATDAAKFVEQYLFDKNKGTLLRSCYR 521
Query: 481 ----RNGPSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
R + P GF DDYAF++ GLLDLY+ WL A +LQ+TQDELF D + G
Sbjct: 522 EEDDRIIQTSVPIYGFHDDYAFVVKGLLDLYQANFDVHWLELAEQLQDTQDELFWDSQDG 581
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
GYF+T ED ++LR+K+ HDGAEPS NS++ NL+RLA+ + ++ ++ A L
Sbjct: 582 GYFSTV-EDSQMILRMKDAHDGAEPSSNSIACSNLLRLAAFLDRNE---LKEKAAQLLRA 637
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F L ++ + P M A +L + ++G + D ML +
Sbjct: 638 FGKGLTEIPIMFPQMTLA--LLDYHYTTQIYIIGKSDAEDTNEMLNVVRERLIPGMVLSL 695
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+D +++ + + N +++ + VC++ +CSPP T P L +LL +K
Sbjct: 696 VDHERSQDNVLFRK----NTIISKMKPQNGRATVFVCRHHTCSPPTTSPRELASLLDDK 750
>gi|242004841|ref|XP_002423285.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212506287|gb|EEB10547.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 774
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 296/738 (40%), Positives = 417/738 (56%), Gaps = 82/738 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFE+E +AK++N+ FV +KVDREERPD
Sbjct: 90 GNEAFSRAVKENKLIFLSVGYSTCHWCHVMEKESFENEEIAKIMNENFVCVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +VQ P+ GGTYFPP D + RPGFK++L + + W
Sbjct: 150 VDKLYMLFVQ-------------------PIFGGTYFPPSDFHERPGFKSVLLILAEQWR 190
Query: 119 KKRDMLAQSGAFAIEQLSEALSA-----SASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
+ R +++G ++ + ++ S + S+ PD + + C L KSY+ +G
Sbjct: 191 ENRQKFSENGRKIMDYIEQSSSLDNSILNPSAVNPPD---ISCIEKCYNSLFKSYEKNYG 247
Query: 174 GFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
GF APKFP V + + LY + + GK+ A M + TL+ MA GGIHDH+G
Sbjct: 248 GFSEAPKFPHLVNLNFLFHLYAREPKSERGKTALA-----MCIHTLKMMANGGIHDHIGK 302
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
GF RYSVD +WHVPHFEKMLYDQGQLA Y A+ TK+ F+S + IL Y+ RD+ P
Sbjct: 303 GFSRYSVDNKWHVPHFEKMLYDQGQLAVSYATAYLTTKNQFFSEVLEGILSYVDRDLSHP 362
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE---------HAILFKEHYYL 342
G +SAEDADS +T KKEGAFYVWT ++++ L + +A +F E++ +
Sbjct: 363 DGGFYSAEDADSLSAPDSTEKKEGAFYVWTYEDIKKHLPQKIPESSELTYADVFCEYFNV 422
Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
K GN + S+ DPHNE K +NVLI + +A A+K + E+ IL E ++ LF++R+
Sbjct: 423 KANGNVNPSK--DPHNELKNQNVLIITDSEAAVAAKFNLSEERVKQILDESKKILFNLRA 480
Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
KRPRPHLDDK++ SWNGL+IS +A+A ++L + Y++ A AA
Sbjct: 481 KRPRPHLDDKILTSWNGLMISGYAKAGQVLGNS----------------HYVQRAIGAAK 524
Query: 463 FIRRHLYDEQTHRL--------QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 514
FIR+HLY T L ++ + GFLDDYAFLI GLLDLYE W+
Sbjct: 525 FIRQHLYKNDTKTLLRSCYKSSDNTISQIATPINGFLDDYAFLIRGLLDLYEASFDPIWI 584
Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA LQ TQD LF D G GYF++ D S+L+R+KEDHDGAEP GNSVSV NL+RL +
Sbjct: 585 EWAESLQETQDTLFWDEGGAGYFSSPSGDSSILVRMKEDHDGAEPCGNSVSVSNLLRLGA 644
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
+ ++ Y+ A LA F +RLK M + +P M A +L +++ G K+ D
Sbjct: 645 YLDKAE---YKDRAGKLLAAFTSRLKKMPVILPEMVSAL-LLYHDGPTQILITGKKTDPD 700
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+L + + N+ + ID D +E +++++ + S A VC +
Sbjct: 701 TAALLNVVQSRFIPNRILALID--DDKESILYKKNDIIRTIKPVHGHS----TAYVCHHH 754
Query: 695 SCSPPVTDPISLENLLLE 712
+CS P+ L LL E
Sbjct: 755 TCSLPINTREELAKLLDE 772
>gi|148683975|gb|EDL15922.1| spermatogenesis associated 20, isoform CRA_a [Mus musculus]
Length = 745
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 46 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 105
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 106 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 165
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 166 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 221
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 222 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 276
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D FY+ + + IL Y+ R + G
Sbjct: 277 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 336
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G + +EGA+YVWT KEV+ +L E + L +HY L
Sbjct: 337 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 395
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + S+ DP+ E G+NVL+ +A++ G+ +E +L KLF R
Sbjct: 396 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 453
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA L E A A S A F
Sbjct: 454 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 497
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 498 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 557
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL S
Sbjct: 558 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 617
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
G K + L F R++ + +A+P M LS + K +V+ G +
Sbjct: 618 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 671
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + +L H+ Y NK +I AD + F +S+ R D+ + +
Sbjct: 672 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 725
Query: 693 NFSCSPPVTDPISLENLL 710
N +CS P+TDP L LL
Sbjct: 726 NQACSMPITDPCELRKLL 743
>gi|351713578|gb|EHB16497.1| Spermatogenesis-associated protein 20, partial [Heterocephalus
glaber]
Length = 806
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 292/738 (39%), Positives = 417/738 (56%), Gaps = 69/738 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME E+F++E + +LL++ FVS+KVDREE+PD
Sbjct: 109 GQEAFGKARKENKPIFLSVGYSTCHWCHMMEEETFQNEEIGRLLSEDFVSVKVDREEQPD 168
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 169 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 228
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L +S ++++ AL A + + + P A + C +QL + YD +GGF
Sbjct: 229 QNKSTLLESS----QRVTTALLARSEISMGDRQAPPLAATMNSRCFQQLDEGYDEEYGGF 284
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 285 AEAPKFPIPVILSFLFSYWLGHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 339
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +W PHFEKMLYDQ QLA Y AF ++ D FYS I + IL Y+ R + G
Sbjct: 340 HRYSTDRQWQGPHFEKMLYDQAQLAVSYSQAFQISGDEFYSDIAKGILQYVDRSLSHRSG 399
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAED+DSA G + +EGAFY+WT +E++ +L E + L +HY L
Sbjct: 400 GFYSAEDSDSAPERG-MQPREGAFYMWTVRELQCLLPEPVVGASEPLTVGQLLTKHYGLT 458
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN L + DP E +G+NVL +A++ G+ +E +L KLF VR +
Sbjct: 459 EAGNVSLCQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRGLLTSGLDKLFQVRKQ 516
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L E + A ++A F
Sbjct: 517 RPKPHLDSKMLTAWNGLMVSGYAVTGAVLGIE----------------RLVNRATNSAKF 560
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D T RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 561 LKRHMFDVATGRLKRTCYAGTGASVEHSTPPRWGFLEDYAFVVRGLLDLYEASQESAWLE 620
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E P + LRVK+D DGAEPS NSV+ NL+RL
Sbjct: 621 WALRLQDTQDRLFWDSRGGGYFCSEAELGPGLPLRVKDDQDGAEPSANSVAAHNLLRLHG 680
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
++ + L F R++ + +A+P M LS + K +V+ G +
Sbjct: 681 F---TRHKDWLDKCVCLLTAFSERMRRVPVALPEM---VRTLSTHQQGLKQIVICGDAQA 734
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + +L H+ Y NK +I AD F +++ R D+ A VC+
Sbjct: 735 KDTKALLQCVHSLYIPNKVLIL---ADGGPSSFLSRQLPFLSTLRRLE---DRATAYVCE 788
Query: 693 NFSCSPPVTDPISLENLL 710
N +CS P+T+P L LL
Sbjct: 789 NQACSMPITEPCELRKLL 806
>gi|46485467|ref|NP_659076.2| spermatogenesis-associated protein 20 [Mus musculus]
gi|81912951|sp|Q80YT5.1|SPT20_MOUSE RecName: Full=Spermatogenesis-associated protein 20; AltName:
Full=Sperm-specific protein 411; Short=Ssp411; AltName:
Full=Transcript increased in spermiogenesis 78 protein
gi|29748049|gb|AAH50788.1| Spermatogenesis associated 20 [Mus musculus]
Length = 790
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 91 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 150
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 151 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 210
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 211 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 266
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 267 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 321
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D FY+ + + IL Y+ R + G
Sbjct: 322 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 381
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G + +EGA+YVWT KEV+ +L E + L +HY L
Sbjct: 382 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 440
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + S+ DP+ E G+NVL+ +A++ G+ +E +L KLF R
Sbjct: 441 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 498
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA L E A A S A F
Sbjct: 499 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 542
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 543 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 602
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL S
Sbjct: 603 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 662
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
G K + L F R++ + +A+P M LS + K +V+ G +
Sbjct: 663 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 716
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + +L H+ Y NK +I AD + F +S+ R D+ + +
Sbjct: 717 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 770
Query: 693 NFSCSPPVTDPISLENLL 710
N +CS P+TDP L LL
Sbjct: 771 NQACSMPITDPCELRKLL 788
>gi|194217119|ref|XP_001499729.2| PREDICTED: spermatogenesis-associated protein 20-like [Equus
caballus]
Length = 889
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 296/736 (40%), Positives = 417/736 (56%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 190 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 249
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF T+L+++++ W
Sbjct: 250 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFHTVLQRIREQWK 309
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 310 QNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 365
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 366 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 420
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 421 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVTRNLSHRSG 480
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G R KEGAFYVWT KEV+ +L E L +HY L
Sbjct: 481 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQQLLPEPVPGATEPLTSGQLLMKHYGLT 539
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E G+NVL +A++ G+ ++ +L KLF R
Sbjct: 540 EAGN--ISSNQDPKGELHGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 597
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L E + N+ + + A F
Sbjct: 598 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGLE---RLINYAI-------------NCAKF 641
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 642 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEATQESAWLE 701
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 702 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 761
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G +
Sbjct: 762 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKG 817
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F +++ R D+ A + +
Sbjct: 818 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSTLRRLE---DRATAYIYGSQ 871
Query: 695 SCSPPVTDPISLENLL 710
CS PVT+P L LL
Sbjct: 872 VCSLPVTEPCELRKLL 887
>gi|148683976|gb|EDL15923.1| spermatogenesis associated 20, isoform CRA_b [Mus musculus]
Length = 796
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 295/738 (39%), Positives = 418/738 (56%), Gaps = 69/738 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ F+ + VDREERPD
Sbjct: 97 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNENFICVMVDREERPD 156
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 157 VDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 216
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
++ L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 217 LNKNTLLENS----QRVTTALLARSEISVGDRQIPASAATMNSRCFQQLDEGYDEEYGGF 272
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 273 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIQDHVGQGF 327
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D FY+ + + IL Y+ R + G
Sbjct: 328 HRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQISGDEFYADVAKGILQYVTRTLSHRSG 387
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G + +EGA+YVWT KEV+ +L E + L +HY L
Sbjct: 388 GFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQQLLPEPVVGASEPLTSGQLLMKHYGLS 446
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + S+ DP+ E G+NVL+ +A++ G+ +E +L KLF R
Sbjct: 447 EVGNINSSQ--DPNGELHGQNVLMVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 504
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA L E A A S A F
Sbjct: 505 RPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEKLVAQ----------------ATSGAKF 548
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 549 LKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 608
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL S
Sbjct: 609 WALRLQDTQDKLFWDPRGGGYFCSEAELGADLPLRLKDDQDGAEPSANSVSAHNLLRLHS 668
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSS 632
G K + L F R++ + +A+P M LS + K +V+ G +
Sbjct: 669 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEM---VRTLSAQQQTLKQIVICGDPQA 722
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + +L H+ Y NK +I AD + F +S+ R D+ + +
Sbjct: 723 KDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLSSLRR---VEDRATVYIFE 776
Query: 693 NFSCSPPVTDPISLENLL 710
N +CS P+TDP L LL
Sbjct: 777 NQACSMPITDPCELRKLL 794
>gi|391227735|ref|ZP_10263942.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
gi|391223228|gb|EIQ01648.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
Length = 734
Score = 509 bits (1310), Expect = e-141, Method: Compositional matrix adjust.
Identities = 295/730 (40%), Positives = 402/730 (55%), Gaps = 44/730 (6%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ FL +TCHWCHVM ESFE+E VA +LN FVSIKVDREERPD
Sbjct: 27 GEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFENEAVAAVLNKHFVSIKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
VDKVYM YVQA+ G GGWPLSV+L+PDLKP GGTYFPPED+ GR G ++L + W
Sbjct: 87 VDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTYFPPEDRSGRSGLLSVLDVIARGWN 146
Query: 118 --DKKRDMLAQS--------GAFAIEQLSEALSASASSNKLPD--ELPQNALRLCAEQLS 165
D++R +A+S G +A +Q+ + +P E +A C QL
Sbjct: 147 DDDERRKFVAESSRVIDVLAGYYAGKQVR-----PDPATPMPPLYETGGDAFERCYLQLG 201
Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
+S+DS GGFG APKFPR + + + ++G E M TL+ M GGI
Sbjct: 202 ESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPETETGR--EAVSMAASTLRHMIAGGI 259
Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
HDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A LDA T D Y++ R LDY+
Sbjct: 260 HDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNLLDAALFTGDERYAWAARATLDYVL 319
Query: 286 RDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKP 344
RD+ P G FSAEDAD+A GAT EGAFYVWT+ E+ L + A L + H + P
Sbjct: 320 RDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWTAGELRRALSPDAARLVESHLGINP 379
Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
++ DPH E +GKN+L ++ + +A+ LG+ L L +R+ R
Sbjct: 380 GPEGNVPPTLDPHGELRGKNILRQVRPLAETAAALGLEPAAAAERLAAALETLQAIRAAR 439
Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
PRPHLDDKVI +WNGL +S+FARA+ + + R Y++ A AA F+
Sbjct: 440 PRPHLDDKVITAWNGLALSAFARAATSPAA----------CLDDRRDRYLDAARRAARFV 489
Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
R L D L ++R + GF +DYA I+GLLDL++ WL A LQ T
Sbjct: 490 ERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAGLLDLHDATFDAHWLRLAERLQQTM 549
Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
D F D GGYFN+ DP ++LR+KED+DGAEP+ +S++ NL RL+S++ +
Sbjct: 550 DARFRDEVAGGYFNSPAGDPHIVLRLKEDYDGAEPAPSSIAAANLQRLSSLL---HDETL 606
Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
A ++ + A+P M CA + +L+ P + VV+ G ++ F ++A
Sbjct: 607 HARAVDTVEALRGQWSQTPHALPAMLCALERILAEPVQ--VVIAGDPAAPGFRALVAVVR 664
Query: 644 ASYDLNK-TVIHIDPA--DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
A + +I + PA + D W + R + A VCQ+++C PPV
Sbjct: 665 AQATRRRPALIGLVPAGGSDADADLWLRARAPWLDGMRPA-DGGQAAAYVCQHYTCQPPV 723
Query: 701 TDPISLENLL 710
T P +L LL
Sbjct: 724 TTPEALRQLL 733
>gi|301781214|ref|XP_002926022.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
20-like [Ailuropoda melanoleuca]
Length = 785
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 294/736 (39%), Positives = 413/736 (56%), Gaps = 69/736 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGW L+P+L+P +GGTYFPPED R GF T+L ++++ W
Sbjct: 150 VDKVYMTFVQATSSGGGW----XLTPNLQPFVGGTYFPPEDGLTRVGFHTVLLRIREQWK 205
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + L ++ ++++ AL A + + ++P +A + C +QL + YD +GGF
Sbjct: 206 QNKTTLLENS----QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGF 261
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 262 AEAPKFPTPVILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 316
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 317 HRYSTDRQWHIPHFEKMLYDQAQLAVAYTQAFQISGDEFYSDVAKGILQYVARNLSHRSG 376
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGAFYVWT EV+ +L E + LF +HY L
Sbjct: 377 GFYSAEDADSPPERG-MRPKEGAFYVWTVNEVQQLLPEPVLGATEPLTSGQLFMKHYGLT 435
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ ++ +L KLF R
Sbjct: 436 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKH 493
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L E + A + A F
Sbjct: 494 RPKPHLDSKMLAAWNGLMVSGYAVTGAVLGLE----------------RLITCAINGAKF 537
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D RL + GP S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 538 LKRHMFDVARGRLMRTCYAGPGGTVEHSNPPSWGFLEDYAFVVRGLLDLYEASQESSWLE 597
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 598 WALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 657
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 658 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKD 713
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I A+ + F +++ R D+ A VC+N
Sbjct: 714 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCENQ 767
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 768 ACSMPITEPNELRKLL 783
>gi|126343214|ref|XP_001376429.1| PREDICTED: spermatogenesis-associated protein 20 [Monodelphis
domestica]
Length = 744
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 291/741 (39%), Positives = 421/741 (56%), Gaps = 73/741 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCHVME ESF+++ + ++L++ FVSIKVDREERPD
Sbjct: 45 GQEAFDKAKKENKPIFLSVGYSTCHWCHVMEEESFQNKDIGQILSEDFVSIKVDREERPD 104
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+PDL+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 105 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGVTRVGFRTVLLRIREQWK 164
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ + ML + ++++ +L A + ELP +A + C +QL + YD GGF
Sbjct: 165 QNKAMLMANS----QRVTASLLARSEICMGDRELPPSASAVSNRCFQQLEEVYDEEHGGF 220
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
PKFP PV + + + + ++ G Q+M + TL+ MA GGI DHVG GF
Sbjct: 221 AEVPKFPTPVILSFLFSYWATHRMATDG-----FRAQQMAMHTLKMMANGGIRDHVGQGF 275
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y+ AF ++ D F++ I +DIL Y+ +++ G
Sbjct: 276 HRYSTDRQWHIPHFEKMLYDQAQLAVAYIQAFQISGDEFFADIAKDILQYVSQNLSHQSG 335
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLK 343
SAEDADS EG + KEGA+Y+W KE++D+L + LF +HY +
Sbjct: 336 GFCSAEDADSM-PEGEKKPKEGAYYLWKVKEIKDLLPDPVEGSNEPLTLGQLFMKHYGIT 394
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + DPH E +G+NVL +A++ G+ E +L R KL R +
Sbjct: 395 ENGN--IGSTQDPHGELQGQNVLTVRYSMDLTAARYGLEAEAVRTLLDIGREKLIQTRKR 452
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRP LD K++ +WNGL++S +A L +E E ++ A A F
Sbjct: 453 RPRPRLDSKMLAAWNGLMVSGYAITGATLGNE----------------EMIKQAIDGAKF 496
Query: 464 IRRHLYDEQTHRLQHSFRNGP--------SKAPGFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RHL+D + RL G S+ GFL+DYAF+I GLLDLYE + WL
Sbjct: 497 LKRHLFDVSSGRLIRGCYAGAGGTVEQSSSQWWGFLEDYAFVIRGLLDLYEASRESAWLE 556
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA++LQ+ QD+LF D +GGGYF E + L LR+K+D DG+EPS NSVS NL+R+
Sbjct: 557 WALKLQDMQDKLFWDTQGGGYFCNEVELRNDLPLRLKDDQDGSEPSANSVSAHNLLRIHG 616
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
+ DY + + L F RL + +A+P M A ++ + K VV+ G + D
Sbjct: 617 YTG--RRDYMEKCVK-LLTAFSDRLWKVPVALPEMVRAL-IIQQQTVKQVVICGSPQTTD 672
Query: 635 FENMLAAAHASYDLNKTVIHI--DPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALV 690
+ ++ H+ Y NK +I DP+ ++ F +AR + + A V
Sbjct: 673 TQALINCVHSVYVPNKVLILTDGDPSSFLARQLPF----------LARFHKLEGRATAYV 722
Query: 691 CQNFSCSPPVTDPISLENLLL 711
C+N + S PVT+P L LLL
Sbjct: 723 CENQAYSMPVTEPAELRKLLL 743
>gi|194336238|ref|YP_002018032.1| hypothetical protein Ppha_1140 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194308715|gb|ACF43415.1| protein of unknown function DUF255 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 737
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 288/712 (40%), Positives = 406/712 (57%), Gaps = 62/712 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ +AKLLN FV +KVDREE PD+D++YM+YVQA G GGWP+S
Sbjct: 70 STCHWCHVMEDESFENPEIAKLLNAHFVPVKVDREELPDLDRLYMSYVQASTGRGGWPMS 129
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEA 138
V+L+P+L P GG+YFPPE++YG PGFKTIL + W+ +R+ ++++SG+F
Sbjct: 130 VWLTPELNPFYGGSYFPPEERYGMPGFKTILITITRYWENEREKIISESGSFFA------ 183
Query: 139 LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
S A S P P + A + C E L +YD FGGFG APKFPRPV + + H+
Sbjct: 184 -SLGAVSRTTPSSQPDAEMAQKKCFEWLEANYDPMFGGFGRAPKFPRPVLLNFLFNHAYH 242
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKM 250
D + +M L TL MA+GGIHDH+ GGGF RYS D+RWHVPHFEKM
Sbjct: 243 TGD-------KKALRMALHTLHKMAEGGIHDHLGIIGKGGGGFARYSTDQRWHVPHFEKM 295
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD QLA L+AF + D FY DI +Y+ DM P G +SAEDAD+ T G+
Sbjct: 296 LYDNAQLAISCLEAFQCSGDNFYKRTAEDIFNYVLCDMRSPQGGFYSAEDADTLLTHGSE 355
Query: 311 RKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
+K+EGA Y+W++ E+ + L E A +F Y ++ GN + DPH EF GKN+L++
Sbjct: 356 QKQEGALYLWSADEIRETLADEELATIFSFTYGIRDEGNAEY----DPHGEFNGKNILMQ 411
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
A G +E+ L + R KL+ RS+RPR LDDK++ +WNGL+IS+ A+
Sbjct: 412 QATDEECADTFGKTVEEIRAALDDARTKLYHARSRRPRAFLDDKILTAWNGLMISALAKG 471
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
++L +E ++ A AA+FI LYD+ RL +R+G +
Sbjct: 472 YQVLHNET----------------FLAAAREAANFILETLYDQANGRLLRRYRDGNAAIA 515
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G +DYAFL+ GL DLYE S ++L A++L Q+ LF D GGYF+T +D +V L
Sbjct: 516 GKAEDYAFLVQGLTDLYEASSEVRYLQIALQLAEIQNTLFYDNAQGGYFSTAIDDHTVPL 575
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R+KE++DGAEPS NS+S +NL+RLA + D+ R+ AE ++ L + + A+P
Sbjct: 576 RIKEEYDGAEPSANSISTLNLLRLAEMTG--NEDFVRR-AEETIKSCRIMLAENSSALPQ 632
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
M A + + + H+V G S + + Y T+ H A E +
Sbjct: 633 MLVAKN-FAEQRKVHLVFSGPLDSSSMNELRQTVYEQYLPGATMSH---ASKESAHIFPS 688
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL----LEKPSS 716
H A +A+ + +A +C + SC PP +P L +L L +P S
Sbjct: 689 H---AAIIAKEDGNAK---VYICIDKSCQPPTENPERLAAMLDSQFLHRPDS 734
>gi|449543699|gb|EMD34674.1| hypothetical protein CERSUDRAFT_86096 [Ceriporiopsis subvermispora
B]
Length = 737
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 296/728 (40%), Positives = 415/728 (57%), Gaps = 51/728 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL + CHWCHV+ ESFEDE AK++N+ +V+IKVDREERPD
Sbjct: 39 GQEAFDAAKRHNKPIFLSVGYSACHWCHVLAHESFEDEVTAKIMNEHYVNIKVDREERPD 98
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD++YMT++QA GGGGWP+SV+L+P+L P GTYFP + F+ +L K+ + W+
Sbjct: 99 VDRLYMTFLQATTGGGGWPMSVWLTPELHPFFAGTYFP------QGQFRQVLLKLAEVWN 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
A+ G IEQL A S A S +P + ++ + +L K YDSR GGFG A
Sbjct: 153 NDPARCAEVGKSVIEQLRNA-SNIAPSASIPS-ISAASISIY-RRLEKRYDSRHGGFGGA 209
Query: 179 PKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
PKFP+P + L Y + + DT +A + + M + T+ + GGI D VGGGF RY
Sbjct: 210 PKFPQPSQTTHFLARYAALNMRDTTTKKDAEQARDMAVETMVKIYNGGIRDVVGGGFSRY 269
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSL-----TKDVFYSYICRDILDYLRRDMIGP 291
SVDERWHVPHFEKMLYD+GQL + ++ L + + DI+ Y+ RD+ P
Sbjct: 270 SVDERWHVPHFEKMLYDEGQLLSSAIELSLLLPCDAPERTTLQLMAADIVTYVARDLRSP 329
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G +SAEDADS + +T KKEGAFYVWT+K+++D+LG A FK H+ ++ GNCD S
Sbjct: 330 EGGFYSAEDADSLPSSDSTVKKEGAFYVWTAKQLDDLLGAEAEAFKYHFGVEAKGNCDPS 389
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLD 410
D E KG+NVL + +A K G +E+ +L KL + R K RPRPHLD
Sbjct: 390 H--DIQGELKGQNVLYTAHTPEETAKKFGRSIEETGQLLKGSLAKLKEYRDKERPRPHLD 447
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK++ WNGL+IS ++AS++L E + ++ +++AE +A+FIR+ LYD
Sbjct: 448 DKILTCWNGLMISGLSKASEVLDESFELS-----------EKALQLAEDSATFIRQRLYD 496
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
E T L+ S+R GP G DDYAFLI GLLDLYE ++ +WAI LQ QDELF D
Sbjct: 497 ESTGELRRSYREGPGPT-GQADDYAFLIQGLLDLYEASGKEEYALWAIRLQEKQDELFWD 555
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
EGGGYF ++ DP +L+R+K+ DGAEPS SV+ NL RL S A + Y++ A
Sbjct: 556 SEGGGYF-SSAPDPHILVRMKDPQDGAEPSAQSVAFWNLQRL-SHFAEDRHGAYQEKARG 613
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
L L A+ M A +L+ K + V S + + L A H+ + +
Sbjct: 614 VLETDAQILGQAPYALAAMVSGA-LLAEKGLKQFI-VTKPSYSEAASFLKAVHSRFIPQR 671
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASM--------ARNNFSADKVVALVCQNFSCSPPVTD 702
+IH+DP E NA++ + A + VC+NF+C PV D
Sbjct: 672 VLIHLDPEHPP-----RELAEVNATLRALIEDVDTNKDGDAKRASVRVCENFACGLPVED 726
Query: 703 PISLENLL 710
+E +L
Sbjct: 727 LEEVEKML 734
>gi|149053889|gb|EDM05706.1| spermatogenesis associated 20 [Rattus norvegicus]
Length = 745
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 289/736 (39%), Positives = 417/736 (56%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + LLN+ FVS+ VDREERPD
Sbjct: 46 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 105
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 106 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 165
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 166 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 221
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 222 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 276
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D F+S + + IL Y+ R++ G
Sbjct: 277 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 336
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G + +EGA Y+WT KEV+ +L E L +HY L
Sbjct: 337 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 395
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ D + E G+NVL +A++ G+ +E +L KLF R
Sbjct: 396 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 453
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA A +L E + + A + A F
Sbjct: 454 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 497
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 498 LKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 557
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+ QD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 558 WALRLQDIQDKLFWDSHGGGYFCSEAELGTDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 617
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
+ G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 618 LT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDPQAKD 673
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F +++ R D+ + +N
Sbjct: 674 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATVYIFENQ 727
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 728 ACSMPITDPCELRKLL 743
>gi|427779347|gb|JAA55125.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 816
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 311/785 (39%), Positives = 417/785 (53%), Gaps = 126/785 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE++ +AK++ND FV++KVDREERPDVD+VYMTY+QA GGGGWP+S
Sbjct: 65 STCHWCHVMERESFENDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMS 124
Query: 80 VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQL 135
++L+PDLKP++GGTYFPP+D+ YG+PGFKT+L + + W K R L G F I EQ
Sbjct: 125 IWLTPDLKPVVGGTYFPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQT 184
Query: 136 SE-----------ALSASASSNKLPDELPQNALRLCAEQ---------LSKSYDSR-FGG 174
S+ + S ++ K P + C Q L ++ D R FGG
Sbjct: 185 SDVRVFGGDGVPTSPRGSEANQKCP--FAPDVATTCYRQLXGTRIFQILEQTSDVRVFGG 242
Query: 175 ----------------------------------------FGSAPKFPRPVEIQMMLYHS 194
FG APKFP+ V + +L +
Sbjct: 243 DGVPTSPRGSEANQKCPFAPDVATTCYRQLERSYDVSMGGFGRAPKFPQCVNLNFLLRYR 302
Query: 195 KKLEDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L EA + +M + TL+ MA+GGIHDH+G GFHRYS D +WHVPHFEKM
Sbjct: 303 AVLLQGDPPPEAKTAVDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDGKWHVPHFEKM 362
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQ QL Y +A+ +T D + + RDIL Y+ RD+ P G +SAEDADS G
Sbjct: 363 LYDQAQLTRTYSEAYQVTHDRRLADVARDILCYVERDLSHPSGGFYSAEDADSYPEHGDK 422
Query: 311 RKKEGAFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
K+EGAF VW EV +L E A + +Y ++ +GN D M DPH+E K
Sbjct: 423 EKREGAFCVWEESEVYRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD--PMQDPHDELK 480
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
KNVLI + A+ G+ + +L R LF+ R +RP+PHLDDK + SWNGL+
Sbjct: 481 RKNVLIVRESKESVAACYGLEVGVLDALLERARETLFEARLRRPKPHLDDKFLTSWNGLM 540
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-F 480
IS FA A++ L N PV Y++ A FI++HLY+ + L S +
Sbjct: 541 ISGFAIAARTL---------NQPV-------YLDRALKCVEFIKKHLYNPKKKTLIRSAY 584
Query: 481 R-------NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
R G G L+DYAFLI LLD+YE L+WA ELQ+ QD LF D++
Sbjct: 585 RGEDGSVVQGSQPIDGVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDKQDRLFWDKKD 644
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GYF + GEDP+V+LR+K+D DGAEPS NSVS+ NLVRL+ ++ + D RQ AE +
Sbjct: 645 MGYFLSNGEDPTVVLRLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDELRQRAEKLAS 701
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
V+ R+ + +A+P M C L + VV+ G + + +L+ + TVI
Sbjct: 702 VYGQRMILVPLALPEMVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSCLRRHFLPFVTVI 760
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSA-----DKVVALVCQNFSCSPPVTDPISLEN 708
D + N NF K A VCQ+F CS PVT LE
Sbjct: 761 LAD-----------QDPENPLRKRLTNFDGYTCVNGKPAAYVCQDFQCSKPVTTAAELEA 809
Query: 709 LLLEK 713
LL K
Sbjct: 810 LLTAK 814
>gi|373850029|ref|ZP_09592830.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
gi|372476194|gb|EHP36203.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
Length = 734
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 293/730 (40%), Positives = 402/730 (55%), Gaps = 44/730 (6%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ FL +TCHWCHVM ESFE+E VA +LN+ FVSIKVDREERPD
Sbjct: 27 GEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFENEAVAAVLNEHFVSIKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYM YVQA+ G GGWPLSV+L+PDLKP GGTYFPPED+ GR G ++L + W+
Sbjct: 87 VDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTYFPPEDRSGRSGLLSVLDVIIQGWN 146
Query: 119 ---KKRDMLAQS--------GAFAIEQLSEALSASASSNKLPD--ELPQNALRLCAEQLS 165
++R +A+S G +A +Q+ + +P E +A C QL
Sbjct: 147 DDGERRKFVAESSRVIDVLAGYYAGKQVR-----PDPATPMPPLYETGGDAFERCYLQLG 201
Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
+S+DS GGFG APKFPR + + + ++G E M TL+ M GGI
Sbjct: 202 ESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPETETGR--EAVSMAASTLRHMIAGGI 259
Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
HDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A LDA T D Y++ R LDY+
Sbjct: 260 HDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNLLDAALFTGDERYAWAARATLDYVL 319
Query: 286 RDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKP 344
RD+ P G FSAEDAD+A GAT EGAFYVWT+ E+ L + A L + H + P
Sbjct: 320 RDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWTADELRRALSPDAARLVESHLGINP 379
Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
++ DPH E +GKN+L ++ + +A+ LG+ L L +R+ R
Sbjct: 380 GSEGNVPPALDPHGELRGKNILRQVRPLAETAAALGLEPAAAAERLAAALETLQAIRTAR 439
Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
PRPHLDDKVI +WNGL +S+FARA+ + + R Y++ A AA F+
Sbjct: 440 PRPHLDDKVITAWNGLALSAFARAATSPAA----------CLDDRRDRYLDAARRAARFV 489
Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
R L D L ++R + GF +DYA I+GLLDL++ WL A LQ T
Sbjct: 490 ERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAGLLDLHDATFDAHWLRLAERLQQTM 549
Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
D F D GGYFN+ DP ++LR+KED+DGAEP+ +S++ NL RL+S++ +
Sbjct: 550 DARFRDEIAGGYFNSPAGDPHIVLRLKEDYDGAEPAPSSIAASNLQRLSSLL---HDETL 606
Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
A ++ + A+P M CA + +L+ P + VV+ G ++ F ++A
Sbjct: 607 HARAVDTVEALRGQWSQTPHALPAMLCALERILAEPVQ--VVIAGDPAAPGFRALVAVVR 664
Query: 644 ASYDLNK-TVIHIDPA--DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
A + +I + PA + D W + R + A VCQ+++C PV
Sbjct: 665 AQATRRRPALIGLVPAGGSDADADLWLRARAPWLDGMRPA-DGGQAAAYVCQHYTCQSPV 723
Query: 701 TDPISLENLL 710
T P +L LL
Sbjct: 724 TTPEALRQLL 733
>gi|40786501|ref|NP_955434.1| spermatogenesis-associated protein 20 [Rattus norvegicus]
gi|81871190|sp|Q6T393.1|SPT20_RAT RecName: Full=Spermatogenesis-associated protein 20; AltName:
Full=Sperm-specific protein 411; Short=Ssp411
gi|38156445|gb|AAR12892.1| sperm protein SSP411 [Rattus norvegicus]
Length = 789
Score = 503 bits (1296), Expect = e-139, Method: Compositional matrix adjust.
Identities = 288/736 (39%), Positives = 417/736 (56%), Gaps = 65/736 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + LLN+ FVS+ VDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D F+S + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G + +EGA Y+WT KEV+ +L E L +HY L
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ D + E G+NVL + + ++ G+ +E +L KLF R
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRDSLELTGARYGLEVEAVRALLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA A +L E + + A + A F
Sbjct: 498 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL+ + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 542 LKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 601
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+ QD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 602 WALRLQDIQDKLFWDSHGGGYFCSEAELGTDLPLRLKDDQDGAEPSANSVSAHNLLRLHG 661
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
+ G K + L F R++ + +A+P M A + K +V+ G + D
Sbjct: 662 LT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDPQAKD 717
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I AD + F +++ R D+ + +N
Sbjct: 718 TKALLQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLSNLRR---VEDRATVYIFENQ 771
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 772 ACSMPITDPCELRKLL 787
>gi|409047490|gb|EKM56969.1| hypothetical protein PHACADRAFT_92450 [Phanerochaete carnosa
HHB-10118-sp]
Length = 717
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 284/700 (40%), Positives = 406/700 (58%), Gaps = 58/700 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AKL+N+ +V++KVDREERPDVD++YMT++QA GGGGWP+S
Sbjct: 62 SACHWCHVLAHESFEDEVTAKLMNERYVNVKVDREERPDVDRLYMTFLQATSGGGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P GTYFP + F+ L K+ + W++ R+ L +SG IEQL +
Sbjct: 122 VWLTPDLHPFFAGTYFP------KGQFRQALEKLANFWEEDRERLVESGKGIIEQLKSSS 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE- 198
+AS S ++L + YDS GGFG APKFP P + L L
Sbjct: 176 NASICSQ-------------VYKRLERLYDSVHGGFGGAPKFPSPSQTTHFLARLAALNI 222
Query: 199 -DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D EA + + M + T+ + GGI D VGGGF RYSVD+ WHVPHFEKMLYD+ QL
Sbjct: 223 GDEKLKSEALKARDMAVQTMVKIYNGGIRDVVGGGFSRYSVDDHWHVPHFEKMLYDEAQL 282
Query: 258 ANVYLDAFSL-----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
+ L+ L + + DI+ Y+ RD+ G +SAEDADS + +T K
Sbjct: 283 LSSALELAQLLPIDSVECKTLEAMANDIIIYVSRDLRNSEGAFYSAEDADSLPSSDSTIK 342
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
KEGAFYVWTS +++++LG+++ +FK HY +K GNCD D E KG+NVL +
Sbjct: 343 KEGAFYVWTSAQLDELLGDNSDVFKFHYGVKSNGNCDPKH--DVQGELKGQNVLYTAHTV 400
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKI 431
+A K G+P E+ L +C L R + RPRPHLDDK++ WNGL++S A+AS++
Sbjct: 401 EDTARKFGIPAEQVQVTLDQCLAHLKRYRDENRPRPHLDDKILTCWNGLMLSGLAKASEV 460
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ +A +A +++AE +A+FI++ LYDE+T L+ S+R GP G
Sbjct: 461 LEGQAANA--------------LKLAEDSAAFIKKELYDEKTGELRRSYRQGPGPT-GQA 505
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DDYAFLI GLLDLYE +++ WAI LQ QDELF D EGGGYF + DP +L+R+K
Sbjct: 506 DDYAFLIQGLLDLYEASGKEEYVTWAIRLQEKQDELFHDTEGGGYF-ASAPDPHILVRMK 564
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
+ DGAEPS SV++ NL RLA A + YR+ A+ L L+ A+ M
Sbjct: 565 DAQDGAEPSAVSVTLYNLNRLAHF-AEDRHGEYREKAQSILRSNSQLLEHAPFALATMVS 623
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA----DTEEMDFWE 667
AA + + + ++ G S+ D L A ++ ++ +IH+DP + +++
Sbjct: 624 AA-LTAQRGYRQFIVSGEASNSDTTRFLHAIRHTFVPSRVLIHLDPQRPPRELAKLNGTL 682
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
++++ AR N +C+NF+C P+ DP L+
Sbjct: 683 RALMDDSANARPNVR-------LCENFACGLPIYDPKELK 715
>gi|395536753|ref|XP_003770376.1| PREDICTED: spermatogenesis-associated protein 20 [Sarcophilus
harrisii]
Length = 744
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 285/739 (38%), Positives = 413/739 (55%), Gaps = 69/739 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + FL +TCHWCHVME ESF ++ + ++L++ FVS+KVDREE PD
Sbjct: 45 GQEAFDKAKNENKPIFLSVGYSTCHWCHVMEEESFRNKEIGEILSEDFVSVKVDREEHPD 104
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+PDL+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 105 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 164
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA---LRLCAEQLSKSYDSRFGGF 175
+ + ML ++ ++++ +L A + ELP A + C +QL + YD GGF
Sbjct: 165 QNKAMLLENS----QRVTASLLARSEITVGDRELPPTASAVSKRCFQQLEEVYDEEHGGF 220
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
APKFP PV + + + T E Q+M + +L+ MA GGI DHVG GFHR
Sbjct: 221 AEAPKFPTPVILSFLFSYWAAHRMT---SEGFRAQQMAMHSLKMMANGGIRDHVGQGFHR 277
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS D +WH+PHFEKMLYDQ QLA Y AF ++ D +S + + IL Y+ +++ P G
Sbjct: 278 YSTDRQWHIPHFEKMLYDQAQLAVAYTQAFQVSGDELFSDVAKGILQYVSQNLSHPSGGF 337
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPT 345
+SAEDADS EG + KEGA+Y+WT E++D+L E LF +HY + T
Sbjct: 338 YSAEDADSV-PEGEVKPKEGAYYLWTVNEIKDLLPEPVEGATEPLSLGQLFMKHYGVTET 396
Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
GN + DP E +G+NVL +A++ G+ E +L R KL +R +R
Sbjct: 397 GN--IGSTQDPQGELQGQNVLTVRYSMDLTAARFGLEAETVRKLLDTGREKLVQIRKRRS 454
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
RP LD K++ +WNG+++S +A A +L E E + A A F++
Sbjct: 455 RPRLDIKMLAAWNGMMVSGYAIAGAVLGKE----------------ELINQAIDGAKFLK 498
Query: 466 RHLYDEQTHRLQH--------SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWA 517
RHL+D + RL + S+ GFL+DYAF+I GLLDLYE + WL WA
Sbjct: 499 RHLFDVSSGRLFRGCYATIGGTVEQSSSQFWGFLEDYAFVIRGLLDLYEASGESAWLEWA 558
Query: 518 IELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIV 576
+ LQ+ QD+LF D +GGGYF + E L LR+K+D DG+EPS NSVS NL+R+ +
Sbjct: 559 LRLQDMQDKLFWDTQGGGYFCSEAELGGNLPLRLKDDQDGSEPSANSVSAHNLLRIHAYT 618
Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
+ D+ + + L F RL+ + +A+P M A + + K +V+ G D +
Sbjct: 619 G--RRDWMDKCVK-LLTAFSDRLRRVPVALPEMVRAL-CIQQQTIKQIVICGSPQGQDTK 674
Query: 637 NMLAAAHASYDLNKTVIHID--PAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
++ H+ Y NK +I D P+ ++ F + R + A VC+
Sbjct: 675 ALIDCVHSIYVPNKVLILYDGEPSSFLARQLPF----------LVRLQKVDSQATAYVCE 724
Query: 693 NFSCSPPVTDPISLENLLL 711
N + S PVT+P L LLL
Sbjct: 725 NQAYSLPVTEPAELRKLLL 743
>gi|431890790|gb|ELK01669.1| Spermatogenesis-associated protein 20 [Pteropus alecto]
Length = 777
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 293/736 (39%), Positives = 415/736 (56%), Gaps = 77/736 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPD
Sbjct: 90 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRIGFRTVLLRIREQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISTGDRQLPPSAATMNSRCFQQLDEGYDEEY--- 262
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
V + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 ---------VILNFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 308
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQGQLA Y AF ++ D FYS + + IL Y+ R++ G
Sbjct: 309 HRYSTDRQWHVPHFEKMLYDQGQLAVAYSQAFQISGDEFYSDVAKGILQYVSRNLSHRSG 368
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G R KEGAFYVWT KEV+ +L E L +HY L
Sbjct: 369 GFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQQLLPESVHGATEPLTSGQLLMKHYGLT 427
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 428 EAGN--ISPNQDPKGELQGQNVLTVRYSLELTAARFGLDVEAIRTLLNTGLEKLFQARKH 485
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L E + N+ A + A F
Sbjct: 486 RPKPHLDSKMLAAWNGLMVSGYAITGAVLGME---RLVNY-------------ATNGAKF 529
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 530 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASLESAWLE 589
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD+LF D GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 590 WALRLQDTQDKLFWDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 649
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + + L F R++ + +A+P M A + + K +V+ G + D
Sbjct: 650 FT-GHKD--WMEKCVCLLTAFSERMRRVPVALPEMVRAL-LAHQQTLKQIVICGDPQAKD 705
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + F ++ R D+ A VC+N
Sbjct: 706 TKALVQCVHSIYIPNKVLIL---ADGDPSSFLSRQLPFLNTLRR---LEDRATAYVCENQ 759
Query: 695 SCSPPVTDPISLENLL 710
+CS PVT+P L LL
Sbjct: 760 ACSMPVTEPSELRKLL 775
>gi|110598780|ref|ZP_01387040.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
13031]
gi|110339607|gb|EAT58122.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
13031]
Length = 712
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 276/669 (41%), Positives = 386/669 (57%), Gaps = 56/669 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + R FL +TCHWCHVME ESFE+ +A++LN +FV +KVDREE PD
Sbjct: 33 GEEAFEKAERENRPIFLSVGYSTCHWCHVMERESFENPDIAEVLNRYFVPVKVDREELPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM YVQ+ G GGWP+SV+L+PD P GG+YFPPED+YG GFKTIL + W+
Sbjct: 93 LDRLYMEYVQSTTGRGGWPMSVWLTPDRNPFYGGSYFPPEDRYGMTGFKTILLSIASLWE 152
Query: 119 KKRDML--AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
+ + A SG F+ Q A++ + LP E A C L ++D +GGF
Sbjct: 153 SDEEKIRDASSGFFSDLQ----AFAASRAAALPPE--DEAQHNCFRWLESTFDPVYGGFS 206
Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------G 230
APKFPRPV + + H+ SG S+ ++M LFTL+ MA+GGIHDH+ G
Sbjct: 207 GAPKFPRPVLLNFLFSHAY------YSGN-SKAREMALFTLRRMAEGGIHDHISVTGKGG 259
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GGF RYS DERWHVPHFEKMLYD QLA YL+AF + + + + DI +Y+ DM
Sbjct: 260 GGFARYSTDERWHVPHFEKMLYDNAQLAVSYLEAFQCSGEPLFRSVAEDIFNYVLSDMTA 319
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNC 348
P G +SAEDADS E+E T KKEGAFY+W + E+ + +G E A +F Y ++ GN
Sbjct: 320 PEGGFYSAEDADSLESESGTEKKEGAFYLWRADELHEAIGNAEQAAIFSFVYGVRAEGNA 379
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
++DPH EF G+N+L++ +A + G + ++L E RRKL+ RS RPRP
Sbjct: 380 ----LNDPHGEFTGRNILMQQVSVEETAVRFGKTAVEIRDVLDEARRKLYTARSGRPRPF 435
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LDDK++ SWN L+IS+ ++ ++L SE E + A AA F+ L
Sbjct: 436 LDDKILTSWNALMISALSKGFRVLHSE----------------ECLTAARKAADFLLETL 479
Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
YD ++ RL +R+G + G +DDYAF + L+DLYE +L A+EL Q LF
Sbjct: 480 YDRRSCRLLRRYRDGSAAIAGKVDDYAFFVQALIDLYEASFEIVYLKAALELAEVQKTLF 539
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D GGYF++ +D +V +R KE +DGAEPS NSV+ +NL+RL + K ++ Q A
Sbjct: 540 CDALHGGYFSSASDDQTVPVRQKESYDGAEPSANSVTALNLLRLGELTG--KEEFALQ-A 596
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK---HVVLVGHKSSVDFENMLAAAHAS 645
E + F T L + A+P M A + +RK ++ G + + E + A A
Sbjct: 597 EELFSAFGTTLASQSHALPQMLVALNF----ARKRGCRILFSGDLHATEMERLRAVAGER 652
Query: 646 YDLNKTVIH 654
Y V+H
Sbjct: 653 YLPGTVVMH 661
>gi|395328680|gb|EJF61071.1| hypothetical protein DICSQDRAFT_161788 [Dichomitus squalens
LYAD-421 SS1]
Length = 791
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 284/704 (40%), Positives = 400/704 (56%), Gaps = 63/704 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AK++N+++V+IKVDREERPDVD++YMT++QA GGGGWP+S
Sbjct: 112 SACHWCHVLAHESFEDEVTAKIMNEYYVNIKVDREERPDVDRLYMTFLQATTGGGGWPMS 171
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P GTYFPP + F+ +L K+ + W++ + SG IE L ++
Sbjct: 172 VWLTPDLHPFFAGTYFPPGN------FRQVLIKLAEIWERDPERCIASGKQIIEVLQQSS 225
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-----YHS 194
A+ S L + L QL K +D++ GGFG APKFP P + L Y+
Sbjct: 226 KAAPESGVDVKPLAEKILT----QLQKRFDAKEGGFGRAPKFPSPSQTMYPLARIAAYYL 281
Query: 195 KKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
T + E++E + M +FT+ + GGI D VGGGF RYSVDERWHVPHFEKMLYD
Sbjct: 282 NNSSATAQEKESAEKARDMAVFTMTKIYNGGIRDVVGGGFSRYSVDERWHVPHFEKMLYD 341
Query: 254 QGQLANVYLDAFSLTKD-----VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
+ QL + L+ + L + +DI+ Y+ RD+ P G +SAEDADS +
Sbjct: 342 EAQLLSSALELYQLLPSGSHDKTTLELMAKDIVSYVARDLRSPQGGFYSAEDADSLPSHE 401
Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
+T KKEGAFYVWT+K+++++L A LFK H+ +K GNCD S D E KG+NVL
Sbjct: 402 STVKKEGAFYVWTAKQLDELLDADAELFKYHFGVKAEGNCDPSH--DIQGELKGQNVLFT 459
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
+ +A K G E+ L L + R+K RPRPHLDDK++ WNGL+IS ++
Sbjct: 460 AHTLEETAQKFGKAYEEVQKTLEVNLATLREYRNKHRPRPHLDDKILACWNGLMISGLSK 519
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
++L S +E A K+ +++AE +A+F+R HLYDE++ L S+R GP
Sbjct: 520 TYEVLHSHSEIA-----------KKALQLAEDSATFLRAHLYDEKSGTLWRSYREGPGPT 568
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
G DDYAFLI GLLDLYE + ++L+WA+ LQ QDELF D EGGGYF + D +L
Sbjct: 569 -GQADDYAFLIQGLLDLYEASAKEEYLLWALRLQEKQDELFYDPEGGGYF-ASAPDEHIL 626
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
+R+K+ DGAEPS SV+V NL RLA + S + + +LA LK A+
Sbjct: 627 VRMKDAQDGAEPSAVSVAVSNLQRLAHFAEDNHSAFTEKTTS-TLASNGQFLKQAPHALA 685
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDF--------ENMLAAAHASYDLNKTVIHIDPAD 659
M AA L G K + F L +++ N+ +IH DP++
Sbjct: 686 YMVSAA------------LTGEKGYMQFIYEGTSQDSPFLKLIRSTFIPNRVLIHFDPSN 733
Query: 660 TEEMDFWEEHNSNNASMA---RNNFSADKVVALVCQNFSCSPPV 700
+HN + S+ + ++C+NF+C P+
Sbjct: 734 PPRG--IAKHNGSVRSLVEELEKKEGEHRENVMICENFTCGLPI 775
>gi|392558461|gb|EIW51649.1| hypothetical protein TRAVEDRAFT_137028 [Trametes versicolor
FP-101664 SS1]
Length = 739
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 297/737 (40%), Positives = 411/737 (55%), Gaps = 67/737 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIK-VDREERP 57
G+ +F K + FL + CHWCHV+ ESFEDE AK++N+ +V++K VDREERP
Sbjct: 36 GQEAFDKAKKENKPIFLSVGYSACHWCHVLAHESFEDEITAKMMNEHYVNVKKVDREERP 95
Query: 58 DVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
DVD++YMT++QA GGGGWP+SV+L+PDL P GTYFPP GR F+ IL ++ D W
Sbjct: 96 DVDRLYMTFLQASTGGGGWPMSVWLTPDLHPFFAGTYFPP----GR--FRQILDRLADVW 149
Query: 118 DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL------CAEQLSKSYDSR 171
R+ +S +E L E SSN P PQ+++ L ++L K +D
Sbjct: 150 TYDRERCIESAGKVLETLKE------SSNIAPS--PQDSVELKPLPQEVFQRLQKRFDGV 201
Query: 172 FGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE----ASEGQKMVLFTLQCMAKGGI 225
GGFG APKFP P + L Y + L D S E A + M ++++ + GGI
Sbjct: 202 NGGFGGAPKFPSPAQTTHFLARYAASHLSDLNASNEDKKNAQAARDMAVYSMIKIYNGGI 261
Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL----TKD-VFYSYICRDI 280
D VGGGF RYSVDERWHVPHFEKMLYD+ QL + LD + L ++D + +DI
Sbjct: 262 RDVVGGGFSRYSVDERWHVPHFEKMLYDEAQLLSSSLDLYQLLTTPSRDKKTLELMAKDI 321
Query: 281 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY 340
+ Y+ D+ P G +SAEDADS T + KKEGAFYVWTS++++++LG A LF+ H+
Sbjct: 322 VSYVANDLRSPEGGFYSAEDADSLPTHDSIVKKEGAFYVWTSEQLDELLGADAELFEYHF 381
Query: 341 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 400
++ GNCD D E KG+NVL + S +A K G +E ILG + L D
Sbjct: 382 GVEADGNCDPGH--DIQGELKGQNVLFTAHTSEETADKFGKSVEDTEKILGAGLKTLRDY 439
Query: 401 RSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
R K RPRPHLDDK++ WNGL+IS AR S++L + + A + +++AE+
Sbjct: 440 RDKHRPRPHLDDKILTCWNGLMISGLARTSEVLGHDKDVA-----------SKALDMAEA 488
Query: 460 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
+A+FIR HL+DEQ+ +L S+R GP G DDYAFLI G LDLYE + + L+WA+
Sbjct: 489 SAAFIRGHLFDEQSGKLWRSYREGPGPT-GQADDYAFLIQGFLDLYEASANEEHLLWALR 547
Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
LQ QDELF D E GGYF + D +L+R+K+ DGAEPS SV++ NL RLA +
Sbjct: 548 LQEKQDELFYDPEDGGYF-ASAPDEHILIRMKDAQDGAEPSAVSVTLANLQRLAHLAEDR 606
Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
+D Y A+ L+ L A+ M A M + K + H + +L
Sbjct: 607 HAD-YNAKAKSILSSNGQLLTRAPFALASMVSGAMM----ADKGYMQFIHTGASSTSPLL 661
Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM------ARNNFSADKVVALVCQN 693
+++ N+ +IHIDP + E N S+ K +C+N
Sbjct: 662 ELTRSTFIPNRVLIHIDPKNLP-----RELAKVNGSIRSLIEELERTGGETKENVRICEN 716
Query: 694 FSCSPPVTDPISLENLL 710
F+C P+ D L L
Sbjct: 717 FTCGLPIEDVDDLRTRL 733
>gi|223935696|ref|ZP_03627612.1| protein of unknown function DUF255 [bacterium Ellin514]
gi|223895704|gb|EEF62149.1| protein of unknown function DUF255 [bacterium Ellin514]
Length = 701
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 289/715 (40%), Positives = 401/715 (56%), Gaps = 72/715 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFE E + K LN+ FVSIKVDREERPD
Sbjct: 52 GEEAFAKARKENKPIFLSIGYSTCHWCHVMERESFEKEEIGKYLNEHFVSIKVDREERPD 111
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YMT+VQ+ G GGWPL+ FL+PDLKP GGTYFPPE KYGRP F +L+ + W+
Sbjct: 112 VDKIYMTFVQSTSGQGGWPLNCFLTPDLKPFYGGTYFPPESKYGRPSFLDLLKHINQLWE 171
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ + S EQL++ ++A ++N L L Q L A QL + YDSR GGFG A
Sbjct: 172 TRHGDVTNSAVQLHEQLAQ-MTAKETTNGL--ALTQAVLNKAAGQLKEMYDSRNGGFGDA 228
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+P + +L + G E MVL T MA+GGIHD +GGGF RY+V
Sbjct: 229 PKFPQPSQPAFLLRY-------GVHSNDQEAIAMVLNTCDHMARGGIHDQIGGGFARYAV 281
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W VPHFEKMLYD QL N+YLDA+ ++ + Y+ RD++ Y+ RDM G +SA
Sbjct: 282 DAKWLVPHFEKMLYDNAQLVNLYLDAYLVSGETRYADTARDVIGYVLRDMTHAEGGFYSA 341
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDP 356
EDADS EG KEG FY WT E+ +L E + K Y T + SDP
Sbjct: 342 EDADS---EG----KEGKFYCWTRVELAKLLTPEEFNVAVK---YFGITEGGNFVDHSDP 391
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
+NVL ++ + A + PL L ++K+F RSKR RPHLDDK++ S
Sbjct: 392 -EPLPNQNVLSIVDSNLPRADE---PL------LQSAKQKMFAARSKRVRPHLDDKILAS 441
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL++S+ ARA +L KEY+ AE SF++ L+D +T L
Sbjct: 442 WNGLMLSAIARAYAVLGD----------------KEYLTAAEHNLSFLQSKLWDAKTKTL 485
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
H +R+G + YAFL++G++DLYE + L +AI L + F D GG+
Sbjct: 486 YHRWRDGERDTAQLHETYAFLLNGVVDLYEATLDPRHLEFAISLADAMIAKFYDPAEGGF 545
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
+ + G P ++LR+KED+DGAEPSGNSV+ + L++LA+I ++D YR+ AE ++ +F
Sbjct: 546 WQSAGA-PDLILRIKEDYDGAEPSGNSVATLTLLKLAAIT--DRAD-YRKAAEGTMRLFA 601
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HI 655
RL+ AVP M A D S+ K VV+ G+++ + + +L AAH+ Y K V+ ++
Sbjct: 602 DRLQRFPQAVPYMLMAVD-FSLQEPKRVVIAGNRAEPEAQKLLRAAHSVYQPAKVVLGNV 660
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P + AR + +C +C P +D ++ LL
Sbjct: 661 GPVE---------------EFARTLPAKQGATVYICTAKACQAPTSDAAKVKQLL 700
>gi|320168532|gb|EFW45431.1| spermatogenesis-associated protein 20 [Capsaspora owczarzaki ATCC
30864]
Length = 832
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 293/783 (37%), Positives = 415/783 (53%), Gaps = 118/783 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME +SF + G+A ++N FV+IKVDREERPDVD+VYM ++ A G GGWP+S
Sbjct: 65 STCHWCHVMEEQSFMNPGIASIMNKNFVNIKVDREERPDVDRVYMAFITATTGHGGWPMS 124
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L P+ GGTYFPPEDK+G PGF +L K+ W +RD + G ++ L + +
Sbjct: 125 VWLTPELTPIFGGTYFPPEDKWGTPGFPFLLAKIAALWSSRRDEILLKGRGIMQLLEQGI 184
Query: 140 SASASSNKLPDE---------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
A + +E ++ L L + + +D + GGFG APKFPRPV +Q +
Sbjct: 185 DARLQPTEESNEGAVSDAKQDSARDWLELAFTKFEEEFDPQLGGFGGAPKFPRPVILQFL 244
Query: 191 L------------YHSKKLEDTGKSGEAS------------------------------- 207
L ++ + T AS
Sbjct: 245 LNLYAHFSRVTASLKAQATDATPSPTSASPRLAGAPVAAAAATTLSASPKLKGSRRLSVA 304
Query: 208 -----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 262
+ +M TL M +GG++DH+GGGFHRYSVD+ WHVPHFEKML+DQ QLA Y
Sbjct: 305 ERNCLQTMRMCTTTLDAMHRGGLYDHLGGGFHRYSVDQFWHVPHFEKMLFDQAQLALTYA 364
Query: 263 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
F LT+ Y+ +CRD L Y+ RD+ P G FSAEDADS + + K EGA+YVW+
Sbjct: 365 MGFQLTRIPAYAQVCRDTLAYVLRDLAHPLGGFFSAEDADSLPSVTSESKSEGAYYVWSY 424
Query: 323 KEVEDILGE------------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+E+ L + +F + ++P GN + R S+PH E KN L +
Sbjct: 425 EEISTTLSQGDCAAGVASNATDLAVFCYAFGVRPQGN--IRRESNPHGELARKNHLFQEY 482
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+A +PL N L R +L +R+ RPRPHLDDK+I +WNGL+IS+ A+A
Sbjct: 483 TLQETADHFHLPLADVANRLENARARLHGIRAARPRPHLDDKIIAAWNGLMISALAKAGG 542
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
++ E +F + A+ AA F+R +Y+ ++ +L S+R+G SK G
Sbjct: 543 VV----EEPLF------------IHAAQKAARFLRGSMYNTESGQLVRSWRDGSASKVGG 586
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLL 548
FL DYAF+I GLLDLYE T WL WA++LQ+ QDELF D GGGYF T+ DPS+L+
Sbjct: 587 FLSDYAFVIQGLLDLYEVDGDTTWLEWALQLQSKQDELFHDPNGGGGYFVTSTHDPSILV 646
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R+K + D AEP+GNS++ INL+RLA++V + R A + + + A+P+
Sbjct: 647 RLKCEEDSAEPAGNSIAAINLLRLANLVNRPE---MRDRAAALITSHQFLFSNAPTALPM 703
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFEN-----------MLAAAHASYDLNKTVIHIDP 657
M A L P+ + VVLV S D AA+ A+ +L V+
Sbjct: 704 MLSALQFLHSPNVQ-VVLVTKNSPTDVPKPKDEPTRPAAAASAASEAATELQSVVLSQCF 762
Query: 658 ADTEEMDFWEEHNSNNAS--MARNNFSA--------DKVVALVCQNFSCSPPVTDPISLE 707
+ + H ++AS RN A ++ A VCQ+F+C PVT L
Sbjct: 763 IPFKSI----VHLQSDASRRFLRNKLPAVDDYQMIDNQPTAYVCQSFACQAPVTSVRELR 818
Query: 708 NLL 710
LL
Sbjct: 819 TLL 821
>gi|189346882|ref|YP_001943411.1| hypothetical protein Clim_1372 [Chlorobium limicola DSM 245]
gi|189341029|gb|ACD90432.1| protein of unknown function DUF255 [Chlorobium limicola DSM 245]
Length = 706
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 288/703 (40%), Positives = 393/703 (55%), Gaps = 69/703 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE+E A+LLN F+ +KVDREE PD+D++YMTYVQA G GGWP+SV
Sbjct: 55 TCHWCHVMERESFENEETARLLNGSFIPVKVDREELPDLDRLYMTYVQASTGRGGWPMSV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
+L+PDLKP GG+YFPPED+YG PGF+T+L + W+ + ++ EQL S
Sbjct: 115 WLTPDLKPFYGGSYFPPEDRYGMPGFRTVLTSIAQLWNTDPARITEASRIFFEQLQS--S 172
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + LP++ A C L+ +YD GGFG APKFPRP + + H+ T
Sbjct: 173 SPMGKSGLPEK--GEAQEACFRWLASAYDPLRGGFGGAPKFPRPALLTFLFSHAFH---T 227
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQ 254
G AS M L TL+ MA+GGIHDHV GGGF RYS DERWH+PHFEKMLYD
Sbjct: 228 GNREAAS----MALHTLKKMAEGGIHDHVHSMGKGGGGFARYSTDERWHLPHFEKMLYDN 283
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
QLA YL+AF ++ + ++ I DI +Y+ DM P G +SAEDADS K+E
Sbjct: 284 AQLAASYLEAFQISGETLFARIAEDIFNYILHDMQSPEGGFYSAEDADSFPDGETQEKRE 343
Query: 315 GAFYVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
GAFYVW+ KEV + E LF Y +KP GN DPH EF GKNVL+E +
Sbjct: 344 GAFYVWSWKEVMSLPAEPDKLELFARTYGMKPEGNVS----EDPHGEFGGKNVLMEQSAP 399
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ + L E R+ L++ R +R RP LDDK+I SWNGL+IS+FA+ ++L
Sbjct: 400 EKHE-------KDTVAALDEVRQLLYEKRLQRSRPLLDDKIITSWNGLMISAFAKGYRVL 452
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
E EY+ A +AA FI HLY+E RL +R+G + G +
Sbjct: 453 GHE----------------EYLRAARNAADFILVHLYEENEGRLLRRYRDGDAAITGKAE 496
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAF + GL+DLY+ ++L A L T + LF D GGYF+T +D +V +R+KE
Sbjct: 497 DYAFFVRGLIDLYQACFDNRYLDAADRLCETCNRLFYDHADGGYFSTATDDNTVPVRLKE 556
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
++DGAEP+ +SV ++NL+ LA ++ G+++ Y AE F T L + A+PLM A
Sbjct: 557 EYDGAEPAASSVGILNLLDLA-VMTGNEA--YEGMAEACFRGFGTMLSHNSPALPLMLAA 613
Query: 613 ADMLSVPSRKH---VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
+ +RK VL G+ S + +L ++ Y T++H A
Sbjct: 614 LNN----ARKGGILAVLAGNMQSPRMQELLKTLNSRYLPGLTLMHHASA----------- 658
Query: 670 NSNNASMARNNFSADKVVALV--CQNFSCSPPVTDPISLENLL 710
S S + + + V C +C P T P +L+ LL
Sbjct: 659 GSLKGSEIPADIDPESAIPAVYLCIGHACRLPATTPEALDELL 701
>gi|66826709|ref|XP_646709.1| DUF255 family protein [Dictyostelium discoideum AX4]
gi|60474801|gb|EAL72738.1| DUF255 family protein [Dictyostelium discoideum AX4]
Length = 824
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 277/701 (39%), Positives = 404/701 (57%), Gaps = 62/701 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWC+VME E FE+ +AK++N++ V+IK+DREERPD+DK+YMTY+ + G GGWP+S++
Sbjct: 140 CHWCNVMERECFENVEIAKVMNEYCVNIKIDREERPDIDKIYMTYLTEISGSGGWPMSIW 199
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P L P+ GGTYF PE KYGRPGF +++K+ W K R+M+ + I+ L E
Sbjct: 200 LTPQLHPITGGTYFAPEAKYGRPGFPDLIKKLDKLWRKDREMVQERADSFIKFLKEEKPM 259
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+N L + + C +Q+ K YD GG+ APKFPR ++L K ED
Sbjct: 260 GNINNALSSQ----TIEKCFQQIMKGYDPIDGGYSDAPKFPRCSIFNLLLMTLK--EDYS 313
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
K + K+V FTL+ MA GG++D VGGGFHRYSV W +PHFEKMLYD QLA+VY
Sbjct: 314 K--QVGSLDKLV-FTLEKMANGGMYDQVGGGFHRYSVTSDWMIPHFEKMLYDNAQLASVY 370
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
LDA+ +TK + + ++IL Y+ + G FSAEDADS E K+EGAFYVW+
Sbjct: 371 LDAYQITKSPLFERVAKEILHYVSTKLTHTLGGFFSAEDADSLNLE-INEKQEGAFYVWS 429
Query: 322 SKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI---ELNDSSASA 376
++++ + + ++ H+ L GN D DPHNEFK KNV+ L +++A
Sbjct: 430 YQDIKKAIQDKDDIEIYSFHHGLIENGNVD--PKDDPHNEFKDKNVITIVKSLKETAAYF 487
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
K +EK LN + + KLF R + +P+P LDDK+IVSWNGL++SSF +A ++ K E
Sbjct: 488 KKTQEEIEKSLN---QSKEKLFKFREQFKPKPQLDDKIIVSWNGLMVSSFCKAYQLFKDE 544
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE--------------QTHRLQHSFR 481
+Y+ A + FI+ HLYD RL +++
Sbjct: 545 ----------------KYLNSAIKSIEFIKTHLYDSVGDDNDYDDEDDKLNNCRLIRNYK 588
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
+GPSK F DDY+FLI LLDLY+ K L WA++LQ QD LF D E GGY++T+G
Sbjct: 589 DGPSKIHAFTDDYSFLIQALLDLYQVTFDYKHLEWAMKLQKQQDNLFYDLENGGYYSTSG 648
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
D S+L R+KE+HDGAEPS S+SV NL++L SI + ++ Y++ A+ +L L+
Sbjct: 649 LDKSILSRMKEEHDGAEPSPQSISVSNLLKLYSI---TYNEAYKEKAKKTLENCSLYLEK 705
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVG----HKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ P M C+ L + S ++L ++ ++L H++Y NK ++ D
Sbjct: 706 APLVFPQMVCSL-YLYLNSINTIILSTNSNDNQQKQQLLSILDEIHSNYIPNKLILLNDH 764
Query: 658 ADTEEMDFWEEHNSN-NASMARNNFSADKVVALVCQNFSCS 697
++ F+E+ SN N S++ + DK +C C+
Sbjct: 765 SNNSITQFFEKSTSNLNLSLSTPVY--DKTTFSLCNPNGCT 803
>gi|254445309|ref|ZP_05058785.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
gi|198259617|gb|EDY83925.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
Length = 715
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 281/694 (40%), Positives = 403/694 (58%), Gaps = 41/694 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDEG+A +ND FV++K+DREERPDVD++YM+YVQ+ G GGWP+S
Sbjct: 59 STCHWCHVMAHESFEDEGIAGRMNDLFVNVKLDREERPDVDRIYMSYVQSTTGSGGWPMS 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDLKP GGTYFPPEDKYGR GF T++ ++ W +R L + G + S+AL
Sbjct: 119 VWLTPDLKPFYGGTYFPPEDKYGRVGFLTLVERIGQLWRDERATLLEYG-----EKSQAL 173
Query: 140 SASASSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A ++S L D + + A+ LC EQL YD ++GGFG APKFP P QM+ +
Sbjct: 174 LADSASRNLSDGIGEAAGAIDLCLEQLDTEYDEQWGGFGGAPKFPMPGYFQML------V 227
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ + G A +M+ +L+ MA GGI DHVG GFHRYSVD+ WHVPH+EKMLYDQGQL
Sbjct: 228 DGISRRGNARL-TEMLAGSLEKMADGGIWDHVGSGFHRYSVDKYWHVPHYEKMLYDQGQL 286
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A +Y +A+ LT ++ + + I+ Y+ RD+ G GE+F+AEDADSA + A++ EGAF
Sbjct: 287 AGIYAEAYRLTGRDSFAAVAKGIVRYVARDLQGAAGELFAAEDADSALPDDASKHGEGAF 346
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVW+ E++ +LGE A LF Y +K GN SDPH E KG N L+ +
Sbjct: 347 YVWSKAELDGLLGEDAALFASAYDVKAGGNARPE--SDPHGELKGMNTLMRVASDGELGK 404
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ + + LG C LF+ R RPRPHLDDK +VSWN L+IS A K+ ++ +
Sbjct: 405 RFSLEVSAVRERLGACLGVLFEKRDGRPRPHLDDKALVSWNALMISG---ACKVYQACGD 461
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ + +E+A+ AA F+ ++D R +R G + GF +DYA
Sbjct: 462 A-------------DALELAKKAAVFLFAEMWDAGEGRFARVYRGGCGEQGGFAEDYAAA 508
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
LDLYE W+ A E+ F D + GG+F T D +VL+R+++D+DGA
Sbjct: 509 AGACLDLYEATFDAVWVERAREVLQQLKLRFWDEQRGGFFATEVGDANVLVRLRDDYDGA 568
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EP+ +S++ + L+RLA+++ K R ++ F + K A+PLM AA
Sbjct: 569 EPAASSLAALALLRLAALLDDEK---LRVLGRETIEAFGEQWKRSPRAMPLMLVAASRF- 624
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ S + +V+VG + + ++A A+ ++ +DPA + E N A
Sbjct: 625 LESDQQIVVVGDLEAAETRELIACANRWRASFSVLVGVDPA----VGLPEVFGGNEKLKA 680
Query: 678 RNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A K + VC+NF+C PV SLE +L
Sbjct: 681 MLEVAEAGKPLVYVCENFACKEPVGSVESLEGIL 714
>gi|451946132|ref|YP_007466727.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
10523]
gi|451905480|gb|AGF77074.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
10523]
Length = 710
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 279/696 (40%), Positives = 388/696 (55%), Gaps = 49/696 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM +SFED+ +A LN +F+ IKVDREERPDVD++YM QA+ G GGWP+S
Sbjct: 62 STCHWCHVMAHQSFEDQEIADFLNSYFIPIKVDREERPDVDQIYMAATQAMTGSGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL PD +P GTYFPP YGRPGF IL+ +K AW R+ L+ S EQ++ L
Sbjct: 122 LFLFPDTRPFYAGTYFPPRADYGRPGFMEILQAIKTAWLTDRESLSLSA----EQVTSLL 177
Query: 140 SASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S ++ P+ A L QL +SYD ++GGFG APKFPRPV I +L + K
Sbjct: 178 RKDTSDGRVS---PEKAWLDKGFSQLEESYDPKYGGFGQAPKFPRPVVIDFLLRYYKS-- 232
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ + M L TL+ MA GG++D +GGGFHRYSVD RW VPHFEKMLYDQ QL
Sbjct: 233 -TGRKA----ARDMALVTLEQMAGGGMYDQIGGGFHRYSVDGRWRVPHFEKMLYDQSQLV 287
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL AF LT D Y I ++L+Y+ RDM P G +SAEDADS EGAFY
Sbjct: 288 FAYLSAFQLTGDSAYKEIVVEVLEYVLRDMRHPEGGFYSAEDADSVNPYNLEEHGEGAFY 347
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E++ +L E A L K +Y +K GN + DP EF G+N+ + S A
Sbjct: 348 LWTEEEIDTLLTEKQAALIKAYYGVKAKGNA----LHDPQKEFTGRNIFYRDKELSEVAR 403
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
++G+ E+ +IL + RR L R R PHLDDK++ SWNGL+IS+FARA+ +L
Sbjct: 404 EVGLSEEEARDILQDARRSLLSHRQDRTAPHLDDKILTSWNGLMISAFARAAMVLGE--- 460
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
K Y+ A A F+ L + L +R+G ++ LDDY+FL
Sbjct: 461 -------------KRYLAAANQATDFLLDRLTVD--GELVRRWRDGDARYAAGLDDYSFL 505
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ GLLDLY + L A++L +F D +GG F T + +L R++ +DGA
Sbjct: 506 VQGLLDLYLASHDSIRLQAAVDLTEKMIRIFADEKGG--FYDTPQSTQLLTRMRAAYDGA 563
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EPSGNSV+V+NL+RLA + ++ + A S+ F L A+P+M A D
Sbjct: 564 EPSGNSVAVMNLLRLAGLTGNNE---WVALATESIESFGKTLSTYPPAMPMMLSAMD-FQ 619
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ + +V+ G + D +L+ H+ Y N ++ D ++ F ++
Sbjct: 620 MDKPRQIVIAGTLEADDTRELLSEVHSRYLPNTLLLLADGGKNQQ--FLRGGLPFIGTVK 677
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ + + A VC++F+C PV L LL EK
Sbjct: 678 KID---GRATAYVCEDFTCRIPVNTREGLRALLDEK 710
>gi|452825593|gb|EME32589.1| hypothetical protein Gasu_03590 [Galdieria sulphuraria]
Length = 822
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 275/707 (38%), Positives = 395/707 (55%), Gaps = 57/707 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E +A +LN +FVS+KVDREERPDVD VYMT+VQA G GGWP+S
Sbjct: 153 STCHWCHVMEKESFENEQIASILNTYFVSVKVDREERPDVDGVYMTFVQATNGNGGWPMS 212
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PDL P +G TY PP+ F + L+++ + W ++ + Q G+ + L + L
Sbjct: 213 IFLTPDLVPFVGTTYLPPDR------FASALQQIAEKWRTSKEAIEQEGSRVLNALQQYL 266
Query: 140 SASASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
A + L N C EQ + +D +GGFG+APKFPRPV + +
Sbjct: 267 DAPRKDDSL------NITTSCLEQGYMEAKEMFDEEYGGFGTAPKFPRPVVYDFLF--TL 318
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D GK+ A + M L TL MAKGGIHDH+GGGFHRYSVD+ WHVPHFEKMLYDQ
Sbjct: 319 YWFDGGKTERAKDCLNMALQTLSNMAKGGIHDHLGGGFHRYSVDQYWHVPHFEKMLYDQS 378
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG-GEIFSAEDADSAE-------TE 307
QL YLDA+ +TKD + DIL Y+ RDM G FSAEDADS E +
Sbjct: 379 QLLQSYLDAYLITKDESFRDTAIDILSYVLRDMTDKNTGAFFSAEDADSLEPFSTDSSSI 438
Query: 308 GATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
+ KKEGAFY WT E + ILG + L EH+ +KP GN SDP E GKNVL
Sbjct: 439 NSETKKEGAFYTWTDFECKLILGPTTSKLISEHFDIKPEGNARPG--SDPFGELGGKNVL 496
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+ + +G+ + + E ++KL++ R++R RPHLDDK+I SWN ++I S
Sbjct: 497 YIAKSLTEVSKSMGVSEAEANVAIQEAKQKLWEQRNRRARPHLDDKIITSWNAMMIYSLV 556
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD---EQTHRLQHSFRNG 483
+A +L+ E +Y++ A AA+F++ ++ + ++T + S+R G
Sbjct: 557 KAYIVLEDE----------------QYLQKAMDAATFLKSYMIETTSQETTLIYRSYREG 600
Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
S GF++DYA I L ++E +WL +AI+LQNTQD F D GGYF+T+ +
Sbjct: 601 RSDVEGFVEDYAHTIRAFLSVFEATGNEEWLKYAIQLQNTQDATFYDEVNGGYFSTSSQA 660
Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
++LLR K+D+DG+EPS ++VS NL RL +I +K Y + + ++ F +
Sbjct: 661 KNILLRRKDDYDGSEPSPSAVSGWNLFRLGAITGDTK---YYEKFKSTINAFSIPVNKAP 717
Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
VP M +L + + V++V + +++ A + ++ N+ +I + P + +
Sbjct: 718 FGVPAMLINCCLLLKEATRVVLVVDNMKEPRTRDLVNAVVSRFEPNRVLIPLKPDNQRFL 777
Query: 664 DFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S + + D A VC +C PVT L LL
Sbjct: 778 ------SSLSTELKAMKMIEDSPTAYVCFGKTCKNPVTSKEELCALL 818
>gi|390463544|ref|XP_002748471.2| PREDICTED: spermatogenesis-associated protein 20 [Callithrix
jacchus]
Length = 783
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 283/736 (38%), Positives = 406/736 (55%), Gaps = 84/736 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++
Sbjct: 103 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSE-------------- 148
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
T+V A GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 149 -----GTFVSATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 203
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 204 QNKNALLENS----QRVTTALLARSEISVGDRQLPPSAATVNSRCFQQLDEGYDEEYGGF 259
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 260 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 314
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF ++ D FYS + +DIL Y+ R + G
Sbjct: 315 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSG 374
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + LF +HY L
Sbjct: 375 GFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATELLTSGQLFTKHYGLT 433
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 434 EAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLGVEAVRTLLNTGLEKLFQARKH 491
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+PHLD K++ +WNGL++S +A +L G DR + A + A F
Sbjct: 492 RPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR--LINYATNGAKF 535
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 536 LKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLE 595
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+TQD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 596 WALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 655
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F R++ + +A+P M A + K +V+ G + + D
Sbjct: 656 FT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKD 711
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ ++ H+ Y NK +I AD + + F +++ R D+ A VC+N
Sbjct: 712 TKALVQCVHSVYIPNKVLIL---ADGDPLSFLSRQLPFLSTLRRLE---DQATAYVCENQ 765
Query: 695 SCSPPVTDPISLENLL 710
+CS P+TDP L LL
Sbjct: 766 ACSMPITDPCELRKLL 781
>gi|405953510|gb|EKC21160.1| Spermatogenesis-associated protein 20 [Crassostrea gigas]
Length = 682
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 282/708 (39%), Positives = 386/708 (54%), Gaps = 98/708 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E + ++LN+ FVSIKVDREERPDVD+VYMT++QA GGGGWP+S
Sbjct: 60 STCHWCHVMERESFENEEIGRILNENFVSIKVDREERPDVDRVYMTFIQATVGGGGWPMS 119
Query: 80 VFLSPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
V+L+P+LKPL GGTYFPP+D+ YGRPGFKT+L + + W K +L + + + L E
Sbjct: 120 VWLTPELKPLFGGTYFPPDDRYYGRPGFKTVLTSLAEQWKTKGPVLKEQSSVILRTLQEG 179
Query: 139 LSAS-ASSNKLPDELPQNALRLCAE----QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
SAS A LPD L+ C E QL +S+D GGF PKFP+PV +
Sbjct: 180 TSASEAQGQSLPD------LKDCTEKLYYQLERSFDQEDGGFSKEPKFPQPVNFNFLFRL 233
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
K +D+ S A+ +M FTL MAKGGI DH+
Sbjct: 234 YAKYKDSF-SDMANSSLEMATFTLNKMAKGGIFDHIS----------------------- 269
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
+TK ++ + RDI +Y RD++ P G +SAEDADS T + KK
Sbjct: 270 ------------KITKQDNFAEVVRDIAEYTMRDLLNPCGGFYSAEDADSLPTAESPEKK 317
Query: 314 EGAFYVWTSKEVEDILGEH-------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
EGAF VWT ++++DIL E A +F H+ +K GN D M DPH+E +NVL
Sbjct: 318 EGAFCVWTYQQIQDILKEKVKDNLSLAQIFCYHFNIKEKGNVD--PMQDPHDELLNQNVL 375
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
I + +A K + + ++L +CR L+ R RPRPHLDDK++ +WNGL+IS +
Sbjct: 376 IVKDSVEETAQKFSLNPVEVKDVLEKCRTLLYKERQNRPRPHLDDKIVAAWNGLMISGLS 435
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
+A + L ES +++ A ASF++ H+ S
Sbjct: 436 KAGQAL---GESL-------------FVDQAVKTASFLQSHM---------------SSP 464
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
GF+DDYA++I GLLDLYE +W+ WA ELQ Q+ LF D EGG YF+ +G D S+
Sbjct: 465 IEGFVDDYAYVIRGLLDLYEVCQDEQWVQWAEELQERQNGLFWDSEGGAYFSNSGRDASI 524
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+LR+K+D DGAEP NSVSV NLVRL +++ Y + A L VF RL + +A+
Sbjct: 525 VLRLKDDQDGAEPCPNSVSVSNLVRLGALLNNQD---YTEKAVTILKVFYERLTKIPIAI 581
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
P M C +L + K +VLVG +S D + Y NK I D + M
Sbjct: 582 PEMVCGLILLQ-DTPKQIVLVGDPNSDDLTALKNCVAKHYLPNKITITCDGTSDKFMKAK 640
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
E + S+ + + K A VC+N++C PVT LE +L P
Sbjct: 641 LEFLN---SLTKKD---GKATAYVCENYTCDLPVTSVADLERVLKVNP 682
>gi|170067981|ref|XP_001868692.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
gi|167863990|gb|EDS27373.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
Length = 763
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 284/717 (39%), Positives = 390/717 (54%), Gaps = 75/717 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE E VA+++N+ FV++KVDREERPD+DK+YMT++ + G GGWP+S
Sbjct: 75 STCHWCHVMEKESFESEEVAEIMNENFVNVKVDREERPDIDKLYMTFILLINGSGGWPMS 134
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K W + L ++G I+ + + +
Sbjct: 135 VWLTPDLAPITGGTYFPPKDRWGMPGFTTILLKLKIKWATDGEDLKETGRSIIQAIQKNV 194
Query: 140 SASASSNKLPDELP---QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+K ELP + R +++D +GG PKFP ++ +++H
Sbjct: 195 E---EKHKEEPELPLTVEEKFRQAIMIYRRNFDPVWGGSMGEPKFPEVSKLN-LIFHLHL 250
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
L+ AS+ +VL TL MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQ
Sbjct: 251 LD------PASKLLGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQ 304
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y + + T+ Y + I YL +D+ P G +S EDADS + K EGA
Sbjct: 305 LLMAYANGYKATRKPLYLEVADSIFKYLCKDLRHPAGGFYSGEDADSLPAWDSKDKIEGA 364
Query: 317 FYVWTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
FY WT E++D+ + +F EHY ++PTGN + S SDPH GKN
Sbjct: 365 FYAWTFSEIKDLFNANLEKFGDLGKLNPVEVFTEHYDVQPTGNVEPS--SDPHGHLLGKN 422
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+LI +A KL E IL L +VR KRPRPHLD K+I +WNGL++S
Sbjct: 423 ILIVYGSLRETALKLDTSEEVVAKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSG 482
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A S++ + +R EY+EVA +FIR +L+D + +L SF
Sbjct: 483 LAELSRVKDA-------------PNRAEYLEVAAKLVAFIRENLFDAKAGKLLRSFYGDD 529
Query: 485 S------KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
S + P GF+DDYAFLI GL+D Y T L WA ELQ QD LF D G Y
Sbjct: 530 SDKAKSLEVPIYGFIDDYAFLIKGLIDYYRASLDTSALRWARELQEIQDRLFWDDTSGAY 589
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH----SL 592
F + +V++R+KEDHDGAEP GNSV+ NL+ L DY+ + A H L
Sbjct: 590 FYSEANSANVVVRLKEDHDGAEPCGNSVAAHNLLLLG--------DYFAEGAFHERARKL 641
Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA-AAHASYDLNKT 651
+ + + +P M AA ++ R ++++G K D N L A Y+
Sbjct: 642 LDYFSNVAPFGYVLPKMMSAA-LMEEHGRDMLIVIGPKG--DQTNALVDAVRNFYNPGLV 698
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFS--ADKVVALVCQNFSCSPPVTDPISL 706
V+H+DP E + A +NF D A +C + C P+TDP L
Sbjct: 699 VVHLDPTKPSE---------HLAGKKLDNFKMIQDAPTAYICHDKICQLPLTDPDRL 746
>gi|225156854|ref|ZP_03724957.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
gi|224802800|gb|EEG21050.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
Length = 758
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 298/730 (40%), Positives = 404/730 (55%), Gaps = 59/730 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+E VA +LN+ FVSIKVDREERPDVD++YM YVQA+ G GGWPLS
Sbjct: 48 STCHWCHVMARESFENESVAAVLNEHFVSIKVDREERPDVDRIYMAYVQAMTGRGGWPLS 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLS- 136
+L+PDLKP GGTYFPP D+ GRPGF +L + +AW + +R L A I+ L+
Sbjct: 108 AWLTPDLKPFYGGTYFPPHDQQGRPGFLAVLHAITEAWSDEAERHKLVAESARVIQALTD 167
Query: 137 -----EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
+ S A + L D +A C QL +S+D GGFG APKFPR + L
Sbjct: 168 YHAGKQHASVPAHTRPLHDRA-ADAFEHCFLQLRESFDPAHGGFGGAPKFPRASNLD-FL 225
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ ++ T +S E K+ TL+ M GGIHDHVGGGFHRY+VDE W VPHFEKML
Sbjct: 226 FRVAAIQGT-QSEVGREAVKLATTTLRHMIAGGIHDHVGGGFHRYAVDETWLVPHFEKML 284
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----ETE 307
YDQ Q+A LDA +T D Y+++ R LDY+ RD+ P G FSAEDADSA + +
Sbjct: 285 YDQAQIAVNLLDAALVTGDERYAWVARSTLDYVLRDLRHPAGGFFSAEDADSAVPHDDGD 344
Query: 308 GATR----KKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMS------DP 356
+ R EGAFYVWT+ E+ IL + A F H+ + + + + + DP
Sbjct: 345 ASPRAHGNHAEGAFYVWTTAELRRILPSDTADRFILHFGVAGSHDANAAEAGNVPPAHDP 404
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
H E GKN+L + +A+ LG+ L VR+ RPRPHLDDK+I +
Sbjct: 405 HGELSGKNILHHTRPIAETAAALGLDPAALAAEFARALETLRAVRAARPRPHLDDKIITA 464
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDEQTHR 475
WNGL I++FARA+ + + DR+E Y++ A +AA FI R LYD+
Sbjct: 465 WNGLAITAFARAAASPAACLD-----------DRREFYLDAALTAARFIERELYDDDGGD 513
Query: 476 ------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
L ++R+G + GF +DYAFLI+GLLDL+E WL A LQ T D LF
Sbjct: 514 APARCILWRNWRDGRGASEGFAEDYAFLIAGLLDLHEATLDPHWLRRAARLQETMDHLFW 573
Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
D GGYFNT P ++LR+KED+DGAEP+ S++ NL RL+++ + D A
Sbjct: 574 DDAHGGYFNTPAGSPHLVLRLKEDYDGAEPAPGSIAAANLQRLSALF---QDDTLHARAV 630
Query: 590 HSLAVFETRLKDMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHA-SYD 647
++ + + A+P + A + +L P++ ++L G S DF + A A
Sbjct: 631 RTVESLRGQWETTPHALPALLFALERILEEPAQ--IILAGDPRSHDFRALAAVLRARDKT 688
Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNA-------SMARNNFSADKVVALVCQNFSCSPPV 700
L + I P + + + NS+ A +A S A VC +C PPV
Sbjct: 689 LRRHTILAAPL-SPALPTTDSPNSDEAWLLERAPWLAGMKPSDGCAAAYVCHGRTCHPPV 747
Query: 701 TDPISLENLL 710
T P +L LL
Sbjct: 748 TTPSALRQLL 757
>gi|21674102|ref|NP_662167.1| hypothetical protein CT1279 [Chlorobium tepidum TLS]
gi|21647257|gb|AAM72509.1| conserved hypothetical protein [Chlorobium tepidum TLS]
Length = 710
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 281/719 (39%), Positives = 383/719 (53%), Gaps = 54/719 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T R FL +TCHWCHVME ESFE+ A LLN FV +K+DREE PD
Sbjct: 30 GEEAFSRARETGRPIFLSSGYSTCHWCHVMEHESFENAETAALLNRHFVPVKLDREEHPD 89
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YM +VQA G GGWP+SV+++PDLKP GG+YFP +++G P F+++L + + W+
Sbjct: 90 VDHLYMMFVQATTGRGGWPMSVWMTPDLKPFFGGSYFPATERWGMPSFRSVLEHLANLWE 149
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R L S ++QLS + DE+ C L + +D+ +GGFG
Sbjct: 150 HDRPRLLASAGSIMDQLSGLTRPQEGT----DEVTDAHASACLAALERGFDAEWGGFGGE 205
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH------VGGG 232
PKFPRP + + H+ TG M L TL+ MA GGIHDH GGG
Sbjct: 206 PKFPRPAVLSFLFSHAVA---TGN----RHALDMALLTLRKMAAGGIHDHLGVAGLGGGG 258
Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
F RYS D WHVPHFEKMLYD QLA YL+A+ + D ++ RDI Y+ DM P
Sbjct: 259 FARYSTDRFWHVPHFEKMLYDNAQLAASYLEAYQASGDELFANTARDIFHYVLCDMTSPE 318
Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
G +SAEDADS + G+ K+EGAFY+WT +E+ +L E A LF Y ++ GN
Sbjct: 319 GAFWSAEDADSLDPYGSGEKREGAFYLWTEQEITGLLDPEEATLFIATYGIRSDGNAPF- 377
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
DPH EF GKN+LI + A +P+E L R+KLF+ R KRPRP LDD
Sbjct: 378 ---DPHGEFTGKNILIRTMSDNELAGTFEIPIETVGKRLNSARKKLFEARKKRPRPGLDD 434
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL++S+ A+ S +L +E AE AA FI L D
Sbjct: 435 KILTSWNGLMLSALAKGSLVLGD----------------TTLLEAAERAARFILDTLCDS 478
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
++ +L +R+G + G DYA LI GLLDLY + WL AI+L Q E F D+
Sbjct: 479 KSGKLLRRYRDGQAAIEGKAADYACLILGLLDLYSASFDSDWLRAAIKLAEAQIERFFDQ 538
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
E G +++T ED SV LR+ ED+D AEPS NSV+ +N +RLA+I D +R A +
Sbjct: 539 EAGVFYSTAVEDHSVPLRMIEDNDNAEPSANSVNALNYLRLAAITG---RDEFRTIALRT 595
Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
+ F L A+PL+ A ++ S ++ G + + ++A A T
Sbjct: 596 IRHFSGTLDANPSALPLLLV-ARQIATASPVQIIFAGKRGNPALAKLVATAFRHNRPELT 654
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
VIH D +T E E + A + A +C SC P + + SL+ L
Sbjct: 655 VIHAD--ETCEALLPE-------AAAIGKMHKGEPAAYLCAGGSCQPAIRNAESLDAAL 704
>gi|193212931|ref|YP_001998884.1| hypothetical protein Cpar_1281 [Chlorobaculum parvum NCIB 8327]
gi|193086408|gb|ACF11684.1| protein of unknown function DUF255 [Chlorobaculum parvum NCIB 8327]
Length = 708
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 266/698 (38%), Positives = 396/698 (56%), Gaps = 51/698 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED +A LN FV +K+DREE PD+D+ YM +VQA GWP+S
Sbjct: 51 STCHWCHVMERESFEDPEIAGFLNAHFVPVKLDREEHPDIDRFYMLFVQATTSNAGWPMS 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+++PD KP GG+YFPP +++G P F+++L + W+ R L S ++QL +
Sbjct: 111 VWMTPDRKPFFGGSYFPPAERWGMPSFRSVLETLARMWEHDRPKLLASAGSIMDQLFDIA 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + D +A R C E L++ +D+ +GGFG+APKFP+P + + H+ +
Sbjct: 171 KPQSGPGDVSD---AHAAR-CFEALAQRFDAEWGGFGNAPKFPQPSILGFLFSHAAR--- 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYD 253
TG A M L TL+ MA GG+HD + GGGF RYS D WHVPHFEKMLYD
Sbjct: 224 TGNQTAAD----MALVTLRKMAAGGLHDQLGVTGRGGGGFARYSTDRFWHVPHFEKMLYD 279
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
QLA YL+A+ LT + ++ RDI +Y+ DM P G +SAEDADS + G+ K+
Sbjct: 280 NAQLAASYLEAYQLTGEALFADTARDIFNYVLCDMTSPEGGFWSAEDADSLDPNGSGEKR 339
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT +E+ ++L + A+LF E Y ++P GN + DPH EF G+N+L
Sbjct: 340 EGTFYVWTEEEIGNLLDPDEAVLFMEAYGVRPEGNAPV----DPHGEFIGRNILKRTASD 395
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
++ G+ +++ L E R KLF+ R RPRP LDDK++V+WNG++IS+ A+ + +L
Sbjct: 396 EELTNRFGLSMDEASRRLKEARSKLFESRLTRPRPGLDDKILVAWNGMMISALAKGALVL 455
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ K+ +E AE AA FI LYD T +L +R+G + G
Sbjct: 456 RD----------------KKLLEAAERAALFILGTLYDSATGKLLRRYRDGEAAIDGKAS 499
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA +I L+DLY+ ++L AI L TQ E F D++ G +++T +D S LR+ E
Sbjct: 500 DYACMIQALIDLYQASLDPEYLSTAIALAETQIERFFDQKQGVFYSTAFDDESAPLRMIE 559
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D+D AEPS NSVS N +RLA++ D R+ A ++ F + L +A+PLM A
Sbjct: 560 DNDTAEPSPNSVSAFNYLRLAAMTG---RDELREIALRTINFFSSTLDANPVALPLMLAA 616
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
M + +++ G +S + + AA + T++H + E +++ S
Sbjct: 617 RAMADT-APAQLIVSGKRSDPAIQRFVEAASRHFQPELTILHAN----ENVEWLP---SE 668
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++A+++ + A +C C P VT+P L+ LL
Sbjct: 669 AVAIAKDHHG--QPAAWLCAKGQCYPAVTEPEELDTLL 704
>gi|330805805|ref|XP_003290868.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
gi|325078993|gb|EGC32616.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
Length = 740
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 269/703 (38%), Positives = 400/703 (56%), Gaps = 46/703 (6%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWC VM E FE+ ++K++ND F++IKVDREERPD+DK+YMT++ GGGGWP+S++
Sbjct: 64 CHWCSVMHKECFENPSISKVMNDLFINIKVDREERPDIDKLYMTFLTETTGGGGWPMSIW 123
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P L+P+ GTYF PE K+GR F + +K+ + W R+ + + G IE L E
Sbjct: 124 LTPSLQPISAGTYFAPEPKFGRAAFPELCKKLNEIWKNDRETVIERGNSFIEYLKEDKPK 183
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
N L +E + C EQ+ K YD GGF APKFPR +L S ++
Sbjct: 184 GNLDNALSEE----TVSKCIEQILKGYDPDDGGFTDAPKFPRCSIFNFLL--SASTQEQL 237
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
KS + S +K+ FTL MA GGI+D +G GFHRYSV W +PHFEKMLYDQGQL VY
Sbjct: 238 KSSKESILEKL-FFTLSKMAYGGIYDQIGFGFHRYSVTPDWKIPHFEKMLYDQGQLVPVY 296
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
LD++ L+K+ + I + L Y++ + G FSAEDADS + K EGAFY+W
Sbjct: 297 LDSYILSKNELFKNISKSTLKYVQNYLTHKDGGFFSAEDADSFNE--SNEKSEGAFYIWN 354
Query: 322 SKEVEDIL---GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
++++ L E ++ Y L GN ++ DPHNEF KN+++ + + +A+
Sbjct: 355 FEDIKKALENDKEAIEIYSFIYGLVENGN--VNPKDDPHNEFIDKNIIMRIKSNQDAANY 412
Query: 379 LGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
++ + L R+KL R +PRP LDDK+IV+WNGL+IS+FARA +I
Sbjct: 413 FKKSTKEIESSLESSRKKLLTYRDTFKPRPPLDDKIIVAWNGLMISAFARAYQI------ 466
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
FP D + Y+E A+ A FI+ +LY++ T L +F++ PS F DDYA L
Sbjct: 467 -----FP----DEESYLESAKRATKFIKDNLYNQATKTLIRNFKDSPSLIHAFADDYASL 517
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDG 556
I GLLDLY+ ++L WAIELQ QD+LF D + GGYF+T+G+D S+L R+KE+HDG
Sbjct: 518 IQGLLDLYQCTFEIEYLEWAIELQEKQDQLFYDSQLPGGYFSTSGDDKSILHRLKEEHDG 577
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
AE S S+SV NL++L S+ + Y++ A +L L+ + +P M C+ ML
Sbjct: 578 AENSCQSISVSNLLKLYSVTYNQE---YKEKALATLDSCSLYLEKAPIVMPQMMCS--ML 632
Query: 617 SVPSRKHV-----VLVGHK----SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
+++ +++ K + D + +L ++ + NK + D +D +++ F+
Sbjct: 633 LCKEKENTLNSINIVINSKEYNQTKNDLKQILKQVNSLFIPNKFITVKDISDQKQVQFFN 692
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
E + N ++ DK +C CS + + N+L
Sbjct: 693 EK-TKNLNLINLKPVYDKPSLSLCNPNGCSISSNNLGQITNIL 734
>gi|158296880|ref|XP_317217.4| AGAP008252-PA [Anopheles gambiae str. PEST]
gi|157014924|gb|EAA12337.5| AGAP008252-PA [Anopheles gambiae str. PEST]
Length = 813
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 280/713 (39%), Positives = 380/713 (53%), Gaps = 64/713 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E VAK++N+ F++IKVDREERPD+DK+YM ++ + G GGWP+S
Sbjct: 122 STCHWCHVMEKESFENEEVAKIMNEHFINIKVDREERPDIDKLYMMFILLINGSGGWPMS 181
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P+ GGTYFPP D++G PGF T+L K+ W +D L +G IE + +
Sbjct: 182 VWLTPDLAPVTGGTYFPPNDRWGMPGFTTVLTKLASKWSTDKDDLVTTGRSVIEAIRRNV 241
Query: 140 S---ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
A + E + + ++YD +GG APKFP ++ +M +H
Sbjct: 242 DHKRADEVEDATNMETLEAKFKQAVNMYQRNYDMVWGGSLGAPKFPEASKLNLM-FHLHV 300
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
E K +VL TL MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQ
Sbjct: 301 QEPKHKV------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQ 354
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L ++Y + + LTK Y + I YL +D+ P G +S EDADS T + K EGA
Sbjct: 355 LLSLYANGYRLTKKPSYLAVADAIYRYLCKDLRHPAGGFYSGEDADSLPTAESEEKIEGA 414
Query: 317 FYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKN 364
FY WT EV+++LG + F E HY +K GN S SDPH GKN
Sbjct: 415 FYAWTYDEVKELLGANGEKFGELGGVDPVAVYAAHYDVKEEGNVKPS--SDPHGHLLGKN 472
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+LI +A K +E IL L +VR KRPRPHLD K++ +WNGLV+S
Sbjct: 473 ILIVYGSVRETAEKFNTTVEIVERILKTGNELLHEVRDKRPRPHLDTKILCAWNGLVLSG 532
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG- 483
++ + + + R EY+ AE FIR +LYD Q +L S G
Sbjct: 533 LSQLACVKDAPG-------------RSEYLATAEELVKFIRANLYDVQARKLLRSCYGGA 579
Query: 484 ----PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
S+ P GF+DDYAFLI GL+D Y L WA ELQ+ QDELF D + G YF
Sbjct: 580 EESLASERPIYGFIDDYAFLIKGLIDYYVASLDEHALHWAKELQDIQDELFWDTKHGAYF 639
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN--AEHSLAVF 595
+ P+V +R+KEDHDGAEP GNSV+ NL+ L SDY+ + E + +F
Sbjct: 640 YSEANSPNVAVRLKEDHDGAEPCGNSVAAHNLLLL--------SDYFEEERLKEKARTLF 691
Query: 596 E--TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
+ +P M AA +L R +++VG +S + ++ Y ++
Sbjct: 692 DYFAHTAHFGYVLPEMMSAA-LLEEQGRNTLIVVGPESP-EATALVDGVREFYIPGMIIV 749
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+ D + +N M +N A +C N C PVT+P L
Sbjct: 750 QLK-IDQPAHIVRRRKSLDNFKMVKN-----MPTAYICHNKVCHLPVTEPERL 796
>gi|156058630|ref|XP_001595238.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980]
gi|154701114|gb|EDO00853.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 797
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 251/592 (42%), Positives = 356/592 (60%), Gaps = 27/592 (4%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCH+ME ESFE+E VA +LN F+ IK+DREERPD+D++YM +VQA G GGWPL+
Sbjct: 86 SSCHWCHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 145
Query: 80 VFLSPDLKPLMGGTYFPPEDKY----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
VFL+P L+P+ GGTY+P K + F IL K+ W ++ Q A ++QL
Sbjct: 146 VFLTPSLEPVFGGTYWPGPSKTKAFEDQVDFLGILDKLSTVWSEQERRCRQDSAQILQQL 205
Query: 136 SEALSASASSNKLPDELPQNALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
+ + SN+L D + + L E +KS+D + GGFGSAPKFP P ++ +L
Sbjct: 206 KDFANEGTLSNRLGDAVDNIDIELLEEATQHFAKSFDKKNGGFGSAPKFPTPSKLAFLLR 265
Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
S+ + D + + + + TL+ MA+GGIHDH+G GF RYSV W +PHFEK
Sbjct: 266 LSQFPQAVLDIVGIPDCENAKNIAITTLRKMARGGIHDHIGNGFARYSVTADWSLPHFEK 325
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
MLYD QL ++YLDAF L++D + + DI DYL + P G +S+EDADS G
Sbjct: 326 MLYDNAQLLHIYLDAFLLSRDPEFLGVAYDIADYLTITLFHPQGGFYSSEDADSYYKAGD 385
Query: 310 TRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
T K+EGA+YVWT +E E+ILG EH + + + GN +++ +DPH+EF +NVL
Sbjct: 386 TEKREGAYYVWTKREFENILGTEHEPILSAFFNVTSHGN--VAQENDPHDEFMDQNVLAI 443
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFAR 427
+ SA A++ GM + + ++ E + KL R + R +P +DDK+IVSWNG+ I + AR
Sbjct: 444 SSTPSALANQFGMKEAEIIKVIKEGKAKLRKRREADRVKPDMDDKIIVSWNGIAIGALAR 503
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
AS ++ F+ PV D Y++ A A FI+ +LYDE++ L +R G
Sbjct: 504 ASAVING------FD-PVKAQD---YLDAALKTAKFIKENLYDEKSKILYRIWREGRGDT 553
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
GF DDYAFL+ GL+DLYE KWL WA ELQ +Q F D GG+F+T P+V+
Sbjct: 554 QGFADDYAFLMEGLIDLYEATFDEKWLQWADELQQSQINFFYDTNKGGFFSTIASAPNVI 613
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
LR+KE D AEPS N S NL RL+SI+ + Y + A ++ FE+ +
Sbjct: 614 LRLKEGMDSAEPSTNGTSSSNLYRLSSIL---NDESYAKKANETVKSFESEM 662
>gi|194334203|ref|YP_002016063.1| hypothetical protein Paes_1395 [Prosthecochloris aestuarii DSM 271]
gi|194312021|gb|ACF46416.1| protein of unknown function DUF255 [Prosthecochloris aestuarii DSM
271]
Length = 720
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 285/703 (40%), Positives = 390/703 (55%), Gaps = 53/703 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE++ +A++LN FV +K+DREERPD+D++YM YVQA G GGWP+S
Sbjct: 56 STCHWCHVMERESFENDEIAQVLNHSFVPVKIDREERPDIDRLYMAYVQASTGSGGWPMS 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+LKP GGTY+PPED++GRPGF ++L + DAW + R L + + L
Sbjct: 116 VWLTPELKPFYGGTYYPPEDRFGRPGFLSLLHSIADAWKEDRKKLEH----VADGIQSQL 171
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +++ P+ L + L Q+S +D GGF SAPKFPRP + + ++
Sbjct: 172 KSFSTAAPHPESLGEKVLDDAFMQISSHFDPVAGGFSSAPKFPRPSILTFLFNYAYF--- 228
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYD 253
TG+ E M L TL+ MA+GGIHDH+ GGGF RY+ D WHVPHFEKMLYD
Sbjct: 229 TGR----EEASAMALLTLERMARGGIHDHLGVKGKGGGGFARYATDALWHVPHFEKMLYD 284
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
LA +L+AF LTK+ Y+ DI +Y+ DM P G +SAEDADS + K
Sbjct: 285 NALLALSFLEAFQLTKETLYAQTAEDIFNYVLCDMTSPEGAFYSAEDADSFPDRESKTKI 344
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT E+ ++L +F Y +K GN + DPH F+ KN+L D
Sbjct: 345 EGGFYVWTKTEIAELLDPLEEQIFSFRYGVKQNGNV----LEDPHGTFERKNILSLKADE 400
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+A +P ++ N+ KLF R +RPRP DDK+I SWN L+IS+ A+ S++L
Sbjct: 401 ETTAKHFDLPTDQVANLSRSAIEKLFQARMRRPRPDRDDKIITSWNALMISALAKGSRVL 460
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
++ +Y+ AE AA FI +L++ T L + G S G +
Sbjct: 461 QN----------------TDYLTAAEKAAGFIGDNLFENGTGNLLRRYCKGESGITGQAE 504
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFLI GLLDLYE L A EL Q E F D E GG+FN + ++ SV +R+KE
Sbjct: 505 DYAFLIQGLLDLYEASFDDSLLHKAQELAERQCEHFYDDEHGGFFNASSQEASVPIRLKE 564
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D+DGAEPS NSVSV+N RL ++ G + +Y AE +L F L M +P M
Sbjct: 565 DYDGAEPSANSVSVMNFSRLW-LMTGKQ--HYLDIAEKTLYYFSAILAANGMQLPEMLAG 621
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
L PS V+L G +S F+ + + Y TV+H T+E +
Sbjct: 622 YARLLHPSNT-VILTGSQSDPAFKALKKSVEQLYLPGTTVMHA----TKEKPVSSIPGAE 676
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
AS N+ A +C+ SC PVT P + NLL +PS
Sbjct: 677 TASEENNS-----AAAYICKGGSCRLPVTTPEEVTNLL--RPS 712
>gi|403182450|gb|EAT47160.2| AAEL001725-PA [Aedes aegypti]
Length = 749
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 272/712 (38%), Positives = 380/712 (53%), Gaps = 55/712 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++ + G GGWP+S
Sbjct: 61 STCHWCHVMEKESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K+ W + LA +G I+ + +
Sbjct: 121 VWLTPDLAPVTGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNV 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
P+ R +++D +GG APKFP ++ ++ + +
Sbjct: 181 EEKHQEEAERVFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPS 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T G +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL
Sbjct: 241 TKILG-------VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLM 293
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y + + T+ Y + I Y+ +D+ P G +S EDADS T +T K EGAFY
Sbjct: 294 AYANGYKTTRKPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYA 353
Query: 320 WTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
WT EV D+L + +F EHY ++ TGN + S SDPH GKN+ I
Sbjct: 354 WTFAEVRDLLKANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPI 411
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+A K E IL L +VR KRPRPHLD K+I +WNGL++S ++
Sbjct: 412 VYGSVRETADKFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQ 471
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS-- 485
S I + +R Y++ SFIR +LYD Q +L S S
Sbjct: 472 LSCIKDA-------------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQ 518
Query: 486 ----KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
+ P GF+DDYAFLI GL+D Y T L WA ELQ QDELF D + G YF +
Sbjct: 519 AKSLETPIYGFIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYS 578
Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+V++R+KEDHDGAEP GNSVS NL+ L + +R+ A + F + +
Sbjct: 579 EANSANVVVRLKEDHDGAEPCGNSVSAHNLIMLGDYFETAA---FREKANKLFSYF-SNV 634
Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
+P M A +L R +V+VG + ++ A Y ++ +DP+
Sbjct: 635 TPFGYVLPEMMSAM-LLQENGRDMLVVVG-PDGPEATALVDAVRDFYMPGLLIVQLDPS- 691
Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+H+ ++ + A +C N C PVT+P L + L+
Sbjct: 692 ------LPDHSLGGKTLKSFKMMNEAPTAYMCHNKVCQLPVTEPEKLADDLV 737
>gi|403418379|emb|CCM05079.1| predicted protein [Fibroporia radiculosa]
Length = 791
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 294/757 (38%), Positives = 398/757 (52%), Gaps = 94/757 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFED+ A L+N+ +++IKVDREERPDVD++YMT++QA GGGGWP+S
Sbjct: 62 SACHWCHVLAHESFEDKVTANLMNEHYINIKVDREERPDVDRLYMTFLQASSGGGGWPMS 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPG-FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++L+P+L P G P Y PG F+ +L K+ D W+ D SG IE L +A
Sbjct: 122 IWLTPELHPFFAGPSLPVPQTYFPPGRFRQVLYKLADIWESDPDRCRASGKQIIESLRDA 181
Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMML------ 191
+ + + DELP +L L +L+K +D+R+GGF SAPKFP+P + L
Sbjct: 182 TNVKSGT----DELPVVSLALTVYARLAKRFDTRYGGFSSAPKFPQPSQTTQFLARYAAL 237
Query: 192 -YHSKK-----------------LEDTGKSG-----------------EASEGQKMVLFT 216
HSK E G+ G EA + M T
Sbjct: 238 RMHSKDSGAGEQKNADEVLKHLDAESLGEDGKDSKLSEPSSKPKSKQEEAEHARDMAAET 297
Query: 217 LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL--------- 267
L + KGGIHD V GGF RYSVDERWHVPHFEKMLYDQ QL L+ SL
Sbjct: 298 LVQIYKGGIHDVVEGGFARYSVDERWHVPHFEKMLYDQAQLLTSALELASLLPHSSDGPP 357
Query: 268 ---TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
T+ + + R IL YL R + P G +SAEDADS +T+ KEGAFY WT+ +
Sbjct: 358 LSSTRTTLLA-LARSILIYLPRHLTSPEGGFYSAEDADSLPAADSTKTKEGAFYTWTANQ 416
Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
ILGE A + Y +K GNCD M D E KG+NVL + +A K G P+E
Sbjct: 417 FSRILGEDAEVAVWAYGVKEDGNCD--PMHDIQGELKGQNVLFMAHTPEEAAEKFGRPVE 474
Query: 385 KYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
+ L KL R + RPRPHLDDK++ WNGL+IS ARA++ +
Sbjct: 475 EVRCALQHSLDKLRAFRDENRPRPHLDDKILTCWNGLMISGLARATETFE---------- 524
Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
G + + + +AE +A+F+R LY+E + L S+R G + G DDYAFLI GLLD
Sbjct: 525 ---GEEAVQALTLAERSAAFLRAQLYNEASGELTRSWREG-AGPKGQADDYAFLIQGLLD 580
Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
LYE ++++WAI LQ QDELF D EG GYF + D +L+R+K+ DGAEPS S
Sbjct: 581 LYEACGKEEYVIWAIRLQEKQDELFFDAEGCGYF-ASAPDEHILIRMKDAQDGAEPSAVS 639
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
V++ NL+RL S A + Y + A+ LA L A+ M AA M K
Sbjct: 640 VTLSNLLRL-SHFAEDRHKEYDEKAKSILASNAQLLGAAPYALAAMVSAA-MCREKGYKQ 697
Query: 624 VVLVGHKSSVDFEN-MLAAAHASYDLNKTVIHIDPADTEE---------MDFWEEHNSNN 673
++L +S F + L A + N+ +IH+DPA+ + N++
Sbjct: 698 IILT--ESPASFPSPYLKAIRERFVPNRVLIHLDPANPPRKLAKVNGTLRSLLTDINTDR 755
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A + V VCQNF+C P+ D L+ L
Sbjct: 756 SGNADARSAQPNV--RVCQNFTCGLPIRDMAELKAAL 790
>gi|157123455|ref|XP_001653842.1| hypothetical protein AaeL_AAEL001725 [Aedes aegypti]
Length = 752
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 272/715 (38%), Positives = 380/715 (53%), Gaps = 58/715 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++ + G GGWP+S
Sbjct: 61 STCHWCHVMEKESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL P+ GGTYFPP+D++G PGF TIL K+K+ W + LA +G I+ + +
Sbjct: 121 VWLTPDLAPVTGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNV 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
P+ R +++D +GG APKFP ++ ++ + +
Sbjct: 181 EEKHQEEAERVFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPS 240
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T G +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL
Sbjct: 241 TKILG-------VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLM 293
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y + + T+ Y + I Y+ +D+ P G +S EDADS T +T K EGAFY
Sbjct: 294 AYANGYKTTRKPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYA 353
Query: 320 WTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
WT EV D+L + +F EHY ++ TGN + S SDPH GKN+ I
Sbjct: 354 WTFAEVRDLLKANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPI 411
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+A K E IL L +VR KRPRPHLD K+I +WNGL++S ++
Sbjct: 412 VYGSVRETADKFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQ 471
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS-- 485
S I + +R Y++ SFIR +LYD Q +L S S
Sbjct: 472 LSCIKDA-------------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQ 518
Query: 486 ----KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
+ P GF+DDYAFLI GL+D Y T L WA ELQ QDELF D + G YF +
Sbjct: 519 AKSLETPIYGFIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYS 578
Query: 540 TGEDPSVLLRVKE---DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
+V++R+KE DHDGAEP GNSVS NL+ L + +R+ A + F
Sbjct: 579 EANSANVVVRLKEGKLDHDGAEPCGNSVSAHNLIMLGDYFETAA---FREKANKLFSYF- 634
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
+ + +P M A +L R +V+VG + ++ A Y ++ +D
Sbjct: 635 SNVTPFGYVLPEMMSAM-LLQENGRDMLVVVG-PDGPEATALVDAVRDFYMPGLLIVQLD 692
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
P+ +H+ ++ + A +C N C PVT+P L + L+
Sbjct: 693 PS-------LPDHSLGGKTLKSFKMMNEAPTAYMCHNKVCQLPVTEPEKLADDLV 740
>gi|119357268|ref|YP_911912.1| hypothetical protein Cpha266_1460 [Chlorobium phaeobacteroides DSM
266]
gi|119354617|gb|ABL65488.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides DSM
266]
Length = 720
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 279/723 (38%), Positives = 385/723 (53%), Gaps = 61/723 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFED A LLN FV +KVDREE PD
Sbjct: 33 GVEAFAKAKKESKPIFLSVGYSTCHWCHVMERESFEDPRTALLLNTNFVPVKVDREEYPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMT+VQ+ G GGWP+SV+L+PDL P GG+YFPP D+YG PGF T+L + W
Sbjct: 93 LDRLYMTFVQSTTGRGGWPMSVWLTPDLDPFYGGSYFPPVDRYGMPGFNTLLTSIARLWQ 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS 177
+ A +QL+ SA S K LP ++A C L S+D FGGFG+
Sbjct: 153 TDPQSILDRSALFFQQLN-----SAESVKTEGSLPSKDAANRCFRWLEDSFDRDFGGFGN 207
Query: 178 APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV------G 230
APKFPRPV + + YH TG + M LFTL+ MA+GGIHDH+ G
Sbjct: 208 APKFPRPVLLDFLFNYHYH----TGN----EQALAMALFTLRKMAEGGIHDHLGIPEKGG 259
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GGF RYS D WH+PHFEKMLYD QLA ++ AF + D FY+ + DI +Y+ D+
Sbjct: 260 GGFSRYSTDPFWHLPHFEKMLYDNAQLAISFVQAFQCSGDSFYAEVADDIFNYVLTDLAS 319
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI-LGEHAI-LFKEHYYLKPTGNC 348
G +SAEDADS + ++ +EGAFY W+ +EV + +I LF Y ++P GN
Sbjct: 320 SEGAFYSAEDADSLPEQSSSVLEEGAFYRWSHEEVLRLPCSRRSIELFSRLYGIRPEGNV 379
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
++DPHNEF G N+L + + M ++ L E R L + R RPRP
Sbjct: 380 ----LNDPHNEFAGLNILKKESSIEEIGRIFSMREKEVAEALEEVRLALHNARLARPRPF 435
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LDDK++ SWNGL+IS+ AR ++ K + A A F+ L
Sbjct: 436 LDDKILASWNGLMISALARGYRVFGD----------------KRLLLAANRATEFLLSTL 479
Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
Y+ T +L +RNG + G DDYAF + GLLDLYE + + AI L T LF
Sbjct: 480 YNRHTGKLLRRYRNGSAGIDGKADDYAFFVQGLLDLYEADFDPRHIETAIALTETVILLF 539
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D GG+ +T +D S+ R++E++DGAEP+ NSV +NL+RL+ + + Y + A
Sbjct: 540 EDTIKGGFSSTASDDTSLPARMREEYDGAEPAANSVLAMNLLRLSEMTGEER---YNEKA 596
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
E+ F++ L + A+P M A + + +L G +S + + A Y
Sbjct: 597 ENIFKAFDSILDTNSHALPAMLVALNFWE-QKKSLTILNGDPASPVMQELKRAPGRRYLP 655
Query: 649 NKTVIHIDPAD-TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
IH + +D E+ + A + R A VC + +C PV+DPISL
Sbjct: 656 GNVTIHASIRQVVKGLDVLEQIEESPA-IPR---------AYVCLDRACQLPVSDPISLM 705
Query: 708 NLL 710
LL
Sbjct: 706 ALL 708
>gi|189195556|ref|XP_001934116.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979995|gb|EDU46621.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 748
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 273/698 (39%), Positives = 378/698 (54%), Gaps = 49/698 (7%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA G GGWPL+ F
Sbjct: 69 CHWCHVMERESFENDEVAKLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNAF 128
Query: 82 LSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-- 135
++PDL+P+ GGTY+P P GF IL K++D W +R +S QL
Sbjct: 129 ITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLRD 188
Query: 136 -SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+E + S P+ L + L E K YD GFG APKFP P ++ +L S
Sbjct: 189 FAEDGNISRKDGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKLS 248
Query: 195 K---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ + + + + + + M L TL M KGGIHD +G GF RYSV + W +PHFEKML
Sbjct: 249 QYPSAVREVLSAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKML 308
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGAT 310
YDQ QL VYLDA+ +T+ + DI YL M G FS+EDADS
Sbjct: 309 YDQAQLLPVYLDAYLMTRSPEHLSAVHDIATYLTSPPMQAESGGFFSSEDADSLYRPNDK 368
Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
K+EGAFYVWT KE + ILG+ A + +Y ++ GN ++ D H+E +NVL
Sbjct: 369 EKREGAFYVWTLKEFQQILGDRDAEILARYYNVQDEGN--VAPEHDAHDELINQNVLAVT 426
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARA 428
A + G+ ++ IL E R+KL D R+K RPRP LDDK++VSWNGL I + AR
Sbjct: 427 TTKPDLAQQFGLSEDEVNKILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALART 486
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
S L S+ + ++Y+ AE AA+F+R HLY+ + L +R GP AP
Sbjct: 487 SAALSSQDPTR----------SQKYLAAAEKAATFLRAHLYNSTSKTLIRVYREGPGDAP 536
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
GF DDYA+LISGL+DLYE +L WA +LQ TQ +F D++ G+F+T + +++
Sbjct: 537 GFADDYAYLISGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLIM 596
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R+K+ D AEP N VS NL RL +++ + + Y + A + + FE + P
Sbjct: 597 RLKDGMDNAEPGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFPT 653
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVD-----FENMLAAAHASYDLNKTVIHIDPADTEEM 663
M A ++ H V+ G VD + N A L K V
Sbjct: 654 MMDAV-VVGKLGISHSVITGEGKKVDEWLQRYRNRPAGLGTVSKLGKGV----------G 702
Query: 664 DFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
++ + N SM +ADK +VC+N +C +T
Sbjct: 703 EWLKSRNPLVKSM-----NADKEGVMVCENGACREALT 735
>gi|386812871|ref|ZP_10100096.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386405141|dbj|GAB62977.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 704
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 271/715 (37%), Positives = 391/715 (54%), Gaps = 67/715 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE VAK+LN+ FVSIKVDREERPD
Sbjct: 50 GEEAFQKAIRENKPVFLSIGYSTCHWCHVMEYESFEDEEVAKILNENFVSIKVDREERPD 109
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +Y+T QA+ G GGWPL++FL+P+ KP GTYFP ++YG PGF IL+K+ D W
Sbjct: 110 LDNIYITVCQAMTGSGGWPLNLFLTPEKKPFFAGTYFPKTERYGNPGFIAILKKISDLWK 169
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGS 177
++ + S EQ+++ + ++A S P E L + L+ QL ++DS +GGFGS
Sbjct: 170 TNKESVIASS----EQITKVIQSAAIST--PGEILTKETLQHAYAQLRDNFDSIYGGFGS 223
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFP P +L K+ D ++V TL+ M +GGI+D +GGGFHRYS
Sbjct: 224 APKFPTPHNYTFLLRWWKRSND-------PTALEIVEKTLERMGRGGIYDQLGGGFHRYS 276
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
DE W VPHFEKMLYDQ A Y + + T VFY+ R I Y+ RDM P G +S
Sbjct: 277 TDEYWLVPHFEKMLYDQALAAIAYTETYQATGKVFYADSVRGIFTYVLRDMTSPEGGFYS 336
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDP 356
AEDADS EG EG FYVWT E+ ILGE +F ++Y + GN
Sbjct: 337 AEDADS---EGV----EGKFYVWTPDEIIKILGEKEGNIFCDYYDVSKEGN--------- 380
Query: 357 HNEFKGKNVLIELNDSSASASKL-GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+ KN+L ++ + SK+ G+ + +L R KLF VR KR PH DDK++
Sbjct: 381 ---FEEKNIL-HVDKPVDTFSKMRGIKPAELEEVLRTAREKLFSVREKRIHPHKDDKILT 436
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
+WNGL+I++ A+ ++ L + +Y + A AA FI L ++
Sbjct: 437 AWNGLMIAALAKGAQAL----------------NEPKYTQAAMRAADFILNTL-RQKDGT 479
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L +R+G + PG+LDDYA+ + GL+DLYE K+L A EL N E F D +GGG
Sbjct: 480 LLRRYRSGEASIPGYLDDYAYFVWGLIDLYEATFEVKYLKIARELNNHMIENFQDEKGGG 539
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+F + ++ ++ + KE +DGA PSGNSV++ N++RL I ++ + + AE + F
Sbjct: 540 FFFSGKKNEQLITQTKEIYDGATPSGNSVALFNILRLGRITGNTE---FEKIAEQIIRAF 596
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+K CA D + P+ K +V+ G S D E +L + L + V+ +
Sbjct: 597 GETIKQHPSGYTQFLCALDFVLGPT-KEIVIAGEPGSDDTERILREIGKRF-LPRKVLLL 654
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P+ + ++ E + +K A +C N++C+ P D + LL
Sbjct: 655 HPSKDKSIEDIAEF------IKEQKIVDNKATAYICINYACNAPTNDIHKIIQLL 703
>gi|423073704|ref|ZP_17062443.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
DP7]
gi|361855545|gb|EHL07513.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
DP7]
Length = 706
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 280/721 (38%), Positives = 387/721 (53%), Gaps = 65/721 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 40 GEEAFAKAKAENKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 99
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
VD +YM + QAL G GGWPL++FL+PD KP GTYFP E +YGRPG +L ++ + W
Sbjct: 100 VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 159
Query: 118 DKKRDML---AQSGAFAIEQLSEALSASASSNKLPDELP--QNALRLCAEQLSKSYDSRF 172
K + + A S A+ E +S + + D +P + L + L KS+D ++
Sbjct: 160 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPAQQDDFIPWAKEILDTAFQTLQKSFDRQY 219
Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
GGFG APKFP P + +L ++ D G EA + MV TL+ M +GGI DHVG G
Sbjct: 220 GGFGRAPKFPTPHHLTFLLRYA---HDHGDGLEAQQASLMVRTTLERMGQGGIFDHVGFG 276
Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
F RYS D RW VPHFEKMLYD LA YL+ + D + R+I Y+ RDM P
Sbjct: 277 FARYSTDRRWLVPHFEKMLYDNALLAIAYLETYQAEHDPYDGQKAREIFAYVLRDMTAPE 336
Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
G +SAEDADS EG EG FYVWT +E+ +ILG E L+ + Y + P GN
Sbjct: 337 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGNEEGRLYCQAYGITPEGN---- 385
Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
F+GK++ L+ D A S L L + R KLF VR +R PH D
Sbjct: 386 --------FEGKSIPNLLDTDWEALESDWQQSLSALKERLEKSREKLFAVRKERIPPHKD 437
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK++ SWNGL+I++ A+ +++L A Y E AE A FIR++LY
Sbjct: 438 DKILTSWNGLMIAALAKGTQVLGEPA----------------YAEAAEQAVYFIRKNLYA 481
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
Q RL +R+G S G+LDDYAFLI GL++LY+ + L +A++LQ QDELF D
Sbjct: 482 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGQKEHLEFALQLQREQDELFWD 539
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
GYF T + +L+R KE +DGA PSGNS+S +NL+RLA + + + A
Sbjct: 540 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGMLE---ERAYE 596
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F+ L A SR+ ++L G + ENM +
Sbjct: 597 QINAFKATLAAYPSGYSAFLQAIQFALQESRE-IILAGSLQHPELENMKTMIFKEFRPYT 655
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
T+++ + +E + + +++ ++KV A +CQN++C PV L LL
Sbjct: 656 TLLYEEGTLSELIPWLKDY----------PLDSEKVTAYLCQNYACHKPVYQAEELLALL 705
Query: 711 L 711
+
Sbjct: 706 I 706
>gi|330916342|ref|XP_003297383.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
gi|311329963|gb|EFQ94518.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
Length = 747
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 258/627 (41%), Positives = 353/627 (56%), Gaps = 29/627 (4%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFE++ VA LLN+ F+ IK+DREERPDVD++YM YVQA G GGWPL+
Sbjct: 67 ACHWCHVMERESFENDEVANLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNA 126
Query: 81 FLSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL- 135
F++PDL+P+ GGTY+P P GF IL K++D W +R +S QL
Sbjct: 127 FITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLR 186
Query: 136 --SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+E + S P+ L + L E K YD GFG APKFP P ++ +L
Sbjct: 187 DFAEDGNISRKDGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKL 246
Query: 194 SK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
S+ + + + + + + M L TL M KGGIHD +G GF RYSV + W +PHFEKM
Sbjct: 247 SQYPSAVREVLGAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKM 306
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGA 309
LYDQ QL VYLDA+ +T+ + DI YL M G FS+EDADS
Sbjct: 307 LYDQAQLLPVYLDAYLMTRSPEHLSAVHDIAAYLTSPPMQAESGGFFSSEDADSLYRPND 366
Query: 310 TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
K+EGAFYVWT KE + ILG+ A + +Y +K GN ++ D H+E +NVL
Sbjct: 367 KEKREGAFYVWTLKEFQQILGDRDAEILARYYNVKDEGN--VAPEHDAHDELINQNVLAI 424
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
A + G+ ++ NIL E R+KL D R+K RPRP LDDK++VSWNGL I + AR
Sbjct: 425 TTTKPDLAQQFGLSEDEVNNILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALAR 484
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
S L S+ + ++Y+ AE AASF+R HLY+ + L +R GP A
Sbjct: 485 TSAALSSQDPTR----------SQKYLAAAEKAASFLRAHLYNPTSKTLIRVYREGPGDA 534
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
PGF DDYA+LISGL+DLYE +L WA +LQ TQ +F D++ G+F+T + ++
Sbjct: 535 PGFADDYAYLISGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLI 594
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
+R+K+ D AEP N VS NL RL +++ + + Y + A + + FE + P
Sbjct: 595 MRLKDGMDNAEPGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFP 651
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVD 634
M A ++ H V+ G V+
Sbjct: 652 TMMDAV-VVGKLGNSHSVITGEGKKVE 677
>gi|431794219|ref|YP_007221124.1| thioredoxin domain-containing protein [Desulfitobacterium
dichloroeliminans LMG P-21439]
gi|430784445|gb|AGA69728.1| thioredoxin domain protein [Desulfitobacterium dichloroeliminans
LMG P-21439]
Length = 698
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 277/720 (38%), Positives = 390/720 (54%), Gaps = 64/720 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F R FL +TCHWCHVME ESFED VA LLN +F++IKVDREERPD
Sbjct: 33 GQEAFAKAKTQNRPIFLSIGYSTCHWCHVMERESFEDHEVADLLNRYFIAIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
VD +YM + QAL G GGWPL++ ++PD KP GTYFP E +YGRPG +L ++ + W
Sbjct: 93 VDHIYMEFCQALIGSGGWPLTILMTPDQKPFYAGTYFPKESRYGRPGIIDVLHQLGELWR 152
Query: 118 --DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCA--EQLSKSYDSRFG 173
+KK A+S A+ E +AS S++ D P + L A + +S+DS++G
Sbjct: 153 VDEKKVLSSAESIYTAVTTHKELPNASVVSSQEDDFRPWAKVILEAAFQTFQESFDSQYG 212
Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
GF APKFP P + +L ++ D G++ +A + MV TL M +GGI+DH+G GF
Sbjct: 213 GFRQAPKFPTPHNLTFLLRYAY---DHGQAPKAQQATHMVRTTLDAMGQGGIYDHIGFGF 269
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
RYS D+ W VPHFEKMLYD LA YL+++ + R+I Y+ RDM+ P G
Sbjct: 270 ARYSTDQHWLVPHFEKMLYDNALLAIAYLESYQVQHLPRDEQKVREIFAYVLRDMVSPEG 329
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSR 352
+SAEDADS EG EG FYVWT +E+ ++LG A L+ Y + GN
Sbjct: 330 GFYSAEDADS---EGV----EGKFYVWTPQEIHELLGSEAGQLYCRAYDITRDGN----- 377
Query: 353 MSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+ L+ + +A A + + E+ L E R+ LF R KR PH DD
Sbjct: 378 -------FEGKNIPNLLHTEWTALAEEFNLSREELSLQLEEARKVLFQAREKRIHPHKDD 430
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL+I++ A+ ++IL D Y + AE A SFI +LY +
Sbjct: 431 KILTSWNGLMIAALAKGAQIL----------------DDTTYTDAAEKAVSFIINYLYPK 474
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
Q RL +R+ S G+LDDYAFLI GL++LY L A+ LQ QDELFLD
Sbjct: 475 Q--RLLARYRDRDSAHLGYLDDYAFLIWGLIELYSATGKKDHLGLALSLQKAQDELFLDT 532
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
E GYF T + +L+R KE +DGA PSGNSVS NL+RLA + ++ + A
Sbjct: 533 EQLGYFLTGHDAEELLIRPKEIYDGATPSGNSVSACNLIRLARLTGDI---HWEKRANEQ 589
Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
L F++ L + + A SR+ +VL G + M Y T
Sbjct: 590 LMAFKSSLSTHSAGYTMFLQALQYALAQSRE-IVLAGPIQHAELSKMKELIFTEYRPYTT 648
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+++ + +E + + +++ + + + A +CQN+SC PV L +LLL
Sbjct: 649 LLYQEGTLSELIPWLKDYPED----------SKQSTAYICQNYSCLRPVHTAAELPSLLL 698
>gi|195334316|ref|XP_002033829.1| GM21533 [Drosophila sechellia]
gi|194125799|gb|EDW47842.1| GM21533 [Drosophila sechellia]
Length = 808
Score = 454 bits (1167), Expect = e-124, Method: Compositional matrix adjust.
Identities = 267/723 (36%), Positives = 375/723 (51%), Gaps = 75/723 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE A ++N+ FV+IKVDREERPD+DK+YM ++ G GGWP+S
Sbjct: 122 STCHWCHVMEHESFESPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L PL+ GTYFPP+ +YG P F +L + W+ ++ L +G+ + L +
Sbjct: 182 VWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLNSIARKWETDKESLLTTGSSLLSALKKNQ 241
Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
ASA +P+ A E+LS++ +D GGFGS PKFP + +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+ +D + MV+ TL + KGGIHDH+ GGF RY+ + WH HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQGQL + +A+ +T+D Y I YL +D+ P G ++ EDADS T
Sbjct: 347 LYDQGQLMVAFTNAYKVTRDEIYLGYADKIYKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406
Query: 311 RKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
K EGAFY WT E++ DI + A ++ HY LKP GN + SDPH
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFDDITPDRAFEIYAYHYDLKPPGN--VPTYSDPHG 464
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
GKN+LI + + + +++ +L L +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEADQFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GLV+S + ++R++YM+ A+ F+R+ +YD + L
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570
Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
S S+ GFLDDYAFLI GLLD Y+ L WA LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D G YF + + P+V++R+KEDHDGAEPSGNSVS NLV LA D + Q A
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPSGNSVSAHNLVLLAHYY---DEDAFLQKA 687
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
L F + A+P M A +L + +V V S D + + Y
Sbjct: 688 GKLLNFF-ADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEICRKFYIP 744
Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
+ ++H+DP++ EE SN + K +CQ +C PVTDP LE+
Sbjct: 745 SMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICQERACRMPVTDPQQLED 797
Query: 709 LLL 711
L+
Sbjct: 798 NLM 800
>gi|451845821|gb|EMD59132.1| hypothetical protein COCSADRAFT_41015 [Cochliobolus sativus ND90Pr]
Length = 799
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 263/630 (41%), Positives = 360/630 (57%), Gaps = 37/630 (5%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA G GGWPL+VF
Sbjct: 120 CHWCHVMERESFENDEVAKLLNEHFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNVF 179
Query: 82 LSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++PDL+P+ GGTY+P P GF IL+K++D W +R +S QL +
Sbjct: 180 ITPDLEPIFGGTYWPGPGSTMAMGEHIGFIGILKKIRDVWRDQRQRCLESAKEITAQLRD 239
Query: 138 ALSASASSNKLPDELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
S K D P L L E K YD GFG APKFP P + +L
Sbjct: 240 FAEEGNISRK--DGAPNETLDLELLDEAYEHFKKRYDQVHAGFGGAPKFPTPSNLHFLLK 297
Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
S+ +++ + + + + M L TL M KGGIHD +G GF RYSV + W +PHFEK
Sbjct: 298 LSQYPNPVKEVLGAKDCTYAKDMALATLSAMNKGGIHDQIGNGFARYSVTKDWSLPHFEK 357
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
MLYDQ QL VYLDA+ +T+ + DI YL M G +S+EDADS
Sbjct: 358 MLYDQSQLLAVYLDAYLMTRSPEHLGAVHDIATYLTSPPMHAESGGFYSSEDADSLYRPN 417
Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
K+EGAFYVWT E +DILGE + + +Y +K GN ++ D H+E +NVL
Sbjct: 418 DKEKREGAFYVWTLNEFQDILGERDSEILARYYNVKDEGN--VAPEHDAHDELINQNVLA 475
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFA 426
+ S+ A + G+ +K IL E R+KL + R+K RPRP LDDK++VSWNGL I + A
Sbjct: 476 ITSTSADLAKQFGLSEDKVEKILTEGRQKLLEHRNKERPRPGLDDKIVVSWNGLAIGALA 535
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
R S L S+ + KEY+ AE AA+F+++HLY+ ++ L +R GP
Sbjct: 536 RTSAALASQDPAR----------SKEYLAAAEKAAAFLQKHLYNSESKTLIRVWREGPGD 585
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
APGF DDYA+LISGL++LYE +L WA +LQ TQ ++F D++ G+F+T + +
Sbjct: 586 APGFADDYAYLISGLINLYEATFNDSYLQWADDLQKTQLKMFWDKQHLGFFSTPEDQTDL 645
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
++R+K+ D AEP N VS NL RL +++ S+ Y Q A + + FE +
Sbjct: 646 IMRLKDGMDNAEPGTNGVSAQNLDRLGALLEDSE---YTQRARDTASAFEAEIMQHPFLF 702
Query: 607 PLM--CCAADMLSVPSRKHVVLVGHKSSVD 634
P M A L + +H V+ G VD
Sbjct: 703 PSMMEAVVAGKLGI---RHAVITGDGQKVD 729
>gi|20129985|ref|NP_610953.1| CG8613 [Drosophila melanogaster]
gi|7303195|gb|AAF58258.1| CG8613 [Drosophila melanogaster]
gi|60677913|gb|AAX33463.1| RE10908p [Drosophila melanogaster]
Length = 808
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 267/727 (36%), Positives = 376/727 (51%), Gaps = 83/727 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ A ++N+ FV+IKVDREERPD+DK+YM ++ G GGWP+S
Sbjct: 122 STCHWCHVMEHESFENPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P L PL+ GTYFPP+ +YG P F T+L+ + W+ ++ L +G+ + L +
Sbjct: 182 VWLTPTLAPLVAGTYFPPKSRYGMPSFNTVLKSIARKWETDKESLLATGSSLLSALQKNQ 241
Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
ASA +P+ A E+LS++ +D GGFGS PKFP + +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+ +D + MV+ TL + KGGIHDH+ GGF RY+ + WH HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQGQL + +A+ +T+D Y I YL +D+ P G ++ EDADS T
Sbjct: 347 LYDQGQLMMAFANAYKVTRDEIYLRYADKIHKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406
Query: 311 RKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
K EGAFY WT E++ DI E A ++ HY LKP GN + SDPH
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFDDITPERAFEIYAYHYGLKPPGN--VPAYSDPHG 464
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
GKN+LI + + + +++ +L L +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEEDRFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GLV+S + ++R++YM+ A+ F+R+ +YD + L
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570
Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
S S+ GFLDDYAFLI GLLD Y+ L WA LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D G YF + + P+V++R+KEDHDGAEP GNSVS NLV LA YY +NA
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAH--------YYDENA 682
Query: 589 ----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
L F + A+P M A +L + +V V S D + +
Sbjct: 683 YLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEICRK 740
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPI 704
+ + ++H+DP++ EE SN + K +C +C PVTDP
Sbjct: 741 FFIPSMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTDPQ 793
Query: 705 SLENLLL 711
LE+ L+
Sbjct: 794 QLEDNLM 800
>gi|169597471|ref|XP_001792159.1| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
gi|160707528|gb|EAT91170.2| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
Length = 756
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 276/702 (39%), Positives = 378/702 (53%), Gaps = 48/702 (6%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VA +LN F+ IK+DREERPD+D++YM YVQA GGGGWPL+ F
Sbjct: 68 CHWCHVMERESFENQEVADILNKNFIPIKIDREERPDIDRIYMNYVQATTGGGGWPLNAF 127
Query: 82 LSPDLKPLMGGTYFP-PEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++PDL+P+ GGTY+P PE G PGF IL K++D W +R S QL +
Sbjct: 128 ITPDLEPIFGGTYWPGPESTMAMEGHPGFVGILEKIRDVWQNQRQRCLDSAKEITAQLRD 187
Query: 138 ALSASASSNKLPDE-------LPQNALRLC----AEQLSKSYDSRFGGFGSAPKFPRPVE 186
S K E L +A +C + + YD GFGSAPKFP P
Sbjct: 188 FAEDGNISRKDGAEHDHLDLDLLDDAYEVCEADGPQHFKRRYDQAHAGFGSAPKFPTPSN 247
Query: 187 IQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+ +L + K+ + + S QKMVL TL M KGGIHD +G GF RYSV + W
Sbjct: 248 LHFLLKLNTYPKQTAQILTAEDISNAQKMVLATLDKMNKGGIHDQIGNGFARYSVTKDWS 307
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ QL VYLDA+ TK DI YL M G FS+EDAD
Sbjct: 308 LPHFEKMLYDQAQLLPVYLDAYLATKRPEMLEAVHDIATYLTTPPMQAESGGFFSSEDAD 367
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S K+EGAFYVWT KE ++ILG+ A + +Y ++ GN ++ D H+E
Sbjct: 368 SLYRPSDKEKREGAFYVWTLKEFQEILGDRDAEILARYYNVRDEGN--VAPEHDAHDELI 425
Query: 362 GKNVL-IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
+NVL I N + A + + ++ +IL R+KL D R+K RPRP LDDK++VSWNG
Sbjct: 426 NQNVLAINNNTPTDVAKQFALSEDELQSILRSGRQKLLDHRNKERPRPALDDKIVVSWNG 485
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 479
L I + AR + + ++ S +Y+ AE AA FI++ LY+ + L
Sbjct: 486 LAIGALARTAAAISAQDPSR----------SSQYLAAAEKAAHFIQKELYNPTSKTLTRV 535
Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 539
+R GP APGF DDYA+LISGL+DLYE L WA ELQ TQ +F D++ G+F+T
Sbjct: 536 YREGPGDAPGFADDYAYLISGLIDLYEATFNPSNLQWADELQQTQLSMFWDKQHLGFFST 595
Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+++R+K+ D AEP N VS NL RL +++ ++ Y + A +++ FE +
Sbjct: 596 PENQTDLIMRLKDGMDNAEPGTNGVSARNLDRLGALLEDAE---YVKKARDTVSAFEAEI 652
Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
P M A + R HVV+ G E L T+ + D
Sbjct: 653 MQHPFLFPSMLDAVVAGKLGMR-HVVVTGKGEKA--EQWLRRYRERPAGLSTISRV---D 706
Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
T+ D+ ++ N SM A + +VC+N +C +T
Sbjct: 707 TDLGDWLKQRNPLVKSM-----DAGREGVMVCENGACKDGLT 743
>gi|218780669|ref|YP_002431987.1| hypothetical protein Dalk_2829 [Desulfatibacillum alkenivorans
AK-01]
gi|218762053|gb|ACL04519.1| protein of unknown function DUF255 [Desulfatibacillum alkenivorans
AK-01]
Length = 718
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 279/714 (39%), Positives = 374/714 (52%), Gaps = 55/714 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFED A LLN F+ IKVDREERPD
Sbjct: 54 GDEAFEQAKKEDKPVFLSIGYSTCHWCHVMERESFEDPEAAALLNRHFICIKVDREERPD 113
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYM+ QA+ G GGWP+SVFL+PD +P GTYFP ED GRPG + + + W
Sbjct: 114 IDHVYMSVTQAMTGAGGWPMSVFLTPDKEPFYAGTYFPKEDHMGRPGLMRLATLLGELWK 173
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R A +Q+ +ALS A K +EL + L L SYD + GGFG
Sbjct: 174 NERSKALN----AAQQVVQALS-QAQPKKGREELGPHTLGKAFAGLKASYDVQQGGFGRG 228
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
KFP P + +L + K+ D +E MV TL M GGI+DHVG G HRY+
Sbjct: 229 NKFPTPHNLTFLLRYWKRTGD-------AEALAMVEKTLTAMRMGGIYDHVGFGIHRYAT 281
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYDQ AN L+A+ T Y+ R+I Y+ RDM P G +SA
Sbjct: 282 DPNWLLPHFEKMLYDQALTANALLEAYQATGKEEYATNAREIFTYVLRDMTSPEGGFYSA 341
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG +EG FYVWT+KE+ +ILG E LF + L GN
Sbjct: 342 EDADS---EG----EEGKFYVWTTKEITEILGKEDGALFISAFNLVKGGNF----FDQAT 390
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
+ G ++ D A+ LGM + + L + R LF R KR P+ DDK++ W
Sbjct: 391 GQKTGDSIPHLQKDPGRLAADLGMEKAELESRLEKIRAALFAEREKRIHPYKDDKILTDW 450
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++ A+ +IL E +Y A AA FI L D + H LQ
Sbjct: 451 NGLMIAALAKGGRILGDE----------------KYTLAAVRAADFILDALQDGEGH-LQ 493
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
FR G + PG LDDYAF++ GLL+LYE G KWL A+ L T +LF DR+ GG F
Sbjct: 494 KRFREGEAALPGLLDDYAFMVWGLLELYESTFGVKWLKKAVTLNETMLDLFWDRKNGGLF 553
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+ + +R K+ HDGA+PSGNSV+ +NL+RLA I A + R+ AE L F
Sbjct: 554 MSPVYGEKLFMRGKDLHDGAQPSGNSVAAVNLLRLAGITANEEC---REKAEAILQAFSG 610
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHID 656
+++ + A D + P+ + +V+ G + + D ML + + NK V +
Sbjct: 611 QIEAQPYVYTHLLGALDFIIGPALE-IVICGDQGARDSTVMLDGVNQRFVPNKVLVFRPN 669
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
D +E+D + A + K A VCQ ++C P TDP +L +L
Sbjct: 670 TEDCKELDELAPYTREQACV------QGKATAYVCQGYTCQRPTTDPEALFRIL 717
>gi|333922724|ref|YP_004496304.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
CO-1-SRB]
gi|333748285|gb|AEF93392.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
CO-1-SRB]
Length = 692
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 267/707 (37%), Positives = 381/707 (53%), Gaps = 65/707 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFE E VA++LN ++V+IKVDREERPD
Sbjct: 33 GEEAFEKAKRENKPVFLSIGYSTCHWCHVMERESFESEDVAEVLNKYYVAIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMT QAL G GGWPL++ ++PD KP GTYFP YG+PG IL+++ D W
Sbjct: 93 IDQIYMTVCQALTGQGGWPLNIIMTPDQKPFFAGTYFPKNSNYGKPGLIDILQQIADLWA 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K R L + +L+ + + + +L E+ A RL A + +DS +GGFG+
Sbjct: 153 KDRQQLLGISDQLMARLN--MKTATAPGQLSPEVLDKAYRLFA----RHFDSTYGGFGNP 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + ++L KK + MV TL M +GGI+DH+G GF RYS
Sbjct: 207 PKFPTPHNLMLLLRCWKKTSQ-------KKALTMVEDTLDAMHRGGIYDHIGFGFSRYST 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D RW VPHFEKMLYD LA +L+ + + ++ +S + ++I Y+ RDM P G +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLAIAFLETYQINRNPRFSRVAKEIFTYVLRDMTAPEGGFYSA 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FYVW +EVE +LG+ LF +Y + P GN
Sbjct: 320 EDADS---EGV----EGKFYVWHPQEVEQVLGQIDGQLFCRYYDITPRGN---------- 362
Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+G ++ +N D A +L + LE ++ L +CR+ LF R KR PH DDK++ S
Sbjct: 363 --FEGASIPNLINQDPLKFAQELDITLEDLVDGLEKCRQLLFAQREKRVHPHKDDKILTS 420
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I++ AR +++L E +Y + AE A FI +L RL
Sbjct: 421 WNGLMIAALARGARVLGDE----------------KYSQAAEKAVDFIYHNL-QRADGRL 463
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+R+G + P +LDDYAFLI GLL+LYE K L A++L ++ +LF DR+ GG+
Sbjct: 464 LARYRDGEAAYPAYLDDYAFLIWGLLELYEATFDIKHLEQAVQLTDSMIDLFWDRQNGGF 523
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F + ++ R KE +DGA PSGNSV+ +NL RLA + ++ Y + A L VF
Sbjct: 524 FFYGKDSEQLISRPKEIYDGAIPSGNSVATVNLFRLARLTGRNR---YEELATKQLQVFA 580
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
L+ + AA + P + +VL G + + M+ + L VI +
Sbjct: 581 GELEHYPIGYSYFMIAAYLNQEPPTE-IVLSGKREDSALKQMIDVVQKEF-LPSAVIAVR 638
Query: 657 PADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTD 702
+ ++ A K A VC+NF+C PPVTD
Sbjct: 639 YEGEAAA-----QAEELVPLLKDRLPVAGKATAYVCKNFACQPPVTD 680
>gi|195430492|ref|XP_002063288.1| GK21469 [Drosophila willistoni]
gi|194159373|gb|EDW74274.1| GK21469 [Drosophila willistoni]
Length = 752
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 274/718 (38%), Positives = 372/718 (51%), Gaps = 65/718 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ A ++N FV+IKVDREERPD+DKVYM ++ G GGWP+S
Sbjct: 66 STCHWCHVMEHESFENPETAAVMNKHFVNIKVDREERPDIDKVYMQFLLLSKGSGGWPMS 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL PL GTYFPP ++G P F +L + + W R+ L ++G+ ++ L +
Sbjct: 126 VWLTPDLAPLAAGTYFPPHSRWGMPSFTKVLESIANKWQTDRESLLKAGSTVLKALQKNQ 185
Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
A+A + + P +A E L+ + YD GGFG PKFP + + +
Sbjct: 186 DAAAVAEAAFE--PGSAEEKLMEALNVHKQRYDQAHGGFGREPKFPEIPRLNFLFHAYLV 243
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D + MV+ TL + +GGI+DHV GGF RY+ WH HFEKMLYDQGQ
Sbjct: 244 TKDV-------DVLDMVMQTLDHIGRGGINDHVFGGFCRYATTRDWHNVHFEKMLYDQGQ 296
Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L Y +A+ LT+ D+F SY + I YL +D+ P G ++ EDADS T T K EG
Sbjct: 297 LMAAYANAYKLTRSDLFLSYADK-IYRYLIKDLRHPAGGFYAGEDADSLPTHQDTVKVEG 355
Query: 316 AFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGK 363
AFY WT E+++ A F E HY L+P GN + SDPH GK
Sbjct: 356 AFYAWTWSEIQETFKSQAQCFGEVSPERAFEIYTFHYDLQPKGN--VPPASDPHGHLTGK 413
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
N+LI + S + LE+ IL L VR KRPRPHLD K+I WNGLV+S
Sbjct: 414 NILIVKGSEEDTCSNFNLELEQLQQILETANDILHSVRDKRPRPHLDTKIICGWNGLVLS 473
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS---- 479
++ + ++ R EYM+ A+ F+RR +YD++ LQ S
Sbjct: 474 GLSKLANCGTTK--------------RDEYMQTAKELVDFLRREMYDKERKLLQRSCYGS 519
Query: 480 ------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
+ GFLDDYAFLI GLLD Y+ L WA ELQ +QD+LF D++
Sbjct: 520 GVEDNTLEKNELQIEGFLDDYAFLIKGLLDYYKASLDLSVLSWAKELQESQDKLFWDQQN 579
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G YF + P+V++R+KEDHDGAEP GNSVS NL L+ S Y + A L
Sbjct: 580 GAYFFSQQNAPNVIVRLKEDHDGAEPCGNSVSARNLTLLSHYYDESS---YLERAGKLLN 636
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F + A+P M A +L V +VG SS D + + Y ++
Sbjct: 637 FF-ADVSPFGHALPEMLSAL-LLHENGLDLVAVVGPDSS-DTKKFVEICRKFYIPGMIIL 693
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
H+DP + D + N M K +C + C PVTDP+ LE L+
Sbjct: 694 HVDPLHPD--DACNQRVQNKFKMVNG-----KTTVYICHDRVCRMPVTDPVQLEENLM 744
>gi|148656403|ref|YP_001276608.1| hypothetical protein RoseRS_2279 [Roseiflexus sp. RS-1]
gi|148568513|gb|ABQ90658.1| protein of unknown function DUF255 [Roseiflexus sp. RS-1]
Length = 700
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 265/697 (38%), Positives = 377/697 (54%), Gaps = 72/697 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFEDE A L+N F+++KVDREERPD+D +YMT VQA+ G GGWP++V
Sbjct: 57 ACHWCHVMEHESFEDEETAALMNQHFINVKVDREERPDIDAIYMTAVQAMTGSGGWPMTV 116
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD P GTYFPPED++ P F+ +LR V +A+ +R+ L G +E++ EA+S
Sbjct: 117 FLTPDGVPFFAGTYFPPEDRWQMPSFRRVLRSVAEAYASRRNELLARGRELVERMREAIS 176
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
L + A L +++D FGGFG APKFP+P+ ++ +L ++ + T
Sbjct: 177 MHMPGGTLTPAVLDTAF----IGLQQAFDPAFGGFGRAPKFPQPMTLEFLLRYAVR---T 229
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ G +M+ TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD LA V
Sbjct: 230 GR------GMEMLEMTLRRMAEGGMYDQLGGGFHRYSVDAQWLVPHFEKMLYDNALLARV 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL+ F T + Y I + LDY+ R+M P G FS +DADS T AT K EGAF+VW
Sbjct: 284 YLETFQATGNACYRRIAEETLDYMLREMHHPEGGFFSTQDADSLPTPDATHKHEGAFFVW 343
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E+ + LG AI+F Y + GN F+GKN+L A +G
Sbjct: 344 TPAEIREALGTDAIVFSALYGVTDQGN------------FEGKNILHVRRSPDEVARVMG 391
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
MP+E+ I RR LF+VR +RP P LDDKV+ +WNG+ I +FA +
Sbjct: 392 MPVEQIETIAARGRRILFEVRQRRPMPDLDDKVLTAWNGMAIRAFALGA----------- 440
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
V DR++Y A A F+ +L L+ R + P FL+DYA L G
Sbjct: 441 -----VALDREDYRIAAVRCARFVLTNLRRADGELLRSWRRGVANPTPAFLEDYALLADG 495
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL LYE WL+ A L ++ E F D GG+++T +++R ++ D A PS
Sbjct: 496 LLALYEATFDPHWLLEARALADSLLERFWDEGLGGFYDTGKNHEQLVIRPRDTGDNATPS 555
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM---------CC 611
G+S +V L+RLA I ++ YR E +L+V E+ VP+M
Sbjct: 556 GSSAAVDVLLRLALIFDEAR---YR---ERALSVLES-------MVPVMQRYPTGFGRYL 602
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
AA ++ + + L+G+ D + + A + N+ ++ P E+
Sbjct: 603 AAAEFALGQPREIALIGNPEDADTQALAAVVLKPFLPNRVIVLARPG--------EDPPR 654
Query: 672 NNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLE 707
+ + D K A VCQN++C PVT+P +LE
Sbjct: 655 IPSPLLNGRGQIDGKATAYVCQNYACQLPVTEPSALE 691
>gi|410980751|ref|XP_003996739.1| PREDICTED: spermatogenesis-associated protein 20 [Felis catus]
Length = 773
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/736 (38%), Positives = 389/736 (52%), Gaps = 81/736 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 90 GPEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT++Q W +GG PP + L + W
Sbjct: 150 VDKVYMTFIQVSSVSTYW------------AVGGXXXPPPTPHADLQVCPCLPQ----WK 193
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP + + C +QL +SYD +GGF
Sbjct: 194 QNKNTLLENS----QRVTAALLARSEISMGDRQLPPSGATMNSRCFQQLDESYDEEYGGF 249
Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 250 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 304
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QLA Y AF ++ D FYS + R IL Y+ R++ G
Sbjct: 305 HRYSTDRQWHIPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVARGILQYVARNLSHRSG 364
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
SAEDADS G + KEGAFYVWT KEV+ +L E L +HY L
Sbjct: 365 GFCSAEDADSPPERG-MQPKEGAFYVWTVKEVQQLLSEPVPGATEPLTSGQLLMKHYGLT 423
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN +S DP E G+NVL +A++ G+ +E +L KLF R
Sbjct: 424 EAGN--ISPSQDPKGELHGRNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKH 481
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRPHLD K++ SWNGL++S FA +L E + N+ A + A F
Sbjct: 482 RPRPHLDSKMLASWNGLMVSGFAVTGAVLGLE---RLINY-------------ATNGAKF 525
Query: 464 IRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLV 515
++RH++D + RL + G S P GFL+DYAF++ GLLDLYE + WL
Sbjct: 526 LKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLE 585
Query: 516 WAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA+ LQ+ QD LF D +GGGYF + E + L LR+K+D DGAEPS NSVS NL+RL
Sbjct: 586 WALRLQDAQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHG 645
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
G K + L F RL+ + +A+P M A + K +V+ G + D
Sbjct: 646 FT-GHKD--WMDKCVSLLTAFSERLRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKD 701
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
+ +L H+ Y NK +I A+ + F +++ R D+ A VC+N
Sbjct: 702 TKALLQCVHSIYIPNKVLIL---ANGDPSSFLSRQLPFLSTLRRLE---DRATAYVCENQ 755
Query: 695 SCSPPVTDPISLENLL 710
+CS P+T+P L LL
Sbjct: 756 ACSVPITEPCELRKLL 771
>gi|89894906|ref|YP_518393.1| hypothetical protein DSY2160 [Desulfitobacterium hafniense Y51]
gi|89334354|dbj|BAE83949.1| hypothetical protein [Desulfitobacterium hafniense Y51]
Length = 699
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 277/720 (38%), Positives = 385/720 (53%), Gaps = 65/720 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 33 GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
VD +YM + QAL G GGWPL++FL+PD KP GTYFP E +YGRPG +L ++ + W
Sbjct: 93 VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 152
Query: 118 DKKRDMLAQSGAFAIEQLS--EALSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRF 172
K + + S + ++ E S S+ + L D+ + L + L KS+D ++
Sbjct: 153 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPALQDDFIPWAKEILDTAFQTLQKSFDRQY 212
Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
GGFG APKFP P + +L ++ D EA + MV TL+ M +GGI DHVG G
Sbjct: 213 GGFGRAPKFPTPHHLTFLLRYA---HDHSDGLEAQQAALMVRTTLERMGQGGIFDHVGFG 269
Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
F RYS D W VPHFEKMLYD LA YL+ + D R+I Y+ RDM P
Sbjct: 270 FARYSTDRHWLVPHFEKMLYDNALLAIAYLENYQAQHDPHDEQKAREIFSYVLRDMTAPE 329
Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
G +SAEDADS EG EG FYVWT +E+ +ILG E L+ + Y + P GN
Sbjct: 330 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGSEEGRLYCQAYGVSPEGN---- 378
Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
F+GK++ L+ D A S+ LE L + R KLF VR +R PH D
Sbjct: 379 --------FEGKSIPNLLDTDWEALGSERQHSLEVLKRRLEKSREKLFAVRKERIPPHKD 430
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK++ SWNGL+IS+ A+ +++L A Y E AE A FIR++LY
Sbjct: 431 DKILTSWNGLMISALAKGAQVLGEPA----------------YAEAAEQAVYFIRKNLYA 474
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
Q RL +R+G S G+LDDYAFLI GL++LY+ + L +A++LQ QDELF D
Sbjct: 475 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGQKEHLEFALQLQREQDELFWD 532
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
GYF T + +L+R KE +DGA PSGNS+S +NL+RLA + + + A
Sbjct: 533 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGMLE---ERAYE 589
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F+ L A SR+ ++L G + +NM +
Sbjct: 590 QINAFKATLATYPSGYSAFLQAIQFALQESRE-IILAGSLQHPELKNMKTTIFKKFHPYT 648
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
T+++ + +E + + +++ ++K+ A +CQN++C PV L LL
Sbjct: 649 TLLYEEGTLSELIPWLKDY----------PLDSEKMTAYLCQNYACHKPVHKAEELSALL 698
>gi|194756922|ref|XP_001960719.1| GF13496 [Drosophila ananassae]
gi|190622017|gb|EDV37541.1| GF13496 [Drosophila ananassae]
Length = 797
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 274/737 (37%), Positives = 379/737 (51%), Gaps = 79/737 (10%)
Query: 11 KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
K RR + LI +TCHWCHVME ESFE A ++N+ FV+IKVDREERPD+DKVYM
Sbjct: 96 KARRENKLIFLSVGYSTCHWCHVMEHESFESPETAAIMNEHFVNIKVDREERPDIDKVYM 155
Query: 65 TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
++ G GGWP+SV+L+PDL PL+ GTYFPP+ +YG P F T+L+ + W ++ L
Sbjct: 156 QFLLMSKGSGGWPMSVWLTPDLAPLVAGTYFPPKTRYGMPSFTTVLQNIAKKWQTDKESL 215
Query: 125 AQSGAFAIEQLSEALSASASSNKLPDEL--PQNALRLCAEQLS---KSYDSRFGGFGSAP 179
++G+ L +AL + + +P+ P +A +E ++ + +D GGFGS P
Sbjct: 216 IEAGS----TLVDALKRNQDAEAVPEAAFEPGSAEAKLSEAITVHKQRFDQTHGGFGSEP 271
Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
KFP + + + +D + MVL +L + +GGI+DH+ GGF RY+
Sbjct: 272 KFPEVPRLNFLFHGYLVTKDV-------DVLDMVLQSLDHIGRGGINDHIFGGFARYATT 324
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
WH HFEKMLYDQGQL Y +A+ LT+ + I YL +D+ P G ++ E
Sbjct: 325 RDWHNVHFEKMLYDQGQLMAAYANAYKLTRSETFLGYADKIYKYLVKDLRHPLGGFYAGE 384
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGN 347
DADS T T K EGAFY WT +E++ A F+ HY LKP GN
Sbjct: 385 DADSLPTHKDTVKVEGAFYAWTWEEIQSAFKNQAERFEGVSPERAFEIYSFHYGLKPQGN 444
Query: 348 CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM---PLEKYLNILGECRRKLFDVRSKR 404
+ SDPH GKN+LI A+ S + PLEK L+ + L +R +R
Sbjct: 445 --VPTYSDPHGHLTGKNILIVKGSDEATCSNFNLEAEPLEKLLDTANDI---LHVLRDQR 499
Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
PRPHLD K+I +WNGLV+S ++ + ++ R+EYM+ A+ F+
Sbjct: 500 PRPHLDTKIICAWNGLVLSGLSKLANCGTAK--------------RQEYMQTAKELLEFL 545
Query: 465 RRHLYDEQTHRLQHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 514
R+ +YD + L S S+ GFLDDY+FLI GLLD Y+ L
Sbjct: 546 RKEMYDSERKLLLRSCYGVAVGDPRLEKNESEIEGFLDDYSFLIKGLLDYYKASLDLSAL 605
Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 574
WA ELQ TQD+LF D G YF + + P+V++R+K+DHDGAEP GNSVS NL L+
Sbjct: 606 NWAKELQETQDKLFWDERNGAYFFSQRDSPNVIVRLKDDHDGAEPCGNSVSARNLTLLSH 665
Query: 575 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 634
D Y Q A L F + A+P M A +L V +VG S D
Sbjct: 666 YY---DEDAYLQRAGKLLNFF-ADVSPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-D 719
Query: 635 FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
E + Y ++H+DP +E SN + K +C +
Sbjct: 720 TERFVEICRKFYIPGMIILHVDPQHPDEA-------SNQRVQKKFKMVNGKTTVYICHDR 772
Query: 695 SCSPPVTDPISLENLLL 711
C PVTDP LE L+
Sbjct: 773 VCRMPVTDPAQLEQNLM 789
>gi|414153807|ref|ZP_11410129.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
gi|411454828|emb|CCO08033.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
Length = 691
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 266/717 (37%), Positives = 384/717 (53%), Gaps = 69/717 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFE VA++LN +FVSIKVDREERPD
Sbjct: 34 GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFESADVAEVLNKYFVSIKVDREERPD 93
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD++YM+ QAL G GGWPL+V ++P KP GTYFP E YGRPG IL ++ W+
Sbjct: 94 VDQIYMSVCQALTGSGGWPLTVIMTPQQKPFFAGTYFPKETNYGRPGLIEILTRIAWLWE 153
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R L G EQL+ L A+ + P +LP + L L+++YD+ +GGFG+A
Sbjct: 154 HERPSLLAMG----EQLTAHLHQEAAVS--PGQLPADILDQAYRLLARNYDASYGGFGTA 207
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + K + + MV TL M +GGI+DH+G GF RYSV
Sbjct: 208 PKFPTPHNLMFLLRYYYKTKQ-------PQALTMVEETLDAMHRGGIYDHIGFGFARYSV 260
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W VPHFEKMLYD LA +L+ + +T ++ + I ++I Y+ RDM P G +SA
Sbjct: 261 DHKWLVPHFEKMLYDNALLALAFLETYQVTGNMRFGRIAKEIFAYVLRDMTSPEGGFYSA 320
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS T EG FY+W +EV DILG+ +F +Y + GN
Sbjct: 321 EDADSEGT-------EGKFYLWQPQEVVDILGQPDGEIFCRYYNITAQGN---------- 363
Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+G N+ LI D A++LG+ L + + +CR LF RSKR P DDK++
Sbjct: 364 --FEGSNIPNLIG-QDPRRFAAELGIELADLVKGMEKCRSLLFKARSKRVHPFKDDKILT 420
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
+WNGL+I++ +R +++ SE Y A A +FI + L R
Sbjct: 421 AWNGLMIAALSRGARVFHSEV----------------YRTAAVKAVNFINQRL-RRPDGR 463
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L FR+G + P +LDDYAFL GLL+LYE T +L A+ L ELFLD++ GG
Sbjct: 464 LLARFRDGEAAFPAYLDDYAFLAWGLLELYEATFDTDYLAEAVRLTEDMIELFLDQQHGG 523
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+F + ++ R KE +DGA PSGNSV+ +NL+RLA + + +D + + A L F
Sbjct: 524 FFFYGKDSEQLISRPKEIYDGALPSGNSVAAVNLIRLARL---TGNDRFAELAHRQLTGF 580
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-H 654
+++ AA +L P + +VL G + M+ ++ + ++
Sbjct: 581 AQQVEQYPAGYSFFMIAAYLLQEPPLE-IVLTGEAADDSLRRMIQTVQRAFLPHGVIMAR 639
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
+ ADTEE + + R+ + + C+NF+C P+T+ L+ L
Sbjct: 640 YEGADTEE-------PARLLPLTRDKLPVNGQATVYFCENFTCRKPITELSQLQAAL 689
>gi|374856309|dbj|BAL59163.1| hypothetical conserved protein [uncultured candidate division OP1
bacterium]
Length = 683
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 254/690 (36%), Positives = 376/690 (54%), Gaps = 65/690 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME E FE+ +A+ LN+ FVSIKVDREERPD+D++YMT VQ L G GGWPL+
Sbjct: 51 SACHWCHVMERECFENPQIAQYLNEHFVSIKVDREERPDLDEIYMTAVQLLTGQGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPPED++GRPGF T+L+ + + K+R+ + + EQL++ L
Sbjct: 111 VFLTPDLKPFFGGTYFPPEDRWGRPGFLTVLKAITALYQKEREKIVEQA----EQLTQYL 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + L ++ ++ +S+D GGFG APKFP +E+ ++L + + D
Sbjct: 167 QALQQPRPSSELLTRDLIQRAYLSALQSFDREHGGFGGAPKFPHSLELSLLLRYWHRTRD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ +V F+L+ MA+GGI+D +GGGFHRYSVD +W VPHFEKMLYD L
Sbjct: 227 -------ADALHVVEFSLEQMARGGIYDQLGGGFHRYSVDAQWAVPHFEKMLYDNALLVW 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ +T+ Y + + LDY+ R+M G F+++DADS + EGAFY+
Sbjct: 280 TYLEAYQITQKALYRRVVEETLDYVLREMTSSAGGFFASQDADSPD-------GEGAFYL 332
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT +E+E +LG A K Y G + R EF A+K+
Sbjct: 333 WTPEEIEAVLGA-ADGAKACEYFGVAGGASVLRSPYTLEEF---------------AAKM 376
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
M + + L + KLF R +RP+P D+K++ +WNGL+IS+ RA ++L E
Sbjct: 377 KMTISECEGWLARVKEKLFAAREQRPKPARDEKMLTAWNGLMISALVRAYQVLGHE---- 432
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+Y+ A AA F LY + L+HS ++G +K PG+LDDYAFLI
Sbjct: 433 ------------KYLHAAHDAAHFCLNSLYRDGA--LKHSCKDGIAKIPGYLDDYAFLIL 478
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLDLYE +W+ A L T E F D GGG+F T+ + + +R K +DGA P
Sbjct: 479 ALLDLYESDFDLRWVHAAKTLSATLIEKFWDEHGGGFFFTSSDHEKLPVRPKSFYDGATP 538
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNS + + L+RL + + R AE +L + ++ A+ M A D P
Sbjct: 539 SGNSAATMALLRLVELTGDAA---LRVKAEQTLRLCRDFMEQAPQALSYMLSALDFYLGP 595
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ + + +VG + + + + A + NK V+ +P D E + + +
Sbjct: 596 TTQ-IAIVGARGDARTQQFVESIRARFLPNKIVVVSEPGDGE--------RAALIPLVQG 646
Query: 680 NFSADKVVAL-VCQNFSCSPPVTDPISLEN 708
+ A+ +C+N SC P+T+ LE
Sbjct: 647 KGLVNGAPAVYLCKNSSCQAPITEITELER 676
>gi|194883110|ref|XP_001975647.1| GG20445 [Drosophila erecta]
gi|190658834|gb|EDV56047.1| GG20445 [Drosophila erecta]
Length = 805
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 270/739 (36%), Positives = 384/739 (51%), Gaps = 71/739 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFE+ A LN+ FVSIK+DREERPD
Sbjct: 101 GEEAFEKARRENKIIFLSVGYSTCHWCHVMEHESFENPDTAAFLNEHFVSIKLDREERPD 160
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+DK+YM ++ G GGWP++V+L+PDL PL+ GTYFP + +YG F +L+ + W+
Sbjct: 161 IDKIYMKFLLMTKGSGGWPMNVWLTPDLVPLVAGTYFPHKPQYGMHSFIVVLKTIAKKWN 220
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGF 175
++ L +G+ + + E+ SA+ S K +A+ +E ++ + +D +GGF
Sbjct: 221 ADKEFLLTTGSSMLSTILESQSAAEVSFK-----EGSAIDKLSEAINIHKQRFDETYGGF 275
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
GS PKFP I + + +D + MV+ TL + KGGI+DH+ GGF R
Sbjct: 276 GSEPKFPEVPRINFLFHAYLVTKDV-------DVLDMVIETLNQIGKGGINDHIFGGFAR 328
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
Y+ E WH HFEKMLYDQGQL + +A+ +++D + I YL +D+ P G
Sbjct: 329 YATTEDWHNVHFEKMLYDQGQLMGAFANAYKVSRDETFLGYGDKIYKYLVKDLSHPMGGF 388
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLK 343
++ EDADS T K EGAFY WT E++ DI E A ++ HY LK
Sbjct: 389 YAGEDADSLPTHEDKVKVEGAFYAWTWDEIQAAVQDQAQRFDDITAERAFEIYAYHYDLK 448
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
P GN S SDPH GKN+LI + + + +K +L L +R +
Sbjct: 449 PPGNVKAS--SDPHGHLTGKNILIIRGSEEDTCANFKLEADKLKKLLATTNDILHVLREQ 506
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRPHLD K+I +WNGLV+S + + ++R++YM+ AE F
Sbjct: 507 RPRPHLDTKIICAWNGLVLSGLCKLAN--------------CYSANREQYMQTAEKLLDF 552
Query: 464 IRRHLYDEQTHRLQHSF-----------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
+R+ +YD + RL S +N P + GFLDDYAFLI GLLD Y+
Sbjct: 553 LRKEMYDPERKRLIRSCYGVAVGDETLEKNEP-QIDGFLDDYAFLIKGLLDYYKATLDVD 611
Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
L WA ELQ TQD LF D + G YF + + P++++R KEDHDGAEP GNSVS NLV L
Sbjct: 612 VLHWAKELQETQDTLFWDDQNGAYFFSQQDAPNIIMRYKEDHDGAEPCGNSVSAGNLVLL 671
Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 632
A S Y Q A L F + A+P M A +L + +V V S
Sbjct: 672 AHYYDESA---YIQKAGKLLNFF-ADVSPFGHALPEMLSA--LLMYENGLDLVAVVGPDS 725
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQ 692
D + + Y + ++H+DP++ EE+ N+ + K +C
Sbjct: 726 PDTQRFVEICRKFYIPSMIIVHVDPSNPEEV-------LNHRLQKKFKMVGGKTTVYICH 778
Query: 693 NFSCSPPVTDPISLENLLL 711
+C PVTDP LE+ L+
Sbjct: 779 ERACRMPVTDPQQLEDNLV 797
>gi|290982332|ref|XP_002673884.1| predicted protein [Naegleria gruberi]
gi|284087471|gb|EFC41140.1| predicted protein [Naegleria gruberi]
Length = 600
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 245/581 (42%), Positives = 338/581 (58%), Gaps = 52/581 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFE+E +A ++N FV+IKVDREERPD
Sbjct: 38 GEEAFEKARNENKPIFLSIGYSTCHWCHVMEKESFENEEIAAIMNQNFVNIKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDA 116
+D+VYMT+VQ G GGWPLS FL+P LKP+ GGTYFPP++ G F ++L K+ +
Sbjct: 98 IDRVYMTFVQLTTGSGGWPLSCFLTPQLKPIFGGTYFPPKESIYRGNISFPSLLNKIHNM 157
Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS-------KSYD 169
W KR+ L G + L +A + + + P + + L+ E ++ S+D
Sbjct: 158 WTNKREALVSQGDKIVSVLKKAFTEKENEEE-PAKSADHILKFAHEYVASTVEDFLSSFD 216
Query: 170 SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV 229
+ +GGF APKFPRPV I +L + +D + + V FTL MA+GG++DH+
Sbjct: 217 TVYGGFSQAPKFPRPVVIDFLLRSYYEEKDDRRKLDIINS---VTFTLDKMARGGLYDHL 273
Query: 230 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM- 288
GGGFHRYSVD WHVPHFEKM+YDQGQLA V+ +A+ T++ +Y I +IL Y+ RDM
Sbjct: 274 GGGFHRYSVDTYWHVPHFEKMMYDQGQLAIVFAEAYKATRNEYYKQILEEILLYIERDMS 333
Query: 289 IGPGGEI---FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG---------EHAILF 336
+G ++ FSAEDADS T + K+EGAFY W ++V DI+ + + +F
Sbjct: 334 LGESSDMIGFFSAEDADSLPTFDSKEKREGAFYAWDYQQVVDIIDNMVPHIGSVKPSDIF 393
Query: 337 KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG-MPLEKYLNILGECRR 395
+ LK GN S SDPH E G NVL + + +P E N++ +C+
Sbjct: 394 SFMFDLKQDGNVRQS--SDPHGELTGLNVLYMDKSLKETQDRFSTIPPESVANVIMDCKD 451
Query: 396 KLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 454
LF R+K +PRPHLDDK+I +WN VIS+F+R++ +L Y+
Sbjct: 452 ILFKERNKMKPRPHLDDKIITAWNAYVISAFSRSALLLSEPG----------------YL 495
Query: 455 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSK---APGFLDDYAFLISGLLDLYEFGSGT 511
++AE AA+FI LYD +T L F+ K GFL DYA +IS L+DLYE
Sbjct: 496 KIAERAANFIYEKLYDRETKVLHRIFKKNSEKERNIAGFLSDYANMISALIDLYEASGSI 555
Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
KWL WA ELQ+ QD F D+ GGYF G DP+++ R+KE
Sbjct: 556 KWLNWAFELQDIQDSYFYDQTNGGYFEERGNDPTIIYRLKE 596
>gi|219669354|ref|YP_002459789.1| hypothetical protein Dhaf_3335 [Desulfitobacterium hafniense DCB-2]
gi|219539614|gb|ACL21353.1| protein of unknown function DUF255 [Desulfitobacterium hafniense
DCB-2]
Length = 699
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 276/720 (38%), Positives = 386/720 (53%), Gaps = 65/720 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFEDE VA+L+N +FV IKVDREERPD
Sbjct: 33 GEEAFAKAKAEDKPIFLSIGYSTCHWCHVMERESFEDEEVAQLINRYFVPIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
VD +YM + QAL G GGWPL++FL+PD KP GTYFP E +YGRPG +L ++ + W
Sbjct: 93 VDHIYMEFCQALTGSGGWPLTLFLTPDERKPFYAGTYFPKESRYGRPGILDLLSQLGELW 152
Query: 118 DKKRDMLAQSGAFAIEQLS--EALSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRF 172
K + + S + ++ E S S+ + L D+ + L + L KS+D ++
Sbjct: 153 AKDQPKIRGSADSIYKAVTSREEPSVSSLTPALQDDFIPWAKEILDTAFQTLQKSFDRQY 212
Query: 173 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
GGFG APKFP P + +L ++ D EA + MV TL+ M +GGI DHVG G
Sbjct: 213 GGFGRAPKFPTPHHLTFLLRYA---HDHSDGLEAQQAALMVRTTLERMGQGGIFDHVGFG 269
Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 292
F RYS D W VPHFEKMLYD LA YL+ + D R+I Y+ RDM P
Sbjct: 270 FARYSTDRHWLVPHFEKMLYDNALLAIAYLENYQAQHDPHDEQKAREIFSYVLRDMTAPE 329
Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
G +SAEDADS EG EG FYVWT +E+ +ILG E L+ + Y + P GN
Sbjct: 330 GGFYSAEDADS---EGV----EGKFYVWTPQEIHEILGSEEGRLYCQAYGVSPEGN---- 378
Query: 352 RMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
F+GK++ L+ D A S+ LE L + R KLF VR +R PH D
Sbjct: 379 --------FEGKSIPNLLDTDWEALGSERQHSLEVLKRRLEKSREKLFAVRKERIPPHKD 430
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK++ SWNGL+I++ A+ +++L A Y E E A FIR++LY
Sbjct: 431 DKLLTSWNGLMIAALAKGAQVLGEPA----------------YAEAVEQAVYFIRKNLYA 474
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
Q RL +R+G S G+LDDYAFLI GL++LY+ + L +A++LQ QDELF D
Sbjct: 475 NQ--RLLARYRDGDSAHLGYLDDYAFLIWGLIELYQASGKKEHLEFALQLQREQDELFWD 532
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
GYF T + +L+R KE +DGA PSGNS+S +NL+RLA + + + + A
Sbjct: 533 GAKSGYFLTGRDAEELLIRPKEIYDGATPSGNSISALNLIRLARLTGDGELE---KRAYE 589
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F+ L A SR+ ++L G + +NM A +
Sbjct: 590 QINAFKATLSTYPSGYSAFLQAIQFALQESRE-IILAGPLQHPELKNMKTAIFKKFHPYT 648
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
T+++ + +E + + +++ ++K+ A +CQN++C PV L LL
Sbjct: 649 TLLYEEGTLSELIPWLKDY----------PLDSEKMTAYLCQNYACHKPVHKAEELSALL 698
>gi|156742936|ref|YP_001433065.1| hypothetical protein Rcas_2990 [Roseiflexus castenholzii DSM 13941]
gi|156234264|gb|ABU59047.1| protein of unknown function DUF255 [Roseiflexus castenholzii DSM
13941]
Length = 696
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 266/692 (38%), Positives = 376/692 (54%), Gaps = 58/692 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE A L+N +FV++KVDREERPDVD +YMT VQA+ G GGWP++VF
Sbjct: 58 CHWCHVMEHESFEDEETAALMNRYFVNVKVDREERPDVDSIYMTAVQAMTGSGGWPMTVF 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GTYFPPED++ P F+ +LR V +A+ +R+ L G +E++ E
Sbjct: 118 LTPDGTPFFAGTYFPPEDRWQMPSFQRVLRSVAEAYATRRNDLLARGRELVERMRE---- 173
Query: 142 SASSNKLP-DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
AS ++P L AL L +++D +GGFG APKFP+P+ ++ +L ++ + T
Sbjct: 174 -ASMMQIPGSTLTPAALDSAFMGLQQAFDPEYGGFGRAPKFPQPMTLEFLLRYAAR---T 229
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ G +M+ TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD LA V
Sbjct: 230 GR------GMEMLERTLRAMAEGGMYDQIGGGFHRYSVDAQWLVPHFEKMLYDNALLARV 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL+ F T + FY I + L Y+ R+M P G FS +DADS T AT K EGAF+VW
Sbjct: 284 YLETFQATGNAFYRRIAEETLTYMLREMQHPDGGFFSTQDADSLPTADATHKHEGAFFVW 343
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E+ + LG A +F Y + GN F+GKN+L + A +G
Sbjct: 344 TPAEIREALGADATVFSALYGVTDRGN------------FEGKNILHVQRSPAEVARVMG 391
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
M +E+ +I RR LF VR RP+P LDDKV+ +WNG+ + +FA + +L
Sbjct: 392 MSVERVESIAERGRRVLFAVRQHRPKPELDDKVLTAWNGMALRAFALGAIVL-------- 443
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
DR+EY A A F+ R L L+ S+R G + P FL+DYA L
Sbjct: 444 --------DREEYRTAAVRCAEFVLRELRRADGELLR-SWRQGVANPTPAFLEDYALLAD 494
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLL LYE +WL+ A L + E F D GG+++T +++R ++ D A P
Sbjct: 495 GLLALYEATFDPRWLLEARALADALLERFWDDGIGGFYDTGSHHEQLVIRPRDTGDNATP 554
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSV 618
SG+S + L+RLA I + YR+ A L+ ++ AA+ LS
Sbjct: 555 SGSSAAADVLLRLALIFDEPR---YRERALTVLSAMAPLMERYPTGFGRYLAAAEFALSQ 611
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P + + L+G + D + A A + N+ V+ P + + + +A
Sbjct: 612 P--REIALIGDPEAADTRALAAIALKPFLPNRVVVLARPGE-------DPPRIPSPLLAG 662
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VCQN++C PVT P L L
Sbjct: 663 RTPIDGRAAAYVCQNYACRLPVTKPADLAAQL 694
>gi|451995214|gb|EMD87683.1| hypothetical protein COCHEDRAFT_21080 [Cochliobolus heterostrophus
C5]
Length = 734
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 266/651 (40%), Positives = 368/651 (56%), Gaps = 40/651 (6%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ + K+ R F+ CHWCHVME ESFE++ VA LLN+ F+ IK+DREERPD
Sbjct: 36 GQEAIGLAKKSNRLIFISIGYAACHWCHVMERESFENDEVANLLNEHFIPIKIDREERPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYG---RPGFKTILRKVK 114
VD++YM YVQA G GGWPL+VF++PDL+P+ GGTY+P P GF IL+K++
Sbjct: 96 VDRIYMNYVQATTGSGGWPLNVFITPDLEPIFGGTYWPGPGSTMAMGEHIGFVGILKKIR 155
Query: 115 DAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF-- 172
D W +R +S QL + S K D P L L E L ++Y++
Sbjct: 156 DVWRDQRQRCLESAKEITAQLRDFAEEGNISRK--DGAPNETLDL--ELLDEAYEASTTF 211
Query: 173 -GGFGSAPKFPRPVEIQMMLYHSKK---LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
FG APKFP P + +L S+ +++ + + + + M L TL M KGGIHD
Sbjct: 212 ASSFGGAPKFPTPSNLHFLLKLSQYPNLVKEVLGAKDCTRAKDMALATLSAMNKGGIHDQ 271
Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-D 287
+G GF RYSV + W +PHFEKMLYDQ QL VYLDA+ +T+ + DI YL
Sbjct: 272 IGNGFARYSVTKDWSLPHFEKMLYDQSQLLAVYLDAYLMTRSPEHLEAVHDIATYLTSPP 331
Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTG 346
M G +S+EDADS K+EGAFYVWT KE +DILGE + + +Y +K G
Sbjct: 332 MHAESGGFYSSEDADSLYRPNDKEKREGAFYVWTLKEFQDILGERDSEILARYYNVKDEG 391
Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RP 405
N ++ D H+E +NVL + + A + G+ EK IL E R+KL + R+K RP
Sbjct: 392 N--VAPEHDAHDELINQNVLAITSTPADLAKQFGLSEEKVKRILTEGRQKLLEHRNKERP 449
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
RP LDDK++VSWNGL I + AR S L S+ + KEY+ AE AA+F++
Sbjct: 450 RPGLDDKIVVSWNGLAIGALARTSAALASQDPTR----------SKEYLAAAEKAAAFVQ 499
Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
+HLY ++ L +R GP APGF DDYA+LISGL+DLYE +L WA +LQ TQ
Sbjct: 500 KHLYHSESKTLIRVWREGPGDAPGFADDYAYLISGLIDLYEATFNDSYLQWADDLQKTQL 559
Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
++F D++ G+F+T + +++R+K+ D AEP N VS NL RL +++ S+ Y
Sbjct: 560 KMFWDKQHLGFFSTPEDQTDLIMRLKDGMDNAEPGTNGVSAQNLDRLGALLEDSE---YT 616
Query: 586 QNAEHSLAVFETRLKDMAMAVPLM--CCAADMLSVPSRKHVVLVGHKSSVD 634
Q A + + FE + P M A L + H V+ G+ VD
Sbjct: 617 QRARDTASAFEAEIMQHPFLFPSMMDAVVAGKLGI---THAVITGNGQKVD 664
>gi|308274671|emb|CBX31270.1| Spermatogenesis-associated protein 20 [uncultured Desulfobacterium
sp.]
Length = 633
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 238/572 (41%), Positives = 343/572 (59%), Gaps = 40/572 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF D +AK++ND F+ IKVDREERPD+D++Y++ V AL G GWPL+
Sbjct: 43 STCHWCHVMENESFTDHEIAKIMNDNFICIKVDREERPDLDRIYISAVTALTGSAGWPLN 102
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLS 136
VFL+P LKP GGTYFP E +G + +L ++ W +D+++ S E+++
Sbjct: 103 VFLTPKLKPFFGGTYFPAESNFGITSWPDLLNRITSVWKDPVVHKDIISSS-----EKIT 157
Query: 137 EALSASASSNKL---PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+ + + S +K+ ++ Q+ L + S SYD ++ GFG APKFP P I+ +L +
Sbjct: 158 DIIIKNLSYDKVFSTAEKHKQSHLDDAFKYYSSSYDEKYAGFGKAPKFPSPSIIKFILAY 217
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ + A M +TL+ MAKGGI+D + GGFHRYS DE+WH+PHFEKMLYD
Sbjct: 218 FSYAKKINEPAVAKRTIDMADYTLKAMAKGGIYDQLRGGFHRYSTDEKWHIPHFEKMLYD 277
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE-------T 306
QL NVYL+A+ +T D F++ I ++ DY+ DM G +SAEDADS +
Sbjct: 278 NAQLVNVYLEAYQITSDKFFAQIAKETCDYILSDMTSSPGGFYSAEDADSYPGQISEKGS 337
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
+ A K EGAFYVW+ KE++ IL E+ A +F + + GN DPH FK KN+
Sbjct: 338 DDAHNKVEGAFYVWSKKELDKILEENTAEIFSYFFGVMEEGNA----AHDPHGYFKKKNI 393
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
L + + +A K M +K I+ + + KL RS R RPHLDDK++ SWNGL+IS+F
Sbjct: 394 LYVKHSINETAKKYNMAPDKVELIINDAKNKLLKARSSRERPHLDDKILTSWNGLMISAF 453
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
A+A K+L GSD+ Y++ A++AA FI +LYD+ T +L +R G
Sbjct: 454 AKAYKVL--------------GSDK--YLQAAKNAAEFIISNLYDKNTGKLFRRWREGER 497
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DP 544
G DYAF I GL+DLYE S KWL A+ L +LF D + G++ T+ + D
Sbjct: 498 AVLGMGSDYAFYICGLIDLYESDSDKKWLETAVMLSEEYIKLFYDEQFAGFYITSPDHDK 557
Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
++++R K+D D P+ SV++ NL+RL+ I
Sbjct: 558 NLIIRAKDDSDSVIPAHGSVAIQNLLRLSKIT 589
>gi|341899864|gb|EGT55799.1| hypothetical protein CAEBREN_04954 [Caenorhabditis brenneri]
Length = 731
Score = 447 bits (1150), Expect = e-122, Method: Compositional matrix adjust.
Identities = 276/729 (37%), Positives = 383/729 (52%), Gaps = 65/729 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T + FL +TCHWCHVME ESFE+E AK+LN+ FV+IKVDREERPD
Sbjct: 45 GEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFVAIKVDREERPD 104
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +V A G GGWP+SVFL+PDL P+ GGTYFPP+D G GF TIL + W
Sbjct: 105 VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWQ 164
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ + L GA I+ L + S N+ D ++DSR GGFG A
Sbjct: 165 KEGENLRTRGAQIIKLLQPEMK-SGDVNRSED-----VFESIYSHKKSTFDSRLGGFGRA 218
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+ + ++ + S E E M+ TL+ MA GGIHDH+G GFHRYSV
Sbjct: 219 PKFPKAPDFDFLIAFAS---SQSNSKEKQESIMMLQKTLESMADGGIHDHIGNGFHRYSV 275
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
D WH+PHFEKM+YDQ QL Y + LT K + DI +Y+++ GG +
Sbjct: 276 DSEWHIPHFEKMIYDQSQLLASYSEFHRLTEKKHENIKLVINDIFEYMQKISHKDGG-FY 334
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
+AEDADS T +T K EGAF W E++ +LGE I +F +++ ++ GN
Sbjct: 335 AAEDADSLPTHESTEKVEGAFCAWERDEIKQLLGEKKIESASLFDVFVDYFDVEENGN-- 392
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
+++ SDPH E K KNVL +L A+ G+ +E+ N + E R L+ R+KRP PHL
Sbjct: 393 VAKSSDPHGELKNKNVLRKLLTDEECATNHGITVEQLKNGIDEAREILWIARTKRPSPHL 452
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K++ +W GL I+ +A + ++ +Y+E AE A+F+ ++L
Sbjct: 453 DSKMVTAWQGLAITGLVKAYQ----------------ATNEPKYVERAEKCAAFVEKYL- 495
Query: 470 DEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
E+ L+ S G + F DDYAFLI GLLDLY ++L +I+LQ
Sbjct: 496 -EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYAFLIQGLLDLYTVAGKNEYLERSIKLQ 554
Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
T DE F G GYF + D V +R+ ED DGAEP+ S++ NL+R I+ ++
Sbjct: 555 KTCDEKFWS--GNGYFISEKSDEVVSVRMIEDQDGAEPTATSIASNNLLRFYDIL---EN 609
Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
+ YR+ A RL + +A+P M A + S VLVG S
Sbjct: 610 EEYRERANQCFRGASERLNKIPIALPKMAVALQRWQLGSTT-FVLVGDPVSELLTEARNQ 668
Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ N +V+HI E D +S+NA MA+ + +C+ F C PV
Sbjct: 669 LNQKLINNLSVVHI----RSENDVSASGSSHNA-MAQ----GPQPAVYLCKGFVCGLPVR 719
Query: 702 DPISLENLL 710
LE L
Sbjct: 720 KIDKLEQLF 728
>gi|268530908|ref|XP_002630580.1| Hypothetical protein CBG13036 [Caenorhabditis briggsae]
Length = 724
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 281/731 (38%), Positives = 385/731 (52%), Gaps = 63/731 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ + FL +TCHWCHVME ESFE+E AKLLND FV+IKVDREERPD
Sbjct: 36 GEEAFQKARESNKPIFLSVGYSTCHWCHVMEKESFENENTAKLLNDNFVAIKVDREERPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +V A G GGWP+SVFL+PDL P+ GGTYFPP+D G GF TIL + + W
Sbjct: 96 VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHEEWQ 155
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ + L GA I+ L L+ S N+ D R + S+DSR GGFG A
Sbjct: 156 KEGENLKARGAQIIKLLQPKLN-SGDVNRSED-----VFRAIFTRHQSSFDSRLGGFGGA 209
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+P ++ ++ + + S + E KM+ TL+ MA GGIHDH+G GFHRYSV
Sbjct: 210 PKFPKPSDLDFLICMANT-DPILNSESSKESVKMIQKTLESMADGGIHDHIGNGFHRYSV 268
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF--YSYICRDILDYLRRDMIGPGGEIF 296
D WHVPHFEKMLYDQ QL Y D + LT I DI Y+++ GG +
Sbjct: 269 DAEWHVPHFEKMLYDQSQLLATYSDFYRLTGRKLDNIKTIVDDIFQYMQKISHKDGG-FY 327
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
SAEDADS +T+K EGAF VW +E++ +LGE I +F + YL N +
Sbjct: 328 SAEDADSLPRHDSTKKMEGAFCVWEKEEIKILLGEMKIGSANLVDVFND--YLDVEENGN 385
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
+SR SDPH E K KNVL +L A + +++ + + ++ L++ R+KRP PHL
Sbjct: 386 VSRSSDPHGELKNKNVLRKLLTDEECAINHDITVDELIEGMQRAKKILWEARTKRPSPHL 445
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K++ +W GL I+ +A + ++ +Y+E AE A F++++L
Sbjct: 446 DSKMVTAWQGLAITGLVKAYQ----------------ATNDTKYIERAEKCAEFVQKYL- 488
Query: 470 DEQTHRLQHSFRNGPS--------KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
+ L+ S GP+ + F DDYAF+I LLDLY +L AIELQ
Sbjct: 489 -AENGELKRSVYLGPTGEVEQGNQEMKAFSDDYAFMIQALLDLYTTLGKDDYLKNAIELQ 547
Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
D F G GYF + D V +R+ ED DGAEP+ S++ NL+R I+ +
Sbjct: 548 KICDSKFW--SGNGYFISEQTDEKVSVRMIEDQDGAEPTATSIASNNLLRFYDIL---ED 602
Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
+ YR+ A RL + +A+P M A + S VLVG S
Sbjct: 603 EEYREKAHQCFRGASERLNKVPIALPKMAVALNRWQKGSIT-FVLVGEPDSELLIETRKR 661
Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ + N + +HI E D S+ A M A +C+ F CS PV
Sbjct: 662 LNQKFIENFSAVHI----RSENDLGATGASHKA-MTEGPHPA----VYMCKGFVCSLPVR 712
Query: 702 DPISLENLLLE 712
D L+ +L E
Sbjct: 713 DIKGLDKMLNE 723
>gi|195029929|ref|XP_001987824.1| GH19740 [Drosophila grimshawi]
gi|193903824|gb|EDW02691.1| GH19740 [Drosophila grimshawi]
Length = 747
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 272/718 (37%), Positives = 363/718 (50%), Gaps = 65/718 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED A ++N FV+IKVDREERPD+DKVYM ++ G GGWP+S
Sbjct: 61 STCHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L PL GTYFPP+ +YG P F +L + W R L +G+ ++ L
Sbjct: 121 VWLTPELAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRAALQNAGSILMDALKANQ 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ASA + P +A AE L+ + +D + GGFG PKFP + + +
Sbjct: 181 NASAVGEAAFE--PGSADAKLAEALNVHKQRFDQQHGGFGREPKFPEVSRLNFLFHAYLV 238
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D + MVL TL + +GGI+DH+ GGF RY+ WH HFEKMLYDQGQ
Sbjct: 239 SKDV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQ 291
Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L + +A+ LT+ + F Y R I +YL +D+ P G F+ EDADS T T K EG
Sbjct: 292 LMAAFANAYKLTRSEEFLGYADR-IYEYLLKDLRHPAGGFFAGEDADSLPTHKDTVKVEG 350
Query: 316 AFYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGK 363
AFY WT +EV+D F + HY +KP GN + SDPH GK
Sbjct: 351 AFYAWTWQEVQDAFRAQKTHFNDVSPDRAFDIYSFHYDMKPGGN--VPPDSDPHGHLTGK 408
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVLI + S + L++ +L L VR KRPRPHLD K+I SWNGLV+S
Sbjct: 409 NVLIVRGSEEDTCSNFNVELDQLKPLLRTANDILHAVRDKRPRPHLDTKIICSWNGLVLS 468
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL------- 476
A+ + + R Y++ A+ F+R HLYDE+ L
Sbjct: 469 GLAKLANCGTGK--------------RNAYLKTAKELVQFLRTHLYDEEQQVLLRSCYGA 514
Query: 477 ---QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
++ + GFLDDYAFLI GLLD Y+ L WA ELQ TQD+LF D +
Sbjct: 515 GVQDNTLEQNAVRIEGFLDDYAFLIKGLLDYYKASLDMGALRWAKELQGTQDKLFWDEKN 574
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G YF + + P+V++R+KEDHDGAEP GNSV+ NL L D Y + + L
Sbjct: 575 GAYFYSQQDAPNVIVRLKEDHDGAEPCGNSVTARNLTLLTHYY---DDDAYLKRTDKLLN 631
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F + A+P M A ML V +VG S D + Y ++
Sbjct: 632 YF-ADVSPFGHALPEMLSAL-MLHEHGLDLVAVVG-PDSPDTARFVEICRKFYVPGMIIV 688
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
H DP +E N + K +C + C PVTDP LE L+
Sbjct: 689 HCDPQHPDEA-------CNQRLQTKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739
>gi|323703366|ref|ZP_08115015.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
DSM 574]
gi|323531635|gb|EGB21525.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
DSM 574]
Length = 692
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 265/707 (37%), Positives = 379/707 (53%), Gaps = 65/707 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFE E VA++LN ++V+IKVDREERPD
Sbjct: 33 GEEAFEKAKRENKPVFLSIGYSTCHWCHVMERESFESEDVAEVLNKYYVAIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMT QAL G GGWPL++ ++PD KP GTYFP YG+PG IL+++ D W
Sbjct: 93 IDQIYMTVCQALTGQGGWPLNIIMTPDQKPFFAGTYFPKNSNYGKPGLIDILQQIADLWA 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K R L +QL L+ ++ P +L L ++ +DS +GGFG+
Sbjct: 153 KNRQQLLGIS----DQLMARLNMKTATA--PGQLSPEVLDKAYLLFARHFDSTYGGFGNP 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + ++L KK + MV TL M +GGI+DH+G GF RYS
Sbjct: 207 PKFPTPHNLMLLLRCWKKTSQ-------KKALTMVEDTLDAMHRGGIYDHIGFGFSRYST 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D RW VPHFEKMLYD LA +L+ + + ++ +S + ++I Y+ RDM P G +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLAIAFLETYQINRNPRFSRVAKEIFTYVLRDMTAPEGGFYSA 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FYVW +EVE +LG+ LF +Y + P GN
Sbjct: 320 EDADS---EGV----EGKFYVWHPQEVEQVLGQIDGQLFCRYYDITPRGN---------- 362
Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+G ++ +N D A +L + LE ++ L +CR+ LF R KR PH DDK++ S
Sbjct: 363 --FEGASIPNLINQDPLKFAQELDITLEDLVDGLEKCRQLLFAQREKRVHPHKDDKILTS 420
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I++ AR +++L E +Y + AE A FI +L RL
Sbjct: 421 WNGLMIAALARGARVLGDE----------------KYSQAAEKAVDFIYHNL-QRADGRL 463
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+R+G + P +LDDYAFLI GLL+LYE K L A++L ++ +LF DR+ GG+
Sbjct: 464 LARYRDGEAAYPAYLDDYAFLIWGLLELYEATFDIKHLEQAVQLTDSMIDLFWDRQNGGF 523
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F + ++ R KE +DGA PSGNSV+ +NL RLA + ++ + Y + A L VF
Sbjct: 524 FFYGKDSEQLISRPKEIYDGAIPSGNSVATVNLFRLARL---TERNRYEELATKQLQVFA 580
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
L+ + AA + P + +VL G + + M+ + L V+ +
Sbjct: 581 GELEHYPIGYSYFMIAAYLNQEPPTE-IVLSGKREDSALKQMIDVVQKEF-LPSAVLAVR 638
Query: 657 PADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTD 702
+ ++ A K A VC+NF+C PPVTD
Sbjct: 639 YEGEAAA-----QAEELVPLLKDRLPVAGKATAYVCKNFACQPPVTD 680
>gi|347839355|emb|CCD53927.1| similar to DUF255 domain protein [Botryotinia fuckeliana]
Length = 823
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 241/592 (40%), Positives = 354/592 (59%), Gaps = 26/592 (4%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCH+ME ESFE+E VA +LN F+ IK+DREERPD+D++YM +VQA G GGWPL+
Sbjct: 82 SSCHWCHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 141
Query: 80 VFLSPDLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
VFL+P L+P+ GGTY+ D + F IL K+ W ++ Q A +++QL
Sbjct: 142 VFLTPSLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSAQSLQQL 201
Query: 136 SEALSASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML- 191
+ + SN+L D + L E + SYD GGFGSAPKFP P +I +L
Sbjct: 202 KDFANEGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSKIAFLLR 261
Query: 192 --YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
+ + D + +++ + TL+ MA+GGIHDH+G GF RYS W +PHFEK
Sbjct: 262 LGQFPQAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWSLPHFEK 321
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
MLYD QL ++YLD F L++D + + DI +YL + G +S+EDADS G
Sbjct: 322 MLYDNAQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADSYYKNGD 381
Query: 310 TRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+ K+EGA+YVWT +E E+ILG L ++ TG+ ++ + +DPH+EF +NVL
Sbjct: 382 SEKREGAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQNVLAIS 440
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
+ SA AS+ G+ + + ++ E + +L R + R +P +DDKV+VSWNG+ + + AR
Sbjct: 441 STPSALASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAVGALARL 500
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
S ++ F+ PV +EY++ A AA+FI+++LYD++ L +R G
Sbjct: 501 SSVING------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWREGRGDTQ 550
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVL 547
GF DDYAFLI GL+DLYE KWL WA ELQ +Q LF D+ G G +F+TT P+V+
Sbjct: 551 GFADDYAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTVSAPNVI 610
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
LR+K+ D +EPS N +S NL RL+S+ + Y + A+ ++ FE +
Sbjct: 611 LRLKDAMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEM 659
>gi|195120756|ref|XP_002004887.1| GI20164 [Drosophila mojavensis]
gi|193909955|gb|EDW08822.1| GI20164 [Drosophila mojavensis]
Length = 747
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 269/716 (37%), Positives = 362/716 (50%), Gaps = 61/716 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED A+++N FV+IKVDREERPD+DKVYM ++ G GGWP+S
Sbjct: 61 STCHWCHVMEHESFEDAATAEVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL+PL GTYFPP+ +YG P F +L + W RD L ++G+ ++ +
Sbjct: 121 VWLTPDLEPLAAGTYFPPKPRYGMPSFTMVLESIAKKWVADRDSLKKAGSTLLQAMQTNQ 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
SA S+ + +A A + K +D + GFG PKFP + + + +
Sbjct: 181 SAGTSAEMAFERGSGDAKLAEAVAVHKQRFDQQHAGFGREPKFPEVPRLNFLFHAYLVTK 240
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + MVL TL + +GGI+DH+ GGF RY+ WH HFEKMLYDQGQL
Sbjct: 241 DV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLM 293
Query: 259 NVYLDAFSLTKDV-FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ LT+ F Y R I +YL +D+ P G ++ EDADS T T K EGAF
Sbjct: 294 AAYANAYKLTRSKEFLGYADR-IYEYLIKDLRHPAGGFYAGEDADSLPTHEDTVKVEGAF 352
Query: 318 YVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNV 365
Y WT EV+ + FK+ HY LKP+GN +S SDPH GKN+
Sbjct: 353 YAWTWDEVKQAFQKEESCFKDISAARAFEIYSFHYDLKPSGN--VSPSSDPHGHLTGKNI 410
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
LI + S M LEK +L L +R +RPRPHLD K+I WNGLV+S
Sbjct: 411 LIVRGSEEDTCSNFNMELEKLQQLLRTANEILHKIRDQRPRPHLDTKIICGWNGLVLSGL 470
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------ 479
A+ + ++ R Y+ A+ F+R+HLYDE L S
Sbjct: 471 AKLANCGTAK--------------RDAYLATAKQLMEFVRKHLYDEDEKLLLRSCYGAGV 516
Query: 480 ----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
++ GFLDDYAFLI GLLD Y+ + L W+ LQ TQD+LF D + G
Sbjct: 517 ADDTLEQNATRIEGFLDDYAFLIKGLLDYYKASLEMEALNWSKTLQETQDKLFWDEDKGA 576
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
YF + P+V++R+KEDHDGAEP GNSV+ NL L+ K Y + A L F
Sbjct: 577 YFFSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLSHYYDDRK---YFERATKLLNYF 633
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+ A+P M A +L V +VG S D + Y ++H
Sbjct: 634 -ADVSPFGHALPEMLSAL-LLHENGLDLVAVVG-PDSEDTRRFVEIVRKFYVPGMIIVHC 690
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
DP + N + K +C + C PVTDP LE L+
Sbjct: 691 DPLHPDAA-------CNQRLQQKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739
>gi|449300572|gb|EMC96584.1| hypothetical protein BAUCODRAFT_33944 [Baudoinia compniacensis UAMH
10762]
Length = 739
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 270/706 (38%), Positives = 382/706 (54%), Gaps = 45/706 (6%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
+T R F+ + CHWCHVM ESF+D +A+LLN+ F+ IK+DREERPD+D+ YM ++
Sbjct: 43 QTNRLLFVSIGYSACHWCHVMAHESFDDPRIAQLLNEHFIPIKIDREERPDIDRQYMDFL 102
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PED---KYGRPGFKTILRKVKDAWDKKRDM 123
QA GGGGWPL+VF++PDL+P+ GGTY+P P+ + G GF+ IL KV W ++
Sbjct: 103 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPKSERAQMGGTGFEQILVKVAQMWKEQESK 162
Query: 124 LAQSGAFAIEQLSEALSASASSNKL-------PDELPQNALRLCAEQLSKSYDSRFGGFG 176
L ++G QL E + D L + + +DS++GGFG
Sbjct: 163 LRENGKQITAQLKEFAQEGTLGGRTDGKTSDGDDGLELDLIEEAYNHYKGRFDSKYGGFG 222
Query: 177 SAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
SAPKFP PV ++ ++ H +++ E + M + TL+CMAKGGI D VG GF
Sbjct: 223 SAPKFPTPVHLKALVRFGCHPHTVKEIVGDKEVKHARYMAVKTLECMAKGGIKDQVGHGF 282
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPG 292
RYSV W +PHFEKMLYD QL +YLDA+ LTK + D+ YL + M
Sbjct: 283 ARYSVTRDWSLPHFEKMLYDNAQLLPLYLDAYLLTKTDLFLETVHDVATYLTTEPMQSSL 342
Query: 293 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLS 351
G I ++EDADS T K+EGAFYVWT E +++L E A + ++ ++P GN D
Sbjct: 343 GGINASEDADSLPTAIDHHKREGAFYVWTLDEFKELLTDEEATVCARYWNVQPNGNVD-- 400
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLD 410
R D E G+N L D+ AS+LGM + ++G R+KL + R K RP P LD
Sbjct: 401 RRYDHQGELVGRNTLCVQYDTPDLASELGMSDSEVKRLIGSGRKKLLEYRDKNRPLPSLD 460
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK++ +WNGL I ARAS L S A + + Y+ AE AA+ I++HL+D
Sbjct: 461 DKIVTAWNGLAIGGLARASAALSSMAPDSA----------QAYLAGAERAAACIKQHLFD 510
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
+T L+ +R GP + GF DDYAFLISGLLDLYE +L +A LQ TQ +LF D
Sbjct: 511 AKTGTLRRVYREGPGETQGFADDYAFLISGLLDLYEATFDDSYLSFADTLQQTQVKLFWD 570
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
+F+T P +L+R K+ D AEPS N VS NL RL+S++ K Y + A+
Sbjct: 571 DNKYAFFSTPANQPDILVRTKDAMDNAEPSTNGVSAQNLFRLSSLLNDEK---YEKMAKR 627
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
++A FE + M + + S K +++VG E L A S N
Sbjct: 628 TVAAFEVEIGQHPGLFSGMMSSI-IASKLGMKGLMVVGEGEVA--EAALKKARESVRPNW 684
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
TV+ + E + + N + +V+ VC++ +C
Sbjct: 685 TVLRV--GGKAEAKWLRQRNE-----LLQDLDGSRVMVQVCEDGAC 723
>gi|283778260|ref|YP_003369015.1| hypothetical protein Psta_0467 [Pirellula staleyi DSM 6068]
gi|283436713|gb|ADB15155.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
Length = 709
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 270/703 (38%), Positives = 388/703 (55%), Gaps = 75/703 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE + +A LN+ FV IKVDREERPD+D++YM VQ + G GGWP+S
Sbjct: 58 SACHWCHVMEHESFESQEIADYLNEHFVCIKVDREERPDLDQIYMDAVQLMTGRGGWPMS 117
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEA 138
VFL+P+ KP GGTY+PP D+ G PGF ++R V DAW +R+ L+Q+ +L++
Sbjct: 118 VFLTPEGKPFFGGTYWPPTDRQGMPGFSRVIRAVIDAWKNRREQALSQA-----TELTDH 172
Query: 139 LSASASSNKLPDELPQNALR--------LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L + A+SN P +LP + R A +LS+++DSR+GGFGSAPKFP ++++++
Sbjct: 173 LGSLATSNT-PAQLPLSVSRSMVDGWMETAAARLSRAFDSRYGGFGSAPKFPHSMDLELL 231
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L ++ + +M L TL+ M+ GGI+DH+GGGF RYSVDERW VPHFEKM
Sbjct: 232 LLEWQR-------SARVDVAEMTLVTLEKMSAGGIYDHLGGGFARYSVDERWLVPHFEKM 284
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD L + A+ T D ++ R+ +YL RDM G I+S EDADS EG
Sbjct: 285 LYDNSLLLRALVRAYQATGDAKFAATMRETCNYLLRDMTDELGGIYSTEDADS---EG-- 339
Query: 311 RKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVW E+ ++LG E F + Y + P GN F+ ++ L
Sbjct: 340 --EEGKFYVWKPAEIYEVLGPERGSRFCQVYDVAPGGN------------FEHGFSILNL 385
Query: 370 NDSSASASKLG-MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
+ S A S+L MPLE N L E R LFDVR KR P DDK++ SWN L I + A
Sbjct: 386 SRSIADWSRLWEMPLEVLSNELAEDRAILFDVREKRVHPGKDDKILTSWNALAIDALAEV 445
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ +L D Y+ A+ AA F+ +HL D RL H++R+G +K
Sbjct: 446 AGVL----------------DEPRYLLAAQRAADFVLQHLRDSDG-RLLHTWRHGRAKLA 488
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
+LDDYA+L+ L+ LYE T+WL A+EL + F D E GG+F T + +++
Sbjct: 489 AYLDDYAYLVHALVSLYEADFHTRWLSAAVELADQMIAHFSDHERGGFFFTADDHEALIT 548
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R K+ HDG+ PSG+S++ + L RL I Y +E ++ + A +
Sbjct: 549 RAKDMHDGSVPSGSSMAALALARLGKITGKQA---YLLASERAILAASGSVTANPTASAV 605
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
M AAD+L P+ + +VL G ++ V + L +A + ++ P D
Sbjct: 606 MIQAADLLVGPTSE-IVLAGPEAEVRETARALRKIYAPRKVVAALMTGLPVDA------- 657
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S A + + S+ ++ +CQNFSC PVT S+ L
Sbjct: 658 --SSPVAPLVQGKESS-QLSLYICQNFSCQAPVTGASSIAAAL 697
>gi|195485941|ref|XP_002091297.1| GE13577 [Drosophila yakuba]
gi|194177398|gb|EDW91009.1| GE13577 [Drosophila yakuba]
Length = 809
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 268/742 (36%), Positives = 377/742 (50%), Gaps = 74/742 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVME ESFE A ++N+ FV+IKVDREERPD
Sbjct: 102 GEEAFEKARSENKIIFLSVGYSTCHWCHVMEHESFESPVTAAIMNEKFVNIKVDREERPD 161
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+DK+YM ++ G GGWP+SV+L+P L PL+ GTYFPP+ +YG P F +L+ + W+
Sbjct: 162 IDKIYMQFLLMSKGSGGWPMSVWLTPTLAPLVAGTYFPPKSRYGMPSFNAVLKSIAKKWE 221
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGF 175
++ L +G+ + L + ASA + +A+ +E ++ + +D GGF
Sbjct: 222 TDKESLLTAGSTLLTALQKNQDASAVAEAAFG--VGSAIEKLSEAINVHKQRFDQTHGGF 279
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
GS PKFP I + + +D ++ MV+ TL + KGGI+DH+ GGF R
Sbjct: 280 GSEPKFPEVPRINFLFHAYLVTKD-------ADVLDMVIETLTQIGKGGINDHIFGGFAR 332
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
Y+ E WH HFEKMLYDQGQL + +A+ +T+D + I YL +D+ P G
Sbjct: 333 YATTEDWHNVHFEKMLYDQGQLMAAFANAYKVTRDETFLGYADKIYKYLLKDLRHPLGGF 392
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLK 343
++ EDADS T K EGAFY WT E++ DI E A ++ HY LK
Sbjct: 393 YAGEDADSLPTHEDNVKVEGAFYAWTWDEIQAAFKDQAQRLDDITPERAFEIYAYHYDLK 452
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
P GN + SDPH GKN+LI S + + +K+ +L L VR +
Sbjct: 453 PPGN--VPAYSDPHGHLTGKNILIVRGSEEDSIANFSLEADKFKKLLATTNDILHVVREQ 510
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRPHLD K+I +WNGLV+S + ++R +YM+ A+ F
Sbjct: 511 RPRPHLDTKIICAWNGLVLSGLCKLGN--------------CYSANRDQYMQTAKELLDF 556
Query: 464 IRRHLYDEQTHRLQHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 513
+R+ +YD + L S S+ GFLDDYAFLI GLLD Y+
Sbjct: 557 LRKEMYDPEKKLLIRSCYGVAVGDETLEKNESQIDGFLDDYAFLIKGLLDYYKATLDVDV 616
Query: 514 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 573
L WA LQ+TQD+LF D G YF + + P+V++R+KEDHDGAEP GNSVS NLV L
Sbjct: 617 LHWAKALQDTQDKLFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSARNLVLLG 676
Query: 574 SIVAGSKSDYYRQNA----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 629
YY +NA L F + A+P M A +L + +V V
Sbjct: 677 H--------YYDENAYLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVG 726
Query: 630 KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVAL 689
S D + + Y + ++H+DP++ E SN + K
Sbjct: 727 PDSPDTQRFVEICRKFYIPSMIIVHVDPSNPGEA-------SNQRLQTKFKMVGGKTTVY 779
Query: 690 VCQNFSCSPPVTDPISLENLLL 711
+C +C PVTDP LE+ L+
Sbjct: 780 ICHERACRMPVTDPQQLEDNLM 801
>gi|407917811|gb|EKG11113.1| protein of unknown function DUF255 [Macrophomina phaseolina MS6]
Length = 747
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 262/665 (39%), Positives = 356/665 (53%), Gaps = 35/665 (5%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
KT R F+ CHWCHVME ESFE+ +A +LN F+ +KVDREERPDVD++YM YV
Sbjct: 53 KTNRLLFVSIGYAACHWCHVMERESFENPEIANILNKNFIPVKVDREERPDVDRIYMNYV 112
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYG----RPGFKTILRKVKDAWDKKRDM 123
QA G GGWPL+VF++PDL+P+ GGTY+P P F IL ++KD W +R
Sbjct: 113 QATTGSGGWPLNVFITPDLEPIFGGTYWPGPGSTTVLGDHPSFLEILERIKDVWQTQRQK 172
Query: 124 LAQSGAFAIEQL----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
+S QL E + + D L L + YD ++ GFG AP
Sbjct: 173 CLESAKEVTAQLREFAQEGTISKGGEGAVGDGLDLELLEEAYTHFANKYDKQYAGFGKAP 232
Query: 180 KFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
KFP P I +L + + +E E + ++M + TL+ MA+GGIHD +G GF RY
Sbjct: 233 KFPTPTNISFLLRLAQYPEAVEHVVGDRECAHAKEMAVETLRRMARGGIHDQIGNGFARY 292
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYL-RRDMIGPGGEI 295
SV W +PHFEKMLYDQ QL YLDA +T D DI YL + P G
Sbjct: 293 SVTRDWSLPHFEKMLYDQSQLLTAYLDAHIITNDSELLDAAHDIATYLTTHPLQSPDGGF 352
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
FS+EDADS K+EGAFYVWT KE + ILGE A + +Y ++ GN +S
Sbjct: 353 FSSEDADSLYRPNDKEKREGAFYVWTRKEFKSILGEKDAEVCARYYNVRENGN--VSPEH 410
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKV 413
D H+E +NVL + A A + G+ ++ IL RR+L + R+K RPRP LDDK+
Sbjct: 411 DAHDELINQNVLAISSTPDALAKEFGLSKDEVTKILESGRRRLLEHRNKERPRPGLDDKI 470
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+V WNGL I + AR S L++ DR Y+ AE A I+ LY
Sbjct: 471 VVGWNGLAIGALARFSAYLQASGSKE--------PDR--YISAAEKAVKLIKTKLYSAAD 520
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
L+ +R GP +AP F DDYAFLISGL+DLYE +L +A +LQ TQ +LF D
Sbjct: 521 GTLKRVYREGPGEAPAFADDYAFLISGLIDLYEATFDDSYLEFADQLQRTQIKLFWDSTS 580
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G +F+T ++LR+KE D AEPS N +S NL RL +++ + DY ++ A+ +
Sbjct: 581 GAFFSTAEGQADLILRLKEGMDNAEPSTNGISASNLYRLGALL--EEPDYTKR-AKETCE 637
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
FE L P M L + K +V+ G +V E ++ A + + N T+
Sbjct: 638 AFEAELMQHPFLFPSMLNGIVALRL-GMKSIVVSGSGENV--EKAISKARSRVNTNTTIA 694
Query: 654 HIDPA 658
+ P
Sbjct: 695 RLGPG 699
>gi|195583350|ref|XP_002081485.1| GD11041 [Drosophila simulans]
gi|194193494|gb|EDX07070.1| GD11041 [Drosophila simulans]
Length = 808
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 268/723 (37%), Positives = 374/723 (51%), Gaps = 75/723 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE A ++N+ FV+IKVDREERPD+DK+YM ++ G GGWP+S
Sbjct: 122 STCHWCHVMEHESFESPETAAIMNENFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMS 181
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L PL+ GTYFPP+ +YG P F +L+ + W+ ++ L +G+ + L +
Sbjct: 182 VWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLKSIARKWETDKESLLSTGSSLLSALQKNQ 241
Query: 140 SASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
ASA +P+ A E+LS++ +D GGFGS PKFP + +
Sbjct: 242 DASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLNFL 293
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+ +D + MV+ TL + KGGIHDH+ GGF RY+ + WH HFEKM
Sbjct: 294 FHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFEKM 346
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQGQL + +A+ +T+D Y I YL +D+ P G ++ EDADS T
Sbjct: 347 LYDQGQLIVAFTNAYKVTRDEIYLGYADKIYKYLIKDLRHPLGGFYAGEDADSLPTHEDK 406
Query: 311 RKKEGAFYVWTSKEV-----------EDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
K EGAFY WT E+ EDI E A ++ HY LKP GN + SDPH
Sbjct: 407 VKVEGAFYAWTWDEIQAAFKDQAQRFEDITPERAFEIYAYHYDLKPPGN--VPTYSDPHG 464
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
GKN+LI + + + +++ +L L +R KRPRPHLD K+I +WN
Sbjct: 465 HLTGKNILIVRGSEEDTCANFKLEADQFKKLLATTNDILHVIRDKRPRPHLDTKIICAWN 524
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GLV+S + ++R++YM+ A+ F+R+ +YD + L
Sbjct: 525 GLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLLIR 570
Query: 479 S----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
S S+ GFLDDYAFLI GLLD Y+ L WA LQ+TQD+LF
Sbjct: 571 SCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLF 630
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D G YF + + P+V++R+KEDHDGAEP GNSVS NLV LA D + Q A
Sbjct: 631 WDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAHYY---DEDAFLQKA 687
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
L F + A+P M A +L + +V V S D E + Y
Sbjct: 688 GKLLNFF-ADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTERFVEICRKFYIP 744
Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
+ ++H+DP++ EE SN + K +C +C PVTDP LE+
Sbjct: 745 SMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTDPQQLED 797
Query: 709 LLL 711
L+
Sbjct: 798 NLM 800
>gi|341876361|gb|EGT32296.1| hypothetical protein CAEBREN_30752 [Caenorhabditis brenneri]
Length = 745
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 278/743 (37%), Positives = 386/743 (51%), Gaps = 79/743 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T + FL +TCHWCHVME ESFE+E AK+LN+ FV+IKVDREERPD
Sbjct: 45 GEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFVAIKVDREERPD 104
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +V A G GGWP+SVFL+PDL P+ GGTYFPP+D G GF TIL + W
Sbjct: 105 VDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWQ 164
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ + L GA I+ L + S N+ D + ++DSR GGFG A
Sbjct: 165 KEGENLRTRGAQIIKLLQPEIK-SGDVNRSED-----VFKSIYSHKKSTFDSRLGGFGRA 218
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+ + ++ + S E E M+ TL+ MA GGIHDH+G GFHRYSV
Sbjct: 219 PKFPKAPDFDFLIAFAS---SQSNSEEKQESIMMLQKTLESMADGGIHDHIGNGFHRYSV 275
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS--YICRDILDYLRRDMIGPGGEIF 296
D WH+PHFEKM+YDQ QL Y + SLT+ S + DI +Y+++ GG +
Sbjct: 276 DSEWHIPHFEKMIYDQSQLLASYSEFHSLTEKKHESIKLVINDIFEYMQKISHKDGG-FY 334
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
+AEDADS T +T K EGAF W E++ +LGE I +F +++ ++ GN
Sbjct: 335 AAEDADSLPTHESTEKVEGAFCAWERDEIKQLLGEKKIESASLFDVFVDYFDVEENGN-- 392
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
+++ SDPH E K KNVL +L A+ G+ +E+ N + E R L+ R+KRP PHL
Sbjct: 393 VAKSSDPHGELKNKNVLRKLLTDEECATNHGITVEQLKNGIDEAREILWIARTKRPSPHL 452
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K++ +W GL I+ +A + ++ +Y+E AE A+F+ ++L
Sbjct: 453 DSKMVTAWQGLAITGLVKAYQ----------------ATNEPKYLERAEKCAAFVEKYL- 495
Query: 470 DEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
E+ L+ S G + F DDYAFLI GLLDLY ++L IELQ
Sbjct: 496 -EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYAFLIQGLLDLYTVAGKNEYLERCIELQ 554
Query: 522 NTQDELFLDREGGGYFNTTGEDPSVLLRVKE--------------DHDGAEPSGNSVSVI 567
T DE F G GYF + D V +R+ E D DGAEP+ S++
Sbjct: 555 KTCDEKFWS--GNGYFISEKSDEEVSVRMIEGKIILSNFYKKNFSDQDGAEPTATSIASN 612
Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
NL+R I+ +++ YR+ A RL + +A+P M A + S VLV
Sbjct: 613 NLLRFYDIL---ENEEYREKANQCFRGASERLNKIPIALPKMAVALQRWQLGSTT-FVLV 668
Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVV 687
G +S + N +V+HI D D +S+NA MA+ +
Sbjct: 669 GDPTSELLTEARNQLNQKLINNVSVVHIRSKD----DVSASGSSHNA-MAQ----GPQPA 719
Query: 688 ALVCQNFSCSPPVTDPISLENLL 710
+C+ F C PV LE L
Sbjct: 720 VYLCKGFVCGLPVRKIDKLEQLF 742
>gi|28210673|ref|NP_781617.1| thymidylate kinase [Clostridium tetani E88]
gi|28203111|gb|AAO35554.1| thymidylate kinase [Clostridium tetani E88]
Length = 713
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 264/720 (36%), Positives = 391/720 (54%), Gaps = 89/720 (12%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE VAK+LND F+SIKVDREERPD
Sbjct: 69 GEEAFQKAKEEDKPIFLSIGYSTCHWCHVMERESFEDEEVAKVLNDNFISIKVDREERPD 128
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YMT+ QA+ G GGWPL++ ++PD KP GTYFP ED+YG G IL+++ + W
Sbjct: 129 IDNIYMTFCQAVTGSGGWPLTIIMTPDKKPFFAGTYFPKEDRYGVRGLMYILKEMSNQWK 188
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R+++ S ++ +S+ +S S ++L + ++ C E L +SYD GGF A
Sbjct: 189 NNRELILNSSEKLLKDMSQYISVSQR-----EDLNKEVIKECFEVLKESYDPIHGGFYDA 243
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP ++ +L + + +D E +V TL+ M KGGI DH+G GF RYS
Sbjct: 244 PKFPTSHKLMFLLRYYRLYKD-------EEALNIVEKTLKSMYKGGIFDHIGYGFSRYST 296
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D++W VPHFEKMLYD L Y + + +TK+ Y I + Y+ RDM G +SA
Sbjct: 297 DDKWLVPHFEKMLYDNAMLTIAYAEMYQITKEELYKEIIEKTISYVIRDMKDKKGAFYSA 356
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FYVWT +E+EDILG E A LF ++Y + GN
Sbjct: 357 EDADS---EGV----EGKFYVWTLEEIEDILGKEDAKLFSKYYGITDRGN---------- 399
Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKY----LNILGECRRKLFDVRSKRPRPHLDD 411
F+G+N+ LIE PLE + L R+ LF R KR PH D
Sbjct: 400 --FEGENIPNLIE------------TPLEDLEPDVKDKLENIRKTLFINREKRIHPHKDT 445
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL+I++ A + ++LK RK+Y+E AE A FI ++L DE
Sbjct: 446 KILTSWNGLMIAALAYSGRVLK----------------RKDYIESAEEAVKFIMKNLIDE 489
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ +R+G G L+DY+FLI L++LY+ T+++ A+++ ELF D
Sbjct: 490 NG-RIYVRYRDGERAHKGHLEDYSFLIWALIELYQSTFKTEYIEKALKINYDMIELFWDE 548
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
E G+F+T + ++L++KE +D A PSGNSV++ N+VRL+ I SK D + + +
Sbjct: 549 ENHGFFHTGKDGEELILKLKESYDSAIPSGNSVAMYNMVRLSRITGDSKLD---EIIQQN 605
Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD-LNK 650
L F R+K + + + S + V++ G + F+ M+ + Y +
Sbjct: 606 LNYFSGRIKSTLESHTFFLISYMHYVLESEEIVIVKGEDEDI-FKAMIKVINEKYHPFSM 664
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ + + + E++N N K +C+NF+C P+ ISLE+L+
Sbjct: 665 NIVKDEKVEKLMPELKEKNNIQN-----------KTTVYICKNFACGNPI---ISLEDLI 710
>gi|374302064|ref|YP_005053703.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332555000|gb|EGJ52044.1| protein of unknown function DUF255 [Desulfovibrio africanus str.
Walvis Bay]
Length = 691
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 278/712 (39%), Positives = 379/712 (53%), Gaps = 55/712 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F T+ + FL +TCHWCHVME ESFED+ VAKLLN+ FV IKVDREERPD
Sbjct: 30 GEEAFRTATEQDKPVFLSIGYSTCHWCHVMERESFEDDEVAKLLNEAFVCIKVDREERPD 89
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT Q + G GGWPL+V ++PD KP GTYFP GR G ++ KV+D W
Sbjct: 90 IDNVYMTVCQMMTGHGGWPLTVLMTPDKKPFFSGTYFPKSSLSGRMGLMELVPKVQDLWR 149
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R+ L QS E L L A +L D + A R QLS+ +D FGGFG A
Sbjct: 150 TRREDLVQSADKVTEAL-RGLERPAVGGELGDSVLFKAER----QLSERFDEAFGGFGGA 204
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +L + TG + + MV TL M +GGI+DH+G GFHRYS
Sbjct: 205 PKFPTP---HNLLLLLRMFRRTGNARNLA----MVEKTLTTMRRGGIYDHLGYGFHRYST 257
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+RW +PHFEKMLYDQ QL Y++A+ LT+ Y ++I++Y+RRD+ P G +SA
Sbjct: 258 DQRWLLPHFEKMLYDQAQLLMAYVEAYQLTRKPIYKRTAQEIVEYVRRDLQHPDGPFYSA 317
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EG +EG FYVW+ KE+ +LG+ A F Y + P GN + + +
Sbjct: 318 EDADS---EG----EEGKFYVWSEKEIRSVLGKKADPFIRAYDILPEGNF----LDEATH 366
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
G NVL A +LGM + L + RR LF VR +R RP DDKV+ WN
Sbjct: 367 RRTGANVLHLQRPLDILAKELGMSELELETTLADQRRLLFHVRERRVRPLRDDKVLTDWN 426
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL+I++ + A+K L D + ++ A +AA FI + + RL H
Sbjct: 427 GLMIAALSMAAKAL----------------DEELFVRAATAAADFILSRM--RKDGRLLH 468
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+G L DYAFLI GL++LYE G ++ L A++L ++ F D + GGY+
Sbjct: 469 RFRDGEVAIEATLTDYAFLIWGLVELYEAGLDSRHLEAALDLTEIMNKQFWDPKDGGYYF 528
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
T +L+R K+ DGA PSGNSV++ L++L+ + S A T
Sbjct: 529 TAESAEQLLVRQKDLFDGAIPSGNSVAMHVLLKLSRLTGRPNLANRAAAVARSAARQAT- 587
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
+ + + C D PS VV+VG +++ + ML HASY NK ++ +
Sbjct: 588 --EHPVGFTQLLCGVDFSIGPS-AEVVIVGKRNAPETRAMLRKLHASYIPNKVLLLREEG 644
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
D E + A K A VC+ FSC PVT+P ++ LL
Sbjct: 645 D-------ERMPALAPFTAELVMQDGKATAYVCRGFSCELPVTEPQAMMELL 689
>gi|91201579|emb|CAJ74639.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 729
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 264/714 (36%), Positives = 382/714 (53%), Gaps = 65/714 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + FL +TCHWCHVME ESFEDE VAK+LN+++V+IKVDREERPD
Sbjct: 75 GKEAFEKAKAESKVIFLSIGYSTCHWCHVMETESFEDEEVAKILNEYYVAIKVDREERPD 134
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT QA+ G GGWPL++FL+ + K GTYFP ++ G PG +L ++ + W+
Sbjct: 135 IDNVYMTVCQAMTGSGGWPLTLFLTSEGKSFYAGTYFPKTERLGNPGLIALLTQIANLWN 194
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + S + + +L + +AS K PD L+ EQLS +DS +GGFG++
Sbjct: 195 TNKESIIAS-SLQVTKLIDTETASKGEEK-PD---VRTLKTAYEQLSDRFDSLYGGFGTS 249
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +L K+ + + +MV +L+ MA+GGIHDH+GGGFHRYS
Sbjct: 250 PKFPTPHNFTFLLRWWKRSNN-------AFALEMVEKSLELMARGGIHDHLGGGFHRYST 302
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE W PHFEKMLYDQ LA Y++ + TK YS I +DI DY+ RDM P G +SA
Sbjct: 303 DEYWLTPHFEKMLYDQALLAISYIETYQATKKDLYSAIAKDIFDYVLRDMTSPEGGFYSA 362
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN--CDLSRMSDP 356
EDADS EG EG FYVW +E+++ LGE GN CD +SD
Sbjct: 363 EDADS---EGI----EGKFYVWKPEEIKEALGEK------------DGNIFCDFYDVSDI 403
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
N F+ KN+L +A M + L R+KL +R KR +PH D K+I S
Sbjct: 404 GN-FEDKNILHADKPLHIAAKLENMSPDALEKRLANSRKKLLSIREKRIKPHKDTKIITS 462
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+IS+ +R ++ + D +Y VA AA FI L E L
Sbjct: 463 WNGLMISALSRGAQAM----------------DEPKYTNVAMCAADFILNTLLQENKILL 506
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ + G S GFLDDYAF ++GL+DLYE K+L A+++ + FLD GG+
Sbjct: 507 RR-YCQGESAIAGFLDDYAFFVNGLIDLYEATFQEKYLQAALQINEEMIKNFLDENEGGF 565
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F + + + + K+ +DGA PSGNS++++NL+RL I Y A++ + F
Sbjct: 566 FLSGKSNEKLFTQTKDIYDGATPSGNSIALLNLLRLGRITGNPS---YEALADNLIKTFS 622
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
+ CA D P+ K +++ G + D +++L + + NK V+ +
Sbjct: 623 GTILQYPSGYTQFMCALDFALGPT-KEIIVAGEREGNDTKDILREIRSRFLPNK-VLLLH 680
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P++ F EE + + +C+N+SC PV+D ++ LL
Sbjct: 681 PSNG---IFIEEIAPYTKELIP---IEGRSTVYMCENYSCKKPVSDKNAVIQLL 728
>gi|333374035|ref|ZP_08465926.1| thymidylate kinase [Desmospora sp. 8437]
gi|332968513|gb|EGK07575.1| thymidylate kinase [Desmospora sp. 8437]
Length = 702
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 275/710 (38%), Positives = 382/710 (53%), Gaps = 65/710 (9%)
Query: 5 SFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 61
+F K + FL +TCHWCHVME ESFED VA+LLN +++IKVDREERPDVD
Sbjct: 49 AFAKARKEDKPIFLSIGYSTCHWCHVMERESFEDVEVAQLLNREYIAIKVDREERPDVDN 108
Query: 62 VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 121
+YM+ QAL G GGWPL++ ++P+ +P GTYFP + G G IL +V AW ++R
Sbjct: 109 IYMSVCQALTGHGGWPLTIIMTPEKEPFFAGTYFPKQAVQGMQGLMEILGQVARAWREER 168
Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKF 181
+ + +G + L S S + +EL + Q +YD ++GGFG+APKF
Sbjct: 169 EQVLDAGRKITRAVQTQLKVSESGDLGKEELAE-----AYRQFKSTYDPQYGGFGTAPKF 223
Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
PRP ++ +L + K +SGE MV TL M +GGI+DHVG GF RY+VD
Sbjct: 224 PRPHDLLFLLRYWK------ESGEPF-ALSMVEETLDGMRRGGIYDHVGFGFARYAVDRE 276
Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
W VPHFEKMLYD LA YL+A+ +TK Y+ R+I Y+ R M P G +SAEDA
Sbjct: 277 WLVPHFEKMLYDNALLAYAYLEAYQVTKKDAYAGTAREIFTYVLRGMTSPEGGFYSAEDA 336
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS EG +EG FYVW EV+++LGE A LF E Y + P GN + +MS P+
Sbjct: 337 DS---EG----EEGKFYVWNPSEVKEVLGEEAGELFCECYDITPHGNFE-QKMSIPN--- 385
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
+ + L E+ D + G +E+ L R KLF R +R PH DDK++ SWNGL
Sbjct: 386 RIHSSLQEIAD------RRGRDVEELREQLEVSREKLFRAREERVHPHKDDKILTSWNGL 439
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
+I++ A+ +++L E+ Y E AE AASFI L DE+ RL +
Sbjct: 440 MIAALAKGARVLGDES----------------YAEAAEKAASFILERLRDEKG-RLLARY 482
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
R+G + PG++DDYAFL+ GL++LYE ++L A+EL ELF D E GG + T
Sbjct: 483 RDGEAAIPGYVDDYAFLVWGLIELYEATFRPRYLKSALELTREMLELFGDEEEGGLYFTG 542
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
+ +L R KE +DGA PSGNSV+ +NL RLA + + R+ A+ + F +
Sbjct: 543 RDAEKLLTRTKEVYDGAVPSGNSVAALNLARLARLTGDTG---LREQADRQIRAFAGSVG 599
Query: 601 DMAMAVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
A A L P K +V+ G D E M+ ++ L + V+ P
Sbjct: 600 QAPTAFSFFLTAVQFFLGTP--KEIVIAGPDGDHDTELMIRRVQQAF-LPEAVLLYKPEG 656
Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
EE +A + A VC+N++C P T +LE L
Sbjct: 657 K-----GEEVTQLVPFLAEQGAIQGRATAYVCENYACMAPAT---TLEEL 698
>gi|332020712|gb|EGI61117.1| Spermatogenesis-associated protein 20 [Acromyrmex echinatior]
Length = 746
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 274/711 (38%), Positives = 386/711 (54%), Gaps = 69/711 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA--LYGGGGWP 77
+TCHWCHVME ESF++E VAK++N+ +V+IKVDREERPD+D + M ++QA L G GGWP
Sbjct: 63 STCHWCHVMEKESFKNEEVAKIMNENYVNIKVDREERPDIDMMCMMFIQASRLRGHGGWP 122
Query: 78 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
L+VFL+PDL P+ GGTYF F L ++ W + RD + +S A ++L E
Sbjct: 123 LNVFLTPDLMPITGGTYF------SCAMFTLYLTRIVKEWTEGRDKMVKSAAIVSDRLKE 176
Query: 138 ALSASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGSA-------PKFPRPVEIQM 189
LS S K D +P + LCA L YD +GGFGS+ PKFP P +
Sbjct: 177 -LSTSRHDIK-DDGVPAIDCAFLCAHVLLNIYDEEYGGFGSSSATNPNSPKFPEPTNLNF 234
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
+L L + E S L TL+ M+ GG+HDHVG GFHRY+VD RW VPHFEK
Sbjct: 235 LL-SMHVLSTSTMLVEMSLNAS--LNTLRKMSFGGLHDHVGKGFHRYTVDARWKVPHFEK 291
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
MLYDQ QL Y+DA+ +TKD F+S I DI Y+ R + G FSA DADS T A
Sbjct: 292 MLYDQAQLIQCYVDAYIITKDSFFSDIVDDIATYVLRMLTHMEGGFFSAVDADSLPTFDA 351
Query: 310 TRKKEGAFYVWTSKEVEDIL-----GEHAI----LFKEHYYLKPTGNCDLSRMSDPHNEF 360
K+EGAFYVW+ ++ +L G+ + L H+ ++ GN + R DPH E
Sbjct: 352 PAKREGAFYVWSYDNLKALLKKKVPGKDNVTYFDLICRHFSVRKEGN--VERPQDPHGEL 409
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
GKNVL + +A+ + +++ + E L++ RS RP P LDDK++ SWNGL
Sbjct: 410 TGKNVLSMQSGIEDTANHFKLNVKETQKYIKEACTTLYEDRSHRPWPSLDDKMVTSWNGL 469
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS- 479
+IS ARA +K+ K+Y+E A AA+F+ ++L+++ L S
Sbjct: 470 MISGLARAGIAVKN----------------KDYVEAATEAATFVEKYLFNKDKRILLRSC 513
Query: 480 FRNGPSK-------APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
+R K PGF +DYAF + GLLDLYE W+ +A ELQ+ QD LF D E
Sbjct: 514 YRRRDDKIVQRSDPIPGFHEDYAFFVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDSE 573
Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
GGYF E P +L R K+ DG++PSGNS++ NL+RLA + D R AE L
Sbjct: 574 DGGYFAMAEESP-ILTRTKDSDDGSQPSGNSIACSNLLRLAIYL---DRDDLRHKAEKLL 629
Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
F +L + A P M A P++ +V G + + ML + + +
Sbjct: 630 CAFGNKLANCPAACPQMMLALIEFHHPTQIYV--AGKADAKETIEMLEIIRSRLIPGRVL 687
Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
I AD+E+ + N + R ++ +C+++SC+ P+++P
Sbjct: 688 IL---ADSEDNVLFRR----NMIVKRMKPQKNRATVFICRDYSCTLPISNP 731
>gi|296415498|ref|XP_002837423.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633295|emb|CAZ81614.1| unnamed protein product [Tuber melanosporum]
Length = 773
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 261/693 (37%), Positives = 376/693 (54%), Gaps = 63/693 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
C + VME ESFE+E +A++LN+ F+ IK+DREERPD+D++YM +VQA G GGWPL+VF
Sbjct: 109 CEYTIVMERESFENEEIARILNENFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVF 168
Query: 82 LSPDLKPLMGGTYFPPEDKYG----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
L+PDL+P+ GGTY+P G + GF +LRK+ + W ++ + S + + QL E
Sbjct: 169 LTPDLQPVFGGTYWPGPSAVGGMKDQLGFLEVLRKIANVWKEQHERCVASASDILNQLKE 228
Query: 138 ALSAS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---Y 192
+ + D L + L + YD +GGFG+APKFP PV + +L
Sbjct: 229 FTDEGLKGTGGEPGDGLELDLLEEAYQHFMARYDPLYGGFGNAPKFPTPVNLAFLLRLGT 288
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
++D E + MV+ TLQ MAKGGIHDH+G GF RYSV W++PHFEKMLY
Sbjct: 289 FPATVQDIVGEMECENAKSMVIDTLQGMAKGGIHDHIGHGFSRYSVTANWNLPHFEKMLY 348
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATR 311
DQ QL ++Y+DA+ +TK DI +Y+ D + P G +S+EDADS + T
Sbjct: 349 DQAQLLSIYIDAWLVTKSPAMLEAANDIAEYMCLDALKSPDGAFYSSEDADSLYRKADTE 408
Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
K+EGAFYVWT KE + +LGE A + ++ + GN D + +DPH+EF +NVL +
Sbjct: 409 KREGAFYVWTRKEFDVMLGEQDASICARYWNVHRDGNVDPA--NDPHDEFIAQNVLSVAS 466
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 429
+ GM E+ NI+ R+KL R K RPRP+LDDK++ +
Sbjct: 467 TPEKLSKMYGMSAERITNIISSARQKLLQHRLKERPRPNLDDKIVTT------------- 513
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+ Y + AE A SFIR++LYDE+T L+ +R+GP +A G
Sbjct: 514 ---------------------QLYKKNAEEAISFIRKNLYDEKTGILKRVYRDGPGEADG 552
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
F DDYAFLISGLL +YE ++L WA LQ Q + F D E GG+F+T+ ++LR
Sbjct: 553 FADDYAFLISGLLCMYEATFDVEYLQWADALQQKQIDAFWDAENGGFFSTSEGASDLILR 612
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+K+ D EPS N VS NL RL +++ K + Y A+ + + F T L + P +
Sbjct: 613 LKDGLDSQEPSTNGVSANNLFRLGTLLGDPKLEEY---AQQTCSAFSTEL----LQHPFL 665
Query: 610 CCA---ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
+ A + S + VVL G E L + N T++ +DPA + +D+
Sbjct: 666 FSSLMPAIVASNLGMRSVVLAGDPKDPTIEKHLKRLRSKLLTNTTLVQLDPARGDSLDWL 725
Query: 667 EEHNSNNASMARNNFSAD---KVVALVCQNFSC 696
N + + N +A K V VC+ C
Sbjct: 726 LSRNKLHKELL--NVAAKGSGKPVVQVCEGTKC 756
>gi|108805332|ref|YP_645269.1| hypothetical protein Rxyl_2540 [Rubrobacter xylanophilus DSM 9941]
gi|108766575|gb|ABG05457.1| protein of unknown function DUF255 [Rubrobacter xylanophilus DSM
9941]
Length = 685
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 267/696 (38%), Positives = 386/696 (55%), Gaps = 65/696 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE A+++N+ FV+IKVDREERPD+D +YM+ +QA+ GGGWP++
Sbjct: 51 SSCHWCHVMERESFEDEETARIMNEHFVNIKVDREERPDIDSIYMSALQAMTRGGGWPMT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ P GTYFPPE + G P FK +L + DA+ +R+ + +S E L +
Sbjct: 111 VFLTPEGVPFYAGTYFPPEPRGGMPSFKQVLLTLADAYRNRREEVLRSAESVREFLRAST 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A +L +EL A AE L + D RFGGFG APKFP+P+ ++++L H ++ D
Sbjct: 171 TAEMPRGRLREELLDGA----AEALMRQLDRRFGGFGGAPKFPQPMSLEVLLRHHRRTGD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
E V TL+ MA+GGI+D +GGGFHRY+VD RW VPHFEKMLYD L+
Sbjct: 227 -------REALAGVELTLRSMARGGIYDQLGGGFHRYAVDGRWLVPHFEKMLYDNALLSR 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+YL+A+ T D FY I + LDY+ RDM GP G +SAEDADS EG +EG FYV
Sbjct: 280 LYLEAYQATGDGFYRRIAEETLDYVARDMRGPEGGFYSAEDADS---EG----EEGKFYV 332
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +E+ + LG E A L ++ + GN F+G+NVL + A +
Sbjct: 333 WTPRELREALGSEDASLAAAYWGVTERGN------------FEGRNVLHVPREPEEVARE 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+G+ + + E RR+L + R +R RP D+KV+ +WNGL++ SFA +++L+
Sbjct: 381 VGLSPGELGRRVREIRRRLLEARGRRVRPGRDEKVLAAWNGLMLRSFAFTARVLR----- 435
Query: 439 AMFNFPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
R++Y+ +A E+AA + R L E RL S+R+G ++ G+L+DYA +
Sbjct: 436 -----------REDYLRIACENAAFLLGRLLSPE--GRLLRSYRDGRARIAGYLEDYAMV 482
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GL+ LYE T+WL AI L + DELF D G +F+ ++ R ++ +D A
Sbjct: 483 ADGLVSLYEATFETRWLREAISLADAMDELFWDESAGAFFDAPAGGEELVTRPRDVYDNA 542
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-L 616
PSG SV+V V L + + D YR+ AE +L L+ M A + A D L
Sbjct: 543 TPSGTSVAVD--VLLRLALLLGRED-YRRRAEAALEGLSGLLEQMPAAFGRLLGALDFHL 599
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P + V +VG + D ++ A ++ Y N+ VI P E S +
Sbjct: 600 GRP--REVAIVGRPDAPDTRALVDALYSVYLPNR-VIAGGPGG--------EDASLVPLL 648
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ A VC+ + C P T+P L L E
Sbjct: 649 EGRGMVDGRATAYVCEGYVCKSPTTEPGELLRQLRE 684
>gi|298710386|emb|CBJ25450.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 808
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 296/782 (37%), Positives = 407/782 (52%), Gaps = 94/782 (12%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL +TCHWCHVME ESFE + VAK+LN+ FVSIKVDREERPD
Sbjct: 48 GQEAFSRAKEEDKPIFLSVGYSTCHWCHVMERESFESQTVAKVLNENFVSIKVDREERPD 107
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD+ +MT+VQA GGGGWP+SV+L+PDLKP +G TYFP F +IL+ + D W
Sbjct: 108 VDQCFMTFVQATSGGGGWPMSVWLTPDLKPFVGATYFPEMR------FVSILKTLADKWS 161
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-----PQNALRLCAEQLSKSYDSRFG 173
R+ + + G + L E LS +A+++ P + A+R L K +D G
Sbjct: 162 SDREEVVKQGDHIVRLLQERLSETAAASGDPLAFLALDKSREAVREGVRVLDKGHDDVLG 221
Query: 174 GFGSAP---KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
G+G KFP+P + ++L + +LE G S + MV TL+ MAKGGI+D++
Sbjct: 222 GWGGGRGGMKFPQPSRMNLLL-RAHRLEGEG-SALGARALAMVETTLKAMAKGGIYDYLF 279
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GF RYS D RWHVPHFEKMLYDQ QL Y++AF +T D Y+ + R +L Y+ RDM
Sbjct: 280 DGFARYSTDPRWHVPHFEKMLYDQSQLVTAYVEAFQVTGDTAYADVARGVLRYVLRDMTD 339
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAI--------------L 335
GG +SAEDADS EGAT KKEGAF VWT ++ +L GE + L
Sbjct: 340 EGGGFYSAEDADSLPFEGATEKKEGAFCVWTEPDLRRLLDGEEGVALPGEGGQTVPVSSL 399
Query: 336 FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL--EKYLNILGEC 393
F Y ++P GN D + D H E +NVL + +A LG+ E+ +
Sbjct: 400 FCRVYGVRPEGNVDPA--VDAHGELTSQNVLFKSETVRVAAEALGLTCSGEEAEAAMTGA 457
Query: 394 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 453
R L R KRP PHLDDKV+ SWNGL+IS+ ARAS+ F+ + Y
Sbjct: 458 RATLVAARRKRPAPHLDDKVLTSWNGLMISALARASQ---------AFSSSPPSEESLAY 508
Query: 454 MEVAESAASFIRRHLY------DEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYE 506
+ A AA F+R +LY E L S+RNG S GF DDYAFLI GL+DLYE
Sbjct: 509 LGAATKAAEFVRENLYRSGSGDGETAGTLLRSWRNGRASPVEGFADDYAFLIRGLIDLYE 568
Query: 507 F----GSGTKWLVWAIELQNTQDELFL--DREGGGYFN-----TTGEDPS---------- 545
+G +WL WA ELQ DE F GGGY++ + GE
Sbjct: 569 ADPRRDTGWRWLRWARELQAEMDEGFKCPSEAGGGYYSSRALESEGETKGDGETEGGSGS 628
Query: 546 --VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
+ R++ D+DGAEP SV+ NL+RL+ G + R+ A LA L +
Sbjct: 629 GVLPYRLRTDYDGAEPGAGSVAADNLLRLSGYFGGEEGKVLREKAAEQLAA-AFALPETP 687
Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
A P + A+ + ++ K V++ G + + + +++AA S+ N +I D +++
Sbjct: 688 QAYPEL-TASLVTALLGPKQVIISGDPAGAETQALMSAAQRSFCPNLVLIVEDSTTSDDR 746
Query: 664 DFWEEHNSNNAS-----MARNNFSA----------DKVVALVCQNFSCSPPVTDPISLEN 708
EE + R A + A VC + +CS PV +LE
Sbjct: 747 GKEEEAGDGKTGDEPPPLFREILEAYGGGYSAGEGGQAAAYVCFDNTCSAPVHTVEALEK 806
Query: 709 LL 710
LL
Sbjct: 807 LL 808
>gi|452985594|gb|EME85350.1| hypothetical protein MYCFIDRAFT_60228 [Pseudocercospora fijiensis
CIRAD86]
Length = 784
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 262/663 (39%), Positives = 366/663 (55%), Gaps = 37/663 (5%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
KT R F+ + CHWCHVM ESF+D +++LLN+ F+ +K+DREERPD+D+ YM ++
Sbjct: 94 KTNRLLFVSIGYSACHWCHVMAHESFDDPRISRLLNENFIPVKIDREERPDIDRQYMDFL 153
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP---PEDKYGRPGFKTILRKVKDAWDKKRDML 124
QA GGGGWP++VF++PDL+P+ GGTY+P E GF+ IL K+ W ++ +
Sbjct: 154 QATNGGGGWPMNVFVTPDLEPVFGGTYWPGPKSERLQAAGGFEDILIKIATTWKEQEARV 213
Query: 125 AQSGAFAIEQLSE-----ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
QSG QL E ++ DEL + L + YD + GFG AP
Sbjct: 214 RQSGKEITRQLREFAQEGSIGGKNGRTDDEDELELDLLDDAFQHYKMRYDPKHHGFGGAP 273
Query: 180 KFPRPVEIQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
KFP PV I+ +L Y S E G+ E E + M + TL MAKGGI D +G GF R
Sbjct: 274 KFPTPVHIRPLLRVAAYPSVVREIVGEK-ECVEARAMAVNTLAAMAKGGIKDQIGHGFAR 332
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGE 294
YSV W +PHFEKMLYD QL VYLDA+ LTK + DI YL M P G
Sbjct: 333 YSVTRDWSLPHFEKMLYDNAQLLPVYLDAYLLTKSPLFLETAIDIATYLTSPPMQSPLGG 392
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
I SAEDADS+ T K+EGA+YVWT E + +LG+ + + +++ ++P GN D +
Sbjct: 393 ICSAEDADSSPTVSDKEKREGAYYVWTFDEFKQVLGDAQVDICAKYWNVRPEGNID--QR 450
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDK 412
SD E G+N L D A +LG+P ++ ++ + R+KL R K RPRP LDDK
Sbjct: 451 SDAQGELAGQNTLCVQYDIPDLAKELGLPEDEVKQMILDGRQKLLAHREKTRPRPALDDK 510
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
++ SWNGL I AR S +L+S A + Y+ A A + I+ HL+D
Sbjct: 511 IVTSWNGLAIGGLARTSAVLQSSAPAQA----------TRYLSSAVRAVTCIQEHLFDPA 560
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
T L+ +R GP + GF DDYAF +SGLLDLYE ++WL +A LQ TQ++LF D
Sbjct: 561 TGTLKRVYREGPGETQGFADDYAFFVSGLLDLYEATFDSRWLEFAETLQKTQNKLFWDDL 620
Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
G+F+T + P +L+R K+ D AEPS N VS NL RL S++ ++ Y + +
Sbjct: 621 KYGFFSTPADQPDILIRTKDAMDNAEPSVNGVSAANLFRLGSLLNDAE---YEKMGRRVV 677
Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
A FE ++ M + + S K +++VG + E L A + N T+
Sbjct: 678 ACFEVEIEQHPGLFSGMLSSV-VASKLGMKGLMIVGEGDAA--EAALKKARETVRPNYTI 734
Query: 653 IHI 655
+ I
Sbjct: 735 LRI 737
>gi|268316671|ref|YP_003290390.1| hypothetical protein Rmar_1111 [Rhodothermus marinus DSM 4252]
gi|262334205|gb|ACY48002.1| protein of unknown function DUF255 [Rhodothermus marinus DSM 4252]
Length = 699
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 265/691 (38%), Positives = 364/691 (52%), Gaps = 52/691 (7%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF+DE VA+LLND F++IKVDREERPD+D +YMT Q + G GGWPL++
Sbjct: 50 CHWCHVMAHESFQDEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTII 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
++PD KP TY P +YGRPG I+ ++K+AW + RD + S L + +S
Sbjct: 110 MTPDKKPFFAATYIPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSF 169
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A S + E + A R +L +D + GGFG APKFP P + +L +
Sbjct: 170 EAPSQIIDAEWLEIAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------ 219
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+SGEA Q MV TL M GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ L Y
Sbjct: 220 RSGEAHALQ-MVEHTLVQMRLGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAY 278
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+A+ T + FY R+IL Y+ RD+ P G +S+EDADS EG +EG FYVWT
Sbjct: 279 TEAYQATGNPFYERTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWT 331
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E+ ++LG E L E + + P GN + + E GKN+L A A + G
Sbjct: 332 VEELREVLGPELTPLAIELFNVDPEGNYE----EEATGERTGKNILYLSKPPEALARERG 387
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
E+ L E R++LF R++R RP D+K++ WNGL+I++ ARA+++
Sbjct: 388 WTPEELEAKLEEIRQRLFAYRARRVRPGRDEKILTDWNGLMIAALARAAQVF-------- 439
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
D Y+E A SAA F+ R ++ + RL H +R G + PG LDDYAFL G
Sbjct: 440 --------DEVAYVEAARSAADFLLRTMHTPEG-RLWHRYREGEAGIPGMLDDYAFLTWG 490
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LLDLYE T +L A+ L F D G Y +P +++R +E D A PS
Sbjct: 491 LLDLYETTFETSYLETALALTEQMLAHFWDPRGAFYMTPDDGEP-MIVRPRETLDNALPS 549
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
GN+V+++NLVRL + + Y ++A+ + F +K M A D+ P
Sbjct: 550 GNAVALMNLVRLGHMTGRTA---YEEHADAMIRFFSGPVKQQPPIFTGMLIAIDLAFGPI 606
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
+ +VL G ML H Y K ++ P + E A
Sbjct: 607 YE-LVLAGEPDDPTLREMLRTIHRRYLPRKVLLLRRPGEA------GERLVRVAPFVAAQ 659
Query: 681 FSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
D + A VC ++ C PVTDP +L L
Sbjct: 660 LPVDGRATAYVCHDYRCEQPVTDPEALARQL 690
>gi|198457071|ref|XP_001360541.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
gi|198135846|gb|EAL25116.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
Length = 803
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 262/717 (36%), Positives = 365/717 (50%), Gaps = 63/717 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ A ++N+ FV+IKVDREERPD+DK+YMT++Q GGGGWP+S
Sbjct: 117 STCHWCHVMEHESFENLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMS 176
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
++L+PDL P+ GTYFPP +YG P FKT+L + W R L +SG+ + L +
Sbjct: 177 IWLTPDLAPITAGTYFPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKQNE 236
Query: 140 SASASSNKLPDELPQNALRLCAEQL---SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
ASA + + P +A AE + + +D GGFG+ PKFP + + +
Sbjct: 237 DASAVAEAAFE--PGSASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLV 294
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D +VL TL + +GGI+DH+ GGF RY+ WH HFEKMLYDQGQ
Sbjct: 295 SKDVSV-------LDLVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQ 347
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LT+ + I Y+ +D+ P G ++ EDADS T K EGA
Sbjct: 348 LMAAYSNAYKLTRSATFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGA 407
Query: 317 FYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
FY WT E+E D+L + A ++ HY LKP GN + SDPH GKN
Sbjct: 408 FYAWTWNEIEAAFKDQAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKN 465
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+LI + S + EK +L L +R +RPRPHLD K+I +WNGL++S
Sbjct: 466 ILIVRGSDEETCSNFDLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSG 525
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----- 479
++ + + R+EY++ A+ F+R+ +YD + L S
Sbjct: 526 LSKLANCGTVK--------------REEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVA 571
Query: 480 -----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
S+ GFLDDYAFLI GLLD Y+ L WA ELQ TQD+LF D + G
Sbjct: 572 VGDPTLEKNESQIDGFLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNG 631
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
YF + P+V++R+KE DGAEP GNSVS NL L+ + Y Q A L
Sbjct: 632 AYFFSQQNAPNVIVRLKEGDDGAEPCGNSVSARNLTLLSHYY---DEETYLQRAA-KLMN 687
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F + A+P M A +L V +VG S D + + + ++H
Sbjct: 688 FFADVAPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-DTKRFVEICRKFFIPGMIILH 745
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+DP ++ N + K +C + C PVTDP LE L+
Sbjct: 746 VDPLHPDDA-------CNQRVQKKFKMVNGKTTVYICHDRVCRMPVTDPTQLEENLM 795
>gi|391342665|ref|XP_003745636.1| PREDICTED: spermatogenesis-associated protein 20 [Metaseiulus
occidentalis]
Length = 728
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 284/731 (38%), Positives = 387/731 (52%), Gaps = 114/731 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E VAK+LND +VSIKVDREERPD+DK+YMTYVQ G GWPLS
Sbjct: 54 STCHWCHVMERESFENEEVAKILNDRYVSIKVDREERPDIDKIYMTYVQVTSGHSGWPLS 113
Query: 80 VFLSPDLKPLMGGTYFPPED-KYGRPGFKTILRKVKDAW------------DKKRDMLAQ 126
V+L+P+LKP+ GGTYFPPED +YG GFKTIL + D W D+ MLA+
Sbjct: 114 VWLTPELKPIFGGTYFPPEDNQYGLAGFKTILLMLDDKWHSSKNEKIKADSDRITAMLAR 173
Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV- 185
+ L E L A+ S P ++ C+ L K GF P+FP+ V
Sbjct: 174 AS-----NLRENLEAAESFQ------PSQCIKDCSLILQK----HLIGFVKEPRFPQCVN 218
Query: 186 -EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
M L+H + G +V L+ MA GGIHDH+GGGFHRY+VD W V
Sbjct: 219 GNFYMNLFHFQN---------NRMGVDIVERQLKEMATGGIHDHLGGGFHRYTVDAAWQV 269
Query: 245 PHFEKMLYDQGQLANVYLDAFSLTK-----DVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
PHFEKMLYDQ Q+ +Y + F+ + I DY+ RD+ P G +SAE
Sbjct: 270 PHFEKMLYDQAQILALYCSYLRMPGIKPEIASFFGGVATGIADYVMRDLSHPQGGFYSAE 329
Query: 300 DADSAET-EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
DADS E+ + + KKEGAFYVWT E++ IL + A +F E + + GN DPH
Sbjct: 330 DADSLESFDSSDHKKEGAFYVWTMAEIQKILSKKEAKVFCEFFGVDEQGNV------DPH 383
Query: 358 NEFKG----KNVLI---------ELNDSSASAS-KLGMPLEKYLNILGECRRKLFDVR-S 402
++ +G +N L +ND + + G PL++ IL +RKL R
Sbjct: 384 HDAQGELLNQNTLFYRYPDSYDQNINDMAKVIDLEDGDPLDE---ILESAKRKLLQRRLE 440
Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
RPRPHLD+K++ +WNGL+I++ A+AS +LK R Y E A A
Sbjct: 441 SRPRPHLDNKIVSAWNGLMIAALAKASVVLK----------------RPAYAERALKAVD 484
Query: 463 FIRRHLYDEQTHRLQHS-FRNGPSKA----------PGFLDDYAFLISGLLDLYEFGSGT 511
FIR +L+D + RL S + G A PG L+DYAF+ISGLL LY+
Sbjct: 485 FIRANLFDRENQRLYRSAYTEGEGDAARVEQLEKPIPGVLEDYAFVISGLLQLYDATLDE 544
Query: 512 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 571
+ L++A LQ++Q+ F D GGYF +G +++ +K+DHDGAEPS NSVS+ NL+R
Sbjct: 545 QLLLFAKILQDSQNRQFWDETNGGYFLFSGGGSNIIYVLKDDHDGAEPSANSVSIANLIR 604
Query: 572 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 631
L I + YR A ++ +F RL + +A+P M + L P K ++
Sbjct: 605 LYHIF---DHEPYRTKANKTVKLFAERLSKVPIALPEMVSSLMYLVEPPTKIILSAEDDE 661
Query: 632 SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVC 691
DF+ + + I E+ F +E A N +V A VC
Sbjct: 662 ISDFKRVCDEEARGFS-----IVFAARSVSELGFTKEQYP-----AVNG----EVTAYVC 707
Query: 692 QNFSCSPPVTD 702
++ SC PP+ D
Sbjct: 708 KDLSCLPPIND 718
>gi|195150279|ref|XP_002016082.1| GL10685 [Drosophila persimilis]
gi|194109929|gb|EDW31972.1| GL10685 [Drosophila persimilis]
Length = 803
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 262/717 (36%), Positives = 365/717 (50%), Gaps = 63/717 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ A ++N+ FV+IKVDREERPD+DK+YMT++Q GGGGWP+S
Sbjct: 117 STCHWCHVMEHESFENLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMS 176
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
++L+PDL P+ GTYFPP +YG P FKT+L + W R L +SG+ + L +
Sbjct: 177 IWLTPDLAPITAGTYFPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKKNE 236
Query: 140 SASASSNKLPDELPQNALRLCAEQL---SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
ASA + + P +A AE + + +D GGFG+ PKFP + + +
Sbjct: 237 DASAVAEAAFE--PGSASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLV 294
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D +VL TL + +GGI+DH+ GGF RY+ WH HFEKMLYDQGQ
Sbjct: 295 SKDVSV-------LDLVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQ 347
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LT+ + I Y+ +D+ P G ++ EDADS T K EGA
Sbjct: 348 LMAAYSNAYKLTRSATFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGA 407
Query: 317 FYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
FY WT E+E D+L + A ++ HY LKP GN + SDPH GKN
Sbjct: 408 FYAWTWNEIEAAFKDQAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKN 465
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+LI + S + EK +L L +R +RPRPHLD K+I +WNGL++S
Sbjct: 466 ILIVRGSDEETCSNFDLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSG 525
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----- 479
++ + + R+EY++ A+ F+R+ +YD + L S
Sbjct: 526 LSKLANCGTVK--------------REEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVA 571
Query: 480 -----FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
S+ GFLDDYAFLI GLLD Y+ L WA ELQ TQD+LF D + G
Sbjct: 572 VGDPTLEKNESQIDGFLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNG 631
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
YF + P+V++R+KE DGAEP GNSVS NL L+ + Y Q A L
Sbjct: 632 AYFFSQQNAPNVIVRLKEGDDGAEPCGNSVSARNLTLLSHYY---DEETYLQRAA-KLMN 687
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F + A+P M A +L V +VG S D + + + ++H
Sbjct: 688 FFADVAPFGHALPEMLSAL-LLHENGLDLVAVVGPDSE-DTKRFVEICRKFFIPGMIILH 745
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+DP ++ N + K +C + C PVTDP LE L+
Sbjct: 746 VDPLHPDDA-------CNQRVQKKFKMVNGKTTVYICHDRVCRMPVTDPTQLEENLM 795
>gi|195382934|ref|XP_002050183.1| GJ22002 [Drosophila virilis]
gi|194144980|gb|EDW61376.1| GJ22002 [Drosophila virilis]
Length = 747
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 261/718 (36%), Positives = 363/718 (50%), Gaps = 65/718 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED A ++N FV+IKVDREERPD+DKVYM ++ G GGWP+S
Sbjct: 61 STCHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL PL GTYFPP+ +YG P F +L + W R L ++G+ +E +
Sbjct: 121 VWLTPDLAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRTSLKKAGSTLMEAMRANQ 180
Query: 140 SASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+A + + P +A AE L+ + +D GFG PKFP + + +
Sbjct: 181 NAGTDAEAAFE--PGSADAKLAEALAVHKQRFDQEHAGFGREPKFPEVPRLNFLFHAYLV 238
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D + MVL TL + +GGI+DH+ GGF RY+ WH HFEKMLYDQGQ
Sbjct: 239 SKDV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQ 291
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LT+ + I +YL +D+ P G ++ EDADS T T K EGA
Sbjct: 292 LMAAYANAYKLTRSKEFLRYADRIYEYLIKDLRHPAGGFYAGEDADSLPTHADTVKVEGA 351
Query: 317 FYVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKN 364
FY WT EV+ F + HY +KP GN + SDPH GKN
Sbjct: 352 FYAWTWDEVKQAFEAQQARFNDVSPARVFEIYCFHYGMKPAGN--VPPASDPHGHLTGKN 409
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+LI + S + + + +L L +R +RPRPHLD K+I WNGLV+S
Sbjct: 410 ILIVRGSEEDTCSNFNLEMAQLSQLLETANDILHKIRDQRPRPHLDTKIICGWNGLVLSG 469
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYD-EQTHRLQHSFRN 482
++ + G+D+++ Y+ A+ F+R HLYD EQ L+ +
Sbjct: 470 LSKLAN---------------CGTDKRDAYLATAKQLMDFLRTHLYDGEQKLLLRSCYGA 514
Query: 483 G---------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
G P++ GFLDDYAFL+ GLLD Y+ L WA ELQ TQD+LF D +
Sbjct: 515 GVQDNTLEQNPTRIEGFLDDYAFLVKGLLDYYKASLDMSALHWAKELQVTQDKLFWDEKN 574
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G YF + P+V++R+KEDHDGAEP GNSV+ NL L+ + Y ++ A+ L
Sbjct: 575 GAYFFSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLSHYF--DEGTYLKRAAK--LL 630
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
+ + A+P M A +L V +VG S D + + Y ++
Sbjct: 631 NYFADVAPFGHALPEMLSAL-LLHENGLDLVAVVG-PDSPDTKRFVEIVRKFYVPGMIIV 688
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
H DP +E N + K +C + C PVTDP LE L+
Sbjct: 689 HCDPQHPDEA-------CNQRLQQKFKMVNGKTTVYICHDRVCRMPVTDPAQLEENLM 739
>gi|384917096|ref|ZP_10017228.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
SolV]
gi|384525484|emb|CCG93101.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
SolV]
Length = 727
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 252/689 (36%), Positives = 366/689 (53%), Gaps = 37/689 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+ VA+LLN +++ +KVDREERPD+D+ YM +VQA G GGWP+S
Sbjct: 47 STCHWCHVMAEESFENPTVAELLNAFYIPVKVDREERPDIDQFYMEFVQAFCGQGGWPMS 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDL+P GGTYFP E K+GRPGF +L+K+ + W R L Q G + ++ E++
Sbjct: 107 VWLTPDLEPFFGGTYFPLESKWGRPGFIDLLKKIANLWQSHRSALQQQGQEILNKMRESI 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S P+ L Q A R EQL ++D +GGF PKFPRP + L+ + ++
Sbjct: 167 LCSIEIESQPN-LTQIA-RKTVEQLWGNFDRVYGGFSPPPKFPRP-NLFFFLFRAGSFKE 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ ++ KM LFTLQ M+ GGIHD + GGFHRYSVD +W +PHFEKMLYDQ L +
Sbjct: 224 LPDPLQ-NKAMKMALFTLQKMSCGGIHDILEGGFHRYSVDAQWRLPHFEKMLYDQAHLGS 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+AF +T D + + +YL + P G +SAEDADS + G K EGA+Y+
Sbjct: 283 AYLEAFQMTSDFLFKETATALFEYLFSHLYNPAGGFYSAEDADSLNSSG--EKAEGAYYL 340
Query: 320 WTSKEVEDILGEHAILFKEH-----YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
WT +E+E IL E ++ KE + T +L+ + KN+L SA
Sbjct: 341 WTMEELEKILEE--VVGKERSKVLASFFGATNQGNLAEGLGTEPSMRLKNMLFFSKPLSA 398
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A +L MP+E+ ++L + + L + R KRP+P LDDK+I +WNG IS+ A+A +L
Sbjct: 399 LAEELKMPIEETKDLLLKAKTALKEARLKRPKPFLDDKIITAWNGYAISALAKAYMVLAD 458
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
Y+ A+ A FI HL+D + L +RNG PGF DY
Sbjct: 459 ----------------SRYLNEAKKTADFILEHLWDADSKILYRIYRNGRGSIPGFASDY 502
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L + LLDL+E KWL+ A Q +E F D Y + E + +++ +E++
Sbjct: 503 ASLAASLLDLFEADQDEKWLLQAKMFQELLEEKFADPYRHQYLSRAVETAATIIQTREEY 562
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGAEP+ S+S L +L SI K +++ E L+ A+P
Sbjct: 563 DGAEPATLSLSAYALWKLFSITGEEK---WKKRLEELFNSAWPILERFPTALPYFLGVYL 619
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SVP + +++VG K + + + N+ + +DP F +N
Sbjct: 620 EYSVPPIE-IIIVGEKDDLKTRALFNTLSSVLIPNRLFLVLDPRQGVPRTFKSIDFYSNL 678
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDP 703
+ +A +C CS P T+P
Sbjct: 679 LSVYPGYP----IAYICARGQCSLPQTEP 703
>gi|154303146|ref|XP_001551981.1| hypothetical protein BC1G_09593 [Botryotinia fuckeliana B05.10]
Length = 753
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 243/610 (39%), Positives = 355/610 (58%), Gaps = 28/610 (4%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CH+ME ESFE+E VA +LN F+ IK+DREERPD+D++YM +VQA G GGWPL+VFL+P
Sbjct: 17 CHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFLTP 76
Query: 85 DLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
L+P+ GGTY+ D + F IL K+ W ++ Q A +++QL + +
Sbjct: 77 SLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSAQSLQQLKDFAN 136
Query: 141 ASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHS 194
SN+L D + L E + SYD GGFGSAPKFP P +I +L
Sbjct: 137 EGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSKIAFLLRLGQFP 196
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + D + +++ + TL+ MA+GGIHDH+G GF RYS W +PHFEKMLYD
Sbjct: 197 QAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWSLPHFEKMLYDN 256
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
QL ++YLD F L++D + + DI +YL + G +S+EDADS G + K+E
Sbjct: 257 AQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADSYYKNGDSEKRE 316
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
GA+YVWT +E E+ILG L ++ TG+ ++ + +DPH+EF +NVL + SA
Sbjct: 317 GAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQNVLAISSTPSA 375
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
AS+ G+ + + ++ E + +L R + R +P +DDKV+VSWNG+ + + AR S ++
Sbjct: 376 LASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAVGALARLSSVIN 435
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
F+ PV +EY++ A AA+FI+++LYD++ L +R G GF DD
Sbjct: 436 G------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWREGRGDTQGFADD 485
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKE 552
YAFLI GL+DLYE KWL WA ELQ +Q LF D+ G G +F+TT P+V+LR+K+
Sbjct: 486 YAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTVSAPNVILRLKD 545
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMC 610
D +EPS N +S NL RL+S+ + Y + A+ ++ FE + P +
Sbjct: 546 AMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEMLQYPWLFPSFMPA 602
Query: 611 CAADMLSVPS 620
A L V S
Sbjct: 603 IVASHLGVKS 612
>gi|302814858|ref|XP_002989112.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
gi|300143213|gb|EFJ09906.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
Length = 354
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/330 (62%), Positives = 260/330 (78%), Gaps = 3/330 (0%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVMEVESFE E VAKLLNDWFVSIKVDREERPD
Sbjct: 25 GEEAFAKAKAEDKPIFLSVGYSTCHWCHVMEVESFESEEVAKLLNDWFVSIKVDREERPD 84
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGGWP+SVFL+P+LKP++GGTYFPPED YGRPGFKT+LR+VK+ WD
Sbjct: 85 VDKVYMTFVQASQGGGGWPMSVFLTPELKPIVGGTYFPPEDNYGRPGFKTVLRRVKENWD 144
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ +L +G I+QL+EA++A A+S ++ + + A++LCA QL K +D++ GGFGSA
Sbjct: 145 SRKAVLRNAGDNVIQQLAEAMAACATSLQVSGGVAEQAVQLCASQLMKGFDAKLGGFGSA 204
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFPRPVE+ +ML + K+L+ GK+ + + +M F LQCMA+GG+HDHVGGGFHRYSV
Sbjct: 205 PKFPRPVELNLMLRYYKRLDQAGKASLSKKALEMASFNLQCMARGGMHDHVGGGFHRYSV 264
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+ WHVPHFEKMLYDQ QLAN YLD + +T+D ++ + RDILDYL RDM P G IFSA
Sbjct: 265 DDYWHVPHFEKMLYDQAQLANAYLDVYLVTRDTMHACVARDILDYLNRDMTHPEGGIFSA 324
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDI 328
EDADS E G+++KKEGAFYVWT+KEV ++
Sbjct: 325 EDADSLEPSGSSKKKEGAFYVWTAKEVRNL 354
>gi|167629725|ref|YP_001680224.1| thioredoxin [Heliobacterium modesticaldum Ice1]
gi|167592465|gb|ABZ84213.1| conserved hypothetical protein containing a thioredoxin domain
[Heliobacterium modesticaldum Ice1]
Length = 687
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 279/714 (39%), Positives = 372/714 (52%), Gaps = 67/714 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE VA LN+ F+S+KVDREERPD
Sbjct: 34 GEEAFTRAKEQDKPVFLSVGYSTCHWCHVMERESFEDEEVAAYLNEHFISVKVDREERPD 93
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YMT QA+ G GGWPL+V ++PD KP GTYFP + G G IL V D W
Sbjct: 94 VDHIYMTVCQAITGHGGWPLTVIMTPDKKPFFAGTYFPKRSRQGLAGLLDILEAVVDQWK 153
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R L +G + L + A+ S+ L D + LR A L K +D +GGFG A
Sbjct: 154 NDRGKLVAAGDRVTQHLQREVQAN-SAGSLDD---ASILRGYA-WLQKRFDDVYGGFGHA 208
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L K + A E MV TL+ M GGI+DH+G GF RYS
Sbjct: 209 PKFPTPHNLLFLLRCDKLI-------NAKEALPMVEKTLRQMHAGGIYDHLGYGFSRYST 261
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE+W VPHFEKMLYD QLA YL+A+ +T Y+ + R+I Y+ RDM P G +SA
Sbjct: 262 DEKWLVPHFEKMLYDNAQLAMAYLEAYQVTAKDEYAEVAREIFSYVLRDMHAPEGGFYSA 321
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FY+WT +EV++ILGE LF + Y + GN
Sbjct: 322 EDADS---EGV----EGKFYLWTPQEVKEILGEETGKLFCQWYDITEKGN---------- 364
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F+G+N+ LN A P+ + IL + KLF R KR P D+K++ +W
Sbjct: 365 --FEGQNI---LNRIDADRRPFTPPM-GWHQILTDAEEKLFVAREKRVHPLKDEKILTAW 418
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++ A +IL + Y++ A AA FI L D++ RL
Sbjct: 419 NGLMIAALAMGFRILYD----------------RSYLDAAIGAADFIWEKLRDDKG-RLL 461
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+R+G + G++DDYAF+I L++LY+ + WL A+ LQ Q+ LF D + GGYF
Sbjct: 462 ARYRDGEAAYKGYIDDYAFMIWALIELYQADTNPLWLKRALTLQEDQNRLFWDPDQGGYF 521
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+ +L R KE +DGA PSGNSVS +NL+RLA I ++ Y RQ AE L F
Sbjct: 522 FYGSDSEELLTRPKEIYDGATPSGNSVSALNLLRLARITG--RNAYARQ-AETLLESFSG 578
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ A P K VV+V + F L H+ + +TV
Sbjct: 579 NINAQPAGHTFALMALLFARRPG-KEVVVVADRKRETFRQELERLHSPFS-PETVFLYRL 636
Query: 658 ADTEEMDFWEEHNSNNASMARNNF-SADKVVALVCQNFSCSPPVTDPISLENLL 710
AD E D + A N D VC+NF+C PP T+P + +L
Sbjct: 637 ADREYKDL-----AELAPFVENMAPQGDSPTYYVCENFACKPPTTNPREVWEIL 685
>gi|392375956|ref|YP_003207789.1| hypothetical protein DAMO_2917 [Candidatus Methylomirabilis
oxyfera]
gi|258593649|emb|CBE69990.1| conserved protein of unknown function [Candidatus Methylomirabilis
oxyfera]
Length = 1103
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 250/694 (36%), Positives = 372/694 (53%), Gaps = 64/694 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFE E +A+L+N +FV IKVDREERPD+D +YM AL +G GGWP+
Sbjct: 63 SACHWCHVMAHESFESEQIAELMNRYFVCIKVDREERPDLDAIYMAATLALNHGQGGWPM 122
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+VFL+PDL+P GTYFPP D GRPGF TIL +V W ++ D L ++++E
Sbjct: 123 TVFLTPDLQPFFAGTYFPPRDGLGRPGFPTILNRVAQVWREQPDALRTQS----DKITEG 178
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L S S LP + + + + ++D FGGFG+APKFP + ++L H +
Sbjct: 179 LRES-SRPSLPMPVGRAEIAAAVAHFAATFDPTFGGFGAAPKFPAATALSLLLRHHQHTG 237
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + +MV TL MA+GGI+D +GGGF RYS DERW +PHFEKMLYD LA
Sbjct: 238 D-------AHALQMVRTTLDAMARGGIYDQIGGGFARYSTDERWLIPHFEKMLYDNALLA 290
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+AF + D Y I ++LDY+ R+M G +SA DADS EG EG FY
Sbjct: 291 RTYLEAFQVAGDPSYRQIATELLDYILREMTALEGGFYSATDADS---EGV----EGKFY 343
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT E+E ILG E A F +Y + PTGN ++G+++ ++ A+
Sbjct: 344 VWTPAEIEAILGQEEARRFCAYYDITPTGN------------WEGRSIPNIRRTAAQVAA 391
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
KLG+ +E+ + + K+++ R KR P LDDK++ +WNGL++S+ A ++L
Sbjct: 392 KLGVSVEELAASIDRTQPKVYEARRKRVPPGLDDKILTAWNGLMVSAMAEGYRVLGE--- 448
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ +++ A AA F+ L RL ++R+G + +L+DYA L
Sbjct: 449 -------------RRHLDAAVRAADFLLSTLLRPDG-RLLRTYRSGVAHLNAYLEDYACL 494
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GL+DLYE G T++L A+ L F D E G + T+ + +++LR +E DGA
Sbjct: 495 CEGLIDLYEAGGETRYLREAVRLAERMPGDFADEESGAFHTTSRDHETLILRYREGTDGA 554
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ L RL+ + + +R+ AE +++ + ++ A D+L
Sbjct: 555 TPSGNAVAASALTRLSFHL---NREEWRRAAEQAISAYGQQIARYPHAFAKSLAVVDLL- 610
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ + L+G+ + E + + N+ + H DP + N +
Sbjct: 611 LEGPVELCLIGNPAEAGCEALRREVGRHFIPNRIIAHHDPT---------KGNPPELPLL 661
Query: 678 RNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
R D AL +C+NF+C P+TDP + LL
Sbjct: 662 RGKGLVDGRAALYLCRNFTCQAPITDPAQVAELL 695
>gi|374297486|ref|YP_005047677.1| thioredoxin domain-containing protein [Clostridium clariflavum DSM
19732]
gi|359826980|gb|AEV69753.1| thioredoxin domain protein [Clostridium clariflavum DSM 19732]
Length = 680
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 261/696 (37%), Positives = 369/696 (53%), Gaps = 75/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA++LN +F+SIKVDREERPD+D +YM QAL G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDYEVAEILNKYFISIKVDREERPDIDHIYMNVCQALTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F++PD KP GTYFP D+ G G +IL V +AW R+ L + + I ++E
Sbjct: 113 IFMTPDKKPFFAGTYFPKNDRMGMSGLMSILESVHNAWTTDREALLKESEYIINAINEHN 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
++ EL ++ L +L ++D+ FGGFGSAPKFP P + +L Y++K+
Sbjct: 173 ELLEQDHE--GELTEDILDKAYSELKFAFDNIFGGFGSAPKFPTPHNLFFLLRYWYNTKE 230
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
MV TL CM KGGI+DH+G GF RYS D +W VPHFEKMLYD
Sbjct: 231 ----------EYALTMVEKTLACMHKGGIYDHIGFGFSRYSTDRKWLVPHFEKMLYDNAL 280
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L+ YL+A+ TK Y+ I +I Y+ RDM P G +SAEDADS EG EG
Sbjct: 281 LSIAYLEAYQATKKRDYADIAEEIFTYVLRDMTSPEGGFYSAEDADS---EGM----EGK 333
Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW+ EV+ +LGE H + ++Y + P GN F+G N+ +
Sbjct: 334 FYVWSMDEVKKVLGEQHGEKYCKYYDITPHGN------------FEGFNI--------PN 373
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
K +P E+ + ECR+KLF+ R KR PH DDK++ SWNGL+I++ A ++L E
Sbjct: 374 LIKGNIPDEE-RPFIEECRKKLFEYREKRVHPHKDDKILTSWNGLMIAALAIGGRVLGKE 432
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+Y+ AE AA FI L RL +R+G S PG++DDYA
Sbjct: 433 ----------------KYITAAERAAKFISSKLVSNNG-RLLARYRDGESAFPGYVDDYA 475
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F I GL++LYE +L +++L + + F D GG F + ++ R KE +D
Sbjct: 476 FFIWGLIELYETTYKPVYLKQSLKLNDDLIKYFWDENNGGLFYYGSDSEQLITRPKETYD 535
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSVS +N +RLA + S + A F +++ AM A +
Sbjct: 536 GAIPSGNSVSTLNFLRLARLTGRSDLE---DKAYIQFKTFSRNIENFAMGHSFFLTAL-L 591
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ K VV+VG+ ++ ++M+ + + A +E D A
Sbjct: 592 FAKSKSKEVVIVGN-DKLESDSMINIIREEFRPFTLSMFYSDAQSELKDI--------AP 642
Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
N S + K A +C+N++C P+TD S N +
Sbjct: 643 FIENYRSVEGKTTAYICENYTCHDPITDVSSFRNAI 678
>gi|25147430|ref|NP_495615.2| Protein B0495.5 [Caenorhabditis elegans]
gi|21264548|sp|Q09214.2|YP65_CAEEL RecName: Full=Uncharacterized protein B0495.5
gi|351065503|emb|CCD61473.1| Protein B0495.5 [Caenorhabditis elegans]
Length = 729
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 266/727 (36%), Positives = 376/727 (51%), Gaps = 61/727 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + FL +TCHWCHVME ESFE+E AK+LND FV+IKVDREERPD
Sbjct: 43 GQEAFQKAKDNNKPIFLSVGYSTCHWCHVMEKESFENEATAKILNDNFVAIKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +V A G GGWP+SVFL+PDL P+ GGTYFPP+D G GF TIL + W
Sbjct: 103 VDKLYMAFVVASSGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWK 162
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+ + L Q GA I +L + +AS N+ + + S+DSR GGFG A
Sbjct: 163 KEGESLKQRGAQII-KLLQPETASGDVNR-----SEEVFKSIYSHKQSSFDSRLGGFGRA 216
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+ ++ ++ + +S +A + M+ TL+ MA GGIHDH+G GFHRYSV
Sbjct: 217 PKFPKACDLDFLITFAAS---ENESEKAKDSIMMLQKTLESMADGGIHDHIGNGFHRYSV 273
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
WH+PHFEKMLYDQ QL Y D LT K ++ DI Y+++ GG +
Sbjct: 274 GSEWHIPHFEKMLYDQSQLLATYSDFHKLTERKHDNVKHVINDIYQYMQKISHKDGG-FY 332
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
+AEDADS ++ K EGAF W +E++ +LG+ I + +++ ++ +GN
Sbjct: 333 AAEDADSLPNHNSSNKVEGAFCAWEKEEIKQLLGDKKIGSASLFDVVADYFDVEDSGN-- 390
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
++R SDPH E K KNVL +L A+ + + + + E + L++ R++RP PHL
Sbjct: 391 VARSSDPHGELKNKNVLRKLLTDEECATNHEISVAELKKGIDEAKEILWNARTQRPSPHL 450
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K++ SW GL I+ +A + ++ +Y++ AE A FI + L
Sbjct: 451 DSKMVTSWQGLAITGLVKAYQ----------------ATEETKYLDRAEKCAEFIGKFLD 494
Query: 470 DEQTHR------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
D R G + F DDYAFLI LLDLY ++L A+ELQ
Sbjct: 495 DNGELRRSVYLGANGEVEQGNQEIRAFSDDYAFLIQALLDLYTTVGKDEYLKKAVELQKI 554
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
D F + G GYF + D V +R+ ED DGAEP+ S++ NL+RL I+ + +
Sbjct: 555 CDVKFWN--GNGYFISEKTDEDVSVRMIEDQDGAEPTATSIASNNLLRLYDIL---EKEE 609
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
YR+ A RL + +A+P M A + S VLVG S + +
Sbjct: 610 YREKANQCFRGASERLNTVPIALPKMAVALHRWQIGSTT-FVLVGDPKSELLSETRSRLN 668
Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
+ N +V+HI EE S + + K +C+ F C PV
Sbjct: 669 QKFLNNLSVVHIQS---------EEDLSASGPSHKAMAEGPKPAVYMCKGFVCDRPVKAI 719
Query: 704 ISLENLL 710
LE L
Sbjct: 720 QELEELF 726
>gi|406859397|gb|EKD12463.1| putative DUF255 domain-containing protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 820
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 243/594 (40%), Positives = 343/594 (57%), Gaps = 34/594 (5%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A LLN F+ +K+DRE RPD+D++YM +VQA G GGWPL+VF
Sbjct: 105 CHWCHVMERESFENEEIATLLNTHFIPVKIDREVRPDIDRIYMNFVQATTGSGGWPLNVF 164
Query: 82 LSPDLKPLMGGTYFPP-------EDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
L+PDL+P+ GGTY+P ED+ F IL+K+ W ++ + + +EQ
Sbjct: 165 LTPDLEPVFGGTYWPGHSSGTAFEDQVD---FLGILQKLSSVWREQEERCRRDSKQILEQ 221
Query: 135 LSEALSASASSNKLPDELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQM 189
L + ++L D + L + S +YDS GGFG APKFP P ++
Sbjct: 222 LKSFAADGTFGSRLGDGEGGDGLDIELLEEAVQHFSSTYDSTNGGFGLAPKFPTPSKLSF 281
Query: 190 ML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
+L + + D + E Q M + TL+ MA+GG+HD VG GF RYSV W +PH
Sbjct: 282 LLRLGQYPSIVVDVVGAPECRNAQSMAVTTLRKMARGGVHDQVGNGFARYSVTADWSLPH 341
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD QL +VYLDAF L++D + DI YL D+ G +S++DADS
Sbjct: 342 FEKMLYDNAQLLHVYLDAFLLSRDAELLGVVYDISTYLTTDLAHAEGGFYSSQDADSLYR 401
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
G + K+EGAFYVWT +E E++LGE+ + + TG+ ++ +D H+EF +NVL
Sbjct: 402 RGDSEKREGAFYVWTKREFENVLGENEPILSA--FFNVTGHGNVGPENDGHDEFLDQNVL 459
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 425
++ SA AS+ GM E+ + I+ + L R K R RP LDDK++ SWNGL + +
Sbjct: 460 AIVSTPSALASQFGMKEEEVVRIIKAGKAALRAHREKERVRPGLDDKIVTSWNGLAVGAL 519
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
AR + K F S+ E + A AA+FI+++LYD + L +R G
Sbjct: 520 ARTGGVFK--------GFDPAKSE--ELLGFAIKAATFIKQNLYDSSSKILYRIWREGRG 569
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
GF DDYAFL+ GL+DLYE +WL WA ELQ TQ LF D GG+F+T+ P
Sbjct: 570 DTEGFADDYAFLVEGLIDLYEATFDEEWLKWADELQQTQISLFFDVNIGGFFSTSSTAPH 629
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
++LR+K+ D +EPS N S NL RL+S++ Y + A+ +LA FE+ +
Sbjct: 630 LILRLKDGMDTSEPSTNGTSASNLYRLSSLL---NDLTYAEKAKQTLACFESEM 680
>gi|392411456|ref|YP_006448063.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
gi|390624592|gb|AFM25799.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
Length = 692
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 260/699 (37%), Positives = 371/699 (53%), Gaps = 65/699 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE A +N FVSIKVDREERPD+D +YMT Q + G GGWPL+
Sbjct: 49 STCHWCHVMEHESFEDEETAAAMNQSFVSIKVDREERPDLDNIYMTVCQMMTGSGGWPLN 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
V L+PDLKP GTYFP ++G+ G + ++++ W +R+ + +S A+ Q+
Sbjct: 109 VVLTPDLKPFFAGTYFPKTSRFGKIGMVELSDRIREIWQTRRNDVLESADKVTNALRQMP 168
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+A S S L L +L K +D GGF APKFP P + +L + K+
Sbjct: 169 DASSGSVQGKAL--------LEQAFTELDKRFDPARGGFSPAPKFPTPHNLLFLLRYWKR 220
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
D + KMV TL + GGI+DHVG GFHRYS D W VPHFEKMLYDQ
Sbjct: 221 TGD-------EKALKMVEKTLHALRLGGIYDHVGFGFHRYSTDTEWLVPHFEKMLYDQAL 273
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ T + FY+ ++I+ Y+ RDM P G +SAEDADS EG EG
Sbjct: 274 LTMAYTEAYQATGNEFYADTAKEIVTYVLRDMTSPQGGFYSAEDADS---EGV----EGK 326
Query: 317 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVWT +E+ED+LG+ A L+ Y +P GN + + G N+ L
Sbjct: 327 FYVWTLREIEDVLGQKDAALYSAVYNFEPEGNFH----DEASGQATGANIPHLLARFEEI 382
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ M + + L R KLF R +R PH DDK++ WNGL+I++ A+A+++ ++
Sbjct: 383 AATRDMTPHELHDRLRAIREKLFSTRERRVHPHKDDKILTDWNGLMIAALAKAAQVFEN- 441
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+EY E A AA F+ L DEQ RL H FR+G + +DD+A
Sbjct: 442 ---------------REYGEAARKAADFLLSTLRDEQG-RLLHRFRDGEAGLTAHVDDFA 485
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F + GLL+LYE ++L A+EL + + F D E GG++ T + ++L+R KE +D
Sbjct: 486 FFVWGLLELYETVFEPQYLAAALELNDDLLKRFWDDERGGFYFTAMDAENLLVRTKEVYD 545
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSVS++NL+RL + + + + AE F L+ A M +
Sbjct: 546 GAVPSGNSVSLLNLLRLGRMTSNPELE---SKAEQIAKAFAGTLRQFPSAYTQMLVGLEF 602
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
R + V++ + + D ML ++ NK V+ M F + + N
Sbjct: 603 --AEGRTYEVVIANSGTEDVLPMLRIIRRNFLPNKVVL---------MRFRDGKHENLLR 651
Query: 676 MAR--NNFS--ADKVVALVCQNFSCSPPVTDPISLENLL 710
+ R ++F+ +K A VC N+ C P T+P + LL
Sbjct: 652 VVRFDHDFALLENKTTAYVCVNYHCELPTTEPSRVLELL 690
>gi|308480509|ref|XP_003102461.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
gi|308261193|gb|EFP05146.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
Length = 746
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 276/743 (37%), Positives = 389/743 (52%), Gaps = 78/743 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F ++ + FL +TCHWCHVME ESFE+E AK+LN+ F++IKVDREERPD
Sbjct: 45 GEEAFKKAKESNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFIAIKVDREERPD 104
Query: 59 VDKVYMTYV---------------QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGR 103
VDK+YM +V QA G GGWP+SVFL+P+L P+ GGTYFPP+D G
Sbjct: 105 VDKLYMAFVVVYLNFCFTSSFSFFQAASGHGGWPMSVFLTPELHPITGGTYFPPDDNRGM 164
Query: 104 PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQ 163
GF TIL ++ W K+ D L + G I +L + +AS NK + +
Sbjct: 165 LGFSTILNMIQTEWKKEGDNLRKRGEQII-KLLQPETASGDVNK-----SEEVFQSIYSH 218
Query: 164 LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKG 223
S+DSR GGFG APKFP+ ++ ++ S KS E++ M+ TL+ MA G
Sbjct: 219 KQSSFDSRLGGFGGAPKFPKASDLDFLIAFSSADSCGDKSKEST---TMLQKTLESMADG 275
Query: 224 GIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDIL 281
GIHDH+G GFHRYSVD WHVPHFEKMLYDQ QL Y D LT K+ ++ DI
Sbjct: 276 GIHDHIGTGFHRYSVDGEWHVPHFEKMLYDQSQLLATYSDFHRLTGKKNENIKFVINDIF 335
Query: 282 DYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY- 340
+Y+++ GG +SAEDADS + K EGAF VW +E++ +L E I + +
Sbjct: 336 EYMQKISHKEGG-FYSAEDADSLPKNDSKEKMEGAFCVWEKEEIKKLLCERKIGSADLFD 394
Query: 341 ----YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRK 396
Y N ++ R SDPH E K KNVL +L A+ + +E+ + E ++
Sbjct: 395 VVADYFDVEDNGNVPRSSDPHGELKNKNVLRKLLTDDECAANHSLTVEELKRGIEEAKQI 454
Query: 397 LFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEV 456
L++ R+KRP PHLD K++ +W L IS +A + ++ +Y+E
Sbjct: 455 LWEARTKRPSPHLDSKMVTAWQALAISGLVKAYQ----------------ATEDVKYIER 498
Query: 457 AESAASFIRRHLYDEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFG 508
AE A+F+R++L E+ L+ S G F DDYAF+I GLLDLY
Sbjct: 499 AEKCAAFVRKYL--EENGELKRSVYLGVEGNIEQGHQNMKAFSDDYAFMIQGLLDLYTVL 556
Query: 509 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 568
++L AIELQ T D+ F G GYF + D V +R+ ED DGAEP+ S++ N
Sbjct: 557 GKNEYLEKAIELQKTCDQKFWS--GNGYFISEQADEGVSVRMVEDQDGAEPTATSIASNN 614
Query: 569 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 628
L+RL I+ ++D YR+ A RL +A+P M A S VLVG
Sbjct: 615 LLRLHDIL---ENDEYREKANKCFRGASERLNKFPIALPKMAVALHRWQNGSTT-FVLVG 670
Query: 629 HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVA 688
+FE+ L A LN+ +I + + E+ + + N S A
Sbjct: 671 -----EFESEL-LVEARRRLNEKLIE----NLSVVHIRSENEIGASGPSHNAMSQGPQPA 720
Query: 689 L-VCQNFSCSPPVTDPISLENLL 710
+ +C+ F+C P+ +L+ L
Sbjct: 721 VYMCKGFACGLPIRSIDALDKLF 743
>gi|312385290|gb|EFR29828.1| hypothetical protein AND_00943 [Anopheles darlingi]
Length = 874
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 268/709 (37%), Positives = 372/709 (52%), Gaps = 69/709 (9%)
Query: 30 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 89
V+ F++E VA+++N+ F+++K+DREERPD+DK+YM ++ + G GGWP+SV+L+PDL P+
Sbjct: 186 VDCFQNEEVARIMNENFINVKLDREERPDIDKLYMMFILLINGSGGWPMSVWLTPDLAPI 245
Query: 90 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 149
GGTYFPP D++G PGF T+L K+ W R+ L ++G IE + + S
Sbjct: 246 TGGTYFPPNDRWGMPGFTTVLTKLAAKWASDREDLVRTGRSVIEAIKRNVDQKQGSGNGD 305
Query: 150 DELPQNALRLCAEQL-----------SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+E A+ E L ++YD +GG APKFP ++ +M +H E
Sbjct: 306 EEDGAAAVAAAGETLEAKFRQAINLYQRNYDPVWGGSLGAPKFPEAAKLNLM-FHLHVQE 364
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
K +VL TL MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQL
Sbjct: 365 PKHKI------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLL 418
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++Y + + LT Y + I YL +D+ PGG +S EDADS T + K EGAFY
Sbjct: 419 SLYANGYRLTHKPLYLTVADAIYRYLCKDLRHPGGGFYSGEDADSLPTADSDVKVEGAFY 478
Query: 319 VWTSKEVEDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
WT EV++ L A ++ EHY +K TGN + + SDPH GKN+ I
Sbjct: 479 AWTYAEVKETLERGAAKFGDTTVSPIEVYAEHYDIKETGNVEPA--SDPHGHLLGKNIPI 536
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+A K G E +L L +VR +RPRPHLD K+I +WNGLV+S +
Sbjct: 537 VYGSVRETAEKCGTRPEIVERVLRVANELLHEVREQRPRPHLDTKIICAWNGLVLSGLSH 596
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG--- 483
+ + + DR +Y+ AE F+R +LYD Q +L S + NG
Sbjct: 597 LACVHDA-------------PDRSKYLATAEELVKFVRANLYDVQARKLLRSCYGNGEET 643
Query: 484 -PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
S+ P GF+DDYAFLI GL+D Y L WA ELQ+ QDELF D + G YF +
Sbjct: 644 LASERPIYGFIDDYAFLIRGLIDYYVASLDEHRLHWAKELQDIQDELFWDPKHGAYFYSE 703
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
P V +R+KEDHDGAEP GNSV+ NL+ L + + ++ A A F +
Sbjct: 704 ANSPHVAVRLKEDHDGAEPCGNSVAGHNLLLLHDYF---EEERLKERARKLFAYF-SESS 759
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI---DP 657
+P M AA L KH ++V S + ++ A Y ++ + P
Sbjct: 760 PFGYVLPEMMSAA--LVEEHGKHTLIVVGPESPEATALVDAVRRFYIPGMIIVQLKIDKP 817
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
A E + +N M +N A +C N C PVT+P L
Sbjct: 818 AHIER----RRKSLDNFKMVKN-----MPTAYICHNRVCHLPVTEPERL 857
>gi|225181777|ref|ZP_03735215.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
1]
gi|225167551|gb|EEG76364.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
1]
Length = 697
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 262/694 (37%), Positives = 370/694 (53%), Gaps = 55/694 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA+ LN FV IKVDREERPD+D +YM QA+ G GGWPL+
Sbjct: 55 STCHWCHVMERESFEDEEVARELNRVFVCIKVDREERPDIDNIYMAVCQAMTGSGGWPLT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ +SPD +P GTYFP + +GR G + ++++ W RD + + S
Sbjct: 115 IVMSPDKRPFFAGTYFPKKTSFGRMGVIDLAQRIEMLWKTSRDKINSTAD------SVMT 168
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S A S P +LP + AL+ +L +D GGFG APKFP P + +L + K
Sbjct: 169 SLQAMSKVTPGDLPGEEALQGGFAKLEGRFDPDHGGFGYAPKFPSPHNLTFLLRYWK--- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+SG A + +MV TL MA+GG++DH+G GFHRYS D W +PHFEKMLYDQ LA
Sbjct: 226 ---RSGNA-KALEMVEKTLLAMARGGVYDHIGFGFHRYSTDREWLLPHFEKMLYDQALLA 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+A+ T Y+ R+I Y+ RDM P G +SAEDADS EG +EG FY
Sbjct: 282 VTYLEAYQATGKEVYAQTAREIFGYVLRDMTSPQGGFYSAEDADS---EG----EEGKFY 334
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW + E+ ILGE A +F Y ++ GN + + G N+ A
Sbjct: 335 VWETNEIVHILGEADAAIFNAAYNIREDGNF----TDETTGKKTGANIPHLRKTYQELAQ 390
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+L + + + L R+KLF VR KR PH DDK++ WNGL+I++ A +IL E
Sbjct: 391 ELSLEPNELKDRLEAMRQKLFAVRKKRIHPHKDDKILTDWNGLMIAALAMGGRILNDE-- 448
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y + A+ AA FI HL ++ RL FR + P LDDYAF
Sbjct: 449 --------------NYNKSAKKAAGFILSHL--KKDGRLLKRFREDEASLPAHLDDYAFF 492
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ GL++LYE T +L A+ L T + F D + G ++ T + VL+R +E +DGA
Sbjct: 493 VWGLIELYETTFDTDFLKEALSLNKTMIKHFWDHDNGSFYFTADDAEDVLVRHRELYDGA 552
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ +N +RL I ++ + Q AE F ++ + M A + ++
Sbjct: 553 VPSGNSVAAMNNLRLGRITGNTELE---QIAEKIARAFTDEIEKVPQGYTQMLSAINFMA 609
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHNSNNASM 676
PS + +V+ G + D ++ML +++ NK V+ H +E++ + S+
Sbjct: 610 GPSLE-IVIAGEAQAQDTKDMLQKLCSTFVPNKVVVLHPGGKKAKEIEELAPYTRRQQSI 668
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K A VC+NFSC PVTD + +LL
Sbjct: 669 ------EGKATAYVCRNFSCQAPVTDADKMLSLL 696
>gi|452845430|gb|EME47363.1| hypothetical protein DOTSEDRAFT_41782 [Dothistroma septosporum
NZE10]
Length = 734
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 253/607 (41%), Positives = 341/607 (56%), Gaps = 39/607 (6%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
+T R F+ + CHWCHVM ESF+D +A+LLN++FV IK+DREERPD+D+ YM ++
Sbjct: 48 QTNRLLFVSIGYSACHWCHVMAHESFDDPRIAQLLNEYFVPIKIDREERPDIDRQYMDFL 107
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYGRPG---FKTILRKVKDAWDKKRDM 123
QA GGGGWPL+VF++PDL+P+ GGTY+P P + G F+ IL KV W ++ +
Sbjct: 108 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPRSDRAQMGGTTFEDILLKVSSMWKEQEER 167
Query: 124 LAQSGAFAIEQLSE-----ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
L SG +QL E + D L + L + K YD +FGGFG+A
Sbjct: 168 LRASGKEITKQLREFAQEGHIGGRDGKGDDNDGLELDLLDDAFQHYKKRYDRKFGGFGAA 227
Query: 179 PKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
PKFP PV I+ +L+ + K++ + E+ E + M + +L+ MAKGGI D +G GF R
Sbjct: 228 PKFPTPVHIRPLLHVACYPKEVREIVGEDESIEVRAMAVKSLENMAKGGIKDQIGHGFAR 287
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGE 294
YSV W +PHFEKMLYD QL VYL+A+ LTK + DI YL M G
Sbjct: 288 YSVTRDWSLPHFEKMLYDNAQLLPVYLEAYMLTKSQLFLETTHDIAKYLTSAPMASDLGG 347
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRM 353
I SAEDADS T K+EGA+YVWT E + IL + + Y+ +K GN D +
Sbjct: 348 ICSAEDADSLPTAIDHHKREGAYYVWTMDEFKKILTDEEVKVCSAYWGVKSEGNID--KQ 405
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDK 412
D E G+N L ++ + A +L M E L R KL R K RPRP LDDK
Sbjct: 406 HDIQGELVGQNTLCVQHEPAELARELSMSEEDVKRTLANGREKLLAYRQKDRPRPALDDK 465
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
++ SWNGL + ARA A P EY+ AE A + IR L+DE+
Sbjct: 466 IVTSWNGLAVGGLARAG---------AALGVP-------EYIAAAEKAVNCIRAQLFDEK 509
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
L+ +R GP + GF DDYAFLISGLLDLYE ++WL +A LQ TQ +LF D E
Sbjct: 510 AKTLKRVYREGPGETQGFADDYAFLISGLLDLYESTFDSQWLEFADILQQTQTKLFWDEE 569
Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
G+F+T P +L R K+ D AEPS N VS +NL RL S++ + Y + + ++
Sbjct: 570 KFGFFSTPANQPDILFRTKDAMDNAEPSVNGVSAMNLFRLGSLLYDAT---YEKMGKRTV 626
Query: 593 AVFETRL 599
A F+ +
Sbjct: 627 AAFDVEI 633
>gi|396464920|ref|XP_003837068.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312213626|emb|CBX93628.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 748
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 252/625 (40%), Positives = 342/625 (54%), Gaps = 28/625 (4%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VAK+LN+ ++ IKVDREERPDVD++YM YVQAL G GGWPL+ F
Sbjct: 68 CHWCHVMERESFENQEVAKILNESYIPIKVDREERPDVDRIYMNYVQALTGRGGWPLNAF 127
Query: 82 LSPDLKPLMGGTYFP---PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--- 135
L+PDL+P+ GGTYF G F +L K++D W +R S ++L
Sbjct: 128 LTPDLQPIFGGTYFAGPGSTTALGAQPFVAVLEKIRDLWTDQRQRCLDSAREETKKLIDF 187
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
++ + S D L L + YD GFG APKFP P +Q +L S+
Sbjct: 188 AQDGNISRQGGAEHDGLELELLDDALSHFKRKYDPVNAGFGDAPKFPTPSNLQFLLKLSR 247
Query: 196 ---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ + + + + + MVL TL M KGGIHD +G GF RYSV + W +PHFEKMLY
Sbjct: 248 YPTAVTELLGADDCTLAKTMVLKTLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLY 307
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATR 311
D QL V+LDA+ LTK + DI YL M G FS+EDADS
Sbjct: 308 DHAQLLPVFLDAYLLTKSAAHLSAVHDIATYLTSPPMHAEHGGFFSSEDADSLYRPNDKE 367
Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
K+EGAFYVWT E +DILGE A + +Y ++ GN D H+E +NVL
Sbjct: 368 KREGAFYVWTLTEFQDILGERDAEILARYYNVRDEGNVHPEH--DAHDELINQNVLAIST 425
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 429
S A + G+ E+ IL R+KL R K RPRP LDDK++VSWNGL I + AR +
Sbjct: 426 TPSDLAKQFGLSEEEVHRILTSGRQKLLFHRDKERPRPALDDKIVVSWNGLAIGALARTA 485
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
L S +A Y+ AE AA+F++ +LYD + L +R GP + PG
Sbjct: 486 AALSSSEPTASHT----------YLAAAEKAATFLKENLYDPSSQTLTRVYREGPGETPG 535
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
F DDYA+LISGL+DLY+ +L WA +LQ +Q LF D + G+F+T +++R
Sbjct: 536 FADDYAYLISGLIDLYQTTFNDSYLQWADDLQQSQIRLFWDTKHLGFFSTPAGQSDLIMR 595
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+K+ D AEP N VS NL RL +++ + + Y + A + + FE L P +
Sbjct: 596 LKDGMDNAEPGTNGVSAQNLDRLGALL---EDEAYSKRARETASAFEAELMQHPFLFPSL 652
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVD 634
A + + R H V+ G V+
Sbjct: 653 MDAVVVGRLGIR-HSVITGEGRRVE 676
>gi|78043330|ref|YP_360543.1| hypothetical protein CHY_1723 [Carboxydothermus hydrogenoformans
Z-2901]
gi|77995445|gb|ABB14344.1| conserved hypothetical protein [Carboxydothermus hydrogenoformans
Z-2901]
Length = 686
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 263/696 (37%), Positives = 372/696 (53%), Gaps = 63/696 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA LLN FV+IKVDREERPDVD++YMT QA+ G GGWPL+
Sbjct: 50 STCHWCHVMERESFEDEEVADLLNKHFVAIKVDREERPDVDQIYMTACQAMTGQGGWPLT 109
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++P+ KP GTYFP K+GRPG IL ++ W+ R+ L ++L E +
Sbjct: 110 IIMTPEKKPFFAGTYFPKRSKWGRPGLMEILTEIVKLWETDREQLLTIS----KRLYEFM 165
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S K +L + L + +DS +GGFG APKFP P + +L + K+
Sbjct: 166 QTIPQSKK--GDLTEEVLEKAYREFLGRFDSEYGGFGPAPKFPTPHNLIFLLRYWKR--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ +K TL+ MA+GGI+DHVG GFHRYS D W VPHFEKMLYD LA
Sbjct: 221 TGEEKALFMAEK----TLEAMARGGIYDHVGYGFHRYSTDREWLVPHFEKMLYDNALLAY 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ TK Y+ I R++ Y++R M P +SAEDADS EG EG +YV
Sbjct: 277 TYLEAYQATKKEKYARIAREVFTYVKRKMTSPERGFYSAEDADS---EGV----EGKYYV 329
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
WT EV+ +LG E LF Y + P GN F+GKN+ LI D A
Sbjct: 330 WTPDEVKKVLGPEEGELFCRVYDITPEGN------------FEGKNIPNLIH-TDIELVA 376
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++G + L R+KL+ R KR P DDK++ SWNGL+I++ A+ +++L+ +
Sbjct: 377 QEIGKSAAELTESLDRMRQKLYHEREKRVLPLKDDKILTSWNGLMIAALAKGARVLQDQ- 435
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
E + +A +AA FI L RL +R G + +LDDYAF
Sbjct: 436 ---------------ELLNMAHNAAEFIFSKL-RRADGRLIARYREGEAAVLAYLDDYAF 479
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI GL++LYE +L A+EL +LF D + GG F T + ++ R KE +DG
Sbjct: 480 LIWGLIELYEASFEVWYLKLAVELTREMLKLFWDEKHGGLFFTGADGEELITRPKEIYDG 539
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ +NL+RL+ ++ + + Q A L+ F ++ ++ A A +
Sbjct: 540 ALPSGNSVAALNLLRLSRMLG---EEDFLQKAVEILSTFAGKVSEIPSAHSFYLLAY-LF 595
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNSNNAS 675
+ K +V+ G D M+ + +Y N V+ D +E+ H ++ S
Sbjct: 596 YLGPVKEIVVAGEPDGEDTRAMIEKINLAYLPNSVVLFHPIGDAGQEIREIIPHIADKKS 655
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ ++ VC+NFSC PV + LE L+
Sbjct: 656 LI-----GERATVYVCENFSCKAPVVEVEMLEEYLM 686
>gi|168186605|ref|ZP_02621240.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
gi|169295490|gb|EDS77623.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
Length = 693
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 251/690 (36%), Positives = 370/690 (53%), Gaps = 73/690 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFEDE VAKLLND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++
Sbjct: 62 SCHWCHVMEKESFEDEEVAKLLNDKYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTI 121
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP + YGRPG IL ++ D W+ RD + + + + E S
Sbjct: 122 IMAPDQKPFFAGTYFPKKRMYGRPGLIQILNQIADEWENNRDGVINASNELLNTMKEHTS 181
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S E+ +N L+ +++ YD +GGFG APKFP P ++ ++L + K+ +
Sbjct: 182 QDKSG-----EINENVLQDAIKEMKHYYDESYGGFGIAPKFPTPHKLMLLLTYYKEYNN- 235
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
MV TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD LA V
Sbjct: 236 ------KIALHMVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 289
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + +T +FY + I Y+ RDM P G +SAEDADS EG EG FY+W
Sbjct: 290 YTQTYQITGKLFYKEVAEKIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGKFYLW 342
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T EVE+IL E A F Y + GN F+G N+ + +G
Sbjct: 343 TLHEVENILKEDAKEFCNTYDITKGGN------------FEGSNI----------PNLIG 380
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
LE + L R+KLF VR KR P DDK++ +WN L+IS+ A A ++ +++
Sbjct: 381 KDLEN-TDKLENLRKKLFQVREKRVHPFKDDKILTAWNALMISALAYAGRVFENQ----- 434
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
EY++ A+ A +FI +L + RL FR+G + +++DY+FL+
Sbjct: 435 -----------EYIDRAKEAYNFIENNLI-RKDGRLLARFRHGEAAYIAYIEDYSFLVWA 482
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL+LYE +K+L A++ + +LF D E G+F++ + ++L +K+ +D A PS
Sbjct: 483 LLELYEATFESKFLKEALQFTDEMIKLFWDEESYGFFHSGKDGEKLILNLKDSYDTAIPS 542
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
GNSV+ +NL++L+ I + + A L F +K+ + + PS
Sbjct: 543 GNSVAAMNLIKLSKITGDNS---LGEKAYKMLEGFGGNIKESLQSHSIFLMVYMNYIRPS 599
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
K +++ K F++M+ + + + T + ++ + E + S+
Sbjct: 600 -KQIIIASKKEDKVFKDMIREVNKRF-MPFTTVLLNDGNLENII---------PSIKDER 648
Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
+K A VC+NFSC+ PV + LL
Sbjct: 649 KVDNKTTAYVCENFSCNRPVDNIKEFIKLL 678
>gi|134119086|ref|XP_771778.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254378|gb|EAL17131.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 748
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 265/706 (37%), Positives = 389/706 (55%), Gaps = 41/706 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 66 SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F++P L+P GTYFP RP F +L K+ + W++ R+ + G IE L +
Sbjct: 126 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEVLKDMS 179
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
+S L L + QLS D+R+GGF ++ PKFP + ++ +
Sbjct: 180 HTGRTSESLSQLLASSPASKLFSQLSTMNDTRYGGFTNSGSSTRGPKFPSCSITLEPLAR 239
Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ ++ E E ++M + L+ M GGI D VGGG RYSVDE+W VPHFEKML
Sbjct: 240 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 299
Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
YDQ QL + LD L +D Y + DIL Y RD+ P G +SAEDADSAE
Sbjct: 300 YDQAQLVSSCLDFARLYPVDHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 359
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
+GA +K EGAFY+W E++++LG+ A LF + ++P GN D+ + D H E +GKN+L
Sbjct: 360 KGA-KKSEGAFYIWKKTEIDEVLGDDAPLFNSFFGVQPDGNVDI--IHDSHGEMRGKNIL 416
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+ A + G ++ I+ + KL R +R RP LDDK++ +WNGL++++ +
Sbjct: 417 HQHKTYEEVALEFGKREDQAKGIIIQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 476
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
+AS +L P R + + A +F++ H++D T L S+R G K
Sbjct: 477 KASTLL-----------PPSYGIRSQCLPAALGIVNFVKSHMWDSSTRTLTRSYREG--K 523
Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
P DDYAFL+ GLL+LYE +++A ELQ QDELF D GGYF + ED
Sbjct: 524 GPQAQTDDYAFLVQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-ASAEDAH 582
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
VL+R+K+ DGAEPS +VS NL R + +++ S+ + Y AE + + A
Sbjct: 583 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRA 641
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
V L R+ V+++G S + L AA +Y N+ ++ I P + +
Sbjct: 642 VGYAVSGLIDLEKGYRE-VIVIGSASDEVVKKFLEAARKTYFSNQVIVQIQPENLPK-GL 699
Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
E++ A + +K +L VC+ +C PV D +NLL
Sbjct: 700 AEKNEVVKALVNDVESGKEKAASLRVCEGGTCGLPVKDLEGAKNLL 745
>gi|158521543|ref|YP_001529413.1| hypothetical protein Dole_1532 [Desulfococcus oleovorans Hxd3]
gi|158510369|gb|ABW67336.1| protein of unknown function DUF255 [Desulfococcus oleovorans Hxd3]
Length = 641
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 248/610 (40%), Positives = 338/610 (55%), Gaps = 50/610 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESF D A L+N FV +KVDREERPD+D++YMT V A+ G GGWPL+V
Sbjct: 55 TCHWCHVMAHESFSDPDTAALMNAHFVCVKVDREERPDIDRLYMTAVSAITGSGGWPLNV 114
Query: 81 FLSPD-LKPLMGGTYFPPEDKYGRPG------FKTILRKVKDAW---DKKRDMLAQSGAF 130
FL P L P GGTYFPP RPG + +L+++ DAW DK+ +LA + +
Sbjct: 115 FLEPHALAPFFGGTYFPP-----RPGRTLMITWPDLLQQIADAWENPDKRSSLLASADSI 169
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L AL+ + D + + + YDS+ GGFG APKFP P I +
Sbjct: 170 TTF-LESALTGTRHRPAEGDAELTGIYKKALDAFTGMYDSQSGGFGPAPKFPMPAIINFL 228
Query: 191 LY--HSKKLEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + D G + + + M + TL MA+GGI+D +GGGFHRYS DERWH+PHF
Sbjct: 229 LACAATDPAADLGLDTRQREKALGMAIHTLSAMARGGIYDQLGGGFHRYSTDERWHLPHF 288
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYIC--RDILDYLRRDMIGPGGEIFSAEDADSAE 305
EKMLYD QL DA++LT++ S +C R DY+ ++M P G +SA+DADS E
Sbjct: 289 EKMLYDNAQLLACLADAYALTEN--NSLLCRARQTADYILKEMTHPEGGFYSAQDADSPE 346
Query: 306 TEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPH-NEFKGK 363
+ GA +K EGAFYVW ++E+E +L A LF H+ ++P GN +S PH EF K
Sbjct: 347 SAGAGKKVEGAFYVWEAREIESLLDAPAAKLFMSHFGVRPEGN-----VSGPHAAEFSHK 401
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVL +A G+ ++ ++L R+ L R RP P DDK+I +WNGL+IS
Sbjct: 402 NVLYGTGPVDQAAKTFGLSEQETQDLLQTARQTLLAHRKHRPAPDTDDKIITAWNGLMIS 461
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
A+ ++ + +Y + A AA FI+ HLYD QTH L +R G
Sbjct: 462 GLAKLYRVTR----------------EAQYRDGAVKAARFIQTHLYDPQTHHLARIWRAG 505
Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-TGE 542
++ G +DYAFL GL+DLYE + WL WAI+L F D + GG F T G
Sbjct: 506 EARIDGMAEDYAFLAQGLIDLYEANADAFWLAWAIDLSEEVLASFYDSKNGGIFMTGKGH 565
Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
DP +LLR+KED D PS SV+ N RL++ ++D + A ++ L++
Sbjct: 566 DPHLLLRMKEDTDNVMPSAGSVAARNFYRLSAYTG--RND-FSDAARATINALIPLLEEH 622
Query: 603 AMAVPLMCCA 612
A PL+ A
Sbjct: 623 PSAAPLLLTA 632
>gi|253681418|ref|ZP_04862215.1| dTMP kinase [Clostridium botulinum D str. 1873]
gi|253561130|gb|EES90582.1| dTMP kinase [Clostridium botulinum D str. 1873]
Length = 671
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 246/683 (36%), Positives = 368/683 (53%), Gaps = 75/683 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFEDE VAK+LND ++SIKVDREERPDVD YMT+ QA+ G GGWPL++
Sbjct: 54 SCHWCHVMEKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++P+ KP GTYFP + YGRPG IL+++ D W +D + + + + E +S
Sbjct: 114 IMTPEQKPFFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDNIINTSNKLLNTMKERVS 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+E+ ++ L +++ YD+++GGFG APKFP P ++ ++L + K D
Sbjct: 174 QDKW-----EEINESILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDK 228
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G MV TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD LA V
Sbjct: 229 SALG-------MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ +T FY + I Y+ RDM P G +SAEDADS EG EG FYVW
Sbjct: 282 YTEAYQVTGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVW 334
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+ +E++ ILGE A F Y + GN F+GKN+ + +G
Sbjct: 335 SLEEIQSILGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIG 372
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
LE ++ L + R KLF VR KR P DDK++ +WN L+I S + A ++
Sbjct: 373 KDLEN-IDKLKDLRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF-------- 423
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+ KEY+ ++ A FI +L + RL FR+G + +L+DY+FL+
Sbjct: 424 --------ENKEYINRSKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWA 474
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L++LYE + +L A+ + +LF D E G+F++ + ++L +K+ +D A PS
Sbjct: 475 LMELYEATFESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPS 534
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
GNSV+ +NL++L+ I + + A F +K+ + + + PS
Sbjct: 535 GNSVAAMNLIKLSKITGDNSLG---EKAYKMFQCFGGNIKESLQSHSIFLISYMNYIKPS 591
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
R+ +V+ K F+ M+ + + + T+I ++ + E N ++
Sbjct: 592 RQ-IVIASEKEDRLFKEMIKEVNKRF-MPFTIILLNDGNLE----------NIVPFIKDE 639
Query: 681 FSAD-KVVALVCQNFSCSPPVTD 702
D K A +C+NFSC+ PV +
Sbjct: 640 KKIDNKTTAYICENFSCNKPVYN 662
>gi|58262588|ref|XP_568704.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57230878|gb|AAW47187.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 773
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 265/706 (37%), Positives = 387/706 (54%), Gaps = 41/706 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 91 SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 150
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F++P L+P GTYFP RP F +L K+ + W++ R+ + G IE L +
Sbjct: 151 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEVLKDMS 204
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF---GSAPKFPRPVEIQMMLYHSKK 196
+S L L + QLS D+R+GGF GS+ + P+ + L +
Sbjct: 205 HTGRTSESLSQLLASSPASKLFSQLSTMNDTRYGGFTNSGSSTRGPKFPSCSITLEPLAR 264
Query: 197 LEDTGKSGEAS-----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
L G + + ++M + L+ M GGI D VGGG RYSVDE+W VPHFEKML
Sbjct: 265 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 324
Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
YDQ QL + LD L +D Y + DIL Y RD+ P G +SAEDADSAE
Sbjct: 325 YDQAQLVSSCLDFARLYPVDHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 384
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
+GA +K EGAFY+W E++++LG+ A LF + ++P GN D+ + D H E +GKN+L
Sbjct: 385 KGA-KKSEGAFYIWKKTEIDEVLGDDAPLFNSFFGVQPDGNVDI--IHDSHGEMRGKNIL 441
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+ A + G ++ I+ + KL R +R RP LDDK++ +WNGL++++ +
Sbjct: 442 HQHKTYEEVALEFGKREDQAKGIIIQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 501
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
+AS +L P R + + A +F++ H++D T L S+R G K
Sbjct: 502 KASTLL-----------PPSYGIRSQCLPAALGIVNFVKSHMWDSSTRTLTRSYREG--K 548
Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
P DDYAFL+ GLL+LYE +++A ELQ QDELF D GGYF + ED
Sbjct: 549 GPQAQTDDYAFLVQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-ASAEDAH 607
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
VL+R+K+ DGAEPS +VS NL R + +++ S+ + Y AE + + A
Sbjct: 608 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRA 666
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
V L R+ V+++G S + L AA +Y N+ ++ I P + +
Sbjct: 667 VGYAVSGLIDLEKGYRE-VIVIGSASDEVVKKFLEAARKTYFSNQVIVQIQPENLPK-GL 724
Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
E++ A + +K +L VC+ +C PV D +NLL
Sbjct: 725 AEKNEVVKALVNDVESGKEKGASLRVCEGGTCGLPVKDLEGAKNLL 770
>gi|85858097|ref|YP_460299.1| thymidylate kinase [Syntrophus aciditrophicus SB]
gi|85721188|gb|ABC76131.1| thymidylate kinase [Syntrophus aciditrophicus SB]
Length = 691
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 265/695 (38%), Positives = 365/695 (52%), Gaps = 63/695 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+E VA+LLN+ F+SIKVDREERPD+DK+YM Q L GGGGWPL+
Sbjct: 58 STCHWCHVMAHESFENEEVARLLNESFISIKVDREERPDIDKLYMAVCQLLTGGGGWPLT 117
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD +P GTY P E + G G ++ + + W K+R+ + ++ +++ AL
Sbjct: 118 ILMTPDRRPFYAGTYIPRESRSGMVGMLVLIPGLSEVWRKERNRILETAG----EITTAL 173
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
P ELP L + L + +D+R+GGF SAPKFP M HS L
Sbjct: 174 QGMDQGG--PGELPLDRVLHEAYDDLRRRFDARYGGFDSAPKFP-------MAQHSFFLL 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G+ E S+ +V TLQ M +GGI+D VG GFHRYS D +W +PHFEKMLYDQ LA
Sbjct: 225 RYGRRQENSQALAIVEKTLQSMRRGGIYDAVGFGFHRYSTDAQWRLPHFEKMLYDQALLA 284
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +AF Y R+IL Y+ RDM P G +SAEDAD+A +EGAFY
Sbjct: 285 MAYTEAFQAAGQSLYKKTAREILTYVLRDMTAPEGGFYSAEDADTA-------GEEGAFY 337
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS- 377
+WT++E+ +L Y P G GK ++ + S S
Sbjct: 338 LWTAEELRQVLPTEEAELMIRVYAIPEG---------------GKPSVLHCSSSYPELSV 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L +P E+ L L R+KLF R+KR RP DDK++ WNGL+I++ ARA+ +
Sbjct: 383 DLDLPEERLLERLESARQKLFLQRAKRIRPLRDDKILTDWNGLMIAAMARAAAV------ 436
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
F PV Y++ A A FI +L D + RL H +R G + P LDDYAFL
Sbjct: 437 ---FEEPV-------YLQAAREAVRFILENLRDPRG-RLLHRWREGEAAMPAVLDDYAFL 485
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I GL++ YE L A+ L F D GGYF T + S+L+R KE +DGA
Sbjct: 486 IWGLIEAYEATFDANLLQTALSLDEELTAHFWDNASGGYFYTPDDGESLLVRQKESYDGA 545
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+++NL+RL+ + + + + A + F ++ ++ A A D L+
Sbjct: 546 IPSGNSVAMLNLLRLSRLTGQAGLE---ERAVATAQAFADSIRSLSAAHTSFMVALDYLA 602
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
PS VV+ G D +ML ++ + TV+ I D E M
Sbjct: 603 GPS-AEVVIAGSPEGTDTRDMLRELRRAFLPHVTVLLI--PDEGEKGMLAGVAEFTGGMT 659
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
R + + A VC+NFSC P TDP + LL E
Sbjct: 660 RID---GRATAYVCRNFSCRKPTTDPAEMTTLLRE 691
>gi|398407269|ref|XP_003855100.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
gi|339474984|gb|EGP90076.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
Length = 750
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 256/611 (41%), Positives = 331/611 (54%), Gaps = 38/611 (6%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
KT R F+ + CHWCHVME ESF D +A+LLN+ F+ IK+DREERPD+D+ YM ++
Sbjct: 48 KTNRLLFVSIGYSACHWCHVMEHESFSDSRIAQLLNEHFIPIKIDREERPDIDRQYMDFL 107
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKYGR-----PGFKTILRKVKDAWDKKR 121
QA GGGGWPL+VF++PDL+P+ GGTY+P P + R F+ +LRKV AW ++
Sbjct: 108 QATSGGGGWPLNVFVTPDLEPIFGGTYWPGPNSERARSRAAGTTFEDVLRKVSTAWKEQE 167
Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL------CAEQLSKSYDSRFGGF 175
+ QL E + + +N E YD++ GGF
Sbjct: 168 QKCRANAKDITRQLREYAQEGMLGGRDGKQTDENDGLELDLLDDAYEHYKGRYDAKCGGF 227
Query: 176 GSAPKFPRPVEIQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
G APKFP PV I+ +L Y E G+ + E ++M + TL+ MAKGGI D +G
Sbjct: 228 GGAPKFPTPVHIKPLLRVANYPHVVREIVGEE-DCQEARRMAVHTLESMAKGGIKDQIGH 286
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIG 290
GF RYSV W +PHFEKMLYD QL VYLDA+ LTK DI YL M+
Sbjct: 287 GFARYSVTRDWSLPHFEKMLYDNAQLLPVYLDAWILTKSPLLLESVNDIATYLTSPPMVS 346
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCD 349
G IFSAEDADS T K+EGAFYVW E + IL E + Y+ ++ GN D
Sbjct: 347 ELGGIFSAEDADSLPTPQDKHKREGAFYVWMMDEFKSILSEEEVTVCAKYWGVQAQGNVD 406
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPH 408
R D E G+N L + A +L E+ + R KL R K RPRP
Sbjct: 407 --RRFDLQGELVGQNTLCVQYEIPELAQELSKSEEQITQTIQSGRSKLLAHREKNRPRPA 464
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LDDK++ SWNGL I AR S L+ + Y+ A A + I+ HL
Sbjct: 465 LDDKIVTSWNGLAIGGLARTSSALRY----------ISPEPAAAYLAAALKATNCIKTHL 514
Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
+D T+ L+ +R GP + PGF DDYAFLISGLLDLYE + WL WA LQ TQ LF
Sbjct: 515 FDPSTNALKRVYREGPGETPGFADDYAFLISGLLDLYEATWDSNWLQWADTLQQTQTRLF 574
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D E G+F+T P +L+RVK+ D AEPS N V+ NL RL S++ S+ Y + A
Sbjct: 575 WDEEKYGFFSTAASQPDILIRVKDAMDNAEPSVNGVASYNLFRLGSLLNDSE---YEKMA 631
Query: 589 EHSLAVFETRL 599
+A FE L
Sbjct: 632 RRIVACFEVEL 642
>gi|390559056|ref|ZP_10243426.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
gi|390174366|emb|CCF82718.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
Length = 685
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 255/691 (36%), Positives = 370/691 (53%), Gaps = 66/691 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE+ +A ++N+ F++IKVDREERPD+D +YM VQ L G GGWP++
Sbjct: 48 SSCHWCHVMAHESFENPDIAAIMNENFINIKVDREERPDLDAIYMAAVQMLSGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD++P GTYFPPED+ PGF IL V DA+ +R+ + ++ ++L+
Sbjct: 108 VFLTPDMRPFYAGTYFPPEDRPPMPGFARILDLVADAYRDRREDIDETAEQISDELNHHF 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+ S + + + R +L+ +D GGFG+ PKFP + ++ ML +
Sbjct: 168 QAAIESLAISPSILDDGAR----KLALQFDQSNGGFGNEPKFPPSMSLEFML---RTYVR 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + +MV FTL MA+GGI+D +GGGFHRYSVD W VPHFEKMLYD LA
Sbjct: 221 TG----SKRALEMVTFTLDRMARGGIYDQIGGGFHRYSVDAIWLVPHFEKMLYDNALLAR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + T Y I Y+ R+M+ P G +SA+DADS EG +EG FY+
Sbjct: 277 IYTLGYQATGKDLYRRIAEQTFTYVLREMMSPEGGFYSAQDADS---EG----EEGKFYI 329
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +E E +LG A + K ++ + P GN F+GKN+L + A +
Sbjct: 330 WTPQEFETVLGRRDASIAKRYFGIMPDGN------------FEGKNILTAPREPERIAEQ 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ LE+ + + E R KL+ RS R P DDKV+ +WN L++ SFA + +
Sbjct: 378 FGISLEELESTIAEIRGKLYQARSTRVWPGRDDKVLTAWNALMLRSFAEGATVF------ 431
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
R + +EVA A FIR +LY Q L ++ G +K G+L+DYA+LI
Sbjct: 432 ----------GRADLLEVAVRNARFIRDNLY--QDGHLLRTYTAGQAKLNGYLEDYAYLI 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
LL LYE W+ WA EL +T + F D E GG+F+T ++ R KE D A
Sbjct: 480 DALLSLYEATFNASWIAWAQELTDTMVKEFWDHENGGFFSTGTSHEELVARPKELFDSAT 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR---LKDMAMAVPLMCCAADM 615
PSGNSV+ L+RL+ ++ ++DY E +AV + K+ + A D
Sbjct: 540 PSGNSVAADVLLRLSHLLG--RNDY----RERGMAVLKKHGMLAKEYPHGTARLLLAYD- 592
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
++ S + + LVG S+ +++LA Y +K V P +E +
Sbjct: 593 FALSSPREIALVGDPSAEATQSLLAVVQQPYLPHKVVALRHPGRADEAAIIPLLEGRD-E 651
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+ R K A VC+NF+C PVT+P L
Sbjct: 652 IER------KPAAYVCRNFTCERPVTEPAEL 676
>gi|321265830|ref|XP_003197631.1| DUF255 domain protein [Cryptococcus gattii WM276]
gi|317464111|gb|ADV25844.1| DUF255 domain protein, putative [Cryptococcus gattii WM276]
Length = 772
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 266/698 (38%), Positives = 382/698 (54%), Gaps = 41/698 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 90 SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 149
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++P L+P GTYFP RP F +L+K+ + W++ R+ + G IE L +
Sbjct: 150 VFMTPKLEPFFAGTYFP------RPNFHQLLKKIHNVWEEDREKCEKMGKGVIEALKDMN 203
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
+S L L + QLS D R+GGF +A PKFP + ++ +
Sbjct: 204 DTGRTSESLSQLLSTSPASKLFAQLSTMNDPRYGGFTNAGSSTRGPKFPSCSITLEPLAR 263
Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ ++ E E ++M + L+ M GGI D VGGG RYSVDE+W VPHFEKML
Sbjct: 264 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 323
Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
YDQ QL + LD L D Y + DIL Y RD+ P G +SAEDADSAE
Sbjct: 324 YDQTQLVSSCLDFARLYPADHPDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 383
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
+GA +K EGAFY+W E++++LG+ A LF + ++P GN D+ + D H E + KN+L
Sbjct: 384 KGA-KKSEGAFYIWKKSEIDEVLGDDAPLFNSFFGVEPDGNVDI--IHDSHGEMRDKNIL 440
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+ A + G ++ +I+ + KL R +R RP LDDK++ +WNGL++++ +
Sbjct: 441 HQHKTYEEVALEFGKKEDEAKDIIVQACEKLRLKREERERPGLDDKILTAWNGLMLTALS 500
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
+AS +L + + P A +F++ H++D T L S+R G K
Sbjct: 501 KASTLLPPSYDISPQCLP-----------AALGIVNFVKSHMWDSSTRTLTRSYREG--K 547
Query: 487 AP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
P DDYAFLI GLL+LYE +++A ELQ QDELF D GGYF T+ EDP
Sbjct: 548 GPQAQTDDYAFLIQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-TSAEDPH 606
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
VL+R+K+ DGAEPS +VS NL R + +++ D Y AE + + A
Sbjct: 607 VLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLSSEFED-YEARAEATYLSMGPLIAQAPRA 665
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
V L R+ V++VG + L AA +Y N+ +IHI P + +
Sbjct: 666 VGYAVSGLIDLEKGYRE-VIIVGSTKDDVVKKFLKAARETYFSNQVIIHIQPENLPK-GL 723
Query: 666 WEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
E++ A + +K +L VC+ +C P D
Sbjct: 724 AEKNEVVKALVNDIESGKEKGASLRVCEGGTCGLPAKD 761
>gi|134300686|ref|YP_001114182.1| hypothetical protein Dred_2853 [Desulfotomaculum reducens MI-1]
gi|134053386|gb|ABO51357.1| protein of unknown function DUF255 [Desulfotomaculum reducens MI-1]
Length = 690
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 258/716 (36%), Positives = 386/716 (53%), Gaps = 67/716 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFE E VAK+LN+ FVSIKVDREERPD
Sbjct: 33 GNEAFDMAKRVDKPIFLSIGYSTCHWCHVMERESFESEEVAKILNEHFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM Q+L G GGWPL++ ++PD KP GTYFP + +YGRPG IL V W
Sbjct: 93 IDQIYMNVCQSLTGSGGWPLTIMMTPDQKPFFAGTYFPKQAQYGRPGITEILENVASLWK 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R L + G ++L + + AS+ P +LP + L +++YD+ +GGFG+A
Sbjct: 153 NERQHLLEVG----DKLVSHMQSEASTA--PGQLPADILDKAYHIFAQNYDATYGGFGTA 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + K+GEA + MV TL M +GGI+DH+G GF RYS
Sbjct: 207 PKFPTPHNLMFLLRYWH------KTGEA-KALSMVEETLDAMHRGGIYDHIGFGFSRYST 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D++W VPHFEKMLYD LA + + + +T + + + ++I Y+ RDM P G +SA
Sbjct: 260 DKKWLVPHFEKMLYDNALLALAFTETYQITGNPRFGRVAKEIFTYILRDMTSPEGGFYSA 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FYVW +EV +LG+ L+ ++Y + TGN
Sbjct: 320 EDADS---EGV----EGKFYVWRPEEVISLLGQVDGELYCQYYDITSTGN---------- 362
Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+G+++ LI D + L + L + L CR+ LF+ R+KR P+ DDK++
Sbjct: 363 --FEGESIPNLIG-QDPFKFSQDLEITLGDLVEGLEACRKTLFEERAKRIHPYKDDKILT 419
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
+WNGL+I++ AR +++ +S K Y+E A +A FI L R
Sbjct: 420 AWNGLMIAALARGAQVFQS----------------KRYLEAASNAMGFIFDRL-QRNDGR 462
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L +R + P +LDDYAF+I GLL+LY+ + L A+ L + +LF D + GG
Sbjct: 463 LLARYREYEAAYPAYLDDYAFVIWGLLELYQATFEPRHLQNAVYLTDDMIDLFYDDKQGG 522
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
++ + ++ R K+ +DGA PSGNSV+ +NL +LA + S+ Y + A L VF
Sbjct: 523 FYFYGKDSEQLISRPKDIYDGAIPSGNSVATVNLFKLARLTGNSR---YEELANQQLQVF 579
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
L A + P + +V+ G K + M+ ++ N +V+
Sbjct: 580 ADELARYPAGYSFFMMGAYLQQEPPME-IVIAGTKEDPSLQQMINTLRQNFLPNASVLV- 637
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
D E + W S + ++ + K A VCQN +C P+T+P +L+ ++
Sbjct: 638 -RYDDEFANKW----SPLLPLLKDKTPVNGKAAAYVCQNLACQAPLTEPEALQKMI 688
>gi|374994065|ref|YP_004969564.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
DSM 765]
gi|357212431|gb|AET67049.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
DSM 765]
Length = 702
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 263/730 (36%), Positives = 386/730 (52%), Gaps = 84/730 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE VA LLN WF+SIKVDREERPD
Sbjct: 33 GEEAFTLSKRENKPIFLSIGYSTCHWCHVMERESFEDEAVAALLNRWFISIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YM + QAL G GGWPL++ ++P+ KP GTYFP + +G G +L +V W
Sbjct: 93 VDHMYMAFCQALTGSGGWPLTIIMTPEKKPFFAGTYFPKTEHHGYHGLMELLEQVGTLWR 152
Query: 119 KKRDML----------AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSY 168
+ L QSG ++ S + S +++ ++ + L +++
Sbjct: 153 TSENKLRESADQIVAAVQSGLALPKKASTPIDNSQNTSDSNKAWEKDVIDKAYAALEQNF 212
Query: 169 DSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
D R+GGFG APKFP P + +L ++ ++ S MV TL MA+GG++DH
Sbjct: 213 DPRYGGFGRAPKFPSPHTLTFLLRYA-------ENHPQSNALAMVRKTLNGMARGGMYDH 265
Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
+G GF RYS DE+W +PHFEKMLYD LA YL++F +T ++ + +DI Y+ RDM
Sbjct: 266 IGFGFARYSTDEKWLIPHFEKMLYDNALLALAYLESFQVTHSPEHAKVAQDIFTYVLRDM 325
Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGN 347
P G +SAEDAD+ + +EG F+VWT +EVE +L E A + Y + GN
Sbjct: 326 TSPEGGFYSAEDADAED-------QEGKFHVWTPQEVEAVLDMETAQKYCSVYDISAKGN 378
Query: 348 CDLSRMSDPHNEFKGKNV--LIELN----DSSASASKLGMPLEKYLNILGECRRKLFDVR 401
F+GK++ L++ N D +S +++ + + L R+ LF R
Sbjct: 379 ------------FEGKSIPNLLQGNIHKLDQESSLAEVDV-----IKSLESARQALFSAR 421
Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
KR PH DDK++ SWNGL+I++ A+ +++L + K Y+E E AA
Sbjct: 422 EKRIHPHKDDKILTSWNGLMIAALAKGAQVLGN----------------KTYLEAGEKAA 465
Query: 462 SFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIEL 520
FI HL RL +R G S G+LDDY+F I GLL+LY F SG +L A+ L
Sbjct: 466 DFILTHL-RRVDGRLLARYREGDSAILGYLDDYSFFIWGLLELY-FASGKPLFLQTALLL 523
Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
Q QD LF D + GGYF T + +L R KE +DGA PSGNS++ +NL+R + GSK
Sbjct: 524 QEEQDRLFFDTQRGGYFLTGSDGEKLLFRPKESYDGAIPSGNSITTLNLLRFGQLT-GSK 582
Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 640
Y+++ AE L F T L+ A P+++ ++L G S + M
Sbjct: 583 --YWKEKAEQQLLDFRTVLEAHPSGYTAFLQALQFALHPTQE-LILAGSLDSEELSMMRN 639
Query: 641 AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
+ + +V++ + + E + + E + ++D+ A +CQNF+C PV
Sbjct: 640 LFFSEFRPYASVLYQEGSLGELVPWIENY----------PLASDQTAAYLCQNFTCQQPV 689
Query: 701 TDPISLENLL 710
+ LL
Sbjct: 690 YEVDQFARLL 699
>gi|322420309|ref|YP_004199532.1| hypothetical protein GM18_2810 [Geobacter sp. M18]
gi|320126696|gb|ADW14256.1| protein of unknown function DUF255 [Geobacter sp. M18]
Length = 742
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 254/685 (37%), Positives = 368/685 (53%), Gaps = 54/685 (7%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+ LN F++IKVDREERPDVD VYMT V A+ GGWPL+V
Sbjct: 99 TCHWCHVMEEESFEDESVAEFLNGNFIAIKVDREERPDVDTVYMTAVHAMGLQGGWPLNV 158
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD KP GGTY PP D G GF T+LR++++++D D ++++G E + L+
Sbjct: 159 FVAPDRKPFYGGTYSPPNDYPGGLGFLTLLRRIRESFDSAPDRVSRAGVQLTEAVQTMLA 218
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + P A+RL ++ +D R GG APKFP + ++++L + + D
Sbjct: 219 PAQGEESWQEISPDPAVRLYQDR----FDDRNGGLVGAPKFPSSLPLRLLLRYFLRTGD- 273
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
MV TL+ MA GGI+D GGGFHRY+ D W VPHFEKMLYD L
Sbjct: 274 ------RRSLSMVELTLRSMAAGGIYDQAGGGFHRYATDTSWLVPHFEKMLYDNALLTVS 327
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL+ + T ++ + R+IL YL+RDM P G +SA DADS G ++EG F+ W
Sbjct: 328 YLEGYQATGAAEFAAVAREILRYLQRDMQAPAGGFYSATDADSLSPGG--HREEGVFFTW 385
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T +E+ LG E L Y + GN F+G+++L + A L
Sbjct: 386 TPEELRGTLGPERGDLMAACYGVTQGGN------------FEGRSILHREKSIAELARAL 433
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ ++ L +CR L+ R+KRP P D+K++ SWNGL IS+FA IL
Sbjct: 434 KLSEQELELTLADCRELLYRARAKRPLPLRDEKILASWNGLAISAFASGGLIL------- 486
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ E ++VA AA F+ +++ RL+HSF+ G +K FLDDYAFLI+
Sbjct: 487 ---------NNAELVQVAVRAAGFMLQNMV--VNGRLRHSFQEGEAKGEAFLDDYAFLIA 535
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GL+DL+E WL A+EL E F DRE GG+F T ++ R K +DG P
Sbjct: 536 GLIDLFEASRDISWLERALELTAAVQEQFEDRESGGFFMTGPHHEELISREKPAYDGVIP 595
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSV ++NL+RL ++ ++ A ++LA F T+L + A+ M A + L
Sbjct: 596 SGNSVMIMNLLRLNTLTGATR---LLDQARNALAAFATQLANSPAALSEMLLAIEYLQQT 652
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
++ V++ E L + N+ ++ + + EE+ + + +
Sbjct: 653 PKEVVIVAPAGKPEAAEPFLEGLRRTLVPNRALVVV--CEGEEL----QRAARLIPLVEG 706
Query: 680 NFS-ADKVVALVCQNFSCSPPVTDP 703
+ D+ VA +C N SC PP +DP
Sbjct: 707 KTAEGDRAVAYLCANRSCRPPTSDP 731
>gi|269926785|ref|YP_003323408.1| hypothetical protein Tter_1680 [Thermobaculum terrenum ATCC
BAA-798]
gi|269790445|gb|ACZ42586.1| protein of unknown function DUF255 [Thermobaculum terrenum ATCC
BAA-798]
Length = 686
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 259/695 (37%), Positives = 377/695 (54%), Gaps = 62/695 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE+ +AK++ND FV+IKVDREERPD+D +YM VQA+ G GWPL+
Sbjct: 48 SSCHWCHVMAHESFENPEIAKIMNDNFVNIKVDREERPDIDAIYMEAVQAMTGQAGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GGTYFPPED+ G PGFK +L + + + +R + QS + +QL +
Sbjct: 108 VFLTPDGKPFFGGTYFPPEDRVGMPGFKRLLLWLSEVYHTRRQEIEQSASQIAQQLLQIS 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A S+ + E+ ++A + L S+D ++GGFG+APKFP+P+ ++ +L
Sbjct: 168 RAELKSHDISLEILESA----CQSLKSSFDHQYGGFGTAPKFPQPMTVEYLL-------Q 216
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ + E MV TL M+ GGIHDH+GGGFHRYSVD W +PHFEKMLYDQ +A
Sbjct: 217 SFIRAQQKEYLDMVTLTLVRMSLGGIHDHLGGGFHRYSVDRTWLIPHFEKMLYDQALIAR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL A+ +T + +Y + L Y+ +DM G +SA+DADS EG +EG +Y+
Sbjct: 277 AYLHAWQVTHNSWYLKVVNRTLQYVLKDMTSSQGGFYSAQDADS---EG----EEGKYYL 329
Query: 320 WTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ E++ +L E + L EHY + +GN F+GKN+L A
Sbjct: 330 WSLDEIKRVLNEREVELVCEHYGVTASGN------------FEGKNILHIAKSIEDLARD 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
M L + I+ E KL R +R P D KV+ SWN L+ ++ A EA
Sbjct: 378 HNMDLSEVEKIIDEASMKLLHYRDQRTPPAKDTKVVTSWNALMSTTLA--------EAGF 429
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
AM N EY+ ++ A F+ +L + L H++ + K PGFL+DYA L
Sbjct: 430 AMNN--------PEYIAASQRNAQFLLDNLVVDGL--LHHTYSDSKPKVPGFLEDYAALS 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ L+ LYE S KWL A + F E G + +T+ + + L+ + +D A
Sbjct: 480 NSLITLYEITSDGKWLESARRFVQDMIDSFWKEEIGTFSDTSIKHSDIFLQPRNLYDNAT 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNS++ + L+RLA I + D YR+ A + + A M C A+ L
Sbjct: 540 PSGNSLACMALLRLAVIF--DRQD-YREIASRVVRGLALVMSKHPTAFGHMLCVANTLLS 596
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
PS + +V++G K SV+ E +L +Y NK +I + TEE E S+ +
Sbjct: 597 PSVE-IVILGDKHSVNTEALLEVIRQTYIPNKILI----STTEE----EASRSDLPLLQG 647
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLE 712
+K A VC+N++CS PV +P L E L L+
Sbjct: 648 RTLRNNKPTAFVCRNYACSMPVNEPDELREQLTLQ 682
>gi|254442730|ref|ZP_05056206.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
gi|198257038|gb|EDY81346.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
Length = 727
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 259/708 (36%), Positives = 371/708 (52%), Gaps = 72/708 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESF DE +A LN+ +V IK+DREERPD+D VYMT+VQ L G GGWPL+
Sbjct: 71 STCHWCHVMNRESFSDEEIAAYLNEHYVCIKIDREERPDIDNVYMTFVQNLTGNGGWPLN 130
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGR-PGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSE 137
V+LSPD KP GGTYFPP D R GF +++++ D W +LA+S + ++ L++
Sbjct: 131 VWLSPDKKPFFGGTYFPPRDDPSRGRGFLPLIQEINDFWIQDPTGVLARSQSI-VDTLNQ 189
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMM 190
+ + ++N +NA L E+LS+S +D + GFG+ KFP P + ++
Sbjct: 190 HSAQTLAANS------ENAASL--ERLSESITAFLFIFDEQNKGFGNDQKFPSPNTLSLL 241
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L + E + S +++ L TL M GGI DH+GGGFHRY+VD W +PHFEKM
Sbjct: 242 LRAAATPE--LHQEDRSLAKRLALETLDAMLAGGIRDHLGGGFHRYTVDAGWQLPHFEKM 299
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQ +A+ +DA+ LT + Y + LDY+ RD+ G ++SAEDA+S + + +
Sbjct: 300 LYDQALIASALVDAYQLTGEARYRQAATETLDYVLRDLRHENGGLYSAEDAESLDPDKSF 359
Query: 311 RKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
K+EGA+Y WT+ + E + E H+ L+P GN P F G N L
Sbjct: 360 AKREGAYYTWTTADFERLFPHEEKRAGLAAHFSLRPAGNAPYGNF--PREIFAGYNTLRI 417
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
D+ +L L L RS R RPHLDDK+I SWNGL IS+ ARA
Sbjct: 418 NPDAKIDPDQLAADLA-----------TLRQDRSTRARPHLDDKIITSWNGLAISALARA 466
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ +R +Y A+ AA+F+ +LY ++ +L +R S
Sbjct: 467 GLVF----------------NRPDYTNAAQQAANFLLENLYQPESQQLLRLYRQDASPVA 510
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
F +DYA+LI+GLLDLYE + +WL A ELQ Q++ F D E GGYF D V
Sbjct: 511 AFAEDYAYLIAGLLDLYEADADHRWLQKAHELQLAQNQRFADTENGGYFLFEASDDIVFN 570
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R K+ D A PS NSVS NL RLA + ++Q A ++ F +L +P
Sbjct: 571 RTKQAADTAIPSPNSVSAKNLARLAQFFDDAS---FQQQASQTINAFAPQLDSSGTTLPT 627
Query: 609 MCCAADMLSVPSRK-HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-----TEE 662
+ A +L V + +V+ G + + ML + ++T+++ D AD +
Sbjct: 628 LREA--ILFVGKKPLQIVIAGDPQTASAQAMLHEVNQRLLPSRTLLYADQADGQAYLGQH 685
Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++F + S N K VC+NF C P DP +L L
Sbjct: 686 LEFIQTAKSYNG----------KATVFVCENFVCQMPTEDPQTLAKQL 723
>gi|402218687|gb|EJT98763.1| hypothetical protein DACRYDRAFT_110659 [Dacryopinax sp. DJM-731
SS1]
Length = 705
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 254/617 (41%), Positives = 349/617 (56%), Gaps = 59/617 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TC WCHVME ESFE+E VAK++ND V++KVDRE PDVD+VYM YV A+ G GGWP+S
Sbjct: 46 STCRWCHVMERESFENEEVAKMMNDVCVNVKVDREVLPDVDRVYMNYVTAISGRGGWPMS 105
Query: 80 VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
V+++PD K P GGTYFPP+ + IL +VKD W +RD L G + L E
Sbjct: 106 VWITPDTKIPFFGGTYFPPQ------AMEQILTQVKDKWKNERDKLVPKGNSLSDILQEP 159
Query: 139 LSASASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
S ++ + L Q L L ++ L + YD GGFG APKFP + +
Sbjct: 160 ASPTSPA------LSQLGLPLLRDRGLAMLGQMYDRTHGGFGGAPKFPTQSRFSFLHLVA 213
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
ED+ + G+KM FTL+ MA GGIHD +G GFHRYSVD WH+PHFE MLYD
Sbjct: 214 YLAEDSN-----NLGRKMSAFTLKKMAMGGIHDQIGLGFHRYSVDAAWHIPHFEIMLYDN 268
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP---GGEIFSAEDADSAETEGATR 311
QLA YL + LT D +Y + +L YL R ++ G SAEDA+S E EG T
Sbjct: 269 AQLAYHYLTYYVLTGDEYYRTVANGVLAYLDRVLLKKTDHGIAYMSAEDAESYEEEGDTI 328
Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
KKEGAFYVWT ++ LGE F +H+ +K GN L DPH E +GKNVL+E
Sbjct: 329 KKEGAFYVWTRAQITAALGEKDGDAFCDHFGVKEEGNVGLEH--DPHKELQGKNVLMEQR 386
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ +A+ LG+ E+ I+ R L + R KRP+PHLDDK+I SWNGL++ + A+A+
Sbjct: 387 SAEETATALGISTEEMEGIINRGREVLREERDKRPKPHLDDKIIASWNGLMLKTLAQAAL 446
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
L S G + +++ A F++ + + +L +R + G
Sbjct: 447 RLPS------------GPEPEKFYNQGIEVARFVQNQMIKD--GKLLRCYR---TNVQGV 489
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLR 549
+DYA +I+GLL LY+ L A+ELQ+ QDELF D + GYF + + D S ++R
Sbjct: 490 CEDYASVINGLLALYQVKLEPWLLRIAVELQDKQDELFWDEKAWGYFASAEDSDASKIMR 549
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASI-------------VAGSKSDYYRQNAEHSLAVFE 596
+K+DHDG EPS NS+S+ NLV L SI ++ S+++ Y+ A+ + F
Sbjct: 550 LKDDHDGPEPSANSLSLHNLVTLDSICHATDPFALGIPNMSESRAERYQMYAQKMVTFFT 609
Query: 597 TRLKDMAMAVPLMCCAA 613
RL ++P M AA
Sbjct: 610 PRLLTQPASMPEMVSAA 626
>gi|453087339|gb|EMF15380.1| hypothetical protein SEPMUDRAFT_147282 [Mycosphaerella populorum
SO2202]
Length = 800
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 250/597 (41%), Positives = 334/597 (55%), Gaps = 32/597 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF+D +A+LLN+ F+ +K+DREERPD+D+ YM ++QA GGGGWPL+
Sbjct: 121 SACHWCHVMAHESFDDPRIAQLLNENFIPVKIDREERPDIDRQYMDFLQATNGGGGWPLN 180
Query: 80 VFLSPD-LKPLMGGTYFPPEDK--YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
VF++P L+P+ GGTY+P ++ R GF+ I+ KV AW ++ QS QL
Sbjct: 181 VFVTPGGLEPIFGGTYWPKRERAQQARTGFEDIILKVSTAWREQEQRCRQSAKDITRQLR 240
Query: 137 EALSASA----SSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
E + N+ D EL + L + YD + GGFG APKFP PV I+ +
Sbjct: 241 EFAQEGSIGGKDVNRTDDDAELELDLLDDAFQHYKMRYDDKHGGFGGAPKFPTPVHIRPL 300
Query: 191 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
L Y + E G+ E E + M L TL+ MAKGGI D +G GF RYSV W +PH
Sbjct: 301 LRVASYPATVREIVGEE-ECIEARSMALMTLEKMAKGGIKDQIGHGFARYSVTRDWSLPH 359
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
FEKMLYD QL VYLDA+ LTK + I +DI YL M G I SAEDADS
Sbjct: 360 FEKMLYDNAQLLAVYLDAYLLTKSPLFLEIVKDIATYLTSAPMQSELGGIHSAEDADSFP 419
Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKN 364
T K+EGA+YVWT +E E +L E + Y+ +K GN D R D E +N
Sbjct: 420 TINDKHKREGAYYVWTLEEFEQVLSEEEVKVCAKYWNVKAEGNVD--RRHDAQGELIKQN 477
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVIS 423
L +++ A +L M + + R+ L R + RP P LDDK++ SWNGL I
Sbjct: 478 TLCVSRETAELAEELNMAEDDVKRAIDSGRQALLAYREANRPSPSLDDKIVTSWNGLAIG 537
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
S ARA L+ + P GS Y+ A AA I+ HL+D + L+ +R G
Sbjct: 538 SLARAGAALREVS-------PEAGSS---YVSAARKAALCIQNHLFDAMSGTLRRVYREG 587
Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
P + GF DDYAF ISGLLDLYE + +L A LQ TQ++LF D E G+F+T
Sbjct: 588 PGETQGFADDYAFFISGLLDLYEATFDSDFLQLADTLQETQNKLFWDPEKYGFFSTPAHQ 647
Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
P +L+R K+ D AEPS N VS NL RL S++ + Y + A ++A FE ++
Sbjct: 648 PDILIRTKDAMDNAEPSVNGVSASNLFRLGSLL---NDEEYSKMARRTVACFEVEIE 701
>gi|410661555|ref|YP_006913926.1| Thymidylate kinase [Dehalobacter sp. CF]
gi|409023911|gb|AFV05941.1| Thymidylate kinase [Dehalobacter sp. CF]
Length = 741
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 271/751 (36%), Positives = 395/751 (52%), Gaps = 82/751 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFED+ VA +LN ++ +KVDREERPD
Sbjct: 33 GEEAFQKAKEENKPVFLSIGYSTCHWCHVMERESFEDKEVAAILNRSYIPVKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMTY Q + G GGWPL+V ++PD +P GTYFP YGRPG IL +V + W
Sbjct: 93 IDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAGTYFPKHSHYGRPGLMDILSQVGELWQ 152
Query: 119 KKRDMLAQSGAFAIEQLSEAL----SASASSNKLPDELP---------------QNALRL 159
++D + Q+ A E ++ +A+++ K LP + L
Sbjct: 153 TEKDKVIQTAAELYETVTRHYRGDKNATSAVPKNKQTLPFTEKEKDSGDIAIWGKTLLGK 212
Query: 160 CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
E L +DS++GGFGSAPKFP P + +L +S +E+ S+ MV TL
Sbjct: 213 GYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRYS--MEEP-----QSKALAMVEKTLDS 265
Query: 220 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRD 279
MA GGI DH+G GF RYS D W VPHFEKMLYD LA VYL+A+ TK+ Y + ++
Sbjct: 266 MADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYDNAGLALVYLEAYQRTKNQKYRRVAQN 325
Query: 280 ILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEH 339
I Y+ RDM G +SAEDADS EG +EG +Y+W+ E+ L + ++
Sbjct: 326 IFGYVLRDMTSAEGGFYSAEDADS---EG----EEGKYYLWSKDEIRKTLQDGIESLQKE 378
Query: 340 YYL----KPTGN---------CDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGM 381
L KP CD ++D N ++GKN+ + + D ++ S G
Sbjct: 379 RELKNGFKPLSKQKEEVADIYCDAYGITDEGN-YEGKNIPSRIFHVGVGDLTSRYSLTGD 437
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
L + L+I C LF R KR RP DDK++VSWNGL+I + A+ ++L +
Sbjct: 438 ELGEMLDI---CNTILFSAREKRVRPAKDDKILVSWNGLMIGALAKGVQVLSGDLSWE-- 492
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+D+K + AE+AA FIR ++D + RL +R G + PG+LDDYAFL+ GL
Sbjct: 493 ------NDKKSLLLTAENAAGFIRDKMFDSRG-RLLARYREGEAGIPGYLDDYAFLVHGL 545
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
L+LY T++L AI LQ Q++LF D GGY+ T + +LLR KE +DGA PSG
Sbjct: 546 LELYTACGKTEYLEQAIFLQEEQEKLFRDETNGGYYFTGCDAEELLLRPKEIYDGAMPSG 605
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
NS+S NL RL + SK +++ AE + F T ++D A ++
Sbjct: 606 NSMSACNLGRLWRLTGLSK---WQERAEKQINSFRTTVEDYPPGYTAFLQAI-QYTLNQG 661
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
+ +VL G ++ E M A + V + D + + + +++ + R+
Sbjct: 662 EELVLSGSSANQTLEKMQTAIFKDFHPYAAVAYNDGSLGQLIPRMDDY-----PVGRD-- 714
Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ VC++F+C PV P L +L E
Sbjct: 715 ----LSVYVCRDFACREPVNTPEELAKILSE 741
>gi|410658568|ref|YP_006910939.1| Thymidylate kinase [Dehalobacter sp. DCA]
gi|409020923|gb|AFV02954.1| Thymidylate kinase [Dehalobacter sp. DCA]
Length = 741
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 271/751 (36%), Positives = 395/751 (52%), Gaps = 82/751 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFED+ VA +LN ++ +KVDREERPD
Sbjct: 33 GEEAFQKAKEENKPVFLSIGYSTCHWCHVMERESFEDKEVAAILNRSYIPVKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMTY Q + G GGWPL+V ++PD +P GTYFP YGRPG IL +V + W
Sbjct: 93 IDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAGTYFPKHSHYGRPGLMDILSQVGELWQ 152
Query: 119 KKRDMLAQSGAFAIEQLSEAL----SASASSNKLPDELP---------------QNALRL 159
++D + Q+ A E ++ +A+++ K LP + L
Sbjct: 153 TEKDKVIQTAAELYETVTRHYRGDKNATSAVPKNKQTLPFTEKEKDSGDIAIWGKTLLGK 212
Query: 160 CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
E L +DS++GGFGSAPKFP P + +L +S +E+ S+ MV TL
Sbjct: 213 GYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRYS--MEEP-----QSKALAMVEKTLDS 265
Query: 220 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRD 279
MA GGI DH+G GF RYS D W VPHFEKMLYD LA VYL+A+ TK+ Y + ++
Sbjct: 266 MADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYDNAGLALVYLEAYQRTKNQKYRRVAQN 325
Query: 280 ILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEH 339
I Y+ RDM G +SAEDADS EG +EG +Y+W+ E+ L + ++
Sbjct: 326 IFGYVLRDMTSAEGGFYSAEDADS---EG----EEGKYYLWSKDEIRKTLQDGIESLQKE 378
Query: 340 YYL----KPTGN---------CDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGM 381
L KP CD ++D N ++GKN+ + + D ++ S G
Sbjct: 379 RELKNGFKPLSKQKEEVADIYCDAYGITDEGN-YEGKNIPSRIFHVGVGDLTSRYSLTGD 437
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
L + L+I C LF R KR RP DDK++VSWNGL+I + A+ ++L +
Sbjct: 438 ELGEMLDI---CNTILFSAREKRVRPAKDDKILVSWNGLMIGALAKGVQVLSGDLSWE-- 492
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+D+K + AE+AA FIR ++D + RL +R G + PG+LDDYAFL+ GL
Sbjct: 493 ------NDKKSLLLTAENAAGFIRDKMFDSRG-RLLARYREGEAGIPGYLDDYAFLVHGL 545
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
L+LY T++L AI LQ Q++LF D GGY+ T + +LLR KE +DGA PSG
Sbjct: 546 LELYTACGKTEYLEQAIFLQEEQEKLFRDETNGGYYFTGCDAEELLLRPKEIYDGAMPSG 605
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
NS+S NL RL + SK +++ AE + F T ++D A ++
Sbjct: 606 NSMSACNLGRLWRLTGLSK---WQERAEKQINSFRTTVEDYPPGYTAFLQAI-QYALNQG 661
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
+ +VL G ++ E M A + V + D + + + +++ + R+
Sbjct: 662 EELVLSGSSANQTLEKMQTAIFKDFHPYAAVAYNDGSLGQLIPRMDDY-----PVGRD-- 714
Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ VC++F+C PV P L +L E
Sbjct: 715 ----LSVYVCRDFACREPVNTPEELAKILSE 741
>gi|322794007|gb|EFZ17245.1| hypothetical protein SINV_09516 [Solenopsis invicta]
Length = 891
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 279/784 (35%), Positives = 387/784 (49%), Gaps = 131/784 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA---------- 69
+TCHWCHVME ESF++E VAK++N+ +V+IKVDREERPD+D + M ++QA
Sbjct: 144 STCHWCHVMEKESFKNEEVAKIMNEHYVNIKVDREERPDIDMMCMMFIQASLYLVSGTTR 203
Query: 70 LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA 129
L G GGWPLSVFL+PDL P+ GGTYF F L ++ W RD + +S
Sbjct: 204 LRGHGGWPLSVFLTPDLMPITGGTYF------SSSMFTLYLTRIMKEWTDGRDKMIKSAT 257
Query: 130 FAIEQLSEALSASASSNKLP-----------------------DELPQ-NALRLCAEQLS 165
E+L E L+ S K+ D +P ++ LCA L
Sbjct: 258 TIAERLKE-LATSREDIKVSECYLKFLNYFNNVFYLLIFAIQDDGVPAIDSAFLCAHVLM 316
Query: 166 KSYDSRFGGFGSA-------PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQ 218
YDS +GGFGS+ PKFP P + +L T S+ L TL+
Sbjct: 317 NIYDSEYGGFGSSSAINPNSPKFPEPSNLNFLLSMHVLTTSTMLVEMTSDA---CLNTLK 373
Query: 219 CMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICR 278
M+ GGIHDH+G GFHRY+VD RW VPHFEKMLYDQ QL Y DA+ +TKD FYS I
Sbjct: 374 KMSYGGIHDHIGKGFHRYTVDARWKVPHFEKMLYDQAQLIQCYADAYLITKDSFYSDIVD 433
Query: 279 DILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI---- 334
DI Y+ R + G FSAEDADS T A+ K+EGAFYVWT ++ +L + +
Sbjct: 434 DIATYVLRILQHMEGGFFSAEDADSLPTSDASAKREGAFYVWTYDRLKTLLKKEKVPGKD 493
Query: 335 ------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 388
L H+ ++ GN + + DPH E GKNV +AS + +E+
Sbjct: 494 NVTYFDLICRHFSVRKEGNVESPQ--DPHGELTGKNVFSMQAGIEDTASHFKLSVEETQK 551
Query: 389 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 448
L E LF+ R+ RP P LDDK++ +WNGL+IS ARA +K+
Sbjct: 552 HLKEACTILFEDRTHRPWPQLDDKMVTAWNGLMISGLARAGIAVKN-------------- 597
Query: 449 DRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------------------------- 479
K Y+E A AA+F+ ++L+D++ L S
Sbjct: 598 --KTYVEAATEAATFVEKYLFDKKKRILLRSCYRRRDDKIVQRQVLSLHQSVSRCEIYDA 655
Query: 480 -FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
+R+ P PGF +DYAF + GLLDLYE W+ +A ELQ+ QD LF D + GGYF
Sbjct: 656 IYRSTP--IPGFHEDYAFYVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDLQDGGYFA 713
Query: 539 TTGEDPSVLLRVKE---------DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
E P +L R K+ DGA PS NS++ NL+RLA + D R AE
Sbjct: 714 MAEESP-ILTRTKDFKIPMSFVVADDGALPSSNSIACSNLLRLAIYL---DRDDLRNKAE 769
Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
L F +L A P M A P++ +V G + + ML +
Sbjct: 770 KLLCAFGNKLVSCPAACPQMMLALIEYHHPTQIYV--TGKTDAKETNEMLEIIRSRLIPG 827
Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
+ +I D + + F + N + R D+ + +C++++CS P++ P +L +
Sbjct: 828 RVLILADAEQQDNVLF-----NRNMIVKRMKPQKDRAMVFICRDYTCSLPISSPSALISE 882
Query: 710 LLEK 713
L +K
Sbjct: 883 LNKK 886
>gi|83816674|ref|YP_445669.1| hypothetical protein SRU_1548 [Salinibacter ruber DSM 13855]
gi|83758068|gb|ABC46181.1| Protein of unknown function, DUF255 family [Salinibacter ruber DSM
13855]
Length = 701
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 259/695 (37%), Positives = 358/695 (51%), Gaps = 51/695 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ VA LLND FV IKVDREERPDVD +YM Q + G GGWPL+
Sbjct: 48 STCHWCHVMERESFEDDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSE 137
V L+PD KP TY P E ++ + G +L +VK W D + +L + EQ+++
Sbjct: 108 VLLTPDRKPFFAATYLPKEGRFQQTGLMDLLPRVKQLWNSDDRAKLLDDA-----EQVTD 162
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L D L A QL++ +D GGFGSAPKFP P + +L H +
Sbjct: 163 RLQRIGDDQTDGDAPGPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR- 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG+ ++ V TL M GG+ D VG GFHRYS D++W +PHFEKMLYDQ
Sbjct: 222 --TGEQAALNQ----VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMH 275
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ T Y R++L Y+RRD+ P G FSAEDADS EG +EGAF
Sbjct: 276 VLAYTEAYQATGTDRYERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAF 333
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ +++ + L A L + Y + P GN R E GKNVL +A+A
Sbjct: 334 YVWSIEDIREHLEPALADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAA 389
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ GM ++ + L RR L D RS+RPRP LDDKV+ WNGL+ ++ A+A+++
Sbjct: 390 EQRGMEVDVLRDHLETARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF---- 445
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D ++ E A F+ ++D RL H +R G + LDDYAF
Sbjct: 446 ------------DDAQFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAF 492
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI GLL+LYE WL A+E + F D EGGG++ T + ++++R KE +DG
Sbjct: 493 LIWGLLELYETTFDADWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDG 552
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV ++NL+RLA ++++ + A S T + ++ L
Sbjct: 553 ALPSGNSVQLMNLLRLARFTG--RTEFEERAAALSRWAGATARRRPTGFTAMLSGLHWAL 610
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P + VV+ G S D ++ Y + P D + + A
Sbjct: 611 GTP--REVVVAGEPDSDDTNALIDVLRDDYTPTTVTLQRPPGDAD--------ITALAPF 660
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
+ D + A VC+ F C PVTDP +L L
Sbjct: 661 TESQTPVDGRAAAYVCEAFRCEAPVTDPAALREQL 695
>gi|331269923|ref|YP_004396415.1| thymidylate kinase [Clostridium botulinum BKT015925]
gi|329126473|gb|AEB76418.1| thymidylate kinase [Clostridium botulinum BKT015925]
Length = 671
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 250/711 (35%), Positives = 379/711 (53%), Gaps = 78/711 (10%)
Query: 5 SFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 61
+F K + FL ++CHWCHVME ESFEDE VAK+LND ++SIKVDREERPDVD
Sbjct: 35 AFLKAKKEDKPIFLSIGYSSCHWCHVMEKESFEDEEVAKILNDKYISIKVDREERPDVDN 94
Query: 62 VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 121
YMT+ Q++ G GGWPL++ ++P+ KP GTYFP + YGRPGF IL+++ D W +
Sbjct: 95 TYMTFCQSVTGSGGWPLTIIMTPEQKPFFAGTYFPKKSMYGRPGFIQILKQISDEWKSNK 154
Query: 122 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKF 181
+ + + + + E +S S E+ + L+ +++ YD+++GGFG++PKF
Sbjct: 155 NNIINTSNELLNTMEEHISQDKSG-----EINETILQDAVIEMNYYYDNKYGGFGASPKF 209
Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
P P ++ ++L + K + G MV TL+CM KGGI DH+G GF RYS DE+
Sbjct: 210 PTPHKLMLLLINYKVYNNKNALG-------MVENTLKCMYKGGIFDHIGFGFSRYSTDEK 262
Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
W VPHFEKMLYD LA VY A+ +T FY + I Y+ RDM P G +SAEDA
Sbjct: 263 WLVPHFEKMLYDNALLAYVYTQAYQVTGKSFYKEVAEKIFKYILRDMTSPEGGFYSAEDA 322
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
DS EG EG FYVWT E+E ILGE A F Y + GN F+
Sbjct: 323 DS---EGV----EGKFYVWTLHEIESILGEDAKEFCNIYNITKNGN------------FE 363
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
G N+ + +G L+ ++ L R+KLF+VR KR P DDK++ +WN L+
Sbjct: 364 GSNI----------PNLIGKDLDD-IDKLESLRKKLFEVREKRIHPFKDDKILTAWNALM 412
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
I + A A ++ ++E +Y+ A+ A +FI +L + RL FR
Sbjct: 413 IVALAYAGRVFENE----------------KYINRAKKAYNFIENNLI-RKDGRLLARFR 455
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
+G + +L+DY+FL+ L++LYE +K+L A+ + +LF D E G+F++
Sbjct: 456 HGEAAYIAYLEDYSFLVWALMELYEATFDSKYLKQALHFTDEMIKLFWDEESYGFFHSGK 515
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
+ ++L +K+ +D A PSGNS++ +NL++L+ I + + A + F + +
Sbjct: 516 DGEKLILNLKDSYDMAIPSGNSIAAMNLIKLSKITGDNT---LAEKAYKMIEGFGGNIIE 572
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+ + A PS + +V+ K F++M+ + + + T ++ D E
Sbjct: 573 SIQSHSIFLMAYMNYIRPSTQ-IVIASEKQDELFKDMIREVNKRF-MPFTTTLLNDGDLE 630
Query: 662 EMDFWEEHNSNNASMARNNFSA-DKVVALVCQNFSCSPPVTDPISLENLLL 711
N +N +K A VC+NFSC+ PV + LL+
Sbjct: 631 ----------NVIPFIKNEKKIYNKTTAYVCENFSCNRPVDNVEDFIKLLI 671
>gi|365158244|ref|ZP_09354475.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
gi|363621167|gb|EHL72387.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
Length = 678
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 264/687 (38%), Positives = 371/687 (54%), Gaps = 76/687 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA+LLN +FV+IKVDREERPD+D VYMT Q + G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDPEVAELLNQYFVAIKVDREERPDIDSVYMTVCQMMTGQGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GTYFP +YGRPG IL ++ A+ + D +A G+ +E L E
Sbjct: 113 VFLTPDKKPFYAGTYFPKNSQYGRPGMMDILPQLHRAYHQDPDRIADIGSRLVEALKE-- 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
+ K ++ + A+ EQL+ +DS +GGFG APKFP P ++ + YH
Sbjct: 171 ---EAGRKSEGDVTEEAVHKGFEQLAGKFDSLYGGFGEAPKFPSPHQLLFLFRYYHM--- 224
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+GE S KM TL MA GGI+DH+GGGF RYS D W VPHFEKMLYD L
Sbjct: 225 -----TGEES-ALKMAEKTLDSMAAGGIYDHIGGGFSRYSTDGMWLVPHFEKMLYDNALL 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +TK+ Y I +I D++ R+M P G +SA DADS EG +EG F
Sbjct: 279 MYAYTEAYQITKNERYRRIVLEIADFVAREMTHPEGGFYSAIDADS---EG----EEGKF 331
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSAS 375
YVW+ +E+ D+LGE +F E Y++ GN F+GKN+L L D
Sbjct: 332 YVWSKEEIMDVLGEETGTIFSELYHVTDQGN------------FEGKNILHLLQTDLETI 379
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ + +E+ N++ + ++ LF R KR +PH+DDKV+ SWNGL+I++ A+A +
Sbjct: 380 AANHELSIEELENLMSKAKQFLFQAREKRVKPHVDDKVLTSWNGLMIAALAKAGSV---- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
F+ P + S A A +F+ ++++ E+ RL FR G +K G+LDDYA
Sbjct: 436 -----FDDPGLLSQ-------ARKAMAFLEKYVWKEK--RLMARFREGEAKYRGYLDDYA 481
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL+ G L+L+ L +AIEL+N E F D E GG+F T + +L+R K +D
Sbjct: 482 FLLWGTLELFLAEDDLHMLSFAIELKNALFERFWD-ENGGFFFTDRDGEELLVREKPGYD 540
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV+ L RLA + + + E + F L +++ M AA
Sbjct: 541 GAYPSGNSVAAYQLWRLAKLTGDIE---LMKRVEMCVRSFSKELNAFPVSMLYMLEAAMA 597
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L R+ V+++G S + V+ + D W H
Sbjct: 598 LFAQGRE-VIVIGSNGSE---------------KRAVLWRCREEFLPFDVWSGHRPEWLE 641
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
A D +V +C+N +C P+ D
Sbjct: 642 GAAKQKETDLLV-FICENQACKMPMED 667
>gi|296132106|ref|YP_003639353.1| hypothetical protein TherJR_0579 [Thermincola potens JR]
gi|296030684|gb|ADG81452.1| protein of unknown function DUF255 [Thermincola potens JR]
Length = 673
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/695 (38%), Positives = 376/695 (54%), Gaps = 77/695 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA +LN+ +VSIKVDREERPD+D +YM+ QA+ G GGWPL+
Sbjct: 52 STCHWCHVMERESFEDEEVAAILNEHYVSIKVDREERPDIDTIYMSVCQAMTGHGGWPLT 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V ++PD KP GTYFP + G PG IL ++ D W +++ L +SG E+++EA+
Sbjct: 112 VIMTPDKKPFFAGTYFPKKSSRGMPGLTDILIQIADLWRERKKELTESG----EKITEAV 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++ S+ D + + L +++D +GGFG+APKFP P + +L + K
Sbjct: 168 NSHLFSHTGGD-VSKEMLDKAFAYFEENFDRLYGGFGAAPKFPTPHNLTFLLRYWK---- 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G A E MV TL M +GGI+DH+G GF RYS D +W VPHFEKMLYD LA
Sbjct: 223 MSGNGAALE---MVEKTLDAMYRGGIYDHIGFGFARYSTDRKWLVPHFEKMLYDNALLAI 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ T + Y+ +I Y++RDMI P G +SAEDADS EG +EG FYV
Sbjct: 280 AYLEAYQATGNRKYAKTAEEIFTYVQRDMISPEGGFYSAEDADS---EG----EEGKFYV 332
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
WT +EV+++LG+ F Y + GN F+ K++ LIE
Sbjct: 333 WTPEEVKEVLGDTLGRYFCRDYDITAQGN------------FESKSIPNLIETG------ 374
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
Y+ E R+KLF R +R P DDK++ +WNGL+I++ A ++ L
Sbjct: 375 ---------YVEGYEEARKKLFARREQRVHPFKDDKILTAWNGLMIAAMAYGARAL---- 421
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
K+Y EVA A +FI ++L E RL FR+G + G+LDDYA
Sbjct: 422 ------------GEKKYAEVAAKAVNFINKNLRREDG-RLSARFRDGEAAFLGYLDDYAC 468
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ GL++LYE +L A+EL N +LF D E GG F + +++ R KE +DG
Sbjct: 469 YVWGLIELYEATFEPAYLEQALELNNDMLKLFWDEENGGLFLYGNDAENLITRPKEIYDG 528
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A P+GNSV+ +NL RLA + + + A L F + + M A L
Sbjct: 529 ALPAGNSVAAVNLFRLARLTGDRQ---LAERAREQLKAFGGSVAESPMGHSHFLMAV-WL 584
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ + +VG + + D E MLA ++ + TVI + P E E + +
Sbjct: 585 DLTPPVDITVVGDRKAGDTEKMLATVNSRFMPEATVI-LKPPGPE-----GEKLAQAVAF 638
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
R+ + + K A VC+N+SC PPVTD LE LL
Sbjct: 639 LRDRQAVNGKATAYVCKNYSCHPPVTDADKLEKLL 673
>gi|87306323|ref|ZP_01088470.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
3645]
gi|87290502|gb|EAQ82389.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
3645]
Length = 688
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/700 (38%), Positives = 380/700 (54%), Gaps = 74/700 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ +A LN+ FVSIKVDREERPD+D++YM VQ L G GGWP+S
Sbjct: 48 SACHWCHVMEHESFENQEIADYLNEHFVSIKVDREERPDLDQIYMNAVQMLTGRGGWPMS 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEA 138
VFL+P LKP GGTY+PP + G PGF +L+ V DAW+ +R + L QS FA E+L E
Sbjct: 108 VFLTPQLKPFFGGTYWPPTPRGGMPGFDQVLKAVMDAWENRRAIALEQSEKFA-ERLQEI 166
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A S ++ L +A + L YD R GGFG APKFP ++I++ L +S++
Sbjct: 167 GQAEDSGEQIDLHLLDDAYKY----LESIYDFRHGGFGGAPKFPHTMDIEVCLRYSRR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+S +M + L MA+GGI+DH+GGGF RYSVD RW VPHFEKMLYD LA
Sbjct: 221 -----QPSSRALEMAIHNLDQMARGGIYDHLGGGFARYSVDARWLVPHFEKMLYDNALLA 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY+D + T ++ + R+ DY+ + G S EDADS EG +EG FY
Sbjct: 276 GVYIDGYRATGREDFARVARETCDYVLHYLTDEAGGFQSTEDADS---EG----EEGKFY 328
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELNDSSA 374
VWT +E+ DILGE F E + + +GN F+GKN+L + D A
Sbjct: 329 VWTPQEIVDILGEGEGRRFCEIFDVSESGN------------FEGKNILNLPQSIEDWGA 376
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+++ + L + L++ R++L VR KR RP DDKV+VSWNGL+I S ARA+ L
Sbjct: 377 ASNLDVVELRRELDV---ARQQLLQVRDKRIRPAKDDKVLVSWNGLMIDSLARAAGALSE 433
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
+Y+ AE AA F+ + D+ + RL HS+R+G +K +LDDY
Sbjct: 434 ----------------PKYLIAAERAADFVFDKMIDD-SGRLLHSYRHGVAKLAAYLDDY 476
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L + + LYE +WL AIEL N F D GGGY+ T + ++ R K+ +
Sbjct: 477 ANLANACISLYEASFAERWLKRAIELTNLMMRHFGDPVGGGYYFTADDHEKLIARNKDLY 536
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D + PSGNS++ + L+RL++++ ++ A ++ V +K A M A D
Sbjct: 537 DNSVPSGNSMAAVVLLRLSALLGNTE---LLDEAVTTIRVAAPLMKKHPTATGQMLAAVD 593
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
P+R+ VV+ G+ S LA SY N + + E+ + +
Sbjct: 594 RYLGPARE-VVIFGNADSGATHEFLAELRRSYTPNSAIACVSS---------EKALPSGS 643
Query: 675 SMA-----RNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
+A + VC+NF+C PVT ++ +L
Sbjct: 644 PLAPIFAGKGPLPEADGTVYVCENFACQRPVTAAEAIADL 683
>gi|386002945|ref|YP_005921244.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
gi|357211001|gb|AET65621.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
Length = 698
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 268/723 (37%), Positives = 369/723 (51%), Gaps = 70/723 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESFEDE VA+LLN FV IKVDREERPD
Sbjct: 29 GEEAFTRAEREDKPVFLSIGYSTCHWCHVMAAESFEDEEVARLLNATFVPIKVDREERPD 88
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYM Q + G GGWPL+VFL+PD KP TY P E ++GR G ++ ++ W
Sbjct: 89 LDAVYMAVAQMMTGSGGWPLTVFLTPDKKPFFAATYIPKESRFGRIGILDLIPRIGHLWK 148
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP-----QNALRLCAEQLSKSYDSRFG 173
+R ML LS A +++ + P E+P + ++ + L +D+ G
Sbjct: 149 NERAML----------LSSAEEVASALRRPPPEVPGLRLEEATIKAAYQGLVARFDAANG 198
Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
GFG APKFP P +L H ++ D G G +M TL+ M +GGI DH+GGGF
Sbjct: 199 GFGGAPKFPSPTTFLFLLRHWRRTGDPG-------GVQMTEVTLRAMRRGGIFDHLGGGF 251
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D W +PHFEKMLYDQ ++ L+A T Y+ I R++ DYL RD+ P G
Sbjct: 252 HRYSTDLHWRLPHFEKMLYDQAMISLACLEAHQATGKAEYATIAREVFDYLLRDLAAPEG 311
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR 352
+SAEDADS EG +EG FY+WT EV +L + A L ++L+ GN
Sbjct: 312 GFYSAEDADS---EG----EEGRFYLWTLPEVRAVLDPDEAELAARIFHLQEEGNF---- 360
Query: 353 MSDPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
+ GKNVL I L D A ++G+P+ L R KLF R R RP
Sbjct: 361 REEATGRLTGKNVLAMKIPLED---HAREMGIPVGDLREWLEAAREKLFAAREGRARPKK 417
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
DDK++ WNGL I++ AR +++L G R E E A+ AA + +
Sbjct: 418 DDKILADWNGLAIAALARGAQVL--------------GDRRLE--EAADRAADLVLHRMR 461
Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
DE+ RL H +R G + G LDDYA ++ GLL+LYE G + L A+ L E F
Sbjct: 462 DERG-RLLHRYRGGDAGILGNLDDYANMVWGLLELYEAGFRPERLEAALALARDMVERFR 520
Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
DR+GGG+F T + +++R K+ HDGA P+GN+V+ NL+RLA + + +
Sbjct: 521 DRDGGGFFFTPEDGEELIVRRKDGHDGALPAGNAVAAFNLLRLARMTGDPELEVI---GS 577
Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
L F + + A + A D PS VV+VG S + ML A + +
Sbjct: 578 EGLQAFAAQARGSPSAFLHLLSALDFALGPS-SEVVVVGEAGSPETAEMLKALRSRFLPR 636
Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
K V+ + + + E A M + A VC C P TDP ++ L
Sbjct: 637 KVVLGRPVGEDQRI---VELAGFTAEM---EALEGRTTAYVCSGRVCRQPTTDPAAVLKL 690
Query: 710 LLE 712
L E
Sbjct: 691 LEE 693
>gi|221632535|ref|YP_002521756.1| hypothetical protein trd_0509 [Thermomicrobium roseum DSM 5159]
gi|221156894|gb|ACM06021.1| Protein of unknown function, DUF255 family [Thermomicrobium roseum
DSM 5159]
Length = 687
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 253/706 (35%), Positives = 370/706 (52%), Gaps = 88/706 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME E FE+ +A+L N+ FV+IKVDREERPD+D++YM +QA+ G GGWPL+
Sbjct: 48 SSCHWCHVMERECFENPEIAQLQNELFVNIKVDREERPDLDELYMNALQAMTGSGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQL 135
VFL+PD KP GGTYFPPED+ P + +L V A+ ++R + ++ ++ +Q
Sbjct: 108 VFLTPDGKPFYGGTYFPPEDRGQLPAWPRVLLAVAQAYRERRADVERAAEDLVSYLQQQS 167
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
L A+ + DE +N L YD GGFG+APKFP P++++ +L
Sbjct: 168 RPPLQAAPLREQFLDEAARN--------LVPHYDREHGGFGTAPKFPSPLQLEFLL---- 215
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
T + A +MVL TL MA+GGIHD +GGGFHRY+VDE W VPHFEKMLYD
Sbjct: 216 ---RTFRRAGAPRALEMVLQTLTAMARGGIHDQIGGGFHRYTVDEAWLVPHFEKMLYDNA 272
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
LA VY A + + I + L Y++R+M G G F+A+DADS E EG
Sbjct: 273 LLARVYTLAHLASGNRLCRTIAEETLVYIQREMRGDHGAFFAAQDADSEE-------GEG 325
Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
AFY+WT +E+ +LG + A L ++ + P GN F+GK++L D
Sbjct: 326 AFYLWTPEEIAAVLGNDDAGLACRYFGVTPRGN------------FEGKSILHVAEDPVT 373
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
AS+ G+ L++ +G R +L++ R +RP P D+KVIV+WN L I +FA A L
Sbjct: 374 IASEFGLSLDELEQRIGSIRARLYEARDQRPHPARDEKVIVAWNALAIRAFAEAGTAL-- 431
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
DR +++ +AE AA+F+R L+D +T L H + G ++ PGFLDDY
Sbjct: 432 --------------DRPDFVALAERAATFLRDQLWDGKT--LYHVWEEGEARFPGFLDDY 475
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L++ L+ LYE W+ WA +L F+D G +++T + +++R K
Sbjct: 476 ADLVNALVSLYEATFDPFWIAWARQLTEAILAKFIDPVAGDFYDTASDGEQLIVRPKTFI 535
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK---------SDYYRQNAEHSLAVFETRLK-DMAM 604
D PSGN + L+RL +++ + Y + EH +A + L D A+
Sbjct: 536 DQGTPSGNGATAEALLRLGTLLGEHRFIDQARTLLERYAQLAVEHPIACGQLLLAMDFAL 595
Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
P V ++G + + +L ASY N+ + P D
Sbjct: 596 GQPF--------------EVAIIGDPTQPETRALLRVVQASYLPNRVLALRRPED----- 636
Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
E S +A + A VC+NF+C PVT P L + L
Sbjct: 637 --EIAASIVPLLAERSLVDGHPAAYVCRNFACQRPVTTPQELASQL 680
>gi|302392081|ref|YP_003827901.1| hypothetical protein [Acetohalobium arabaticum DSM 5501]
gi|302204158|gb|ADL12836.1| protein of unknown function DUF255 [Acetohalobium arabaticum DSM
5501]
Length = 686
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 259/691 (37%), Positives = 373/691 (53%), Gaps = 76/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA++LN FV+IKVDREERPD+D +YMT Q L G GGWPL+
Sbjct: 55 STCHWCHVMERESFEDEEVAEILNRSFVAIKVDREERPDIDNIYMTVCQTLTGRGGWPLT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V ++P+ KP GTYFP E G+PG IL +V+ AW KKR L ++ E++ AL
Sbjct: 115 VIMTPEKKPFFAGTYFPKEAGRGQPGLMDILIRVEQAWKKKRQPLLETS----EEILSAL 170
Query: 140 SASASSNKLPDELPQNALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
++K + L E ++D +GGFG+APKFP P + +L + K
Sbjct: 171 ERVNDTDKNDSASMEEMSGLAKEAFISFVANFDEDYGGFGTAPKFPTPHNLMFLLRYWK- 229
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+GE + +MV TL M +GG++DH+G GF RYS DE+W VPHFEKMLYD
Sbjct: 230 -----STGE-EKALEMVETTLDNMYRGGMYDHLGYGFARYSTDEKWLVPHFEKMLYDNAL 283
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA YL+A+ +T Y+ I R+I Y+ RD+ P G +SAEDADS ++EG
Sbjct: 284 LAVTYLEAYQITDKEDYADIAREIFTYVLRDLTSPEGGFYSAEDADS-------EREEGK 336
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LI--ELNDS 372
FYVWT E++ ILG E + C + ++D N F+GK++ LI EL+ S
Sbjct: 337 FYVWTPNEIKKILGNKQ---GEEF-------CQVYNITDEGN-FEGKSIPNLIGTELDKS 385
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
R++LF R KR PH DDK++ SWNGL+I++ A +++L
Sbjct: 386 EVDKK------------FAAERKELFKAREKRVHPHKDDKILTSWNGLMIAALAIGARVL 433
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
E Y + A+ AA FI ++L + RL +RNG + G++D
Sbjct: 434 NDE----------------RYQQAAKEAAEFIWQNLRRDGNGRLLARYRNGEADYYGYVD 477
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAF I GL++LYE T++L A EL N E F D+E GG + + +L R KE
Sbjct: 478 DYAFFIWGLIELYETTFETEYLEKAAELNNDLIEYFWDKEQGGLYFYGYDSEELLTRPKE 537
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
+DGA PSGNSV+ +NL+RLA ++ ++ + + A F +R+ + +A +
Sbjct: 538 IYDGAIPSGNSVATLNLLRLAKLIGDTELE---EKARQQFEYFGSRITNKPIASSYFLLS 594
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ + + +V+ G++ E M+ H + L TV ++ T+E + S
Sbjct: 595 W-LFAQNGGREIVIAGNREETVTEEMVQVLHQEF-LPFTVSLLNT--TQE----RKKLSE 646
Query: 673 NASMARNNFSADKV-VALVCQNFSCSPPVTD 702
A + DK A +C+NF+C PV D
Sbjct: 647 LVPFAADQMKVDKRPTAYICENFACQKPVID 677
>gi|294507561|ref|YP_003571619.1| hypothetical protein SRM_01746 [Salinibacter ruber M8]
gi|294343889|emb|CBH24667.1| conserved hypothetical protein [Salinibacter ruber M8]
Length = 701
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 257/691 (37%), Positives = 356/691 (51%), Gaps = 51/691 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ VA LLND FV IKVDREERPDVD +YM Q + G GGWPL+
Sbjct: 48 STCHWCHVMERESFEDDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSE 137
V L+PD KP TY P E ++ + G +L +V+ W D + +L + EQ+++
Sbjct: 108 VLLTPDRKPFFAATYLPKEGRFQQTGLMDLLPRVRQLWNSDDRAKLLDDA-----EQVTD 162
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L D L A QL++ +D GGFGSAPKFP P + +L H +
Sbjct: 163 RLQRIGDDQTDGDAPGPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR- 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG+ ++ V TL M GG+ D VG GFHRYS D++W +PHFEKMLYDQ
Sbjct: 222 --TGEQAALNQ----VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMH 275
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ T Y R++L Y+RRD+ P G FSAEDADS EG +EGAF
Sbjct: 276 VLAYTEAYQATGTDRYERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAF 333
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ +++ + L A L + Y + P GN R E GKNVL +A+A
Sbjct: 334 YVWSIEDIREHLEPALADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAA 389
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ GM + + L RR L D RS+RPRP LDDKV+ WNGL+ ++ A+A+++
Sbjct: 390 EQRGMEADVLRDHLDTARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF---- 445
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D ++ E A F+ ++D RL H +R G + LDDYAF
Sbjct: 446 ------------DEAQFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAF 492
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI GLL+LYE WL A+E + F D EGGG++ T + ++++R KE +DG
Sbjct: 493 LIWGLLELYETTFDADWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDG 552
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV ++NL+RLA ++++ + A S T + ++ L
Sbjct: 553 ALPSGNSVQLMNLLRLARFTG--RTEFEERAAALSRWAGATARRRPTGFTAMLSGLHWAL 610
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P + VV+ G S D ++ Y + P D + + A
Sbjct: 611 GTP--REVVVAGEPDSDDTNALIDVLRDDYTPTTVTLQRPPGDAD--------ITALAPF 660
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISL 706
+ D + A VC+ F C PVTDP +L
Sbjct: 661 TESQTPVDGRAAAYVCEAFRCEAPVTDPAAL 691
>gi|25326752|pir||A88216 protein B0495.5 [imported] - Caenorhabditis elegans
Length = 722
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 262/727 (36%), Positives = 373/727 (51%), Gaps = 60/727 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + FL +TCHWCHVME ESFE+E AK+LND FV+IKVDREERPD
Sbjct: 35 GQEAFQKAKDNNKPIFLSVGYSTCHWCHVMEKESFENEATAKILNDNFVAIKVDREERPD 94
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDK+YM +V A G GGWP+SVFL+PDL P+ GGTYFPP+D G GF TIL +
Sbjct: 95 VDKLYMAFVVASSGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEVV 154
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+KR ++ I +L + +AS N+ + + S+DSR GGFG A
Sbjct: 155 EKRRREFETTRAQIIKLLQPETASGDVNR-----SEEVFKSIYSHKQSSFDSRLGGFGRA 209
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP+ ++ ++ + +S +A + M+ TL+ MA GGIHDH+G GFHRYSV
Sbjct: 210 PKFPKACDLDFLITFAAS---ENESEKAKDSIMMLQKTLESMADGGIHDHIGNGFHRYSV 266
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIF 296
WH+PHFEKMLYDQ QL Y D LT K ++ DI Y+++ GG +
Sbjct: 267 GSEWHIPHFEKMLYDQSQLLATYSDFHKLTERKHDNVKHVINDIYQYMQKISHKDGG-FY 325
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCD 349
+AEDADS ++ K EGAF W +E++ +LG+ I + +++ ++ +GN
Sbjct: 326 AAEDADSLPNHNSSNKVEGAFCAWEKEEIKQLLGDKKIGSASLFDVVADYFDVEDSGN-- 383
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
++R SDPH E K KNVL +L A+ + + + + E + L++ R++RP PHL
Sbjct: 384 VARSSDPHGELKNKNVLRKLLTDEECATNHEISVAELKKGIDEAKEILWNARTQRPSPHL 443
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K++ SW GL I+ +A + ++ +Y++ AE A FI + L
Sbjct: 444 DSKMVTSWQGLAITGLVKAYQ----------------ATEETKYLDRAEKCAEFIGKFLD 487
Query: 470 DEQTHR------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
D R G + F DDYAFLI LLDLY ++L A+ELQ
Sbjct: 488 DNGELRRSVYLGANGEVEQGNQEIRAFSDDYAFLIQALLDLYTTVGKDEYLKKAVELQKI 547
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
D F + G GYF + D V +R+ ED DGAEP+ S++ NL+RL I+ + +
Sbjct: 548 CDVKFWN--GNGYFISEKTDEDVSVRMIEDQDGAEPTATSIASNNLLRLYDIL---EKEE 602
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
YR+ A RL + +A+P M A + S VLVG S + +
Sbjct: 603 YREKANQCFRGASERLNTVPIALPKMAVALHRWQIGSTT-FVLVGDPKSELLSETRSRLN 661
Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
+ N +V+HI EE S + + K +C+ F C PV
Sbjct: 662 QKFLNNLSVVHIQS---------EEDLSASGPSHKAMAEGPKPAVYMCKGFVCDRPVKAI 712
Query: 704 ISLENLL 710
LE L
Sbjct: 713 QELEELF 719
>gi|125972813|ref|YP_001036723.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
27405]
gi|281417012|ref|ZP_06248032.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
gi|385779271|ref|YP_005688436.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
1313]
gi|419721660|ref|ZP_14248818.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
gi|419725407|ref|ZP_14252450.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
gi|125713038|gb|ABN51530.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
27405]
gi|281408414|gb|EFB38672.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
gi|316940951|gb|ADU74985.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
1313]
gi|380771156|gb|EIC05033.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
gi|380782356|gb|EIC11996.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
Length = 680
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 257/689 (37%), Positives = 367/689 (53%), Gaps = 77/689 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA++LN FVSIKVDREERPD+D +YMT QAL G GGWPL+
Sbjct: 53 STCHWCHVMESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP +D+ G PG +IL+ V + W ++D LA+ + + +SE++
Sbjct: 113 IIMTPDKKPFFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESI 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ DE+ ++ Q +D+ +GGFG+APKFP P + +L + K
Sbjct: 173 DDDYYYS--VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK--- 227
Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
A E +V+ TL M GGI+DH+G GF RYS DE+W VPHFEKMLYD L
Sbjct: 228 ------AKEEYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALL 281
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A YL+ + TK+ Y+ I ++I Y+ RDM P G +SAEDADS EG +EG F
Sbjct: 282 AIAYLETYQATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKF 334
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ E++++LGE F ++Y + GN F+G N+ +N +
Sbjct: 335 YIWSPTEIKEVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDE 382
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
K + L CR+KLFD R KR PH DDK++ +WNGL+I++ A ++L E
Sbjct: 383 DKEFVEL---------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE- 432
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+Y AE A+ FI L RL +R+G + +LDDYAF
Sbjct: 433 ---------------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDGEAAFLAYLDDYAF 476
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L++LYE +L A+EL N + F D + GG F + ++ R KE +DG
Sbjct: 477 LIWALIELYETTYKPMYLKKAMELTNDMIKYFWDNKKGGLFIYGSDSEQLITRPKEIYDG 536
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ +N +RL+ + + + + A A+F +++ M A +
Sbjct: 537 AIPSGNSVAALNFLRLSRLTGQQELE---EKAHQMFALFGSKIDSMPQGYAFFLTAM-LF 592
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
S VVLVG D +NML+ + T I + EEH +
Sbjct: 593 SKSKSNEVVLVGSNEK-DTQNMLSILSEDFRPFTTSIL----------YSEEHKDLKELI 641
Query: 677 AR-NNFSA--DKVVALVCQNFSCSPPVTD 702
+N++ +K A VC+NF C P+TD
Sbjct: 642 PFIDNYTTIENKPTAYVCENFVCHEPITD 670
>gi|408381411|ref|ZP_11178960.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
3637]
gi|407815878|gb|EKF86441.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
3637]
Length = 712
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 260/701 (37%), Positives = 366/701 (52%), Gaps = 58/701 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESF+D + LLN FV +KVDREERPD+D VYMT Q + G GGWPL+
Sbjct: 59 STCHWCHVMARESFQDPEIGDLLNQVFVPVKVDREERPDIDSVYMTVCQMITGSGGWPLT 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
V ++PDLKP GTYFP + G + ++ V+D WD KR L +S +++Q+S
Sbjct: 119 VIMTPDLKPFFAGTYFPKDTGPRGTGLRDLILNVRDLWDNKRGELVKSAEELTHSLQQIS 178
Query: 137 EA-----LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
E + S + EL + L+ + LS ++D ++ GFG+ KFP P + +L
Sbjct: 179 EGPLPQTVKGSQGFPESSQELGEEILKQAYQSLSDNFDEKYTGFGNNQKFPTPHHLLFLL 238
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ K TG+ + MV TL M KGGI+DHVG GFHRY+VD +W VPHFEKML
Sbjct: 239 RYWKH---TGEDMALT----MVERTLDAMKKGGIYDHVGFGFHRYTVDRQWMVPHFEKML 291
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YDQ LA Y +AF T Y ++L+Y+ RDM P G +SAEDADS EG
Sbjct: 292 YDQALLAIAYTEAFQATGKTQYRETAEEVLEYILRDMRSPEGGFYSAEDADS---EG--- 345
Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIEL 369
+EG FY+WT E+ D+LG + LF E Y + GN D K GKN+L
Sbjct: 346 -EEGKFYLWTQDEIMDLLGSNDGALFSEIYSVSEEGN-----FKDEATRVKTGKNILHRT 399
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
+ KLG+ E+ R LF R R PH DDKV+ WNGLVI + A A
Sbjct: 400 QTWDELSKKLGISTEELWWKTETARETLFHARKSRIHPHKDDKVLTDWNGLVIVALALAG 459
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
K R++Y+ A A FI L+ + RL+H +R+G + G
Sbjct: 460 NSFK----------------REDYLMAAGDAVKFIMTKLHHQG--RLKHRWRDGEAAVDG 501
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
LDDYA+LI GLL+LY+ +++L A++L T E FLD + GG++ T+ +L+R
Sbjct: 502 NLDDYAYLIWGLLELYQATFQSEYLEIALKLNQTLLEHFLDHDNGGFYFTSDFTQKILVR 561
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
KE +D A PSGNSV ++NL + + I+ D + H L + + + + M
Sbjct: 562 QKEAYDTALPSGNSVQMMNLEKFSLII----DDMKISESFHGLESYFASMITQSPSAFTM 617
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
+A +L + VV+ G K S D + +L Y L ++ ++ +D +
Sbjct: 618 FLSAIILKIGPSFQVVICGEKDSPDTQVLLNTIQKEY-LPNVILILNSSDDSLI------ 670
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N S+ + A VC N +C PV +P L N+L
Sbjct: 671 NQIVGSLEHKTIVNGQATAYVCGNGTCHAPVNNPDDLINIL 711
>gi|268325595|emb|CBH39183.1| conserved hypothetical protein, DUF255 family [uncultured archaeon]
Length = 685
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 263/708 (37%), Positives = 371/708 (52%), Gaps = 93/708 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE++ A+LLN F+ IKVDREERPD+D +YM VQ + G GGWPLS
Sbjct: 50 STCHWCHVMARESFENKQTAELLNTNFICIKVDREERPDLDALYMKAVQMMAGTGGWPLS 109
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PDLKP GGTYFPPE +G P F +L+ + D W +KR+ + S EQ++E L
Sbjct: 110 VFMTPDLKPFYGGTYFPPEPIHGLPAFNELLQTITDYWHEKRERILHSS----EQITEHL 165
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--------APKFPRPVEI-QMM 190
S N L +EL + L EQL+ +DS +GGFG+ PKFP P + ++
Sbjct: 166 RRSYQHNLLTEELSVDMLENAFEQLNLQFDSTYGGFGAEVAAWSVKKPKFPLPSYLFFLL 225
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
LYH + E S KMV TL MA+GGI+D + GGFHRYS D RW VPHFEKM
Sbjct: 226 LYHHRTDE--------SYALKMVTKTLYEMARGGIYDQLAGGFHRYSTDNRWLVPHFEKM 277
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD LA VYL A+ +T D F++ I + LD++ R+M G +SA DADS +
Sbjct: 278 LYDNALLAQVYLWAYQVTGDKFFAQIATETLDWVLREMTDSNGGFYSAIDADSEDI---- 333
Query: 311 RKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
EGAFYVW+ E+ +L EH +F +Y + GN + GK+VL
Sbjct: 334 ---EGAFYVWSPSEIISVLSEEHGEVFCRYYGVTQQGNFE-----------GGKSVLHVA 379
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
ND + I+ ++KL + R++R RP DDK+I WN L+IS+FA
Sbjct: 380 NDEVNKDTA---------GIINRSKQKLLEARNRRIRPATDDKIITGWNSLMISAFALGY 430
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
++L+ + +++ A SA FI L E +L +R G + G
Sbjct: 431 QVLRE----------------RRFLDAATSATQFILNKLNKEG--QLFRRYRAGEAAITG 472
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
LDD+AFLI+ LLD+YE KWL A++ + ELF D+ G+F + +
Sbjct: 473 TLDDHAFLIAALLDIYEASFDLKWLREALQRNDRVVELFWDKANAGFFFNRYGETDLPAA 532
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+KE +DG PSGNS++ NL+RLA++ + ++ R A+ F +L+ + M
Sbjct: 533 IKEAYDGPIPSGNSIAAQNLIRLAAL---TDNEELRILAKDLFRTFGAQLEQSPLEHTQM 589
Query: 610 CCAADM-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
CA D LS P + VV+ K ++ A + + L VI +
Sbjct: 590 LCALDFYLSSPMQ--VVIASQK--IEEVQAFAVEISRHFLPNQVIAFTSS---------- 635
Query: 669 HNSNNASMARNNFSADKV------VALVCQNFSCSPPVTDPISLENLL 710
S+N R DKV +C+N++C P+TD L +L
Sbjct: 636 --SDNELSGRIPLITDKVAVQGKPTVYICENYACKAPITDLYDLRRVL 681
>gi|399888568|ref|ZP_10774445.1| hypothetical protein CarbS_08603 [Clostridium arbusti SL206]
Length = 679
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 253/692 (36%), Positives = 364/692 (52%), Gaps = 70/692 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED VA+LLN +F++IKVDREERPD+D +YM+ QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDNEVAELLNKYFIAIKVDREERPDIDNIYMSVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++ D KP GTY P + +YG G +L K+ W + ++ L +S ++ L + +
Sbjct: 114 IMTSDKKPFFAGTYLPKKTQYGHMGLMELLNKINKLWIEDKNKLVESSNNIVDFLQDQIV 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
E+ + + E L SY+ FGGF S+PKFP P + +L + + D
Sbjct: 174 HKKG------EISEKIVNDAYESLRDSYNPVFGGFSSSPKFPTPHNLNFLLRYYRAKGD- 226
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+MV TL M GGI DH+G GF RYSVD +W VPHFEKMLYD LA +
Sbjct: 227 ------KYALQMVENTLNSMYSGGIFDHIGFGFSRYSVDSKWLVPHFEKMLYDNALLAII 280
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + + +T Y I IL+Y+ RDM G +SAEDADS EG EG FYVW
Sbjct: 281 YTETYQITHKDRYREIAMKILNYILRDMTSKQGGFYSAEDADS---EGV----EGKFYVW 333
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASK 378
KE++ +LGE A F EHY +K GN F+GKN+ LI +
Sbjct: 334 DKKEIKSVLGEDADFFNEHYNIKSKGN------------FEGKNIPNLIGEDLEELEDES 381
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ L+ + KLF R KR PH DDK++ SWNGL+I++ A A +
Sbjct: 382 IKSKLDG-------LKEKLFSYREKRIHPHKDDKILTSWNGLMIAAMAYAGR-------- 426
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
V G +R Y E A + SFI +L + + RL +R+G + G+LDDYAFL+
Sbjct: 427 ------VFGIER--YKEAASKSISFISHNLVNHKG-RLLCRYRDGEAANLGYLDDYAFLV 477
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL+++YE + +L AIEL + + F D + GG F + ++L+ KE +DGA
Sbjct: 478 FGLIEMYEATFESFYLRKAIELNDEMVKYFWDEQNGGLFFYGKDSEELILKTKEIYDGAI 537
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+ +N++RL+ I K + Q A F ++ ++ +A + +A + S
Sbjct: 538 PSGNSVAAMNIIRLSRITGDKKLE---QKAGEIFNTFAEKINEVPLAY-VNTISAFLTSK 593
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S HVV+ G K + + M+ + + +I D +++E+ NN M +
Sbjct: 594 ISETHVVIAGDKDHTNTKAMINEINKKFLPFSEIIFND--ESKEIYKLIPFIKNNV-MVK 650
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N K A VC+N SC P D NL+
Sbjct: 651 N-----KTTAYVCKNNSCLAPTNDLQEFSNLI 677
>gi|405123962|gb|AFR98725.1| cold-induced thioredoxin domain-containing protein [Cryptococcus
neoformans var. grubii H99]
Length = 745
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 267/708 (37%), Positives = 389/708 (54%), Gaps = 42/708 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+ ESFEDE AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S
Sbjct: 60 SACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMS 119
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F++P L+P GTYFP RP F +L K+ + W++ R+ + G IE L +
Sbjct: 120 IFMTPKLEPFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEALKDMS 173
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLY 192
+S L L + QLS D+R+GGF +A PKFP + ++ +
Sbjct: 174 DTGRTSESLSQLLSSSPASKLFAQLSTMNDTRYGGFTNAGSSTRGPKFPSCSITLEPLAR 233
Query: 193 HSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ ++ E E ++M + L+ M GGI D VGGG RYSVDE+W VPHFEKML
Sbjct: 234 LASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKML 293
Query: 252 YDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
YDQ QL + LD L +D Y + DIL Y RD+ P G +SAEDADSAE
Sbjct: 294 YDQAQLVSSCLDFARLYPANHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEY 353
Query: 307 EGATRK--KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
+GA + EGAFY+W E+++ILG+ A LF + ++P GN ++ + D H E +GKN
Sbjct: 354 KGAKKSVLPEGAFYIWKKTEIDEILGDDAPLFDSFFGVEPDGNVNI--IHDSHGEMRGKN 411
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+L + A + G ++ +I+ E KL R +R RP LDDK++ +WNGL++++
Sbjct: 412 ILHQHKTYEEVALEFGKREDQAKDIIIEACEKLRLKREERERPGLDDKILTAWNGLMLTA 471
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
++AS +L S + P A +F++ H++D T L S+R G
Sbjct: 472 LSKASTLLPSSYGISSQCLP-----------AALGIVNFVKSHMWDPSTRTLTRSYREG- 519
Query: 485 SKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
K P DDYAFLI GLL+LYE +++A ELQ QDELF D + GGYF + ED
Sbjct: 520 -KGPQAQTDDYAFLIQGLLNLYEATGDESHVLFAEELQKRQDELFWDDDDGGYF-ASAED 577
Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 603
VL+R+K+ DGAEPS +VS NL R + +++ S+ + Y AE + +
Sbjct: 578 AHVLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAP 636
Query: 604 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
AV L R+ V+++G + + L AA +Y N+ ++HI P +
Sbjct: 637 RAVGYAVSGLIDLEKGYRE-VIVIGSANDEMIKEFLKAARETYFSNQVIVHIQPEKLPK- 694
Query: 664 DFWEEHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
E++ A + +K +L VC+ +C PV D +NLL
Sbjct: 695 GLAEKNEVVKALINDVESGKEKEASLRVCEGGTCGLPVKDLEGAKNLL 742
>gi|347754417|ref|YP_004861981.1| thioredoxin domain-containing protein [Candidatus
Chloracidobacterium thermophilum B]
gi|347586935|gb|AEP11465.1| Thioredoxin domain containing protein [Candidatus
Chloracidobacterium thermophilum B]
Length = 691
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 255/692 (36%), Positives = 366/692 (52%), Gaps = 58/692 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME E FE+ +A L+N+ FV+IKVDREERPD+D +YM VQ + G GGWPL+
Sbjct: 56 SACHWCHVMEHECFENPSIAALMNELFVNIKVDREERPDLDTLYMNAVQLMTGRGGWPLT 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GGTYFPPED+ PGF ILR V DA+ ++R + QS A +L
Sbjct: 116 VFLTPDGEPFYGGTYFPPEDRGRMPGFPRILRSVADAYRQRRQDVRQSIAEITAELRRIH 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ L E+ +A R +LS +D GGFG APKFP + + +L + +
Sbjct: 176 EPLDGARTLSPEILTDAYR----RLSTRFDHVHGGFGGAPKFPNSMLLSFLLRYWR---- 227
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GE +MV +L MA GG++DH+GGGFHRYS D++W VPHFEKMLYD LA
Sbjct: 228 --LTGEL-HALEMVELSLDKMASGGMYDHLGGGFHRYSTDDQWLVPHFEKMLYDNALLAR 284
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ T Y I + LDY+ R+M P G ++ +DADS EG +EG F+V
Sbjct: 285 TYLEAWQATGKPRYRQIVEETLDYVVREMTAPTGGFYATQDADS---EG----EEGRFFV 337
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +E+ +L E A L + ++ + GN E GK VL A
Sbjct: 338 WTPEEINTLLDEADADLVRRYFDVTEEGNF----------EGTGKTVLSTPLPLETVARL 387
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ E ++L +R LF+ R +R +P D+K + +WNGL++ SFARA+ +L
Sbjct: 388 KEVTPEHLEHVLARAKRILFEAREQRVKPARDEKCLAAWNGLMLYSFARAAAVL------ 441
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+R +Y VAE A+F+ +Y + L S ++G +K PG+ +DYA
Sbjct: 442 ----------ERDDYRAVAERNAAFVLGTMYVDGI--LYRSHKDGQNKFPGYQEDYACYA 489
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL LYE K+ A EL F D +GGG+F T ++ RVK+ D A
Sbjct: 490 EGLLALYEATGNVKYFCAARELTEAMLAQFDDPQGGGFFFTGDRHEQLITRVKDVFDNAT 549
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+V L+RLA + + YR+ AEH L + + M + A D +
Sbjct: 550 PSGNSVAVEVLLRLALLTGEQR---YRERAEHILQTLSSSMAKMPSGFGQLLGALDFY-L 605
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S + +V+VG + + + ++ ++ V ++P D +H +A+
Sbjct: 606 ASVREIVIVGPPDAAETRELRRVVEEAFRPHRVVALLNPEDG-------DHAQYVPLVAQ 658
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VCQNF+C PVT P +L L
Sbjct: 659 RTMHNGQPTAYVCQNFTCQAPVTTPDALRAQL 690
>gi|118443135|ref|YP_878469.1| thymidylate kinase [Clostridium novyi NT]
gi|118133591|gb|ABK60635.1| thymidylate kinase [Clostridium novyi NT]
Length = 678
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 245/690 (35%), Positives = 375/690 (54%), Gaps = 73/690 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFEDE VA++LND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++
Sbjct: 61 SCHWCHVMENESFEDEEVAEILNDNYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTI 120
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD +P GTYFP + YGRPG IL ++ D W+ ++ + S ++ L E
Sbjct: 121 IMTPDQRPFFAGTYFPKKRMYGRPGLIQILNQIADEWEINKNNIINSSDELLKTLKEH-E 179
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A S ++ +E+ Q+A+ E++ YD +GGFG APKFP P ++ ++L + K+ D
Sbjct: 180 AQDKSGEINEEVLQDAI----EEMKYYYDDVYGGFGIAPKFPTPHKLMLLLTYYKEYNDK 235
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+V TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD LA V
Sbjct: 236 NV-------LHIVEHTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYV 288
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ LT FY + I Y+ RDM P G +SAEDADS EG EG FY+W
Sbjct: 289 YTEAYQLTGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYLW 341
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
E+E+IL E Y K D++R+ + F+G N+ + +G
Sbjct: 342 KLNEIENILKED--------YKKFCNTYDITRVGN----FEGSNI----------PNLIG 379
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+E ++ L R KLF +R KR P DDK++ +WN L+IS+ A ++ ++
Sbjct: 380 KDIEN-IDKLEYIREKLFQIREKRIHPFKDDKILTAWNALMISALAYGGRVFEN------ 432
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
KEY++ A+ A FI+ +L + RL FR G + +L+DY+FL+
Sbjct: 433 ----------KEYIKRAKDAYDFIKNNLI-RKDGRLLARFRYGEAAYIAYLEDYSFLVWA 481
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L++LYE +K+L A+ Q+ +LF D + G+F++ + ++L +K+ +D A PS
Sbjct: 482 LIELYEATFESKFLKEALYFQDEMIKLFWDEKSYGFFHSGKDGEKLILNLKDSYDTAIPS 541
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
GNSV+ +NL++L+ I + + A + F +K+ + + A PS
Sbjct: 542 GNSVAAMNLIKLSKITGYNS---LVEKAYKMIKGFGGNIKESLQSHSVFLMAYMNYIRPS 598
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
R+ +++ +K +M+ + + + T + ++ E++ S+
Sbjct: 599 RQ-IIIASNKEDKVLNDMIREVNKKF-MPFTTVLLNDGTLEDII---------PSIKNEK 647
Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
+K A VC+NFSC+ PV + LL
Sbjct: 648 IIDNKTTAYVCENFSCNRPVNNVEDFRKLL 677
>gi|357039905|ref|ZP_09101696.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
DSM 7213]
gi|355357268|gb|EHG05044.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
DSM 7213]
Length = 688
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 260/693 (37%), Positives = 367/693 (52%), Gaps = 55/693 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ VA LN FVSIKVDREERPD+D++YMT QAL G GGWPL+
Sbjct: 48 STCHWCHVMERESFEDQEVADALNHHFVSIKVDREERPDIDQIYMTVCQALTGQGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V ++PD KP GTYFP ++GR G I+ +V D W RD L Q+ EQ+
Sbjct: 108 VIMTPDKKPFFAGTYFPKRSRWGRAGLLDIIEQVADKWTNDRDKLIQASDMITEQVQ--- 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
L DE + +Q +S+D ++GGFG APKFP P + ++ + K
Sbjct: 165 --FTPGGYLADEPLADISARGYKQFRQSFDKQYGGFGLAPKFPTPHNLLFLMRYWK---- 218
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++GE + M TLQ + +GGI+DH+G GF RYS DE+W VPHFEKMLYD LA
Sbjct: 219 --QNGEEA-ALNMAKKTLQSIYRGGINDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAL 275
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+L+ + T++ FY+ R I Y+ RDM P G +SAEDADS EG EG FYV
Sbjct: 276 AFLEVYQATQNDFYAGAARQIFTYVLRDMTHPEGGFYSAEDADS---EGV----EGKFYV 328
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ EV +LG E+ ++ + Y + +GN + + N++ L + A K
Sbjct: 329 WSPAEVYQVLGRENGDIYCKVYNITESGNFESKSIP---------NLISALPEE--HARK 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+ L +L E R+KLF+ R++R P DDKV+ +WNGL++++ AR + +L
Sbjct: 378 LGIETRALLQLLEESRQKLFNHRARRVHPFKDDKVLTAWNGLMMAALARGAAVL------ 431
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
G R Y + A A FI RH + RL +R+G S G+LDDYAF+I
Sbjct: 432 --------GDVR--YRDAAVKAEQFI-RHKLQRRDGRLLARYRDGESDLNGYLDDYAFVI 480
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL+LY +L AI+L + +LF D+E GG+F + ++ R KE +DGA
Sbjct: 481 WGLLELYRATFQAVYLSRAIDLTHHVRDLFWDQEQGGFFFYGTDSEQLIARPKEIYDGAM 540
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV NL++LA+I S+ + + AE + +F A +
Sbjct: 541 PSGNSVMAANLLQLAAITGNSELE---ELAERQIDIFAGTAAQHPRGYAYFLTALLFATG 597
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P+ +V+ G + ML A Y +I+ + + A R
Sbjct: 598 PT-SEIVITGQRDDPQVAEMLRLAQRQYAPGAVLIY--RPEGDGDQQDGGQIGKLAPFTR 654
Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
S D + A VC++ +C PVT+ L +LL
Sbjct: 655 EQKSIDGRATAYVCRDRACREPVTETEVLGSLL 687
>gi|298243436|ref|ZP_06967243.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
44963]
gi|297556490|gb|EFH90354.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
44963]
Length = 719
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 257/706 (36%), Positives = 374/706 (52%), Gaps = 67/706 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ +A L+N FVSIKVDREERPD+D +YM VQA+ GGWP++
Sbjct: 66 SACHWCHVMERESFENPAIAALMNQHFVSIKVDREERPDIDNIYMQAVQAMTQQGGWPMT 125
Query: 80 VFLSPDLKPLMGGTYFPPEDK----YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
VFL+PD +P GGTYFPP+D+ Y PGF+ +L + + ++R+ + + + L
Sbjct: 126 VFLTPDGRPFYGGTYFPPDDRHHGQYVMPGFRRVLLSLAQLYAQEREKIEEQADELAQFL 185
Query: 136 --SEALSASASSNKLPDELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMM-- 190
E + N LPQ L + A Q L+ +D++ GGFG APKFP + ++ +
Sbjct: 186 RQREGMPLRRRENAT-QGLPQLDLLVVASQALANDFDAQHGGFGGAPKFPHSMALEFLLR 244
Query: 191 --LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
L+ SK+ G+ MV +L+ MAKGG++D +GGGFHRYSVD W VPHFE
Sbjct: 245 VYLHRSKQELSLGQLPGNLTELGMVESSLEHMAKGGMYDQLGGGFHRYSVDAEWLVPHFE 304
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD L+ YL A+ +T FY I + LDY+ R+M+ P G +S +DADS EG
Sbjct: 305 KMLYDNALLSCAYLAAYLVTGKPFYRRIVEETLDYVAREMVSPEGGFYSTQDADS---EG 361
Query: 309 ATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
EG F++W EVE +L A +F +Y + GN F+GKN+L
Sbjct: 362 V----EGKFFLWQPAEVEALLNAPDAAIFMRYYDISARGN------------FEGKNILH 405
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+ A +L + + + I+ R +LF R R +P D+K++ SWNGL++ SFA
Sbjct: 406 INVEVEQLAKELTLSVPEVEQIVKSGREQLFKARELRVKPGRDEKILTSWNGLMLRSFAE 465
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
A++ L R +Y+E+A + A+F+ R L Q RL ++++G ++
Sbjct: 466 AARHL----------------GRGDYLEIAINNANFLLRSL--RQDGRLLRTYKDGRARL 507
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
G+L+DYAFL GLL LY+ +W A L + LF D + GG+F+T + ++
Sbjct: 508 KGYLEDYAFLADGLLALYQACFDPRWFAEARTLMDQAIALFADEQNGGFFDTGSDHEELV 567
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
R K+ D A PSGNSV+ L+RLA++ S D YR+ AE L L D+ + P
Sbjct: 568 TRPKDIMDNATPSGNSVAADVLLRLAAL---SGEDAYRERAEAYL----QSLADVMVQHP 620
Query: 608 LM---CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
A S+ + + L+G + D + +L + Y N + P D E +
Sbjct: 621 QFFGQALGALDFSLTMAREIALLGSPEAADTQALLNVVNTRYLPNSVLACARPDDKEAI- 679
Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A K A VCQNF+C PVT +L LL
Sbjct: 680 ------RAVPLLAERTMQEGKATAYVCQNFACQAPVTTAEALRQLL 719
>gi|347753644|ref|YP_004861209.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
gi|347586162|gb|AEP02429.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
Length = 689
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 267/703 (37%), Positives = 380/703 (54%), Gaps = 75/703 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+E VA++LN+ FV+IKVDREERPD+D +YM Q + G GGWPLS
Sbjct: 53 STCHWCHVMERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ P GTYFP E +YG PGFK +L + + + D + G Q+ +AL
Sbjct: 113 VFLTPEKVPFYAGTYFPRESRYGMPGFKEVLLYLSQQYTENPDRIKDVGV----QVKQAL 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
AS K L + + + + +D R+GGFG APKFP P + +L ++K E+
Sbjct: 169 EASREKGK-QTALTKETIGRAFQAYKQGFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYEN 227
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
A++ TL +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD L
Sbjct: 228 RDALAMATK-------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLVL 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y DAF +TK+ Y I +I+ Y+ RDM P G +SAEDADS EG KEG FYV
Sbjct: 281 AYTDAFRMTKNAQYKKITEEIITYVLRDMAHPDGGFYSAEDADS---EG----KEGKFYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-AS 377
WT EV+D+LGE LF + Y + GN F+GKN+ ++ S A
Sbjct: 334 WTPAEVKDVLGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLESIAK 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K G+ L R+ LF R KR RP DDK++ +WNGL+I++ A+A ++
Sbjct: 382 KEGISPAALAEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRV------ 435
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
F+ P Y++ AE A SFIR +L Q R+ +R+G K GF+D+YAFL
Sbjct: 436 ---FHQP-------SYVQAAEKAVSFIRDNLI--QNDRVMVRYRDGEVKNKGFIDEYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ G ++LYE +L A +L +LF D GGG+F + +D +L+R KE +DGA
Sbjct: 484 LWGYMELYESTFAPFYLAEAKKLAGNMIDLFWDGHGGGFFFSGNDDEPLLVRQKESYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ L+RL+ + + + + VF + D A +M A M +
Sbjct: 544 LPSGNSVAACQLLRLSKLTGDFTLE---EKVQQLFQVFSKDIHDEPTAHAMMLQAG-MHA 599
Query: 618 VPSRKHVVLV---GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD----FWEEHN 670
+ K VV+V K VDF N + ++ +V+ + + ++ F E++
Sbjct: 600 QQATKEVVIVMDDETKEVVDFINHI---QKNFYPGISVMVVKRREQAKLSKIASFIEDYA 656
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
N + VC+NFSC+ P D + +LL +K
Sbjct: 657 MING----------QPTIYVCENFSCNQPTNDFQTAMDLLFKK 689
>gi|410671814|ref|YP_006924185.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
gi|409170942|gb|AFV24817.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
Length = 703
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 250/705 (35%), Positives = 375/705 (53%), Gaps = 53/705 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFED VA+L+N+ FV IKVDREERPD
Sbjct: 38 GEEAFNKAKQDDKPIFLSIGYSTCHWCHVMERESFEDPQVAELMNEAFVPIKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YM+ QAL G GGWPLS+ ++PD KP M TY P E +YG G I+ V + W
Sbjct: 98 IDTIYMSVCQALTGRGGWPLSIIMTPDKKPFMAATYIPRESRYGMAGMLDIVPAVSNMWT 157
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++R+ L + E++ A+S A + L ++ L + L S+D GFG+A
Sbjct: 158 RQREELIANA----EEIVSAISGGARDSTEGPGLDESTLDRTYQLLRSSFDPSSAGFGNA 213
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P ++ +L + K+ K +A E M TL+ M KGGI+DH+G GFHRYS
Sbjct: 214 PKFPTPHHLKFLLRYWKR----SKEDKALE---MAEETLKAMRKGGIYDHIGFGFHRYST 266
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D RW VPHFEKMLYDQ ++ ++ + T++ Y ++ Y+ RDM P G +SA
Sbjct: 267 DSRWLVPHFEKMLYDQALISIALVETYQATQNPEYRENAEEVFSYVLRDMHSPEGGFYSA 326
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS + +EG FY+WT +E+ED+LGE A LFKE ++ P GN L S H
Sbjct: 327 EDADSED-------EEGRFYLWTEQELEDVLGEMDAGLFKEVFHTSPGGNF-LDEASMTH 378
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
G+N+L +A + G +++ L RRKLF+ R R P DDK++ W
Sbjct: 379 T---GRNILHLEESLREAAERRGEDYDRFRQSLESSRRKLFEHREMRVHPSKDDKIMTDW 435
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
N L+I + ++A++ D Y + A A FI + RL
Sbjct: 436 NSLMIVALSKAARAF----------------DEPAYAQEAALTADFILSKMISPNG-RLF 478
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R+G GFLDDYAF I GL++LY+ T++L A+ + F D GG+F
Sbjct: 479 HRYRDGEVAVEGFLDDYAFFIWGLIELYQATFNTEYLRNALRFNDQLILHFRDSIHGGFF 538
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+T + +++R KE +DGA PSGNSV +NL+ L I + + + A + +F
Sbjct: 539 HTADDSEKLIMRSKEIYDGAIPSGNSVCALNLLHLGRITGNTDLE---KKAYEIMQLFSG 595
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
++ M + + CA D + PSR+ +V+ G S + + +++ + + NK ++
Sbjct: 596 QVSKMPVGYTQLMCALDFAAGPSRE-IVVAGDPESEETQGIISDINREFVPNKVILLKPE 654
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E+ E+ S+ + + +C+N++C+ P TD
Sbjct: 655 GRETEISAIAEYVSDMS------MKDGRTTVHICRNYNCNLPSTD 693
>gi|407473332|ref|YP_006787732.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
gi|407049840|gb|AFS77885.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
Length = 682
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 251/697 (36%), Positives = 382/697 (54%), Gaps = 77/697 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED+ VA++LN +F+SIKVDREERPD+D +YM + QA+ G GGWP+++
Sbjct: 54 TCHWCHVMERESFEDDEVAEVLNKYFISIKVDREERPDIDSIYMNFCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP + GTY+P +GR G +L KV + W +D L S +E + +
Sbjct: 114 IMTPDKKPFIAGTYYPKHSMHGRIGIIELLNKVNEKWKSNKDDLINSSEEILEFMKTNIV 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
AS N L E +NA L L S+D +GGFG APKFP P + +L + K
Sbjct: 174 ASEQGN-LDMEDIENAFNL----LKNSFDPEYGGFGKAPKFPTPHNLNFLLRYYK----- 223
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ S ++V TL+ M KGGI DH+G GF RYSVDE+W VPHFEKMLYD LA
Sbjct: 224 -VKGDES-ALEVVEKTLESMYKGGIFDHIGYGFARYSVDEKWLVPHFEKMLYDNALLAVA 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y++A+ +TK Y I I +++ R+M G +SA DADS EG EG FY++
Sbjct: 282 YIEAYQITKRDLYKEIAEKIFEFIEREMTSEEGGFYSAIDADS---EGV----EGKFYLF 334
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
E+ + LG E + LF +Y + GN F+GKN+ +
Sbjct: 335 DHSEISEQLGLEDSELFAHYYDITYDGN------------FEGKNI--------PNLIIT 374
Query: 380 GMPLEKYLNILGE----CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
G+P ++L E C +KL+ R+KR PH DDK++ SWNGL+I + A ++ K +
Sbjct: 375 GLPNMDTNSVLQERLRACIKKLYTYRNKRVYPHKDDKILTSWNGLMIGALALGGRVFKDD 434
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+Y+E AE +A+FI +L D + RL +R+G +K +L+DYA
Sbjct: 435 ----------------KYIERAERSANFILENLIDREG-RLLARYRDGETKYKAYLEDYA 477
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+L+ GL++LY+ ++L AI+L +LF D GG F + ++L+ KE +D
Sbjct: 478 YLVHGLIELYQSTFKMEYLEKAIKLNQDMLDLFWDDNEGGLFIYGKDSEQLVLQHKEIYD 537
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM--AVPLMCCAA 613
GA+PSGNSV+ +NL+RL+ I+ + + ++ L F +K+ + + LM C
Sbjct: 538 GAQPSGNSVASLNLIRLSKILEDPSLE---EKSKAILKAFGGNVKNTVIGHSYLLMSC-- 592
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ ++ S + +V++G+K+ D + M+ + ++ TV+ + ++ EE++
Sbjct: 593 -LFNIVSTQEIVILGNKNDSDTQEMIDKVNDNFTPFTTVVLSNNSE-EELNVI------- 643
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ DK A +C+NF+C+ P D LL
Sbjct: 644 PRLKDYKKVEDKTTAYICKNFTCNDPTADVEQFSGLL 680
>gi|345302921|ref|YP_004824823.1| hypothetical protein Rhom172_1056 [Rhodothermus marinus
SG0.5JP17-172]
gi|345112154|gb|AEN72986.1| protein of unknown function DUF255 [Rhodothermus marinus
SG0.5JP17-172]
Length = 699
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/690 (38%), Positives = 365/690 (52%), Gaps = 50/690 (7%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF+DE VA+LLND F++IKVDREERPD+D +YMT Q + G GGWPL++
Sbjct: 50 CHWCHVMAHESFQDEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTII 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
++PD KP TY P +YGRPG I+ ++K+AW + RD + S L + +S
Sbjct: 110 MTPDKKPFFAATYIPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSF 169
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A S + E + A R +L +D + GGFG APKFP P + +L +
Sbjct: 170 EAPSQVIDAEWLEIAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------ 219
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+SGEA Q MV TL M GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ L Y
Sbjct: 220 RSGEAHALQ-MVEHTLVQMRPGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAY 278
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+A+ T + FY R+IL Y+ RD+ P G +S+EDADS EG +EG FYVWT
Sbjct: 279 TEAYQATGNPFYERTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWT 331
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E+ + LG E A L E + + P GN + + E GKN+L A A + G
Sbjct: 332 VEELREALGPELAPLAIELFNVNPEGNYE----EEATGERTGKNILYLTRPPKALARERG 387
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
E+ L E R++LF R++R RP D+K++ WNGL+I++ ARA+++
Sbjct: 388 WTPEELEAKLEEIRQRLFAYRAQRVRPGRDEKILTDWNGLMIAALARAAQVF-------- 439
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
D Y+E A +AA F+ R + + RL H +R+G + PG LDDYAFL G
Sbjct: 440 --------DEAAYVEAARAAADFLLRTMRTPEG-RLWHRYRDGEAGIPGMLDDYAFLTWG 490
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LLDLYE +L A+ L + F D G ++ T + S+++R +E D A PS
Sbjct: 491 LLDLYEATFEESYLETALALTDQTLAHFWDPR-GVFYMTPDDGESLIVRPRETLDNALPS 549
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
GN+V+++NLVRL + + Y ++A+ + F +K M A D+ P
Sbjct: 550 GNAVALMNLVRLGHMTGRT---VYEEHADAMIRFFSGPVKQQPPIFTGMLVAIDLAFGPI 606
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
+ +VL G ML H Y K ++ P E +A
Sbjct: 607 YE-LVLAGEPDDPTLREMLRTIHRRYLPRKVLLLRRPGAAG-----ERLVRLAPFVAAQA 660
Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC ++ C PVTDP +L L
Sbjct: 661 LLDGRATAYVCHDYRCEQPVTDPEALARQL 690
>gi|91204070|emb|CAJ71723.1| conserved hypothetical protein (thioredoxin) [Candidatus Kuenenia
stuttgartiensis]
Length = 758
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 259/720 (35%), Positives = 377/720 (52%), Gaps = 64/720 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESFED VA+L+N+ F+ IKVDREERPD
Sbjct: 94 GPEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDPEVARLMNEVFICIKVDREERPD 153
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YM Q + G GGWPL++ ++PD KP GTY P+ YGR G ++ ++K+ W+
Sbjct: 154 IDNIYMRVCQMMTGSGGWPLTIVMTPDKKPFYAGTYI-PKKSYGRIGMLDLVPRIKELWN 212
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ + +S L + S S + L + L+ E L++ + + GGF ++
Sbjct: 213 IQHADIQKSANLITASLGQ-FSHDPSEAR----LDASTLKAAYELLARRFSEQHGGFSTS 267
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + K +GE + +MV+ TL M KGGI+DH+G GFHRYS
Sbjct: 268 PKFPSPQNLLFLLRYWK------STGEGN-ALRMVVKTLHSMRKGGIYDHIGYGFHRYST 320
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLYDQ LA Y +A+ T + ++I Y+ RDM P G SA
Sbjct: 321 DPEWLVPHFEKMLYDQAMLAMAYTEAYLATGRKEFGETAKEIFAYVMRDMTDPKGGFCSA 380
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG KEG FYVWT +E+ L E A L + ++ GN
Sbjct: 381 EDADS---EG----KEGKFYVWTEEEIRHALKEDDANLIINVFNIEKAGNFK-------- 425
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE----CRRKLFDVRSKRPRPHLDDKV 413
+E G+N + S +++ + + L+ L E RRKLF VRSKR RPH DDK+
Sbjct: 426 DEIAGRNTGDNILHLKKSLAEIALENKTSLDELKERVETARRKLFAVRSKRIRPHKDDKI 485
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+ WNGL+I++ A+ ++ D EY+ A+ AA FI + Q
Sbjct: 486 LTDWNGLMIAALAKGAQAF----------------DAPEYLAAAKRAADFILSDM-RRQD 528
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
RL H +R G + P F DDYAF I GLL+LYE +L A++L + + F D +
Sbjct: 529 GRLLHRYRGGQAGIPAFADDYAFFIWGLLELYETNFNVNYLRTALDLNSDMIKHFWDNQN 588
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GG++ T + +++R KE +DGA PSGNSV+ +NL RLA I A + + + A ++
Sbjct: 589 GGFYFTADDAEDLIVRQKEVYDGAIPSGNSVAALNLFRLARITADPELE---EKANKTML 645
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F T +K M M P+ + +++ G+ +VD +ML + NK V+
Sbjct: 646 AFSTEVKKMPAGYTQMMIGLSFGIGPAYE-IIIAGNPRAVDTRDMLNTLRRHFIPNKIVL 704
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
+ P D E + + A + D K A +C++++C PVTD + LL E
Sbjct: 705 -LRPTDEETPEI-----TRIAKFTEHQSGIDGKATAYICRDYTCKMPVTDTKEMLKLLKE 758
>gi|392962639|ref|ZP_10328068.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
gi|421053373|ref|ZP_15516355.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
gi|421058355|ref|ZP_15521061.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
gi|421066419|ref|ZP_15528029.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
gi|421073618|ref|ZP_15534678.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
gi|392442414|gb|EIW20004.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
gi|392444040|gb|EIW21515.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
gi|392451880|gb|EIW28849.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
gi|392456062|gb|EIW32823.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
gi|392460977|gb|EIW37218.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
Length = 683
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 254/693 (36%), Positives = 366/693 (52%), Gaps = 67/693 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME E FED+ VA LLN F++IKVDREERPDVD +YM+ QAL G GGWPL+
Sbjct: 51 SCCHWCHVMERECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++P+ KP GTYFP K GR G +L + W+ R + ++G + L
Sbjct: 111 IIMAPNKKPFFAGTYFPKHRKMGRMGLLELLTTLHQHWENNRSEIIKAGNEIVSILQRPK 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
AS + L Q L +L SYDS+ GGFGSAPKFP P +I +L + + ++
Sbjct: 171 PASEEGQVGEELLKQAYL-----ELENSYDSQCGGFGSAPKFPTPHKITFLLRYWQHFKE 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ MV TL M +GGI+DH+G GF RYS D++W VPHFEKMLYD L
Sbjct: 226 -------PKALAMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCT 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ T + ++ I +IL Y+ RDM+ G +SAEDADS EG EG FYV
Sbjct: 279 SYLEAYQCTGNGEFARIAEEILTYVMRDMMDKSGGFYSAEDADS---EGV----EGKFYV 331
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASAS 377
+T KEV +ILG E LF + Y + GN + G ++ + D A
Sbjct: 332 FTRKEVLEILGEEEGTLFADFYQISSQGNFE-----------HGTSIPNRIGRDLEEYAR 380
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K+ +E +L + R KL+ VR KR PH DDK++ +WNGL+I++FA+A+K+LK
Sbjct: 381 KVKWTVESLSALLEQGREKLYHVREKRIHPHKDDKILTAWNGLMIAAFAKAAKVLK---- 436
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ +Y VAE A+FI L + RL +R G + ++DDYAFL
Sbjct: 437 ------------QSKYANVAEQGAAFIYEKLM-KADGRLLARYREGEAAHQAYIDDYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L+++YE ++L A+ L + LF D GG++ + +++R KE +DGA
Sbjct: 484 LMALIEVYEATCNNQYLHRAVTLAKDMEALFGDNTEGGFYFYGNDGEELIVRPKEIYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L +L I + + AE L+ F + A A D
Sbjct: 544 IPSGNSVAALALQKLGDI---TDDRGFSDIAERLLSSFAGEVSRYAAGYTYFMMAVDYYV 600
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K +++ G K + D + ML ++ + L + I F++ H+ N
Sbjct: 601 ADNTK-IIIAGDKEAADTKAMLDVINSCF-LPSSAIR----------FYDRHSQENVEYK 648
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ K A +C+NF+C PP+TD L NLL
Sbjct: 649 EID---HKATAYICRNFACQPPITDAEKLCNLL 678
>gi|306811901|gb|ADN05998.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
bacterium]
Length = 800
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 249/695 (35%), Positives = 367/695 (52%), Gaps = 57/695 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A LN F++IKVDREERPD+D VYM V L G GGWP++
Sbjct: 136 STCHWCHVMERESFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMKAVTILTGRGGWPMT 195
Query: 80 VFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLS 136
V ++PD +P GGTYFPP + GR G IL + + ++ +++A++ ++LS
Sbjct: 196 VIMTPDKEPFFGGTYFPPRKGFRGGRAGLIDILADMLGLYRNEPTEVVARA-----QELS 250
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ + +A+ P + + A+ L + +D GGFG APKFP+P + ++L ++++
Sbjct: 251 QRVEQAAAIKPGPGVPSDKVIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLLRYARR 310
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
D G + MV TL MA GGI+D VGGGFHRYS D +W VPHFEKMLYD Q
Sbjct: 311 TRDKGATA-------MVATTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQ 363
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA VYL+A+ T D Y + R+ILDY+ R+M P G +SA DADS G +EG
Sbjct: 364 LAVVYLEAWQHTGDSGYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGW 421
Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
F+ WT E+E +LG A +F + + GN F+G+N+L +
Sbjct: 422 FFTWTPDELERLLGAGDAAVFSSAFGVTKPGN------------FEGRNILHRVKSDQEL 469
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
AS+LG+ ++ ++ + L+D R+ RP P D+K+I +WNG++ ++FA+A +L +E
Sbjct: 470 ASELGLAPKRVGEMIRRAQSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AE 528
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
A Y+EVA A F+ + + L ++R+G + FLDDYA
Sbjct: 529 A---------------RYVEVAARAVQFVLEQMRTKDGA-LVRTYRDGKKGSASFLDDYA 572
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F+++ LDLYE W+ A+ELQ QD +LD + GGY+ T + +L+R K +D
Sbjct: 573 FMVAASLDLYEATGDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYD 632
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSGNSV+ NL+RL K +R+ AE A ++ PL+ A D
Sbjct: 633 RAVPSGNSVAANNLLRLHDFNGDPK---WRRRAERLFASLAFQVTRSPTGFPLLLVALDR 689
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ V L+ + + + A S+ NK + DTE + S
Sbjct: 690 Y-YDTVLEVALIAPTNREEASLLNARLRKSFVPNKAFTVL--TDTEAT----QQESTIPW 742
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ K A VC+ C P + P + L
Sbjct: 743 LEAKRAMGGKSTAYVCERGRCDLPTSKPQVFQKQL 777
>gi|188996723|ref|YP_001930974.1| hypothetical protein SYO3AOP1_0787 [Sulfurihydrogenibium sp.
YO3AOP1]
gi|188931790|gb|ACD66420.1| protein of unknown function DUF255 [Sulfurihydrogenibium sp.
YO3AOP1]
Length = 686
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 253/687 (36%), Positives = 358/687 (52%), Gaps = 65/687 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE VAK+LN+ FVSIKVDREERPD+D +YM G GGWPL+
Sbjct: 51 SSCHWCHVMEKESFEDEEVAKILNENFVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP + GR G +L V + W ++ L Q IE L
Sbjct: 111 IIMTPDKKPFFAGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKNDF 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
+ DE+ ++ + C L +D +GGF PKFP P I +L YH+K+
Sbjct: 171 KGKS------DEISKDIIDACYLDLKSRFDKEYGGFSIKPKFPTPHNILFLLRYYYHTKE 224
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ E KM TL M GG++DHVG GFHRYS D W +PHFEKMLYDQ
Sbjct: 225 M----------EALKMAEKTLINMRLGGMYDHVGFGFHRYSTDREWLLPHFEKMLYDQAM 274
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LTK+ FY ++ + Y+ RDM G +S+EDADS EG +EG
Sbjct: 275 LTMAYTEAYQLTKNNFYKKTAQETIAYVLRDMTSKEGVFYSSEDADS---EG----EEGK 327
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY WT E++++L + + L + + +K GN + + G+N+L
Sbjct: 328 FYTWTIDELKEVLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIREL 383
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ L M ++ L E R+KLFD R KR P DDKV+ WNGL+IS+ A+A K
Sbjct: 384 ANDLNMNQDQLETKLEEIRKKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK----- 438
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
G + ++ +E A++AA FI ++ T L H +++G K G LDDYA
Sbjct: 439 -----------GFEDRDLIEKAKTAADFILNTMFKNDT--LYHLYKDGEVKVEGLLDDYA 485
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F GL++LYE K+L A++L + E F D E GG+F + V++R KE D
Sbjct: 486 FFSWGLIELYEATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFD 545
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSVS NL RL I K Y A +L F +K + + +
Sbjct: 546 GAIPSGNSVSAYNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLML 602
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ P+ + VVL G + E +L + + NK +I ++ + +++ + S
Sbjct: 603 VFYPTSE-VVLAG-----NCEKVLDKINTEFIPNKAIIFLNRENEKQLKELIPYTS---- 652
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
N +D+ VC+NFSC+ P D
Sbjct: 653 ---NMILSDECDIYVCKNFSCNLPTKD 676
>gi|218887845|ref|YP_002437166.1| hypothetical protein DvMF_2759 [Desulfovibrio vulgaris str.
'Miyazaki F']
gi|218758799|gb|ACL09698.1| protein of unknown function DUF255 [Desulfovibrio vulgaris str.
'Miyazaki F']
Length = 756
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 267/738 (36%), Positives = 382/738 (51%), Gaps = 85/738 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED+ VA+LLND FV +KVDREERPD+D YM Q L G GGWPL+
Sbjct: 50 STCHWCHVMAHESFEDDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGSGGWPLT 109
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---S 136
+ PD +P TY P + GR G ++ +V + W KRD + S +E + +
Sbjct: 110 IIALPDGRPFFAATYLPKHSRPGRIGLMDLVPRVLEVWRHKRDDVLDSADSIVEHVRRHA 169
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
EA+ + +LP L E ++ +D+ GGFG+APKFP P + +L +++
Sbjct: 170 EAMLRPPADGRLPG---AGTLHAACEAMASEFDAVNGGFGTAPKFPSPHNLLFLLRWARR 226
Query: 197 ---------LEDTGK--SGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
L G +GE S G K M TL+ + +GGIHDHVG GFHRYS D RW
Sbjct: 227 NGHAAGQPGLAQAGTVPTGEESGGAKALRMAAQTLRSIRRGGIHDHVGYGFHRYSTDARW 286
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ L Y +A+ T D + + Y+ RD+ P G +SAEDAD
Sbjct: 287 LLPHFEKMLYDQAMLMLAYAEAWLATGDGEFRRTAEETAAYVLRDLASPEGAFYSAEDAD 346
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN------------CDL 350
S E +GA + EG FY +T ++E+ + ++P G+ DL
Sbjct: 347 S-ELDGA--RGEGLFYTFTLADIEEACAPLDVRPGVRPAVRPDGDGGGGVNPASLSEADL 403
Query: 351 SRMS-----------DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFD 399
+ + + G+NVL A LG+P + L R LFD
Sbjct: 404 TARAFGCTAYGNYEDEATRSRTGRNVLHLPRAPQELARDLGLPPREVEERLEAARAALFD 463
Query: 400 VRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 459
+R++RPRPHLDDKV+ WNGL I++ +R ++ D E A +
Sbjct: 464 LRARRPRPHLDDKVLADWNGLAIAAMSRCAQAF----------------DAPHLAEAAAA 507
Query: 460 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 519
AA F+ + Q RL H +R+G + PG LDDYAF+I GL++LY +WL A+
Sbjct: 508 AADFVLARMV-TQEGRLLHRWRDGEAAVPGLLDDYAFMIWGLIELYGATGEVRWLRRALR 566
Query: 520 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
LQ QD F D EGGGY+ T + ++L+R KE HDGA PSGN+ ++ NL+RLA ++
Sbjct: 567 LQEVQDTFFHDAEGGGYWMTPADGDALLVRRKEGHDGALPSGNAAALFNLLRLALLLGRP 626
Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 639
+ Y + A L F T+++ + + C D ++ + V++ G D E ML
Sbjct: 627 E---YGERARGVLRAFATQVRHHPVGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAML 682
Query: 640 AAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA------DKVVALVCQN 693
AA +Y TV+H+ D N+ + + A F+A D+ A +C+N
Sbjct: 683 AAVRGTY-APTTVLHLRTTD----------NARDLA-ALVPFTAHLAPLEDRATAWLCEN 730
Query: 694 FSCSPPVTDPISLENLLL 711
++CSPP+TDP L+ LL
Sbjct: 731 YACSPPITDPAELKARLL 748
>gi|366164964|ref|ZP_09464719.1| hypothetical protein AcelC_14944 [Acetivibrio cellulolyticus CD2]
Length = 680
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 259/699 (37%), Positives = 370/699 (52%), Gaps = 81/699 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ VA LN F+SIKVDREERPD+D +YM QAL G GGWPL+
Sbjct: 53 STCHWCHVMEKESFEDKEVADALNKNFISIKVDREERPDIDHIYMNVCQALTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F+SPD KP GTYFP ++ G PG T+L V DAW RD+L +S EQ+ AL
Sbjct: 113 IFMSPDKKPFFAGTYFPKNNRMGMPGLLTVLESVHDAWVSNRDILTRSS----EQILNAL 168
Query: 140 SASASSNKL--PD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
S N + PD EL ++ + +D+ +GGFGSAPKFP P + +L +
Sbjct: 169 S---DRNDILEPDSEEELSEDIFYEAFSEFKYDFDNNYGGFGSAPKFPTPHNLFFLLRYW 225
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+D KMV TL+ M KGGI+DH+G GF RYS D +W +PHFEKMLYD
Sbjct: 226 YNTKD-------EYALKMVEKTLESMHKGGIYDHIGFGFSRYSTDRKWLIPHFEKMLYDN 278
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
LA YL+ + TK Y+ I ++I Y+ RDM G +SAEDADS EG +E
Sbjct: 279 ALLAIAYLEVYQATKKSEYADIAKEIFTYVLRDMTSNEGGFYSAEDADS---EG----EE 331
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDS 372
G FY+W++ EV+ +LG E Y C L ++ H F+G N+ LI+ N +
Sbjct: 332 GKFYIWSANEVKTVLGNKD---GEKY-------CKLYDIT-AHGNFEGFNIPNLIKGNIA 380
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ ECR+KLF+ R KR P+ DDK++ SWNGL+I++ A ++L
Sbjct: 381 QEDDG-----------FIEECRKKLFEFREKRVHPYKDDKILTSWNGLMIAAMAFGGRVL 429
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
G D+ Y + AE A FI L RL +R+G S P ++D
Sbjct: 430 --------------GVDK--YTKAAEKAVDFIFSKLISSDG-RLLARYRDGDSAFPAYVD 472
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFLI GL++LYE +L +++L + + F D GG F+ + ++ R KE
Sbjct: 473 DYAFLIWGLIELYETTYKPIYLKRSLKLNDDLIKYFWDETNGGLFHYGSDSEQLITRPKE 532
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
+DGA PSGNSV+ +N +RLA + ++ + + A + A F ++ A A
Sbjct: 533 IYDGATPSGNSVATMNFLRLARLTGQAELE---EKAYNQFATFGRSIERFARGHSFFLSA 589
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ + K VV+VG++ +++ +M++ + + T+ +D
Sbjct: 590 L-LFAKSKSKEVVIVGNE-NLEESSMVSIIREDFRPFTLSMFYSNKHTDLIDL------- 640
Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
A N + + K A VC+NF+C P+TD N +
Sbjct: 641 -APFIENYKTVEGKTTAYVCENFACQAPITDNSLFRNAI 678
>gi|219849212|ref|YP_002463645.1| hypothetical protein Cagg_2330 [Chloroflexus aggregans DSM 9485]
gi|219543471|gb|ACL25209.1| protein of unknown function DUF255 [Chloroflexus aggregans DSM
9485]
Length = 693
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 250/692 (36%), Positives = 364/692 (52%), Gaps = 64/692 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESF D +A + N++F++IKVDREERPD+D +YM QAL G GGWPL+V
Sbjct: 55 ACHWCHVMAHESFADPEIAAIQNEYFINIKVDREERPDLDSIYMAAAQALTGRGGWPLNV 114
Query: 81 FLSPDLKPLMGGTYFPPE---DKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQ 134
F PD P GTYFPP+ ++Y P ++ +L + +A+ +RD L AQ I+
Sbjct: 115 FCLPDGTPFFAGTYFPPDAKANRYRMPSWRQVLLSIAEAYRTRRDDLTASAQELLNHIKL 174
Query: 135 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
L++ L +A+ ++ L A +L + +D ++GGFG APKFP+P+ ++ +L
Sbjct: 175 LAQPLPETATVDE-------ALLLEAAAKLEREFDPQYGGFGDAPKFPQPLVLEFLL--- 224
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
T G + M+ TL+ MA GG++D VGGGFHRYSVD RW VPHFEKMLYD
Sbjct: 225 ----RTHLRGHV-QALPMLHQTLEQMAHGGMYDQVGGGFHRYSVDTRWLVPHFEKMLYDN 279
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
LA VY A +T D F + I + YL RD+ P G FS+EDADS GA +E
Sbjct: 280 ALLAEVYHLAALVTGDPFLAQIADETFAYLLRDLRHPEGAFFSSEDADSLPVPGAAHAEE 339
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
GAFYVWT E+ LG+ A + +Y + GN F+GK++L +SA
Sbjct: 340 GAFYVWTPDELRLALGDDATIVGAYYGVTRQGN------------FEGKSILYVPRSASA 387
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A++LG+P+E+ + R L R +RPRP D+K+I +WN L I + A AS +
Sbjct: 388 VAARLGVPVERVTETVERARPILRTFREQRPRPFRDEKIITAWNALAIRALATASARV-- 445
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
EY+ A A F+ +L RL S+++G GFLDDY
Sbjct: 446 ----------------PEYLSAARQCADFLLANL-RRADGRLLRSWKDGRPGPAGFLDDY 488
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L LL+L+ G T +L AIEL +LF D + +F+T + P+++ R ++
Sbjct: 489 ALLCDALLELHAAGGETYYLATAIELAEAMLDLFWDAQSWMFFDTGRDQPALVTRPRDLS 548
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A PSG S + + L+RL ++ + +D + AE L L + M CAAD
Sbjct: 549 DNATPSGTSAATMALLRLYAL---TGNDLFATRAEQVLQQVAPMLIRFPLGFGRMLCAAD 605
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
++ P R+ + ++G + +LA A ++Y + H +P D A
Sbjct: 606 LMIGPIRE-LAIIGPSGHPATQALLAVARSAYRPRLVIAHAEPGDP--------IAEQVA 656
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+A + A +C+ F+C PVT P +L
Sbjct: 657 LLAGRTLIDGQPTAYLCERFACRLPVTTPEAL 688
>gi|345856701|ref|ZP_08809173.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
gi|344330213|gb|EGW41519.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
Length = 652
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 246/696 (35%), Positives = 371/696 (53%), Gaps = 80/696 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE++ VA +LN +F+SIKVDREERPDVD +YM + Q L G GGWPL++ ++PD K
Sbjct: 1 MERESFENDEVAGILNRYFISIKVDREERPDVDHLYMAFCQTLTGSGGWPLTIIMTPDKK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP ++YGRPG + +V W L +S + + + + S+
Sbjct: 61 PFFAGTYFPKTERYGRPGLMELAEQVGTLWKTNEGKLRESSDEIVAAVHSQRTVPSKSSP 120
Query: 148 LPDELPQNA-------------LRLCAEQL--------SKSYDSRFGGFGSAPKFPRPVE 186
LP + + + +EQL ++S+D+R+GGFG APKFP P
Sbjct: 121 LPSAVTNDPSLKDGNGPTSSEDFQTWSEQLIDKAYQVFAQSFDARYGGFGRAPKFPTPHT 180
Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
I +L ++ + S+ +MV TL MA+GGI+DHVG GF RYS DE+W VPH
Sbjct: 181 ISFLLRYA-------QDHPQSKALEMVRKTLDGMAQGGIYDHVGFGFARYSTDEKWLVPH 233
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD LA+ YL+++ + ++I Y+ RDM P G +SAEDAD+
Sbjct: 234 FEKMLYDNALLASTYLESYQANHQPDDAQKAKEIFTYVLRDMTSPEGGFYSAEDADA--- 290
Query: 307 EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
EG EG F+VWT E+E +LG + A ++ Y + P GN F+GKN+
Sbjct: 291 EGV----EGKFHVWTRAEIETLLGKDTAAMYCAVYDITPEGN------------FEGKNI 334
Query: 366 L-IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+ L + A + + L IL + R+ LF R KR PH DDK++ +WNGL+I++
Sbjct: 335 PNLLLGNLEKIARNNSLAAAEVLQILEKARQTLFTAREKRIHPHKDDKILTAWNGLMIAA 394
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
FA+ +++L A Y+E AE+AA F+ HL RL +R G
Sbjct: 395 FAKGAQVLGIPA----------------YLEAAENAADFVLTHL-KRNDGRLLARYREGH 437
Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 544
S G+LDDYAF I GLL+LY +L A++LQ Q+ LFLD E GGY+ T +
Sbjct: 438 SAYLGYLDDYAFFIGGLLELYSVSGKPHYLQVALQLQEEQERLFLDEEDGGYYLTGSDGE 497
Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
+L R KE +DGA P+GNS++ +NL +LA + + + + AE L VF + L++
Sbjct: 498 ELLFRPKESYDGAIPAGNSITALNLFKLARLTGDER---WERKAEQQLLVFRSVLEEHPS 554
Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
A PS++ ++L G ++ + M +++ +V++ + + E +
Sbjct: 555 GYTAFLQALQFAVHPSQE-LILAGALNATELPEMRQIFFSAFRPYASVLYQEGSLPETVP 613
Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
+ +++ + + + A +CQNF+C PV
Sbjct: 614 WIQDYPIDPS----------HITAYLCQNFTCQRPV 639
>gi|20092523|ref|NP_618598.1| hypothetical protein MA3726 [Methanosarcina acetivorans C2A]
gi|19917793|gb|AAM07078.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 697
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 250/713 (35%), Positives = 367/713 (51%), Gaps = 54/713 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESFEDE +A+L+N+ FVSIKVDREERPD
Sbjct: 33 GEEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDEEIARLMNEAFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YMT Q + G GGWPL++ ++P KP GTY P + ++ + G ++ ++K+ WD
Sbjct: 93 IDNIYMTVCQIILGRGGWPLTIIMTPGKKPFFAGTYIPKKSRFNQTGMTELIPRIKEIWD 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + + S + + S + + L S+D +GGFG A
Sbjct: 153 QQHEEVLDSAEKITSTIQNMIVESTGEGLG-----EEIIEEAYNDLLNSFDPEYGGFGRA 207
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +I +L + K+ D E MV TL M GGI+DH+G GFHRYS
Sbjct: 208 PKFPTPHKISFLLRYWKRSGD-------PEALDMVEHTLDNMRSGGIYDHLGSGFHRYST 260
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYDQ A Y++A+ ++ Y ILDY+ RD+ P G +
Sbjct: 261 DNMWLLPHFEKMLYDQALTAIAYIEAYQVSGKDLYKETAEGILDYVLRDLTSPEGGFYCG 320
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDAD EG +EG +Y+WT +EV ILG E + L + + LK GN + +
Sbjct: 321 EDAD---VEG----EEGKYYLWTIEEVMSILGPEDSELIIKMFNLKRGGNFE----EEIR 369
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
G N+ ++ + A++L +P+E+ + + R KL R +R RP LDDKV+ W
Sbjct: 370 GRKTGTNLFYMVHSPGSLAAELEIPVEEVESRVKSAREKLLKARYERKRPSLDDKVLTDW 429
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++FA+ F V G ++ Y++ AE AA F+ LY + RL
Sbjct: 430 NGLMIAAFAKG--------------FQVFGEEK--YLKAAEKAADFLLETLYGPE-KRLH 472
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R+G + G DDYAFLI GLL+LYE G ++L A+ L E F D E GG++
Sbjct: 473 HRYRDGVAGISGTSDDYAFLIHGLLELYEAGFELRYLKSAVSLNRELLEHFWDPENGGFY 532
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + ++ R KE D A PSGNS ++NL+RL+ ++A + + A+ F
Sbjct: 533 FTASDSEVLIFRKKEFTDAAIPSGNSFEMLNLLRLSRLIADPGME---ETADRLERAFSK 589
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+K A D PS + V++ G + S D NML + + NK ++
Sbjct: 590 LIKKTPSGYTQFLSAFDFRLGPSYE-VIISGKRESPDTVNMLEELWSYFTPNKVLVFRPE 648
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ E+ E+ + K A VCQN+ C P T+ + LL
Sbjct: 649 GENPEIADLAEYTKEQLPI------EGKATAYVCQNYECQLPTTETREMLKLL 695
>gi|306811868|gb|ADN05966.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
bacterium]
Length = 800
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 247/694 (35%), Positives = 358/694 (51%), Gaps = 55/694 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A LN F++IKVDREERPD+D VYMT V L G GGWP++
Sbjct: 136 STCHWCHVMERESFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMTAVTILTGRGGWPMT 195
Query: 80 VFLSPDLKPLMGGTYFPPEDKY--GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
V ++P +P GGTYFPP + R G IL + + + + ++LS+
Sbjct: 196 VIMTPHKEPFFGGTYFPPRKGFRGNRAGLIDILTDMLSLYKNEPTQVVARA----QELSQ 251
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ +A+ P + + A+ L + +D GGFG APKFP+P + +++ ++++
Sbjct: 252 RVEQAAAIKPGPGVPSDKMIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLMRYARRT 311
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D G + MV TL MA GGI+D VGGGFHRYS D +W VPHFEKMLYD QL
Sbjct: 312 RDEGATA-------MVTTTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQL 364
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A VYL+A+ T D Y + R+ILDY+ R+M P G +SA DADS G +EG F
Sbjct: 365 AVVYLEAWQHTGDSAYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGWF 422
Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+ WT E+E +LG A + + + GN F+G+N+L +
Sbjct: 423 FTWTPGELERLLGAGDAAVVSSAFGVTERGN------------FEGRNILHRVKADQELG 470
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S+LG+ ++ I+ R L+D R+ RP P D+K+I +WNG++ ++FA+A +L +EA
Sbjct: 471 SELGLAPKRVGEIIRSARSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AEA 529
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+EVA A F+ + E L ++R G + FLDDYAF
Sbjct: 530 ---------------RYVEVAARAVGFVLAQMRAEGGA-LVRTYREGKKGSASFLDDYAF 573
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+++ LDLYE W+ A+ELQ QD +LD + GGY+ T + +L+R K +D
Sbjct: 574 IVAACLDLYEATGDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYDR 633
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ NL+RL K +R+ AE A ++ PL+ A D
Sbjct: 634 AVPSGNSVAANNLLRLHDFTGDPK---WRRRAERLFAWLAFQVTRSPTGFPLLLVALDRY 690
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ V L+ S + + A S+ NK + A+ + + S +
Sbjct: 691 -YDTVLEVALIAPASREEASVLDAQLRKSFVPNKAFTVLTDAEASQQE------STIPWL 743
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A K A VC+ C P + P + L
Sbjct: 744 EAKRAMAGKSTAYVCERGRCELPTSKPQVFQKQL 777
>gi|315425009|dbj|BAJ46683.1| hypothetical conserved protein [Candidatus Caldiarchaeum
subterraneum]
Length = 692
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/700 (38%), Positives = 381/700 (54%), Gaps = 81/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE +A+LLN +FV +KVDREERPD+D+VYM V + G GGWPL+
Sbjct: 61 SSCHWCHVMEKESFEDEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLT 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPP + G G ILR V + W K + + A EQ L
Sbjct: 121 VFLTPDLKPFFGGTYFPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLL 176
Query: 140 SASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ ++ K D P + L + A + L+ S+DS +GGFG APKFP PV + + +S LE
Sbjct: 177 KSFYTTEK-SDTTPSHNLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE 234
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +MV TL+ MA+GG+ DH+GGGF RYS D W VPHFEKMLYD LA
Sbjct: 235 ------KEPAAVRMVSTTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLA 288
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY++ + +T D FY I LD+L +M+ PGG +SA DADS E EG +Y
Sbjct: 289 RVYMNHYLITGDSFYREIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGEYY 341
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW E+E ILG E A + + Y + TGN + GKN+L ++ A+
Sbjct: 342 VWRRGELEQILGPELAKIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAA 390
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+LG+ +L E + KL D R KRP P +DDK+I +WNG +S+ +
Sbjct: 391 ELGVDEPTLKQMLEEAKNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR------- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ K Y++ A FI +++ T L ++NG S GFLDDYA +
Sbjct: 444 ---------ATGEKRYLDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAV 491
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
++ LLD++E ++L A+++ N ELF D GG++ T ED + + R+K+ +DGA
Sbjct: 492 VNALLDVFEVSFEPRYLAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGA 550
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADML 616
PSGN+++ L++L+ + +K Y Q E +L F +RL+ A L+ A
Sbjct: 551 TPSGNTLAAAALLKLSELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFH 607
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ SR VVLV +S + LA + ++ ++V+ + HN N ++
Sbjct: 608 T--SRMEVVLV-TESPQEARPYLAHLYRAFKPFRSVVVV-------------HNGNRDTL 651
Query: 677 AR-NNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
+ ADK V A VC+N+SC PVT SLE +
Sbjct: 652 QKYTRLVADKPAKGPVTAYVCENYSCRMPVT---SLEEFV 688
>gi|301061221|ref|ZP_07202007.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300444689|gb|EFK08668.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 694
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/699 (36%), Positives = 379/699 (54%), Gaps = 68/699 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED A++LND +VSIKVDREERPD+DK+YM+ QAL G GGWPLSV
Sbjct: 55 TCHWCHVMAHESFEDPETARILNDHYVSIKVDREERPDLDKIYMSVCQALTGRGGWPLSV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ P GTYFP G GF +L K+ W + R+ L +G ++++E L
Sbjct: 115 FLTPERIPFFAGTYFPKIGHQGLIGFPELLLKLGKLWKEDRERLLTAG----DEITEHLR 170
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + L L QLS+S+D R+GGFG APKFP P ++ +L + ++
Sbjct: 171 NSELGGSVEKSLDMEVLNKAGVQLSRSFDPRWGGFGGAPKFPSPHQLTFLLRRHVRSKN- 229
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ +MV TLQ M +GG+ DH+G GFHRYSVDE+W PHFEKMLYDQ LA
Sbjct: 230 ------ARDLEMVEKTLQSMRRGGLFDHIGYGFHRYSVDEKWFAPHFEKMLYDQALLAMA 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ +T FY+ + R+I Y+ RDM P G +SAEDADS EG EG FY+W
Sbjct: 284 YTEAYQVTGKSFYARVAREIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGLFYLW 336
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR----MSDPHNEF-KGKNVLIELNDSSA 374
T KEV++ILG E A LF +++ ++ GN + R M +P + F +G+N
Sbjct: 337 TPKEVQEILGTESADLFCDYFDIRERGNFEEGRSIPHMREPLSTFAEGRN---------- 386
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
M +++ +++L + R KLF R KR P DDK++ SWNGL+I++ + + L
Sbjct: 387 ------MGVKRLVSLLRQGREKLFSARQKRIHPLKDDKILTSWNGLMITALFKGYRALGD 440
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
A Y+ A+++ FI L E L +R G + G+LDDY
Sbjct: 441 AA----------------YVTAAQNSLQFILNTLRKEDGC-LIRRYREGETAHAGYLDDY 483
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AFL+ L++ YE L A+ L +T +LF D E GG+F T E+ +++ R ++
Sbjct: 484 AFLVWALIEGYESTFNPNHLKTAMVLTHTMLDLFWDSENGGFFFTGRENETLIARSRDAQ 543
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGA PSGNSV+ + L++L + + + + A + F ++ A M A D
Sbjct: 544 DGAIPSGNSVAALTLLQLGRLTGDTS---FEEKANALMQAFSGQMDAYPSAHTQMLQALD 600
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+ P+++ VV+ G + + + ML ++ L + V + ++ E E + A
Sbjct: 601 FVIGPTQE-VVIAGTRHDRNTDVMLKVIQQNF-LPRQVALLVSSNEE-----RERVAGLA 653
Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
+ + K A +C+ +C PVTDP ++E L E
Sbjct: 654 PYVKEMVPVEGKATAYICRRHACQAPVTDPEAMEKALNE 692
>gi|83590501|ref|YP_430510.1| hypothetical protein Moth_1665 [Moorella thermoacetica ATCC 39073]
gi|83573415|gb|ABC19967.1| Protein of unknown function DUF255 [Moorella thermoacetica ATCC
39073]
Length = 752
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 273/752 (36%), Positives = 372/752 (49%), Gaps = 90/752 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESF DE VA LLND F++IKVDREERPD
Sbjct: 32 GEEAFARAKREDKPVFLSIGYSTCHWCHVMARESFNDEEVAALLNDSFIAIKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D+VYM QAL G GGWPL+VFL+P+ +P GTYFP ++YGRPG +L+ +++ W
Sbjct: 92 IDQVYMAACQALTGSGGWPLTVFLTPEKRPFYAGTYFPKHNRYGRPGLVELLKLIREKWA 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R+ L +SGA I+ ++ + + P E L +QL +D +GGF A
Sbjct: 152 THREELEESGAELIQHVAGQFAPTP-----PGEPGAQVLEKGWQQLRAGFDPLYGGFSEA 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P ++ +L + K+ ++ G MV TLQ M GGI+DH+G GF RYS
Sbjct: 207 PKFPSPHQLLFLLRYWKRYDEAG-------ALAMVEKTLQAMYCGGIYDHIGFGFARYST 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D RW VPHFEKMLYD LA YL+ T YS++ R+I ++ RDM P G +SA
Sbjct: 260 DRRWLVPHFEKMLYDNALLALAYLETRQATGKAVYSHVAREIFTWVLRDMTSPEGGFYSA 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
DADS EG +EG FY+WT +V ++LG F Y+ T + S P+
Sbjct: 320 LDADS---EG----EEGRFYLWTPDQVREVLGAKEGEFFCRYF-DITAGGNFEGRSIPNL 371
Query: 359 EFKGKNVLI------ELNDSSASASK------------------LGMPLEKYLNILGEC- 393
+G+ + E ND++ + G P E L G
Sbjct: 372 IGRGEALFAAGTSGNESNDTAGDQRQPREQGGRAGGISGGGGCAKGSPEEDRLPGRGPTT 431
Query: 394 ---------------RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
R KLF R KR PH DDK++ +WNGL+I++ AR + +L
Sbjct: 432 LAGFGPATAARLAAAREKLFAAREKRVHPHRDDKILTAWNGLMIAALARGAWVL------ 485
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D Y A AA FI HL D + RLQ +R G + P +LDDYAFL
Sbjct: 486 ----------DEPAYAAAAARAARFILTHLRDAEG-RLQARYREGQAAFPAYLDDYAFLT 534
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ T +L A+ L ELF D EGGGYF T + +R +E +DGA
Sbjct: 535 WGLIELYQATFETGYLREALALTRQMQELFRD-EGGGYFFTPHGAGELPVRPREVYDGAI 593
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+ +NL+RLA I S+ + + A + + + CA D
Sbjct: 594 PSGNSVAALNLLRLARITGDSRLE---EEAAAQVRALAGTVAEYPRGYSFYLCALDFYLG 650
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P +VL G + + D +L A+Y L V+ + P E EE A
Sbjct: 651 PV-TEIVLAGERETEDTRALLRVLRAAY-LPSAVLVLRPGGREG----EEVTRLIPYTAG 704
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K +C+NF+C PVT LE L
Sbjct: 705 QKPVNGKATLYLCRNFACRAPVTTAGELEQWL 736
>gi|385811559|ref|YP_005847955.1| thioredoxin domain-containing protein [Ignavibacterium album JCM
16511]
gi|383803607|gb|AFH50687.1| Thioredoxin domain protein [Ignavibacterium album JCM 16511]
Length = 692
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 242/684 (35%), Positives = 369/684 (53%), Gaps = 54/684 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VAKL+ND F+SIKVDREERPD+D VYM Q + GGGGWPL+
Sbjct: 51 STCHWCHVMERESFEDEEVAKLMNDTFISIKVDREERPDIDGVYMAVCQMITGGGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP +++GR G ++ K+ D W +R+ + S E++++++
Sbjct: 111 IVMTPDKKPFFAGTYFPKYNRFGRIGMLELITKLNDIWKNRREEVLNSA----EEITKSI 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S K +E+ + L ++ S+ +D +GGFG+APKFP P + +L + ++ ++
Sbjct: 167 N-KISHKKSDEEIDEKILDKAFDEYSRRFDKEYGGFGNAPKFPTPHNLLFLLRYYRRTKN 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
K+V TL M KGGI+D +G GF RYS D+ W VPHFEKMLYD L
Sbjct: 226 LS-------ALKIVEKTLTEMRKGGIYDQIGFGFARYSTDKYWLVPHFEKMLYDNALLLM 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ +AF +T + FY +I +Y+ RDM P G FSAEDADS EG +EG FY+
Sbjct: 279 AFSEAFQITGNDFYKTTSEEIAEYVLRDMTHPEGGFFSAEDADS---EG----EEGKFYL 331
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ ++L + A + + ++P GN + G N+L A+
Sbjct: 332 WTEVEIRELLTKDEADFIIKVFNIEPNGNW----YDEARGVRTGNNILHLKKSYKELAND 387
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L M ++ L R+K+FD R KR PH DDK++ WN L+IS+ ++S IL
Sbjct: 388 LSMSENDFIKNLSSIRKKMFDWRKKRVHPHKDDKILTDWNSLMISALIKSSVIL------ 441
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D+ ++++ A A F++++L+ ++ +L H FR S G +DDYAF I
Sbjct: 442 ----------DKNKFLQAAMKADKFVKKYLF--RSEKLLHRFRESESAIDGNIDDYAFFI 489
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
LDL+E S ++L+ AI L F D + GGYF T+ + +++R KE +DGA
Sbjct: 490 QAQLDLFEATSEAEFLLTAIRLNEILFHKFWDDKSGGYFFTSEDSEKLIVRQKEIYDGAI 549
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV ++NL+RL + + Y + A+ + F + + M C D LS
Sbjct: 550 PSGNSVQLLNLLRLYELTGNA---VYYEIAQKQVKAFASEVSRMPSVFAQFLCGFDFLSG 606
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S + V+ K+ D + Y +K +I ID ++ +++ S +
Sbjct: 607 ASVQLVITAKDKNVAD--EIFKKLSREYFPSKVIIRIDNSNCQKL-------SEIIPHLK 657
Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
+ +K C++F C P +
Sbjct: 658 DYKVEEKPTIYFCRDFVCEKPTNN 681
>gi|159897570|ref|YP_001543817.1| hypothetical protein Haur_1041 [Herpetosiphon aurantiacus DSM 785]
gi|159890609|gb|ABX03689.1| protein of unknown function DUF255 [Herpetosiphon aurantiacus DSM
785]
Length = 681
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 251/690 (36%), Positives = 365/690 (52%), Gaps = 64/690 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A ++N+ FV+IKVDREERPD+D +YM VQA+ GGWP++
Sbjct: 48 SACHWCHVMAHESFEDPATAAVMNELFVNIKVDREERPDIDSLYMAAVQAMTRHGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD P GGTYFPPE ++ P F+ +L V +A+ +R+ + QS E L + L
Sbjct: 108 VFLTPDGAPFYGGTYFPPEPRHNMPSFQQVLHGVAEAYRDRREEVFQSAEQMREHLEDIL 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S K L ++ L + A++ +DSRFGG+G APKFP+ + M+L + ED
Sbjct: 168 SFDLEQVK----LSKSQLNVAAQRQMSQFDSRFGGYGGAPKFPQALIFGMVLRTWLRSED 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ ++ TLQ MA GG++D +GGGF RYSVD +W VPHFEKMLYD L+
Sbjct: 224 QDALNQVTQ-------TLQAMANGGMYDQLGGGFARYSVDAQWLVPHFEKMLYDNALLSQ 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+YL+ + T D FY I + ++Y+ RDM P G ++AEDADS EG +EG FYV
Sbjct: 277 LYLETYQATHDPFYRRIAEESINYILRDMTSPDGGFYAAEDADS---EG----EEGKFYV 329
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ E++ +L E A L + ++ ++P GN F+G +L D S A +
Sbjct: 330 WSLAEIQQLLSPEDAALAQLYWNIQPEGN------------FEGHAILYVPQDPSVVAKE 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L + + R L R+ R RP D+K++ SWNG+++ S A A+ +L
Sbjct: 378 LSISEADLAQRIAVIRATLLAQRNTRIRPGRDEKILASWNGMMLRSLAFAANVL------ 431
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D +Y A A FI LY Q +L S+++G +K G+L+DYA +
Sbjct: 432 ----------DNADYRAAAIRNAEFITSKLY--QNGQLYRSYKDGQAKFKGYLEDYACVA 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
G+L LYE +WL AIEL + E F D + +F+T + ++ R ++ +D A
Sbjct: 480 DGMLALYEATFDLRWLQVAIELAESMTERFWDAQQRSFFDTASDHEQLITRPRDLYDNAT 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+GNSV+V L+RLA+++ + YRQ AE LA L + A + AAD
Sbjct: 540 PAGNSVAVDVLLRLATLLDRYE---YRQYAETVLANLSGALLQLPGAFGRLLAAADFALA 596
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN--ASM 676
R+ V L+G + F+ +L A + +Y NK V P D H + +
Sbjct: 597 EPRE-VALIGDPADPAFKALLQATYRNYQPNKVVAACKPDD---------HAAQQLIPLL 646
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISL 706
A + A VC +C P DP L
Sbjct: 647 AERPLLNQQATAYVCVRRACKLPTNDPNEL 676
>gi|315426698|dbj|BAJ48323.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
gi|343485462|dbj|BAJ51116.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
Length = 692
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 265/699 (37%), Positives = 377/699 (53%), Gaps = 79/699 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE +A+LLN +FV +KVDREERPD+D+VYM V + G GGWPL+
Sbjct: 61 SSCHWCHVMEKESFEDEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLT 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPP + G G ILR V + W K + + A EQ L
Sbjct: 121 VFLTPDLKPFFGGTYFPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLL 176
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ ++ K N + + L+ S+DS +GGFG APKFP PV + + +S LE
Sbjct: 177 KSFYTTEKSVTTPSHNLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE- 234
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ S +MV TL+ MA+GG+ DH+GGGF RYS D W VPHFEKMLYD LA
Sbjct: 235 -----KESAAVRMVSTTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLAR 289
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY++ + +T D FY I LD+L +M+ PGG +SA DADS E EGA+YV
Sbjct: 290 VYMNHYLITGDSFYREIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGAYYV 342
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W E+ ILG E A + + Y + TGN + GKN+L ++ A++
Sbjct: 343 WRLGELGQILGPELAKIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAAE 391
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+ +L E + KL D R KRP P +DDK+I +WNG +S+ +
Sbjct: 392 LGVDEPTLKQMLEEAKNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR-------- 443
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ K Y++ A FI +++ T L ++NG S GFLDDYA ++
Sbjct: 444 --------ATGEKRYLDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAVV 492
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ LLD++E ++L A+++ N ELF D GG++ T ED + + R+K+ +DGA
Sbjct: 493 NALLDVFEVSFEPRYLAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGAT 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADMLS 617
PSGN+++ L++L+ + +K Y Q E +L F +RL+ A L+ A +
Sbjct: 552 PSGNTLAAAALLKLSELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFHT 608
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
SR VVLV +S + LA + + ++V+ + HN N ++
Sbjct: 609 --SRMEVVLV-TESPQEARPYLAHLYREFKPFRSVVVV-------------HNGNRDTLQ 652
Query: 678 R-NNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
+ ADK V A VC+N+SC PVT SLE +
Sbjct: 653 KYTRLVADKPAKGPVTAYVCENYSCRMPVT---SLEEFV 688
>gi|328951864|ref|YP_004369198.1| hypothetical protein Desac_0120 [Desulfobacca acetoxidans DSM
11109]
gi|328452188|gb|AEB08017.1| protein of unknown function DUF255 [Desulfobacca acetoxidans DSM
11109]
Length = 693
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 250/696 (35%), Positives = 369/696 (53%), Gaps = 60/696 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM E FED +A+L+N+WF++IKVDREERPD+D +YM VQ + G GGWPL+
Sbjct: 53 STCHWCHVMAHECFEDPEIARLMNEWFINIKVDREERPDLDDIYMHAVQMITGRGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+LKP GGTYFPP D+ G PGF +L+ + D++ K+ + A +EQ L
Sbjct: 113 VFLTPELKPFYGGTYFPPIDRGGLPGFPRLLQALHDSYKNKKSNIHNVIA-TLEQNMRIL 171
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + +S + P AL E +D GGF APKFP ++ H
Sbjct: 172 ALTPASGQAPS---LAALDQLIEHNLADFDEGNGGFRGAPKFPPSQDLGFWACHYH---- 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++G+ Q + L TLQ MA+GG++D + GGFHRYSVD+ W +PHFEKMLYD QLA
Sbjct: 225 --RTGQPKVLQSLSL-TLQKMARGGLYDQLRGGFHRYSVDDVWLIPHFEKMLYDNAQLAR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ +T DVF + + + LDY+ +M P G ++A+DADS EG EG F+V
Sbjct: 282 RYLEAYQITGDVFLAQVAQQTLDYVLAEMTAPEGVFYAAQDADS---EGV----EGRFFV 334
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +++ ++ G + A L + + GN + G +VL + + A +
Sbjct: 335 WTPEQIAEVAGAQRAPLICAAFGVTQEGNFE-----------HGASVLHRPQNEAQLAEQ 383
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ +++ ++L E RR+L+ R +R RPH D+K+I +WN L+IS+ A S++L
Sbjct: 384 FSLNMDEMRHVLTEARRRLWQGREQRVRPHRDEKIITAWNALMISALAYGSQVL------ 437
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D + Y A +AA FI + Q RL + + FLDD+AF I
Sbjct: 438 ----------DNRTYRGAAITAAQFILGR--EAQAGRLLRIWAATDRQGSAFLDDFAFFI 485
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ LLDLYE WL A+ L + F DRE GGYF+T + +L+R K D A
Sbjct: 486 AALLDLYETDFSPAWLAAAVRLSKEVETSFYDREAGGYFSTPVDHEKLLVRPKNFFDLAI 545
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV V NL+RL DY+ + A+ +L +T + + + + A +
Sbjct: 546 PSGNSVMVHNLIRLHRFT--DNPDYFLR-AQETLTRLQTLMMENPRGLSHLAAATEDFLA 602
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P+ + LVG+ + MLA + Y ++ ++ DP E + AR
Sbjct: 603 PTLA-ITLVGNPTEPALAEMLAVVYRHYLPHRRLVVKDPESCEAL-------LEIVPAAR 654
Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ D + A VC +C PV L+NLL +
Sbjct: 655 HYDRIDGRPTAFVCHGQTCQAPVFSAGGLDNLLATR 690
>gi|148379048|ref|YP_001253589.1| hypothetical protein CBO1058 [Clostridium botulinum A str. ATCC
3502]
gi|153933571|ref|YP_001383431.1| hypothetical protein CLB_1099 [Clostridium botulinum A str. ATCC
19397]
gi|153935757|ref|YP_001386978.1| hypothetical protein CLC_1111 [Clostridium botulinum A str. Hall]
gi|148288532|emb|CAL82612.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152929615|gb|ABS35115.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
19397]
gi|152931671|gb|ABS37170.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
Length = 680
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 249/693 (35%), Positives = 358/693 (51%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + A+ L ++DS++GGFG+ PKFP I +L Y+ KK E
Sbjct: 170 --FQDNHRQGELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDE 227
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 228 KV---------LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F T +K M L A M +
Sbjct: 536 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYN 591
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K + L +K DF + + Y V D ++ E N ++
Sbjct: 592 ISPVKEITLAYNKKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+TD ++LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPITDLEEFKSLL 676
>gi|416351321|ref|ZP_11681110.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
gi|338196028|gb|EGO88249.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
Length = 611
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 243/674 (36%), Positives = 360/674 (53%), Gaps = 75/674 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFEDE VAK+LND ++SIKVDREERPDVD YMT+ QA+ G GGWPL++ ++P+ K
Sbjct: 1 MEKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTIIMTPEQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP + YGRPG IL+++ D W +D + + + + E +S S
Sbjct: 61 PFFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDKIINTSNKLLNTMKERVSQDKS--- 117
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
+E+ + L +++ YD+++GGFG APKFP P ++ ++L + K D G
Sbjct: 118 --EEINGSILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDKSALG--- 172
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
MV TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD LA VY +A+ +
Sbjct: 173 ----MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTEAYQV 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T FY + I Y+ RDM P G +SAEDADS EG EG FYVW+ +E++
Sbjct: 229 TGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWSLEEIQS 281
Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
ILGE A F Y + GN F+GKN+ + +G LE +
Sbjct: 282 ILGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIGKDLEN-I 318
Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
+ L E R KLF VR KR P DDK++ +WN L+I S + A ++
Sbjct: 319 DKLEELRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF--------------- 363
Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
+ KEY+ A+ A FI +L + RL FR+G + +L+DY+FL+ L++LYE
Sbjct: 364 -ENKEYINRAKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWALMELYEA 421
Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
+ +L A+ + +LF D E G+F++ + ++L +K+ +D A PSGNSV+ +
Sbjct: 422 TFESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPSGNSVTAM 481
Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
NL++L+ I + + A F +K+ + + + PSR+ +V+
Sbjct: 482 NLIKLSKITGDNSLG---EKAYKMFQGFGGNIKESLQSHSIFLISYMNYIKPSRQ-IVIA 537
Query: 628 GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD-KV 686
K F+ M+ + + + T+I ++ + E N ++ D K
Sbjct: 538 SEKEDRLFKEMIKKVNKRF-MPFTIILLNDGNLE----------NIVPFIKDEKKIDNKT 586
Query: 687 VALVCQNFSCSPPV 700
A +C+NFSC+ PV
Sbjct: 587 TAYICENFSCNKPV 600
>gi|430746011|ref|YP_007205140.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
DSM 18658]
gi|430017731|gb|AGA29445.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
Length = 701
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 250/683 (36%), Positives = 364/683 (53%), Gaps = 58/683 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ A L+N+ F+++KVDREERPDVD++YM VQA+ GGWP+S
Sbjct: 66 SACHWCHVMEHESFENADTAALMNEHFINVKVDREERPDVDQIYMAAVQAMTDHGGWPMS 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GTYFPP D G PGF +L V AW ++RD + S +++
Sbjct: 126 VFLTPDLKPFYCGTYFPPVDGRGMPGFPRVLYSVHRAWAERRDDILISAGDLTDRIRLMG 185
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+S L L A R L++S+D+ GGFGSAPKFP P++++++L + +
Sbjct: 186 KIPAASGALESVLLDQAAR----GLARSFDTIHGGFGSAPKFPHPMDLKVLLRQHARTRE 241
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ ++V TL MA+GGI+D + GGF RYS DERW PHFEKMLYD L++
Sbjct: 242 -------AHPLQIVRHTLDKMARGGIYDQLLGGFARYSTDERWLAPHFEKMLYDNALLSS 294
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYL+A +T D Y+ + R+ +DY+ M GP GEI+S EDADS EG +EG FYV
Sbjct: 295 VYLEAHQVTGDAEYARVARETMDYILERMTGPEGEIYSTEDADS---EG----EEGKFYV 347
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ EV ILG E A F Y + +GN ++ +N+L +A++
Sbjct: 348 WSLAEVNQILGPERAKEFAAVYDVTESGN------------WEHQNILNLPMSVDQAATR 395
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG + L R +L + R +R P D KV+ SWNGL++++ A S+ILK E
Sbjct: 396 LGRDERELQADLDRDRARLLEARDRRVPPGKDTKVLTSWNGLMLAALAEGSRILKDE--- 452
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y++ A AA+F+ + + RL H++++G ++ G+LDDY+ LI
Sbjct: 453 -------------RYLDAATKAAAFLLDRMRTAEG-RLLHAYKDGRARFNGYLDDYSNLI 498
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL LYE +W+ A+EL + F D E GG+F T ++ R K+ D A
Sbjct: 499 DGLTRLYEVSGEPRWIEAALELTAVMIDEFHDAEAGGFFYTGRSHEVLIARQKDFQDNAT 558
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN++ L+RL ++ G +S R +L + L MA+ A D
Sbjct: 559 PSGNAMVATALLRLGALT-GRES--LRTLGRSTLEAVQAYLDRAPMAMGQSLVALDFELA 615
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
R+ V+ G +F ++ A +A + +K V PA E+ E +A
Sbjct: 616 SPREFAVIAG-SDPAEFRRVMEAIYAPFLPHKVVA---PALAEKASALAE---TLPLLAD 668
Query: 679 NNFSADKVVALVCQNFSCSPPVT 701
D+ +C+ F+C PV
Sbjct: 669 RPAQDDRTTTYICERFTCHAPVV 691
>gi|410721128|ref|ZP_11360472.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
MBC34]
gi|410599579|gb|EKQ54125.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
MBC34]
Length = 708
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 264/716 (36%), Positives = 370/716 (51%), Gaps = 59/716 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESF+D + LLN FV +KVDREERPD
Sbjct: 44 GDEAFDKAKKEDKPIFLSIGYSTCHWCHVMARESFQDPEIGDLLNQVFVPVKVDREERPD 103
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT Q + G GGWPL++ ++PDLKP GTYFP + G + ++ V D W+
Sbjct: 104 IDSVYMTVCQMITGSGGWPLTIIMTPDLKPFFAGTYFPKDTGPRGTGLRDLILNVHDLWE 163
Query: 119 KKRDMLAQSG---AFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
KR+ L +S +++Q+S S +K ++L L + +++D + GF
Sbjct: 164 NKREDLLKSAEDLTLSLQQISH-----RSPDKSGEQLNDGILNQTYQSQLENFDQEYAGF 218
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
G+ KFP P + +L + K +GE E MV TL M KGGI+DHVG GFHR
Sbjct: 219 GTNQKFPTPHHLLFLLRYWK------HTGE-DEALTMVEKTLDAMRKGGIYDHVGFGFHR 271
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
Y+VD +W VPHFEKMLYDQ L Y +AF T Y ++L+YL RDM P
Sbjct: 272 YTVDRKWVVPHFEKMLYDQALLVIAYTEAFQATGKTKYRETAEEVLEYLLRDMRSPEDGF 331
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMS 354
+SAEDADS EG +EG FY+WT E+ +ILG E LF Y + GN
Sbjct: 332 YSAEDADS---EG----EEGKFYLWTLDEIINILGPEEGELFSRVYSVSENGNFK----D 380
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
+ E GKN+L + KL M E+ R LF R R PH DDK++
Sbjct: 381 EATGEKTGKNILHRSQTWDELSKKLEMSPEELWWKTESARETLFQAREGRVHPHKDDKIL 440
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
WNGLVI + A A K+ R++Y+ A A +FI + Q
Sbjct: 441 TDWNGLVIVALALAGKVFG----------------REDYLLAATEAVNFIMTKI--NQQG 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
RL H +R+G + G LDDYA+LI GLL+LY+ +++L A++L T E F D + G
Sbjct: 483 RLHHRWRDGEAAVDGNLDDYAYLIWGLLELYQATFNSEYLKTALKLNQTILEHFWDHDNG 542
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G++ T+ P +L+R KE +D A PSGNSV ++NL +L I D + + ++L
Sbjct: 543 GFYFTSDYAPEILVRQKEAYDTALPSGNSVMMMNLEKLYLIT----EDIHIREISNALEK 598
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
+ + + + + + M +A +L + + G K S D + ML A + Y N +I
Sbjct: 599 YFSPMIEQSPSAFTMFLSAIILKRGPSFKIAITGEKDSADTKAMLNALYKKYLPNCMLI- 657
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ +D ++ E + N M NN K A VC N +C PV P L NLL
Sbjct: 658 LRSSDDAMINQIIESSETNIMM--NN----KATAYVCGNGTCHAPVNTPEDLVNLL 707
>gi|387817346|ref|YP_005677690.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
065]
gi|322805387|emb|CBZ02951.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
065]
Length = 680
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 249/693 (35%), Positives = 358/693 (51%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VAK+LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAKVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + A+ L ++DS++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F T +K M L A M +
Sbjct: 536 TPSGNAVAALTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYN 591
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K + L ++ DF + + Y V D ++ E N ++
Sbjct: 592 ISPVKEITLAYNEKDEDFYKFINEVNNRYIPFSIVTVNDKSN--------EIEKINKNIK 643
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+TD ++LL
Sbjct: 644 DKIAIKDKSTVYICQNYACREPITDLEEFKSLL 676
>gi|373458119|ref|ZP_09549886.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
gi|371719783|gb|EHO41554.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
Length = 684
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 260/702 (37%), Positives = 369/702 (52%), Gaps = 82/702 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE A+L+N FV+IKVDREERPD+D+ YM +VQ L G GGWPL+
Sbjct: 51 SACHWCHVMEKESFEDEETAQLMNRLFVNIKVDREERPDIDQHYMEFVQTLTGSGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GGTYFPPED+YG+P FK +L V + + K R L ++ ++++ E +
Sbjct: 111 VFLTPDGEPFYGGTYFPPEDRYGKPAFKKLLVMVSEYYHKNRQQLEEN----LDKIREIM 166
Query: 140 SASASSNK---LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ K +PD A ++L++ YD+ GG G APKFP +Q+ +K
Sbjct: 167 ARQRREIKGRHIPDT---EAWNQAVQRLTQFYDALNGGMGQAPKFP---AVQVFSLFLRK 220
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
G + +M TLQ MA GGI+D +GGGF RY+VDE+W VPHFEKMLYD Q
Sbjct: 221 FAHHGD----KQFLRMAEHTLQRMANGGIYDQLGGGFARYAVDEKWRVPHFEKMLYDNAQ 276
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA++Y+DA+ LT++ FY I R+ L+++RR++ P G +S+ DADS EG +EG
Sbjct: 277 LASLYIDAYRLTQNPFYLQIARETLEFVRRELTDPDGGFYSSLDADS---EG----QEGK 329
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY+W+ E+ ILG E LF + + GN F+G N+L
Sbjct: 330 FYLWSKDEILKILGDETGRLFCARFGVTDGGN------------FEGSNILFVSKSFDEL 377
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A++ E+ ++ + R+K+ R +R RP LD K + SWNGL++S+FA A ++ +
Sbjct: 378 AAEFKKTPEEIEALIRQARKKMLAEREQRIRPGLDYKALTSWNGLMLSAFAAAYQVTLNP 437
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y V + F+RR+LY Q+ RL H + G SK F+DDYA
Sbjct: 438 T----------------YAAVIDKNIDFVRRNLY--QSGRLLHVYSKGQSKIDAFVDDYA 479
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDH 554
+LI GLLD YE +L A+EL ++LF D+ GGY F TG+D + K +
Sbjct: 480 YLIQGLLDAYEALFDEHYLQMAVELTRRANDLFWDKRHGGYFFEATGKDQAK-RHFKSET 538
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D ++PS +V + N +RL Y Q AE + + + + A A D
Sbjct: 539 DASQPSPTAVMLHNQLRLFHFTG---EQLYLQTAEQLMRKYGQKALENPYAFASFLNALD 595
Query: 615 M-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
LS P +L+ K F+ + Y NK V+ + S+
Sbjct: 596 FYLSQPLE---ILILKKDQQRFDAFQKLIFSRYLPNKVVL-------------VQTASSK 639
Query: 674 ASMARNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLL 710
ASM R K A VC SCS PVT L+ +L
Sbjct: 640 ASMGRPLLQGRESMEGKTTAFVCHGQSCSLPVTTVDGLKQIL 681
>gi|326203005|ref|ZP_08192872.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
gi|325987082|gb|EGD47911.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
Length = 672
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 260/693 (37%), Positives = 369/693 (53%), Gaps = 76/693 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA +LN F+ IKVDREERPD+D +YM+ Q L G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQTLTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP ++ G G ++L VK+AWD KR+ L +S IE +S
Sbjct: 113 VFLTPDRQPFYAGTYFPKDNSKGSIGLMSLLDSVKEAWDLKRESLLESAKNIIEHVSHEE 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S+ + + ++ + + ++D ++GGFG++PKFP P + +L +
Sbjct: 173 SSDETI------ISKDIIHEAFKHFKYNFDIKYGGFGTSPKFPSPHTLLFLL----RYWY 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T K A E MV TL+ M GGI DH+G GF RYS D++W VPHFEKMLYD LA
Sbjct: 223 TEKEPFALE---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLAI 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+S T + Y R ILDY++RDM G +SAEDADS EG EG FY+
Sbjct: 280 AYGEAYSATGNKNYEETSRQILDYVQRDMSSQLGAFYSAEDADS---EGF----EGKFYI 332
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASAS 377
W+ +EV +LG+ KE+ C+L ++ P F+G N+ LIE S
Sbjct: 333 WSQEEVMKVLGQKD--GKEY--------CNLFDIT-PSGNFEGLNIPNLIETGALSQQQK 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
ECR+KLF+ R KR P+ DDKV+ SWNGL+I++ A +I
Sbjct: 382 SFA----------EECRKKLFNHREKRVHPYKDDKVLTSWNGLMIAAMAYCGRIF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
G +R Y+E A+ FI + L RL +R+G + P +L+DYAFL
Sbjct: 427 ---------GEER--YIETAKRCVDFIYKKLI-RTDGRLLARYRDGEAMFPAYLEDYAFL 474
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ GLL+LYE T +L A++L + LF + F + ++ R +E +DGA
Sbjct: 475 VWGLLELYEATFTTIYLKRALKLTDAMLNLFGENNSAALFLYGHDSEQLISRPRESYDGA 534
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ +NL+RLA I + Y A+ + F ++K M ++ M S
Sbjct: 535 IPSGNSVAAMNLLRLARITGHHE---YENRAKAIMDFFNNQVKAAPTGHSYM-LSSYMYS 590
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V +++ ++S + + L + + + T+ +I P TE F ++ S N
Sbjct: 591 VSDNSSEIVITGENSKEMVDTLNRKYLPFAV--TISNISPELTEIAPFVGDYKSQNG--- 645
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K A VC+NFSC PVT P L +L
Sbjct: 646 -------KTAAYVCRNFSCMEPVTQPEKLSEVL 671
>gi|15607089|ref|NP_214471.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
gi|2984353|gb|AAC07873.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
Length = 692
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 247/704 (35%), Positives = 370/704 (52%), Gaps = 64/704 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFED +A++LN++FV IKVDREERPD
Sbjct: 30 GEEAFKKAKEEDKPIFLSIGYSTCHWCHVMEKESFEDPEIAEILNNYFVPIKVDREERPD 89
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD YM+ QA+ G GGWPL++ ++PD +P GTY P E +GRPG + +L +++ W+
Sbjct: 90 VDAFYMSVCQAMTGTGGWPLTIIMTPDKEPFFAGTYIPKEGMFGRPGLRDLLLTIRELWE 149
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K R + + ++ L EA + + ++ + + +L SYD FGGFGSA
Sbjct: 150 KDRTKILNTAKHLVKALQEASRETQKA-----QIGEETIHRAFSELFSSYDEHFGGFGSA 204
Query: 179 PKFPRPVEIQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
PKFP P + + Y+ K E + KM+ TL M GGI+DHVG GFHRY
Sbjct: 205 PKFPTPHNLMFLGRYYYRYKRE---------QALKMIEKTLTNMRMGGIYDHVGFGFHRY 255
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S D W +PHFEKMLYDQ L Y + + L K + +I+D+L+RDM+ P G +
Sbjct: 256 STDREWILPHFEKMLYDQAMLLFAYTEGYQLLKKDLFKQTVYEIVDFLKRDMLSPEGAFY 315
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD 355
SA DADS EG +EG FY W+ +E++++L E L + + L GN + +
Sbjct: 316 SAWDADS---EG----EEGKFYTWSFEELKEVLDPEELELAVKVFNLSQEGNY----LEE 364
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
G+NVL A +LG+ ++ L R+KLF+ R KR +P D+K++
Sbjct: 365 ATKVKTGRNVLYIGKSYEELAKELGISEKELKEKLERIRKKLFEAREKRVKPLRDEKILT 424
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
WNGL I++ + A K+ KE++++A+ AA F+ +++ E
Sbjct: 425 DWNGLTIAALSYAGKVF----------------GEKEWIDLAKGAADFVLKNMRTENG-L 467
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L H + G +K GFL+DYA+ I GL++LYE +K+L I+LQ Q + F D+E GG
Sbjct: 468 LLHRYMEGEAKYWGFLEDYAYFIWGLMELYEATLDSKYLEEVIKLQEIQIKHFWDKENGG 527
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+F T + +R KE +DGA PSGNSVS NL+RL +++ S+ Y + +L F
Sbjct: 528 FFQTPDFFTEIPVRKKEVYDGAIPSGNSVSAYNLIRLGRLISRSE---YEKYGTKTLEAF 584
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+ + A A D++ V K +V+V S + N+ A Y + ++
Sbjct: 585 SWEIANFPSAHTFSIIALDLI-VNGTKELVIVPTDDS--WRNLKAQLDKEYLPDLLILKK 641
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
D E S N + K +C+N++C P
Sbjct: 642 DKVI--------EKLSENLEQMKP--VEGKTTYYLCRNYTCESP 675
>gi|440792869|gb|ELR14077.1| Hypothetical protein ACA1_367000 [Acanthamoeba castellanii str.
Neff]
Length = 865
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 255/689 (37%), Positives = 353/689 (51%), Gaps = 104/689 (15%)
Query: 36 EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYF 95
E +++LLND FVSIKVDREERPDVD++YMTYV A G GGWPLSVFL+PDLKPL+GGTYF
Sbjct: 265 EKISRLLNDNFVSIKVDREERPDVDRLYMTYVTATTGHGGWPLSVFLTPDLKPLVGGTYF 324
Query: 96 PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELPQ 154
PP KYGRPGF T++ V W +K+D L L E ++ A + D+ +
Sbjct: 325 PPTSKYGRPGFDTLIHNVDKVWREKQDQLKAEADNTAHALQEYMTVAGKEVEGIDDDSIE 384
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMV 213
A + L++SYD GGF APKFPR + + + + E + +A++ M
Sbjct: 385 IAYDAALKSLAESYDEEHGGFTRAPKFPRLATLNFLFRVYGHRKEGLELNEKATKAMDMA 444
Query: 214 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 273
L TL MA+GGI+DH+G W VPHFEKMLYDQ QL YL A+ +T + +
Sbjct: 445 LVTLTKMARGGIYDHIGN----------WLVPHFEKMLYDQSQLTMAYLSAYQITDEPVF 494
Query: 274 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 332
+ + D+L+Y+ + P G +SAEDADS + + K EGAFYVW EV LGE
Sbjct: 495 ADVAEDVLEYVTTKITSPEGAFYSAEDADSLVSPDSDEKVEGAFYVWEYDEVIKALGEQD 554
Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
+F Y + P GN + +D E K KNVL E + +A + G ++ + E
Sbjct: 555 GKIFAHRYGVLPEGN--VPAPADIQGELKHKNVLAEKLTAEETALEFGFKVDYVDKLTME 612
Query: 393 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 452
+ KL R KRPRPHLDDK+I SWNGL+IS++ARAS++L K
Sbjct: 613 SKAKLKHERDKRPRPHLDDKIITSWNGLMISAYARASEVLGD----------------KR 656
Query: 453 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 512
Y E A A FIR LYD+Q +
Sbjct: 657 YAESASKCAQFIRDQLYDDQ---------------------------------------E 677
Query: 513 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
++WA + GYFNT +DPS+L RV++D DGAEPS NS+S +NLVRL
Sbjct: 678 AILWARQ--------------RGYFNTVKDDPSLLARVRDDQDGAEPSSNSISAMNLVRL 723
Query: 573 ASIVAGSKSDYYRQNAEHSLA------VFETRL-----KDMAMAVPLMCCAADMLSVPSR 621
+ SD + + AE + + + RL KD + VP M C+ D S +
Sbjct: 724 WHMTG---SDDWYKKAEATFSSCKGPIITPLRLTVCPAKDAPLMVPQMLCSLD-FSRATA 779
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
K +V+ G ++ D +L + + N+ +++ D E DF + + M +
Sbjct: 780 KQIVIAGDPNAEDTAALLKEVRSQFIPNRVLLYAD--GREGQDFLSSYRALIKDMKPIDG 837
Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+NF+C P P L + L
Sbjct: 838 AA---TAYVCENFTCKLPTNKPEKLRDAL 863
>gi|269836164|ref|YP_003318392.1| hypothetical protein Sthe_0131 [Sphaerobacter thermophilus DSM
20745]
gi|269785427|gb|ACZ37570.1| protein of unknown function DUF255 [Sphaerobacter thermophilus DSM
20745]
Length = 685
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 255/686 (37%), Positives = 363/686 (52%), Gaps = 68/686 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFE+ +A L+N F++IKVDREERPD+D VYM Q + G GGWPL++
Sbjct: 49 ACHWCHVMERESFENPDIAALMNQHFINIKVDREERPDLDTVYMAAAQMMTGQGGWPLTI 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL PD KP GTYFPPED+ G PGF +L V +A+ +R L ++ L+E
Sbjct: 109 FLMPDGKPFYAGTYFPPEDRSGMPGFPRVLLAVAEAYRNRRADLERAANDIQGHLTEHFR 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLED 199
S + L L A L++ +D GGFG APKFP P+ ++ +L Y + D
Sbjct: 169 WSLPETAITPAL----LNEAASGLARQFDEANGGFGGAPKFPPPMALEFLLRYRLRTGSD 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T ++V TL+ MA+GGIHD VGGGFHRY+VD W VPHFEKMLYD LA
Sbjct: 225 TAL--------RIVELTLERMARGGIHDQVGGGFHRYAVDATWLVPHFEKMLYDNALLAR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + T FY+ D ++Y+ R+M P G +S +DADS EG +EG FYV
Sbjct: 277 LYTLTYQATGHPFYAATALDTIEYVLREMTSPDGGFYSTQDADS---EG----EEGKFYV 329
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +E+E +LG E A + +Y + P GN F+GK++L + A+
Sbjct: 330 WTPEELEAVLGPEQAPIVARYYGVHPGGN------------FEGKSILHVPEAPESVAAA 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ +++ + I+G R KL+ R++R P D+K++ WNGL++ + A+A+ L
Sbjct: 378 FDLTIDELVEIIGPAREKLYAARAQRVWPGRDEKILTDWNGLMLRALAQAAIALG----- 432
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
R + + A A+F+ HLY + RL HS+++G +K G+L DYA LI
Sbjct: 433 -----------RSDLRDAAVRNATFLHTHLY--RDGRLLHSYKDGEAKITGYLADYASLI 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+GLL LYE +W+ WA +L + F D EGG +F+T+ +D ++ R K+ D A
Sbjct: 480 AGLLALYEATFDVRWIAWARDLTDRAIADFWDNEGGAFFDTSADDAPLVARPKDAFDSAT 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---MCCAADM 615
PSGNS+ +L+RL + D YRQ A + V E R +A P A
Sbjct: 540 PSGNSLMAESLLRLGLL---LGEDDYRQRA---MTVLE-RFAALAAKAPTGFGQLLCAAD 592
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L++ + LVG MLA Y L V+ + D + D E
Sbjct: 593 LALAEAHEIALVGDPQVPAMAEMLAVVQQPY-LPHQVVALRHPDQDGED--EVIPLLAGR 649
Query: 676 MARNNFSADKVVALVCQNFSCSPPVT 701
AR+ + A VC+N++C PVT
Sbjct: 650 TARDG----QPTAYVCRNYACRQPVT 671
>gi|406878261|gb|EKD27217.1| hypothetical protein ACD_79C00804G0001 [uncultured bacterium]
Length = 713
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 249/701 (35%), Positives = 370/701 (52%), Gaps = 66/701 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESF + +A +LN F+SIKVDREERPD+D VYM VQ + G GGWPL+V
Sbjct: 55 TCHWCHVMEEESFSGKTIADILNRDFISIKVDREERPDIDSVYMNAVQKMTGSGGWPLNV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD K GGTYF PE K IL ++D W KR+ + + + ++E
Sbjct: 115 FITPDKKIFYGGTYFAPEQ------LKIILSSIEDLWKNKREKILKPSEELMNLMNEETL 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A + ++ D + A Q YDS +GGFG+ PKFP +L + + ++
Sbjct: 169 ARNHTTEVSDVVFNTAFEFLLSQ----YDSMYGGFGTFPKFPSSQTFSFLLRYYYRTKN- 223
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+MV ++ + GGI+D +G G HRYS D++W +PHFEKMLYDQ + V
Sbjct: 224 ------KTALEMVKNSISHILDGGIYDQLGSGIHRYSTDQKWFLPHFEKMLYDQALITKV 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFYV 319
+L+ + +T++ Y+ RDIL+++ R+M P G +SA DADS E + +K EGAFY+
Sbjct: 278 FLEIYQITREEKYAEAARDILEFVLREMTSPEGVFYSALDADSFNNDENSVKKTEGAFYI 337
Query: 320 WTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W KE+ ILG +F +Y ++ GN +D H EF KNVL N+ + +A
Sbjct: 338 WEKKEIIRILGNKTGEIFCYYYGIQEDGNVS----NDSHGEFIRKNVLAVSNNLTNTAKH 393
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
M ++ N L + LF R KRP+P LDDK++ WN L+IS+FA+ IL
Sbjct: 394 FNMQHKEIENELNRSHQLLFHSREKRPKPFLDDKILTDWNALMISAFAKGGLIL------ 447
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ Y+ + ++A+F+ L E+ L H +R+ + PGFLDDYAF I
Sbjct: 448 ----------NEPRYVNASINSANFVLSRLKTEKG-TLLHRYRDQIAGIPGFLDDYAFFI 496
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-TGEDPSVLLRVKEDHDGA 557
+ LLDLYE +L A+ L + ELF D+ GG+F T G + + R+KE +DGA
Sbjct: 497 NSLLDLYEATFEGIYLKEALALNDKMLELFEDKVNGGFFLTAVGTETILQNRIKEFYDGA 556
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNS+++INL++L+ I ++ + +Q+++ S+ L A LM A S
Sbjct: 557 YPSGNSIALINLIKLSRI---TQKNILKQSSKKSIDFISEALSKFPTAY-LMSLIALNNS 612
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ +V+V + S + + +Y +IH F HN N +
Sbjct: 613 LEPENEIVIVSNDSKDS-----SVSQINY-----LIHRFYLSGWSFLF---HNMNENDII 659
Query: 678 -------RN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
RN +DK VC++ C PP+TD + +L
Sbjct: 660 LSIVPRIRNYALISDKTTIYVCKDNICQPPITDIGRFQEIL 700
>gi|308069056|ref|YP_003870661.1| hypothetical protein PPE_02290 [Paenibacillus polymyxa E681]
gi|305858335|gb|ADM70123.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
Length = 688
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 257/696 (36%), Positives = 358/696 (51%), Gaps = 66/696 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE VA++LN +VSIKVDREERPDVD +YM+ Q + G GGWPL+
Sbjct: 53 STCHWCHVMGRESFEDEEVAEVLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
+ ++PD KP GTY P E K+GR G +L KV W ++ + L + S E +
Sbjct: 113 ILMTPDQKPFFAGTYLPKEQKFGRVGLLELLDKVGTRWKEQPEELVELSEQVLTEHERQD 172
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L A EL + +L + S ++D +GGFG APKFP P + +L +++
Sbjct: 173 LLAGYRG-----ELDEQSLNKAFHEYSHTFDKEYGGFGEAPKFPSPHNLSFLLRYAQH-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG + +M TL M++GGI+DH+G GF RYSVDE+W VPHFEKMLYD LA
Sbjct: 226 -TGN----QQALEMAEKTLDAMSRGGIYDHIGMGFSRYSVDEKWLVPHFEKMLYDNALLA 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ +T Y I I YL RDM GG +SAEDADS EG +EG FY
Sbjct: 281 IAYTEAWQMTGKELYRRITEQIFTYLARDMTDAGGAFYSAEDADS---EG----EEGRFY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
VW EV +LG E A F + Y + P GN F+G N+ LI++N A
Sbjct: 334 VWDDSEVRAVLGDEDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAY 380
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
K + ++ + E R KLF R +R PH DDK++ SWNGL+I++ A+A +
Sbjct: 381 GIKHDLTEQELEQRVSELRAKLFAAREQRVHPHKDDKILTSWNGLMIAALAKAGQ----- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
G R Y E A A +F+ HL E RL +R+G + PG++DDY
Sbjct: 436 ---------AFGDMR--YTEQARKAETFLWNHLRQENG-RLLARYRDGEAAYPGYVDDYV 483
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F + GL++LY+ +L A+ L +LF D E G F + ++ + KE D
Sbjct: 484 FYVWGLIELYQATFDIVYLQRALTLNQNMIDLFWDEERDGLFFYGSDSEQLIAKPKEIDD 543
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNS++ N VRLA + S+ + Y A F + + A +
Sbjct: 544 GAIPSGNSIAAYNFVRLARLTGESRLENY---AAKQFKAFGGMVAHYPSGHSALLSAL-L 599
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ + K +V+VGH+ + A A + N VI D +E + S
Sbjct: 600 YATGTTKEIVIVGHRDDPQTGQFIRAVRAGFRPNTVVILKDEGQSE--------IAETVS 651
Query: 676 MARN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
R+ + K VC++F+C PVT L+ LL
Sbjct: 652 YIRDYDLVEGKPAVYVCEHFTCQAPVTRLEDLKVLL 687
>gi|403068246|ref|ZP_10909578.1| hypothetical protein ONdio_01469 [Oceanobacillus sp. Ndiop]
Length = 685
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/720 (37%), Positives = 372/720 (51%), Gaps = 77/720 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + FL +TCHWCHVM ESFED VA+LLN ++SIKVDREERPD
Sbjct: 31 GKEAFERAKLENKPIFLSIGYSTCHWCHVMAHESFEDPEVAELLNAHYISIKVDREERPD 90
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYM Q + G GGWPL++ ++PD P GTYFP E K+G PG L ++ +
Sbjct: 91 IDSVYMKVCQMMTGHGGWPLTIMMTPDKVPFYAGTYFPKESKHGMPGILEALSQLHKKYT 150
Query: 119 KKRDMLAQSGAFAIEQLSEALSASA---SSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
K D +A+ E ++ AL S S N+L E + A R QL+K++D +GGF
Sbjct: 151 KDPDHIAE----VTESVTAALQKSVTEKSENRLTSESTEKAYR----QLAKNFDFSYGGF 202
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
G APKFP+P + +L H +T KMV TLQ MA GGI DH+G GF R
Sbjct: 203 GPAPKFPQPQNLFFLLKHYHFTGNTS-------ALKMVESTLQSMASGGIWDHIGYGFSR 255
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS DE+W VPHFEKMLYD L VY + + +TK+ FY I I+ ++ R+M G
Sbjct: 256 YSTDEKWLVPHFEKMLYDNALLLMVYTECYQITKNPFYRQISEQIIAFVSREMTSSDGAF 315
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
+SA DADS EG EG +YVW ++E+ D+LGE L+ + Y + P GN
Sbjct: 316 YSAIDADS---EGI----EGKYYVWRNEEIYDVLGEELGELYSDIYGITPFGN------- 361
Query: 355 DPHNEFKGKNVLIELNDS-SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
F+GKN+ +N S +A GM L + L R KL R KR PH+DDKV
Sbjct: 362 -----FEGKNIPNLINTSLEKTAKDNGMSLANLHSHLETARSKLLLAREKRTYPHVDDKV 416
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+ +WNGL++++ A+A K L ++ Y+E A A FI + LY Q
Sbjct: 417 LTAWNGLMVAALAKAGKALANDT----------------YIEKANRAIQFIEKKLY--QG 458
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
+RL FR+G +K ++DDYAFL+ G ++LYE T++L A+ L ELF D
Sbjct: 459 NRLMARFRDGEAKFKAYIDDYAFLLWGYIELYEATYSTEYLQKAMALIEQMTELFWDEAN 518
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GG++ + ++ + KE +DGA PSGNS + + L R+A + + Y E
Sbjct: 519 GGFYFNGKDSEELISKEKEIYDGAIPSGNSTAALMLTRMAYLTGETA---YLDKTEEMYF 575
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F A A + + P+ K VV++G + +LA +Y N TV+
Sbjct: 576 TFYEDTHQYASASAFFMQSLFVTENPA-KEVVILGRSDDPARQKLLAKLQEAYIPNVTVL 634
Query: 654 HID--PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS-LENLL 710
D A F E+ N D VC+NF+C P TD S L+N+L
Sbjct: 635 AADHPSAFAVVAPFAAEYKQLN----------DSTTIYVCENFTCQQPTTDIDSALKNIL 684
>gi|237755775|ref|ZP_04584378.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
gi|237692063|gb|EEP61068.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
Length = 686
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 250/687 (36%), Positives = 353/687 (51%), Gaps = 65/687 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE VAK+LN+ +VSIKVDREERPD+D +YM G GGWPL+
Sbjct: 51 SSCHWCHVMEKESFEDEEVAKILNENYVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP + GR G +L V + W ++ L Q IE L +
Sbjct: 111 IIMTPDKKPFFAGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKDDF 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 196
+ DE+ ++ + C L +D +GGF PKFP P I +L YH+K+
Sbjct: 171 KG------IYDEISKDIIDACYFDLKSRFDREYGGFSIKPKFPTPHNIMFLLRYYYHTKE 224
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+E KM TL M GG++DH+G GFHRYS D W +PHFEKMLYDQ
Sbjct: 225 ----------TEALKMAEKTLINMRLGGMYDHIGFGFHRYSTDREWLLPHFEKMLYDQAM 274
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LTK+ FY ++ + Y+ RDM G +S+EDADS EG +EG
Sbjct: 275 LTMAYTEAYQLTKNNFYKKTAQETITYVLRDMTSKEGVFYSSEDADS---EG----EEGK 327
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY WT E++++L + + L + + +K GN + + G+N+L
Sbjct: 328 FYTWTIDELKEVLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIREL 383
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ L M ++ L E RRKLFD R KR P DDKV+ WNGL+IS+ A+A K
Sbjct: 384 ANDLNMNQDQLEAKLEEIRRKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK----- 438
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
G + K+ +E A+ AA FI ++ T L H +++G K G LDDY
Sbjct: 439 -----------GFEDKDLIEKAKVAADFILNTMFKNDT--LYHLYKDGEIKVEGLLDDYT 485
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F GL++L E K+L A++L + E F D E GG+F + V++R KE D
Sbjct: 486 FFSWGLIELCEATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFD 545
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSVS NL RL I K Y A +L F +K + + +
Sbjct: 546 GAIPSGNSVSAYNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLML 602
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ P+ + VVL G + E +L + + NK ++ ++ + E+
Sbjct: 603 VFYPTSE-VVLAG-----NCEKVLDKINTEFIPNKAIVFLNREN-------EKQIKELIP 649
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
N +D+ VC+NFSC+ P D
Sbjct: 650 YTNNMILSDECDIYVCKNFSCNLPTKD 676
>gi|423680595|ref|ZP_17655434.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
gi|383441701|gb|EID49410.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
Length = 681
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/691 (37%), Positives = 368/691 (53%), Gaps = 75/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE VAKLLN+ FVSIKVDREERPDVD +YMT Q + G GGWPL+
Sbjct: 49 STCHWCHVMAHESFEDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLN 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GTYFP ++ RPGF +++++ D + K R+ + E+ + L
Sbjct: 109 VFLTPDQKPFYAGTYFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNL 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
A S+ D L ++ LR +QL S+D+ +GGFGSAPKFP P + +L YH
Sbjct: 165 RIKAKSDA-GDSLGEDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ---- 219
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
SGE + V+ TL MA GGI+DHVG GF RYS D+ W VPHFEKMLYD L
Sbjct: 220 ---YSGEEN-ALYSVMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLL 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ +TK+ Y I I+ ++RR+M G +SA DAD TEG EG +Y
Sbjct: 276 IAYTEAYQITKNERYKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYY 328
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSS 373
VW+ +EV + LG E L+ Y + GN F+G N + L D
Sbjct: 329 VWSKEEVLETLGDELGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK 376
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+ + E+ N L E R KLF+ R +R PH+DDKV+ SWN L+I+ A+A+K+
Sbjct: 377 ---DEFALTDEELQNKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV-- 431
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+N P EY+E+A +AA FI L Q R+ +R+G K GF+DD
Sbjct: 432 -------YNAP-------EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDD 475
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL+ ++LYE L A +L+ LF D E GG++ T + ++++R KE
Sbjct: 476 YAFLLWAYIELYEASLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEV 535
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGN V + L RL + G S A A F +
Sbjct: 536 YDGALPSGNGVLAVQLSRLGRLT-GDLS--LHDQAAKMFAAFHGDVSAYPSGHTNFLQGL 592
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWEEHNS 671
+P +K +V++G ++ D + +++A ++ N V+ + D + DF E+ +
Sbjct: 593 LSQFMP-QKEIVVLGKRNDPDRQKIVSALQQAFQPNYAVLAAESPDDFKGIADFAAEYKA 651
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ +K +C+NF+C P T+
Sbjct: 652 VD----------NKTTVYICENFACRQPTTN 672
>gi|336113948|ref|YP_004568715.1| hypothetical protein BCO26_1270 [Bacillus coagulans 2-6]
gi|335367378|gb|AEH53329.1| protein of unknown function DUF255 [Bacillus coagulans 2-6]
Length = 629
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 257/698 (36%), Positives = 365/698 (52%), Gaps = 81/698 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+E VA++LN+ FV+IKVDREERPD+D +YM Q + G GGWPLSVFL+P+
Sbjct: 1 MERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLSVFLTPEKV 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP E +YG PGFK +L + + + D + G Q+ +AL AS +
Sbjct: 61 PFYAGTYFPRESRYGMPGFKEVLHYLSQQYTENPDRIKDVGT----QVKQALEASREKGE 116
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L + + +++D R+GGFG APKFP P + +L ++K E+ A+
Sbjct: 117 -QTALTKETTGRAFQTYKQAFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYENRDALAMAT 175
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
+ TL +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD LA Y DAF +
Sbjct: 176 K-------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLALAYTDAFRM 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
TK+ Y I +I+ Y+ RDM P G +SAEDADS EG +EG FYVWT KEV+D
Sbjct: 229 TKNARYKKITEEIIKYVLRDMAHPDGGFYSAEDADS---EG----EEGKFYVWTPKEVKD 281
Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 385
+LGE LF + Y + GN F+GKN+ ++ + A K G
Sbjct: 282 VLGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLETIAKKEGFSPAA 329
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
L R+ LF R KR RP DDK++ +WNGL+I++ A+A ++ +
Sbjct: 330 LAEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRVFYQPS--------- 380
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
Y++ AE A SFIR +L Q R+ +R+G K GF+D+YAFL+ G ++LY
Sbjct: 381 -------YVQAAEKAVSFIRDNLI--QNGRIMVRYRDGEVKNKGFIDEYAFLLWGYMELY 431
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E +L A L +LF D GGG+F + +D +L+R KE +DGA PSGNSV+
Sbjct: 432 ESTFAPFYLAEAKRLAGNMIDLFWDEHGGGFFFSGNDDEPLLVRQKESYDGALPSGNSVA 491
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
L+RLA + + + + F + D A +M A M + + K VV
Sbjct: 492 ACQLLRLAKLTGDFTLE---EKVQQMFQAFSKVIHDDPNAHAMMMQAV-MYAQQATKEVV 547
Query: 626 LV---GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR-NNF 681
+V + +VDF + HI E+ F +++ F
Sbjct: 548 IVMDDETEKAVDF----------------IRHIQENFHPEISFMAVKRREKKKLSKIAPF 591
Query: 682 SAD------KVVALVCQNFSCSPPVTDPISLENLLLEK 713
D + VC+NFSC+ P D + +LL +K
Sbjct: 592 IEDYAMINGQPTIYVCENFSCNQPTNDFQTARDLLFKK 629
>gi|376259602|ref|YP_005146322.1| thioredoxin domain-containing protein [Clostridium sp. BNL1100]
gi|373943596|gb|AEY64517.1| thioredoxin domain protein [Clostridium sp. BNL1100]
Length = 673
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 265/695 (38%), Positives = 364/695 (52%), Gaps = 80/695 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA +LN F+ IKVDREERPD+D +YM+ QAL G GGWPL+
Sbjct: 54 STCHWCHVMERESFEDEDVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLT 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP ED G G ++L VK+AWD KRD L +S IE +S+
Sbjct: 114 VFLTPDRQPFYAGTYFPKEDSRGFMGLMSLLGSVKEAWDNKRDKLLESAKSIIEHVSQ-- 171
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
K+ DE + ++ + + ++DS++GGFG++PKFP P + +L +
Sbjct: 172 ------EKVSDEAKISKDIIHEAFKHFKYNFDSKYGGFGTSPKFPSPHTLLFLL----RY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
T K A E MV TL+ M GGI DH+G GF RYS D++W VPHFEKMLYD L
Sbjct: 222 WYTEKEPFALE---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALL 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A Y +AFS T + Y R ILDY++RDM G +SAEDADS EG EG F
Sbjct: 279 AIAYGEAFSATGNKNYEETARQILDYVQRDMTSQFGAFYSAEDADS---EGV----EGKF 331
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y+W+ +E D+LG E Y C L ++ N F+G N+ +N
Sbjct: 332 YIWSREEAIDVLGSKD---AEEY-------CRLFDITSSGN-FEGLNIPNLINS------ 374
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
G E+ + +CR+KLF R KR P+ DDKV+ SWNGL+ ++ A +I
Sbjct: 375 --GTLTEQQKSFAEDCRKKLFSHREKRIHPYKDDKVLTSWNGLMTAAMAYCGRIF----- 427
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
G DR Y+E A+ FI + L RL +R+G + P +L+DYAFL
Sbjct: 428 ---------GEDR--YIESAKRCVDFIYKKLI-RTDGRLLARYRDGEAVFPAYLEDYAFL 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ GLL+LYE T +L A++L + LF + G F + ++ R +E +DGA
Sbjct: 476 VWGLLELYEATFTTIYLKRALKLTDAMLNLFGENNSAGLFLYGHDSEQLISRPRESYDGA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ +NL+RLA I + Y A+ + F +++ M C+
Sbjct: 536 IPSGNSVAAMNLLRLARITGHHE---YENRAKAIMDFFSNQVEVAPTGHSYMLCSYMYSV 592
Query: 618 VPSRKHVVLVGH--KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
VV+ G K VD N A + +I P TE + ++ + N
Sbjct: 593 SDVSSEVVIAGANGKELVDTINRKYLPFAV-----AISNISPELTEIAPYVGDYKAQNG- 646
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K A VC+NFSC P+T+ L +L
Sbjct: 647 ---------KTAAYVCRNFSCMEPITEAEKLAEVL 672
>gi|440784088|ref|ZP_20961509.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
525]
gi|440219124|gb|ELP58339.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
525]
Length = 679
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 252/714 (35%), Positives = 365/714 (51%), Gaps = 69/714 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESFEDE VA++LN +FV+IKVDREERPD
Sbjct: 32 GEEAFNKADRENKPVFLSVGYSTCHWCHVMNRESFEDEEVAEILNKYFVAIKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YM+ QA+ G GGWPL++ ++ + KP GTY P +KYG+ G +L KV W
Sbjct: 92 IDNIYMSVCQAITGSGGWPLTIIMTAEKKPFFAGTYLPKIEKYGQIGIIELLDKVNTMWI 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+K+D L +S ++ L K+ +++ A L +YD FGGF +
Sbjct: 152 QKKDKLLESSNNIVDFLQN--DTVDKKGKINEDIIDEAYN----SLKNAYDPVFGGFSDS 205
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + K D E +MV TL M GGI DH+G GF RYSV
Sbjct: 206 PKFPIPHNLSFLLRYYKIKGD-------REALQMVENTLDSMYSGGIFDHIGFGFARYSV 258
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W VPHFEKMLYD LA VY + + +T Y I + I DY RDM G +SA
Sbjct: 259 DSKWLVPHFEKMLYDNALLAIVYTETYQITHKNRYKEIVQKIFDYTLRDMTNEDGGFYSA 318
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EG EG FY+W E+E+IL E A LF +Y +K GN
Sbjct: 319 EDADS---EGV----EGKFYLWDKSEIENILEEDADLFNSYYNIKSKGN----------- 360
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+G+N+ + + N + R KLF+ R KR PH DDK++ +WN
Sbjct: 361 -FEGRNIPNLIGEDLEELENEETK-----NKINRLREKLFNYREKRVHPHKDDKILTAWN 414
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL+I++ A A K+ K EA A+ A+ FI +L D + RL
Sbjct: 415 GLMIAAMAYAGKVFKIEAYKKA----------------AKKASDFILANLIDNRG-RLLC 457
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
+R+G + GFLDDYAF + GL++LYE +L A++L + F D E G+F
Sbjct: 458 RYRDGETGNVGFLDDYAFFVFGLIELYEATFEVHYLKKAVDLNGEMIKYFWDEENSGFFF 517
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
+ ++L+ KE +DGA PSGNSV+ +NL+RL+ I + + + ++F +
Sbjct: 518 YGKDSEELILKTKEIYDGALPSGNSVAAMNLIRLSRITGDVQLE---EKVAEIFSLFSEK 574
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
+ + + A +VP H+V+ G K V+ + ++ + + L +V+ D +
Sbjct: 575 INKVPLGYINTISAFLTNTVPDI-HIVIAGDKDDVNTKTLIDEINKRFLLFASVVFNDES 633
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
D E + + N +K A VC+N +C PV D +L+ E
Sbjct: 634 D--------ELSKLIPYIEDNKVVNNKATAYVCKNKACLTPVNDVKEFMDLIEE 679
>gi|94985364|ref|YP_604728.1| hypothetical protein Dgeo_1263 [Deinococcus geothermalis DSM 11300]
gi|94555645|gb|ABF45559.1| protein of unknown function DUF255 [Deinococcus geothermalis DSM
11300]
Length = 678
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 248/697 (35%), Positives = 351/697 (50%), Gaps = 68/697 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED A+ +N FV+IKVDREERPDVD VYMT Q + G GGWP++
Sbjct: 47 STCHWCHVMAHESFEDPSTAEFMNKHFVNIKVDREERPDVDSVYMTATQLMTGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GTYFPPED+YG PGF+ +L V AW + RD L + + L+E +
Sbjct: 107 VFLTPDGKPFYAGTYFPPEDRYGMPGFRRLLASVAQAWAQDRDKLTGNA----QTLTEHI 162
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++ + +LP + LR + L + YD+ GGFGSAPKFP P + +L
Sbjct: 163 REASRPRRGAGDLPTDFLRRGVDNLRRVYDADLGGFGSAPKFPAPTTLDFLLTQ------ 216
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
EG+ M L TL+ M +GGI+D +GGGFHRYSVDERW VPHFEKMLYD QL
Sbjct: 217 -------PEGRDMALHTLRMMGRGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTR 269
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
L A+ T D ++ + R+ L YL R+M+ P G FSA+DAD+ EG T +
Sbjct: 270 TLLRAWQFTGDPTFTRLARETLAYLEREMLAPQGGFFSAQDADTQGVEGLT-------FT 322
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
WT +E+ ++LG L+ G + +DPH E+ +NVL L + A
Sbjct: 323 WTPQEIREVLGAGP---DTDLVLRVYGVTEEGNFADPHRPEYGRRNVLHVLTPPAELARD 379
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG E L RRKL R +RP+P D KV+ SWNGL +++FA A +IL
Sbjct: 380 LGESAEALSARLDAARRKLLTAREQRPQPGTDRKVLTSWNGLALAAFADAGRILGE---- 435
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y+E+A A F+R+HL L+H++++G ++ G L+D+A
Sbjct: 436 ------------GHYLEIARRNADFVRQHLRLPDGT-LRHTYKDGEARVEGLLEDHALYG 482
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL+ LY+ G L WA EL F D E G + +T G ++L R + D A
Sbjct: 483 LGLVALYQAGGDLAHLAWARELWGIVRRDFWDGEAGLFRSTGGRAETLLTRQAQGFDAAV 542
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
S N+ + + + ++ +++ + A ++ ++ + A + AA L+
Sbjct: 543 LSDNAAAALLGLWISRYFGDEEAE---RLARATVRTYQADMLAAAGGFGGLWQAAAFLAA 599
Query: 619 PSRKHVVLVGHKSS-VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
P + V L+G + E ++A + I PA EH +
Sbjct: 600 P-QVEVALIGTPAERAPLERVVARFPLPF------AAIAPA---------EHGEGLPVLE 643
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
A VC +C P DP L L P
Sbjct: 644 GRPGGG---TAYVCVGHACDLPTRDPEVLAGQLERLP 677
>gi|293376087|ref|ZP_06622338.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|292645289|gb|EFF63348.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
Length = 672
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 249/693 (35%), Positives = 360/693 (51%), Gaps = 73/693 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA LN+ F+SIKVDREERPD+D VYM+ QAL G GGWPL+
Sbjct: 51 STCHWCHVMEHESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F++P + GTYFP +YGRPGF +L+ + W+ R + +
Sbjct: 111 IFMTPTQQAFYAGTYFPKTSRYGRPGFLDVLKNIDFNWNHHRAKVTDITKQIESHFKDLE 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ L + QN + QL +SYD RFGGFG+APKFP P ++ +L + ++ +D
Sbjct: 171 GIETEGDSLSMAIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
Q MV TL M KGGI DH+G GF RYS DE W VPHFEKMLYD L
Sbjct: 227 KSV-------QDMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMI 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y I +Y+ + P G + AEDADS EG +EG FYV
Sbjct: 280 SYTEAYQVTREPRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYV 332
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+T E+ ILG E F E Y + GN F+GKN+L L+
Sbjct: 333 FTPAEIIQILGHEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK----- 375
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LE + L CR L R +R H DDK++ SWNGL+I++FA+
Sbjct: 376 ----LELDIKELEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK----------- 420
Query: 439 AMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ G +K Y++ A A FI++HL+DE RL +R G S +LDDYAFL
Sbjct: 421 ------LYGQTQKMIYLDAASKAVIFIKQHLFDET--RLLARYREGESHFKAYLDDYAFL 472
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GL++L++ + ++L AI+L +LF D E GG++ T + +++LR KE +DGA
Sbjct: 473 SYGLIELHQSTAEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGA 531
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ NL+RLA + + + AE + ++K M AA
Sbjct: 532 MPSGNSVAAYNLIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFAL 588
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+++ ++ V + + + +L + + N T++ P + ++ S A
Sbjct: 589 SDTKELMITVTKQEQI--KEILKQLNETPHFNTTLLFKTPENQTQL-------SKLAPYT 639
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ D+ +C N +C P + SL+N+L
Sbjct: 640 KDYPIVDQPTYYLCSNGTCQAPTSSLESLKNIL 672
>gi|220931972|ref|YP_002508880.1| putative glutamate--cysteine ligase/putative amino acid ligase
[Halothermothrix orenii H 168]
gi|219993282|gb|ACL69885.1| putative glutamate--cysteine ligase/putative amino acid ligase
[Halothermothrix orenii H 168]
Length = 691
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 252/691 (36%), Positives = 357/691 (51%), Gaps = 75/691 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESF+DE VA+LLN+ F+SIKVDREERPD+D VYM QAL G GGWPL++
Sbjct: 57 TCHWCHVMERESFKDEEVARLLNENFISIKVDREERPDIDAVYMNVCQALTGSGGWPLTI 116
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
L+PD KP GGTY P + GR G +L +V + W K + + ++ + +++
Sbjct: 117 LLTPDKKPFFGGTYIPKNSRGGRMGLIDLLSRVTELWSKNNEKIIKNADKITSSIQRSMT 176
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ L +N L + L +D +GGFG+APKFP P ++ +L++ +
Sbjct: 177 DDSYKGHKETSLGKNTLEKAFDDLKVVFDVEYGGFGTAPKFPIPHQLIFLLHYWYR---- 232
Query: 201 GKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
G M L+ TL M GGI DH+G GFHRYS D +W +PHFEKMLYDQ
Sbjct: 233 -------TGNDMALYMVEKTLTAMRCGGIFDHIGYGFHRYSTDRKWILPHFEKMLYDQAL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ T++ + ++I+DY+RR++ G +SA+D AE+EG EG
Sbjct: 286 LTYSYSEAYLATENKKFLTTIKEIIDYVRRELKSDRGGFYSAQD---AESEGV----EGK 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+Y W+ KE+E+ILG+ A F E Y LK GN + + + GKNVL N
Sbjct: 339 YYTWSVKEIENILGKQADRFIETYSLKSDGNF----IDEATGKKTGKNVLYLRNYKEEVE 394
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ R KLF VR +R P DDK++ WNGL+I+ ARA +
Sbjct: 395 ELK------------KEREKLFKVRQRRRPPFKDDKILTDWNGLMIAGLARAGQ------ 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ EY+ +A AA FI +LY +RL H FR G G L+DYAF
Sbjct: 437 ----------ATGEIEYITMAREAADFIINNLYSSD-NRLYHRFRKGEVSIKGNLNDYAF 485
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
I GLL+LY+ K+L A++L + Q F D + GG++ T ++ +L+R KE +DG
Sbjct: 486 FIWGLLELYQDTFEVKYLKKALKLIDQQLNYFWDNKNGGFYFTPDDEEEILVRQKEIYDG 545
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSVS+ NL R+ + S Y + AE+ L VF ++K+ + + + L
Sbjct: 546 ATPSGNSVSIWNLYRIGHLTGNSD---YEEIAENILRVFSDKIKNDPASYSMALIGLNSL 602
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI----HIDPADTEEMDFWEE-HNS 671
P VV+VG K+ +L + Y N + H TE F E H
Sbjct: 603 LGPGYD-VVVVGDKNKAKTHKILYSLKNEYIPNVNTLFKPAHNGKILTELGPFIENYHMI 661
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
NN VC+++SC P +
Sbjct: 662 NNLP-----------TIYVCKDYSCRRPTNN 681
>gi|398309078|ref|ZP_10512552.1| hypothetical protein BmojR_06022 [Bacillus mojavensis RO-H-1]
Length = 689
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 247/687 (35%), Positives = 358/687 (52%), Gaps = 67/687 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIASLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP KY RPGF +L + + + R+ + A L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKYNRPGFVDVLEHLSETFANDREHVEDIAENAANHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A S L ++A+ +QL+ +D+ +GGFG APKFP P M++Y +
Sbjct: 173 AAKTSEG-----LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHT 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKDICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ +E+ LGE L+ Y + GN F+GKN+ LI A
Sbjct: 334 WSKEEILKTLGEDLGTLYCSVYDITEKGN------------FEGKNIPNLIHTKREQIKA 381
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G+ E+ L + R KL R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DG-GLTEEELSRKLEDARLKLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVFQ--- 437
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+Y+ +AE A +FI ++ + R+ +R+G K GF+DDYAF
Sbjct: 438 -------------EPQYLSLAEDAITFIENNVIIDG--RVMVRYRDGEVKNKGFIDDYAF 482
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ LDLYE +L A +L +LF D E GG++ T + ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLEKAKKLSEDMIDLFWDEEHGGFYFTGHDAEALIVREKEVYDG 542
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ + L+RL V G S + AE +VF+ ++ +
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPEIEAYPSGHSFFMQSVLKH 599
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P +K +V+ G D + + +A ++ N +++ + D + A
Sbjct: 600 MTP-KKEIVIFGRPDDPDRKQITSALQQAFIPNDSILVAEHPD---------QCKDIAPF 649
Query: 677 ARN-NFSADKVVALVCQNFSCSPPVTD 702
A + D+ +C+NF+C P TD
Sbjct: 650 AADYRIIDDQTTVYICENFACQQPTTD 676
>gi|440631885|gb|ELR01804.1| hypothetical protein GMDG_00904 [Geomyces destructans 20631-21]
Length = 918
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 238/603 (39%), Positives = 340/603 (56%), Gaps = 39/603 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ VA +LN F+ IK+DREERPD+D++YM +VQA G GGWPL+
Sbjct: 96 SACHWCHVMEKESFENDEVAAILNKDFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLN 155
Query: 80 VFLSPDLKPLMGGTYF-------PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 132
VF++P L+P+ GGTY+ P + F IL K+ AW ++ A +
Sbjct: 156 VFVTPTLEPVFGGTYWHGPHSNTPQLELEDHVDFLRILGKLSQAWREQESRCRLDSAQIL 215
Query: 133 EQLSEALSASASSNKLPD---ELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRP 184
+QL + +A + P E P L L + L ++D+ GF +APKFP P
Sbjct: 216 QQL-KVFAAEGTLGGAPKTGAEPPAGGLDLDIIDEAYQHLVSTFDTTNSGFSAAPKFPTP 274
Query: 185 VEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
++ +L + + + D + E Q M L TL+ MA+GGIHDH+G GF RYSV
Sbjct: 275 SKLAFLLRLPHFPQPVLDVVGAEEVKSAQFMALSTLRAMARGGIHDHIGHGFSRYSVTAD 334
Query: 242 WHVPHFEKMLYDQGQLANVYLDAF-SLTK-DVFYSYICRDILDYLRRDMIG-PGGEIFSA 298
W +PHFEKMLYD QL ++YLDAF L K D + D+ YL I PGG +S+
Sbjct: 335 WSLPHFEKMLYDNAQLLSLYLDAFLGLPKPDPELLGVVYDLAAYLLSPPIAAPGGGFYSS 394
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
+DADS +G +EGA+YVWT++E+E +L A + + + P GN S D H
Sbjct: 395 QDADSFYRKGDKETREGAYYVWTARELETLLPAGAYDIVAAFFGVNPDGNVAPSH--DVH 452
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVS 416
+EF +NVL + S AS+ G+ + + + +R L R ++R P+LDDK++ +
Sbjct: 453 DEFINQNVLRIASTPSQLASQFGIAESEVVETIKSAKRTLLAHREAERVVPNLDDKIVCA 512
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNG+ I + AR L+ E ++ M S+R ++ A AA F+RR +YDE L
Sbjct: 513 WNGIAIGALARTGASLR-EVDAQM-------SER--CLDAAIRAARFMRREMYDEDAKTL 562
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ +R GP + GF DDYAFL+ GLL+LYE +W+ WA ELQ TQ+ FLD G+
Sbjct: 563 RRVWRGGPGETAGFADDYAFLVEGLLELYEATFADEWVRWADELQATQNSHFLDPTASGF 622
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F T P +LR+K+ D +EPS N VS NL RLAS++ D Y A+ ++ FE
Sbjct: 623 FATAAAAPHTILRLKDGMDASEPSTNGVSASNLFRLASLLG---DDKYEALAKETVGAFE 679
Query: 597 TRL 599
+
Sbjct: 680 AEI 682
>gi|298675032|ref|YP_003726782.1| hypothetical protein Metev_1104 [Methanohalobium evestigatum
Z-7303]
gi|298288020|gb|ADI73986.1| protein of unknown function DUF255 [Methanohalobium evestigatum
Z-7303]
Length = 728
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 250/709 (35%), Positives = 368/709 (51%), Gaps = 77/709 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED +A++LND FV IKVDREERPD+D YM QAL G GGWPL++
Sbjct: 59 TCHWCHVMENESFEDPEIAQILNDNFVCIKVDREERPDIDSTYMDVCQALTGRGGWPLTI 118
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK-KRDMLAQSGAFAIEQLSEAL 139
++P+ KP TY P E ++G G +L ++ D W K KR++++++ EQ++ ++
Sbjct: 119 IMTPEKKPFSAATYLPKESRFGLTGLIDLLPRISDMWSKQKRELVSRA-----EQITSSV 173
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + EL L E L ++YD +GGFG+APKFP P + ++ + ++ +
Sbjct: 174 EEVFTKSPKTRELSNQELDSAYESLLENYDPEYGGFGNAPKFPSPHNLMFLMRYWERTSN 233
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ +MV TL+ M GGI+DH+G GFHRYS D W +PHFEKMLYDQ L+
Sbjct: 234 -------NKALEMVEKTLKNMRIGGIYDHIGFGFHRYSTDRYWMIPHFEKMLYDQALLSM 286
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y++ + T + Y RD+ Y RD+ G +SA DADS EG EG FY
Sbjct: 287 AYIEVYQATGKIEYKNTARDVFTYALRDLTSKEGGFYSAVDADS---EGV----EGKFYT 339
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE-------- 368
WT E+ IL + A + + +K GN + + GKN+ LIE
Sbjct: 340 WTYDEIHKILSKSEANIVTNLFNIKKEGNFRDEKTGN----LTGKNIPHLIETPLYIDVE 395
Query: 369 -----------LNDSSASASKLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVI 414
LN++ L K + L RRKLF+ R R P DDK++
Sbjct: 396 PDEELDEFHEKLNEAREKRGAWKRNLLKTIYSQRRLEVARRKLFEARENRVHPAKDDKIL 455
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
WNGL+I++ ++ +++ + KEY A AA FI +++ D +
Sbjct: 456 TDWNGLMIAALSKGAQVF----------------NDKEYANSARKAADFIIKNMSD-SSG 498
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
+L H +R+G S GF+DDYAFL GL++LYE K+L A+E N F D G
Sbjct: 499 QLMHRYRDGDSDIHGFIDDYAFLTWGLIELYETTFEVKYLEKALEFNNYLINHFWDDNNG 558
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G++ T + ++R KE +DGA PSGNSV+++NL+RL + + + + A S+
Sbjct: 559 GFYFTPDNAETPIVRKKEIYDGASPSGNSVALMNLMRLGRMTGNPELE---KKASDSIKS 615
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F L +A A D + PS + VV+ G S D +NM+ + + + + V+
Sbjct: 616 FSKSLSRNPIASTHSMQALDFVQGPSSE-VVITGDFQSEDTQNMINSLRTEF-IPRKVVL 673
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
P + D N A R+ S + K A +CQN+SCS P TD
Sbjct: 674 FKPDKVQSPDI-----VNIAGFTRDMDSQEGKATAYICQNYSCSSPKTD 717
>gi|325958772|ref|YP_004290238.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
gi|325330204|gb|ADZ09266.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
Length = 702
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 271/717 (37%), Positives = 365/717 (50%), Gaps = 61/717 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESFED VA+LLN+ FV++KVDREERPD
Sbjct: 38 GDEAFEKAKKLDKPIFLSIGYSTCHWCHVMAHESFEDLEVAELLNNNFVAVKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD VYM Q + G GGWPL++ ++ D KP GTYFP E +G G K +L V D W
Sbjct: 98 VDSVYMAACQIMTGTGGWPLTIIMTHDKKPFFAGTYFPKESSFGNIGLKDLLLNVMDIWR 157
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R SG +Q+ AL S N +L L +QLSK +D GGFG
Sbjct: 158 DERKNALDSG----DQIFRALK-EMSVNTKGKQLDSTILEKTYDQLSKVFDVENGGFGDF 212
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
KFP P + +L + K+ TG + MVL TL MA GGI+DHVG GFHRYSV
Sbjct: 213 QKFPTPHSLMFLLRYWKR---TGNKHSLN----MVLKTLDEMAMGGIYDHVGFGFHRYSV 265
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+ W VPHFEKMLYDQ +A +Y + +S T Y + I +Y+ RDM G +SA
Sbjct: 266 DKNWLVPHFEKMLYDQALIAMLYTEVYSATGKFEYKKTAQQIYEYVLRDMTDVEGGFYSA 325
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FY WT +E+ IL + A L E + +K GN +D +
Sbjct: 326 EDADS---EGV----EGKFYYWTYEELYSILDKDSADLITEVFNVKKDGN-----FNDGY 373
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
+ N+L + D A G+ + ++ + +LF VR KR PH DDK++ W
Sbjct: 374 SNESINNILHKKRDYKKIAENKGLNISDLEELVDDILSELFLVREKRVHPHKDDKILTDW 433
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I+S +RA ++ + E +Y++ AE+ +FI Y Q +RL
Sbjct: 434 NGLMIASLSRAFQVFEEE----------------KYVKAAENCVNFIMNKSY--QQNRLM 475
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H FR+G S G LDDY F+I GLL++Y +L A++L T E F D E GG++
Sbjct: 476 HMFRDGESAVYGNLDDYTFMIWGLLEIYMATFNVDYLEKAMDLNQTVVEHFWDEENGGFY 535
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFE 596
T ++ VL+R K+ D A PSGNSV +NL+RL S +D+ + + L VF
Sbjct: 536 FTADDEEKVLIREKKTFDSAIPSGNSVEFLNLLRLGSFT----NDHNQMDTARKLETVFS 591
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
+K D PS VV+VG S D ML Y N T+I D
Sbjct: 592 ETVKRSPTGHTQFISGVDFALGPSYS-VVIVGDGDSEDTIEMLRLRQL-YIPNTTIILKD 649
Query: 657 PADTEEMDFW-EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
W ++ NS + + + + K A VC SC P + LL E
Sbjct: 650 SK-------WSDKTNSISEDIDKKSMINGKATAHVCSTGSCKLPTNKKSEMLKLLNE 699
>gi|421839588|ref|ZP_16273125.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
CFSAN001627]
gi|409733965|gb|EKN35825.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
CFSAN001627]
Length = 680
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 245/684 (35%), Positives = 353/684 (51%), Gaps = 70/684 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+WT +E+ DILGE E Y C + ++ N F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN+V+ + L L I D Y+ + F T +K M L A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
K + L +K DF + + Y V D ++ E N ++
Sbjct: 593 SPVKEITLAYNKKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644
Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
DK +CQN++C P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668
>gi|226948333|ref|YP_002803424.1| hypothetical protein CLM_1215 [Clostridium botulinum A2 str. Kyoto]
gi|226841180|gb|ACO83846.1| conserved hypothetical protein [Clostridium botulinum A2 str.
Kyoto]
Length = 680
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 245/684 (35%), Positives = 354/684 (51%), Gaps = 70/684 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + A+ L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAAKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+WT +E+ DILGE E Y C + ++ N F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN+V+ + L L I D Y+ + F T +K M L A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
K + L ++ DF + + Y V D ++ E N ++
Sbjct: 593 SPVKEITLAYNEKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644
Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
DK +CQN++C P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668
>gi|423720021|ref|ZP_17694203.1| thioredoxin domain protein [Geobacillus thermoglucosidans
TNO-09.020]
gi|383366783|gb|EID44068.1| thioredoxin domain protein [Geobacillus thermoglucosidans
TNO-09.020]
Length = 637
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 257/694 (37%), Positives = 369/694 (53%), Gaps = 77/694 (11%)
Query: 18 LINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 77
LI+TCHWCHVM ESFEDE VAK+LN+ +VSIKVDREERPD+D VYM Q + G GGWP
Sbjct: 2 LISTCHWCHVMAHESFEDEEVAKILNEKYVSIKVDREERPDIDSVYMRVCQMMTGQGGWP 61
Query: 78 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
LSVFL+P+ KP GTYFP + +YGRPGF +L ++ D + + D + EQ++E
Sbjct: 62 LSVFLTPEGKPFYAGTYFPKQSRYGRPGFIELLTRLYDKYKENPDEIVHVA----EQVTE 117
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP-VEIQMMLYHSKK 196
AL SA ++ + LP A+ QL +D+ +GGFG APKFP P + + +M Y+ K
Sbjct: 118 ALRQSARASG-TERLPFAAIEKAYRQLLNGFDAVYGGFGGAPKFPIPHMLMFLMRYYQWK 176
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D MV TL MA GGI+DH+G GF RYS D W VPHFEKMLYD
Sbjct: 177 RDD--------RALLMVEKTLNGMANGGIYDHIGYGFARYSTDAMWLVPHFEKMLYDNAL 228
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ LTK Y I I+++++R+M G +SA DADS EG EG
Sbjct: 229 LVIAYTEAYQLTKKERYKEIAEQIIEFVKREMTSQDGAFYSAVDADS---EGV----EGK 281
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
+YVWT EV ++LG E Y C + ++D N F GKNV LI
Sbjct: 282 YYVWTPDEVVNVLGAE---LGELY-------CRVYDITDEGN-FAGKNVPNLIHARMERL 330
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A + + E+ L E R++L RS R RPH+DDK++ +WN L+I++ A+A+K+
Sbjct: 331 -ARRYRLTEEELRERLEEARKQLLAERSSRVRPHVDDKILTAWNALMIAALAKAAKVY-- 387
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
+R++Y+++A+ A SFI HL+ Q RL +R G K G +DDY
Sbjct: 388 --------------ERRDYLQMAKQALSFIETHLW--QNGRLMVRYRGGEVKHLGIIDDY 431
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A+L+ +++YE +L A LF D + G +F T + ++++R KE +
Sbjct: 432 AYLVWAYVEMYEATLDLAYLQKAKTCAERMISLFWDEKHGAFFMTGNDAEALIIREKEIY 491
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGA PSGNSV+ + ++RLA + + AE VF +++
Sbjct: 492 DGALPSGNSVAAVQMIRLARLTGDLA---LLEKAETMYKVFRRQVEAYESGHTFFLQGLL 548
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
++ P+ + VVL G + E + ++ N ++ EH ++ A
Sbjct: 549 LIETPAAE-VVLFGKQGDEKREQFILKWQHAFAPNVFLLV------------AEHPADVA 595
Query: 675 SMARNNFSA------DKVVALVCQNFSCSPPVTD 702
+A F+A D+ VC+NF+C P TD
Sbjct: 596 GIA--PFAAEYEPLGDETTVYVCENFACQQPTTD 627
>gi|15896782|ref|NP_350131.1| hypothetical protein CA_C3546 [Clostridium acetobutylicum ATCC 824]
gi|337738753|ref|YP_004638200.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
1731]
gi|384460264|ref|YP_005672684.1| hypothetical protein CEA_G3552 [Clostridium acetobutylicum EA 2018]
gi|15026641|gb|AAK81471.1|AE007851_2 Highly conserved protein containing a domain related to cellulase
catalitic domain and a thioredoxin domain [Clostridium
acetobutylicum ATCC 824]
gi|325510953|gb|ADZ22589.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
gi|336292984|gb|AEI34118.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
1731]
Length = 677
Score = 407 bits (1046), Expect = e-110, Method: Compositional matrix adjust.
Identities = 246/694 (35%), Positives = 358/694 (51%), Gaps = 76/694 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED+ VA++LN FVSIKVDREERPD+D++YM A+ G GGWPL++
Sbjct: 56 TCHWCHVMERESFEDDDVAEVLNRSFVSIKVDREERPDIDEIYMNVCTAITGSGGWPLTI 115
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++P+ KP GTY P ++ G G ++L ++ W + ++ L + G + L++
Sbjct: 116 VMTPEQKPFFAGTYIPKNNRMGMQGLISLLENIEYQWKENQNELVEIGDKIVSSLNKDRK 175
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+A EL + L Q ++D +GGFGS PKFP P + ++ + +D
Sbjct: 176 TTAK------ELSEEVLEEAFSQFKYNFDRTYGGFGSEPKFPTPHNLIFLMRYFYASKD- 228
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
M L TL M +GGI+DH+G GF RYSVD++W VPHFEKMLYD LA
Sbjct: 229 ------KTSLNMALKTLDTMYRGGIYDHIGYGFSRYSVDKKWLVPHFEKMLYDNALLAYA 282
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +AF +TK+ Y I I Y+ RDM G + AEDADS EG EG FYVW
Sbjct: 283 YTEAFKITKNDNYKNIVDQIFTYILRDMTSNEGGFYCAEDADS---EGV----EGKFYVW 335
Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ KE+ ++LGE F +++ + TGN F+G+N+L + K+
Sbjct: 336 SKKEINNVLGEDDGKKFSKYFNVTDTGN------------FEGENIL-----NLIETEKI 378
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
E L CR+KLFD R KR P+ DDK++ SWNGL+I++ A + LK+E
Sbjct: 379 EFEDE----FLNSCRKKLFDYREKRIHPYKDDKILTSWNGLMIAALAFGGRSLKNEI--- 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
Y+ AE A +FI L D RL +R+G + G+L DY+FLI
Sbjct: 432 -------------YINAAEKAVTFIFTKLID-ANGRLLSRYRHGEASIKGYLTDYSFLIW 477
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GL++LYE ++++ AI+L N + F D + G F + ++ R KE +DGA P
Sbjct: 478 GLIELYEATYKSEYIEKAIKLNNDLIKYFWDDKNKGLFLYGSDSEELISRPKEIYDGAIP 537
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSVS +N +RL+ + + L F ++ M + L
Sbjct: 538 SGNSVSALNFIRLSRLTGSYDLE---DKCTEILQAFSEEIESYPMGYSFSLLSVLFLGKK 594
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYD-LNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S K + LV + + L + Y+ L+ + +I+ T E N S
Sbjct: 595 S-KEITLVSNSYDNTSKEFLEVINDKYNPLSTFIYYIEGDKTLE----------NVSNFV 643
Query: 679 NNFSA--DKVVALVCQNFSCSPPVTDPISLENLL 710
+++ DK +C+NFSC+ PVT+ L+ LL
Sbjct: 644 SDYQPLNDKPTVYICENFSCNAPVTNISDLKKLL 677
>gi|345560346|gb|EGX43471.1| hypothetical protein AOL_s00215g207 [Arthrobotrys oligospora ATCC
24927]
Length = 758
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 257/698 (36%), Positives = 372/698 (53%), Gaps = 43/698 (6%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESF+D VAK+LND F+ IK+DREERPD+D++YM YVQA G GGWPL+VF
Sbjct: 68 CHWCHVMERESFQDAYVAKILNDNFIPIKIDREERPDIDRIYMNYVQATTGSGGWPLNVF 127
Query: 82 LSPDLKPLMGGTYFPPEDKYGRP------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
L+P+L+P+ GGTY+P + P GF +L K+ W +++D S ++QL
Sbjct: 128 LTPNLEPVFGGTYWPGPNATDGPSMKDQIGFVEVLDKIVKVWKEQQDKCLASAKDILKQL 187
Query: 136 S----EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
E L + + L + L + YD+ GGFG+ PKFP P + +L
Sbjct: 188 KEFSDEGLKEQGGNQDGAEILEIDLLEEAYQHFLSRYDTTHGGFGTEPKFPTPTNLAFLL 247
Query: 192 YHSKKLEDTGKSGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
S E ++ M + TL+ M++GGIHDH+G GF RYSV W +PHFE
Sbjct: 248 RLSSLSSVVEDVVGDVECERAKFMAVTTLRHMSRGGIHDHIGNGFERYSVTADWSLPHFE 307
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP----GGEIFSAEDADSA 304
KMLYD QL +VYLDA+ LTKD D DYL GP G +SAEDADS
Sbjct: 308 KMLYDNAQLISVYLDAYLLTKDREMLDAALDAADYL---CSGPLSHKDGGFYSAEDADSY 364
Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
+G T K+EGAFYVW KE +LGE A + +++ ++ GN D +R D H+EF +
Sbjct: 365 ARKGDTEKREGAFYVWDKKEFIKVLGEQDAEVCSKYWGVRTDGNVDPAR--DIHDEFLHQ 422
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH-LDDKVIVSWNGLVI 422
NVL + S LG+ + + R KL + R + LDDK++ WNGL I
Sbjct: 423 NVLQISQTPAQIGSMLGLSETAIVEKIKNGRAKLREYRERERPRPILDDKILTGWNGLAI 482
Query: 423 SSFARASKILK-SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
++ +R + L+ +AE + F Y+ A AA FIR++++D++T L+ +R
Sbjct: 483 AALSRLAAALEIVDAEKSKF-----------YLNQAIRAAEFIRKNVFDQRTLGLKRVWR 531
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
P F DDYA+LI GL+ LYE WL WA LQ Q +LF D GG+F+T
Sbjct: 532 ETPGATKAFADDYAYLIYGLISLYEATFDAGWLRWAHSLQAAQTKLFWDEAQGGFFSTER 591
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
+ P ++LR+K+ D AEPS N +S NL +L S++ + + A + F T L
Sbjct: 592 DAPDLILRLKDGLDSAEPSTNGISAANLYKLGSLLGDASFSFL---ASKTCNAFSTELMQ 648
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-T 660
M + L++ + V++ G KS A N ++I +DP + +
Sbjct: 649 HPFLFSTMLPSVVALNLGTGT-VIIAGKKSDPTISAYRAKLRTQLFTNTSIIVVDPTEKS 707
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
+++ ++ N + ++ +A K + VCQN +C P
Sbjct: 708 DDITWFTGKNEILKDILKS--AATKPIVQVCQNQTCVP 743
>gi|168182912|ref|ZP_02617576.1| dTMP kinase [Clostridium botulinum Bf]
gi|182673930|gb|EDT85891.1| dTMP kinase [Clostridium botulinum Bf]
Length = 682
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 244/693 (35%), Positives = 357/693 (51%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 55 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 171
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK-- 227
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++ ++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 228 -------DNKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 281 MTYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVD 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 432
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 537
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F +K M L A M +
Sbjct: 538 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYN 593
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V K + L + DF + + Y +I D ++ E N ++
Sbjct: 594 VLPIKEITLTYREKDEDFYKFINEVNNRYIPFSIIILNDKSN--------EIEKINKNIK 645
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+TD +++L
Sbjct: 646 DKIAIKDKTTVYICQNYACREPITDLEEFKSVL 678
>gi|116749973|ref|YP_846660.1| hypothetical protein Sfum_2547 [Syntrophobacter fumaroxidans MPOB]
gi|116699037|gb|ABK18225.1| protein of unknown function DUF255 [Syntrophobacter fumaroxidans
MPOB]
Length = 684
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 249/685 (36%), Positives = 361/685 (52%), Gaps = 61/685 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA LLN+ V++KVDREERPD+D++YMT QAL G GGWPLSV
Sbjct: 49 TCHWCHVMERESFEDEEVAALLNEHVVAVKVDREERPDIDQIYMTVCQALLGSGGWPLSV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++P+ G+YFP + G GF ++R++ W R+ L ++G E +
Sbjct: 109 FMTPEKNAFFAGSYFPKHARLGMAGFTDVIRRIVHMWKNDRERLLEAGRQITESIQPRPV 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ S P+ L + R LS+++D+ +GGFGS PKFP P + +L ++
Sbjct: 169 QTVGSLPGPEVLEEAYSR-----LSRAFDATWGGFGSKPKFPTPHHLTFLLRWHRR---- 219
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
S+ +V TL M GGI D VG GFHRYSVDE+W VPHFEKMLYDQ LA
Sbjct: 220 ---NPWSDALAIVEKTLDGMRDGGIFDQVGFGFHRYSVDEKWLVPHFEKMLYDQAMLALA 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL+AF +T + + R+I +Y+ RDM P G +SAEDADS EG EG FYVW
Sbjct: 277 YLEAFQVTGRERHGRVAREIFEYVLRDMTDPDGGFYSAEDADS---EGV----EGRFYVW 329
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T EV +LG E F + + P GN + R S PH L EL DS + +
Sbjct: 330 TPAEVNALLGNEIGETFCRFFDITPEGNFEDGR-SIPH--------LAELADSLSDRDEP 380
Query: 380 GM-PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ LE ++L + RR LF+ R R P DDK++ SWNGL+I++ ++ S+ L
Sbjct: 381 GIGGLE---DLLEKGRRLLFEARRMRVHPLKDDKILTSWNGLMIAALSKGSRALGD---- 433
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ Y A AA FI + + RL +R G + + DDYAF I
Sbjct: 434 ------------RSYALAASRAADFILDRM-RRDSGRLHRRYRKGEAAIHAYADDYAFFI 480
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LYE ++L A++LQ+ +LF D GG+F T + ++++R +E +DGA
Sbjct: 481 WGLIELYEAAFDVRYLEEAVKLQDLMIDLFWDDAEGGFFFTPNDGENLIVREREIYDGAV 540
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PS NS + +NL+RL +V + + + A+ L F ++D A A D +
Sbjct: 541 PSSNSAAALNLLRLGRMVGAVR---FEEKADRLLRRFSETVRDYPSAYTQFLHAVDFAAG 597
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV-IHIDPADTEEMDFWEEHNSNNASMA 677
P+R+ VV+ G + M+ + + N V + P + + + +
Sbjct: 598 PTRE-VVIAGSPDNATTAEMMKIVGSGFVPNTVVLLRGTPESGARLAELAPYTAGLVAPG 656
Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
N +C+ F+C+ P+T+
Sbjct: 657 GNP------AVYICEKFACTSPITE 675
>gi|347733897|ref|ZP_08866951.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
gi|347517453|gb|EGY24644.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
Length = 781
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 258/734 (35%), Positives = 370/734 (50%), Gaps = 85/734 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED+ VA+LLND FV +KVDREERPD+D YM Q L G GGWPL+
Sbjct: 83 STCHWCHVMAHESFEDDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGTGGWPLT 142
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---S 136
+ PD +P TY P + GR G ++ +V W KR + S +E + +
Sbjct: 143 IIALPDGRPFFAATYLPKHSRPGRIGLMDLVPRVLAVWRDKRGEVLDSAESIVEHVRRHA 202
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
EA+ + +LP L E ++ +D+ GGFGSAPKFP P + +L +++
Sbjct: 203 EAMLRPPADGRLPG---AGTLHAACEAMASEFDAANGGFGSAPKFPSPHNLLFLLRWARR 259
Query: 197 --------------LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
T ++ +M TL+ + +GGIHDHVG GFHRYS D RW
Sbjct: 260 NGYGAGSGASGAAAPGATQDEPGGAKALRMAAQTLRAIRRGGIHDHVGYGFHRYSTDARW 319
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ L Y +A+ T D + + Y+ RD+ G +SAEDAD
Sbjct: 320 LLPHFEKMLYDQAMLMLAYAEAWLATGDGEFRRTAEETAAYVLRDLTSSEGAFYSAEDAD 379
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILG-------------------EHAILFKEHYYLK 343
S E +G + EG FY +T ++E A L +
Sbjct: 380 S-ELDGV--RGEGLFYTFTLADLEAACAPLDVGSGGDGGAEAGEGAISDADLAARAFGCT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + + G+NVL A A +LG+P + L R LFD+R+
Sbjct: 437 AYGNYE----DEATRSRTGRNVLHLPRSPEALARELGLPPREVEERLEAARAALFDLRTT 492
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RPRPHLDDKV+ WNGL I++ +R ++ D E A AA F
Sbjct: 493 RPRPHLDDKVLADWNGLAIAAMSRCAQAF----------------DAPHLAEAAAVAADF 536
Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
+ + + RL H +R+G + PG LDDYAF+I GL++LY +WL A+ LQ
Sbjct: 537 VLTRMVTPEG-RLLHRWRDGEAAVPGLLDDYAFMIWGLVELYGATGEVRWLRRALRLQEV 595
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
QD F D EGGGY+ T + ++L+R KE HDGA PSGN+ ++ NL+RL+ ++ +
Sbjct: 596 QDTFFHDPEGGGYWMTPADGDALLVRRKEGHDGALPSGNAAALFNLLRLSLLLGRPE--- 652
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 643
Y + A L F T+++ + + C D ++ + V++ G D E MLAA
Sbjct: 653 YGERARGVLRAFATQVRHHPIGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAMLAAVR 711
Query: 644 ASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA------DKVVALVCQNFSCS 697
+Y TV+H+ +D N+ + + A F+A D+ A +C+N++CS
Sbjct: 712 GTY-APTTVLHLRTSD----------NARDLA-ALVPFTAHLAPVEDRATAWLCENYACS 759
Query: 698 PPVTDPISLENLLL 711
PP+TDP L+ LL
Sbjct: 760 PPITDPAELKARLL 773
>gi|419820995|ref|ZP_14344599.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
gi|388474906|gb|EIM11625.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
Length = 645
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 251/695 (36%), Positives = 371/695 (53%), Gaps = 84/695 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 11 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 70
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ +E+++E
Sbjct: 71 VFITPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSNTFANDREH--------VEEIAENA 122
Query: 140 SASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
S S K P+ L + AL +QL +D+ +GGFG APKFP P M++Y +
Sbjct: 123 S-SHLQIKTPEGNGTLTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---HMLMYLLRY 178
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ TG+ K TL MA GGI+DHVG GF RYS D+ W VPHFEKMLYD
Sbjct: 179 HQYTGQENALYNVTK----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNAL 234
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ +T+D Y +I I+ +++R+M G +SA DAD TEG EG
Sbjct: 235 LLTAYTEAYQVTQDSRYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEGV----EGK 287
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
+YVW+ E+ + LG E L+ Y + +GN F+G N+ LI
Sbjct: 288 YYVWSKDEIIETLGDELGELYCAIYNITSSGN------------FEGHNIPNLIHTKLDK 335
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A + + ++ LGE R+KL R R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 336 VKA-EFDLNEQEINKQLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGLAKAAKVFQ 394
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+ EY+ +A++AA+FI + L + R+ +R+G K GF+DD
Sbjct: 395 A----------------PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEVKNKGFIDD 436
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL+ ++LYE G +L A +L +LF D++ GG++ T + ++L+R KE
Sbjct: 437 YAFLLWAYIELYEAGYDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEALLVREKEV 496
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGNSV+ + L+RL + G S + AE + F+ ++ +
Sbjct: 497 YDGAVPSGNSVAAVQLLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSGHSFFMQSV 553
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+P +K +V+ G K +++++A ++ N +V+ EH
Sbjct: 554 LTHMMP-KKEIVIFGRKDDSQRQHIISALQQAFQPNFSVL------------VAEHPDQC 600
Query: 674 ASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
+A F+AD K +C+NF+C P TD
Sbjct: 601 KDIA--PFAADYRIIDGKTTVYICENFACQQPTTD 633
>gi|237794355|ref|YP_002861907.1| thymidylate kinase [Clostridium botulinum Ba4 str. 657]
gi|229263126|gb|ACQ54159.1| dTMP kinase [Clostridium botulinum Ba4 str. 657]
Length = 682
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 244/693 (35%), Positives = 357/693 (51%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 55 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 171
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK-- 227
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++ ++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 228 -------DNKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 281 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVD 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 432
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGA 537
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F +K M L A M +
Sbjct: 538 TPSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYN 593
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V K + L + DF + + Y +I D ++ E N ++
Sbjct: 594 VLPIKEITLTYREKDEDFYKFINEVNNRYIPFSIIILNDKSN--------EIEKINKNIK 645
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+TD +++L
Sbjct: 646 DKIAIKDKTTVYICQNYACREPITDLEEFKSVL 678
>gi|220927673|ref|YP_002504582.1| hypothetical protein Ccel_0215 [Clostridium cellulolyticum H10]
gi|219998001|gb|ACL74602.1| protein of unknown function DUF255 [Clostridium cellulolyticum H10]
Length = 673
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 261/701 (37%), Positives = 371/701 (52%), Gaps = 92/701 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA +LN F+ IKVDREERPD+D +YM+ QAL G GGWPL+
Sbjct: 54 STCHWCHVMERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLT 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP ED G G ++L VK+AWD KR+ L S I +S+
Sbjct: 114 VFLTPDKQPFYAGTYFPKEDSKGLMGLISLLGSVKEAWDNKREHLLVSAENIINHVSKES 173
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
+ S ++ Q A ++DS++GGFG++PKFP P + +L +++KK
Sbjct: 174 ISKDSKIS--SDIIQEAF----AHFKYNFDSKYGGFGTSPKFPSPHTLLFLLRYWYTKK- 226
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+MV TL+ M GGI DH+G GF RYS D++W VPHFEKMLYD L
Sbjct: 227 --------EPYALEMVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALL 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A Y +A+S T + Y R ILDY++RDM G +SAEDADS EG EG F
Sbjct: 279 AIAYGEAYSATGNKNYEETARQILDYVQRDMSSQLGAFYSAEDADS---EGV----EGKF 331
Query: 318 YVWTSKEVEDILG-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN 370
Y+W+ +EV ++LG E+ +F + P+GN F+G N+ LIE
Sbjct: 332 YIWSKEEVINVLGSKDGEEYCRIFD----ISPSGN------------FEGLNIPNLIE-- 373
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
G E+ + +CR+KLF R KR P+ DDK++ +WNGL+ ++ A +
Sbjct: 374 --------TGTLPEQQKSFAEDCRKKLFTHREKRIHPYKDDKILTAWNGLMTAAMAYCGR 425
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L G D+ Y+E A+ FI + L RL +R G + P +
Sbjct: 426 VL--------------GEDK--YIESAKRCIDFISKKLV-RTDGRLLARYREGEAVFPAY 468
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL+ GLL+LYE T +L A++L + LF + G F + ++ R
Sbjct: 469 LEDYAFLVWGLLELYEATFTTLYLKRALKLTDAMLNLFGENNSTGLFLYGHDSEQLIARP 528
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E +DGA PSGNSV+ +NL+RLA I + Y A+ + F T++ M
Sbjct: 529 RESYDGAIPSGNSVAAMNLLRLARITGRHE---YENRAKAIMDFFGTQINAAPTGHSYML 585
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY-DLNKTVIHIDPADTEEMDFWEEH 669
C+ M SV V++ + VD + ++ + Y + +I P TE F ++
Sbjct: 586 CSY-MYSVSDISSEVVI---AGVDGKGLIDTFNNKYLPFAVAISNISPELTEIAPFIGDY 641
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ N K +A VC+NFSC P+T+P L +L
Sbjct: 642 KAQNG----------KTMAYVCRNFSCMEPITEPKKLGEVL 672
>gi|168178477|ref|ZP_02613141.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
gi|182670724|gb|EDT82698.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
Length = 680
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 244/684 (35%), Positives = 353/684 (51%), Gaps = 70/684 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 IMTPDKKPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I +L+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+WT +E+ DILGE E Y C + ++ N F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDN 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 381 NKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--- 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF +
Sbjct: 431 -------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFL 476
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L++LYE +L +IE+ N+ +LF +E GG++ + +L+R KE +DGA
Sbjct: 477 WALIELYEASFDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGAT 536
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN+V+ + L L I D Y+ + F T +K M L A M ++
Sbjct: 537 PSGNAVASLTLNLLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNI 592
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
K + L ++ DF + + Y V D ++ E N ++
Sbjct: 593 SPVKEITLAYNEKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIKD 644
Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
DK +CQN++C P+TD
Sbjct: 645 KIAIKDKATVYICQNYACREPITD 668
>gi|335040507|ref|ZP_08533634.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
TA2.A1]
gi|334179587|gb|EGL82225.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
TA2.A1]
Length = 715
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 256/705 (36%), Positives = 365/705 (51%), Gaps = 65/705 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE +A +LN+ FVSIKVDREERPD
Sbjct: 56 GEEAFEKARREDKPVFLSIGYSTCHWCHVMERESFEDEEIADILNNHFVSIKVDREERPD 115
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YM QAL G GGWPL++ + PD KP TY P E K+GR G K IL+K+ W
Sbjct: 116 VDAIYMAVCQALTGHGGWPLTIVMHPDQKPFFAATYLPKEGKWGRSGLKEILQKIHHLWL 175
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R L ++G I+ + E S + EL + L Q +++D+ +GGFG A
Sbjct: 176 HDRKKLNEAGTNIIKAIQEMKSRPKGA-----ELTKEILHHAYAQFERTFDADYGGFGQA 230
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +L + + TG+ + +M +L+ M +GGI+DH+G GF RYSV
Sbjct: 231 PKFPLPHSYLFLL---RYWQMTGE----PKALEMTEKSLRAMHRGGIYDHLGYGFARYSV 283
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE+W VPHFEKMLYD LA Y +A+ T++ +Y + +I +Y++R M P G +SA
Sbjct: 284 DEKWLVPHFEKMLYDNALLAYSYTEAYQATRNPYYKQVTEEIFEYVQRVMTSPEGGFYSA 343
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG EG FYVWT +E+ ++L E A LF CD+ +++
Sbjct: 344 EDADS---EGV----EGKFYVWTPEEIFEVLEETEAELF-----------CDIYDVTEQG 385
Query: 358 NEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
N F+GKN+L ++ D A + G+ + L R KLF R KR PH DDK++ +
Sbjct: 386 N-FEGKNILHLIDVDLEQKAKQYGLSFAQLEQKLAAARHKLFLHREKRVHPHKDDKILTA 444
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I++ A+AS R +Y+E+A AA+ I RHL D + RL
Sbjct: 445 WNGLMIAALAKASAAF----------------GRSDYLELARRAANMIERHLTDNEG-RL 487
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+R+G + ++DDYAF I L +LY L A L + E F D++ GG+
Sbjct: 488 LARYRDGEAHYLAYIDDYAFFIWALHELYFASLDASCLQQAKSLLDQALERFWDKQNGGF 547
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F + ++ KE +DGA PSGN V NLVR + S D YR+ AE L F
Sbjct: 548 FFYAKDAERLITNPKEIYDGATPSGNGVMAFNLVRHYLL---SGEDVYRETAEALLQAFG 604
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
++ + A +LS + +V+V K ++ M+ +Y V++
Sbjct: 605 QQINEYPSGHAFSLLALQLLS-GNHAELVIVEGKDRHTYDKMVETVQRAYLPLAVVLYKT 663
Query: 657 PADTEEMD-FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
+ ++ H A + F C NF+C PV
Sbjct: 664 REQNQRLNALAPAHQDKQAVDGQTTFYH-------CVNFACRQPV 701
>gi|451344787|ref|YP_007443418.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
gi|449848545|gb|AGF25537.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
Length = 689
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 256/700 (36%), Positives = 368/700 (52%), Gaps = 78/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F+++KVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI G L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWGYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +VL G K D + + A H PA T EH A +
Sbjct: 599 TMP-QKEIVLFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGI 645
Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTDPISLENLL 710
+ +F+A K +C+NF+C P TD N+L
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTDIDEAMNIL 683
>gi|333987397|ref|YP_004520004.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
gi|333825541|gb|AEG18203.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
Length = 700
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 263/718 (36%), Positives = 371/718 (51%), Gaps = 66/718 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESFED VA+L+N+ FV +KVDREERPD
Sbjct: 38 GDEAFKKAEKEDKPIFLSIGYSTCHWCHVMAHESFEDPEVAELINEVFVPVKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD++YM Q + G GGWPL++ ++PD KP GTYFP E +YG G K ++ V++ W
Sbjct: 98 VDRIYMDVCQIMTGTGGWPLTIIMTPDKKPFFAGTYFPKESRYGSTGLKDLILNVEEIWK 157
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ R + SG EQ+ L SS E+ L + LSK++D +GGFG
Sbjct: 158 ENRKDVLNSG----EQVFRVLK-DVSSTPRGGEIEAKILEKTYDTLSKTFDYEYGGFGDF 212
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
KFP P + +L + K+ TG MV TL M GGI+DH+G GFHRYSV
Sbjct: 213 QKFPTPHNLMFLLRYWKR---TGNKNAVH----MVEKTLDSMYMGGIYDHLGFGFHRYSV 265
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLYDQ ++ VY++AF T + Y I I Y+ R+M P G +SA
Sbjct: 266 DPGWVVPHFEKMLYDQALISMVYIEAFQATGNEEYKRIAEQIFKYVFRNMKSPEGGFYSA 325
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDAD TEG EG FY+WT KE+ D L + A L + + +K GN + +
Sbjct: 326 EDAD---TEGV----EGKFYLWTKKEIFDALDPDEAELICKIFNVKEAGNFEDETIG--- 375
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
E G N+L + A LG+ + + L R KLF R R P DDK++ W
Sbjct: 376 -EETGANILYLKSSIGELAEGLGISRRELEDKLETSRMKLFQNRETRVHPQKDDKILADW 434
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++ A+A++ D +Y + AE AA+FI + E RL
Sbjct: 435 NGLMITALAKAAQAF----------------DDPKYSKAAEDAANFILDKMCKEG--RLF 476
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R+ + PG LDD+ F+I GLL+LYE K+L A++L E F D + GG++
Sbjct: 477 HRYRDNEAAIPGNLDDHTFMIWGLLELYEAVFNVKYLKKALKLNKILIEHFWDEKDGGFY 536
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + VLL K+ +DGA PSGNSV + NL++LA I + + + E + F T
Sbjct: 537 FTANDSEHVLLWEKQTYDGALPSGNSVGIFNLIKLARITEDPELERRSIDLERA---FST 593
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+++ + A D PS + VV+VG + D + M+ + + + NK + D
Sbjct: 594 QIRRAPIVHTHFLEAIDFKVGPSYE-VVIVGDPEADDTKKMIQSIRSHFIPNKVFLLKDE 652
Query: 658 -----ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ E ++E NA+ A +C SC P TD + NLL
Sbjct: 653 NVPDISEIAESLKYKEPIKGNAT------------AYICTEGSCKSPSTDVRKVLNLL 698
>gi|424826571|ref|ZP_18251427.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
gi|365980601|gb|EHN16625.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
Length = 682
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 248/694 (35%), Positives = 360/694 (51%), Gaps = 74/694 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN+ F+SIKVDREERPDVD +YM++ QA G GGWPL++
Sbjct: 56 TCHWCHVMERESFEDEDVAEILNNNFISIKVDREERPDVDNIYMSFCQAYTGSGGWPLTI 115
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG IL+ + W + + + +S +EQ+
Sbjct: 116 LMTPDKKPFFAGTYFPKWGKYNIPGIMDILKSINKLWHEDKSKILESSNRILEQIER--- 172
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N DEL + + A+ L ++DS++GGFG+ PKFP I +L Y+ KK E
Sbjct: 173 --FQDNHGEDELEEYIIEEAAQTLIDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDE 230
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 231 KV---------LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ Y + IL+Y+++ M G +SAEDADS EG EG FY
Sbjct: 282 MAYTEAYEATKNPLYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFY 334
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
+WT KE+ DILGE F C L ++ N F+ KN+ LI+ +
Sbjct: 335 LWTKKEIIDILGEEDGAFY----------CKLYDITSRGN-FENKNIANLIQTDLKDVDN 383
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+K + L R KLF+ R KR PH DDK++ SWN L+I +F RA + K++
Sbjct: 384 NK---------DKLERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND- 433
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+++A+ +A FI ++L DE L R+ GF+DDYAF
Sbjct: 434 ---------------NYIDIAKQSADFIIKNLMDENG-TLYARIRDEERGNEGFIDDYAF 477
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ L++LYE +L +IE+ ++ +LF +E GG++ + +++R KE +DG
Sbjct: 478 FLWALIELYEASFDIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYDG 537
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGN+V+ + L L I D Y+ + F +K M L A M
Sbjct: 538 AMPSGNAVASLALSLLYYITG---EDKYKNLVDEQFKFFAANIKSGPM-YHLFSVMAYMY 593
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+V K + L ++ F + + Y + ++I ++ E E+ N N
Sbjct: 594 NVSPVKEITLAYNEKDEAFYEFINEFNNRY-IPFSIITLNDKSNE----IEKINKNLKDK 648
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A DK +CQN++C P+TD +++L
Sbjct: 649 AP---IKDKTTVYICQNYACREPITDLEKFKSVL 679
>gi|311070619|ref|YP_003975542.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
gi|310871136|gb|ADP34611.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
Length = 687
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 251/695 (36%), Positives = 371/695 (53%), Gaps = 84/695 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ +E+++E
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSNTFANDREH--------VEEIAENA 164
Query: 140 SASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
S S K P+ L + AL +QL +D+ +GGFG APKFP P M++Y +
Sbjct: 165 S-SHLQIKTPEGNGTLTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---HMLMYLLRY 220
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ TG+ K TL MA GGI+DHVG GF RYS D+ W VPHFEKMLYD
Sbjct: 221 HQYTGQENALYNVTK----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNAL 276
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y +A+ +T+D Y +I I+ +++R+M G +SA DAD TEG EG
Sbjct: 277 LLTAYTEAYQVTQDSRYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEGV----EGK 329
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
+YVW+ E+ + LG E L+ Y + +GN F+G N+ LI
Sbjct: 330 YYVWSKDEIIETLGDELGELYCAIYNITSSGN------------FEGHNIPNLIHTKLDK 377
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A + + ++ LGE R+KL R R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 378 VKA-EFDLNEQEINKQLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGLAKAAKVFQ 436
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+ EY+ +A++AA+FI + L + R+ +R+G K GF+DD
Sbjct: 437 A----------------PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEVKNKGFIDD 478
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL+ ++LYE G +L A +L +LF D++ GG++ T + ++L+R KE
Sbjct: 479 YAFLLWAYIELYEAGYDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEALLVREKEV 538
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGNSV+ + L+RL + G S + AE + F+ ++ +
Sbjct: 539 YDGAVPSGNSVAAVQLLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSGHSFFMQSV 595
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+P +K +V+ G K +++++A ++ N +V+ EH
Sbjct: 596 LTHMMP-KKEIVIFGRKDDSQRQHIISALQQAFQPNFSVL------------VAEHPDQC 642
Query: 674 ASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
+A F+AD K +C+NF+C P TD
Sbjct: 643 KDIA--PFAADYRIIDGKTTVYICENFACQQPTTD 675
>gi|197119298|ref|YP_002139725.1| hypothetical protein Gbem_2926 [Geobacter bemidjiensis Bem]
gi|197088658|gb|ACH39929.1| thioredoxin domain protein YyaL [Geobacter bemidjiensis Bem]
Length = 746
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 252/700 (36%), Positives = 366/700 (52%), Gaps = 56/700 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+ LN F++IKVDREERPDVD +YMT V A+ GGWPL+V
Sbjct: 98 TCHWCHVMEEESFEDEEVARFLNSNFIAIKVDREERPDVDTIYMTAVHAMGMQGGWPLNV 157
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +PD KP GGTYFPP D G GF ++L+++++ + + D + +G QL+EA+
Sbjct: 158 FATPDRKPFYGGTYFPPRDYAGGIGFLSLLQRIRETYRQAPDRVTHAGV----QLTEAIR 213
Query: 141 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + E PQN + L E + +D++ GG APKF L L
Sbjct: 214 GMLAP--MGGEPPQNEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + G+ + M +TL+ MA GGI+D GGGFHRY+ D W +PHFEKMLYD +LA
Sbjct: 266 DHLRRGDKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSAWLIPHFEKMLYDNARLA 324
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+ + T D ++ + R+IL YL+RDM+ P G +SA DADS G ++EG F+
Sbjct: 325 AAYLEGYQATGDPQFAKVAREILRYLQRDMMSPQGAFYSATDADSLTESG--HREEGIFF 382
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT +E++ +LG E A + Y + GN F+G+++L A
Sbjct: 383 TWTPEELDAVLGTERARVVAACYGVTSEGN------------FEGRSILHREKSMQHLAE 430
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+L +P E+ +L E R +L+ R +RP P D+K++ SWNGL IS+FAR +L A
Sbjct: 431 ELMLPKEELERLLDEAREELYRARQRRPLPLRDEKILASWNGLAISAFARGGLVLNDPA- 489
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
++ A AA+FI + + ++ RL HS++ G +K GFLDDYAF
Sbjct: 490 ---------------LLDTARRAANFILQSMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I+GL+DL+E WL A+E+ E F D E GG+F T ++ R K +DG
Sbjct: 533 IAGLIDLFEATGELPWLKRALEVAQQVQEQFEDSETGGFFMTGPRHEELISREKPAYDGV 592
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV ++NL+RL ++ + A+ +L F +L A+ M A D L
Sbjct: 593 IPSGNSVMIMNLLRLNALTG---EQWMLDQAQRALDAFSIQLASAPTALSEMLLALDYLQ 649
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
R+ V++ +L + N+ ++ E D E+ +
Sbjct: 650 DLPREIVIVAPQGKREAAGPLLEKLRGVFLPNRALVVFC-----EGDELEQAGELLPLVR 704
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
+A +C++ SC P +DP L E S
Sbjct: 705 EKKADGGLAMAYLCESRSCRRPTSDPEEFHRQLQETQSKV 744
>gi|435851537|ref|YP_007313123.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
15978]
gi|433662167|gb|AGB49593.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
15978]
Length = 717
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 246/687 (35%), Positives = 370/687 (53%), Gaps = 61/687 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA+L+N F+ IKVDREERPD+D VYM QA+ G GGWPL+
Sbjct: 65 STCHWCHVMEKESFEDPDVARLMNATFICIKVDREERPDIDSVYMAICQAITGRGGWPLT 124
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++P+ +P TY P + ++G PG ++ + W ++++ + Q+ +L AL
Sbjct: 125 ILMTPNKEPFFAATYIPKKSRFGNPGMLDLIPHIAKVWTQQQEDILQTA----RELKAAL 180
Query: 140 S---ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
S AS+ E+ + L QL ++D + GGFG APKFP P + +L + ++
Sbjct: 181 SPQMVQASAKSTGTEINEKTLHSGYSQLLSAFDWQAGGFGRAPKFPSPHNLTFLLRYWQR 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
TGK E +MV TL M GGI+DHVG GFHRYS D +W VPHFEKMLYDQ
Sbjct: 241 ---TGK----LEALQMVTKTLDGMRGGGIYDHVGFGFHRYSTDGQWLVPHFEKMLYDQAM 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y + F +T + + +I++Y+ RDM G + AEDADS EG EG
Sbjct: 294 LIMAYTEGFQVTGIEDHRQVAAEIIEYVLRDMCSAEGAFYCAEDADS---EGM----EGK 346
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNC--DLSRMSDPHNEFKGKNVLIELNDSS 373
FY+W +E+ D+L E A L + Y + GN ++S +S +N+L
Sbjct: 347 FYLWKKEEIYDLLPLEVANLVCKVYDISSEGNYKEEISGIS------TRQNILHLARPMQ 400
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+A +LG+ L++ L R+ LF R KR P DDKV+ WNGL+I++ +AS+
Sbjct: 401 EAAQELGISLDELKAKLEPARKILFAAREKRVHPSKDDKVLTDWNGLMIAALCKASRAF- 459
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+R EY + A A FI +H+ RL H +R+G + GFL+D
Sbjct: 460 ---------------ERPEYAQAASRTADFILQHM-SSHDGRLLHRYRDGEASISGFLED 503
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL+ GL++LY+ K+L A+ L + Q F+D E GG+F+T + ++L R K+
Sbjct: 504 YAFLVWGLIELYQATFEKKYLEHALRLNSLQIRDFMDVE-GGFFHTANDSETLLFRNKDL 562
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGNSVSV+NL++L+ + + + + A S+ F ++ M MA A
Sbjct: 563 YDGAMPSGNSVSVLNLLKLSRLTGDTDLE---EKASTSMKAFSGQIDAMPMAYSQFLHAL 619
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
D + P+ + VV+ G + M++ A S+ N ++ + E+ + +
Sbjct: 620 DFTAGPAYE-VVIAGDPDDPNTREMISLAGRSFLPNMVLLLQGKNNIGEL---APYTKDM 675
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPV 700
++ RN +CQ +SCS P+
Sbjct: 676 SATDRN------ATVYICQGYSCSMPI 696
>gi|194017545|ref|ZP_03056156.1| YyaL [Bacillus pumilus ATCC 7061]
gi|194010817|gb|EDW20388.1| YyaL [Bacillus pumilus ATCC 7061]
Length = 687
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 253/688 (36%), Positives = 360/688 (52%), Gaps = 71/688 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+ Q + G GGWPL+V
Sbjct: 54 TCHWCHVMAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNV 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD KP GTYFP YGRPGF L +++DA+ RD + A L +
Sbjct: 114 FVTPDQKPFYAGTYFPKRSAYGRPGFIEALTQLRDAYHNDRDHIESLAEKATNNLRIKAA 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S L Q A+ QL S+D+ GGFGSAPKFP P M+ + + E T
Sbjct: 174 GQTEST-----LTQEAIHKAYYQLMSSFDTLHGGFGSAPKFPAP---HMLSFLMRYYEWT 225
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ V+ TL MA GGI+DHVG GF RYS DE+W VPHFEKMLYD L
Sbjct: 226 GQEN----ALYAVMKTLDGMANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALLMEA 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ LT+ Y + ++ +++RDM+ PGG +SA DADS EG KEG +YVW
Sbjct: 282 YTEAYQLTQQPEYEKLVHRLIHFIKRDMMNPGGSFYSAIDADS---EG----KEGQYYVW 334
Query: 321 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ E+ LGE LF Y++ GN + + + PH + +D AS S
Sbjct: 335 SKDEIMTHLGEDLGALFCAIYHITEEGNFEGANI--PH------TISTSFDDIKASFSID 386
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
L+ L E R L VR +RP P +DDKV+ SWN L+ISS A+A ++ +E
Sbjct: 387 DHALQSKLQ---EARHILQSVRQQRPAPLVDDKVLTSWNALMISSLAKAGRVFGAE---- 439
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
E + +A+ A SF+ HL Q RL +R G K GF++DYA ++
Sbjct: 440 ------------EAIRMAKQAMSFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAHMLK 485
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ LYE WL A + ELF D+E GG+F + + ++++R KE +DGA P
Sbjct: 486 AYMSLYEATFELAWLEKATAIAKNMFELFWDKEKGGFFFSGSDAEALIVREKEVYDGAMP 545
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA---ADM 615
SGNS ++ L+ L+ + RQ+ +L +F+ D++ + P A +
Sbjct: 546 SGNSTALKQLLMLSRLTG-------RQDWLDTLEQMFKAFYVDVS-SYPSGHTAFLQGLL 597
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+++ ++++G E +L A L K + D T E E + A
Sbjct: 598 AQYATKREIIILGKNGDPQKEQLLQA------LQKRFMPFDIILTAETG---EELAKLAP 648
Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTD 702
+N + D K +C+N+SC P+T+
Sbjct: 649 FTKNYKTIDGKTTVYICENYSCRQPITN 676
>gi|435854108|ref|YP_007315427.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
gi|433670519|gb|AGB41334.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
Length = 681
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 247/701 (35%), Positives = 372/701 (53%), Gaps = 83/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF D+ VA +LN+ FVSIKVDREERPD+D +YM+ QA+ G GGWPL+
Sbjct: 53 STCHWCHVMERESFADQEVANVLNENFVSIKVDREERPDIDDIYMSVCQAMTGRGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA- 138
V ++PD +P GTYFP + K GRPG IL ++ W +++ + +S ++ + +
Sbjct: 113 VVMTPDKRPFFAGTYFPKQTKRGRPGLLKILDQITKKWSNQQEKILESSEELVQAIKQQD 172
Query: 139 ---LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+A+ SSN L D+L + A+ L S+D+++GGFGSAPKFP P + +L +
Sbjct: 173 MKKQAANFSSNDL-DKLVKEAV----SSLKSSFDAQYGGFGSAPKFPSPHNLMFLLRY-- 225
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
GK E +V TL M +GGI+DH+G GF RY+ DE+W PHFEKMLYD
Sbjct: 226 -----GKIHNDQEVLSIVEKTLDSMYQGGIYDHIGYGFSRYATDEKWLAPHFEKMLYDNA 280
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L VYL+ + + + Y+ I +IL Y+ RDM G +SAEDADS EG +EG
Sbjct: 281 LLTIVYLEGYQVLEKEIYAKIAEEILAYINRDMTSSKGAFYSAEDADS---EG----EEG 333
Query: 316 AFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
+Y+W EV++ LG+ F + Y + P GN F GKN+ N
Sbjct: 334 KYYLWQPGEVKEALGDKLGSQFCQTYNIIPEGN------------FAGKNI---PNLIKT 378
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
KL + E + R+KLF R KR RP DDK++ +WNGL+I +FA+A KIL
Sbjct: 379 ERDKLKINHE-----FRKARKKLFLAREKRVRPAKDDKILTAWNGLMIVAFAKAGKIL-- 431
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
D++EY+ A+ AA FI +L + RL +R G + G+++DY
Sbjct: 432 --------------DKEEYLNYAKEAADFIWDNLIRKDDGRLLARYREGEADYLGYVNDY 477
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AF I GL++LY+ +L A+ L F D+E GG++ + ++ R K
Sbjct: 478 AFYIWGLIELYQANFNANYLERALILNKDLIHFFWDQEDGGFYLYGSDGEKLITRPKRVR 537
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
DGA PSGNS++ +NL++L+ +V+ + SD +Q E+ F +++ A +
Sbjct: 538 DGALPSGNSIATLNLLKLSKLVSNQELSDMAQQQFEY----FYNQVRKAPRAYSAFLISV 593
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM----DFWEEH 669
P K V++V K + M+ ++ V+ D + +++ + +++
Sbjct: 594 LFNQQPG-KEVIIVKAKEETE---MIDIFQQKFNPFSVVVVKDTKNNDKLIELISYIKDY 649
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N + A VC++FSC PVT + L+
Sbjct: 650 QVKNG----------ETTAYVCEDFSCLAPVTSRDKFKELI 680
>gi|302037753|ref|YP_003798075.1| hypothetical protein NIDE2440 [Candidatus Nitrospira defluvii]
gi|300605817|emb|CBK42150.1| conserved protein of unknown function (modular protein) [Candidatus
Nitrospira defluvii]
Length = 1236
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 239/694 (34%), Positives = 358/694 (51%), Gaps = 64/694 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
++CHWCHVME ESFE+E +A+L+N FV IKVDREERPD+D++YM AL GGWP+
Sbjct: 56 SSCHWCHVMERESFENEAIARLMNHHFVCIKVDREERPDLDEIYMQATLALNRNQGGWPM 115
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+VFL+PD KP GTYFPPED++GRPGF T+L+K+ + W+K + A +L +
Sbjct: 116 TVFLTPDQKPFFAGTYFPPEDRWGRPGFPTLLKKIAEYWEKDHAGVVAQAATLTARLQDG 175
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A + P + + L + Q ++ +D++ GGFG APKFP + ++L+ + +
Sbjct: 176 SHAPS-----PTTVGEAELDMAVTQFAEDFDAKLGGFGGAPKFPPATGLSLLLHCYHRTK 230
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + MV TL MA GGI+D +G GF RYS D+RW VPHFEKMLYD LA
Sbjct: 231 D-------PQTLTMVRTTLDAMAAGGIYDQIGDGFARYSTDDRWLVPHFEKMLYDNALLA 283
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY++AF +T D Y + + LDY+ ++M P G +SA DADS EG EG F+
Sbjct: 284 RVYVEAFQVTADPNYRRVACETLDYILKEMTSPEGGFYSATDADS---EGV----EGKFF 336
Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT E+ +L E +Y + P GN ++ KNVL ++ A
Sbjct: 337 VWTPDEIRAVLSNEEDVRRICTYYDVTPAGN------------WEHKNVLHTAKPVASVA 384
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+LG+ +E + + L+ R+KR P LDDKVI +WNG++IS+ A A ++
Sbjct: 385 KELGLTVEDLQATIDRVKPLLYAARAKRVPPGLDDKVITAWNGMMISAMAEAGRV----- 439
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P Y AE A F+ L + RL ++R G + +L+DYA+
Sbjct: 440 ----FDMP-------RYRAAAERACEFLLTTL-SKPDGRLLRTYRAGTAHLDAYLEDYAY 487
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
GL+D YE G ++L A+ L F D + GG+F T ++++R +E DG
Sbjct: 488 FAEGLIDTYEAGGHERYLSAAVRLAERILADFSDGQQGGFFTTATGHEALIVRSREGPDG 547
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGN+V+ L RL+ + +RQ A ++ + ++ A D+L
Sbjct: 548 ATPSGNAVAAAALARLSYHFG---REDFRQAAAGAVRAYGRQIARYPRAFAKSLIVVDLL 604
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ + ++G + + AA +Y N+ + + +E + +
Sbjct: 605 T-SGPVEIAVIGAPDDSNTVALRAAVSRTYIPNRVIASRESQQSE---------PTHPLL 654
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K VC+NF+C P+TDP L L
Sbjct: 655 HGKALVGGKSALYVCRNFACRRPITDPADLPTQL 688
>gi|163846817|ref|YP_001634861.1| hypothetical protein Caur_1244 [Chloroflexus aurantiacus J-10-fl]
gi|222524638|ref|YP_002569109.1| hypothetical protein Chy400_1363 [Chloroflexus sp. Y-400-fl]
gi|163668106|gb|ABY34472.1| protein of unknown function DUF255 [Chloroflexus aurantiacus
J-10-fl]
gi|222448517|gb|ACM52783.1| protein of unknown function DUF255 [Chloroflexus sp. Y-400-fl]
Length = 693
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 248/694 (35%), Positives = 367/694 (52%), Gaps = 62/694 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF D VA + N++F++IKVDREERPD+D +YM QAL G GGWPL+VF
Sbjct: 56 CHWCHVMAHESFADPEVAAVQNEYFINIKVDREERPDLDNIYMAAAQALTGRGGWPLNVF 115
Query: 82 LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
PD P GTYFPP+ K R PG++ +L V +A+ +R + S +E +
Sbjct: 116 CLPDGTPFFAGTYFPPDAKAARYRMPGWRQVLLSVAEAYKTRRADVTASAHELLEHIK-- 173
Query: 139 LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ LP+ LP + L A Q+ + +D ++GGFG APKFP+PV ++ +L
Sbjct: 174 ----LLTRPLPETLPLDEELLMAAAAQIGREFDPQYGGFGDAPKFPQPVVLEFLLR---- 225
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
T G+ + M+ TL+ MA+GG++D VGGGFHRYSVDERW VPHFEKMLYD
Sbjct: 226 ---THLRGDV-QALPMLQQTLEQMARGGMYDQVGGGFHRYSVDERWLVPHFEKMLYDNAL 281
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA VY A +T D F + I + Y+ RD+ P G FS+EDADS T GA+ +EGA
Sbjct: 282 LAEVYHLAAQVTGDTFLARIADETFTYMLRDLRHPDGAFFSSEDADSLPTPGASHAEEGA 341
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FYVWT E+ LG+ A+L +Y + GN F+G+++L ++A A
Sbjct: 342 FYVWTPDELRAALGDDAVLVGAYYGVTRQGN------------FEGRSILHVPRPAAAVA 389
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ LG+ +E+ + R L R +RPRP D+KVI +WN + I + A AS + +
Sbjct: 390 AMLGVSVERLEATVARARPILRTFRERRPRPFRDEKVITAWNAMAIRALAVASSRVPA-- 447
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y++ A A F+ +L + RL S+++G FLDDYA
Sbjct: 448 ----------------YLDAARQCADFLLTNLRRDDG-RLLRSWKDGRPGPAAFLDDYAL 490
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L++L+ G T++L AI+L + +LF D + G +F+T + P+++ R ++ D
Sbjct: 491 FCDALIELHAAGGDTRYLATAIDLADAMIDLFWDDQAGMFFDTGRDQPALVTRPRDLSDN 550
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSG+S + + L+RL +I + Y A +L LK + M CAAD+
Sbjct: 551 ATPSGSSAATVALLRLYAITGRER---YETRAMQTLQQTTPLLKRFPLGFGRMLCAADLA 607
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P R+ + ++G + MLA A ++Y + P D + + +
Sbjct: 608 LGPLRE-LAIIGPPDHPVTQAMLAVARSAYRPRLVIARAMPDDPV--------VTLSPLL 658
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A +C+ F+C PVT P +L+ L
Sbjct: 659 NDRPMVDGQPTAYLCEQFACQMPVTTPEALQAQL 692
>gi|387929306|ref|ZP_10131983.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
gi|387586124|gb|EIJ78448.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
Length = 685
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 236/564 (41%), Positives = 326/564 (57%), Gaps = 53/564 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA+LLN+ FVSIKVDREERPD+D +YM Q + G GGWPLS
Sbjct: 53 STCHWCHVMERESFEDEEVARLLNERFVSIKVDREERPDIDSIYMNICQMMNGHGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP E +YG PGFK ++ ++ D + K RD + + + A E L
Sbjct: 113 VFMTPDQKPFFAGTYFPKESRYGVPGFKEVITQLHDQYMKNRDQIEKIASDAAEALKH-- 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA SS +LP + L +QL+ S++S +GGFG APKFP P + +L + K
Sbjct: 171 SARESSAELPS---ADVLHKTYQQLAGSFNSFYGGFGDAPKFPIPHNLMFLLKYYKW--- 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TGK KMV TL MA GGI+DH+G GF RYSVD W VPHFEKMLYD L
Sbjct: 225 TGKEM----ALKMVEKTLVSMANGGIYDHIGFGFARYSVDVMWLVPHFEKMLYDNALLLY 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +TK+ Y I I++++ R+M G FSA DADS EG +EG +YV
Sbjct: 281 TYSEAYQVTKNSKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ +E+ D+LG+ F Y + GN F+GKN+ LI N +
Sbjct: 334 WSKEEILDVLGDKDGEFFCRVYDITSGGN------------FEGKNIPNLIHTN-IVKTV 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ G+ LE+ L E R+KLF+ R +R PHLDDK++ SWN L+I+ A+A + ++
Sbjct: 381 AEAGLNLEEGKAKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQN-- 438
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
K ++E AE A FI L L +R+G SK +LDD+AF
Sbjct: 439 --------------KNHVEKAEKALRFIEEKLV--VNGELMARYRDGESKFRAYLDDWAF 482
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ LL+LYE ++L A + F D + GG++ T + ++++R K+ +DG
Sbjct: 483 LLWALLELYEATFSMEYLDKARNTAEKMKKHFWDEQDGGFYFTRSDGEALIVREKQVYDG 542
Query: 557 AEPSGNSVSVINLVRLASIVAGSK 580
A PSGNSV+ ++L+RL +K
Sbjct: 543 ALPSGNSVAAVSLLRLGHFTGETK 566
>gi|345020399|ref|ZP_08784012.1| hypothetical protein OTW25_03576 [Ornithinibacillus scapharcae
TW25]
Length = 685
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 256/714 (35%), Positives = 372/714 (52%), Gaps = 78/714 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESFEDE VAKL+ND +++IKVDREERPD
Sbjct: 32 GEEAFEKAKQENKPIFLSIGYSTCHWCHVMAHESFEDEEVAKLINDHYIAIKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YM Q + G GGWPL++F++PD P GTYFP E KYGRPG K L ++ +
Sbjct: 92 VDSIYMKVCQMMAGHGGWPLTIFMTPDKIPFYAGTYFPKESKYGRPGIKEALEQLHIKYT 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASA---SSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
+ +A E + EAL + S+N+L E A +QL + +D +GGF
Sbjct: 152 TDPEHIAD----VTESVREALDNTIREKSNNRLTIETVDQAF----QQLGRGFDFTYGGF 203
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
APKFP+P Q +L+ + +GK+ KMV TLQ MA GGI DH+G GF R
Sbjct: 204 WEAPKFPQP---QNLLFLMRYYHFSGKTA----ALKMVESTLQNMAAGGIWDHIGYGFAR 256
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS DE+W VPHFEKMLYD L VY + + +TK FY I I+ +++R+M G
Sbjct: 257 YSTDEKWLVPHFEKMLYDNALLLMVYTECYQITKKPFYKNIAEQIITFIKREMTSKDGAF 316
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMS 354
+SA DADS EG EG +YVW +E+ DILGE ++ Y + P GN
Sbjct: 317 YSAIDADS---EGV----EGKYYVWADEEIYDILGEDLGEIYTTTYGITPFGN------- 362
Query: 355 DPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
F+GKN+ LI N S A + + L + + L R L R KR PH+DDK
Sbjct: 363 -----FEGKNIPNLIRANLESV-AEEFDLTLSELTSQLETARLTLLQEREKRVYPHVDDK 416
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
V+ SWN ++I+ A+AS++ +++ +Y+ +A+ A SF+ ++ +
Sbjct: 417 VLTSWNAMMIAGLAKASRVFQNQ----------------DYVTLAKRALSFLEENIVVDG 460
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
L +R G +K +LDDYA+LI ++LY+ +L A N ELF D
Sbjct: 461 D--LMARYREGETKYHAYLDDYAYLIWAYIELYQLEFDLTYLSKAKAQLNIMIELFWDPH 518
Query: 533 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
GG+F + + ++ KE +DGA PSGNSV+ + L ++AS+ + DY + E
Sbjct: 519 HGGFFFSGKNNEKLISNDKEIYDGATPSGNSVAALMLGQMASLTG--EVDYLDKINEMYS 576
Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN-KT 651
+E +K + V + +L+ K VV++GH +V + L Y N
Sbjct: 577 TFYEDMMKQPSAGVFFLQSL--LLTENPTKEVVVLGHDENV--QEFLNHVQDKYAPNIAL 632
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
++ + P E+ + + N M N + VC+NF+C P D I+
Sbjct: 633 LVAVTPGQLIEVAPF----AANYKMVNN-----QTTIYVCENFACQQPTNDIIA 677
>gi|421729533|ref|ZP_16168663.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
plantarum M27]
gi|407076503|gb|EKE49486.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
plantarum M27]
Length = 689
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 254/692 (36%), Positives = 364/692 (52%), Gaps = 78/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKIHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI G L+LYE G +L A L ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWGYLELYEAGFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +V+ G K D + + A H PA T EH A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGI 645
Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
+ +F+A K +C+NF+C P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675
>gi|300855044|ref|YP_003780028.1| hypothetical protein CLJU_c18640 [Clostridium ljungdahlii DSM
13528]
gi|300435159|gb|ADK14926.1| conserved protein containing a thioredoxin domain [Clostridium
ljungdahlii DSM 13528]
Length = 675
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 248/702 (35%), Positives = 359/702 (51%), Gaps = 92/702 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME SFED VA++LND F+SIKVDREERPD+D +YM Q++ G GGWPL++
Sbjct: 54 TCHWCHVMEKGSFEDTEVAEMLNDSFISIKVDREERPDIDSIYMNVCQSITGSGGWPLTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP ++ G G +IL +K AW R L + ++ L
Sbjct: 114 IMTPDQKPFFAGTYFPKNNRDGLMGLMSILDYIKKAWKNNRSELLNAS-------TQILD 166
Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +SN+ +E + ++ + +D +GGFG PKFP + +L + K +D
Sbjct: 167 SLKNSNETSNETINEDIFQKTFLNFKYDFDPTYGGFGDFPKFPSAHNLLFLLRYFYKTKD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
S +MV TL CM KGGI+DH+G GF RYSVD +W VPHFEKMLYD L
Sbjct: 227 -------SSALEMVEKTLDCMRKGGIYDHIGFGFSRYSVDRKWLVPHFEKMLYDNALLII 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y++ F T + Y +IL Y+ RDM G +SAEDADS EG +EG FYV
Sbjct: 280 AYIETFQATGNKKYCKTAEEILSYVLRDMTSNEGGFYSAEDADS---EG----EEGKFYV 332
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E++DIL E + F ++ + GN F+GKN+L +N S
Sbjct: 333 WSEEEIKDILQEEDSGKFCSYFNVTKGGN------------FEGKNILNLINSS------ 374
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+P E + + CR KLF R KR P+ DDK++ SWNGL+I + + A+++L
Sbjct: 375 --IP-EDDMQFIENCREKLFAEREKRIHPYKDDKILTSWNGLMIGAMSIAARVL------ 425
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ +Y + A+ A FI ++L + RL +R+G + G+LDDY+FLI
Sbjct: 426 ----------NNSKYTKAAKKAVDFIYKNLV-KSDGRLLARYRDGEASFLGYLDDYSFLI 474
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LYE T +L A+EL +LF D+E GG+F + ++ R KE +D A
Sbjct: 475 WGLIELYETTYSTDYLKKALELNEDLLKLFWDKENGGFFLYGNDGEKLITRPKEIYDSAI 534
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+ +NL+RL+ + + + A+ F + A +
Sbjct: 535 PSGNSVATLNLLRLSHLTSSYD---FEDKAKQLFDAFSREINSFPRACSFSLISLLFSKS 591
Query: 619 PSRKHVVLVG----------HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
P R+ +V G H + F N + +LNK + I P +D
Sbjct: 592 PIRQIIVSAGSNIEEGKQVVHMINEKF-NPFTISILYCNLNKDLSTISPIIKNYIDI--- 647
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
NN K +C+NF+C P+TD L +L
Sbjct: 648 ---NN-----------KTTTYICENFTCKKPITDINLLRKIL 675
>gi|444911449|ref|ZP_21231624.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
gi|444718207|gb|ELW59023.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
Length = 683
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 246/702 (35%), Positives = 369/702 (52%), Gaps = 78/702 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE +A+L+N+ F+++KVDREERPDVD++Y VQ + GGGWPL+
Sbjct: 48 SACHWCHVMAHESFEDEAIARLMNEGFINVKVDREERPDVDQLYQGVVQLMGQGGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSE- 137
VFL+PDL P GGTYFPP+D+YGRPGF +LR + +AW R ++L+Q+ F E L E
Sbjct: 108 VFLTPDLVPFFGGTYFPPKDRYGRPGFPKVLRALSEAWATNRGELLSQAREFR-EGLGEL 166
Query: 138 ---ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
L A+ ++ K P+++ L L + D GGFG APKFP P+ + ++L
Sbjct: 167 ALHGLDAAPAALK-PEDIVSMGLSLL-----ERMDGVNGGFGGAPKFPNPMNVALVLRAW 220
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
++ + G+ ++ VL TL+ MA+GG++D +GGGFHRYSVDERW VPHFEKMLYD
Sbjct: 221 RR--EPGQDAL----KQAVLLTLEKMARGGVYDQLGGGFHRYSVDERWAVPHFEKMLYDN 274
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
QL ++Y +A + + + + +Y+RR+M G ++ +DAD TEG +E
Sbjct: 275 AQLLHLYAEAQQVEPRPLWRKVVEETAEYVRREMTDARGGFYATQDAD---TEG----EE 327
Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
G F+VW ++V ++L E A L H+ + GN + G+ VL
Sbjct: 328 GRFFVWLPEQVREVLPPELAELALRHFRVTALGNFE-----------HGRTVLESAVSVE 376
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+ A +L P+E+ + L E RR+LF+ R +R +P DDK++ WNGL+I A A ++
Sbjct: 377 SLAEELQRPVEEVASGLSEARRRLFEARERRVKPGRDDKILAGWNGLMIRGLAFAGRVF- 435
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
DR +++E A AA F+ L+D Q RL S++ G ++ PGF++D
Sbjct: 436 ---------------DRADWVESARKAADFVLAELWDGQ--RLSRSYQEGQARIPGFVED 478
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
Y L +GL LY+ ++L A L T + LF D E G Y +++
Sbjct: 479 YGDLAAGLTALYQATFEPRYLEAAEALVRTAETLFWDEERGAYLTAPRTQGDLVVATYAT 538
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D A PSG S V LA++ + + Y + E ++ +L+ M + AA
Sbjct: 539 FDNAFPSGASTLTEAQVALAALTSNKQ---YLELPERYVSRMGEQLRKNPMGYGHLALAA 595
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
D L V V G + +V E +LA + Y W+ +
Sbjct: 596 DAL-VDGAPSVTFAGTREAV--EPLLAVSRTVYAPTFGFT------------WKAPEAPV 640
Query: 674 ASMARNNF-----SADKVVALVCQNFSCSPPVTDPISLENLL 710
R F + A +C+NF+C PP+T+ +L L
Sbjct: 641 PPSMRETFLGREPVGGRAAAYLCRNFACEPPLTEAGALAKRL 682
>gi|224368664|ref|YP_002602826.1| hypothetical protein HRM2_15540 [Desulfobacterium autotrophicum
HRM2]
gi|223691380|gb|ACN14663.1| conserved hypothetical protein [Desulfobacterium autotrophicum
HRM2]
Length = 766
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 249/710 (35%), Positives = 388/710 (54%), Gaps = 57/710 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K R FL TCHWCHVME ESFE+E +A+ LN+ ++ +KVDREERPD
Sbjct: 88 GDEAFETARKLNRPVFLSVGYATCHWCHVMEEESFENEEIARYLNENYLCVKVDREERPD 147
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDA 116
+D +YM+ VQAL G GGWP++V+L+ D KP GGTYFPP D+ GF T+L K+ +
Sbjct: 148 IDSIYMSAVQALTGRGGWPMNVWLTCDRKPFYGGTYFPPRDGDRGADIGFLTLLEKLIQS 207
Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
+ + + +G + + +S + E QNA+ +SYDSRFGG
Sbjct: 208 FHAQDGRVENAGRQITAAIQQMMSPKPGTRLPGKETIQNAVSF----YRQSYDSRFGGLS 263
Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
+PKFP + ++++L H++ + K + + +M+ +L MA GG++DHVGGGFHRY
Sbjct: 264 GSPKFPSSLPVRLLLRHNRNTFE--KVKQDTNILEMIDHSLAQMAGGGMYDHVGGGFHRY 321
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S DE W VPHFEKMLYD LA VYL+A+ T + + + +IL Y+ +DM G +
Sbjct: 322 STDEHWLVPHFEKMLYDNALLAVVYLEAWQATDNADFKRVVNEILSYVIQDMTSADGAFY 381
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD 355
SA DADS G +EG ++ WT +E++ ILG E++ + K +Y + T N
Sbjct: 382 SATDADSITPRG--HMEEGWYFTWTPEELDAILGKENSKIIKRYYSVGVTPN-------- 431
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+ +++L + +AS L + EK I+ R L+ R+KRP P D+KV+
Sbjct: 432 ----FEKRHILHTTKSRAETASALNITEEKLAKIIETSRELLYLERNKRPAPLRDEKVLT 487
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
+WN L+IS+FARA L + Y++ A AA FI +LY + +R
Sbjct: 488 AWNALMISAFARAGFTLNNTV----------------YIDQAVRAARFIMENLYID--NR 529
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L S+++G ++ +L+DYAF I+ L+DLYE +WL A+EL + + DR+ G
Sbjct: 530 LFRSYKDGKARHNAYLEDYAFFIAALIDLYEATHDIEWLKKALELDDVLKTFYEDRKNGA 589
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNAEHSLAV 594
+F T+ + +++ R K +D A PSGN+++++NL+RL S +DY Y+Q AE +L
Sbjct: 590 FFMTSSDHEALISREKPYYDNATPSGNAIAILNLLRLHSFT----TDYRYKQRAEKALKF 645
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F RL A+ M A D + K ++++ D + L + + ++
Sbjct: 646 FSERLNTAPSALSEMLLAIDYY-FDNPKEIIVIAPTEKPDAGDCLLETFRNLFIPNRILM 704
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDP 703
+ AD ++ ++ +A+ + + K A VC+N +C P +DP
Sbjct: 705 V--ADEKQA----ADHAKIIPLAQGKKAINGKATAYVCENGTCKLPTSDP 748
>gi|153003852|ref|YP_001378177.1| hypothetical protein Anae109_0984 [Anaeromyxobacter sp. Fw109-5]
gi|152027425|gb|ABS25193.1| protein of unknown function DUF255 [Anaeromyxobacter sp. Fw109-5]
Length = 725
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 261/716 (36%), Positives = 366/716 (51%), Gaps = 79/716 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T R FL +TCHWCHVME ESFEDE +A++LN+ +V IKVDREERPD
Sbjct: 71 GEEAFAEARRTGRPVFLSVGYSTCHWCHVMEGESFEDEEIARVLNERYVPIKVDREERPD 130
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED-KYGRP-GFKTILRKVKDA 116
VD +YMT VQ L GGGGWP+SV+L+P+ +P GGTYFP D G P GF +ILR++ D
Sbjct: 131 VDGLYMTAVQLLTGGGGWPMSVWLTPEKEPFFGGTYFPARDGDRGAPRGFLSILRELADL 190
Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
+ + + + + + + AL+ + +P + L ++D+ GG
Sbjct: 191 YARDAGRVQAATSSLVGAVRAALAPRGEPAASVPG---ADVLEAAFRGFRDAFDAAHGGL 247
Query: 176 GSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
APKFP + ++ +L YH + E +E +M TL+ MA GG+HD +GGGFH
Sbjct: 248 RGAPKFPSSLPVRFLLRYHRRARE--------AEALRMATVTLERMAAGGLHDQIGGGFH 299
Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
RYS D W VPHFEKMLYD LA Y +A+ +T + + R LDYL R+M P G
Sbjct: 300 RYSTDATWLVPHFEKMLYDNALLAVAYAEAWQVTGRRELARVVRQTLDYLGREMTSPEGG 359
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
++SA DADS EG +EG F+VW + E+ LG A F + GN
Sbjct: 360 LYSATDADS---EG----EEGRFFVWDAAELRQRLGADAERFMRFHGATDAGN------- 405
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
F+G+NVL + P E L R L+ R +RPRP D+K++
Sbjct: 406 -----FEGRNVL-----------HVPRPDEDEWEALAPQRALLYAAREERPRPLRDEKIL 449
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT 473
WNGL IS+ A ++L E Y++ A SAA F+ R + D
Sbjct: 450 AGWNGLAISALAFGGRVLGEE----------------RYVKAAASAAEFVLGRMIVD--- 490
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
RL+ ++ +G + PGFLDD+AF+ GLLDLYE +WL A+EL + LF D G
Sbjct: 491 GRLRRAWLDGAAGVPGFLDDHAFVAQGLLDLYEATFDARWLEAAVELSERLEVLFGDPRG 550
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G +F T + +L R K HDGAEPSG SV+++N +RL++ + D +R AE +L
Sbjct: 551 GAWFGTAADHERLLAREKPTHDGAEPSGASVALVNALRLSAF---TTDDRWRVRAEGALR 607
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
+ L + A M A D + +R+ VVLV + E LA S+ N+ +
Sbjct: 608 HYGRALAEHPSAFTEMLLAVDFATDVARE-VVLVWPEEGPSPEPFLAVLRRSFLPNRALA 666
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLEN 708
E A +A + +V A VC+ CS P P L +
Sbjct: 667 GAAEGAA------IERLGRVALVAAEKVALGGRVTAYVCERGQCSLPAIAPEKLAS 716
>gi|429507366|ref|YP_007188550.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
gi|429488956|gb|AFZ92880.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
Length = 689
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 253/692 (36%), Positives = 364/692 (52%), Gaps = 78/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSATAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +V+ G K D + + A H PA T EH A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645
Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
+ +F+A K +C+NF+C P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675
>gi|384267593|ref|YP_005423300.1| hypothetical protein BANAU_3964 [Bacillus amyloliquefaciens subsp.
plantarum YAU B9601-Y2]
gi|380500946|emb|CCG51984.1| putative protein yyaL [Bacillus amyloliquefaciens subsp. plantarum
YAU B9601-Y2]
Length = 689
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 249/686 (36%), Positives = 361/686 (52%), Gaps = 66/686 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP KY RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKIHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LPAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 NELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +V+ G K D + + A + T++ + D E +
Sbjct: 599 TMP-QKEIVVFGSKDDPDRKRFIEALQEHFTPAYTILAAEHPD--------ELKGISDFA 649
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
A K +C+NF+C P TD
Sbjct: 650 AGYQMIDGKTTVYICENFACRRPTTD 675
>gi|91772578|ref|YP_565270.1| hypothetical protein Mbur_0543 [Methanococcoides burtonii DSM 6242]
gi|91711593|gb|ABE51520.1| Protein of unknown function DUF255 [Methanococcoides burtonii DSM
6242]
Length = 703
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 233/684 (34%), Positives = 355/684 (51%), Gaps = 51/684 (7%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESF ++ VAK++ND FVSIKVDREERPD+D VYM Q + G GGWPL++
Sbjct: 56 TCHWCHVMAKESFRNKDVAKMMNDTFVSIKVDREERPDIDSVYMDICQKMNGSGGWPLTI 115
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++P+ P + TY P + +GR G I+ ++ W ++ + + + LSE
Sbjct: 116 IMTPEKVPFIAATYIPLKSGFGRKGMLEIIPWIEHLWKEEHNKIVEQTELIKTALSE--- 172
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S N +E+ + + L+ ++D+ GGFG++PKFP P I +L + K
Sbjct: 173 --KSENSHNEEVTEEIIHRTYTYLANNFDNENGGFGTSPKFPSPHNISYLLRYWK----- 225
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
++G + Q MV TLQ M KGGI+DH+G GFHRYS D W VPHFEKMLYDQ L
Sbjct: 226 -RTGNPTALQ-MVERTLQAMRKGGIYDHIGFGFHRYSTDSSWLVPHFEKMLYDQALLIIA 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ T YS +I++Y+ RDM P G + A DADS E EG FY W
Sbjct: 284 YTEAYQATNKEEYSNTANEIIEYILRDMTSPDGGFYCAGDADSEEV-------EGRFYTW 336
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
E+E IL E +F++ + ++P GN P+ GKN+L D + +
Sbjct: 337 ELSEIESILNREDHPIFRDAFNVRPEGNFLEESTHRPN----GKNILHLEKDLESIEKQY 392
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ ++ +I+ CR++LF R KR P DDK++ WNGL++++ + + +++ +
Sbjct: 393 NITRKEIDHIIERCRKQLFSTREKRIHPSKDDKILTDWNGLMLAALSISGRVMGN----- 447
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K Y+++A+ A + E L H++ + GFLDDYAF
Sbjct: 448 -----------KRYIDIAKRNADLLISERMKENG-ELYHNYSSNKEPTIGFLDDYAFFTW 495
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GL++LYE +L A++L + E F D GG+F+T+ + ++L R KE +DGA P
Sbjct: 496 GLIELYEATFEVTYLAKALQLTDYMIENFKDTINGGFFHTSNKSETLLFRKKEVYDGAIP 555
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSV + NL++L+ + + + A + F + + M D+ P
Sbjct: 556 SGNSVEINNLLKLSKLTGNPELN---SEAIDTSNAFASTIYAMPFGYTHFIAGLDLALAP 612
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
S + +V+ G S D + ML + + KTVI + +E++ + S + +
Sbjct: 613 SVE-IVIAGELDSEDTQLMLNNINEEFIPGKTVIVKSEKNEKELERIAPYTSTLKTQNQ- 670
Query: 680 NFSADKVVALVCQNFSCSPPVTDP 703
K A VCQ C+ P TDP
Sbjct: 671 -----KATAYVCQGHECTLPTTDP 689
>gi|153939114|ref|YP_001390416.1| hypothetical protein CLI_1150 [Clostridium botulinum F str.
Langeland]
gi|384461487|ref|YP_005674082.1| hypothetical protein CBF_1122 [Clostridium botulinum F str. 230613]
gi|152935010|gb|ABS40508.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
gi|295318504|gb|ADF98881.1| conserved hypothetical protein [Clostridium botulinum F str.
230613]
Length = 680
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 245/693 (35%), Positives = 354/693 (51%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD P GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 LMTPDKNPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKVLESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I IL+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR P+ DDK++ SWN L+I +F++A + LK++
Sbjct: 380 NNKDKLEK-------IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND-- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ ++ +LF +E GG++ + +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F T +K M L A M +
Sbjct: 536 TPSGNAVASLALNLLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYN 591
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K + L + DF + + Y V D ++ E N ++
Sbjct: 592 ILPVKEITLAYREKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+TD + LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPITDLEEFKFLL 676
>gi|296330011|ref|ZP_06872495.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|305676735|ref|YP_003868407.1| hypothetical protein BSUW23_20330 [Bacillus subtilis subsp.
spizizenii str. W23]
gi|296153050|gb|EFG93915.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|305414979|gb|ADM40098.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
str. W23]
Length = 695
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 245/692 (35%), Positives = 362/692 (52%), Gaps = 77/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 59 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 119 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 178
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 179 AAKSGEG-----LSKSAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 230
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 231 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 286
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 287 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 339
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ +E+ LG+ +L+ + Y + GN F+GKN+ LI A
Sbjct: 340 WSKEEILKTLGDDLGMLYCQVYDITEEGN------------FEGKNIPNLIHTMQEQIKA 387
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G+ E+ L R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 388 DA-GLTKEELSLKLENARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ--- 443
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+Y+ +AE A +FI L + R+ +R+G K GF+DDYAF
Sbjct: 444 -------------EPKYLSLAEDAITFIENQLIIDG--RVMVRYRDGEVKNKGFIDDYAF 488
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DG
Sbjct: 489 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDG 548
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ + L+RL V G S + AE +VF+ ++ + +
Sbjct: 549 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSV-LK 604
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
V +K +V+ G + + A ++ N +++ EH +
Sbjct: 605 HVMPKKEIVIFGSADDPARKQITTALQKAFKPNDSIL------------VAEHPDQCKDI 652
Query: 677 ARNNFSAD------KVVALVCQNFSCSPPVTD 702
A F+AD K +C+NF+C P T+
Sbjct: 653 AP--FAADYRIIDGKTTVYICENFACQQPTTN 682
>gi|170761713|ref|YP_001786452.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
gi|169408702|gb|ACA57113.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
Length = 682
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 244/685 (35%), Positives = 351/685 (51%), Gaps = 72/685 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+ LN F+SIKVDREERPDVD +YM + QA G GGWPL++
Sbjct: 55 TCHWCHVMERESFEDEEVAEALNKNFISIKVDREERPDVDNIYMNFCQAYTGSGGWPLTI 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP KY PG +LR + + W + ++ + +S EQ+
Sbjct: 115 IMTPDKKPFFAGTYFPKWGKYNIPGIMDVLRSISNLWREDKNKILESSNRISEQIER--- 171
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 172 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 227
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ ++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 228 -------DKKILDVINKTLTNMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I IL+Y+++ M G +SAEDADS EG EG FY
Sbjct: 281 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N +
Sbjct: 334 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKTVD 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R KLF+ R KR PH DDK++ SWN L+I +F++A + LK++
Sbjct: 382 NNKDKLEK-------IREKLFEYREKRIHPHKDDKILTSWNALMIVAFSKAGRSLKND-- 432
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 433 --------------NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFF 477
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ ++ +LF +E GG++ + +L+R KE +DGA
Sbjct: 478 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKESGGFYLYSKNSEKLLVRPKEIYDGA 537
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F + +K M L A M +
Sbjct: 538 TPSGNAVASLALNLLYYITG---EDRYKDLVDKQFKFFASNIKSGPM-YHLFSVMAYMYN 593
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V K + L + DF + + Y V D ++ E N ++
Sbjct: 594 VLPVKEITLAYREKDEDFYKFINEVNNRYIPFSIVTLNDKSN--------EIEKINKNIK 645
Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
DK +CQN++C P+TD
Sbjct: 646 DKIAIKDKATVYICQNYACREPITD 670
>gi|73667810|ref|YP_303825.1| hypothetical protein Mbar_A0261 [Methanosarcina barkeri str.
Fusaro]
gi|72394972|gb|AAZ69245.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 711
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 245/705 (34%), Positives = 361/705 (51%), Gaps = 54/705 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVM ESFEDE +A+L+N FV IKVDREERPD
Sbjct: 47 GEEAFEKARKENKPIFLSIGYSTCHWCHVMAHESFEDEEIARLMNRAFVCIKVDREERPD 106
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT Q + G GGWPL++ ++PD+KP GTY P ++ + G ++ ++++ W+
Sbjct: 107 IDNVYMTVCQIILGRGGWPLNIIMTPDMKPFFAGTYIPKNSRFSQTGMLELVPRIEEIWN 166
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + +S + +S A + ++ + E+L S+D+ +GGFG A
Sbjct: 167 RQHTEVLESADKITSTIQNMISEPAGEG-----IGESIMEEAYEELLTSFDNEYGGFGRA 221
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP +I +L + + +SG E MV +TL+ M +GGIHDH+G GFHRYS
Sbjct: 222 PKFPTSHKIFFLLRYWR------RSGN-PEALHMVEYTLENMYRGGIHDHLGSGFHRYST 274
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLYDQ +A Y + + +T Y ILDY+ RD+ G +
Sbjct: 275 DNVWIVPHFEKMLYDQALIATAYTEIYQVTGKRLYKEAAEGILDYVLRDLTSQEGGFYCG 334
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDAD EG +EG +Y+WT +EV +L E + L + + L TGN + +
Sbjct: 335 EDAD---VEG----EEGKYYLWTLEEVRTVLSPEESELITKVFNLSETGNFE----EEIR 383
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
G N+ + A++L +P + + + + KL R KR RP DDK++ W
Sbjct: 384 GRKTGTNIFYMPRSLESLAAELNIPADDVDSRVKTAKAKLLLARDKRKRPAKDDKILTDW 443
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++ A+ F G ++ Y++ AE AA FI + LY+ RL
Sbjct: 444 NGLMIAALAKG--------------FQAFGEEK--YLKAAEKAADFILKVLYNPD-RRLL 486
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R+G + G DDYAFLI GLL+LYE G +L A+ L E F D GG F
Sbjct: 487 HRYRDGKTGISGTADDYAFLIHGLLELYEAGFKLDYLKAALCLNREFLEHFWDPIQGGLF 546
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + +++ R KE D A PSGNS+ ++NL+RL+ I A S+ + Q E + F
Sbjct: 547 FTADDSEALIFRKKEFSDAAIPSGNSIEMLNLLRLSRITADSELEDRAQGLERA---FSK 603
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
++ + A D P+ + VV+VG S D ML + NK +I
Sbjct: 604 LIQKIPSGYTQFLSALDFGLGPAYQ-VVIVGEHESPDTGQMLEELWTYFIPNKVLIFRPE 662
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E+ ++ + K A VCQN+ C P T+
Sbjct: 663 GKDPEITKLAKYTEGQVPI------DGKATAYVCQNYQCQLPTTE 701
>gi|385266996|ref|ZP_10045083.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
gi|385151492|gb|EIF15429.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
Length = 689
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 253/692 (36%), Positives = 363/692 (52%), Gaps = 78/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E + +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--GTGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +V+ G K D + + A H PA T EH A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645
Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
+ +F+A K +C+NF+C P TD
Sbjct: 646 S--DFAAGYQMIDGKTTVYICENFACRRPTTD 675
>gi|442804077|ref|YP_007372226.1| N-acylglucosamine 2-epimerase family protein [Clostridium
stercorarium subsp. stercorarium DSM 8532]
gi|442739927|gb|AGC67616.1| N-acylglucosamine 2-epimerase family protein [Clostridium
stercorarium subsp. stercorarium DSM 8532]
Length = 679
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 248/687 (36%), Positives = 366/687 (53%), Gaps = 77/687 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA +LN FV+IKVDREERPD+D +YMT+ QA+ G GGWPL+
Sbjct: 57 STCHWCHVMERESFEDEEVADILNKHFVAIKVDREERPDIDHIYMTFCQAITGHGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP D++G PG TIL+ AW++ + L + G EQ+ ++
Sbjct: 117 IIMTPDKKPFFAGTYFPKNDRHGMPGLVTILKSAHRAWEENKKDLERLG----EQILNSV 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S ++ + L + + +QL S+D +GGFG+APKFP P + +L +
Sbjct: 173 -YSEDNDYQHEVLSETIIDDIYKQLESSFDPVYGGFGNAPKFPAPHNLLFLLRYWY---- 227
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+GE + +MV TL M KGGI+DH+G GF RYS D +W +PHFEKMLYD LA
Sbjct: 228 --ATGE-KKALEMVEKTLDSMHKGGIYDHIGFGFCRYSTDRKWLIPHFEKMLYDNALLAM 284
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ TK Y+ I +I Y+ RDM P G +SAEDADS EG EG FY
Sbjct: 285 AYSEAYQATKKDKYARIAAEIYKYIERDMTSPEGAFYSAEDADS---EGV----EGFFYT 337
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +EV +LG E F + + P+GN F+G+N+ +N + +
Sbjct: 338 WTYEEVMSVLGDEDGKRFCGIFDITPSGN------------FEGRNIPNLINADPSDSDF 385
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ + CR+KLF+ R KR RP DDK++ SWN L+ +S A +ILK
Sbjct: 386 IEI-----------CRKKLFETREKRIRPFKDDKILTSWNALMAASLAVGGRILKD---- 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ +A+ A SFI+ L E RL +R+G + P FLDDYA+L
Sbjct: 431 ------------MNLINMAKKAVSFIKAKLVREDG-RLLARYRDGSADIPAFLDDYAYLQ 477
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
++LY+ +L+ A+ + + LFLD E GG+F + ++ R K+ +DGA
Sbjct: 478 WAYIELYQSTHEPGYLIDAVSINEEINGLFLDDEKGGFFFYGNDAERLITRPKDAYDGAM 537
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV +NL++L+ I Y + E+ + F + + M +
Sbjct: 538 PSGNSVMAMNLLKLSQITGDLS---YSDSFENQIDAFSGEISQNPLGYVYMLTSFLGYIQ 594
Query: 619 PSRKHVVLVGHKSS---VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
P ++ V LV +S + F N++ + + TV+ + + + ++ H + +
Sbjct: 595 PDQR-VFLVSDESESRLMPFINVINENYRPF----TVLILYGSRYKRLEDVIPHIKDYTA 649
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
A K A VC+NF+C+ PV+D
Sbjct: 650 ------PAGKTAAYVCENFTCNEPVSD 670
>gi|295695073|ref|YP_003588311.1| hypothetical protein [Kyrpidia tusciae DSM 2912]
gi|295410675|gb|ADG05167.1| protein of unknown function DUF255 [Kyrpidia tusciae DSM 2912]
Length = 716
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 242/608 (39%), Positives = 328/608 (53%), Gaps = 52/608 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA+LLN FV+IKVDREERPDVD +YM QAL G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDPEVAELLNRHFVAIKVDREERPDVDHLYMAACQALTGQGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFP +YGRPG +L +V W+K D + +G Q+ EAL
Sbjct: 113 VFLTPEKEPFYAGTYFPKRSRYGRPGLMELLTRVAQLWEKGADRVKDAGRHLTGQIGEAL 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A E+ L EQL SYD FGGFG APKFPRP ++ +L + +
Sbjct: 173 GRAAQG-----EVDAGTLTRAFEQLLASYDHTFGGFGHAPKFPRPHDLLFLLRYGVR--- 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G+ E MV TL+ M +GGI DHVG GF RYS D RW +PHFEKMLYD L
Sbjct: 225 SGR----REAFDMVQGTLEGMRRGGIWDHVGFGFARYSTDRRWLIPHFEKMLYDNALLVL 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A+ D ++ R+I+ Y+RR+M PGG +SAEDADS EG +EG FYV
Sbjct: 281 TYLEAYQALGDQRWAQTAREIVTYVRREMTDPGGGFYSAEDADS---EG----EEGKFYV 333
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASAS 377
WT +E+ + +G E + ++ + GN + G++VL E++ D A
Sbjct: 334 WTPQEITEAVGPEDGEVLCRYFGVTEEGNFE-----------GGRSVLNEIDTDVDLLAR 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+LGM E+ + L VR +R PH DDK++ +WNGL+I++ AR +++L
Sbjct: 383 ELGMTPEEIDRKVRRGLEILHSVRDRRVHPHKDDKILTAWNGLMIAALARGARVLGD--- 439
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ A AA ++ R L + RL +R+G + G+LDDYAF
Sbjct: 440 -------------ADYLVSARRAAEWLWRTL-RQGDGRLLARYRDGEAGILGYLDDYAFY 485
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I GLL+LY+ WL AI L LF D + GG F T + ++ R K DGA
Sbjct: 486 IWGLLELYQADGDVAWLRRAIRLAQDVRTLFWDEKEGGCFLTGSDAEALWSRPKTAEDGA 545
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV ++L+ L + + + AE L F + A D
Sbjct: 546 LPSGNSVLALDLLWLGRLTGDPA---WERWAEAQLRAFAGAVSRYPAGYTFFLTAWDFAL 602
Query: 618 VPSRKHVV 625
PS + VV
Sbjct: 603 GPSEEIVV 610
>gi|421076735|ref|ZP_15537717.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
gi|392525347|gb|EIW48491.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
Length = 628
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 250/686 (36%), Positives = 358/686 (52%), Gaps = 65/686 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME E FED+ VA LLN F++IKVDREERPDVD +YM+ QAL G GGWPL++ ++PD K
Sbjct: 1 MERECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLTIIMAPDKK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP K GR G +L + W+K R + ++G + L S
Sbjct: 61 PFFAGTYFPKHRKMGRMGLLELLTTLHQHWEKNRSEILKAGNEIVNILQRPKPPSGEGQI 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
D L Q L +L SYD ++GGFGSAPKFP P +I +L + + ++
Sbjct: 121 GEDLLKQAYL-----ELENSYDPQYGGFGSAPKFPTPHKITFLLRYWQHFKE-------P 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
+ MV TL M +GGI+DH+G GF RYS D++W VPHFEKMLYD L YL+A+
Sbjct: 169 KALAMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCTSYLEAYQC 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T + ++ I DIL Y+ RDM+ G +SAEDADS EG EG FYV+T K+V +
Sbjct: 229 TGNQEFARIAEDILTYVMRDMMDKNGGFYSAEDADS---EGV----EGKFYVFTRKQVVE 281
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
ILG E LF + Y++ GN + S H G+N+ A + +E
Sbjct: 282 ILGEEEGALFADFYHISSHGNFEHG-TSILH--LIGRNL-------EEYARVVNKTVENL 331
Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
+L + R KL+ VR R P+ DDK++ +WNGL+I++FA+A+++LK
Sbjct: 332 SEVLKKGREKLYQVREARIHPYKDDKILTAWNGLMIAAFAKAARVLK------------- 378
Query: 447 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 506
+ +Y +VAE +FI L RL +R G + +LDDYAFL+ L+++YE
Sbjct: 379 ---QSKYAKVAEQGIAFIYEKLMGSNG-RLLARYREGEAAHLAYLDDYAFLLMALIEVYE 434
Query: 507 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 566
+L A L ELF DR GG++ + ++ R KE +DGA PSGNSV+
Sbjct: 435 TTCNDYYLQQAAILAKDMGELFGDRTEGGFYFYGNDGEELIARPKEIYDGAIPSGNSVAA 494
Query: 567 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
L +LA + ++ + AE L F + A A D + K +V+
Sbjct: 495 FALQKLADM---TEDRSFSDTAERLLGHFAGEVSRYAAGYTYFMMAVDYYLADNTK-IVI 550
Query: 627 VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKV 686
VG K + D ++M VI+ + M F++ H+ N + K
Sbjct: 551 VGDKEAADTKSMF-----------DVINNCFLPSAAMRFYDRHSRENVEYKEID---HKA 596
Query: 687 VALVCQNFSCSPPVTDPISLENLLLE 712
A +C+NF+C PP+T+ L NLL++
Sbjct: 597 TAYICKNFACQPPITNVEKLRNLLMK 622
>gi|253699928|ref|YP_003021117.1| hypothetical protein GM21_1299 [Geobacter sp. M21]
gi|251774778|gb|ACT17359.1| protein of unknown function DUF255 [Geobacter sp. M21]
Length = 750
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 250/698 (35%), Positives = 365/698 (52%), Gaps = 56/698 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE +A+ LN F++IKVDREERPDVD VYMT V A+ GGWPL++
Sbjct: 98 TCHWCHVMEEESFEDEEIARFLNANFIAIKVDREERPDVDTVYMTAVHAMGMQGGWPLNI 157
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +P+ KP GGTYFPP D G GF ++LR++++ + + D + +G QL+EA+
Sbjct: 158 FATPERKPFYGGTYFPPSDYAGGIGFLSLLRRIRETYQQAPDRVTHAGL----QLTEAIR 213
Query: 141 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + E P+ + L E + +D++ GG APKF L L
Sbjct: 214 GILAP--MGGEPPEKEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + GE + M +TL+ MA GGI+D GGGFHRY+ D W +PHFEKMLYD +LA
Sbjct: 266 DYLRRGEKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSTWLIPHFEKMLYDNARLA 324
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+ + T D ++ + R+IL YL+RDM+ P G +SA DADS G ++EG F+
Sbjct: 325 AAYLEGYQATGDRHFAQVAREILRYLQRDMMSPEGAFYSATDADSLTESG--HREEGIFF 382
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT +E++ LG E A + Y + GN F+G+++L A
Sbjct: 383 TWTPEELDAALGAERARVVAACYGVTDEGN------------FEGRSILHREKSMQHLAE 430
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+L +P E+ +L E R +L+ R +RP P D+K++ SWNGL IS+FAR +L + A
Sbjct: 431 ELMLPKEELERLLDEAREELYLARQRRPLPLRDEKILASWNGLAISAFARGGLVLNAPA- 489
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
++ A AA+F+ ++ ++ RL HS++ G +K GFLDDYAF
Sbjct: 490 ---------------LLDTARGAANFMLENMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I+GL+DL+E WL A+E E F D E GG+F T ++ R K +DG
Sbjct: 533 IAGLIDLFEATGELPWLKRALEQARQVQEQFEDSETGGFFMTGPHHEELISREKPAYDGV 592
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV ++NL+RL ++ A+ +L F T+L A+ M A D L
Sbjct: 593 IPSGNSVMIMNLLRLNALTGEQGMP---DQAQRALDAFSTQLASAPTALSEMLLALDYLQ 649
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
R+ V++ +L + N+ ++ E D E+ +
Sbjct: 650 DVPREIVIVAPQGKREAAGPLLEKLRGVFLPNRALVVFC-----EGDELEQAGELLPLVR 704
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
+ +A +C++ SC P +DP L E S
Sbjct: 705 EKKADGGRAMAYLCESRSCRRPTSDPEEFHRQLQETRS 742
>gi|310641971|ref|YP_003946729.1| cellulase catalitic domain protein and a thioredoxin domain protein
[Paenibacillus polymyxa SC2]
gi|386040955|ref|YP_005959909.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
gi|309246921|gb|ADO56488.1| cellulase catalitic domain protein and a thioredoxin domain protein
[Paenibacillus polymyxa SC2]
gi|343096993|emb|CCC85202.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
Length = 691
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 250/695 (35%), Positives = 357/695 (51%), Gaps = 64/695 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ VA++LN +VSIKVDREERPDVD +YM+ + + G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDQEVAEVLNQDYVSIKVDREERPDVDHIYMSICETMTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
+ ++PD KP GTY P E K+GR G +L KV W ++ D L + S E +
Sbjct: 113 IMMTPDQKPFFAGTYLPKEQKFGRVGLLELLGKVGIRWKEQPDELMELSEQVLTEHERQD 172
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L A EL L + S ++D +GGFG APKFP P + +L +++
Sbjct: 173 LLAGYRG-----ELDDQCLNKAFHEYSHTFDHEYGGFGEAPKFPSPHNLSFLLRYAQH-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG + +MV TL M++GGI+DHVG GF RYSVDE+W VPHFEKMLYD LA
Sbjct: 226 -TGN----QQALEMVEKTLDAMSRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLA 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ +T Y I I Y+ RDM GG +SAEDADS EG +EG FY
Sbjct: 281 ITYTEAWQVTGKRLYRQITEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGRFY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
VW+ E++ +LG E A F + Y + P GN F+G N+ LI++N A
Sbjct: 334 VWSDSEIKAVLGDEDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAY 380
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+K + + + E + KLF R +R P DDK++ SWNGL+I++ A+A +
Sbjct: 381 GNKHDLTEPELEQRVSELKDKLFTAREQRVHPQKDDKILTSWNGLMIAALAKAGQ----- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
G R Y E A A +F+ HL E RL +R+G + G++DDYA
Sbjct: 436 ---------AFGDTR--YTEQARKAETFLWNHLRREDG-RLLARYRDGQAAYLGYVDDYA 483
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F + GL++LY+ ++L A+ L +LF D E G F T + ++ R KE +D
Sbjct: 484 FYVWGLIELYQATFDVQYLQRALTLNQNMIDLFWDEERDGLFFTGSDSEQLISRPKEIYD 543
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNS++ N VRLA + ++ + Y A F + + A +
Sbjct: 544 GAIPSGNSIAAHNFVRLARLTGETRLEDY---AAKQFKAFGGMVAHYPSGHSALLSAL-L 599
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ +V+VG ++ + A + N VI D E +
Sbjct: 600 YATGKTSEIVIVGQRNDPQTAQFVQEVQAGFRPNMVVIFKDKGQPEIAEI-------APY 652
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + K VC++F+C PVT L+++L
Sbjct: 653 IHDYDLVDGKPAVYVCEHFACQAPVTHIDDLKHML 687
>gi|374324300|ref|YP_005077429.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
gi|357203309|gb|AET61206.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
Length = 631
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 253/692 (36%), Positives = 361/692 (52%), Gaps = 74/692 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFEDE VA+LLN +VSIKVDREERPDVD +YM+ Q + G GGWPL++ ++PD K
Sbjct: 1 MERESFEDEEVAELLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDHK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTY P E K+GR G +L KV W ++ D L +E + L+ +K
Sbjct: 61 PFFAGTYLPKEQKFGRVGLMELLPKVAARWKEQPDEL-------VELSEQVLTEHERHDK 113
Query: 148 LPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
L EL +++L Q S ++D +GGFG APKFP P + +L +++ TG
Sbjct: 114 LASYQGELDEHSLNKAFHQFSYAFDKDYGGFGEAPKFPSPHNLSFLLRYAQH---TGN-- 168
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
+ +M TL M +GGI+DHVG GF RY+VDE+W VPHFEKMLYD LA Y +A
Sbjct: 169 --QQALEMAEKTLDAMYRGGIYDHVGMGFSRYAVDEKWLVPHFEKMLYDNALLAIAYTEA 226
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
+ +T Y I I Y+ RDM GG +SAEDADS EG +EG FYVW E
Sbjct: 227 WQVTGKELYRRIAEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGKFYVWDESE 279
Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 381
V ILG+ A F + Y + P GN F+G N+ LI++N A K +
Sbjct: 280 VRAILGDKDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGIKHDL 326
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
++ E R KLF R +R PH DDK++ SWNGL+I++ A+A +
Sbjct: 327 TEQELEQRASELRAKLFTTREQRTHPHKDDKILTSWNGLMIAALAKAGQAFGE------- 379
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+Y E A+ A SF+ HL + RL FR+G + PG++DDYAF + GL
Sbjct: 380 ---------AQYTEQAQRAESFLWNHLRRDDG-RLLARFRDGDAAYPGYVDDYAFYVWGL 429
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
++LY+ ++L A+ L +LF D E GG F + ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQDMIDLFWDEERGGLFFYGPDGEQLIAKPKEVYDGAIPSG 489
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
NS++ NLVRLA ++ S+ + Y + VF + + + + + +
Sbjct: 490 NSIAAHNLVRLARLMGESRLEDY---SAKQFKVFGGLVVQYPTGYSALLSSL-LYATGTT 545
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID---PADTEEMDFWEEHNSNNASMAR 678
K +V+VGH+ + + A A + N VI D PA + + + ++ +
Sbjct: 546 KEIVIVGHRDAPQTVQFIRAVQAGFRPNTVVILKDEGQPAIADIVPYIRDYTLVDG---- 601
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K VC++F+C PVT L+ LL
Sbjct: 602 ------KPAVYVCEHFACQAPVTRLDDLKALL 627
>gi|154688185|ref|YP_001423346.1| hypothetical protein RBAM_037900 [Bacillus amyloliquefaciens FZB42]
gi|154354036|gb|ABS76115.1| YyaL [Bacillus amyloliquefaciens FZB42]
Length = 689
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 252/692 (36%), Positives = 363/692 (52%), Gaps = 78/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGYGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M G FSA DAD TEG +EG +
Sbjct: 278 LTAYTEAYQVTGNERYKQIAMQIVMFIQREMTHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 331 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 387 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMITGLAKAAKV----- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 435 ----FHEP-------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAF 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 482 LIWAYLELYEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 541
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 542 AVPSGNSAAAVQLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P +K +V+ G K D + + A H PA T EH A +
Sbjct: 599 TMP-QKEIVVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPEELAGI 645
Query: 677 ARNNFSA------DKVVALVCQNFSCSPPVTD 702
+ +F+A + +C+NF+C P TD
Sbjct: 646 S--DFAAGYQMIDGRTTVYICENFACRRPTTD 675
>gi|386760793|ref|YP_006234010.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
gi|384934076|gb|AFI30754.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
Length = 689
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 243/687 (35%), Positives = 362/687 (52%), Gaps = 67/687 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVENIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A K + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AA-----KTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFVQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ +E+ LG E L+ + Y + GN F+GKN+ LI A
Sbjct: 334 WSREEILKTLGDELGTLYCQVYDITEEGN------------FEGKNIPNLIHSKREQIKA 381
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G+ E+ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+
Sbjct: 382 DA-GLTEEELRLKLEDARQRLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVY---- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ +Y+ +A+ A +FI HL + R+ +R+G K GF+DDYAF
Sbjct: 437 ------------EEPKYLSLAQDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAF 482
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ LDLYE +L A +L + LF D E GG++ + + ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFSGHDAEALIVREKEVYDG 542
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ + L+RL V G S + AE +VF+ + +
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIDAYPSGHAFFMQSVLRH 599
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+P +K +V+ G + ++ ++ N +++ + E + A
Sbjct: 600 LMP-KKEIVIFGSADDPARKQIITELQKAFKPNDSILVAEQP---------EQCKDIAPF 649
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTD 702
A + D K +C+NF+C P T+
Sbjct: 650 AADYRIIDGKTTVYICENFACQQPTTN 676
>gi|418030673|ref|ZP_12669158.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
str. SC-8]
gi|351471732|gb|EHA31845.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
str. SC-8]
Length = 664
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 239/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 28 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 88 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 147
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 148 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 199
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 200 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 255
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 256 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 308
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 309 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 356
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 357 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 412
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL
Sbjct: 413 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 458
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 459 LWAYLDLYEASFDLSFLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 518
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL + S + AE +VF+ + +
Sbjct: 519 VPSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKQHIDAYPSGHAFFMQSVLRHL 575
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++ ++ N +++ EH +A
Sbjct: 576 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 622
Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
F+AD K +C+NF+C P T+
Sbjct: 623 --PFAADYRIIDGKTTVYICENFACQQPTTN 651
>gi|163782790|ref|ZP_02177786.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
gi|159881911|gb|EDP75419.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
Length = 697
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 267/720 (37%), Positives = 380/720 (52%), Gaps = 59/720 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFEDE +A++LN+ +V IKVDREERPD
Sbjct: 31 GEEAFEKAEREDKPVFLSIGYSTCHWCHVMERESFEDEEIARILNENYVPIKVDREERPD 90
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD VYM+ Q + G GGWPL+V ++PD KP GTYFP E YGRPG + IL ++ + W
Sbjct: 91 VDSVYMSVCQMMTGSGGWPLTVIMTPDKKPFFAGTYFPKEGMYGRPGLRDILLRIAELWR 150
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R Q A EQ+ +AL+ + + + L ++ L +L +YD +GGFG+A
Sbjct: 151 NDR----QKVLTAAEQVVDALAKGEEESYIGERLDESILHKGFAELYHTYDEAYGGFGNA 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + ++ TG +G+A E MV TL+ M GGI DHVG GFHRYS
Sbjct: 207 PKFPIPHNLMFLLRYYRR---TG-NGKALE---MVKHTLKKMRLGGIWDHVGFGFHRYST 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYD L VY +AF T D F++ + +I +YL+RDM+ P G +SA
Sbjct: 260 DREWLLPHFEKMLYDNALLMLVYTEAFQATGDEFFAQVVEEIAEYLQRDMLSPEGAFYSA 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPH 357
EDADS EG +EG FY WT E+E++L E + + + GN + +
Sbjct: 320 EDADS---EG----EEGKFYTWTLAELEELLTEEELGIALRLFGIAEEGNF----LEEAT 368
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
GKNVL + A +LG + L E R KLF R KR RP D+KV+ W
Sbjct: 369 RRKVGKNVLHMKKELEKYAEELGYEPDVLKQKLEEIRSKLFKRREKRVRPLRDEKVLTDW 428
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL I++F++A V RK+++ VA+ A F+ + D++ +L
Sbjct: 429 NGLAIAAFSKAG----------------VALGRKDFLAVAKRTADFLLNTMVDDEG-KLL 471
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H ++ G + P FL+DYA+LI GL++LY+ ++L A EL + E F D E G++
Sbjct: 472 HRYKEGEAGIPAFLEDYAYLIWGLMELYQGSFEGEYLKRAKELTDFALEHFWDEENLGFY 531
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T VL+R KE +DGA PSGNSV NLVRL ++ + Y + A+ +L F
Sbjct: 532 QTPDFGERVLVRKKEIYDGATPSGNSVMAYNLVRLGRLLGLQE---YERRADQTLNAFSQ 588
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ A A D+L V +V VG + A + +L + +
Sbjct: 589 VIASFPGAHTFSLLALDIL-VKGSFELVAVGDREE--------AIQSLLELERDFLPEGL 639
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
++ E S + + +C+NFSC P TD + N L+ + S T
Sbjct: 640 FAVKD----ETLQSLSGFFDSLREMDGRTTYYLCRNFSCESPATDIEDIRNRLVPQESGT 695
>gi|452913203|ref|ZP_21961831.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
gi|452118231|gb|EME08625.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
Length = 664
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/686 (34%), Positives = 365/686 (53%), Gaps = 65/686 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 28 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 88 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQ--- 144
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ ++ K + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 145 --TKTAAKTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 199
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 200 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 255
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 256 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 308
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 309 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 356
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 357 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 412
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL
Sbjct: 413 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 458
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 459 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 518
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL V G S + AE +VF+ ++ +
Sbjct: 519 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSVLRHL 575
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++A ++ N +++ + E + A A
Sbjct: 576 MP-KKEIVIFGSADDPARKQIIAELQKAFKPNDSILVAEQP---------EQCKDIAPFA 625
Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTD 702
+ D K +C+NF+C P T+
Sbjct: 626 ADYRIIDGKTTVYICENFACQQPTTN 651
>gi|321313642|ref|YP_004205929.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
gi|320019916|gb|ADV94902.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
Length = 689
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 239/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSFLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL + S + AE +VF+ + +
Sbjct: 544 VPSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKQHIDAYPSGHAFFMQSVLRHL 600
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++ ++ N +++ EH +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647
Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
F+AD K +C+NF+C P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676
>gi|350268373|ref|YP_004879680.1| hypothetical protein GYO_4496 [Bacillus subtilis subsp. spizizenii
TU-B-10]
gi|349601260|gb|AEP89048.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
TU-B-10]
Length = 689
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 247/692 (35%), Positives = 362/692 (52%), Gaps = 77/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AAKSGEG-----LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T E V TL MA GGI+DH+G GF RYS DE W VPHFEKMLYD L
Sbjct: 225 T----EQENALYNVTKTLDSMANGGIYDHIGYGFARYSTDEEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ +E+ LG+ L+ + Y + GN F+GKN+ LI A
Sbjct: 334 WSKEEILRTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKRKQIKA 381
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G+ E+ L R+ L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DA-GLTEEELSLKLEGARQLLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ--- 437
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+Y+ +A+ A +FI HL + R+ +R+G K GF+DDYAF
Sbjct: 438 -------------EPKYLSLAKDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAF 482
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DG
Sbjct: 483 LLWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDG 542
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ + L+RL V G S + AE +VF+ + D + + +
Sbjct: 543 AVPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDI-DAYPSGHAFFMQSVLK 598
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
V +K +V+ G + ++ A ++ N +++ EH +
Sbjct: 599 HVMPKKEIVIFGSADDPARKQIITALQKAFKPNDSIL------------VAEHPDQCKDI 646
Query: 677 ARNNFSAD------KVVALVCQNFSCSPPVTD 702
A F+AD K +C+NF+C P T+
Sbjct: 647 AP--FAADYRIIDGKTTVYICENFACQQPTTN 676
>gi|307107988|gb|EFN56229.1| hypothetical protein CHLNCDRAFT_145019 [Chlorella variabilis]
Length = 648
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 221/533 (41%), Positives = 305/533 (57%), Gaps = 37/533 (6%)
Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 271
M F+L+ MA GG+ DHVGGGFHRYSVDE WHVPHFEKMLYD QLA YL AF +T+D
Sbjct: 114 MATFSLRQMAAGGMWDHVGGGFHRYSVDEYWHVPHFEKMLYDNPQLAATYLAAFQITRDA 173
Query: 272 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 330
Y+ + R I DYL R M PGG +F+AEDADS + KKEG FYVW+ +E++ +LG
Sbjct: 174 QYAGVARGIFDYLLRGMTHPGGGLFAAEDADSLDPASGD-KKEGWFYVWSWEELQQLLGP 232
Query: 331 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
E A F HYY K GNCDLS SDPH EF G N LI+ + +A+ L
Sbjct: 233 EDAPAFCAHYYAKQGGNCDLSPRSDPHGEFVGLNCLIQRQSLAQTAAAAARGEADTAAAL 292
Query: 391 GECRRKLFDVRSKRPRPHLDDK-----------------------VIVSWNGLVISSFAR 427
CR KLF R +RPRPH DDK ++ +WNG+ IS++A
Sbjct: 293 AACREKLFRARERRPRPHRDDKARARGRGGAWPRILSNPWQHRLLIVAAWNGMAISAYAL 352
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
AS+IL E A FPV G +Y++ A AA+F+R+HL+D +T RL+ F GPS
Sbjct: 353 ASRILPHEQPPAARCFPVEGRPPGDYLQAALQAAAFVRQHLWDGETGRLRRCFTTGPSAV 412
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
GF DDYA++++GLLDL+ WA++LQ T DE+ D GG YF+ D S+L
Sbjct: 413 EGFADDYAWMVAGLLDLHSTTGD-----WALQLQGTMDEVLWDEAGGAYFSGVAGDASIL 467
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
LR+KED+DGAEP+ +S+++ NL RLA + +S +R+ A A F RL + +A+P
Sbjct: 468 LRMKEDYDGAEPAASSIALANLWRLAGLCGTEESARWRERAAKCAAAFAERLGEAPVALP 527
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
M + +L++ + V++ G + + D + +L AA S+ + VI +DP ++ MDFW
Sbjct: 528 QMAASLHLLTLGHPRQVIIAGAQGAPDTQALLDAAFYSFTPDMVVIQLDPGSSQVMDFWR 587
Query: 668 EHNSNNASMAR--NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
+ N ++ + D A + Q P DP ++ +L E S A
Sbjct: 588 QRNPEAVAVVEVMGMQAGDPATAFIYQA-----PTRDPEKVKQVLAEPRISAA 635
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/63 (57%), Positives = 42/63 (66%), Gaps = 3/63 (4%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFE E A L+N FV++KVDREERPD
Sbjct: 42 GEEAFERARKEDKPIFLSVGYSTCHWCHVMERESFESEETAALMNQLFVNVKVDREERPD 101
Query: 59 VDK 61
VDK
Sbjct: 102 VDK 104
>gi|407768088|ref|ZP_11115467.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
17429]
gi|407288801|gb|EKF14278.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
17429]
Length = 683
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 251/704 (35%), Positives = 369/704 (52%), Gaps = 80/704 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED+G+A L+N+ FV+IK+DREERPD+D VY + L GGWPL++
Sbjct: 52 ACHWCHVMAHESFEDDGIAALMNELFVNIKLDREERPDLDSVYQNALALLGQQGGWPLTM 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW----DKKRDMLAQSGAFAIEQLS 136
FL+PD +P GGTYFP E +YGRPGF +L+ V + + D R +AQ G A+ +++
Sbjct: 112 FLTPDGEPFWGGTYFPKEARYGRPGFGDVLKSVSEIYTQQPDNIRHNVAQIGQ-ALIKMN 170
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ S S + D+ C + D GG APKFP+P + ++ +
Sbjct: 171 SGATGSMPSLAMIDQ--------CGHGCLQIMDGENGGTNGAPKFPQPSILALIWRVGVR 222
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
DT + +++V +L M +GGI+DHVGGGF RY+VD++W VPHFEKMLYD Q
Sbjct: 223 TNDT-------DLKRIVRHSLDRMCQGGIYDHVGGGFARYAVDDQWLVPHFEKMLYDNAQ 275
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L ++ D + T + Y + +D++ RDM PGG ++ DADS EG EG
Sbjct: 276 LIDLLCDVWRETGNPLYEARISETIDWILRDMRVPGGAFAASLDADS---EGV----EGK 328
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FYVW E+ ILG A LFK+ Y + P+GN ++ KN+L + +
Sbjct: 329 FYVWDEAEINAILGNDAALFKDIYDVSPSGN------------WEHKNIL------NRTQ 370
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S LG+ L E R KL VR+KR P DDK + WN + I++ A A+ + K
Sbjct: 371 SGLGLADRTTEKKLSETRTKLLAVRNKRIWPGWDDKALTDWNAMTIAALAEAAMVFK--- 427
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDY 494
R ++++ A+ A +F+ L +++ R HS+RNG ++ G L+DY
Sbjct: 428 -------------RADWLDYAKLAYNFVINSLMTGESNDRRFLHSYRNGKAQHAGMLEDY 474
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A +I L LYE +L A E + LF D + GGYF + + +++R K
Sbjct: 475 AHMIRAALRLYECFGEDAYLREATEWCEAVENLFADTK-GGYFQSASDADDLVVRQKPHM 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A P+GNSV NL RL ++ +K YR AE ++A F RL + +P + AA+
Sbjct: 534 DNAVPAGNSVMAQNLARLYALTGDTK---YRDRAEITIAAFAGRLNEQFPNMPGLLLAAE 590
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
ML P + +VL+ + S + M A A+Y N+ + + ADT+ + +
Sbjct: 591 MLQNPLQ--IVLIAKERSQMYMEMRRAIFAAYLPNRAITIL--ADTDALP--------DL 638
Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
A+ + D A VCQ CS PVT+ L LL P+ +
Sbjct: 639 HPAKGKTAIDGHETAYVCQGSVCSAPVTNVADLAKLLANLPNKS 682
>gi|16081134|ref|NP_391962.1| hypothetical protein BSU40820 [Bacillus subtilis subsp. subtilis
str. 168]
gi|221312064|ref|ZP_03593911.1| hypothetical protein Bsubs1_22036 [Bacillus subtilis subsp.
subtilis str. 168]
gi|221316389|ref|ZP_03598194.1| hypothetical protein BsubsN3_21942 [Bacillus subtilis subsp.
subtilis str. NCIB 3610]
gi|221321302|ref|ZP_03602596.1| hypothetical protein BsubsJ_21895 [Bacillus subtilis subsp.
subtilis str. JH642]
gi|221325585|ref|ZP_03606879.1| hypothetical protein BsubsS_22051 [Bacillus subtilis subsp.
subtilis str. SMY]
gi|402778252|ref|YP_006632196.1| protein YyaL [Bacillus subtilis QB928]
gi|586842|sp|P37512.1|YYAL_BACSU RecName: Full=Uncharacterized protein YyaL
gi|467366|dbj|BAA05212.1| unknown [Bacillus subtilis]
gi|2636629|emb|CAB16119.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
str. 168]
gi|402483431|gb|AFQ59940.1| YyaL [Bacillus subtilis QB928]
gi|407962936|dbj|BAM56176.1| hypothetical protein BEST7613_7245 [Bacillus subtilis BEST7613]
gi|407966948|dbj|BAM60187.1| hypothetical protein BEST7003_3986 [Bacillus subtilis BEST7003]
Length = 689
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 241/686 (35%), Positives = 364/686 (53%), Gaps = 65/686 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A K + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AA-----KTGEGLSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL V G S + AE +VF+ ++ +
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKPDIEAYPSGHAFFMQSVLRHL 600
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++A ++ N +++ + E + A A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIIAELQKAFKPNDSILVAEQP---------EQCKDIAPFA 650
Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTD 702
+ D K +C+NF+C P T+
Sbjct: 651 ADYRIIDGKTTVYICENFACQQPTTN 676
>gi|354559793|ref|ZP_08979037.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
metallireducens DSM 15288]
gi|353540319|gb|EHC09795.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
metallireducens DSM 15288]
Length = 653
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 260/712 (36%), Positives = 373/712 (52%), Gaps = 93/712 (13%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFED VA+LLN F++IKVDREERPD+D +YM + QAL G GGWPL++ ++P+ +
Sbjct: 1 MERESFEDTEVAELLNRSFLAIKVDREERPDIDHLYMEFCQALTGSGGWPLTILMTPEKQ 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS----------- 136
P GTYFP YGRPG +L ++ + WDK + L +S ++ ++
Sbjct: 61 PFFTGTYFPKSSHYGRPGLIDLLSQISELWDKDENKLRKSAEEIVKAITSHQKRSSEEVN 120
Query: 137 ----EALS----------ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
AL ASA +EL + + + L +++DSR+GGFG APKFP
Sbjct: 121 PVEVHALQGFLNVQNGGDASADFQSWANELIEQSY----QALIQNFDSRYGGFGQAPKFP 176
Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
P + +L ++K D S+ + M+ L M +GGI+DH+G GF RYS D++W
Sbjct: 177 SPHNLTFLLRYAKDHPD-------SQAEAMIRKNLDTMGQGGIYDHIGFGFARYSTDQQW 229
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
VPHFEKMLYD LA Y++A+ K+ + ++IL Y+ RDM P G +SAEDAD
Sbjct: 230 LVPHFEKMLYDNALLAIAYIEAYQSQKEPRDAQKAQEILTYVLRDMTSPEGGFYSAEDAD 289
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S EG EG FYVWT +E+ +LGE + LF + + + P GN F+
Sbjct: 290 S---EGI----EGKFYVWTPEEITSVLGEKRSALFCDVFNITPEGN------------FE 330
Query: 362 GKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
GK++ L+ D A K + E IL E R KL+ R R PH DDK++ SWNGL
Sbjct: 331 GKSIPNRLSGDIGELARKHHLNPETLNYILEEDRLKLWQSREHRIHPHKDDKILTSWNGL 390
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
+I + A+ ++ FN D K Y+ AE AA F+ +LY + RL F
Sbjct: 391 MIVALAKGGQV---------FN------DNK-YILAAEQAAHFVLENLYPNE--RLLARF 432
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
R+G + G+LDDYAF I GLL+LY + +L A+ LQ + LF D E GGY+ T
Sbjct: 433 RDGNAAYLGYLDDYAFFIWGLLELYTASGKSDYLKSALSLQEQLETLFKDEEAGGYYLTG 492
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
+ +LLR KE +DGA PSGNS++ +NL+ LA + + ++ AE L F + L
Sbjct: 493 SDGEELLLRPKEIYDGALPSGNSITALNLLHLARLTGDER---WKLQAEKQLLSFRSTLT 549
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM--LAAAHASYDLNKTVIHIDPA 658
A PS++ ++LVG S++ E + L + L + +
Sbjct: 550 SNPAGYTAFLQALQYALHPSQE-LLLVG---SLNHEGISPLRQTFFTIFLPYSSLLYHEG 605
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
E+ W + F +KV+A +C NF+C PV P L+ LL
Sbjct: 606 RLGELLPW---------VKDYPFDPNKVLAYLCTNFTCQKPVESPEELKALL 648
>gi|21226721|ref|NP_632643.1| hypothetical protein MM_0619 [Methanosarcina mazei Go1]
gi|20905010|gb|AAM30315.1| conserved protein [Methanosarcina mazei Go1]
Length = 700
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 238/705 (33%), Positives = 358/705 (50%), Gaps = 54/705 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCH+M ESFEDE VA L+N+ FVSIKVDREERPD
Sbjct: 36 GEEAFEKARKENKPVFLSIGYSTCHWCHMMAHESFEDEEVAGLMNEAFVSIKVDREERPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YMT Q + G GGWPL++ ++P KP GTY P ++ + G ++ ++K+ W+
Sbjct: 96 IDNIYMTVCQIILGRGGWPLNIIMTPGKKPFFAGTYIPKNTRFNQIGMLELVPRIKEIWE 155
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + + S + E + S+ L + + E+L S+D+ +GGF A
Sbjct: 156 QQHEEVLDSAEKITSTIQEMIKESSGEG-----LGEEVIEEVYEELLSSFDTEYGGFSGA 210
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +I +L + ++ + E M +TL M +GGI+DH+G GFHRYS
Sbjct: 211 PKFPTPHKISFLLRYWRRSRN-------PEALHMAEYTLDKMRRGGIYDHLGSGFHRYST 263
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYDQ A Y +A+ +T Y ILDY+ RD+ P G +
Sbjct: 264 DSMWLLPHFEKMLYDQALTAIAYTEAYQVTGKDLYKETAEGILDYVLRDLTSPEGGFYCG 323
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
EDAD ++EG +Y+WT +E+ IL E + L + + L+ GN + +
Sbjct: 324 EDAD-------VEREEGKYYLWTLEEIRSILDPEDSELIIKMFNLREEGNFE----EEIR 372
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
G N+ + A+K+ +P+E+ + R KL R +R RP LDDK++ W
Sbjct: 373 GRETGTNLFYMARSPGSLAAKMKIPVEEVEKKVKAAREKLLKARYERKRPSLDDKILTDW 432
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I++FA+ + V G R Y++ AE AA FI LY L
Sbjct: 433 NGLMIAAFAKG--------------YQVFGEQR--YLKAAEKAADFILMALYS-PGDGLL 475
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R+G + G DDYAFLI GLL+LYE G ++L A+ L + E F D GG +
Sbjct: 476 HRYRDGVAGISGTSDDYAFLIHGLLELYEAGFKMRYLKAAVSLNSELLECFWDPVNGGLY 535
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + +++ R KE D A P+GNS ++NL+RL+ I+A + + A+ F
Sbjct: 536 FTANDSEALIFRKKEFMDSAIPTGNSFEMLNLLRLSRIIADPGLE---ETADKLERAFSK 592
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
++ A D PS + V++ G + D E ML + + NK +I
Sbjct: 593 QIMKAPSGYTQFLSAFDFRLGPSYE-VIISGKAEASDTEQMLKELWSYFVPNKVLIFRPE 651
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ E+ ++ + K A VCQN+ C P T+
Sbjct: 652 REKPEITELAKYTEEQVPI------EGKATAYVCQNYECQLPTTE 690
>gi|406830400|ref|ZP_11089994.1| hypothetical protein SpalD1_02134 [Schlesneria paludicola DSM
18645]
Length = 883
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/617 (38%), Positives = 333/617 (53%), Gaps = 63/617 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL ++C+WCHVME + F +E +AK LN FV IKVDREERPD
Sbjct: 92 GPEAFEKAKKEGKMIFLSVGYSSCYWCHVMERKVFMNEAIAKTLNQDFVCIKVDREERPD 151
Query: 59 VDKVYMTYVQALY------GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRK 112
VD +YMT +Q Y GGWPLS+FL+PD KP+ GGTYFPPE G GF IL K
Sbjct: 152 VDDIYMTALQVYYQAIKAPASGGWPLSMFLTPDGKPIAGGTYFPPEATEGNEGFPAILAK 211
Query: 113 VKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF 172
+ D W + + + + + S P E+ + ++ S+D F
Sbjct: 212 LTDLWKNNHEQMVGNADIVANETRRLMRPKLSLK--PVEVNAKLVESVFAAVAGSFDPEF 269
Query: 173 GGFG------SAPKFPRPVEI---QMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKG 223
GG PKFP P ++ Q MLY S ED K++ TL +A G
Sbjct: 270 GGIDFNPNRPDGPKFPTPTKLSFLQQMLYRSPN-EDV---------SKLLDVTLLQLACG 319
Query: 224 GIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDY 283
GI DHVGGGFHRYSVD RW VPHFEKMLYDQ QLA+VY +A+ + + + ++ ++
Sbjct: 320 GIRDHVGGGFHRYSVDRRWDVPHFEKMLYDQAQLADVYAEAYRTSHQPLHKQVAEELFEF 379
Query: 284 LRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLK 343
+ RD+ P G +SA D AET G EG FYVW + E++ ILG A FKE Y +K
Sbjct: 380 VARDLTAPEGGFYSAID---AETNGI----EGEFYVWDATEIDHILGRSAAAFKEAYRVK 432
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
+ + + + K I+ + ASA+ G +++ + R+KL +VR+K
Sbjct: 433 ELSDFEHGNVLRLSQKRLPKAEAIKAVATPASAT--GSEKDEFTS----SRQKLLEVRNK 486
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
R +P D+K++ WNGL+I ++ARA +A N P EY+E+A AA F
Sbjct: 487 RKKPLRDEKLLTCWNGLMIGAYARA---------AAPLNHP-------EYVEIAARAAEF 530
Query: 464 IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
I D Q RL H++ +G +K +LDDYAFLI GL+ LY+ KWL A +LQ+
Sbjct: 531 ILTKARDSQG-RLLHTYASGQAKLNAYLDDYAFLIDGLISLYDATEDVKWLKVAKQLQDD 589
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
Q LFLD GG+F T+ +L R K DG P+GNSVS NL+RLA++ +K
Sbjct: 590 QLRLFLDESNGGFFFTSHHHEELLTRTKNCFDGVVPAGNSVSARNLIRLAAL---TKISS 646
Query: 584 YRQNAEHSLAVFETRLK 600
Y A ++ +F + ++
Sbjct: 647 YADEARATVELFASNIE 663
>gi|170757692|ref|YP_001780692.1| hypothetical protein CLD_3500 [Clostridium botulinum B1 str. Okra]
gi|169122904|gb|ACA46740.1| conserved hypothetical protein [Clostridium botulinum B1 str. Okra]
Length = 680
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 243/693 (35%), Positives = 353/693 (50%), Gaps = 72/693 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN F+SIKVDREERPD+D +YM + QA G GGWPL++
Sbjct: 53 TCHWCHVMERESFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTI 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD P GTYFP KY PG ILR + + W + ++ + +S +EQ+
Sbjct: 113 LMTPDKNPFFAGTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER--- 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
N EL + + + L ++D+++GGFG+ PKFP I +L Y+ KK
Sbjct: 170 --FQDNHREGELEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK-- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +V TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L+
Sbjct: 226 -------DKKILDIVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLS 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK+ + I IL+Y+++ M G +SAEDADS EG EG FY
Sbjct: 279 MAYTEAYEATKNPLFKDITEKILNYVKKSMTSDEGGFYSAEDADS---EGV----EGKFY 331
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+WT +E+ DILG E L+ + Y + GN F+ KN+ +N
Sbjct: 332 LWTKEEIMDILGEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVD 379
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LEK R+KLF+ R KR P+ DDK++ SWN L+I +F++A + K++
Sbjct: 380 NNKDKLEK-------MRKKLFEYREKRIHPYKDDKILTSWNALMIIAFSKAGRSFKND-- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+E+A+ +A+FI +L DE+ L R G GF+DDYAF
Sbjct: 431 --------------NYIEIAKKSANFIIENLMDERG-TLYARIREGERGNEGFIDDYAFF 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ L++LYE +L +IE+ ++ +LF +E GG++ + +L+R KE +DGA
Sbjct: 476 LWALIELYEASFDIYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V+ + L L I D Y+ + F T +K M L A M +
Sbjct: 536 TPSGNAVASLALNLLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYN 591
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K + L + DF + + Y V D ++ E N ++
Sbjct: 592 ILPVKEITLAYREKDEDFYKFINELNNRYIPFSIVTLNDKSN--------EIEKINKNIK 643
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +CQN++C P+ D + LL
Sbjct: 644 DKIAIKDKTTVYICQNYACREPIADLEEFKFLL 676
>gi|119184130|ref|XP_001243004.1| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
Length = 797
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 244/611 (39%), Positives = 336/611 (54%), Gaps = 49/611 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P P F IL K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188
Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + + P ++L L + YD GGF APKFP P
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247
Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
+ +L Y ++ G+ E + +MV TL MA+GGIHD +G GF RYSV W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
+PHFEKMLYDQ QL +VY+D F +T++ DI+ Y+ ++ P G S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS T K+EGAFYVWT KE++ ILG+ A + H+ + P GN ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEF 424
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
+NVL A G+ ++ + ++ R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484
Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L I + A+ S +L K +AE A VAE AA FIR +L+D +T +L
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533
Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
+R+G + PGF DDYA+L SGL+ LYE +L +A LQ + FL
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593
Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
GY+ N G+ P L R+K D A PS N V NL+RLAS++ + D Y+ A
Sbjct: 594 PAGYYMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650
Query: 589 EHSLAVFETRL 599
H+ + F +
Sbjct: 651 RHTCSAFAAEM 661
>gi|392865908|gb|EAS31753.2| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
Length = 799
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 244/611 (39%), Positives = 336/611 (54%), Gaps = 49/611 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P P F IL K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188
Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + + P ++L L + YD GGF APKFP P
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247
Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
+ +L Y ++ G+ E + +MV TL MA+GGIHD +G GF RYSV W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
+PHFEKMLYDQ QL +VY+D F +T++ DI+ Y+ ++ P G S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS T K+EGAFYVWT KE++ ILG+ A + H+ + P GN ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEF 424
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
+NVL A G+ ++ + ++ R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484
Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L I + A+ S +L K +AE A VAE AA FIR +L+D +T +L
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533
Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
+R+G + PGF DDYA+L SGL+ LYE +L +A LQ + FL
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593
Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
GY+ N G+ P L R+K D A PS N V NL+RLAS++ + D Y+ A
Sbjct: 594 PAGYYMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650
Query: 589 EHSLAVFETRL 599
H+ + F +
Sbjct: 651 RHTCSAFAAEM 661
>gi|51892001|ref|YP_074692.1| hypothetical protein STH863, partial [Symbiobacterium thermophilum
IAM 14863]
gi|51855690|dbj|BAD39848.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
14863]
Length = 623
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 254/681 (37%), Positives = 362/681 (53%), Gaps = 76/681 (11%)
Query: 27 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
+ME ESF D A+++N FV IKVDREERPD+D +Y T Q + GGWPLSV+L+P+
Sbjct: 1 MMERESFADPETAEIMNRHFVCIKVDREERPDLDDIYQTICQLVTRSGGWPLSVWLTPEQ 60
Query: 87 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR---DMLAQSGAFAIEQLSEALSASA 143
KP GTYFPP ++YGRPGF+ +L + AW +KR + +A+S A I Q E L
Sbjct: 61 KPFYVGTYFPPVERYGRPGFRQVLLALAQAWREKRQEVEKVAESWARGIAQTDELLP--- 117
Query: 144 SSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 202
+ +PD L +A R AE++ D + GGFG APKFP + + +ML H K D
Sbjct: 118 PAGPMPDHRLVADAARALAERI----DRQHGGFGGAPKFPNTMALDLMLRHWKATGD--- 170
Query: 203 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 262
+V TL+ MA+GGI+D +GGGFHRYSVD RW VPHFEKMLYD L VYL
Sbjct: 171 ----DLFLHLVTLTLRKMAEGGIYDQLGGGFHRYSVDARWAVPHFEKMLYDNALLPAVYL 226
Query: 263 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
A+ T + + I + LDY+ R+M P G FS DADS EG +EG +YVW
Sbjct: 227 AAWQATGEPLFRRIVEETLDYVLREMTHPEGGFFSTTDADS---EG----EEGRYYVWDP 279
Query: 323 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+EV +LG + L HY + GN E GK VL ++ AS LG+
Sbjct: 280 REVTAVLGPDLGALICRHYGVTEAGNF----------ERTGKTVLHIAEPAADLASSLGL 329
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
P+E+ L E RR+L + RS+R P D+K++ WNGL+IS+ ARA +IL+
Sbjct: 330 PVEEVERRLAEGRRRLLEARSRRVPPFRDEKILAGWNGLMISALARAGRILR-------- 381
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
R +Y E A AA+F+ L D + L+ +++G + PG+L+D+AF+ +GL
Sbjct: 382 --------RPDYAEAARRAATFVLDRLADGEGGLLRR-YKDGHAGIPGYLEDHAFMAAGL 432
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
+DLYE ++L A+ L F D G + +G +P ++ R ++ D + PSG
Sbjct: 433 IDLYECTFDERFLQEAMRLTEETLRRFYDGSGSFHLTQSGAEP-LIHRPRDTTDQSVPSG 491
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPS 620
+V+V+NL+RL + D +R+ A+ + + + A + A D+ L P+
Sbjct: 492 AAVAVVNLLRLQPY---RRDDRFREVADTAFRAHRDLMARVPGATATLLQALDLYLDGPT 548
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
V LVG E L A Y+ N + I E ++A +
Sbjct: 549 --EVTLVGDPP----EAWLEALGRRYEPNLVLTRI------------EAPRDDAPIWAGK 590
Query: 681 FSADKVVALVCQNFSCSPPVT 701
+ VA VC+NF+CSPP T
Sbjct: 591 AAGTGPVAYVCRNFACSPPAT 611
>gi|430756760|ref|YP_007207432.1| hypothetical protein A7A1_1268 [Bacillus subtilis subsp. subtilis
str. BSP1]
gi|430021280|gb|AGA21886.1| Hypothetical protein YyaL [Bacillus subtilis subsp. subtilis str.
BSP1]
Length = 689
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 241/691 (34%), Positives = 362/691 (52%), Gaps = 75/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AAKSGEG-----LSESAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQDNALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKE 381
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL V G S + AE +VF+ + +
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKLDIDAYPSGHAFFMQSVLRHL 600
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++ ++ N +++ EH +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647
Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
F+AD K +C+NF+C P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676
>gi|161528699|ref|YP_001582525.1| hypothetical protein Nmar_1191 [Nitrosopumilus maritimus SCM1]
gi|160340000|gb|ABX13087.1| protein of unknown function DUF255 [Nitrosopumilus maritimus SCM1]
Length = 675
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 249/686 (36%), Positives = 370/686 (53%), Gaps = 74/686 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE+E VAK +N+ FV+IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 49 SSCHWCHVMAHESFENEEVAKFMNENFVNIKVDREERPDIDDIYQKACQIATGQGGWPLS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PD KP GTYFP D YGRPGF +I R++ AW +K + +S ++ L++
Sbjct: 109 IFLTPDQKPFYVGTYFPILDSYGRPGFGSICRQLSQAWKEKPKDIEKSADNFLDALNKTE 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S SS +L + L A L + DS +GGFGSAPKFP + + ++K
Sbjct: 169 KVSISS-----KLERTILDEAAMNLFQLGDSAYGGFGSAPKFPNAANVSFLFRYAKI--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G S G K TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD +
Sbjct: 221 SGLSKFTEFGLK----TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +AF +TKD FY + + LD++ R+M P G +SA DADS EG EG FYV
Sbjct: 277 NYAEAFQITKDPFYLDVLKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYV 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W E+++ILG+ A +F Y GN ++G N+L + S A
Sbjct: 330 WKKSEIKEILGDDADIFCLFYDATDGGN------------WEGNNILCNNLNISTVAFNF 377
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G EK IL C +KL DVRSKR P LDDK++VSWN L+I++FA+ ++
Sbjct: 378 GTTEEKVREILQACSKKLLDVRSKRVAPGLDDKILVSWNSLMITAFAKGYRV-------- 429
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
++ Y++ A+ SFI +L+ +L +++N +K G+L+DY++ ++
Sbjct: 430 --------TNESRYLDAAKDCISFIENNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVN 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E K+L A++L + E F D E +F T+ +++R K ++D + P
Sbjct: 480 CLLDVFEIEPDPKYLKLALKLGHHLVEHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLP 539
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAAD 614
SGNSVS ++RL +Q + + + E++ + MA P L+ +
Sbjct: 540 SGNSVSAFVMLRLFHFSQE------QQFLDIATKIMESQAQ-MAAENPFGFGYLLNTISI 592
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
L P + ++ ++S +++L Y N V+ I ++ ++ E+
Sbjct: 593 YLEKPVE--ITIINTENSQLCDSIL----LEYLPNSIVVTIQ--NSTQLSALSEY----P 640
Query: 675 SMARNNFSADKVVALVCQNFSCSPPV 700
A +F +K A VC+NF+CS P+
Sbjct: 641 FFAGKSFE-EKTSAFVCKNFTCSLPL 665
>gi|407462858|ref|YP_006774175.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407046480|gb|AFS81233.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 675
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 247/684 (36%), Positives = 366/684 (53%), Gaps = 70/684 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE+E VA+ +N+ FV+IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 49 SSCHWCHVMAHESFENEEVAQFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PD KP GTYFP D YGRPGF +I R++ AW +K + +S ++ L++
Sbjct: 109 IFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLAQAWKEKPHDIEKSANNFLDALNKTE 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S P +L + L A L + DS +GGFGSAPKFP + + ++K
Sbjct: 169 KIST-----PSKLERTILDEAAMNLFQLGDSTYGGFGSAPKFPNAANVSFLFRYAKL--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G S G K TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD +
Sbjct: 221 SGLSKFTEFGLK----TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +AF +TKD FY I + LD++ R+M P G +SA DADS EG EG FYV
Sbjct: 277 NYAEAFQITKDPFYLDILKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYV 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W E+++ILG+ + +F +Y + GN ++G N+L + S A
Sbjct: 330 WKKSEIKEILGDDSDIFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNF 377
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ EK IL C +KL DVRSKR P LDDK++VSWN L+I++FA+ ++
Sbjct: 378 GITEEKVREILQSCSKKLLDVRSKRIAPGLDDKILVSWNALMITAFAKGCRV-------- 429
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
++ Y+ A++ SFI +L+ +L +++N +K G+L+DY++ ++
Sbjct: 430 --------TNDSRYLNAAKTCISFIEDNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVN 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E K+L A++L + + F D E +F T+ +++R K ++D + P
Sbjct: 480 CLLDVFEIEPDPKYLKLALKLGHHLVDHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLP 539
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-MCCAADMLSV 618
SGNSVS ++RL + K E + + E++ + MA P + +S+
Sbjct: 540 SGNSVSAFAMLRLFHLSQEKKF------LEITEKIMESQAQ-MAAENPFGFGYLLNTISI 592
Query: 619 PSRKHVVLVGHKSSVDFEN--MLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
K + + + ++ EN + + Y N V+ I D S
Sbjct: 593 YLEKPIEI----TIINTENSPLCKSILLEYLPNSIVVTIQNPDQLSA------LSQYPFF 642
Query: 677 ARNNFSADKVVALVCQNFSCSPPV 700
A +F DK VC+NF+CS P+
Sbjct: 643 AGKSFE-DKTSVFVCKNFTCSLPL 665
>gi|375308642|ref|ZP_09773925.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
gi|375079269|gb|EHS57494.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
Length = 690
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 251/698 (35%), Positives = 356/698 (51%), Gaps = 70/698 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM+ ESFEDE +A++LN +VSIKVDREERPDVD +YM+ Q + G GGWPL+
Sbjct: 55 SSCHWCHVMKRESFEDEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTY P E K+GR G +L KV W ++ + L +E + L
Sbjct: 115 ILMTPDQKPFFAGTYLPKEQKFGRVGLLELLDKVGTRWKEQPEEL-------VELSEQVL 167
Query: 140 SASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ + L EL + +L Q S ++D +GGFG APKFP P + +L +++
Sbjct: 168 TEHERQDMLAGYRGELDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPSPHILSFLLRYAQH 227
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
TG + +MV TL M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD
Sbjct: 228 ---TGN----QQALEMVEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNAL 280
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA Y + + +T Y I I Y+ R+M GG +SAEDADS EG +EG
Sbjct: 281 LAIAYTETWQVTGKELYRQITEQIFTYIAREMTDAGGAFYSAEDADS---EG----EEGR 333
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSS 373
FYVW EV +LG E A F + Y + P GN F+G N+ LI++N
Sbjct: 334 FYVWDDSEVRAVLGDEDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LE 380
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A K + ++ + + E R KLF R KR PH DDK++ SWNGL+I + A+A +
Sbjct: 381 AYGLKHDLTKQELEDRVRELRDKLFAAREKRVHPHKDDKILTSWNGLMIVALAKAGQAFG 440
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
Y E A+ A SF+ HL RL +R+G + PG+LDD
Sbjct: 441 DVT----------------YTERAQKAESFLWSHL-RRVDGRLLARYRDGDAAYPGYLDD 483
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAF + GL++LY+ ++L A+ L +LF D E G F + ++ + KE
Sbjct: 484 YAFYVWGLIELYQATFDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEI 543
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGNS++ NLVRLA + ++ + Y A F + + +
Sbjct: 544 YDGAIPSGNSIAAHNLVRLARLTGEARLEDY---AAKQFKAFGGMVSYDPPGYSALLSSL 600
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ + + K +V+VG + + A A + N I D + D
Sbjct: 601 -LYATGTTKEIVIVGQRDDPQTLQFIRAIQAGFRPNTVAILKDEGQSAIADI-------- 651
Query: 674 ASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
R+ D K VC++F+C PV L+ LL
Sbjct: 652 VPYIRDYTLVDGKPAVYVCEHFACQAPVMTLDDLKALL 689
>gi|157690983|ref|YP_001485445.1| thioredoxin [Bacillus pumilus SAFR-032]
gi|157679741|gb|ABV60885.1| possible thioredoxin [Bacillus pumilus SAFR-032]
Length = 687
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 250/691 (36%), Positives = 361/691 (52%), Gaps = 77/691 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+ Q + G GGWPL+V
Sbjct: 54 TCHWCHVMAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNV 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD KP GTYFP YGRPGF L ++ DA+ RD IE L+E +
Sbjct: 114 FVTPDQKPFYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKAT 165
Query: 141 AS---ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ ++ + + L Q + QL S+D+ GGFG+APKFP P M+ + +
Sbjct: 166 NNLRIKAAGQTENTLTQETIHKAYYQLMSSFDTLHGGFGTAPKFPAP---HMLSFLMRYY 222
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
E TG+ K TL +A GGI+DHVG GF RYS DE+W VPHFEKMLYD L
Sbjct: 223 EWTGQENALYAVTK----TLDGIANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALL 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ LT+ Y + ++ +++RDM+ P G +SA DADS EG KEG F
Sbjct: 279 MEAYTEAYQLTQQPTYEKLVHRLIHFIKRDMMNPDGSFYSAIDADS---EG----KEGQF 331
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ E+ LGE LF Y++ GN + + PH + +D AS
Sbjct: 332 YVWSKDEIMTHLGEDLGALFCAVYHITDEGNFEGENI--PH------TISTSFDDIKASF 383
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S L+ L E R L VR +RP P +DDKV+ SWN L+IS+ A+ ++
Sbjct: 384 SIDDQTLQSKLQ---EARYILQSVRQQRPAPLVDDKVLTSWNALMISALAKTGRVF---- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D +E + +A+ A SF+ HL Q RL +R G K GF++DYA
Sbjct: 437 ------------DAEEAIRMAKQAISFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAH 482
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
++ + LYE WL A + ELF D+E GG+F + + ++L+R KE +DG
Sbjct: 483 MLKAYMSLYEATFELAWLEKATAIAENMFELFWDKEKGGFFFSGSDAEALLVREKEVYDG 542
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--- 612
A PSGNS ++ +L+ L+ + RQN +L +F+ D++ + P A
Sbjct: 543 AMPSGNSTALKHLLILSRLTG-------RQNWLDTLEQMFQAFYVDVS-SYPSGHTAFLQ 594
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ +++ ++++G E +L A L K + D T E + +
Sbjct: 595 GLLAQYATKREIIILGKNGDPQKEQLLQA------LQKRFMPFDIILTAETG---QELAK 645
Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
A ++ + D K +C+N+SC P+TD
Sbjct: 646 LAPFTKDYKTIDGKTTVYICENYSCRQPITD 676
>gi|373849972|ref|ZP_09592773.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
gi|372476137|gb|EHP36146.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
Length = 785
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 248/717 (34%), Positives = 371/717 (51%), Gaps = 74/717 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM E+F VA LN+ F+ +K+DREERPD+D++Y+ +V G GGWPL+
Sbjct: 112 STCHWCHVMRRETFSRADVAAFLNEHFIPVKLDREERPDIDRIYLAFVAGTTGRGGWPLN 171
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PDLKP +GGTY+PPED+ G+PGF T+ R + W + R+ +A A
Sbjct: 172 VWLTPDLKPFLGGTYYPPEDQPGQPGFLTVARVAAEGWARDREKVAAH-----ADRIAAA 226
Query: 140 SASASSNKLPDE---------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
AS + PD+ + A A QL + +D GGFG KFP +I+ +
Sbjct: 227 LASLAGAAGPDQRSGRSGAATIDNAAWSAAAAQLFEEFDPEHGGFGRDAKFPHASKIRFL 286
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+ ++ +GEA+ +++ +L+ + GG+ DH+GGGFHRY+VD W +PHFEKM
Sbjct: 287 FRFA--VQPGVPAGEAARAREVAFASLEALTGGGLRDHLGGGFHRYTVDRGWRLPHFEKM 344
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYDQ +A + +DA+ L+ D + R+ L ++ + P G ++A DA+SA A
Sbjct: 345 LYDQALVAGLLVDAYQLSGDTRRFDLLRETLAFVEAALTSPDGAFYAALDAESALPGAAE 404
Query: 311 -RKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-- 366
K EGAFY W+ E+ L + A L Y GN + + + +NVL
Sbjct: 405 GDKAEGAFYTWSLDEITAALPPDEAALVIARYGFTAEGNA--TSLEERAGVLHNRNVLVP 462
Query: 367 ------IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
+ + +A KL L+ +L +RS R P D+K+I +WNG
Sbjct: 463 ASSAAATAVTKAPGAAEKLSRALD-----------RLRAIRSTRQPPARDEKIITAWNGY 511
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
+IS+ ARA + V G R ++++A AA+ + + ++ +T L+
Sbjct: 512 MISALARAHQ--------------VTGESR--WLDLATRAATHLWQTAWNGKTATLRRI- 554
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-----GGG 535
P GF +DYA I GLLDLYE G +WL A+ LQ T D F D GGG
Sbjct: 555 -AAPGGGDGFAEDYAAFIQGLLDLYEAGFDPRWLDRALALQATLDTRFADPAPASAGGGG 613
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
YF T VL+R+KED DGAEP+ +S++ NL RLA + Y A LA F
Sbjct: 614 YFGTAAGASGVLVRMKEDFDGAEPAASSLAADNLRRLAVFTGDAA---YEHRARAVLAAF 670
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSR-KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
+ + A+P++ AA L+ ++ + +V+ G + D +LA A + T++
Sbjct: 671 APQHRRAPAAMPVLLAAAFGLAEGAKPRQIVIAGRAGADDTRALLAEARRRFQPFATILL 730
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
AD D+ + N A+M SAD + A VC+NF+C PV+DP +L LL
Sbjct: 731 ---ADGASGDWLAQRNEAVAAMR----SADGQATAFVCENFACDAPVSDPAALGRLL 780
>gi|167043013|gb|ABZ07725.1| putative protein of unknown function, DUF255 [uncultured marine
microorganism HF4000_ANIW141A21]
Length = 678
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 254/711 (35%), Positives = 380/711 (53%), Gaps = 83/711 (11%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
+K +R + +I +TCHWCHVM E+FE++ A++LN F+ IKVDREERPD+D++Y
Sbjct: 39 SKAKRENKIIFLSIGYSTCHWCHVMAHETFENDEAAEILNQNFIPIKVDREERPDIDELY 98
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-D 122
M V ++ G GGWPL+VFL+PDLKP GGTY+P FK++L V + W+K+R D
Sbjct: 99 MKAVTSMGGQGGWPLTVFLTPDLKPFYGGTYYP------LSSFKSLLGSVTEIWNKQRKD 152
Query: 123 MLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
+ Q+ + +E L + S+ E P +A L L S+D R+GGFG +PKFP
Sbjct: 153 VFGQANSI-VENLRRMYTPQEQSS--ISEYPIDAAYL---NLVDSFDDRWGGFGDSPKFP 206
Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
P + ++L + D K+ +A + MV+ TL M+ GGI DH+ GGFHRYSVD W
Sbjct: 207 TPSNLILLL----RYYDRSKNHKALD---MVVKTLDAMSSGGIQDHLAGGFHRYSVDRMW 259
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
+ HFEKMLYD L YL+A+ + + R L+++ R+M G +SA+DAD
Sbjct: 260 VISHFEKMLYDNALLTIAYLEAYRCKPNDAFEKTARMTLNWILREMQSKDGAFYSAQDAD 319
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S + EGA+YVW+ E+ DILG ++ ++ E + + GN + K
Sbjct: 320 SPDG-------EGAYYVWSKAEISDILGPKNGMIVAEWFGVGDEGNFE-----------K 361
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
K+VL + A K+G+ +K + ++ + + L RS R +P DDK++ SWNGL
Sbjct: 362 EKSVLTTRTNLDDLAKKVGLTPKKLVALMDKSKAALLQARSHRVKPSTDDKILTSWNGLT 421
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
IS+ A +++L DR EY+E A+ AASF+ L + RL +R
Sbjct: 422 ISALALGAQVL---------------GDR-EYLEAAKRAASFLMETL--SEKGRLLRRYR 463
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 541
+G + G L+DYAF I GLLDLYE KWL A+ L + ELF D GG+F G
Sbjct: 464 DGEAALGGTLEDYAFFIQGLLDLYEADLQIKWLQEAMRLADKMIELFWDDSSGGFF-FNG 522
Query: 542 EDPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+D S +++++KE +DGA PSGNSV + L++L S+ D YR+ ++ F R+
Sbjct: 523 KDSSDNMIVKIKEAYDGATPSGNSVGALALLKLGVF---SERDEYREKGVKTIMSFFGRI 579
Query: 600 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
+ MA M A D SR+ +++ G +++ +ML Y NK V+ +
Sbjct: 580 ESNPMAHSHMLSAVDFHLRGSRE-IIVAGSDANL-INDMLHEIWRRYIPNK-VLALSGKA 636
Query: 660 TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
E+ M + V +C+NF C PV+ L +L
Sbjct: 637 VEK----------TIPMVKGKIGT-PVSVYICENFVCKRPVSKLKELTAML 676
>gi|384170788|ref|YP_005552166.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
gi|341830067|gb|AEK91318.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
Length = 664
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 246/696 (35%), Positives = 357/696 (51%), Gaps = 70/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 28 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 87
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 88 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENA 139
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+L+ +
Sbjct: 140 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLLFLLRYY 196
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 197 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 252
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 253 LSAYTEAYQVTNNERYKQIATQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 305
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
Y+W+ KE+ ++LG+ L+ + Y + GN F+G+N+ LI A
Sbjct: 306 YIWSKKEIMNLLGDQLGSLYCKVYNITEQGN------------FEGENIPNLI-FTRREA 352
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+ G+ + L R+KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 353 ILEETGLTEHELTERLEGARKKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKVFHE 412
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
++ +AE+A F+ RHL + R+ +R G K GF+DDY
Sbjct: 413 PG----------------FLSMAETAIRFLERHLIPDG--RVMVRYREGEVKNKGFIDDY 454
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AFLI L+LYE G +L A L + +LF D GG+F T + ++L+R KE +
Sbjct: 455 AFLIWAYLELYEAGFNPSYLKKAKTLCTSMLDLFWDERHGGFFFTGNDAETLLVREKEVY 514
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGA PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 515 DGAVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSV- 570
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+ + +K +V+ G K D + + A + T++ + EE +
Sbjct: 571 LAHIMPQKEIVVFGSKDDPDRKWFIEALQEHFTPAYTILAAENP--------EELAGISD 622
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A K +C+NF+C P TD N+L
Sbjct: 623 FAAGYEMIDGKTTVYICENFTCRRPTTDIDEAMNVL 658
>gi|443631576|ref|ZP_21115757.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
inaquosorum KCTC 13429]
gi|443349381|gb|ELS63437.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
inaquosorum KCTC 13429]
Length = 689
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 244/698 (34%), Positives = 364/698 (52%), Gaps = 89/698 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDAEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AAKSGEG-----LSESATHRTFLQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE------LN 370
W+ E+ LG+ L+ + Y + GN F+GKN+ LI +
Sbjct: 334 WSKDEILKTLGDDLGTLYCQVYDITEKGN------------FEGKNIPNLIHTKREQLIA 381
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
D+S + +L + LE + R++L +R +R PH+DDKV+ SWN L+I+ A+A+K
Sbjct: 382 DASLTKEELNLKLE-------DARQQLLKIREERTYPHVDDKVLTSWNALMIAGLAKAAK 434
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+ + +Y+ +A+ A +FI L + R+ +R+G K GF
Sbjct: 435 VYQ----------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGF 476
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
+DDYAFL+ LDLYE +L A +L + LF D E GG++ T + ++++R
Sbjct: 477 IDDYAFLLWAYLDLYEASFDLSYLRKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVRE 536
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
KE +DGA PSGNSV+ + L+RL V G S + AE +VF+ +
Sbjct: 537 KEVYDGAMPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAESMFSVFKPDIDAYPSGHAFFM 593
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
+ +P +K +V+ G+ + ++ A ++ N +++ EH
Sbjct: 594 QSVLKHLMP-KKEIVIFGNADDPARKQIITALQKAFKPNDSIL------------VAEHP 640
Query: 671 SNNASMARNNFSAD------KVVALVCQNFSCSPPVTD 702
+A F+AD K +C+NF+C P T+
Sbjct: 641 DECTDIAP--FAADYRIIDGKTTVYICENFACQQPTTN 676
>gi|407478214|ref|YP_006792091.1| hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
gi|407062293|gb|AFS71483.1| Hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
Length = 677
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 244/729 (33%), Positives = 375/729 (51%), Gaps = 97/729 (13%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F T + FL +TCHWCHV+ ESFEDE A++LN+ FVSIKVDREERPD
Sbjct: 28 GEEAFSLARATNKPIFLSIGYSTCHWCHVLAHESFEDEETARMLNERFVSIKVDREERPD 87
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMT Q + G GGWPLSVFLSPD P GTYFP ++ RP F+ ++ ++ + +
Sbjct: 88 IDQIYMTAAQLMNGQGGWPLSVFLSPDQTPFYIGTYFPKTPQFNRPSFRQVILQLSEHYR 147
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ + + G I+ L++ SA ++ +L D L + +Q + +D + GGFG A
Sbjct: 148 TDPEKIKRVGNELIQALTDVTSAD-TTGQLDDTLIHDTF----DQAMRQFDVQNGGFGEA 202
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L D + E +MV+ TL M GGI D +G G RY+V
Sbjct: 203 PKFPSPSLLTFLL-------DYYRFAEDETALQMVMRTLTAMRDGGITDQIGFGLCRYTV 255
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DERW VPHFEKMLYD A + ++ + ++ + ++ Y+ RD++ P G +SA
Sbjct: 256 DERWDVPHFEKMLYDNALFATLCIETYQVSGRERFKQYAEEVFTYIERDLLSPDGAFYSA 315
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EG +EG FY +T E+ D+LGE A LF Y P GN
Sbjct: 316 EDADS---EG----REGTFYTFTYDELLDVLGEDA-LFPRFYQATPQGN----------- 356
Query: 359 EFKGKNVLIELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F G+NV N S A G ++K L L + R+ L VRS+R RP DDK++ +W
Sbjct: 357 -FDGRNVFRRTNQSVQQFADDNGRTVQKTLFQLEQERQTLLHVRSQRIRPFRDDKILTAW 415
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
N L+IS++A+A ++ D Y +VA A +F+ HL D+ RL+
Sbjct: 416 NALMISAYAKAGRVF----------------DDHHYTDVAIRALTFLETHLMDDD--RLR 457
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+R G + GFLDDY+FL L+L++ T ++ A+ L + + F D E G +F
Sbjct: 458 VRYREGHIQGNGFLDDYSFLTEAYLELHQTTQQTVYIQQALRLTDRMIQDFGD-EQGSFF 516
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T+ E+ ++L+R K+ +DG +P+GNS +V+NL+RL+ + + YR+ A+H +
Sbjct: 517 FTSVEEETLLVRPKDIYDGVKPAGNSTAVLNLIRLSQLTGRTD---YRECAQHVFSALAL 573
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVG------------HKSSVDFENMLAAAHAS 645
+ + A + ++ ++L HK + ++LA
Sbjct: 574 EVASQPTGFASLLSAYVRTWLEPKELIMLTDSLETIGPFLADLHKRRLPELSVLAGK--- 630
Query: 646 YDLNKTVIHIDP--ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
+T++ + P AD + +D + A +CQ+F C P T+
Sbjct: 631 ---KETLLKVAPFIADYDLID-------------------SRPTAYLCQDFQCERPTTNL 668
Query: 704 ISLENLLLE 712
L + ++E
Sbjct: 669 SELLHQIIE 677
>gi|255306584|ref|ZP_05350755.1| hypothetical protein CdifA_08327 [Clostridium difficile ATCC 43255]
Length = 678
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 237/698 (33%), Positives = 364/698 (52%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE I F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + +K+F+ R +R H DDK++ SWN L+I + +A LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + +FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I ++ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK+ VCQ+ SCS P+ D L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|384161675|ref|YP_005543748.1| YyaL [Bacillus amyloliquefaciens TA208]
gi|328555763|gb|AEB26255.1| YyaL [Bacillus amyloliquefaciens TA208]
Length = 689
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 246/696 (35%), Positives = 357/696 (51%), Gaps = 70/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R +E ++E
Sbjct: 113 VFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 164
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+L+ +
Sbjct: 165 AAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLLFLLRYY 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 222 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 278 LSAYTEAYQVTNNERYKQIATQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 330
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSA 374
Y+W+ KE+ ++LG+ L+ + Y + GN F+G+N+ LI A
Sbjct: 331 YIWSKKEIMNLLGDQLGSLYCKVYNITEQGN------------FEGENIPNLI-FTRREA 377
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+ G+ + L R+KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 378 ILEETGLTEHELTERLEGARKKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKVFHE 437
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
++ +AE+A F+ RHL + R+ +R G K GF+DDY
Sbjct: 438 PG----------------FLSMAETAIRFLERHLIPDG--RVMVRYREGEVKNKGFIDDY 479
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AFLI L+LYE G +L A L + +LF D GG+F T + ++L+R KE +
Sbjct: 480 AFLIWAYLELYEAGFNPSYLKKAKTLCTSMLDLFWDERHGGFFFTGNDAETLLVREKEVY 539
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGA PSGNS + + L+RL + + AE +VF+ ++ + +
Sbjct: 540 DGAVPSGNSAAAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSV- 595
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+ + +K +V+ G K D + + A + T++ + EE +
Sbjct: 596 LAHIMPQKEIVVFGSKDDPDRKWFIEALQEHFTPAYTILAAENP--------EELAGISD 647
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A K +C+NF+C P TD N+L
Sbjct: 648 FAAGYEMIDGKTTVYICENFTCRRPTTDIDEAMNVL 683
>gi|67517751|ref|XP_658661.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
gi|40747019|gb|EAA66175.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
gi|259488639|tpe|CBF88239.1| TPA: DUF255 domain protein (AFU_orthologue; AFUA_1G12370)
[Aspergillus nidulans FGSC A4]
Length = 774
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 240/602 (39%), Positives = 331/602 (54%), Gaps = 37/602 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN+ F+ IKVDREERPDVD +YM YVQA G GGWPL+
Sbjct: 66 SACHWCHVMEKESFMSQEVASILNESFIPIKVDREERPDVDDIYMNYVQATTGSGGWPLN 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPG-----FKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + G F IL K++D W +R +S +Q
Sbjct: 126 VFLTPDLEPVFGGTYWPGPNAASLLGPETVSFIEILEKLRDVWQTQRQRCLESAKEITKQ 185
Query: 135 L---SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
L +E + + ++ ++L L + + YD GGF APKFP P + +L
Sbjct: 186 LREFAEEGTHTFQGDQSDEDLDVELLEEAYQHFASRYDINNGGFSRAPKFPTPANLSFLL 245
Query: 192 ---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+ + D E M + TL MA+GGI DH+G GF RYSV W +PHFE
Sbjct: 246 RLGIYPSAVTDIVGQEECENATAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHFE 305
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETE 307
KMLYDQ QL +VY DAF +T + + D++ YL I G S+EDADS T
Sbjct: 306 KMLYDQAQLLDVYADAFKITHNPEFLGAVYDLITYLTSAPIQSTTGGFHSSEDADSLPTP 365
Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
T K+EGAFYVWT KE+ +LG A + H+ + GN ++ +DPH+EF +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELTQVLGPRDAGVCARHWGVLSDGN--IAPENDPHDEFMDQNVL 423
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 425
S A + G+ ++ + I+ R++L + R K R RP LDDK+IV+WNGL I +
Sbjct: 424 SIKVTPSKLAKEFGLGEDEVVRIIKSGRQRLREYRDKNRVRPDLDDKIIVAWNGLAIGAL 483
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
A+ S +L E +S S + E A A +FI+ LYD+ T +L +R+G
Sbjct: 484 AKCS-VLFEEIDS---------SKSAQCREAAAKAINFIKETLYDKATGQLWRIYRDGSK 533
Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNTTG 541
PGF +DYAFL SGLLD+YE +L +A +LQ +E FL G GY+ T
Sbjct: 534 GTTPGFAEDYAFLTSGLLDMYEATFDDSYLQFAEQLQRYLNENFLAYAGSSPAGYYTTPS 593
Query: 542 E----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
P+ LLR+K + A PS N V NL+RL+SI+ + + YR A + F
Sbjct: 594 TSAPGSPATLLRLKTGTESAVPSVNGVIARNLLRLSSIL---EENSYRVLARQTCQSFAV 650
Query: 598 RL 599
+
Sbjct: 651 EI 652
>gi|153953760|ref|YP_001394525.1| hypothetical protein CKL_1135 [Clostridium kluyveri DSM 555]
gi|219854377|ref|YP_002471499.1| hypothetical protein CKR_1034 [Clostridium kluyveri NBRC 12016]
gi|146346641|gb|EDK33177.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
gi|219568101|dbj|BAH06085.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length = 633
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 231/697 (33%), Positives = 367/697 (52%), Gaps = 74/697 (10%)
Query: 17 FLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGW 76
+LI TCHWCHVM ESF+D VA++LN +F+S+KVDREERPDVD +YM Q++ G GGW
Sbjct: 4 YLICTCHWCHVMAKESFQDNEVAEILNKYFISVKVDREERPDVDSIYMKVCQSITGSGGW 63
Query: 77 PLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
PL++ ++P+ KP GTYFP + G IL ++ AW + L + G ++ +
Sbjct: 64 PLTIIMTPEQKPFFAGTYFPKNNVGEALGLIAILEYIQKAWKDNKAQLLKEGD-SLLDII 122
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L+ ++S EL Q+ L+ + +++D+ +GGFG PKFP + +L + K
Sbjct: 123 NTLNKNSSG-----ELSQDILKKAFLEFKQNFDTLYGGFGGYPKFPSAHNLLFLLRYFHK 177
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+D + +MV TL+ M +GG++DH+G GF RYSVD +W +PHFEKMLYD
Sbjct: 178 TKD-------AFALEMVEKTLESMYRGGMYDHIGYGFSRYSVDRKWLIPHFEKMLYDNAL 230
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+A YL+ F +T + Y+ + +I +Y+ RDM G +SAEDADS EG +EG
Sbjct: 231 IAMAYLETFQVTGNKKYAKVAEEIFEYVLRDMTSKEGGFYSAEDADS---EG----EEGK 283
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY+W+ +E++DILG E F ++ + GN F+GKN+ + +S
Sbjct: 284 FYMWSQEEIKDILGQEQGSKFCCYFNVTSQGN------------FRGKNIPNLIGNS--- 328
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
LE+ + + CR KLF R KR PH DDK++ SWNGL+I++ A A ++L
Sbjct: 329 ------ILEEDVQFIKNCREKLFKYREKRVHPHKDDKILTSWNGLMIAAMALAGRVL--- 379
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ +Y A+ + FI ++L + RL +R G S G+ DDYA
Sbjct: 380 -------------NNSKYTLAAKKSVDFIYKNLI-RKDGRLLARYREGDSSFLGYADDYA 425
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI GL++LYE ++L A+EL E+F D E GG+F + +++R KE +D
Sbjct: 426 FLIWGLIELYETTYNPEYLKNALELNQNFLEIFWDSENGGFFLYGKDSEKLIIRPKEIYD 485
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G P GNS + +NL+RL+ + + + + F ++ ++ A
Sbjct: 486 GPTPCGNSAAALNLLRLSYLATSYE---FEDKVKQLFENFADEIESSPISCSFSLVALLF 542
Query: 616 LSVPSRKHVVLVGH--KSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
P R+ ++ G + +M+ ++ + ++ H++ +E +
Sbjct: 543 SKYPVRQIIISAGENINEARKVLDMINKKYSPFTVSVLYSHLN----------KELKNIC 592
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + KV VC+NF+C P+T+ L+ +L
Sbjct: 593 PSIEQYIAIRGKVTVYVCENFTCKEPITNMDLLKEVL 629
>gi|425767540|gb|EKV06109.1| hypothetical protein PDIG_78870 [Penicillium digitatum PHI26]
gi|425780454|gb|EKV18461.1| hypothetical protein PDIP_27280 [Penicillium digitatum Pd1]
Length = 752
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 244/605 (40%), Positives = 330/605 (54%), Gaps = 40/605 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN+ FV IKVDREERPD+D +YM YVQA G GGWPL+
Sbjct: 32 SACHWCHVMEKESFMSSEVASILNESFVPIKVDREERPDIDDIYMNYVQATTGSGGWPLN 91
Query: 80 VFLSPDLKPLMGGTYF--PPEDKYGRP---GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+ P + P GF IL K++D W ++ S +Q
Sbjct: 92 VFLTPDLEPVFGGTYWQGPNSTTFTGPEAIGFVEILEKLRDVWQTQQQRCLDSAKEITKQ 151
Query: 135 LSEALSASASSNK------LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
L E S + +++ L + + YDS GGFG APKFP P +
Sbjct: 152 LREFAEEGTHSQQGDRDDDNDEDMDIELLEEAYQHFASRYDSVNGGFGRAPKFPTPSNLS 211
Query: 189 MMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
+L + ++ D E + M + TL MA+GGI DH+G GF RYSV W +P
Sbjct: 212 FLLRLGAYPTQVMDVVGHDECEQATAMAVTTLVNMARGGIRDHIGHGFARYSVTTDWGLP 271
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 304
HFEKMLYDQ QL +VY+DAF LT D D+ YL I P G FS+EDADS
Sbjct: 272 HFEKMLYDQAQLLDVYVDAFRLTHDPELLGAVYDLAAYLTSAPIQSPTGGFFSSEDADSY 331
Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
T K+EGAFYVW+ KE+ +LG A + +H+ + P GN + DPH+EF +
Sbjct: 332 PHPNDTEKREGAFYVWSLKELTSVLGPRDAPVCAKHWGVLPDGN--VPPEYDPHDEFMNQ 389
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 422
NVL S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I
Sbjct: 390 NVLSIRATPSKLAKDFGLSEEEVVKIIKSSKQKLHDYRERSRGRPDLDDKIIVAWNGLAI 449
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
+ A+ S +L E ES+ + E A A SFI+ L+D+ T +L +R
Sbjct: 450 GALAKCS-VLFEEIESSKAVY---------CREAAARAISFIKDKLFDKTTGQLWRIYRG 499
Query: 483 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG---GYFN 538
G PGF DDYA+L SGLLD+Y+ +L +A LQ +E FL + G GY++
Sbjct: 500 GNRGDTPGFADDYAYLASGLLDMYDATYDDSYLQFAERLQKYLNEYFLAQSGSTATGYYS 559
Query: 539 T----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
T T P LLR+K + A PS N V NL+RL++++ + + YR A +
Sbjct: 560 TPSVITPGMPGPLLRLKTGTESATPSVNGVIARNLLRLSALL---EDESYRTLARQTCNT 616
Query: 595 FETRL 599
F +
Sbjct: 617 FAVEI 621
>gi|340345243|ref|ZP_08668375.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
gi|339520384|gb|EGP94107.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
Length = 675
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 220/561 (39%), Positives = 319/561 (56%), Gaps = 49/561 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE++ VAK +N+ FV+IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 49 SACHWCHVMAHESFENDEVAKFMNENFVNIKVDREERPDLDDIYQKVCQIATGQGGWPLS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PD KP GTYFP D YGRPGF +I R++ AW +K + +S + L +A
Sbjct: 109 IFLTPDQKPFYVGTYFPVLDSYGRPGFGSITRQLAQAWKEKPKDIEKSADNFLSALQKAE 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ K+P +L + L A L + D+ +GGFGSAPKFP + + ++K
Sbjct: 169 TV-----KIPSKLEKVILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG S+ + L TL MAKGGI D +GGGFHRYS D +W VPHFEKMLYD +
Sbjct: 221 TG----LSKFNEFALKTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T+D FY + L ++ R+M G +SA DADS EG EG FYV
Sbjct: 277 NYAEAYQITQDQFYLEVLHKTLGFVLREMTSKEGGFYSAYDADS---EGV----EGKFYV 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W E+++ILG+ A +F +Y + GN ++G ++L + SA A
Sbjct: 330 WKKSEIKEILGDDAEIFCLYYDVTDGGN------------WEGNSILCNNINISAVAFHF 377
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GMP EK IL C KL +VRSKR P LDDKV+ SWN L+I++FA+ ++
Sbjct: 378 GMPEEKIKEILVRCSEKLLNVRSKRVPPGLDDKVLTSWNALMITAFAKGYRV-------- 429
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ +Y++ A++ SFI L D+ +L +++N +K G+L+DY++ +
Sbjct: 430 --------TGETKYLDAAKNCVSFIETKLLDDT--KLLRTYKNNVAKIDGYLEDYSYFAN 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E K+L A++L + + F D E +F T+ + +++R K ++D + P
Sbjct: 480 ALLDVFEIEPEAKYLNLAVKLGHHLVDHFWDPESSSFFMTSDDHEKLIIRPKSNYDLSLP 539
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNSVS ++RL + K
Sbjct: 540 SGNSVSCFVMLRLYHLTQEEK 560
>gi|384177739|ref|YP_005559124.1| hypothetical protein I33_4252 [Bacillus subtilis subsp. subtilis
str. RO-NN-1]
gi|349596963|gb|AEP93150.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
str. RO-NN-1]
Length = 689
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 241/691 (34%), Positives = 361/691 (52%), Gaps = 75/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 53 STCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP K+ RPGF +L + + + R+ + A + L
Sbjct: 113 VFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKT 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +
Sbjct: 173 AAKSGEG-----LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHN 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L
Sbjct: 225 TGQENALYNVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YV
Sbjct: 281 AYTEAYQVTQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYV 333
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +E+ LG+ L+ + Y + GN F+GKN+ ++ +
Sbjct: 334 WSKEEILKTLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKE 381
Query: 379 LGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
EK L++ L + R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 382 DAGLTEKELSLKLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ---- 437
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+ +A+ A +FI L + R+ +R G K GF+DDYAFL
Sbjct: 438 ------------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRGGEVKNKGFIDDYAFL 483
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 484 LWAYLDLYEASFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGA 543
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ + L+RL V G S + AE +VF+ + +
Sbjct: 544 VPSGNSVAAVQLLRLGQ-VTGDLS--LIEKAETMFSVFKLDIDAYPSGHAFFMQSVLRHL 600
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P +K +V+ G + ++ ++ N +++ EH +A
Sbjct: 601 MP-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA 647
Query: 678 RNNFSAD------KVVALVCQNFSCSPPVTD 702
F+AD K +C+NF+C P T+
Sbjct: 648 --PFAADYRIIDGKTTVYICENFACQQPTTN 676
>gi|320031949|gb|EFW13906.1| DUF255 domain-containing protein [Coccidioides posadasii str.
Silveira]
Length = 799
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 243/611 (39%), Positives = 335/611 (54%), Gaps = 49/611 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P P F IL K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188
Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + + P ++L L + YD GGF APKFP P
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247
Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
+ +L Y ++ G+ E + +MV TL MA+GGIHD +G GF RYSV W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
+PHFEKMLYDQ QL +VY+D F +T++ DI+ Y+ ++ P G S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS T K+EGAFYVWT KE++ ILG+ A + H+ + P GN ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEF 424
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
+NVL A G+ ++ + ++ R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484
Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L I + A+ S +L K +AE A VAE AA FIR +L+D +T +L
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533
Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
+R+G + PGF DDYA+L SGL+ LYE +L +A LQ + FL
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593
Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
GY+ N + P L R+K D A PS N V NL+RLAS++ + D Y+ A
Sbjct: 594 PAGYYMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650
Query: 589 EHSLAVFETRL 599
H+ + F +
Sbjct: 651 RHTCSAFAAEM 661
>gi|255937427|ref|XP_002559740.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584360|emb|CAP92395.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 788
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 247/605 (40%), Positives = 329/605 (54%), Gaps = 40/605 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN+ FV IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 68 SACHWCHVMEKESFMSSEVASILNESFVPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 127
Query: 80 VFLSPDLKPLMGGTYF--PPEDKYGRP---GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+P L+P+ GGTY+ P + P GF IL K++D W ++ S +Q
Sbjct: 128 VFLTPSLEPVFGGTYWQGPNSTTFRGPEAIGFVEILEKLRDVWQTQQQRCLDSAKEITKQ 187
Query: 135 LSEALSASASS------NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
L E + N +E+ L + + YDS GGFG APKFP P +
Sbjct: 188 LREFAEEGTHTQQGDRDNDKDEEMDIELLEEAYQHFASRYDSVNGGFGRAPKFPTPSNLS 247
Query: 189 MMLY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
+L + ++ D E + M + TL MA+GGI DH+G GF RYSV W +P
Sbjct: 248 FLLRLGAYPTQVMDVVGHDECEQATAMAVTTLVNMARGGIRDHIGHGFARYSVTADWGLP 307
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 304
HFEKMLYDQ QL +VY+DAF LT D D+ YL I P G FS+EDADS
Sbjct: 308 HFEKMLYDQAQLLDVYVDAFRLTHDPELLGAVYDLSAYLTSAPIQSPTGGFFSSEDADSY 367
Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
T K+EGAFYVW+ KE+ +LG A + +H+ + P GN + DPH+EF +
Sbjct: 368 PHPNDTEKREGAFYVWSLKELTSVLGPRDAPVCAKHWGVLPDGN--VPPEYDPHDEFMNQ 425
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 422
NVL S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I
Sbjct: 426 NVLSIRATPSKLAKDFGLSEEEVVKIIKSSKQKLHDHREQTRGRPDLDDKIIVAWNGLAI 485
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
+ A+ S +L E ES S E A A FI+ L+D+ T +L +R+
Sbjct: 486 GALAKCS-VLFEEIES---------SKAVHCREAAARAIGFIKDKLFDKATGQLWRIYRD 535
Query: 483 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFN 538
G PGF DDYA+L SGLLD+Y+ +L +A LQ +E FL + G GY++
Sbjct: 536 GNRGDTPGFADDYAYLASGLLDMYDATYDDSYLQFAERLQKYLNEYFLAQSGSTAAGYYS 595
Query: 539 ----TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
TT P LLR+K + A PS N V NL+RL++++ G +S YR A +
Sbjct: 596 TPSVTTPGMPGPLLRLKTGTESATPSVNGVIARNLLRLSALL-GDES--YRTLARQTCNT 652
Query: 595 FETRL 599
F +
Sbjct: 653 FAVEI 657
>gi|126699171|ref|YP_001088068.1| hypothetical protein CD630_15680 [Clostridium difficile 630]
gi|115250608|emb|CAJ68432.1| conserved hypothetical protein [Clostridium difficile 630]
Length = 678
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 236/698 (33%), Positives = 364/698 (52%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L ++ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKDMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE I F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + +K+F+ R +R H DDK++ SWN L+I + +A LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + +FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I ++ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK+ VCQ+ SCS P+ D L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|303320203|ref|XP_003070101.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
delta SOWgp]
gi|240109787|gb|EER27956.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
delta SOWgp]
Length = 799
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 243/611 (39%), Positives = 335/611 (54%), Gaps = 49/611 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN FV IK+DREERPD+D+VYM YVQA+ G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P P F IL K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEI 188
Query: 132 IEQLSEALSASASSNKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + + P ++L L + YD GGF APKFP P
Sbjct: 189 TRQLRE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPAN 247
Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
+ +L Y ++ G+ E + +MV TL MA+GGIHD +G GF RYSV W
Sbjct: 248 LSFLLRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDW 306
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
+PHFEKMLYDQ QL +VY+D F +T++ DI+ Y+ ++ P G S+EDA
Sbjct: 307 SLPHFEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDA 366
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS T K+EGAFYVWT KE++ ILG+ A + H+ + P GN ++R +DPH+EF
Sbjct: 367 DSFPNSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEF 424
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNG 419
+NVL A G+ ++ + ++ R+KL + R + R RP LDDK+IVSWNG
Sbjct: 425 INQNVLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNG 484
Query: 420 LVISSFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L I + A+ S +L K +AE A VAE AA FIR +L+D +T +L
Sbjct: 485 LAIGALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWR 533
Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
+R+G + PGF DDYA+L SGL+ LYE +L +A LQ + FL
Sbjct: 534 VYRDGRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTT 593
Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
GY+ N + P L R+K D A PS N V NL+RLAS++ + D Y+ A
Sbjct: 594 PAGYYMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALA 650
Query: 589 EHSLAVFETRL 599
H+ + F +
Sbjct: 651 RHTCSAFAAEM 661
>gi|86157370|ref|YP_464155.1| hypothetical protein Adeh_0943 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85773881|gb|ABC80718.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 718
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 248/633 (39%), Positives = 351/633 (55%), Gaps = 70/633 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T R FL +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64 GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
VD VYMT VQ L G GGWP+SV+L+PD +P GGTYFPP D P G +IL ++ D
Sbjct: 124 VDAVYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGLLSILHEIADL 183
Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
W + D + + +GA + A ++ +P P ++A+ L L +S+D R GG
Sbjct: 184 WARDPDRIRSATGALVEAVRTALAPAGPAAADVPGPEPIEHAVTL----LERSFDERHGG 239
Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
APKFP V ++++L H + ++GE +M TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGE-ERSLRMATVTLERMAAGGLHDQVGGGFH 292
Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
RYS D +W VPHFEKMLYD LA Y +A+ T ++ + R LDYL R++ P G
Sbjct: 293 RYSTDAQWLVPHFEKMLYDNALLAVAYAEAWQATGRRDFARVTRQTLDYLLRELTSPEGG 352
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
++SA DADS EG +EG F+ WT E+ + LG+ A F + ++P GN
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
F+G+NVL + P E R L+ +R +RPRP D+KV+
Sbjct: 399 -----FEGRNVL-----------HVPAPDEDAWESFAPDRAALYALRERRPRPLRDEKVL 442
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
WNGL IS+ A ++L SEA +++ A AA F+ + +
Sbjct: 443 AGWNGLAISALALGGRVL-SEA---------------RWVDAAARAADFVLTRMVKDG-- 484
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
RLQ S+ G + P +L+D+AFL+ GLLDL+E +WL A++L QD LF D GG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEASFDPRWLRSALQLAEAQDRLFGDPAGG 544
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G+F + + +L R K HDGAEPSG SV+ +N +RL + + + +R+ A+ +L
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
L + +A+ + A D S R+ VVLV
Sbjct: 602 HARTLAEQPLAMSELLLALDFASDAVRE-VVLV 633
>gi|423090012|ref|ZP_17078355.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
70-100-2010]
gi|357557317|gb|EHJ38868.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
70-100-2010]
Length = 678
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 236/698 (33%), Positives = 363/698 (52%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + +K+F+ R +R H DDK++ SWN L+I + +A LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + +FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I ++ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK+ VCQ+ SCS P+ D L++++L
Sbjct: 638 GFLNNYRLKDDKISYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|187778206|ref|ZP_02994679.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
15579]
gi|187775134|gb|EDU38936.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
15579]
Length = 683
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 237/687 (34%), Positives = 351/687 (51%), Gaps = 74/687 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA++LN+ F+SIKVDREERPD+D +YM + QA G GGWPL+
Sbjct: 55 STCHWCHVMERESFEDEDVAEILNENFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP K+ PG IL+ + W + ++ + +S +EQ+
Sbjct: 115 ILMTPDKKPFFAGTYFPKWGKHNIPGIMDILKSINKLWREDKNKVLESSNRILEQIER-- 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
N DEL + + A+ L ++DS++GGFG+ PKFP I +L Y+ KK
Sbjct: 173 ---FQDNHGEDELEEYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK- 228
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ ++ TL M KGGI DH+G GF RYS D +W VPHFEKMLYD L
Sbjct: 229 --------DKKVLDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALL 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ Y +A+ TK+ Y + IL+Y+++ M G +SAEDADS EG EG F
Sbjct: 281 SMAYTEAYEATKNPLYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKF 333
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
Y+WT KE+ DILGE F C L ++ N F+ KN+ LI+ +
Sbjct: 334 YLWTKKEIMDILGEEDGAFY----------CKLYDITSRGN-FEKKNIANLIQTDLKDVD 382
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+K + L R KLF+ R KR PH DDK++ SWN L+I +F RA + K++
Sbjct: 383 NNK---------DKLERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND 433
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+++A+ +A FI ++L DE+ L R GF+DDYA
Sbjct: 434 ----------------NYIDIAKQSADFIIKNLMDEKG-TLYARIREEERGNEGFIDDYA 476
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F + L++LYE +L +IE+ ++ +LF +E GG++ + +++R KE +D
Sbjct: 477 FFLWALIELYEASFDIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYD 536
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGN+V+ + L L I D Y+ + F +K M L A M
Sbjct: 537 GAMPSGNAVASLALSLLYYITG---EDKYKNLVDKQFKFFAANIKSGPM-YHLFSVIAYM 592
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
++ + + L + F + + Y + D ++ E N +
Sbjct: 593 YNISPVQEITLAYSEKDEAFYEFINELNNRYIPFSIITLNDKSNKIE--------KINKN 644
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
+ DK +CQ+++C P+ D
Sbjct: 645 LKDKTPIKDKTTVYICQDYACKEPIMD 671
>gi|254975197|ref|ZP_05271669.1| hypothetical protein CdifQC_07775 [Clostridium difficile QCD-66c26]
gi|255092587|ref|ZP_05322065.1| hypothetical protein CdifC_07992 [Clostridium difficile CIP 107932]
gi|255314324|ref|ZP_05355907.1| hypothetical protein CdifQCD-7_08235 [Clostridium difficile
QCD-76w55]
gi|255517004|ref|ZP_05384680.1| hypothetical protein CdifQCD-_07809 [Clostridium difficile
QCD-97b34]
gi|255650105|ref|ZP_05397007.1| hypothetical protein CdifQCD_07959 [Clostridium difficile
QCD-37x79]
gi|260683234|ref|YP_003214519.1| hypothetical protein CD196_1491 [Clostridium difficile CD196]
gi|260686830|ref|YP_003217963.1| hypothetical protein CDR20291_1466 [Clostridium difficile R20291]
gi|306520110|ref|ZP_07406457.1| hypothetical protein CdifQ_08874 [Clostridium difficile QCD-32g58]
gi|384360839|ref|YP_006198691.1| hypothetical protein CDBI1_07695 [Clostridium difficile BI1]
gi|260209397|emb|CBA62859.1| conserved hypothetical protein [Clostridium difficile CD196]
gi|260212846|emb|CBE04045.1| conserved hypothetical protein [Clostridium difficile R20291]
Length = 678
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 236/698 (33%), Positives = 363/698 (52%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L+ V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLKNVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + +K+F+ R +R H DDK++ SWN L+I + +A LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + +FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I ++ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK VCQ+ SCS P+ D L++++L
Sbjct: 638 GFLNNYRLKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|189218169|ref|YP_001938811.1| Highly conserved protein containing a thioredoxin domain
[Methylacidiphilum infernorum V4]
gi|189185027|gb|ACD82212.1| Highly conserved protein containing a thioredoxin domain
[Methylacidiphilum infernorum V4]
Length = 724
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 232/645 (35%), Positives = 351/645 (54%), Gaps = 34/645 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+ VA+LLN +F+ IKVDREERPD+D+ YM +VQA G GGWP++
Sbjct: 47 STCHWCHVMAKESFENPIVAQLLNSFFIPIKVDREERPDIDQFYMEFVQAFTGQGGWPMN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+L+P GGTYFP E K+G+PGF IL+K+ + W R +L Q G ++ E +
Sbjct: 107 VWLTPNLEPFFGGTYFPLESKWGKPGFVDILKKIAELWQYNRSLLEQQGQEIFHKMREVI 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+S P+ A R EQL S+D GGF +PKFPRP + L+ + L D
Sbjct: 167 QSSFEPKSPPNL--AIASRKAVEQLWGSFDRTHGGFSPSPKFPRP-SLFYFLFRAGSLAD 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ + Q M L++LQ M+ GGIHD + GGFHRYSVDE+W +PHFEKMLYDQ L
Sbjct: 224 FSEDYKKKSLQ-MALYSLQKMSGGGIHDQLEGGFHRYSVDEKWRLPHFEKMLYDQATLGL 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YLDA+ T D + +++YL + P G +SAEDADS G +++EGA+Y+
Sbjct: 283 SYLDAYQATDDPLFKDTFESLVEYLLSHLHHPSGGFYSAEDADSLNASG--QEEEGAYYL 340
Query: 320 WTSKE----VEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
WT +E +E I+G+ H++ GN +S+ KN+L+ S
Sbjct: 341 WTFQELQQTLEPIVGKDRSKILAHFFGATEQGNLPGGLISE--EALAKKNILLMEKPLSD 398
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A +LG+ LE+ I+ + + L R KR +P LDDK+I +WNG +S+ A+A
Sbjct: 399 LAHELGISLEEAREIVLKAKEGLKKERLKRSKPFLDDKIICAWNGYTLSALAKA------ 452
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
+ V+G R + A+ A+F+ +L+D + L +RNG PGF DY
Sbjct: 453 --------YMVIGDGR--LINEAKKTATFLLENLWDPSSKTLYRIYRNG-RGTPGFSSDY 501
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L +L L+E KWL A Q +E F+D Y E + ++ +E++
Sbjct: 502 ASLALSMLHLFEADQDEKWLSLAKLFQELLEEKFVDPYRHNYMVEAVEISAKSIQTREEY 561
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DGAEP+ S++ +L++L ++ K +R+ E + L+ A+P +
Sbjct: 562 DGAEPATLSLAAHSLLKLYTLTGEEK---WRKRLEELFSYAWPILERFPTALPYLLGVYC 618
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD 659
P + ++LVG K + + + + + N+ ++ +DP +
Sbjct: 619 EYRAPLVE-IILVGEKKNEETKRLFHSLSKLLIPNRLLVVLDPQE 662
>gi|394994118|ref|ZP_10386849.1| YyaL, partial [Bacillus sp. 916]
gi|393805058|gb|EJD66446.1| YyaL, partial [Bacillus sp. 916]
Length = 607
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 240/637 (37%), Positives = 346/637 (54%), Gaps = 58/637 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+
Sbjct: 23 STCHWCHVMAHESFEDEEIADMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLN 82
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD KP GTYFP KY RPGF +L + + + R +E ++E
Sbjct: 83 VFVTPDQKPFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENA 134
Query: 140 SASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A P E L + A+ QL+ +D+ +GGFG APKFP P M+++ +
Sbjct: 135 AAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYY 191
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TGK +A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L
Sbjct: 192 SYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALL 247
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y +A+ +T + Y I I+ +++R+M+ G FSA DAD TEG +EG +
Sbjct: 248 LTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKY 300
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ KE+ ++LG E L+ + Y + GN + + PH F + ++E ++ +
Sbjct: 301 YIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTG 356
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L LE E R KL + R R PH DDKV+ SWN L+I+ A+A+K+
Sbjct: 357 HELAERLE-------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV----- 404
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F+ P +++ +AE+A F+ RHL + R+ +R G K GF+DDYAF
Sbjct: 405 ----FHEP-------DFLSMAETAIRFLERHLMPDA--RVMVRYREGEVKNKGFIDDYAF 451
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI L+LYE G +L A L + ELF D GG+F T + ++L+R KE +DG
Sbjct: 452 LIWAYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDG 511
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS + + L+RL + G S + AE +VF+ ++ + +
Sbjct: 512 AVPSGNSAAAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAH 568
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
++P +K +V+ G K D + + A + T++
Sbjct: 569 TMP-QKEIVVFGRKDDPDRKRFIEALQEHFTPAYTIL 604
>gi|255100682|ref|ZP_05329659.1| hypothetical protein CdifQCD-6_07712 [Clostridium difficile
QCD-63q42]
Length = 678
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 237/698 (33%), Positives = 362/698 (51%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD +GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDENYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE I F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + +K+F+ R +R H DDK++ SWN L+I + +A LK++
Sbjct: 375 ----NKEYERHNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + +FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I ++ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK VCQ+ SCS P+ D L++++L
Sbjct: 638 GFLNNYILKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|115491785|ref|XP_001210520.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114197380|gb|EAU39080.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 787
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 236/569 (41%), Positives = 323/569 (56%), Gaps = 37/569 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN+ F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 70 SACHWCHVMEKESFMSQEVASILNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 129
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKT-----ILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + PG +T IL K++D W ++ +S +Q
Sbjct: 130 VFLTPDLEPVFGGTYWPGPNATTNPGHETIGFVDILEKLRDVWQTQQQRCRESAKDITKQ 189
Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L +E + S ++ DE L L + YD+ GGF APKFP P + +
Sbjct: 190 LREFAEEGTHSYQGDRAADEDLDIELLEEAYQHFVSRYDTAHGGFSKAPKFPTPANLSFL 249
Query: 191 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
L Y S ++ GK E M + TL MA+GGIHDH+G GF RYSV W +PH
Sbjct: 250 LRLGVYPSAVVDVVGKE-ECENATAMAVNTLINMARGGIHDHIGHGFARYSVTADWGLPH 308
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
FEKMLYDQ QL +VY+DAF +T + D++ YL + G S+EDADS
Sbjct: 309 FEKMLYDQAQLLDVYIDAFKITHNPELLGAVYDLVTYLTTAPLQSSTGAFHSSEDADSLP 368
Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
T K+EGAFYVWT KE+ +LG A + H+ + P GN +S +DPH+EF +N
Sbjct: 369 MPNDTEKREGAFYVWTLKELTQVLGSRDAGVCARHWGVLPDGN--ISPANDPHDEFMNQN 426
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 423
VL S A + G+ ++ + IL ++KL + R K R RP LDDK+IV+WNGL I
Sbjct: 427 VLSIKVTPSKLAREFGLGEDEVVRILRSAKQKLREYREKNRVRPDLDDKIIVAWNGLAIG 486
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
+ A+AS + + +S+M + + E A A SFI+ L+++ T +L +R+G
Sbjct: 487 ALAKASALF-DQIDSSMAS---------KCREAAARAVSFIKETLFEKSTGQLWRIYRDG 536
Query: 484 P-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT 539
PGF DDYA+L SGLL++YE +L +A +LQ +E FL G GY++T
Sbjct: 537 SRGDTPGFADDYAYLTSGLLEMYEATFDDSYLQFAEQLQKYLNEKFLAYVGSTPAGYYST 596
Query: 540 ----TGEDPSVLLRVKEDHDGAEPSGNSV 564
T P LLR+K + A PS N V
Sbjct: 597 PSTMTPGMPGPLLRLKTGTESATPSINGV 625
>gi|404493392|ref|YP_006717498.1| thioredoxin domain-containing protein YyaL [Pelobacter carbinolicus
DSM 2380]
gi|77545446|gb|ABA89008.1| thioredoxin domain protein YyaL [Pelobacter carbinolicus DSM 2380]
Length = 711
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 243/684 (35%), Positives = 352/684 (51%), Gaps = 61/684 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA++LN F+ IKVDREERPD+D +YMT Q + GGGGWPL+
Sbjct: 76 STCHWCHVMEQESFEDREVAEVLNKLFIPIKVDREERPDIDNLYMTACQLVTGGGGWPLN 135
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD P TY P + PG IL K+ W RD L Q+G E L +
Sbjct: 136 VFLTPDKAPFYAATYMPRRPRGQMPGIIAILTKIGAMWQSDRDQLLQTGREIGETL---I 192
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+S+ + L + L E+ ++D GGFG APKFP P + ++ + +++
Sbjct: 193 RLESSAAPVASSLTEAPLTEAFERFKANFDHERGGFGKAPKFPMPHNLSLLFHIAQRF-- 250
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G+ + + M + TLQ + GG++DH+G G HRYSVD W VPHFEKMLYDQ +
Sbjct: 251 ----GQET-AEAMAIKTLQHIRLGGMYDHIGFGMHRYSVDAFWRVPHFEKMLYDQALVTL 305
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
LDA+ +T D F+ + + Y+ RD+ P G S EDAD TEGA EG FY+
Sbjct: 306 AALDAYQVTHDTFFESLADQTMSYVLRDLSLPEGGFCSGEDAD---TEGA----EGTFYL 358
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT ++VE++LG + A +F Y + GN F+G N+ D A
Sbjct: 359 WTPQQVEEVLGHQQATIFCTCYEISEAGN------------FEGSNIPRLEMDLKEWAQW 406
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G ++ +L + RRKL R R RPH DDKV+V+WNGL I++ AR ++++
Sbjct: 407 FGTDTDELGAVLEDGRRKLLQARKLRVRPHRDDKVLVAWNGLAIAAMARTARLI------ 460
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
EY+E A AA FI ++ +E+ L+ R + P FL+DYA LI
Sbjct: 461 ----------GHPEYLEGATRAADFILSNMRNEEGRLLRRWRRG-QAGIPAFLEDYAALI 509
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ G ++L A++L E F G Y++T + VL+R + HDGA
Sbjct: 510 LGLIELYQAGFNARYLAEAVQLGRDMQERF-GTPDGVYYDTGTDAEEVLVRKRTLHDGAM 568
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
SGNS++ + L+RL S+ + ++AE L + D A + A D L++
Sbjct: 569 ISGNSMAAMALLRLGSL---TGEPALEEHAEKILLASSKQWTDAPTASGQLLMALD-LAL 624
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
R+ +V+ K + M+ AAH + N ++ P D S + R
Sbjct: 625 SQREVLVIAAPKDDPEGTRMVKAAHTGFRPNLIILWHTPDDNAL--------SEVTPLVR 676
Query: 679 -NNFSADKVVALVCQNFSCSPPVT 701
K A +C+ +C P T
Sbjct: 677 GKTMQNGKATAYLCRGQTCMAPAT 700
>gi|121701517|ref|XP_001269023.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
gi|119397166|gb|EAW07597.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
Length = 788
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 240/598 (40%), Positives = 329/598 (55%), Gaps = 35/598 (5%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHV+E ESF + VA LLN+ F+ IKVDREERPD+D VYM YVQA G GGWPLS
Sbjct: 69 SACHWCHVIEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + GF IL K++D W ++ +S Q
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNSSTLSGPHTIGFVDILEKLRDVWKTQQQRCRESAKEITRQ 188
Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L +E + S ++ DE L L + + YD+ GGF APKFP P + +
Sbjct: 189 LREFAEEGTHSQQGDREADEDLDIELLEEAYQHFASRYDAVNGGFSRAPKFPTPANLSFL 248
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E + M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 249 LRLKTYPSAVSDIVGQEECDKATTMAVSTLVSMARGGIRDHIGHGFARYSVTSDWSLPHF 308
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I G S+EDADS
Sbjct: 309 EKMLYDQAQLLDVYVDAFQITHNPELLGAVYDLATYLTTAPIQSSTGAFHSSEDADSLPA 368
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ DPH+EF +NV
Sbjct: 369 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 426
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A + G+ E+ + I+ ++KL + R K R RP LDDK+IV+WNGL I +
Sbjct: 427 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKIIVAWNGLAIGA 486
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S E E A A SFI+ +L+++ T +L +R+G
Sbjct: 487 LAKCSALFE-EIES---------SKAVECREAAARAISFIKENLFEKVTGQLWRIYRDGS 536
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
PGF DDYA+L GLLD+YE +L +A +LQ + FL G GY++T
Sbjct: 537 RGDTPGFADDYAYLTQGLLDMYEATFEDSYLQFAEQLQRYLNRNFLAYIGSTPAGYYSTP 596
Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
T P LLR+K + A PS N V NL+RL++++ + + HS +V
Sbjct: 597 STMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALLEDEEYRTLARQTCHSFSV 654
>gi|258569036|ref|XP_002585262.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237906708|gb|EEP81109.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 818
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 240/611 (39%), Positives = 333/611 (54%), Gaps = 49/611 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN F+ IK+DREERPD+D+VYM YVQA G GGWPL+
Sbjct: 58 SACHWCHVMEKESFMSQEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLN 117
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P P F IL K++D W+ ++ +S
Sbjct: 118 VFLTPDLEPVFGGTYWPGPHSSSVPRLGGEEPITFVDILEKLRDVWNSQQLRCMESAKEI 177
Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + + PD + L + + YD GGF APKFP P
Sbjct: 178 TRQLRE-FAEEGTHLRRPDSEGEEDLEVELLEEAYQHFVSRYDPVNGGFSRAPKFPTPAN 236
Query: 187 IQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
+ +L Y ++ G+ E + +MV TL M +GGIHD +G GF RYSV W
Sbjct: 237 LSFLLRLGRYPGAVMDIVGQE-ECARATEMVSKTLLQMVRGGIHDQIGHGFARYSVTADW 295
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDA 301
+PHFEKMLYDQ QL +VY+D F T+D DI+ Y+ M+ P G S+EDA
Sbjct: 296 SLPHFEKMLYDQAQLLDVYVDCFEATQDPELLGAVYDIVAYMTSPPMLSPEGAFHSSEDA 355
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
DS T T K+EGAFYVWT KE++ ILG+ A + H+ + P GN ++R DPH+EF
Sbjct: 356 DSLPTPKDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGYDPHDEF 413
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNG 419
+NVL A LG+ ++ + I+ R+KL + R ++R RP LDDKVIVSWNG
Sbjct: 414 INQNVLSIKATPRHIAKDLGLSEDEVVRIIKSSRKKLQEFRDTQRVRPDLDDKVIVSWNG 473
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYM-EVAESAASFIRRHLYDEQTHRLQH 478
L I + A+ S +L + D+ E+ A +AA+FI+ L+D T +L
Sbjct: 474 LAIGALAKCSVLLDR-----------IDPDKAEHCRRSAATAAAFIKEKLFDADTGQLWR 522
Query: 479 SFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---- 533
+R+G + PGF DDYA+L +GL+ LYE +L +A +LQ + FL
Sbjct: 523 VYRDGVRGETPGFGDDYAYLTAGLIQLYEATFDDSYLRFAEQLQKYMNTHFLAMAADGST 582
Query: 534 -GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
GY+ N G+ P L R+K D A PS N V NLVRL S++ + + Y A
Sbjct: 583 PAGYYMTQENMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLVRLGSLL---EDESYSVLA 639
Query: 589 EHSLAVFETRL 599
+ + + F +
Sbjct: 640 KQTCSAFAAEI 650
>gi|167043802|gb|ABZ08492.1| hypothetical protein ALOHA_HF4000APKG3D24ctg2g4 [uncultured marine
crenarchaeote HF4000_APKG3D24]
Length = 620
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 239/686 (34%), Positives = 364/686 (53%), Gaps = 69/686 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE +AK++N+ FV+IKVDREERPD+D +Y Q G GGWPLSVFL+P+ +
Sbjct: 1 MAHESFEDEEIAKIMNENFVNIKVDREERPDLDDIYQKVCQMSTGQGGWPLSVFLTPEQR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFA--IEQLSEALSASAS 144
P GTYFP D YGRPGF ++ R++ +W +K +D+ + F +++L + + S
Sbjct: 61 PFYVGTYFPAIDSYGRPGFGSLCRQMAQSWKEKPKDIEKAADNFMQNLDKLKQFPTPSEI 120
Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
+ DE N L++ D +GGFG APKFP + M +SK SG
Sbjct: 121 DKSILDEAAINLLQIA--------DITYGGFGQAPKFPNASNLSFMFRYSKL------SG 166
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
S+ +K L TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD L VY +A
Sbjct: 167 -ISKFEKFALLTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPIVYSEA 225
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
+ +TKD F+ + R LDY+ R+M G FSA+DAD+ EG T +VW +E
Sbjct: 226 YQITKDPFFENVVRKTLDYIIREMTSSDGMFFSAQDADTNGEEGQT-------FVWKKRE 278
Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
+E ILGE + +F +Y + GN F+G +L ++S+ K G
Sbjct: 279 IEKILGEDSEIFCIYYDVTDGGN------------FEGNTILANNINASSLGFKFGKSES 326
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+ NI+ +C KL +VR+KR +P DDKVI SWNGL+IS+F +I
Sbjct: 327 EIQNIILKCSDKLLEVRNKREQPGKDDKVITSWNGLMISAFLSGYQI------------- 373
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
+D +Y+++A+ + F + ++ H L +F+NG K G+LDDYA++ + +D+
Sbjct: 374 ---TDNSKYLDMAKKSIDFFESNF--KENHILHRTFKNGEPKLNGYLDDYAYMANASIDM 428
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
+E S K+L++A L N F D G+F T+ +++R K ++D + PSGNSV
Sbjct: 429 FENTSDPKYLLFATNLANYLVTHFWDDSTHGFFFTSDNHEKLIIRPKNNYDLSMPSGNSV 488
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
+ L++L I +Q E + + E++ A P + +
Sbjct: 489 AACVLLKLYHITQD------KQFLEIAKKIIESQAT-AAAENPFAFGYLLNVLYLYYQKP 541
Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
+ + +FE ++++ + ++ + A+ +D ++ A + F D
Sbjct: 542 TEITIINDKNFE-LVSSLRKKFLPESIMVLV--ANKNNLDALSKY----AFFSGKEFQDD 594
Query: 685 KVVALVCQNFSCSPPVTDPISLENLL 710
K +VC+NFSCS P++D +E L
Sbjct: 595 KTNVIVCKNFSCSLPLSDLSEIEKEL 620
>gi|423083522|ref|ZP_17072052.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
002-P50-2011]
gi|423088427|ref|ZP_17076810.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
050-P50-2011]
gi|357542999|gb|EHJ25034.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
050-P50-2011]
gi|357544282|gb|EHJ26286.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
002-P50-2011]
Length = 678
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/698 (33%), Positives = 361/698 (51%), Gaps = 83/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG I+ L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIKALKDDFD 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +TK Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITKKELYKEIAIKTIDYVVREMKDKDGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
++ E+ ++LGE F ++ + +GN F+GK++ LI+
Sbjct: 334 IFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + + K+F+ R +R H DDK++ SWN L+I + +A L+++
Sbjct: 375 ----NKEYERHNEKIADLSEKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLEND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y+E + FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 I----------------YLEYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNENCINLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I S+ + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDSRLE---EMSYKQLKLYVDNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSS--VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ S K ++ + + S + F+ +++ + P T + E N+
Sbjct: 590 FELYSTKEIICIFKEDSDLIAFKELISE------------NFIPNATFLAKKYNEENTII 637
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ + DK VCQ+ SCS P+ D L++++L
Sbjct: 638 SFLNNYRLKDDKTSYYVCQSNSCSQPINDLQKLKDMIL 675
>gi|448382091|ref|ZP_21561926.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
11522]
gi|445662325|gb|ELZ15095.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
11522]
Length = 731
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 240/696 (34%), Positives = 359/696 (51%), Gaps = 59/696 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFADEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP E K G+PGF + ++ D+W+ + D Q A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGKRGQPGFLDLCERISDSWESEEDREEMQHRAQQWTDAATD 172
Query: 134 QLSEAL-SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
+L E SA + + + L A+ + +S D ++GGFG+ KFP+P ++++
Sbjct: 173 RLEETPDSAGVDAGGAAEPPSSDVLEAAADAVLRSADRQYGGFGTGQKFPQPSRLRVL-- 230
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
++ + TG+ E ++++ TL MA GG+ DHVGGGFHRY VD W VPHFEKMLY
Sbjct: 231 -ARTYDRTGR----EEYREVLAETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLY 285
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ +L + LT + Y+ D L ++ R++ G FS DA S + E R
Sbjct: 286 DNAEIPRAFLAGYQLTGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER- 344
Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EGAFYVWT +EV D++ + A LF Y + +GN F+G+N +
Sbjct: 345 EEGAFYVWTPEEVHDVIADETDASLFCARYDITESGN------------FEGQNQPNRIA 392
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
S AS+ + + L L R++LF+ R +RPRP D+K++ WNGL+IS++A A+
Sbjct: 393 RVSELASQFDLAESEVLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAAL 452
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L G D EY E A A F+R L+D+++ RL ++ G K G+
Sbjct: 453 VL--------------GED--EYAETAVDALEFVRDRLWDDESQRLSRRYKAGDVKVDGY 496
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G LD Y+ L +A+EL + F D + G + T S++ R
Sbjct: 497 LEDYAFLARGALDCYQATGEVDHLAFALELARVIETEFWDADRGTLYFTPESGESLVTRP 556
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+V L+ L A D A L +L+ A+ +C
Sbjct: 557 QELGDQSTPSSTGVAVETLLALDEFAASEFGDI----AATVLETHANKLEANALEHATLC 612
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
AAD L+ + + V ++ + A AS L + + P ++ W E
Sbjct: 613 LAADRLAAGALEVTV-----AADELPTEWREAFASQYLPDRLFALRPPTEAGLETWLETL 667
Query: 670 ---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R + + VC++ +CSPP D
Sbjct: 668 GLADAPPIWAGREARDGEPTL-YVCRDRTCSPPTHD 702
>gi|52078696|ref|YP_077487.1| hypothetical protein BL00131 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319649027|ref|ZP_08003236.1| YyaL protein [Bacillus sp. BT1B_CT2]
gi|52001907|gb|AAU21849.1| conserved protein YyaL [Bacillus licheniformis DSM 13 = ATCC 14580]
gi|317389021|gb|EFV69839.1| YyaL protein [Bacillus sp. BT1B_CT2]
Length = 625
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 254/698 (36%), Positives = 362/698 (51%), Gaps = 105/698 (15%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE VAKLLN+ FVSIKVDREERPDVD +YMT Q + G GGWPL+VFL+PD K
Sbjct: 1 MAHESFEDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP ++ RPGF +++++ D + K R+ + E+ + L A S+
Sbjct: 61 PFYAGTYFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNLRIKAKSDA 116
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 206
D L ++ LR +QL S+D+ +GGFGSAPKFP P + +L YH SGE
Sbjct: 117 -GDSLGEDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
+ V+ TL MA GGI+DHVG GF RYS D+ W VPHFEKMLYD L Y +A+
Sbjct: 169 N-ALYSVMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLLIAYTEAYQ 227
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
+TK+ Y I I+ ++RR+M G +SA DAD TEG EG +YVW+ +EV
Sbjct: 228 ITKNERYKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYYVWSKEEVL 280
Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSSASASKLGM 381
+ LG E L+ Y + GN F+G N + L D + +
Sbjct: 281 ETLGDELGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK---DEFAL 325
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
E+ N L E R KLF+ R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 326 TDEELQNKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV---------Y 376
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
N P EY+E+A +AA FI L Q R+ +R+G K GF+DDYAFL+
Sbjct: 377 NAP-------EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDDYAFLLWAY 427
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
++LYE L A +L+ LF D E GG++ T + ++++R KE +DGA PSG
Sbjct: 428 IELYEASLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEVYDGALPSG 487
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS- 620
N V + L RL + + L D A A D+ + PS
Sbjct: 488 NGVLAVQLSRLGRLTG------------------DLSLHDQA-AKMFAAFHGDVSAYPSG 528
Query: 621 --------------RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MD 664
+K +V++G ++ D + +++A ++ N V+ + D + D
Sbjct: 529 HTNFLQGLLSQFMPQKEIVVLGKRNDPDRQKIVSALQQAFQPNYAVLAAESPDDFKGIAD 588
Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
F E+ + + +K +C+NF+C P T+
Sbjct: 589 FAAEYKAVD----------NKTTVYICENFACRQPTTN 616
>gi|317030461|ref|XP_001392621.2| hypothetical protein ANI_1_728074 [Aspergillus niger CBS 513.88]
Length = 791
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 236/580 (40%), Positives = 323/580 (55%), Gaps = 35/580 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 73 SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 132
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + G GF IL K+ D W ++ +S +Q
Sbjct: 133 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 192
Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + YD GGF +APKFP P + +
Sbjct: 193 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 252
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E ++ M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 253 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 312
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 313 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 372
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ +DPH+EF +NV
Sbjct: 373 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 430
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 431 LSVKVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 490
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 491 LAKCSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGG 540
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
PGF DDYA+LI GLLD+YE +L +A +LQ ++ FL G GY++T
Sbjct: 541 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTP 600
Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
T P LLR+K + A P+ N V NL+RL S++
Sbjct: 601 STMTSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 640
>gi|397775180|ref|YP_006542726.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
gi|397684273|gb|AFO58650.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
Length = 732
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/697 (33%), Positives = 363/697 (52%), Gaps = 60/697 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF+DE VA++LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFQDEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ +P GTYFP E + G+PGF+ + +++ D+W+ D Q A +
Sbjct: 113 AWLTPEGEPFFIGTYFPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATD 172
Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
+L E A+ + P+ + L A+ + +S D +GGFGS+ PKFP+P I+++
Sbjct: 173 RLEETPDAAGGGTVEAPEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL- 231
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
++ + TG+ E ++++ TL MA GG++DHVGGGFHRY VD W VPHFEKML
Sbjct: 232 --ARTYDRTGR----DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 285
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD ++ +L + LT + Y+ + D L ++ R++ G FS DA SA E R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER 345
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EGAFYVWT EV D+L + A LF Y + GN F+G+N +
Sbjct: 346 -EEGAFYVWTPAEVHDVLEDETDAALFCARYDITEAGN------------FEGRNQPNRV 392
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A++ + + L L R++LF+ R +RPRP+ D+K++ WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLAEHEILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L G+D +Y + A A F+R L+D+ RL +++G K G
Sbjct: 453 LVL--------------GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDG 496
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G LD Y+ L +A+EL F D + G + T +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIKAEFWDADRGTLYFTPESGEALVTR 556
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+V L+ L A + + A L +L+ A+ +
Sbjct: 557 PQELSDQSTPSATGVAVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATL 612
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
C AAD L + + V + + + L + + + + P + +D W E
Sbjct: 613 CLAADRLEAGALE-VTVAADDLPTAWRDRLTSQY----FPDRLFALRPPTEDGLDAWLET 667
Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R + + VC++ +CSPP D
Sbjct: 668 LGLADAPPIWAGREARDGEPTL-YVCRDRTCSPPSHD 703
>gi|238498046|ref|XP_002380258.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
gi|317141806|ref|XP_003189401.1| hypothetical protein AOR_1_504164 [Aspergillus oryzae RIB40]
gi|220693532|gb|EED49877.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
Length = 787
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 233/568 (41%), Positives = 318/568 (55%), Gaps = 35/568 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN+ F+ IKVDREERPD+D +YM YVQA G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSPEVATILNESFIPIKVDREERPDIDDIYMNYVQATTGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + GF IL K+++ W ++ S +Q
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNSSTLLGNETIGFVDILEKLREVWQTQQQRCLDSAKEITKQ 188
Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L +E + S +K DE L L + YDS GGF APKFP P + +
Sbjct: 189 LREFAEEGTHSYQGDKEADEDLDIELLEEAYQHFVSRYDSVHGGFSRAPKFPTPANLSFL 248
Query: 191 LY---HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E + M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 249 LRLGAYPNAVSDIVGREECEKATAMAVHTLISMARGGIRDHIGHGFARYSVTADWSLPHF 308
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS +
Sbjct: 309 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPS 368
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN +S +DPH+EF +NV
Sbjct: 369 PKDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVHPDGN--ISPENDPHDEFMNQNV 426
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A + G+ E+ + I+ +++L + R + R RP LDDK+IV+WNGLVI +
Sbjct: 427 LSVKVTPSKLAREFGLGEEEVVRIIRSAKQRLREYRERTRVRPDLDDKIIVAWNGLVIGA 486
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + + S + E A A SFI+ +L+D+ T +L +R+G
Sbjct: 487 LAKCSALFER----------IESSKAVQCREAAAKAISFIKNNLFDKATGQLWRIYRDGG 536
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
PGF DDYA+LISGLLD+YE +L +A +LQ +E FL G GY++T
Sbjct: 537 RGDTPGFADDYAYLISGLLDMYEATFDDSYLQFAEQLQKYLNENFLAYVGSTPAGYYSTP 596
Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSV 564
T + P LLR+K + A PS N V
Sbjct: 597 SNMTSDMPGPLLRLKTGTESATPSVNGV 624
>gi|325845722|ref|ZP_08169003.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
gi|325488252|gb|EGC90680.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
Length = 614
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 242/685 (35%), Positives = 353/685 (51%), Gaps = 73/685 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFEDE VA LN+ F+SIKVDREERPD+D VYM+ QAL G GGWPL++F++P +
Sbjct: 1 MEHESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLTIFMTPTQQ 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
GTYFP +YGRPGF +L+ + W+ R + + +
Sbjct: 61 AFYAGTYFPKTSRYGRPGFLDVLKTIDFNWNHHRAKVTDITKQIASHFKDLEGIETEGDS 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L + QN + QL +SYD RFGGFG+APKFP P ++ +L + ++ +D
Sbjct: 121 LSMAIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKDKSV----- 171
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
Q MV TL M KGGI DH+G GF RYS DE W VPHFEKMLYD L Y +A+ +
Sbjct: 172 --QDMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMISYTEAYQV 229
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T++ Y I +Y+ + P G + AEDADS EG +EG FYV+T E+
Sbjct: 230 TREPRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYVFTPAEIIQ 282
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
ILG E F E Y + GN F+GKN+L L+ LE
Sbjct: 283 ILGPEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK---------LELD 321
Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
+ L CR L R +R H DDK++ SWNGL+I++FA+ +
Sbjct: 322 IKELEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK-----------------LY 364
Query: 447 GSDRKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
G +K Y++ A A +FI++HL+DE RL +R G S +LDDYAFL GL++L+
Sbjct: 365 GQTQKMIYLDAASKAVTFIKQHLFDET--RLLARYREGESHFKAYLDDYAFLSYGLIELH 422
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
+ + ++L AI+L +LF D E GG++ T + +++LR KE +DGA PSGNSV+
Sbjct: 423 QSTAEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGAMPSGNSVA 481
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
NL+RLA + + + AE + ++K M AA +++ ++
Sbjct: 482 AYNLIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFALSDTKELMI 538
Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
V + + + +L + + N T++ P + ++ S A ++ D+
Sbjct: 539 TVPKQEQI--KEILKQLNETPHFNTTLLFKTPENQTQL-------SKLAPYTKDYPIGDQ 589
Query: 686 VVALVCQNFSCSPPVTDPISLENLL 710
+C N +C P + SL+N+L
Sbjct: 590 PTYYLCSNGTCQAPTSSLESLKNIL 614
>gi|196232510|ref|ZP_03131362.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
gi|196223272|gb|EDY17790.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
Length = 428
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/366 (53%), Positives = 242/366 (66%), Gaps = 16/366 (4%)
Query: 11 KTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 64
K RR H I +TCHWCHVM ESFE+ AKL+N+ FV+IKVDREERPDVD+VYM
Sbjct: 57 KARREHKPIFLSIGYSTCHWCHVMAHESFENPATAKLMNENFVNIKVDREERPDVDRVYM 116
Query: 65 TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 124
TYVQA G GGWP+SVFL+PDLKP GGTYFPPED+YGRPGF TIL+++ +AW + +
Sbjct: 117 TYVQATTGSGGWPMSVFLTPDLKPFYGGTYFPPEDRYGRPGFPTILQRLAEAWKDDHEKV 176
Query: 125 AQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
+ AI L++ S A S + E A+ L QL++S+D GGFG APKFPR
Sbjct: 177 LGAANDAIRALNDYTASGPAQSTAVGKE----AIALALNQLTRSFDDELGGFGGAPKFPR 232
Query: 184 PVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
PV + + + + + G+A+ G M L TLQ MA GG+HDH+GGGFHRYSVD+ WH
Sbjct: 233 PVTLNFLFHVFAREGHESRDGKAALG--MALITLQKMADGGMHDHLGGGFHRYSVDKFWH 290
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 303
VPHFEKMLYDQ QLA+ YLDAF +T D Y RDI DY+RRDM GG +SAEDADS
Sbjct: 291 VPHFEKMLYDQAQLASSYLDAFQVTHDTVYERTARDIFDYVRRDMTDAGGGFYSAEDADS 350
Query: 304 AETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKG 362
+G EGAFYVWT E+ +LGE A +F Y + GN SDP EF+G
Sbjct: 351 LLEKGKPEHSEGAFYVWTKDEIVHVLGEDAAAVFDRVYGVDAEGNA--PEGSDPQGEFRG 408
Query: 363 KNVLIE 368
KN+LI+
Sbjct: 409 KNILIQ 414
>gi|70995702|ref|XP_752606.1| DUF255 domain protein [Aspergillus fumigatus Af293]
gi|19309415|emb|CAD27314.1| hypothetical protein [Aspergillus fumigatus]
gi|41581314|emb|CAE47963.1| hypothetical protein, conserved [Aspergillus fumigatus]
gi|66850241|gb|EAL90568.1| DUF255 domain protein [Aspergillus fumigatus Af293]
Length = 799
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 244/620 (39%), Positives = 334/620 (53%), Gaps = 55/620 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA LLN+ F+ IKVDREERPD+D VYM YVQA G GGWPLS
Sbjct: 63 SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 122
Query: 80 VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+P+L+P+ GGTY+P + + GF IL K++D W ++ S Q
Sbjct: 123 VFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 182
Query: 135 LSEALSASASSN----KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + + YD+ GGF APKFP P + +
Sbjct: 183 LREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 242
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 243 LRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 302
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 303 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 362
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ DPH+EF +NV
Sbjct: 363 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 420
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A + G+ E+ + I+ ++KL + R K R RP LDDKVIV+WNGL I +
Sbjct: 421 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAIGA 480
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 481 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 530
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDEL--- 527
+ PGF DDYA+LI GLLD+YE +L +A +LQ+ TQ E
Sbjct: 531 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYLND 590
Query: 528 -FLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
FL G GY++T T P LLR+K + A PS N V NL+RL++++
Sbjct: 591 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL--- 647
Query: 580 KSDYYRQNAEHSLAVFETRL 599
+ + YR A + F +
Sbjct: 648 EEEEYRTLARQTCLSFSVEI 667
>gi|325288476|ref|YP_004264657.1| hypothetical protein Sgly_0289 [Syntrophobotulus glycolicus DSM
8271]
gi|324963877|gb|ADY54656.1| protein of unknown function DUF255 [Syntrophobotulus glycolicus DSM
8271]
Length = 752
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 263/759 (34%), Positives = 383/759 (50%), Gaps = 92/759 (12%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL +TCHWCHVME ESFED+ VA+ LN F+++KVDREERPD
Sbjct: 34 GIEAFEKAAKENKPVFLSIGYSTCHWCHVMERESFEDKEVAEKLNKSFIAVKVDREERPD 93
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D YMT+ QAL G GGWPL++ ++PD KP GTYF GR G +L + W
Sbjct: 94 IDHTYMTFCQALTGAGGWPLTILMTPDKKPFFAGTYFAKNSGGGRVGLIDVLDYTSEKWK 153
Query: 119 KKRDMLA------------------QSGAFAIEQLSEALSASASSNKLPDEL---PQNAL 157
+++ + Q F E L E + + + + D++ + +
Sbjct: 154 NEKEKILTSAEELYTVVSSHYGGKDQETVFKKEGLLEEVRYADARKQTKDDIMVWGKQMI 213
Query: 158 RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTL 217
E L+K++D +FGGFG APKFP P + ++ D +MV TL
Sbjct: 214 EKGYEMLAKTFDPKFGGFGHAPKFPSPHTLGFLMRCHLDRPD-------QNALEMVRKTL 266
Query: 218 QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC 277
MA GGI+D +G GF RYS D W VPHFEKMLYD LA YL+A+ LT + Y +
Sbjct: 267 DLMADGGIYDQIGYGFSRYSTDRFWLVPHFEKMLYDNATLAYTYLEAYQLTHEQRYGQVA 326
Query: 278 RDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFK 337
R+I Y+ R+M P G +SAEDADS EG +EG +Y+WT +EV + L + +
Sbjct: 327 REIFSYVLREMCSPEGGFYSAEDADS---EG----EEGKYYIWTYQEVMETLTAELLRIQ 379
Query: 338 E-------------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL-NDSSASAS 377
E H + P C+ +++ N F+GKN+L L +D A
Sbjct: 380 ENRASLDQPDGRDIFQSQFAHPDVLPGLYCEAYQITKEGN-FEGKNILNRLFSDWRDLAR 438
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K +P ++++ + C L VR +R RP DDK++VSWNGL+I++ A+ +++L
Sbjct: 439 KASIPFDEFVRAIRYCNTILLRVRERRVRPIRDDKILVSWNGLMIAALAKGAQVL----- 493
Query: 438 SAMFNFP----VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+FP V + Y+ AE AA+FI ++ RL +R+G ++ P +LDD
Sbjct: 494 ----SFPDQTFAVHENASLYLTQAEKAANFIDDNMRSSDG-RLFARYRHGEAQYPAYLDD 548
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAF I GLL+LY +L AIELQ Q+ LF D E GGYF T + +L R KE
Sbjct: 549 YAFYIFGLLELYTACGKPVYLQRAIELQQQQENLFRDTEKGGYFFTGKDSEELLFRPKEV 608
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGNS++V+NL +L + +K ++ AE ++ F +K+ A
Sbjct: 609 YDGALPSGNSLAVLNLTKLWKMTGDNK---WKNIAEGNIQSFHAEMKEYP--------AG 657
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT--VIHIDPADTEEMDFWEEHNS 671
+ + S +H + G E +L A + LNK V D + + E
Sbjct: 658 HLAFLRSIQHYISDGD------ELILGGALNNEVLNKMKEVFFRDFRPYAVLLYHEGTVQ 711
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+K A +C+NFSC PV L+++L
Sbjct: 712 ELVPELAGYPQQEKAAAYLCRNFSCLNPVFSVEELQHVL 750
>gi|448343975|ref|ZP_21532892.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
gi|445622058|gb|ELY75523.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
Length = 732
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 233/697 (33%), Positives = 363/697 (52%), Gaps = 60/697 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF+DE VA++LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEAESFQDEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ +P GTYFP E + G+PGF+ + +++ D+W+ D Q A +
Sbjct: 113 AWLTPEGEPFFIGTYFPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATD 172
Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
+L E A+ + P+ + L A+ + +S D +GGFGS+ PKFP+P I+++
Sbjct: 173 RLEETPDAAGGGTVEAPEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL- 231
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
++ + TG+ E ++++ TL MA GG++DHVGGGFHRY VD W VPHFEKML
Sbjct: 232 --ARTYDRTGR----DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 285
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD ++ +L + LT + Y+ + D L ++ R++ G FS DA SA E R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER 345
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EGAFYVWT EV D+L + A LF + + GN F+G+N +
Sbjct: 346 -EEGAFYVWTPAEVHDVLEDETDAALFCARFDITEAGN------------FEGRNQPNRV 392
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A++ + + L L R++LF+ R +RPRP+ D+K++ WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLAEHEILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L G+D +Y + A A F+R L+D+ RL +++G K G
Sbjct: 453 LVL--------------GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDG 496
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G LD Y+ L +A+EL + F D + G + T +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGEALVTR 556
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+V L+ L A + + A L +L+ A+ +
Sbjct: 557 PQELGDQSTPSATGVAVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATL 612
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
C AD L + + V + + + L + + + + P + +D W E
Sbjct: 613 CLVADRLEAGALE-VTVAADDLPTAWRDRLTSQY----FPDRLFALRPPTEDGLDAWLET 667
Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R + + VC++ +CSPP D
Sbjct: 668 LGLADAPPIWAGREARDGEPTL-YVCRDRTCSPPSHD 703
>gi|300087365|ref|YP_003757887.1| hypothetical protein Dehly_0239 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
gi|299527098|gb|ADJ25566.1| protein of unknown function DUF255 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
Length = 669
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 247/694 (35%), Positives = 367/694 (52%), Gaps = 77/694 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A ++N F++IKVDREERPD+D +YM VQA+ G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDEATAAVMNRHFINIKVDREERPDIDSIYMAAVQAMTGHGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GGTY+PPED++G P F IL V +A+ ++ D +A + + +++
Sbjct: 108 VFLTPDGKPFYGGTYYPPEDRHGLPAFTRILEAVAEAYRERPDEVAATATRLVTAVADKP 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
A + L EL A + L++ +D GFG APKFP+P+ + +L YH +
Sbjct: 168 VGDAGESSLTVELLDRAF----QALTRDFDENHAGFGGAPKFPQPLVLDFLLRYHYRT-- 221
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++ +MV TL+ M +GG++DH+GGGFHRYSVD+ W VPHFEKMLYD LA
Sbjct: 222 ------SSARALEMVEKTLEAMYRGGMYDHLGGGFHRYSVDDAWQVPHFEKMLYDNALLA 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE-IFSAEDADSAETEGATRKKEGAF 317
VYL AF +T Y + DILDY+ +M P +SA+DADS EG +EG +
Sbjct: 276 RVYLHAFQITGKAQYRLVTEDILDYVLEEMTDPATSGFYSAQDADS---EG----EEGRY 328
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+WT E+E +LG E A +F Y + GN F+G+N+L + S A
Sbjct: 329 YIWTPDEIESVLGRESAEIFGRRYGVTQAGN------------FEGRNILHLTGEFSVEA 376
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S G+ + R +L R KR P D K++VSWN + + A A
Sbjct: 377 SA-GVSAD---------RARLLAERRKRVPPGTDTKILVSWNAMTQLALASAG------- 419
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
V DR +Y+ AE+ A+F+ +L D + RL+H+ S A GFL+DYA
Sbjct: 420 ---------VALDRPDYLAAAEANAAFLLDNLLD--SGRLRHTV----SVAEGFLEDYAL 464
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L LL L++ +WL A+ L ELF D + G +++T + + R + DG
Sbjct: 465 LTESLLALHKATLTPRWLRQAMALGAAMVELFWDEDEGVFYDTPADAGQLFQRPRNFQDG 524
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSG SV+ + L+RL+ + + Y Q A +L + + + L A D
Sbjct: 525 AVPSGASVASLALLRLSRL---ADERSYWQTAGRALKGVSSFMGRYPLGFGLWLGALDFY 581
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P ++ V ++G + ++A ++ N + +D D+E + ++
Sbjct: 582 LGP-QQEVAVIGPAADDASRRLVAVVGRAFRPNTVLAGLDAGDSEGI-------ASLPLF 633
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC++F+C PPVT P+ LE +L
Sbjct: 634 QGRGQTAGQPTAWVCRSFTCYPPVTAPVDLEQVL 667
>gi|407465214|ref|YP_006776096.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
gi|407048402|gb|AFS83154.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
Length = 675
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 244/686 (35%), Positives = 362/686 (52%), Gaps = 74/686 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE+E VAK +N+ F++IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 49 SSCHWCHVMAHESFENEDVAKFMNENFINIKVDREERPDIDDIYQKVCQIATGQGGWPLS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GTYFP D YGRPGF +I R++ AW +K + + S I+ L++
Sbjct: 109 VFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLSQAWKEKPNDIETSAKRFIDALTK-- 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + ++P +L + L A L + D+ +GGFGSAPKFP I L+ KL
Sbjct: 167 ---AEAIQVPSKLERILLDEAAMNLFQLGDATYGGFGSAPKFPNAANIS-FLFRYAKLSG 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
K E L TL+ MA GGI D +GGGF RYS D +W VPHFEKMLYD ++
Sbjct: 223 LTKFNE------FALKTLKKMANGGIFDQIGGGFSRYSTDAKWLVPHFEKMLYDNALISV 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +AF +TKD FY + R LD++ R+M P G +SA DADS EG EG +YV
Sbjct: 277 NYAEAFQITKDPFYLEVLRKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKYYV 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W E+++ILG+ A LF +Y + GN ++G N+L + S A
Sbjct: 330 WKKSEIKEILGDDADLFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNF 377
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + I+ C +KL VRS R P LDDK++VSWN L+I++ A+ ++
Sbjct: 378 GISETEVKKIINLCSKKLLKVRSSRIPPGLDDKILVSWNSLMITALAKGYRV-------- 429
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ Y+ A++ SFI +L +L +++NG +K G+L+DY++ I+
Sbjct: 430 --------TGDILYLNAAKNCISFIENNLL--VNDKLLRTYKNGTAKIDGYLEDYSYFIN 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E K+L +++L + F D + +F T+ + +++R K ++D + P
Sbjct: 480 ALLDVFEIEPDEKYLKLSLKLAHHLVNHFWDSKNNNFFMTSDDHEKLIIRPKSNYDLSLP 539
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD----MAMAVPL-MCCAAD 614
SGNSVS L+RL Y + + + T++ + MA P +
Sbjct: 540 SGNSVSAFALLRL-----------YHLSQDSTFLKITTKIMESQAQMAAENPFGFGYLLN 588
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+S+ +K V + + ++ EN D I I D ++ E+ S
Sbjct: 589 TISMYIQKPVEI----TIINTENPKICESLLLDYLPNSIMITIRDASQL----ENLSEYP 640
Query: 675 SMARNNFSADKVVALVCQNFSCSPPV 700
A +F DK VC++F+CS P+
Sbjct: 641 FFAGKSFE-DKTTVFVCKDFTCSLPL 665
>gi|134077135|emb|CAK45476.1| unnamed protein product [Aspergillus niger]
Length = 765
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/577 (40%), Positives = 322/577 (55%), Gaps = 39/577 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 57 SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + G GF IL K+ D W ++ +S +Q
Sbjct: 117 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 176
Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + YD GGF +APKFP P + +
Sbjct: 177 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 236
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L+ + E ++ M + TL MA+GGI DH+G GF RYSV W +PHFEKM
Sbjct: 237 LHIVGR-------DECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHFEKM 289
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGA 309
LYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 290 LYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPTPND 349
Query: 310 TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ +DPH+EF +NVL
Sbjct: 350 TEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNVLSV 407
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR 427
S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I + A+
Sbjct: 408 KVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGALAK 467
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SK 486
S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 468 CSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGGRGN 517
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT---- 539
PGF DDYA+LI GLLD+YE +L +A +LQ ++ FL G GY++T
Sbjct: 518 TPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTPSTM 577
Query: 540 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
T P LLR+K + A P+ N V NL+RL S++
Sbjct: 578 TSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 614
>gi|442323509|ref|YP_007363530.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
gi|441491151|gb|AGC47846.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
Length = 697
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 248/704 (35%), Positives = 357/704 (50%), Gaps = 69/704 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE A+L+N+ F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 57 SACHWCHVMAHESFESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPPED+YGRPGF +L ++DAW KR+ + + A E L E
Sbjct: 117 VFLTPDLKPFYGGTYFPPEDRYGRPGFPRLLMALRDAWKNKREDIHRQAAQFEEGLGEL- 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+ + P L + ++++ DS GGFG APKFP P+ ++L ++
Sbjct: 176 -AAYGLDAAPGVLSVEDVLSMGQRMALQVDSVHGGFGGAPKFPNPMNFSLLLRAWRR--- 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + V TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD QL +
Sbjct: 232 ----GGGDSLRDAVFLTLERMALGGIYDQLGGGFHRYSVDARWLVPHFEKMLYDNAQLMH 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A + + + + ++Y+RR+M GG ++A+DADS EG +EG F+V
Sbjct: 288 LYSEAQQVAPRPLWRKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340
Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W +E++ +L E A L H+ + P GN + G VL + + A +
Sbjct: 341 WRPEEIQAVLPPERAELVMRHFRVTPLGNFE-----------HGATVLEVVVPAETLARE 389
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ LE L E R+ LF R +R +P DDK++ WNGL+I A A+++
Sbjct: 390 RSLSLEAVERELAETRQVLFQARERRVKPGRDDKILAGWNGLMIRGLALAARVF------ 443
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
DR ++ +A SAA F+ L+D RL S++ G ++ GFL+DY L
Sbjct: 444 ----------DRPDWTRLAVSAADFVLAKLWD--GTRLARSYQEGQARIDGFLEDYGDLA 491
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
SGL LY+ K+L A L +ELF D E Y +++ D A
Sbjct: 492 SGLTALYQATFDVKYLEAAKALVKRAEELFWDAEKQAYLTAPRGQKDLVVATYGLFDNAF 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S V LA++ + +++ + +A L AM + AAD L +
Sbjct: 552 PSGASTLTEAQVALAAL---TGDEHHLELPSKYVARMREGLVANAMGYGHLGLAADSL-L 607
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
V G +V +L+AA+ Y A T W+E ++ +
Sbjct: 608 DGGAGVTFSGSSDAV--APLLSAANHVY-----------APTFAFG-WKEEGRPVPALLK 653
Query: 679 NNFS-----ADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
F A K A +C+ F+C P TD +L L EKP
Sbjct: 654 ELFEGREPVAGKGAAYLCRGFACELPRTDAKALAERLTEKPKGA 697
>gi|159131360|gb|EDP56473.1| DUF255 domain protein [Aspergillus fumigatus A1163]
Length = 799
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 244/620 (39%), Positives = 333/620 (53%), Gaps = 55/620 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA LLN+ F+ IKVDREERPD+D VYM YVQA G GGWPLS
Sbjct: 63 SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 122
Query: 80 VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+P+L P+ GGTY+P + + GF IL K++D W ++ S Q
Sbjct: 123 VFLTPNLDPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 182
Query: 135 LSEALSASASSN----KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + + YD+ GGF APKFP P + +
Sbjct: 183 LREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 242
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 243 LRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 302
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 303 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 362
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ DPH+EF +NV
Sbjct: 363 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 420
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A + G+ E+ + I+ ++KL + R K R RP LDDKVIV+WNGL I +
Sbjct: 421 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAIGA 480
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 481 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 530
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDEL--- 527
+ PGF DDYA+LI GLLD+YE +L +A +LQ+ TQ E
Sbjct: 531 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYLND 590
Query: 528 -FLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
FL G GY++T T P LLR+K + A PS N V NL+RL++++
Sbjct: 591 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL--- 647
Query: 580 KSDYYRQNAEHSLAVFETRL 599
+ + YR A + F +
Sbjct: 648 EEEEYRTLARQTCLSFSVEI 667
>gi|429217838|ref|YP_007179482.1| thioredoxin domain-containing protein [Deinococcus peraridilitoris
DSM 19664]
gi|429128701|gb|AFZ65716.1| thioredoxin domain protein [Deinococcus peraridilitoris DSM 19664]
Length = 677
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 246/692 (35%), Positives = 361/692 (52%), Gaps = 69/692 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE VA +N FV+IKVDREERPDVD VYM+ VQA G GGWP++
Sbjct: 47 STCHWCHVMAHESFEDETVAGFMNTHFVNIKVDREERPDVDAVYMSAVQATTGSGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL +P GTYFPP D +G P F +L V AW+ +R L Q+ E L++ L
Sbjct: 107 VFLDAQGRPFYAGTYFPPRDAHGMPSFSRVLAGVAQAWNGRRQDLMQNA----ETLTQHL 162
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA + + LP + Q+ K +D+R GGFGSAPKFP P + +L
Sbjct: 163 Q-SAGRREGSEALPADFTARGLAQVRKLFDARHGGFGSAPKFPAPTTLAYLLTQ------ 215
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ + + L TLQ MA GG++D +GGGFHRYSVDERW VPHFEKMLYD QLA
Sbjct: 216 -------PQARDISLTTLQKMAAGGLYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLAR 268
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYL A+ LT + ++ R+ L+YL R+M+ P G +SA+DADS EG EG F+V
Sbjct: 269 VYLQAYQLTGEASFTQFARETLEYLEREMLSPEGGFYSAQDADS---EGI----EGKFFV 321
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
WT +E++ ILG+ A L + + GN DPH+ +F ++VL + + A +
Sbjct: 322 WTPQELQAILGDDAALAARFWGVTAEGN-----FMDPHHPDFGRRSVLSVVASPTELAEQ 376
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ L RR+L++ R R P D KV+ SWNGL + +FA A+++L+ E
Sbjct: 377 FGLSEPDVRRRLEAARRRLWEERELRVHPGTDTKVLTSWNGLALGAFALAARVLREE--- 433
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+++VA A F+R HL E L+HS+++G ++ G L+D+A
Sbjct: 434 -------------RFLDVARRNADFVRSHLRSEDA-TLRHSYKDGQARVQGLLEDHALYA 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ L WA EL N F D+EGG +++T+ +++ R K+ D A
Sbjct: 480 LGLIELYQASGHLPHLEWARELWNVVATEFWDQEGGAFWSTSARAETLITRQKDAFDSAV 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
S N+ + + + + + + + A ++ F + + A +L+
Sbjct: 540 MSDNAAAALLGLWMGRYYGDPRGE---ELATRTIGTFAADMLAAPSGFGGLWQAHALLTA 596
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P + VL ++ FE LA + + P++ + +
Sbjct: 597 PHVEVAVLGSSQARAPFEAELARHFLPF------AALAPSEA------------GSGLPV 638
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + VA VC+NF+C P D +L L
Sbjct: 639 LEGRSGEGVAYVCRNFACDLPARDTATLGQQL 670
>gi|383762697|ref|YP_005441679.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381382965|dbj|BAL99781.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 689
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 244/692 (35%), Positives = 358/692 (51%), Gaps = 63/692 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE A L+N+ FV+IKVDREERPD+D +YM VQA+ G GGWP+S
Sbjct: 53 SACHWCHVMERESFEDEETAALMNELFVNIKVDREERPDLDAIYMDAVQAMTGQGGWPMS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PD KP GGTYFP E +YG P F+ +LR V +A+ ++R+M+ E+L+ L
Sbjct: 113 VWLTPDGKPFYGGTYFPKEPRYGMPSFQQVLRAVAEAYRERREMVEGQA----ERLASML 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+AS EL + L Q+ + +D GGFGS PKFP+P+ + L +
Sbjct: 169 QRTASLRAEGGELGEEILEEALGQMRQYFDEEEGGFGSQPKFPQPMTLDFALTQYLR--- 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + M TL+ MA GGI+D +GGGFHRYSVD W VPHFEKMLYD QL
Sbjct: 226 TGN----LDALYMAELTLEKMAHGGIYDQLGGGFHRYSVDAIWLVPHFEKMLYDNAQLLR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL A+ +T+ + + + +DY+ R+M P G +SA+DADS EG EG F++
Sbjct: 282 TYLHAWQVTQRPLFRRVVEETIDYVLREMTAPDGGFYSAQDADS---EG----HEGKFFL 334
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ +EVE +L H A +F ++Y + GN F+GKN+L + A +
Sbjct: 335 WSQQEVESLLDPHTAAIFCDYYGVSAHGN------------FEGKNILSVVRSIEQVAQR 382
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ + + L R LF R KR +P D+K++ WNGL+I + A +L
Sbjct: 383 FRIGEAEVEDALRRARAILFAHREKRIKPARDEKILTEWNGLMIHALAECGVVL------ 436
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+R++ + A AA FI + + RL S+++G ++ +L+DYA LI
Sbjct: 437 ----------ERQDALAAAVRAAEFILAQM-SQPDGRLYRSYKDGRARFNAYLEDYASLI 485
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL+ LYE +WL A L E F D GG+F T + ++ R K+ D A
Sbjct: 486 RGLIALYEATFDLRWLGEATRLAQIMFEQFHD-PAGGFFQTGVDHEQLVARRKDFVDNAV 544
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNS++ L+RL+ + + YR A L + + + + C D
Sbjct: 545 PSGNSLAAEALLRLSVFLDKPE---YRTEAGRILLMMKDAMARQPTGFGRLLCVLDAYLS 601
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
PS++ + +VG + +LA + + + +P E S +
Sbjct: 602 PSQE-IAIVGRRDDPATAALLAEVRRRFLPHAILALKEP----------EQESVLPLLQG 650
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K A VC+N++C PVT +L +L
Sbjct: 651 RTLVDGKATAYVCENYACKLPVTSAEALAAML 682
>gi|222056570|ref|YP_002538932.1| hypothetical protein Geob_3488 [Geobacter daltonii FRC-32]
gi|221565859|gb|ACM21831.1| protein of unknown function DUF255 [Geobacter daltonii FRC-32]
Length = 705
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 247/685 (36%), Positives = 344/685 (50%), Gaps = 76/685 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED VAK LND FV+IKVDREERPD+D +M Q + G GGWPL+V
Sbjct: 80 TCHWCHVMAHESFEDREVAKALNDSFVAIKVDREERPDIDDQFMAVAQMISGSGGWPLNV 139
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSE 137
L+PD KP TY P E + G PG +L ++ W ++RD + +S + ++E+L+
Sbjct: 140 LLTPDKKPFFAATYLPKERRMGVPGIIDLLERISRFWQRERDKVEESCSTIMASLERLNR 199
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A A EL + A QL+ YD +GGFG APKFP P I +L
Sbjct: 200 TEPAYAGG-----ELEEAAF----NQLAAMYDDDWGGFGQAPKFPMPHYISFLL------ 244
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
K+G E +M TL M +GGI+D +G G HRYSVD +W VPHFEKMLYDQ +
Sbjct: 245 -RCWKAGR-PEALQMAEHTLTRMRQGGIYDQLGFGIHRYSVDRQWLVPHFEKMLYDQALV 302
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A + +AF T +Y + R+IL+Y +M G G SA+DAD TEG +EG F
Sbjct: 303 AIAFAEAFQATGKNYYREVVREILNYCLVEMTGIDGGFCSAQDAD---TEG----QEGKF 355
Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W + EV+++LGE A LF + + GN F+GKN+L ++ A
Sbjct: 356 YLWAAAEVKEVLGEEAARLFCRLFDITEKGN------------FEGKNILHLPVSIASFA 403
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G+ E + L + R KL VR KR RP D KV+ +WNGL+I++ A+ + E
Sbjct: 404 DREGLIAESFKGELIKWRAKLLTVRQKRVRPLRDAKVLTAWNGLLIAALAKGYGVTGDET 463
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+ AESA + I L ++ RL S+ G +K P FL+DYAF
Sbjct: 464 ----------------YLRAAESAVTIILEKLQTKEG-RLSRSYHLGQAKIPAFLEDYAF 506
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L GLL+LY+ +L A+ L LF GGG+++ + VL+R K +DG
Sbjct: 507 LGWGLLELYQVSLHQGYLFQALRLARDMIRLF-SAPGGGFYDNGMDAEEVLIRQKNAYDG 565
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS++ +NL+RL I+ K D EH + F + A D
Sbjct: 566 AMPSGNSIAAMNLLRLGKIL---KDDSLETAGEHGVGAFLGNALQQPAGYLQLIMAHDYQ 622
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ + L G + + +LA + + + H + D A
Sbjct: 623 HA-EKIEITLAGAREGAEIRALLATVNRHFIAGLVLRHAEDGD--------------AGA 667
Query: 677 ARNNFSADKVVALVCQNFSCSPPVT 701
A A +C + +C PPVT
Sbjct: 668 GTMEAPAVGAAAYICASGACRPPVT 692
>gi|284045681|ref|YP_003396021.1| hypothetical protein Cwoe_4232 [Conexibacter woesei DSM 14684]
gi|283949902|gb|ADB52646.1| protein of unknown function DUF255 [Conexibacter woesei DSM 14684]
Length = 666
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 253/695 (36%), Positives = 351/695 (50%), Gaps = 81/695 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFED A L+N+ FV IKVDREERPDVD +YM VQA+ G GGWPL+
Sbjct: 48 SACHWCHVMERESFEDPQTAALMNERFVCIKVDREERPDVDAIYMDAVQAMTGHGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
F +P+ P GTYFPP+ ++G P ++ +L + DAW +RD + + LS
Sbjct: 108 AFATPEQVPFYAGTYFPPQPRHGLPSWRQVLEAISDAWRARRDEILAQNDRIVAHLSAGA 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S + L +A+ + L + D GGFGSAPKFP+ I+++L
Sbjct: 168 RLAPSGAMVDPGLLDDAV----DSLRMAADPVNGGFGSAPKFPQASVIELLL-------- 215
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ GE Q + L L+ MA+GGIHD +GGGF RY+VD W VPHFEKMLYD LA
Sbjct: 216 --RRGE----QTVALDALRAMARGGIHDQLGGGFSRYTVDAAWVVPHFEKMLYDNALLAR 269
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL + ++ D +C D LD+ R+M GP G SA DADS EG EG FYV
Sbjct: 270 AYLHGWQVSGDPLLRQVCEDTLDWALREMRGPEGGFHSALDADS---EGV----EGKFYV 322
Query: 320 WTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
W+ E+ LG+ + + Y GN F+G N+L+ +SA+
Sbjct: 323 WSLAELRSALGDDELYDVAVAWYGATVAGN------------FEGLNILVRAGSASAAE- 369
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
P E L E RR+L RS R RP LDDK + SWN L+I++ A A +L
Sbjct: 370 ----PPE-----LPEIRRRLLAARSTRVRPGLDDKRLTSWNALMIAALAEAGAVL----- 415
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+R +Y++ A ASF+ L RL S+++G + PG+L+D+A+
Sbjct: 416 -----------ERDDYLDAARGTASFLLDSLATSDG-RLLRSWKDGRATLPGYLEDHAYA 463
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL LYE +W A L + F D E GG+F T + ++ R K+ D
Sbjct: 464 LEALLTLYEATFEERWFTAARALADATIAHFADAEHGGFFMTADDHEQLVARRKDLEDTP 523
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNS + L+RLA + +DY R+ AE +A+ AMA + A D
Sbjct: 524 IPSGNSAAAFGLLRLARLT--GSADYERE-AERVIALLHPLAAGHAMAFAHLLAAID-FQ 579
Query: 618 VPSRKHVVLVGHKSSVD-FENMLAAAHASYDLNKTVIHIDPA-DTEEMDFWEEHNSNNAS 675
+ V +VG +++ E ++ A K H+ A T E D E +
Sbjct: 580 LGEVHEVAIVGDRAAAKPLERVVRA--------KLRPHVVLAGGTGEGDRDAEASVVPLL 631
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
R+ K A VC+ F+C PVTDP +L LL
Sbjct: 632 EGRHAVGG-KPAAYVCERFACRAPVTDPDALAELL 665
>gi|430745763|ref|YP_007204892.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
DSM 18658]
gi|430017483|gb|AGA29197.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
Length = 811
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 237/608 (38%), Positives = 324/608 (53%), Gaps = 60/608 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++C+WCHVME E F+D +AKL+N FV IKVDREERPD+D++YM +QA +G GGWP+S
Sbjct: 86 SSCYWCHVMERECFKDPQIAKLMNQKFVCIKVDREERPDIDQIYMAALQA-FGNGGWPMS 144
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PD +P GGTYFPP+D+ G GF T+L V DAW ++ + +S + + +L
Sbjct: 145 MFLTPDGRPFFGGTYFPPKDRNGIRGFPTVLAGVADAWRDEKAQIEESADRLTDLVRRSL 204
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMMLYH 193
+ S P L + E+L++ +D +GGFG PKFP PV + +L
Sbjct: 205 AKSNDKRHAP--LTRAVAAQGREELTEQFDPEYGGFGFNPENARRPKFPEPVNLVFLLDE 262
Query: 194 SKKLEDTGKSGEASEGQK-------MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
++ GK EGQ+ MVL TL MA+GGI D + GG+HRY+ W VPH
Sbjct: 263 HRRGAAAGKK----EGQEASSNALAMVLKTLDQMARGGIRDQLAGGYHRYATSRYWIVPH 318
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD QLA+ +L AF LT D + ++ R M P G +SA D AET
Sbjct: 319 FEKMLYDNAQLASTHLLAFELTADPRWRLEAESTFAFIARSMTSPEGGFYSAID---AET 375
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
+G EG +YVWT EVE LG F + Y LK N + K +
Sbjct: 376 DG----DEGQYYVWTRDEVEKTLGAGPDYEAFAQVYGLKREPNFE-----------KERY 420
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
VL+E + A+ L + R KL VR +RP P LDDKV+ SWNGL+I++
Sbjct: 421 VLLEPRSRADQAATLKTTPAALEATMAPLRAKLLAVRERRPAPLLDDKVLTSWNGLMIAA 480
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
+A +IL +Y + A+ AA FI L RL S+R G
Sbjct: 481 YADGFRILHD----------------AKYRQAADKAADFILAKLRSPDG-RLLRSYRLGQ 523
Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 544
+K G+L+DYAFL+ GLL L+ K L A EL + F D E GG+F T
Sbjct: 524 AKLAGYLEDYAFLVHGLLRLHAATGDPKRLTQARELTDRMIADFSDPEEGGFFYTADGHE 583
Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
S+L R K+ +DGA PSGNSV++ NLV LAS ++ Y A+ +L F + L
Sbjct: 584 SLLARPKDPYDGALPSGNSVAIRNLVALASATGEAR---YLDQAQKALDAFSSTLAQNPG 640
Query: 605 AVPLMCCA 612
++PL+ A
Sbjct: 641 SLPLLVVA 648
>gi|172058552|ref|YP_001815012.1| hypothetical protein Exig_2546 [Exiguobacterium sibiricum 255-15]
gi|171991073|gb|ACB61995.1| protein of unknown function DUF255 [Exiguobacterium sibiricum
255-15]
Length = 677
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 239/709 (33%), Positives = 359/709 (50%), Gaps = 77/709 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHV+ ESFEDE A++LND F+SIKVDREERPD
Sbjct: 28 GEEAFAAARSANKPIFLSIGYSTCHWCHVLAHESFEDEETARMLNDRFISIKVDREERPD 87
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YMT Q + G GGWPLSVF+SPD P GTYFP ++ RP F+ +L ++ + +
Sbjct: 88 IDQIYMTAAQMMNGQGGWPLSVFMSPDQTPFYIGTYFPKTPQFNRPSFRQVLLQLSEHYR 147
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
D + + G +++ +AL+A + + D L + + +Q + YD GGFG+A
Sbjct: 148 TDPDKIKRVG----QEIIQALTAVTTFDS-EDPLDEALVHETFDQAMRQYDVENGGFGTA 202
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L D + E +MV+ TL M GGI DHVG G +RY+V
Sbjct: 203 PKFPSPSLLTFLL-------DYYRFAEDETALQMVMRTLTAMRDGGITDHVGFGLYRYTV 255
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DERW +PHFEKMLYD A + ++ + ++ + +I Y+ RD+ P G +SA
Sbjct: 256 DERWEIPHFEKMLYDNALFATLCIETYQVSGRERFKQYAEEIFAYIERDLSSPDGAFYSA 315
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EG +EG FY +T E+ D+LG+ A+ F Y P GN
Sbjct: 316 EDADS---EG----REGLFYTFTFDELTDLLGQDAV-FPLLYQATPQGN----------- 356
Query: 359 EFKGKNVLIELNDSSASASK-LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F+G+ V S S ++ L L + RR L RS+R RP DDKV+ SW
Sbjct: 357 -FEGRIVFRRTGQSIQQLSADRNTAVQDILIQLEQERRTLLLFRSQRTRPFRDDKVLTSW 415
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
N L+IS++A+A ++ E Y + A A +F+ HL D+ RL
Sbjct: 416 NALMISAYAKAGRVFNDE----------------RYTKFARQALTFLETHLMDDD--RLH 457
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+R G + G+LDDY+FL L+L++ +L AI L F D E G +F
Sbjct: 458 VRYRQGHIQGNGYLDDYSFLTEAYLELHQTTQHIPYLKQAIRLTERMIGDFSD-EDGSFF 516
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T+ ED ++L+R K+ +D +P+GNS +V NL+RL+ + + YR A+ + + +
Sbjct: 517 FTSFEDETLLMRPKDVYDVVKPAGNSTAVSNLLRLSQLTGRTD---YRDQAQRNFSTLAS 573
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHV----VLVGHKSSVDFENMLAAAHASYDLNKTVI 653
+K A +LSV +R + ++V +S D + L H +++
Sbjct: 574 EIKSQPTGF------ASLLSVYTRTLMEPKELIVLTESYTDVASFLTQLHQRRLPELSLL 627
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
D E+ +A + + A +C +F C P T+
Sbjct: 628 VGSKTDLLEI---------APFLATYDAPTQQPTAYLCHDFQCDRPTTN 667
>gi|212538503|ref|XP_002149407.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
gi|210069149|gb|EEA23240.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
Length = 783
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 238/612 (38%), Positives = 333/612 (54%), Gaps = 51/612 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LND F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 68 SACHWCHVMEKESFMSTEVATILNDSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 127
Query: 80 VFLSPDLKPLMGGTYFP-----PEDKYGRP---GFKTILRKVKDAW--------DKKRDM 123
VFL+PDL+P+ GGTY+P + ++G GF IL K++D W D +++
Sbjct: 128 VFLTPDLEPVFGGTYWPGPQASSQSQWGAEGPIGFVDILEKLRDVWQTQQARCLDSAKEI 187
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
Q FA E A L EL + A + + YD +GGFG APKF
Sbjct: 188 TKQLREFAEEGTHTQQGAKGGGEDLEIELIEEAF----QHFASRYDPLYGGFGRAPKFHT 243
Query: 184 PVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
P + ++ + + D E M TL +A+GGI DH+G G RYSV
Sbjct: 244 PANLSFLIRLGMYPSAVSDIVGQDECVRATAMATNTLLNIARGGIRDHIGHGVARYSVTA 303
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAE 299
W +PHFEKMLYDQ QL +VY+DAF T + D++ YL + I G +S+E
Sbjct: 304 DWLLPHFEKMLYDQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTGGYYSSE 363
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 358
DADS T T K+EGAFYVWT KE++ +LG+ A + H+ + GN ++ +DPH+
Sbjct: 364 DADSLPTPNDTEKREGAFYVWTMKELKQVLGQRDAGVCARHWGVLADGN--IAPENDPHD 421
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSW 417
EF +NVL S A + G+ E+ + I+ ++KL D R K R RP LDDK+IV+W
Sbjct: 422 EFMDQNVLSIKVTPSKLAKEFGLSEEEVIKIIKSGKQKLRDYREKIRVRPDLDDKIIVAW 481
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL I + A+AS +L+ + ++ + A A FIR+ L++ + +L
Sbjct: 482 NGLTIGALAKASVLLEE----------IDKVKAQQCRDSAHKAVEFIRKTLFEPSSGQLW 531
Query: 478 HSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--- 533
+R+G PGF DDYAFL SGL+ +YE +L +A +LQ ++ F+ G
Sbjct: 532 RIYRDGHRGNTPGFADDYAFLTSGLIAMYEATFDDSYLQFAEQLQKHLNQYFMAPGGESG 591
Query: 534 --GGYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
GY+ T+ E +P LLR+K D A PS N + NLVRL +++ + D YR+
Sbjct: 592 TSAGYYTTSSEPISGEPGPLLRLKSGTDSATPSINGIIARNLVRLGTLL---EDDNYRRL 648
Query: 588 AEHSLAVFETRL 599
A + + F L
Sbjct: 649 ARQTCSTFSVEL 660
>gi|255655589|ref|ZP_05400998.1| hypothetical protein CdifQCD-2_07782 [Clostridium difficile
QCD-23m63]
gi|296451580|ref|ZP_06893315.1| thymidylate kinase [Clostridium difficile NAP08]
gi|296878837|ref|ZP_06902837.1| thymidylate kinase [Clostridium difficile NAP07]
gi|296259645|gb|EFH06505.1| thymidylate kinase [Clostridium difficile NAP08]
gi|296430109|gb|EFH15956.1| thymidylate kinase [Clostridium difficile NAP07]
Length = 678
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 233/696 (33%), Positives = 356/696 (51%), Gaps = 79/696 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA+++N FV+IKVD+EERPDVD VYMT QA+ G GGWP+++
Sbjct: 54 TCHWCHVMEKESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTI 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD KP GTYFP +Y RPG +L V + W+ RD+L +SG IE L +
Sbjct: 114 IMTPDKKPFFAGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFG 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 198
+ L E+ +++R+ YD ++GGFG+APKFP P + ++ Y +K +
Sbjct: 174 VKNTEGDLSKEMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDK 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D KMV TL M +GG+ DH+G GF RYS D++W PHFEKMLYD L
Sbjct: 230 DV---------LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLT 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+LDA+ +T Y I +DY+ R+M G +SA+DADS EG +EG FY
Sbjct: 281 IAFLDAYKITNKELYKEIAMKTIDYVVREMQDKDGGFYSAQDADS---EG----EEGKFY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSAS 375
+ E+ ++LGE F ++ + +GN F+GK++ LI+
Sbjct: 334 TFNPLEIIEVLGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK------- 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E++ + +K+F+ R +R H DDK++ SWN L++ + +A LK++
Sbjct: 375 ----NKEYERHNEKIDNLSKKVFEYRKERTSLHKDDKILTSWNALMVVALTKAYSTLKND 430
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
Y++ + FI +L +E + RL +R+G S +LDDYA
Sbjct: 431 M----------------YLDYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FLI ++LYE K+L A+ L + +LF D E G++ + +++ R K+ +D
Sbjct: 474 FLIWAYIELYESTFNMKYLEKALNLNESCIDLFWDYEKSGFYIYGKDSENLIARPKDLYD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV + NL+RLA I +K + + + L ++ +K + M
Sbjct: 534 GAIPSGNSVQLYNLIRLAKITGDNKLE---EMSYKQLKLYVNNVKSSPTGYSFYMLSL-M 589
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ S K ++ + K D ++ N T + + E N+
Sbjct: 590 FELYSTKEIICI-FKEDSDLSAFKELISENFIPNTTFLAKK---------YNEENTIIGF 639
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ DK VCQ+ SCS P+ + L++++L
Sbjct: 640 LNNYKLKEDKTSYYVCQSNSCSQPINNLQKLKDMIL 675
>gi|358371871|dbj|GAA88477.1| DUF255 domain protein [Aspergillus kawachii IFO 4308]
Length = 784
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/580 (40%), Positives = 321/580 (55%), Gaps = 35/580 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 66 SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 125
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + GF IL K+ D W ++ +S +Q
Sbjct: 126 VFLTPDLEPVFGGTYWPGPNSSTLTGNETIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 185
Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + YD GGF +APKFP P + +
Sbjct: 186 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 245
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E ++ M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 246 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 305
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 306 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 365
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ +DPH+EF +NV
Sbjct: 366 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 423
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 424 LSVKVTPSRLAKDFGLGEEEVVRIIRTAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 483
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A SFI+ +L+++ T +L +R+G
Sbjct: 484 LAKCSALFE-EIES---------SKAVQCREAAAKAISFIKENLFEKSTGQLWRIYRDGG 533
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT- 539
PGF DDYA+LI GLLD+YE +L +A +LQ ++ FL G GY++T
Sbjct: 534 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTP 593
Query: 540 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
T P LLR+K + P+ N V NL+RL S++
Sbjct: 594 STMTSGAPGPLLRLKTGTESVTPAVNGVIARNLLRLGSLL 633
>gi|297622269|ref|YP_003703703.1| hypothetical protein [Truepera radiovictrix DSM 17093]
gi|297163449|gb|ADI13160.1| protein of unknown function DUF255 [Truepera radiovictrix DSM
17093]
Length = 704
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 222/535 (41%), Positives = 303/535 (56%), Gaps = 50/535 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A L+N FV++KVDREERPDVD VYM+ VQA+ G GGWP++V
Sbjct: 74 ACHWCHVMAHESFENPEIADLMNAHFVNVKVDREERPDVDAVYMSAVQAMTGSGGWPMTV 133
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
L+PD KP GGTY+PPED+ G PGFK +L + +AW +RD + ++ L++
Sbjct: 134 ALTPDGKPFFGGTYYPPEDRLGHPGFKRVLLSLAEAWRSRRDEVLRAAETLTNHLADLNK 193
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A+ P L + L L +++D + GGFG APKFP + +L +
Sbjct: 194 LPAAGEPSPGALGEEVLAEAVRALQRTFDPQHGGFGGAPKFPPHGALAFLLRRPE----- 248
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
E ++M TL MA GGI D +GGGF RYSVD RW VPHFEKMLYD QL V
Sbjct: 249 ------PEAREMAYVTLDKMAAGGIFDQLGGGFARYSVDARWLVPHFEKMLYDNAQLVGV 302
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A++ T+ Y + L +++R++ P G +SA DADS EG +EG FYVW
Sbjct: 303 YAEAYAQTRRARYREVVEATLAFVQRELTSPEGCFYSALDADS---EG----EEGKFYVW 355
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+ E D+LGE A L K ++ + GN F+G+NVL + +A A + G
Sbjct: 356 RADEF-DVLGEDAALAKVYFGVSAAGN------------FEGRNVLFVPHPPAAVAERFG 402
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ L +R LF++RS+R RP LDDKV+ SWNGL+I +FARA ++L +A
Sbjct: 403 LSEAALAARLARVKRALFEIRSRRTRPGLDDKVLASWNGLMIGAFARAGRVLAEDA---- 458
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A AA +R L E RL H+FR G +K G L+DYA L G
Sbjct: 459 ------------YLEAARRAARGVRSALLREG--RLWHTFRGGEAKVEGLLEDYALLGLG 504
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
LL+LY WL+WA+EL F D E GG+F+T + ++++R KE D
Sbjct: 505 LLELYRATLEGPWLLWALELAEVIAARFTDPE-GGFFSTAADAEALVVRPKELFD 558
>gi|452209206|ref|YP_007489320.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
gi|452099108|gb|AGF96048.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
Length = 690
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 231/684 (33%), Positives = 348/684 (50%), Gaps = 51/684 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
N WCH+M ESFEDE VA L+N+ FVSIKVDREERPD+D +YMT Q + G GGWPL+
Sbjct: 47 NKPDWCHMMAHESFEDEEVAGLMNEAFVSIKVDREERPDIDNIYMTVCQIILGRGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++P KP GTY P ++ + G ++ ++K+ W+++ + + S + E +
Sbjct: 107 IIMTPGKKPFFAGTYIPKNTRFNQIGMLELVPRIKEIWEQQHEEVLDSAEKITSTIQEMI 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S+ L + + E+L S+D+ +GGF APKFP P +I +L + ++ +
Sbjct: 167 KESSGEG-----LGEEVIEEVYEELLSSFDTEYGGFSGAPKFPTPHKISFLLRYWRRSRN 221
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
E M +TL M +GGI+DH+G GFHRYS D W +PHFEKMLYDQ A
Sbjct: 222 -------PEALHMAEYTLDKMRRGGIYDHLGSGFHRYSTDSMWLLPHFEKMLYDQALTAI 274
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T Y ILDY+ RD+ P G + EDAD ++EG +Y+
Sbjct: 275 AYTEAYQVTGKDLYKETAEGILDYVLRDLTSPEGGFYCGEDAD-------VEREEGKYYL 327
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +E+ IL E + L + + L+ GN + + G N+ + A+K
Sbjct: 328 WTLEEIRSILDPEDSELIIKMFNLREEGNFE----EEIRGRETGTNLFYMARSPGSLAAK 383
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ +P+E+ + R KL R +R RP LDDK++ WNGL+I++FA+
Sbjct: 384 MKIPVEEVEKKVKAAREKLLKARYERKRPSLDDKILTDWNGLMIAAFAKG---------- 433
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ V G R Y++ AE AA FI LY L H +R+G + G DDYAFLI
Sbjct: 434 ----YQVFGEQR--YLKAAEKAADFILMALYS-PGDGLLHRYRDGVAGISGTSDDYAFLI 486
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL+LYE G ++L A+ L + E F D GG + T + +++ R KE D A
Sbjct: 487 HGLLELYEAGFKMRYLKAAVSLNSELLECFWDPVNGGLYFTANDSEALIFRKKEFMDSAI 546
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+GNS ++NL+RL+ I+A + + A+ F ++ A D
Sbjct: 547 PTGNSFEMLNLLRLSRIIADPGLE---ETADKLERAFSKQIMKAPSGYTQFLSAFDFRLG 603
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
PS + V++ G + D E ML + + NK +I + E+ ++ +
Sbjct: 604 PSYE-VIISGKAEASDTEQMLKELWSYFVPNKVLIFRPEREKPEITELAKYTEEQVPI-- 660
Query: 679 NNFSADKVVALVCQNFSCSPPVTD 702
K A VCQN+ C P T+
Sbjct: 661 ----EGKATAYVCQNYECQLPTTE 680
>gi|327357546|gb|EGE86403.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 833
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 242/638 (37%), Positives = 339/638 (53%), Gaps = 64/638 (10%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
K R FL + CHWCHVME ESF VA +LN F+ IK+DREERPD+D+VYM YV
Sbjct: 59 KLNRMVFLSIGYSACHWCHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYV 118
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDK 119
QA G GGWPL+VFL+PDL+P+ GGTY+P P F IL K++D W
Sbjct: 119 QATTGSGGWPLNVFLTPDLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQT 178
Query: 120 KRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGF 175
++ +S +QL E A + S K D + L + + +D GGF
Sbjct: 179 QQLRCRESAKDITKQLREFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGF 238
Query: 176 GSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 232
APKF P + ++ S+ + D E S +M TL M++GGIHD +G G
Sbjct: 239 SRAPKFATPANLSFLINLSRYPSAVSDIVGYDECSRALEMATKTLISMSRGGIHDQIGHG 298
Query: 233 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGP 291
F RYSV W +PHFEKMLYDQ QL NVY+DAF + DI Y+ ++ P
Sbjct: 299 FARYSVTADWSLPHFEKMLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSP 358
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDL 350
G +S+EDADS T T K+EGAFYVWT KE + ILG+ A + H+ + P GN +
Sbjct: 359 TGGFYSSEDADSLPTPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--V 416
Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHL 409
+R +DPH+EF +NVL + A + G+ E+ + I+ R KL + R SKR RP L
Sbjct: 417 ARGNDPHDEFINQNVLSIKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGL 476
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
DDK+IVSWNGL I + A+ S +L++ V + +E+ AE+AA FIR++L+
Sbjct: 477 DDKIIVSWNGLAIGALAKCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLF 526
Query: 470 DEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
D + +L +R+G PGF DDY++L SGL+DLYE +L +A +LQ + F
Sbjct: 527 DPASGQLWRIYRDGERGDTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYF 586
Query: 529 LDR---------------------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSG 561
L + GY+ T P+ L R+K D + PS
Sbjct: 587 LAQGPTPTPSPRTSITTESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSP 646
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
N V NL+RL++++ + D Y++ A ++ F +
Sbjct: 647 NGVIAQNLLRLSTLL---EDDTYKRLARETVNAFAVEI 681
>gi|225848123|ref|YP_002728286.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
gi|225644610|gb|ACN99660.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
Length = 684
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 243/695 (34%), Positives = 363/695 (52%), Gaps = 67/695 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFEDE VA++LN +FV IKVDREERPD+D VYM G GGWPL+
Sbjct: 51 SSCHWCHVMEKESFEDEEVAEILNKYFVPIKVDREERPDIDAVYMNVCMLFNGSGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
+ ++PD KP GTYFP + R G +L V W + K D++++S E++
Sbjct: 111 IIMTPDKKPFFAGTYFPKHSRPNRIGVVDLLLSVAKYWQENKEDLISRS-----EKVLGY 165
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSK 195
L SN EL ++ + L +D+ +GGF + PKFP P I +L YH+K
Sbjct: 166 LKEDNKSNY--GELKKDYIHAGFYDLKGRFDNTYGGFSNKPKFPTPHNIMFLLRYYYHTK 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ E +MV TL M GGI+DHVG GFHRYS D +W +PHFEKM YDQ
Sbjct: 224 E----------EEALQMVEKTLTNMRLGGIYDHVGFGFHRYSTDRQWLLPHFEKMHYDQA 273
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L Y + + +TK Y ++I++Y+ RDM G FSAEDADS EG +EG
Sbjct: 274 MLLMAYTETYQITKKDLYKQTVQEIIEYVIRDMTNEEGVFFSAEDADS---EG----EEG 326
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY WT +E++DIL E + L + + +K GN P G+N++
Sbjct: 327 KFYTWTFQEIKDILKEESDLAIKIFNIKEEGNYLEEATGHP----TGRNIIYLSKTLRDY 382
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A LG+ L + R+KLF R KR P DDKV+ WNGL+I++ ++A K ++
Sbjct: 383 AIDLGIDENTLKQKLEQIRKKLFKEREKRVHPLKDDKVLTDWNGLMIAALSKAGKAFSNQ 442
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+Y+ A+ AA FI ++ + +L H +++ K G LDDYA
Sbjct: 443 ----------------DYISYAQKAADFIIHNMIIDG--KLYHLYKDKEVKIEGMLDDYA 484
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL+ GL++LY+ K+L A++L N + D + GG+F + +D +++ KE D
Sbjct: 485 FLVWGLIELYQATGELKYLKTAVDLTNKAIQPLYDEKNGGFFLSKSQD--LIVNPKESFD 542
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNSV NL RL I A + ++Y+++ E +L F +K + + A M
Sbjct: 543 GAIPSGNSVMAYNLYRLYLITA--QEEFYKKSYE-TLTAFAGDIKRLPSYHTMFLIALMM 599
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
P+ + +++ K ++ N L + + N +I P + EE+ S +
Sbjct: 600 HFFPTSE--IVISGKGWIEALNQL---NREFLPNTVIIVKTPENKEEL-------SKISH 647
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ + +C+NF+C+ P D + N+L
Sbjct: 648 YTQSMEVPEDFYIYLCKNFACNLPTKDLEYVINML 682
>gi|119495483|ref|XP_001264525.1| hypothetical protein NFIA_013170 [Neosartorya fischeri NRRL 181]
gi|119412687|gb|EAW22628.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 805
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 241/615 (39%), Positives = 335/615 (54%), Gaps = 52/615 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA LLN+ F+ IKVDREERPD+D VYM YVQA G GGWPLS
Sbjct: 69 SACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLS 128
Query: 80 VFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+P+L+P+ GGTY+P + + GF IL K++D W ++ S Q
Sbjct: 129 VFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEITRQ 188
Query: 135 L---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L +E + S ++ DE L L + + YD+ GGF APKFP P + +
Sbjct: 189 LREFAEEGTHSQQGDRQTDEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLSFL 248
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E + M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 249 LRLKTYPSAVSDIVGQEECDKAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHF 308
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 309 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSLPT 368
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ DPH+EF +NV
Sbjct: 369 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQNV 426
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISS 424
L S A + G+ E+ + I+ ++KL + R + R RP LDDKVIV+WNGL I +
Sbjct: 427 LSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYRETTRVRPDLDDKVIVAWNGLAIGA 486
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 487 LAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRDGS 536
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT-----------------QDE 526
+ PGF DDYA+LI GLLD+YE +L +A +LQ+ ++
Sbjct: 537 RGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTHAEYLND 596
Query: 527 LFLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
FL G GY++T T P LLR+K + A PS N V NL+RL++++
Sbjct: 597 NFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALLEEE 656
Query: 580 KSDYYRQNAEHSLAV 594
+ + HS +V
Sbjct: 657 EYRTLARQTCHSFSV 671
>gi|46446752|ref|YP_008117.1| hypothetical protein pc1118 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46400393|emb|CAF23842.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 718
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 226/568 (39%), Positives = 320/568 (56%), Gaps = 54/568 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG-GWPLS 79
TCHWCHVME ESFED VA +N FVSIKVDREE P+VD +YM + Q++ G GWPL+
Sbjct: 85 TCHWCHVMERESFEDIEVADSMNQTFVSIKVDREELPEVDSLYMEFSQSMMAGAAGWPLN 144
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEA 138
V L+PDL+P TY P +G G +++++ + W ++R+ + +E S+A
Sbjct: 145 VILTPDLQPFFATTYLPSHSSHGMMGLIDLIQRIAELWSSEEREKIITQAEKIVEVFSKA 204
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + +PDE + + A+ L K D +GG APKFP + ML + ++
Sbjct: 205 VHTTGED--IPDE---EQISITADLLYKMADPTYGGIKGAPKFPIGYQYSFMLRYYANMK 259
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D S +V TL + +GGI+DH+GGGF RYS+DE+W VPHFEKMLYD LA
Sbjct: 260 D-------SRALFLVERTLDMLHRGGIYDHLGGGFSRYSIDEKWLVPHFEKMLYDNAILA 312
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+A+ LTK Y + ++IL+Y+ RDM G +SAEDADS EG EG FY
Sbjct: 313 QSYLEAWQLTKKNLYKEVAQEILNYILRDMTYSDGGFYSAEDADS---EG----HEGFFY 365
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W +EV++ILG+H+ LF E+Y + GN F+G+N+L + ASK
Sbjct: 366 TWKEEEVKEILGDHSQLFCEYYDITAEGN------------FEGRNILHTPLNLEEFASK 413
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+++ I R+KL+ R KR P DDK++ SWNGL+I SFA A +
Sbjct: 414 HQQDIDQLRIIFDNQRKKLWSAREKRIHPLKDDKILSSWNGLMIYSFAEA---------A 464
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
F+ P+ Y+E A AA FI+ L+ Q +L +R G + LD+YAF+I
Sbjct: 465 FTFDCPL-------YLEAAVKAARFIKNKLWKNQ--KLLRRWREGQAMFQAGLDEYAFMI 515
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
G L L+E +GT+WL WAIE+ + + E G ++ T G D ++LLR + DGAE
Sbjct: 516 KGALSLFEANAGTEWLEWAIEMATLLKDQY-KAEEGAFYQTDGGDKNLLLRKCQFSDGAE 574
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQ 586
PSGN+V NL+RL + ++ DY Q
Sbjct: 575 PSGNAVHCENLLRLYQLT--NEEDYLAQ 600
>gi|326474295|gb|EGD98304.1| hypothetical protein TESG_05683 [Trichophyton tonsurans CBS 112818]
gi|326479253|gb|EGE03263.1| DUF255 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 774
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 227/605 (37%), Positives = 334/605 (55%), Gaps = 42/605 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P + P GF +L K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188
Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + ++ ++L + L + YD+ GGF +PKFP PV
Sbjct: 189 TRQLREFAEEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVN 248
Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+ +L S+ E D E ++ +M + T+ +A+GGI D +G GF RYSV W
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWS 308
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ QL +V++D F + + D++ Y+ ++ P G +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDAD 368
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S + T K+EGA+YVWT KE++ ILG+ A + H+ + P GN ++R++DPH+EF
Sbjct: 369 SQPSPEDTEKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFM 426
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
+NVL + A + G+ E+ + IL R KL + R +KR RP LDDK+IV+WNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGL 486
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
VI + A+ + +L+ + K +A +A FI+ +L+D ++ +L +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIY 536
Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
R + PGF DDYA+LISGLL LYE L +A +LQ ++ F+
Sbjct: 537 RADSRGDTPGFADDYAYLISGLLQLYEATFDDAHLQFADKLQQYLNKYFISVSASDSSIC 596
Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
G++ T E PS L R+K D A PS N V NL+RL+S++ +
Sbjct: 597 TGFYMTPSEAVTDTPSALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTC 656
Query: 590 HSLAV 594
H+ AV
Sbjct: 657 HAFAV 661
>gi|325107403|ref|YP_004268471.1| hypothetical protein Plabr_0826 [Planctomyces brasiliensis DSM
5305]
gi|324967671|gb|ADY58449.1| protein of unknown function DUF255 [Planctomyces brasiliensis DSM
5305]
Length = 686
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 227/607 (37%), Positives = 334/607 (55%), Gaps = 51/607 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ +A L+N WFV++KVDREERPD+D++YMT VQ + G GGWP+S
Sbjct: 52 SACHWCHVMERESFENDQIAALMNQWFVNVKVDREERPDIDQIYMTAVQLVTGQGGWPMS 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P +P GGTY+PP ++G PGF IL+K+ W++ R+ GA +L A+
Sbjct: 112 VFLAPSGEPFYGGTYWPPTSRHGMPGFADILQKIHQYWEEHREECLAKGA----ELVTAI 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ L ++ LR +L +S D + GGFG APKFP P++++++L ++
Sbjct: 168 DQLHHHEQEKSPLQEDLLRHAQHRLMQSADMQEGGFGHAPKFPHPIDLRVLLRSWRRF-- 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
GE E + +V TL MA GGI+DH+ GGF RYS D W VPHFEKMLYD QLA
Sbjct: 226 ----GEV-ESRNVVTLTLDKMADGGIYDHLAGGFARYSTDRYWLVPHFEKMLYDNSQLAT 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+ + T + Y+ + R+ LD++ RDM +S DADS EG EG FYV
Sbjct: 281 AYLEGYQATGEERYAEVVRETLDFVLRDMTSSEHGFYSTLDADS---EGV----EGKFYV 333
Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W+ EV+++L + A FK Y + GN ++G N+L A +
Sbjct: 334 WSEAEVDELLEAKAAEWFKHVYNVSAQGN------------WEGHNILHRTKPLQELAGE 381
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG E L + R L VR +R P D+K+IV+WNGL++S+FA+A +IL
Sbjct: 382 LGTDRETLSASLMQSRETLLKVREQRIWPGRDEKIIVAWNGLMLSAFAQAGRIL------ 435
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
G DR Y + A +AA F+ L E L H ++G ++ GFLDDYA L+
Sbjct: 436 --------GEDR--YTQAACNAADFLLDTLRREDG-SLWHCRKDGRNRFNGFLDDYACLV 484
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL DLY K+L A+EL + LF D E + T + +++RV++ +D A
Sbjct: 485 DGLNDLYLTTLEPKYLQAALELADVMQRLFYDDEQKAFHYTPSDHEELVVRVRDRYDSAI 544
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG ++++ L++L I + DY + + L ++ + A D+L
Sbjct: 545 PSGTNLAIHALLKLGWIAG--REDYVTRAGD-CLDSVSGTMRQQPSGMGQAVVALDLLLG 601
Query: 619 PSRKHVV 625
P+ + ++
Sbjct: 602 PTEEFIL 608
>gi|415885100|ref|ZP_11547028.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
gi|387590769|gb|EIJ83088.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
Length = 625
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 245/689 (35%), Positives = 366/689 (53%), Gaps = 74/689 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFEDE VAKLLN+ FVSIKVDREERPD+D +YM Q + G GGWPLSVF++PD K
Sbjct: 1 MERESFEDEEVAKLLNERFVSIKVDREERPDIDSIYMNICQLMNGHGGWPLSVFMTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP E +YG PGFK ++ ++ D + K R + + + A E L + SA SS +
Sbjct: 61 PFFAGTYFPKESRYGVPGFKDVITQLYDQYMKNRSHIEKIASDAAEALKQ--SARESSAE 118
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
LP + L +QL+ S++S +GGFG APKFP P + +L + K TG
Sbjct: 119 LPS---VDVLHKTYQQLAGSFNSVYGGFGDAPKFPIPHHLMFLLKYYKW---TG----TE 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
KMV TL MA GGI+DH+G GF RYSVD W VPHFEKMLYD L Y +A+ +
Sbjct: 169 MALKMVEKTLVSMANGGIYDHIGFGFARYSVDAMWLVPHFEKMLYDNALLLYTYSEAYQV 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
TK+ Y I I++++ R+M G FSA DADS EG +EG +YVW+ +E+ D
Sbjct: 229 TKNSKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYVWSKEEILD 281
Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 385
+LGE F C + ++ N F+GKN+ LI N + ++ G+ LE+
Sbjct: 282 VLGEKDGEF----------YCKVYDITSGGN-FEGKNIPNLIHTN-MVKTFAEAGLKLEE 329
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
L E R+KLF+ R +R PHLDDK++ SWN L+I+ A+A + +++
Sbjct: 330 GKAKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQNQ---------- 379
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
+Y+E AE A FI L L +R+G SK +LDD+AFL+ L+LY
Sbjct: 380 ------DYVEKAEKALRFIEEKLM--VNGELMARYRDGESKYSAYLDDWAFLLWAYLELY 431
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E ++L A +LF D + GG++ T + ++++R K+ +DGA PSGNSV+
Sbjct: 432 EATFSMEYLDKAQNTAEKMKKLFWDEQDGGFYFTRSDGEALIVREKQVYDGALPSGNSVA 491
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
+N +RL +K + + F+ ++ + + + P + V+
Sbjct: 492 AVNFLRLGHFTGETK---WFDVVDEIHRFFKDDVESYGPGHTFLLQSLLLKEFPMSEVVI 548
Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA-- 683
+ + + ++ A+ I P + ++ + + + ++A
Sbjct: 549 VGTPEKRSELAGIIQKAYTP--------EIAPVTS-------KNQEDLVKIYQRGYTATD 593
Query: 684 DKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ +C+NF+C P+ D LE++L E
Sbjct: 594 SDLTVYICENFTCQKPMND---LEDVLKE 619
>gi|451982157|ref|ZP_21930485.1| conserved hypothetical protein, contains Thioredoxin domain
[Nitrospina gracilis 3/211]
gi|451760626|emb|CCQ91765.1| conserved hypothetical protein, contains Thioredoxin domain
[Nitrospina gracilis 3/211]
Length = 727
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 235/697 (33%), Positives = 357/697 (51%), Gaps = 64/697 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFE E AKL+N+ FV+IKVDREERPD+D +YM V AL G GGWP+S
Sbjct: 53 SSCHWCHVMAHESFESEETAKLMNELFVNIKVDREERPDIDAIYMKSVIALNGHGGWPMS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P +GGTY+PPE K+ RPGF +L++ D + ++D + A +E+L+
Sbjct: 113 VFLTPEQEPYLGGTYYPPEPKFNRPGFPQVLQQAADIYRNQKDRMKSVSARLMEKLTTPP 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
D L A+ L E+ +D +GGFGS KFP P+ ++L H +K ED
Sbjct: 173 PIPQGQGAGTDALIPQAVELMKEK----FDETYGGFGSGMKFPEPMLYTLLLRHWQKRED 228
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ M +L MA+GG++D VGGGFHRYS D +W VPHFEKMLYD LA
Sbjct: 229 -------NDAILMADKSLTKMAEGGMYDQVGGGFHRYSTDRKWLVPHFEKMLYDNALLAR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++++ F TK Y I R++ Y+ R+M P +S++DAD T EG F+
Sbjct: 282 LFVEMFQATKQEIYERIAREVFHYIGREMTSPEWAFYSSQDAD-------TDAGEGHFFT 334
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT KEV DILG H+ +F Y + TGN F+ +NVL +
Sbjct: 335 WTMKEVLDILGPRHSKVFARVYGMTATGN------------FEKRNVLHIAETMEKVSES 382
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+P+ + +I+ R+ L + R KR P DDK++ WNG++I++FA + + +
Sbjct: 383 EGVPIFEVDHIIRNGRQTLLESRGKRQNPGRDDKILTGWNGMMIAAFAAGAVVFRDRV-- 440
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
Y + A AA F+ ++ + +L +++G + G L+DYA+ I
Sbjct: 441 --------------YRDHAVQAARFLWDTMWKDG--KLFRVYKDGKVRVDGCLEDYAWFI 484
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL ++E +W+ A + + + F D + G+F T + ++ R+K D A
Sbjct: 485 EGLLGVFEATGEGEWIDKAQAVADALIDRFWDDKDNGFFMTAADQEKLITRLKNPEDEAI 544
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML-S 617
PS N V+ + L +L + D Y + ++ F R++ A + A D + S
Sbjct: 545 PSANGVAALALAKLGRLTG---KDAYFEKGRDTVRAFADRIEHRPTAYTSLLAAMDFIES 601
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P V + G + + +L A +A Y +K V+ T + W E
Sbjct: 602 LPM--EVTISGPEGDPQYGKLLEAVYADYRPDKLVVRYSGDATVQRVPWAE--------G 651
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
R S V VC+ +C PPV D +L N + P
Sbjct: 652 RGPVSGQPTV-YVCRQGTCYPPVHDAEALMNQMGRPP 687
>gi|327293790|ref|XP_003231591.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
gi|326466219|gb|EGD91672.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
Length = 774
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 226/605 (37%), Positives = 333/605 (55%), Gaps = 42/605 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P + P GF +L K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEDPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188
Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E + + ++ ++L + L + YD+ GGF +PKFP PV
Sbjct: 189 TRQLREFAEEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVN 248
Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+ +L S+ E D E ++ +M + T+ +A+GGI D +G GF RYSV W
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWS 308
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ QL +V++D F + + D++ Y+ ++ P G +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPKGCFYSSEDAD 368
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S + T K+EGA+YVWT KE++ ILG+ A + H+ + P GN ++R++DPH+EF
Sbjct: 369 SQPSPEDTEKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFM 426
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
+NVL + A + G+ E+ + IL R KL + R +KR RP LDDK+IV+WNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGL 486
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
VI + A+ + +L+ + K +A +A FI+ +L+D ++ +L +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIY 536
Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
R + PGF DDYA+LISGLL LYE L +A +LQ ++ F+
Sbjct: 537 RADSRGDTPGFADDYAYLISGLLQLYEATFDDAHLQYADKLQQYLNKYFISVSASDSSIC 596
Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
G++ T E P L R+K D A PS N V NL+RL+S++ +
Sbjct: 597 TGFYMTPSEAVTDTPGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTC 656
Query: 590 HSLAV 594
H+ AV
Sbjct: 657 HAFAV 661
>gi|149174989|ref|ZP_01853613.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
gi|148846326|gb|EDL60665.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
Length = 876
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 228/630 (36%), Positives = 340/630 (53%), Gaps = 58/630 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY------GG 73
++C+WCHVME FE+ +AK +N+ FV+IKVDREERPD+D +YMT + +
Sbjct: 103 SSCYWCHVMERLVFENPEIAKYMNENFVNIKVDREERPDIDDIYMTSLSVYFHLIGAPDN 162
Query: 74 GGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE 133
GGWPLS+FL+PD +P GGTYFPP D+ G+ F +L+KV + W + + QS +
Sbjct: 163 GGWPLSMFLTPDREPFAGGTYFPPTDQGGQMSFPRVLQKVNELWSGDKAKVQQSATIIAK 222
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEI 187
+++ ++ +P E ++ ++ S+DS +GG + PKFP ++
Sbjct: 223 EVARLQKEEGATEAIPIE--DRLVKAGVRSINASFDSEYGGIDFSEVSPNGPKFPTSSKL 280
Query: 188 QMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
++ Y + ++ S E++ K++ TL MA GGI+DH+GGGFHRYS D WHVPHF
Sbjct: 281 VLLQYDIESMDAESTSAESA---KVLYQTLDAMANGGIYDHLGGGFHRYSTDRYWHVPHF 337
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETE 307
EKMLYD GQLA++Y A+ T + Y + I+D++ R++ G +SA D AET+
Sbjct: 338 EKMLYDNGQLASLYAKAYGQTGNEQYKQVAAGIIDFVLRELTDTQGGFYSALD---AETD 394
Query: 308 GATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
G EG Y W+ +E+++IL E LF E Y L ++P F+ VL
Sbjct: 395 GV----EGEHYAWSQEELKEILDEGYPLFAEFYGL-----------NEP-VRFEHGYVLH 438
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+ A A K E + L R+KL VR++R DDK++ SWNGL+I+ A
Sbjct: 439 RVTTLKALAEKQKTTPEALESQLAAMRKKLHTVRNQRQPLLKDDKILTSWNGLMITGMAN 498
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
A +ILK R +Y AE AA FI + D+Q H L S+R ++
Sbjct: 499 AGRILK----------------RPDYTAAAEKAAQFILDQMRDKQGH-LYRSYRADQARL 541
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
+LDDYAFL+ GLL LYE +WL A L + Q +LF D++ G+F TT + ++
Sbjct: 542 NAYLDDYAFLVQGLLALYEATGKQQWLDQAQALTDLQIKLFWDQKEHGFFFTTHDHEQLI 601
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA-MAV 606
R K +D A PSGNS+S NL++L + K YRQ+A+ +L +F +K
Sbjct: 602 ARTKNAYDAAIPSGNSISTRNLIQLTQLTGDPK---YRQHADQTLQLFGRVIKRYPNRCA 658
Query: 607 PLMCCAADMLSV-PSRKHVVLVGHKSSVDF 635
L+ + L+ P++K L+ S F
Sbjct: 659 QLVQAVGEFLTTPPAQKQSALLAPTSDAGF 688
>gi|296816653|ref|XP_002848663.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
gi|238839116|gb|EEQ28778.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
Length = 781
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 232/610 (38%), Positives = 336/610 (55%), Gaps = 45/610 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+
Sbjct: 69 SACHWCHVMEKESFMSLEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 128
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFA 131
VFL+PDL+P+ GGTY+P + P GF +L K++D W+ ++ +S
Sbjct: 129 VFLTPDLEPVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEI 188
Query: 132 IEQLSEALS-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
QL E A A+ + ++L L + YD+ GGF ++PKFP PV
Sbjct: 189 TRQLREFAEEGTHLAQANKKEQMEDLEIELLEEAFVHFAARYDATNGGFSTSPKFPTPVN 248
Query: 187 IQMMLYHSKKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+ +L S+ E D E ++ +M + TL +A+GGI D +G GF RYSV W
Sbjct: 249 LSFLLRLSRYPEEVMDIVGREECTKATEMAVNTLIKVARGGIRDQIGYGFSRYSVTPDWS 308
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDAD 302
+PHFEKMLYDQ QL +VY+D F + + D++ Y+ ++ P G +S+EDAD
Sbjct: 309 LPHFEKMLYDQAQLLDVYIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDAD 368
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK 361
S + T K+EGA+YVWT KE++ ILG A + H+ + P GN ++R++DPH+EF
Sbjct: 369 SQPSPDDTDKREGAYYVWTLKELKQILGHRDADVCARHWGVLPDGN--VARVNDPHDEFM 426
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 420
+NVL + A + G+ E+ + IL R KL + R +KR RP LDDK+IVSWNGL
Sbjct: 427 NRNVLRIATTPAQVAKEFGLHEEETIRILKNSRVKLREYRETKRVRPELDDKIIVSWNGL 486
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
VI + A+ + +L+ + K +A +A FI+ +L D ++ +L +
Sbjct: 487 VIGALAKCAILLED----------IDAEKSKHCKLMASNAVKFIKENLLDAESGQLWRIY 536
Query: 481 R-NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG----- 534
R + PGF DDYA+LISGL+ LYE +L +A +LQ ++ F+
Sbjct: 537 RADSRGNTPGFADDYAYLISGLIQLYEATFDDSYLQFADKLQQYLNKYFISVSTSDSSIC 596
Query: 535 -GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
GY+ T E PS L R+K D A PS N V NL+RL+S++ + + Y+ A
Sbjct: 597 TGYYMTPSEAVTNTPSALFRLKTGTDSATPSTNGVIAQNLLRLSSLL---EDESYKVKAR 653
Query: 590 HSLAVFETRL 599
+ F +
Sbjct: 654 QTCNAFAVEI 663
>gi|404329401|ref|ZP_10969849.1| hypothetical protein SvinD2_04859 [Sporolactobacillus vineae DSM
21990 = SL153]
Length = 731
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 260/708 (36%), Positives = 362/708 (51%), Gaps = 82/708 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ A LLN+ +VSIKVDREERPD+D VYM Q L G GGWPL+
Sbjct: 94 SACHWCHVMAGESFEDQETAALLNENYVSIKVDREERPDIDAVYMKVCQTLTGQGGWPLN 153
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD P GTYFP YG P FK +LR++K +D+ D +A G+ Q+ AL
Sbjct: 154 VFLTPDQTPFYAGTYFPLHAAYGHPAFKDVLRELKKQYDQNPDKIAAIGS----QIMTAL 209
Query: 140 S-ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ S S KL DE +R E LS+++D RFGGFG APKFP P ++ +L
Sbjct: 210 AKQSRSGRKLTDE----TVRKAYEALSENFDPRFGGFGDAPKFPAPHQLIFLLRFGSL-- 263
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TGK + M + TL+ +A+GGI DH+GGGF RY+ D +W VPHFEKMLYDQ LA
Sbjct: 264 -TGK----KQAMDMAVRTLRALAEGGIRDHIGGGFCRYATDRQWQVPHFEKMLYDQAMLA 318
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+ +A+ T + + + I DY RD++ P G + +EDADS EG +EG +Y
Sbjct: 319 AAFTEAYQATGEAAFRDVVATIFDYCERDLLSPAGGFYCSEDADS---EG----EEGKYY 371
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+W EV +LG A LF E Y++ GN S PH G ++ A A+
Sbjct: 372 LWNPGEVRAVLGADAGLFCEVYHITDAGN--FHGQSIPH--LSGSDL-----GRIAEANH 422
Query: 379 LGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L +P LN L R KLF R KR P DDK++ SWN L+I+ A A ++L +
Sbjct: 423 LSLPA---LNQQLAASRHKLFAARQKRVHPFKDDKILTSWNALMIAVLAEAGRVLHN--- 476
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
K Y+ +A+S FI HL + T L +R+ ++ +LDDYAFL
Sbjct: 477 -------------KHYVNLAKSCFHFIDTHLVQDST--LLARYRDEEARFSAYLDDYAFL 521
Query: 498 ISGLLDLYEFGSGTKWL----VWAIELQNTQDELFLDREGGGYFNTTGEDP--SVLLRVK 551
+YE +L VW + F+DRE GG+F E+P ++++R K
Sbjct: 522 TLACEAMYEATFDLTYLEKMKVWGDRMTGR----FMDREHGGFFM---EEPQSTLIIRNK 574
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E +D A PSGNS +V+ L+RL+ +Y A + A + + M
Sbjct: 575 EAYDSAVPSGNSAAVLALLRLSERTGDQNYIHYADQAFAAFA---DEVSEYPAGYTFMLS 631
Query: 612 AADM-LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A + LS PS + V L G K L ++ Y + DP +
Sbjct: 632 ALMLRLSGPS-ELVALQGAKGEAAVAE-LRSSDLPYLPGLALYAGDPCRL---------S 680
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
+ N ++ + A + CQNF C PVT+ L+ L ++ T+
Sbjct: 681 AFNENIGIYSPIAGRTTYFFCQNFICHLPVTEFAKLKTQLNDEAQKTS 728
>gi|242806544|ref|XP_002484765.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218715390|gb|EED14812.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 791
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 239/612 (39%), Positives = 335/612 (54%), Gaps = 51/612 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF VA +LN+ F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 70 SACHWCHVMEKESFMSTEVATILNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 129
Query: 80 VFLSPDLKPLMGGTYFP-----PEDKYGRP---GFKTILRKVKDAW--------DKKRDM 123
VFL+PDL+P+ GGTY+P + ++G GF IL K++D W D +++
Sbjct: 130 VFLTPDLEPVFGGTYWPGPHSSSQSQWGVEGPIGFVDILEKLRDVWQTQQARCLDSAKEI 189
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 183
Q FA E A + L EL + A + + YD +GGFG APKFP
Sbjct: 190 TKQLREFAEEGTHVQQGAKSGGEDLEIELIEEAF----QHFASRYDPVYGGFGRAPKFPT 245
Query: 184 PVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
P + ++ + + D E M TL +A+GGI DH+G G RYSV
Sbjct: 246 PANLGFLIRLGMYPTAVSDIVGQDECVRATAMATKTLLNIARGGIRDHIGHGVARYSVTT 305
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAE 299
W +PHFEKMLYDQ QL +VY+DAF T + D++ YL + I G +S+E
Sbjct: 306 DWLLPHFEKMLYDQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTGGYYSSE 365
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 358
DADS + T K+EGAFYVWT KE++ +LG+ A + H+ + GN ++ +DPH+
Sbjct: 366 DADSLPSPNDTEKREGAFYVWTLKELKQVLGQRDAGVCARHWGVLADGN--IAPENDPHD 423
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSW 417
EF +NVL S A + G+ E+ + I+ ++KL + R K R RP LDDK+I +W
Sbjct: 424 EFMDQNVLSIKVTPSKLAKEFGLSEEEVIKIIKSGKQKLREYREKARVRPDLDDKIIAAW 483
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL I + A+AS IL E ++ ++ + A+ A FI+ L++ T +L
Sbjct: 484 NGLAIGALAKAS-ILLEEIDTI---------KAQQCRDSAQRAVEFIKTTLFEPSTGQLW 533
Query: 478 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL-----DR 531
+R+G PGF DDYAFLISGL+ +YE +L +A +LQ ++ F+
Sbjct: 534 RIYRDGSRGNTPGFADDYAFLISGLITMYEATFDDSYLQFAEQLQEHLNKYFIAPGDEPD 593
Query: 532 EGGGYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
GY+ T+ E +P LLR+K D A PS N + NLVRL S++ + D YRQ
Sbjct: 594 TYAGYYTTSSEPIPDEPGPLLRLKSGTDSATPSINGIIARNLVRLGSLL---EDDTYRQL 650
Query: 588 AEHSLAVFETRL 599
A + + F L
Sbjct: 651 ARQTCSTFSVEL 662
>gi|390452556|ref|ZP_10238084.1| hypothetical protein PpeoK3_00885 [Paenibacillus peoriae KCTC 3763]
Length = 628
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 247/692 (35%), Positives = 357/692 (51%), Gaps = 74/692 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFEDE +A++LN +VSIKVDREERPDVD +YM+ Q + G GGWPL++ ++PD K
Sbjct: 1 MERESFEDEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTY P E K+GR G +L KV W ++ + L +E + L+ +
Sbjct: 61 PFFAGTYLPKEQKFGRIGLLELLDKVGTRWKEQPEEL-------VELSEQVLTEHERQDM 113
Query: 148 LP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
L EL + +L Q S ++D +GGFG APKFP P + +L +++ SG
Sbjct: 114 LAGYRGELDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPAPHNLSFLLRYAQ------HSG 167
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
+ +M TL M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD LA Y +
Sbjct: 168 N-QQALEMAEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLAIAYTET 226
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
+ +T Y I I Y+ RDM GG +SAEDADS EG +EG FYVW E
Sbjct: 227 WQVTGKGLYRQIAEQIFTYIARDMTDVGGAFYSAEDADS---EG----EEGRFYVWNEAE 279
Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 381
+ +LG+ A F + Y + P GN F+G N+ LI++N A K +
Sbjct: 280 IRAVLGDRDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGLKHDL 326
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
++ + + E R KLF VR KR PH DDK++ SWNGL+I++ A+A +
Sbjct: 327 TKQELEDRVRELRDKLFAVREKRVHPHKDDKILTSWNGLMIAALAKAGQAFGD------- 379
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
V+ Y E A+ A SF+ HL RL +R+G + PG+LDDYAF + GL
Sbjct: 380 ---VI------YTERAQKAESFLWNHL-RRANGRLLARYRDGDAAYPGYLDDYAFYVWGL 429
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
++LY+ ++L A+ L +LF D E G F + ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEIYDGAIPSG 489
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
NS++ NLVRLA + ++ + Y A F + A + + + + +
Sbjct: 490 NSIAAHNLVRLARLTGEARLEDY---AAKQFKAFGGMVSYDPSAYSALLSSL-LYATGTT 545
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID---PADTEEMDFWEEHNSNNASMAR 678
K +V+VG + + A A + N VI D PA + + + ++ +
Sbjct: 546 KEIVVVGQRDDPQTLQFIRAIQAGFRPNTVVILKDAGQPAIADIVPYIHDYTLIDG---- 601
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K +C++F+C PVT L+ LL
Sbjct: 602 ------KPAVYMCEHFACQAPVTSLDDLKALL 627
>gi|350629727|gb|EHA18100.1| hypothetical protein ASPNIDRAFT_47529 [Aspergillus niger ATCC 1015]
Length = 769
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 237/591 (40%), Positives = 324/591 (54%), Gaps = 46/591 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF + VA +LN F+ IKVDREERPD+D VYM YVQA G GGWPL+
Sbjct: 60 SACHWCHVMEKESFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLN 119
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 134
VFL+PDL+P+ GGTY+P + G GF IL K+ D W ++ +S +Q
Sbjct: 120 VFLTPDLEPVFGGTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQ 179
Query: 135 LSEALSASASS----NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
L E S + ++L L + YD GGF +APKFP P + +
Sbjct: 180 LREFAEEGTHSYQGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFL 239
Query: 191 L---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 247
L + + D E ++ M + TL MA+GGI DH+G GF RYSV W +PHF
Sbjct: 240 LRLGIYPTAVADIVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHF 299
Query: 248 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAET 306
EKMLYDQ QL +VY+DAF +T + D+ YL I P G S+EDADS T
Sbjct: 300 EKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPT 359
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
T K+EGAFYVWT KE+ +LG+ A + H+ + P GN ++ +DPH+EF +NV
Sbjct: 360 PNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNV 417
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISS 424
L S A G+ E+ + I+ ++KL D R + R RP LDDK+IV+WNGL I +
Sbjct: 418 LSVKVTPSRLAKDFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGA 477
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 484
A+ S + + E ES S + E A A +FI+ +L+++ T +L +R+G
Sbjct: 478 LAKCSALFE-EIES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGG 527
Query: 485 -SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDEL-----------FLDRE 532
PGF DDYA+LI GLLD+YE +L +A +LQ+ + L FL
Sbjct: 528 RGNTPGFADDYAYLIGGLLDMYEATFDDSYLQFAEQLQSKRLALLTFLLEYLNDNFLAYV 587
Query: 533 G---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
G GY++T T P LLR+K + A P+ N V NL+RL S++
Sbjct: 588 GTTPAGYYSTPSTMTSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 638
>gi|14548135|gb|AAK66792.1|U40238_13 Highly conserved protein containing a thioredoxin domain
[uncultured crenarchaeote 4B7]
Length = 674
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 233/689 (33%), Positives = 364/689 (52%), Gaps = 68/689 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFE++ VAK++N+ FV+IKVDREERPD+D +Y Q G GGWPLSV
Sbjct: 49 SCHWCHVMAHESFENDDVAKIMNENFVNIKVDREERPDLDDIYQKICQMSTGQGGWPLSV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GTYFP D YGRPGF ++ R++ AW++K + S + L++
Sbjct: 109 FLTPEQKPFYVGTYFPVLDSYGRPGFGSLCRQLAQAWNEKPKDVGTSAEQFMSNLTKLEK 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S E+ ++ L A L + D+ +GGFG APKFP + M +SK
Sbjct: 169 VSDGG-----EIEKSILDEAAVNLLQVADTNYGGFGQAPKFPNAANLSFMFRYSK----- 218
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
SG ++ Q+ L TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD L V
Sbjct: 219 -LSG-ITKFQEFALMTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPPV 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ +TKD FY + LDY+ R+M G +SA+DAD+ EG T +VW
Sbjct: 277 YAEAYQITKDPFYLDVVTKTLDYIMREMTSASGLFYSAQDADTNGEEGQT-------FVW 329
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E+E+ILG+ + +F +Y + GN F+G +L + S+ + K
Sbjct: 330 KKREIENILGDDSEIFCIYYDVTDGGN------------FEGNTILANNINISSLSFKFN 377
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
++ +L +KL DVRS R +P DDK+I SWN ++IS+FA+ +I
Sbjct: 378 KTEDEITKLLKRSSKKLLDVRSNRDQPGTDDKIITSWNSMMISAFAKGYRI--------- 428
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH-SFRNGPSKAPGFLDDYAFLIS 499
S ++Y+ VA +AA + H H +F+N K G+LDDY++L++
Sbjct: 429 -------SGNEKYLNVAVNAAKYFSEQF---SKHGFIHRTFKNDTPKLNGYLDDYSYLVN 478
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L+D++E S +L A ++ + E F + ++ T S+++R K +D + P
Sbjct: 479 SLIDVFEITSDAYFLDIAQKITHYMIEHFWNETEKSFYFTADTHESLIVRPKNYYDLSVP 538
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSV 618
SGNSV+ L++L +V + + + ++ L + T + A + ++ L
Sbjct: 539 SGNSVAANALLKLHHLVNDEE---FLKISKQILELNGTSAAENPFAFGYLLNVMNLYLKH 595
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P+ + ++ ++S ++ + + + +I I D E + ++
Sbjct: 596 PTE--ITIINSENS----EIVNSLYKKFIPEGIIIQI--KDEENLKLLSKY----PFFEG 643
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLE 707
FS DK +C+NF+CS P+++ +E
Sbjct: 644 KEFS-DKTSVTICKNFTCSLPLSELSKIE 671
>gi|448365504|ref|ZP_21553884.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
gi|445655043|gb|ELZ07890.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
Length = 717
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 244/693 (35%), Positives = 355/693 (51%), Gaps = 57/693 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF DE VA LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMADESFADETVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
+L+PD KP GTYFP E K G+PGF IL V ++W+ R+ + Q A A ++L
Sbjct: 113 AWLTPDGKPFYVGTYFPREAKRGQPGFLDILENVTNSWESDREEIENRADQWTAAATDRL 172
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
E A +S ++ L A +S D FGGFGS PKFP+P ++++ +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + TG+ E +++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 226 RAADRTGR----DEFSDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ +L + T D Y+ + + LD++ R++ G FS DA S + E R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELTHEAGGFFSTLDAQSEDPETGER-EE 340
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
GAFYVWT +V D+L + A LF Y + +GN F+GKN +
Sbjct: 341 GAFYVWTPDDVRDVLADETDAELFCSRYDITESGN------------FEGKNQPNRVASI 388
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
++ +P ++ L RR LF+ R +RPRP+ D+KV+ WNGL+I++ A A+ +L
Sbjct: 389 DDLTNRSELPADETRERLESARRDLFEARERRPRPNRDEKVLAGWNGLMIATCAEAALVL 448
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
G D +Y E+A A +F+R L+D RL +++ G+L+
Sbjct: 449 --------------GED--DYAEMATDALAFVRDRLWDADEQRLSRRYKDHDVAIDGYLE 492
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L YE L +A+EL + F D G + T S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L+ L AG ++ R A L RL+ ++ +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLELDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLA 610
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--EEHN 670
AD L + + + ++ D AS L + PA +E++ W E
Sbjct: 611 ADRLESGALEVTI-----AADDLPAEFVEPFASRYLPDRLFARRPATDDELEPWLDELEL 665
Query: 671 SNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
++ ++ + D L VC++ +CSPP D
Sbjct: 666 ADEPAIWAGREARDGEPTLYVCRDRTCSPPTHD 698
>gi|373488750|ref|ZP_09579414.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
gi|372005695|gb|EHP06331.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
Length = 660
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 235/556 (42%), Positives = 315/556 (56%), Gaps = 69/556 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ VA LN FV IKVDREERPD+D++YM VQ L G GGWP+S
Sbjct: 48 SACHWCHVMERESFENADVAAFLNKHFVPIKVDREERPDLDELYMGAVQLLAGRGGWPMS 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEA 138
V+L+P+L+P GGTYFPP + G PGF +L V W ++R D+LAQ+G +L A
Sbjct: 108 VWLTPELEPFYGGTYFPPVSRGGMPGFLDVLEGVARVWQERRQDVLAQAG-----ELVAA 162
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L A P + L + LS S+D+R+GGFG APKFP + ++L
Sbjct: 163 LRAGRGIGGDPPG--EGLLEVAIRHLSYSFDARWGGFGGAPKFPPIPALTLLLGRGD--- 217
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ M + TL MA GGI DH+GGGF RYSVDERW VPHFEKML D QLA
Sbjct: 218 --------PKALDMAIRTLDAMAAGGIRDHLGGGFARYSVDERWKVPHFEKMLCDNAQLA 269
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VYL+AF +T +V + R+ILDY +M G FS+EDADS EG +EG FY
Sbjct: 270 WVYLEAFRVTGEVRHGERAREILDYFLGEMRDASGGFFSSEDADS---EG----EEGRFY 322
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
++ EV+++LG A LF Y + P GN + G+++L + S+
Sbjct: 323 TFSWGEVQEVLGPGADLFCRAYGVTPEGNFE-----------GGRSLLHRMEVGDFPESE 371
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L + R ++ R +R RPH DDK++V+WNGL +S+ A+ S +L
Sbjct: 372 LAI-----------LRERIRLYRDRRVRPHRDDKILVAWNGLALSALAKGSALL------ 414
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
G R Y+E AE+ A F++R L+ + T L ++R G PGFL+DY LI
Sbjct: 415 --------GEPR--YLEAAEACADFLQRELWRDGT--LLRTWRQGRGHTPGFLEDYGALI 462
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLLDLY+ G ++WL WA EL E F + E GG+F T D V+LR D A
Sbjct: 463 LGLLDLYQTGFHSRWLHWAQELGEALLERFHEAE-GGFFGTEALD--VILRQCPVFDHAI 519
Query: 559 PSGNSVSVINLVRLAS 574
PSGN+++ + L+RL +
Sbjct: 520 PSGNALAALALLRLGN 535
>gi|417766154|ref|ZP_12414108.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|400351608|gb|EJP03827.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
Length = 691
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 252/699 (36%), Positives = 363/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G IFSAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|455791360|gb|EMF43176.1| PF03190 family protein [Leptospira interrogans serovar Lora str. TE
1992]
Length = 691
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 252/699 (36%), Positives = 363/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G IFSAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAISLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|433591712|ref|YP_007281208.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
gi|448334040|ref|ZP_21523224.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
gi|433306492|gb|AGB32304.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
gi|445620768|gb|ELY74256.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
Length = 731
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 234/696 (33%), Positives = 356/696 (51%), Gaps = 59/696 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFADEAVAEILNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP + + G+PGF + +++ D+W+ + D Q A +
Sbjct: 113 AWLTPEGKPFFIGTYFPRDGERGQPGFPDLCQRISDSWESEEDREEMQHRAQQWTDAAKD 172
Query: 134 QLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
+L E ++ + E P + L A+ + +S D ++GGFG+ KFP+P ++++
Sbjct: 173 RLEETPDSAGVDAGVAAEPPSSDVLETAADAVLRSADRQYGGFGTGQKFPQPSRLRVL-- 230
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
++ + TG+ E ++++ TL MA GG+ DHVGGGFHRY VD W VPHFEKMLY
Sbjct: 231 -ARTYDRTGR----EEYREVLEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLY 285
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ +L + LT + Y+ D L ++ R++ G FS DA S + E R
Sbjct: 286 DNAEIPRAFLAGYQLTGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER- 344
Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EGAFYVWT +EV D++ + A LF Y + +GN F+G+N +
Sbjct: 345 EEGAFYVWTPEEVHDVIADETDASLFCARYDITESGN------------FEGQNQPNRIA 392
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
S AS+ + + L L R++LF+ R +RPRP D+K++ WNGL+IS++A A+
Sbjct: 393 RVSELASQFDLAESEVLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAAL 452
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L G D EY E A A F+R L+D ++ RL ++ G K G+
Sbjct: 453 VL--------------GED--EYAETAVDALEFVRDRLWDTESQRLSRRYKAGDVKVDGY 496
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G LD Y+ L +A+EL + F D + G + T S++ R
Sbjct: 497 LEDYAFLARGALDCYQATGDVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRP 556
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+V L+ L D + + A L L+ A+ +C
Sbjct: 557 QELGDQSTPSSTGVAVETLLALDEFA----DDDFSEIAATVLETHANELEANALEHATLC 612
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
AD + + V ++ + A AS + + P ++ W E
Sbjct: 613 IGADRFEAGALEVTV-----AADELPTEWREAFASRYFPDRLFALRPPTEAGLETWLETL 667
Query: 670 ---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R + + VC++ +CSPP D
Sbjct: 668 GLADAPPIWAGREARDGEPTL-YVCRDRTCSPPTHD 702
>gi|338812196|ref|ZP_08624385.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
gi|337275852|gb|EGO64300.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
Length = 633
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 243/685 (35%), Positives = 358/685 (52%), Gaps = 59/685 (8%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFED+ VA LLN +++IKVDREERPDVD +YM QAL G GGWPL++ ++PD
Sbjct: 1 MERESFEDQEVADLLNQDYIAIKVDREERPDVDHIYMQVCQALTGQGGWPLTIMMTPDKS 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP K+GRPG IL + W ++RD L E++ +++ A +
Sbjct: 61 PFFAGTYFPKNSKWGRPGLMAILTALSQQWRQQRDSLNDYA----EEILKSIDAREPGSP 116
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L + + L++ +DS +GGF SAPKFP P + ++ + + +GEA
Sbjct: 117 Y-SLLSEEQVHAAFHGLARYFDSEYGGFSSAPKFPTPHNLLFLMRYWR------HTGEA- 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
+ MV TLQ M +GGI+DH+G GF RYSVD +W VPHFEKMLYD L +Y +AF
Sbjct: 169 KAMDMVEKTLQSMRRGGIYDHLGFGFARYSVDHQWLVPHFEKMLYDNALLCYIYAEAFQA 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T + Y+ + +I+ Y++RDM GP G +SAEDADS EG +EG FY+WT +E+
Sbjct: 229 TGNKEYAQVAEEIIAYVQRDMTGPAGGFYSAEDADS---EG----EEGKFYLWTKEEILR 281
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEK 385
LG +F ++Y++ GN D G ++L + + A+K+GM ++
Sbjct: 282 ALGWTQGTIFADYYHVTAEGNFD-----------AGSSILHTIGREPGEYAAKVGMKPDE 330
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
+ +L + R KL ++R++R P DDKV+ SWN L+I++ A+A+++L
Sbjct: 331 FQAMLQDGREKLRELRNQRVHPFKDDKVLTSWNALMIAALAKAARVL------------- 377
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
D+ +Y+ A A +FI HL Q RL R G S +LDDYA+L+ +++LY
Sbjct: 378 ---DKPQYLFAASQALNFIEIHL-TRQDGRLLARHRAGESAYLAYLDDYAYLLWAVIELY 433
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E +L A L ELF D + GG+F T + ++ R KE +DGA PSGNS +
Sbjct: 434 ETTLSAAYLEMAKGLAGNMVELFWDEKQGGFFFTGSDAEKLISRPKEIYDGATPSGNSAA 493
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
L+RLA I + E F + A A D +P ++++
Sbjct: 494 AYALLRLARITEDAD---LLTVVERLFEYFAGEVSQAPRAFTFFLMAFDYYLMPP-QNII 549
Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
+ G K + ++L A Y ++ P E + H + + + R+
Sbjct: 550 IAGVKDDIATVSLLKQARKYYMPEVVLVLNSPDQAETL----RHTAPHVT-GRDRLDG-L 603
Query: 686 VVALVCQNFSCSPPVTDPISLENLL 710
A VC FSC PVT LE LL
Sbjct: 604 ATAYVCHKFSCQRPVTSVRDLERLL 628
>gi|448339114|ref|ZP_21528145.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
gi|445621085|gb|ELY74571.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
Length = 727
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 234/694 (33%), Positives = 357/694 (51%), Gaps = 59/694 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE VA+++N+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMAEESFEDEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW------DKKRDMLAQSGAFAIE 133
+L+P+ KP GTYFP E + G+PGF+ + +++ D+W ++ + Q A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFRDLCQRISDSWESEEDREEMENRAQQWTDAAKD 172
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
QL E + + P + L A+ + +S D ++GGFGS KFP+P ++++
Sbjct: 173 QLEETPDTAGVGAEPPS---SDVLETAADMVLRSADRQYGGFGSGQKFPQPSRLRVL--- 226
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
++ + TG+ E +++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 227 ARAYDRTGR----EEYREVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYD 282
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
++ +L + LT + Y+ + + L+++ R++ G FS DA S E R +
Sbjct: 283 NAEIPRAFLSGYQLTGEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQSESPETGER-E 341
Query: 314 EGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
EGAFYVWT EV + L + A LF + + +GN F+G+N +
Sbjct: 342 EGAFYVWTPAEVHEALDDETDAALFCARFDISESGN------------FEGRNQPNRVAT 389
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
S A + + + L L R+ LF+ R +RPRP+ D+K++ WNGL+IS++A A+ +
Sbjct: 390 VSELADQFDLAEHEILKRLDSARQTLFEAREERPRPNRDEKILAGWNGLLISTYAEAALV 449
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L G+D +Y + A A F+R L+DE RL +++G K G+L
Sbjct: 450 L--------------GAD--DYADTAVDALEFVRDRLWDEDDQRLSRRYKDGDVKVDGYL 493
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G LD Y+ L +A+EL + F D + G + T S++ R +
Sbjct: 494 EDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRPQ 553
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D + PS V+V L+ L A D A L L+ A+ +C
Sbjct: 554 ELGDQSTPSATGVAVETLLALDEFAAEDFEDI----AATVLETHANELESNALEHATLCL 609
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHN 670
AAD L+ + + V + + + LA+ + + + P + ++ W E
Sbjct: 610 AADRLAAGALE-VTVAADDLPTAWRDRLASQY----YPDRLFALRPPTEDGLEAWLETLG 664
Query: 671 SNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
NA A D+ VC+ +CSPP D
Sbjct: 665 LENAPPIWADREARDDEPTLYVCRERTCSPPTHD 698
>gi|408403905|ref|YP_006861888.1| hypothetical protein Ngar_c12930 [Candidatus Nitrososphaera
gargensis Ga9.2]
gi|408364501|gb|AFU58231.1| protein of unknown function DUF255 [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 695
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 253/714 (35%), Positives = 362/714 (50%), Gaps = 101/714 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ +AK++N+ F++IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 57 SACHWCHVMAHESFEDDEIAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLS 116
Query: 80 VFLSPDLKPLMGGTYFPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAF--AIEQL 135
VFL+PD KP GTYFP E Y PGFKTIL ++ A+ KK+++ A SG F A+ Q
Sbjct: 117 VFLTPDQKPFYVGTYFPKEGGHYNMPGFKTILLQLATAYKSKKQEIEAASGEFMDALAQT 176
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ ++ A+ L ++ L A L + D +GGFG APKFP + +L +
Sbjct: 177 ARDVALGAAGKA---SLERSILDEAAVGLLQMGDPIYGGFGQAPKFPNASNLMFLL---R 230
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ +G S + V FT MA GGIHD +GGGF RY+ D++W VPHFEKMLYD
Sbjct: 231 YYDISGMSC----FKDFVAFTADKMAAGGIHDQLGGGFARYATDQKWLVPHFEKMLYDNA 286
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
LA +Y + + +TK Y I R LD++ R+M P G +SA+DADS EG +EG
Sbjct: 287 LLAQLYSELYQITKAEKYLQITRKTLDFVIREMTHPEGGFYSAQDADS---EG----EEG 339
Query: 316 AFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
FYVW+ KE+ ILG+ A +F EHY + GN F+GKN+L S
Sbjct: 340 KFYVWSKKEIASILGDQAATDIFCEHYGVTEGGN------------FEGKNILNVRVPVS 387
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+ + G E+ I+ + KLF R KR RP D+K++ SWNGL+IS FA+ I
Sbjct: 388 SVGLRYGKTPEQTAQIIADASAKLFAAREKRVRPARDEKILTSWNGLMISGFAKGYGI-- 445
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+ ++Y++ A+ A FI + RL H+F++G SK +LDD
Sbjct: 446 --------------TGDQKYLQAAKDAVKFIETKIVTGDG-RLLHTFKDGKSKLNAYLDD 490
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAF GLLDL+ S ++L A++ + F D + F T+ + +++R K
Sbjct: 491 YAFYTGGLLDLFAIDSRQEYLDKAVKYTDFMLAHFWDEKEENLFFTSDDHEKLIVRTKSF 550
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+D A PSGNSV+ NL+RL +Y QN + + CA
Sbjct: 551 YDLAIPSGNSVAASNLLRLY---------HYTQNNSY------------------LDCAV 583
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-TEEMDFWEEH--- 669
++ ++ ++ F ML + V I D + +M W
Sbjct: 584 KIMKASAKP-----AAENPFGFGQMLNTIYLYVKKPVEVTVITRNDHSSKMAEWLNQQFV 638
Query: 670 -NSNNASMARNNFSA------------DKVVALVCQNFSCSPPVTDPISLENLL 710
+ NA ++ N ++ D A VC+NF+CS P+ LE L
Sbjct: 639 PDGINAIVSTNELASLQKYAYFKGRVGDGETAFVCRNFTCSLPIKSQQELERQL 692
>gi|448363039|ref|ZP_21551643.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
gi|445647661|gb|ELZ00635.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
Length = 717
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 241/693 (34%), Positives = 355/693 (51%), Gaps = 57/693 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF DE VA LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMADESFADEAVAAELNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
+L+P+ KP GTYFP E K G+PGF +L V ++W+ R+ + Q A A ++L
Sbjct: 113 AWLTPEGKPFYVGTYFPREAKRGQPGFLDVLENVTNSWESDREEIENRADQWTAAATDRL 172
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
E A +S ++ L A +S D FGGFGS PKFP+P ++++ +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + TG+ E ++++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 226 RATDRTGR----DEFSEVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ +L + T D Y+ + + LD++ R++ G FS DA S + E R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELTHDAGGFFSTLDAQSEDPETGER-EE 340
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
GAFYVWT EVE + + A LF+ Y + +GN F+G N +
Sbjct: 341 GAFYVWTPDEVEAAVTDETDAELFRSRYDITQSGN------------FEGTNQPNRVASI 388
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A + +P ++ + L RR LF R +RPRP+ D+KV+ WNGL+I++ A A+ +L
Sbjct: 389 DELADRFDLPADEVEDRLESARRDLFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVL 448
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
G D +Y E+A A +F+R L+D RL +++ G+L+
Sbjct: 449 --------------GED--DYAEMATDALAFVRERLWDGDEKRLSRRYKDDDVAIDGYLE 492
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L YE L +A+EL + F D G + T S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L++L AG ++ R A L RL+ ++ +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLQLDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLA 610
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--EEHN 670
AD L + + + ++ + AS L + PA +E+ W E
Sbjct: 611 ADRLESGALEITI-----AADELPEAFVEPFASRYLPDRLFARRPATDDELAAWLDELEL 665
Query: 671 SNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
++ ++ + D L VC++ +CSPP D
Sbjct: 666 ADEPAIWAGRATRDGEPTLYVCRDRTCSPPTHD 698
>gi|165970642|gb|AAI58572.1| Spata20 protein [Rattus norvegicus]
Length = 550
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/487 (41%), Positives = 289/487 (59%), Gaps = 46/487 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + LLN+ FVS+ VDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D F+S + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G + +EGA Y+WT KEV+ +L E L +HY L
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ D + E G+NVL +A++ G+ +E +L KLF R
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
RP+ HLD+K++ +WNGL++S FA A +L E + + A + A F
Sbjct: 498 RPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541
Query: 464 IRRHLYD 470
++RH++D
Sbjct: 542 LKRHMFD 548
>gi|387900736|ref|YP_006331032.1| hypothetical protein MUS_4478 [Bacillus amyloliquefaciens Y2]
gi|387174846|gb|AFJ64307.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens Y2]
Length = 629
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 244/678 (35%), Positives = 355/678 (52%), Gaps = 66/678 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP KY RPGF +L + + + R +E ++E +A
Sbjct: 61 PFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKI 112
Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
P E L + A+ QL+ +D+ +GGFG APKFP P M+++ + TGK +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLPAYTEAY 225
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
+T + Y I I+ +++R+M+ G FSA DAD TEG +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278
Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
++LG E L+ + Y + GN + + PH F + ++E ++ + ++L LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGNELAERLE 334
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
E R KL + R R PH DDKV+ SWN L+I+ A+A+K+ F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
+++ +AE+A F+ RHL + R+ +R G K GF+DDYAFLI L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
YE G +L A L + ELF D GG+F T + ++L+R KE +DGA PSGNS
Sbjct: 430 YEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
+ + L+RL + G S + AE +VF+ ++ + + ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545
Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
V+ G K D + + A + T++ + D E + A
Sbjct: 546 VVFGSKDDPDRKRFIEALQEHFTPAYTILAAEHPD--------ELKGISDFAAGYQMIDG 597
Query: 685 KVVALVCQNFSCSPPVTD 702
K +C+NF+C P TD
Sbjct: 598 KTTVYICENFACRRPTTD 615
>gi|338733047|ref|YP_004671520.1| hypothetical protein SNE_A11520 [Simkania negevensis Z]
gi|336482430|emb|CCB89029.1| uncharacterized protein yyaL [Simkania negevensis Z]
Length = 676
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 242/714 (33%), Positives = 360/714 (50%), Gaps = 78/714 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + FL TCHWCHVM ESF + +A L+N+ F+++KVDREE P+
Sbjct: 29 GDEAFEAAKKLDKPIFLSIGYATCHWCHVMSRESFANSEIATLMNETFINVKVDREELPE 88
Query: 59 VDKVYMTYVQALYGGG-GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
+D +YM + QAL G GWPL++ L+P+LKP TY PP + G K ++ +K W
Sbjct: 89 IDSLYMEFAQALMASGSGWPLNLILTPELKPFYATTYMPPTTRQELMGIKELVSHIKQLW 148
Query: 118 DK-KRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 176
+R++L ++ A S +LP+E L EQ ++ D +GG
Sbjct: 149 KSAERELLLDQAEKLVDLF--ARSVQTRGEELPNE---EHLDAAVEQFYEAVDPVYGGIK 203
Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
APKFP +I L H+++ D S TL M +GGI+D VGGGF RY
Sbjct: 204 GAPKFPLGYQILFFLEHARREHD-------SRSLFFAELTLSMMHRGGIYDQVGGGFSRY 256
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
SVDE+W +PHFEKMLYD +A +LDA+ LTK Y +C +ILDYL RDM GG +
Sbjct: 257 SVDEKWIIPHFEKMLYDNALMALAFLDAWKLTKKPLYRQVCEEILDYLLRDMQHQGGGFY 316
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSD 355
SAED AET+G +EGA+Y W ++E++ +L + LF E++ + P+GN
Sbjct: 317 SAED---AETDG----EEGAYYTWHAQEIQKLLPPADLDLFCEYFDVTPSGN-------- 361
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F GKNVL A G+ L C LFD R R RP DDK++V
Sbjct: 362 ----FGGKNVLYRTMTIQEFAELRGLDPLMIQTRLDSCLNLLFDARKGRKRPFKDDKILV 417
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
+WN + I F +A + ++EA Y++ +AASFIR++L+ + +
Sbjct: 418 TWNAMAIDVFIKAGRAFQNEA----------------YLKSGLAAASFIRQNLW--KGGK 459
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L+ FR G + G LDDYA+LI L+ L E G WL WA+EL + ++ F EG
Sbjct: 460 LKRRFREGQTDYEGGLDDYAYLIRALITLSEADLGNVWLQWALELADFLEKEFKADEGA- 518
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
F TG + S+LLR E D A+PSGN++ NL+RL+ + +++ R AE L V
Sbjct: 519 -FYQTGPEYSILLRRPELFDSAQPSGNAIHAENLIRLSQL---TQNRELRIQAEDILKVA 574
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+ ++ P C H++ + H + ++ A L + ++ +
Sbjct: 575 TSYIE----TYPQGACY----------HLIALQHYLDKEALTIVVALDEKESLKEEILEV 620
Query: 656 DPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
+ + FW+ H ++ N K +C++ C P+T +L+
Sbjct: 621 LSTEFIPHHVVFWKRH--SDKEFEENIPLEGKTTVYLCKHGKCEAPITSTDALQ 672
>gi|329765558|ref|ZP_08257134.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329137996|gb|EGG42256.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length = 675
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 218/561 (38%), Positives = 315/561 (56%), Gaps = 49/561 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE+E VAK +N+ F++IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 49 SACHWCHVMAHESFENEDVAKFMNENFINIKVDREERPDLDDIYQKVCQIATGQGGWPLS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GTYFP D YGRPGF +I R++ AW +K + +S I L +
Sbjct: 109 VFLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLAQAWKEKSKDIEKSADKFIVALQK-- 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ K+P +L + L A L + D+ +GGFGSAPKFP + + ++K
Sbjct: 167 ---TDTVKVPSKLDKTILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG S+ + L TL MA+GGI D +GGGFHRYS D +W VPHFEKMLYD +
Sbjct: 221 TG----LSKFNEFALKTLNKMARGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPV 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y++A+ +T+D FY + LD++ R+M G +SA DADS EG EG FYV
Sbjct: 277 NYVEAYQITQDPFYLEVLNKTLDFVLREMTAKNGGFYSAYDADS---EGI----EGKFYV 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W +++ ILG+ + LF +Y + GN ++G N+L + SA +
Sbjct: 330 WKKSDIKVILGDDSDLFCLYYDVTDGGN------------WEGNNILCNNINISAVSFHF 377
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
GMP EK IL C +KL RS R P LDDK++ SWN L+I++FA+ +
Sbjct: 378 GMPEEKIKKILTMCSQKLLKSRSMRVAPGLDDKILTSWNALMITAFAKGYGV-------- 429
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+D +Y++ A++ FI L + +L + +NG +K G+L+DY++ +
Sbjct: 430 --------TDDLKYLDAAKNCIHFIETTLLVDD--KLLRTSKNGITKIDGYLEDYSYFAN 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E +K+L A++L N + F D E +F T+ +++R K ++D + P
Sbjct: 480 ALLDVFEVEPDSKYLDLALKLGNYLVDHFWDSESSSFFMTSDNHEKLIIRPKSNYDLSLP 539
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNSVS ++RL + K
Sbjct: 540 SGNSVSCSVMLRLYHLTHDEK 560
>gi|418679291|ref|ZP_13240555.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|400320416|gb|EJO68286.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
Length = 696
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 247/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + F ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 294 FLEILAEYFLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E +
Sbjct: 347 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A++ P + A
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 609
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696
>gi|357632813|ref|ZP_09130691.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
gi|357581367|gb|EHJ46700.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
Length = 737
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 255/702 (36%), Positives = 345/702 (49%), Gaps = 65/702 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A L+ V++KVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 80 STCHWCHVMEHESFEDEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLN 139
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP E +GR G + +L++V AW R + + ++ + L
Sbjct: 140 VFLTPDGQPFFAGTYFPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRSQL 199
Query: 140 SA-SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A A P E +A R +L+ +YD+ GGFG APKFP P + +L ++
Sbjct: 200 EARDAGETAEPGEAQLDAAR---NELAAAYDAANGGFGGAPKFPSPHNLLFLL---REFR 253
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ E MV TL M +GG+ D +G G HRYS D W VPHFEKMLYDQ A
Sbjct: 254 RTGR----EENLAMVTATLDAMRRGGVFDQIGLGLHRYSTDAHWFVPHFEKMLYDQALTA 309
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+A+ T D + + RDI +Y+ RD+ GP G +SAEDADS EG EG FY
Sbjct: 310 MAATEAYLATGDAEWRRMARDIFEYVHRDLTGPDGAFYSAEDADS---EGV----EGKFY 362
Query: 319 VWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT E+ +L G+ A LF + Y + P GN + + G N+ +A A
Sbjct: 363 VWTESEIRAVLAGDEAGLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAG 418
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K G+ + + L R L R KR RP DDKV+ NGL+I++ A+A++
Sbjct: 419 KKGLGPAELASRLERSRELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF----- 473
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
D +E A+ A+ F+ + + RL H R G + G LDDYAFL
Sbjct: 474 -----------DDEELAGRAKRASDFLLAKMLLPDS-RLLHRLRLGEAAVTGMLDDYAFL 521
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GLL+LY+ +L A+ L F D GG F T + ++LLR K +D A
Sbjct: 522 AWGLLELYQTVFDPAYLAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAA 580
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA--------MAVPLM 609
PSGNSV+ + L L YR E S +RL A
Sbjct: 581 IPSGNSVAFLVLTTL-----------YRLTGEKSFMEEASRLARAAGPWVAGHPSGFTFF 629
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
C + PS V + G + D + A Y L + + + PA E D E
Sbjct: 630 LCGLSQMLAPS-AEVTIAGDPDAPDTHALARALFERY-LPEVAVVLRPAGEEPND--EPD 685
Query: 670 NSNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
A R D+ A VC+ SC PP DP ++ LL
Sbjct: 686 IVALAPFTRFQLPMGDRAAAHVCRAGSCQPPTPDPAAMLALL 727
>gi|225571461|ref|ZP_03780457.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
15053]
gi|225159937|gb|EEG72556.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
15053]
Length = 669
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/693 (34%), Positives = 346/693 (49%), Gaps = 94/693 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED+ A +LN+ F+SIKVDREERPD+D VYM+ QAL G GGWP+S
Sbjct: 61 STCHWCHVMAHESFEDKRTADILNENFISIKVDREERPDIDSVYMSVCQALTGSGGWPMS 120
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI------E 133
+F++ + KP TY PP+++YG GF+ +L ++ W K+ L +S + E
Sbjct: 121 IFMTAEQKPFYAATYIPPDNRYGMKGFRELLLEISGHWKYKKSELLESAEQILDHIDTKE 180
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+ ++ + LP+ A AE ++++D ++GGFG+APKFP P + ++ +
Sbjct: 181 ERAKKKTLKRVGAGTDTTLPERA----AELFAQAFDEKYGGFGAAPKFPTPHNLLFLMIY 236
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
S L+D G S EA + TL+ M +GGI DH+G GF RYS D + VPHFEKMLYD
Sbjct: 237 S-SLQDAGMSYEAEK-------TLEQMRRGGIFDHIGYGFSRYSTDRFYLVPHFEKMLYD 288
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
L Y A+ ++ + +Y+ R+M GP GE +SA+DADS EG +
Sbjct: 289 NALLMIAYSAAYKVSGKTMFLETAEKTAEYILREMTGPDGEFYSAQDADS---EG----R 341
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG +YVW +E+ ILG E F +Y + GN F+GKN+ EL+
Sbjct: 342 EGLYYVWDEEEICGILGAERGTEFCRYYGITEEGN------------FEGKNIPNELDGK 389
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ + + R L+D R +R R HLDDKV+ SWN L+IS+ A +L
Sbjct: 390 EIT------------DRFHKERELLYDYRKRRARLHLDDKVLTSWNSLMISAMA----VL 433
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ V G +R Y+E AE A FI +L D T R+ S R G GFLD
Sbjct: 434 ----------YRVTGKER--YLEAAERARRFIEHNLADGNTLRV--SCRGGSGSVKGFLD 479
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA+ + LL LYE S L A ++ + F D EGGG+F + S++ R KE
Sbjct: 480 DYAYYTAALLSLYEAVSDVDHLTRAEQICREARQQFADEEGGGFFLYGSRNDSLITRPKE 539
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
+DGA PSGNS +LVRL I + Y+ A+ LA ++ + A
Sbjct: 540 TYDGALPSGNSTMAYDLVRLYQITGNEE---YKDAAKRQLAFMSGEAQEYPAGYSMFLTA 596
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ P +K V++ NK I + + E N
Sbjct: 597 LLLYENPPQKITVVLADGD-----------------NKEEI------MSRLPLYAEINIL 633
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
+ + VC+N++C PP + +S
Sbjct: 634 SGETREYKLLNGRTTYYVCKNYTCLPPSNELMS 666
>gi|383458464|ref|YP_005372453.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
2259]
gi|380730954|gb|AFE06956.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
2259]
Length = 696
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/703 (34%), Positives = 353/703 (50%), Gaps = 70/703 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE +A+L+N+ F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 56 SACHWCHVMAHESFEHPDIARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDL+P GGTYFPP D+YGRPGF +L ++DAW+ K D + + E L E
Sbjct: 116 VFLTPDLRPFYGGTYFPPSDRYGRPGFPRLLTALRDAWENKADEIEEQAKRFQEGLGEL- 174
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++ + P L + + + K D GGFG APKFP P+ + ++L ++
Sbjct: 175 -STHGLDAAPAHLSAEDIVAMGQSMLKRMDPVNGGFGGAPKFPNPMNVALLLRAWRR--- 230
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + V TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD QL +
Sbjct: 231 ----GGGEPLKAAVFRTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLH 286
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A + + + + ++Y+RR+M P G ++ +DADS EG +EG F+V
Sbjct: 287 LYSEAEQVESRPLWRKVVEETVEYVRREMTDPAGGFYATQDADS---EG----EEGKFFV 339
Query: 320 WTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
W +EV L G+ A H+ +KP GN + G VL + A
Sbjct: 340 WHPEEVRAALSVGQQADTVLRHFGIKPGGNFE-----------HGATVLEVVVPVEQLAK 388
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G P+E L E RR LF +R +R +P DDK++ WNGL+I A AS++
Sbjct: 389 EQGRPVEAVEKELAEARRVLFLLREQRVKPGRDDKILAGWNGLMIRGLALASRVF----- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
DR ++ ++A AA F+ ++D + RL S+++G + GFL+DY
Sbjct: 444 -----------DRPDWAKLAADAADFVLAKMWDGK--RLLRSYQHGQGRIDGFLEDYGDF 490
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
SGL LY+ K+L A L + ELF D E Y + +++ D A
Sbjct: 491 ASGLTALYQATFDAKYLDAADALAHRAVELFWDEEKQAYLSAPRGQKDLVVAAFSLFDNA 550
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSG S V L+++ + + EH +A +L M + AAD L
Sbjct: 551 FPSGASTLTEAQVTLSAL---TGDVCHLDQPEHYVAKLHDQLVRNPMGYGHLGLAADSL- 606
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V V G + +V +LAAA+ +Y V W + ++ +
Sbjct: 607 VDGASGVTFAGTREAV--APLLAAANRTY---APVFSFG---------WHDTSAPPPARL 652
Query: 678 RNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
+ F K A +C+ F C P+T+ L L+ P
Sbjct: 653 QELFEGRDPVEGKGAAYLCRGFVCERPITEQGLLAERLVAAPG 695
>gi|375364488|ref|YP_005132527.1| hypothetical protein BACAU_3798 [Bacillus amyloliquefaciens subsp.
plantarum CAU B946]
gi|371570482|emb|CCF07332.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens
subsp. plantarum CAU B946]
Length = 629
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 248/692 (35%), Positives = 358/692 (51%), Gaps = 78/692 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE +A +LND F+++KVDREERPDVD VYM Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP K+ RPGF +L + + + R +E ++E +A
Sbjct: 61 PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKV 112
Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
P E L + A+ QL+ +D+ +GGFG APKFP P M+++ + TGK +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAY 225
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
+T + Y I I+ +++R+M+ G FSA DAD TEG +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278
Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
++LG E L+ + Y + GN + + PH F + ++E ++ + +L LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
E R KL + R R PH DDKV+ SWN L+I+ A+A+K+ F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
+++ +AE+A F+ RHL + R+ +R G K GF DDYAFLI G L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFNDDYAFLIWGYLEL 429
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
YE G +L A L ELF D GG+F T + ++L+R KE +DGA PSGNS
Sbjct: 430 YEAGFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
+ + L+RL + + AE +VF+ ++ + + ++P +K +
Sbjct: 490 AAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAHTMP-QKEI 545
Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA- 683
V+ G K D + + A H PA T EH A ++ +F+A
Sbjct: 546 VVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPVELAGIS--DFAAG 591
Query: 684 -----DKVVALVCQNFSCSPPVTDPISLENLL 710
K +C+NF+C P TD N+L
Sbjct: 592 YQMIDGKTTVYICENFACRRPTTDIDEAMNIL 623
>gi|80978835|gb|ABB54669.1| SSP411 [Homo sapiens]
Length = 521
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/436 (44%), Positives = 270/436 (61%), Gaps = 30/436 (6%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 87 GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPD 146
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W
Sbjct: 147 VDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWK 206
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 207 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGF 262
Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S +L G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 263 AEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 317
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D FYS + + IL Y+ R + G
Sbjct: 318 HRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSG 377
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLK 343
+SAEDADS G R KEGA+YVWT KEV+ +L E + L +HY L
Sbjct: 378 GFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLT 436
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN S+ DP E +G+NVL +A++ G+ +E +L KLF R
Sbjct: 437 EAGNISPSQ--DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKH 494
Query: 404 RPRPHLDDKVIVSWNG 419
RP+PHLD K++ +WNG
Sbjct: 495 RPKPHLDSKMLAAWNG 510
>gi|120603287|ref|YP_967687.1| hypothetical protein Dvul_2244 [Desulfovibrio vulgaris DP4]
gi|120563516|gb|ABM29260.1| protein of unknown function DUF255 [Desulfovibrio vulgaris DP4]
Length = 715
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/704 (35%), Positives = 366/704 (51%), Gaps = 57/704 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED VA+ LN+ FV +KVDREERPD+D +YM Q L G GGWPL+
Sbjct: 59 STCHWCHVMAHESFEDAEVAQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLT 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
+F PD P TY P + GR G ++ +V+D + +R + S A A+ + +
Sbjct: 119 IFALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERA 178
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L S + P LR L ++D+ GGFG APKFP P + +L H ++
Sbjct: 179 AELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRR 235
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
D S Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ
Sbjct: 236 TGD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAM 288
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + T++ DY+ RDM GG + +AEDADS EG +++EGA
Sbjct: 289 FMLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGA 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
FY +T EV + G++A L + + GN + +G NVL + L D +
Sbjct: 347 FYTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--A 400
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ LG+ ++ + L +R+ R RPH DDK++ WNGL I++ AR +
Sbjct: 401 ATTLGIDADELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV---- 456
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDD 493
F+ P + ++AAS L + T L HS G PGFLDD
Sbjct: 457 -----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDD 501
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKE 552
YAF+I GLL+LY + +WL AI LQ+ QD+ FLD GGY++T + P + LR+KE
Sbjct: 502 YAFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKE 561
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
DGA PSGN+ +++NL+RLA ++ + Y + A + F ++++ + + C
Sbjct: 562 ARDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCG 618
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNS 671
D ++ + V++ G + D E ML A SY N TV+H+ +T E + S
Sbjct: 619 VD-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPN-TVMHLRDGNTAERLAMLAPFTS 676
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLEKP 714
+ A + K A +CQ+ +CS P+ DP +L E L +P
Sbjct: 677 HLAPI------DGKTTAWLCQDNACSAPIQDPAALAERLAGARP 714
>gi|289582639|ref|YP_003481105.1| hypothetical protein Nmag_2991 [Natrialba magadii ATCC 43099]
gi|448281932|ref|ZP_21473225.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
gi|289532192|gb|ADD06543.1| protein of unknown function DUF255 [Natrialba magadii ATCC 43099]
gi|445577561|gb|ELY31994.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
Length = 722
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 242/692 (34%), Positives = 354/692 (51%), Gaps = 51/692 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 55 SACHWCHVMEDESFADEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLS 136
+L+P+ KP GTYFP K G+PGF IL V ++W+ RD + A+ A +
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENVTNSWEGDRDEVENRAEQWTDAAKDRL 174
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
E S S+++ P + L A +S D +FGGFGS PKFP+P ++++ +
Sbjct: 175 EETPDSVSASQPPS---SDVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVLARAAA 231
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ TG+ + Q + + TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 232 R---TGR----DDFQDVFVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ +L + T D Y+ + + L ++ R++ G FS DA S + + R +EG
Sbjct: 285 AIPRAFLVGYQQTGDERYAEVVAETLTFVERELTHEEGGFFSTLDAQSEDPDTGER-EEG 343
Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
+FYVWT EV D+L A LF + Y + +GN F+G N + S
Sbjct: 344 SFYVWTPDEVHDVLENETDADLFCDRYDITESGN------------FEGSNQPNRVASVS 391
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A++ + L R KLF R +RPRP+ D+KV+ WNGL+I++ A A+ +L
Sbjct: 392 DLAAEYDLDATDVRERLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLG 451
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
G D EY +A A F+R L+DE RL +++ G+L+D
Sbjct: 452 G------------GEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDEDVAIDGYLED 499
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL G L YE L +A++L ++ F D + G + T S++ R +E
Sbjct: 500 YAFLARGALGCYEATGEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQEL 559
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D + PS V+V L+ L V + D + + A L R++ ++ +C AA
Sbjct: 560 GDQSTPSAAGVAVETLLALEGFV--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAA 617
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNSN 672
D L + + V ++ D + A A L + PA +E++ W +E +
Sbjct: 618 DRLESGALEITV-----AADDLPDEWREAFAGRYLPDRLFARRPATDDELESWLDELDLA 672
Query: 673 NAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
+A A S + VC++ +CSPP D
Sbjct: 673 DAPPIWAGREASDGEPTLYVCRDRTCSPPTHD 704
>gi|46579138|ref|YP_009946.1| hypothetical protein DVU0725 [Desulfovibrio vulgaris str.
Hildenborough]
gi|387152533|ref|YP_005701469.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
gi|46448551|gb|AAS95205.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
Hildenborough]
gi|311232977|gb|ADP85831.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
Length = 715
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/704 (35%), Positives = 366/704 (51%), Gaps = 57/704 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED V++ LN+ FV +KVDREERPD+D +YM Q L G GGWPL+
Sbjct: 59 STCHWCHVMAHESFEDAEVSQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLT 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLS 136
+F PD P TY P + GR G ++ +V+D + +R + S A A+ + +
Sbjct: 119 IFALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERA 178
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L S + P LR L ++D+ GGFG APKFP P + +L H ++
Sbjct: 179 AELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRR 235
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
D S Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ
Sbjct: 236 TGD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAM 288
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + T++ DY+ RDM GG + +AEDADS EG +++EGA
Sbjct: 289 FMLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGA 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
FY +T EV + G++A L + + GN + +G NVL + L D +
Sbjct: 347 FYTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--A 400
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ LG+ E+ + L +R+ R RPH DDK++ WNGL I++ AR +
Sbjct: 401 ATTLGIDAEELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV---- 456
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDD 493
F+ P + ++AAS L + T L HS G PGFLDD
Sbjct: 457 -----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDD 501
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKE 552
YAF+I GLL+LY + +WL AI LQ+ QD+ FLD GGY++T + P + LR+KE
Sbjct: 502 YAFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKE 561
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
DGA PSGN+ +++NL+RLA ++ + Y + A + F ++++ + + C
Sbjct: 562 ARDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCG 618
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT-EEMDFWEEHNS 671
D ++ + V++ G + D E ML A SY N TV+H+ +T E + S
Sbjct: 619 VD-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPN-TVMHLRDGNTAERLAMLAPFTS 676
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISL-ENLLLEKP 714
+ A + K A +CQ+ +CS P+ DP +L E L +P
Sbjct: 677 HLAPI------DGKTTAWLCQDNACSAPIQDPAALAERLAGARP 714
>gi|448352262|ref|ZP_21541053.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
gi|445631642|gb|ELY84871.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
Length = 717
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/693 (34%), Positives = 346/693 (49%), Gaps = 57/693 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF DE VA LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMADESFADEAVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
+L+P+ KP GTYFP E K G+PGF IL V ++W+ R+ + Q A A ++L
Sbjct: 113 AWLTPEGKPFYVGTYFPREAKRGQPGFLEILENVTNSWENDREEIETRADQWTAAATDRL 172
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
E A +S ++ L A +S D FGGFGS PKFP+P ++++ +
Sbjct: 173 EETPDAVGASQPPSSDV----LEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---A 225
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + TG+ E +++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 226 RAADRTGR----DEFSDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 281
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ +L + T D Y+ + + LD++ R+++ G FS DA S E R +E
Sbjct: 282 AEIPRAFLLGYQQTGDERYAEVVAETLDFVERELMHEAGGFFSTLDAQSEAPETGER-EE 340
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
GAFYVWT +V D+L + A LF Y + +GN F+G N +
Sbjct: 341 GAFYVWTPDDVRDVLADETDAELFCSRYDITESGN------------FEGTNQPNRVASI 388
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A + +P ++ L R F R +RPRP+ D+KV+ WNGL+I++ A A+ +L
Sbjct: 389 DELADRFDLPTDEVEERLDSARETAFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVL 448
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
G D +Y E+A A +F+R L+D RL +++ G+L+
Sbjct: 449 --------------GKD--DYAEMATDALAFVRDRLWDADEKRLSRRYKDDDVAIDGYLE 492
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L YE L +A+EL + F D G + T S++ R +E
Sbjct: 493 DYAFLARGALGCYEATGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQE 552
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L+ L ++D + + A L RL+ ++ +C A
Sbjct: 553 LGDQSTPSAAGVAVETLLELDGFAG--ETDEFERIATTVLETHANRLETNSLEHATLCLA 610
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
AD L + + + ++ D AS L + PA +E+ W E
Sbjct: 611 ADRLESGALEVTI-----AADDLPEEFVEPFASRYLPDRLFARRPATDDELAAWLDELEL 665
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A K VC++ +CSPP D
Sbjct: 666 MDAPAIWAGREARDGKPTLYVCRDRTCSPPTHD 698
>gi|417784564|ref|ZP_12432270.1| PF03190 family protein [Leptospira interrogans str. C10069]
gi|421127859|ref|ZP_15588077.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421133342|ref|ZP_15593490.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|409952381|gb|EKO06894.1| PF03190 family protein [Leptospira interrogans str. C10069]
gi|410022350|gb|EKO89127.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410434326|gb|EKP83464.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
Length = 691
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KFSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|418670392|ref|ZP_13231763.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|418689642|ref|ZP_13250763.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
gi|418725255|ref|ZP_13283931.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
gi|418729313|ref|ZP_13287860.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
gi|421118286|ref|ZP_15578631.1| PF03190 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|421121658|ref|ZP_15581951.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
gi|400361321|gb|EJP17288.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
gi|409961637|gb|EKO25382.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
gi|410010134|gb|EKO68280.1| PF03190 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|410345509|gb|EKO96605.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
gi|410753774|gb|EKR15432.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|410775491|gb|EKR55482.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
gi|456824626|gb|EMF73052.1| PF03190 family protein [Leptospira interrogans serovar Canicola
str. LT1962]
Length = 691
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|294827769|ref|NP_711139.2| hypothetical protein LA_0958 [Leptospira interrogans serovar Lai
str. 56601]
gi|386073252|ref|YP_005987569.1| hypothetical protein LIF_A0779 [Leptospira interrogans serovar Lai
str. IPAV]
gi|293385614|gb|AAN48157.2| conserved protein containing a thioredoxin domain [Leptospira
interrogans serovar Lai str. 56601]
gi|353457041|gb|AER01586.1| conserved protein containing a thioredoxin domain [Leptospira
interrogans serovar Lai str. IPAV]
Length = 714
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 78 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 137
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 309 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 361
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 362 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 409
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 410 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 459
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 460 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 509
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 510 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 567
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 568 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 622
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 623 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 672
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 673 KFSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 711
>gi|168703256|ref|ZP_02735533.1| hypothetical protein GobsU_27241 [Gemmata obscuriglobus UQM 2246]
Length = 698
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/694 (36%), Positives = 357/694 (51%), Gaps = 62/694 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVME ESFEDE A ++N+ FV IKVDREERPD+D +YMT +Q + GGGWPL
Sbjct: 53 SACHWCHVMEHESFEDEATAAIMNEHFVCIKVDREERPDLDTIYMTALQVMTREGGGWPL 112
Query: 79 SVFLSPDLKPLMGGTYFPPEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 135
SVFL+PDLKP GTY+PP+D+Y GRPGFK +L + +AW +RD + + G + L
Sbjct: 113 SVFLAPDLKPFFAGTYYPPDDRYAAQGRPGFKKLLLGIHNAWQTQRDRVHEIGTSVVGDL 172
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ + + EL A L +SYD RFGGFGS PKFP +E++++L S
Sbjct: 173 QRMGALGDADGPVAPELLAGA----LAALRRSYDPRFGGFGSQPKFPHALELKLLLRLSD 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ D MV TL MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD
Sbjct: 229 RFND-------PVALDMVKHTLTTMARGGIYDQLGGGFARYSVDAKWLVPHFEKMLYDNA 281
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
LA+ +A+ T D F+ I R+ LDY+ R+M GG FS +DADS EG +EG
Sbjct: 282 LLASALAEAYQRTGDPFFQQIGRETLDYVVREMWAEGGAFFSTQDADS---EG----EEG 334
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW+ E+ +LG F + G F+G+N+L +
Sbjct: 335 KFYVWSLDELRAVLGAEDAEFACKVWGATRG-----------GNFEGRNILFRTLSDADE 383
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
G E + L + L+ R+KR P D+K++ +WNGL+I++FA+
Sbjct: 384 GKAHGTSEEAFRARLRAVKDTLYAARAKRVWPGRDEKILTAWNGLMIAAFAQ-------- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
F G D A+ I R + + + P K G+L+DYA
Sbjct: 436 -----FGMATGGEDAACAAVAADH----ILRTMRTADGRLYRTAGVGQPPKLSGYLEDYA 486
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL L+ LYE KWL A+EL + F D G G+F T + ++ R K+ HD
Sbjct: 487 FLADALVTLYEATFEVKWLRAALELAEALLKHFADPNGPGFFFTADDHEELIARTKDLHD 546
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G+ PSGN+V+V L+RLA++ + D + AE +L + + + A M A D
Sbjct: 547 GSTPSGNAVAVTVLLRLAALT--GRRD-LAEPAERTLRGYRETMAEHPAASGQMLIALDF 603
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
P ++ V +VG + + A A++ + V DPA + A+
Sbjct: 604 HLGPVQQ-VAIVGPEHDQATRRAIEAVRATFGPRRVVAFHDPASGAP-------PAELAT 655
Query: 676 MARNNFSADKVVAL-VCQNFSCSPPVTDPISLEN 708
+ + D V + VC+NF+C P+T ++E+
Sbjct: 656 LFEGKEALDGAVTVYVCENFACRAPLTGAEAIES 689
>gi|386875180|ref|ZP_10117368.1| lanthionine synthetase C-like protein, partial [Candidatus
Nitrosopumilus salaria BD31]
gi|386807022|gb|EIJ66453.1| lanthionine synthetase C-like protein, partial [Candidatus
Nitrosopumilus salaria BD31]
Length = 539
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 214/539 (39%), Positives = 304/539 (56%), Gaps = 49/539 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFE++ VAK +N+ FV+IKVDREERPD+D +Y Q G GGWPLS+
Sbjct: 50 SCHWCHVMAHESFENDEVAKFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLSI 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GTYFP D YGRPGF +I R++ AW +K + +S E AL
Sbjct: 110 FLTPDQKPFYVGTYFPVLDSYGRPGFGSICRQLSQAWKEKPKDIEKSA----ENFLNALH 165
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + + P +L + L A L + D+ +GGFGSAPKFP I + ++ E T
Sbjct: 166 KTETVHT-PSKLEKIILDEAAMNLFQLGDATYGGFGSAPKFPNAANISFLFRYA---ELT 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G S+ + L TL MAKGGI D +GGGFHRYS D +W VPHFEKMLYD +
Sbjct: 222 G----LSKFNEFALKTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVN 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y++A+ +TKD FY + + LD++ R+M P G +SA DADS EG EG FYVW
Sbjct: 278 YVEAYQITKDPFYLEVLQKTLDFVLREMTTPEGGFYSAYDADS---EGV----EGKFYVW 330
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
E+++ILG A +F Y + GN ++G +L + S A G
Sbjct: 331 KKSEIKEILGSDADIFCLFYDVTDGGN------------WEGNTILCNNLNISTVAFNFG 378
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
++ +IL C KL VRS R P LDDK++VSWN L+I++FA+
Sbjct: 379 KSEQEIHDILNSCAEKLLKVRSTRISPGLDDKILVSWNSLMITAFAKG------------ 426
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+ V G R Y+ A+ SFI ++L + +LQ +++N +K G+L+DY++ I+
Sbjct: 427 --YRVTGDQR--YLSAAKDCISFIEKNLLVGE--KLQRTYKNNTAKIDGYLEDYSYFINA 480
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LLD++E S K+L ++ L N E F D + +F T+ +++R K ++D + P
Sbjct: 481 LLDVFEIESDQKYLQLSLNLANYLLEHFWDSDANSFFMTSDNHEKLIIRPKSNYDLSLP 539
>gi|456972139|gb|EMG12591.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 699
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 294 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 347 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 394
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 395 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 444
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 445 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 494
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 495 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYD 552
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 553 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 607
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 608 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 657
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 658 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 696
>gi|452857673|ref|YP_007499356.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
plantarum UCMB5036]
gi|452081933|emb|CCP23707.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
plantarum UCMB5036]
Length = 629
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 248/684 (36%), Positives = 357/684 (52%), Gaps = 78/684 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE +A +LND F++IKVDREERPDVD VYM Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP K+ RPGF +L + + + R +E ++E +A
Sbjct: 61 PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENAAAHLEVKV 112
Query: 148 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
P E L + A+ QL+ +D+ +GGFG APKFP P M+++ + TGK +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
A G V TL MA GGI DH+G GF RYS D W VPHFEKMLYD L Y +A
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAC 225
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
+T + Y I I+ +++R+M+ G FSA DAD TEG +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278
Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
++LG E L+ + Y + GN + + PH F + ++E ++ + +L LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
E R KL + R R PH DDKV+ SWN L+I+ A+A+K+ F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
+++ +AE+A F+ RHL + R+ +R G K GF+DDYAFLI L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
YE G +L A L + ELF D GG+F T + ++L+R KE +DGA PSGNS
Sbjct: 430 YEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
+ + L+RL + G S + AE +VF+ ++ + + ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545
Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA- 683
V+ G K D + + A H PA T EH A ++ +F+A
Sbjct: 546 VVFGRKDDPDRKRFIEALQE---------HFTPAYT---ILAAEHPDELAGIS--DFAAG 591
Query: 684 -----DKVVALVCQNFSCSPPVTD 702
K +C+NF+C P TD
Sbjct: 592 YQLIDGKTTVYICENFACRRPTTD 615
>gi|226356002|ref|YP_002785742.1| hypothetical protein Deide_10920 [Deinococcus deserti VCD115]
gi|226317992|gb|ACO45988.1| conserved hypothetical protein [Deinococcus deserti VCD115]
Length = 696
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 214/549 (38%), Positives = 302/549 (55%), Gaps = 43/549 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE A +N+ FV +KVDREERPDVD VYMT QA+ G GGWP++
Sbjct: 62 STCHWCHVMAHESFEDEATAAQMNEHFVCVKVDREERPDVDAVYMTATQAMTGQGGWPMT 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP+D YG P F+ +L + +AW R+ L + + + EA
Sbjct: 122 VFLTPDGEPFYAGTYFPPQDGYGLPSFRRLLASIANAWQNDREKLTGNARALTDHIREAS 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S LP Q A ++L + +D+ GGFG APKFP P ++ +L
Sbjct: 182 RPRPSQGDLPAGFLQQA----PDKLRRVFDADLGGFGGAPKFPAPTLLEFLLTR------ 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
EG+ M L TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD QL
Sbjct: 232 -------PEGRDMALHTLRRMAAGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTR 284
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
V + A+ T D ++ + R+ L YL R+M+ P G +SA+DAD+ G EG +
Sbjct: 285 VLVQAYQHTDDEDFARLARETLTYLEREMLSPAGGFYSAQDADTPTDHGGV---EGLTFT 341
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPH-NEFKGKNVLIELNDSSASASK 378
WT E+ +LG + L + Y + GN DPH E+ +NVL A
Sbjct: 342 WTPAEIRAVLGGDSALIERVYGVTDQGN-----FLDPHRREYGSRNVLHLPTPLEQLARD 396
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG + + + + + R +L + R +R +P DDKV+ SWNGL +++FA A+++L
Sbjct: 397 LGEDPQAFHSRVDQARARLLEAREQRTQPGTDDKVLTSWNGLALAAFADAARVL------ 450
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
G R Y+E+A A F+RR L L+H+F++G ++ G L+D+A
Sbjct: 451 --------GEPR--YLEIARQNAEFVRRELRLPDG-TLRHTFKDGQARVEGLLEDHALYG 499
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL+ L++ G L WA EL F D + G + +T G+ +L R + D A
Sbjct: 500 LGLVALFQAGGDLGHLEWARELWTLVRRDFWDEDAGVFHSTGGQAEPLLSRQVQGFDSAV 559
Query: 559 PSGNSVSVI 567
S N+ + +
Sbjct: 560 LSDNAAAAL 568
>gi|188585586|ref|YP_001917131.1| hypothetical protein Nther_0959 [Natranaerobius thermophilus
JW/NM-WN-LF]
gi|179350273|gb|ACB84543.1| protein of unknown function DUF255 [Natranaerobius thermophilus
JW/NM-WN-LF]
Length = 686
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/690 (34%), Positives = 350/690 (50%), Gaps = 84/690 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED +A +LN F+SIKVDREERPD+D +YM+ QAL G GGWPL+
Sbjct: 56 STCHWCHVMEQESFEDHEIAGILNKNFISIKVDREERPDIDAIYMSACQALTGRGGWPLT 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ D P GTYFP E++ G PG K IL KV W R L G + +
Sbjct: 116 VFLNHDKNPFYAGTYFPKENRLGMPGLKDILEKVSSKWQNDRYELINIGNEITQAVEHHF 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
A P + + +L + QL +++D +GGFGSAPKFP P + +L YH
Sbjct: 176 FTHA-----PGNVTEESLHIAFSQLEENFDEEYGGFGSAPKFPSPHNLYFLLRYYHL--- 227
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG MV TL M +GGI+DH+G GF RYS D++W VPHFEKMLYD L
Sbjct: 228 --TGNES----ALHMVKKTLTSMYRGGIYDHIGYGFCRYSTDKKWLVPHFEKMLYDNALL 281
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A YL+ + +T++ F+ I ++I Y+ R++ P G +SAEDADS EG +EG F
Sbjct: 282 AIAYLEVYEITRNNFFKEIAQEIFTYVSRELTSPEGGFYSAEDADS---EG----EEGKF 334
Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YV+T +EV ++LGE F + Y + GN F+ N + L +
Sbjct: 335 YVFTPQEVIEVLGEVRGQEFCKQYNITANGN------------FEHGNSIPNLIGKNPEK 382
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L +KLF+ R +R P DDK++ SWNGL+I++ A+ S++L E
Sbjct: 383 DEFQKDL-----------KKLFEYREQREHPFKDDKILTSWNGLMIAALAKGSRVLNDE- 430
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+ +A+S+ FI ++L RL +R+G + PGFLDDYA+
Sbjct: 431 ---------------RYLNMAQSSYRFIEKNLIT-NNQRLLTRYRDGEASIPGFLDDYAY 474
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ GL++LY +L A+ + +LF D++ GG + + +++ R KE D
Sbjct: 475 LVWGLIELYNASFEPYYLEKALIFNDEMIKLFWDQDQGGLYLYGHDSETLVSRPKEIDDS 534
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNSV+ NL+ L + + + + AE + F + + A L
Sbjct: 535 ALPSGNSVATRNLLELFHLTGKTSLE---ELAERQINSFGGSVNKSPIYYTHFLTAV-YL 590
Query: 617 SVPSRKHVVLVG-----HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
+ + + + +V +SV E ++ H + L EE+
Sbjct: 591 VLTTTEEITVVSDPEPDEATSVLVEALIKGFHPNRFLLVKTEDRKGRQLEEL-------- 642
Query: 672 NNASMARN-NFSADKVVALVCQNFSCSPPV 700
A + N N +K VC++F+C PV
Sbjct: 643 --APIVNNRNQKDNKPTIYVCKDFTCLTPV 670
>gi|302342409|ref|YP_003806938.1| hypothetical protein Deba_0974 [Desulfarculus baarsii DSM 2075]
gi|301639022|gb|ADK84344.1| protein of unknown function DUF255 [Desulfarculus baarsii DSM 2075]
Length = 681
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/688 (36%), Positives = 353/688 (51%), Gaps = 59/688 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED+ VA LLN +V++KVDREERPD+D +YMT QAL G GGWPL+
Sbjct: 49 TCHWCHVMAHESFEDQAVADLLNQHYVAVKVDREERPDLDAIYMTACQALSGAGGWPLTA 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEAL 139
L+PD P + GTYFP + GRPG IL +V W+ +R + Q+G ++++ A+
Sbjct: 109 LLTPDGLPFIAGTYFPKTARLGRPGLLEILAEVARRWNGPERARMIQAG----QEVARAI 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A +L AL + QL +S+D +FGGFG APKFP P + +L +
Sbjct: 165 QPQAGPKT---DLDPRALGMAYSQLRQSFDDQFGGFGQAPKFPTPHNLLFLLRWQAR--- 218
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
S+ MV TL MA GG+ D VG GFHRYSVD W PHFEKMLYDQ LA
Sbjct: 219 ----NPGSDALAMVEKTLTAMADGGLFDQVGFGFHRYSVDRPWLTPHFEKMLYDQALLAM 274
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YL+A LT ++ R + Y+ M GP G ++AEDADS EG EG +YV
Sbjct: 275 AYLEAHQLTGREDFAATARQVFTYVLTRMTGPEGGFYAAEDADS---EGV----EGKYYV 327
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +EV G+ LF + + + GN + S PH + L + A++
Sbjct: 328 WTPQEVLAAAGQADGRLFNDFHGITADGNFEHG-TSIPHR----RQSLADF------ATQ 376
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ ++ L R L R +R P DDK+I +WNGL+I++ A+A + L EA +
Sbjct: 377 HGLDADQAAQALERARLALLAARQQRIPPLKDDKIITAWNGLMIAALAKAGQALADEALT 436
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
A ++ A + RL S R+G + PGFL+DYAF+I
Sbjct: 437 AAAA-----RAATFILQTARATGG------------RLARSQRDGQASGPGFLEDYAFMI 479
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++L+E L A+EL + ELF D GGYF + + +++R K+D+DGA
Sbjct: 480 WGLIELFEATFELDHLEAALELTDKCCELFWDEADGGYFFSPADGEKLIMRDKDDYDGAT 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+GNS +NL+RLA + + + Q ++A RL MA ++ A D
Sbjct: 540 PAGNSTMTLNLLRLARLTGRRQLEDMAQQLMQTMAAQTMRLP---MAHTMLLMALDFAQG 596
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P+ K +V+ G K+ + M+A A + + ++ P E + A
Sbjct: 597 PT-KEIVICGAKNDPAAQAMIAKAQQKFIPARALLWRPPEGPEAARL----AALAPFTAG 651
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL 706
+ A VCQ+ C+ PVTDP L
Sbjct: 652 MTTVGGRATAYVCQDHVCARPVTDPDEL 679
>gi|418701443|ref|ZP_13262368.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|410759525|gb|EKR25737.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
Length = 691
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/699 (35%), Positives = 361/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASEFSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A+ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALNYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|418710447|ref|ZP_13271218.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|410769383|gb|EKR44625.1| PF03190 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
Length = 691
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|418715817|ref|ZP_13275928.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
gi|410788318|gb|EKR82040.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
Length = 691
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++A+ SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAKETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G SDYYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|428281760|ref|YP_005563495.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
BEST195]
gi|291486717|dbj|BAI87792.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
BEST195]
Length = 629
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 235/690 (34%), Positives = 354/690 (51%), Gaps = 89/690 (12%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP K+ RPGF +L + + + R+ + A + L +A +
Sbjct: 61 PFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG- 119
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L ++A+ +QL+ +D+ +GGFG APKFP P M++Y + +TG+
Sbjct: 120 ----LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQENALY 172
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
K TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L Y +A+ +
Sbjct: 173 NVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQV 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T++ Y IC I+ +++R+M G FSA DAD TEG +EG +YVW+ +E+
Sbjct: 229 TQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILK 281
Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN------DSSASASK 378
LG+ L+ + Y + GN F+GKN+ LI D+ + +
Sbjct: 282 TLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKADAGLTEKE 329
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L + LE E R++L R +R PH+DDKV+ SWN L+I+ A+A+K+ +
Sbjct: 330 LSLKLE-------EARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----- 377
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+Y+ +A+ A +FI L + R+ +R+G K GF+DDYAFL+
Sbjct: 378 -----------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLL 424
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
LDLYE +L A +L + LF D E GG++ T + ++++R KE +DGA
Sbjct: 425 WAYLDLYEASFDLSYLQKAKKLTDDIISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAV 484
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+ + L+RL + S + AE +VF+ + + +
Sbjct: 485 PSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFKPDIDAYPSGHAFFMQSVLRHLM 541
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P +K +V+ G + ++ ++ N +++ EH +A
Sbjct: 542 P-KKEIVIFGSADDPARKQIITELQKAFKPNDSIL------------VAEHPDQCKDIA- 587
Query: 679 NNFSAD------KVVALVCQNFSCSPPVTD 702
F+AD K +C+NF+C P T+
Sbjct: 588 -PFAADYRIIDGKTTVYICENFACQQPTTN 616
>gi|441505288|ref|ZP_20987276.1| Thymidylate kinase [Photobacterium sp. AK15]
gi|441427143|gb|ELR64617.1| Thymidylate kinase [Photobacterium sp. AK15]
Length = 732
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 242/688 (35%), Positives = 362/688 (52%), Gaps = 59/688 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED VA LLN FV+IKVDREERPD+D+++M Q++ GGGGWPL+
Sbjct: 72 TCHWCHVMERESFEDTEVAALLNRDFVAIKVDREERPDIDQLHMAACQSMTGGGGWPLNC 131
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
L+P+ + TY P + +YGRPG ++ + AW K+RD+L +GA + + +ALS
Sbjct: 132 VLTPEGQVFYATTYLPKQGQYGRPGMMELIPTIALAWQKQRDVLL-NGAIQLNKQLQALS 190
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+++ L + + A L EQ ++D GGFG APKFP P + +L + + T
Sbjct: 191 GVSAAGVLDENIEHQAY-LWFEQ---TFDPEHGGFGDAPKFPLPHQYFFLLRYWYR---T 243
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ S MV +LQ M GG+ DH+G GFHRYS D W VPHFEKMLYDQ L
Sbjct: 244 GQRQALS----MVEESLQAMRLGGLFDHIGYGFHRYSTDNCWLVPHFEKMLYDQSLLLMA 299
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A++ T + FY ++++YL+ M+ P G FSAEDADS EG +EG FY+W
Sbjct: 300 YSEAYAATGNEFYKQTAEEVVEYLKSRMLHPDGGFFSAEDADS---EG----EEGKFYIW 352
Query: 321 TSKEVEDILGEHAILF-KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+E++ +L E + + ++HY + P GN + + G N+L SA K
Sbjct: 353 RYEELKAVLEESELTWLEQHYCIFPQGN----YVDEVSGRMTGANILHLSMHPLVSADKK 408
Query: 380 G------MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
G E + N R+KL+ R +R P LDDKV+ WNGL I++ AR S ++
Sbjct: 409 GKVDHDKATPECWRNQWQLIRQKLYQHRERREHPLLDDKVLSDWNGLTIAALARCSLLI- 467
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
D + +E+A A FIR +L DE +H L +RNG + P LDD
Sbjct: 468 ---------------DSSDCLEMARKAFEFIRLNLVDENSH-LMKRYRNGNAGLPAHLDD 511
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA LI L+L++ +L A+ + F D + G++ T + + +R KE
Sbjct: 512 YASLIWAALELHQATLNNDYLQQALNWTEMAVDKFWDSDNHGFYFTEA-NTDLAVRAKEI 570
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
+DGA PSGN+V NL L + S+ ++ +A F +L L+ A
Sbjct: 571 YDGAIPSGNAVMARNLAFLYRLTGESR---WQTKFNKLIAAFAPQLNRYPAGYTLLLTAV 627
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
D+++ P +H++ G + E++L Y N + ++ D + N+
Sbjct: 628 DLMNSPG-QHLLFSGAGVA---EDILRPLKGKYLPNTLWLAVNDKDRVQGG----KNTAV 679
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVT 701
+ + +FS ++ V CQ+ +C P+T
Sbjct: 680 PASFKLSFSGNEPVLCFCQDSACELPIT 707
>gi|389572654|ref|ZP_10162736.1| yyaL [Bacillus sp. M 2-6]
gi|388427679|gb|EIL85482.1| yyaL [Bacillus sp. M 2-6]
Length = 627
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/684 (35%), Positives = 361/684 (52%), Gaps = 77/684 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+ Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 144
P GTYFP YGRPGF L ++ DA+ RD IE L+E + + +
Sbjct: 61 PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHSDRD--------HIESLAEKATNNLRIKA 112
Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
+ + + L Q ++ QL S+D+ +GGFGSAPKFP P M+ + + E TG+
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLTFLMRYFEWTGQEN 169
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
K TL MA GGI+DH+G GF RYS DE+W VPHFEKMLYD L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
+ +T+ Y + +D++ +++RDM+ G +SA DADS EG KEG +YVWT KE
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKKE 278
Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
+ LG+ LF Y++ GN + + PH + +D A+ S +
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGNFEGQNI--PH------TISTSFDDIKAAYS---IDD 327
Query: 384 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
+ + L R L VR +RP P +DDKV+ SWN L+IS+ A+A + E
Sbjct: 328 QTLYSKLQSARNILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHEE-------- 379
Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
E + +A+ A SF+ HL Q RL +R G K GF++DYA +++ +
Sbjct: 380 --------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLTAYMS 429
Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
LYE WL A + ELF D + GG+F + + ++++R KE +DGA PSGNS
Sbjct: 430 LYEATFDLDWLTKARAVGENMFELFWDEQIGGFFFSGSDAETLIVREKEVYDGAMPSGNS 489
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--ADMLS-VP 619
++ L++L+ ++ RQ+ +L +F D++ + P A +LS
Sbjct: 490 TALQQLLKLSRMIG-------RQDWIETLEKMFSAFYVDVS-SYPSGHTAFLQGLLSQYA 541
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+++ ++++G K E +L A L K + D T E + + A A++
Sbjct: 542 AKREIIILGKKGDPQKEQLLQA------LQKRFMPFDLILTAETG---QELARLAPFAKD 592
Query: 680 NFSA-DKVVALVCQNFSCSPPVTD 702
+ D +C+N+SC P+T+
Sbjct: 593 YKTINDSTTVYICENYSCRQPITN 616
>gi|417761487|ref|ZP_12409496.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
gi|417772112|ref|ZP_12420002.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|417776397|ref|ZP_12424235.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
gi|418671976|ref|ZP_13233322.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
gi|418680449|ref|ZP_13241698.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|418703630|ref|ZP_13264514.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|400327807|gb|EJO80047.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|409942568|gb|EKN88176.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
gi|409946069|gb|EKN96083.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|410573764|gb|EKQ36808.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
gi|410581098|gb|EKQ48913.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
gi|410766766|gb|EKR37449.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|455668123|gb|EMF33372.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
Fox 32256]
Length = 691
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|452972836|gb|EME72663.1| hypothetical protein BSONL12_20380 [Bacillus sonorensis L12]
Length = 627
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 247/695 (35%), Positives = 360/695 (51%), Gaps = 99/695 (14%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE VA+LLN+ FVSIKVDREERPDVD +YMT Q + G GGWPL+VFL+P+ K
Sbjct: 1 MAHESFEDEEVAQLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPEQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP +Y RPGF +L+++ + K RD + E+ + L A SN
Sbjct: 61 PFYAGTYFPKTSRYNRPGFVEVLKQLSATFAKNRDHVEDIA----EKAANNLRIKAKSNA 116
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 206
+ L ++ L+ +QL S+D+ +GGFGSAPKFP P + +L YH SGE
Sbjct: 117 -GEALGEDILKRTYQQLINSFDTAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
+ V TL MA GGI+DH+G GF RYS D+ W VPHFEKMLYD L Y +A+
Sbjct: 169 N-ALYSVTKTLDSMANGGIYDHIGYGFARYSTDQEWLVPHFEKMLYDNALLLMAYTEAYQ 227
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
+TK Y I I+ ++RR+M G FSA DAD TEG EG +Y+W+ E+
Sbjct: 228 VTKRERYKRISEQIIAFIRREMTDERGAFFSALDAD---TEGV----EGKYYIWSKDEIT 280
Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA-SKLGMPLE 384
+ LG E L+ C + ++D N F+G N+ + S + +
Sbjct: 281 ETLGDELGSLY-----------CAVYDITDEGN-FEGFNIPNLIYTSFEQVRDEFSLTET 328
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+ N L R+KLF+ R R PH+DDKV+ SWN L+I+ A+ASK+ ++
Sbjct: 329 ELQNKLEAARQKLFEKRRGRIYPHVDDKVLTSWNALMIAGLAKASKVFEA---------- 378
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
EY+E+A +A SFI L + R+ +R+G K GF+DDYAFL+ L+L
Sbjct: 379 ------PEYLEMARTALSFIEDELI--KDGRVMVRYRDGEVKNKGFIDDYAFLLWSYLEL 430
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
YE L A EL +LF D + GG++ T + ++++R KE +DGA PSGN V
Sbjct: 431 YEASLNLPDLRKAKELAGDMIDLFWDEDHGGFYFTGKDAEALIVRDKEVYDGALPSGNGV 490
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS---- 620
+ + L RL + L++ + R+ DM A D+ + PS
Sbjct: 491 AAVQLFRLGRLTG-------------DLSLID-RVSDMFSAF-----HGDVSAYPSGHTN 531
Query: 621 -----------RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWE 667
+K +V++G + + +N++ A ++ N V+ + D + DF
Sbjct: 532 FLQSLLSQMMPQKEIVILGKRDDPNRQNIIRALQQAFQPNYAVLAAESPDDFKGIADFAA 591
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ + + DK +C+NF+C P +
Sbjct: 592 DYKAID----------DKTTVYICENFACQKPTAN 616
>gi|220916114|ref|YP_002491418.1| hypothetical protein A2cp1_1001 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219953968|gb|ACL64352.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
2CP-1]
Length = 718
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/632 (38%), Positives = 350/632 (55%), Gaps = 69/632 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T R FL +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64 GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
VD +YMT VQ L G GGWP+SV+L+PD +P GGTYFPP D P GF +IL ++
Sbjct: 124 VDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGFLSILHEIAGL 183
Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
W++ D + + +GA + A ++ ++P P ++A+ L L +S+D R GG
Sbjct: 184 WERDPDRIRSATGALVEAVRTALAPAGPAAAQVPGPEPIEHAVAL----LERSFDERHGG 239
Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
APKFP V ++++L H + ++GEA +M TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGEA-RSLRMATVTLERMAAGGLHDQVGGGFH 292
Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
RYS D W VPHFEKMLYD LA Y +A+ +T ++ + R LDYL R++ P G
Sbjct: 293 RYSTDAEWLVPHFEKMLYDNALLALAYAEAWQVTGRRDFARVTRQTLDYLLRELTSPEGG 352
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
++SA DADS EG +EG F+ WT E+ + LG+ A F + ++P GN
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
F+G++VL + P E L R L+ +R +RPRP D+K++
Sbjct: 399 -----FEGRSVL-----------HVPAPDEDAWEALAPDRAALYALRERRPRPLRDEKIL 442
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
WNGL IS+ A + L +++ A AA F+ L +
Sbjct: 443 AGWNGLAISALAFGGRALAE----------------PRWVDAAARAADFVLTRLVKDG-- 484
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
RLQ S+ G + P +L+D+AFL+ GLLDL+E +WL A EL QD LF D EGG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEATFDPRWLAAAAELAGAQDRLFGDPEGG 544
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G+F + + +L R K HDGAEPSG SV+ +N +RL + + + +R+ A+ +L
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
L + +A+ + A D S R+ V++
Sbjct: 602 HARTLAEQPLAMSELLLALDYASDAVREVVLI 633
>gi|197121417|ref|YP_002133368.1| hypothetical protein AnaeK_1004 [Anaeromyxobacter sp. K]
gi|196171266|gb|ACG72239.1| protein of unknown function DUF255 [Anaeromyxobacter sp. K]
Length = 718
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 247/633 (39%), Positives = 350/633 (55%), Gaps = 70/633 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F +T R FL +TCHWCHVME ESFEDE +A++LN+ +V+IKVDREERPD
Sbjct: 64 GDEAFEEARRTGRPVFLSVGYSTCHWCHVMERESFEDEEIARVLNERYVAIKVDREERPD 123
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP--GFKTILRKVKDA 116
VD +YMT VQ L G GGWP+SV+L+PD +P GGTYFPP D P GF +IL ++
Sbjct: 124 VDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFGGTYFPPRDGVRGPARGFLSILHEIAGL 183
Query: 117 WDKKRDML-AQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGG 174
W++ D + + +GA + A ++ ++P P ++A+ L L +S+D R GG
Sbjct: 184 WERDPDRIRSATGALVEAVRTALAPAGPAAAEVPGPEPIEHAVAL----LERSFDERHGG 239
Query: 175 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 234
APKFP V ++++L H + ++GE +M TL+ MA GG+HD VGGGFH
Sbjct: 240 LRRAPKFPSNVPVRLLLRHHR------RTGE-ERSLRMATVTLERMAAGGLHDQVGGGFH 292
Query: 235 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 294
RYS D W VPHFEKMLYD LA Y +A+ LT ++ + R LDYL R++ P G
Sbjct: 293 RYSTDAEWLVPHFEKMLYDNALLALAYAEAWQLTGRRDFARVTRQTLDYLLRELTSPEGG 352
Query: 295 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMS 354
++SA DADS EG +EG F+ WT E+ + LG+ A F + ++P GN
Sbjct: 353 LYSATDADS---EG----EEGRFFTWTEAELREALGDRAEAFLRFHGVRPEGN------- 398
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
F+G++VL + P E L R L+ +R +RPRP D+K++
Sbjct: 399 -----FEGRSVL-----------HVPAPDEDAWEALAPDRAALYALRERRPRPLRDEKIL 442
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
WNGL IS+ A + L +++ A AA F+ L +
Sbjct: 443 AGWNGLAISALAFGGRALAE----------------PRWVDAAARAADFVLTRLVKDG-- 484
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
RLQ S+ G + P +L+D+AFL+ GLLDL+E +WL A EL QD LF D EGG
Sbjct: 485 RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEATFDPRWLAAAAELAGAQDRLFGDPEGG 544
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
G+F + + +L R K HDGAEPSG SV+ +N +RL + + + +R+ A+ +L
Sbjct: 545 GWFQSATDHERLLAREKPTHDGAEPSGASVAALNALRLEAFTSDPR---WRRAADGALRH 601
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 627
L + +A+ + A D S R+ VVLV
Sbjct: 602 HARTLAEQPLAMSELLLALDCASDAVRE-VVLV 633
>gi|448397958|ref|ZP_21569896.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
gi|445672174|gb|ELZ24751.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
Length = 731
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 236/697 (33%), Positives = 347/697 (49%), Gaps = 60/697 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEAESFADEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP E K G+PGF + ++ D+W D Q A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGKRGQPGFLDLCERISDSWASAEDRPEMESRAEQWTDAAKD 172
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLY 192
+L E + A ++ L A+ + +S D R GGFGS+ PKFP+P ++++
Sbjct: 173 RLEETPTEDADTDASAGPPSSEVLETAADAIVRSADRRCGGFGSSGPKFPQPSRLRVLAR 232
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ +D E E TL MA GG++DHVGGGFHRY VD W VPHFEKMLY
Sbjct: 233 AHDRTDDETAYREVLEE------TLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLY 286
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ +L + LT + Y+ + D L+++ R++ G FS DA S E R
Sbjct: 287 DNAEIPRAFLAGYQLTGENRYAEVVGDTLEFVERELTHDDGGFFSTLDAQSESPETGER- 345
Query: 313 KEGAFYVWTSKEVEDILGEH---AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
KEGAFYVWT EV D++ EH A LF + Y + +GN F+G++ +
Sbjct: 346 KEGAFYVWTPDEVHDVI-EHEPDAALFCKRYDITESGN------------FEGRSQPNRV 392
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A + + L L R++LF+ R +RPRP+ D+K++ WNGL+IS++A A+
Sbjct: 393 TPVSELAVGFDLEESEVLKRLDAIRQRLFEAREERPRPNRDEKILAGWNGLMISTYAEAA 452
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L G D +Y E A A F+R L+D RL ++ G G
Sbjct: 453 LVL--------------GED--DYAETAVDALEFVRDRLWDADEQRLSRRYKGGDVAIDG 496
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G LD Y+ L +A+EL + F D + G + T S++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEVEFWDADHGTLYFTPASGESLVTR 556
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+V L+ L ++ + + A L L+ A+ +
Sbjct: 557 PQELSDQSTPSAAGVAVETLLSLDEFA----TEDFEEIAATVLETHANTLEANALEHATL 612
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
C AAD L + + V ++ D S + + P + ++ W +
Sbjct: 613 CLAADRLESGALEVTV-----AADDLPATWRDRFTSRYFPDRLFALRPPTEDGLEAWLDR 667
Query: 670 ----NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R + + VC+N +CSPP D
Sbjct: 668 LDLADAPPIWAGREARDGEPTL-YVCRNRTCSPPTHD 703
>gi|418695562|ref|ZP_13256581.1| PF03190 family protein [Leptospira kirschneri str. H1]
gi|409956647|gb|EKO15569.1| PF03190 family protein [Leptospira kirschneri str. H1]
Length = 711
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 246/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 78 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNL 137
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 309 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 361
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E +
Sbjct: 362 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 405
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 406 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 459
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S+ G+ +DYA
Sbjct: 460 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESRILGYSNDYA 508
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 509 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 566
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A++ P + A
Sbjct: 567 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALSYPFLLSAYW 624
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 625 SYKHHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 675
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 676 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 711
>gi|124504310|gb|AAI28719.1| Spata20 protein [Rattus norvegicus]
Length = 550
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 199/487 (40%), Positives = 288/487 (59%), Gaps = 46/487 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL +TCHWCH+ME ESF++E + LLN+ FVS+ VDREERPD
Sbjct: 90 GQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNENFVSVMVDREERPD 149
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+P L+P +GGTYFPPED R GF+T+L ++ D W
Sbjct: 150 VDKVYMTFVQATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLMRICDQWK 209
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ ++++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 210 QNKNTLLENS----QRVTTALLARSEISVGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 265
Query: 176 GSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
APKFP PV + + + S ++ G S Q+M L TL+ MA GGI DHVG GF
Sbjct: 266 AEAPKFPTPVILNFLFSYWLSHRVTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGF 320
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D +WH+PHFEKMLYDQ QL+ VY AF ++ D F+S + + IL Y+ R++ G
Sbjct: 321 HRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAFQISGDEFFSDVAKGILQYVTRNLSHRSG 380
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLK 343
+SAEDADS G + +EGA Y+WT KEV+ +L E L +HY L
Sbjct: 381 GFYSAEDADSPPERG-VKPQEGALYLWTVKEVQQLLPEPVGGASEPLTSGQLLMKHYGLS 439
Query: 344 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 403
GN + ++ D + E G+NVL +A++ G+ +E +L KLF R
Sbjct: 440 EAGNINPTQ--DVNGEMHGQNVLTVRYSLELTAARYGLEVEAVRALLNTGLEKLFQARKH 497
Query: 404 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 463
R + HLD+K++ +WNGL++S FA A +L E + + A + A F
Sbjct: 498 RLKAHLDNKMLAAWNGLMVSGFAVAGSVLGME----------------KLVTQATNGAKF 541
Query: 464 IRRHLYD 470
++RH++D
Sbjct: 542 LKRHMFD 548
>gi|448345120|ref|ZP_21534020.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
12890]
gi|445636069|gb|ELY89233.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
12890]
Length = 589
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 212/568 (37%), Positives = 315/568 (55%), Gaps = 46/568 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF+DE VA+++N+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFQDEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP E + G+PGF+ + +++ D+W+ D Q A +
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFRDLCQRISDSWESDADREEMENRAQQWTDAATD 172
Query: 134 QLSEALSASASSN-KLPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMML 191
+L E A+ S + P+ + L A+ + +S D +GGFGS+ PKFP+P ++++
Sbjct: 173 RLEETPDAAGGSPVEAPEPPSSDVLETAADAVVQSADREYGGFGSSGPKFPQPSRLRVL- 231
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
++ + TG+ E +++ TL MA GG+ DHVGGGFHRY VD W VPHFEKML
Sbjct: 232 --ARTYDRTGR----EEYREVFEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKML 285
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD ++ +L + LT + Y+ + D L ++ R++ G FS DA S E R
Sbjct: 286 YDNAEIPRAFLSGYQLTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSDSPETGER 345
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EGAFYVWT EV D+L + A LF Y + GN F+G+N +
Sbjct: 346 -EEGAFYVWTPDEVHDVLEDETDAALFCARYDITEAGN------------FEGRNQPNRV 392
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A++ + + L L R++LF+ R +RPRP+ D+K++ WNGL+IS++A A+
Sbjct: 393 ARVSELAAQFDLADHEILKRLESARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAA 452
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L G+D +Y + A A F+R L+DE RL +++G K G
Sbjct: 453 LVL--------------GAD--DYADTAVDALGFVRDELWDEDEQRLSRRYKDGDVKIDG 496
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G LD Y+ L +A+EL + F D + G + T +++ R
Sbjct: 497 YLEDYAFLARGALDCYQATGEVDHLAFALELARVIEAEFWDADSGTLYFTPESGEALVTR 556
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVA 577
+E D + PS V+V L+ L A
Sbjct: 557 PQELGDQSTPSATGVAVETLLALDEFAA 584
>gi|45658527|ref|YP_002613.1| hypothetical protein LIC12692 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|45601770|gb|AAS71250.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
Length = 716
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 80 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 139
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 140 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 199
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 200 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 257
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 258 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 310
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 311 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 363
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 364 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 411
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 412 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 461
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 462 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 511
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 512 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 569
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 570 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 624
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 625 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 674
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 675 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 713
>gi|418741789|ref|ZP_13298163.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
gi|410751237|gb|EKR08216.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
Length = 688
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 245/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ G+ + L ++ + + GN F+GKN+L E +
Sbjct: 339 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 382
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ +L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 383 SNFTEEESKHLDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 436
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 437 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 485
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 486 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 543
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A++ P + A
Sbjct: 544 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 601
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 602 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 652
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 653 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 688
>gi|421085457|ref|ZP_15546310.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
gi|421103567|ref|ZP_15564164.1| PF03190 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410366530|gb|EKP21921.1| PF03190 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410432093|gb|EKP76451.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
Length = 691
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 339 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 387 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 436
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 437 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 486
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 487 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 544
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 545 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 599
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 600 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 649
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 650 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 688
>gi|448301393|ref|ZP_21491386.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
gi|445584129|gb|ELY38453.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
Length = 788
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 231/689 (33%), Positives = 347/689 (50%), Gaps = 51/689 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA LLN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 116 SACHWCHVMEDESFADEEVADLLNENFVPIKVDREERPDVDSIYMTVAQLVTGRGGWPLS 175
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P KP GTYFP E K G+PGF +L ++ ++W++ RD + + + L
Sbjct: 176 AWLTPQGKPFYVGTYFPKEAKRGQPGFLDVLEQLANSWEQDRDEVENRAQQWTDAAKDRL 235
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
+ S + L A+ +S D + GGFGS PKFP+P + ++ ++ +
Sbjct: 236 EETPDSVAQAEPPSSEVLTTAADAALRSADRQHGGFGSGGPKFPQPSRLHVL---ARAYD 292
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ + ++++ +L MA GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 293 RTGR----EQFREVLEESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIP 348
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+L + LT D Y+ + + L+++ R++ G FS DA S +G K+EG FY
Sbjct: 349 RAFLAGYQLTGDDRYAEVTAETLEFVDRELTHEEGGFFSTLDAQSKTEDG--EKEEGVFY 406
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT E+ ++L E A LF Y + +GN F+G N + A
Sbjct: 407 VWTPDEISEVLEEETDAELFCARYDITESGN------------FEGTNQPNRVRSIPDLA 454
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ + + L R+ LF+ R +RPRP+ D+KV+ SWNGL+I++ A A+ +L
Sbjct: 455 DEFDLAEDDTEQRLESARKALFEARERRPRPNRDEKVLASWNGLLINTCAEAALVL---- 510
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
G D EY E+ A F+R L+D RL +++G K G+L+DYAF
Sbjct: 511 ----------GED--EYAEMGVDALDFVRERLWDADEGRLARRYKDGDVKVDGYLEDYAF 558
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L YE L +A++L T + F D E G + T S++ R +E D
Sbjct: 559 LARGALRCYEATGDVDHLAFALDLARTIEAEFWDEERGTLYFTPESGESLVTRPQELDDQ 618
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
+ PS V++ L+ L A + + A L R++ ++ +C AAD L
Sbjct: 619 STPSATGVALETLLALDGFAADEN---FEKIASTVLETHANRIEANSLQHASLCLAADRL 675
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
+ + + + + + + AA + + + P E ++ W E
Sbjct: 676 EAGALE-ITIAADELPAAWRDRFAAEYRP----DRLFALRPPTAEGLESWLEQLGLEEAP 730
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + VC++ +CSPP D
Sbjct: 731 AIWAGREARDGEPTLYVCRDRTCSPPTHD 759
>gi|456984461|gb|EMG20516.1| PF03190 family protein [Leptospira interrogans serovar Copenhageni
str. LT2050]
Length = 699
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 250/699 (35%), Positives = 362/699 (51%), Gaps = 74/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM G I SAEDADS EG +EG
Sbjct: 294 FLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGL 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E S
Sbjct: 347 FYIWDLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFT 394
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L+K L + + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 395 EEELKQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------- 444
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ R++++++AE SFI ++L D R+ FR G S G+ +DYA
Sbjct: 445 ---------IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAE 494
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +D
Sbjct: 495 MIASSIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYD 552
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPS NS +LVRL+ + G S+YYR+ AE F L A++ P + A
Sbjct: 553 GVEPSANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA--- 607
Query: 616 LSVPSRKH----VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
S KH +VL+ K+S + ++MLA + + + + ++ + EE
Sbjct: 608 --YWSYKHHFREIVLI-RKNSEEGKDMLAWIQSRFLPDSVLAVVNEDELEEA-------R 657
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ + S + VC+NFSC PV + LE +
Sbjct: 658 KLSSLFDSRDSGGNALVYVCENFSCKLPVDNVSDLEKCM 696
>gi|418686893|ref|ZP_13248057.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|410738600|gb|EKQ83334.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
Length = 713
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 245/696 (35%), Positives = 360/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 80 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 139
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 140 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 199
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 200 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 257
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 258 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 310
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 311 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 363
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ G+ + L ++ + + GN F+GKN+L E +
Sbjct: 364 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 407
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ +L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 408 SNFTEEESKHLDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 461
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 462 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 510
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 511 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 568
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A++ P + A
Sbjct: 569 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 626
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 627 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 677
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 678 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 713
>gi|398339915|ref|ZP_10524618.1| hypothetical protein LkirsB1_10954 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 696
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 246/696 (35%), Positives = 358/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 294 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E +
Sbjct: 347 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A+ P + A
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYW 609
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696
>gi|335427892|ref|ZP_08554812.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
gi|334893818|gb|EGM32027.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
Length = 682
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 227/686 (33%), Positives = 352/686 (51%), Gaps = 70/686 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +++LLN F+SIKVDREERPD+D +YM QAL G GGWPL+
Sbjct: 53 STCHWCHVMERESFEDEEISELLNKDFISIKVDREERPDIDHIYMEVCQALTGRGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++ D KP GTYFP + G +L + W +D + S + L++
Sbjct: 113 IVMTADKKPFYAGTYFPKTTVGKQLGLTQLLPTITKQWKSNKDKILDSATEIYDVLNKYR 172
Query: 140 SASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S KL ++ +N + L ++D+ +GGFG+APKFP P + +L++
Sbjct: 173 EEQESVRGKLSLDVVENLFK----NLRGAFDNLYGGFGTAPKFPSPHNLLFLLHY----- 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G + MV TL+ M KGGI+DH+G GF RYSVD +W VPHFEKMLYD L
Sbjct: 224 --GYINNNQDAVFMVERTLEQMYKGGIYDHIGYGFSRYSVDRKWLVPHFEKMLYDNALLT 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y++A+ L D Y + + L+Y+ R M G ++AEDADS EG +EG FY
Sbjct: 282 LAYIEAYQLKNDPLYKQVVEETLEYVSRVMTDKEGGFYTAEDADS---EG----EEGKFY 334
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD-PHNEFKGKNVLIELNDSSASA 376
+T E++++L E A E+Y + GN + + + + H ++ ++L+D
Sbjct: 335 TFTKNEIKELLDKEDATFIIEYYNISEEGNFERTNILNLIHKDY------LDLDDKERER 388
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
L + + +LF+ R KR PH DDK++ SWN ++I+++ARA ++L ++A
Sbjct: 389 -------------LNKIKERLFNYRDKRVHPHKDDKILTSWNAMMITAYARAGRVLNNDA 435
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+ A+ FI HL DE R+Q +R+G +K G++DDYA+
Sbjct: 436 ----------------YINKAKQGVQFISDHLIDENG-RIQARYRDGEAKFKGYIDDYAY 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L L++L+ S ++ A++L + ELF D E G++ + +L+R KE +DG
Sbjct: 479 LNWALIELFLGTSDQTYIHQALKLTDDMIELFWDDEKDGFYYYGNDSEYLLMRNKEIYDG 538
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGNS++ +N ++L+ I K Y + A F ++K + M
Sbjct: 539 AIPSGNSIATMNFIKLSEITDEIK---YEKYARKLFDAFAYKVKQSPSSHSYMLNTYLHA 595
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
S P K V++ H E +H L +I + + + ++ N +
Sbjct: 596 SHPKTKVVIVGKHDDPKLKEIKRKISHHYLPLGTVLILYKDLVSADDPIFGDYLVENKDI 655
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
A +CQ++SC P+ D
Sbjct: 656 A----------CYICQDYSCDEPIYD 671
>gi|381206676|ref|ZP_09913747.1| hypothetical protein SclubJA_13745 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 693
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/696 (35%), Positives = 363/696 (52%), Gaps = 66/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED A LN FV++KVDREERPD+D+V+M + AL GGWPL++
Sbjct: 52 TCHWCHVMERESFEDLETADYLNRNFVAVKVDREERPDIDQVFMDALHALGEQGGWPLNM 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +PD +P GGTYFPP+ YGR F+ IL ++ W +++ + ++ +Q++ L
Sbjct: 112 FATPDGRPFTGGTYFPPKPMYGRQSFRQILESLRYYWQEEKAKIHETA----DQVTAYLR 167
Query: 141 ASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFG--SAPKFPRPVEIQMML-YHSKK 196
+ + L + LPQ N + + +++DS GGF KFP + +Q++L YH +
Sbjct: 168 RAPAPQPLDEPLPQWNCVEETVQAYRQAFDSEDGGFALQRPNKFPPSMGLQLLLRYHLRT 227
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
MV TL M GGI+D VGGG RYS D RW VPHFEKMLYD
Sbjct: 228 --------RIPSDLFMVELTLFKMRNGGIYDQVGGGLCRYSTDYRWLVPHFEKMLYDNAL 279
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
A L+ F +T + FY I DI Y+ RDM+ SAEDADS EG EG
Sbjct: 280 FAQTSLECFQVTSNPFYREIAEDIFQYVTRDMMAESSAFCSAEDADS---EG----HEGL 332
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY+WT+ E + + +++ ++ + P GN F+G+N+L +
Sbjct: 333 FYLWTADEFKKTVEDKYSDSLANYWNVTPQGN------------FEGRNILNVSQSTKVF 380
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+LG+ ++ I+ R L DVR++R RP DDK++VSWN L+ISSFA+A++IL
Sbjct: 381 GEQLGLEENEWQTIIKSARSNLQDVRAQRIRPLKDDKILVSWNALMISSFAQAARIL--- 437
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ EY A +A +FI HL + Q RL +R+G +K P +L DYA
Sbjct: 438 -------------EHNEYGITANNALAFIEEHLIN-QEGRLLRRYRDGDAKFPAYLSDYA 483
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
L LD+Y + ++++ A N + LFL+ + G YF T + VL+R + +D
Sbjct: 484 QLGLACLDIYAWNYEPQYVLKAHHWANEINRLFLNPD-GAYFETGFDAEEVLVRKADGYD 542
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
G EPSGN+ + + ++LAS GS ++AE L F L + M A +
Sbjct: 543 GVEPSGNTSTALLFLKLASFGMGSG---LLRDAERILHSFSPHLHQAGVNFSAMLNAL-I 598
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ +V+ G +S+++ + +L S+ L + V+ P+D + S
Sbjct: 599 WARKGGTEIVVSGDESNLETKEVLQWLRQSF-LPEVVVAFIPSDD------PDPVSQQIP 651
Query: 676 MARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
+A S D +++ VCQ C PV D SL+ L+
Sbjct: 652 IAEGRASLDERLLIHVCQGQLCHAPVQDLPSLKKLI 687
>gi|429193250|ref|YP_007178928.1| thioredoxin domain-containing protein [Natronobacterium gregoryi
SP2]
gi|448324467|ref|ZP_21513897.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
gi|429137468|gb|AFZ74479.1| thioredoxin domain protein [Natronobacterium gregoryi SP2]
gi|445618899|gb|ELY72451.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
Length = 741
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 241/704 (34%), Positives = 357/704 (50%), Gaps = 64/704 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFADEAVAEVLNENFVPIKVDREERPDVDSIYMTVCNLVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
+L+P+ KP GTYFP E K G+PGF +L + ++W+ R+ + Q A +QL
Sbjct: 113 AWLTPEGKPFYVGTYFPTEAKRGQPGFLDVLENITNSWENDREEVENRADQWTEAARDQL 172
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
E + A S D + L A+ +S D ++GGFGS PKFP+P +Q++ +
Sbjct: 173 EE--TPGAPSPGAADPPSSDLLERAADASLRSADRQYGGFGSDGPKFPQPSRLQVL---A 227
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + TG E ++++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 228 RAYDRTGD----EEYRQVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDN 283
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ +L + LT + Y+ + + L ++ R++ G FS DA S + E R +E
Sbjct: 284 AEIPRAFLAGYQLTGEERYAEVVHETLAFVDRELTHEDGGFFSTLDAQSEDPETGER-EE 342
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT EV D+L + A LF HY + +GN F+G N +
Sbjct: 343 GTFYVWTPAEVHDVLADETDADLFCAHYDITASGN------------FEGANQPNRVRSI 390
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A + + + L + R++LF+ R KRPRP+ D+KV+ WNGL+I++ A A+ L
Sbjct: 391 ADLAGEFDLAEHEVKQRLEDARQQLFETREKRPRPNRDEKVLAGWNGLMIATCAEAALTL 450
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
E Y E+A A F+R L+D++ RL ++ G+L+
Sbjct: 451 GEE----------------RYAEMAVDALEFVRDRLWDDEEGRLSRRYKGEDVAIEGYLE 494
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L YE L +A+EL +E F D + G + T S++ R +E
Sbjct: 495 DYAFLARGALGCYEATGEVDHLAFALELGRAIEEEFWDADRGTLYFTPESGESLVTRPQE 554
Query: 553 DHDGAEPSGNSVSVINLVRLASIVA--GSKSDY---------YRQNAEHSLAVFETRLKD 601
D + PS V+V L+ L GSKS Y + A L+ RL+
Sbjct: 555 LGDQSTPSSAGVAVEILLALEKFAGSEGSKSPRGDGEVADADYEEIAATVLSTHANRLEA 614
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
++ +C AAD L + + V ++ + A A+ ++ P +
Sbjct: 615 NSLQHATLCLAADHLESGALEVTV-----TADELPEEWREAFATQYFPDRLLARRPTTDD 669
Query: 662 EMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
+++ W + S A+ A + VC++ +CSPP D
Sbjct: 670 DLEAWLDRLSLAAAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 713
>gi|421131211|ref|ZP_15591395.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
gi|410357462|gb|EKP04717.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
Length = 696
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 245/696 (35%), Positives = 359/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 123 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 182
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 183 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 240
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 241 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 293
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 294 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGL 346
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ G+ + L ++ + + GN F+GKN+L E +
Sbjct: 347 FYIWDLEEFREVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 390
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 391 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 444
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 445 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 493
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 494 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 551
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A++ P + A
Sbjct: 552 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYW 609
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 610 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 660
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 661 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 696
>gi|386392363|ref|ZP_10077144.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
gi|385733241|gb|EIG53439.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
Length = 704
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 251/701 (35%), Positives = 343/701 (48%), Gaps = 67/701 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A L+ V++KVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 51 STCHWCHVMEHESFEDEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP E +GR G + +L++V AW R + + ++ + + L
Sbjct: 111 VFLTPDGRPFFAGTYFPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRDQL 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + + E Q L +L+ ++D+ GGFG APKFP P + +L ++
Sbjct: 171 EARDAGEAV--EPGQAQLGAARNELAAAFDTANGGFGGAPKFPSPHNLLFLLREYRR--- 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ + MV TL M +GG+ D +G G HRYS D RW VPHFEKMLYDQ A
Sbjct: 226 TGQ----EDNLAMVTATLDAMRRGGVFDQIGLGLHRYSTDARWFVPHFEKMLYDQALTAM 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+A+ T D + +I +Y+RRD+ GP G +SAEDADS EG EG FYV
Sbjct: 282 AATEAYLATGDAGLRRMAMEIFEYVRRDLTGPDGAFYSAEDADS---EGV----EGRFYV 334
Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ +L G+ A LF + Y + P GN + + G N+ +A A K
Sbjct: 335 WTESEIRAVLPGDEAGLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAGK 390
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G + L R L R KR RP DDKV+ NGL+I++ A+A++
Sbjct: 391 RGQEPAELAARLERSRELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF------ 444
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D +E A+ A+ F+ + + RL H R G + G LDDYAFL
Sbjct: 445 ----------DDEELAGRAKRASDFLLGKMLLPDS-RLLHRLRLGEAAVSGMLDDYAFLA 493
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL+LY+ +L A+ L F D GG F T + ++LLR K +D A
Sbjct: 494 WGLLELYQTVFDPAYLAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAAI 552
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA--------MAVPLMC 610
PSGNSV+ + L L YR E S TRL A
Sbjct: 553 PSGNSVAFLVLTTL-----------YRLTGEKSFMEEATRLARAAGPWLAGHPSGFTFFL 601
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
C + PS V + G + D + + A Y L + + + PA E
Sbjct: 602 CGLSQMLAPS-AEVTIAGDPDAPDTQALARALFERY-LPEVAVVLRPAGG------EPDI 653
Query: 671 SNNASMARNNFS-ADKVVALVCQNFSCSPPVTDPISLENLL 710
A R D+ A VC+ SC PP TDP ++ LL
Sbjct: 654 VALAPFTRFQLPMGDRAAAHVCRAGSCQPPTTDPAAMLALL 694
>gi|302390271|ref|YP_003826092.1| hypothetical protein Toce_1734 [Thermosediminibacter oceani DSM
16646]
gi|302200899|gb|ADL08469.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
16646]
Length = 670
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 242/687 (35%), Positives = 349/687 (50%), Gaps = 90/687 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE V +LN ++VSIKVDREE PDVD YM QAL G GGWPL+
Sbjct: 56 STCHWCHVMEKESFEDEEVGNILNRYYVSIKVDREEHPDVDNFYMEVCQALTGSGGWPLT 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD P+ TY P ED YGRPG KT+L K+ + W K R+ L +G + + +
Sbjct: 116 IIMTPDKHPVFAATYLPKEDSYGRPGLKTVLFKINELWQKDRERLITTGREIVSSIKKLE 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 197
EL + E L SYD ++GGF APKFP P + +L YH +K
Sbjct: 176 RTGHG------ELDPGVIDKAFEILKASYDRKYGGFFGAPKFPMPGTLLFLLGYYHYRK- 228
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
E +MV TL+ M KGGI+DH+G G RYS D RW VPHFEKMLYD +
Sbjct: 229 --------DPEALEMVENTLKNMYKGGIYDHIGFGLCRYSTDRRWLVPHFEKMLYDNALV 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ V +A+ + +D F+ +I+DY+ R++ P G ++AEDADS EG +EG F
Sbjct: 281 SFVCAEAYKIARDEFFKTFALEIIDYVLRNLRNPEGGFYTAEDADS---EG----EEGRF 333
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y WT +E+ +LG+ A F E Y + GN F+GKN+ +
Sbjct: 334 YTWTPQEIRHVLGDRADEFMESYNITERGN------------FEGKNI----------PN 371
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+G L ++ + R+KLF+ R +R +P D+K++VS N L+I+S R I K+E
Sbjct: 372 LIGRDLSCKMD--EDTRKKLFEYREQRVKPFRDEKILVSGNSLMIASLFRVYGITKNE-- 427
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y + AE A +FI + RL +R G KA DDY+ L
Sbjct: 428 --------------NYRKEAEVALNFILENARGSDG-RLHVGYREGIMKAKATFDDYSHL 472
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL+ YE+ T +L A L + +LF D+E GG++ T + + R K+ +DGA
Sbjct: 473 LWALLEAYEYTLETSYLKKAKSLADEMIDLFYDKEAGGFYLTGSDVDHLPARAKDAYDGA 532
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNS++ +L RL+ ++ S + + A + VF + + + + + +
Sbjct: 533 VPSGNSMAAFSLARLSRLLFDSGME---ELARNQYRVFARTISENPVYHTFFLYSF-IYA 588
Query: 618 VPSRKHVVLVGHKSSVDFENMLAA---AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
V V++ G + + F N LA +A + + I PA +E +
Sbjct: 589 VTGGTEVIIAGERPEM-FTNYLAENFFPYAVWAHADRLKEIVPA-------YENYGKIGG 640
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVT 701
A A VC+N SC PVT
Sbjct: 641 RTA----------AYVCKNGSCKSPVT 657
>gi|150016393|ref|YP_001308647.1| hypothetical protein Cbei_1515 [Clostridium beijerinckii NCIMB
8052]
gi|149902858|gb|ABR33691.1| protein of unknown function DUF255 [Clostridium beijerinckii NCIMB
8052]
Length = 680
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 250/719 (34%), Positives = 356/719 (49%), Gaps = 83/719 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESFEDE +A ++ND F++IKVDREERPD
Sbjct: 32 GDEAFAKAKEEDKPIFLSIGYSTCHWCHVMAHESFEDEEIAGIMNDSFIAIKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT QAL G GGWPL+V ++PD KP GTYFP + KY PG IL + W
Sbjct: 92 IDSVYMTVCQALTGHGGWPLTVIMTPDQKPFFAGTYFPKKAKYNMPGLMDILNSINKQWK 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+D L SG + +L S KL + +N Q+ +++ ++GGFG A
Sbjct: 152 DNKDKLISSGDSILSELGGYFDGETSKLKLTSKTLKNGYN----QILHAFEEKYGGFGDA 207
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P I M L K K+ E +E TL M +GGI DH+G GF RYS
Sbjct: 208 PKFPTP-HITMFLLRYYKSHKEIKALEMAEK------TLISMYRGGIFDHIGFGFSRYST 260
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W VPHFEKMLYD L YL+ + +TK+ Y + +L+Y+ R++ G + A
Sbjct: 261 DNKWLVPHFEKMLYDNALLVISYLEGYEVTKNEIYKEVATKVLEYVFRELTSKNGGFYCA 320
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG +EG +YV+ E+ +LGE F +++ + GN
Sbjct: 321 EDADS---EG----EEGKYYVFEPLEILSVLGEEDGTYFNDYFDITSDGN---------- 363
Query: 358 NEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+GK++ LI+ + S ++ + E+ L RS R H DDK++
Sbjct: 364 --FEGKSIPNLIKNKNFHKSDDRIKLLSEQILQ-----------YRSDRTELHKDDKILT 410
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
SWNGL+I++ +A K+++ E Y E A+ A FI +L DE R
Sbjct: 411 SWNGLMIAALGKAYKVIEDE----------------RYFEYAKKAVEFIFNNLMDENK-R 453
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L +R+ S+ +LDDYAFL GL++LYE ++L AIE+ LF D E G
Sbjct: 454 LLARYRDKDSRHKAYLDDYAFLCFGLIELYESSYDIEFLNKAIEINKDMINLFWDNEKDG 513
Query: 536 YFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
+F GED L+ R KE DGA PSGNSV+ NL++LA + + + AE
Sbjct: 514 FF-LYGEDSEKLIARPKELFDGAMPSGNSVAAYNLIKLARLTGDLTLE---EMAEKQFDF 569
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
+ + + AA S++ V + K + L + ++L T+I
Sbjct: 570 ICGSVFNEEINHSFFLMAASFALNESQELVCVTNDKGEEEKIKDLLSERPIFNLT-TIIK 628
Query: 655 IDPADTEEMD---FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
D E D F +E++ N +K +C+ SC PV D L +L
Sbjct: 629 NDENRNEIEDLAPFLKEYDLIN----------EKSTYYLCKGKSCMAPVNDIDELRKML 677
>gi|435846903|ref|YP_007309153.1| thioredoxin domain protein [Natronococcus occultus SP4]
gi|433673171|gb|AGB37363.1| thioredoxin domain protein [Natronococcus occultus SP4]
Length = 732
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 244/702 (34%), Positives = 361/702 (51%), Gaps = 66/702 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFADEEVAEVLNEEFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P+ KP GTYFP K G+PGF ++ + D+W R+ IE +E
Sbjct: 113 AWLTPEGKPFYVGTYFPKHSKRGQPGFLDLIEGLADSWKTDRE--------EIENRAEEW 164
Query: 140 SASASS--NKLPDEL------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
+A+A+ + PD + + L A+ +S D + GGFGS PKFP+P ++++
Sbjct: 165 TAAATDRLEETPDSIGAAEPPSSDVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL 224
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
++ + TG+ E ++++ +L M +GG++DHVGGGFHRY VDE W VPHFEKM
Sbjct: 225 ---ARAYDRTGR----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDEDWTVPHFEKM 277
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD ++ L + LT D Y+ RD L+++ R++ G FS DA S E
Sbjct: 278 LYDNAEIPRALLAGYQLTGDERYADSVRDTLEFVSRELTHAEGGFFSTLDAQS-EDPATG 336
Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
++EGAF+VWT EV ++LG+ A LF Y + +GN F G+N
Sbjct: 337 EREEGAFFVWTPAEVREVLGDETDAELFCARYDITESGN------------FGGQNQPNV 384
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
+ S A + + E L + R +LF+ R +RPRP+ D+KV+ SWNGL+I++ A A
Sbjct: 385 VASISELAERFDLAAETVEQRLEDARAELFEAREERPRPNRDEKVLASWNGLMIATCAEA 444
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
L G DR Y +A A F+R L+D + RL F++G
Sbjct: 445 GLAL--------------GEDR--YAGMAVDALEFVRDRLWDAEEGRLSRRFKDGDVAVQ 488
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G+L+DYAFL G L YE + L +A+EL + F D E + T S++
Sbjct: 489 GYLEDYAFLARGALGCYEATGEVEHLAFALELARVIEAEFYDAERETIYFTPESGESLVT 548
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA-----EHSLAVFET---RLK 600
R +E +D + PS V+V L+ L AG S R++ E + +V T RL+
Sbjct: 549 RPQELNDQSTPSATGVAVETLLALDGF-AGEGSTSPREDGDAEFEEIAASVLRTHAGRLE 607
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
A+ +C AAD L + + V + + ++ A+ + L + +
Sbjct: 608 SNALQHATLCLAADRLESGALE-VTVAADEVPAEWRAAFASRYLPDRLFAPRPPTEDGLS 666
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E +D E ++ R + + VC+N +CSPP D
Sbjct: 667 EWLDELELESAPTIWAGREARDGEPTL-YVCRNRTCSPPTHD 707
>gi|239608009|gb|EEQ84996.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 823
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 235/621 (37%), Positives = 331/621 (53%), Gaps = 61/621 (9%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVME ESF VA +LN F+ IK+DREERPD+D+VYM YVQA G GGWPL+VFL+P
Sbjct: 66 CHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 125
Query: 85 DLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
DL+P+ GGTY+P P F IL K++D W ++ +S +QL
Sbjct: 126 DLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLR 185
Query: 137 E-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
E A + S K D + L + + +D GGF APKF P + ++
Sbjct: 186 EFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLIN 245
Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
S+ + D E S +M TL M++GGIHD +G GF RYSV W +PHFEK
Sbjct: 246 LSRYPSAVSDIVGYDECSRALEMATKTLISMSRGGIHDQIGHGFARYSVTADWSLPHFEK 305
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
MLYDQ QL NVY+DAF + DI Y+ ++ P G +S+EDADS T
Sbjct: 306 MLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPS 365
Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
T K+EGAFYVWT KE + ILG+ A + H+ + P GN ++R +DPH+EF +NVL
Sbjct: 366 DTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLS 423
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFA 426
+ A + G+ E+ + I+ R KL + R SKR RP LDDK+IVSWNGL I + A
Sbjct: 424 IKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALA 483
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-S 485
+ S +L++ V + +E+ AE+AA FIR++L+D + +L +R+G
Sbjct: 484 KCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERG 533
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------------- 531
PGF DDY++L SGL+DLYE +L +A +LQ + FL +
Sbjct: 534 DTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSITT 593
Query: 532 -------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
GY+ T P+ L R+K D + PS N V NL+RL++++
Sbjct: 594 ESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL-- 651
Query: 579 SKSDYYRQNAEHSLAVFETRL 599
+ D Y++ A ++ F +
Sbjct: 652 -EDDTYKRLARETVNAFAVEI 671
>gi|328950404|ref|YP_004367739.1| hypothetical protein Marky_0883 [Marinithermus hydrothermalis DSM
14884]
gi|328450728|gb|AEB11629.1| protein of unknown function DUF255 [Marinithermus hydrothermalis
DSM 14884]
Length = 667
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 229/577 (39%), Positives = 311/577 (53%), Gaps = 57/577 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVM ESFED VA+LLN FV +KVDREERPD
Sbjct: 27 GEEAFARAQQEGKPIFLSVGYATCHWCHVMARESFEDPEVARLLNAHFVPVKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD YM +QAL G GGWP+S+FL+P+ KP GGTYFPP D+YG P F+ +L V +AW
Sbjct: 87 VDHAYMQALQALTGQGGWPMSLFLTPEGKPFYGGTYFPPTDRYGLPSFRRVLEAVAEAWT 146
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K+R+ + A +++++AL + LP +L AL E +++D + GGFG A
Sbjct: 147 KRRNEIETHAAALAQRIAQAL--TNRPGDLPPQLHAKAL----EAYRQAFDPQHGGFGGA 200
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP ++ +L + GEA+ G+ M+ TL M GG++D VGGGFHRY+V
Sbjct: 201 PKFPNAPALRYLLLQAWL-------GEAAAGE-MLRVTLDRMQAGGVYDQVGGGFHRYAV 252
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLYD QLA VYL AF L D Y R+ LDYL R+M G ++A
Sbjct: 253 DAVWRVPHFEKMLYDNAQLARVYLGAFRLFGDARYRRTARETLDYLLREMQDAAGGFYAA 312
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
+D AE+EG +EG +YVW E+ +LG ++ + GN
Sbjct: 313 QD---AESEG----EEGRYYVWRIPELRAVLGADFEAAARYFGVSDAGN----------- 354
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
++GKN+L A +LG+ + L + +L + R +R RP DDK++ WN
Sbjct: 355 -WEGKNILEARYPEPLLAQELGLDAAGFEAWLASVKARLLEARLRRVRPLTDDKILADWN 413
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL +++FA A + L G R Y+E A A F+ LY Q L+H
Sbjct: 414 GLALAAFAEAGRWL--------------GEAR--YLEAARKNAEFVLGALY--QDGLLRH 455
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
++R G +L D A GLL L+E +WL A L E F D E GG+F+
Sbjct: 456 AWRRGRLGRHAYLSDQAHYGLGLLALFEATGEMRWLEAARVLAEGILEHFRDPE-GGFFD 514
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
+P L R K+ DGA PSGN+ + LVRLA +
Sbjct: 515 ALEANP--LGRPKDVFDGAWPSGNAAAAELLVRLARL 549
>gi|403389033|ref|ZP_10931090.1| hypothetical protein CJC12_14629 [Clostridium sp. JC122]
Length = 593
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 219/593 (36%), Positives = 325/593 (54%), Gaps = 60/593 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM E FED+ VAK+LND F+SIKVDREERPDVD +YMT QA GGGGWPL++
Sbjct: 55 TCHWCHVMAHECFEDDEVAKILNDNFISIKVDREERPDVDSIYMTVCQAFTGGGGWPLNL 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD KP GTYFP KY PGF IL + D W ++ + + I QL A
Sbjct: 115 FITPDQKPFYAGTYFPKHAKYNVPGFMDILSSISDQWKSDKERIIDASEEVINQLENAFQ 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + +++ ++ + C E +D GGF APKFP P ++ +L + KLE+
Sbjct: 175 PTTTDDEIGKDIIEGGYLWCLE----FFDVVNGGFDKAPKFPTPHKLMFLLKYY-KLENE 229
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
K+ E MV TL M +GGI DH+G GF RYS D++W VPHFEKMLYD L
Sbjct: 230 PKALE------MVEKTLNQMYRGGIFDHIGYGFSRYSTDDKWLVPHFEKMLYDNALLTMA 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL+ +S+TK FY + +DY+ R++ G + A+DADS EG EG FYV+
Sbjct: 284 YLETYSITKKEFYKNVAIKTMDYVLRELTSDEGGFYCAQDADS---EG----DEGKFYVF 336
Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
E+ ++LGE F ++ + +GN F+GK++ L ++S
Sbjct: 337 NPLEICEVLGEDDGKYFNNYFDITTSGN------------FEGKSIANLLKNNSFENDD- 383
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
EK + + R+K+F+ R +R H D+K++ SWN L+I++FA+A ILK E
Sbjct: 384 ----EK----INDLRKKVFNYRLERTTLHKDEKILTSWNALMITAFAKAYSILKDE---- 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+Y++V + A +FI +L + + +RL +++G +L+DYAFLI
Sbjct: 432 ------------KYLKVCKDAIAFIENNLVN-KDNRLLARYKDGDVAYFSYLEDYAFLIW 478
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
++LYE + ++L AI L + + F D G+F + ++ R KE +DGA P
Sbjct: 479 SFIELYEGTNEKEYLEKAISLNSEMIDKFWDENSSGFFLYGKDSEKLIARPKEIYDGAIP 538
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
SGNSV+ LV+L+ I +K + + L F + +K+ ++ + A
Sbjct: 539 SGNSVAAYVLVKLSKI---TKDKILKDITYNQLKYFSSTVKNSPISYTMYLIA 588
>gi|410462713|ref|ZP_11316275.1| thioredoxin domain containing protein [Desulfovibrio magneticus
str. Maddingley MBC34]
gi|409984165|gb|EKO40492.1| thioredoxin domain containing protein [Desulfovibrio magneticus
str. Maddingley MBC34]
Length = 697
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 251/694 (36%), Positives = 353/694 (50%), Gaps = 53/694 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A L+N VSIKVDREERPD+D +YM+ AL G GGWPL+
Sbjct: 52 STCHWCHVMERESFEDEDIAALMNAVAVSIKVDREERPDLDTLYMSVCHALTGRGGWPLT 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP E YGR G + +L++V +W R + + ++ + E L
Sbjct: 112 VFLTPDKEPFFAGTYFPKESAYGRTGLRELLQRVHMSWKGNRQAVVNNAGQIMDAVREQL 171
Query: 140 SASASSNKL-PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+A+A + P E +A R QLS +D+R GGFG APKFP P + +L +
Sbjct: 172 TAAAGAASAEPGEAVLDAAR---AQLSGIFDARNGGFGGAPKFPSPHNLLFLLREYR--- 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++G+AS + MV TL M +GG++DHVG G HRY+ D +W +PHFEKMLYDQ
Sbjct: 226 ---RTGDAS-CRDMVCRTLDAMRRGGVYDHVGFGLHRYATDAQWFLPHFEKMLYDQALTV 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++A+ + D + + +IL+Y+RRD+ P G SAEDADS EG EG FY
Sbjct: 282 MACVEAYQASGDAAHKTMALEILEYVRRDLTSPEGLFHSAEDADS---EGV----EGKFY 334
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VW++ E+ +LG+ A L GN + E G N+L +A++
Sbjct: 335 VWSAAELRRLLGDEAALVMAAMGATEEGNAH----DEATGETTGSNILHLPRPLDETAAQ 390
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+ +E L ECRR L R KR RP DDKV+ NGL++++ A+A++ E +
Sbjct: 391 LGLTVEALTTRLEECRRILLVEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEELA 450
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ AES + + R RL H R+G + GFLDDY FL
Sbjct: 451 G------------RAVTAAESLLTRLTR-----PNGRLLHRLRDGEAAIDGFLDDYVFLA 493
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ T +L A+ L + F D GG+F T + +L+R K D A
Sbjct: 494 WGLVELYQTVFDTAYLHRAVALLRAVADHFADPAEGGFFVTPDDGEQLLVRQKVFFDAAV 553
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLS 617
PSGNSV+ L L + + +++ A RL D A C + +L
Sbjct: 554 PSGNSVAYFVLTTLFRL---TGDPVFKEQATALARAMAPRLADHAAGHAFFLCGLSQVLG 610
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
PS V L G + D + + A Y L + + + P D E A
Sbjct: 611 KPS--EVTLAGDPAGPDTQALARAVFGRY-LPEVAVVLRP------DEGEPDIVALAPFT 661
Query: 678 RNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
R D + A VC+ SC P D ++ LL
Sbjct: 662 RYQLPLDGRTAAHVCRAGSCQPATADVETMLKLL 695
>gi|383625377|ref|ZP_09949783.1| hypothetical protein HlacAJ_18680 [Halobiforma lacisalsi AJ5]
gi|448700355|ref|ZP_21699463.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
gi|445779895|gb|EMA30810.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
Length = 746
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 241/699 (34%), Positives = 346/699 (49%), Gaps = 60/699 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA LLND FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 57 SACHWCHVMEEESFADEDVADLLNDHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA----FAIEQL 135
+L+P+ KP GTYFP E K G+PGF IL V D+W+ R+ + A ++L
Sbjct: 117 AWLTPEGKPFYVGTYFPKESKRGQPGFVDILENVIDSWETDREEIENRAQKWTDAARDEL 176
Query: 136 SEAL------SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
E A+ + + P + L A+ +S D +GGFGS PKFP+P ++
Sbjct: 177 EETPGTGGPGDAAVAESTEPTPPSSDLLETTADAAVRSADRGYGGFGSDGPKFPQPSRLR 236
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
++ S + TG GE ++++ TL MA GG++DHVGGGFHRY VD W VPHFE
Sbjct: 237 VLARASDR---TG--GETY--REVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFE 289
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD ++ +L + LT D Y+ + + L ++ R++ G F+ DA S + E
Sbjct: 290 KMLYDNAEIPRAFLTGYRLTGDDRYAEVVEETLAFVDRELTHDEGGFFATLDAQSEDPET 349
Query: 309 ATRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
R +EGAFYVWT EV D+L + A LF E Y + +GN F+G+N
Sbjct: 350 GER-EEGAFYVWTPDEVRDVLEDETDAELFCERYDITASGN------------FEGENQP 396
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+ + A + + L + R +LF R +RPRP+ D+KV+ WNGL+I++ A
Sbjct: 397 NRVRSVADLAESFDLEESEVRERLADARERLFAAREERPRPNRDEKVLAGWNGLMIATCA 456
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
A+ L G D EY +A A F+R L+D RL +++
Sbjct: 457 EAAMTL--------------GED--EYATMAVDALEFVRERLWDADERRLSRRYKDDDVA 500
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
G+L+DYAFL G L Y+ L +A++L + F D E G + T +
Sbjct: 501 IDGYLEDYAFLARGALACYQATGDVDHLAFALDLAREIEGEFWDEEAGTLYFTPESGEDL 560
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R +E D + PS V+V L+ L S V + Y + AE L RL+ +
Sbjct: 561 VTRPQELGDQSTPSAAGVAVETLLALESFVPDAD---YAELAETVLGTHVDRLEGSPLQH 617
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
+C AD L + + V + + ++ A H +I P + ++ W
Sbjct: 618 ATLCLGADRLESGALE-VTVAAEEVPDEWREAFATGH----YPDRLIARRPPTEDGLEAW 672
Query: 667 EEH---NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ A D+ VC+ +CSPP D
Sbjct: 673 LDRLGLEDAPPIWAGREARDDEPTLYVCRGRTCSPPTHD 711
>gi|297566141|ref|YP_003685113.1| hypothetical protein [Meiothermus silvanus DSM 9946]
gi|296850590|gb|ADH63605.1| protein of unknown function DUF255 [Meiothermus silvanus DSM 9946]
Length = 665
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 227/583 (38%), Positives = 309/583 (53%), Gaps = 62/583 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED A+LLN++FV +KVDREE PDVD VYM +QAL G GGWP+S+
Sbjct: 49 TCHWCHVMERESFEDPETAQLLNEFFVPVKVDREELPDVDHVYMMALQALTGSGGWPMSL 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PDLKP GGTYFPPED++G P F +L+ + W +R+ + S + L + L
Sbjct: 109 FLTPDLKPFYGGTYFPPEDRHGLPSFARVLKTIASTWQNRREEVLGSADELTQHLHKLL- 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
LP +L AL+ QL++++D+ GGFG APKFP+ + +L + K +
Sbjct: 168 -VPRGGPLPQDLHAQALK----QLARAHDATHGGFGGAPKFPQAPTLTYLLALAWKGDPL 222
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
M+ TL MA+GGI+D VGGGFHRY+VD W VPHFEKMLYD QLA V
Sbjct: 223 AWG--------MLELTLDKMAEGGIYDQVGGGFHRYAVDGIWRVPHFEKMLYDNAQLAWV 274
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL LT Y + + LDYL R+M P G +SA+DADS EG EG FYVW
Sbjct: 275 YLGMSRLTGKTLYRRVTLETLDYLLREMQHPEGGFYSAQDADS---EGV----EGKFYVW 327
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+ +EV +LG A + + + GN ++G NVL A +LG
Sbjct: 328 SEQEVRAVLGSDAEAALKLFGVSQAGN------------WEGVNVLEARYPEPALRQELG 375
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ + L E + KL+ R +R P DDK++ WNGL + +FA A +IL EA
Sbjct: 376 LDEATFARWLEEVKAKLYQARRQRIPPLTDDKILADWNGLALRAFAAAGRILGKEA---- 431
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A A F+ + + L+HS+R G + +L D A G
Sbjct: 432 ------------YLEAARKNAEFVTSRMMRDGL--LRHSWRGGKLRPEAYLSDQASYGLG 477
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL+ Y+ +WL A L F D GG+F+ +G + LR K+ DG P
Sbjct: 478 LLETYQATGEMRWLEAARTLAEGILTHFRD-PNGGFFDASGG--GLPLRAKDVFDGPYPG 534
Query: 561 GNSVSVINLVRLASI--------VAGSKSDYYRQNAEHSLAVF 595
GNS + L+RLA++ A +++ Q HS + F
Sbjct: 535 GNSAAAELLIRLAALYEREDWAEAARGAIEFHAQGLAHSPSAF 577
>gi|421092713|ref|ZP_15553445.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
gi|410364564|gb|EKP15585.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
gi|456889958|gb|EMG00828.1| PF03190 family protein [Leptospira borgpetersenii str. 200701203]
Length = 700
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 252/713 (35%), Positives = 362/713 (50%), Gaps = 71/713 (9%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 46 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 105
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE YGR F +L ++ W +KR
Sbjct: 106 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQE 165
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS--APK 180
L + + L ++ A + LP ++ YD+ FGGF + K
Sbjct: 166 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNFGFSLYESYYDAEFGGFKTNHVNK 225
Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
FP + + +L YH S + +MV TL M +GGI+D VGGG RYS D
Sbjct: 226 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 277
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
RW VPHFEKMLYD ++ ++K + D++ YL RDM GG I SAE
Sbjct: 278 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 337
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 338 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 378
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+GKN+L E A+KL K ++ +L R KL + RSKR RP DDK++ SWN
Sbjct: 379 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 436
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I + A+A + R++++++AE SFI R+L D R+
Sbjct: 437 GLYIKALAKAG----------------IAFRREDFLKLAEETYSFIERNLIDPDG-RILR 479
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+G S G+ +DYA +IS + L+E G G ++L A+ LF R G F
Sbjct: 480 RFRDGESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 537
Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TG D VLLR D +DG EPS NS +LV+L+ + G S YR+ AE + F
Sbjct: 538 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 595
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L +++ P + A S K +VL+ K + +++LAA + + ++
Sbjct: 596 ELSTHSLSYPHLLSAYWTYRYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 653
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ EE +++ + S + VC+NFSC PV++ L+ +
Sbjct: 654 NELEEA-------RKLSALFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 699
>gi|397690129|ref|YP_006527383.1| Thioredoxin domain protein [Melioribacter roseus P3M]
gi|395811621|gb|AFN74370.1| Thioredoxin domain protein [Melioribacter roseus P3M]
Length = 690
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 241/681 (35%), Positives = 347/681 (50%), Gaps = 72/681 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFEDE VA+LLN F+SIKVDREERPD+D +YM Q + G GGWPLS+
Sbjct: 67 TCHWCHVMAHESFEDEEVAELLNKNFISIKVDREERPDIDSIYMASCQLITGRGGWPLSI 126
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GTYFP YGR GF +L ++ D W+K R++L ++ +++
Sbjct: 127 FLTPDGKPFYAGTYFPKYSYYGRIGFVDLLNRIIDLWNKDRNVLLRTSDEITAAINKHFE 186
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+SA D + A E L ++D +GGFGSAPKFP P + +L + D
Sbjct: 187 SSAKE-AFDDSVVDKAF----ETLKLNFDPEYGGFGSAPKFPSPHNLLFLLDRNNPQAD- 240
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+MV TL M KGGI D +G GFHRYS D +W +PHFEKM+YDQ L
Sbjct: 241 ----------EMVQKTLTEMRKGGIFDQLGFGFHRYSTDGKWFLPHFEKMIYDQASLIEA 290
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y AF+ T D Y+ +I ++++ +M G +SA DADS EG +EG FY+W
Sbjct: 291 YAYAFAKTGDALYADTINEIYEFIKNEMTSHEGAFYSALDADS---EG----EEGKFYLW 343
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
TS+E+ + G+ + KE + GN ++ + GKN+L K G
Sbjct: 344 TSEEIRSVAGDDYEIAKEIFNFTDEGN----HRNESNGNSTGKNILFLRKRPDKLYEKYG 399
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
KY +I R L + R KR P D+K++ WN +VISS A A I++++ A
Sbjct: 400 RS--KYDSI----RINLLEARKKRIPPMRDEKILTDWNAMVISSLANAGSIIENDDMVAW 453
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
AE A + +H + L H N + GFLDDYA+LI
Sbjct: 454 ----------------AERAYQCLMKHAF--VNGELYHYPENNIT---GFLDDYAYLIKA 492
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LDLY ++L A+EL + E F D+ EGG +FN G + +RVK+ +DGA P
Sbjct: 493 ALDLYRATLNEEYLFNALELNDLLSENFEDKSEGGYFFNKAGANT---IRVKDAYDGAVP 549
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNS+ + NL+ L + G+ S YR +AE+S+ F + L ++ L
Sbjct: 550 SGNSIQLSNLIELY-FITGNNS--YRLSAENSIKTFSSGLNKSSIGYTYFLRGIKKLYSK 606
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+++ G K+ +F L+ + DL +H+ + E + +
Sbjct: 607 DTSLLLIAGKKTGREF---LSRLRKNTDL--YYLHVAEDNVERLI------KRAPWIEIY 655
Query: 680 NFSADKVVALVCQNFSCSPPV 700
++K V +C++F+C P
Sbjct: 656 KLDSEKTVYYLCRDFTCGIPT 676
>gi|448318308|ref|ZP_21507834.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
gi|445599332|gb|ELY53367.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
Length = 721
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 244/711 (34%), Positives = 355/711 (49%), Gaps = 65/711 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF DE VA+LLN+ FV IKVDREERPDVD +YMT Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFADEEVAELLNEEFVPIKVDREERPDVDSIYMTVCQLVSGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFP K G+PGF +L + D+W+ R+ IE +E
Sbjct: 113 VWLTPEGKPFYVGTYFPKRSKRGQPGFLDLLEGLADSWETDRE--------EIENRAEEW 164
Query: 140 SASASS--NKLPDEL------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
+A+A + PD + L A+ +S D + GGFGS PKFP+P ++++
Sbjct: 165 TAAARDRLEETPDSIGAAEPPSSEVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL 224
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
++ + TG E ++++ +L M +GG++DHVGGGFHRY VD W VPHFEKM
Sbjct: 225 ---ARAFDRTGN----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKM 277
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD ++ L + LT D Y+ R+ L+++ R++ G FS DA S + E
Sbjct: 278 LYDNAEIPRALLAGYRLTGDERYADYVRETLEFVSRELTHAEGGFFSTLDAQSEDPETGE 337
Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
R +EGAFYVWT EV D+LG A LF Y + +GN F+G++
Sbjct: 338 R-EEGAFYVWTPAEVRDVLGSETDADLFCARYDITESGN------------FEGQSQPNL 384
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
S A + + + L RR+LF+ R +RPRP+ D+KV+ WNGL+I++ A A
Sbjct: 385 AASISELADRFDLEEREVEERLESARRELFEAREERPRPNRDEKVLAGWNGLMIATCAEA 444
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ L G DR Y +A A F+R L++ RL F++G
Sbjct: 445 ALAL--------------GEDR--YAGMAVDALEFVRDRLWNADEGRLSRRFKDGDVAVQ 488
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G+L+DYAFL G L YE L +A+EL + F D E G + T S++
Sbjct: 489 GYLEDYAFLARGALGCYEATGEVDHLAFALELARAIEAEFYDAERGTLYFTPESGESLVT 548
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R +E +D + PS V+V L+ L + + D + + A L RL+ A+
Sbjct: 549 RPQELNDQSTPSATGVAVETLLALGDVAG--EDDGFEEIATSVLRTHAGRLESNALEHAT 606
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+C AAD L V + + + + + L + P + ++ W +
Sbjct: 607 LCLAADRLEA-GPLEVTVAAEEVPAAWRERFGSRY----LPDRLFAPRPPTEDGLESWLD 661
Query: 669 H---NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+ A A + VC+N +CSPP D + L E +S
Sbjct: 662 ELGLEAAPAIWAGREARDGEPTLYVCRNRTCSPPTRDVDEALDWLAESEAS 712
>gi|421108799|ref|ZP_15569331.1| PF03190 family protein [Leptospira kirschneri str. H2]
gi|410006082|gb|EKO59855.1| PF03190 family protein [Leptospira kirschneri str. H2]
Length = 688
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 245/696 (35%), Positives = 358/696 (51%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 115 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 175 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 232
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 233 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 285
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I SAED+DS EG +EG
Sbjct: 286 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDSDS---EG----EEGL 338
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E +
Sbjct: 339 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 382
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 383 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 436
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 437 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 485
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 486 EMIASSIVLFEAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 543
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A+ P + A
Sbjct: 544 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALIYPFLLSAYW 601
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 602 SYKHHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 652
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 653 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 688
>gi|407980032|ref|ZP_11160833.1| thioredoxin [Bacillus sp. HYC-10]
gi|407413294|gb|EKF35013.1| thioredoxin [Bacillus sp. HYC-10]
Length = 627
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 240/684 (35%), Positives = 357/684 (52%), Gaps = 77/684 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+ Q + G GGWPL+VF++PD K
Sbjct: 1 MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 144
P GTYFP YGRPGF L ++ DA+ RD IE L+E + + +
Sbjct: 61 PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKATNNLRIKA 112
Query: 145 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 204
+ + + L Q ++ QL S+D+ +GGFGSAPKFP P M+ + + E TG+
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLSFLMRYFEWTGQEN 169
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
K TL MA GGI+DH+G GF RYS DE+W VPHFEKMLYD L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
+ +T+ Y + +D++ +++RDM+ G +SA DADS EG KEG +YVWT +E
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKEE 278
Query: 325 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
+ LG+ LF Y++ GN + + PH + +D A+ S L
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGNFEGQNI--PH------TISTSFDDIKAAYSIDDKTL 330
Query: 384 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
L R L VR +RP P +DDKV+ SWN L+IS+ A+A + E
Sbjct: 331 HSKLQ---SARHILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHVE-------- 379
Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 503
E + +A+ A SF+ HL Q RL +R G K GF++DYA +++ +
Sbjct: 380 --------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLTAYMS 429
Query: 504 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 563
LYE WL A ELF D + GG+F + + ++++R KE +DGA PSGNS
Sbjct: 430 LYEATFDLDWLTKARAAAENMFELFWDEQIGGFFFSGSDAEALIVREKEVYDGAMPSGNS 489
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCA--ADMLS-VP 619
++ L++L+ ++ RQ+ +L +F D++ + P A +LS
Sbjct: 490 TALQKLLKLSRMIG-------RQDWIETLEKMFSAFYVDVS-SYPSGHTAFLQGLLSQYA 541
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
++ ++++G K E +L A L K + D T E + + A A++
Sbjct: 542 VKREIIILGEKGDPQKEQLLQA------LQKRFMPFDLILTAETG---QELARLAPFAKD 592
Query: 680 NFSA-DKVVALVCQNFSCSPPVTD 702
+ D +C+N+SC P+T+
Sbjct: 593 YKTINDSTTVYICENYSCRQPITN 616
>gi|261200020|ref|XP_002626411.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239594619|gb|EEQ77200.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 823
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 234/621 (37%), Positives = 331/621 (53%), Gaps = 61/621 (9%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVME ESF VA +LN F+ IK+DREERPD+D+VYM YVQA G GGWPL+VFL+P
Sbjct: 66 CHVMEKESFMSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 125
Query: 85 DLKPLMGGTYFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
DL+P+ GGTY+P P F IL K++D W ++ +S +QL
Sbjct: 126 DLEPVFGGTYWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLR 185
Query: 137 E-ALSASASSNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
E A + S K D + L + + +D GGF APKF P + ++
Sbjct: 186 EFAEEGTHSKQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLIN 245
Query: 193 HSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
S+ + D E + +M TL M++GGIHD +G GF RYSV W +PHFEK
Sbjct: 246 LSRYPSAVSDIVGYDECARALEMATKTLIYMSRGGIHDQIGHGFARYSVTADWSLPHFEK 305
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEG 308
MLYDQ QL NVY+DAF + DI Y+ ++ P G +S+EDADS T
Sbjct: 306 MLYDQAQLLNVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPS 365
Query: 309 ATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
T K+EGAFYVWT KE + ILG+ A + H+ + P GN ++R +DPH+EF +NVL
Sbjct: 366 DTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLS 423
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFA 426
+ A + G+ E+ + I+ R KL + R SKR RP LDDK+IVSWNGL I + A
Sbjct: 424 IKVTPAKLAKEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALA 483
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-S 485
+ S +L++ V + +E+ AE+AA FIR++L+D + +L +R+G
Sbjct: 484 KCSVVLEN----------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERG 533
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------------- 531
PGF DDY++L SGL+DLYE +L +A +LQ + FL +
Sbjct: 534 DTPGFADDYSYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSTTT 593
Query: 532 -------EGGGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
GY+ T P+ L R+K D + PS N V NL+RL++++
Sbjct: 594 ESTPAPSSSTGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL-- 651
Query: 579 SKSDYYRQNAEHSLAVFETRL 599
+ D Y++ A ++ F +
Sbjct: 652 -EDDTYKRLARETVNAFAVEI 671
>gi|15805870|ref|NP_294568.1| hypothetical protein DR_0844 [Deinococcus radiodurans R1]
gi|6458560|gb|AAF10421.1|AE001938_7 conserved hypothetical protein [Deinococcus radiodurans R1]
Length = 690
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 241/688 (35%), Positives = 338/688 (49%), Gaps = 67/688 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+E A +N FV+IKVDREERPDVD VYM QAL G GGWP++
Sbjct: 62 STCHWCHVMAHESFENERTAAFMNAHFVNIKVDREERPDVDAVYMAATQALTGQGGWPMT 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP++ G P F +L + D W +RD + + L+E +
Sbjct: 122 VFLTPDAEPFYAGTYFPPQEGMGMPSFMRVLASIDDVWQNRRDQALGNA----QALTEHV 177
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++ + ELP AL E ++ YD++FGGFG APKFP P + +L
Sbjct: 178 RGASQPTRREGELPGGALARAVENAARLYDAQFGGFGRAPKFPAPSTLDFLLTQ------ 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G++M L TL+ M GGI+D +GGGFHRYSVD +W VPHFEKMLYD QL
Sbjct: 232 -------PQGREMALHTLRMMGAGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLVR 284
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
L A+ LT + ++ + R+ L YL R+M+ P G +SA+DAD+ G EG +
Sbjct: 285 TLLRAYQLTGEDDFARLARETLAYLEREMLAPDGGFYSAQDADTPTEHGGV---EGLTFT 341
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG-KNVLIELNDSSASASK 378
WT E+ +LGE A L + + GN DPH G +NVL A A +
Sbjct: 342 WTPDEIRAVLGEDADLALRSFNVTAQGN-----FRDPHQPAYGSRNVLHTPTPLPALARE 396
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG + L R KLF R RP+PH DDKV+ SWNGLV+++ A A++IL E
Sbjct: 397 LG---DDAAQRLQAARAKLFAARQVRPQPHTDDKVLTSWNGLVLAALADAARILGEE--- 450
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+Y+++A A F+ R L L+H+F++G + G L+D+A
Sbjct: 451 -------------KYLDLARRNADFVHREL-RLPGGTLRHTFKDGRASVEGLLEDHALYG 496
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL+ L++ G L WA EL N F D G ++++ G ++L R D A
Sbjct: 497 LGLVALFQAGGDLAHLHWARELWNIVRRDFWDEGAGVFYSSGGHAETLLTRQASFFDSAI 556
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
S N+ + + V + ++++ A ++ F L + + A L
Sbjct: 557 LSDNAAAALLGVWMNRYFGDAEAEAI---ARRTVQSFHAELLAAPTGLGGLWQVAAFLEA 613
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P + V+ E LA + + PAD E AR
Sbjct: 614 PHTEIAVIGTPAERQPLERELAWHFLPF------TALAPAD--------EGGDLPVLEAR 659
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISL 706
A VC N +C P DP L
Sbjct: 660 PGGGQ----AYVCVNHACQLPTRDPAEL 683
>gi|448310353|ref|ZP_21500197.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
JCM 12255]
gi|445608208|gb|ELY62067.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
JCM 12255]
Length = 729
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 236/694 (34%), Positives = 356/694 (51%), Gaps = 59/694 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFADEAVADVLNEHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP E+K G+PGF + R++ D+W D Q A +
Sbjct: 113 AWLTPEGKPFFVGTYFPKEEKRGQPGFLDLCRRISDSWSSPEDRPEMENRAEQWTDAAKD 172
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
+L E + A + E+ L A+ +S D + GGFGS PKFP+P ++++
Sbjct: 173 RLEETPDSVAGAEPPTSEV----LTAAADAAVRSADHQHGGFGSGGPKFPQPSRLRVL-- 226
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
++ + TG+ E + ++ +L MA GG++DHVGGGFHRY VD W VPHFEKMLY
Sbjct: 227 -ARAYDRTGE----GEYRAVLEESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLY 281
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ +L + LT D Y+ + + L+++ R++ GG FS DA S + E R
Sbjct: 282 DNAEIPRAFLAGYQLTGDERYAEVVAETLEFVDRELTHEGGGFFSTLDAQSEDPETGER- 340
Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EGAF+VWT E+ DIL + A LF E Y + +GN F+G+N +
Sbjct: 341 EEGAFFVWTPDEIRDILDDETTAELFCERYDVTESGN------------FEGQNQPNRVR 388
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ A + ++ L + R ++F+ R +RPRP+ D+KV+ SWNGL+I++ A A+
Sbjct: 389 SIDSLAEAYDLAEDELRERLEDAREQVFEAREERPRPNRDEKVLASWNGLMIATCAEAAL 448
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L +A Y E+ A F+R L+D RL+ +++G G+
Sbjct: 449 VLGEDA----------------YAEMGVDALEFVRDRLWDADEGRLRRRYKDGDVAIQGY 492
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G L YE L +A+EL + + F D + G + T S++ R
Sbjct: 493 LEDYAFLARGALGCYEATGDVDHLAFALELARSIEAEFWDADAGTLYFTPESGESLVTRP 552
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+V L+ L G D A L ++ A+ +C
Sbjct: 553 QELDDQSTPSATGVAVETLLAL----DGFADDDLESIAVGVLRTHANEIQTNALQHASLC 608
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
AAD L + + + + + ++ + +A A Y ++ + P + ++ E N
Sbjct: 609 LAADRLEAGALE-ITVAADELPDEWRDRVADA---YRPDRLIARRPPTEDGLEEWLEALN 664
Query: 671 --SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + VC+N +CSPP D
Sbjct: 665 LAEPPAIWAGREARDGEPTLYVCRNRTCSPPTHD 698
>gi|448305439|ref|ZP_21495370.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
14089]
gi|445588825|gb|ELY43066.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
14089]
Length = 727
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 233/704 (33%), Positives = 350/704 (49%), Gaps = 49/704 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF D+ VA+LLN+ FV IKVDREERPDVD +YMT Q + GGWPLS
Sbjct: 53 SACHWCHVMEDESFADDEVAELLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P+ KP GTYFP E K G+PGF IL ++ + W+ R+ + + ++ L
Sbjct: 113 AWLTPEGKPFHIGTYFPKESKRGQPGFLDILERLAETWETDREEVENRAQQWTDAATDQL 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
+ + + + L A+ +S D ++GGFGS PKFP+P ++++ ++ +
Sbjct: 173 EETPDTVAAAEPPSSDVLETAADTALRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFD 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ SE +++ +L M GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 230 RTGQ----SEYLEVLEESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIP 285
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
L + LT + Y+ + L ++ R++ G FS DA S + E R +EGAF+
Sbjct: 286 RALLAGYQLTGEERYAETVAETLAFVDRELTHDDGGFFSTLDAQSKDPETGER-EEGAFF 344
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT +EV ++L + A LF E Y + +GN F+G+N + S+ A
Sbjct: 345 VWTPEEVSEVLEDQTTAELFCERYDITESGN------------FEGQNQPNRVQSISSLA 392
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ ++ L R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L
Sbjct: 393 EAFDLEEQEVETRLEAARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL---- 448
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
G D EY E A A F+R L+D RL +++G G+L+DYAF
Sbjct: 449 ----------GDD--EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAF 496
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L + YE L +A+EL T + F D E G + T S++ R +E +D
Sbjct: 497 LARAAVGCYEATGEVDHLAFALELARTIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQ 556
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
+ PS V+V L+ L S+ + A L R++ + +C AAD L
Sbjct: 557 STPSAAGVAVETLLALDRFAVDSEE--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
+ + V + H + + P + ++ W E
Sbjct: 615 ESGALEITVAADELPDAWRDRFAETYHPD-----RLFALRPPTDDGLEAWLEQLGLADAP 669
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSST 717
A A + VC+ +CSPP D L E S+T
Sbjct: 670 AIWAGREARDGEPTLYVCRGRTCSPPTNDVEDALEWLGENTSAT 713
>gi|308513297|ref|NP_952224.2| thioredoxin domain-containing protein YyaL [Geobacter
sulfurreducens PCA]
gi|409911713|ref|YP_006890178.1| thioredoxin domain-containing protein YyaL [Geobacter
sulfurreducens KN400]
gi|41152670|gb|AAR34547.2| thioredoxin domain protein YyaL [Geobacter sulfurreducens PCA]
gi|298505285|gb|ADI84008.1| thioredoxin domain protein YyaL [Geobacter sulfurreducens KN400]
Length = 710
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 240/695 (34%), Positives = 346/695 (49%), Gaps = 79/695 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESF+D+ VA +LN +V +KVDREERPD+D +M Q + G GGWPL++
Sbjct: 79 TCHWCHVMAAESFDDDEVAAVLNREYVPVKVDREERPDIDDTFMRVAQMMNGSGGWPLTI 138
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD +P TY P + G PG +L K+ + W ++RD++ Q+ + ++ LS S
Sbjct: 139 IMTPDRQPFFAATYIPRRSRGGMPGLIDLLEKIAEVWRQRRDVVRQNCSAIMDALSRFNS 198
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
++ + DE P + R +QL+ YD FGGFG APKFP + + +L + ++ D
Sbjct: 199 VRPAAAE--DEAPLHGAR---QQLADIYDKEFGGFGGAPKFPMAMNLSFLLRYGQRYGD- 252
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
E M TL MA+GGI DH+GGGFHRY+VD RW VPHFEKMLYDQ
Sbjct: 253 ------GEAVAMATDTLTAMAQGGIWDHLGGGFHRYTVDGRWLVPHFEKMLYDQALCTLA 306
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
++A +T + + + ++ ++ R++ P G +SA DADS EG +EGA Y+W
Sbjct: 307 LVEAAQVTGNSVFRELAKETCGFVLRELSAPAGGFYSALDADS---EG----REGACYLW 359
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T +V DILG LF Y + GN F+G NVL A A
Sbjct: 360 TPAQVRDILGVADGELFCRLYAVTAWGN------------FEGANVLHLPLAPDAFARDE 407
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + + + L + R +RPRP D+K+I WNGL+I++ AR I E
Sbjct: 408 GVDPLRLQEKIAQWHILLLEARERRPRPFRDEKIITGWNGLMIAALARTFLICGDEL--- 464
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAFL 497
+E AE A +RR D +T RL S G + PGFL+DYAF
Sbjct: 465 -------------LLEGAERA---VRRVCIDLRTPAGRLVRSCHRGEASGPGFLEDYAFF 508
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I GLL+L+E + L A L + LF D GGG F+T + ++L+R K DGA
Sbjct: 509 IRGLLELHEATLDPRHLALARSLAHDMLRLFGD-SGGGLFDTGSDAETILVRGKGALDGA 567
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL--AVFETRLKDMAMAVPLMCCAADM 615
PSGN+++ L+RL I D + A + A + A + L+C ++
Sbjct: 568 IPSGNAMAASVLIRLGRIT----GDGVFEEAGRGIIRAFLAGAARQPAAHIHLLCALGEL 623
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L+ P FE ++AAA + + + + + + E + A
Sbjct: 624 LADP---------------FEVVIAAATRPHAVRELLCILGGRLIPGLVLMEREENAPAR 668
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S +A VC C PPVT P LE +L
Sbjct: 669 EGGGGGS----IARVCAGRVCLPPVTAPEGLEEIL 699
>gi|448307474|ref|ZP_21497369.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
gi|445595646|gb|ELY49750.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
Length = 727
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 229/689 (33%), Positives = 349/689 (50%), Gaps = 49/689 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + GGWPLS
Sbjct: 53 SACHWCHVMESESFADEEVAEMLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P+ KP GTYFP E K G+PGF IL ++ + W+ RD + + ++ L
Sbjct: 113 AWLTPEGKPFHIGTYFPKESKRGQPGFLDILERLAETWETDRDEVENRAQQWTDAATDQL 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
+ + + +AL A+ +S D ++GGFGS PKFP+P ++++ ++ +
Sbjct: 173 EETPDTVAAAEPPSSDALEAAADTAVRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFD 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ E +++ +L M GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 230 RTGR----EEYLEVLEESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIP 285
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
L + LT + Y+ + L+++ R++ G FS DA S ++E R +EGAF+
Sbjct: 286 RALLAGYQLTDEERYAETVAETLEFVERELTHDEGGFFSTLDAQSEDSETGER-EEGAFF 344
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT +EV ++L + A LF Y + +GN F+G+N + S+ A
Sbjct: 345 VWTPEEVSEVLADETDADLFCARYDITESGN------------FEGQNQPNRVQSISSLA 392
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ + L R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L
Sbjct: 393 GEFDLEESDVETRLEAARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL---- 448
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
G D EY E A A F+R L+D RL +++G G+L+DYAF
Sbjct: 449 ----------GDD--EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAF 496
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L + YE L +A+EL + + F D E G + T S++ R +E +D
Sbjct: 497 LARAAVGCYEATGEVDHLAFALELARSIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQ 556
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
PS V+V L+ L S++ + A L R++ + +C AAD L
Sbjct: 557 PTPSAAGVAVETLLALDGFAGDSEA--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
+ + V + + A +Y ++ P + + ++ W E
Sbjct: 615 ESGALEITVAADELPDA-WRDRFA---ETYRPDRLFARRPPTE-DGLEAWLEQLGLADAP 669
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + VC+ +CSPP D
Sbjct: 670 AIWAGREARDGEPTLYVCRGRTCSPPTRD 698
>gi|115372663|ref|ZP_01459970.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
gi|310823874|ref|YP_003956232.1| hypothetical protein STAUR_6648 [Stigmatella aurantiaca DW4/3-1]
gi|115370384|gb|EAU69312.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
gi|309396946|gb|ADO74405.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 694
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 235/701 (33%), Positives = 345/701 (49%), Gaps = 69/701 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED +A ++N F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 57 SACHWCHVMAHESFEDPAIASVMNAHFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDL+P GGTYFPP+DKYGRPGF +L + DAW +R+ + A E L E
Sbjct: 117 VFLTPDLRPFYGGTYFPPQDKYGRPGFPKVLESLHDAWMNQREKVLGQAADFREGLGEL- 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+ P L + E++ + D GGFG APKFP P+ + +L ++
Sbjct: 176 -ATYGLEAAPAALSVEDVLKMGERMLRHVDPVNGGFGGAPKFPNPMNVSFLLRAWRR--- 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + L TL+ MA GG++D +GGGFHRY+VD+RW VPHFEKMLYD QL +
Sbjct: 232 ----GGPEPLKDAALRTLERMALGGVYDQLGGGFHRYAVDDRWRVPHFEKMLYDNAQLLH 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + + + + + +Y+RR+M G ++A+DADS EG +EG F+V
Sbjct: 288 LYAEGEQVESRPLWRKVVEETAEYVRREMTDARGGFYAAQDADS---EG----EEGRFFV 340
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +V +L EHA L H+ + P GN + +G VL + A +
Sbjct: 341 WTPAQVCSVLTPEHANLLLRHFRITPQGNFE-----------QGATVLEVAVPVAQIAHE 389
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ E L R LF +R +R +P DDK++ WNGL+I A AS++
Sbjct: 390 RGLSQEALERTLTAAREALFGIREQRVKPGRDDKILSGWNGLMIRGLAFASRVF------ 443
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
R E+ ++A +A F+ H++D RL S+ G + GFL+DY
Sbjct: 444 ----------GRPEWAQLAAGSADFVLTHMWD--GTRLSRSYEEGGGRIDGFLEDYGDFA 491
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL LY+ K+L A L LF D E Y + +++ D A
Sbjct: 492 VGLTALYQATFEAKYLEAASALVKRAVALFWDEEKQAYLSAPKGQKDLVVATYSLFDNAF 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S V LA++ G KS + + E L+ L+D + + AAD +
Sbjct: 552 PSGASTLTEAQVALAALT-GDKS--HLELPERYLSRMRKALEDNPLGYGHLALAADTF-L 607
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+ G + V +L A ++ V W+E + ++ +
Sbjct: 608 DGGAGITFAGTREQV--APLLEVAQRAFAPTFAV------------GWKEAGAPVPAVLK 653
Query: 679 NNFSADKVV-----ALVCQNFSCSPPVTDPISLENLLLEKP 714
F + V A VC+ F+C P+T+P L+ L +P
Sbjct: 654 ELFEGREPVEGKGAAYVCRGFACERPLTNPEQLKARLGARP 694
>gi|407772664|ref|ZP_11119966.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
gi|407284617|gb|EKF10133.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
Length = 679
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 236/701 (33%), Positives = 358/701 (51%), Gaps = 76/701 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDEG+A L+N+ F++IK+DREERPD+D +Y + L GGWPL++
Sbjct: 52 ACHWCHVMAHESFEDEGIAALMNELFINIKLDREERPDLDALYQNALALLGQQGGWPLTM 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GGTYFP E +YGRPGF +L+ V + +K D + + + Q+S AL
Sbjct: 112 FLTPDGEPFWGGTYFPKEARYGRPGFGDVLKTVAKIYAEKPDDVRHN----VSQISNALI 167
Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
SA+ +P + C + D GG APKFP+P + + + +
Sbjct: 168 KMNSAAVGAVPS---LEMIDRCGHGCLQIMDGENGGTSGAPKFPQPSLLSYIWRTGVRTD 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D G +++V +L M +GGI+DH+GGG RY+VD++W VPHFEKMLYD QL
Sbjct: 225 DDGL-------KRIVKHSLDRMCQGGIYDHLGGGLARYAVDDQWLVPHFEKMLYDNAQLI 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++ D + + + Y+ + + ++ R+M PGG ++ DADS EG EG FY
Sbjct: 278 DLLCDVWRVDPNPLYAKRVEETIGWILREMRIPGGAFTASLDADS---EGV----EGKFY 330
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VW+ E++ ILG +A LFK+ Y + GN ++G +L + +AS
Sbjct: 331 VWSEDEIDQILGANADLFKKFYDVSKDGN------------WEGHTIL------NRTASG 372
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L + + L E R KL R+KR RP DDK + WN + I++FA A+
Sbjct: 373 LELADDATEEKLAELRAKLLAERAKRIRPGWDDKALTDWNAMTIAAFAEAAMTFH----- 427
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
R ++++ A+ A F+ L + R HS+R+G + G L+DYA +I
Sbjct: 428 -----------RADWLDYAKLAYGFVINTLM--KGDRFLHSYRDGRVQHAGMLEDYAHMI 474
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L LYE +L AI + LF D + GGYF + + +++R K D A
Sbjct: 475 RAALRLYECFGEDAYLNEAIRWSAAVETLFADAK-GGYFQSASDASDLVVRQKPFMDNAV 533
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN++ NL +L ++ ++ YR AE +LA F R+ + +P + AA+ML
Sbjct: 534 PSGNAIMAQNLAKLYALTGDTQ---YRDQAEITLAAFGGRIGEQFPNMPGLMMAAEMLQN 590
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P + +VL+ S + +M A +Y N+ + + D + A+
Sbjct: 591 PVQ--IVLIAKDRSQTYLDMRRAIFGAYLPNRAITILSDGDPLP----------DGHPAQ 638
Query: 679 NNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
+ D K A +CQ CS PVT L +L + P+ A
Sbjct: 639 GKTAIDGKETAYICQGPVCSAPVTGVEELTEMLADLPAKAA 679
>gi|359728137|ref|ZP_09266833.1| hypothetical protein Lwei2_14957 [Leptospira weilii str.
2006001855]
Length = 724
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 257/718 (35%), Positives = 368/718 (51%), Gaps = 82/718 (11%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R + LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 71 TKAREQNKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 130
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE +YGR F IL ++ W++KR
Sbjct: 131 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWNEKR-- 188
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS-- 177
Q A +LS L S + + LP +N YD+ FGGF +
Sbjct: 189 --QELIVASSELSRYLKDSGEGRAIEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNH 246
Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
KFP + + +L YHS SG +MV TL M +GGI+D +GGG R
Sbjct: 247 VNKFPPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCR 297
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS D W VPHFEKMLYD ++ ++K + D++ YL RDM GG I
Sbjct: 298 YSTDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGI 357
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
SAEDADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 358 CSAEDADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN-------- 402
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVI 414
F+GKN+L E + A+K K ++ +L R KL + RSKR RP DDK++
Sbjct: 403 ----FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKIL 456
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
SWNGL I + A+A V R++++++AE SFI ++L D
Sbjct: 457 TSWNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPNG- 499
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
R+ FR+G S G+ +DYA +IS + L+E G G ++L A+ +D + L R
Sbjct: 500 RILRRFRDGESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWM--EDAIRLFRSPA 557
Query: 535 GYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G F TG D VLLR D +DG EPS NS +LV+L+ + G S Y + AE
Sbjct: 558 GVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESIFL 615
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTV 652
F L +++ P + A S K +VL+ + DF +++LAA + + +
Sbjct: 616 YFTKELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVL 672
Query: 653 IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ + EE +++ + S + VC+NFSC PV+D L+ +
Sbjct: 673 AVVNENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSDLADLKKWI 723
>gi|322371783|ref|ZP_08046326.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
DX253]
gi|320548668|gb|EFW90339.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
DX253]
Length = 713
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 238/700 (34%), Positives = 348/700 (49%), Gaps = 70/700 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA+LLN+ FV IKVDREERPD+D +YM+ Q + GGGGWPLS
Sbjct: 53 SACHWCHVMEEESFEDEDVAELLNEHFVPIKVDREERPDIDAIYMSICQQVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+PD KP GTYFP + GRPGF +L VK+ W + + + G EQ ++A+
Sbjct: 113 AWLTPDGKPFYVGTYFPKRSQQGRPGFIDLLENVKNTWQENPEEMKNRG----EQWTDAI 168
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKL 197
S D+ P L AEQ ++ D +GGFG PKFP+P + ++L +
Sbjct: 169 EGELESTPEADDAPGPELLGSAAEQTVRTADREYGGFGRGGPKFPQPARLHLLL---RAY 225
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ TG A++ + + + L MA GG++DH+GGGFHRY+ D +W VPHFEKMLYD +L
Sbjct: 226 DRTG----ATQYRDVAVEALDAMADGGMYDHIGGGFHRYATDRKWTVPHFEKMLYDNAEL 281
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
YL + LT D Y+ + R+ L R+M P G +S DA S + G +EG F
Sbjct: 282 PRAYLAGYQLTGDERYAELVRETFASLEREMRHPEGGFYSTLDARSEDEAG--NYEEGPF 339
Query: 318 YVWTSKEV---------EDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
YVWT +V +DI E A + E Y + +GN F+GK VL
Sbjct: 340 YVWTPSDVYEAVEDERDDDIDTETRADIVCERYGVTQSGN------------FEGKTVLT 387
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
D A K + ++ ++L + R +F+ R +R RP D+K++ WNGL+I++ A
Sbjct: 388 LTTDVPDLAEKYDVSEDEVRDVLADARHSMFEAREERERPPRDEKILAGWNGLLIAALAE 447
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
+L + Y ++A A F+R L+DE +L F++
Sbjct: 448 GGFVLD-----------------EHYTDLAADALDFVREKLWDEADAKLSRRFKDEDVAI 490
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
G+L+DYAFL G LYE L +A++L + F D E + T ++
Sbjct: 491 DGYLEDYAFLARGAFALYESTGNPDHLEFALDLARAIEREFWDAERETLYFTPESGERLV 550
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM---AM 604
R +E D + PS V+ L L+ AE AV ET + +
Sbjct: 551 ARPQELADQSTPSSLGVATDVLAVLSEFAPDEAF------AEIPEAVLETHARTVESNPF 604
Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
+ AAD + S + + + G + + + LA + L V+ P + +
Sbjct: 605 QYATLVLAADRNATGSLE-LTVAGDELPEAWHDQLAETY----LPMRVLTRRPPTEDGVA 659
Query: 665 FWEEH--NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
W E N + + SA + VC++F+CSPPVTD
Sbjct: 660 AWCEKLGVENVPPIWADRESAGEPTLYVCRSFTCSPPVTD 699
>gi|320102044|ref|YP_004177635.1| hypothetical protein Isop_0491 [Isosphaera pallida ATCC 43644]
gi|319749326|gb|ADV61086.1| protein of unknown function DUF255 [Isosphaera pallida ATCC 43644]
Length = 723
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 241/739 (32%), Positives = 365/739 (49%), Gaps = 80/739 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL + CHWCHVME ESFE +A L+N WFV+IKVDREERPD
Sbjct: 40 GEEAFAKAKAENKPIFLSVGYSACHWCHVMERESFESPTIAALMNQWFVNIKVDREERPD 99
Query: 59 VDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
+D++YM VQAL G GGWP+SVF++P+ +P GGTY+PP D G PGF IL + AW
Sbjct: 100 IDQIYMAAVQALNQGHGGWPMSVFMTPEGEPFFGGTYYPPHDARGMPGFPRILEGLATAW 159
Query: 118 DKKRDMLAQSGAFAIEQLSE------------ALSASASSNKLPDELPQNALRLCAEQLS 165
++ + ++ A +E L + AL A+ ++ D L + A L
Sbjct: 160 REREPEVREAAARLVEHLRKRNEPMPPLIKGPALDHPAADDR--DGLDPGWIAEAARALG 217
Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 225
+ +DSR+GGFGSAPKFP P++++++L H ++++D MV+ TL M++GGI
Sbjct: 218 RVFDSRYGGFGSAPKFPHPMDLKLLLRHHQRVQD-------PRALAMVIQTLDHMSRGGI 270
Query: 226 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 285
+DH+GGGF RY+ DERW VPHFEKMLYD L + + D + + + LDYL
Sbjct: 271 YDHLGGGFARYATDERWLVPHFEKMLYDNALLISALAETIQCRPDPTLARVVVETLDYLA 330
Query: 286 RDMIGP--GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYL 342
M GP F+ EDADS EG EG +YVW+ E+ + LGE LF E Y +
Sbjct: 331 ERMTGPPEAPGFFATEDADS---EGV----EGKYYVWSRDEMLETLGEPLGSLFAEVYDV 383
Query: 343 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 402
GN ++G ++L A +LG P ++ L + R L R
Sbjct: 384 TEAGN------------WEGHSILNLPEPLDRVAQRLGRPTDQLAAELAQARALLKARRD 431
Query: 403 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 462
+R P D K++ SWNGL++++ A A+ ++ DR +++E AE AA
Sbjct: 432 RRIPPGKDTKILTSWNGLMLAAIAEAAWVV----------------DRPDHLERAEKAAG 475
Query: 463 FIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN 522
F+ HL + RL H F++G ++ G+L+DYA+LI GL L + T+W+ A +L
Sbjct: 476 FLLDHLR-QPDGRLFHVFKDGRARFNGYLEDYAYLIDGLTRLGQVTGTTRWIREARDLSR 534
Query: 523 TQDELFLDR--EGGGYFNTTG-EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 579
E F D +G G F TG +++ R ++ D A PS +++V L+RLA++ +
Sbjct: 535 LMIEEFGDEVIDGVGGFAFTGVRHETLVARPRDLFDNATPSAAAMAVTALLRLAAL---T 591
Query: 580 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD-FENM 638
R L +K A A D +V+ G D +
Sbjct: 592 DDQALRGRGLAGLRALAPLMKHAPTAAAQSLIALDFALRDPEIALVVPGQLDPSDTLAQV 651
Query: 639 LAAAHASYDLNKTVI--HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
L H + + ++ +DP ++ + + D V +C+ +C
Sbjct: 652 LRLLHRDFQPGRLLLVRSLDPPHPHDLHLL-------PPLQGRDHPHDHVTLYLCRGQTC 704
Query: 697 SPPVTDPISLENLLLEKPS 715
P+ ++ L P+
Sbjct: 705 QAPLVGVEAIAQALTSPPT 723
>gi|188475827|gb|ACD50089.1| hypothetical protein [uncultured crenarchaeote MCG]
Length = 684
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 248/710 (34%), Positives = 365/710 (51%), Gaps = 94/710 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A +LN+ FV +KVDREERPD+D +YM AL G GGWP+SV
Sbjct: 49 ACHWCHVMAHESFEDELTASILNENFVCVKVDREERPDLDAIYMRATVALSGSGGWPMSV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PDL+P GTYFPP +Y PGF +LR + AW ++ I ++ +
Sbjct: 109 FLTPDLRPFYAGTYFPPARRYNLPGFPELLRALAQAWGTRQQ--------EIHAVAARVD 160
Query: 141 ASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S S+ LP L Q L L + D + GG+G+APKFP+P+ I+++L L
Sbjct: 161 QSLSTPDLPSHLGVVSQQLLEQAESWLVRHADRQHGGWGAAPKFPQPMAIELLL-----L 215
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G ++G + +LQ MA+GG++D +GGGF RYS D WHVPHFEKMLYD QL
Sbjct: 216 QAAADPGAHADGLAVATQSLQAMARGGMYDVLGGGFSRYSTDTTWHVPHFEKMLYDNAQL 275
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A YL AF +T + + + + LD++ R+M P G +S+ DADS EG +EG +
Sbjct: 276 ALAYLHAFLVTGETSFRQVAAETLDFVAREMTHPEGGFYSSLDADS---EG----REGKY 328
Query: 318 YVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT E+ +++G+ ++ LF Y G S +G+ +L + +
Sbjct: 329 YVWTQAEIREVIGDPSMTELFLAAY---DAGTAPAS---------QGEIILQRAPNDANL 376
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+++ + +L R +LF R RPRP LDDKVIV+WNGL++ +FA+A++
Sbjct: 377 SARFDKSASEIEELLQRARARLFRARQARPRPGLDDKVIVAWNGLMLQAFAQAARC---- 432
Query: 436 AESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDD 493
F GS + Y+EVA A+F+ +L + Q HR+ +R G + FL+D
Sbjct: 433 -------FGGAGSGTGDMYLEVATRNAAFLLGNLRNHGQLHRI---WRRGKTGQHVFLED 482
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--GGYFNTTGEDPSVLLRVK 551
YA LI GLLDLY+ W + A +L DE+ L GG+F+T + L+R
Sbjct: 483 YAALILGLLDLYQADFSNAWFIAARQL---ADEMLLRFAAPDGGFFDTPDDSKPPLIRPM 539
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E DGA P+G +++ L++LA++ + YR +AE +L + + ++
Sbjct: 540 ELQDGATPAGGALATEALLKLAALTGEAT---YRDHAERTLPLGLANAAESPLSYARWLA 596
Query: 612 AAD----------MLSVPSRKHVVLVGHKSSVDFEN-MLAAAHASYDLNKTVIHIDPADT 660
AA +L PS V +G +S + M+AA+ + D
Sbjct: 597 AAALALAGPRQLALLFPPSANPVAFLGVVNSAFRPHWMVAASPYPPPTGAPPLLQD---- 652
Query: 661 EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A+ A VC++F+C P+TDP L LL
Sbjct: 653 ------------------RPVVANLPTAFVCRDFACLRPITDPAELPALL 684
>gi|403380657|ref|ZP_10922714.1| hypothetical protein PJC66_12642 [Paenibacillus sp. JC66]
Length = 547
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 226/594 (38%), Positives = 327/594 (55%), Gaps = 53/594 (8%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE VA LN ++++KVDREERPDVDK+YM+ QA+ G GGWPL+V ++PD K
Sbjct: 1 MAQESFEDEKVAAWLNAHYIAVKVDREERPDVDKLYMSVCQAMTGQGGWPLTVLMTPDKK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP +YG+PG I+ +V W ++R+ L E+++E + +
Sbjct: 61 PFFVGTYFPKTSQYGKPGVIDIVSQVHQKWTEQREELLDIA----EEIAETVR-NRQETA 115
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L EL + L + E S+++DS++GGFG APKFP P ++ +L + K+ TG+
Sbjct: 116 LSGELSADMLDMAYELFSQAFDSQYGGFGDAPKFPSPHQLSFLLRYYKR---TGEQDALD 172
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
+K TL+ M +GG++DH+G GF R S DERW VPHFEKMLYD LA VYL+A+ +
Sbjct: 173 MAEK----TLEGMHRGGMYDHIGYGFARCSADERWLVPHFEKMLYDNALLAAVYLEAYEV 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T Y+ I I Y++RDM G FSAE + S EGA E FY+WT +EV
Sbjct: 229 TGKQEYAEIAEQIFAYVKRDMTSSEGFFFSAEGSHS---EGA----EEQFYLWTPEEVNA 281
Query: 328 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG-MPLEK 385
+LGE LF + + ++ G D G +V L + ++ ++L M +
Sbjct: 282 VLGEEDGELFCDVFDIQEDGPVD------------GYSVPNLLGLTRSTFARLQRMDPAE 329
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
L R KLF R +R RPH DDK++ +WNGL+I + A+ +K+L+
Sbjct: 330 RERRLERSRVKLFQHRERRARPHKDDKMLTAWNGLMIMALAKGAKVLQ------------ 377
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
+ E+ + A+ A FI + L E RL +R+G + P +LDDYAFL+ GL++LY
Sbjct: 378 ----KAEHADAAQKAVGFILQRLVREDG-RLLARYRDGDAAIPAYLDDYAFLVWGLIELY 432
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E T++L A+ LF D E GG++ + + +L R KE HDG PSGNS +
Sbjct: 433 EATRETEYLHQAVRFNQEMIRLFWDDESGGFYFSGIDGEKLLARSKEIHDGDMPSGNSAA 492
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
+NL+RLAS+ +K Q A L F ++ + CA D + P
Sbjct: 493 AMNLLRLASLTEDTK---LLQLAHRQLRSFAAVVEQYPAGFSMYLCALDSILPP 543
>gi|448373972|ref|ZP_21557857.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
gi|445660649|gb|ELZ13444.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
Length = 760
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 238/718 (33%), Positives = 348/718 (48%), Gaps = 65/718 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT QA+ G GGWPLS
Sbjct: 53 SACHWCHVMEAESFADETVATVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQL 135
+L+PD +P GTYFP E + G PGF + R+++ +W + RD + A A ++L
Sbjct: 113 AWLTPDGRPFYVGTYFPREAQRGTPGFLELCRQIRVSWSENRDEIESRADEWTAMAADRL 172
Query: 136 SEALSASASSNKLP---------------DELPQNALRLCAEQLSKSYDSRFGGFG-SAP 179
A +A S+ P D +AL E ++ D GGFG P
Sbjct: 173 DSAAAAGNESSSTPAPISADTGSPIDGGLDADGPDALERVGEAALRASDDEHGGFGRGGP 232
Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
KFP+P ++ +L +L+ + + ++ L M GG++DHVGGGFHRY VD
Sbjct: 233 KFPQPRRVESLL----RLD---AAHDRPNARETATRALDAMCSGGLYDHVGGGFHRYCVD 285
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
E W VPHFEKMLYD + L + +T D Y+ R+ +D+L R++ P G +S
Sbjct: 286 EDWTVPHFEKMLYDNAAIPRALLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTL 345
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI------LFKEHYYLKPTGNCDLSRM 353
DA S ETE R +EGAFYVWT E+E + E + LF + + +GN
Sbjct: 346 DAQS-ETESGER-EEGAFYVWTPAEIESAVAEAGLSDESGALFCNRFGVTDSGN------ 397
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
F+G VL A+ G+ + L R +F+ R+ RPRP D+K+
Sbjct: 398 ------FEGSTVLTVEASIEDLATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKI 451
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVG------SDRKEYMEVAESAASFIRRH 467
+ WNGL I A AS +L + A N G S Y ++A A +F+R +
Sbjct: 452 LAGWNGLAIDMLAEASIVLGTSGREAATNAASAGGASDGPSGDDRYAQLATDALAFVRTN 511
Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDEL 527
L+D+ T RL R+G G+L+DYAFL G L YE + L +A++L
Sbjct: 512 LWDDDTGRLARRVRDGDVGIDGYLEDYAFLARGALTCYEATGEVEPLAFALDLARAIRRD 571
Query: 528 FLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
F D + T S+L+R +E D + PS V+V L L A + + +
Sbjct: 572 FWDESAETLYFTPERGESLLVRPQELGDQSTPSPTGVAVEILAMLDPFTA----EPFGEM 627
Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
A ++ T +++ + A D+++ V V +++E L +
Sbjct: 628 ARRVVSTHATEIEESPFEYVSLSLAQDLVTH-GPLEVTTVADGRPMEWERTLGRTY---- 682
Query: 648 LNKTVIHIDPADTEEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
L + ++ PA + +D W + ++ A AD+ VC + CSPP D
Sbjct: 683 LPRRLLAPRPASSAMLDDWLDVIGLDTVPPIWADREQRADEPTVYVCADRVCSPPEHD 740
>gi|116327565|ref|YP_797285.1| hypothetical protein LBL_0795 [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
gi|116120309|gb|ABJ78352.1| Conserved hypothetical protein containing a thioredoxin domain
[Leptospira borgpetersenii serovar Hardjo-bovis str.
L550]
Length = 692
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 253/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 38 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE YGR F +L ++ W +KR
Sbjct: 98 MDALHAMDQQGGWPLNIFLTPDGKPIAGGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQE 157
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
L + + L ++ A + LP L +S YD+ FGGF + K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNK 217
Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
FP + + +L YH S + +MV TL M +GGI+D VGGG RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
RW VPHFEKMLYD ++ ++K + D++ YL RDM GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+GKN+L E A+KL K ++ +L R KL + RSKR RP DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I + A+A + R++++++AE SFI R+L D R+
Sbjct: 429 GLYIKALAKAG----------------IAFQREDFLKLAEETYSFIERNLIDPDG-RILR 471
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+ S G+ +DYA +IS + L+E G G ++L A+ LF R G F
Sbjct: 472 RFRDSESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529
Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TG D VLLR D +DG EPS NS +LV+L+ + G S YR+ AE + F
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L +++ P + A S K +VL+ K + +++LAA + + ++
Sbjct: 588 ELSTHSLSYPHLLSAYWTYKYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ EE + + + S + VC+NFSC PV++ L+ +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691
>gi|420158002|ref|ZP_14664826.1| PF03190 family protein [Clostridium sp. MSTE9]
gi|394755349|gb|EJF38596.1| PF03190 family protein [Clostridium sp. MSTE9]
Length = 685
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 244/706 (34%), Positives = 356/706 (50%), Gaps = 73/706 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G ++F + + FL +TCHWCHVM ESFED+ VA+ LN FV IKVDREERPD
Sbjct: 33 GEQAFEKAKREDKPIFLSIGYSTCHWCHVMAHESFEDDEVAEALNQGFVCIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYMT QA+ G GGWP+++ ++P+ +P GTY P + G +L +++ W
Sbjct: 93 IDAVYMTVCQAMTGSGGWPMTILMTPEQRPFWAGTYLPKMSTFRSTGLLELLAFIREQWS 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
R L +G L E S S K +L LR QLS SYDSR+GGFG A
Sbjct: 153 TNRQQLLNAGEEITNYLREQSGPSLGSAKPELDL----LRGAVAQLSASYDSRWGGFGGA 208
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L +S + + KS Q M +TL M +GG+ DH+GGGF RYS
Sbjct: 209 PKFPAPHNLLFLLRYS--VLEREKS-----AQSMAEYTLSQMFRGGLFDHIGGGFSRYST 261
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W VPHFEKMLYD LA YL+A+++T Y + + LDY+ R++ G +
Sbjct: 262 DVKWLVPHFEKMLYDNALLAYTYLEAYAVTGRPLYRSVAKRTLDYVLRELTDEQGGFYCG 321
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPH 357
+DADS +G EG +YV+T +EV+ +LG E LF + + GN
Sbjct: 322 QDADS---DGV----EGKYYVFTPQEVQGVLGKEDGELFCSRFGVTEAGN---------- 364
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F+GK++ L+ S+ E+ +I C+R L++ R +R R H DDKV+ SW
Sbjct: 365 --FEGKSIPNLLDFSAYD--------EEDPHIAQLCQR-LYEYRLERTRLHRDDKVLTSW 413
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
N L+I++ A+A +L D EY++ A+ A F+ L DE+ RL
Sbjct: 414 NALMIAALAKAGWLL----------------DEPEYLQAAQKAQRFLEEKLVDERG-RLL 456
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+R G + G LDDYAF LL+LY +L+ A ++ ELF D E GG +
Sbjct: 457 LRWREGEAANDGQLDDYAFYAFSLLELYRSSFDCTYLLRAAQIAEQILELFSDAEQGGLY 516
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + ++ R KE +DGA PSGNSV+ VRLA++ + +RQ E +
Sbjct: 517 LTAKDSEQLISRPKEVYDGAIPSGNSVAGEVFVRLAALTGEER---WRQAGERQIRFLTG 573
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLV-GHKSSVDFENMLAAAHASYDLNKTVIHID 656
+K+ + A + PS++ V G ++ + + L + L + +
Sbjct: 574 WIKEYPAGYGMSLIALSSVLYPSQELVCTAQGEEAFQEVRDFL----RRHSLPSLTVLLK 629
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
A E +E + D V +CQN +C+ PV +
Sbjct: 630 CAKNE-----QELAAAAPFTVEYPLPQDGVRYYLCQNGTCAAPVQE 670
>gi|418738150|ref|ZP_13294546.1| PF03190 family protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
gi|410746324|gb|EKQ99231.1| PF03190 family protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
Length = 692
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 253/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 38 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE YGR F +L ++ W +KR
Sbjct: 98 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQE 157
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
L + + L ++ A + LP L +S YD+ FGGF + K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFKTNHVNK 217
Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
FP + + +L YH S + +MV TL M +GGI+D VGGG RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
RW VPHFEKMLYD ++ ++K + D++ YL RDM GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+GKN+L E A+KL K ++ +L R KL + RSKR RP DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I + A+A + R++++++AE SFI R+L D R+
Sbjct: 429 GLYIKALAKAG----------------IAFRREDFLKLAEETYSFIERNLIDPDG-RILR 471
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+G S G+ +DYA +IS + L+E G G ++L A+ LF R G F
Sbjct: 472 RFRDGESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529
Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TG D VLLR D +DG EPS NS +LV+L+ + G S YR+ AE + F
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L +++ P + A K +VL+ K + +++LAA + + ++
Sbjct: 588 ELSTHSLSYPHLLSAYWTYRY-HFKEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ EE + + + S + VC+NFSC PV++ L+ +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691
>gi|405355793|ref|ZP_11024905.1| Thymidylate kinase [Chondromyces apiculatus DSM 436]
gi|397091065|gb|EJJ21892.1| Thymidylate kinase [Myxococcus sp. (contaminant ex DSM 436)]
Length = 696
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 243/697 (34%), Positives = 345/697 (49%), Gaps = 69/697 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE A+L+N+ F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 57 SACHWCHVMAHESFESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPP+DKYGRPGF +L ++DAW+ K+D + + A E L E
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDKYGRPGFPRLLMALRDAWENKQDEVQRQSAQFEEGLGEL- 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
AS P L + + ++K D+ GGFG APKFP P+ +ML ++
Sbjct: 176 -ASYGLEAAPAVLTVADVVAMGQGMAKQVDAVNGGFGGAPKFPNPMNFALMLRAWRR--- 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + + V TL+ MA+GGI+D +GGGFHRYSVDERW VPHFEKMLYD QL +
Sbjct: 232 ----GGGAALKDAVFLTLERMARGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLH 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A + + + + ++Y+RR+M GG ++A+DADS EG +EG F+V
Sbjct: 288 LYAQAQQVEPRPLWRKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W +EV L E A L H+ +KP GN + G VL + A A +
Sbjct: 341 WKPEEVRAALPEAQAELVLRHFGIKPGGNFE-----------HGATVLEVVVPVDALAKE 389
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G + + L R+ LF R +R +P DDK + WNGL+I A AS++
Sbjct: 390 RGGAEDVVASELAAARKTLFAAREQRVKPGRDDKQLSGWNGLMIRGLALASRVF------ 443
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
DR E+ A AA F+ +D RL S++ G ++ GFL+DY L
Sbjct: 444 ----------DRPEWARWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGNLA 491
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
SGL LY+ K+L A L +LF D E Y +++ D A
Sbjct: 492 SGLTALYQATFDVKYLEAADALVRRAVDLFWDAEKAAYLTAPRGQKDLVVATYGLFDNAF 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S V LA++ + + + E ++ L M + AAD L +
Sbjct: 552 PSGASTLTEAQVELAALTGDKR---HLELPERYVSRMHDGLVRNPMGYGYLGLAADAL-L 607
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
V L G + D + +A ++ +V W+ ++ +
Sbjct: 608 EGAAAVTLAGSRE--DVAPLRSALDHAFIPTVSV------------GWKAMGQPVPALLK 653
Query: 679 NNFSADKVV-----ALVCQNFSCSPPVTDPISLENLL 710
F + V A +C+ F C PVT+P L L
Sbjct: 654 ELFEGREPVKGKGAAYLCRGFVCELPVTEPDVLSQRL 690
>gi|116331824|ref|YP_801542.1| hypothetical protein LBJ_2312 [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
gi|116125513|gb|ABJ76784.1| Conserved hypothetical protein containing a thioredoxin domain
[Leptospira borgpetersenii serovar Hardjo-bovis str.
JB197]
Length = 692
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 252/713 (35%), Positives = 361/713 (50%), Gaps = 71/713 (9%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 38 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 97
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD +P+ GGTYFPPE YGR F +L ++ W +KR
Sbjct: 98 MDALHAMDQQGGWPLNIFLTPDGRPIAGGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQE 157
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APK 180
L + + L ++ A + LP L +S YD+ FGGF + K
Sbjct: 158 LIVASSELSRYLKDSGEGRAIEKQEEGSLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNK 217
Query: 181 FPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
FP + + +L YH S + +MV TL M +GGI+D VGGG RYS D
Sbjct: 218 FPPSMGLSFLLRYH--------HSSGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTD 269
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
RW VPHFEKMLYD ++ ++K + D++ YL RDM GG I SAE
Sbjct: 270 HRWMVPHFEKMLYDNSLFLETLVECSQVSKKISAESFALDVISYLHRDMRIVGGGICSAE 329
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 330 DADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTNKGN------------ 370
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+GKN+L E A+KL K ++ +L R KL + RSKR RP DDK++ SWN
Sbjct: 371 FEGKNILHE--SYGGEATKLSEEEWKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWN 428
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I + A+A + R++++++AE SFI R+L D R+
Sbjct: 429 GLYIKALAKAG----------------IAFQREDFLKLAEETYSFIERNLIDPDG-RILR 471
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+ S G+ +DYA +IS + L+E G G ++L A+ LF R G F
Sbjct: 472 RFRDSESGILGYSNDYAEMISSSIVLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 529
Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TG D VLLR D +DG EPS NS +LV+L+ + G S YR+ AE + F
Sbjct: 530 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTK 587
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L +++ P + A S K +VL+ K + +++LAA + + ++
Sbjct: 588 ELSTHSLSYPHLLSAYWTYKYHS-KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNE 645
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ EE + + + S + VC+NFSC PV++ L+ +
Sbjct: 646 NELEEA-------RKLSVLFDSRDSGGNALVYVCENFSCKLPVSNLADLQKWI 691
>gi|418746293|ref|ZP_13302623.1| PF03190 family protein [Leptospira santarosai str. CBC379]
gi|410792840|gb|EKR90765.1| PF03190 family protein [Leptospira santarosai str. CBC379]
Length = 699
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 41 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 100
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W+
Sbjct: 101 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 160
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
+KR L A +LS+ L S + + LP A L +S YDS FGG
Sbjct: 161 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 216
Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F + KFP + + +L YH +S + +M TL M +GGI+D VGG
Sbjct: 217 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 268
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
G RYS D RW VPHFEKMLYD ++ S++K + D++ YL RDM
Sbjct: 269 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 328
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G I SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 329 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 377
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+L E + S +A + ++L R KL + RSKR RP DD
Sbjct: 378 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 428
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL + +A V +++++++AE SFI R+L D
Sbjct: 429 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 471
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R
Sbjct: 472 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 529
Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
G F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE
Sbjct: 530 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 587
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F L ++ P + A S K +VL+ K + +++LA + +
Sbjct: 588 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 645
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ ++ + EE +++ + S + VC+NFSC P+
Sbjct: 646 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 689
>gi|448359615|ref|ZP_21548265.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
10990]
gi|445642250|gb|ELY95319.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
10990]
Length = 811
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 217/604 (35%), Positives = 317/604 (52%), Gaps = 43/604 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA+ LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 55 SACHWCHVMEDESFADEQVAEALNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLS 136
+L+P+ KP GTYFP K G+PGF IL V ++W++ RD + A+ A +
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENVTNSWERDRDEVENRAEQWTNAAKDRL 174
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
E + S+++ P + L A +S D +FGGFGS PKFP+P ++++ +
Sbjct: 175 EETPDTVSASQPPS---SDVLDAAANASFRSADRQFGGFGSDGPKFPQPSRLRVLARAAD 231
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ E + Q +++ TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD
Sbjct: 232 RT-------EREDFQDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ +L + T D Y+ + + L ++ R++ G FS DA S + + R +EG
Sbjct: 285 AIPRAFLIGYQQTGDERYAEVVAETLAFVERELTHEEGGFFSTLDAQSEDPDTGER-EEG 343
Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
FYVWT E+ D+L A LF + Y + +GN F+G N + S
Sbjct: 344 TFYVWTPDEIHDVLENETTADLFCDRYDITESGN------------FEGSNQPNRVRSVS 391
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A++ + + L R +LF R +RPRP+ D+KV+ WNGL+I++ A A+ +L
Sbjct: 392 DLAAEYDLEAPDVQDRLESAREELFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLG 451
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
G D EY +A A F+R L+DE RL +++G G+L+D
Sbjct: 452 G------------GEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDGYLED 499
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL L YE L +A++L ++ F D + G + T S++ R +E
Sbjct: 500 YAFLARAALGCYEATGEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQEL 559
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D + PS V+V L+ L + D + + A L R++ ++ +C AA
Sbjct: 560 GDQSTPSAAGVAVETLLALEGFA--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAA 617
Query: 614 DMLS 617
D L+
Sbjct: 618 DRLA 621
>gi|448328363|ref|ZP_21517675.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
gi|445615887|gb|ELY69525.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
Length = 729
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 238/718 (33%), Positives = 362/718 (50%), Gaps = 68/718 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEDESFEDEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P+ KP GTYFP E K G+PGF + ++ D+W+ + D EQ ++A
Sbjct: 113 AWLTPEGKPFFVGTYFPREGKQGQPGFLDLCERISDSWESEEDRAEMEN--RAEQWTDA- 169
Query: 140 SASASSNKLPDEL---------PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
A + PD + L A+ + +S D + GGFGS KFP+P ++++
Sbjct: 170 -AKDQLEETPDAAGAGTGAAPPSSDVLETAADMVLRSADRQHGGFGSGQKFPQPSRLRVL 228
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
++ + TG+ E ++ TL MA GG++DHVGGGFHRY VD W VPHFEKM
Sbjct: 229 ---ARAYDRTGR----EEYLEVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKM 281
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD ++ +L + LT + Y+ + + L+++ R++ G FS DA S E+
Sbjct: 282 LYDNAEIPRAFLSGYQLTGEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQS-ESPETG 340
Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
+EGAFYVWT ++V + L A LF + + +GN F+G+N
Sbjct: 341 EHEEGAFYVWTPEDVHEALESETDAALFCARFDISESGN------------FEGRNQPNR 388
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
+ S A + + + L L R+ LF+ R +RPRP D+KV+ WNGL+IS++A A
Sbjct: 389 VATVSELADQFDLEESEILKRLDSARQTLFEAREERPRPARDEKVLAGWNGLLISTYAEA 448
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ +L G+D +Y A A F+R L++E RL +++G K
Sbjct: 449 ALVL--------------GAD--DYAATAVDALEFVRDRLWNEADQRLSRRYKDGDVKVD 492
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G+L+DYAFL G LD Y+ L +A+EL + F D + G + T S++
Sbjct: 493 GYLEDYAFLARGALDCYQATGEVAHLAFALELARVIEAEFWDEDRGTLYFTPESGESLVT 552
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R +E D + PS V+V L+ L + + A L +L+ A+
Sbjct: 553 RPQELGDQSTPSATGVAVEVLLALDEFA----DEDFEDIAATVLETHANKLESSALEHAT 608
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+C AAD L+ + + V + + ++ A+ + L + P +D W E
Sbjct: 609 LCLAADRLAAGALE-VTVAADELPTEWREGFASRY----LPDRLFARRPPTEAGLDDWLE 663
Query: 669 H---NSNNASMARNNFSADKVVALVCQNFSCSPP---VTDPISL--ENLLLEKPSSTA 718
+ A + VC++ +CSPP VT+ + EN +E S+++
Sbjct: 664 TLGLDDAPPIWAGREARDGEPTLYVCRDRTCSPPTHEVTEALEWLGENAAVEGSSASS 721
>gi|108757716|ref|YP_634091.1| hypothetical protein MXAN_5954 [Myxococcus xanthus DK 1622]
gi|108461596|gb|ABF86781.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 696
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 246/699 (35%), Positives = 347/699 (49%), Gaps = 73/699 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE A+L+N+ F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 57 SACHWCHVMAHESFESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA-QSGAFAIEQLSEA 138
VFL+PDLKP GGTYFPP+D+YGRPGF +L ++DAW+ K+D + QSG F E L E
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDRYGRPGFPRLLMALRDAWENKQDEVQRQSGQFE-EGLGEL 175
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A+ P L + ++++K D+ GGFG APKFP P+ +ML ++
Sbjct: 176 --ATYGLEAAPAVLTAADVVGMGQRMAKQVDAVHGGFGGAPKFPNPMNFALMLRAWRR-- 231
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G + + V TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD QL
Sbjct: 232 -----GGGAPLKDAVFLTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLL 286
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++Y A + + + + + Y+RR+M GG ++A+DADS EG +EG F+
Sbjct: 287 HLYAQAQQVEPRQLWRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFF 339
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW +EV L E A L H+ +KP GN + G VL + S A
Sbjct: 340 VWRPEEVRAALPEAQAELVLRHFGIKPGGNFE-----------HGATVLEVVVPVSELAR 388
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G+ + L ++ LFD R +R +P DDK++ WNGL+I A AS++
Sbjct: 389 ERGVSEDAMERELAAAKQTLFDARERRVKPGRDDKLLSGWNGLMIRGLALASRVF----- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
R E+ + A AA F+ +D RL S++ G ++ GFL+DY L
Sbjct: 444 -----------GRPEWAKWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGDL 490
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
SGL LY+ K+L A L +LF D E Y +++ D A
Sbjct: 491 ASGLTALYQATFDVKYLEAADALVRRAVDLFWDAEKAAYLTAPRGQRDLVVATYGLFDNA 550
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSG S V LA++ + + + E +A L M + AAD L
Sbjct: 551 FPSGASTLTEAQVELAALTGDKQ---HLELPERYVARMHDGLVRNTMGYGYLGLAADAL- 606
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF-WEEHNSNNASM 676
L G S + A AS D+ +D A + W+ ++
Sbjct: 607 --------LEGAAS-------VTVAGASDDVAPLRAAMDRAFAPTVALAWKAPGQPVPAL 651
Query: 677 ARNNFSA-----DKVVALVCQNFSCSPPVTDPISLENLL 710
+ F + A +C+ F C PVT+P L L
Sbjct: 652 LQGTFEGREPVKGRAAAYLCRGFVCELPVTEPDVLTQRL 690
>gi|304314907|ref|YP_003850054.1| hypothetical protein MTBMA_c11480 [Methanothermobacter marburgensis
str. Marburg]
gi|302588366|gb|ADL58741.1| conserved hypothetical protein [Methanothermobacter marburgensis
str. Marburg]
Length = 677
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 209/558 (37%), Positives = 317/558 (56%), Gaps = 53/558 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED +A +LN+ FV++KVDREERPD+D +YM Q + G GGWPL+
Sbjct: 53 STCHWCHVMARESFEDPEIADILNENFVAVKVDREERPDIDAIYMKVCQMMTGTGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++P+ +P GTYFPP+D+ G PG +TIL +V W D + ++ + L +++
Sbjct: 113 IIMTPEGEPFFAGTYFPPDDRGGVPGLRTILERVVLLWKNDPDGIVKTARDVVSALKKSV 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLE 198
A ++KL E A E L +++D+R GGFGS KFP P I +L YH ++ +
Sbjct: 173 ---AKASKLKPETVDAAY----EYLRRNFDTRNGGFGSYQKFPTPHNIYFLLRYHLRRGD 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D E +MV TL+ M GGI+D +G GFHRY+V+ W VPHFEKMLYDQ +
Sbjct: 226 D--------EALRMVNLTLRRMRYGGIYDQLGYGFHRYAVEPTWTVPHFEKMLYDQALIL 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL+AF +T D Y +I++Y+ ++ P G +SAED AE+EG EG +Y
Sbjct: 278 KAYLEAFQVTCDDLYKKTALEIVEYVLGNLQSPEGAFYSAED---AESEGV----EGKYY 330
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+W + E+ ++LG+ A + ++ + GN + +G+N+L + A +
Sbjct: 331 LWRASEIREVLGDDANVVMRYFNVLEDGNF--------AGDVRGENIL-HIGSPWRVADE 381
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ L++ I+ RR L + R +RP P LDDK++ WNGL++ + A +IL SE
Sbjct: 382 FNLTLDELNEIIENARRHLLERRMERPTPALDDKILTDWNGLMLGALAACGRILDSE--- 438
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
E + AE FI +L+ + L H +R+ + G LDDYAFLI
Sbjct: 439 -------------EALAAAERCLKFIMDNLHVDG--ELLHRYRDSEAGIDGKLDDYAFLI 483
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL+L++ ++ A+EL + ++ F +GG Y +DP +++R + DGA
Sbjct: 484 WGLLELHDATFREGYVEMALELSESLEDRFGAPDGGFYLT---DDPKLIVRPMDATDGAI 540
Query: 559 PSGNSVSVINLVRLASIV 576
PSGNSV ++NL+RL I+
Sbjct: 541 PSGNSVQMLNLLRLGGIL 558
>gi|448688002|ref|ZP_21693970.1| thioredoxin [Haloarcula japonica DSM 6131]
gi|445779793|gb|EMA30709.1| thioredoxin [Haloarcula japonica DSM 6131]
Length = 717
Score = 368 bits (945), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 230/690 (33%), Positives = 354/690 (51%), Gaps = 60/690 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFEDEAIAEQLNEDFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
L+P+ +P GTYFPPE+K G+PGF +L+++ D+W ++R+ + E + L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRARQWTEAIESDL 177
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
A+ + P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAD---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAHA 231
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD ++
Sbjct: 232 DGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIP 287
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATRKKEG 315
+L + Y+ + R+ ++++R++ P G FS DA+SA E EG T +EG
Sbjct: 288 RAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPIDEPEGET--EEG 345
Query: 316 AFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
FYVWT ++V D + + A +F +++ + GN F+G VL S
Sbjct: 346 LFYVWTPEQVRDAVDDETDAEIFCDYFGVTARGN------------FEGATVLAVRKPVS 393
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A + +K L + F+ R++RPRP D+KV+ WNGL+I + A + +L
Sbjct: 394 VLAEEYDQSEDKITASLQRALNQTFEARTERPRPARDEKVLAGWNGLMIRTLAEGAIVLD 453
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+Y +VA A SF+R HL++E +RL +++G G+L+D
Sbjct: 454 D-----------------QYADVAADALSFVREHLWNEDENRLNRRYKDGDVAIDGYLED 496
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFL G L L+E + L +A++L E F D E G F T S++ R +E
Sbjct: 497 YAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQEL 556
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D + PS V+V L+ L+ S D + + AE + R+ + + A
Sbjct: 557 TDQSTPSSTGVAVDLLLSLSHF---SDDDRFEEVAERVIRTHADRVSSNPLQHASLTLAT 613
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW----EEH 669
D + + + LVG +S D+ + A + + ++ PAD + W E
Sbjct: 614 DTYEQGALE-LTLVGDRS--DYPSEWTETLAERYVPRRLLAHRPADEGRFEQWLDALELD 670
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPP 699
S R K C+NF+CSPP
Sbjct: 671 ESPPIWAGREQIDG-KPTVYACRNFACSPP 699
>gi|284164956|ref|YP_003403235.1| hypothetical protein Htur_1677 [Haloterrigena turkmenica DSM 5511]
gi|284014611|gb|ADB60562.1| protein of unknown function DUF255 [Haloterrigena turkmenica DSM
5511]
Length = 733
Score = 368 bits (945), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 230/695 (33%), Positives = 354/695 (50%), Gaps = 57/695 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFED+ VA +LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEDESFEDDEVAAVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIE 133
+L+P+ KP GTYFP E + +PGF + +++ D+W+ D Q A +
Sbjct: 113 AWLTPEGKPFFVGTYFPKESQRNQPGFLELCQRISDSWESGEDREEMEHRADQWTEAAKD 172
Query: 134 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
+L E + ++ + L A+ +S D ++GGFGS PKFP+P + ++
Sbjct: 173 RLEETPDDAGTAGGAAEPPSSEVLETAADAALRSADRQYGGFGSGGPKFPQPSRLHVL-- 230
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
++ + TG+ E ++V +L MA GG++DHVGGGFHRY VD+ W VPHFEKMLY
Sbjct: 231 -ARAYDRTGR----EEYLEVVEESLDAMAAGGLYDHVGGGFHRYCVDKDWTVPHFEKMLY 285
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ +L + LT + Y+ + + L +L R++ G FS DA S + E R
Sbjct: 286 DNAEIPRAFLAGYQLTGEERYAEVVDETLAFLERELTHDEGGFFSTLDAQSEDPETGER- 344
Query: 313 KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG FYVWT EV ++L + A LF Y + +GN F+G+N +
Sbjct: 345 EEGVFYVWTPDEVSEVLEDETTADLFCARYDITESGN------------FEGRNQPNRVR 392
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ A + + + + L + R +LF+ R +RPRP+ D+KV+ WNGL+I++ A A+
Sbjct: 393 SLESLADEYDLAEAEIEDRLEDAREQLFEAREQRPRPNRDEKVLAGWNGLMINACAEAAL 452
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
VVG+D EY + A A F+R L+DE RL F++G K G+
Sbjct: 453 --------------VVGND--EYADQAVDALEFVRDRLWDEDEQRLSRRFKDGNVKVDGY 496
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G L Y+ L +A++L T + F D E G + T S++ R
Sbjct: 497 LEDYAFLARGALGCYQATGDVDHLGFALDLARTIEAEFWDEEQGTIYFTPESGESLVTRP 556
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+V L+ L D + + A L +++ ++ +C
Sbjct: 557 QELTDQSTPSAAGVAVETLLALDEFA----EDDFGEIAATVLETHANKIEANSLEHASLC 612
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH- 669
AAD L + + V + + ++ + A + + + P E ++ W +
Sbjct: 613 LAADRLEAGALE-VTVAADELPAEWRDRFADEYHP----DRLFALRPPTAEGLEAWLDQL 667
Query: 670 --NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + VC++ +CSPP D
Sbjct: 668 GLEEPPAIWAGREARDGEPTLYVCRDRTCSPPTHD 702
>gi|359683227|ref|ZP_09253228.1| hypothetical protein Lsan2_00420 [Leptospira santarosai str.
2000030832]
Length = 691
Score = 368 bits (944), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 246/707 (34%), Positives = 355/707 (50%), Gaps = 65/707 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 33 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W
Sbjct: 93 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWS 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS- 177
+KR L + + + L ++ A + D +N YDS FGGF +
Sbjct: 153 EKRQELVVASSELSQYLKDSGEGRAVEKQEGDLPSENCFDSAFSLYESYYDSEFGGFKTN 212
Query: 178 -APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
KFP + + +L YH +S + +M TL M +GGI+D VGGG R
Sbjct: 213 HVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCR 264
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS D RW VPHFEKMLYD ++ S++K + D++ YL RDM G I
Sbjct: 265 YSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNEDGGI 324
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 325 CSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN-------- 369
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
F+GKN+L E + S +A + ++L R KL + RSKR RP DDK++
Sbjct: 370 ----FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILT 424
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
SWNGL + +A V +++++++AE SFI R+L D R
Sbjct: 425 SWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID-PNGR 467
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R G
Sbjct: 468 ILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAG 525
Query: 536 YFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE +
Sbjct: 526 VFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAESIFSY 583
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F L ++ P + A S K +VL+ K + +++LA + + +
Sbjct: 584 FTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDSVLAV 641
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
++ + EE +++ + S + VC+NFSC P+
Sbjct: 642 VNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681
>gi|388254779|gb|AFK24895.1| protein of unknown function DUF255 [uncultured archaeon]
Length = 691
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 220/557 (39%), Positives = 310/557 (55%), Gaps = 48/557 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ VAK++N+ F++IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 55 SACHWCHVMAHESFEDDEVAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLS 114
Query: 80 VFLSPDLKPLMGGTYFPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSE 137
VFL+ D KP GTYFP E +Y PGFKTIL ++ A+ KK+++ A SG F + L++
Sbjct: 115 VFLTSDQKPFYVGTYFPKEGGRYNMPGFKTILLQLATAYKSKKQEIEAASGEF-MGALAQ 173
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
AS L ++ + A L + D +GGFG APKFP P + +L +
Sbjct: 174 TAKDIASGMAEKASLERSIIDEAAMGLLQMGDPIYGGFGQAPKFPNPTNLMFLLRYYN-- 231
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
SG + + V FT MA GGIHD +GGGF RY+ D++W +PHFEKMLYD L
Sbjct: 232 ----LSG-LNRFKDFVAFTADKMAAGGIHDQLGGGFARYATDQKWLIPHFEKMLYDNALL 286
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A +Y + + +TK Y I R LD++ R+M+ P G +SA DADS EG +EG F
Sbjct: 287 AQLYSELYQITKADKYVQITRKTLDFVSREMMHPEGGFYSALDADS---EG----EEGKF 339
Query: 318 YVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
Y+W KE+ ILG+ +F EHY + GN F+G+N+L +
Sbjct: 340 YIWQKKEIASILGDQVATDIFCEHYGVTEGGN------------FEGQNILNVRVPLANV 387
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+ G E+ I+ + KLF R KR RP D+K++ SWNGL+IS FA+ I
Sbjct: 388 GLRYGKTPEQAAQIIADASAKLFTAREKRVRPGRDEKILTSWNGLMISGFAKGYSI---- 443
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ +Y++ A++A FI + RL +F++G SK +LDDYA
Sbjct: 444 ------------TGDAKYLQAAKNAVDFIEAKI-AAGDGRLLRTFKDGHSKLNAYLDDYA 490
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F +SGLLDL+ S +L AI + + F D + G F T+ + +++R K +D
Sbjct: 491 FYVSGLLDLFAVDSKQAYLDKAIMHTDFMLKHFWDEKEGNLFFTSDDHEKLIVRTKSFYD 550
Query: 556 GAEPSGNSVSVINLVRL 572
A PSGNS++ +L+RL
Sbjct: 551 LAIPSGNSMAAADLLRL 567
>gi|16768044|gb|AAL28241.1| GH13403p [Drosophila melanogaster]
Length = 629
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 231/669 (34%), Positives = 330/669 (49%), Gaps = 83/669 (12%)
Query: 78 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+SV+L+P L PL+ GTYFPP+ +YG P F T+L+ + W+ ++ L +G+ + L +
Sbjct: 1 MSVWLTPTLAPLVAGTYFPPKSRYGMPSFNTVLKSIARKWETDKESLLATGSSLLSALQK 60
Query: 138 ALSASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQ 188
ASA +P+ A E+LS++ +D GGFGS PKFP +
Sbjct: 61 NQDASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGFGSEPKFPEVPRLN 112
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+ + +D + MV+ TL + KGGIHDH+ GGF RY+ + WH HFE
Sbjct: 113 FLFHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFARYATTQDWHNVHFE 165
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYDQGQL + +A+ +T+D Y I YL +D+ P G ++ EDADS T
Sbjct: 166 KMLYDQGQLMMAFANAYKVTRDEIYLRYADKIHKYLIKDLRHPLGGFYAGEDADSLPTHE 225
Query: 309 ATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDP 356
K EGAFY WT E++ DI E A ++ HY LKP GN + SDP
Sbjct: 226 DKVKVEGAFYAWTWDEIQAAFKDQAQRFDDITPERAFEIYAYHYGLKPPGN--VPAYSDP 283
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
H GKN+LI + + + +++ +L L +R KRPRPHLD K+I +
Sbjct: 284 HGHLTGKNILIVRGSEEDTCANFKLEEDRFKKLLATTNDILHVIRDKRPRPHLDTKIICA 343
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGLV+S + ++R++YM+ A+ F+R+ +YD + L
Sbjct: 344 WNGLVLSGLCKLGN--------------CYSANREQYMQTAKELLDFLRKEMYDPEQKLL 389
Query: 477 QHS----------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDE 526
S S+ GFLDDYAFLI GLLD Y+ L WA LQ+TQD+
Sbjct: 390 IRSCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDK 449
Query: 527 LFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
LF D G YF + + P+V++R+KEDHDGAEP GNSVS NLV LA YY +
Sbjct: 450 LFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLAH--------YYDE 501
Query: 587 NA----EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
NA L F + A+P M A +L + +V V S D + +
Sbjct: 502 NAYLQKAGKLLNFFADVSPFGHALPEMLSA--LLMHENGLDLVAVVGPDSPDTQRFVEIC 559
Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + ++H+DP++ EE SN + K +C +C PVTD
Sbjct: 560 RKFFIPSMIIVHVDPSNPEEA-------SNQRLQTKFKMVGGKTTVYICHERACRMPVTD 612
Query: 703 PISLENLLL 711
P LE+ L+
Sbjct: 613 PQQLEDNLM 621
>gi|422002946|ref|ZP_16350180.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417258416|gb|EKT87804.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 691
Score = 367 bits (943), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 33 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W+
Sbjct: 93 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
+KR L A +LS+ L S + + LP A L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208
Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F + KFP + + +L YH +S + +M TL M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
G RYS D RW VPHFEKMLYD ++ S++K + D++ YL RDM
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 320
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G I SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+L E + S +A + ++L R KL + RSKR RP DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL + +A V +++++++AE SFI R+L D
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521
Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
G F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGIDSARYRKFAES 579
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F L ++ P + A S K +VL+ K + +++LA + +
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ ++ + EE +++ + S + VC+NFSC P+
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681
>gi|295667924|ref|XP_002794511.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226285927|gb|EEH41493.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 791
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 219/536 (40%), Positives = 304/536 (56%), Gaps = 36/536 (6%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
K R FL + CHWCHVME ESF +A +LN F+ IK+DREERPD+D+VYM YV
Sbjct: 58 KLNRLIFLSIGYSACHWCHVMEKESFMSPEIAAILNKSFIPIKLDREERPDIDEVYMNYV 117
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDK 119
QA G GGWPL+VFL+PDL+P+ GG+Y+P P G+ F IL K++D W
Sbjct: 118 QATTGSGGWPLNVFLTPDLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHT 177
Query: 120 KRDMLAQSGAFAIEQLSEALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGG 174
++ +S +QL E + + +K D +L L + + YD+ GG
Sbjct: 178 QQLRCRESAKDITKQLRE-FAEEGTHSKQSDVETEEDLEIELLEEAYQHFASRYDAVNGG 236
Query: 175 FGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F APKFP PV + +++ S+ + D E S ++ + TL M++GGIHD +G
Sbjct: 237 FSEAPKFPTPVNLSFLVHLSRYPSAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGH 296
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIG 290
GF RYSV W +PHFEKMLYDQ QL +VY+DAF D DI Y+ M+
Sbjct: 297 GFARYSVTADWSLPHFEKMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLS 356
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCD 349
P G S+EDADS + T K+EGAFYVWT KE++ ILG+ A + H+ + GN
Sbjct: 357 PTGGFHSSEDADSRPSPNDTEKREGAFYVWTLKELKQILGQRDADVCARHWGVLADGN-- 414
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPH 408
++R++DPH+EF +NVL S A + G+ ++ + I+ R KL + R SKR RP
Sbjct: 415 VARINDPHDEFINQNVLSIQVTPSKLAKEFGLGEDEVVRIIKRSREKLREYRESKRVRPD 474
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LDDK+IV+WNGL I + A+ S +L++ + F AE A FI+ +L
Sbjct: 475 LDDKIIVAWNGLAIGALAKCSVVLENLDRDKAYQF----------RRAAEEAVRFIKHNL 524
Query: 469 YDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
+DEQT +L +R G PGF DDYA+LISGL++LYE L +A +LQ+
Sbjct: 525 FDEQTGQLWRIYRGGVRGDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQHA 580
>gi|239906990|ref|YP_002953731.1| hypothetical protein DMR_23540 [Desulfovibrio magneticus RS-1]
gi|239796856|dbj|BAH75845.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 697
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 243/692 (35%), Positives = 342/692 (49%), Gaps = 49/692 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A L+N VS+KVDREERPD+D +YM+ AL G GGWPL+
Sbjct: 52 STCHWCHVMERESFEDEDIAALMNAVVVSVKVDREERPDLDALYMSVCHALTGRGGWPLT 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP E YGR G + +L++V W R + + ++ + E L
Sbjct: 112 VFLTPDKEPFFAGTYFPKESAYGRTGLRELLQRVHMFWKGNRQAVVNNAGQIMDAVREQL 171
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A+A + E Q AL QL+ +D+R GGFG APKFP P + +L ++ D
Sbjct: 172 AAAAGTASA--EPGQAALDAARTQLAGIFDARNGGFGGAPKFPSPHNLLFLLREYRRTGD 229
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ M TL M +GG++D VG G HRY+ D W +PHFEKMLYDQ
Sbjct: 230 V-------SCRDMACRTLVAMRRGGVYDQVGFGLHRYATDAHWFLPHFEKMLYDQALTVM 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++A+ + DV + + +IL+Y+RRD+ P G +SAEDADS EG EG FYV
Sbjct: 283 ACVEAYQASGDVAHKTMALEILEYVRRDLTSPEGLFYSAEDADS---EGV----EGKFYV 335
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W++ E+ +LG+ A L GN + E G N+L +A++L
Sbjct: 336 WSAAELRRLLGDEAALIMAAMGATEEGNAH----DEATGETTGANILHLPRPLDETAARL 391
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ E L CR L R KR RP DDKV+ NGL++++ A+A++ E +
Sbjct: 392 GLTAEILAERLEACRHVLLAEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEDLAG 451
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ AE+ S + R Q RL H R+ + G LDDY FL
Sbjct: 452 ------------RAVTAAEALLSRLAR-----QNGRLLHRLRDDEAAIDGLLDDYVFLAW 494
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GL++LY+ T +L A+EL E F D GGYF + +L+R K D A P
Sbjct: 495 GLVELYQTVFDTAYLRRAVELMKAVAEHFADPNEGGYFLAPDDGEQLLVRQKIFFDAAVP 554
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGNSV+ L L + +++ A RL D A C + +
Sbjct: 555 SGNSVAYFVLTTLFRLTGDPA---FKEQATALARAMAPRLADHAAGYAFFLCGLSQV-LG 610
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
V L G + D + + A Y L + + + P D +E D + A R
Sbjct: 611 QASEVTLAGDPAGPDTQTLARAIFERY-LPEVAVVLRP-DEDEPDI-----AALAPFTRY 663
Query: 680 NFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
D + A VC+ SC PP + ++ LL
Sbjct: 664 QLPLDGRAAAHVCRAGSCQPPTAEVETMLKLL 695
>gi|432330863|ref|YP_007249006.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
gi|432137572|gb|AGB02499.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
Length = 708
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 255/709 (35%), Positives = 360/709 (50%), Gaps = 59/709 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVM ESFED VA+LLN F+++KVDREERPD
Sbjct: 38 GEEAFLRAAREDKPVFLSIGYATCHWCHVMAHESFEDLEVAELLNRDFIAVKVDREERPD 97
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D YM Q L G GGWPL++ ++P+ KP TY P E ++ PG +L ++ AW
Sbjct: 98 IDSTYMQVCQMLSGQGGWPLTIVMTPEKKPFFAATYLPKERRFAVPGLLDLLPRIAKAWR 157
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGS 177
++R L QS E +++AL ++ P+ P A L E L +D +GGF
Sbjct: 158 EQRGELLQSA----ESITQALETRDAAPAGPE--PDAALLDEGYEDLLLRFDPGYGGFSG 211
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFP P + +L + K+ TGK MV+ TL GGIHDH+GGGFHRYS
Sbjct: 212 APKFPTPHTLLFLLRYWKR---TGK----KRALDMVVKTLDAFRDGGIHDHIGGGFHRYS 264
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
D +W VPHFEKMLYDQ L Y +AF T++ Y + Y+ RD+ P G FS
Sbjct: 265 TDAQWRVPHFEKMLYDQALLVIAYTEAFQATRNYRYRETAMSTVRYVLRDLTDPEGAFFS 324
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDP 356
AEDADS R EGAFY+WT E+E +L + A + + ++ GN P
Sbjct: 325 AEDADS-------RGGEGAFYLWTMGELEAVLEKDDAAIAGRVFNVRDEGN-----FLSP 372
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
+ +N+L A S G+ E+ + R +LF R KR RP DDKV++
Sbjct: 373 EST-GAENILFRTRTDEALVSVTGIHQEELDERIASIRERLFAAREKRERPRRDDKVLLD 431
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I++ A+A++ + G R E S +R RL
Sbjct: 432 WNGLMIAALAKAARAFGN------------GECRTAAERAMECILSRMR-----TGDGRL 474
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
H +R+G PGF DDYAFL L++LYE ++L A+ + T + FLDRE GG+
Sbjct: 475 YHRYRDGERAIPGFADDYAFLGLALIELYECTFDPRYLAEALAIMKTFRDHFLDRENGGF 534
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F T G+ ++L+R K +DGA PS NSV+ L+RL+ + ++ + S F
Sbjct: 535 FFTAGDAEALLVRDKVIYDGAVPSANSVACEVLLRLSRLTGTTEHEDLAAALARS---FA 591
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
R+++ A CA + PS + +V+ G S + LAA + Y + TVIH
Sbjct: 592 GRVRESPSAFCWFLCAIERAVGPS-QDIVIAGDSGSPAVQEFLAAVRSRYLPHCTVIHKP 650
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADK--VVALVCQNFSCSPPVTDP 703
+D + + E N AD+ A +C +CS P+TDP
Sbjct: 651 ASDPDTIAALEALTPFT-----RNILADRNTPAAYLCSGSTCSLPITDP 694
>gi|398331059|ref|ZP_10515764.1| hypothetical protein LalesM3_03040 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 699
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 256/717 (35%), Positives = 363/717 (50%), Gaps = 80/717 (11%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 46 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 105
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE +YGR F IL ++ W +KR
Sbjct: 106 MDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWKEKRQE 165
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS-- 177
L A +LS L S + + LP +N YD+ FGGF +
Sbjct: 166 L----IVASSELSRYLKDSGEGRAIEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNH 221
Query: 178 APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
KFP + + +L YHS SG S +MV TL M +GGI+D +GGG R
Sbjct: 222 VNKFPPSMGLSFLLRYYHS--------SGNPS-ALEMVENTLLAMKQGGIYDQIGGGLCR 272
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YS D W VPHFEKMLYD ++ ++K + D++ YL RDM GG I
Sbjct: 273 YSTDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGI 332
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSD 355
SAEDADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 333 CSAEDADS---EG----EEGLFYIWDFEEFREVCGEDSRILEKFWNVTKKGN-------- 377
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVI 414
F+GKN+L E + A+K K ++ +L R KL + R+KR RP DDK++
Sbjct: 378 ----FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRNKRVRPLRDDKIL 431
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 474
SWNGL I + A+A V R++++++AE SFI R+L D +
Sbjct: 432 TSWNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIERNLID-PSG 474
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
R+ FR+ S G+ +DYA +IS + L+E G G ++L A+ LF R
Sbjct: 475 RILRRFRDKESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWMEEAIRLF--RSPA 532
Query: 535 GYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
G F TG D VLLR D +DG EPS NS +LV+L+ + G S YR+ AE
Sbjct: 533 GVFFDTGNDGEVLLRRSVDSYDGVEPSANSSLAYSLVKLS--LFGIDSVRYREFAESIFL 590
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F L +++ P + A S K +VL+ K + + +LAA + +
Sbjct: 591 YFTKELSTYSLSYPHLLSAYWTYRHHS-KEIVLI-RKDTDSGKELLAAIQTRFLPDSVFA 648
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ + EE +++ + S + VC+NFSC PV++ L+ +
Sbjct: 649 VVNENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 698
>gi|379010883|ref|YP_005268695.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
gi|375301672|gb|AFA47806.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
Length = 686
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 245/699 (35%), Positives = 348/699 (49%), Gaps = 74/699 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA+ LN +F+SIKVDREERPD+D++YMT+ Q G GGWPL+
Sbjct: 56 STCHWCHVMEKESFEDAEVAEYLNKYFISIKVDREERPDIDQIYMTFSQVSTGQGGWPLN 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ + KP TY P +YG PG +L ++ W + + + S A + L L
Sbjct: 116 VFLTAERKPFYVTTYLPKRSRYGHPGLMDVLVGIEGQWRQNNEEIIYS-ADKMTSLLNDL 174
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
NKL + +A E S+D R+GGFG APKFP P +H L
Sbjct: 175 EIRKDENKLKRTIFFDAYDFFDE----SFDDRYGGFGKAPKFPTP-------HHLFYLLR 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ + MV TL+ M +GG+ DH+G GF RYS DE+W VPHFEKMLYD L
Sbjct: 224 CYQAFNQPDALVMVEKTLKQMYQGGLFDHIGFGFSRYSTDEQWLVPHFEKMLYDNALLVM 283
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + + +T + Y I + + Y+ RD+ G F AEDADS EG +EG FYV
Sbjct: 284 IYAETYQVTGNPLYKKIAQKTITYVNRDLRSEEGGFFCAEDADS---EG----EEGRFYV 336
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 376
W+ ++VE ILG + A +F + Y + GN F GKN+ +I ++ A
Sbjct: 337 WSMEKVEKILGKKRAAVFFKFYPMTAKGN------------FDGKNIPNMIPVDLDLIEA 384
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ LEK +L E + LF+ R KR PH DDK++ +WNGL+I++ A A +I
Sbjct: 385 NP---ELEK---VLDEMKADLFNQREKRIHPHKDDKILTAWNGLMITALAMAGRIF---- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D+ EY+ AE +FI + + RL +R G +K +LDDYA
Sbjct: 435 ------------DQPEYLIQAEETMAFIENKM-TRRNGRLYARYRLGEAKILAYLDDYAS 481
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHD 555
+I G L+LY+ T++L AI +F D G G+F + ++ R KE +D
Sbjct: 482 VIWGYLELYQATFKTEYLEKAILRAVDMINIFGDDFGMSGFFQYGNDAEKLIARPKEIYD 541
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A+PSGN+++ L++L I K Y A F L MA +M CA
Sbjct: 542 NAQPSGNALAACCLLKLGKITGEQK---YIDIVNGMFAYFAGNLNQAPMASTMMLCAKLF 598
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA--DTEEMDFWEEHNSNN 673
P+ + VV G++ M + LNK + + E D + N
Sbjct: 599 HEQPTTE-VVFAGYEKDPTIRAM------NQRLNKLFLPFSVVLFNKSEKDL----KTIN 647
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
A + A VC+N+ C PV D S ++ E
Sbjct: 648 AFAVNQQMIHGQPTAYVCKNYRCEEPVNDLESFLKIIEE 686
>gi|397780504|ref|YP_006544977.1| hypothetical protein BN140_1338 [Methanoculleus bourgensis MS2]
gi|396939006|emb|CCJ36261.1| putative protein yyaL [Methanoculleus bourgensis MS2]
Length = 719
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 242/707 (34%), Positives = 360/707 (50%), Gaps = 56/707 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL + CHWCHVME ESF D VAKLLND FV IKVDREERPD
Sbjct: 44 GEEAFLRAKEEAKPIFLSIGYSACHWCHVMEEESFADPMVAKLLNDVFVCIKVDREERPD 103
Query: 59 VDKVYMTYVQALYGGG-GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
+D++Y+ L G GWPL++F++ D +P +Y P E +YG G ++ ++ W
Sbjct: 104 IDQIYIDAAHVLSGVAVGWPLTIFMTHDGRPFFAASYIPKESRYGMTGLVDLIPRISRIW 163
Query: 118 DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
+R L Q+G+ ++ EAL ++A + EL + L + L + +D GGFG
Sbjct: 164 QTRRQELEQTGS----RVLEALQSAARTPPGESELSEATLDDAYDTLFRLFDGENGGFGD 219
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFP P + +L + + TGK+ MV TL M +GGI DH+G GFHRY+
Sbjct: 220 APKFPAPHNLIFLLRYGHR---TGKT----PAYTMVEKTLHAMRRGGIFDHIGWGFHRYT 272
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
D W VPHFEKMLYDQ L Y +A+ T ++ R+ + Y+ R+M P G +S
Sbjct: 273 TDAEWLVPHFEKMLYDQALLIMAYTEAYLATGREEFARTARETIAYVLREMTDPDGGFYS 332
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDP 356
AEDADS EG EG FY+WT + +LGE F + + GN + P
Sbjct: 333 AEDADS---EGV----EGKFYIWTKAGILQVLGEEDGERFSRIFGVTEPGNY----LEQP 381
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
G+NVL ++ A + MP E + + R++LF R +R RP DDK++
Sbjct: 382 GARRTGQNVLRLRRPLASWAHEFSMPEEDLAWFVEDARQRLFAAREERARPAKDDKILTD 441
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I++ A A++ D EY+ AE AA+F+ L RL
Sbjct: 442 WNGLMIAALATAARAF----------------DDPEYLAAAEKAAAFVLTRLRGPDG-RL 484
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
H +RNG + LDDYAF++ L+++YE +L A++L + D + GG+
Sbjct: 485 LHRYRNGEAGITATLDDYAFMLWALIEVYEASFAPGYLRTAVKLARDLSARYWDCDHGGF 544
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F T +D + +R K DGA PSGNSV++ L L + A + + + A VF
Sbjct: 545 FFTP-DDVEIAVRQKPVFDGATPSGNSVAMYALFLLGRMTANLE---FEEMANRIRRVFA 600
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
+++ +A + + P+ + V++ G + + D M+ A + Y + VI
Sbjct: 601 DTVRESPIAYSYFLTGLEFMLGPNVE-VIISGVRDAEDTRAMIQAIRSRYTPDAVVI-FR 658
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
P+D EE + + A R+ + + K A VC N++C PVTD
Sbjct: 659 PSDEEEPEI-----TKVAGFTRDIVTIEGKATAYVCTNYACDIPVTD 700
>gi|410450937|ref|ZP_11304964.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
gi|410015249|gb|EKO77354.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
Length = 691
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 252/711 (35%), Positives = 359/711 (50%), Gaps = 73/711 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 33 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W+
Sbjct: 93 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
+KR L A +LS+ L S + + LP A L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208
Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F + KFP + + +L YH +S + +M TL M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
G RYS D RW VPHFEKMLYD + S++K + D++ YL RDM
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLAECSSVSKKISAKSFALDVISYLHRDMRNE 320
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G I SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+L E + S +A + ++L R KL + RSKR RP DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL + +A V +++++++AE SFI R+L D
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521
Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
G F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 579
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F L ++ P + A S K +VL+ K + +++LA + +
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ ++ + EE +++ + S + VC+NFSC P+
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681
>gi|433638443|ref|YP_007284203.1| thioredoxin domain protein [Halovivax ruber XH-70]
gi|433290247|gb|AGB16070.1| thioredoxin domain protein [Halovivax ruber XH-70]
Length = 759
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 244/713 (34%), Positives = 350/713 (49%), Gaps = 56/713 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA +LN+ FV IKVDREERPDVD +YMT QA+ G GGWPLS
Sbjct: 53 SACHWCHVMEAESFADETVAAVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQL 135
+L+PD +P GTYFP E + G PGF + R+++ +W + RD + + A A ++L
Sbjct: 113 AWLTPDGRPFYVGTYFPREAQRGTPGFVELCRQIRVSWSENRDEIEARANEWAAMATDRL 172
Query: 136 SEALSASASSNKLPDELPQ---------------NALRLCAEQLSKSYDSRFGGFG-SAP 179
A S P+ + + L E ++ D GGFG P
Sbjct: 173 DSA-DGGGESASTPEPISADTDSPIDVGLDADGPDGLERVGEAALRASDDEHGGFGRGGP 231
Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
KFP+P ++ + +L+ T A E L M GG++DHVGGGFHRY VD
Sbjct: 232 KFPQPRRVEALF----RLDATHDRPTAHE---TATRALDAMCTGGLYDHVGGGFHRYCVD 284
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
E W VPHFEKMLYD + V L + +T D Y+ R+ +D+L R++ P G +S
Sbjct: 285 EDWTVPHFEKMLYDNAAIPRVLLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTL 344
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DA S ETE R +EGAFYVWT E+E + E A L E L CD ++D N
Sbjct: 345 DAQS-ETESGER-EEGAFYVWTPAEIESAVAE-AGLSDESGAL----FCDRFGVTDSGN- 396
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 419
F+G VL A+ G+ + L R +F+ R+ RPRP D+K++ WNG
Sbjct: 397 FEGSTVLTVEASIEDLATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKILAGWNG 456
Query: 420 LVISSFARASKILKSEAESAMFNFP--VVGSDR----KEYMEVAESAASFIRRHLYDEQT 473
L I A AS +L + A + V SD Y ++A A +F+R HL+D+ T
Sbjct: 457 LAIDMLAEASIVLGTSGREAAIDAASDVASSDEPSGDDRYAQLATDALAFVRTHLWDDDT 516
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
RL R+G G+L+DYAFL G L YE ++L +A++L F D
Sbjct: 517 GRLARRVRDGDVGIDGYLEDYAFLARGALTCYEATGEVEFLAFALDLARAIRRDFWDESA 576
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNAEHSL 592
+ T S+L+R +E D + PS V+V L L A + +R + H+
Sbjct: 577 ETLYFTPERGESLLVRPQELGDQSTPSPTGVAVEILALLDPFTAEPFGEMAHRVVSTHAT 636
Query: 593 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 652
+ E+ + +++++ A L V V +++E L + L + +
Sbjct: 637 EIEESPFEYVSLSL------AQSLVTHGPLEVTTVADGRPMEWERTLGRTY----LPRRL 686
Query: 653 IHIDPADTEEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ PA + +D W + ++ A AD+ VC + CSPP D
Sbjct: 687 LAHRPASSAMLDDWLDVIGVDTVPPIWADREQRADEPTVYVCADRVCSPPEHD 739
>gi|421111206|ref|ZP_15571685.1| PF03190 family protein [Leptospira santarosai str. JET]
gi|410803388|gb|EKS09527.1| PF03190 family protein [Leptospira santarosai str. JET]
Length = 699
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 252/711 (35%), Positives = 360/711 (50%), Gaps = 73/711 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 41 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 100
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W+
Sbjct: 101 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWN 160
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
+KR L A +LS+ L S + + LP A L +S YDS FGG
Sbjct: 161 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 216
Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F + KFP + + +L YH +S + +M TL M +GGI+D VGG
Sbjct: 217 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 268
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
G RYS D RW VPHFEKMLYD ++ S++K + D++ YL RDM
Sbjct: 269 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLVECSSVSKKISAKSFALDVISYLHRDMRNE 328
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G I SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 329 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 377
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+L E + S +A + ++L R KL + RSKR RP DD
Sbjct: 378 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 428
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL + +A V +++++++AE SFI R+L D
Sbjct: 429 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 471
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R
Sbjct: 472 PNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 529
Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
G F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE
Sbjct: 530 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGIDSARYRKFAES 587
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F L ++ P + A S K +VL+ K + +++LA + +
Sbjct: 588 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 645
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ ++ + EE +++ + S + VC+NFSC P+
Sbjct: 646 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 689
>gi|296121436|ref|YP_003629214.1| hypothetical protein Plim_1180 [Planctomyces limnophilus DSM 3776]
gi|296013776|gb|ADG67015.1| protein of unknown function DUF255 [Planctomyces limnophilus DSM
3776]
Length = 707
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 237/699 (33%), Positives = 350/699 (50%), Gaps = 76/699 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ +A+LLN WFVSIKVDREERPD+D++YM V A+ GGWP+S
Sbjct: 50 SACHWCHVMEHESFENPRIAELLNQWFVSIKVDREERPDLDQIYMAAVIAMTQQGGWPMS 109
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P P GGTYFPP +YGRPGF +L + DAW+ +R+++ + + QL+ +
Sbjct: 110 VFLTPQGHPFYGGTYFPPTSRYGRPGFAEVLAAIHDAWENRREVVTEQAS----QLTMTV 165
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + P L +N L L + D GGFG APKFP +++++ + + + D
Sbjct: 166 HDQLSERQEPTTLHENLLEKAGRTLVRVCDRVNGGFGHAPKFPHAMDLRLAMRLAHRF-D 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T ++ E +E L MAKGGIHDH+GGGF RYS DE W VPHFEKMLYD L
Sbjct: 225 TTETAEVAE------LGLTAMAKGGIHDHLGGGFARYSTDEIWLVPHFEKMLYDNALLLQ 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI----FSAEDADSAETEGATRKKEG 315
YLD + K FY + I+ Y+ R+M P E+ +A+DADS EG +EG
Sbjct: 279 AYLDGWQFNKTDFYRRTAQSIVHYVLREMQVPRAELPGGFCAAQDADS---EG----EEG 331
Query: 316 AFYVWTSKEVEDIL------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
F+VW+ E+ D+L + + LF+ Y + GN ++G N+L
Sbjct: 332 RFFVWSQSEIRDVLSGSELGNDDSRLFERAYGVTSGGN------------WEGHNILNLP 379
Query: 370 NDSSASASKLGM---PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
+A +LGM LE+ L++L R KLF+ R R P D+K+IV+WNGL+IS+ A
Sbjct: 380 KTIAALGRELGMAETALEQKLSLL---RTKLFEHRKNRIAPGRDEKLIVAWNGLMISALA 436
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
RA +L + + +++AES + L HS + G K
Sbjct: 437 RAGLVLDDQEALQAAQ-----RAARVILDMAESL------------PYGLPHSIQKGQPK 479
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
+LDDY + L++L+ WL A+ L + F D E GG++ T+ + +
Sbjct: 480 HGAYLDDYGCFLEALIELFLADGDPSWLSRAVPLIDRLVNEFHDDEQGGFYFTSSQAEKL 539
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R ++ D PSGN+ L++ I ++S+ + A L ++ MA
Sbjct: 540 ISRSRDFQDNVTPSGNAAVANALLKFGRITGDARSE---ELAHEVLQAASGLMQQSTMAT 596
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENML---AAAHASYDLNKTVIHIDPADTEEM 663
A D PS + V + +S L A +++L + +
Sbjct: 597 AHSLAALDWWLGPSYECVYVPAETTSTTDSEPLKQDAVQRVAHELYLPNVLFLTGRAQ-- 654
Query: 664 DFWEEHNSNNASMARNNFS-ADKVVALVCQNFSCSPPVT 701
WE + A + + + A + V VCQ C PV
Sbjct: 655 --WE--GTLAAGLVQGRLAPASEPVLYVCQKGVCQLPVV 689
>gi|456873671|gb|EMF89033.1| PF03190 family protein [Leptospira santarosai str. ST188]
Length = 691
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 252/711 (35%), Positives = 358/711 (50%), Gaps = 73/711 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA LN FVSIKVDREERPD
Sbjct: 33 GEEAFTKAKEQDKLIFLSIGYATCHWCHVMERESFENPTVADYLNSHFVSIKVDREERPD 92
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM + A+ GGWPL+VFL+PD KP+ GGTYFPPE YGR F +L ++ W
Sbjct: 93 IDRIYMDALHAMNQQGGWPLNVFLTPDGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWS 152
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE---LPQNALRLCAEQLSKS-YDSRFGG 174
+KR L A +LS+ L S + + LP A L +S YDS FGG
Sbjct: 153 EKRQEL----VVASSELSQYLKDSGEGRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGG 208
Query: 175 FGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 231
F + KFP + + +L YH +S + +M TL M +GGI+D VGG
Sbjct: 209 FKTNHVNKFPPSMGLSFLLRYH--------RSSGNPKALEMAENTLLAMKQGGIYDQVGG 260
Query: 232 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP 291
G RYS D RW VPHFEKMLYD + S++K + D++ YL RDM
Sbjct: 261 GLCRYSTDPRWTVPHFEKMLYDNSLFLETLAECSSVSKKISAKSFALDVISYLHRDMRNE 320
Query: 292 GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
G I SAEDADS EG +EG FYVW +E ++ GE + + ++ + + GN
Sbjct: 321 DGGICSAEDADS---EG----EEGLFYVWDLEEFREVCGEDSRILEKFWNVTEKGN---- 369
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+GKN+L E + S +A + ++L R KL + RSKR RP DD
Sbjct: 370 --------FEGKNILRE-SYPSGAAKFSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDD 420
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ SWNGL + +A V +++++++AE SFI R+L D
Sbjct: 421 KILTSWNGLYTKALTKAG----------------VAFQKEDFLKLAEETYSFIERNLID- 463
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
R+ FR+G S G+ +DYA +I+ + L+E G G ++L A+ LF R
Sbjct: 464 SNGRILRRFRDGESGILGYSNDYAEMIASSIALFEAGRGIRYLKNAVLWMEEAIRLF--R 521
Query: 532 EGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
G F TG D VLLR D +DG EPS NS V +LV+L+ + G S YR+ AE
Sbjct: 522 SPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLVYSLVKLS--LFGVDSARYRKFAES 579
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+ F L ++ P + A S K +VL+ K + +++LA + +
Sbjct: 580 IFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KEIVLI-RKDADSGKDLLAEIQTKFLPDS 637
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ ++ + EE +++ + S + VC+NFSC P+
Sbjct: 638 VLAVVNEDELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPIA 681
>gi|87310211|ref|ZP_01092343.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
3645]
gi|87287201|gb|EAQ79103.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
3645]
Length = 637
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 219/564 (38%), Positives = 311/564 (55%), Gaps = 56/564 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESF DE +AK LN+ F+ IKVDREERPD+D VYMT VQ + GGGWPLS
Sbjct: 71 SSCHWCHVMEHESFTDEEIAKFLNEHFICIKVDREERPDIDHVYMTAVQIMTRGGGWPLS 130
Query: 80 VFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
VFL+P+ KP GGTY+P D+ + GF T++ +V W++K L +SG + + E
Sbjct: 131 VFLTPEGKPFYGGTYWPARDGDRDAQVGFLTVIDRVAQFWEEKEADLRKSGDGLSDLVKE 190
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMML 191
AL + P L + L +++++D+ GGF + PKFP P +Q +L
Sbjct: 191 ALRPRVTLQ--PLTLDEQLLATADAAIAETFDAEHGGFNFSADDPNQPKFPEPATLQYLL 248
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ +SG A E QKM+ TL +A GGI DH+GGG HRYSVD W +PHFEKML
Sbjct: 249 ARA-------RSGSA-EAQKMLTTTLDGIAAGGIRDHIGGGLHRYSVDRFWRIPHFEKML 300
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD QLA++Y +A+ LT + Y + + D++ R+M GP G+ +SA DADS EG
Sbjct: 301 YDNAQLASLYAEAYQLTGNPQYRRVAAETCDFVLREMTGPDGQFYSAIDADS---EG--- 354
Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG +Y W+ E+ IL + L K Y L + N F+ + EL
Sbjct: 355 -EEGKYYRWSQAELTAILSPAQLELAKSVYGLGGSPN------------FEEVYFVPELQ 401
Query: 371 DSSASASK-LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
A + L + ++ L R L R+KR P +D K + +WNGL+I+ A A
Sbjct: 402 APIAELPQNLKLDADQLQTRLQTLRETLLAARAKRTPPAIDTKALTAWNGLMIAGLADAG 461
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+IL+ R++Y++ A +A FI ++ RL SF++G +K
Sbjct: 462 RILQ----------------RQDYLDAAARSADFILANVTSADG-RLLRSFKDGQAKITA 504
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
++DDYA L+ GL+ L+E KWL A L Q ELF D GG++ T + V++R
Sbjct: 505 YVDDYAMLVDGLIALHEATGEPKWLDAAERLTKQQIELFGDPRLGGFYFTAADAEEVIVR 564
Query: 550 VKEDHDGAEPSGNSVSVINLVRLA 573
K D A P+GNSV+ NL+ LA
Sbjct: 565 GKIATDNAIPAGNSVAAGNLLYLA 588
>gi|448355570|ref|ZP_21544321.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
10989]
gi|445635098|gb|ELY88270.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
10989]
Length = 722
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 238/696 (34%), Positives = 351/696 (50%), Gaps = 59/696 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA++LN+ FV IKVDREERPDVD +YMT Q + G GGWPLS
Sbjct: 55 SACHWCHVMEDESFADEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLS 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML-------AQSGAFAI 132
+L+P+ KP GTYFP K G+PGF IL + ++W RD + + +
Sbjct: 115 AWLTPEGKPFYVGTYFPKNAKRGQPGFLDILENLTNSWAGDRDEIENRAEQWTDAAKDRL 174
Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML 191
E+ +A+SAS + + L A +S D +FGGFGS PKFP+P ++++
Sbjct: 175 EETPDAVSASQPPSS-------DVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVL- 226
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
++ + TG+ E Q +++ TL MA GG++DHVGGGFHRY VD W VPHFEKML
Sbjct: 227 --ARAADRTGR----DEFQDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKML 280
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD ++ +L + T D Y+ + + L ++ R++ G FS DA S E E
Sbjct: 281 YDNAEIPRAFLIGYQQTGDERYAEVVAETLAFVARELTHEEGGFFSTLDAQSEEPE-TGE 339
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
++EGAFYVWT E+ D+L A LF + Y + +GN F+G +
Sbjct: 340 REEGAFYVWTPDEIHDVLENETTADLFCDRYDITESGN------------FEGSTQPNRV 387
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A++ + L R KLF R +RPRP+ D+KV+ WNGL+I++ A A+
Sbjct: 388 RSVSDLAAEYDLEAADVRARLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAA 447
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L D EY +A A F+R L+DE RL +++G G
Sbjct: 448 LVLGG------------SEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDG 495
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL L YE L +A++L ++ F D + G + T S++ R
Sbjct: 496 YLEDYAFLARAALGCYEATGEVDHLAFALDLARIIEDEFWDADRGTLYFTPESGESLVTR 555
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+V L+ L + D + + A L R++ ++ +
Sbjct: 556 PQELGDQSTPSAAGVAVETLLALEGF--ADQDDEFEEIATTVLETHANRIETNSLEHATL 613
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--E 667
C AAD L + + V ++ D A A L + PA +E++ W E
Sbjct: 614 CLAADRLESGALEITV-----AADDLPAAWREAFAGRYLPDRLFARRPATDDELESWLTE 668
Query: 668 EHNSNNASMARNNFSADKVVAL-VCQNFSCSPPVTD 702
++ + + D L VC++ +CSPP D
Sbjct: 669 LDLADAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 704
>gi|74318745|ref|YP_316485.1| hypothetical protein Tbd_2727 [Thiobacillus denitrificans ATCC
25259]
gi|74058240|gb|AAZ98680.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
25259]
Length = 673
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 239/688 (34%), Positives = 351/688 (51%), Gaps = 73/688 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM + FED V ++N FV+IKVDREERPD+D++Y T Q L GGGWPL
Sbjct: 48 SACHWCHVMAHDCFEDAEVGAVMNRLFVNIKVDREERPDLDQIYQTAHQLLAQRGGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSE 137
+VFL+PD P GTYFP +Y PGF ++ V AW +R ++LAQ+ A L++
Sbjct: 108 TVFLTPDQTPFFAGTYFPKTARYQLPGFPELMENVAHAWHARRGEVLAQNDAVRA-ALAQ 166
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ S A+S P L L L++++D +GGF APKFPRP E+ +L ++
Sbjct: 167 SQSQPAASASTP--LTAAPLEQGVRDLAQAFDPVWGGFSRAPKFPRPGELFFLLRRAQ-- 222
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
G ++ ++M LFTL+ MA GG+ D +GGGF RYSVDE W +PHFEKMLYD G L
Sbjct: 223 ------GGDAKAREMALFTLRKMASGGVVDQLGGGFCRYSVDEEWAIPHFEKMLYDNGPL 276
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y DA++L + + I+ +L R+M P G +SA DADS EG EG F
Sbjct: 277 LHLYADAWALRGETLFRETAEGIVAWLLREMRAPEGGFYSALDADS---EG----HEGKF 329
Query: 318 YVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVW+ +EV+ +L E+A+ + + P P+ E N L
Sbjct: 330 YVWSREEVKSLLTPDEYAVAAPFYGFDAP-----------PNFENTSWNPL-RARPLEEI 377
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ LG+ + RRKLF R R RP DDK + SWN L+I A A +++
Sbjct: 378 AAALGLFPTDAEARVAAARRKLFAARESRIRPGRDDKQLTSWNALMIGGLAHAGRVMA-- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
R E++ A +A F+RR+L+ + RL+ +F+ G ++ +LDDYA
Sbjct: 436 --------------RPEWVAEAHAAIDFLRRNLW--RDGRLRATFKRGEARLNAYLDDYA 479
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL+ LL+ + + WA EL + F DRE GG+F T+ + ++L R K +D
Sbjct: 480 FLVDALLETMQAAYREADMAWAQELADALLAHFEDREAGGFFFTSHDHEALLTRPKPGYD 539
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSGN V+ L RL ++ ++ Y + L +F ++ +A P + D
Sbjct: 540 NATPSGNGVAAFALQRLGHLLGETR---YLDASARCLRLFLPQVVQQPIAHPTLLAVLDE 596
Query: 616 LSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
P R +VL G + V ++ LA + D+ + N A
Sbjct: 597 ALRPPRV-IVLRGPDTPVQEWAANLAPRLGARDMLLAL----------------PNGEGA 639
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + A +C +C PP+T+
Sbjct: 640 PGALAKPEAPQPTAWICSGTACQPPITE 667
>gi|291614213|ref|YP_003524370.1| hypothetical protein Slit_1752 [Sideroxydans lithotrophicus ES-1]
gi|291584325|gb|ADE11983.1| protein of unknown function DUF255 [Sideroxydans lithotrophicus
ES-1]
Length = 676
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 237/704 (33%), Positives = 366/704 (51%), Gaps = 87/704 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFEDE VA ++N+ F++IKVDREERPD+D++Y Q L GGWPL
Sbjct: 48 SACHWCHVMAHESFEDEAVAAVMNELFINIKVDREERPDLDQIYQNAHQLLSRRSGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD P GTYFP + +YG PGF +++ + A+ ++R LA+ G +Q+ A
Sbjct: 108 TMFLAPDGTPFYSGTYFPKQARYGLPGFPALIQDIAHAYKEQRGELAEQG----KQIVAA 163
Query: 139 LSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L+A D L + + Q S+++D GGFG APKF P E+ ++L +
Sbjct: 164 LAAWQPEKSATDSTLDASPIATSIRQHSENFDRVNGGFGGAPKFLHPAELDLLLQQTHAT 223
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D ++ + +VLFTLQ MA+GG++D +GGGF RYSVD W +PHFEKMLYD G L
Sbjct: 224 HD-------AQTRHIVLFTLQQMAQGGLYDQLGGGFCRYSVDAEWDIPHFEKMLYDNGLL 276
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y DA+ + D F++ I ++ R+M P G +++ DADS +EG F
Sbjct: 277 LGLYSDAWLSSSDPFFARIVEQTAAWVMREMQSPQGGYYASLDADS-------EHEEGKF 329
Query: 318 YVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLS----RMSDPHNEFKGKNVLIELND 371
YVW ++ D+L E+A L + HY L T N + R+S P E
Sbjct: 330 YVWQRNDIRDLLSAAEYA-LIQPHYGLDSTPNFENHAWNLRVSQPLGEI----------- 377
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
A KLG+ E+ +L + KLF R +R RP D+K++ SWNGL+I+ A+A++I
Sbjct: 378 ----AQKLGLGEEQAAMLLAAAKTKLFAAREQRIRPGRDEKILGSWNGLMIAGMAKAARI 433
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
R++++ A+ A F+R L+ Q RL + ++G + +L
Sbjct: 434 FG----------------REDWLHSAQQAMDFVRTTLW--QDGRLLATHKDGKTHLNAYL 475
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DD+A+L++ L+L + + L +A+++ + F D GG+F T+ + +++ R K
Sbjct: 476 DDHAYLLNAALELLQAEFRSPDLSFAVQIADALLARFEDVRNGGFFFTSHDHEALIQRNK 535
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
D A PSGN ++ L+RLA + + Y AE L +F ++ A +C
Sbjct: 536 TAQDNATPSGNGIATQGLLRLAELTGDIR---YTDAAERCLKLFFPIMQRAAGQFSSLCT 592
Query: 612 A-ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A + L PS +VL G + ++ AA A Y +I + N
Sbjct: 593 ALGEALQPPSM--LVLCG--AEIETAAWRAAVAAKYLPGLMIIVL--------------N 634
Query: 671 SNNASM--ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ AS+ + + + A +C C PP+T SL+ LL E
Sbjct: 635 GDEASLPSSLDKPRSATTTAWLCHGTQCLPPIT---SLDELLTE 675
>gi|448393368|ref|ZP_21567693.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
gi|445663783|gb|ELZ16525.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
Length = 730
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 230/701 (32%), Positives = 355/701 (50%), Gaps = 70/701 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFED+ VA++LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEDESFEDDDVAEVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFA 131
+L+P+ KP GTYFP E + +PGF + +++ D+W+ + D ++
Sbjct: 113 AWLTPEGKPFFVGTYFPKESQRNQPGFLELCQRISDSWESEDREEMEHRADQWTEAAKDR 172
Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
+E+ + A+ + + P L A + +S D ++GGFGS PKFP+P + ++
Sbjct: 173 LEETPDGAGAAGGAAEPPS---SEVLETAANAVLRSADRQYGGFGSGGPKFPQPSRLHVL 229
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
++ + TG+ E +++ TL MA GG+ DHVGGGFHRY VD+ W VPHFEKM
Sbjct: 230 ---ARAYDRTGR----EEYLEVIEETLDAMAAGGLSDHVGGGFHRYCVDKDWTVPHFEKM 282
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD ++ +L + LT D Y+ + + LD+L R++ G FS DA S E
Sbjct: 283 LYDNAEIPRAFLAGYQLTGDERYAEVVEETLDFLERELTHDEGGFFSTLDAQS-EDPATG 341
Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
++EGAFYVWT EV ++L + A LF Y + +GN F+G+N
Sbjct: 342 EREEGAFYVWTPGEVSEVLEDETTADLFCARYDITESGN------------FEGRNQPNR 389
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
+ + A + + + L + R LF+ R +RPRP+ D+KV+ WNGL+I++ A A
Sbjct: 390 VRSLESLAEEYDLEQSEIEERLEDARETLFEAREERPRPNRDEKVLAGWNGLMINACAEA 449
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ +L G DR Y E A A F+R L+D RL F++G K
Sbjct: 450 ALVL--------------GEDR--YAEQAVDALEFVRDRLWDADEQRLSRRFKDGDVKVD 493
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G+L+DYAFL G L Y+ L +A++L T + F D E G + T ++
Sbjct: 494 GYLEDYAFLARGALGCYQATGDVDHLAFALDLARTIEAEFWDEEQGTIYFTPESGEPLVT 553
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R +E D + PS V+V L+ L D + A L +++ ++
Sbjct: 554 RPQELTDQSTPSAAGVAVETLLALDEFA----EDDLERIAATVLETHANKIEANSLEHAS 609
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+C AAD L + + V + + ++ + A + L + P + ++ W +
Sbjct: 610 LCLAADRLEAGALE-VTVAADELPDEWRDRFAEEYHPGRL----FALRPPTEDGLEAWLD 664
Query: 669 HNSNNAS-------MARNNFSADKVVALVCQNFSCSPPVTD 702
+ + + ARN + VC++ +CSPP D
Sbjct: 665 ELALDEAPPIWAGREARNG----EPTLYVCRDRTCSPPTHD 701
>gi|448627283|ref|ZP_21671896.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
gi|445759112|gb|EMA10399.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
Length = 733
Score = 365 bits (937), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 234/710 (32%), Positives = 356/710 (50%), Gaps = 78/710 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFENEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
L+PD +P GTYFPPE+K G+PGF +L+++ D+W +++ +M AQ AIE
Sbjct: 118 LTPDGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDL 177
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
EA A P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 229 AHADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----------- 304
++ +L + Y+ + R+ ++++R++ P G FS DA+SA
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQ 344
Query: 305 ------ETEGATRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDP 356
E +EG FYVWT ++V D + + A +F ++Y + GN
Sbjct: 345 SSGESPRDEPGGETEEGLFYVWTPEQVHDAVDDETDAEVFCDYYGVTERGN--------- 395
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+G VL + A + ++ L + F+ R RPRP D+KV+
Sbjct: 396 ---FEGATVLAVRKPVAVLAEEYEQSEDEITASLQRALNQTFEARKDRPRPARDEKVLAG 452
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I + A + +L ++Y +VA A SF+R HL+DE RL
Sbjct: 453 WNGLMIRTLAEGAIVLD-----------------EQYADVAADALSFVREHLWDEDERRL 495
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+++G G+L+DYAFL G L L+E + L +A++L E F D E G
Sbjct: 496 NRRYKDGDVAIDGYLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTL 555
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F T S++ R +E D + PS V+V L+ L+ S +D + AE L
Sbjct: 556 FFTPTGGESLVARPQELTDQSTPSSTGVAVDLLLSLSHF---SDNDRFESVAERVLRTHA 612
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
R+ + + A D + + + LVG +S+ + A A + + + ++
Sbjct: 613 DRVSSNPLQHASLTLATDTYEQGALE-LTLVGDQSA--YPGEWAETLAEHYIPRRLLAHR 669
Query: 657 PADTEEMDFWEEHNSNNAS----MARNNFSADKVVALVCQNFSCSPPVTD 702
PAD E + W + + S R + V C+NF+CSPP D
Sbjct: 670 PADDSEFEQWLDALGLDESPPIWAGREQVDGEPTV-YACRNFACSPPKHD 718
>gi|150400057|ref|YP_001323824.1| hypothetical protein Mevan_1315 [Methanococcus vannielii SB]
gi|150012760|gb|ABR55212.1| protein of unknown function DUF255 [Methanococcus vannielii SB]
Length = 687
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 237/713 (33%), Positives = 366/713 (51%), Gaps = 58/713 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL +TCHWCHVM +SFED VA LN F+SIKVDREERPD
Sbjct: 28 GEEAFKKAKLENKPIFLSIGYSTCHWCHVMAKDSFEDFDVADTLNKNFISIKVDREERPD 87
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +Y+ Q + G GGWPL++ ++PD KP T+ E ++G PG +L + + W
Sbjct: 88 LDDIYLKTCQLMTGSGGWPLTIIMTPDKKPFFAATFISKEPRFGSPGIIDLLEGISELWA 147
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K D + + + L E +S + S KL ++L + A QL + YD +GGFG
Sbjct: 148 IKHDEIVKRSDEILIHL-ENISKTTSKGKLDEKLLEKAFL----QLKEIYDKNYGGFG-V 201
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP I ++ + KK TG E +M + TL M GGI+DH+ GFHRY+V
Sbjct: 202 PKFPTAHLIIFLIKYWKK---TGN----DEALEMAIKTLDKMKMGGIYDHISYGFHRYAV 254
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DE W +PHFEKMLYDQ ++ YL+++ T++ + I ++ +Y+ + + P +SA
Sbjct: 255 DEMWKLPHFEKMLYDQALISMAYLESYRATRNEEHKKIVSEVFEYVLKVLKSPEKAFYSA 314
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPH 357
E+ AE+EG EG FY W E++ IL +FK+ Y +KP GN L ++
Sbjct: 315 EN---AESEGI----EGKFYTWNITEIDQILRNSENNIFKKVYNIKPEGNY-LGESTEAT 366
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
N G N+L AS++ M E+ IL + R+KL D R RP D K++ W
Sbjct: 367 N---GTNILYMERSIQEIASEMEMWPEEVDQILEKARKKLLDALENRKRPSKDYKILADW 423
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+I+S ++A +I K+E EY++ +E A SF+ + + +L
Sbjct: 424 NGLMIASLSKAGRIFKNE----------------EYIKASEDAMSFLLSKMVINE--KLY 465
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
HS+ K PGFLDDYAF+ GL++LY ++L A + ELF E GG+
Sbjct: 466 HSYIENELKVPGFLDDYAFITWGLIELYFATFNIEYLKKARDFAEKTLELFW--EDGGFN 523
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+ E + +V+ +DGA PSG S+ +NL++L+ I+ + D Y +
Sbjct: 524 FASKEVNDNIFKVRNIYDGAIPSGTSIMALNLLKLSHIL---RIDKYHEKVYELFENSAE 580
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
++ M A + + P+ V +VG + + ++ + Y N +++ I P
Sbjct: 581 KISKSPFTYLQMLSAYNFDNDPT--DVSIVGDLENKTTKEIIDEINRVYRPNMSLLFI-P 637
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+D+E + E+ AS + ++ V +C+ SC P T+P + NLL
Sbjct: 638 SDSERLKKLEKI----ASFVKEYPTSKDPVVYICKKDSCLNPETNPSQILNLL 686
>gi|448562484|ref|ZP_21635442.1| thioredoxin domain containing protein [Haloferax prahovense DSM
18310]
gi|445718802|gb|ELZ70486.1| thioredoxin domain containing protein [Haloferax prahovense DSM
18310]
Length = 709
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 234/697 (33%), Positives = 350/697 (50%), Gaps = 74/697 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W RD +A EQ + A+
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIANRA----EQWTSAI 168
Query: 140 SAS-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKK 196
+ + +P E P + L + + D GGFG PKFP+P I +L
Sbjct: 169 TDRLEETPDVPGEAPGSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL----- 223
Query: 197 LEDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFEKMLY
Sbjct: 224 ------RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLY 277
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
DQ LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S
Sbjct: 278 DQAGLASRYLDAARLTGNESYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------G 330
Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EG FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++
Sbjct: 331 EEGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSA 378
Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
++A A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S
Sbjct: 379 TTAELADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSV 438
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L+ ++ + SD A A F+R L+D++T L NG K G+
Sbjct: 439 VLEDDS---------LASD-------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGY 482
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G DLY+ L +A++L F D + G + T S++ R
Sbjct: 483 LEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRP 542
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+ + L + + A+ L F R++ + +
Sbjct: 543 QEPTDQSTPSSLGVATSLFLDLEQFAPDAD---FGDVADAVLGSFANRVRGSPLEHVSLA 599
Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-E 667
AA+ + VP + + + S ++ LA+ + L V+ P EE+D W +
Sbjct: 600 LAAEKAASGVP---ELTIAADEVSDEWRETLASRY----LPGLVVSRRPGTDEELDAWLD 652
Query: 668 EHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
E + A A + + C+NF+CS P D
Sbjct: 653 ELGLDEAPPIWAGREMADGEPTVYACENFTCSAPTHD 689
>gi|394990058|ref|ZP_10382890.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
gi|393790323|dbj|GAB72529.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
Length = 681
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 240/688 (34%), Positives = 358/688 (52%), Gaps = 73/688 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+TCHWCHVM ESFED+ A L+N +++IKVDREERPD+D++Y + L G GGWPL
Sbjct: 48 STCHWCHVMAHESFEDQTTADLINRDYIAIKVDREERPDLDQIYQSAHNLLTGKSGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD P GGTYFPPE +Y RPGFK +L KV A+ ++R +AQ L E+
Sbjct: 108 TLFLTPDQTPFYGGTYFPPEARYNRPGFKDLLPKVAQAYRERRHDIAQQNI----SLRES 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L++ + E L QL K++D GGFG APKFPRP EI L E
Sbjct: 164 LASGGPVPQAGIEPNPAPLAGAQSQLEKNFDPVHGGFGGAPKFPRPSEIAFCLRRYAAEE 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ ++ +M TL+ +A GGI+D +GGGF RYSVDERW +PHFEKMLYD G L
Sbjct: 224 N-------AQALEMARQTLRKIADGGINDQLGGGFCRYSVDERWLIPHFEKMLYDNGPLL 276
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+Y +A+ + D + + + + +L R+M P G +SA DADS EG FY
Sbjct: 277 ELYANAWCCSGDERFRRVAEETVAWLEREMRAPQGGFYSALDADSEHV-------EGKFY 329
Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT +EV L E+A+L + HY L N + S H F + L ++ A
Sbjct: 330 VWTPQEVAATLSADEYAVLSR-HYGLDQPANFEGS-----HWHFYVAHPLDQV------A 377
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+L + L+ +L R KL +R++R RP D+K++ SWN L+I A A +
Sbjct: 378 RELSVELDDAWRLLESARTKLIALRAQRVRPGRDEKILTSWNALMIKGLAHAGRTF---- 433
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
R++++ +A+ A FI L+ + +RL S+++G S G+LDDYAF
Sbjct: 434 ------------GREDWIALAQQATDFIHAELW--RNNRLLASWKDGKSNLGGYLDDYAF 479
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ L++L + T L +A EL F D + GG++ T + +++ R K D
Sbjct: 480 LLDALVELLQARFRTADLTFACELAEALLVRFEDCDQGGFYFTAHDHETLIFRPKTGFDN 539
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADM 615
A PSGN+V+ L RL ++ ++ Y AE +L +F ++ A + + +
Sbjct: 540 ATPSGNAVAAFALQRLGHLLGETR---YLAAAERALKLFYPQIASQPAGFMSFLSVLEEY 596
Query: 616 LSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
L P + VL G V ++ LA Y + V+ + ++EM+
Sbjct: 597 LDPP--QIAVLRGPAEQVAAWQQTLA---KEYRPSTMVLAL----SDEME--------KL 639
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + + V A VCQ+ C P ++D
Sbjct: 640 PGSLDKPATSVVNAWVCQSVKCLPAISD 667
>gi|456865795|gb|EMF84112.1| PF03190 family protein [Leptospira weilii serovar Topaz str.
LT2116]
Length = 716
Score = 364 bits (935), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 250/714 (35%), Positives = 360/714 (50%), Gaps = 74/714 (10%)
Query: 10 TKTRRTHFLI------NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY 63
TK R LI TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++Y
Sbjct: 63 TKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIY 122
Query: 64 MTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM 123
M + A+ GGWPL++FL+PD KP+ GGTYFPPE +YGR F IL ++ W +KR
Sbjct: 123 MDALHAMDQQGGWPLNMFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWSEKRQE 182
Query: 124 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKF 181
L + + L ++ A ++ +N YD+ FGGF + KF
Sbjct: 183 LIVASSELSRYLKDSGEGRAIEKQVGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVNKF 242
Query: 182 PRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
P + + +L YHS SG +MV TL M +GGI+D +GGG RYS D
Sbjct: 243 PPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCRYSTD 293
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
W VPHFEKMLYD ++ ++K + D++ YL RDM GG I SAE
Sbjct: 294 HHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGICSAE 353
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
DADS EG +EG FY+W +E ++ GE + + ++ + + GN
Sbjct: 354 DADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN------------ 394
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+GKN+L E + A+K K ++ +L R KL + RSKR RP DDK++ SWN
Sbjct: 395 FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKILTSWN 452
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I + A+A V R++++++AE SFI ++L D R+
Sbjct: 453 GLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPNG-RILR 495
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
FR+ S G+ +DYA +IS + L+E G G ++L A+ LF R G F
Sbjct: 496 RFRDNESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFF 553
Query: 539 TTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
TG D VLLR D +DG EPS NS +LV+L+ + G S Y + AE F
Sbjct: 554 DTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESIFLYFTK 611
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTVIHID 656
L +++ P + A S K +VL+ + DF +++LAA + + ++
Sbjct: 612 ELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVFAVVN 668
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ EE +++ + S + VC+NFSC PV++ L+ +
Sbjct: 669 ENELEEA-------RKLSTLFDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 715
>gi|114778919|ref|ZP_01453713.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
gi|114550835|gb|EAU53402.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
Length = 685
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 242/695 (34%), Positives = 335/695 (48%), Gaps = 77/695 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED VA++LN +F++IKVDREERPD+D VYM Q + GGWPL+
Sbjct: 62 STCHWCHVMEHESFEDPQVAEVLNRYFIAIKVDREERPDIDAVYMHAAQLMNVSGGWPLN 121
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ L+PD KP TY P E ++GR G + ++V W + R + S L++++
Sbjct: 122 LLLTPDKKPFYAATYLPKEGRFGRMGLIELAQRVGVMWKQDRQRIEASANSISSALTDSI 181
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A A + + L A R A++ +D GGFG AP FP P + +L +
Sbjct: 182 -AVAKTGAMDMALVDAAYRDTAQR----FDKGSGGFGGAPLFPSPQRLLFLLRY------ 230
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + + MV +L M +GGIHD +GGGFHRYS D W +PHFEKML DQ L
Sbjct: 231 -GILKDQPQALTMVKESLTAMQRGGIHDQLGGGFHRYSTDAHWLLPHFEKMLSDQAMLMM 289
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y + + T D ++ RD +YL RDM ++AEDADS EG +EG FY+
Sbjct: 290 AYAEGWKATGDASFAATARDTAEYLLRDMRDKQDGFYTAEDADS---EG----EEGRFYL 342
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W++ E+ LG A F + Y ++ GN + +E G N+L + +A
Sbjct: 343 WSADEIRHALGRRADAFMQAYGVEADGNFS----DEASHEKTGANILHRTGEMDPAA--- 395
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
R KL R+KR RP DDKV+ WNGL I++ A +IL
Sbjct: 396 ----------FAAEREKLLASRAKRVRPFRDDKVLADWNGLTIAALAITGRIL------- 438
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
D Y+E A AA FI +L + L H +R G + G LDDY ++
Sbjct: 439 ---------DEPRYIEAATKAADFILHNLRRDDGS-LLHRWRRGEAGIAGQLDDYTDMVW 488
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GL +LYE +WL A+ L + F EGGG++ D ++ R + DGA P
Sbjct: 489 GLTELYEATFDARWLKQALALNHIMLSRF-KAEGGGFYQVERSD-DLIARPMQGFDGALP 546
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGN+V++ NL+RL+ + + A DMA P +
Sbjct: 547 SGNAVAMHNLLRLSRLTGDAAL-------AKQAAAVAGHFSDMAEQAPSGLLHLLSAELL 599
Query: 620 SR---KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ K VVLVG +SS MLA H Y N V+ D A TEE+ A
Sbjct: 600 AESPGKEVVLVGDRSSAGAGAMLAVLHERYRPNTVVLWHD-AQTEEL----------APF 648
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
R + KV VC+N+ C P P + LL
Sbjct: 649 TRGQKAVQGKVTVYVCENYRCKLPSNAPAVVRELL 683
>gi|256419531|ref|YP_003120184.1| hypothetical protein Cpin_0485 [Chitinophaga pinensis DSM 2588]
gi|256034439|gb|ACU57983.1| protein of unknown function DUF255 [Chitinophaga pinensis DSM 2588]
Length = 680
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 215/563 (38%), Positives = 300/563 (53%), Gaps = 55/563 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE E A+++N+ F++IK+DREERPD+D +YM VQA+ G GGWPL+VF
Sbjct: 49 CHWCHVMERESFEHEETARIMNEHFINIKIDREERPDLDHIYMDAVQAMTGSGGWPLNVF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GGTYFPP + RP + +L + A+ ++R+ L + L + A
Sbjct: 109 LTPDKLPFYGGTYFPPVKAFNRPSWTDVLLALSQAFKERREDLETQAQNMRDHL---VQA 165
Query: 142 SASSNKLP--DELPQNALRLCAE------QLSKSYDSRFGGFGSAPKFPRPVEIQMML-Y 192
S S K P D +P L A+ + + D +GGFGSAPKFP IQ +L Y
Sbjct: 166 SGFSGKAPGQDLVPHEELFTKAQCETIFNNMMQQGDKVWGGFGSAPKFPGTFIIQYLLRY 225
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
H S + + L +L M +GGI+D +GGGF RYS D +W PHFEKMLY
Sbjct: 226 H--------HSFNEPKALEQALLSLDKMIRGGIYDQLGGGFARYSTDAKWLAPHFEKMLY 277
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D L +V +A+ LT + Y+ D L ++ R+M GG +SA DADS EG
Sbjct: 278 DNALLVDVLSEAYQLTGNELYARTIADTLGFVAREMTDAGGGFYSALDADS---EGV--- 331
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FY W+ +E+E ILG A LF Y + GN ++ N+L +
Sbjct: 332 -EGKFYTWSKEEIEHILGTDAALFCAFYDVTEEGN------------WEETNILWVTKPA 378
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A++ G+ E L R KL VR+KR RP LDDK+I+ WN L+I + +A
Sbjct: 379 AVFAAEQGITEEALERSLAISREKLMAVRAKRIRPGLDDKIILGWNALMIHACCKA---- 434
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ +G +R Y E+ +A F HL + H+F+ G +K P FLD
Sbjct: 435 ----------YAALGIER--YREMGVNAMKFCLEHLQNTDKQSFFHTFKGGVAKYPAFLD 482
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA+++ L+ L E +WL A EL F D G ++ T V++R KE
Sbjct: 483 DYAWMVRALIALQEVSGEPEWLSKAKELTEYVVNNFSDEGGIYFYYTEAGQTDVIVRKKE 542
Query: 553 DHDGAEPSGNSVSVINLVRLASI 575
+DGA PSGN+V NL+ L+ +
Sbjct: 543 VYDGATPSGNAVMAANLLYLSVV 565
>gi|448666501|ref|ZP_21685146.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
13557]
gi|445771632|gb|EMA22688.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
13557]
Length = 717
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 228/691 (32%), Positives = 351/691 (50%), Gaps = 56/691 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFENEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEAL 139
L+P+ +P GTYFPPE+K G+PGF +L+++ D+W ++R+ + E + L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWADPEQREEMENRARQWTEAIESDL 177
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
A+ ++ P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAN---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYS 231
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD ++
Sbjct: 232 DGGQQDHLN----VVQETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIP 287
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAF 317
+L + Y+ + R+ ++++R++ P G FS DA+S E +EG F
Sbjct: 288 RAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESIPPEDPDGDSEEGLF 347
Query: 318 YVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT ++V D + + A +F CD +++P N F+G VL S
Sbjct: 348 YVWTPEQVHDAVDDETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPVSVL 395
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A + ++ L + F+ R +RPRP D+K++ WNGL+I + A + +L
Sbjct: 396 AEEYERSEDEITAGLQRALNETFEARKERPRPARDEKILAGWNGLMIRALAEGAIVLDD- 454
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
EY +VA A SF+R HL+DE RL +++G G+L+DYA
Sbjct: 455 ----------------EYADVAADALSFVREHLWDETEQRLNRRYKDGDVAIDGYLEDYA 498
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL G L L+E L +A++L E F D + G F T S++ R +E D
Sbjct: 499 FLGRGALTLFEATGDVDHLAFAMDLGQAITEAFWDDDEGTLFFTPTGGESLVARPQELTD 558
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
+ PS V+V L+ L+ S D + + AE L R+ + + A D
Sbjct: 559 QSTPSSTGVAVDLLLSLSHF---SDDDRFEEVAERVLRTHADRVSSNPLQHASLTLATDT 615
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW----EEHNS 671
+ + + LVG +S D+ + A + + ++ PAD + W E +
Sbjct: 616 YEQGALE-LTLVGDQS--DYPSEWTETLAERYVPRRLLAHRPADEGRFEQWLDALELDEA 672
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
R D V C+NF+CSPP D
Sbjct: 673 PPIWAGREPVDGDPTV-YACRNFACSPPKHD 702
>gi|126180264|ref|YP_001048229.1| hypothetical protein Memar_2324 [Methanoculleus marisnigri JR1]
gi|125863058|gb|ABN58247.1| protein of unknown function DUF255 [Methanoculleus marisnigri JR1]
Length = 721
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 244/714 (34%), Positives = 357/714 (50%), Gaps = 55/714 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL + CHWCHVME ESF D+ VAKLLND FV IKVDREERPD
Sbjct: 47 GEEAFSRAREEGKPIFLSIGYSACHWCHVMEEESFADQQVAKLLNDVFVCIKVDREERPD 106
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D+VYM AL G GGWPL++ ++ D KP +Y P E +YG G ++ ++ W
Sbjct: 107 IDQVYMAAAHALTGAGGWPLTILMTADKKPFFAASYIPKESRYGMTGLLDLIPRISKVWQ 166
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+R L +G +Q+ +AL ++A + EL + L + +D GGFG A
Sbjct: 167 TQRQGLENAG----DQVLQALQSAARTPPEEGELAEAVLDEAYNMFFRVFDGENGGFGDA 222
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
P+FP P + +L + + TGK MV TL M +GGI D VG GFHRYS
Sbjct: 223 PRFPTPHNLIFLLRYGNR---TGK----EPAYTMVEKTLHAMRRGGIFDQVGYGFHRYST 275
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLYDQ L Y +A+ T ++ R+ + Y+ R+M P G +SA
Sbjct: 276 DAEWFVPHFEKMLYDQALLVMAYTEAYLATGREEFARTARETIAYVLREMTDPDGGFYSA 335
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
EDADS EG +EG FY+WT E+ +LGE F + + GN P
Sbjct: 336 EDADS---EG----EEGKFYLWTKDEILGVLGEEDGERFSRIFNVTEPGNY----REQPG 384
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
+ G+N+L ++ A + P + + E R+KL R +R RP DDK++ W
Sbjct: 385 GKRTGRNILRLRRPLASWAHEFETPEDDLAWSVEEGRQKLLAARKQRVRPGRDDKILTDW 444
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
N L+I++ A+A++ D +Y+ AE AA+F+ +L E RL
Sbjct: 445 NALMIAALAKAARAF----------------DEPDYLAAAERAAAFVLANLRREDG-RLL 487
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
H +R G + LDDYAF+I L+++YE +L A++L + D GG+F
Sbjct: 488 HRYRGGEAGLAATLDDYAFMIWALIEVYEASFAPGYLKTAVDLSRDLIARYWDCNEGGFF 547
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+D V +R K +DGA PSGNSV++ L L + A + + + AE VF
Sbjct: 548 FVP-DDGDVPVRQKPVYDGAIPSGNSVAMYALFVLGRMTANLELE---ETAERIRRVFAG 603
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ + A + + P+ + V++ G + D M+ A + Y + +I P
Sbjct: 604 TVSESPTACSHFLTGLEFMLGPNFE-VIISGVPDAEDTRAMIGAIRSHYAPDAVII-FRP 661
Query: 658 ADTEEMDFWEEHNSNNASMARN-NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+D EE + E A R+ +K A VC N++C P TDP + L+
Sbjct: 662 SDEEEPEIVE-----VAGFTRDIVMIEEKATAYVCTNYACDIPTTDPDEMVRLV 710
>gi|226291405|gb|EEH46833.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 804
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 234/585 (40%), Positives = 323/585 (55%), Gaps = 40/585 (6%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVME ESF +A +LN F+ IK+DREERPD+D+VYM YVQA G GGWPL+VFL+P
Sbjct: 67 CHVMEKESFMSPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 126
Query: 85 DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
DL+P+ GG+Y+P P G+ F IL K++D W ++ +S +QL
Sbjct: 127 DLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESAKDITKQLR 186
Query: 137 EALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
E + + +K D +L L + + YD+ GGF APKFP PV + ++
Sbjct: 187 E-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPTPVNLSFLV 245
Query: 192 YHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+ S+ + D E S ++ + TL M++GGIHD +G GF RYSV W +PHFE
Sbjct: 246 HLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTADWSLPHFE 305
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETE 307
KMLYDQ QL +VY+DAF D DI Y+ M+ P G S+EDADS +
Sbjct: 306 KMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSEDADSRPSP 365
Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
T K+EGAFYVWT KE++ ILG+ A + H+ + GN +SR++DPH+EF +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELKQILGQRDADVCARHWGVLADGN--VSRINDPHDEFINQNVL 423
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSF 425
S A + G+ ++ + I+ R KL + R SKR RP LDDK+IV+WNGL I +
Sbjct: 424 SIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAWNGLAIGAL 483
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
A+ S +L++ + F AE A FI+ +L+DEQT +L +R G
Sbjct: 484 AKCSVVLENLDRDKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLWRIYRGGVR 533
Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ---NTQDELFLDREGGGY----F 537
PGF DDYA+LISGL++LYE L +A +LQ T LF +
Sbjct: 534 GDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQRYYTTPSTLFYSPSSSDFSTPTS 593
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
T P LLR+K D A PS N V NL+RL++++ G D
Sbjct: 594 PNTPTLPPPLLRLKPGTDAATPSPNGVIARNLLRLSALLDGGDVD 638
>gi|240276138|gb|EER39650.1| DUF255 domain-containing protein [Ajellomyces capsulatus H143]
gi|325089996|gb|EGC43306.1| DUF255 domain-containing protein [Ajellomyces capsulatus H88]
Length = 766
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 232/615 (37%), Positives = 320/615 (52%), Gaps = 76/615 (12%)
Query: 11 KTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 67
K R FL + CHWCHVME ESF VA +LN F+ IK+DREERPD+D VYM YV
Sbjct: 57 KLNRMIFLSIGYSACHWCHVMEKESFMSPEVAAILNKAFIPIKLDREERPDIDDVYMNYV 116
Query: 68 QALYGGGGWPLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDK 119
QA G GGWPL+VFL+PDL+P+ GGTY+P P G+ F IL K++D W
Sbjct: 117 QATTGSGGWPLNVFLTPDLEPVFGGTYWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQT 176
Query: 120 K--------RDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSR 171
+ +D+ Q FA E S + + + +L L + + YD
Sbjct: 177 QQLRCRESAKDITRQLQEFAEEGTYSKQSGAGADGEE--DLEVELLEEAYKHFASRYDPV 234
Query: 172 FGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
GGF APKFP P + ++ S+ + D E + +M + TL +++GGIHDH
Sbjct: 235 NGGFSRAPKFPTPANLSFLVNLSRFSNAVADIVGYEECAHALEMAIKTLISISRGGIHDH 294
Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-D 287
+G GF RYSV W +PHFEKMLYDQ QL VY DAF D DI Y+
Sbjct: 295 IGHGFARYSVTADWSLPHFEKMLYDQAQLLRVYTDAFDSAHDPELLGAMYDIAAYITSPP 354
Query: 288 MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTG 346
++ P S+EDADS T T K+EGAFYVWT KE + ILG+ A + H+ + P G
Sbjct: 355 VLSPTSGFHSSEDADSLPTPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDG 414
Query: 347 NCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRP 405
N + R++DPH+EF +NVL A + G+ E+ + I+ KL + R SKR
Sbjct: 415 NVE--RVNDPHDEFINQNVLHIQTTPGKLAKEFGLSEEEVVRIIKASTEKLREYRESKRV 472
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
RP LDDK+IV+WNGL I + A+ S +L + V +E+ AE+AA FIR
Sbjct: 473 RPALDDKIIVAWNGLAIGALAKCSVVLDN----------VDRIKAQEFRLAAENAAKFIR 522
Query: 466 RHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
+ L+D + +L +R PGF DDYA+LISGL+DLYE +L +A +LQ+
Sbjct: 523 QSLFDPASGQLWRIYRGEERGDTPGFADDYAYLISGLIDLYEATFDDSYLQFAEQLQH-- 580
Query: 525 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
+ PS N V NL+RL++++ + D Y
Sbjct: 581 -------------------------------ASTPSPNGVIARNLLRLSTLL---EDDTY 606
Query: 585 RQNAEHSLAVFETRL 599
R+ A +++ F +
Sbjct: 607 RRLARDTVSAFAVEI 621
>gi|330508169|ref|YP_004384597.1| hypothetical protein MCON_2284 [Methanosaeta concilii GP6]
gi|328928977|gb|AEB68779.1| protein of unknown function (DUF255) [Methanosaeta concilii GP6]
Length = 710
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 259/726 (35%), Positives = 359/726 (49%), Gaps = 78/726 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVM ESFED VA+LLN F+ IKVDREERPD
Sbjct: 43 GEEAFEAARREDKPIFLSVGYSTCHWCHVMAHESFEDPNVARLLNQSFICIKVDREERPD 102
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D++YM A+ G GGWPL+V ++PD KP TY P + G G ++ +VK+ WD
Sbjct: 103 IDQIYMAAAIAVSGRGGWPLTVMMTPDKKPFFAATYIPKKGHMGLTGLMELIAQVKEMWD 162
Query: 119 KKRDMLAQSGAFAIEQLSEALS---ASASSNKLPDELP-----QNALRLCAEQLSKSYDS 170
R+ L S ++ L S A D L + L LS YD
Sbjct: 163 NDRESLMSSANIIVDHLKGRQSGRGAGVQKEAHKDSLSGSPFDSSLLSRGYSALSSIYDP 222
Query: 171 RFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
GGFG+APKFP P I +L K+ ++ +M TLQ M GGI+DHVG
Sbjct: 223 ENGGFGTAPKFPTPHHILFLLRCWKRTKNILP-------LEMAKTTLQGMRMGGIYDHVG 275
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 290
GFHRYS D W VPHFEKMLYDQ LA Y +A+ T + Y+ R+IL+Y+ RDM
Sbjct: 276 FGFHRYSTDPEWFVPHFEKMLYDQALLAMAYAEAYQATGEEEYAQTVREILEYILRDMTS 335
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCD 349
P G +SAEDADS EG +EG FY WT+ E+++ LGE L + + +GN +
Sbjct: 336 PEGGFYSAEDADS---EG----EEGKFYTWTAVELKESLGEEDFRLLIRLFDVYESGNYE 388
Query: 350 LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL 409
R N+L + + S +AS L +P E+ + + +L+ R KR P
Sbjct: 389 GER-----------NILRQRSSFSDAASVLKIPEEELYHRSSDMISRLYLAREKRVHPLK 437
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
DDK++ WNGL+I++ ARA+ L+ + A AA F+ +
Sbjct: 438 DDKILTDWNGLMIAALARAAGALQD----------------PDLATAASRAADFLLEVMR 481
Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
+ RL H +R G + LDDYAFLI GL++LYE K+L A+ L D+ F
Sbjct: 482 TPEG-RLMHRYRQG-ADIQANLDDYAFLIWGLIELYEATFDVKYLKAAVHLNEIMDKHFW 539
Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
D E GG+F T + +L+R KE +DGA PSGNS++++NL+RL + + + E
Sbjct: 540 DGEAGGFFFTADDGEELLVRKKEYYDGALPSGNSIALLNLLRLLHLTGDT-------SLE 592
Query: 590 HSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
A+ A PL + CA D P+ + V LVG + MLAA
Sbjct: 593 EKAALLARSALPAVSAQPLGYTMLLCALDYALGPTYE-VALVGSLEDGGLKEMLAAIRIR 651
Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPI 704
+ NK V+ ++ + A R+ K A VC + C P T+
Sbjct: 652 FLPNKAVVLASGSEIVML----------APFTRDLVPVKGKAAAYVCSDHVCQLPATNAA 701
Query: 705 SLENLL 710
L LL
Sbjct: 702 ELMALL 707
>gi|410941737|ref|ZP_11373531.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
gi|410783286|gb|EKR72283.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
Length = 698
Score = 362 bits (928), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 242/699 (34%), Positives = 357/699 (51%), Gaps = 75/699 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + + GGWPL++
Sbjct: 63 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHEMEQQGGWPLNM 122
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP+ GGTYFPPE KYGR GF +L ++ W +KR L + + +LS+ L
Sbjct: 123 FLTPEGKPITGGTYFPPESKYGRKGFLEVLNIIQKVWTEKRSELIAAAS----ELSQYLK 178
Query: 141 ASASSNKLPDE---LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSK 195
SA S E N YDS+FGGF + KFP + + +L +
Sbjct: 179 DSAESKSRAQETDFTSANCFDSGFLLYENYYDSQFGGFKTNQVNKFPPNMGLGFLLRYY- 237
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
S + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 238 ------LSSKNPRALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNS 291
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ + ++K + DI+ YL RDM GG I SAEDADS EG +EG
Sbjct: 292 LFLEILAEYSLVSKKISAESFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEG 344
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY+W +E ++ GE + L ++ + + GN F+GKN+L E N ++
Sbjct: 345 LFYIWDLEEFREVCGEDSFLLEKFWNVSKEGN------------FEGKNILHE-NFRGSN 391
Query: 376 ASKLGMPLEKYLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
++ E++ + G R KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 392 FTE-----EEFKQLDGALLRGKAKLLERRSKRIRPFRDDKILTSWNGLYIKALVKTG--- 443
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ R++++++AE SFI ++L D + R+ FR G S G+ +
Sbjct: 444 -------------IAFQREDFLKLAEETYSFIEKNLIDSKG-RMLRRFREGESGILGYSN 489
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DY+ +I+ + L+E G G ++L A+ LF R G F TG D VLLR
Sbjct: 490 DYSEMIASSIVLFEAGRGIRYLRNAVLWMEEVIRLF--RSSAGVFFDTGIDGEVLLRRSV 547
Query: 553 D-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
D +DG EPS NS +L++L+ + G S+ Y + AE F L A++ P +
Sbjct: 548 DGYDGVEPSANSSLAHSLIKLSFL--GVNSERYLEIAESIFVYFRKELYSYALSYPYLLS 605
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A S K +VL+ K+S +++ A+ + + + + ++ + EE
Sbjct: 606 AYWSYKHHS-KEIVLI-RKNSEAGKDLFASIRSRFLPDSVLAIVNEDELEEA-------R 656
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S+ S + VC+NFSC P+ + LE +
Sbjct: 657 KLSSLFDFKDSGGNALVYVCENFSCKLPIDNVSDLEKYM 695
>gi|392380898|ref|YP_005030094.1| conserved protein of unknown function; putative Thioredoxin and
glycosidase domains [Azospirillum brasilense Sp245]
gi|356875862|emb|CCC96610.1| conserved protein of unknown function; putative Thioredoxin and
glycosidase domains [Azospirillum brasilense Sp245]
Length = 672
Score = 362 bits (928), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 240/698 (34%), Positives = 348/698 (49%), Gaps = 80/698 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A L+N+ FV+IKVDREERPDVD++Y + + L GGWPL++
Sbjct: 50 ACHWCHVMAHESFENPEIAGLMNELFVNIKVDREERPDVDQIYQSALAMLGQQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GGTYFPP +YGRPGF +LR V + + K + + ++ + L +AL
Sbjct: 110 FLTPEAEPFWGGTYFPPASRYGRPGFPDVLRGVAETYRNKPENVTRN----VAALKDALG 165
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A N+ E+ L A++L + D GG G APKFP+ V I +L+ + T
Sbjct: 166 KLA-ENRAAGEVDLAMLDQIADRLVREVDPFHGGIGHAPKFPQ-VPIFTLLW--RAWLRT 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
GK ++ V TL M++GGI+DH+GGGF RYSVDE W VPHFEKMLYD QL ++
Sbjct: 222 GK----EPYREAVTNTLAHMSQGGIYDHLGGGFARYSVDEMWLVPHFEKMLYDNAQLLDL 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ ++ + R+ + ++ R+MI GG + +DADS EG +EG FY+W
Sbjct: 278 MTLVWQAEREPLFETRIRETVGWVLREMIAEGGGFAATQDADS---EG----EEGLFYIW 330
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSAS 375
+E++ +LG A +FK Y + P GN ++G +L IE D+
Sbjct: 331 NEEEIDRLLGPGAEVFKRAYGVTPQGN------------WEGATILNRLHRIEALDAETE 378
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ L E R L+ R KR +P DDKV+ WNGL+I++ A+A +
Sbjct: 379 AT------------LAEQRAILWREREKRIKPGWDDKVLADWNGLMIAALAQAGMVF--- 423
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
D ++ A+SA +F+R + ++ RL HS+R G K LDDYA
Sbjct: 424 -------------DEPAWIAAAQSAYAFVRDRMTEDG--RLLHSWRAGQLKHRATLDDYA 468
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ L L+E L A D F D + GGYF T + +++R K D
Sbjct: 469 HMARAALALHEATGDAGALEQARAWVRVLDAHFWDAQAGGYFYTADDADDLIVRTKSAGD 528
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSGN L LA++ + YR+ A+ A F L +P AA++
Sbjct: 529 AATPSGNGTM---LAVLATLHHRTGEAAYRERADALAAAFSGELSRNFFPLPTYLNAAEL 585
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L +V+VG + D L A L ++ + P T D H ++
Sbjct: 586 LQ--KALQIVIVGDPQASD-TAALRRAVLDRPLPDRILSVLPPGT---DLPAGHPAHGKG 639
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
M A VC +CSPPVT P +L L +
Sbjct: 640 M-----QGGVATAYVCTGMTCSPPVTTPDALAAALTRR 672
>gi|320334089|ref|YP_004170800.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
gi|319755378|gb|ADV67135.1| hypothetical protein Deima_1486 [Deinococcus maricopensis DSM
21211]
Length = 674
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 250/711 (35%), Positives = 337/711 (47%), Gaps = 110/711 (15%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED A +N+ FV++KVDRE+RPDVD VYM VQA+ G GGWP++V
Sbjct: 48 TCHWCHVMAHESFEDAQTAAFMNEHFVNVKVDREQRPDVDAVYMRAVQAMTGAGGWPMTV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTYFPP D YG P F+T+L V +AW +RD L A A+ + A+S
Sbjct: 108 FLAPDRRPFYAGTYFPPRDAYGMPSFRTVLASVANAWADRRDQL-LGNADALTEHVRAMS 166
Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A A+ LP++ L + +++D+R GGFGSAPKFP P + +L
Sbjct: 167 APKPAADGALPEDFAPRGL----DNARRTFDARHGGFGSAPKFPAPTFLTYLLTQ----- 217
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+G+ M + TL M +GG+ D +GGGFHRYSVDERW VPHFEKMLYD QL
Sbjct: 218 --------PDGRDMAVRTLDAMMRGGLMDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLV 269
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL A +T + R L Y+ R+++ P G A+DAD EG EG F+
Sbjct: 270 RAYLRAHVVTGRADFLDTARATLAYMERELLTPEGGFACAQDADQ---EGI----EGKFF 322
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASAS 377
VWT +E D+LG A L HY + GN DPH+ F ++VL + D A
Sbjct: 323 VWTPQEFRDLLGADADLALRHYGVTDAGN-----FQDPHHPAFGRRSVLSVVTDVPELAR 377
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ + LG R LF R R P LDDKV+ SWNGL + +FA A ++
Sbjct: 378 AFSLGEDDVRARLGRARETLFSARRARAHPGLDDKVLTSWNGLALMAFADAYRL------ 431
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ Y++VA A F+R L L H++R + G L+D A
Sbjct: 432 ----------TGETHYLDVARRNADFVRARLTAPDGAPL-HAYR---ADVRGLLEDAALY 477
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR-VKEDHDG 556
GL+ LY + L WA L + D + G F ++G D L+ E D
Sbjct: 478 GLGLVALYAAAGNLEHLQWARALWDRARRDHWD-DAAGVFYSSGPDAEALVAPTTETFDA 536
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A S N+ A+ + G D Y E A R+ L A DML
Sbjct: 537 AIMSDNA---------AACLLGLHIDRY--FGEDEGARITARV--------LAGTANDML 577
Query: 617 SVPS------RKH---------VVLVGH-KSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
+ PS + H + L+G + FE LAA + + + PA+
Sbjct: 578 THPSGFGGLWQAHAHLHAPHVEIALLGTPEQRAPFERALAAQDLPF------VTVAPAER 631
Query: 661 -EEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ E N VA VC+NF+C P DP + L
Sbjct: 632 GGGLPLLEGREGNG-------------VAYVCRNFTCDLPARDPAAFTAQL 669
>gi|398348235|ref|ZP_10532938.1| hypothetical protein Lbro5_13624 [Leptospira broomii str. 5399]
Length = 669
Score = 361 bits (926), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 258/723 (35%), Positives = 358/723 (49%), Gaps = 78/723 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL TCHWCHVME ESFEDE A +LN +FVSIKVDREERPD
Sbjct: 10 GKDAFLKAKEEDKMIFLSIGYATCHWCHVMEKESFEDEATAAVLNQYFVSIKVDREERPD 69
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD++YM + A+ GGWPL++FL+ + KP+ GGTYFPP KYGR F +L + + W
Sbjct: 70 VDRIYMDALHAMNQQGGWPLNMFLTSEGKPITGGTYFPPVAKYGRKSFVEVLNILANLWK 129
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQL--------SKSYDS 170
+K+ L A E+L++ L S S L + Q+A +L ++++ + YD
Sbjct: 130 EKKGELID----ASEELTQYLKESEESKALNE---QSAFQLPSKKVFENAFGMYDRFYDP 182
Query: 171 RFGGFGS--APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
F GF S KFP + + +L K +GE + +MV TL M KGGI+D
Sbjct: 183 EFAGFKSNVTNKFPPSMGLFFLLRFYK------STGE-PKALEMVEETLVAMRKGGIYDQ 235
Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
+GGG RYS D +W VPHFEKMLYD ++ F T V Y D+L+YL RDM
Sbjct: 236 IGGGISRYSTDHKWLVPHFEKMLYDNSLFLEALVECFQTTGHVKYKEAAYDVLEYLSRDM 295
Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNC 348
GG I SAEDADS EG +EG FY+W E ++ G AIL +E + + GN
Sbjct: 296 RLQGGGIASAEDADS---EG----EEGLFYLWKRNEFHEVCGSDAILLEEFWNVTEIGN- 347
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
F+G N+L E + + A G+ E+ + I+ R+KL RS R RP
Sbjct: 348 -----------FEGSNILHE-SFRTNFARLHGLEQEELIEIVDRNRKKLLARRSDRIRPL 395
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
DDKV++SWN L + + +A+ E + +AE FI +L
Sbjct: 396 RDDKVLLSWNCLYVKAATKAAMAFGD----------------GELLRLAEETFRFIENNL 439
Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
E RL FR+G ++ + DYA I L L++ G G ++L AI + +D +
Sbjct: 440 VREDG-RLLRRFRDGEARFLAYSGDYAEFILASLWLFQAGKGIRYLTLAI--RYAEDAVR 496
Query: 529 LDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
L R G F TG D LLR D +DG EPS NS L+ + G +SD Y
Sbjct: 497 LFRSPAGVFFDTGSDADDLLRRNVDGYDGVEPSANSSFAFAFTILSRL--GVESDKYSDF 554
Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
A+ + F+ L+ M P M A + + S++ V+ + + D + A +
Sbjct: 555 ADAIFSYFKVELETHPMNYPYMLSAYWLKNSASKELAVV--YSTQEDLFPVWQGIGAMF- 611
Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
L +TV D E E + RN S V A CQ F C PV+D ISL
Sbjct: 612 LPETVFAW-ATDKE-----AEEVGEKILLLRNRVSGGSVKAYYCQGFQCDLPVSDWISLR 665
Query: 708 NLL 710
L
Sbjct: 666 EKL 668
>gi|219852761|ref|YP_002467193.1| hypothetical protein Mpal_2172 [Methanosphaerula palustris E1-9c]
gi|219547020|gb|ACL17470.1| protein of unknown function DUF255 [Methanosphaerula palustris
E1-9c]
Length = 714
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 242/692 (34%), Positives = 341/692 (49%), Gaps = 64/692 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESF D VA LLND++++IKVDREERPD+D+VYM Q + G GGWPL++
Sbjct: 74 TCHWCHVMAEESFMDLKVAALLNDYYIAIKVDREERPDIDQVYMAVCQMMTGSGGWPLTI 133
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD +P TY P ++ G +L V W +K L + +E L +
Sbjct: 134 IMTPDRRPFFAATYIPKMSRFRGTGMLDLLPMVAQVWREKPGDLIEVATQVVEALHQPAR 193
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A A D L L A ++D GGFG APKFP P + +L + +
Sbjct: 194 AGAGPEPTIDLLIAGYRGLAA-----TFDPVRGGFGDAPKFPAPHNLLFLLRYWR----- 243
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+SGE MV TLQ M GGI+DH+ GGFHRYS D W VPHFEKMLYDQ L
Sbjct: 244 -RSGEPV-ALAMVEQTLQAMRHGGIYDHLAGGFHRYSTDGGWKVPHFEKMLYDQAMLVMA 301
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +AF T + Y + Y+ RD++ G +A+DADS EG +EG +Y+W
Sbjct: 302 YTEAFLATGNREYRKTAEATIQYVLRDLVTREGGFAAAQDADS---EG----EEGRYYLW 354
Query: 321 TSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASK 378
T EV +L + A F Y + GN +DP N + G+NVL D+
Sbjct: 355 TLAEVRGLLTQDEAATFTTAYQMTERGN-----FTDPSNPKLTGRNVLYRSPDA------ 403
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
PL+ L KL R +R P DDKV+ WNGL+I++ ARA +
Sbjct: 404 ---PLQDPDLHLVAADAKLAAARRERVPPLTDDKVLTGWNGLMIAALARAGRAFGV---- 456
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+Y++VA AA F+ + D Q RL H +R+G G +DYA LI
Sbjct: 457 ------------ADYIDVAGRAADFLLGTMRD-QGGRLLHRYRDGEVAISGQAEDYAALI 503
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLLDLY+ ++L A+E+ D GGG+F+ + +++R KE +DGA
Sbjct: 504 WGLLDLYQATFTVRYLADAVEVMKEFTARCWDPAGGGFFSAAEDATDLIVRQKEQYDGAM 563
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PS NSV+ ++L+ LA + + Y + AE L F T + + + + A ++
Sbjct: 564 PSANSVAFMDLLLLARL---TGEPAYEEQAEE-LGRFMTGVVEQSPLIATFFLAGLDFAL 619
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+ VV+VG + +VD M+ A + L T + PA D ASM R
Sbjct: 620 GPAQEVVIVGDEGAVDTTAMVRALAERF-LPSTTVQFKPAAAGAEDL-TTVAPFTASMER 677
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + VC SC+PP + +E +L
Sbjct: 678 KD---GRATVYVCSGQSCAPPA---VGVEAML 703
>gi|392955811|ref|ZP_10321341.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
gi|391878053|gb|EIT86643.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
Length = 679
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 239/709 (33%), Positives = 348/709 (49%), Gaps = 83/709 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + ++ FL +TCHWCHVM+ ESF+D VA LLN+ FV+IKVDREERPD
Sbjct: 28 GEEAFEKARREKKPVFLSIGYSTCHWCHVMKKESFDDHEVAALLNERFVAIKVDREERPD 87
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D+VYM Q L G GGWPL+VFL+ D +P G YFP ED+YG PGFK+++ ++ + +
Sbjct: 88 LDQVYMAVCQGLTGQGGWPLNVFLTADQRPFYAGVYFPKEDRYGSPGFKSVITQLSEKYT 147
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + + ++L+E+L P L + L C QL + +DS +GGF A
Sbjct: 148 ERHEEIHDYS----KRLTESLQRKMKQE--PTALQETILHTCFNQLGQMFDSIYGGFSQA 201
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + +L + G+ +MV TL MA GGI+D +G GF RY+V
Sbjct: 202 PKFPAPTILTYLLRY-------GQWQGNDLALQMVERTLDAMADGGIYDQIGYGFSRYAV 254
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D+ W VPHFEKMLYD L Y++A+ +TK Y I +I+ Y+ M G + A
Sbjct: 255 DQMWLVPHFEKMLYDNALLLIAYVEAYQVTKKPRYQQIAAEIIQYVTTVMRDEQGGFYCA 314
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EG +EG +YV++ E+E L + + + C L ++D N
Sbjct: 315 EDADS---EG----EEGKYYVFSKTEIERQLPQE----------QASAFCALYDITDEGN 357
Query: 359 EFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+G NV LI A LG+ EK ++ + R+ L+ R R PH DDK++ S
Sbjct: 358 -FEGNNVPNLIHQRKERI-AQTLGITEEKLSTLVEQARQTLYRYRETRIPPHKDDKILTS 415
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WN L+I A+A+ D Y E A+SA SFI + L R+
Sbjct: 416 WNALMIVGLAKAA----------------AAWDEPAYREHAKSALSFIEKELVIHD--RV 457
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+R G + GF+DDYAFL L++YE +++ A L LF D GG+
Sbjct: 458 MVRYREGDVQGKGFIDDYAFLAWAYLEMYEATFDDRYISKAQTLTQDMLSLFWDESHGGF 517
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
+ + +++ KE +DGA PSGN V+ L +L + A + Y + E VF
Sbjct: 518 YYAGNDAEQLIVTGKEAYDGAMPSGNGVAAYVLWKLGKLTADPQ---YDEKLEALFDVFS 574
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HI 655
+ L + ML+ VVLV + V A + L KT + H+
Sbjct: 575 SDLSHYPTGHTQLLQVW-MLTQMKTAEVVLVAEQEQV--------ASSLRTLQKTFLPHV 625
Query: 656 -----DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
DP + + S + + + VC+NF C P
Sbjct: 626 VWFLQDPRE----------RAAFTSFQLVDRTKKHPMIYVCENFHCQRP 664
>gi|421098293|ref|ZP_15558964.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
gi|410798561|gb|EKS00650.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
Length = 691
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 243/690 (35%), Positives = 350/690 (50%), Gaps = 72/690 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ VA LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNI 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP+ GGTYFPPE YGR F +L ++ W++KR L + + +LS+ L
Sbjct: 115 FLTPDGKPITGGTYFPPEPMYGRKSFLEVLNILRKVWNEKRQELIAASS----ELSQYLK 170
Query: 141 ASASSNKLPDE----LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YH 193
S + + +N YD+ FGGF + KFP + + +L YH
Sbjct: 171 DSGERRTIEKQEGGLSSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH 230
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+S +MV TL M +GGI+D VGGG RYS D W VPHFEKMLYD
Sbjct: 231 --------RSSGNPRALEMVENTLLAMKQGGIYDQVGGGLCRYSTDFYWMVPHFEKMLYD 282
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
++ ++K + D++ YL RDM G I SAEDADS EG K
Sbjct: 283 NSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVDGGICSAEDADS---EG----K 335
Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
EG FY+W +E ++ GE + + ++ + + GN F+GKN+L E
Sbjct: 336 EGLFYIWGLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILYE--SYR 381
Query: 374 ASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A+KL K ++ +L R KL + R+KR RP DDK++ SWNGL I + +A
Sbjct: 382 SEATKLSEEEWKQIDSVLERGRAKLLERRNKRVRPLRDDKILTSWNGLYIKALTKAG--- 438
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
V R++++ +AE SFI R+L D + R+ FR+G S G+ +
Sbjct: 439 -------------VAFQREDFLRLAEETYSFIERNLID-PSGRMLRRFRDGESGILGYSN 484
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA +I+ + L+E G G ++L A+ LF R G F G D VLLR
Sbjct: 485 DYAEMITSSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDAGSDGEVLLRRSV 542
Query: 553 D-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
D +DG EPS NS +LV+L+ + G S YR+ AE F L +++ P +
Sbjct: 543 DGYDGVEPSANSSLAYSLVKLS--LFGIDSVRYRKFAESIFLYFTKELSTNSLSYPHLLS 600
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A S K +VL+ K S +++LA + + I+ + EE
Sbjct: 601 AYWTYRHHS-KEIVLI-RKDSDSGKDLLAEIQTKFLPDSVFAVINEDELEEA-------R 651
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
+++ + S + +C+NFSC PV+
Sbjct: 652 KLSTLFDSRDSGGNALVYICENFSCKLPVS 681
>gi|239627004|ref|ZP_04670035.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239517150|gb|EEQ57016.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 638
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 213/560 (38%), Positives = 298/560 (53%), Gaps = 63/560 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+EG+A +LN ++ IKVDREERPDVD VYM+ QA+ G GGWPL+
Sbjct: 7 STCHWCHVMERESFENEGIAGILNRDYICIKVDREERPDVDSVYMSVCQAMNGQGGWPLT 66
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD +P GTYFPP+ +YGR G + +L V W R+ L + GA IE +
Sbjct: 67 IIMTPDCRPFFSGTYFPPKARYGRVGLEELLAAVSAQWKGGRERLLE-GAGRIEAFLKEQ 125
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S + E+ A RL +D + GGFG APKFP P I ++ + +
Sbjct: 126 EQADVSAEPGLEVVHRAFRL----FGDGFDKKNGGFGQAPKFPTPHNIMFLMEYGVRENK 181
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G M + TL M +GGI DH+GGGF RYS DE+W VPHFEKMLYD LA
Sbjct: 182 PGAV-------DMAMDTLVQMYRGGIFDHIGGGFSRYSTDEQWLVPHFEKMLYDNALLAM 234
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y A+ LT Y+ + + IL Y+ ++ G + +DADS EG +YV
Sbjct: 235 AYAKAYGLTGRGLYARVVQRILGYVEAELTHASGGFYCGQDADSDGV-------EGRYYV 287
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL-NDSSASAS 377
+T +E++ +LG E F + + GN F+GKN+ L N+ +A
Sbjct: 288 FTPEEIKQVLGPEDGADFCSQFGITGIGN------------FEGKNIPNLLGNEDYETAG 335
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K RRKL++ R +R H DDK++VSWNG +I + A A +L +
Sbjct: 336 KEA------------SRRKLYEYRIRRAHLHKDDKILVSWNGWMICACAMAGAVLGA--- 380
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+Y+++A A +FIR HL + RL +R+G + G LDDYA
Sbjct: 381 -------------GQYVDMAVRAEAFIRTHLVKD--GRLLVRYRDGDAAGQGKLDDYACY 425
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL+LYE GT +L A+ T F DRE GG++ + +++R KE +DGA
Sbjct: 426 VLALLELYEVTFGTGYLEQAVYWAKTMVLQFFDRERGGFYLYAEDGEQLIVRTKEAYDGA 485
Query: 558 EPSGNSVSVINLVRLASIVA 577
PSGNS + L +LA I
Sbjct: 486 VPSGNSAAARVLQQLAQITG 505
>gi|448435859|ref|ZP_21586927.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
14210]
gi|445683294|gb|ELZ35694.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
14210]
Length = 739
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 241/719 (33%), Positives = 350/719 (48%), Gaps = 83/719 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++ND FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLS 136
+ +P+ KP GTYFPPE + +PGF+ + ++ D+W +++ +M ++ +A
Sbjct: 113 AWCTPEGKPFYVGTYFPPEARQNQPGFRDLCERIADSWSDPEQREEMKRRADQWAESARD 172
Query: 137 EALSASASSNKLP----DELPQNA--LRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
E S P D P L A +SYD +GGFGS KFP P I +
Sbjct: 173 ELESVPTPDAPGPDGEGDASPPGGDLLESAAASALRSYDDEYGGFGSGGAKFPMPGRIDL 232
Query: 190 MLY-HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
++ +++ D S A TL M++GG++D +GGGFHRY+VD W VPHFE
Sbjct: 233 LMRAYARSGRDALLSAAAG--------TLDGMSRGGMYDQIGGGFHRYAVDREWTVPHFE 284
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD +L YLD + L D Y+ + + L +L R++ G FS DA S E
Sbjct: 285 KMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHDDGGFFSTLDARSRPPE- 343
Query: 309 ATRKK---------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHN 358
+R+ EGAFYVWT +EV+ +L E A L E Y ++ GN +
Sbjct: 344 -SRRDDDGHEAGDVEGAFYVWTPEEVDAVLDEPAASLAAERYGIRSGGNFE--------- 393
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
+G V A+ + E L E R LFD R RPRP D+KV+ SWN
Sbjct: 394 --RGTTVPTTAASVEELAADRDLSPEAVRQALTEARTALFDARESRPRPARDEKVLASWN 451
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRL 476
G IS+FA A+ L + Y ++A A F R LY D +T L
Sbjct: 452 GRAISAFADAAGTLG-----------------EPYADIAREALGFCRDRLYDADAETGAL 494
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ +G + PG+LDDYAFL G LD Y + L +A+EL + F D + G
Sbjct: 495 ARRWLDGDVRGPGYLDDYAFLARGALDTYAATGDLEPLGFALELAEALVDEFYDADDGTI 554
Query: 537 FNT---------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YYRQ 586
+ T T + ++ R +E D + PS V+ L +++ G ++D +R+
Sbjct: 555 YFTRDPEGDGGQTDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRFRE 610
Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 646
A + R++ +A + AAD++ V + + ++ L +
Sbjct: 611 IARRVVTTHADRIRGGPLAHASLVRAADLVET-GGVEVTIAADEVPDEWRETLGERY--- 666
Query: 647 DLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
L ++ PA +D W + + A + + D+ A VCQ+F+CSPP TD
Sbjct: 667 -LPNALVAPRPATAAGLDEWLDRLDMAEAPPIWADRSATDDEPTAYVCQDFTCSPPRTD 724
>gi|320160551|ref|YP_004173775.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
gi|319994404|dbj|BAJ63175.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
Length = 684
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 243/689 (35%), Positives = 347/689 (50%), Gaps = 74/689 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED +A++LN FVSIKVDREERPDVD +YM V AL G GGWPLSV
Sbjct: 49 ACHWCHVMAHESFEDPQIAEILNQHFVSIKVDREERPDVDGIYMNAVIALTGQGGWPLSV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFPP ++G P F+ +L AW+ RD L ++G EQL++ +
Sbjct: 109 FLTPEGKPFYGGTYFPPTPRHGLPAFRDVLHAALQAWENDRDDLFKAG----EQLAQHIH 164
Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A +P L N L L SYD R+GG+G+AP+FP+P+ ++ +L + +
Sbjct: 165 AMNDWGSVPGLVLRANLLEQVTHALLASYDRRYGGWGNAPRFPQPMALEFLLLQVTRGNE 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ K V LQ M++GG++D +GGGF RYS D W VPHFEKMLYD Q+++
Sbjct: 225 --------DALKPVEHNLQVMSRGGLYDIIGGGFARYSTDNHWLVPHFEKMLYDNAQISS 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYL A L K+ ++ I LD+L +M P G FS+ DADS EG +EG FY+
Sbjct: 277 VYLHAGMLEKNPWFLRIATQTLDFLLEEMRHPLGGFFSSLDADS---EG----EEGKFYL 329
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLS--RMSDPHN-EFKGKNVLIELNDSSASA 376
W E+ I L+P G D S + P N F+GK +L D
Sbjct: 330 WDFDELRQI-------------LEPAGQWDFSCQVFNLPRNGNFEGKIILQIQEDWERLP 376
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
K G+ +L + R L+ RS R RP DDKVIVSWNG + + A A++ L
Sbjct: 377 EKTGLSETDFLKQMDTVRALLYQKRSLRVRPSTDDKVIVSWNGFALRALAEAARYL---- 432
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+R +Y+ A+ A F+ +LY + L ++R G + L+DYA
Sbjct: 433 ------------NRPDYLHAAQQNAHFLLENLYTPRG--LMRTWREGSPRQIALLEDYAS 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LI GLL LY+ W WA++L + D GG+++T + +++R K+ D
Sbjct: 479 LIIGLLALYQSDDNIVWYEWAVKLGEEMISRYRD-PAGGFYDTRDDQQDLIIRPKDFQDN 537
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A P GNS++ L+ L +G S Y Q A + + L A A D
Sbjct: 538 ATPCGNSLASYALLLLYEF-SGDDSIY--QLATRVFPLLQDSLVKYPTAFGFWLQAIDWA 594
Query: 617 SVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
PSR+ V L+ ++ + F+N+L + P ++ +
Sbjct: 595 MGPSRQ-VALLAPRTLEELQPFKNILWETYR------------PRLVCASSTFQPATNAP 641
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
A + + +V A +C+ F C P +D
Sbjct: 642 ALLQERSVLNGEVTAYLCEGFVCLQPTSD 670
>gi|262197654|ref|YP_003268863.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262081001|gb|ACY16970.1| protein of unknown function DUF255 [Haliangium ochraceum DSM 14365]
Length = 681
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 247/707 (34%), Positives = 364/707 (51%), Gaps = 86/707 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED +A ++N+ FV++K+DREERPDVD VYM +Q L GGGWPLS
Sbjct: 49 ACHWCHVMAHESFEDAEIAAVMNELFVNVKIDREERPDVDAVYMNALQILGEGGGWPLSA 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SE 137
F +PD KP GTYFPP+D+YGRPGF ++LR + ++ +RD + Q+ ++ L E
Sbjct: 109 FCTPDGKPYFLGTYFPPQDRYGRPGFASVLRTMAKVFEDQRDKVDQNTEAIVDGLRRVDE 168
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A S ++ L + L QL++ D + GG GS PKFP + L
Sbjct: 169 HFRRGALSGEV-GALRADLLITAGRQLAQRSDPQHGGLGSKPKFPSSTTHAL-------L 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
G+ + ++ L + MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD GQL
Sbjct: 221 ARAGRLAFGAPAREAFLKQARSMARGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNGQL 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y DA+++ +D ++ + + + +L +M P G +++++DADS EG +EG +
Sbjct: 281 LGIYGDAYAMDQDPAFARVIDETITWLEDEMQHPSGALYASQDADS---EG----EEGKY 333
Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELND 371
YVWT +E+ +LG AI F+ Y + TGN + LSR+SDP + +D
Sbjct: 334 YVWTPEEIRAVLGPVDAIFFERAYGVSETGNFEHGTTVLSRVSDPGGD----------SD 383
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A AS +L R +R P D KV+ WNGL + RA
Sbjct: 384 EAALASAR---------------ARLLAARKQRVAPETDTKVLAGWNGLAVRGAVRA--- 425
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
+ G+ R + +A A F+ H+ E RL F++G +K G L
Sbjct: 426 -----------WETTGNARA--LALAVRVAEFLAGHMLHEGGTRLWRVFKDGSTKLDGTL 472
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL-DREGGG-YFNTTGEDPSVLLR 549
DDYAF+ G L L E +W L +T E F +R+G G ++ T G+D ++ R
Sbjct: 473 DDYAFVAHGFLHLAEATGDARWWRHGAALIDTILERFYEERDGVGIFYMTPGDDTLLVHR 532
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+ + D A P+G SV+V L+RLA + ++ AE LA + + A +
Sbjct: 533 PESNSDHAIPAGASVAVACLLRLAQVAEDKRA---LDIAERYLAGRVPQAGENPFAFSRL 589
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A D+ VV+V D +LAAA Y + ++ PA E W
Sbjct: 590 LSALDLY---LHGQVVVVSAGEGAD--ELLAAARRVYAPARMLV---PALAES---W--- 635
Query: 670 NSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
+ ++ +A + +AD + A VC+ +CS PV+D +L LL P+
Sbjct: 636 -AADSLLAGKDAAADGRAQAYVCRGQTCSAPVSDAQALRELLTATPA 681
>gi|325283375|ref|YP_004255916.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
gi|324315184|gb|ADY26299.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
Length = 679
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 239/692 (34%), Positives = 347/692 (50%), Gaps = 89/692 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFE+E A L+N+ FV+IKVDREERPDVD +YM QA+ G GGWP++
Sbjct: 57 STCHWCHVMAHESFENEATAGLMNERFVNIKVDREERPDVDGIYMAATQAMTGQGGWPMT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL +P GTY+PP + G P F+ ++ V DAW +R L ++ A A+ + +A+
Sbjct: 117 VFLDHQRRPFHAGTYYPPHEGLGLPSFRRVMTAVSDAWQNRRADL-EANAQALTEHIQAM 175
Query: 140 SA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S SA + P EL Q L L L + +D GGFG APKFP P + +L
Sbjct: 176 SEPRSAGGQEWPAELLQAPLDL----LPQVFDPVHGGFGGAPKFPAPTTLDFLL------ 225
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
KSG+ +GQ+M L TL+ M +GGI+D +GGGFHRYSVD +W VPHFEKMLYD QL
Sbjct: 226 ----KSGD-EQGQQMALHTLRQMGRGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQL 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
L A+ ++ D ++ R+ L YL R+M P G +SA+DAD+ EG T
Sbjct: 281 TRTLLAAYQVSGDPAFAEAARETLRYLEREMRHPSGSFYSAQDADTEGVEGLT------- 333
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+ WT E++ +LG E A Y + GN + DPH G+ ++
Sbjct: 334 FTWTPAELQAVLGAEDAEWLARFYGVTEGGNFE-----DPHRRDAGRRTVL--------- 379
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S++G + + L E R +L R +RP+PH DDKV+ SWNGLV+++ A AS+IL
Sbjct: 380 SRVGELTPEQRSRLPELRARLLTAREERPQPHRDDKVLTSWNGLVLAALADASRILGE-- 437
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
++E+A A+++R + + L H++ +G + + G L+D+A
Sbjct: 438 --------------PHWLELARQNAAWVRETM-RQPDGTLWHTWLDGHAPSVEGLLEDHA 482
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
GL+ LY+ ++L WA EL F D G + ++ G+ ++L R D
Sbjct: 483 LYGLGLVALYQASGELEYLTWARELWTVVQRDFWDDAAGLFRSSGGKAEALLTRQSSAFD 542
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA--VFETRLKDMAMAVPLM---C 610
A S N+ + + + + YY +LA + L DM A M
Sbjct: 543 SAIISDNAAAALLALWI--------DRYYGDPQAQALAHRTVSSHLADMVQAPHGMGGLW 594
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
AA ML P + ++ S + L AA A + L + + PA T EH
Sbjct: 595 QAAAMLRAPHTELAII----GSAEERAPLEAAAARFLL--PYVALAPAPTPAGLPVLEHR 648
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ A +C N +C P D
Sbjct: 649 EGGGT------------AYLCVNRACQLPTQD 668
>gi|448321193|ref|ZP_21510673.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
10524]
gi|445604053|gb|ELY58004.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
10524]
Length = 724
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 219/600 (36%), Positives = 314/600 (52%), Gaps = 41/600 (6%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF DE VA LLN+ F+ IKVDREERPDVD +YMT Q + GGGGWPLS
Sbjct: 53 SACHWCHVMEEESFADEEVADLLNEEFIPIKVDREERPDVDSIYMTVCQLVSGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+L+P+ KP GTYFP K G+PGF +L + D+W+ R+ + + L
Sbjct: 113 AWLTPEGKPFYVGTYFPKRSKRGQPGFLDLLEGLADSWETDREEIESRADEWTAAARDQL 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
+ S + + L A+ +S D + GGFGS PKFP+P ++++ ++ +
Sbjct: 173 EETPDSIGAAEPPSSDVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL---ARAYD 229
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ E ++++ +L M +GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 230 RTGR----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIP 285
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
L + LT D Y+ R+ L+++ R++ G FS DA S + E R +EGAF+
Sbjct: 286 RALLAGYRLTGDERYAGYVRETLEFVSRELTHDEGGFFSTLDAQSEDPETGER-EEGAFF 344
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT EV ++LG+ A LF Y + +GN F+G++ S A
Sbjct: 345 VWTPAEVREVLGDETDADLFCARYDITESGN------------FEGQSQPNLAASISELA 392
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ + + L R+KLF+ R +RPRP+ D+KV+ WNGL+IS+ A A+ L
Sbjct: 393 DRFDLEEREVEERLESARQKLFEAREERPRPNRDEKVLAGWNGLMISTCAEAALAL---- 448
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
G DR Y E+A A F+R L+D RL +++G G L+DYAF
Sbjct: 449 ----------GEDR--YAEMATDALEFVRDRLWDADEGRLSRRYKDGDVAVQGNLEDYAF 496
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L YE L +A+EL + F D E + T S++ R +E D
Sbjct: 497 LARGALGCYEATGEVDHLAFALELARGIEAEFYDAERETLYFTPESGESLVTRPQELTDQ 556
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
+ P+ V+V L+ L + D + A L RL+ A+ +C AAD L
Sbjct: 557 STPAAAGVAVETLLALEGFA--DEDDEFEGIAASVLGTHAGRLESNALQHVTLCLAADRL 614
>gi|448585374|ref|ZP_21647767.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
33959]
gi|445726074|gb|ELZ77691.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
33959]
Length = 709
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 231/693 (33%), Positives = 348/693 (50%), Gaps = 66/693 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W RD + EQ + A+
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRA----EQWTSAI 168
Query: 140 SAS-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKK 196
+ + +P E P + L + + D GGFG PKFP+P I +L +
Sbjct: 169 TDRLEETPDVPGEAPGSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL---RG 225
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
TG+ E + +L MA GG+ DH+GGGFHRY VD W VPHFEKMLYDQ
Sbjct: 226 YAVTGR----REALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAG 281
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S +EG
Sbjct: 282 LASRYLDAARLTGNESYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGT 334
Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++ ++A
Sbjct: 335 FYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FERKTTVLNVSATTAE 382
Query: 376 -ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S +L+
Sbjct: 383 LAEEYELDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLED 442
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
++ + SD A A F+R L+D++T L NG K G+L+DY
Sbjct: 443 DS---------LASD-------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDY 486
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AFL G DLY+ L +A++L F D + G + T S++ R +E
Sbjct: 487 AFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPT 546
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D + PS V+ + L + + A+ L F R++ + + AA+
Sbjct: 547 DQSTPSSLGVATSLFLDLEQFAPDAD---FGGVADAVLGSFANRVRGSPLEHVSLALAAE 603
Query: 615 MLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNS 671
+ VP + + + ++ LA+ + L V+ P EE+D W +E
Sbjct: 604 KAASGVP---ELTIAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELDAWLDELGL 656
Query: 672 NNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
+ A A + + C+NF+CS P D
Sbjct: 657 DEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689
>gi|325262773|ref|ZP_08129509.1| dTMP kinase [Clostridium sp. D5]
gi|324031867|gb|EGB93146.1| dTMP kinase [Clostridium sp. D5]
Length = 668
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 240/720 (33%), Positives = 359/720 (49%), Gaps = 95/720 (13%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + R FL +TCHWCHVM ESFEDE VA++LN ++ IKVDREERPD
Sbjct: 28 GPEAFQKAKQEDRPVFLSIGYSTCHWCHVMAHESFEDEQVAEVLNSQYICIKVDREERPD 87
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D VYM+ QA+ G GGWPL+ L+P+ +P GTYFP +YG PG +L ++ W
Sbjct: 88 IDSVYMSACQAVTGAGGWPLTAILTPEQQPFFLGTYFPKHPRYGHPGLIELLEEIGSLWR 147
Query: 119 KKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
+ R+ L ++G +Q++E +S +S +PD + L+ E + YDSR+GGFG
Sbjct: 148 ENRNKLIEAG----QQITEFISIPDHASGSIPD---KKGLKRAFELYRRQYDSRWGGFGK 200
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGE-ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
APKFP P H+ E E +M TL MA GG++D +GGGF RY
Sbjct: 201 APKFPAP--------HNLLFLLHYSLLENEQEALEMAEHTLTAMAHGGMNDQIGGGFSRY 252
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S DE+W VPHFEKMLYD LA YL+A+ + K Y+ R LDY+ R++ GP G+ +
Sbjct: 253 STDEKWLVPHFEKMLYDNALLAIAYLEAYHIKKRELYADTARRTLDYVLRELTGPSGQFY 312
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSD 355
+DADS EG EG +Y ++ +E+ +LG+ F Y + +GN
Sbjct: 313 CGQDADS---EGI----EGKYYFFSPEEIMSVLGDGDGEEFCRIYDITASGN-------- 357
Query: 356 PHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
F+G+++ LI ++ A + + ++++ R R H DDKV
Sbjct: 358 ----FEGRSIPNLIGQSELPWRADDIRL-------------NRIYNYRRNRTLLHRDDKV 400
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
I+SWN ++ + A+A++IL G R Y + A + FI+ H+ D+ +
Sbjct: 401 ILSWNSWMMIAMAKAAQIL--------------GDTR--YKDAAIAVHRFIQAHMTDD-S 443
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
RL H +R G + G LDDYA LL+LY +L A ELF DRE
Sbjct: 444 RRLYHRWREGEAAIEGQLDDYAVYGLALLELYRTAYEPVYLEEAAFFAGQMAELFEDREN 503
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GGYF T + +++ R KE +DGA PSGNS + + L +LA + ++++ E +
Sbjct: 504 GGYFLTASDTEALITRPKETYDGAVPSGNSAAAVLLSQLAHYTC---TPFWQEALERQIN 560
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF--ENMLAAAHASYDLNKT 651
+ + A PS++ + + E +L LN++
Sbjct: 561 FLAGVVNEYPSGHSFGLQALMSALYPSQELICATSDNGMPEILKEYLLRVP----VLNRS 616
Query: 652 VIHIDPADTEEMD----FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLE 707
VI P + EE++ F +E+ + + +CQN C+ PV+D LE
Sbjct: 617 VILKTPENKEELEKAVPFLKEY----------PVPEEGAMFYLCQNGRCTAPVSDLRKLE 666
>gi|448570870|ref|ZP_21639381.1| thioredoxin domain containing protein [Haloferax lucentense DSM
14919]
gi|448595768|ref|ZP_21653215.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
10717]
gi|445722788|gb|ELZ74439.1| thioredoxin domain containing protein [Haloferax lucentense DSM
14919]
gi|445742222|gb|ELZ93717.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
10717]
Length = 703
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 239/708 (33%), Positives = 346/708 (48%), Gaps = 96/708 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W RD + +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRL 172
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
+ + P E P + L + + D GGFG PKFP+P I +L
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223
Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 332 EGTFYVWTPADVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379
Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ +A A A F+R L+D++T L NG K G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYL 483
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL+ G DLY+ L +A++L F D + G + T S++ R +
Sbjct: 484 EDYAFLVRGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543
Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFET 597
E D + PS V+ + L A V GS ++ R + EH SLA+
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLKQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAE 603
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ A VP + AAD VP L AS L V+ P
Sbjct: 604 K---AASGVPELTVAAD--EVPDEWRATL-----------------ASRYLPGLVVSRRP 641
Query: 658 ADTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
E+D W +E + A A + + C+NF+CS P D
Sbjct: 642 GTDAELDAWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689
>gi|399574327|ref|ZP_10768086.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
gi|399240159|gb|EJN61084.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
Length = 723
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 242/705 (34%), Positives = 347/705 (49%), Gaps = 67/705 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE VA +LND FV IKVDREERPD+D+VY T Q + G GGWPLS
Sbjct: 53 SACHWCHVMADESFEDEAVADVLNDEFVPIKVDREERPDLDRVYQTICQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPP+ + G PGF +LR + ++WD + D +Q + AL
Sbjct: 113 VWLTPEGKPFYVGTYFPPQARQGAPGFLDLLRNISNSWDSEEDRAEMEN--RADQWTTAL 170
Query: 140 SASASSNKLP-DELPQ-NALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSK 195
+ P DE P + L A+ + D GGFGS PKFP P I ++L +
Sbjct: 171 DDQLADTPDPADETPDVDVLGTAAQAALRGADREHGGFGSGEGPKFPHPGRIDLLL---R 227
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ +G+ E + TL MA GG++D VGGGFHRY+VD W VPHFEKMLYD
Sbjct: 228 TYDRSGR----GETLNVATETLDAMANGGLYDQVGGGFHRYTVDRSWTVPHFEKMLYDNA 283
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA-------DSAETEG 308
+L YL + +T + Y+ I ++ ++ R++ P G FS DA +SAE+
Sbjct: 284 ELPKSYLAGYQVTGEPRYARIAQETFAFVERELTHPDGGFFSTLDAQSEGFDDESAESAD 343
Query: 309 A-------TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 360
++EGAFYVWT ++V ++L E A LF + Y + GN +
Sbjct: 344 GDDSEGGEAEREEGAFYVWTPEQVHEVLDEEDAELFCDRYGITKRGNFE----------- 392
Query: 361 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
G +VL A + + L R LF+ R +RPRP D+KV+ WNGL
Sbjct: 393 HGTSVLNISTPVEELAEEYDIDRADVSERLTNARVALFEAREERPRPPRDEKVLAGWNGL 452
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 480
+ISSFA +++L A AE A SF+R HL+D+ RL F
Sbjct: 453 MISSFAMGARVLDPALAGA-----------------AERALSFVREHLWDDDAKRLSRRF 495
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
++ K G+L+DYAFL G +LY+ L +A++L + F D E G + T
Sbjct: 496 KDQDVKGDGYLEDYAFLARGAFELYQATGDVDHLAFALDLARVIEAEFWDDEKGTLYFTP 555
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
++ R +E D + PS V+ LV L S +D + AE L R++
Sbjct: 556 ASGEQLVTRPQELTDSSTPSSLGVATDLLVDLDHF--DSDAD-FGDIAERVLKTHADRIR 612
Query: 601 DMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADT 660
+ + AA+ + + + V D+ +LA + L V+ P
Sbjct: 613 GSPLEHVSLALAAEKFARGGLELTLAVDELPD-DWWEVLAGRY----LPGAVVSQRPHSD 667
Query: 661 EEMDFWEE---HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+E+D W + + A + K C++F+CSPP TD
Sbjct: 668 DELDEWLDVLGLDEVPPIWAGRDGKNGKATVYACESFACSPPQTD 712
>gi|374376399|ref|ZP_09634057.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
gi|373233239|gb|EHP53034.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
Length = 687
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 236/698 (33%), Positives = 350/698 (50%), Gaps = 75/698 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFED A L+N+ F++IKVDREERPD+D +YM VQ + G GGWPL+V
Sbjct: 49 ACHWCHVMERESFEDAATAALMNEHFINIKVDREERPDIDHIYMDAVQTMTGSGGWPLNV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GGTY+PP RP +K +L V DA+ KR + Q +QL +A S
Sbjct: 109 FLTPDKKPFYGGTYYPPVSYANRPSWKDVLTAVSDAFQNKRTAIQQQAEGLTQQLVDANS 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
D L C+ L ++ D+ +GGFG APKFP+ I+ +L + +D
Sbjct: 169 FGIGDGSGADFLRDEVDAACSAILKQA-DTSWGGFGRAPKFPQTQTIRFLLRYHYAEKDR 227
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
S A + L +L M +GGI+D VGGGF RY+ D W PHFEKMLYD L
Sbjct: 228 PDSF-ADNALQQALLSLDKMMEGGIYDQVGGGFARYATDTEWLAPHFEKMLYDNALLVVT 286
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+A+ +T+D Y + ++ R++ G ++A DADS EG +EG FYVW
Sbjct: 287 LSEAYQVTRDERYRGCIEQTIAFIERELTDASGGFYAALDADS---EG----EEGKFYVW 339
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNC---DLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+ KE+E++L E A LF +Y + +GN ++ R+ P EF N E+N++ A
Sbjct: 340 SKKEIEELLREDADLFCRYYDITESGNWEGKNILRILTPLKEFAATN---EINETLLEA- 395
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+L + R +L R+ R RP LDDK+I+ WN L+ +++++A + +EA
Sbjct: 396 -----------LLEKGRLQLLVARAHRIRPALDDKIILGWNALMNTAYSKAFEATGNEA- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y++ A F+ + ++ H ++ G +K P FLDDYA+L
Sbjct: 444 ---------------YLQRATDNMRFL-LNAFENTDGSFAHVWKAGVAKYPAFLDDYAYL 487
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I LL L + +L A L E F + E G +F T V+LR KE +DGA
Sbjct: 488 IEALLQLARVTADYSYLEKARALCQGIQEHFAESETGYFFYTPQNQGDVILRKKEVYDGA 547
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V NL+ L+ + +R AE + +L + + P A ML+
Sbjct: 548 TPSGNAVMAANLLHLSVCFDLPE---WRVQAEQMI----VQLANAIIKYP-TSFGAWMLA 599
Query: 618 V----PSRKHVVLVG-HKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
K + L+G +KSS+ + +L + L +I P + +
Sbjct: 600 FYRVQQGSKEIALIGDYKSSL--QELL-----HHFLPGAIIMAGPNADAHYPLLADKRAG 652
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N ++ +C++++C PV + L NLL
Sbjct: 653 N-----------PLLIYLCEHYACRQPVDNLTELFNLL 679
>gi|421090081|ref|ZP_15550882.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
gi|410001344|gb|EKO51958.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
Length = 711
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 237/696 (34%), Positives = 353/696 (50%), Gaps = 68/696 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ +A LN FVSIKVDREERPD+D++YM + A+ GGWPL++
Sbjct: 78 TCHWCHVMEKESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNM 137
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P+ GGTYFPPE +YGR GF +L ++ W +KR L + + + L ++
Sbjct: 138 FLTPEGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGE 197
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKK 196
+ A + D P+N YDS+FGGF + KFP + + +L YHS
Sbjct: 198 SRAKEKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-- 255
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
SG + +MV TL M +GGI+D +GGG RYS D RW VPHFEKMLYD
Sbjct: 256 ------SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSL 308
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ + ++K + DI+ YL RDM GG I + + + ++EG
Sbjct: 309 FLEILAEYSLVSKKISAKSFALDIVSYLHRDMRMDGGGI-------CSAEDADSEEEEGL 361
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FY+W +E ++ GE + L ++ + + GN F+GKN+L E +
Sbjct: 362 FYIWDLEEFREVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRG 405
Query: 377 SKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S K+L+ L + KL + RSKR RP DDK++ SWNGL I + +
Sbjct: 406 SNFTEEESKHLDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------ 459
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+ R++++++AE SFI ++L D + R+ FR G S G+ +DYA
Sbjct: 460 ----------IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYA 508
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+I+ + L+E G G ++L A+ LF R G F TG D VLLR D +
Sbjct: 509 EMIASSIVLFEAGRGVRYLQNAVFWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGY 566
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
DG EPS NS +LV+L+ + G SD YR+ AE F L A+ P + A
Sbjct: 567 DGVEPSANSSLAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYW 624
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
SR+ V++ K+S ++LA + + + ++ + EE +
Sbjct: 625 SYKYHSREIVLI--RKNSEAGRDLLAWIQSRFLPDSVFAVVNEDELEEA-------RKLS 675
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
S+ + S + VC+NFSC P+ + LE +
Sbjct: 676 SLFDSRDSGGNALVYVCENFSCKLPIDNVSDLEKYM 711
>gi|338532946|ref|YP_004666280.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
gi|337259042|gb|AEI65202.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
Length = 696
Score = 358 bits (920), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 241/698 (34%), Positives = 344/698 (49%), Gaps = 71/698 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE A+L+N+ F++IKVDREERPD+D++Y VQ + GGGWPL+
Sbjct: 57 SACHWCHVMAHESFESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLT 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PDLKP GGTYFPP+D+YGRPGF +L ++DAW+ K+D + + A E L E
Sbjct: 117 VFLTPDLKPFYGGTYFPPQDRYGRPGFPRLLGALRDAWENKQDEVQRQAAQFEEGLGEL- 175
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+ + P L + + ++K D GGFG APKFP P+ +ML ++
Sbjct: 176 -ATYGLDAAPSALTAADVVAMGQGMAKQVDPAHGGFGGAPKFPNPMNFALMLRAWRR--- 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + + V TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD QL +
Sbjct: 232 ----GGGAPLKDAVFLTLERMALGGIYDQLGGGFHRYSVDARWRVPHFEKMLYDNAQLLH 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A + + + + + Y+RR+M GG ++A+DADS EG +EG F+V
Sbjct: 288 LYAQAQQVEPRPLWRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFFV 340
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W +EV L E A L H+ +KP GN + G VL + + A +
Sbjct: 341 WRPEEVRAALPEAQAELVLRHFGIKPEGNFE-----------HGATVLEVVVPVAELARE 389
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ + L R+ LF+ R +R +P DDK++ WNGL+I A A+++
Sbjct: 390 RGLSEDAVARALAAARQTLFEARERRVKPGRDDKLLSGWNGLMIRGLALAARVF------ 443
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+R E+ A AA F+ +D RL S++ G ++ GFL+DY L
Sbjct: 444 ----------ERPEWATWAAEAADFVLAKAWD--GTRLARSYQEGQARIDGFLEDYGDLA 491
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
SGL LY+ K+L A L LF D E Y +++ D A
Sbjct: 492 SGLTALYQATFDVKYLEAADALVRRAVALFWDAEKAAYLTAPRGQKDLVVATYGLFDNAS 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S V LA++ G K + + E +A L AM + AAD L
Sbjct: 552 PSGASTLTEAQVELAALT-GDKQ--HLELPERYVARMREGLVRNAMGYGYLGLAADAL-- 606
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF-WEEHNSNNASMA 677
++ + A AS D+ +D A + W+ ++
Sbjct: 607 --------------LEGAAAVTVAGASDDVAPLCAAVDHAFAPTVALSWKAPGQPVPALL 652
Query: 678 RNNFSADKVV-----ALVCQNFSCSPPVTDPISLENLL 710
+ F + V A +C+ F C PVT+P L L
Sbjct: 653 QATFEGREPVKGRAAAYLCRGFVCELPVTEPDVLAQRL 690
>gi|410724261|ref|ZP_11363459.1| thioredoxin domain containing protein [Clostridium sp. Maddingley
MBC34-26]
gi|410602266|gb|EKQ56747.1| thioredoxin domain containing protein [Clostridium sp. Maddingley
MBC34-26]
Length = 617
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 240/689 (34%), Positives = 349/689 (50%), Gaps = 78/689 (11%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE VAK++ND FV++KVDREERPDVD VYMT QAL G GGWPL++ ++PD K
Sbjct: 1 MAHESFEDEEVAKIMNDNFVAVKVDREERPDVDSVYMTVCQALTGHGGWPLTIIMTPDQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTY+P + KY PG IL V W + ++ L + + +L + S +
Sbjct: 61 PFYAGTYYPKKSKYNIPGLMDILNAVVKQWSEDKNKLISTSDGILSELGQYFEGETSCVE 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
L + +N QL +++D +GGFG APKFP P +I +L + K ++ K+ E +
Sbjct: 121 LTSKTLENGYN----QLLQTFDKNYGGFGEAPKFPTPHKIMFLLRYYKNHKNI-KALEIA 175
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E TL M +GG+ DH+G GF RYS D +W VPHFEKMLYD L YL+ + +
Sbjct: 176 EK------TLVSMYRGGMFDHIGYGFSRYSTDNKWLVPHFEKMLYDNALLILAYLEGYEI 229
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
TK+ Y + L+Y+ R++ G + AEDADS EG +EG +YV+ E+
Sbjct: 230 TKNELYKDVATKALEYIFRELSNKEGGFYCAEDADS---EG----EEGKYYVFEPSEILR 282
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLE 384
+LG E F +++ + GN F+GK++ LI+ N+ + K
Sbjct: 283 VLGDEDGTYFNDYFDITLNGN------------FEGKSIPNLIKNNEFDKTNDK------ 324
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
I C + L RS R + H DDK++ SWNGL+I++ A+A K+++ E
Sbjct: 325 ----IKALCEQVLL-YRSDRYKLHKDDKILTSWNGLMIAALAKAYKVIEDE--------- 370
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
Y E A+ A +FI L DE +RL +R S+ +LDDYAFL GL++L
Sbjct: 371 -------RYFEYAKKAVNFIFEKLMDEN-NRLLARYREEESRHKAYLDDYAFLCFGLIEL 422
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNS 563
YE +L A+++ F D + G++ GED L+ R KE DGA PSGNS
Sbjct: 423 YESSFDISFLSKALDINKNMINFFWDYKNYGFY-LYGEDSEQLIARPKELFDGAMPSGNS 481
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
V+ NL++LA I S + + A L + + AA S++
Sbjct: 482 VAAYNLIKLARITGDSNLE---EMAGKQLNFICGSILREEINHSFFLLAASFALSESKEL 538
Query: 624 VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE--MDFWEEHNSNNASMARNNF 681
V L+ KS + L + A ++L + + D E + F +E+ +F
Sbjct: 539 VCLIKDKSEEEKIKDLLSEKAIFNLTTIIKTNENKDEIEKLIPFVKEY----------DF 588
Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
DK +C+ SC PV D L NLL
Sbjct: 589 INDKSTYYLCKGKSCLAPVNDIDELINLL 617
>gi|433424873|ref|ZP_20406585.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
gi|432197957|gb|ELK54295.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
Length = 703
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 228/694 (32%), Positives = 343/694 (49%), Gaps = 68/694 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W RD + +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRL 172
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
+ + P E P + L + + D GGFG PKFP+P I +L
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223
Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 332 EGTFYVWTPADVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379
Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ +A A A F+R L+D++T L NG K G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYL 483
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ L +A++L F D + G + T S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D + PS V+ + L + + + A+ L F R++ + +
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPDAG---FGEVADAVLGSFANRVRGSPLEHVSLAL 600
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHN 670
AA+ + + V ++ + + A AS L V+ P E+D W +E
Sbjct: 601 AAEKAASGVPELTV-----AADEIPDEWRATLASRYLPGLVVSRRPGTDAELDAWLDELR 655
Query: 671 SNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
+ A A + + C+NF+CS P D
Sbjct: 656 LDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689
>gi|385803931|ref|YP_005840331.1| hypothetical protein Hqrw_2868 [Haloquadratum walsbyi C23]
gi|339729423|emb|CCC40679.1| YyaL family protein [Haloquadratum walsbyi C23]
Length = 768
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 236/717 (32%), Positives = 347/717 (48%), Gaps = 92/717 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ VA +LND FV IKVDREERPD+D++Y T Q + GGGGWPLSV+
Sbjct: 55 CHWCHVMAEESFEDDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVW 114
Query: 82 LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
L+PD KP GTYFP ++ R PGF I + AW+ R L + L +
Sbjct: 115 LTPDGKPFYVGTYFPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDR 174
Query: 139 LSASASSNKLPDEL------------PQNA-----------LRLCAEQLSKSYDSRFGGF 175
L +++ D PQ L + ++ D+ +GGF
Sbjct: 175 LEVDTNADTSIDVDDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGF 234
Query: 176 GS-APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
GS PKFP+P I+ ++ H++ +T + TL MA GGI+DHVGGGF
Sbjct: 235 GSRGPKFPQPGRIEALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGF 286
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRY+ D +W VPHFEKMLYD +L+ VYL A+ T Y+ + + +L R++ P G
Sbjct: 287 HRYATDRKWTVPHFEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEG 346
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLS 351
+S D A++EG +EG FYVWT + + + + + I + + + + GN
Sbjct: 347 GFYSTLD---AQSEG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN---- 395
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+G VL S A+K + ++ ++ L + R LFD R R RP+ D+
Sbjct: 396 --------FEGSTVLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDE 447
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ +WNGL ISS AR IL++E +Y E+A A SFIR HL+D
Sbjct: 448 KILTAWNGLAISSLARGGLILETE----------------QYTELANDALSFIRTHLWDS 491
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
+ RL +++G G+LDDYAFL G DLY+ + L +A+ L + ELF D
Sbjct: 492 DSGRLSRRYKDGDVDETGYLDDYAFLARGAFDLYQTTGAVEHLSFAVTLAESIVELFYDT 551
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
G + T + S++ R ++ D + S ++V L + + S +
Sbjct: 552 AGETLYLTPEDAESLVARPQDLRDQSTSSSAGIAVQTLNAVDPFTSTDFSGI-------A 604
Query: 592 LAVFETRLKDMAMAVPL----MCCAADMLSVPSRKH-VVLVGHKSSVDFENMLAAAHASY 646
AV +T D PL + AAD +R H V++ H + + + + AS
Sbjct: 605 GAVIDTH-ADEIRGRPLEHISLAMAADSR---ARGHDEVVIAHDTDTELSQPIRSDIAST 660
Query: 647 DLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVALVCQNFSCSPPV 700
L + PA ++ W + +S A A + K C +CSPP
Sbjct: 661 YLPGVPLSQRPATVSGLESWTDELGLDSPPAIWAGRHQRDSKATIYACSGRACSPPT 717
>gi|225679668|gb|EEH17952.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 865
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 214/525 (40%), Positives = 299/525 (56%), Gaps = 33/525 (6%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVME ESF +A +LN F+ IK+DREERPD+D+VYM YVQA G GGWPL+VFL+P
Sbjct: 67 CHVMEKESFMAPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTP 126
Query: 85 DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
DL+P+ GG+Y+P P G+ F IL K++D W ++ +S +QL
Sbjct: 127 DLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESAKDITKQLR 186
Query: 137 EALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
E + + +K D +L L + + YD+ GGF APKFP PV + ++
Sbjct: 187 E-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPTPVNLSFLV 245
Query: 192 YHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+ S+ + D E S ++ + TL M++GGIHD +G GF RYSV W +PHFE
Sbjct: 246 HLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTADWSLPHFE 305
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETE 307
KMLYDQ QL +VY+DAF D DI Y+ M+ P G S+EDADS +
Sbjct: 306 KMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSEDADSRPSP 365
Query: 308 GATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
T K+EGAFYVWT KE++ ILG+ A + H+ + GN ++R++DPH+EF +NVL
Sbjct: 366 NDTEKREGAFYVWTLKELKQILGQRDAEVCARHWGVLADGN--VARINDPHDEFINQNVL 423
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSF 425
S A + G+ ++ + I+ R KL + R SKR RP LDDK+IV+WNGL I +
Sbjct: 424 SIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAWNGLAIGAL 483
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP- 484
A+ S +L++ + F AE A FI+ +L+DEQT +L +R G
Sbjct: 484 AKCSVVLENLDREKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLWRIYRGGVR 533
Query: 485 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 529
PGF DDYA+LISGL++LYE L +A +LQ ++ FL
Sbjct: 534 GDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQQYLNKHFL 578
>gi|448604533|ref|ZP_21657700.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
BAA-897]
gi|445743942|gb|ELZ95422.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
BAA-897]
Length = 708
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 233/698 (33%), Positives = 351/698 (50%), Gaps = 76/698 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEQFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W RD + A+ AI ++L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTSAITDRL 172
Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
E ++ A +++ D Q ALR D GGFG PKFP+P I +L
Sbjct: 173 EETPDVAGEAPGSEVLDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL- 223
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ +G+ E + +L MA GG+ DH+GGGFHRY VD W VPHFEKMLY
Sbjct: 224 --RGYAVSGR----HEALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLY 277
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
DQ LA YLDA LT + Y+ + + +++RR++ G +F+ DA S
Sbjct: 278 DQAGLAARYLDAARLTGNESYATVAAETFEFVRRELTHDDGGLFATLDAQSG-------G 330
Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EG FYVWT +V +L E A LF + Y + P GN F+ K ++ ++
Sbjct: 331 EEGTFYVWTPDDVRGLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSA 378
Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
++A A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ +
Sbjct: 379 TTADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGAV 438
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+L+ ++ + A A F+R L+D++T L NG K G+
Sbjct: 439 VLEDDS----------------LADDARRALDFVRERLWDDETATLSRRVMNGEVKGDGY 482
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G DLY+ L +A++L F D + G + T S++ R
Sbjct: 483 LEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRP 542
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+ + L + D + + A+ L F R++ + +
Sbjct: 543 QEPTDQSTPSSLGVATSLFLDLEQF---APEDGFGEVADAVLGSFANRVRGSPLEHVSLA 599
Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-E 667
AA+ + VP + + + ++ LA+ + L V+ P EE+D W +
Sbjct: 600 LAAEKAASGVP---ELTIAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELDAWLD 652
Query: 668 EHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
E + A R D V C+NF+CS P D
Sbjct: 653 ELGLDEAPPIWAGREAADGDPTV-YACENFTCSAPTHD 689
>gi|292655805|ref|YP_003535702.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
gi|448289792|ref|ZP_21480955.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
gi|291370452|gb|ADE02679.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
gi|445581309|gb|ELY35670.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
Length = 703
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 240/708 (33%), Positives = 344/708 (48%), Gaps = 96/708 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ I+ ++W R+ + +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDIVESFAESWLTDREEIENRAEQWTSAITDRL 172
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
+ + P E P + L + + D GGFG PKFP+P I ML
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDAML------ 223
Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 332 EGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSAT 379
Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S +
Sbjct: 380 TADLADEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ +A A A F+R L+D +T L NG K G+L
Sbjct: 440 LEDDSLAAD----------------ARRALDFVRERLWDAETATLSRRVMNGEVKGDGYL 483
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ L +A++L F D + G + T S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543
Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFET 597
E D + PS V+ + L A V GS ++ R + EH SLA+
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAE 603
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
+ A VP + AAD VP L AS V+ P
Sbjct: 604 K---AASGVPELTVAAD--EVPDEWRATL-----------------ASRYFPGLVVSRRP 641
Query: 658 ADTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
EE+D W +E + A A + + C+NF+CS P D
Sbjct: 642 GTDEELDAWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689
>gi|398337804|ref|ZP_10522509.1| hypothetical protein LkmesMB_20984 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 630
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 242/687 (35%), Positives = 346/687 (50%), Gaps = 62/687 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE++ +A LN ++SIKVDREERPD+D+++M + A+ GGWPL++FL+PD K
Sbjct: 1 MERESFENQTIADYLNSHYISIKVDREERPDIDRIFMDALHAMDQQGGWPLNMFLTPDGK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P+ GGTYFPPE +YGR F +L ++ W KR L + + L E+ AS +
Sbjct: 61 PITGGTYFPPEQRYGRKSFLEVLNVIQGVWSGKRQELIAASTELAQYLKESGEGRASEKQ 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSG 204
P+N+ YD +FGGF + KFP + + +L YH S
Sbjct: 121 ESGFPPENSFDAGYSLYESYYDPQFGGFKTNHVNKFPPSMGLSFLLRYH--------HSS 172
Query: 205 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 264
+MV TL M +GGI+D VGGG RYS D W VPHFEKMLYD ++
Sbjct: 173 GNPRALEMVENTLLAMKQGGIYDQVGGGLCRYSTDHHWLVPHFEKMLYDNSLFLESLVEY 232
Query: 265 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 324
++K + D+++YL RDM GG I SAEDADS EG +EG FY+W E
Sbjct: 233 SQVSKKIPAESFALDVIEYLHRDMRISGGGICSAEDADS---EG----EEGLFYIWDLAE 285
Query: 325 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
++ GE + L ++ + + GN F+GKN+L E + SA A L+
Sbjct: 286 FREVCGEDSSLLEKFWNVTEKGN------------FEGKNILHE-SYRSAVAKLDAEELK 332
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+ L R+KL + RSKR RP DDK++ SWNGL I + +A +
Sbjct: 333 RIDAALDRGRKKLLERRSKRIRPLRDDKILTSWNGLYIKALVKAGAAFQ----------- 381
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
R+E++ +AE SFI ++L D R+ FR+G S G+ +DYA +I+ + L
Sbjct: 382 -----REEFLRLAEETYSFIEKNLID-SNGRILRRFRDGESGILGYSNDYAEMIAASIAL 435
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNS 563
+E G G ++L A+ LF R G F TG D VLLR D +DG EPS NS
Sbjct: 436 FEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSANS 493
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
+LV+L+ + G SD YR+ AE F L A++ P + A S K
Sbjct: 494 SLSYSLVKLS--LLGVHSDRYREIAESIFLYFTKELSTHALSYPFLLSAYWSYKNHS-KE 550
Query: 624 VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSA 683
+VL+ K+S +++LAA + N V + + E+ +S+ S
Sbjct: 551 IVLI-RKNSDAGKDLLAAIGKKFLPNSVVAVVSEDELEDA-------RKLSSLFDARDSG 602
Query: 684 DKVVALVCQNFSCSPPVTDPISLENLL 710
+ VC+NF+C PV + LE L
Sbjct: 603 GDALVYVCENFACKLPVNNVADLEKFL 629
>gi|302497930|ref|XP_003010964.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
gi|291174510|gb|EFE30324.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
Length = 714
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 223/614 (36%), Positives = 331/614 (53%), Gaps = 60/614 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+VFL+PDL+
Sbjct: 1 MEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLE 60
Query: 88 PLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
P+ GGTY+P + P GF +L K++D W+ ++ +S QL E
Sbjct: 61 PVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFA 120
Query: 140 S-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+ + ++ ++L + L + YD+ GGF +PKFP PV + +L S
Sbjct: 121 EEGTHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLS 180
Query: 195 KKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ E D E + +M + T+ +A+GGI D +G GF RYSV W +PHFEKML
Sbjct: 181 RYPEEVMDIVGREECVKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKML 240
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGAT 310
YDQ QL +V++D F + + D++ Y+ ++ P G +S+EDADS + T
Sbjct: 241 YDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSTPILSPMGCFYSSEDADSQPSPEDT 300
Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
K+EGA+YVWT KE++ ILG+ A + H+ + P GN ++R++DPH+EF +NVL
Sbjct: 301 EKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIA 358
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
+ A + G+ E+ + IL R KL + R +KR RP LDDK+IV+WNGLVI + A+
Sbjct: 359 TTPTQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALAKC 418
Query: 429 SKILKS-EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSK 486
+ +L+ +AE + K ++A +A FI+ +L+D ++ +L +R +
Sbjct: 419 AILLEDIDAEKS-----------KHCRQMASNAVKFIKENLFDAESGQLWRIYRADSRGD 467
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ--------------NTQ--DELFLD 530
PGF DDYA+LISGLL LYE L +A +LQ N + ++ F+
Sbjct: 468 TPGFADDYAYLISGLLQLYEATFDDAHLQFADKLQLCGKGKGVWLTARLNAEYLNKYFIS 527
Query: 531 REGG------GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
G++ T E P L R+K D A PS N V NL+RL+S++
Sbjct: 528 VSASDSSICTGFYMTPSEAVTDTPGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDES 587
Query: 581 SDYYRQNAEHSLAV 594
+ H+ AV
Sbjct: 588 YKLKARQTCHAFAV 601
>gi|162450797|ref|YP_001613164.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
gi|161161379|emb|CAN92684.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
Length = 716
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 246/719 (34%), Positives = 351/719 (48%), Gaps = 84/719 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFEDE +A+ +ND FV+IKVDREERPD+D +Y VQ + GGWPL+V
Sbjct: 52 ACHWCHVMERESFEDEAIARHMNDLFVNIKVDREERPDLDHIYQLVVQLMGRSGGWPLTV 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTYFPP+D G PGF +L K+ DA+ +RD + Q E + A
Sbjct: 112 FLTPDQRPFFAGTYFPPKDALGMPGFPKVLDKIADAFRNRRDDVEQQAQEITEAIERAQR 171
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A A + + + LR + QL D R GG GS PKFP + + ++L D
Sbjct: 172 APARAAGVAAPASSDLLRRASRQLLARLDPRHGGIGSRPKFPNTMALDVLLRRGVLESDR 231
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
A+EG V TL M GGI DH+ GGFHRYS DERW VPHFEKMLYD L +
Sbjct: 232 ----VAAEG---VELTLDRMRDGGIWDHLRGGFHRYSTDERWLVPHFEKMLYDNALLLRL 284
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y D F K Y+ R+I+ YL +M P G ++++DADS EG +EG F+VW
Sbjct: 285 YADGFRAFKKPIYAETAREIVGYLFAEMRDPEGGFYASQDADS---EG----REGKFFVW 337
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM----SDPHN-EFKGKNVLIELNDSSAS 375
T +++ D +GE + + D++R+ S+ N E G VL + +
Sbjct: 338 TLEQLRDAVGEDQLAY------------DMARLVFGISEEGNFEDSGATVLSQHRTLEQA 385
Query: 376 ASKL-----GMP---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
A+ + G P L++ + L R + R RPRP DDKV+ SWNGL+I + A
Sbjct: 386 AAVIDDGAGGGPSTHLDRCRDALARARVAMLAARDARPRPARDDKVLASWNGLLIGALAD 445
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSK 486
A + L D +++ A A + + R L + R+ ++G P+
Sbjct: 446 AGRAL----------------DEPAWVDAAARAFALLERKLL--RGGRVGRYLKDGAPAG 487
Query: 487 A---------------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
A PGFLDD A+L + LDLYE S +++ A + + D
Sbjct: 488 ANREHGGSGAAVGDVRPGFLDDQAYLGNAALDLYEATSDPRYVDVARAIADAMIAHHWDE 547
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
G+F T + +++ R ++ +D A PS S++ + +RL+ I + Y AE
Sbjct: 548 AAPGFFFTPDDGDALIARTQDIYDQAAPSAASMAALLCLRLSEIA----DERYLSPAERQ 603
Query: 592 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
L V + A + C D L+ + VV+VG S + A Y N+
Sbjct: 604 LDVLAPTALENAFGLGQTVCVLDRLTRGA-VTVVVVGEAGSASAAELTREAFKVYLPNRA 662
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++ +DPA E E + D VA C+ +CS PVT L+ LL
Sbjct: 663 IVLVDPARPESAAAVEVVAEGKPA------RPDGAVAYACRGRTCSAPVTTAADLKALL 715
>gi|110668468|ref|YP_658279.1| thioredoxin domain-containing protein [Haloquadratum walsbyi DSM
16790]
gi|109626215|emb|CAJ52671.1| YyaL family protein [Haloquadratum walsbyi DSM 16790]
Length = 768
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 235/717 (32%), Positives = 346/717 (48%), Gaps = 92/717 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ VA +LND FV IKVDREERPD+D++Y T Q + GGGGWPLSV+
Sbjct: 55 CHWCHVMAEESFEDDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVW 114
Query: 82 LSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
L+PD KP GTYFP ++ R PGF I + AW+ R L + L +
Sbjct: 115 LTPDGKPFYVGTYFPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDR 174
Query: 139 LSASASSNKLPDEL------------PQNA-----------LRLCAEQLSKSYDSRFGGF 175
L + + D PQ L + ++ D+ +GGF
Sbjct: 175 LEVDTNVDTNIDVDDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGF 234
Query: 176 GS-APKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
GS PKFP+ I+ ++ H++ +T + TL MA GGI+DHVGGGF
Sbjct: 235 GSRGPKFPQTGRIEALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGF 286
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRY+ D +W VPHFEKMLYD +L+ VYL A+ T Y+ + + +L R++ P G
Sbjct: 287 HRYATDRKWTVPHFEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEG 346
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLS 351
+S D A++EG +EG FYVWT + + + + + I + + + + GN
Sbjct: 347 GFYSTLD---AQSEG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN---- 395
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+G VL S A+K + ++ ++ L + R LFD R R RP+ D+
Sbjct: 396 --------FEGSTVLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDE 447
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ +WNGL ISS AR IL++E +Y E+A A SFIR HL+D
Sbjct: 448 KILTAWNGLAISSLARGGLILETE----------------QYTELANDALSFIRTHLWDS 491
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
+ RL +++G G+LDDYAFL G DLY+ + L +A+ L + ELF D
Sbjct: 492 DSGRLSRRYKDGDVDETGYLDDYAFLARGAFDLYQTTGAVEHLCFAVTLAESIVELFYDA 551
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
G + + S++ R ++ D + PS ++V L + + S +
Sbjct: 552 AGETLYLAPEDAESLVARPQDLRDQSTPSSAGIAVQTLNAVDPFTSTDFSGI-------A 604
Query: 592 LAVFETRLKDMAMAVPL----MCCAADMLSVPSRKH-VVLVGHKSSVDFENMLAAAHASY 646
AV +T D PL + AAD +R H V++ H + + ++ + AS
Sbjct: 605 GAVIDTH-ADEIRGRPLEHISLAMAADSR---ARGHDEVVIAHDTDTELSQLIRSDIAST 660
Query: 647 DLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVALVCQNFSCSPPV 700
L + PA ++ W + +S A A + K C +CSPP
Sbjct: 661 YLPGVPLSQRPATVSGLESWTDELGLDSPPAIWAGRHQRDSKATIYACSGRACSPPT 717
>gi|283778697|ref|YP_003369452.1| hypothetical protein Psta_0907 [Pirellula staleyi DSM 6068]
gi|283437150|gb|ADB15592.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
Length = 667
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 234/614 (38%), Positives = 324/614 (52%), Gaps = 74/614 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT----YVQALYG--G 73
++CHWCHVME ESF D +AKLLN+ F+ IKVDREERPD+D +YMT Y+Q G G
Sbjct: 81 SSCHWCHVMERESFLDPEIAKLLNENFICIKVDREERPDIDTIYMTAVQTYLQLTTGRRG 140
Query: 74 GGWPLSVFLSPDLKPLMGGTYFPPED--KYGRPGFKTILRKVKDAWDKKRDMLAQSGA-- 129
GGWP++VFL+P+ P GGTYFP D + G GF T+ KV + W K+ L
Sbjct: 141 GGWPMTVFLTPEGNPFFGGTYFPARDGDREGMTGFLTLSSKVSEMWKKEPVKLGDDATTL 200
Query: 130 --FAIEQLS--EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAP 179
F +QL + L A KL + + L+ +D R+GGFG P
Sbjct: 201 ARFIKDQLEGPKLLLAVVLDTKLTTSVEKG--------LAAQFDERYGGFGFDEIEWQRP 252
Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
KFP P +Q +L KK ASE + M++ TL MA GGI+DHVGGGFHRYSVD
Sbjct: 253 KFPEPSNLQFLLEIVKKTP-------ASESRAMLVHTLDRMAMGGIYDHVGGGFHRYSVD 305
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
W +PHFEKMLYD GQL VY +A++LT D Y I R+ +++ R+M G ++A
Sbjct: 306 RMWRIPHFEKMLYDNGQLLTVYSEAYALTGDENYQRIARETAEFMLREMRDTSGGFYAAL 365
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 359
D AETEG EG FY W EVE +L KE + L + LSR +
Sbjct: 366 D---AETEGV----EGKFYRWDKAEVEKLLT------KEEFELY-SAVYGLSRAPNFEET 411
Query: 360 FKGKNVLIELNDSSASASKLG-MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F +I+L D+ +K + +EK +N L KL R+ R RP D K++ N
Sbjct: 412 F----YVIQLRDTLVDIAKTREITVEKLVNDLRPIHAKLLAARNARKRPLTDTKILAGEN 467
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
GL I+ A A K+LK Y E A +AA+ + + + RL
Sbjct: 468 GLAITGLATAGKLLKE----------------PRYTEAAATAATLVLSKMTAPE-GRLFR 510
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
++ +K +L DY+ L+ GLL L+E +WL AI+L + Q ELF D GG++
Sbjct: 511 TYSGEKAKLNAYLSDYSMLVEGLLALHEATGEQRWLDEAIKLTDQQVELFHDVPRGGFYF 570
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
T+ + S+L RVKE D A P+GNSV+ +NLV+L I ++ Y + AE ++ +
Sbjct: 571 TSKDHESLLARVKETVDSAMPAGNSVAAVNLVKLVKITGKNE---YLKLAEGAIQSAAGQ 627
Query: 599 LKDMAMAVPLMCCA 612
+++ P + A
Sbjct: 628 MQENPTVSPRLATA 641
>gi|448448658|ref|ZP_21591316.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
gi|445814276|gb|EMA64242.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
Length = 740
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 236/721 (32%), Positives = 344/721 (47%), Gaps = 86/721 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++N+ FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + PGF+ + ++ D+W ++ D A+S
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + + + + L A + YD GGFGS KFP P I +
Sbjct: 173 ELESVPTPEAVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
++ A G+ +L TL MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
HFEKMLYD +L YLD + L D Y+ + + L +L R++ GG FS DA S
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341
Query: 306 TEG--------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDP 356
EG + EGAFYVWT +EV+ +L E A L KE Y ++ GN +
Sbjct: 342 PEGRRGDDTGDSDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE------- 394
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
+G V A+ ++ L R LFD R +RPRP D+KV+ +
Sbjct: 395 ----RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAA 450
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTH 474
WNG IS+FARA L + Y E+A A F R LYD +T
Sbjct: 451 WNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESETG 493
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
L + +G + PG+LDDYAF+ G LD+Y + L +A+EL + + F D + G
Sbjct: 494 ALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDADDG 553
Query: 535 GYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YY 584
+ T D ++ R +E D + PS V+ L +++ G ++D
Sbjct: 554 TIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGEL 609
Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
R+ AE + R++ + + AA+++ V + + D+ L +
Sbjct: 610 REIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGERY- 667
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
L ++ PA + +D W + A+ A + + A VC+ F+CSPP T
Sbjct: 668 ---LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPPRT 724
Query: 702 D 702
D
Sbjct: 725 D 725
>gi|448529052|ref|ZP_21620367.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
700873]
gi|445709758|gb|ELZ61582.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
700873]
Length = 744
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 239/721 (33%), Positives = 347/721 (48%), Gaps = 85/721 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++ND FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFP E + +PGF+ + ++ D+W ++ D A+S
Sbjct: 113 AWCTPEGKPFYVGTYFPLEARRNQPGFRDLCERIADSWSDPEQREEMRRRADQWAESARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + +A L A + YD +GGFGS KFP P I +
Sbjct: 173 ELESVPTPDAADPDGEGDASPPGDGLLESAAASALRGYDDEYGGFGSGGAKFPMPGRIDL 232
Query: 190 MLY-HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
++ +++ D S A TL MA+GG++D +GGGFHRY+VD W VPHFE
Sbjct: 233 LMRAYARSGRDALLSAAAG--------TLDGMARGGMYDQIGGGFHRYAVDREWTVPHFE 284
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD +L YLD + LT D Y+ + + L +L R++ G FS DA S E
Sbjct: 285 KMLYDNAELPMAYLDGYRLTGDPAYARVASESLAFLDRELRRDDGGFFSTLDARSRPPE- 343
Query: 309 ATRKK----------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
+R+ EGAFYVWT +EV+ +L E A L KE Y ++P GN +
Sbjct: 344 -SRRDGNESEEGEDVEGAFYVWTPEEVDAVLDEPAASLVKERYGIRPGGNFE-------- 394
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
+G V A+ + E+ L E R LFD R RPRP D+KV+ SW
Sbjct: 395 ---RGTTVPTLAASVDELAADRDLSPEEVREALTEARTALFDARESRPRPARDEKVLASW 451
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHR 475
NG IS+FA A+ L + Y ++A A F R LYD +T
Sbjct: 452 NGRAISAFADAAGTLG-----------------EPYADIAREALDFCRDRLYDPEAETGA 494
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L + +G + PG+LDDYAFL G LD+Y + L +A+EL F D + G
Sbjct: 495 LARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDLEPLGFALELAEALVAEFYDADDGT 554
Query: 536 -YFNTT---------GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YY 584
YF + G+ ++ R +E D + PS V+ L +++ G ++D +
Sbjct: 555 IYFTRSLDGRESGGDGDAGPLMARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRF 610
Query: 585 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 644
R A + R++ + + AAD++ V + + ++ L +
Sbjct: 611 RDVARRVVTTHADRIRGGPLEHASLVRAADLVET-GGIEVTVAADEVPDEWRETLGERY- 668
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
L ++ PA +D W + + A + + + A VC++F+CSPP T
Sbjct: 669 ---LPSALVAPRPATEAGLDEWLDRLDMAEAPPIWAGRDATDGEPTAYVCRDFTCSPPRT 725
Query: 702 D 702
D
Sbjct: 726 D 726
>gi|358063474|ref|ZP_09150085.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
WAL-18680]
gi|356698267|gb|EHI59816.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
WAL-18680]
Length = 682
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 216/580 (37%), Positives = 305/580 (52%), Gaps = 64/580 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ SF + + FL +TCHWCHVME ESFE+EG+A ++N FV +KVDREERPD
Sbjct: 36 GKESFEKAEREDKPIFLSIGYSTCHWCHVMEEESFENEGIAGIMNREFVCVKVDREERPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD VYM+ QA+ G GGWPL++ ++P+ +P GTY PP +YGR G +L V W
Sbjct: 96 VDSVYMSVCQAMTGQGGWPLTIIMTPECRPFFAGTYLPPVRRYGRMGLAELLNSVAKQWK 155
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ R L +S EQ+ +A + + E+ + + +QL +S+D GGFG A
Sbjct: 156 ENRQQLFRSA----EQI-QAFLRQQTEMDVEGEVSKALVSQGYQQLERSFDEIHGGFGGA 210
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P +H L D G + E MV TL M +GGI DH+GGGF RYS
Sbjct: 211 PKFPTP-------HHLLFLMDYGVRRDVPEAFYMVDRTLVQMYRGGIFDHIGGGFSRYST 263
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DERW VPHFEKMLYD L Y A+ +T Y+ + IL Y++ ++ GG +
Sbjct: 264 DERWLVPHFEKMLYDNALLTLAYAKAYGITGKKLYAEVAGRILGYVKAELTDEGGGFYCG 323
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
+DADS EG +YV+T +E+ +LG F Y + +GN
Sbjct: 324 QDADSDGV-------EGKYYVFTPEEIRAVLGNADGERFLARYGMTGSGN---------- 366
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F+GK + L D ++ P E R+L++ R R R H DDK++VSW
Sbjct: 367 --FEGKWI-PNLLDYQGDLEEM-QP---------EKDRRLYEYRLARARLHKDDKILVSW 413
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NG +I++ RA +L+ +A Y+E+A A +F+R L + RL
Sbjct: 414 NGWMITACGRAGAVLEEDA----------------YVEMAVRAEAFLREKLVKD--GRLM 455
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+R+G + G LDDYA L++LYE T +L A EL + E F D E GG++
Sbjct: 456 VRYRDGEAAGEGKLDDYACYCQALVELYEVTYETDYLRRARELADVMVEQFFDGERGGFY 515
Query: 538 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
+ +++R KE +DGA PSGNSV+ + L +L I
Sbjct: 516 LYAKDGEELIVRTKETYDGAMPSGNSVAALVLEQLGRITG 555
>gi|83649209|ref|YP_437644.1| hypothetical protein HCH_06582 [Hahella chejuensis KCTC 2396]
gi|83637252|gb|ABC33219.1| Highly conserved protein containing a thioredoxin domain [Hahella
chejuensis KCTC 2396]
Length = 762
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 222/599 (37%), Positives = 320/599 (53%), Gaps = 68/599 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF++E VA+ LN +F+ IKVDRE+RPD+D++YMT VQ + G GGWP+S
Sbjct: 80 STCHWCHVMEEESFDNEEVAQTLNGYFIPIKVDREQRPDLDEIYMTAVQIITGHGGWPMS 139
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+P+ P G TYFP RP F +LRKV + W+++++ L + G +LSEA+
Sbjct: 140 SFLTPEGNPFFGATYFP------RPRFINLLRKVHELWEEQQENLLEQG----RRLSEAV 189
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + + L +N + E+L D +GGFGS PKFP+ + +L +E
Sbjct: 190 SVYLRPKPISETLAENLIETAMEKLIGYSDREWGGFGSEPKFPQEPNLLFLL---DIIER 246
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ + +V L + GG++D GGGFHRY+VD+RW VPHFEKMLY+Q QLA
Sbjct: 247 DSRPLDRQPAWTVVKTALDALLAGGVYDQAGGGFHRYAVDQRWLVPHFEKMLYNQAQLAR 306
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++ A+ L++D Y ICR+ LDY+ R+M P G +SA DADS EG +EG ++V
Sbjct: 307 CFIRAYKLSQDPEYLRICRETLDYVLREMRSPEGVFYSATDADS---EG----EEGKYFV 359
Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W +E+ +L + E Y + GN F+G N+L SA+
Sbjct: 360 WAYQELSQLLDTPGLALAEQVYGVTRKGN------------FEGANILYLPRPLQKSAAT 407
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+ E+ L L + + L RS+R P DDKVI WNG++I++ A + I A
Sbjct: 408 LGLTYEELLQQLADLKAILLQTRSQRVPPLRDDKVITEWNGMMIAALAETAAITGISA-- 465
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAF 496
Y + A AA+ + R E HR+ S N PS L+DY
Sbjct: 466 --------------YGDAAVIAANQLWRSQRGEDGLFHRI--SLDNLPSDD-ALLEDYVH 508
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPSVLLRVKEDH 554
+ GLL LY++ WL L T +E FLD E GG+F T + + P +L+R K
Sbjct: 509 YMEGLLQLYDYTHDHLWLERLEALTTTLEEQFLDAEQGGFFITPQSAQGP-LLVRSKHCS 567
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSK---SDYYRQN-AEHSLAVFETRLKDMAMAVPLM 609
D A SGNS +LAS++A + D Q AE+ +A F ++ ++ P+
Sbjct: 568 DNATISGNS-------QLASVLAALRLRTGDLNVQRMAENQIAAFTGQINRHPLSAPVF 619
>gi|313126304|ref|YP_004036574.1| hypothetical protein Hbor_15590 [Halogeometricum borinquense DSM
11551]
gi|448286147|ref|ZP_21477382.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
11551]
gi|312292669|gb|ADQ67129.1| hypothetical protein containing a thioredoxin domain
[Halogeometricum borinquense DSM 11551]
gi|445575198|gb|ELY29677.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
11551]
Length = 725
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 233/700 (33%), Positives = 338/700 (48%), Gaps = 81/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ VA +LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFEDDDVAAVLNESFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
V+L+P KP GTYFP E++ R PGF + R +AW+ R+ + +
Sbjct: 113 VWLTPQGKPFYVGTYFPKEERRDRGNVPGFLDLCRSFAEAWENDREEIENRAQQWTAAIQ 172
Query: 137 EALSASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHS 194
+ L A+ P E P L A+ + D +GGFGS PKFP+P ++ +L
Sbjct: 173 DQLEATPDD---PGESPGTEILGEVAKAALRGADREYGGFGSGGPKFPQPGRVEALLRSY 229
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
SGE E + + TL MA GG++DHVGGGFHRY+ D +W VPHFEKMLYD
Sbjct: 230 V------HSGE-DEPLTVAMETLDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKMLYDN 282
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ VYL A LT Y+ + R+ D++ R++ P G FS DA S +E
Sbjct: 283 AEIPRVYLAAHRLTGRADYAEVARETFDFVARELRHPDGGFFSTLDAQSG-------GEE 335
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT ++V + L + A +F ++Y + GN + G VL
Sbjct: 336 GTFYVWTPEQVHEALADETRAEVFCDYYGVTSGGNFE-----------NGTTVLTVSATV 384
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A + G+ ++ + L R LFD R R RP D+KV+ WNGL+ISS A+ + +L
Sbjct: 385 DSVADEHGLTTDEVTDHLDAARETLFDTRESRTRPPRDEKVLAGWNGLMISSLAQGALVL 444
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
EY E+A A F R HL+DE RL F++G K G+L+
Sbjct: 445 GD-----------------EYAELAADALGFAREHLWDESEGRLSRRFKDGDVKGEGYLE 487
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G DLY+ L +A+EL F D G + T + +++ R +E
Sbjct: 488 DYAFLARGAFDLYQATGDVDHLAFAVELAREIVASFYDDAAGTLYFTPDDGEALVTRPQE 547
Query: 553 DHDGAEPSGNSVSVINLVRL--------ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
D + PS V+ L+ L + VAGS D + R++ +
Sbjct: 548 LQDQSTPSSVGVATSLLLDLDAFAPDADFAAVAGSVLDTHAD-----------RIRGRPL 596
Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
+ AA+ + +V+ G F LA + + V+ I P +++
Sbjct: 597 EHVSLALAAEKRAR-GGSEIVVAGDSLPDSFRQSLAERY----VPDAVLSIRPPTDDDLT 651
Query: 665 FWEE----HNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
W + ++ R + V C+ +CSPP
Sbjct: 652 PWLDTLGVEDAPPVWQGREMRDGEPTV-YACEGRACSPPT 690
>gi|395645901|ref|ZP_10433761.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
gi|395442641|gb|EJG07398.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
Length = 690
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 237/692 (34%), Positives = 342/692 (49%), Gaps = 65/692 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED GVA++LN+ FV++KVDREERPD+D VYM AL G GGWPL+
Sbjct: 55 STCHWCHVMAEESFEDAGVAEVLNEGFVAVKVDREERPDIDAVYMQVCLALTGRGGWPLT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD P TY P E + G G +L+K++ W+ +RD L S ++ + L
Sbjct: 115 IVMTPDRLPFFAATYLPKETRLGVTGLIDVLKKIRHLWETRRDDLVGSA----REIVDDL 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A AS L + LR ++ + YD +GGF +PKFP P M+++ +
Sbjct: 171 GAGAS---LRGKAETALLREGYAEMKRRYDPSYGGFDRSPKFPSP---HMIIFLIRYWHW 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + ++ TL+ + GGI D +G G HRY+ D +W VPHFEKMLYDQ LA
Sbjct: 225 TGDPMALAMAEQ----TLREVRGGGIFDQIGFGVHRYATDRKWLVPHFEKMLYDQAMLAL 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ +A T D FY +I Y++RD+ P G ++AEDADS EG EG FY+
Sbjct: 281 AFTEAHMATGDAFYLSAADEIFTYVQRDLASPEGAFYTAEDADS---EGV----EGKFYL 333
Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT++EV + GE A LF E Y + G+ D+ PH + + +
Sbjct: 334 WTAEEVRSAVGGEDAALFIEAYGIG-EGSGDI-----PHRAVSPQVL----------SRT 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+P ++ L R KL VR R RPH D+K+++ WN L++++ ARA +
Sbjct: 378 TGIPEDEIRRRLEAVREKLLSVRKGRARPHRDEKILLDWNALMVAALARAGRY------- 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
S R Y+ A+ AA + L L H + +G + G L DYA+L+
Sbjct: 431 ---------SGRTGYVAAAQGAAGVLLDRLRRPDGG-LLHRYMDGEAAVSGMLADYAYLV 480
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L ++YE + L A L + E F D GGG++ + + ++LR KE HDGA
Sbjct: 481 WALAEVYEASFDPEILREACRLADAMIERFGDPSGGGFYTVSADGEQLILRQKEIHDGAL 540
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNS+++ LV L + S+ Y + + S F A A S
Sbjct: 541 PSGNSMALFALVTLFRLTGLSR---YWEASSSSFDAFAGDAGRNPSAHAWYMAALLAAST 597
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S +V+ G ML +SY N TV+ D D E + A M+
Sbjct: 598 KS-DELVIAGEGDDPATRKMLDLVASSYRPNLTVLL---KDRRSADVLAEVAPHTALMSA 653
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
K A +C+ +C PVT P L+ +L
Sbjct: 654 QG---GKATAYLCRGTACEQPVTSPEDLDKIL 682
>gi|448414488|ref|ZP_21577557.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
gi|445682054|gb|ELZ34478.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
Length = 725
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 230/695 (33%), Positives = 343/695 (49%), Gaps = 71/695 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE VA++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMAEESFEDEAVARVLNESFVPVKVDREERPDLDRIYQTICQLVSGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
V+L+P+ KP GTYFP E++ R PGF + +AW+ R+ + EQ +
Sbjct: 113 VWLTPEGKPFYVGTYFPKEERRDRGNVPGFLDLCESFANAWETDREEIENRA----EQWT 168
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKS----YDSRFGGFGS-APKFPRPVEIQMML 191
+AL + PDE+ + +++K+ D +GGFGS PKFP+P I+ +L
Sbjct: 169 DALKDQL--EETPDEVGEAPGTEVLGEVTKAALRGADREYGGFGSGGPKFPQPGRIEALL 226
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
SGE E + + L MA GG++DHVGGGFHRY+ D +W VPHFEKML
Sbjct: 227 RSYV------HSGE-EEPLDVAMEALDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKML 279
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD ++ VYL A LT Y+ + R+ D++ R++ P G +S DA S
Sbjct: 280 YDNAEIPRVYLAAHRLTGREAYADVARETFDFVARELRHPDGGFYSTLDAQS-------D 332
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVWT +EV + L + A +F ++Y + GN + G VL
Sbjct: 333 GEEGTFYVWTPEEVRETLDDETRADVFCDYYGVTADGNFE-----------NGTTVLTVS 381
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
A + G+ E+ ++ L R LF+ R R RP D+KV+ WNGL++SS A+ S
Sbjct: 382 APIDEVAEERGLTTEEAVDHLDAARETLFEARESRTRPPRDEKVLAGWNGLMVSSLAQGS 441
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L EY E+A A F+R HL+D RL F++G K G
Sbjct: 442 LVLGD-----------------EYAELAADALGFVREHLWDSDEKRLSRRFKDGDVKGDG 484
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G DLY+ L +A++L E F D G + T + +++ R
Sbjct: 485 YLEDYAFLARGAFDLYQATGDVDHLAFAVDLSRALVESFYDESAGTLYFTPADGETLVTR 544
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+ L+ L S + + A L R++ + +
Sbjct: 545 PQELQDQSTPSSVGVAASLLLDLDSFAPDAD---FASVAGSVLDTHADRIRGRPLEHVSL 601
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW--- 666
A++ + + VV S+ + A A+ + +V+ + P +E+ W
Sbjct: 602 ALASEKRARGGSEIVV-----SADALPDSFREALATRYVPGSVLSVRPPTDDELAPWLDV 656
Query: 667 -EEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
+ + R + V C+ +CSPP
Sbjct: 657 LDLTEAPPVWKGREMRDGEPTV-YACEGRACSPPA 690
>gi|448639421|ref|ZP_21676747.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
gi|445762700|gb|EMA13918.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
Length = 717
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 226/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
L+P+ +P GTYFPPE+K G+PGF +L+++ ++W +++ +M AQ AIE
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLANSWSDPEQREEMENRAQQWTEAIESDL 177
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
EA A P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 229 AYSDGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
++ +L + Y+ + R+ ++++R++ P G FS DA+SA + +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEE 344
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT +EV + + + A +F +++ + GN F+G VL
Sbjct: 345 GLFYVWTPEEVHEAVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPV 392
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A + + L + F R RPRP D+KV+ WNGL+I + A + +L
Sbjct: 393 AVLAEEYDRSEDDITASLQRALNETFKARKSRPRPARDEKVLAGWNGLMIRALAEGAIVL 452
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+Y +VA A SF+R+HL+D RL +++ G+L+
Sbjct: 453 DD-----------------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLE 495
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L L+E + L +A++L E F D E G F T S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQE 555
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L+ L+ S+ D + AE + R+ + + A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLA 612
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
D + + + LVG +S D+ A + + ++ PA+ + W E
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPTEWTETLAEQYIPRRLLAHRPAEKSRFEQWLDTLEV 669
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A D+ C+NF+CSPP D
Sbjct: 670 DESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702
>gi|317122770|ref|YP_004102773.1| hypothetical protein [Thermaerobacter marianensis DSM 12885]
gi|315592750|gb|ADU52046.1| hypothetical protein Tmar_1963 [Thermaerobacter marianensis DSM
12885]
Length = 738
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 262/728 (35%), Positives = 362/728 (49%), Gaps = 101/728 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME E FED +A+ +N FV++KVDREERPD+D+VY T Q L GGGWPL+V
Sbjct: 55 ACHWCHVMERECFEDPAIAEQMNRGFVNVKVDREERPDLDQVYQTAAQILGSGGGWPLTV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PDLKP GTYFPPED++G PGF +L V DA+ +RD + + +E L +
Sbjct: 115 FLTPDLKPFFAGTYFPPEDRHGLPGFPKVLDAVLDAYRHRRDDVERVANRVVEILRRSAG 174
Query: 141 ASASSNKLPDELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---- 191
++ + P ++ A ++++ YD ++GGFG APKFP + ++L
Sbjct: 175 GPGAAEEPAGAAPAREAARQWIQRAATRIARRYDPQYGGFGRAPKFPHATGLAVLLRAGV 234
Query: 192 --------------YHSKKLEDTGKSGEA-------SEGQK----MVLFTLQCMAKGGIH 226
S T +SG A E + M L TLQ MA GG+
Sbjct: 235 ARTPGGPGPSGTTGSGSSGSPGTARSGTADLVAGDVPENPRRHLDMALHTLQAMALGGLF 294
Query: 227 DHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR 286
DH+ GGFHRY+ D W +PHFEKMLYDQ QL +YLDA+ LT D FY+ + R L ++
Sbjct: 295 DHLAGGFHRYATDRAWLIPHFEKMLYDQAQLVPLYLDAYRLTGDPFYAGVARQTLHFVLD 354
Query: 287 DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKP 344
+M P G S DADS EG +EGA+YVWT ++ + LG + A L + +
Sbjct: 355 EMTAPEGGFISTLDADS---EG----REGAYYVWTPDQLREALGDPDEAALAARWFGVTE 407
Query: 345 TGNCDLSRMSDPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR 401
GN + G VL + D A A + G ++ L RR+L D R
Sbjct: 408 EGNFE-----------DGTTVLYRAVADQDLPALAREWGTNRDELQRRLESIRRRLLDAR 456
Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
+R P DDK++V WNGL+I++FA+A+ +L D Y A AA
Sbjct: 457 RRRTPPGRDDKILVGWNGLMIAAFAQAAPVL----------------DEPGYAAAARRAA 500
Query: 462 SFIRRHLYDEQTH-RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 520
FI L + H RL H++R P PGFL DYAFLI GLL L+ +WL A L
Sbjct: 501 EFILGTL--RRPHGRLLHAYRGRPLDVPGFLPDYAFLIGGLLALHAADGDPRWLEEADRL 558
Query: 521 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 580
E F D G +++ E + L+R E D A P+G++ + L RLA I +
Sbjct: 559 ARPMIETFWDDAAGVFYDAPEEAGTPLVRPVELFDQALPAGSAAAATVLARLAVI---TG 615
Query: 581 SDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAADMLSVPSRKHVVLVGHKSS---VDFE 636
+ YR+ AE L + +A+ + AD L V LVG ++ ++
Sbjct: 616 DEEYRRIAEAYLRRAAALAAEQPLAMASTVLLQADQLE--GYTEVTLVGDPAAPVLAEWR 673
Query: 637 NMLAAAHASYDLNKTVIHIDPAD--TEEMDFWEEHNSNNASMARNNFSADKVVALVCQNF 694
L A + L V+ + P D TE WE + + + VA VC+NF
Sbjct: 674 RRL----AGFYLPGLVLTVRPPDAGTERRAVWEGRDPVDG----------RPVAYVCRNF 719
Query: 695 SCSPPVTD 702
SCS P TD
Sbjct: 720 SCSLPQTD 727
>gi|448658484|ref|ZP_21682884.1| thioredoxin [Haloarcula californiae ATCC 33799]
gi|445761209|gb|EMA12458.1| thioredoxin [Haloarcula californiae ATCC 33799]
Length = 717
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 226/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
L+P+ +P GTYFPPE+K G+PGF +L+++ +W +++ +M AQ AIE
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRAQQWTEAIESDL 177
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
EA A P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 229 AYADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
++ +L + Y+ + R+ ++++R++ P G FS DA+SA + +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEE 344
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT +EV + + + A +F +++ + GN F+G VL
Sbjct: 345 GLFYVWTPEEVHEAVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPV 392
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ A + + L + F+ R RPRP D+KV+ WNGL+I + A + +L
Sbjct: 393 AVLAEEYDRSEDDITASLQRALNETFEARKSRPRPARDEKVLAGWNGLMIRALAEGAIVL 452
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+Y +VA A SF+R+HL+D RL +++ G+L+
Sbjct: 453 DD-----------------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLE 495
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L L+E + L +A++L E F D E G F T S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQE 555
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L+ L+ S+ D + AE + R+ + + A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLA 612
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
D + + + LVG +S D+ A + + ++ PA+ + W E
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPTEWTETLAEQYIPRRLLAHRPAEKSRFEQWLDTLEV 669
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A D+ C+NF+CSPP D
Sbjct: 670 DESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702
>gi|336477876|ref|YP_004617017.1| hypothetical protein [Methanosalsum zhilinae DSM 4017]
gi|335931257|gb|AEH61798.1| protein of unknown function DUF255 [Methanosalsum zhilinae DSM
4017]
Length = 704
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 231/698 (33%), Positives = 347/698 (49%), Gaps = 62/698 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED +A ++N F+ IKVDREERPD+D +YM Q + GWP++
Sbjct: 55 STCHWCHVMEEESFEDPKIADMMNRTFICIKVDREERPDIDSMYMKICQQMTERCGWPMT 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V ++P P TY P + G ++ ++ + W ++D + ++L+
Sbjct: 115 VIMTPGKVPFFISTYVPKKSGLAGIGMADLIPQIAEIWKTRQDEIVNKTEEIKQRLNRIT 174
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + + P++ ++ L+ YD +GGFG APKFP P I +L H +
Sbjct: 175 AAPEGAEYIS---PKDVIQKGYHLLAHYYDQNYGGFGRAPKFPAPHNIMFLLRHWNYTGN 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
T + KM TL M GGI DHVG GFHRYS DE+W +PHFEKML DQ LA
Sbjct: 232 T-------DALKMAETTLTSMQLGGIFDHVGYGFHRYSTDEKWKLPHFEKMLNDQALLAL 284
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ T Y R IL Y+ RDM G +SAEDADS EG EG FY+
Sbjct: 285 AYTEAYQATGKKVYENTARKILRYVLRDMRSEKGGFYSAEDADS---EGV----EGKFYL 337
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ IL E A L + +K GN + + G N+L ++S
Sbjct: 338 WTEDEIRYILTPEEADLVCRVFNVKREGNF----AEESTGKLTGNNILYMKGETSEIVEP 393
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
E+ +L + KL++VRS R P DDK++ WNGL+I++ A+A S
Sbjct: 394 TEKENEEIQKLLNQALDKLYEVRSARVHPLKDDKILTDWNGLMIAALAKA---------S 444
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
F P EY+E A++ FI ++YD + +L H + + GF+DDYA +
Sbjct: 445 GAFQEP-------EYVEYAKTCTKFILDNMYD-GSGKLLHRYHRENAGIDGFVDDYAAFV 496
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDHDGA 557
GL++LYE K+L A+E+ + F D +G G YF + +++R E D +
Sbjct: 497 WGLIELYEATFEEKYLQKALEINDYFISHFQDEKGRGFYFTSNDRSGDLIVRSMEICDTS 556
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNS++V+N++RLA + + A LA + ++ + A S
Sbjct: 557 MPSGNSMAVLNILRLAKMTGDHNLESVASEAIRHLAA---AISHNPISSTYLLSAFYFAS 613
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPAD-----TEEMDFWEEHNSN 672
P + V+ ++ D M+ A ++ + + V + PAD TE + + +E
Sbjct: 614 EPGCEVVIAAEIDNAKD---MIEALQTNF-IPQCVYLLRPADSSESFTETIGYLKEMKGI 669
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N A A VC+N++CS PVTD + + +L+
Sbjct: 670 NGRPA----------AYVCRNYTCSSPVTDAVEMMDLI 697
>gi|118575698|ref|YP_875441.1| thioredoxin [Cenarchaeum symbiosum A]
gi|118194219|gb|ABK77137.1| thioredoxin [Cenarchaeum symbiosum A]
Length = 676
Score = 354 bits (909), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 238/700 (34%), Positives = 349/700 (49%), Gaps = 84/700 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE+E +A ++N+ F++IKVDREERPD+D +Y Q G GGWPLS
Sbjct: 52 SACHWCHVMAHESFENENIADIMNENFINIKVDREERPDIDDIYQKGCQLATGQGGWPLS 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+PD KP GTY PP +GR GF++ILR++ AW +K + + +E L
Sbjct: 112 AFLTPDRKPFYIGTYIPPSSSHGRNGFESILRQLSQAWKEKPGDIKGTAEKFLETLRGGE 171
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A+A P E ++ L A L + D+ GGFG APKFP I + +
Sbjct: 172 RATA-----PAEPDRSVLDEAAVNLLQMADTTHGGFGRAPKFPGSANISFLFRY------ 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
GK S+ + L TL MA+GGI D VGGGFHRYS DERW PHFEKMLYD +
Sbjct: 221 -GKLSGISKFTRFALLTLDRMARGGIFDQVGGGFHRYSTDERWLAPHFEKMLYDNALIPV 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y +A+ +T Y I LDY+ R++ P G +S++DAD TEG +EG +YV
Sbjct: 280 NYAEAYQVTGSPAYLRIMEKTLDYVLRELSSPEGGFYSSQDAD---TEG----EEGRYYV 332
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W+ KEV++ILG A F Y + GN ++GK +L SA A +
Sbjct: 333 WSKKEVKEILGADADAFCMFYDVTDGGN------------WEGKTILYNGAAPSAVAFQC 380
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + + I+ KL + RS R P LDDKV+ SWN L++++ AR +
Sbjct: 381 GITVGELDGIIERSAAKLLEARSGRVPPGLDDKVLASWNSLMVTALARGYR--------- 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR---LQHSFRNGPSKAPGFLDDYAF 496
S Y++ A FI D + HR L +++ G ++ PG+LDD+A+
Sbjct: 432 -------ASGEARYLDAARRCLGFI-----DAKMHRDGALMRTYK-GEARIPGYLDDHAY 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
LLD +E + ++L A E+ + + F D E GG+F T+ +++R + +D
Sbjct: 479 YGCALLDAFEVDAEERYLRRASEIGSHLVQNFWDEERGGFFMTSDVHEGLIVRPRSGYDL 538
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADM 615
+ PSGNS + ++RL Y+ E L E + A A A M
Sbjct: 539 SLPSGNSAAAHLMLRL----------YHLTGDESCLKTAERTMSSQAQAAAENPFAFGHM 588
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L+V H++ + +D + A L + ++ I+ A ++D +
Sbjct: 589 LNV-MYMHILGPAEITVLDKGGEIPRGLAEKFLPEALL-INVASQGQLD----------A 636
Query: 676 MARNNFSADK-----VVALVCQNFSCSPPVTDPISLENLL 710
++R F A K A +C+N +CS P +E LL
Sbjct: 637 LSRYPFFAGKSFGGNSTAYICRNKTCSAPQDTMNGVEALL 676
>gi|448424193|ref|ZP_21582319.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
gi|445682858|gb|ELZ35271.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
Length = 742
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 236/723 (32%), Positives = 344/723 (47%), Gaps = 88/723 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++N+ FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + PGF+ + ++ D+W ++ D A+S
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + + + + + L A + YD GGFGS KFP P I +
Sbjct: 173 ELESVPTPEAVGSDGEETASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
++ A G+ +L TL MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
HFEKMLYD +L YLD + L D Y+ + + L +L R++ GG FS DA S
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341
Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
EG EGAFYVWT +EV+ +L E A L KE Y ++ GN +
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
+G V A+ ++ L R LFD R +RPRP D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQ 472
+WNG IS+FARA L + Y E+A A F R LYD +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESE 493
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
T L + +G + PG+LDDYAF+ G LD+Y + L +A+EL + + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDAD 553
Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
G + T D ++ R +E D + PS V+ L +++ G ++D
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
R+ AE + R++ + + AA+++ V + + D+ L
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668
Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
+ L ++ PA + +D W + A+ A + + A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPP 724
Query: 700 VTD 702
TD
Sbjct: 725 RTD 727
>gi|335436727|ref|ZP_08559519.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
gi|335437369|ref|ZP_08560149.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
gi|334896155|gb|EGM34310.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
gi|334897442|gb|EGM35575.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
Length = 715
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 234/700 (33%), Positives = 349/700 (49%), Gaps = 70/700 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ A +LN+ FV IKVDREERPDVD++Y T Q L GGWPLSV+
Sbjct: 55 CHWCHVMAEESFEDDETAAVLNENFVPIKVDREERPDVDRIYQTLAQLLDQQGGWPLSVW 114
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GTYFPP+ + GRPGF +L ++ W+ R+ + Q + +S L
Sbjct: 115 LTPDGRPFYVGTYFPPDSRGGRPGFAELLEDLQATWENDREGIEQRADQWADAISGELEG 174
Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
+ A+ + DEL LR A+ ++ D GGFGS PKFP+P +Q++L +
Sbjct: 175 TPDAARDTAGDEL----LRSGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFG 230
Query: 199 DT----GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
D G++ EA+E + ++ TL M GG++DHVGGGFHRY+ D W VPHFEKMLYD
Sbjct: 231 DARREEGENAEATEYRSILTETLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDN 290
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ V L+A+ T D Y+ + R+ D+L R++ P G +S DA S EG +E
Sbjct: 291 AEIPRVLLEAYRATGDERYARVARETFDFLDRELGHPEGGFYSTLDARS---EG----EE 343
Query: 315 GAFYVWTSKEVEDILGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT +V +++ + L E Y + GN + G+ VL
Sbjct: 344 GKFYVWTPAQVREVIDDETDVSLVCERYGITEEGNFE-----------DGQTVLTIAASV 392
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A++ G+ + L R +LFD RS+R RP D+K++ WNGL IS+ A S L
Sbjct: 393 DELAARSGLGAGEVRERLDRAREELFDARSERTRPPRDEKILAGWNGLAISALAEGSLTL 452
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
G+D +++ A A F+R L+D+ L+ + +G + G+L+
Sbjct: 453 --------------GND---FLDRAVDALEFVRETLWDDDAGLLKRRYIDGDVRVDGYLE 495
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS----VLL 548
DYAFL G LD Y L +A++L + F D++ G + T S +L
Sbjct: 496 DYAFLARGALDCYGASGDLDHLAFALDLAREIETRFFDKDVGTLYFTEAPGESRETDLLA 555
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R +E D + PS V+V LV L V + + E + AV ET +A A PL
Sbjct: 556 RPQELTDRSTPSSAGVAVDVLVTLDEFVP------HDRFGEIASAVLETHHSAIA-AEPL 608
Query: 609 ----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
+ A D + S + + + + + + + L V+ P ++
Sbjct: 609 QHASLVLAGDRDANGS-TELTVASDEIPAAWRDRIGETY----LPARVLARRPPTEAGLE 663
Query: 665 FWEEHN--SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
W E + + + C++F+CS P+ D
Sbjct: 664 TWLEQFELGEAPPIFAGRLAEEDATIYACRDFTCSRPLHD 703
>gi|336254491|ref|YP_004597598.1| hypothetical protein Halxa_3105 [Halopiger xanaduensis SH-6]
gi|335338480|gb|AEH37719.1| protein of unknown function DUF255 [Halopiger xanaduensis SH-6]
Length = 730
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 233/698 (33%), Positives = 347/698 (49%), Gaps = 65/698 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF+DEGVA++LN+ FV IKVDREERPD+D +YMT Q + G GGWPLS
Sbjct: 53 SACHWCHVMEEESFQDEGVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGRGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFA 131
+L+P+ KP GTYFP E + G+PGF + ++ D+W+ + D ++
Sbjct: 113 AWLTPEGKPFFIGTYFPREGQRGQPGFLDLCERISDSWNSEDREEMEHRADQWTEAAKDR 172
Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 190
+E E A ++ E+ L A +S D +GGFGS PKFP+P +Q +
Sbjct: 173 LEDTPEGAGAGGAAEPPSSEV----LETAASAALRSADREYGGFGSDGPKFPQPARLQAL 228
Query: 191 LYHSKKLEDTGKSGEASEGQKMVL-FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
++ + TG+ E + VL TL MA GG++DHVG GFHRY VD W VPHFEK
Sbjct: 229 ---ARAYDRTGR-----EAYREVLEETLDAMAAGGLYDHVGSGFHRYCVDRDWTVPHFEK 280
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
MLYD ++ +L + LT D Y+ + + L ++ R++ G FS DA S + E
Sbjct: 281 MLYDNAEIPRAFLTGYQLTGDERYAEVVAETLAFVDRELTHEEGGFFSTLDAQSEDPETG 340
Query: 310 TRKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
R +EGAFYVWT EV + L + A LF + Y + +GN F+G+N
Sbjct: 341 ER-EEGAFYVWTPDEVREALEDETTADLFCDRYDITESGN------------FEGRNQPN 387
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+ A + + + L R +LF R RPRP+ D+KV+ WNGL+I++ A
Sbjct: 388 RVRPIDDLADEYDLEESEVQKRLETAREQLFAAREGRPRPNRDEKVLAGWNGLMIATCAE 447
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 487
A+ +L G D +Y ++A A F+R L++E RL +++G K
Sbjct: 448 AALVL--------------GDD--QYADMAVDALDFVRDRLWNESEQRLNRRYKDGDVKV 491
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
G+L+DYAFL G L YE L +A+EL + F D + G + T S++
Sbjct: 492 DGYLEDYAFLARGALGCYEATGEVDHLRFALELARVVEAEFWDADRGTLYFTPESGESLV 551
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
R +E D + P+ V+V L+ L + + A L +++ ++
Sbjct: 552 TRPQELGDQSTPAATGVAVEVLLALDEFT----DEDFEGIAATVLETHANKIEANSLEHT 607
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW- 666
+C AAD L + + V ++ D + AS + PA E ++ W
Sbjct: 608 TLCLAADRLESGALEVTV-----AADDLPDEWRDRFASRYFPDRLFARRPATEEGLEDWL 662
Query: 667 EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
+E A A + VC++ +CSPP D
Sbjct: 663 DELGLEEAPPIWAGREARDGEPTLYVCRDRTCSPPTHD 700
>gi|448506299|ref|ZP_21614409.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
gi|448525080|ref|ZP_21619498.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
gi|445699949|gb|ELZ51967.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
gi|445700052|gb|ELZ52067.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
Length = 742
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 236/723 (32%), Positives = 343/723 (47%), Gaps = 88/723 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++N+ FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + PGF+ + ++ D+W ++ D A+S
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + + + + L A + YD GGFGS KFP P I +
Sbjct: 173 ELESVPTPETVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
++ A G+ +L TL MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
HFEKMLYD +L YLD + L D Y+ + + L +L R++ GG FS DA S
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341
Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
EG EGAFYVWT +EV+ +L E A L KE Y ++ GN +
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
+G V A+ ++ L R LFD R +RPRP D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQ 472
+WNG IS+FARA L + Y E+A A F R LY D +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALEFCRERLYDADRE 493
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
T L + +G + PG+LDDYAF+ G LD+Y + L +A+EL + + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDAD 553
Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
G + T D ++ R +E D + PS V+ L +++ G ++D
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
R+ AE + R++ + + AA+++ V + + D+ L
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668
Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
+ L ++ PA + +D W + A+ A + + A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPQIWADRGATDGEPTAYVCEGFTCSPP 724
Query: 700 VTD 702
TD
Sbjct: 725 RTD 727
>gi|53803351|ref|YP_114889.1| hypothetical protein MCA2477 [Methylococcus capsulatus str. Bath]
gi|53757112|gb|AAU91403.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
Length = 679
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 238/696 (34%), Positives = 350/696 (50%), Gaps = 76/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFEDE A+++N FV+IKVDREERPD+D++Y T Q L GGGWPL
Sbjct: 53 SACHWCHVMAHESFEDEATAEVMNRLFVNIKVDREERPDLDRIYQTVHQLLSRRGGGWPL 112
Query: 79 SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+V L+P DL P GTYFP E +YG P F ++L + + + R LA++G E L E
Sbjct: 113 TVCLNPHDLVPFFTGTYFPKEPRYGMPAFVSVLHHLAAFYAEHRGDLARNGQVLREAL-E 171
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A+ +PD L + L S+D+ GGFG APKFPR +++++L
Sbjct: 172 AMGREGDGALMPD---AGLLARATQALRTSFDASHGGFGGAPKFPRTADLELLLRSD--- 225
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
EG +M+ TL MA+GGI+DH+GGGF RYSVDERW +PHFEKMLYD G L
Sbjct: 226 ---------GEGVEMLRTTLDGMARGGIYDHLGGGFARYSVDERWEIPHFEKMLYDNGPL 276
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y + T D Y+ + +++ R+M P G ++A DADS EG EG F
Sbjct: 277 LELYARMAAQTGDPAYAVVATGTAEWVIREMQSPEGGYYAALDADS---EGG----EGRF 329
Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W +EV+ +L + ++F Y L N F+G L A A
Sbjct: 330 YLWDRQEVQGLLSADEYLVFSLRYGLDGPPN------------FEGHWHLRVARSLEAVA 377
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G ++ +L R +L R +R RP DDKVI +WNGL++ A ++L
Sbjct: 378 AATGKGGDEVTRLLESARTRLRRAREQRVRPGRDDKVIAAWNGLMVRGMTVAGRLLG--- 434
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
R ++ME A+ A F+RR + + RL +R+G ++ +LDD+AF
Sbjct: 435 -------------RADFMESADRALGFVRRTM--DAGGRLMSVYRDGRARFDAYLDDHAF 479
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ L++ + T L WA+ L + E F D E GG+F T + +++ R K D
Sbjct: 480 LLDAALEILQTRWSTDDLEWAVSLADRLLERFEDAEHGGFFFTAADHETLIQRPKPWMDE 539
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAADM 615
+ PSGN V++ L+RLA + S+ Y AE L + A LM +
Sbjct: 540 SMPSGNGVAIRALIRLAGLTGESR---YADAAERGLRAAHGAMARYPHAHCALMNAVREW 596
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L+ P V+L G + ++ A A + +++ P+D + ++
Sbjct: 597 LTPPPL--VILRGGREALK----QWCAKAREAAPEALVYAIPSDAVGL---------PSA 641
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+A VA VC+ C+ P TD + N +L
Sbjct: 642 LAARMPGPGGPVAYVCRGRVCAAP-TDSLGTLNEIL 676
>gi|448479213|ref|ZP_21604065.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
gi|445822491|gb|EMA72255.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
Length = 742
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 236/723 (32%), Positives = 343/723 (47%), Gaps = 88/723 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++N+ FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + PGF+ + ++ D+W ++ D A+S
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + + + + L A + YD GGFGS KFP P I +
Sbjct: 173 ELESVPTPEAVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDL 232
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
++ A G+ +L TL MA GG++D +GGGFHRY+VD +W VP
Sbjct: 233 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 281
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
HFEKMLYD +L YLD + L D Y+ + + L +L R++ GG FS DA S
Sbjct: 282 HFEKMLYDNAELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRP 341
Query: 306 TEG----------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 354
EG EGAFYVWT +EV+ +L E A L KE Y ++ GN +
Sbjct: 342 PEGRRGDDTGDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE----- 396
Query: 355 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 414
+G V A+ ++ L R LFD R +RPRP D+KV+
Sbjct: 397 ------RGTTVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVL 450
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQ 472
+WNG IS+FARA L + Y E+A A F R LYD +
Sbjct: 451 AAWNGRAISAFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESE 493
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 532
T L + +G + PG+LDDYAF+ G LD+Y + L +A+EL + + F D +
Sbjct: 494 TGALARRWLDGDVRGPGYLDDYAFVACGALDVYAATGDPEPLGFALELADALVDEFYDAD 553
Query: 533 GGGYFNTTGEDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD- 582
G + T D ++ R +E D + PS V+ L +++ G ++D
Sbjct: 554 DGTIYFTRDRDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDG 609
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 642
R+ AE + R++ + + AA+++ V + + D+ L
Sbjct: 610 ELREIAERVVTTHADRIRGSPLEHASLVRAANVVET-GGIEVTIAADEVPDDWRETLGER 668
Query: 643 HASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPP 699
+ L ++ PA + +D W + A+ A + + A VC+ F+CSPP
Sbjct: 669 Y----LPGALVAPRPATEDGLDEWLDRLDMTAAPPIWADRGATDGEPTAYVCEGFTCSPP 724
Query: 700 VTD 702
TD
Sbjct: 725 RTD 727
>gi|448540737|ref|ZP_21623658.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
gi|448549039|ref|ZP_21627815.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
gi|448555786|ref|ZP_21631715.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
gi|445708890|gb|ELZ60725.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
gi|445713728|gb|ELZ65503.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
gi|445717309|gb|ELZ69027.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
Length = 703
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 227/694 (32%), Positives = 343/694 (49%), Gaps = 72/694 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEEFVPVKVDREERPDLDRIYQNICQQVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+P+ KP GTYFPPE + G PGF+ I+ ++W RD + +++ L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDIVESFAESWRTDRDEIENRADQWTSAITDRL 172
Query: 140 SASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 197
+ + P E P + L + + D GGFG PKFP+P I +L
Sbjct: 173 EETPDT---PGEAPGSDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL------ 223
Query: 198 EDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFEKMLYD
Sbjct: 224 -----RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYD 278
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q LA+ YLDA LT + Y+ + + +++RR++ G F+ DA S +
Sbjct: 279 QAGLASRYLDAARLTGNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GE 331
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT +V D+L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 332 EGTFYVWTPDDVRDLLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSAT 379
Query: 373 SAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A + + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA+ S +
Sbjct: 380 TAELVDEYDLDESEVEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVV 439
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ + SD A A F+R L+D++T L NG K G+L
Sbjct: 440 LEDDS---------LASD-------ARRALDFVRERLWDDETETLSRRAMNGEVKGDGYL 483
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ L +A++L F D + G + T S++ R +
Sbjct: 484 EDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQ 543
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D + PS V+ + L + + + A+ L F R++ + +
Sbjct: 544 EPTDQSTPSSLGVATSLFLDLEQFAPNAD---FGEVADAVLGSFANRVRGSPLEHVSLAL 600
Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
AA+ + VP + + + ++ LA+ + L V+ P E+D W +E
Sbjct: 601 AAEKAASGVP---ELTVAADEVPDEWRATLASRY----LPGLVVSRRPGTDAELDAWLDE 653
Query: 669 HNSNNAS--MARNNFSADKVVALVCQNFSCSPPV 700
+ A A + + C+NF+CS P
Sbjct: 654 LGLDEAPPIWAGREAADGEPTVYACENFTCSAPT 687
>gi|448726262|ref|ZP_21708672.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
gi|445795880|gb|EMA46400.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
Length = 709
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 228/688 (33%), Positives = 338/688 (49%), Gaps = 53/688 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF+D VA+ LN FV IKVDREERPD+D++Y T + G GGWPLS
Sbjct: 51 SACHWCHVMADESFDDPVVAERLNKDFVPIKVDREERPDLDRLYQTVAAMVSGQGGWPLS 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PD +P GTYFP + K G+PGF +L + D+WD +R+ + + ++ L
Sbjct: 111 VWLTPDGRPFYVGTYFPRKAKRGQPGFLDLLDSIADSWDDEREDIEGRADQWADAMAGEL 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S P E+ L A++ D GGFG KFP+ + +++ + E
Sbjct: 171 EGTPDS---PGEVSPGLLETAAQRAVSDADREHGGFGRGQKFPQTGRLHLLM---QAYER 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ +++ + L MA GG+ DH GGGFHRY D W VPHFEKMLYD +L
Sbjct: 225 TGRDA----FREVAVEALDAMADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELVR 280
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ + LT + Y+ I R+ L ++ R++ P G FS DA S + +EGAFYV
Sbjct: 281 AYIAGYRLTGEERYAEIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFYV 338
Query: 320 WTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT EV + + + A LF E Y + GN + GK VL A
Sbjct: 339 WTPPEVHEAIDDEFAADLFCERYGITEAGNFE-----------DGKTVLTLDTAIDGLAD 387
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G E+ L R +F R+ R RP D+KV+ WNGL+IS+FA A L
Sbjct: 388 EHGTTTEEIEADLERAREAIFAARTDRDRPARDEKVLAGWNGLMISAFAEAGLALD---- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ Y E A +A F+R L+DE +L F+ G K G+L+DYAFL
Sbjct: 444 -------------ETYGETAVAALDFVREQLWDEDEQQLARRFKGGEVKIDGYLEDYAFL 490
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L+ YE ++L +A++L F D E G + T S++ R +E D +
Sbjct: 491 ARGALNCYEATGEVEYLTFALDLGRAVVREFFDAEEGTLYFTPQSGESLVARPQELDDQS 550
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PS V+V L+ L+ G + + + AE L ++ + + AAD +
Sbjct: 551 TPSSTGVAVDTLLALSQFAPGEE---FGEIAETVLETHAESIEASPLRRASLALAADRHT 607
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS-- 675
S + + +V + ++ + + L K ++ P E+D W + S + +
Sbjct: 608 AGSLE-LTIVADELPTEWRERIGRTY----LPKRLLARRPPTDAELDGWLDRLSLDDAPP 662
Query: 676 -MARNNFSADKVVALVCQNFSCSPPVTD 702
A + A VC+ F+CSPP T+
Sbjct: 663 IWADRTGENGEPTAYVCRAFTCSPPQTE 690
>gi|225559995|gb|EEH08277.1| DUF255 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 804
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 226/593 (38%), Positives = 313/593 (52%), Gaps = 71/593 (11%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
CHVME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+VFL+P
Sbjct: 112 CHVMEKESFMSPEVAAILNKAFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTP 171
Query: 85 DLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 136
DL+P+ GGTY+P P G+ F IL K++D W ++ +S QL
Sbjct: 172 DLEPVFGGTYWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQTQQLRCRESAKDITRQLQ 231
Query: 137 EALSASASSNKL-------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQM 189
E + + +KL ++L L + + YD GGF APKFP P +
Sbjct: 232 E-FAEEGTYSKLRGAGADEEEDLEVELLEEAYKHFASRYDPVNGGFSRAPKFPTPANLSF 290
Query: 190 MLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
++ S+ + D E + +M + TL +++GGIHDH+G GF RYSV W +PH
Sbjct: 291 LVNLSRFPSAVADIVGYEECAHALEMAIKTLISISRGGIHDHIGHGFARYSVTTDWSLPH 350
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 305
FEKMLYDQ QL VY DAF D DI Y+ ++ P G S+EDADS
Sbjct: 351 FEKMLYDQAQLLGVYTDAFDSAHDPELLGAMYDIAAYITSPPVLSPTGGFHSSEDADSLP 410
Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
T T K+EGAFYVWT KE + ILG+ A + H+ + P GN + R++DPH+EF +N
Sbjct: 411 TPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGNVE--RVNDPHDEFINQN 468
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVIS 423
VL A + G+ E+ + I+ KL + R SKR RP LDDK+IV+WNGL I
Sbjct: 469 VLNIQTTPGKLAKEFGLSEEEVVRIIKASTEKLREYRESKRVRPALDDKIIVAWNGLAIG 528
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
+ A+ S +L + V +E+ AE+AA FIR+ L+D + +L +R
Sbjct: 529 ALAKCSVVLDN----------VDRIKAQEFRLAAENAAKFIRQSLFDPASGQLWRIYRGE 578
Query: 484 P-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
PGF DDYA+LISGL+DLYE +L +A +LQ+
Sbjct: 579 ERGDTPGFADDYAYLISGLIDLYEATFDDSYLQFAEQLQH-------------------- 618
Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+ PS N V NL+RL++++ + D YR+ A +++ F
Sbjct: 619 -------------ASTPSPNGVIARNLLRLSTLL---EDDTYRRLARDTVSAF 655
>gi|448624555|ref|ZP_21670503.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
35960]
gi|445749760|gb|EMA01202.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
35960]
Length = 703
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 233/701 (33%), Positives = 348/701 (49%), Gaps = 82/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P+ KP GTYFPPE + G PGF+ ++ ++W R+ + A+ AI ++L
Sbjct: 113 VWLTPEGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDREEIENRAEQWTSAITDRL 172
Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
E ++ A +++ D Q ALR D GGFG PKFP+P I +L
Sbjct: 173 EETPDVAGEAPGSEVLDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL- 223
Query: 193 HSKKLEDTGKSGEASEGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
G A G++ L +L MA GG+ DH+GGGFHRY VD W VPHFE
Sbjct: 224 ----------RGYAVSGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFE 273
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYDQ LA YLDA LT + Y+ + + ++RR++ G F+ DA S
Sbjct: 274 KMLYDQAGLAARYLDAARLTGNESYATVAAETFAFVRRELTHDDGGFFATLDAQSG---- 329
Query: 309 ATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
+EG FYVWT +V ++L E A LF + Y + P GN F+ K ++
Sbjct: 330 ---GEEGTFYVWTPDDVRELLPELDADLFCDRYGVTPGGN------------FENKTTVL 374
Query: 368 ELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
++ ++A A + + + L + R+ LF R R RP D+KV+ WNGL+IS+FA
Sbjct: 375 NVSATTADLAEEYDLAESEVEARLEKARKALFAAREGRDRPARDEKVLAGWNGLMISAFA 434
Query: 427 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
+ S +L+ ++ + A A F+R L+D++T L NG K
Sbjct: 435 QGSVVLEDDS----------------LADDARRALDFVRERLWDDETETLSRRVMNGEVK 478
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
G+L+DYAFL G DLY+ L +A++L F D + G + T S+
Sbjct: 479 GDGYLEDYAFLARGAFDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESL 538
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R +E D + PS V+ + L + D + A+ L F R++ +
Sbjct: 539 VTRPQEPTDQSTPSSLGVATSLFLDLEQF---APEDGFGDVADAVLGSFANRVRGSPLEH 595
Query: 607 PLMCCAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
+ AA+ + VP + + + ++ LA+ + L V+ P EE+D
Sbjct: 596 VSLALAAEKAASGVP---ELTVAADEVPDEWRETLASRY----LPGLVVSRRPGTDEELD 648
Query: 665 FW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
W +E + A A + + C+NF+CS P D
Sbjct: 649 AWLDELGLDEAPPIWAGREAADGEPTVYACENFTCSAPTHD 689
>gi|403747071|ref|ZP_10955267.1| hypothetical protein URH17368_2612 [Alicyclobacillus hesperidum
URH17-3-68]
gi|403120377|gb|EJY54770.1| hypothetical protein URH17368_2612 [Alicyclobacillus hesperidum
URH17-3-68]
Length = 628
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 241/693 (34%), Positives = 341/693 (49%), Gaps = 68/693 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFEDE VA+ LN ++SIKVDREERPD+D +YMTY QA+ G GGWPL+V L+PD
Sbjct: 1 MAHESFEDEQVAQYLNQHYISIKVDREERPDIDHIYMTYCQAVTGEGGWPLTVILTPDGH 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFP +YGRPG ILR ++ WD++R+ L + A + ++ +A
Sbjct: 61 PFFAGTYFPKNARYGRPGLLEILRVMRQKWDEEREKLVSASAELVTRMQPIFAA------ 114
Query: 148 LPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+P E+ ++A R A L + +D +GGFG APKFP ++ +L +S+ D G
Sbjct: 115 MPGEVDGKHAARQAASTLRERFDHAYGGFGDAPKFPAFHQVMFLLRYSRFASDQG----- 169
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
++M L TL + +GGI DHVGGG RYS D W VPHFEKMLYD Y +A+
Sbjct: 170 --ARQMALDTLDAIMRGGIADHVGGGIARYSTDAFWRVPHFEKMLYDNALAITAYTEAYQ 227
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
+T++ Y I+ +L R++ G +SA DADS EG +EG FYVW ++V
Sbjct: 228 VTRNPRYRRFVEQIVTFLERELTSREGAFYSALDADS---EG----QEGRFYVWRPEDVT 280
Query: 327 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEK 385
LG+ E Y C ++D N F+G +V ++ D A AS M +
Sbjct: 281 AALGDED---GEWY-------CAFYDITDEGN-FEGYSVPNYVDRDIPAFASARNMSEGE 329
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
L E RKL++ R R P LDDK++ +WN L IS A+A + E
Sbjct: 330 LWQWLDEANRKLYEWREHREHPGLDDKILTAWNALAISGLAKAGAVFADE---------- 379
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
++ +A A + L + RL +R+ + + DD+A+LI+ LDLY
Sbjct: 380 ------HWLGLAVRAVQALETLLVRKPDGRLLARYRDQDAAVFAYADDHAYLIAAYLDLY 433
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
E +L A Q+ D LF D EG GYF + ++ + K +DGA PS NSV+
Sbjct: 434 EATLDPFYLRRAQHWQSVLDTLFWDSEGSGYFLYGRDAERLIAQPKTVYDGATPSANSVA 493
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
NL RL ++V + Y + L F T L + A L A ML VV
Sbjct: 494 AHNLQRLYALVG---DEAYADRLDRLLHAFGTWLME-APVDHLWLVTAAMLRDLGTTEVV 549
Query: 626 LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADK 685
D M A H ++ L + V+ A N NA +AD+
Sbjct: 550 WSSVPGRGDVRAMATAFHLAF-LPEAVLLTPSA---------RPNGENAYPP----AADE 595
Query: 686 VVALVCQNFSCSPPVTD-PISLENLLLEKPSST 717
+ VC++F C P D ++ NL+ P T
Sbjct: 596 ALVYVCRHFHCERPEADVAATIANLVANPPRLT 628
>gi|55377924|ref|YP_135774.1| thioredoxin [Haloarcula marismortui ATCC 43049]
gi|55230649|gb|AAV46068.1| thioredoxin domain containing protein [Haloarcula marismortui ATCC
43049]
Length = 733
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 229/709 (32%), Positives = 352/709 (49%), Gaps = 76/709 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFEDEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
L+P+ +P GTYFPPE+K G+PGF +L+++ +W +++ +M AQ AIE
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQRAEMENRAQQWTEAIESDL 177
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
EA A P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAD------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 229 AYADGGQ----EDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----------E 305
++ +L + Y+ + R+ ++++R++ P G FS DA+SA +
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQ 344
Query: 306 TEGATRK-------KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDP 356
+ G + + +EG FYVWT ++V D + + A +F ++Y + GN
Sbjct: 345 SSGESPRDDPDGETEEGLFYVWTPEQVHDAVDDETDADIFCDYYGVTEQGN--------- 395
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+G VL A + ++ L + F+ R RPRP D+KV+
Sbjct: 396 ---FEGATVLAVRKPVPVLAEEYERSEDEITASLQRALNETFEARKDRPRPARDEKVLAG 452
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+I + A + +L +Y +VA A SF+R HL+D RL
Sbjct: 453 WNGLMIRALAEGAIVLDD-----------------QYADVAADALSFVREHLWDADAGRL 495
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+++ G+L+DYAFL G L L+E + L +A++L E F D E G
Sbjct: 496 NRRYKDDDVAIDGYLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTL 555
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F T S++ R +E D + PS V+V L+ L+ S+ D + AE +
Sbjct: 556 FFTPTGGESLVARPQELTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHA 612
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
R+ + + A D + + V LVG +S D+ A + + ++
Sbjct: 613 DRVSSNPLQHASLTLATDTYEQGALE-VTLVGDQS--DYPTEWTETLAEQYIPRRLLAHR 669
Query: 657 PADTEEMDFW---EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
PA+ + W E + + A D+ C+NF+CSPP D
Sbjct: 670 PAEKSRFEQWLDTLEVDESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 718
>gi|448729708|ref|ZP_21712022.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
5350]
gi|445794670|gb|EMA45214.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
5350]
Length = 721
Score = 352 bits (904), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 236/689 (34%), Positives = 346/689 (50%), Gaps = 54/689 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA+ LND FV IKVDREERPD+D++Y T + G GGWPLS
Sbjct: 52 SACHWCHVMEDESFEDEAVAERLNDDFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLS 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
V+L+PD +P GTYFP + K G+PGF +L + ++W D + D+ ++ +A E
Sbjct: 112 VWLTPDGRPFYVGTYFPRDAKRGQPGFLDLLDSIAESWEDDREDVEGRADQWAGAMAGE- 170
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A+ + D + L A+Q +S D +GGFG KFP+ + +++ + E
Sbjct: 171 --LEATPEQPGDPPGSDLLETAAQQAVESADREYGGFGRGQKFPQTGRLHLLM---RAAE 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG++ ++ TL MA GG+ DHVGGGFHRY+ D W VPHFEKMLYD +L
Sbjct: 226 RTGRAV----FDEVARETLDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELV 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL + T+ Y+ + R+ L ++ R++ P G FS DA S + G +EGAFY
Sbjct: 282 RAYLAGYRRTEAERYAEVARETLGFVERELHHPDGGFFSTLDAQSEDESG--EHEEGAFY 339
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT EV D + + A LF E Y + TGN + G VL D A
Sbjct: 340 VWTPDEVHDAVDDEFAADLFCERYGVTETGNFE-----------DGTTVLTLSADIEDLA 388
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ E+ L R +F R++R RP D+K++ WNGL+IS+FA A L +
Sbjct: 389 DEHDTTAEEIEAELERARETVFAARAERARPARDEKILAGWNGLMISAFAEAGLTLDA-- 446
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ + A +A FIR HL+D++ RLQ +++ K G+L+DYAF
Sbjct: 447 ---------------RFADTAVTALDFIREHLWDDEEKRLQRRYKDEDVKIDGYLEDYAF 491
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L+ YE L +A++L T + F D E + T S++ R +E D
Sbjct: 492 LARGALNCYEATGDVDHLAFALDLARTIETEFWDSEEETLYFTPQTGESLVARPQELDDQ 551
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
+ PS V+V L+ L + D + A SL ++ + + AAD
Sbjct: 552 STPSSTGVAVDVLLALDHF---TPDDRFEGIATTSLETHAKTVESSPLRRASLALAADRH 608
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNN 673
+ S + V+ E + SY L + ++ P +E+ W + +
Sbjct: 609 AAGSLEWTVVSDGVPDAWRERI----GRSY-LPRRLLARRPPSDKELATWCDRLGLDDPP 663
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
A A + + A VC++F+CSPP TD
Sbjct: 664 AIWADRDQRDGEPTAYVCRSFTCSPPQTD 692
>gi|337293410|emb|CCB91399.1| uncharacterized protein yyaL [Waddlia chondrophila 2032/99]
Length = 691
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 213/555 (38%), Positives = 295/555 (53%), Gaps = 53/555 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
TCHWCHVME ESF++ VA+ LN F++IKVDREE P+VD++YM + QAL GWPL+
Sbjct: 55 TCHWCHVMEEESFQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLN 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
VFL+PDL P TY PP + G PG +++ + + W K D + ++ +
Sbjct: 115 VFLTPDLLPFFATTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQN 174
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ LPD + + L + L + D +GG APKFP + + L H LE
Sbjct: 175 IQVYGID--LPD---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALE 228
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G+ +V TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD LA
Sbjct: 229 KDGRP------MFLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLA 282
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK + +C +++DY+ + G G SAEDADS EG EG FY
Sbjct: 283 ECYCEAWKATKRSLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFY 335
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT E++D+LG + + LF Y TGN F+GKN+L AS
Sbjct: 336 TWTMDEIDDVLGSDDSELFCSVYGATATGN------------FEGKNILHLPALLEHYAS 383
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
M + + E + KL+ VR KR P DDKV+ SWNGL+I S A K +
Sbjct: 384 DNQMDHFELEARIAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI--- 440
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y++ AA FI HL+ + RL +R G G LDDYAF+
Sbjct: 441 -------------SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFM 485
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I L L+E G GT+WL WA ++ + F EGG ++ T G+DP++++R DGA
Sbjct: 486 IRASLTLFEAGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGA 544
Query: 558 EPSGNSVSVINLVRL 572
EPSGN+V NL+R+
Sbjct: 545 EPSGNAVHCENLLRI 559
>gi|344211988|ref|YP_004796308.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
33960]
gi|343783343|gb|AEM57320.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
33960]
Length = 717
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 227/693 (32%), Positives = 349/693 (50%), Gaps = 60/693 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFENEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLS 136
L+P+ +P GTYFPPE+K G+PGF +L+++ D+W +++ +M AQ AIE
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDL 177
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSK 195
EA A+ P++ ++ ++ + D + GG+GS PKFP+ + +L +
Sbjct: 178 EATPAN------PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---R 228
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D G+ + +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 229 AHADGGQEDYLT----VVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNA 284
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKE 314
++ +L + Y+ + R+ ++++R++ P G FS DA+S E +E
Sbjct: 285 EIPRAFLAGYQAIGSERYASVVRETFEFVQRELQHPDGGFFSTLDAESVPPEDPDGDSEE 344
Query: 315 GAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT ++V D + + A +F CD +++P N F+G VL
Sbjct: 345 GLFYVWTPEQVHDAVDDETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPV 392
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
S A + ++ L + F+ R +RPRP D+KV+ WNGL+I + A + +L
Sbjct: 393 SVLAEEYEQSEDEITASLQRALNETFEAREERPRPARDEKVLAGWNGLMIRALAEGAIVL 452
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
A SF+R HL+D RL +++G G+L+
Sbjct: 453 DDAYADVA-----------------ADALSFVREHLWDADAERLNRRYKDGDVAIDGYLE 495
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL G L L+E + L +A++L E+F D + G F T S++ R +E
Sbjct: 496 DYAFLGRGALTLFEATGNVEHLAFAMDLGQAITEVFWDDDEGTLFFTPTGGESLVARPQE 555
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D + PS V+V L+ L+ S D + AE + R+ + + A
Sbjct: 556 LTDQSTPSSTGVAVDLLLSLSHF---SDDDRFETVAERVIRTHADRVSSNPLQHASLTLA 612
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEH 669
D + + + LVG +S D+ + A + + ++ PAD + W E
Sbjct: 613 TDTYEQGALE-LTLVGDQS--DYPSEWTETLAQRYVPRRLLAHRPADDTGFEQWLDALEL 669
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A D+ C+NF+CSPP D
Sbjct: 670 DESPPIWAGREQVDDEPTVYACRNFACSPPKHD 702
>gi|347735180|ref|ZP_08868108.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
gi|346921671|gb|EGY02301.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
Length = 686
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 232/684 (33%), Positives = 332/684 (48%), Gaps = 72/684 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE++ ++ L+ND F++IKVDREERPDVD+VY + L GGWPL++F
Sbjct: 59 CHWCHVMAHESFENQAISSLMNDLFINIKVDREERPDVDQVYQQALSLLGQQGGWPLTMF 118
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P +P GGTYFPP +YGRPGF +L+ V + + + ++++ ++ L +AL+
Sbjct: 119 LTPKGEPFWGGTYFPPATRYGRPGFPDVLQGVAETYAQDPGKVSRN----VKALGDALAR 174
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ N D + +L A++L + D GG APKFP+P ++ + T
Sbjct: 175 LSRGNP-GDAVTVGSLNAVADRLVREVDPFLGGINGAPKFPQPSIFDLLWRAHLRTART- 232
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ + V+ TL MA GGI+DH+ GGF RYS DE+W VPHFEKMLYD QL +
Sbjct: 233 ------DLRDAVITTLTHMANGGIYDHLAGGFARYSTDEQWLVPHFEKMLYDNAQLVALM 286
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ T+D R+ + ++ +M PGG + DADS EG +EG FYVWT
Sbjct: 287 TQVWQGTRDPLLEVRVRETVGWVLNEMKVPGGAFGATLDADS---EG----EEGRFYVWT 339
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
E++ +LGE A LF HY + GN ++G + LN + A
Sbjct: 340 KAEIDRLLGEDAELFCAHYDVTELGN------------WEGHTI---LNRRTPLA----- 379
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA--ESA 439
P N L R +L R+ R RP DDKV+ WNGL+I++ ARA + + E+A
Sbjct: 380 PGSAEENRLAHARARLLKARALRIRPGWDDKVLADWNGLMIAALARAGFVFEQPGWIEAA 439
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ Y V S H + RL HS R G ++ G L+DYA +
Sbjct: 440 I----------DAYRHVVTSLG-----HTGRDGLDRLYHSGRGGRARHAGLLEDYANMGK 484
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L L+E +L A +T D F D GGY+ T + +L+R + D A P
Sbjct: 485 AALTLHEITGDVAFLDQAARWTDTLDRHFWDAADGGYYTTADDVGDLLVRPRHAQDNAVP 544
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
+GN + NL RL + + D YR A+ ++ F L + A+ L
Sbjct: 545 AGNGTQLGNLTRLWLL---TGQDRYRAQADTLMSAFSGELGRNFFPLSTFLNMAETLL-- 599
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ H VLVG D E A A V + P + E H + +M
Sbjct: 600 NGMHAVLVGEGD--DLEPFNAVLRAQSRPTLVVSRLAPG----QNLPEPHPAAGKAMVDG 653
Query: 680 NFSADKVVALVCQNFSCSPPVTDP 703
+ A VCQ+ CS PVT P
Sbjct: 654 -----RATAYVCQDMRCSLPVTTP 672
>gi|282889930|ref|ZP_06298465.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|338175432|ref|YP_004652242.1| hypothetical protein PUV_14380 [Parachlamydia acanthamoebae UV-7]
gi|281500123|gb|EFB42407.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|336479790|emb|CCB86388.1| uncharacterized protein yyaL [Parachlamydia acanthamoebae UV-7]
Length = 692
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 218/585 (37%), Positives = 310/585 (52%), Gaps = 63/585 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL TCHWCHVME ESFE+ VA+ LN+ F++IKVDREE P+
Sbjct: 33 GDEAFLAAKEADKPIFLSVGYATCHWCHVMEQESFENLEVAQALNEAFINIKVDREELPE 92
Query: 59 VDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
VD +YM + Q++ G GWPL+V L+PDL P TY PP + +G G ++ ++ +AW
Sbjct: 93 VDSLYMEFAQSMMSGAAGWPLNVILTPDLYPFFAATYLPPVNSHGLIGMLELVERIHEAW 152
Query: 118 --DKKRDMLAQSGAFAIEQLSEALS--ASASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
D++ +L QS E++ E S LP P + E L K D G
Sbjct: 153 QGDERERILMQS-----EKIVEVFEQHVHTSGELLP---PPEVIEKTIEMLIKLADPVNG 204
Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
G APKFP + +L +S + +D S +V TL+ M +GGI+DH+GGGF
Sbjct: 205 GMKGAPKFPIAYQSVFLLRYSMEKKD-------SRPLFLVERTLEMMRRGGIYDHLGGGF 257
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
RYSVDE W +PHFEKMLYD LA+ Y +A+ T++ Y +C +IL Y+ RDM G
Sbjct: 258 SRYSVDEAWQIPHFEKMLYDNALLADCYFEAWQATQNPQYKKVCEEILHYVLRDMSHFRG 317
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWT--SKEVEDILGEHAILFKEHYYLKPTGNCDLS 351
+SAEDADS EG EG FY WT E + LF ++ + P GN
Sbjct: 318 GFYSAEDADS---EG----HEGRFYTWTLEEVEELLGGENESELFVHYFDITPEGN---- 366
Query: 352 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 411
F+G+NVL A K+GM ++ + E + L+ R KR P DD
Sbjct: 367 --------FEGRNVLHTPLSLEEFAKKMGMDAQQLDLLFTEQKHILWKAREKRVHPFKDD 418
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 471
K++ +WNGL+I + A A D++ ++ A+++A FI+ L++E
Sbjct: 419 KILTAWNGLMIQAMAEAG---------------CAFCDQR-FLSAAQNSAKFIKAKLWNE 462
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
H L +R+ + LD+YAFLI LL L+E G GT+WL WA+EL F
Sbjct: 463 --HGLLRRWRDDEAMFSAGLDEYAFLIRSLLTLFEAGCGTEWLQWALELNEILKNQF-KA 519
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
G Y+ T G+D S+++R + DGAEPSGN++ NL+RL +
Sbjct: 520 LNGAYYQTNGQDLSLVIRKCQFSDGAEPSGNAIQCENLLRLYQLT 564
>gi|10438196|dbj|BAB15192.1| unnamed protein product [Homo sapiens]
Length = 491
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 209/518 (40%), Positives = 283/518 (54%), Gaps = 48/518 (9%)
Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 271
M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA Y AF L+ D
Sbjct: 1 MALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDE 60
Query: 272 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 331
FYS + + IL Y+ R + G +SAEDADS G R KEGA+YVWT KEV+ +L E
Sbjct: 61 FYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLLPE 119
Query: 332 HAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+ L +HY L GN +S DP E +G+NVL +A++ G+
Sbjct: 120 PVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGL 177
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+E +L KLF R RP+PHLD K++ +WNGL++S +A +L
Sbjct: 178 DVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------- 228
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDD 493
G DR + A + A F++RH++D + RL + GP S P GFL+D
Sbjct: 229 -----GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGTVEHSNPPCWGFLED 281
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKE 552
YAF++ GLLDLYE + WL WA+ LQ+TQD LF D +GGGYF + E + L LR+K+
Sbjct: 282 YAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKD 341
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D DGAEPS NSVS NL+RL G K + L F R++ + +A+P M A
Sbjct: 342 DQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRA 398
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ K +V+ G + + D + ++ H+ Y NK +I AD + F
Sbjct: 399 LSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVLIL---ADGDPSSFLSRQLPF 454
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+++ R D+ A VC+N +CS P+TDP L LL
Sbjct: 455 LSTLRRLE---DQATAYVCENQACSVPITDPCELRKLL 489
>gi|418720670|ref|ZP_13279866.1| PF03190 family protein [Leptospira borgpetersenii str. UI 09149]
gi|410742944|gb|EKQ91689.1| PF03190 family protein [Leptospira borgpetersenii str. UI 09149]
Length = 631
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 242/689 (35%), Positives = 351/689 (50%), Gaps = 65/689 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE++ VA LN FVSIKVDREERPD+D++YM + A+ GGWPL++FL+PD K
Sbjct: 1 MEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P+ GGTYFPPE YGR F +L ++ W +KR L + + L ++ A +
Sbjct: 61 PITGGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQ 120
Query: 148 LPDELPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKS 203
LP L +S YD+ FGGF + KFP + + +L YH S
Sbjct: 121 EEGSLPSKDCFNSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH--------HS 172
Query: 204 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 263
+ +MV TL M +GGI+D VGGG RYS D RW VPHFEKMLYD ++
Sbjct: 173 SGNPKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVE 232
Query: 264 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 323
++K + D++ YL RDM GG I SAEDADS EG +EG FY+W +
Sbjct: 233 CSQVSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFE 285
Query: 324 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
E ++ GE + + ++ + + GN F+GKN+L E A+KL
Sbjct: 286 EFREVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEE 331
Query: 384 EKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 442
K ++ +L R KL + RSKR RP DDK++ SWNGL I + A+A
Sbjct: 332 WKRIDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG------------- 378
Query: 443 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 502
+ R++++++AE SFI R+L D R+ FR+G S G+ +DYA +IS +
Sbjct: 379 ---IAFRREDFLKLAEETYSFIERNLIDPDG-RILRRFRDGESGILGYSNDYAEMISSSI 434
Query: 503 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 561
L+E G G ++L A+ LF R G F TG D VLLR D +DG EPS
Sbjct: 435 VLFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSA 492
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
NS +LV+L+ + G S YR+ AE + F L +++ P + A S
Sbjct: 493 NSSLAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSAYWTYRYHS- 549
Query: 622 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNF 681
K +VL+ K + +++LAA + + ++ + EE +++ +
Sbjct: 550 KEIVLI-RKDANSGKDLLAAIQTRFLPDSVFAVVNENELEEA-------RKLSALFDSRD 601
Query: 682 SADKVVALVCQNFSCSPPVTDPISLENLL 710
S + VC+NFSC PV++ L+ +
Sbjct: 602 SGGNALVYVCENFSCKLPVSNLADLQKWI 630
>gi|389847202|ref|YP_006349441.1| hypothetical protein HFX_1748 [Haloferax mediterranei ATCC 33500]
gi|448614853|ref|ZP_21663881.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
gi|388244508|gb|AFK19454.1| highly conserved protein containing a thioredoxin domain [Haloferax
mediterranei ATCC 33500]
gi|445752940|gb|EMA04359.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
Length = 703
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 232/698 (33%), Positives = 350/698 (50%), Gaps = 76/698 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPEIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P KP GTYFPPE + G PGF+ ++ ++W RD + A+ AI ++L
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTHAITDRL 172
Query: 136 SEALSASASS--NKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY 192
E + + +++ D+ Q ALR + D GGFGS PKFP+P I +L
Sbjct: 173 EETPDTTGETPGSEILDQTVQAALR--------AADRDHGGFGSGGPKFPQPGRIDALL- 223
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ TG+ + + + L MA GG+ DH+GGGFHRY VD +W VPHFEKMLY
Sbjct: 224 --RGYAITGR----RQALDVAVEALDAMANGGLRDHLGGGFHRYCVDRQWTVPHFEKMLY 277
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
DQ LA+ YLDA+ LT + Y+ + R+ +++RR++ G F+ DA S
Sbjct: 278 DQAGLASRYLDAYRLTGNESYATVARETFEFVRRELSHDDGGFFATLDAQSG-------G 330
Query: 313 KEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EG FYVWT ++V L E A LF + Y + P GN F+ K ++ ++
Sbjct: 331 EEGTFYVWTPEDVRSHLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSA 378
Query: 372 SSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
++A A + + + L E +LF R+ R RP D+KV+ WNGL+IS+FA+ +
Sbjct: 379 TTADLAEEYDLTESEVEERLEEAHEELFAARTDRERPARDEKVLAGWNGLMISAFAQGAV 438
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
L ++ + A A F+R HL+DE + L NG K G+
Sbjct: 439 ALTDDS----------------LADDARRALDFVREHLWDEASETLSRRVMNGEVKGDGY 482
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYAFL G DLY+ + L +AI+L F D G + T +++ R
Sbjct: 483 LEDYAFLARGAFDLYQATGDLEPLSFAIDLARATHREFYDDAAGTLYFTPESGEALVTRP 542
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
+E D + PS V+ + L + + A+ L F R++ + +
Sbjct: 543 QEATDQSTPSSLGVATSLFLDLEHFAPDAG---FGDAADAVLESFANRVRGSPLEHVSLV 599
Query: 611 CAADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-- 666
AA+ + VP + + + ++ +A+ + L V+ PA +E+D W
Sbjct: 600 LAAEKAASGVP---ELTVAADEMPDEWRETIASRY----LPGLVVSRRPATDDELDAWLD 652
Query: 667 --EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E + AR + V C+NF+CS P D
Sbjct: 653 ELELDEAPPIWAAREATDGEPTV-YACENFTCSAPTHD 689
>gi|300710941|ref|YP_003736755.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
gi|448296966|ref|ZP_21487016.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
gi|299124624|gb|ADJ14963.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
gi|445580643|gb|ELY35021.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
Length = 709
Score = 351 bits (901), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 235/688 (34%), Positives = 343/688 (49%), Gaps = 55/688 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE +AK LN+ FV IKVDREERPD+D +Y T Q + GGWPLS
Sbjct: 51 SACHWCHVMEEESFEDEDIAKQLNENFVPIKVDREERPDLDSIYQTICQLVTRRGGWPLS 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PD +P GTYFP E + G PGF +L + ++W+ R+ + +Q + A+
Sbjct: 111 VWLTPDGRPFYVGTYFPRESRRGTPGFGDLLGNLAESWEGDREEIENRA----DQWTRAI 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLE 198
+ E P+ L A+ + D GGFG + PKFP+ ++++L + +
Sbjct: 167 TDQLEEVPEAGERPEGVLIEAADAALRGADREHGGFGQNGPKFPQTARLEVLL---RAYD 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ ++V TL M G++D +GGGFHRY+ D W VPHFEKMLYD +L
Sbjct: 224 RTGR----GPYDEVVRETLDAMGSRGMYDQLGGGFHRYATDREWVVPHFEKMLYDNAELP 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL + +T Y+ I R+ L ++ R++ P G +S DA S + E R +EGAFY
Sbjct: 280 RSYLAGYRVTGQERYARIVRETLAFVERELGHPDGGFYSTLDAQSEDPETGER-EEGAFY 338
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT VE++L E A LF E Y + GN F+GK VL + A
Sbjct: 339 VWTPAAVEEVLDEERAALFCERYGVDKRGN------------FEGKTVLTLARSVGSLAE 386
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G+ ++ + L E R+LF+ R +RPRP D+KV+ WNGL+ISSFA A L
Sbjct: 387 EYGLDEDEVEDRLVEAERRLFEAREERPRPRRDEKVLAGWNGLMISSFAEAGLTLD---- 442
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
GS Y + A A F+R L+D + RL F++ K G+L+DYAFL
Sbjct: 443 ---------GS----YAKRAAEALEFVREQLWDTEGKRLSRRFKDREVKIDGYLEDYAFL 489
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G D Y+ + L +A++L + F D E + T ++ R +E +D +
Sbjct: 490 ARGAFDTYQATGDVEHLKFALDLARAIEREFWDEERETLYFTPEAGEELVARPQELNDQS 549
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PS V+ L+ L+ + E LA R++ + + AD
Sbjct: 550 TPSSLGVACDVLLSLSQFADAD----FEGIVERVLARHGDRIRGNPLEHATLALVADRFE 605
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EEHNSNNAS- 675
S + V + ++ L A+ L V+ P E ++ W +E A
Sbjct: 606 NGSLE-VTVAADVLPTEWRERLGEAY----LPGRVLARRPPTEEGLEGWLDELGLEEAPP 660
Query: 676 -MARNNFSADKVVALVCQNFSCSPPVTD 702
A + A VC++F+CSPPVTD
Sbjct: 661 IWADREAREGEATAYVCRSFTCSPPVTD 688
>gi|448677622|ref|ZP_21688812.1| thioredoxin [Haloarcula argentinensis DSM 12282]
gi|445773297|gb|EMA24330.1| thioredoxin [Haloarcula argentinensis DSM 12282]
Length = 717
Score = 351 bits (900), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 225/696 (32%), Positives = 346/696 (49%), Gaps = 66/696 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A+ LN+ FV IKVDREERPD+D VYM+ Q + GGGGWPLS +
Sbjct: 58 CHWCHVMEEESFENEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAW 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
L+P+ +P GTYFPPE+K G+PGF +L+++ +W ++R+ + E + L
Sbjct: 118 LTPEGEPFYVGTYFPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRARQWTEAIESDL 177
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 198
A+ + P++ ++ ++ + D + GG+GS PKFP+ + +L
Sbjct: 178 EATPAD---PEDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL------- 227
Query: 199 DTGKSGEASEGQK----MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
A GQ+ +V TL MA G++DHVGGGFHRY+ D++W VPHFEKMLYD
Sbjct: 228 ----RAHAGGGQEDYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDN 283
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATR 311
++ +L + Y+ + R+ ++++R+M P G FS DA+SA E EG T
Sbjct: 284 AEIPRAFLAGYQAIGSERYASVVRETFEFVQREMQHPEGGFFSTLDAESAPIDEPEGET- 342
Query: 312 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVWT ++V + + + A +F +++ + GN F+G VL
Sbjct: 343 -EEGLFYVWTPEQVHEAVDDETDAEIFCDYFGVTERGN------------FEGATVLAVR 389
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
S A + ++ L + F+ R RPRP D+KV+ WNGL+I + A +
Sbjct: 390 KPVSVLAEEYDQSEDEITGSLQRALNEAFEARENRPRPARDEKVLAGWNGLMIRTLAEGA 449
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L A SF+R +L+D+ RL +++G G
Sbjct: 450 IVLDDAYADVA-----------------ADALSFVREYLWDDDAGRLNRRYKDGDVAIDG 492
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+L+DYAFL G L L+E + L +A++L E F D E G F T S++ R
Sbjct: 493 YLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVAR 552
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+E D + PS V+V L+ L+ S D + AE + R+ + +
Sbjct: 553 PQELTDQSTPSSTGVAVDLLLSLSHF---SDDDRFESVAERVIRTHADRVSSNPLQHASL 609
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A D + + + LVG +S D+ A + + ++ PAD + + W +
Sbjct: 610 TLATDTYEQGALE-LTLVGDQS--DYPTEWTETLAERYVPRRLLAHRPADEDRFEQWLDT 666
Query: 670 NSNNAS---MARNNFSADKVVALVCQNFSCSPPVTD 702
N S A D+ C+NF+CSPP D
Sbjct: 667 LGLNESPPIWAGRTQVDDRPTVYACRNFACSPPKHD 702
>gi|297621186|ref|YP_003709323.1| thymidylate kinase [Waddlia chondrophila WSU 86-1044]
gi|297376487|gb|ADI38317.1| putative thymidylate kinase [Waddlia chondrophila WSU 86-1044]
Length = 691
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 212/555 (38%), Positives = 294/555 (52%), Gaps = 53/555 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
TCHWCHVME ESF++ VA+ LN F++IKVDREE P+VD++YM + QAL GWPL+
Sbjct: 55 TCHWCHVMEEESFQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLN 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEA 138
VFL+PDL P TY PP + G PG +++ + + W K D + ++ +
Sbjct: 115 VFLTPDLLPFFATTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQN 174
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ LPD + + L + L + D +GG APKFP + + L H LE
Sbjct: 175 IQVYGID--LPD---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALE 228
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G+ +V TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD LA
Sbjct: 229 KDGRP------MFLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLA 282
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y +A+ TK + +C +++DY+ + G G SAEDADS EG EG FY
Sbjct: 283 ECYCEAWKATKRSLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFY 335
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT E++D+LG + + LF Y GN F+GKN+L AS
Sbjct: 336 TWTMDEIDDVLGSDDSELFCSVYGATAIGN------------FEGKNILHLPALLEHYAS 383
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
M + + E + KL+ VR KR P DDKV+ SWNGL+I S A K +
Sbjct: 384 DNQMDHFELEARIAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI--- 440
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y++ AA FI HL+ + RL +R G G LDDYAF+
Sbjct: 441 -------------SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFM 485
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I L L+E G GT+WL WA ++ + F EGG ++ T G+DP++++R DGA
Sbjct: 486 IRASLTLFEAGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGA 544
Query: 558 EPSGNSVSVINLVRL 572
EPSGN+V NL+R+
Sbjct: 545 EPSGNAVHCENLLRI 559
>gi|448738600|ref|ZP_21720623.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
13552]
gi|445801484|gb|EMA51818.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
13552]
Length = 709
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 226/689 (32%), Positives = 341/689 (49%), Gaps = 55/689 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF+D VA+ LN+ FV IKVDREERPD+D++Y T + G GGWPLS
Sbjct: 51 SACHWCHVMADESFDDPAVAEQLNEEFVPIKVDREERPDLDRLYQTVAAMVSGRGGWPLS 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PD +P GTYFP E K G+PGF +L + D+W+ +R+ + +Q ++A+
Sbjct: 111 VWLTPDGRPFYVGTYFPREAKRGQPGFLDLLDSIADSWNDEREDIESRA----DQWADAM 166
Query: 140 SAS-ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + P E+ L A++ D GGFG KFP+ + +++ + E
Sbjct: 167 AGELEGTPDTPGEVSPGLLETAAQRAVSEADREHGGFGRGQKFPQTGRLHLLM---QAHE 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ +++ + L +A GG+ DH GGGFHRY D W VPHFEKMLYD +L
Sbjct: 224 RTGRDA----FREVAVEALDAIADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELV 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL + LT + Y+ I R+ L ++ R++ P G FS DA S + +EGAFY
Sbjct: 280 RAYLAGYRLTGEERYAEIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFY 337
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT +EV + + + A LF E Y + GN + GK VL A
Sbjct: 338 VWTPQEVHEAVDDEFAADLFCERYGITEAGNFE-----------NGKTVLTIDTTIDGLA 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G E+ L R +F R+ R RP D+K++ WNGL+IS+FA A L
Sbjct: 387 DEHGTTTEEIEADLERAREAIFAARADRERPARDEKILAGWNGLMISAFAEAGLALD--- 443
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ Y E A +A F+ L+DE +L F++G K G+L+DYAF
Sbjct: 444 --------------ETYSETAVAALGFVHEQLWDEDEQQLARRFKDGEVKIDGYLEDYAF 489
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L+ YE L +A++L F D E G + T S++ R +E D
Sbjct: 490 LARGALNCYEATGEVAQLEFALDLGRAIVREFFDGEEGTLYFTPRSGESLVARPQELDDQ 549
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
+ PS V+V L+ L+ + + + AE L ++ + + AAD
Sbjct: 550 STPSSTGVAVDTLLALSQF---APDEEFEDVAETVLETHAESIEASPLRRASLALAADRH 606
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS- 675
+ S + + +V + ++ + A+ L K ++ P+ E+D W + S + +
Sbjct: 607 TAGSLE-LTVVADELPGEWRERIGRAY----LPKRLLARRPSTNAELDDWLDRLSVDDAP 661
Query: 676 --MARNNFSADKVVALVCQNFSCSPPVTD 702
A + A VC+ F+CSPP T+
Sbjct: 662 PIWAERTGEDGEPTAYVCRAFTCSPPQTE 690
>gi|448502781|ref|ZP_21612730.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
gi|445693844|gb|ELZ45985.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
Length = 745
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 251/747 (33%), Positives = 349/747 (46%), Gaps = 106/747 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++ND FV IKVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAAVVNDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + +PGF+ + ++ D+W ++ D QS
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRRNQPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARD 172
Query: 131 AIEQLSEALSASAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEI 187
+E + AS + L D ALR YD +GGFGS KFP P I
Sbjct: 173 ELESVPTPAEGDASPPGSDLLDTAAAAALR--------GYDEEYGGFGSGGAKFPMPGRI 224
Query: 188 QMMLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+++ A G+ +L TL MA GG++D VGGGFHRY+VD +W
Sbjct: 225 DLLM-----------RAYAGRGRDALLSAATGTLDGMADGGMYDQVGGGFHRYAVDRQWT 273
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA-- 301
VPHFEKMLYD +L YLD + LT D Y+ + + L +L R++ GG FS DA
Sbjct: 274 VPHFEKMLYDNAELPMAYLDGYRLTGDPRYARVASESLAFLDRELRHEGGGFFSTLDARS 333
Query: 302 --------DSAETEGATRKK--------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKP 344
DS E A EGAFYVWT +EV+ +L E A L K+ Y ++
Sbjct: 334 RRPASRGSDSEADEEADVDAGNVGGDDVEGAFYVWTPEEVDAVLDEPAASLAKDRYGIRS 393
Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
GN + +G V A+ + E L E R LFD R R
Sbjct: 394 GGNFE-----------RGTTVPTIAASVEGLAADRDLSPEAVRETLVEARTALFDARESR 442
Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
PRP D+KV+ SWNG IS+FARA L + Y E+A A F
Sbjct: 443 PRPARDEKVLASWNGRAISAFARAGDSLG-----------------EPYAEIAREALDFC 485
Query: 465 RRHLYDEQTHRLQHSFR--NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN 522
R LYD + R +G + PG+LDDYAFL G LD Y + L +A++L
Sbjct: 486 RERLYDADADAGALARRWLDGDVRGPGYLDDYAFLARGALDTYAATGDPEPLGFALDLAG 545
Query: 523 TQDELFLDREGGGYFNT------TGEDPS----VLLRVKEDHDGAEPSGNSVSVINLVRL 572
E F D + G + T T +D + ++ R +E D + PS V+ L L
Sbjct: 546 ALVEEFYDADDGTIYFTRDLDDGTADDRADAGPLIARPQEFTDRSTPSSLGVAAETLALL 605
Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 632
A + +R+ AE + R++ + + AAD++ V + +
Sbjct: 606 DGFRADGE---FREIAERVVTTHGDRIRGSPLEHASLVRAADLVET-GGIEVTIAAAEVP 661
Query: 633 VDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVAL 689
++ L + L ++ P +D W + + A + + + A
Sbjct: 662 REWRETLGERY----LPGALVAPRPLTETGLDEWLDRLGMAEAPPIWADRDATDGEPTAY 717
Query: 690 VCQNFSCSPPVTD-PISLENLLLEKPS 715
VC+ F+CSPP TD +LE L +PS
Sbjct: 718 VCEGFTCSPPRTDLDAALEWLETREPS 744
>gi|448469568|ref|ZP_21600250.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
gi|445808905|gb|EMA58956.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
Length = 740
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 239/721 (33%), Positives = 349/721 (48%), Gaps = 86/721 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE +A +LND FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESIAAVLNDEFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
+ +P+ +P GTYFPPE + +PGF+ + ++ D+W +++ D S
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMERRADQWTTSARD 172
Query: 131 AIEQLSE-ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
+E + + +L+ A ++ P N L A + YD +GGFGS KFP P I
Sbjct: 173 ELESVPDPSLAGDAGGSEAPG---PNLLDEAAAAAVRGYDDEYGGFGSGGAKFPMPGRID 229
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+++ + TG+ + TL MA+GG++D +GGGFHRY+VD +W VPHFE
Sbjct: 230 VLM---RAYARTGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFE 282
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS----- 303
KMLYD +L YLDA LT D Y+ + + L ++ R++ G FS DA S
Sbjct: 283 KMLYDNAELPMAYLDAHRLTGDASYARVASETLGFIDRELRHDDGGFFSTLDARSRPPES 342
Query: 304 ----AETEGATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
A ++G+ + EGAFYVWT EV+ L E A L KE Y + GN +
Sbjct: 343 RRGNAGSDGSDAAEDVADVEGAFYVWTPGEVDAALDEPAASLAKERYGIASGGNFE---- 398
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
+G V A + M L R LF+ R RPRP D+KV
Sbjct: 399 -------RGTTVPTIAASVPELADQRDMSTADVREALTAARVALFEARESRPRPARDEKV 451
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+ SWNG IS+FA A ++L K Y ++A A +F R LYDE+T
Sbjct: 452 LASWNGRAISAFAAAGQVLG-----------------KPYADIASDALAFCRERLYDEET 494
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
L + +G + PG+LDD+AFL G LD Y L +A++L T F D +
Sbjct: 495 GGLARRWLDGDVRGPGYLDDHAFLARGALDAYSATGDPAALGFALDLAETVVSDFYDADD 554
Query: 534 GG-YFN------TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YR 585
G YF T D ++ R +E D + PS V+ L +++ G ++D +
Sbjct: 555 GTIYFTRDPDEETEQGDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFA 610
Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD-FENMLAAAHA 644
AE + R++ + + AAD ++ S V V + D + LA +
Sbjct: 611 DVAERVVTTHADRIRASPLEHVSLVRAADRVA--SGGIEVTVAADAVPDAWRETLAERY- 667
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSPPVT 701
L ++ P + + W + + + A + + A VC+ +CSPP T
Sbjct: 668 ---LPGALVAPRPPTEDGLAAWLDRLGMDEAPPIWADRDAVDGEPTAYVCEGRTCSPPET 724
Query: 702 D 702
D
Sbjct: 725 D 725
>gi|448491519|ref|ZP_21608359.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
19288]
gi|445692519|gb|ELZ44690.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
19288]
Length = 746
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 242/729 (33%), Positives = 348/729 (47%), Gaps = 96/729 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA ++ND FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAGVVNDSFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + PGF+ + ++ D+W ++ D QS
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF-GGFGSAPKFPRPVEIQM 189
+E + S + + L A + YD + G G KFP P I +
Sbjct: 173 ELESVPNP-DTPGSDGEAASPPGDDLLDTAAAAALRGYDEEYGGFGGGGAKFPMPGRIDL 231
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
++ A G+ +L TL MA GG++D +GGGFHRY+VD +W VP
Sbjct: 232 LM-----------RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVP 280
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA---- 301
HFEKMLYD +L YLD + L+ D Y+ + + L +L R++ GG FS DA
Sbjct: 281 HFEKMLYDNAELPMAYLDGYRLSGDPAYARVAGESLAFLDRELRHEGGAFFSTLDARSRP 340
Query: 302 --------DSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSR 352
DS E +G EGAFYVWT +EV+ +L E A L K+ Y ++ GN +
Sbjct: 341 PESRRDGSDSDEGDGEG-DVEGAFYVWTPEEVDAVLDEPAASLAKKRYGIRSGGNFE--- 396
Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
+G V A+ + EK IL E R LFD R RPRP D+K
Sbjct: 397 --------RGTTVPTLAASVEELAADRDLSPEKVREILTEARTTLFDARESRPRPARDEK 448
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-- 470
V+ SWNG IS+FARA L +EY E+A A F LYD
Sbjct: 449 VLASWNGRAISAFARAGDTLG-----------------EEYAEIAREALDFCHERLYDAE 491
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT-QDELF- 528
+T L + +G + PG+LDDYAFL G LD+Y + L +A+EL + DE +
Sbjct: 492 NETGALARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALELADALVDEFYD 551
Query: 529 -----------LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
LD EG G + + ++ R +E D + PS V+ L +++
Sbjct: 552 ADDGTIYFTRDLDGEGAGGGSRNADSGPLIARPQEFTDRSTPSSLGVAAETL----ALLD 607
Query: 578 GSKSD-YYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
G ++D +R+ AE L R++ + + AAD++ V + + ++
Sbjct: 608 GFRTDGEFREIAERVLTTHADRIRGSPLEHASLVRAADVVET-GGIEVTIAADEVPDEWR 666
Query: 637 NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQN 693
L + L ++ PA + +D W + + A + + + A VC+
Sbjct: 667 ETLGERY----LPGALVAPRPATEDGLDAWLDALGMAEAPPIWADRDATDGEPTAYVCEG 722
Query: 694 FSCSPPVTD 702
F+CSPP TD
Sbjct: 723 FTCSPPRTD 731
>gi|448439398|ref|ZP_21588039.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
gi|445691449|gb|ELZ43640.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
Length = 751
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 250/734 (34%), Positives = 355/734 (48%), Gaps = 101/734 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA +LN+ FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAAVLNEEFVPVKVDREERPDVDSAFMTVSQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
+ +P+ +P GTYFPPE + +PGF+ + ++ D+W ++ D S
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMQRRADQWTTSARD 172
Query: 131 AIEQLSEALSASAS-------SNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSA-PKF 181
+E + +A + A ++ E P + L A + YD +GGFGS KF
Sbjct: 173 ELESVPDAEAGPAGGADDAGGTDGADGEAPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKF 232
Query: 182 PRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
P P I +++ + TG+ + TL MA+GG++D +GGGFHRY+VD +
Sbjct: 233 PMPGRIDVLMRAYAR---TGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQ 285
Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
W VPHFEKMLYD +L +LDA LT D Y+ + + L +L R++ G FS DA
Sbjct: 286 WTVPHFEKMLYDNAELPMAFLDAARLTGDASYARVASETLGFLDRELRHDDGGFFSTLDA 345
Query: 302 DSAETEGATRKK----------------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKP 344
S E TR+ EGAFYVWT EV+ +L E A L KE Y ++
Sbjct: 346 RSRPPE--TRRGGVGSDGSDGSGHAADVEGAFYVWTPGEVDAVLDEPAASLAKERYGIES 403
Query: 345 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 404
GN + +G V A M E L E R LF+ R R
Sbjct: 404 GGNFE-----------RGTTVPTVAASIEELADDHDMSPEAVREALTEARVALFEARESR 452
Query: 405 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 464
PRP D+KV+ SWNG IS+FA A ++L + Y ++A A +F
Sbjct: 453 PRPARDEKVLASWNGRAISAFAAAGQVLG-----------------EPYADIAGDALAFC 495
Query: 465 RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 524
R +LYDE T L + +G + PG+LDD+AFL G LD+Y L +A++L T
Sbjct: 496 RENLYDESTGDLARRWLDGDVRGPGYLDDHAFLARGALDVYAATGDPDALGFALDLAETV 555
Query: 525 DELFLDREGGGYFNT------TGED--PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
F D E G + T GED ++ R +E D + PS V+ LV ++
Sbjct: 556 VADFYDDEDGTIYFTRDPDEAAGEDGDDTLFARPQEFTDRSTPSSLGVAAETLV----LL 611
Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSS 632
G ++D R+ AE + AV T D A PL + AAD ++ S V V +S
Sbjct: 612 DGFRTD--REFAEVAEAVVTTH-ADRIRASPLEHVSLVRAADRVA--SGGIEVTVAAESV 666
Query: 633 VD-FENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH---NSNNASMARNNFSADKVVA 688
D + L + L ++ P + + W + + A + + + A
Sbjct: 667 PDAWRETLGERY----LPGALVAPRPPTEDGLAVWLDRLDMDEAPPVWADRDAADGEPTA 722
Query: 689 LVCQNFSCSPPVTD 702
VC+ +CSPP TD
Sbjct: 723 YVCEGRTCSPPETD 736
>gi|118579433|ref|YP_900683.1| hypothetical protein Ppro_0998 [Pelobacter propionicus DSM 2379]
gi|118502143|gb|ABK98625.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
2379]
Length = 705
Score = 348 bits (894), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 223/697 (31%), Positives = 331/697 (47%), Gaps = 78/697 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLS 79
TCHWCHVM ESFED VA ++N + +KVDREERPD+D +YMT + L G G GWPL+
Sbjct: 80 TCHWCHVMARESFEDPEVAAIINRHLIPVKVDREERPDIDSLYMTAARILTGSGAGWPLT 139
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+P+ KP TY P G G + K+ + W+ RD++ ++ + L E +
Sbjct: 140 IFLTPERKPFYCATYIPKTGSNGVLGIVETVEKISEIWNTNRDLINENSDTVVRALREIV 199
Query: 140 ---SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
SA ++ DE L YD GGFG KFP P + +L ++
Sbjct: 200 APVSADTDFGRVLDE--------AQASLQGMYDYLNGGFGGGAKFPLPHNLSFLLRMWRR 251
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
++ + ++MV +TL+ M GGI+D +G GFHRY+VD W VPHFEKMLYDQ
Sbjct: 252 TQN-------QDIEEMVAYTLRMMRDGGIYDQLGFGFHRYAVDPEWRVPHFEKMLYDQAL 304
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+A L+AF D F + +I ++ ++ P G S ADS EG
Sbjct: 305 IAITCLEAFQAYGDEFLKDMAMEIFSFVFDELTSPDGGFCSGLGADSG-------GGEGY 357
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
+Y+W+ E++ L GE + LF E + + TGN F+G N+L + +
Sbjct: 358 YYLWSRGEIDRNLDGETSRLFCEAFGVTDTGN------------FEGGNILYQPRSVALL 405
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A + G+ + L R KL +VR++R RP D+K++V+WNGL++++ AR + +
Sbjct: 406 ARENGLDAGELDRRLETARAKLLEVRAERVRPFRDEKILVAWNGLMVAALARGAAV---- 461
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
S + +E A SA FI R+L+ RL S+ + P FL+DYA
Sbjct: 462 ------------SGEQRLLEAARSAVRFIARNLH-TPAGRLLRSYHQSVASVPAFLEDYA 508
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL G+++LY+ L A+ L +LF D G +++T E VL+R+K HD
Sbjct: 509 FLCWGMVELYQVDGDPVMLQGALGLARGMLDLFSDAVTGAFYDTASEAEQVLVRMKNAHD 568
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA PSGNS++ + L++L I + E L + L + +A M A D
Sbjct: 569 GAIPSGNSIACLCLLKLGKICG---DEALTHAGERCLVSWMGSLAEQPIAHIQMVTALDF 625
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
P + + L+G + +L H + + D M
Sbjct: 626 FLGPDVE-ITLIGDRDKPGVRELLNVIHRYFIPGLVLRFKGDGDVYPM------------ 672
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
A VC +C PPV D LE LL E
Sbjct: 673 ------VGGLPTAYVCARGACRPPVNDAAQLEQLLSE 703
>gi|386856660|ref|YP_006260837.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
gi|380000189|gb|AFD25379.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
Length = 680
Score = 348 bits (893), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 207/545 (37%), Positives = 281/545 (51%), Gaps = 46/545 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE A +N FV+IKVDREERPD+D VYM QAL G GGWP++
Sbjct: 47 STCHWCHVMAHESFEDEATAAQMNAGFVNIKVDREERPDIDAVYMAATQALTGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEA 138
VFL+PD +P GTYFPP + G P F +L V AW +RD ML + + L+
Sbjct: 107 VFLTPDAEPFYAGTYFPPREGLGMPSFGRVLGSVSGAWTTQRDKMLGNA-----QALTAH 161
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ +++ + D LP A L E L + YD+ GGFG APKFP P + +L S
Sbjct: 162 IQEASAPRRGEDPLPDGATGLAVEHLRRVYDADLGGFGGAPKFPSPATLDFLLTQSA--- 218
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G+ M L TL+ M GGIHD +GGGFHRYSVD +W VPHFEKMLYD QLA
Sbjct: 219 ----------GRDMALHTLRRMGAGGIHDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLA 268
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
L AF ++ D ++ + R L YL R+M+ G FSA+DAD+ G EG +
Sbjct: 269 RTLLRAFQVSGDGAFADLARTTLGYLEREMLSAEGGFFSAQDADTPTDHGGV---EGLTF 325
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASAS 377
WT E+ ++LG L+ G + DPH E+ +NVL S
Sbjct: 326 TWTPAEIREVLGAGG---DTDLALRAYGVTEEGNFLDPHRPEYGRRNVLHLPTPVSQLTR 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LG + L R++ P DDKV+ SWNGL +++FA A+++L
Sbjct: 383 DLGPDVPTRLEAARAHLLAARQARTQ---PGTDDKVLTSWNGLALAAFADAARVLGD--- 436
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ +EVA A F+RR L L+H++++G ++ G L+D+
Sbjct: 437 -------------TQLLEVARRNADFVRRELRLPDG-TLRHTYKDGQARVEGLLEDHVLY 482
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GL+ L++ G L WA EL F D E G + + G ++L R + D A
Sbjct: 483 ALGLVALFQAGGDLAHLHWARELWTVVRRDFWDAEAGVFHSAGGRAETLLTRQAQGFDSA 542
Query: 558 EPSGN 562
S N
Sbjct: 543 ILSDN 547
>gi|257051594|ref|YP_003129427.1| hypothetical protein Huta_0507 [Halorhabdus utahensis DSM 12940]
gi|256690357|gb|ACV10694.1| protein of unknown function DUF255 [Halorhabdus utahensis DSM
12940]
Length = 717
Score = 348 bits (892), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 205/568 (36%), Positives = 298/568 (52%), Gaps = 48/568 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A +LN+ FV IKVDREERPDVD++Y T Q L GGWPLSV
Sbjct: 54 ACHWCHVMAEESFEDEATAAVLNENFVPIKVDREERPDVDRIYQTLAQLLGQQGGWPLSV 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
+L+PD +P GTYF P+ + GRPGF +L +K+ W+ RD + Q + +S L
Sbjct: 114 WLTPDGRPFYVGTYFAPDSRGGRPGFADLLEDLKETWENDRDGIEQRADQWADAISGELE 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML-----YHS 194
+ + D LR A+ ++ D GGFGS PKFP+P +Q++L + S
Sbjct: 174 GTPTPADPSDVRSDELLRAGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFGS 233
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
++ D G + E + ++ +L M GG++DHVGGGFHRY+ D W VPHFEKMLYD
Sbjct: 234 ERSAD-GDGADPGEYRAVLTESLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDN 292
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
++ ++ + +T D Y+ + + ++L R++ P G +S DA S EG +E
Sbjct: 293 AEIPRALIEGYRVTGDERYARVAGETFEFLDRELGHPEGGFYSTLDARS---EG----EE 345
Query: 315 GAFYVWTSKEVEDILGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVWT +EV +G+ L + Y + GN + G+ VL
Sbjct: 346 GKFYVWTPEEVRAAVGDETDVSLVLDRYGITEDGNFE-----------DGQTVLTIAASV 394
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A++ G+ ++ + L R +LFD RS+R RP D+K++ WNGL IS+ A S L
Sbjct: 395 DELAAQSGLEVDDVQDRLDRAREQLFDARSERTRPPRDEKILAGWNGLAISALAEGSLAL 454
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ + ++ A A F+R L+DE + L+ F +G + G+L+
Sbjct: 455 ED-----------------DILDRAVDALEFVRETLWDEDSGLLKRRFIDGDVRVEGYLE 497
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPS--VLL 548
DYAFL G LD Y+ L +A++L + F D + G + T G D +L
Sbjct: 498 DYAFLARGALDCYQASGDPDQLAFALDLAEEIESRFFDEDAGTLYFTEEAGSDAGTDLLA 557
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIV 576
R +E D + PS V+V LV L V
Sbjct: 558 RPQELTDRSTPSSAGVAVDVLVTLDEFV 585
>gi|374293368|ref|YP_005040403.1| hypothetical protein AZOLI_3026 [Azospirillum lipoferum 4B]
gi|357425307|emb|CBS88194.1| conserved protein of unknown function; putative Thioredoxin and
glycosidase domains [Azospirillum lipoferum 4B]
Length = 683
Score = 348 bits (892), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 234/697 (33%), Positives = 344/697 (49%), Gaps = 75/697 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE+ +A L+N+ FV+IKVDREERPD+D +Y + + L GGWPL++F
Sbjct: 56 CHWCHVMAHESFENPEIAGLMNELFVNIKVDREERPDLDTIYQSALALLGQQGGWPLTMF 115
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GGTYFPP +YGR GF +LR + + ++D + ++ ++ L ALS
Sbjct: 116 LTPDAEPFWGGTYFPPAPRYGRAGFPDVLRGIAGTYANEQDKVGKN----VDALKSALS- 170
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
N+ + L A++L + D GG G+APKFP+ V + +L+ + + TG
Sbjct: 171 GMGENRSAGAVDAGVLDQVAQRLLREVDPIHGGIGTAPKFPQ-VPLFELLW--RAWQRTG 227
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ ++ V TL MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD +L ++
Sbjct: 228 R----EPFREAVTHTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLDLM 283
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ T+D R+ + +L R+MI GG + DADS EG +EG FY+W
Sbjct: 284 TLVWQETRDPLLETRIRETVGWLLREMIADGGGFAATLDADS---EG----EEGLFYIWN 336
Query: 322 SKEVEDIL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+EV+ +L + FK Y + P GN + + N G + L D + A
Sbjct: 337 EEEVDRLLTPALGADGLATFKHVYEVLPQGNWEGVTIL---NRLGG----LSLADDATEA 389
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ L + R L R+KR RP DDKV+ WNGL+I++ A+
Sbjct: 390 T------------LAKGREILLRARAKRVRPGWDDKVLADWNGLMIAALTHAALA----- 432
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D E+++ A A +F+R + ++ RL HS+R+G K G LDDYA
Sbjct: 433 -----------LDEPEWLDAAGRAFAFVRDRM--DKNGRLCHSWRHGQGKHTGMLDDYAH 479
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ L L+E L A T D F D GGYF T + +++R K D
Sbjct: 480 MARAALALHEATGDPAALDQAKLWVATLDAHFWDGANGGYFFTADDAEGLIVRTKTAFDN 539
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGN L LA++ + D YR+ A+ A F L + + +++
Sbjct: 540 ATPSGNGTM---LAVLATLFQRTGEDAYRERADALAAAFSGELTRNFFPLTTFLNSVELM 596
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ P + +V+VG + + E + N+ + + P D H + M
Sbjct: 597 TAPLQ--IVVVGPPKAAETEALRRTVLDHSLPNRILTVLAPG----ADLPANHPAQGKGM 650
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
A VC+ +CS PVT P L LL K
Sbjct: 651 RDG-----AATAYVCRGMTCSAPVTAPADLAALLSTK 682
>gi|163786447|ref|ZP_02180895.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
ALC-1]
gi|159878307|gb|EDP72363.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
ALC-1]
Length = 705
Score = 348 bits (892), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 224/689 (32%), Positives = 353/689 (51%), Gaps = 84/689 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VA+L+N+ F+SIKVDREERPDVD++YM+ VQ + G GGWPL+
Sbjct: 81 CHWCHVMEEESFENDSVARLMNENFISIKVDREERPDVDQIYMSAVQLMTGSGGWPLNCI 140
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF +P + IL + + + + A+A E+L+E +
Sbjct: 141 TLPDGRPVFGGTYFT------KPQWTKILEDMSSLYKTNPEKVI---AYA-EKLTEGVKN 190
Query: 142 SASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ N + + N L++ ++L KS D + GG +APKFP P + +L +S + +D
Sbjct: 191 ADLINVNKEGIQFNKLQIESTVDELKKSLDFKLGGQKNAPKFPMPSNLDFLLRYSFQNDD 250
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ Q+ V+ +L MA GGI+D +GGGF RYSVD+RWH+PHFEKMLYD QL +
Sbjct: 251 -------KDLQQFVMTSLNKMANGGIYDQIGGGFSRYSVDDRWHIPHFEKMLYDNAQLVS 303
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A+ TK+ + I + L+++ R++ G +S+ DADS EG +EG FY
Sbjct: 304 LYSKAYQFTKNEDFKTIVTETLNFIDRELTQEEGAFYSSLDADSKTKEGEL--EEGVFYT 361
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM----SDPHNEF-KGKNVLIELNDSSA 374
WT +++ LGE LFK +Y + TG + + + NEF K N+ I+
Sbjct: 362 WTKDDLKTELGEDFDLFKSYYNINATGKWEKDQFILYKTKTDNEFIKTNNITIK------ 415
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
E + +L ++KL++VR+KR RP LDDK + SWN L++ ++ A ++
Sbjct: 416 ---------ELHSKVLA-WKKKLYEVRAKRERPRLDDKALTSWNALMLKAYVDAYRVF-- 463
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
+++ Y++ A A FI+ + + L H+++N S GF +DY
Sbjct: 464 --------------NKQSYLDKAIDNAKFIKENQI-QNNGSLFHNYKNKKSTIEGFSEDY 508
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A I+ ++LY+ +WL A EL + F ++E ++ T+ + +++ R E
Sbjct: 509 AHTITAYIELYQATFNEQWLNTAKELMDYAIAHFSNKETSMFYFTSDNETNLITRKTEVF 568
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D PS NSV L +L YY A LA K M L D
Sbjct: 569 DNVIPSSNSVLADCLFKLGH--------YYSNKAYTDLA------KQM-----LSNVYDD 609
Query: 615 MLSVPS--RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+ PS + L + ++ +E ++ + A L + + P + + S+
Sbjct: 610 IEKAPSAYTNWLKLYLNYANPYYEVAISGSEADSKLKELNMFYLP----NILISGSNKSS 665
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVT 701
N + +N F D+ VC N +C PVT
Sbjct: 666 NLPLLKNKFIEDETFIYVCVNGTCKLPVT 694
>gi|168702337|ref|ZP_02734614.1| hypothetical protein GobsU_22617 [Gemmata obscuriglobus UQM 2246]
Length = 793
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 234/633 (36%), Positives = 322/633 (50%), Gaps = 66/633 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K ++ FL + CHWCHVME ESF VAK+LN FV IKVDREERPD
Sbjct: 64 GPEAFERAKKEKKLIFLSIGYSACHWCHVMERESFSRADVAKILNANFVCIKVDREERPD 123
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED-KYGR---PGFKTILRKVK 114
VD +YMT + GGWPL++FL+PD KP+ G TYFPP+D K G PGFKT+L KV
Sbjct: 124 VDDIYMTALNTTGEQGGWPLNMFLTPDGKPIFGATYFPPDDRKIGDDTVPGFKTVLNKVM 183
Query: 115 DAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGG 174
+ +DK R L + + EAL A++ + L +P + + D GG
Sbjct: 184 E-FDKDRADLEKQADRVAKATVEALDANSRAIAL---VPLKRDLVSDGLDAFDIDPEHGG 239
Query: 175 FGS------APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDH 228
GS KFPRP +L +KK G A K+ TL + +GGI+DH
Sbjct: 240 TGSKKRDYKGTKFPRPPVWGFVLTQTKK---PGNERLA----KLTHNTLAKILEGGIYDH 292
Query: 229 VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDM 288
+GGGFHRYS + W VPHFEKMLYD QL +Y +A++L Y + + L+++RR+M
Sbjct: 293 LGGGFHRYSTERTWTVPHFEKMLYDNAQLVELYSEAYALAPRPEYKRVVAETLEFVRREM 352
Query: 289 IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNC 348
P +SA DADS + KEG FYVWT+ EV +LG A + +K
Sbjct: 353 TAPEKGFYSALDADSND-------KEGEFYVWTADEVAKVLGTDA----DTAIVKAVYGV 401
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
D + + L E+ A +L + + L L ++KLFD R+KR RP
Sbjct: 402 TAPNFEDKFHILRLPKPLAEI------AKELKLTEDALLTKLEPLKKKLFDHRAKRERPF 455
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
LD KVI +WNG +I+ +ARA + K A Y+ A AA F+ L
Sbjct: 456 LDTKVITAWNGQMIAGYARAGGVFKEPA----------------YVRAAADAADFLLTKL 499
Query: 469 YDEQTHRLQHSFRNGPSKAP-----GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT 523
D+ RL + P P FLDDYA+LI GLL+L++ KWL A L +
Sbjct: 500 RDKD-GRLYRMYAAAPGGKPAPKGAAFLDDYAYLIHGLLNLHDATGEPKWLDAAKGLTDL 558
Query: 524 QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
+ + D GG++ T + + R K+ +DG +PSGNS NL+RL + +K +
Sbjct: 559 AVKHYADPVNGGFYFTAADGEKLFARAKDSYDGVQPSGNSQMARNLLRLGT---KTKDEG 615
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
YR ++ F L+ ++PLM D L
Sbjct: 616 YRDRGIRTVKAFSFALRTAPTSMPLMLRTLDEL 648
>gi|448410530|ref|ZP_21575235.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
gi|445671566|gb|ELZ24153.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
Length = 719
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 225/697 (32%), Positives = 335/697 (48%), Gaps = 60/697 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESF DE +A+LLN+ FV IKVDREERPD+D +YM+ Q + G GGWPL+ +
Sbjct: 57 CHWCHVMEEESFADEDIAELLNENFVPIKVDREERPDIDSIYMSICQQVSGRGGWPLNAW 116
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GTYFPPE K G PGF+ +L + ++W D Q ++A++
Sbjct: 117 LTPDGDPFYVGTYFPPEPKRGAPGFRQLLDDISESWADSEDRAEMED--RARQWTDAIAN 174
Query: 142 S-ASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++ P + P ++ L A + D FGG+G KFP+P +++++
Sbjct: 175 DLETTPDQPGDAPGEDVLDTTASAALRGADREFGGWGKGQKFPQPGRLRVLMR------- 227
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+SG +++V TL M GG++DHVGGGFHRY+ D W VPHFEKMLYD +LA
Sbjct: 228 AHRSGGRDAYREVVGETLDAMGDGGLYDHVGGGFHRYTTDREWVVPHFEKMLYDNAELAR 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
V+L + T Y R+ L+++ R++ P G +S DA+S ++EGAFY
Sbjct: 288 VFLTGYQFTGRERYRETARETLEFVERELTHPDGGFYSTLDAESEGE--EGEREEGAFYA 345
Query: 320 WTSKEVEDILGEH--------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
WT V+D + E+ A +F+E Y + TGN + G+ V
Sbjct: 346 WTPDGVDDAVAEYGPEHGVPGEQASLAAEIFRERYGVTATGNFE-----------GGETV 394
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
L + A G+ L ++L +F R +RPRP D+KV+ WNGL++S+F
Sbjct: 395 LTRSASVESLADDYGLSLGDAEDLLDAATTAVFAAREERPRPPRDEKVLAGWNGLMVSAF 454
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
A A+ + D + + A A F R HL+D + RL F++G
Sbjct: 455 AEAAVV-----------------DDESWAGTATEALDFARDHLWDADSGRLSRRFKDGDV 497
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
G+L+DYAFL G D Y+ + L +A+EL T + F D E + T S
Sbjct: 498 DIRGYLEDYAFLARGAFDTYQATGEVEHLAFALELARTIETEFWDAEEETLYFTPQSGES 557
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
++ R +E D + PS V+ L+ L V D + A LA R++
Sbjct: 558 LVARPQELADQSTPSSAGVAAELLLALDHFV---DHDRFETVASGVLATHGGRVESNPQQ 614
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
P + AAD + + + L + LA + L D A +D
Sbjct: 615 HPSLALAADAYRSGAHE-LTLAADPLPESWRETLAETYIPRRLLAPRPPTDDALAAWLDA 673
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E ++ +R + V C++ +CSPP D
Sbjct: 674 LELADAPPIWASREARDGEPTV-YACRSRTCSPPTQD 709
>gi|448608928|ref|ZP_21660207.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
gi|445747305|gb|ELZ98761.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
Length = 702
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 235/707 (33%), Positives = 345/707 (48%), Gaps = 95/707 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A++LN+ F+ +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPEIAEVLNEHFIPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P KP GTYFPPE + G PGF+ ++ + W RD + A+ AI ++L
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAETWQTDRDEIENRAEQWTHAITDRL 172
Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
E A +++ D+ Q ALR PKFP+P I +L
Sbjct: 173 EETPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDAIL-- 222
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ TG+ E + + L MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----REALDVAVEALDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q LA YLDA+ LT + Y+ + R+ +++RR++ G F+ DA S +
Sbjct: 278 QAGLAARYLDAYRLTGNESYAAVARETFEFVRRELSHDDGGFFATLDAQS-------DGE 330
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT + V L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 331 EGTFYVWTPEAVRSHLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSAT 378
Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
S A++ + ++ + L E ++ LF R+ R RP D+KV+ WNGL+IS+FA+ +
Sbjct: 379 LSDLAAEYDLSEDEVEDHLEEAKKTLFAARADRERPARDEKVLAGWNGLMISAFAQGAVA 438
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ +A A A F+R HL+DE + L NG K G+L
Sbjct: 439 LEDDSLAAD----------------ARRALDFVREHLWDEASETLSRRVMNGEVKGDGYL 482
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ + L +AI+L + F D G + T +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDLEPLSFAIDLARATNREFYDAAAGTLYFTPESGEALVTRPQ 542
Query: 552 EDHDGAEPSGNSVSVINLVRL------------ASIVAGSKSDYYRQNA-EHSLAVFETR 598
E D + PS V+ + L A V S ++ R + EH V T
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAGFGEAADAVLESYANRIRGSPLEHVSLVLAT- 601
Query: 599 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA 658
+ A VP + AAD + R+ + AS L V+ PA
Sbjct: 602 -EKAASGVPELTAAADEMPDEWRETL-------------------ASRYLPGLVVSRRPA 641
Query: 659 DTEEMDFW-EEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
+E+D W +E + A A + K C++F+CS P D
Sbjct: 642 TDDELDVWLDELELDEAPPIWAAREATDGKPTVYACESFTCSAPTHD 688
>gi|431930442|ref|YP_007243488.1| thioredoxin domain-containing protein [Thioflavicoccus mobilis
8321]
gi|431828745|gb|AGA89858.1| thioredoxin domain protein [Thioflavicoccus mobilis 8321]
Length = 683
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 230/685 (33%), Positives = 343/685 (50%), Gaps = 63/685 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFED A L+N FV+IKVDREERPD+D++Y T Q L GGWPL
Sbjct: 54 SACHWCHVMAHESFEDPATAALMNRLFVNIKVDREERPDLDRIYQTAHQLLSSRAGGWPL 113
Query: 79 SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+VFL+P+ L+P GTYFP E ++G P F+ +L V+ A+ ++R+ + + + L+E
Sbjct: 114 TVFLTPETLEPFFCGTYFPREPRHGLPAFRQLLEGVERAFREQREAIREQSQGLMAALAE 173
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ + +PD P R QL+ S+D+ GGFG APKFPR +++++L H
Sbjct: 174 L---APRAGAIPDSAPLEGAR---RQLAASFDAARGGFGGAPKFPRVPDLELLLRHWAAT 227
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G+ + MV FTL+ M GGI+D VGGGF+RYSVD+ W +PHFEKMLYD QL
Sbjct: 228 DAAGQPD--ARALAMVTFTLERMIAGGINDQVGGGFYRYSVDDAWMIPHFEKMLYDNAQL 285
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ DA+ T + + D++ +M G +SA DADS EG +EG +
Sbjct: 286 LALCCDAWQATSEPVFRAAAEATADWVIGEMQSDEGGYYSALDADS---EG----QEGRY 338
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT +E+E L Y + P N F+G+ L + A
Sbjct: 339 YVWTREELEGTLAPEEFAAFAARY----------GLDGPAN-FEGRWHLHAQAMPAEVAG 387
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+LG+ + + ++ RRKL +VR R RP D+KV+ +WN L+I ARA+++L
Sbjct: 388 RLGLTVAQVEGLIDGARRKLLEVRRARVRPACDEKVLTAWNALMIKGMARAARVLA---- 443
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
R +Y+ AE A +R L+ + RL S+ +G + P +LDD+A L
Sbjct: 444 ------------RPDYLASAERALGLVRSTLW--RDGRLLASYMDGTAHLPAYLDDHAML 489
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I LL+L + L +AIEL F D GG+F T + +++ R K D +
Sbjct: 490 IDALLELLQVRWRRDDLRFAIELAEILLARFEDSGEGGFFFTASDHETLIHRPKPLADES 549
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
P+GN+V+ RL ++ + Y + A LAV ++ A + A D
Sbjct: 550 LPAGNAVAARVFQRLGHLLGEPR---YLEAAARVLAVAGGDMRRAPYAHASLLMALDEHL 606
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
P VV + LA +Y ++ + I PAD +++ N ASM
Sbjct: 607 EPGETVVV---RAPPTELPPWLAELQQTYRPRRSALGI-PADEQDL------PGNLASMG 656
Query: 678 RNNFSADKVVALVCQNFSCSPPVTD 702
A +C+ C P+ +
Sbjct: 657 ----PGPGARAYLCRGTHCEAPIEE 677
>gi|372487318|ref|YP_005026883.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
gi|359353871|gb|AEV25042.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
Length = 682
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 242/704 (34%), Positives = 352/704 (50%), Gaps = 82/704 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM E F D VA +N F++IKVDREERPD+D+VY T Q L G GGWPL
Sbjct: 48 SACHWCHVMAHECFADATVAAEMNRLFINIKVDREERPDLDQVYQTAHQMLVGRPGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD P GGTYFP E ++G P F +L V A+ +K+ +A+ G E
Sbjct: 108 TMFLTPDAMPFFGGTYFPREPRHGLPAFVEVLHSVARAFTEKQSEIAEQGRTMREAFGST 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L + L + P L +L +YD R GGFG APKFPRP + +L
Sbjct: 168 LPRAVRGEPLFNADP---LAQAVAELDTNYDRRRGGFGGAPKFPRPAALDFLLRRHAATG 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D G M L TL+ MA+GGIHDH+GGGF+RYSVD +W +PHFEKMLYD QL
Sbjct: 225 DPHARG-------MALTTLERMAEGGIHDHLGGGFYRYSVDAQWSIPHFEKMLYDNAQLL 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++Y +A++L++ + I+ +L+ +M PGG +A DADS EG +EG FY
Sbjct: 278 HLYAEAWALSRKQVFRQAAEGIVAWLQHEMALPGGAFAAALDADS---EG----EEGRFY 330
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSR----MSDPHNEFKGKNVLIELNDSSA 374
+WT++EV HA+L P D++ + P N + L ++
Sbjct: 331 LWTAREV------HALL--------PPQQWDVASIHWGLDGPPNFEDAEWHLRQVQPLEQ 376
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A +L + + L R L R++R RP DDKV+ N L I ARA++
Sbjct: 377 VAERLRLTPGEARQQLEGARHTLLAARNERIRPGRDDKVLTGCNALAIKGLARAARAF-- 434
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
R E++ +A AA F++R L+ + RL ++++G ++ P +LDD+
Sbjct: 435 --------------GRPEWLGLACGAADFLQRELWRDG--RLLAAWKDGRARLPAYLDDH 478
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AFL+ +L+L + G A+ L + + F DRE GG+F T + +++ R K
Sbjct: 479 AFLLEAMLELLQAGWRDADYRCAVALADALLQHFEDREEGGFFFTAHDHETLIYRTKPVE 538
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAA 613
D A PSGN V+ L RLA + S Y A +LA+F L+ A P L+
Sbjct: 539 DHATPSGNGVAAFALGRLALL---SGEPRYAAAARRALALFLPDLRQHPGAHPGLLNVLG 595
Query: 614 DMLSVPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
D LS P+ VL G + + +++ + A + ++ + P +E
Sbjct: 596 DELSPPAL--AVLQGPAAELARWQDEIGRLPAPW-----LLAVAPTGGDER--------- 639
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL--LEKP 714
++V A VC +C PP+ LE LL L KP
Sbjct: 640 --PPPLRKPETERVNAWVCAGVTCLPPID---GLEALLGMLAKP 678
>gi|359690220|ref|ZP_09260221.1| hypothetical protein LlicsVM_17604 [Leptospira licerasiae serovar
Varillal str. MMD0835]
gi|418751442|ref|ZP_13307728.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
gi|418758573|ref|ZP_13314755.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
gi|384114475|gb|EIE00738.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
gi|404274045|gb|EJZ41365.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
Length = 695
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 248/704 (35%), Positives = 349/704 (49%), Gaps = 76/704 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE A++LN +VSIKVDREERPDVD++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFEDETTAEVLNRDYVSIKVDREERPDVDRIYMDALHAMGQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--- 137
FL+P+ KP+ GGTYFPP KYGR F +L + W K++ L ++ + L E
Sbjct: 115 FLTPEGKPITGGTYFPPVPKYGRKSFTEVLGILTGLWKDKKEELLEASEDLTKHLKESEE 174
Query: 138 --ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMML-Y 192
AL+ +A + E+ +N L + YD + GF S KFP + + +L Y
Sbjct: 175 TRALAGTADISSPGSEVFENGFLL----YDRLYDPEYAGFKSNSVNKFPPSMGLSFLLRY 230
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
H KS + +MV TL M KGGI+D +GGG RYS D W VPHFEKMLY
Sbjct: 231 H--------KSTGEPKALEMVEETLTAMKKGGIYDQIGGGLCRYSTDHHWLVPHFEKMLY 282
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D ++ + + Y D+++YL RDM PGG I SAEDADS EG
Sbjct: 283 DNSLFLEALVECYQAVGEEKYKDYAYDVIEYLHRDMRLPGGGIASAEDADS---EG---- 335
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
+EG FY+WT +EV ++ G+ + L E + + GN F+ KN+L E
Sbjct: 336 EEGLFYLWTKEEVREVCGQDSSLLDEFWNITEKGN------------FEEKNILHE--SF 381
Query: 373 SASASKL-GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+ S+L G+ + I+ R+KL + RS R RP DDK++ SWN L I + +A+
Sbjct: 382 RMNFSRLHGLEPSELEEIVSRNRKKLLEKRSTRIRPLRDDKILFSWNCLYIKALTKAAMA 441
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
+ + AE F+ ++L E RL FR G +K +
Sbjct: 442 FGD----------------GDLLREAEETYKFLEKNLIREDG-RLLRRFREGEAKILAYS 484
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DYA + L L++ G G ++L +I + T++ + L R G F +G D LLR
Sbjct: 485 TDYAEFVLASLYLFQAGKGFRYLENSI--RYTEEAIRLFRSPAGVFFDSGIDGEALLRRT 542
Query: 552 ED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
D +DG EPS NS V L S + S+ Y Q A+ + F+ L+ M+ P M
Sbjct: 543 VDGYDGVEPSANSSFATAFV-LLSKLGVVDSEKYLQYADSIFSYFKPELEAYPMSYPYML 601
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLA--AAHASYDLNKTVIHIDPADTEEMDFWEE 668
A + P R+ V+ + E +L S L +TV+ + D E E
Sbjct: 602 SALWLRKSPGRELAVVYSSQ-----EELLPFWKGVGSLFLPETVL-VWANDKE-----AE 650
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
N + +N S V A VC F C PV+D SL L+E
Sbjct: 651 ENGEKFLLLKNRNSGGGVKAYVCVGFHCELPVSDWPSLRARLVE 694
>gi|209966075|ref|YP_002298990.1| hypothetical protein RC1_2806 [Rhodospirillum centenum SW]
gi|209959541|gb|ACJ00178.1| conserved hypothetical protein [Rhodospirillum centenum SW]
Length = 688
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 233/699 (33%), Positives = 345/699 (49%), Gaps = 80/699 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED +A ++ND FV++KVDREERPDVD++Y + + L GGWPL++F
Sbjct: 53 CHWCHVMAHESFEDPTIAAMMNDLFVNVKVDREERPDVDQIYQSALGLLGQQGGWPLTMF 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSEA 138
L+P+ +P GGTYFPPE ++GRPGF +L V + ++ D + ++ A+ +L++
Sbjct: 113 LTPEGEPFWGGTYFPPERRWGRPGFPDVLLGVSTTYRQEPDKVVRNTTALKDALHRLAQN 172
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ L DE+ A +L + D GG GSAPKFP+ ++++ K+
Sbjct: 173 RPGAGVDVDLLDEV--------AARLVQEVDPVHGGIGSAPKFPQTGIVELLWRAWKR-- 222
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ + + V+ TL M++GGI+DH+GGG+ RYS D+ W VPHFEKMLYD QL
Sbjct: 223 -TGR----EDCRAAVVTTLTQMSQGGIYDHLGGGYARYSTDQEWLVPHFEKMLYDNAQLI 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIG----PGGEIFSAE-DADSAETEGATRKK 313
++ + T+D + R+ + ++ R+M+ P G F+A DADS EG +
Sbjct: 278 DLLTTVWQDTRDPLFEARVRETVGWVLREMVSEPGRPVGGGFAATLDADS---EG----E 330
Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
EG FYVWT EV+ +LG+ A F Y + GN ++G +L L
Sbjct: 331 EGRFYVWTWAEVDRLLGDRAETFARAYDVTERGN------------WEGTTILNRLKRPE 378
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
G P E+ L E R LF R R RP DDKV+ WNGL+I++ ARA +
Sbjct: 379 P-----GTPAEE--GALAEMRAVLFQARGARVRPGWDDKVLADWNGLMIAALARAGAVF- 430
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
D +++ A A F+R H+ D RL HS+R G + G LDD
Sbjct: 431 ---------------DEPDWIAAARRAYDFVRTHMQDAD-GRLWHSWRAGTLRHRGTLDD 474
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
A + L L+E + A D F D E GGYF T + +++R +
Sbjct: 475 QAAMARAALALFEVTGDGTCVEQARRWAAVADAQFWDTESGGYFLTAADATDLIVRPRNA 534
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D A PSGN + L RL I + + +R+ A+ + F + PL
Sbjct: 535 QDNAVPSGNGTMLGVLARLWLI---TGEEGWRRRADALVTAFGG--EPGRNFFPLATFLN 589
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
++ + VV+ G ++ D +L A H + + + P + H +
Sbjct: 590 NVELLHRAVQVVVAGDPAAADTGALLRAVHGAGLPTLVLTPVTPGTALP----DGHPAAG 645
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
M + A VC+ +CS PVTDP +L LL E
Sbjct: 646 KGMV-----GGRAAAYVCRAMACSLPVTDPAALAALLRE 679
>gi|222479721|ref|YP_002565958.1| hypothetical protein Hlac_1296 [Halorubrum lacusprofundi ATCC
49239]
gi|222452623|gb|ACM56888.1| protein of unknown function DUF255 [Halorubrum lacusprofundi ATCC
49239]
Length = 744
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 240/724 (33%), Positives = 343/724 (47%), Gaps = 88/724 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE +A +LN+ FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESIAAVLNEKFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
+ +P KP GTYFPPE + +PGF+ + ++ D+W ++ D S
Sbjct: 113 AWCTPKGKPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARD 172
Query: 131 AIEQLSEALSAS-ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
+E + E +A AS + L A + YD +GGFGS KFP P I
Sbjct: 173 ELESVPEPDAAGDASGTGGAGPPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKFPMPGRID 232
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
++L + G+A+ TL MA+GG++D +GGGFHRY+VD +W VPHFE
Sbjct: 233 VLLRAYAR-----SGGDAA--LTAATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFE 285
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD +L YLD + LT D Y+ + + L +L R++ G FS DA S E
Sbjct: 286 KMLYDNAELPMAYLDGYRLTGDASYARVASETLGFLDRELRHDDGGFFSTLDARSRPPEN 345
Query: 309 --------------ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 353
EGAFYVWT EV+ +L E A L K+ Y ++ GN +
Sbjct: 346 RRGNAGSDESDDADDVADVEGAFYVWTPAEVDAVLDEPAASLAKDRYGIRSGGNFE---- 401
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
+G V + A + M E L R LF+ R RPRP D+KV
Sbjct: 402 -------RGTTVPTIAASIAELADEHDMSTEAVREALTAARVALFEARESRPRPARDEKV 454
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+ SWNG IS+FA A ++L + Y ++A A SF R LYDE+T
Sbjct: 455 LASWNGRAISAFATAGQVLG-----------------EPYADIASDALSFCRERLYDEET 497
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
L + +G + PG+LDD+AFL G LD+Y + L +A++L T F D
Sbjct: 498 ETLARRWLDGDVRGPGYLDDHAFLARGALDVYSVTGDPEALGFALDLAATVVSDFYDEAD 557
Query: 534 GGYFNTT--------GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
G + T G D ++ R +E D + PS V+ L +++ G ++D R
Sbjct: 558 GTIYFTRDPDGNAGHGGDDTLFARPQEFTDQSTPSSLGVAAETL----ALLDGFRTD--R 611
Query: 586 QNAEHSLAVFETRLKDMAMAVPL----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 641
+ AE + V T D A PL + AAD ++ + + V E +
Sbjct: 612 EFAEVAETVVTTH-ADRIRASPLEHVSLVRAADRVASGGIEVTIAVDAVPDAWRETL--- 667
Query: 642 AHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQNFSCSP 698
L ++ P + + W + + + A + + A VC+ +CSP
Sbjct: 668 --GERYLPGALVAPRPPTEDGLAAWLDRLDMDEAPPIWADRDAVDGEPTAYVCEGRTCSP 725
Query: 699 PVTD 702
P TD
Sbjct: 726 PETD 729
>gi|448591505|ref|ZP_21650993.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
gi|445733479|gb|ELZ85048.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
Length = 702
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 228/696 (32%), Positives = 339/696 (48%), Gaps = 73/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A+ LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P KP GTYFPPE + G PGF+ ++ ++W RD + AQ AI +QL
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQL 172
Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+ A +++ D+ Q ALR PKFP+P I +L
Sbjct: 173 EDTPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDSLL-- 222
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ TG+ E + + +L MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----REALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q L YLD + LT Y+ + + +++RR++ G F+ DA S +
Sbjct: 278 QAGLVPRYLDTYRLTGTEAYADVAVETFEFVRRELSHDDGGFFATLDAQSG-------GE 330
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT EV +L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 331 EGTFYVWTPDEVRSLLPELEADLFCDRYGITPGGN------------FENKTTVLNVSAT 378
Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
S A + + ++ + L E R+ LF RS R RP D+K+I WNGL+IS+FA+ +
Sbjct: 379 VSDLAEEYDLSEDEVEDKLAEARKALFAARSGRERPARDEKIIAGWNGLMISAFAQGAVA 438
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ + A A FIR HL+D L NG K G+L
Sbjct: 439 LEDDS----------------LADDARRALDFIREHLWDADAEHLSRRVMNGEVKGDGYL 482
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ + L +A++L F D G + T +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDVEPLAFALDLGRAIHREFYDDAAGTLYFTPESGEALVTRPQ 542
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D + PS V+ + L + + + A+ L R++ + +
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAG---FGEAADAVLETHANRIRGSPLEHVSLAL 599
Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
AA+ + VP + + + ++ LA+ + L V+ PA +E+D W +E
Sbjct: 600 AAEKAASGVP---ELTIAADEIPAEWRETLASRY----LPGLVVAPRPATDDELDAWLDE 652
Query: 669 HNSNNASMARNNFSAD--KVVALVCQNFSCSPPVTD 702
+ A AD + C+NF+CS P D
Sbjct: 653 LELDEAPPIWAAREADGGEPTVYACENFTCSAPTHD 688
>gi|418753914|ref|ZP_13310150.1| PF03190 family protein [Leptospira santarosai str. MOR084]
gi|409965755|gb|EKO33616.1| PF03190 family protein [Leptospira santarosai str. MOR084]
Length = 630
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 241/682 (35%), Positives = 346/682 (50%), Gaps = 70/682 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ VA LN FVSIKVDREERPD+D++YM + A+ GGWPL+VFL+PD K
Sbjct: 1 MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P+ GGTYFPPE YGR F +L ++ W++KR L A +LS+ L S
Sbjct: 61 PITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGEGRA 116
Query: 148 LPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDT 200
+ + LP A L +S YDS FGGF + KFP + + +L YH
Sbjct: 117 VEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH------- 169
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+S + +M TL M +GGI+D VGGG RYS D RW VPHFEKMLYD
Sbjct: 170 -RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLET 228
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
++ S++K + D++ YL RDM G I SAEDADS EG +EG FYVW
Sbjct: 229 LVECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVW 281
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E ++ GE + + ++ + + GN F+GKN+L E + S +A
Sbjct: 282 DLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSE 328
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ ++L R KL + RSKR RP DDK++ SWNGL + +A
Sbjct: 329 EEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG----------- 377
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
V +++++++AE SFI R+L D R+ FR+G S G+ +DYA +I+
Sbjct: 378 -----VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEMIAS 431
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 559
+ L+E G G ++L A+ LF R G F TG D VLLR D +DG EP
Sbjct: 432 SIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEP 489
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
S NS V +LV+L+ + G S YR+ AE + F L ++ P + A
Sbjct: 490 SANSSLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFH 547
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
S K +VL+ K + +++LA + + + ++ + EE +++ +
Sbjct: 548 S-KEIVLI-RKDADSGKDLLAEIQTKFLPDSVLAVVNEDELEEA-------RKLSALFDS 598
Query: 680 NFSADKVVALVCQNFSCSPPVT 701
S + VC+NFSC P+
Sbjct: 599 RDSGGNALVYVCENFSCKLPIA 620
>gi|409730794|ref|ZP_11272353.1| hypothetical protein Hham1_16314 [Halococcus hamelinensis 100A6]
gi|448723490|ref|ZP_21706008.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
gi|445787756|gb|EMA38495.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
Length = 719
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 204/556 (36%), Positives = 298/556 (53%), Gaps = 44/556 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA+ LN+ FV IKVDREERPD+D++Y T + + G GGWPLS
Sbjct: 52 SSCHWCHVMADESFEDERVAERLNEDFVPIKVDREERPDLDRLYQTVIGMVSGRGGWPLS 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+L+PD +P GTYFPPE K G+PGF +L + +AW+ +R+ + +Q ++A+
Sbjct: 112 VWLTPDGRPFYIGTYFPPEAKRGQPGFLDLLDSITEAWETEREDIEGRA----DQWADAM 167
Query: 140 SASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + P + P + L A ++ D +GG G KFP+ +++++ + +++
Sbjct: 168 TGELEATPEPGDPPGSELLETAARSAVRNADREYGGSGRGQKFPQTGRLRLLMEAADRID 227
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D A E L MA GG+ DHVGGGFHRY+ D W VPHFEKMLYD +L
Sbjct: 228 DEEFGTVAREA-------LDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELV 280
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YLD + L D Y+ + R+ L ++ R++ P G FS DA S + G ++EGAFY
Sbjct: 281 RAYLDGYRLFGDERYAEVARETLGFVERELTSPEGGFFSTLDAQSVDESG--EREEGAFY 338
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT EV D +G+ A LF E Y + +GN + G VL D A
Sbjct: 339 VWTPDEVHDAVGDDRAAELFCERYGISESGNFE-----------NGTTVLTLAADVQGLA 387
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ +E+ L R +F R++R RP D+KV+ WNGL++++FA A L
Sbjct: 388 DEYDTTVEEVEADLERAREAVFAARAERSRPDRDEKVLAGWNGLMVAAFAEAGLALD--- 444
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ E A +A F+R L++E+ RL +++G K G+L+DYAF
Sbjct: 445 --------------PRFAETAVAALDFVREELWNEEEERLSRRYKDGEVKIDGYLEDYAF 490
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L YE L +A++L + F D E G + T S++ R +E D
Sbjct: 491 LARGALACYEATGDVHHLGFALDLARAIESEFWDPEEGTLYFTPSSGESLVARPQELDDQ 550
Query: 557 AEPSGNSVSVINLVRL 572
+ PS V+V L+ L
Sbjct: 551 STPSSTGVAVETLLAL 566
>gi|441496345|ref|ZP_20978578.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
gi|441439862|gb|ELR73159.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
Length = 680
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 208/560 (37%), Positives = 296/560 (52%), Gaps = 57/560 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME ESFE++ +A ++N+ F+SIK+DREERPDVD++YM VQA+ GGWPL+
Sbjct: 58 SSCHWCHVMERESFENDSIAAIMNEHFISIKIDREERPDVDQIYMDAVQAMGQSGGWPLN 117
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ D KP GGTYFPPE + +L++V +++KR + +S +QL+ A+
Sbjct: 118 VFLTSDQKPFYGGTYFPPE------SWAQLLKQVARVYNEKRSEVEESA----DQLTNAI 167
Query: 140 SASASSN-KLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ S +L D E L E+LS +D GGF APKFP P +L +
Sbjct: 168 ATSEVIKFRLKDNGTEYTTTTLEKMYEKLSMKFDGNKGGFKGAPKFPMPGNWLFLLRYYN 227
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
D E + + TL +A+GGI+D +GGGF RYSVD W VPHFEKMLYD G
Sbjct: 228 ATND-------QEALRQLEVTLSEIARGGIYDQIGGGFARYSVDADWLVPHFEKMLYDNG 280
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL ++Y +A++ TK Y + +D+L R+M G +SA DADS EG +EG
Sbjct: 281 QLVSLYAEAYTATKLELYKEVVYQTIDWLEREMTSKEGGFYSALDADS---EG----EEG 333
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVWT EVE +LG A L +Y ++ GN + +GKN+L
Sbjct: 334 KFYVWTKDEVEHVLGAEANLIMSYYNIEKEGNWE-----------EGKNILHMHVSDEEF 382
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A + + + + + + L + RSKR RP LDDKV+ WNGL+ A
Sbjct: 383 AKRHDLGVAELKEKVWKADELLLEERSKRVRPGLDDKVLAGWNGLMQKGLVDA------- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
V +++++A A F+ +H+ + RL SF++G + G+L+DYA
Sbjct: 436 ---------YVAFGEPKFLDLALRNAHFLDQHMIHD--FRLNRSFKSGKASIDGYLEDYA 484
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F+I LYE +WL A L + E F D +F T ++ R KE D
Sbjct: 485 FVIDAYTALYEATFDEQWLKKAKGLMDYTIEHFYDNSEKLFFFTDDRSEKLIARKKEVFD 544
Query: 556 GAEPSGNSVSVINLVRLASI 575
P+ NS +NL RL I
Sbjct: 545 NVIPASNSQMALNLYRLGKI 564
>gi|436836357|ref|YP_007321573.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
gi|384067770|emb|CCH00980.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
Length = 682
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 231/690 (33%), Positives = 341/690 (49%), Gaps = 72/690 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+E +AK++N+ FV IKVDREERPDVD VYM VQA+ GGWPL+
Sbjct: 47 SACHWCHVMERESFENEQIAKIMNERFVCIKVDREERPDVDAVYMEAVQAMGVQGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL PD +P G TY PP++ + ++ V+ A+D+ RD L +S E L+ +
Sbjct: 107 VFLMPDARPFYGLTYAPPQN------WANLMVGVRQAFDENRDELLRSAEGFAEHLNTSE 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S Q + +L+ +D+ GG G APKFP P +L ++
Sbjct: 161 STRFQLQTAEPVYAQETVETMYRKLATRFDTELGGTGRAPKFPMPSIYTFLLRYAD---- 216
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+G+ S Q++ L TL MA GGI+D +GGGF RYS D+ W PHFEKMLYD QL
Sbjct: 217 --LTGDPSAFQQLTL-TLNRMALGGIYDQLGGGFARYSTDKHWFAPHFEKMLYDNAQLLT 273
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +AF++T Y + +++L R+++ P G +SA DADS EG EG FY
Sbjct: 274 LYSEAFAMTGSALYRFTVYHTIEFLERELLSPDGGFYSALDADS---EGI----EGKFYT 326
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W++ E++ ILG+ F + Y + P GN D+ H + N+L + A A +L
Sbjct: 327 WSADELQSILGDDYDWFAQLYTITPEGNWDIG-----HGHGR-TNILHRTETNPAFADQL 380
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G + L + KL VRS+R RP LDDK++ SWNGL + A ++
Sbjct: 381 GWTAAELNERLTTAKEKLLAVRSQRVRPGLDDKLLCSWNGLALKGLVSAYRV-------- 432
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT-HRLQHSFRNGP-----SKAPGFLDD 493
FN P E++ +A A FI++ L D + RL HS++ GP ++ GFL+D
Sbjct: 433 -FNEP-------EFLSMALRLAFFIKQKLTDGRNGGRLWHSYKTGPDGVGRARQLGFLED 484
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA +I G + LY+ +WL A L F D + F T ++ R KE
Sbjct: 485 YAAVIDGYVALYQATFADEWLTEADRLTQYVLAHFNDPDEPLLFFTDKSGEELIARKKEL 544
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D P+ NS+ NL L+ ++ + Y + + L + + L + V + A
Sbjct: 545 FDNVIPASNSIMAQNLYTLSLLLERPE---YAERVDQMLGLIQPLLDN---EVNYLTNWA 598
Query: 614 DMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
+ ++ R + +VG D + A + NK + D S
Sbjct: 599 SLYTLRVRPTAEIAIVGP----DAQEFRRDIDAKFFPNKVLAGTD------------SRS 642
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ +A+ + VC N +C PVT
Sbjct: 643 SLPLLAQRGPIDGQTAIYVCYNRACQLPVT 672
>gi|148264330|ref|YP_001231036.1| hypothetical protein Gura_2283 [Geobacter uraniireducens Rf4]
gi|146397830|gb|ABQ26463.1| protein of unknown function DUF255 [Geobacter uraniireducens Rf4]
Length = 700
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 238/695 (34%), Positives = 340/695 (48%), Gaps = 78/695 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME E+FED VA + N +F+ IKVDREERPD+D+ YM Q + G GGWPL++
Sbjct: 79 TCHWCHVMEHEAFEDREVAAVFNRFFICIKVDREERPDIDEQYMAVAQMMTGSGGWPLNI 138
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++P+ KP TY P + G PG IL +V + W +R L Q IE L+
Sbjct: 139 FMTPEKKPFFAATYMPRTPRMGMPGIIQILERVAELWRTERQKLEQDSDVTIEALTHHFQ 198
Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S LPD L QNA +QL++ YD +GGFG+ PKFP P+ + +L K
Sbjct: 199 PHPGS--LPDMVLVQNAY----QQLTEMYDDLWGGFGNVPKFPMPLYLTFLLRFWK---- 248
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+SG + MV TL+ + +GGI+D +G GFHRY+VD +W VPHFEKMLYDQ +A
Sbjct: 249 --RSGNGAS-LAMVEHTLRMLRQGGIYDQIGFGFHRYAVDRQWLVPHFEKMLYDQALIAI 305
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
YLDAF T FY + ++ Y+ +M P G F+ +DAD TEG +EG +Y+
Sbjct: 306 GYLDAFQATAVPFYRQVAEEVFAYVLGEMTSPEGGFFAGQDAD---TEG----EEGNYYI 358
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ +G + A +F C L +++ N F+G+N+L A++
Sbjct: 359 WTPAEIAAAIGHDEAQVF-----------CRLFDVTEKGN-FEGRNILHLPVPPETFAAR 406
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ E L R L VR R RP D+KV+ +WNGL+I++ AR +
Sbjct: 407 EAILTEVLTADLERWRHTLLKVRGNRIRPFRDEKVLTAWNGLMIAALARGYAL------- 459
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
S + ++ A+ AA+FI L RL SF G + P FLDDYAF +
Sbjct: 460 ---------SGEERFLAAAKRAAAFIGTRL-TSPGGRLMRSFHLGEASVPAFLDDYAFFV 509
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGA 557
GL++L++ ++L A L + LF +GG Y TG D L +++ DG
Sbjct: 510 WGLIELHQVTLEPEFLDSARFLADEMLRLFHSGKGGLY--ETGLDSEQLPVIRQSARDGV 567
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGNSV+ +L RL I + + ++ E + F + +A A+D
Sbjct: 568 LPSGNSVAAFDLFRLGRITGDGR---FLESGEAVVRTFMGDVTRQPLASLNFLSASDYHL 624
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
P V L G++ + ML A H + N + +
Sbjct: 625 GPEVT-VTLAGNREELG--GMLDAVHRRFIPNLALRY------------------GGEGG 663
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ A VC +C P VT +L LL E
Sbjct: 664 ESPTVGGLPTAYVCAKGACRPSVTRADALGALLDE 698
>gi|298206807|ref|YP_003714986.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
HTCC2559]
gi|83849439|gb|EAP87307.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
HTCC2559]
Length = 681
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 216/686 (31%), Positives = 350/686 (51%), Gaps = 70/686 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED +A+++N F++IKVDREERPDVD+VYM +Q + G GGWPL++
Sbjct: 56 CHWCHVMEHESFEDISIAEVMNANFINIKVDREERPDVDQVYMKALQLMTGQGGWPLNIV 115
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ G TY P + +K L ++ D + + + E+LS+ ++
Sbjct: 116 ALPDGRPIWGATYLP------KKQWKGSLHQLADLYRSNSEHMITYA----EKLSKGMAQ 165
Query: 142 SASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ K ++ + L+ + S +D +GG +PKF P Q +L ++ + +D
Sbjct: 166 VSLVTKTDSNTDISKAFLKDSLQTWSNQFDYTYGGTQRSPKFMMPNNYQFLLRYAHQTKD 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
V+ TL ++ GG++DH+GGGF RY+VD +WHVPHFEKMLYD QL +
Sbjct: 226 KSL-------LDYVILTLNKISYGGVYDHIGGGFSRYAVDSKWHVPHFEKMLYDNAQLVS 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A++LTKD +Y + + L+++ ++ G +S+ DADS TEG + +EGAFYV
Sbjct: 279 LYSKAYTLTKDPWYKTVVTNTLNFIETELTRDNGSFYSSLDADSLNTEG--KLEEGAFYV 336
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT E++ +L E LF+ +Y + G+ + HN + VLI +S A+
Sbjct: 337 WTKAELKSLLNEDYPLFEAYYNINEYGHWE-------HNNY----VLIRTKSNSEIANDF 385
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+P+ L + L + R KR +P LDDK + SWN L+I+ + A K +
Sbjct: 386 SIPISTLDKKLTSWKALLNNNRQKRAQPRLDDKSLTSWNALMINGYIDAYKAFQIN---- 441
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+Y+E+A A++FI + ++ L HS+ +K G+L+DYAF I
Sbjct: 442 ------------DYLEIALKASNFILDKML-QKDGSLTHSYNKNEAKINGYLEDYAFTIE 488
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ L+E +KWL A EL + F D E ++ + D +++ R E D P
Sbjct: 489 AFISLFEVTFNSKWLSKAEELTTYALKHFYDEEQHIFYFNSNLDDALVTRPIEQQDNVIP 548
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
+ NS NL +L+ ++ G KS Y++ AE L K A S P
Sbjct: 549 ASNSTMAKNLFKLSHLL-GIKS--YKEIAEQQLKTVLQDAKTYASGYSNWLDVIMNFSFP 605
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ +V+ G +S +++ +LN I A +E +N+ + +N
Sbjct: 606 YHE-IVITGKNASNYVKDL--------NLNYIPNSITAATEKE--------NNDLLIFKN 648
Query: 680 NFSADKVVALVCQNFSCSPPVTDPIS 705
+ ++ + VC++ +C+ P TD +S
Sbjct: 649 RYVDEQTLIYVCKDNTCNVP-TDKVS 673
>gi|338741363|ref|YP_004678325.1| hypothetical protein HYPMC_4552 [Hyphomicrobium sp. MC1]
gi|337761926|emb|CCB67761.1| conserved protein of unknown function [Hyphomicrobium sp. MC1]
Length = 682
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 231/696 (33%), Positives = 340/696 (48%), Gaps = 74/696 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED A+++ND FV+IKVDREERPD+D +YM + L GGWPL++F
Sbjct: 51 CHWCHVMAHESFEDPETARVMNDLFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMF 110
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L + KP GGTYFP E +YGRP F T+L ++ +A+ + + +A++ + L E S
Sbjct: 111 LDSEAKPFWGGTYFPRESRYGRPSFVTVLLRIAEAYQSQPENVAKNTEALVAALKEEAST 170
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ PD +P R ++++ D GG APKFP+ ++ + + D
Sbjct: 171 TDRVEAGPD-VPDLVAR-----ITRAVDRDHGGINGAPKFPQWNIFWLLWRGAMRFGD-- 222
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ ++ V+ TL+ + +GGI+DH+GGGF RYSVD W VPHFEKMLYD L ++
Sbjct: 223 -----EDAKQAVITTLRNICQGGIYDHLGGGFARYSVDPFWLVPHFEKMLYDNALLIDLI 277
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ + T+D + + + +L+R+MIG G ++ DADS EG +EG FYVW
Sbjct: 278 TEVWRETQDPLFKIRIAETVAWLKREMIGEAGGFAASLDADS---EG----EEGKFYVWH 330
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
KE+ D+LG E A +F + Y + GN +G +L L S S+ +
Sbjct: 331 KKEIVDVLGPEDAAIFGKVYGVTRDGNFSEHAAITASGRIEGPTILNRLESQSFSSDEAE 390
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
L E R KL R+ R RP DDK++ WNGL+I++ +RA+ +
Sbjct: 391 ARLS-------EMRAKLLTRRAGRVRPGWDDKILADWNGLMIAAMSRAAIVF-------- 435
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
D+ E++ +AE+A + + L RL HS+R G +KAP DYA +I
Sbjct: 436 --------DQPEWLGMAEAAFTCVATKL-SAGGDRLYHSYRGGLAKAPATASDYANMIWA 486
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L LYE S ++L A D + D + GGYF + V++R+K D A PS
Sbjct: 487 ALRLYEATSSDRYLSQAQRWAAVLDTHYWDGDSGGYFTAADDTSDVVVRLKSASDDATPS 546
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCA--ADMLS 617
N++ + NL+ LA++ D A TR+ A+A P C A
Sbjct: 547 ANAIQLSNLITLAAMTGDLTYD--------DRAAELTRVFSGAVARAPTGHCGLIAAGFD 598
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS--NNAS 675
+ V ++G S DL K + +I + F E S ++
Sbjct: 599 LGRLVQVAVIGEGRS--------------DLQKALTNISVPGA--VSFISETGSFTEGSA 642
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+A K A VC C PV D L LL
Sbjct: 643 LAGKASIGGKSTAYVCVGPVCGMPVQDAQELRKELL 678
>gi|312115384|ref|YP_004012980.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
17100]
gi|311220513|gb|ADP71881.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
17100]
Length = 685
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 231/699 (33%), Positives = 349/699 (49%), Gaps = 85/699 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE E A+L+N F++IKVDREERPDVD +YMT +Q L GGWPL++F
Sbjct: 51 CHWCHVMAHESFEKEDTAELMNRLFINIKVDREERPDVDTLYMTALQELGEQGGWPLTMF 110
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GGTYFP + ++G+P FK +L V + ++++ +AQ+ A+ ++L+ L+
Sbjct: 111 LTPDGMPFFGGTYFPDKSRFGKPSFKDVLVNVARVYAQEKETIAQNTAYLKQRLTPRLNY 170
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-----YHSKK 196
A+ E + L A + + D GG APKFP Q + Y+ K
Sbjct: 171 GAAP-----EFSEEQLAAIAAKFIGAIDPTNGGLRGAPKFPNTTIFQFLWRAGLRYNLKT 225
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ K+ TL + +GGI+DH+GGGF RY+VDERW VPHFEKMLYD
Sbjct: 226 CIEEVKN------------TLLHICQGGIYDHLGGGFSRYTVDERWLVPHFEKMLYDNAL 273
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L + + T+ + + +L+RDMI PGG ++ DADS EG +EG
Sbjct: 274 LIEFMTEVWKETQSDRLKTRVAETIGWLKRDMIVPGGAFAASYDADS---EG----EEGK 326
Query: 317 FYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
FYVWT++E+ DIL GE A +F + Y + GN ++GK +L L
Sbjct: 327 FYVWTAREITDILGHGEEAAIFAQTYDVTEGGN------------WEGKTILNRLK---- 370
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+ + L E+ ++ ECR KLF R +R +P DDKV+ WNGL I + ARA
Sbjct: 371 ALALLNGGEERAMD---ECRAKLFAERERRVKPGWDDKVLADWNGLAIRALARAGDAFA- 426
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
+ +++ +A A F++ + + RL HS+R+G K P DY
Sbjct: 427 ---------------QPDWIVLAADAYGFVKSRMI--ENGRLFHSWRDGKLKGPATAADY 469
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A +IS L L++ ++L A+E + + D E GGY+ + ++LR
Sbjct: 470 ANIISAALVLHQVTGEPRYLDDAVEWTAIMNRHY-DAEQGGYYFAADDTSDLILRPLSAS 528
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A P+ N+ + NL L ++ + Y + A+ L F+ + MA+ + A
Sbjct: 529 DDAVPNANATMLQNLADLYTLTGDAA---YLKRADGLLTAFQGAAQTMAIGYTGLLSGA- 584
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
L++ S + + + G ++ D A TV ++P + N +
Sbjct: 585 -LTLISPQSIAIAGDRAGPDAAAWRRALAEVSLPGATVQWVNP----------DENLPAS 633
Query: 675 SMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
S A + D K A +C CS P+TDP L++ L E
Sbjct: 634 SPAFGKKAIDGKTTAYICFGPRCSEPITDPAILKDRLKE 672
>gi|448474014|ref|ZP_21601982.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
gi|445818294|gb|EMA68153.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
Length = 735
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 235/720 (32%), Positives = 352/720 (48%), Gaps = 86/720 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ +A +LND FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDDSIAAVLNDQFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
+ +P+ KP GTYFPPE + +PGF+ + ++ D+W ++ + S
Sbjct: 113 AWCTPEGKPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRAEQWTTSARD 172
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 189
+E + E A + + P + L A + YD +GGFGS KFP P I +
Sbjct: 173 ELESVPEPGDADDADDTGPSG--SDLLEEAAAAAIRGYDDEYGGFGSGGAKFPMPGRIDL 230
Query: 190 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 249
++ + + + A+ TL MA+GG++D +GGGFHRY+VD +W +PHFEK
Sbjct: 231 LMRAAARSGRSAALTAATG-------TLDGMARGGVYDQIGGGFHRYAVDRQWTIPHFEK 283
Query: 250 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------ 303
MLYD +L VYLD + LT D Y+ + + L +L R++ G FS DA S
Sbjct: 284 MLYDNAELPMVYLDGYRLTGDPSYARVASESLGFLDRELRHADGGFFSTLDARSRPPAGR 343
Query: 304 ---------AETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRM 353
+ EG EGA+YVWT +EV+ +L E A L K + ++ GN +
Sbjct: 344 GGGRGNDEGGDGEGDAPAVEGAYYVWTPEEVDAVLDEPASSLAKARFGIRSGGNFE---- 399
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
+G V A + P ++ IL + R LF+ R RPRP D+KV
Sbjct: 400 -------RGTTVPTVAASIEELADEYDRPADEVREILTDARVALFEARETRPRPARDEKV 452
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
+ SWNG IS+FARA +L Y +A A +F R LYDE T
Sbjct: 453 LASWNGRAISAFARAGDVLG-----------------DSYAAIASDALAFCRDRLYDEDT 495
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
L + +G + PG+LDDYAFL G LD+Y + L +A++L + + F +
Sbjct: 496 GELARRWLDGDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALDLAESLVDAFYEAAD 555
Query: 534 GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNA 588
G + T +D ++ R +E D + PS V+ L +++ G ++D +R+ A
Sbjct: 556 GTIYFTRDPDASDDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFREIA 611
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS--- 645
E + R++ A PL + V + +HV G + ++ + + AA +
Sbjct: 612 EAVVTTHADRIR----ASPLEHVSL----VRAAEHVETGGVEVTIAADEVPAAWRETLGE 663
Query: 646 -YDLNKTVIHIDPADTEEMDFWEEHNSNNAS--MARNNFSADKVVALVCQNFSCSPPVTD 702
Y V P D + ++ + A A + + A VC+ F+CSPP TD
Sbjct: 664 RYLPGALVAPRPPTDAGLAAWLDDLGLDEAPPIWADRDALDGEPTAYVCEGFACSPPRTD 723
>gi|417781210|ref|ZP_12428962.1| PF03190 family protein [Leptospira weilii str. 2006001853]
gi|410778461|gb|EKR63087.1| PF03190 family protein [Leptospira weilii str. 2006001853]
Length = 630
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 243/694 (35%), Positives = 354/694 (51%), Gaps = 76/694 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE++ VA LN FVSIKVDREERPD+D++YM + A+ GGWPL++FL+PD K
Sbjct: 1 MEKESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P+ GGTYFPPE +YGR F IL ++ W++KR L A +LS L S
Sbjct: 61 PITGGTYFPPEPRYGRKSFLEILNILRKVWNEKRQEL----IVASSELSRYLKDSGEGRA 116
Query: 148 LPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLED 199
+ + LP +N YD+ FGGF + KFP + + +L YHS
Sbjct: 117 IEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYYHS----- 171
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
SG +MV TL M +GGI+D +GGG RYS D W VPHFEKMLYD
Sbjct: 172 ---SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCRYSTDHHWMVPHFEKMLYDNSLFLE 227
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++ ++K + D++ YL RDM GG I SAEDADS EG +EG FY+
Sbjct: 228 TLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYI 280
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W +E ++ GE + + ++ + + GN F+GKN+L E + A+K
Sbjct: 281 WDFEEFREVCGEDSQILEKFWNVTKKGN------------FEGKNILHE--SYRSEATKF 326
Query: 380 GMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
K ++ +L R KL + RSKR RP DDK++ SWNGL I + A+A
Sbjct: 327 SEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG--------- 377
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
V R++++++AE SFI ++L D R+ FR+G S G+ +DYA +I
Sbjct: 378 -------VAFQREDFLKLAEETYSFIEKNLIDPNG-RILRRFRDGESGILGYSNDYAEMI 429
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGA 557
S + L+E G G ++L A+ +D + L R G F TG D VLLR D +DG
Sbjct: 430 SSSIALFEAGCGIRYLKNAVLWM--EDAIRLFRSPAGVFFDTGSDGEVLLRRSVDGYDGV 487
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EPS N +LV+L+ + G S Y + AE F L +++ P + A
Sbjct: 488 EPSANGSLAYSLVKLS--LFGIDSARYGEFAESIFLYFTKELSTNSLSYPHLLSAYWTYR 545
Query: 618 VPSRKHVVLVGHKSSVDF-ENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
S K +VL+ + DF +++LAA + + + ++ + EE +++
Sbjct: 546 RHS-KEIVLI--RKDTDFGKDLLAAIQTRFLPDSVLAVVNENELEEA-------RKLSTL 595
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ S + VC+NFSC PV++ L+ +
Sbjct: 596 FDSRDSGGNALVYVCENFSCKLPVSNLADLKKWI 629
>gi|404447779|ref|ZP_11012773.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
gi|403766365|gb|EJZ27237.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
Length = 674
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 236/694 (34%), Positives = 346/694 (49%), Gaps = 89/694 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE A+L+N +FV IK+DREERPD+D +YM VQA+ GGWPL+
Sbjct: 47 SACHWCHVMEKESFEDEATAQLMNQYFVCIKIDREERPDLDNIYMDAVQAMGLQGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSE 137
VFL P+ KP GGTYFP +K +L+ + +A+ + D LA+S F Q SE
Sbjct: 107 VFLMPNQKPFYGGTYFP------NAQWKALLQNIGEAYQEHYDQLAKSAEEFGNSLQTSE 160
Query: 138 ALSASASSNKL---PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
L S P EL + A++L Q +D +GG PKFP P ++ ++
Sbjct: 161 FLKYGLSHGTFQLDPKELAE-AIKLLENQ----FDLDWGGMNRKPKFPMPAIWSFVMDYA 215
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
KS E + V FTL+ + GGI+DH+ GGF RYSVD W PHFEKMLYD
Sbjct: 216 -----LAKSDEVLLAK--VFFTLKKIGMGGIYDHLRGGFARYSVDGEWFAPHFEKMLYDN 268
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
GQL ++Y A++++ + FY + + +L+ +M+ G ++A+DADS EG E
Sbjct: 269 GQLLDLYSKAYAVSGEYFYKEKILETIAWLKSEMLHKEGGFYAAQDADS---EGV----E 321
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
G FY WT +E+E I+GE F + Y LK GN + G N+L +
Sbjct: 322 GKFYTWTYEELESIVGEDLHWFAKLYNLKYQGNWE-----------DGVNILFQTESYEK 370
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A + E Y+ L E + KL VR++R P LDDK++ WNGL+IS A L
Sbjct: 371 LAESSELSEEGYIQRLNEIKAKLLSVRNQRIFPGLDDKILSGWNGLMISGLVSAYTSLGD 430
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
E E +E++ + A+FI +Y ++ L S++NG + P FL+DY
Sbjct: 431 E----------------EALELSLNNATFILDKMYKDKV--LYRSYKNGHAYTPAFLEDY 472
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A +I G + LY+ +KWL+ A EL + E F D E G ++ + ++ KE
Sbjct: 473 AAVIRGFISLYQATLDSKWLLKAKELSDKVIEAFYDEEEGFFYFNNPQAEKLIANKKELF 532
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-- 612
D P+ NS+ NL+ L+ D Y A++ L +K + + P C
Sbjct: 533 DNVIPASNSIMARNLLDLSMFFY---EDNYAAIAKNMLGT----MKKLIIKEPGFLCNWA 585
Query: 613 ---ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
DML +P + V +VG + + A ++ + L+ + E+
Sbjct: 586 SLYLDML-LP-KAEVAIVGEGAEKLGQEFFAKRNSGFILSAS---------------EKT 628
Query: 670 NSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
N+ + D + VC N SC PV+D
Sbjct: 629 NTEIPLLEGKKPDTDGNALIYVCFNRSCQRPVSD 662
>gi|77166007|ref|YP_344532.1| hypothetical protein Noc_2549 [Nitrosococcus oceani ATCC 19707]
gi|254436399|ref|ZP_05049905.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
gi|76884321|gb|ABA59002.1| Protein of unknown function DUF255 [Nitrosococcus oceani ATCC
19707]
gi|207088089|gb|EDZ65362.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
Length = 694
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 234/693 (33%), Positives = 351/693 (50%), Gaps = 58/693 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFED A ++N +F++IKVDREERPD+D++Y Q L G GGWPL
Sbjct: 53 SACHWCHVMAHESFEDSETAAVMNQYFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112
Query: 79 SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++FL P P GGTYFPPE+++G PGFK +L++V + + +R+ + ++ +
Sbjct: 113 TMFLEPIKQAPFFGGTYFPPEERHGLPGFKDLLQRVAEYFHTRREAIQSQNERLLDAFGD 172
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L A + ++ + L + L+ QL++++DSR GGF APKFP P I+ L ++
Sbjct: 173 -LDARLPAAEV-EGLNRAPLQAAHRQLAQAFDSRHGGFRGAPKFPNPSSIERCLRDARGE 230
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
T E + M TL+ MA+GGI+D +GGGF RYSVDE W +PHFEKMLYD GQL
Sbjct: 231 HLT--EDEKQQALTMARLTLEQMAQGGIYDQLGGGFCRYSVDEEWRIPHFEKMLYDNGQL 288
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y DA+ L + I + + R+M P G +S+ DADS EG EG F
Sbjct: 289 LVLYRDAYRLWGSGLFRRILEETGHWAVREMQSPEGGYYSSLDADS---EG----HEGKF 341
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT ++V +LGE Y+ + P N F+G L A A
Sbjct: 342 YVWTREQVRALLGEEEYALAARYF----------GLDQPAN-FEGYWHLYAATVPEALAQ 390
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
++ +P L ++KLF R R RP DDK++ +WNGL+I A A + L
Sbjct: 391 EMKVPAPGLQEQLTAAKQKLFAAREARIRPGRDDKILTAWNGLMIKGMAAAGQALAQ--- 447
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
PV ++ AE A F+R HL+ Q RL S+++G ++ G+LDDYAFL
Sbjct: 448 ------PV-------FIASAERAVDFVRAHLW--QKGRLLVSYKDGRAQHRGYLDDYAFL 492
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL+L + L +A++L E F D+ GG++ T + ++ R D A
Sbjct: 493 LDALLELLQVRWRDGDLSFAVDLAEAVLERFEDKAQGGFYFTADDHEILIHRPVPLMDDA 552
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
P+GN V +L+RL ++ + Y + AE +L ++ A + +
Sbjct: 553 TPAGNGVLAWSLLRLGHLLGEVR---YLKAAESTLKAAWKSIQQTPHAHCSLLKTLEEWL 609
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+P + V+L G + E A A A Y + + I P + +++ +
Sbjct: 610 IPPQI-VILRG--GGEELETWRAVAAAEYAPRRVALAI-PLEAQDLP---------GILG 656
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
V A VC +CS P+T +L+ L
Sbjct: 657 EYRPQGTAVTAYVCSGHTCSAPLTRREALKEHL 689
>gi|418053652|ref|ZP_12691708.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
1NES1]
gi|353211277|gb|EHB76677.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
1NES1]
Length = 677
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 209/596 (35%), Positives = 315/596 (52%), Gaps = 72/596 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED G A+++N+ FV+IKVDREERPD+D +YM + L GGWPL++F
Sbjct: 51 CHWCHVMAHESFEDSGTAEVMNELFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMF 110
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L D KP GGTYFP E +YGRP F T+L ++ +A+ + D I + +EAL A
Sbjct: 111 LDSDAKPFWGGTYFPREARYGRPAFVTVLLRIAEAYQNQPDN--------IRKNTEALLA 162
Query: 142 SASSNKLPDELPQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ + P+E +A R + ++++ D GG APKFP+ ++ + +
Sbjct: 163 ALKES--PNETSADASRPMTKDVVAAIARAVDREHGGLSGAPKFPQWSVFWLLWRGAIRY 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+D Q+ V+ TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD L
Sbjct: 221 DD-------PNAQEAVVTTLRHICQGGIYDHLGGGFARYSVDEFWLVPHFEKMLYDNALL 273
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++ + + T+D + + + +L+R+MIG G ++ DADS EG +EG F
Sbjct: 274 IDLLTEVWRETQDPIFKTRIAETVTWLKREMIGEAGGFAASLDADS---EG----EEGKF 326
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW++ E+ED+LG E A F Y + P GN F+G +L LN
Sbjct: 327 YVWSAAEIEDVLGAEDAAFFSRVYGVTPEGN------------FEGHTILNRLN------ 368
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
L + + L + R KL + R+ R RP DDK++ WNGL+I++ +RA+ + +
Sbjct: 369 -SLALLTNEEEAHLAKLRAKLLERRASRIRPGWDDKILADWNGLMIAALSRAAVVFEC-- 425
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+++ +AE A I L RL H++R G +KAP DYA
Sbjct: 426 --------------SDWLALAERAFDCIVTKLAAPDG-RLFHAYRKGLAKAPAIASDYAN 470
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ S L L+ ++L A + D+ + D + GGYF + V++R+K D
Sbjct: 471 MTSAALRLFAATGSERYLEHARQWTRILDKHYWDVQRGGYFTAADDTGDVVVRLKVASDD 530
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
A PS N++ + NL+ LA++ Q+ E + + E MA+ P+ CA
Sbjct: 531 AAPSANAIQLSNLIALAAVTGDV------QHHERARQLLEAFAPAMALG-PIGHCA 579
>gi|448731719|ref|ZP_21714012.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
DSM 8989]
gi|445805618|gb|EMA55820.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
DSM 8989]
Length = 580
Score = 345 bits (884), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 200/565 (35%), Positives = 295/565 (52%), Gaps = 43/565 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA+ LND FV IKVDREERPD+D++Y T + G GGWPLS
Sbjct: 52 SACHWCHVMEDESFEDERVAERLNDEFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLS 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEA 138
V+L+PD +P GTYFP ++K G+PGF +L + ++W+ R D+ ++ +A E
Sbjct: 112 VWLTPDGRPFYVGTYFPRDEKRGQPGFLDLLDSIAESWENDREDIEGRADQWAGAMAGEL 171
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ ++PD + L A+Q ++ D +GGFG KFP+ + +++ + E
Sbjct: 172 EATPEQPGEVPD---SDLLETAAQQAVENADREYGGFGHGQKFPQTGRLHLLM---RAAE 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG+ ++ L M++GG+ DH GGGFHRY+ D W VPHFEKMLYD +L
Sbjct: 226 RTGRES----FDEVAHEALDAMSEGGLRDHAGGGFHRYTTDREWTVPHFEKMLYDNAELT 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
YL + T Y+ + R+ L ++ R++ P G FS DA S + G ++EGAFY
Sbjct: 282 RAYLAGYRRTGAERYAEVARETLGFVERELRHPDGGFFSTLDAQSEDESG--EREEGAFY 339
Query: 319 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
VWT V D + + A LF E Y + GN + GK VL + A
Sbjct: 340 VWTPNGVHDAVDDEFAADLFCERYGVTEAGNFE-----------DGKTVLTVSTEIEDLA 388
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ E+ L R +F R++R RP D+KV+ WNGL+IS+FA A L +
Sbjct: 389 DEHDTTTEEVSAELERAREAVFAARAERERPERDEKVLAGWNGLMISAFAEAGLALDA-- 446
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ + A + F+ HL++++ RLQ +++G K G+L+DYAF
Sbjct: 447 ---------------RFADTAVAGIEFVHEHLWNDEKRRLQRRYKDGDVKIEGYLEDYAF 491
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L G L+ YE L +A++L + F D + + T S++ R +E D
Sbjct: 492 LARGALNCYEATGEVDHLAFALDLARAIETEFWDSDEETLYFTPQTGESLVARPQELDDQ 551
Query: 557 AEPSGNSVSVINLVRLASIVAGSKS 581
+ PS V+V L+ L A S
Sbjct: 552 STPSSTGVAVDVLLALDHFAADRPS 576
>gi|317470765|ref|ZP_07930149.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
gi|316901754|gb|EFV23684.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
Length = 679
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 236/699 (33%), Positives = 349/699 (49%), Gaps = 85/699 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFED VA+LLN F+SIKVDREERPD+D VYM+ QA+ G GGWP+SV
Sbjct: 53 SCHWCHVMEEESFEDHEVAELLNKHFISIKVDREERPDIDSVYMSVCQAMTGSGGWPMSV 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD KP TY P +Y G +L ++ W + R+ L + G + L+
Sbjct: 113 FMTPDQKPFFAATYLPKTSRYHLTGLMDLLPRISLLWKQDRERLLKIGNEITDHLNTDQR 172
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + + L +++P AL L+ S+D+ GGFG+APKFP P + ++ K D
Sbjct: 173 PSETVS-LSEDVPAQAL----ADLNASFDNVNGGFGTAPKFPTPAVLLFLIQQYKLCGD- 226
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ M TL M +GGI DH+GGGF RYS D+RW VPHFEKMLYD L
Sbjct: 227 ------KDSLAMAEHTLLRMYRGGIFDHIGGGFSRYSTDDRWLVPHFEKMLYDNALLLEA 280
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A++ ++ + I ++ + ++ P G + ++DADS EG +EG +Y +
Sbjct: 281 YAEAYACCENPLFPEIADAVVSCVLNELSHPDGGFYCSQDADS---EG----EEGKYYTF 333
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T EV +LG E+ LF C L ++D N F+GK++ L S
Sbjct: 334 TRDEVLHVLGEENGSLF-----------CSLYDITDRGN-FEGKSIPNLLKQSPFPNDHE 381
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G L +R L+ R KR D K++ SWN L+IS+ +AS+I
Sbjct: 382 G---------LKRMKRTLYLYRKKRTSLSTDKKILTSWNCLMISALTKASRIF------- 425
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
R++++ A+ A SF+ +HL + RL + +G + G L+DYAF
Sbjct: 426 ---------GREKFLAAAQKAESFLDKHLRKDDG-RLFLRWCDGEAAYDGQLEDYAFYSL 475
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+L LY ++L A++ + LF DRE GG+F + E +++L+ KE +DGA P
Sbjct: 476 SMLSLYRSTFLEEYLEKAVQAADLMISLFFDREHGGFFLYSSESEALILKPKELYDGAMP 535
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV- 618
SGNS ++ L L+ I S YR + + + F L A C A +LS
Sbjct: 536 SGNSAALHVLFILSKITGKS---IYRDCMDQTFSYFSPELSVHPSAY---CYALSVLSSQ 589
Query: 619 --PSRKHVVLVGHKS-SVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
PSR+ V+ +S F +L+ +N + + E++ A+
Sbjct: 590 FHPSRQLVITTKKESLPKKFMELLSKPQ----MNDFTVLVKT---------EQNKDTLAA 636
Query: 676 MA----RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A ADK +C+ +C PV D SLE LL
Sbjct: 637 IAPFTKEYPVLADKTSCYLCRGGACQAPVFDAESLETLL 675
>gi|288956849|ref|YP_003447190.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
gi|288909157|dbj|BAI70646.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
Length = 685
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 232/691 (33%), Positives = 335/691 (48%), Gaps = 80/691 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE+ +A L+N+ F++IKVDREERPD+D +Y + + L GGWPL++F
Sbjct: 51 CHWCHVMAHESFENPEIAGLMNELFINIKVDREERPDLDTIYQSALALLGQQGGWPLTMF 110
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GGTYFPP +YGR GF +LR + + + D + ++ +E L AL+
Sbjct: 111 LTPDAEPFWGGTYFPPAQRYGRAGFPDVLRGIAGTYTDEPDKVGKN----VEALRSALAG 166
Query: 142 SAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + L A++L + D GG GSAPKFP+ V + +L+ + +
Sbjct: 167 IGENRSAGAAGTIDAGMLDQVAQRLLREVDPIHGGIGSAPKFPQ-VPLFELLWRAWR--R 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ + V TL MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD +L +
Sbjct: 224 TGR----EPFRDAVTHTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLD 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ + T+D R+ + +L R+MI GG + DADS EG +EG FY+
Sbjct: 280 LMTLVWQETRDPLLETRIRETVGWLLREMIAEGGGFAATLDADS---EG----EEGLFYI 332
Query: 320 WTSKEVEDILG-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
W +EV+ +LG + FK Y + P GN ++G +L L +
Sbjct: 333 WREEEVDRLLGPALGADGLATFKRVYEVLPQGN------------WEGVTILNRLGGLTP 380
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+ E +L + R L R+KR RP DDKV+ WNGL+I++ A+
Sbjct: 381 AD-------ESTEAMLAKGREALSRARAKRVRPGWDDKVLADWNGLMIAALTHAALA--- 430
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
D E+++ A A +F+R + + RL HS+R+G K G LDDY
Sbjct: 431 -------------LDEPEWLDAAGRAFAFVRDRM--DSGGRLCHSWRHGQGKHAGMLDDY 475
Query: 495 AFLISGLLDLYEFGSGTKWL----VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
A + L L+E L VWA L D F D GGYF T + +++R
Sbjct: 476 AHMARAALALHEATGDPAALDQAKVWAAAL----DAHFWDDANGGYFFTADDAEGLIVRT 531
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
K +D A PSGN L L + + D YR AE F L +P
Sbjct: 532 KTAYDNATPSGNGTM---LAVLTILFQRTGEDAYRDRAEALATAFSGELTRNFFPLPTFL 588
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A ++++ P +V+VG + + E + N+ + + P D H
Sbjct: 589 NAVELMTAP--LQIVIVGPPRTAETEALRRTVLDRSLPNRILTVLAPKGDFPADLPAGHP 646
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ M A VC+ +CS PVT
Sbjct: 647 AQGKGMRDGT-----ATAYVCRGMTCSAPVT 672
>gi|355673311|ref|ZP_09058908.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
WAL-17108]
gi|354814777|gb|EHE99376.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
WAL-17108]
Length = 688
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 225/632 (35%), Positives = 326/632 (51%), Gaps = 97/632 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED+ +A++LN FV +KVDREERP++D VYM+ QA+ G GGWPL+
Sbjct: 48 STCHWCHVMAHESFEDKEIARILNTHFVPVKVDREERPEIDMVYMSVCQAMTGRGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ------------- 126
+ ++PD KP GTY PP +YG G +L KV W+ R+ L Q
Sbjct: 108 IIMTPDKKPFFAGTYLPPRSRYGMTGLTELLEKVSGLWETDREQLLQMSRQVMSLIHGRE 167
Query: 127 -SGAFAIEQLSEALSASASS-NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
+GA + + + + ++ ++ D + ++LS +D + GGFG APKFP P
Sbjct: 168 GNGADGMGTAGDGMDGTGTAGDRTEDSVSWELAHEGFKELSAMFDKKHGGFGRAPKFPAP 227
Query: 185 VEIQ-MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 243
+ +M+Y++ + ED M TL MA+GGIHD +GGGF RYS DE W
Sbjct: 228 HNLLFLMMYYAARDED--------HAMDMAEQTLTAMARGGIHDQIGGGFSRYSTDEAWL 279
Query: 244 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 303
VPHFEKMLYD LA YL+ + LT + +Y I IL Y+ R++ G + +DADS
Sbjct: 280 VPHFEKMLYDNALLALAYLEGYRLTDNPYYRQIAERILIYVERELSDSDGGFYCGQDADS 339
Query: 304 AETEGATRKKEGAFYVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFK 361
EG EG FYV++ E+ IL F + + + GN F+
Sbjct: 340 ---EGV----EGKFYVFSKDEIRQILDTPREYDDFCQWFGITEKGN------------FE 380
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
GKN+ L++ + +G +K++D R KR H DDK++ SWN ++
Sbjct: 381 GKNIPNLLHNPGYKDT---------FPFMGPVCKKVYDHRIKRMALHRDDKILTSWNSMM 431
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
I+++A+A +L D+K Y + A +A F+ +HL DE HR+ +R
Sbjct: 432 ITAYAKAGLLL----------------DQKAYEKKARNAQMFVEQHLVDE-NHRMFVRYR 474
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTT 540
+G PG LDDYA+ GLL LYE +L A++ +LF D R+GG YF
Sbjct: 475 DGERAFPGNLDDYAYYCLGLLALYEATLEVDYLELALKRAAQMADLFWDSRQGGFYF--Y 532
Query: 541 GEDPSVLL-RVKEDHDGAEPSGNSVSVINLV-----------------RLASIVAGSKSD 582
G D L+ R KE +DGA PSGNS + L+ +LA + AG+K
Sbjct: 533 GRDVQELIHRPKEIYDGAVPSGNSAAAHVLLALASLTAEPRWQEFADRQLAFLAAGAKG- 591
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
Y SL F +K ++++ L+C +AD
Sbjct: 592 -YPSAHCFSLMAF---MKALSISRELVCVSAD 619
>gi|257092092|ref|YP_003165733.1| hypothetical protein CAP2UW1_0453 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257044616|gb|ACV33804.1| protein of unknown function DUF255 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 734
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 226/638 (35%), Positives = 335/638 (52%), Gaps = 71/638 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + R FL +TCHWCHVME ESFEDE +A+ LN +V+IKVDREERPD
Sbjct: 72 GDEAFAEARRLGRPVFLSIGYSTCHWCHVMEAESFEDEAIARFLNRHYVAIKVDREERPD 131
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED--KYGRPGFKTILRKVKDA 116
+D VYM+ VQ L G GGWP+SV+L+ +P GGTYFPP D + G+ GF +L + D
Sbjct: 132 IDAVYMSAVQQLTGAGGWPMSVWLTAAREPFFGGTYFPPRDGGRDGQRGFLPLLGALSDT 191
Query: 117 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFG 173
+ + + + Q+ +E + + + + LP + + +S+D+R G
Sbjct: 192 FHRDPERVGQACTALVEAIRHDMQGAYGTGGADAAIGLPAGDVIDATVAHYRQSFDARHG 251
Query: 174 GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 233
G APKFP + ++++L + ++ D ++ +M TL+ MA GG++D +GGGF
Sbjct: 252 GLSRAPKFPSHIPVRLLLRYHQRTGD-------ADALRMATLTLEKMAAGGLYDQLGGGF 304
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRYS D RW VPHFEKMLYD L Y +AF +T ++ + R+ DY+ R+M GG
Sbjct: 305 HRYSTDVRWLVPHFEKMLYDNALLVVAYAEAFQVTDRADFARVARETCDYILREMTDAGG 364
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVE---DILGEHAIL--FKEHYYLKPTGNC 348
+SA DADS EG +EG F+VW E+ D LG+ F HY + P GN
Sbjct: 365 GFYSATDADS---EG----EEGRFFVWREDEIRRELDALGDGDTTEHFLAHYDVHPGGN- 416
Query: 349 DLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH 408
++G +L + P E L R +L+ VR++R P
Sbjct: 417 -----------WEGHTIL-----------NVPRPDEAAWEALAAARARLYAVRARRTPPL 454
Query: 409 LDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHL 468
D+K++ WNGL+IS+ A A ++L D Y+ A AA F+ HL
Sbjct: 455 RDEKILAGWNGLMISALAVAGRVL----------------DAPRYVAAAVRAADFVLTHL 498
Query: 469 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 528
L+ SF++G ++ FLDD+AFL +GL+DLYE + L A+ L T + LF
Sbjct: 499 RGADGG-LRRSFKDGQARQAAFLDDHAFLAAGLIDLYEATFDVRHLRDALALAETTEHLF 557
Query: 529 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D G +F ++ S++ R K +DGAEPSG SV+++N +RL + + + +RQ A
Sbjct: 558 AD-PAGAWFMSSEAHESLIAREKPAYDGAEPSGTSVALLNALRLGVL---TDDERWRQIA 613
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
E L L + +A+ A D L+ R+ V+
Sbjct: 614 ERGLRAHARVLGERPIAMTEALLAVDFLATTPRQIAVV 651
>gi|303245350|ref|ZP_07331634.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
JJ]
gi|302493199|gb|EFL53061.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
JJ]
Length = 702
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 241/693 (34%), Positives = 333/693 (48%), Gaps = 50/693 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE +A L+ V+IKVDREERPD+D +YMT+ QAL G GGWPL+
Sbjct: 51 STCHWCHVMERESFEDEDIAALMRAIVVAIKVDREERPDLDTLYMTFCQALTGRGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFP E +GR G + +L++V AW R + + A + + + +
Sbjct: 111 VFLTPDGEPFFAGTYFPKESGFGRTGMRELLQRVHMAWKSNRQAVIGNAAQLLGAVRDQI 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + E L +L+ S+D GGFGSAPKFP P +L ++
Sbjct: 171 TARDGTGAA--EPGTVELEAATGELAASFDVENGGFGSAPKFPAP---HNLLLLLREYRR 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + MV TL M +GG++DHVG GFHRYS D W VPHFEKMLYDQ
Sbjct: 226 TGN----KDLLAMVTATLSAMRRGGVYDHVGFGFHRYSTDAGWLVPHFEKMLYDQALCVM 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++A+ T +V+ + L+Y+RRD+ P G +SAEDADS EG EG FYV
Sbjct: 282 ACVEAWQATGEVWLKDTALEALEYVRRDLTSPDGVFYSAEDADS---EGV----EGKFYV 334
Query: 320 WTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ + L E A L + Y ++ TGN + G N+L +A+
Sbjct: 335 WTEAEIREALPPEDAQLVVDVYGVEATGNF----RDEATGVATGTNILHLPRSLEDAAAG 390
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G + L CR L VR KR RP DDKV+ NG + +
Sbjct: 391 RGTSVAALAARLETCRAALLAVREKRARPLCDDKVLTDNNG---------LMLAALAKAA 441
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
FN D +A + + E RL H R G + G LDDYAF
Sbjct: 442 RAFN------DEALAARAVAAADFLLEKMALPED--RLLHRLRQGEAAVAGMLDDYAFFA 493
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ ++L A L F D GG+F + + S+LLR K +D A
Sbjct: 494 WGLVELYQTVFAPRYLERAAALAKAMIAHFGD-GAGGFFLSPDDGESLLLRQKTFYDAAV 552
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGNSV+ L L + G KS +R+ A R+ + C+ +
Sbjct: 553 PSGNSVAFFVLTTLFRLT-GEKS--FREEAAKLAKAAGGRVAEHPSGYAFFLCSLSQMLA 609
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P+ V L G + D + + Y L + + + PA ++ + + A R
Sbjct: 610 PA-AEVTLAGDPDAADTQVLARTIFDRY-LPEVAVVLRPAGEDDPEI-----AAIAPFTR 662
Query: 679 NNFSADKVVAL-VCQNFSCSPPVTDPISLENLL 710
D A VC+ SC PP D +L L+
Sbjct: 663 FQLPLDGAAAAHVCRAGSCQPPTADAATLLELI 695
>gi|451980948|ref|ZP_21929330.1| conserved hypothetical protein, contains Thioredoxin domain
[Nitrospina gracilis 3/211]
gi|451761870|emb|CCQ90575.1| conserved hypothetical protein, contains Thioredoxin domain
[Nitrospina gracilis 3/211]
Length = 697
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 236/695 (33%), Positives = 341/695 (49%), Gaps = 64/695 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED +A+ LN FV IKVDREERPDVD +YM VQA GGWPL+V
Sbjct: 54 TCHWCHVMERESFEDPEIAEYLNAHFVPIKVDREERPDVDSIYMKSVQAFGQQGGWPLNV 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD P GGTY+P +YG P F +L + W ++ + + + I L +
Sbjct: 114 FVTPDGVPFYGGTYYPSVGRYGLPSFLEVLTFLDKTWREEPEKVEKQSTALINYLKDVSK 173
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGG--FGSAPKFPRPVEIQMMLYHSKKLE 198
++ D+L + E ++SYD G F KFP + + ++L H +
Sbjct: 174 QEQNTEGTVDDLGFHGENKTREFYTQSYDRLHHGFLFQQQNKFPPSMGLSLLLRHHHRTG 233
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + +MV TL+ M +GGI+D +GGG RYS D +W VPHFEKMLYD G
Sbjct: 234 D-------ALSLEMVENTLRAMKQGGIYDQIGGGLARYSTDHQWLVPHFEKMLYDNGLFV 286
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++ + +T ++ D+L Y+ RDM G +SAEDADS EG EG FY
Sbjct: 287 TALIETYQVTGKREFADYANDVLQYIDRDMTSAEGAFYSAEDADS---EGV----EGKFY 339
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT +E+E +LG E A + +Y + P GN ++GKN+L A
Sbjct: 340 VWTQEEIEKVLGRETASIAIPYYNVLPNGN------------WEGKNILHVKRPPEQIAK 387
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
LG+PL+ + E R KL VRS+R RP LDDK++ SWNGL+I + A+ ++L
Sbjct: 388 DLGLPLDHVEAKIAEAREKLLAVRSQRIRPLLDDKILTSWNGLMIRAMAQVGRVL----- 442
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
D + + AE A FI +L + +L +R G ++ G+L DY +
Sbjct: 443 -----------DDADRIAKAEKALHFIWNNLRTPEG-KLLRRWREGEARYDGYLCDYTSI 490
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
DLYE ++ A L T +E F ++ G Y+ T + +++R +DG
Sbjct: 491 ALACCDLYEATYNPDYINKAEALMKTVEEKFGNQ--GAYYETASDAEELIVRQVSGYDGV 548
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
EPSGNS + + L++LA++ DY R+ AE F + + + M A L
Sbjct: 549 EPSGNSSAAMALLKLAALT--QNVDYERR-AEKIFLAFSDEVTEYGINSSFMMQALH-LY 604
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHID-PADTEEMDFWEEHNSNNAS 675
+ K V + G S + + N +D AD + +
Sbjct: 605 LGGCKQVAVRGVNSDKGLDAFWPLMRRRFFPNAVFAFSLDGDADAQRVPL---------- 654
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A K A VCQ+ SC PPVT L+NL+
Sbjct: 655 LAGKESLQGKTTAYVCQHGSCLPPVTQVTELKNLV 689
>gi|363583054|ref|ZP_09315864.1| hypothetical protein FbacHQ_16672 [Flavobacteriaceae bacterium
HQM9]
Length = 705
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 226/700 (32%), Positives = 349/700 (49%), Gaps = 82/700 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED VA ++N FV+IK+DREERPD+D+VYM+ VQ + G GGWPL+V
Sbjct: 80 CHWCHVMEHESFEDSTVAAVMNTNFVNIKIDREERPDIDQVYMSAVQLMTGRGGWPLNVI 139
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYFP ++ G L++++ ++ L + +L+E + +
Sbjct: 140 ALPDGRPVWGGTYFPKDEWMGA------LKQIQKIYEDNPAKLEEYAT----KLTEGIQS 189
Query: 142 SASSNKLPDEL--PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ P+ L ++ + +K +D + GG APKF P +L ++ +
Sbjct: 190 VSLVKPNPNTLIFEKDTIENAVANWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQ--- 246
Query: 200 TGKSGEASEGQK-MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
A+E K V+ TL ++ GG++DHVGGGF RYS DE+WHVPHFEKMLYD QL
Sbjct: 247 -----SANEKLKEYVITTLNQISYGGVYDHVGGGFARYSTDEKWHVPHFEKMLYDNAQLV 301
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++Y DA+ +TK+ +Y + + LD++ R++ G +S+ DADS G + +EGAFY
Sbjct: 302 SLYSDAYLITKNDWYKQVVYETLDFVARELTNDEGAFYSSLDADSLTPSG--KLEEGAFY 359
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VW +E LGE LFK++Y + G + HN + VLI + K
Sbjct: 360 VWQKPALETALGEDFPLFKDYYNINTYGLWE-------HNNY----VLIRKESDANFVEK 408
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
M ++ +L + ++ L +RSKR RP LDDK + SWN L++ +A A ++
Sbjct: 409 HEMEMDAFLQKQKKWKQLLLGIRSKRERPRLDDKTLTSWNALMLKGYADAYRVF------ 462
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
D ++++ A + A FI+ + L + + +L H+++NG S G+L+DYA
Sbjct: 463 ----------DNAKFLKAALANAEFIKTKQL--KGSGQLMHNYKNGKSTINGYLEDYAAT 510
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I + LY+ +WL + ++ + F D YF T+ ED +++ R E D
Sbjct: 511 IEAFIALYQVTFDQQWLDLSKKMIDYVHTHFYDSASEMYFFTSDEDAALVTRNIESSDNV 570
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA----VPLMCCAA 613
P+ NS+ NL L+ S DY + + L +T + + + LM
Sbjct: 571 IPASNSIMAKNLYHLSHYY--SNKDYLVR-SRKMLHNIQTNITEYPSGYSNWLDLMLNFT 627
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
D VV++G + E A Y NK + A T+
Sbjct: 628 DDFY-----EVVIIGAAA----EEKRVAVQQKYYPNKIMAGSATASTQ------------ 666
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ N FS +C N +C PVT+ NLL EK
Sbjct: 667 -PLLLNRFSDTDTHIFICVNNACKYPVTEVSEAFNLLNEK 705
>gi|448455362|ref|ZP_21594542.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
gi|445813964|gb|EMA63937.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
Length = 747
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 240/729 (32%), Positives = 349/729 (47%), Gaps = 95/729 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA +LN+ FV +KVDREERPDVD +MT Q + GGGGWPLS
Sbjct: 53 SSCHWCHVMAEESFEDESVAAVLNESFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAF 130
+ +P+ +P GTYFPPE + +PGF+ + ++ D+W ++ D S
Sbjct: 113 AWCTPEGEPFYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARD 172
Query: 131 AIEQL---------SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APK 180
+E + +A S + PD L + A + YD +GGFGS K
Sbjct: 173 ELESVPDSGPVGGAGDAGDMSGAEAPGPDLLDEAAAAAI-----RGYDDEYGGFGSGGAK 227
Query: 181 FPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 240
FP P I ++L K TG++ + TL MA+GG++D VGGGFHRY+VD
Sbjct: 228 FPMPGRIDVLLRAYAK---TGRNAALT----AATGTLDGMARGGMYDQVGGGFHRYAVDR 280
Query: 241 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAED 300
+W VPHFEKMLYD +L YLDA LT D Y+ + + L +L R++ G FS D
Sbjct: 281 QWTVPHFEKMLYDNAELPMAYLDAHRLTGDASYARVANETLGFLDRELRHDEGGFFSTLD 340
Query: 301 ADS---------AETEGATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPT 345
A S A ++G+ R EGAFYVWT EV+ +L E A L K+ Y ++
Sbjct: 341 ARSRPPASRRGDAGSDGSGRDDDANDVEGAFYVWTPGEVDAVLDEPAASLAKDRYGIESG 400
Query: 346 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 405
GN + +G V + A M + L R LF+ R RP
Sbjct: 401 GNFE-----------RGTTVPTIAASVAELAEAHDMSTDDVRETLTAARVALFEARESRP 449
Query: 406 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 465
RP D+KV+ SWNG IS+FA A ++L + Y ++A A +F R
Sbjct: 450 RPARDEKVLASWNGRAISAFAAAGRVLG-----------------EPYADIASDALAFCR 492
Query: 466 RHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 525
LYDE+T L + +G + PG+LDD+AFL G LD Y + L +A++L T
Sbjct: 493 ERLYDEETGALARRWLDGDVRGPGYLDDHAFLARGALDAYSATGDPEALGFALDLAETIV 552
Query: 526 ELFLDREGGG-YFN-----TTG--EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 577
F D E G YF T G D ++ R +E D + PS V+ L +++
Sbjct: 553 SDFYDEEDGTIYFTRDPDETAGGDGDDTLFARPQEFTDRSTPSSLGVAAETL----ALLD 608
Query: 578 GSKSDY-YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFE 636
G ++D + + AE + R++ + + AAD ++ V + +
Sbjct: 609 GFRTDREFAEVAERVVTTHADRIRASPLEHVSLVRAADRVAS-GGIEVTVATDAVPEAWR 667
Query: 637 NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS---MARNNFSADKVVALVCQN 693
L + L ++ P + + W + + + A + + A VC+
Sbjct: 668 ETLGERY----LPGALVAPRPPTEDGLAAWLDRLGMDEAPPIWADRDAVDGEPTAYVCEG 723
Query: 694 FSCSPPVTD 702
+CSPP TD
Sbjct: 724 RTCSPPETD 732
>gi|76802617|ref|YP_327625.1| hypothetical protein NP3966A [Natronomonas pharaonis DSM 2160]
gi|76558482|emb|CAI50074.1| YyaL family protein [Natronomonas pharaonis DSM 2160]
Length = 698
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 231/692 (33%), Positives = 330/692 (47%), Gaps = 62/692 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF+D A +LN+ FV IKVDREERPDVD VYM Q + G GGWPLSV+
Sbjct: 50 CHWCHVMADESFDDPDTADVLNEHFVPIKVDREERPDVDNVYMQVCQMVRGSGGWPLSVW 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEAL 139
L+P+ KP GTYFPPE PGFK++L + +AWD ++R L Q +Q + ++
Sbjct: 110 LTPEGKPFHVGTYFPPEPTKNTPGFKSVLEDIAEAWDDTERRQQLEQQA----DQWATSI 165
Query: 140 SASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S+ P P + L A + D GG+G KFP P I ++L ++
Sbjct: 166 SSELEDTPEPVAEPPGEEFLDTAANAAVGNADREHGGWGRGQKFPHPGRIHLLLCAYQQT 225
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ A E TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 226 DRETYRDVAVE-------TLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEI 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+L + +T D Y+ I + ++ R++ P G +S DA+S ++ G ++EGAF
Sbjct: 279 PRAFLAGYQVTGDDRYAEIVAETFAFVDRELTHPDGGFYSTLDAESEDSTGT--REEGAF 336
Query: 318 YVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT + V + A LF E Y + GN + VL E
Sbjct: 337 YVWTPEVVAAAVDNETDAELFCERYGVTDAGNFE-----------NATTVLTESRPPEEL 385
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A++ M + R +LF+ R++R RP D+KV+ WNGL+IS+ A + +L
Sbjct: 386 AAERVMDTATVEERIERAREQLFESRAERSRPPRDEKVLAGWNGLMISALAEGALVLD-- 443
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
EY + A +A SF R L+DE L F G G+L DYA
Sbjct: 444 ---------------PEYADDAAAALSFCREQLWDETEEVLNRRFEGGTVGIDGYLQDYA 488
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDH 554
FL G LDLY+ + L +A+ L F D + G YF G D S+L R ++
Sbjct: 489 FLGRGALDLYQATGDVEQLSFALSLGRVIQSEFYDADAGTLYFTAEGGD-SLLARPQQLA 547
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAA 613
D + PS V+V L RLA+ + D AE + + L+ ++ L+ A
Sbjct: 548 DSSTPSSTGVAVELLSRLAAFDPDAGFD---DVAETVIETHASTLESNPLSHTSLVAAAH 604
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW---EEHN 670
D S R + + + LA + L ++ P + +D W + +
Sbjct: 605 D--SAAGRIELTVAAADLPETWRTSLAETY----LPGRLLSRRPPTDDGLDPWLAALDVD 658
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
A + + C++F+CSPP D
Sbjct: 659 DVPPIWANRDAKDGEPTVYACRSFTCSPPKHD 690
>gi|375150037|ref|YP_005012478.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361064083|gb|AEW03075.1| hypothetical protein Niako_6853 [Niastella koreensis GR20-10]
Length = 685
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 204/571 (35%), Positives = 295/571 (51%), Gaps = 71/571 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFE+E A ++N F+++K+DREERPD+D +YM VQA+ G GGWPL++
Sbjct: 52 ACHWCHVMEKESFENEETASMMNAHFINVKIDREERPDLDHIYMDAVQAMTGSGGWPLNI 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-----------MLAQSGA 129
FL+PD +P GGTYFPP+ Y RP + +L V +AW +KRD + QS +
Sbjct: 112 FLTPDGRPFYGGTYFPPKAIYNRPSWHDVLTGVANAWTEKRDDIDAQATNLTGHIVQSNS 171
Query: 130 FAIEQLSEALSASA-SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
F + + ++ A S ++ D + N + + D GGFGSAPKFP+ I
Sbjct: 172 FGQQAVEGDINMDALFSKEIADTMFNNIM--------GTADKEEGGFGSAPKFPQTFTIG 223
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+L + K + +A +L M +GG++DH+GGGF RYS D W VPHFE
Sbjct: 224 YLLRYYHKTGNEQALAQAC-------LSLDKMIRGGLYDHLGGGFARYSTDREWLVPHFE 276
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD L +V DA+ LT+ Y + L ++ R++ P +SA DADS EG
Sbjct: 277 KMLYDNALLVSVLCDAWQLTQQPLYKQAVEETLAFVERELHSPEKGFYSALDADS---EG 333
Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGN---CDLSRMSDPHNEFKGKNV 365
EG FYVW+ E+E IL + A +F Y + GN ++ + P +F N
Sbjct: 334 V----EGKFYVWSKPEIEAILQQDAAVFCAFYDVTEGGNWEHTNILNIRKPLKQFAADN- 388
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
+P + +L + R KL R+ R RP LDDK+++ WN L+ +++
Sbjct: 389 --------------NIPEARLQELLQQGREKLLQHRAGRIRPQLDDKILLGWNALMNTAY 434
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
++A + F P +Y EVAE FI + H+++ +
Sbjct: 435 SKAYSV---------FGNP-------QYAEVAEENMKFIMNR-FTRDGLEFFHTYKKEIA 477
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DP 544
+ P FLDDYA+LI L+ L E +L A L + F EG GYF T +
Sbjct: 478 RYPAFLDDYAYLIQALIHLQEITGKAAYLYKAKALTQQVIDQF-SEEGTGYFFYTHQGQQ 536
Query: 545 SVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
V++R KE +DGA PSGN++ NL L +
Sbjct: 537 DVIVRKKEVYDGAIPSGNAIMAFNLQYLGVV 567
>gi|392399485|ref|YP_006436086.1| thioredoxin domain-containing protein [Flexibacter litoralis DSM
6794]
gi|390530563|gb|AFM06293.1| thioredoxin domain protein [Flexibacter litoralis DSM 6794]
Length = 712
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 223/705 (31%), Positives = 344/705 (48%), Gaps = 77/705 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E VAK +N+ F+ IKVDREERPDVD +YM VQ + GGWPL+VF
Sbjct: 49 CHWCHVMEHESFENEDVAKAMNENFICIKVDREERPDVDAIYMEAVQMMGVSGGWPLNVF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+ D KP GGTYFP ++ + I+ ++ + KR+ + +S + LS +
Sbjct: 109 LTSDAKPFWGGTYFPAKE------WIDIVEQIGKTYKNKRNEVEESANKVTKVLSISTLE 162
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ + D + L + L K +D+ FGG G APKFP P +L + L+
Sbjct: 163 RYNLKDVSD-FDDSILAKAFQSLEKKFDTEFGGIGEAPKFPMPSYYLFLLRYYDYLDKNN 221
Query: 202 KSGEASEGQK-----MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ + K + TL M +GGI+D +GGGF RYSVD+ W PHFEKMLYD Q
Sbjct: 222 QDQNITNPTKNKILSQIHLTLNKMDQGGIYDQIGGGFARYSVDKEWFAPHFEKMLYDNAQ 281
Query: 257 LANVYLDAFSLTKDVFYSYICRDIL----DYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
L ++Y +A+++T+D ++ ++I+ ++L R++ G ++A DADS EG
Sbjct: 282 LLSLYAEAYTITEDKVQKHVYKEIIEQTTEFLTRELQDKNGGFYAALDADS---EG---- 334
Query: 313 KEGAFYVWTSKEVEDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFK 361
KEG FY WT E+E + H LFK++Y + GN PH +
Sbjct: 335 KEGKFYTWTIDEIEQVFTNHTFSTSINQEEDLQLFKKYYSITAIGN-----WQSPHAT-E 388
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
G N+L N A + + L + E + L ++R + P LDDK++ SWN L+
Sbjct: 389 GANILYRNNTDEEFAQENNIELNNLKCKVKEWQNYLLEIRKTKVSPSLDDKILTSWNALL 448
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-----RL 476
I F + L + K+Y+ +A A FI ++L+D+Q +L
Sbjct: 449 IKGFCNSYSSL----------------NDKKYLNLALQTAEFIEKNLFDKQNTKNNKLKL 492
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG-G 535
H+F++G ++ GFL+DYA LI + LY+ KWL+ A EL F D+E
Sbjct: 493 HHTFKDGTAEIDGFLEDYALLIESYIALYQVCFDEKWLLRADELTKYVFTNFYDKEEKLF 552
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
YF E ++ + KE D S NSV NL L ++ +++ Y++ ++ L+
Sbjct: 553 YFTNQNESEKLVAQKKELFDNVISSSNSVMATNLYFLGILL---ENNLYKETSKEMLSKV 609
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+ + V P+ + +VG K ++ +L + Y NK ++
Sbjct: 610 ASLIAAEPRHVSNWASLFTYFLTPT-PEIAIVGEK----YQEVLQEISSFYIPNKVIV-- 662
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
+EE E S+ + ++ VC+N C PV
Sbjct: 663 -ATKSEE----EGQKSSLPLLEMRPVMNNQTTIYVCKNKMCQLPV 702
>gi|291295832|ref|YP_003507230.1| hypothetical protein [Meiothermus ruber DSM 1279]
gi|290470791|gb|ADD28210.1| protein of unknown function DUF255 [Meiothermus ruber DSM 1279]
Length = 672
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 227/622 (36%), Positives = 319/622 (51%), Gaps = 71/622 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + FL TCHWCHVME ESFED VA+ LN FV IKVDREERPD
Sbjct: 27 GEEAFAKARAENKPIFLSVGYATCHWCHVMERESFEDPEVAQFLNAHFVPIKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW- 117
VD+VYM+ +QA+ G GGWP+++FL PDL+P GGTY+PPED+ G P F+ +L V +AW
Sbjct: 87 VDQVYMSALQAMTGSGGWPMNMFLMPDLRPFFGGTYWPPEDRQGFPSFRRVLAGVHNAWL 146
Query: 118 DKKRDMLAQSGAFAIEQLSEAL--SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
+++++L + EQL+ L LPD+L AL LS+ +D GGF
Sbjct: 147 HQQKEVLENA-----EQLTTYLQDQLKPRGGALPDDLHSTAL----AGLSRIFDPAHGGF 197
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
G APKFP+ + +L + + K + TL MA+GG++D VGGGFHR
Sbjct: 198 GGAPKFPQSPALGYLLTQAWLGHEA--------AWKHLQLTLDRMAEGGLYDQVGGGFHR 249
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDA-----FSLTKDVFYSYICRDILDYLRRDMIG 290
Y+VD W VPHFEKMLYD QLA +Y A SL + Y I ++ LDY+ R++ G
Sbjct: 250 YTVDHIWRVPHFEKMLYDNAQLARLYAAASRMPQASLEQARRYQRIAQETLDYVLRELTG 309
Query: 291 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDL 350
P G +SA+DADS EG EG FYVW ++E +LG A + + GN
Sbjct: 310 PEGGFWSAQDADS---EGV----EGKFYVWQAEEFRRVLGAEAEAAMLLFGVSEAGN--- 359
Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
++ NVL +A LG+ E + + R +L+ R +R P D
Sbjct: 360 ---------WEHTNVLERRIPDAALMQHLGLGPEAFERWVQSVRHRLYAARQQRTPPLTD 410
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DKV+ WNGL++ + A + L + Y+E A A+F+ + +Y
Sbjct: 411 DKVLADWNGLMLRALADVGRWL----------------EEPRYIEAARKNAAFVMQEMYR 454
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
+ L+HS+R G K +L D A GLL L+E WL A +L F
Sbjct: 455 DGL--LRHSWRQGQLKPQAYLSDQAHYGLGLLALFEATGEVGWLEGARQLAEAILTHF-- 510
Query: 531 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
+E G F + D ++ + + +DG PSGN+V+ L RLA++ + D++ Q A
Sbjct: 511 KEPTGAFRDS-LDQTLPVVALDAYDGPYPSGNAVAAELLFRLAALY--ERPDWH-QAALT 566
Query: 591 SLAVFETRLKDMAMAVPLMCCA 612
++ RL A P M A
Sbjct: 567 TVESNAQRLLHNAFGFPAMLQA 588
>gi|257076883|ref|ZP_05571244.1| thymidylate kinase [Ferroplasma acidarmanus fer1]
Length = 638
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 214/572 (37%), Positives = 302/572 (52%), Gaps = 63/572 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESF D VAK +N FV IKVDREE PDVD +YMT+ Q + G GGWPL+V
Sbjct: 48 SCHWCHVMEQESFTDPEVAKRMNSTFVCIKVDREEMPDVDSLYMTFSQVMTGTGGWPLNV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
L+PD KP+ TY P + G + + W KR + ++G AI +L
Sbjct: 108 ILTPDRKPIFAFTYIPRVSRNNMIGIMELAENIDYLWKNKRGEMEKNGDEAISRLRNM-- 165
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
N P + + A+ E L ++YDS +GGFG+APKFP I +L + K
Sbjct: 166 ERKEENNSPVDYKK-AIEATYESLKRNYDSEYGGFGNAPKFPSFHNIIFLLNYYKA---H 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
GK E +MV +L+ M GG++DHVGGGFHRYS D + +PHFEKM YDQ
Sbjct: 222 GK----EEALEMVKHSLRMMYIGGMYDHVGGGFHRYSTDPFFRIPHFEKMTYDQAMAIIA 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y A+ +T D FY + +I +L+++M G ++A DADS EG +EG +Y W
Sbjct: 278 YSYAYDVTGDTFYKNVVYEIYKFLKQEMFSRG--FYTAMDADS---EG----QEGKYYTW 328
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+ + G+ F + + P GN D ++ G+N+L D G
Sbjct: 329 TYEELVENAGKK---FVYDFNILPEGN-----FYDANSRQTGRNILYMGRDIQ------G 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
P Y N L ++ R KR +P DDK++ NGLVI + + AS I
Sbjct: 375 DPTTLYKNELEALKKS----REKRIKPLTDDKILTDINGLVIKALSIASMIF-------- 422
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+ K+ + AE +A FI +Y ++ +L HS+RNG S G LDDY+F++SG
Sbjct: 423 --------NDKDMLNTAEGSADFIMNDMYTDK--KLMHSYRNGKSSINGMLDDYSFMVSG 472
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL LYE +L +A +LQ T + F D+ GG++N G ++L+R+KE +D A PS
Sbjct: 473 LLSLYEASLNDIYLDYARDLQKTIMDTFYDKTSGGFYNGMG---NLLVRLKESYDNAIPS 529
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
G S + N++ I D YR E S+
Sbjct: 530 GFSFEIGNMIVFNYI-----DDKYRVELEKSI 556
>gi|258405434|ref|YP_003198176.1| hypothetical protein Dret_1310 [Desulfohalobium retbaense DSM 5692]
gi|257797661|gb|ACV68598.1| protein of unknown function DUF255 [Desulfohalobium retbaense DSM
5692]
Length = 615
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 206/569 (36%), Positives = 300/569 (52%), Gaps = 45/569 (7%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME E FED VA +LN V IKVDREERPD+D YM+ QAL G GGWPL++
Sbjct: 53 TCHWCHVMERECFEDTEVAHILNTVCVPIKVDREERPDLDTFYMSCCQALSGRGGWPLNL 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P TY P + ++ +PG +L V++ W + R+ + QS + + + S
Sbjct: 113 FLTPDGRPFFAATYIPKQSRFSQPGLLDLLVSVQEDWVRNREQIEQSATRLVSHIHDLFS 172
Query: 141 ASASSNKLPDELPQNAL-RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S+ LP+NA+ ++L +++D FGGFG APKFP P + +L +D
Sbjct: 173 DSSGP------LPENAIFEQAVQELRQNHDDDFGGFGKAPKFPTPHVLLFLLRLYDLSQD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
MV TL+ + +GGI DH+GGGFHRYS D WH+PHFEKMLYDQ L
Sbjct: 227 RSLL-------NMVDSTLEAICRGGIRDHIGGGFHRYSTDRAWHLPHFEKMLYDQALLLM 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ + T+ + + +Y+ + G ++ EDAD TEG +EGAFY
Sbjct: 280 ALAEGHARTRRDLFRREAVAVAEYMLERLHDGDGGLYCGEDAD---TEG----EEGAFYQ 332
Query: 320 WTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+E L + + ++ GN + + + GKNVL + D++ +A +
Sbjct: 333 WTETELEAALPPDTFRVVQTVAGIRSDGNI----LDEATRQRTGKNVLARVADTADAAER 388
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+ E+ L +R++RP+P LDDK + SWNGL +++ AR+ +L E
Sbjct: 389 LGLSEEQVRLEWHRAMATLGGLRAQRPQPFLDDKQLTSWNGLAVAALARSGILLGEE--- 445
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ A A ++ + E RL H RN + PGFL+DYA+ I
Sbjct: 446 -------------HLIAAARETADWVLETMQPEPG-RLWHRARNRHAGIPGFLEDYAYFI 491
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL+L + G + A+ L +T F D + GG+F T LLR+K+ D A
Sbjct: 492 WGLLELVQTSEGQDYRRIALRLADTVLSEFADLKEGGFFQTHAAAQEPLLRLKKVFDDAL 551
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQN 587
PS N+V + NLVRL +G +D R++
Sbjct: 552 PSENAVMLYNLVRLYG--SGPTNDCARKH 578
>gi|113867298|ref|YP_725787.1| hypothetical protein H16_A1279 [Ralstonia eutropha H16]
gi|113526074|emb|CAJ92419.1| highly conserved protein containing a thioredoxin domain [Ralstonia
eutropha H16]
Length = 673
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 243/690 (35%), Positives = 341/690 (49%), Gaps = 92/690 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFE+ +A L+ND F+SIKVDR+ERPD+D +Y Q + GGGWPL+V
Sbjct: 50 TCHWCHVMAHESFENPRIAGLMNDRFISIKVDRQERPDLDDIYQKVPQMMGQGGGWPLTV 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA-- 138
FL+P +P GGTYFPP+D+YGRPG +L + +AW +R+ L + IEQ +
Sbjct: 110 FLTPQGEPFYGGTYFPPDDRYGRPGLARVLLSLSEAWTHRREALRDT----IEQFQQGFR 165
Query: 139 -LSASASSNKLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
L + S + +E Q+ A L+++ D GG G APKFP ++L +
Sbjct: 166 QLDDTVLSREDAEEAAEVQDLPAQTALALARNTDPTHGGLGGAPKFPNASAYDLVLRICQ 225
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ + TL MA GGIHD +GGGF RYSVDERW VPHFEKMLYD G
Sbjct: 226 RTHEPALLDALER-------TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNG 278
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL +Y +A+ LT + + + Y+ RDM P G ++ EDADS EG +EG
Sbjct: 279 QLVTLYANAYRLTGKQAWRRVFEGTIAYIVRDMTHPDGGFYAGEDADS---EG----EEG 331
Query: 316 AFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
FYVWT+ EV+ +LGE L Y + GN + G++VL
Sbjct: 332 RFYVWTAPEVKAVLGESEGALACRAYGVTEGGNFE-----------PGRSVL-------Q 373
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A L PLE+ L R +L R++R RP DD ++ WNGL+I A + +
Sbjct: 374 RAVTL-TPLEE--ARLEGWRERLLAARAQRVRPGRDDNILAGWNGLMIQGLCAAYQATGN 430
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPGFLD 492
A ++ A AASFI+ L D +R +++G K PGFL+
Sbjct: 431 PA----------------HLAAARRAASFIQDKLTMPDGGVYRY---WKDGTVKVPGFLE 471
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL + L+DLYE ++L A EL + F D G YF +P ++ R +
Sbjct: 472 DYAFLANALIDLYESCFDRRYLDRAAELVALIIDNFWD--DGLYFTPNDGEP-LIHRPRA 528
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
HDGA PSG S SV + +RL + S D YR AEH + A
Sbjct: 529 PHDGAWPSGISASVFSFLRLHEL---SGEDRYRDLAEHEFQRYRAAASAAPAGFVHFLAA 585
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
AD + ++L G K++ ++ + H +Y L V+ +
Sbjct: 586 ADFAQRGAFG-IILAGDKAAA--AALVESVHRTY-LPARVLAF---------------AE 626
Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVT 701
+ + + D + A VC++ +CS PVT
Sbjct: 627 DVPVGQGRLPVDGRPAAYVCRHRACSAPVT 656
>gi|408826725|ref|ZP_11211615.1| hypothetical protein SsomD4_06008 [Streptomyces somaliensis DSM
40738]
Length = 651
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 234/702 (33%), Positives = 333/702 (47%), Gaps = 86/702 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A LN+ FVS+KVDREERPDVD VYM VQA G GGWP+S
Sbjct: 22 SACHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 81
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++PD +P GTYFPPE ++G P F+ +L V AW +RD + + + +LS
Sbjct: 82 VFMTPDGEPFYFGTYFPPEARHGMPSFRQVLEGVHHAWTSRRDEVDEVAGSIVRELSGRS 141
Query: 140 SASASSNKLPDEL-PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A P E P AL L++ YD R GGFG APKFP + ++ +L H +
Sbjct: 142 LALGGDGGAPGEAEPAQALL----ALTREYDERHGGFGGAPKFPPSMVVEFLLRHHAR-- 195
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 196 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 250
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ P G SA DADS +G R EGA+Y
Sbjct: 251 RVYTHLWRATGSDLARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 308
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VWT ++ ++LGE + ++ +++ +G +VL D+ + ++
Sbjct: 309 VWTPAQLREVLGEEDAAYAARFH----------GVTEEGTFEEGASVLRLPVDAGVAGAE 358
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L RR+L R +R RP DDK++ +WNGL +++ A
Sbjct: 359 R----------LAGIRRRLLAARDERARPGRDDKIVAAWNGLAVAALAETGACF------ 402
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFL 497
DR + +E A AA + R DE RL + ++G + A G L+DY +
Sbjct: 403 ----------DRPDLVERATEAADLLVRVHLDEGG-RLARTSKDGRAGANAGVLEDYGDV 451
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDH 554
G L L WL +A L + LDR E G ++T + ++ R ++
Sbjct: 452 AEGFLALAAVTGEGVWLEFAGLLLDG----VLDRFRGEDGELYDTAHDAEQLIRRPQDPT 507
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPL 608
D A PSG + + L+ S A + S+ +R AE +L V R +AV
Sbjct: 508 DNAAPSGWTAAAGALL---SYAAHTGSEAHRSAAERALGVVRALGPRAPRFVGWGLAV-- 562
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+L P + V +VG D + + AA V +P ++E E+
Sbjct: 563 ---TEALLDGP--REVAVVGPAGDADTDALRRAALLGTAPGAVVAVGEPG-SDEFPLLED 616
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC+ F+C P TDP L L
Sbjct: 617 ----------RPLVGGRPAAYVCRRFTCDAPTTDPERLAREL 648
>gi|294102620|ref|YP_003554478.1| hypothetical protein [Aminobacterium colombiense DSM 12261]
gi|293617600|gb|ADE57754.1| protein of unknown function DUF255 [Aminobacterium colombiense DSM
12261]
Length = 595
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 212/578 (36%), Positives = 306/578 (52%), Gaps = 63/578 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F + + FL +TCHWCHVME E F DE VA+LLND VSIKVDREERPD
Sbjct: 30 GKEAFTKAQEENKPIFLSIGYSTCHWCHVMEKECFSDEEVAQLLNDACVSIKVDREERPD 89
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D V M + G GGWPL++FL+P+ KP +Y P E PG ++ +VK W
Sbjct: 90 IDHVCMAVSLIMNGSGGWPLNLFLTPNGKPFFAASYIPKETSGRIPGLMDMVPRVKWLWL 149
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNK--LPDELPQNALRLCAEQLSKSYDSRFGGFG 176
+++ + +S E + AL ++ K PD +N + ++LS+++D +GGF
Sbjct: 150 MQKEDVLKSA----ESIMNALEKEMTNQKGTCPD---KNLAKKAFQELSRNFDPLWGGFS 202
Query: 177 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 236
APKFP P + +L + GK + + KMV TL CMA GGI DH+GGGF RY
Sbjct: 203 KAPKFPMPPVLLFLL-------EYGKIFKEEKAIKMVEKTLDCMAMGGIRDHLGGGFARY 255
Query: 237 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 296
S D W +PHFEKMLYDQ L Y A+ +T Y I +I Y+ RD+ P G F
Sbjct: 256 STDREWKIPHFEKMLYDQALLLKAYTAAWEMTGRDIYKKIAFEIAAYVLRDLRSPEGVFF 315
Query: 297 SAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSD 355
+AEDADS EG EG FYVWT +E+ ++ E LF + Y + GN ++
Sbjct: 316 AAEDADS---EGV----EGRFYVWTEEEIRRLVPSEDRQLFLQAYGIHGEGNV----LAL 364
Query: 356 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 415
P + L EL A+ + L+K L + R LF+ R++R RPH D K++
Sbjct: 365 PAS-------LEEL------AATYNVELQKLDQSLQKSRALLFEARNRRVRPHCDRKILT 411
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTH 474
WN L+I + A A +I + ++++E A +A F + + +Y E+
Sbjct: 412 DWNALMIEALAFAGRIF----------------EERQFIEAARNAVDFLLEKAVYQEK-- 453
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
+ HS +G PG L+DY+F I LL+L E + + L + +++F D + G
Sbjct: 454 EVYHSVADGKGHIPGLLNDYSFFIRALLELEEATGEEDYGEKGMGLLRSMNDIFYDPKRG 513
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 572
GYF +G D + R DG SGNSV+++NL+R
Sbjct: 514 GYFMNSGLDELLFFRPWSGEDGVMVSGNSVAMMNLLRF 551
>gi|257388360|ref|YP_003178133.1| hypothetical protein Hmuk_2314 [Halomicrobium mukohataei DSM 12286]
gi|257170667|gb|ACV48426.1| protein of unknown function DUF255 [Halomicrobium mukohataei DSM
12286]
Length = 715
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 213/694 (30%), Positives = 331/694 (47%), Gaps = 63/694 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF D A LLN+ FV IKVDREERPD+D +YM+ Q + G GGWPLS
Sbjct: 56 SACHWCHVMEDESFSDPETATLLNEHFVPIKVDREERPDLDAIYMSICQQVTGRGGWPLS 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLS 136
+L+PD +P GTYFPPE++ G P F +L + +W +++ +M ++ Q +
Sbjct: 116 AWLTPDGEPFYVGTYFPPEERRGMPAFGQLLEDIAGSWSDSEQREEMYNRA-----RQWT 170
Query: 137 EALSASASSNKLPDELPQN-ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+A+ + P ++P + AL+ + ++ D GG+G+ PKFP+P + ++
Sbjct: 171 DAIESDVGDVGQPGDVPDDEALQAAVDAAIRAADREHGGWGNGPKFPQPGRLHYLMREVA 230
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ + + + +V TL MA GG+ DHVGGGFHRY D W VPHFEKMLYD
Sbjct: 231 R-------SDRDDVRSVVTETLDAMADGGLFDHVGGGFHRYCTDREWVVPHFEKMLYDNA 283
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA-----T 310
L YL + LT D Y+ + R+ ++ R++ G FS DA S G
Sbjct: 284 TLPRAYLAGYQLTGDERYAEVARETFAFVERELTHEDGGFFSTLDAQSVPPAGRREDADA 343
Query: 311 RKKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
+EGA++VW EV + A L + + + +GN F+GK VL
Sbjct: 344 EPEEGAYFVWIPDEVRAAVDSETAADLLCDRFGITESGN------------FEGKTVLTV 391
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
A + G+ L R ++F+ R +RPRP D+KV+ WNGL+I++ A
Sbjct: 392 DASIEALSESSGLEASDVERTLASAREQVFEAREERPRPARDEKVLAGWNGLMITAIAEG 451
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
+ +L A +F+R HL+DE RL +++G
Sbjct: 452 AIVLDDVDPDPA-----------------ADALAFVREHLWDESEQRLARRYKDGDVAID 494
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G+L+DYAFL G L L+E + L +A++L + + F D + G + T S++
Sbjct: 495 GYLEDYAFLARGALTLFEATGEVEHLAFALDLAHAIEREFWDADDGTLYFTPTSGESLVA 554
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R +E D + PS V+V L+ L++ V D + A L +++ M
Sbjct: 555 RPQELTDQSTPSSTGVAVQALLSLSAFV---PHDRFETIAAGVLETHANKIEANPMQHAS 611
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ AAD + + LV + ++ LA + L ++ P ++D W +
Sbjct: 612 LVVAADRY-LRGDLELTLVADEVPAEWRTTLAETY----LPDRLLAWRPPGDGDLDAWLD 666
Query: 669 ---HNSNNASMARNNFSADKVVALVCQNFSCSPP 699
+ A + C+ F+CSPP
Sbjct: 667 VLGLDDVPPIWADRTERDGEATVYACRQFTCSPP 700
>gi|398343191|ref|ZP_10527894.1| hypothetical protein LinasL1_09021 [Leptospira inadai serovar Lyme
str. 10]
Length = 692
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 248/701 (35%), Positives = 344/701 (49%), Gaps = 75/701 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE A +LN +FVSIKVDREERPDVD++YM + A+ GGWPL++
Sbjct: 55 TCHWCHVMEKESFEDEATAAVLNQYFVSIKVDREERPDVDRIYMDALHAMNQQGGWPLNM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+ + KP+ GGTYFPP KYGR F IL + W +K++ L A E+L++ L
Sbjct: 115 FLTSEGKPITGGTYFPPVAKYGRKSFTDILNILATLWKEKKEELID----ASEELAQYLK 170
Query: 141 ASASSNKLPDELPQNALRLCAEQL--------SKSYDSRFGGFGS--APKFPRPVEIQMM 190
S S L + Q+AL+L ++ + + YD F GF S KFP + + +
Sbjct: 171 ESEESKALSE---QSALQLPSKTVFENAFGMYDRFYDPEFAGFKSNVTNKFPPSMGLSFL 227
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L K +GE + +MV TL M KGGI+D +GGG RYS D +W VPHFEKM
Sbjct: 228 LRFYK------STGE-PKALEMVEETLVAMKKGGIYDQIGGGISRYSTDHKWLVPHFEKM 280
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD ++ F T + Y D+L+Y+ RDM GG I SAEDADS EG
Sbjct: 281 LYDNSLFLEALVECFQTTGHLKYKEAAYDVLEYISRDMRLQGGGIASAEDADS---EG-- 335
Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG FY+W E ++ AIL + + + GN F+G N+L E +
Sbjct: 336 --EEGLFYLWKRNEFHEVCDSDAILLEAFWNVTEIGN------------FEGSNILHE-S 380
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ A G+ E+ + I+ ++KL RS R RP DDKV++SWN L + + +A+
Sbjct: 381 FRTNFARLHGLEEEELIEIVNRNKKKLLARRSDRIRPLRDDKVLLSWNCLYVKAATKAAM 440
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
E + +AE FI +L E RL FR G ++ +
Sbjct: 441 AFGD----------------GELLRLAEETFRFIENNLVREDG-RLLRRFREGEARFLAY 483
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
DYA I L L++ G G ++L AI LF R G F TG D LLR
Sbjct: 484 SGDYAEFILASLWLFQAGKGIRYLTLAIRYAEEAVRLF--RSPAGVFFDTGSDAEDLLRR 541
Query: 551 K-EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
E +DG EPS NS + L+ + G +S Y A+ + F+ L+ M P M
Sbjct: 542 NVEGYDGVEPSANSSFALAFTILSRL--GVESGRYSDFADAIFSYFKVELETHPMNYPYM 599
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A + + S++ V+ + + D + A + L +TV D E E
Sbjct: 600 LSAYWLKNSDSKELAVV--YSTQEDLFPIWQGIGAMF-LPETVFAW-ATDKE-----AEE 650
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ +N S V A CQ F C PV+D SL +L
Sbjct: 651 AGEKILLLKNRKSGGSVKAYFCQGFRCDLPVSDWNSLRAIL 691
>gi|390953615|ref|YP_006417373.1| thioredoxin domain-containing protein [Aequorivita sublithincola
DSM 14238]
gi|390419601|gb|AFL80358.1| thioredoxin domain-containing protein [Aequorivita sublithincola
DSM 14238]
Length = 704
Score = 341 bits (875), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 223/694 (32%), Positives = 346/694 (49%), Gaps = 77/694 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED VA ++N F+S+KVDREERPDVD+ Y+ VQ + G GWPL+V
Sbjct: 78 CHWCHVMEHESFEDSTVAAVMNKNFISVKVDREERPDVDQTYINAVQLMTGSAGWPLNVV 137
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF D + L +++ ++++ + L A+A +L E + +
Sbjct: 138 TLPDGRPVWGGTYFRKND------WIDALEQIQKVYNEEPEKLM---AYA-NRLEEGIKS 187
Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
N + + E LS+++D++ GGF APKF P ++ +L + + +
Sbjct: 188 MDLVHLNTEDVDFAKYPTSEIVENLSQNFDAKNGGFKGAPKFMMPNNLEFLLRQAVQENN 247
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G V TL MA GG++D +GGGF RYS DE+WHVPHFEKMLYD QL +
Sbjct: 248 ADLLG-------YVTLTLDKMAYGGLYDQIGGGFARYSTDEKWHVPHFEKMLYDNAQLVS 300
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A+ +TK Y + + LD++ RDM G +S+ DADS + G + +EGAFYV
Sbjct: 301 LYSNAYLVTKKPLYKEVVEETLDFIARDMTNDEGGFYSSLDADSKDENG--KLEEGAFYV 358
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+TS+E++ IL + +FKE+Y + G + K VLI +
Sbjct: 359 FTSEELQKILKDDFDIFKEYYNVNSYGKWE-----------KNHYVLIRKKTDDEIEKEF 407
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ E + + + L R+KRP+P LDDK + SWN +++ + A K
Sbjct: 408 GITSEAFQQKKEDWKNTLLAYRNKRPKPRLDDKTLTSWNAMMLKGYVDAYKTF------- 460
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
++EY++ A A+FI ++ L H++++G S GFL+DYAF I
Sbjct: 461 ---------GKREYLDAALKNAAFISEKQL-QKNGALFHNYKDGKSSINGFLEDYAFTIE 510
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+DLY+ KWL + ++ + F D E ++ T+ ED +++ R E D P
Sbjct: 511 AFIDLYQATLDEKWLTLSKKMADYAKTNFFDEEKQMFYFTSKEDAAIVTRNFEYRDNVIP 570
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
+ NSV NL L+ + D E S +F+ ++ D+LS
Sbjct: 571 ASNSVMAKNLFVLSKYFEETGFD------EISHQMFKNVSVEIEQYPSGFSNWLDLLSSF 624
Query: 620 SRK--HVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHNSNNASM 676
VV+VG S + +LNK + +I A ++ N+ +
Sbjct: 625 QNDFYEVVIVGKDVSEKIK----------ELNKHYLPNIIIAGSK--------GENSGPL 666
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDP-ISLENL 709
N ++ D + VC N +C PV D I++E+L
Sbjct: 667 FENRYTPDATLIYVCVNNACKLPVEDTKIAIESL 700
>gi|448576201|ref|ZP_21642244.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
gi|445729881|gb|ELZ81475.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
Length = 702
Score = 341 bits (874), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 224/696 (32%), Positives = 338/696 (48%), Gaps = 73/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF D +A+ LN+ FV +KVDREERPD+D++Y T Q + GGGGWPLS
Sbjct: 53 SACHWCHVMADESFSDPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQL 135
V+L+P KP GTYFPPE + G PGF+ ++ ++W RD + AQ AI +QL
Sbjct: 113 VWLTPQGKPFFVGTYFPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQL 172
Query: 136 SEA--LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+ A +++ D+ Q ALR PKFP+P I +L
Sbjct: 173 EDTPDTPGEAPGSEILDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDALL-- 222
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ TG+ + + + +L MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYD
Sbjct: 223 -RGYAITGR----RQALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYD 277
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q L + YLD + LT Y+ + + +++RR++ G F+ DA S +
Sbjct: 278 QAGLVSRYLDTYRLTGTEAYADVAAETFEFVRRELSHDDGGFFATLDAQSG-------GE 330
Query: 314 EGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVWT EV +L E A LF + Y + P GN F+ K ++ ++ +
Sbjct: 331 EGTFYVWTPDEVRSLLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSAT 378
Query: 373 -SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
S A + + ++ + L E R+ LF RS R RP D+K++ WNGL+IS+FA+ +
Sbjct: 379 LSDLAEEYDISEDEVEDKLAEARKALFAARSGRERPARDEKILAGWNGLMISAFAQGAVA 438
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
L+ ++ + A A F+R HL+D L NG K G+L
Sbjct: 439 LEDDS----------------LADDARRALDFVREHLWDADAGHLSRRVMNGEVKGDGYL 482
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYAFL G DLY+ L +A++L F D G + T +++ R +
Sbjct: 483 EDYAFLARGAFDLYQATGDVDPLAFALDLARAIHREFYDDAAGTLYFTPESGEALVTRPQ 542
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D + PS V+ + L + + + A+ L R++ + +
Sbjct: 543 EATDQSTPSSLGVATSLFLDLEHFAPDAG---FGEAADTVLETHANRIRGSPLEHVSLAL 599
Query: 612 AADMLS--VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW-EE 668
AA+ + VP + + + ++ LA+ + L V+ PA + +D W +E
Sbjct: 600 AAEKAASGVP---ELTVAADEMPAEWHETLASRY----LPGLVVAPRPATDDGLDAWLDE 652
Query: 669 HNSNNASMARNNFSAD--KVVALVCQNFSCSPPVTD 702
+ A AD + C+NF+CS P D
Sbjct: 653 LELDEAPPIWAAREADGGEPTVYACENFTCSAPTHD 688
>gi|345864005|ref|ZP_08816211.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
(vent Tica)]
gi|345124912|gb|EGW54786.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
(vent Tica)]
Length = 799
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 216/628 (34%), Positives = 316/628 (50%), Gaps = 62/628 (9%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL +TCHWCHVME ESFE+E +A+ LN+ F++IKVDRE PD
Sbjct: 91 GEAAFAKAKRENKPIFLSIGYSTCHWCHVMERESFENESIARFLNEHFIAIKVDRESHPD 150
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D+ YMT V + G GGWP+S L+P+ KP GGTYFPP+ F ++L++++ W+
Sbjct: 151 IDETYMTAVMLMTGSGGWPMSSLLTPEGKPFFGGTYFPPQQ------FASVLQQIQTIWE 204
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
++ + Q E++++A+ A+ S L A Q+ +S+D GGF A
Sbjct: 205 ERPEDTRQQA----ERVAKAVEAANSQRGKAKALDSQAADKAVAQMLRSFDELQGGFSQA 260
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP + ++L D + E + + TL MA+GGI+D GGGFHRYS
Sbjct: 261 PKFPHEPWLFLLL-------DQLQRQPHPEALQALEVTLDAMARGGIYDQAGGGFHRYST 313
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W VPHFEKMLY+Q QLA +YL A+ LT Y + LDY+ R+M P G +SA
Sbjct: 314 DNEWLVPHFEKMLYNQAQLARIYLLAWRLTGKEQYRRVVTQTLDYVLREMTAPSGGFYSA 373
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPH 357
DADSA +EG F+ W E+ D L A L E Y + GN
Sbjct: 374 TDADSA-------GEEGLFFTWIPAEIRDALEPRDAGLAIELYAISERGN---------- 416
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
F+G+N+L A M LE + + L +R +R P DDK++ +W
Sbjct: 417 --FEGRNILHLPQSLEEYAETKSMNLEALHQRIDHINQVLRQIREQREHPLRDDKIVTAW 474
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NG++I++FA+A+ +L S++ Y + AE AA F+ +H + +L
Sbjct: 475 NGMMITAFAQAADLLDSDS----------------YRQAAERAAEFLWQH-NRKGAGQLW 517
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+G S +DYA+L GL LY+ KWL + EL + F +++GG Y
Sbjct: 518 RVHLDGKSSISANQEDYAYLGEGLSYLYDLTGDPKWLSRSRELADAMLARFQEKDGGFYM 577
Query: 538 NTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+ GED + D D A SG+SV++ L RL + +G Y+ AE +A F
Sbjct: 578 SEAGEDHFNAMGRPRDGGSDNAIASGSSVALHLLQRLW-LRSGHLD--YKTAAESLIAYF 634
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKH 623
++ M A D L+ R H
Sbjct: 635 AANIERQPNGYTYMLSAVDNLNQGERTH 662
>gi|320101644|ref|YP_004177235.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
gi|319748926|gb|ADV60686.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
Length = 909
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 227/631 (35%), Positives = 311/631 (49%), Gaps = 75/631 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME E F D +A LN FV IK+DREERPDVD+ Y+T ++ +G GGWP+S+
Sbjct: 113 ACHWCHVMERECFRDPAIAARLNRDFVCIKLDREERPDVDQTYLTALRT-FGTGGWPMSI 171
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFPPED+ G GF T+L +V AW + RD + + + L
Sbjct: 172 FLTPEGKPFYGGTYFPPEDRPGLTGFSTVLDRVARAWREDRDRIERVAGELDAMVGRILV 231
Query: 141 ASASSNKL--PDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMMLY 192
A+S+ L P L + C L +D +GGFG PKFP P + +L
Sbjct: 232 RRAASSVLGPPPVLSSDLTDACYLILCGEFDPEYGGFGFDRTNPRRPKFPEPSRLLFLLE 291
Query: 193 HSKKLEDTGKS-------------GEASEGQ------KMVLFTLQCMAKGGIHDHVGGGF 233
L++ + G A+ M LFTL +A+GG+ DHVGGG+
Sbjct: 292 RHAALKERPRPVKTPARSLLMLDPGPAAAPLIRRAPLDMALFTLDRIARGGLRDHVGGGY 351
Query: 234 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 293
HRY V W VPHFEK LYD QLA V++ AF LT D + I D++ R+M P G
Sbjct: 352 HRYCVSRFWIVPHFEKTLYDNAQLARVFVRAFELTGDPRWRDEAEAIFDFVAREMTLPEG 411
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDL 350
SA DA+S + +G G +Y+WT +VE L E I+ + + L+
Sbjct: 412 GFLSALDAESRDEDG------GEYYLWTRPQVEQALANPEESRIVLQVYGMLR------- 458
Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLD 410
DP+ E G+ VL+E + S A LG+ L + L RR+L VR +RP P D
Sbjct: 459 ----DPNFE-GGRYVLLEPRERSEHARALGLELPELTRRLDAARRRLHQVRDQRPAPRKD 513
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
DK I WNGL+I++ A A + V +R Y++ A+ AA F
Sbjct: 514 DKAIAGWNGLMIAALAEAGR--------------VCDHNRDRYLKAAQRAAEFAWTQFRR 559
Query: 471 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 530
EQ RL ++R G +K GF +DYAFL GLL LY +WL A L F D
Sbjct: 560 EQ-DRLARTWRQGVAKGEGFAEDYAFLAEGLLRLYRADGDPRWLERARRLTERMRHDFGD 618
Query: 531 REG--GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
+ GG F + D + R K+ D PS N+V+ L+ L + D Q
Sbjct: 619 PDPNRGGLFFASRRDARLPARFKDPLDSVLPSANAVAARVLIELGRL------DDDPQRY 672
Query: 589 EHSLAVFETRLKDMAM---AVPLMCCAADML 616
+ + A+ L D+A P+M A + L
Sbjct: 673 DQAEAILREFLPDLARRPGVWPMMMVALEEL 703
>gi|392966241|ref|ZP_10331660.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
gi|387845305|emb|CCH53706.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
Length = 677
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 210/561 (37%), Positives = 295/561 (52%), Gaps = 50/561 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE E VA+++N+ FV IKVDREERPDVD +YM VQA+ GGWPL+
Sbjct: 48 SACHWCHVMERESFEKEPVARVMNENFVCIKVDREERPDVDAIYMEAVQAMGVQGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEA 138
VFL PD KP G TY PP++ + +L ++DA+D+ R LAQS FA E
Sbjct: 108 VFLMPDAKPFYGVTYLPPQN------WVNLLGNIRDAFDEHRADLAQSAEGFATEL---N 158
Query: 139 LSASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSK 195
LS S P + L + ++ D GG APKFP P Q +L Y+
Sbjct: 159 LSDSERFGLQPADPLFSAETLDVLYRKVHVKADDEKGGMRRAPKFPMPSIWQFLLRYYDS 218
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ T ++ A ++V TL MA GGI+D +GGGF RYS D W PHFEKMLYD G
Sbjct: 219 TVASTTENETA---LRLVTLTLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL +Y +A+SLTK Y ++ + + +R+++ P G +SA DADS EG EG
Sbjct: 276 QLLTLYSEAYSLTKSPLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV----EG 328
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY +T+ E+ D LG+ F E Y L GN + G+N+L +
Sbjct: 329 KFYTFTTSELRDALGDEFDWFAELYNLSEDGNWE-----------HGRNILHRTESDESF 377
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A ++G L +L +R++R RP LDDK++ SWNGL++ A A ++
Sbjct: 378 AERMGWSAADLSVRLDATHLRLLKIRNERIRPGLDDKILCSWNGLMLKGLATAYRV---- 433
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
F P E++ +A A F+ + + D + RL H+++ G ++ PGFL+DYA
Sbjct: 434 -----FGEP-------EFLTLALRNAYFLLQKMRDNRNGRLWHTYKEGRARQPGFLEDYA 481
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+I GLL LY+ WL A L + F D +F T ++ R KE D
Sbjct: 482 TVIDGLLALYQATFTESWLTEADRLTQYVFDSFSDPNDDLFFFTDKNGEELIARRKELFD 541
Query: 556 GAEPSGNSVSVINLVRLASIV 576
PS NS+ NL ++ ++
Sbjct: 542 NVIPSSNSIMAGNLYAMSLLL 562
>gi|114319387|ref|YP_741070.1| hypothetical protein Mlg_0225 [Alkalilimnicola ehrlichii MLHE-1]
gi|114225781|gb|ABI55580.1| protein of unknown function DUF255 [Alkalilimnicola ehrlichii
MLHE-1]
Length = 697
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 236/693 (34%), Positives = 344/693 (49%), Gaps = 57/693 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFED +A+L+N+ F++IKVDREERPD+D++Y T Q L GGWPL
Sbjct: 51 SACHWCHVMAHESFEDPAIARLMNERFINIKVDREERPDLDRIYQTAHQLLTRRPGGWPL 110
Query: 79 SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++ L+PD + P+ GTYFPP+ + G PGF +LR+V +A + +A L
Sbjct: 111 TLVLTPDDQTPVFAGTYFPPDTRGGMPGFADVLRQVDEAIRSQPQAVADQNRALRHALGR 170
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A A L LR + L+ S+D GGFG+APKFP P I+ +L H
Sbjct: 171 LAHAPADGGDA--ALGNAPLRAARDALADSFDRVHGGFGAAPKFPHPGGIERLLRHYALT 228
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G + M TL+ MA GGI+D VGGGF RYSVDE W +PHFEKML D L
Sbjct: 229 LVTG-DGPDRDALHMACHTLRRMALGGIYDQVGGGFARYSVDEYWMIPHFEKMLCDNALL 287
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y DA+ T D Y+ + ++ +++R +M P G ++ DADS EG EG +
Sbjct: 288 LGLYADAWHATGDGLYARVVQETAEWVRAEMERPEGGYCTSLDADS---EGG----EGRY 340
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y+WT EV ++L E EH + + +P N F+G+ L S SA
Sbjct: 341 YLWTPDEVRELLDEDEWRLVEHRF----------GLDEPAN-FEGRWHLHVQASFSESAR 389
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+LG P E+ + + R+KL R +R RP DDKV+ +WNGL+I++ ARA ++L
Sbjct: 390 RLGRPREQVVALWQSARQKLQRARGQRVRPGRDDKVLTAWNGLMIAALARAGRLL----- 444
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
D + A A F+R L D+Q RL S+R G + L+DYA+L
Sbjct: 445 -----------DEPAWTASALRALGFLRERLADDQG-RLYASWRAGRAAHQACLEDYAYL 492
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ G+L+ + L +A+ L +T E F D++ GG++ T + ++ R + D +
Sbjct: 493 LEGVLECLQSEWSDDRLGFALHLADTLLERFQDKDEGGFWMTADDHEPLIHRPRPLADDS 552
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
PSGN+V++ L RL ++ + Y + L + M A + A +
Sbjct: 553 LPSGNAVALRALQRLGHLLGEPR---YLEAVARGLRAAAGAIARMPEAHASLLTALEEYL 609
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
P + VV+ G A + Y N+ V + PAD + A+ A
Sbjct: 610 YPP-EIVVIRGAPEVTGPWRTRALKY--YTPNRLVFAL-PADAAPPGVLSGRQTEGAAPA 665
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A VC +C PV LE +L
Sbjct: 666 ----------AWVCSGKTCRAPVRSLDELERVL 688
>gi|149369679|ref|ZP_01889531.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
gi|149357106|gb|EDM45661.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
Length = 703
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 205/554 (37%), Positives = 294/554 (53%), Gaps = 49/554 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED VA +N+ F+S+KVDREERPD+D++Y+ VQ + G GWPL+V
Sbjct: 80 CHWCHVMEHESFEDSLVAATMNENFISVKVDREERPDLDQIYINAVQLMTGSAGWPLNVV 139
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF ED + T+L+K++ + + L + QL E +
Sbjct: 140 TLPDGRPVWGGTYFKKED------WITVLQKIQKINTENPEKLNEIAG----QLEEGIKN 189
Query: 142 --SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ N +L L S+D RFGG+ APKF P + +L ++ + +D
Sbjct: 190 LDLVALNTEDVDLKNYNLDEVIHTWKSSFDHRFGGYKRAPKFMMPSNYEYLLRYAVQDKD 249
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
E Q VLFTL MA GGI+D +GGGF RYSVDE+WHVPHFEKMLYD QL +
Sbjct: 250 -------QELQDYVLFTLDQMAYGGIYDAIGGGFSRYSVDEKWHVPHFEKMLYDNAQLVS 302
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A+ LTK Y I + L ++ +M G +S+ DADS +G +EGAFYV
Sbjct: 303 LYSNAYKLTKKPLYKEIITETLAFIFEEMTTEEGAFYSSLDADSLTEDGTL--EEGAFYV 360
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+T++E++ LG LF +Y + G + GK VLI D ++ A L
Sbjct: 361 YTAQELKSQLGTDFDLFAAYYNVNNFGKWE-----------DGKYVLIRDEDDASIAKDL 409
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ E + + L R R +P LDDK + SWNGL++ + +A +A
Sbjct: 410 GISTEALQRKVANWKAILKAYRGFRSKPRLDDKTLTSWNGLMLKGYV--------DAYTA 461
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ N KEY++ A A FI+ E L H+++ G S G+L+DYA +IS
Sbjct: 462 LGN--------KEYLDAALKNAVFIKDKQLKEDG-SLYHNYKEGRSTINGYLEDYASVIS 512
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
G + LYE + +WL A +L + F D E G ++ T+ EDP ++ R E D
Sbjct: 513 GFISLYEVTADVQWLDLAKKLTDYTFTKFYDTESGMFYFTSSEDPKLVARSVEYRDNVIA 572
Query: 560 SGNSVSVINLVRLA 573
S N++ N+ L
Sbjct: 573 SSNAIMAQNIFVLG 586
>gi|408680345|ref|YP_006880172.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
gi|328884674|emb|CCA57913.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
Length = 676
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 233/702 (33%), Positives = 342/702 (48%), Gaps = 87/702 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ +A L+N+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 51 SSCHWCHVMAHESFEDDAIAGLVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD P GTYFPPE ++G P F +L VKDAW +RD + + ++ L+ +
Sbjct: 111 VFLTPDAAPFYFGTYFPPEPRHGMPSFPEVLEGVKDAWADRRDEVGEVAERIVKDLAGRS 170
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L+ +EL Q L L++ YD+ GGFG APKFP + ++ +L H +
Sbjct: 171 LAYGGEGVPGEEELAQALL-----GLTREYDATRGGFGGAPKFPPSMTLEFLLRHHAR-- 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RY+VD W VPHFEKMLYD L
Sbjct: 224 -TGAEG----ALQMAADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALLC 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y + T + + D++ R++ P G SA DADS +G R EGA+Y
Sbjct: 279 RAYAHLWKATGSDLARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 336
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT ++ ++LG E A L HY + G F+ + +++L + A
Sbjct: 337 VWTPAQLTEVLGAEDAALAAAHYGVTEAGT------------FEHGSSVLQLPQQAGPAE 384
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ + +L R +R RP DDKV+ +WNGL I++ A +
Sbjct: 385 A---------DRIASIAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF----- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
DR + +E A AA + R DE RL + ++G + G L+DYA
Sbjct: 431 -----------DRPDLVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNAGVLEDYAD 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKED 553
+ G L L WL +A L + + LDR EGG ++T + +++ R ++
Sbjct: 479 VAEGFLALAAVTGEGAWLEFAGFLLD----IVLDRFTAEGGALYDTAHDAEALIRRPQDP 534
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----- 608
D A PSG + + L+ S A + SD +R AE +L V +K + P
Sbjct: 535 TDNATPSGWTAAAGALL---SYAAHTGSDAHRAAAEGALGV----VKALGPRAPRFIGWG 587
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ + +L P + + +VG F+ + A + + P D+EE
Sbjct: 588 LAVSEALLDGP--REIAVVGAPGDEVFQELRRTALRATAPGAVLASGAP-DSEEFPL--- 641
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A A VC++F+C PVTDP L L
Sbjct: 642 -------LGDRPLVAGGAAAYVCRHFTCDAPVTDPEELRRKL 676
>gi|120434573|ref|YP_860266.1| hypothetical protein GFO_0204 [Gramella forsetii KT0803]
gi|117576723|emb|CAL65192.1| protein containing DUF255 [Gramella forsetii KT0803]
Length = 682
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 203/580 (35%), Positives = 300/580 (51%), Gaps = 52/580 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE VA+L+N ++ IKVDREERPDVD+VYM VQ + G GGWP+++
Sbjct: 57 CHWCHVMEHESFEDEAVAELMNVNYICIKVDREERPDVDQVYMNAVQIMTGMGGWPMNIV 116
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF E + L+++ ++ + + L + E+L + L
Sbjct: 117 ALPDGRPVWGGTYFRKEQ------WMEALQQISHLFNSQPEKLLEYA----EKLEQGLKQ 166
Query: 142 SASSNKLPDE-LPQNALRL-CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ ++ P + E+ +S+D + GG+ +PKF P + +L ++ + D
Sbjct: 167 IQIIEPVKEQNKPHKDFFIPIIEKWKRSFDPKNGGYQRSPKFMMPNNYEFLLRYAFQNSD 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
E + L TL ++ GG+ D + GGF RYSVDE+WHVPHFEKMLYD QL
Sbjct: 227 -------KELKSHCLLTLNRISWGGVFDPIEGGFSRYSVDEKWHVPHFEKMLYDNAQLVQ 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + +TK+ +Y + + L ++ +M G +SA DADSA G +K+EGA+YV
Sbjct: 280 LYSKTYKITKNNWYKEVVKQTLQFISAEMTDESGAFYSALDADSANENG--KKEEGAYYV 337
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT + ++ ILG +F E+Y + G + VLI + L
Sbjct: 338 WTKENLKSILGNEFEIFSEYYNINNYGKWEADNY-----------VLIRTKSLDQLSQDL 386
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+P E + +C KL +SKR +P LDDK + SWN L+IS + A K ++
Sbjct: 387 DIPREDLQQRIAQCNLKLKKAKSKREKPGLDDKSLTSWNALMISGYTEAYKAFRN----- 441
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
EY+E AE A+FI + E RL HS++NG S G+L+DYAF IS
Sbjct: 442 -----------GEYLEAAEKNAAFILENQLQENG-RLYHSYKNGKSTINGYLEDYAFSIS 489
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LDLYE ++L A L + D+ F D G YF T+ +D ++ + E D P
Sbjct: 490 AFLDLYECTFEQEYLGRARNLIDVTDKDFTDSVSGLYFFTSDKDRELVTKTIEISDNVIP 549
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+ NS N+ R + K Y AE L + ++
Sbjct: 550 ASNSEMAKNIFRFGKLTGDMK---YVGKAEKMLQIVMDKI 586
>gi|300024782|ref|YP_003757393.1| hypothetical protein Hden_3279 [Hyphomicrobium denitrificans ATCC
51888]
gi|299526603|gb|ADJ25072.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
ATCC 51888]
Length = 678
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 227/694 (32%), Positives = 340/694 (48%), Gaps = 78/694 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED G A+++N++F++IKVDREERPD+D +YM + L GGWPL++
Sbjct: 50 ACHWCHVMAHESFEDPGTAEVMNEFFINIKVDREERPDIDAIYMGALHQLGEQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL D KP GGTYFP E +YGRP F T+L ++ +A+ +RD + + E L AL
Sbjct: 110 FLDSDAKPFWGGTYFPREARYGRPAFVTVLLRIAEAYANQRDDVRNN----TEALLAALK 165
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ N P + P+ A A +S++ D +GG APKFP+ I +L+
Sbjct: 166 TAPGDNA-PRQ-PRPATEDVAAAISRAVDREYGGLSGAPKFPQ-WSIFWLLWR------V 216
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + ++ + V+ TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD L ++
Sbjct: 217 GIRDDNADAKNGVITTLRHICQGGIYDHLGGGFSRYSVDEYWLVPHFEKMLYDNALLIDL 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ + T+D + + + ++ R+MIG G ++ DADS EG +EG FYVW
Sbjct: 277 MTEVWRETQDPLFKTRVAETIAWIEREMIGEAGGFAASLDADS---EG----EEGKFYVW 329
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ E+ED+LG E A F Y + P GN F+G +L L L
Sbjct: 330 NADEIEDVLGAEDAAFFSRVYGVVPGGN------------FEGHTILNRLG-------SL 370
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
E+ L R KL + R+ R RP DDK++ WNGL I++ +RA+ +L+ A
Sbjct: 371 AFLSEEDEARLTSLRAKLLERRASRIRPGWDDKILADWNGLAIAAISRAAIVLEQPA--- 427
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
++ +AE A S I L RL H++R+G +KAP DYA +
Sbjct: 428 -------------WLALAERAFSAITTKLA-ASDGRLFHAYRSGLAKAPATASDYANMTW 473
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ L+ ++L A + D+ + D + GGYF + V++R+K D A P
Sbjct: 474 AAIRLFTATGSERYLDQAQQWTRILDKHYWDEDRGGYFTAADDTLDVVVRLKSATDDAAP 533
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA--ADMLS 617
+ N++ + NL+ LA++ + D + + A P+ CA A L
Sbjct: 534 NANAIQLSNLIALAALTGDAAYDDRARRLSQAFA-------SAVAHTPISHCALLAAELD 586
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
V + D +L + I P E + E + ++
Sbjct: 587 ADRVVQVAIQAPPGPCDLRG---------ELQRLSI---PGALEFVGLSEAQSGQSSLFG 634
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
+ K A VC CS P+ +P L LL
Sbjct: 635 GKSMIDGKSTAYVCVGPVCSAPIQEPEKLRQALL 668
>gi|386826330|ref|ZP_10113437.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
gi|386427214|gb|EIJ41042.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
Length = 700
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 222/697 (31%), Positives = 340/697 (48%), Gaps = 64/697 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFED A+++N+ F++IKVDREERPD+DK+Y Q L GGWPL
Sbjct: 56 SACHWCHVMAHESFEDPETAQVMNELFINIKVDREERPDLDKIYQMAHQILTRRAGGWPL 115
Query: 79 SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQ 134
++FL+PD P GGTYFP E ++ P FK IL +V + + + R + Q A AIE
Sbjct: 116 TMFLTPDAHYPFFGGTYFPKEPRFNLPAFKNILYRVAEFYRQNRHGIVEQCQQLAQAIEY 175
Query: 135 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+ S + EL L +Q+ +S+DS +GGF APKFP ++ + +H
Sbjct: 176 HDTPRTEGVSITTISPEL----LNTARQQIEQSFDSEWGGFSKAPKFPHLTNVERLFHHY 231
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
E +G ++ + TL MA GGI+D VGGGF RYSVD+ W +PHFEKMLYD
Sbjct: 232 HITAHQENPDE--DGLQIAMHTLTRMALGGIYDQVGGGFCRYSVDDYWMIPHFEKMLYDN 289
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
+Y +A+ L K Y + + D++ R+M G +S DADS EG E
Sbjct: 290 APFLTIYSEAWQLAKIPLYKQVAQATADWVLREMQLSEGGFYSTLDADS---EGV----E 342
Query: 315 GAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
G FYVWT +E++ +L E F + L N + + L +D
Sbjct: 343 GKFYVWTPEEIKGLLSPELYAPFAYQFGLNRPANFEETHWH-----------LFGWHDRE 391
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A A K + LE+ L + LF R +R P D+K++ +WNG++I + A A +I K
Sbjct: 392 AVAVKFDLSLEEVNARLDKALAILFQAREQRVHPQRDEKILTAWNGMMIKALATAGRIFK 451
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
R +Y+ AE + +FIR L+ + +L ++++G + +LDD
Sbjct: 452 ----------------RTDYIHAAEQSLNFIRSTLW--KNGKLLATYKDGKAHLNAYLDD 493
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAFLI G+L L + + +EL + F D+E GG+F T ++ R+K
Sbjct: 494 YAFLIEGILTLLQCRWNNSDYAFMLELVDVLLHEFEDKEKGGFFFTGNHHEQLIARLKPL 553
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D A PSGN V+ + L RL ++ +D Y + A ++ + ++ +A A + A
Sbjct: 554 ADEAIPSGNGVAAVVLGRLGHLLG---NDEYLRAAARTVNIALPAIEQIAYAHNTLLLAV 610
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
+ P + ++ K +++ A Y + I +E +
Sbjct: 611 EDYLFPPQLIIIRADAKHLAEWQ---AVCQHDYAPQRLCFAIPNHLSEPL---------- 657
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ N + VA +C + CS P+ +LE L
Sbjct: 658 TGVLANCKPQGEAVAYICHGYQCSAPIHSLTALEEAL 694
>gi|357391644|ref|YP_004906485.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
gi|311898121|dbj|BAJ30529.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
Length = 687
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 243/699 (34%), Positives = 338/699 (48%), Gaps = 79/699 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDEG A LN+ FV++KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 49 ACHWCHVMAHESFEDEGTAGFLNERFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GTYFPPE ++G P F+ +L V AW +R + + L+E S
Sbjct: 109 FLTPEKEPFYFGTYFPPEPRHGMPSFRQVLEGVDKAWTGRRAEVGEVAGRISRDLAERAS 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A + + + L +L+KSYD R GGFG APKFP + ++ +L H + T
Sbjct: 169 VYAVGSGVAGVPGEGELGAAVAELAKSYDERRGGFGGAPKFPPSMVLEFLLRHHAR---T 225
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + +M T + MA+GGIHD +GGGF RY+VD W VPHFEKM YD L V
Sbjct: 226 GSAA----ALRMAGRTCEAMARGGIHDQLGGGFARYAVDATWTVPHFEKMCYDNALLLRV 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL + T + + D+L R++ P G SA DADS + E R EGA+Y W
Sbjct: 282 YLHLWRATGEERARRVALSTADFLLRELRTPEGGFASALDADSLD-EATGRTAEGAYYAW 340
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T +++E +LG A E + + G + G +VL L D
Sbjct: 341 TPEQLERVLGAADAGYAAELFGVTANGTFE-----------HGSSVLQLLADPEDR---- 385
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
++Y ++ R KLF+ RS RP P DDKV+ +WNGL I++ A A +L+
Sbjct: 386 ----DRYESV----RAKLFEARSHRPAPARDDKVVAAWNGLAIAALAEAGALLE------ 431
Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFL 497
R E +E AE AA I HL + RL + R+G + A G L+DYA
Sbjct: 432 ----------RPELVEAAERAADLLIAVHLTPDG--RLLRTSRDGRAGANAGVLEDYADT 479
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L LY + WL A EL + F D G ++T + ++ R ++ D A
Sbjct: 480 AEGFLALYAVTGESSWLQLAGELLDLVLRHFTDEASGALYDTADDAEQLIRRPQDPTDNA 539
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMCC 611
PSG + + L+ A+ + SD +R AE +L + T R +AV
Sbjct: 540 TPSGWTAAAGALLTYAAY---TGSDRHRTAAERALGIVSTLGTRAPRFTGWGLAV----- 591
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A +L P + V +VG + AA + V +P DTE
Sbjct: 592 AEALLDGP--REVAVVGAPDDPARAALHLAALRATAPGAVVAVGEPGDTE---------- 639
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC++F+C P D L + L
Sbjct: 640 -VPLLADRPLLDGRPAAYVCRHFACERPTADAADLADRL 677
>gi|312143535|ref|YP_003994981.1| glutamate--cysteine ligase [Halanaerobium hydrogeniformans]
gi|311904186|gb|ADQ14627.1| putative glutamate--cysteine ligase/putative amino acid ligase
[Halanaerobium hydrogeniformans]
Length = 647
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 193/579 (33%), Positives = 305/579 (52%), Gaps = 68/579 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFEDE VA++LN +F+SIKVDREERP++D +YM Q + G GGWPLS+
Sbjct: 51 TCHWCHVMEKESFEDEEVAQMLNQFFISIKVDREERPEIDSLYMDVCQTMTGSGGWPLSI 110
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++ D KP TY P E+KYGR G TIL ++ W ++R L Q+ + LS+
Sbjct: 111 FMTADKKPFYAATYIPKENKYGRKGLLTILPEIHYLWTEERKKLLQASENIVSHLSKINQ 170
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ EL N E + +YD ++GGFGS+PKFP + +L++ KK T
Sbjct: 171 NQKA------ELASNIFEKTVEAIESNYDHQYGGFGSSPKFPMYQYLLFLLHYWKK---T 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ S ++ TLQ M GGI+D + GFHRYS D W +PHFEKMLYDQ + +
Sbjct: 222 GEDKYLS----ILETTLQQMRAGGIYDQLAFGFHRYSTDREWKMPHFEKMLYDQALMIYI 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y A+ T Y+ + ++I+ +L +M+ G F+A DADS +EG +Y+W
Sbjct: 278 YTAAYQATAKEIYADVVKEIVSFLESEMLAKEGAFFTAIDADSG-------GEEGKYYLW 330
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
E++ IL E +R++ + KN+ + L +
Sbjct: 331 EKSELKSILNE----------------AQFNRLNKIFDIQANKNINLSLKN--------- 365
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
++ Y N L E + KL R +R P D K++ WNGL+I++ A+A +LK
Sbjct: 366 --VQDY-NQLAELKDKLLKHRKERIHPSKDKKILTDWNGLLIAALAKAGFVLK------- 415
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
DR Y+++A+ FI ++ + RL HS+ G L+DY+FL+ G
Sbjct: 416 -------EDR--YLKLADDVEKFIHNNMKTNKG-RLAHSYYEGEKSKIDNLNDYSFLLWG 465
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L++LY+ ++L+ A + E F D++ ++ + ++ + ++ +D + PS
Sbjct: 466 LIELYQATLKDEYLIKAEKTAKIMKEYFWDQKEEAFYFSAKDNEDLFIKQINANDHSLPS 525
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
NS++ N ++LA + Y+++A+ +A F ++
Sbjct: 526 ANSIAAFNFLKLAHLKDNLA---YQKDAQKIIAAFSDQI 561
>gi|395774413|ref|ZP_10454928.1| hypothetical protein Saci8_31786 [Streptomyces acidiscabies 84-104]
Length = 682
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 235/704 (33%), Positives = 338/704 (48%), Gaps = 90/704 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDQHTADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-A 138
VFL+PD +P GTYFPPE ++G P F+ +L V+ AW +RD +A+ + L E
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGSPSFRQVLEGVRQAWTGRRDEVAEVAGKIVRDLGERE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
LS + +EL L L++ YD + GGFG APKFP + I+ +L H +
Sbjct: 168 LSFGDAQPPGEEELAAALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGSEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T I + D++ R++ P G SA DADS +G + EGA+Y
Sbjct: 276 RVYAHLWRSTGSELARRIALETADFMVRELRTPEGGFASALDADS--DDGTGKHVEGAYY 333
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
VWT E+ D LGE A L ++ + G + +G +VL + + A
Sbjct: 334 VWTMAELRDTLGEDADLAAHYFGVTEDGTFE-----------EGASVLQLPQTEGVFDAD 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K + +L R++RP P DDK++ +WNGL I++ A
Sbjct: 383 K-----------IASIHARLLAKRAERPAPGRDDKIVAAWNGLAIAALAETGAYF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
DR + +E A +AA + R D+ H + S P G L+DY +
Sbjct: 427 -----------DRPDLIEAALTAADLVVRIHLDDHAHLSRTSKDGQPGANAGVLEDYGDV 475
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L + WL +A L + F D E G ++T + ++ R ++ D A
Sbjct: 476 AEGFLALAAVTAEGVWLDFAGLLLDHVLARFTDPESGALYDTASDAEQLIRRPQDPMDNA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
PSG + + L S A + ++ +R AE +L V +K + VP + A
Sbjct: 536 TPSGWTAAASA---LLSYAAHTGAEPHRTAAEKALGV----VKALGPRVPRFIGWGLSVA 588
Query: 613 ADMLSVPSRKHVVLVGHK------SSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
+L P + V +V + ++ + +LA A + V+ D++E
Sbjct: 589 EALLDGP--REVAVVARELTDPAGKNLHRQALLATAPGA------VVAYGVTDSDEFPL- 639
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A S + A VC+NF+C P TDP L L
Sbjct: 640 ---------IADRPLSGSEATAYVCRNFTCDLPTTDPDRLRTAL 674
>gi|154150757|ref|YP_001404375.1| hypothetical protein Mboo_1214 [Methanoregula boonei 6A8]
gi|153999309|gb|ABS55732.1| protein of unknown function DUF255 [Methanoregula boonei 6A8]
Length = 723
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 231/714 (32%), Positives = 343/714 (48%), Gaps = 62/714 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + R FL + CHWCHVM ESFE+ VA +LN FV IKVDREERPD
Sbjct: 54 GGEAFSRAKREDRPLFLSIGYSACHWCHVMARESFENNEVAGILNKHFVCIKVDREERPD 113
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD VYM Q L G GGWPL++ ++P+ KP GTYFP + G PG IL + + W+
Sbjct: 114 VDSVYMGICQQLTGQGGWPLTIIMTPEKKPFFAGTYFPKTGRAGMPGLTDILITIANLWE 173
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+RD L A A + LS+A S + PD ++ L +L+ +DS GGFG A
Sbjct: 174 TRRDELY---AAAEQILSDAHLLHKSPSGDPD---RHLLDKGFRELAAQFDSANGGFGRA 227
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P I +L + + +GE + M TL + +GGI DHVGGG HRY+
Sbjct: 228 PKFPAPHNILFLLRYWQ------MTGE-NRALDMAEQTLDAIRQGGIWDHVGGGMHRYAT 280
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D RW VPHFEKML DQ L +A++ T + Y I + + Y+ R++ PGG ++A
Sbjct: 281 DARWLVPHFEKMLSDQAMLVLASTEAYAATGKIRYRTIAEECIAYVLRELRDPGGGFYTA 340
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
EDADS EGA+Y+WT +E+ ILG A + L P P +
Sbjct: 341 EDADSP-------AGEGAYYLWTEEEIARILGLDAAFASILFSLTPL----------PGS 383
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
E K +++ LG+ ++ ++ R+L R KRP+P D K++ N
Sbjct: 384 E-KHASIISAAGPDPVLLKNLGITEQELISRRAGILRRLAHEREKRPKPARDTKILTDTN 442
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L ++ ARA ++L + + Y + A F+ +++ + + L H
Sbjct: 443 ALFCTALARAGRVLGNPS----------------YTDAAACTLRFLLQNMRNGEGRILHH 486
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
S G PGF DDYA L++ ++LY+ S + A+ + + D+EGGG+F
Sbjct: 487 S-GGGEHAVPGFADDYAHLVAAHIELYKATSDIACIKEAVTINALLLTHYRDKEGGGFFT 545
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
T + ++ KE +DGA PS N+ + NL L + +D + + A
Sbjct: 546 TADTAVDLPVQKKEWYDGAVPSANTTAFENLTALYRLTG---NDVFNEAALECARFITGA 602
Query: 599 LKDMAMAVP--LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
AV L A L+ + + +V+ G ++ + +LA A Y L +I +
Sbjct: 603 ASRAPHAVTGFLAALACSPLT-GNTQDLVIAGDPANAGTQTLLAVARRQY-LPGLLILLR 660
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P +E ++ + K A +C +C PPV+DP L N L
Sbjct: 661 PPGKAG----DEVDTVFPVVQGKVPHEGKATAYLCTGLACLPPVSDPQELVNQL 710
>gi|110638981|ref|YP_679190.1| hypothetical protein CHU_2595 [Cytophaga hutchinsonii ATCC 33406]
gi|110281662|gb|ABG59848.1| conserved hypothetical protein; thioredoxin domain [Cytophaga
hutchinsonii ATCC 33406]
Length = 681
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 201/557 (36%), Positives = 294/557 (52%), Gaps = 49/557 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME E FE E VA ++ND F++IK+DREERPD+D++YM V A+ GGWPL+
Sbjct: 56 SACHWCHVMEHECFEKEEVAAVMNDLFINIKIDREERPDLDQIYMDAVSAMGLRGGWPLN 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD KP GGTYFP + + +L ++ +A+ R+ + +S E L+++
Sbjct: 116 VFLTPDAKPFYGGTYFPQDH------WLNLLGQISNAYLNHREDILKSAESFTESLNQSD 169
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ ++ L L +++S+ +D+ GG APKFP P + LY +
Sbjct: 170 VFKYGLVDDAETFHKDELDLAYDRISQQFDTDMGGMNKAPKFPMP---SIYLYLLRDYAL 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ G + V TL MA GGI+D +GGGF RYSVD W PHFEKMLYD GQL +
Sbjct: 227 TGRQGSL----QHVELTLDKMAMGGIYDTIGGGFARYSVDGAWFAPHFEKMLYDNGQLLS 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A+++TK Y + + +L+R+M+ P G +SA DADS EG EG FY
Sbjct: 283 LYSEAYTVTKKPLYKEVIEETYTWLKREMLSPEGGFYSALDADS---EGV----EGKFYC 335
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W +E+ ++ E LF +Y + GN + G N+L + A A+
Sbjct: 336 WQYEELAQLIQEDFALFCAYYAITENGNWE-----------HGMNILYKRMSDEAFAAAH 384
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ E + + LF R R P LDDK++ SWNG+++ A +IL ++A
Sbjct: 385 SISAEALRESVSRWKNILFSERDPREHPGLDDKILASWNGIMLKGLCDAYRIL---GDAA 441
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ N ++ A FI LYD +T L HS++N + PGFL+DY +I
Sbjct: 442 ILNTALMN-------------AEFILTKLYDGKT--LFHSYKNKKATIPGFLEDYTHVID 486
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
G L LYE +WL AI L N + F D + G +F T+ ++ R KE D P
Sbjct: 487 GYLALYEVSLDEQWLRQAITLVNHVIDHFYDDDEGLFFYTSRTSEKLIARKKEIFDNVIP 546
Query: 560 SGNSVSVINLVRLASIV 576
+ NS NL L ++
Sbjct: 547 ASNSSLARNLYHLGKLL 563
>gi|431797737|ref|YP_007224641.1| thioredoxin domain-containing protein [Echinicola vietnamensis DSM
17526]
gi|430788502|gb|AGA78631.1| thioredoxin domain protein [Echinicola vietnamensis DSM 17526]
Length = 678
Score = 338 bits (868), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 207/557 (37%), Positives = 296/557 (53%), Gaps = 55/557 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE AK++N FV IK+DREERPD+D +YM VQ++ GGWPL+
Sbjct: 51 SACHWCHVMEHESFEDEATAKIMNAHFVCIKIDREERPDLDNIYMDAVQSMGLQGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS--GAFAIEQLSE 137
VFL P+ KP GGTYFP P +K +L+ + +A+ D LA+S G +L E
Sbjct: 111 VFLMPNQKPFYGGTYFP------NPNWKGLLQNIAEAYATHHDELAKSAEGFGNSIKLKE 164
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ + P L L A++++ D ++GGF +PKFP P +L ++
Sbjct: 165 REKYRLADD--PSRLTAEDLTHMAQKIASQMDPQWGGFNRSPKFPMPAVWDFLLRYA--- 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
G+AS +K VLFTL + GGI+DH+ GGF RYSVD W PHFEKMLYD GQL
Sbjct: 220 ---ALKGDASLIEK-VLFTLTKIGMGGIYDHLRGGFARYSVDSEWFAPHFEKMLYDNGQL 275
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y AF L+ D + + +++L+ +M+ G ++A DADS EG +EG F
Sbjct: 276 LSLYAKAFQLSGDALFKEKINETVNWLQAEMLQEEGGFYAALDADS---EG----EEGKF 328
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y WT E+E +L + F E + + GN + KG N+L + + A
Sbjct: 329 YTWTHDELESMLDDEDAWFYECFNISEKGNWE-----------KGVNILFQTHTYEEIAH 377
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K G+ E+ L E + +L +R+ R P LDDKVI WNGL IS A+A +
Sbjct: 378 KHGLEEEQLAQNLNEVKERLLKIRNLRTPPGLDDKVIAGWNGLTISGLAQAYWATAN--- 434
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRH-LYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
P+ S +A +FI H L EQ +R S++NG + P FL+DYA
Sbjct: 435 ------PLAKS-------LAIQNGTFILDHMLKGEQLYR---SYKNGEAYTPAFLEDYAA 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+I G + LY+ S +WL+ A L E F D + G ++ + +++ KE D
Sbjct: 479 IIQGFIHLYQLTSEPRWLLVAKRLTAFVLEHFFDEDDGLFYFNNPDSETLIANKKEIFDN 538
Query: 557 AEPSGNSVSVINLVRLA 573
PS N++ NL +L
Sbjct: 539 VIPSSNALMATNLHQLG 555
>gi|255531347|ref|YP_003091719.1| hypothetical protein Phep_1443 [Pedobacter heparinus DSM 2366]
gi|255344331|gb|ACU03657.1| protein of unknown function DUF255 [Pedobacter heparinus DSM 2366]
Length = 670
Score = 338 bits (867), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 209/585 (35%), Positives = 300/585 (51%), Gaps = 60/585 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ VA+++N FV IKVDREERPD+D++YM +Q + G GGWPL+
Sbjct: 51 SACHWCHVMERESFENHEVAEVMNRHFVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
PD +P+ GGTYF D + +L V W + D ++ A+A ++L++ +
Sbjct: 111 CICLPDQRPIYGGTYFRKAD------WVNVLESVAAMWANEPD---KAIAYA-DRLTDGI 160
Query: 140 SASASSNKLP----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ +P DE + L E + +D GG+ APKFP P Q ML +S
Sbjct: 161 QNA--EKIIPQIKVDEYTKAHLTAITEPWKRYFDMAEGGYNRAPKFPLPNNWQFMLRYSH 218
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
++D A L TL+ MA GGI+DHV GGF RYSVD WHVPHFEKMLYD G
Sbjct: 219 LMQDDATHVSA-------LLTLEKMAMGGIYDHVAGGFSRYSVDGDWHVPHFEKMLYDNG 271
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL ++Y +A+ ++ + + + + +++L R+M+ P G ++A DADS EG EG
Sbjct: 272 QLISLYAEAYQYSRSLLFKEVAEESIEWLEREMMSPEGLFYAALDADS---EGV----EG 324
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW + E +LG+ A L +++ + GN E + N+L+
Sbjct: 325 KFYVWDKPDFEAVLGDDADLLSDYFNVTDEGNW----------EEEQTNILLRKFTEEEY 374
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A G+ + + L + + KL RSKR RP LDDK + +WN + I A +++I
Sbjct: 375 AEVKGISVVELLQKIKTAKIKLLQERSKRIRPGLDDKCLTAWNAMAIKGLAESAEIF--- 431
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
D Y E+A+ AASFI H+ + L +F+N + PGFLDDYA
Sbjct: 432 -------------DHPHYYEMAKKAASFILAHV-NTADGGLYRNFKNDKASIPGFLDDYA 477
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F I L+ LYE WL A L + F D F T+ +++ R E D
Sbjct: 478 FFIEALIALYEADFDENWLKEAKRLCDYVLLNFEDEHSPMLFYTSAAGETLIARKHEIMD 537
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
P+ NSV NL +L + D Y AE LA ++K
Sbjct: 538 NVVPASNSVMAQNLHKLGLLF---DEDVYSIKAEEMLAAVLPQIK 579
>gi|116754985|ref|YP_844103.1| hypothetical protein Mthe_1697 [Methanosaeta thermophila PT]
gi|116666436|gb|ABK15463.1| protein of unknown function DUF255 [Methanosaeta thermophila PT]
Length = 669
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 234/694 (33%), Positives = 344/694 (49%), Gaps = 91/694 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFEDE +A++LN FV +KVDREERPD+D +YM Q + G GGWPL+
Sbjct: 51 STCHWCHVMARESFEDERIAEMLNRAFVCVKVDREERPDIDAIYMEACQIITGRGGWPLT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ +SPD P TY P + + G G + ++ V++ W +R L G + + +A
Sbjct: 111 IIMSPDGIPFFAATYIPKDGRLGMMGLRELIPLVEELWRNRRSELTSLGFKVLNAMRKAD 170
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ +SN L + L +LS +D GGFG APKFP Q +L+ +
Sbjct: 171 THLQASNADESTLSRAYL-----ELSGIFDWTSGGFGRAPKFPLA---QNLLFLLRYWHR 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ + +MV TL+ M GGI+D + GFHRYS D W VPHFEKMLYDQ ++
Sbjct: 223 TGE----MKALEMVELTLREMRCGGIYDQLAYGFHRYSTDSSWGVPHFEKMLYDQALMSV 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VYL+A+ T Y+ + +IL ++ D+ P G SA DA+S EG +Y+
Sbjct: 279 VYLEAYQATGKRDYAIVADEILGFVAEDLRSPDGAFCSALDAESDNI-------EGGYYL 331
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASK 378
WT ++ D LG+ E + L+P G D GKNVL I L +
Sbjct: 332 WTMDQLRDALGDDLKKALEVFVLEPIGGSD------------GKNVLRISLKGELSEFKH 379
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
P+ RRKL D RS R +P D+KV+ WNGL+I++F+R +++L E
Sbjct: 380 TSEPI----------RRKLLDARSLRRKPFRDEKVLADWNGLMIAAFSRGAQVLGDE--- 426
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
++ +A AA F+ ++ + L HS++ LDDYAFLI
Sbjct: 427 -------------RWLRIASEAADFVLSSMHRDGM--LMHSYKGSRVS---ILDDYAFLI 468
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GL++LY+ G ++L A L + F D +GG Y+ T E ++L+ KE DGA
Sbjct: 469 FGLIELYQAGFDGRYLERAEILCDEMVSHFSDPDGGFYY-TMKEQSDIILQRKEIRDGAI 527
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEH--SLAVFETRLKDMAMAVPLMCCAADML 616
PSG S++ ++++ L I+ R + E S+++ + + V L+ A D+
Sbjct: 528 PSGYSMATMDMLLLGKILG-------RPDLEEIASMSLRHISMASLPAQVGLL-IALDLA 579
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
PS + + +VG + ML A + Y K V+ D AS
Sbjct: 580 LGPSHE-IAIVGDADNT--RTMLRALWSVYAPRKVVVSGD------------RPPEWASS 624
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
R K A VC ++CS P TD S+ LL
Sbjct: 625 LRP--VDKKATAYVCSRYTCSFPATDIRSMIELL 656
>gi|313667030|gb|ADR72969.1| DUF255 family protein [Streptomyces sp. OH-4156]
Length = 673
Score = 338 bits (866), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 234/703 (33%), Positives = 344/703 (48%), Gaps = 89/703 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A L+N+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDDATAALVNENFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD P GTYFPPE ++G P F +L VK AW +RD + + ++ L+
Sbjct: 108 VFLTPDAAPFYFGTYFPPEPRHGMPSFPEVLEGVKGAWSDRRDEVGEVAERIVKDLA-GR 166
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S + + +P +EL Q L L++ YD+ GGFG APKFP + ++ +L H +
Sbjct: 167 SLAYGGDGVPGEEELAQALL-----GLTREYDATHGGFGGAPKFPPSMTLEFLLRHHAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RY+VD W VPHFEKMLYD L
Sbjct: 221 --TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y + T + + D+L R++ P G SA DADS +G R EGA+
Sbjct: 275 CRAYAHLWKATGSDLARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGAY 332
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT ++ ++LG E A L HY + G F+ + +++L + +A
Sbjct: 333 YVWTPAQLTEVLGAEDAALAAAHYGVTEDGT------------FEHGSSVLQLPREAGTA 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ +L R +R RP DDKV+ +WNGL I++ A +
Sbjct: 381 DA---------GRIASIAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF---- 427
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
DR + +E A AA + R DE RL + ++G + G L+DYA
Sbjct: 428 ------------DRPDLVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNDGVLEDYA 474
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKE 552
+ G L L WL +A L + L +DR EGG ++T + +++ R ++
Sbjct: 475 DVAEGFLALAAVTGEGAWLDFAGFLLD----LVIDRFTAEGGALYDTAHDAEALIRRPQD 530
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
D A PSG + + L+ S A + SD +R AE +L V +K + P
Sbjct: 531 PTDNATPSGWTAAAGALL---SYAAHTGSDAHRAAAEGALGV----VKALGPRAPRFIGW 583
Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
+ + +L P + + +VG F+ + A + V+ D+EE
Sbjct: 584 GLAVSEALLDGP--REIAVVGAPGDEAFQELRRTALLAT-APGAVLAFGAPDSEEFPLLR 640
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + A A VC++F+C PVTDP +L L
Sbjct: 641 DRPLVSGGPA----------AYVCRHFTCDAPVTDPDALRRKL 673
>gi|374585294|ref|ZP_09658386.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
gi|373874155|gb|EHQ06149.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
Length = 685
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 230/701 (32%), Positives = 349/701 (49%), Gaps = 81/701 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED+ A LLN+ +V+IKVDREE PDVD +YM + A+ GGWPL++
Sbjct: 51 TCHWCHVMERESFEDQSTADLLNEHYVAIKVDREELPDVDSIYMKALHAMGQPGGWPLNL 110
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P+ GGTYFPP+ +GRP FK +L + W R L ++ + E L+E
Sbjct: 111 FLTPDRRPITGGTYFPPQPAHGRPSFKQMLGTLAQMWKNDRPRLLEAASSITEFLNE--- 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF-GSAP-KFPRPVEIQMMLYHSKKLE 198
+A ++ LPD P R E + +++D + GGF G+ P KFP + + ++L +L
Sbjct: 168 QNALASDLPD--PSIFARFIGE-MEQAFDVQRGGFYGNGPNKFPPSMALMLLL----RLH 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ + G +S MV TL+ M++GGI+D +GGG RYS D W VPHFEKMLYD
Sbjct: 221 ERDRQGSSSV-LVMVEKTLEAMSRGGIYDQLGGGLCRYSTDPAWLVPHFEKMLYDNALFL 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+A+ +T + FY + D++ YLRRD++ P G + AEDADS EG EG FY
Sbjct: 280 QALTEAYRITGNDFYRRMAYDVIAYLRRDLMSPEGAFYCAEDADS---EGV----EGKFY 332
Query: 319 VWTSKEVEDILGEHAI------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
VW++ E + L + L ++ + GN F+GKN+L
Sbjct: 333 VWSAAEFRETLRSSGLSDDEIRLLSLYWNVTEAGN------------FEGKNILHLTGSD 380
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
AS+ + L + + R+ LF VR +R RP DDK++ SWN L+IS+ +RAS +
Sbjct: 381 EDFASQHSLTLTSLNEMTQKARQALFAVRERRIRPLRDDKILTSWNALMISALSRASIVF 440
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ + M A + A F+ HL Q +L +R+G ++ L
Sbjct: 441 GDASLADM----------------AVACADFVESHLM--QDGQLMRRYRDGEARFKATLT 482
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIE-LQNTQDELFLDREGGGYFNTTGEDPS--VLLR 549
D+A L L+DL+ + ++ A+E + F D G T ED S + LR
Sbjct: 483 DHALLGCALIDLFRVTGKSVYMRRALERAEAIMSSFFAD----GRLYETAEDDSDDLFLR 538
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
+ +DG PSG S ++ V L+ G + Y + A+ L F A A P M
Sbjct: 539 PIDSYDGVMPSGPSAALRLFVTLSRY--GESARIYEETAKVILRQFSPEWAQAARAYPAM 596
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A S +R+ + + G + L + L+ ++ D+
Sbjct: 597 VSAFLTFSDEARE-IAITGEADFIGQALKLIGSR----LDGDAVYAFSVDS--------- 642
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+S + +A + S + +CQ+F+C P + L+ L
Sbjct: 643 DSPVSLIAGKDRSRSAIY--LCQDFACQTPFSSVQQLDQAL 681
>gi|398893990|ref|ZP_10646420.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
gi|398183122|gb|EJM70617.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
Length = 662
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 237/691 (34%), Positives = 337/691 (48%), Gaps = 88/691 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A+L+N+ F++IKVDR+ERPD+D +Y VQ + GGGWPL+V
Sbjct: 49 ACHWCHVMAHESFENPEIARLMNERFINIKVDRQERPDLDDIYQKIVQMMGQGGGWPLTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GGTYFPP++ YGR GF +LR + +AW R L Q+ A + Q A+
Sbjct: 109 FLTPRREPFFGGTYFPPQESYGRAGFPQLLRGLSEAWQNNRAALEQNVAQFL-QGYRAMD 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV--EIQMMLYHSKKLE 198
P E Q A A +++ D GG G+APKFP ++ + LY
Sbjct: 168 TQMLEGDTPLEQDQPA--AAARLFARNTDPVHGGLGNAPKFPNVACHDLVLRLYQRLHEP 225
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D +S E TL +A GG++DH+GGGF RY VDE W VPHFEKMLYD GQL
Sbjct: 226 DLLRSLE---------LTLDQVAAGGLYDHLGGGFARYCVDEHWAVPHFEKMLYDNGQLV 276
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+Y DA+ T + + + + +DY+ RDM P G +++EDADS EG +EG FY
Sbjct: 277 KLYADAWRATGEPAWRRVFEETIDYILRDMTHPEGGFYASEDADS---EG----EEGKFY 329
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT +V+ +LG+ A L + Y + +GN + G VL A+
Sbjct: 330 VWTPAQVQAVLGDPDAALACQAYGVTASGNFE-----------HGTTVL-------HRAA 371
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L E L L R KL R++R RP D+ ++ SWN L+I A +
Sbjct: 372 TLDTAQEAQLAGL---RDKLLVARAQRIRPGRDENILTSWNALMIQGLCAAYQ------- 421
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ +++ A AA FI L L ++R +K PGFL+DYAFL
Sbjct: 422 ---------ATGTATHLDAARRAADFILDRLSTPDGG-LYRAWREDTAKVPGFLEDYAFL 471
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVLLRVKEDHD 555
+ LLDLYE +L A L EL L++ E G YF +P ++ R + D
Sbjct: 472 ANALLDLYECEFDQLYLERATRLV----ELILEKFWEDGLYFTPKDGEP-LVHRPRAPQD 526
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSG S SV +RL + + + YR+ AE L ++ + A D
Sbjct: 527 NAWPSGTSTSVFAFLRLFEL---TGRELYRERAEQVLTMYRAAAAQNPFGFAHLLAAQDF 583
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ +V+ G +S+ + L A+ L V+ A E++
Sbjct: 584 VQR-GPISIVIAGERSAA---SALVASLQRRYLPARVL----AFAEDVPI---------- 625
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISL 706
A + + A VC+N +C PVT L
Sbjct: 626 GAGRHMLKGQTSAYVCRNRTCENPVTSAAEL 656
>gi|326800931|ref|YP_004318750.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326551695|gb|ADZ80080.1| protein of unknown function DUF255 [Sphingobacterium sp. 21]
Length = 672
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 220/699 (31%), Positives = 349/699 (49%), Gaps = 81/699 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ VA+++N ++SIKVDREERPD+D++YMT VQ + GGWPL+
Sbjct: 48 SACHWCHVMERESFENKEVAQVMNRHYISIKVDREERPDIDQIYMTAVQLMTNSGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
PD +P+ GGTYF P D + +L +V+ W + + + E+L++ +
Sbjct: 108 CICLPDGRPVYGGTYFRPAD------WVNVLNQVQALWANEPETAIEYA----EKLAQGI 157
Query: 140 SASAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ S + +K+P++ ++ L+ + +++D GG+ APKFP P L +
Sbjct: 158 TESETFKISKIPEKYSEDDLKEIVKPWQQTFDPIDGGYKRAPKFPLPNNWLFFLRY---- 213
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
G ++ + FTLQ +A GG++D VGGGF RY+VD +WH+PHFEKMLYD QL
Sbjct: 214 ---GHLANDADILEHTHFTLQHIAAGGLYDQVGGGFARYAVDGQWHIPHFEKMLYDNAQL 270
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y +A+ + Y + + L ++ R+M G +SA DADS EG EG +
Sbjct: 271 ISLYAEAYLQKPEPLYKRVVEETLQWVDREMTSAEGAFYSALDADS---EGV----EGKY 323
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y + E++++LG+ A LF ++ + GN + NVL D+ A
Sbjct: 324 YTFQQDEIDNLLGKDADLFISYFSITAAGNWPEEKT----------NVLKTRLDADKLAE 373
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G E++ L + ++K+ R +R RP LD+K++ SWN +++ ++ A +
Sbjct: 374 QAGYSKEEWETYLKDIKKKIRHYREQRIRPGLDNKILTSWNAMMLKAYIDAYRTF----- 428
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ---THRLQHSFRNGPSKAPGFLDDY 494
++KEY+ VAE A FI R L E+ H+ Q F+ FLDDY
Sbjct: 429 -----------NKKEYLTVAERNAHFILRKLITEEGTLLHQPQTPFKT----ITAFLDDY 473
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
AF+I + LYE WL A L + F DR+ G ++ T+ ++ R E
Sbjct: 474 AFVIEAFIALYEVTFNKAWLDQAKSLADYTLAQFYDRQAGAFYYTSDLTEVLITRKFEIM 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D PS NSV L +L I S Y++ A LA +++ A A
Sbjct: 534 DNVIPSSNSVMAHQLNKLGVIFEDST---YKEIAAQLLANVFPQIRTYGSAYS--NWAIR 588
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+L H + + S D +A Y NK ++ EE N
Sbjct: 589 LLEEVYGFHEIAITGPQSNDLR--IAIDQKIYSPNKVIL----GGVEE----------NL 632
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ RN + ++ + VC+N +CS PV + +ENL+L++
Sbjct: 633 PLLRNRVT-ERSLIYVCKNNTCSLPVDNLKDVENLILKQ 670
>gi|302652658|ref|XP_003018175.1| hypothetical protein TRV_07811 [Trichophyton verrucosum HKI 0517]
gi|291181788|gb|EFE37530.1| hypothetical protein TRV_07811 [Trichophyton verrucosum HKI 0517]
Length = 511
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 197/514 (38%), Positives = 292/514 (56%), Gaps = 32/514 (6%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESF VA +LN F+ IK+DREERPD+D VYM YVQA G GGWPL+VFL+PDL+
Sbjct: 1 MEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLE 60
Query: 88 PLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
P+ GGTY+P + P GF +L K++D W+ ++ +S QL E
Sbjct: 61 PVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFA 120
Query: 140 S-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+ + ++ ++L + L + YD+ GGF +PKFP PV + +L S
Sbjct: 121 EEGIHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLS 180
Query: 195 KKLE---DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ E D E ++ +M + T+ +A+GGI D +G GF RYSV W +PHFEKML
Sbjct: 181 RYPEEVMDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKML 240
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGAT 310
YDQ QL +V++D F + + D++ Y+ ++ P G +S+EDADS + T
Sbjct: 241 YDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSTPILSPMGCFYSSEDADSQPSPEDT 300
Query: 311 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
K+EGA+YVWT KE++ ILG+ A + H+ + P GN ++R++DPH+EF +NVL
Sbjct: 301 EKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIA 358
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 428
+ A + G+ E+ + IL R KL + R +KR RP LDDK+IV+WNGLVI + ++
Sbjct: 359 TTPAQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALSKC 418
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSKA 487
+ +L+ + K +A +A FI+ +L+D ++ +L +R +
Sbjct: 419 AILLED----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIYRADSRGDT 468
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
PGF DDYA+LISGLL LYE L +A +LQ
Sbjct: 469 PGFADDYAYLISGLLQLYEATFDDAHLQFADKLQ 502
>gi|374852688|dbj|BAL55616.1| hypothetical conserved protein [uncultured gamma proteobacterium]
Length = 723
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 215/566 (37%), Positives = 306/566 (54%), Gaps = 63/566 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + + FL ++CHWCHVME ESFEDE +A +LN FV +K+DRE+RPD
Sbjct: 34 GEEAFAKARREAKPIFLSSGYSSCHWCHVMERESFEDEEIAAILNRDFVPVKLDREQRPD 93
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD VYM VQ L G GGWPLS FL+PD +P GGTYFPP+ FK +L++V +AW
Sbjct: 94 VDAVYMHAVQLLTGHGGWPLSAFLTPDGRPFFGGTYFPPQ------AFKRLLQQVAEAWR 147
Query: 119 KKR-DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 177
+R ++ AQ+ E+L +AL S++ P E+ + ++ +D R GGFG+
Sbjct: 148 SRRAEIEAQA-----ERLKQALLELESTH--PGEIGPETVEAAIAEILAPFDPRHGGFGA 200
Query: 178 APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 237
APKFP + +++ D G+ + ++V TL MA+GG+ D +G GFHRY
Sbjct: 201 APKFPNEPWLALLI-------DELWRGDDPKVLEVVRKTLDAMARGGLCDQIGDGFHRYC 253
Query: 238 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 297
VD + +PHFEKMLY+Q QL +Y A +LTKD ++Y R D++ R++ P G ++
Sbjct: 254 VDAAFQIPHFEKMLYNQAQLGRLYARAAALTKDALFAYAARCTFDFVLRELTAPEGGFYA 313
Query: 298 AEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDP 356
A DADS EG +EG FY+WT +E+ L + A L E + + +GN
Sbjct: 314 AIDADS---EG----EEGKFYLWTPEEIRAALPKDDAELAIELFGVSASGN--------- 357
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
F+GKNVL + A GM E+ L L R++L+ VR +R P DDK++ +
Sbjct: 358 ---FEGKNVLHLPRPLAEIAQAKGMTEEELLACLDRIRQRLYQVRRRRVPPLRDDKIVTA 414
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNG++I++ A A++ +F+ P +Y+ A AA F+ RH Q RL
Sbjct: 415 WNGMMIAALAEAAR---------LFHEP-------KYLLAARRAAEFLSRHHL--QGERL 456
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ RNG G +DYAFL G L LY+ + WL A L F D G
Sbjct: 457 LRASRNGRPAGEGLQEDYAFLAEGFLALYDVSADPVWLQEAEALTAAMLAQFWDEARGAC 516
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGN 562
F D + +R K+ DGA PSGN
Sbjct: 517 FMNRA-DERLAVRPKDLFDGAYPSGN 541
>gi|114326678|ref|YP_743835.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
gi|114314852|gb|ABI60912.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
Length = 679
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 232/691 (33%), Positives = 330/691 (47%), Gaps = 96/691 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ A +N+ F+ IKVDREERPD+D +YM+ + A+ GGWPL++F
Sbjct: 62 CHWCHVMAHESFEDQATADEMNNAFICIKVDREERPDIDHIYMSALHAMGQQGGWPLTMF 121
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P+ +P GGTYFPPE ++GRP F+ +L ++DAW +R + Q+ + QL+ A++
Sbjct: 122 LTPEGQPFWGGTYFPPEPRFGRPSFRQVLAAIRDAWATRRSAIEQN----LGQLTRAMNR 177
Query: 142 SASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + P D L NA+ L ++ D GGF APKFP + + ++
Sbjct: 178 LSETAAGPEVDVLLLNAVDAA---LLRNLDPEKGGFTGAPKFP---NAPVFRFFWQEFHR 231
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG+ E V L MA+GGI+DH+GGGF RYS D W VPHFEKM YD GQ+
Sbjct: 232 TGR----PELSDAVHAVLSHMARGGIYDHLGGGFARYSTDAEWLVPHFEKMAYDNGQILE 287
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGP---GGEIFSA-EDADSAETEGATRKKEG 315
+ ++ Y+ + + +L RDM P GG F+A EDADS EG +EG
Sbjct: 288 LLSLGYAQNPTPLYARCIEETVGWLIRDMSVPVEGGGTAFAASEDADS---EG----EEG 340
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY+W E++ +LGE A FK+ + + GN ++G +L L S
Sbjct: 341 RFYIWHEDEIDALLGEAATGFKQAFDVTREGN------------WEGHTILRRLTISP-- 386
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
E + RR LF R RPRP DDKV+ WNGLVI RA+ L
Sbjct: 387 --------EADAESWAQERRILFQSRENRPRPGRDDKVLADWNGLVIVGLVRAAIAL--- 435
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
DR +++ AESA +R L E R+ H++R G A G LDD A
Sbjct: 436 -------------DRADWLSAAESAYEAVRAALGSEDG-RIAHAWRLGRITAAGLLDDQA 481
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+I L LYE ++L A+ L + F G Y D L R D
Sbjct: 482 SMIRAALSLYEATGQERYLSDAVTLAQSARSFFSSETGAFYTTAHDADDVPLTRPCTASD 541
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSGN + L RL + + + + A + F R + +A + P + AAD+
Sbjct: 542 NAVPSGNGMMADALARLYHLTGEQR---WYEAASGLIRAFTGRPQSLA-SSPYLLMAADL 597
Query: 616 LSVPSRKHVVLV-GHKSSVDFENM----LAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
L +R +V + G ++M LA S + + +H P
Sbjct: 598 L---TRGTLVSIHGQADDPHLQSMVREVLALGDPSVLVCRKPLHAAPDR----------- 643
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ + A LVC+ CS P+T
Sbjct: 644 -------QTDHVAQTFFVLVCRQTLCSAPLT 667
>gi|408529633|emb|CCK27807.1| hypothetical protein BN159_3428 [Streptomyces davawensis JCM 4913]
Length = 682
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 230/701 (32%), Positives = 334/701 (47%), Gaps = 86/701 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFEDE A LN+ FV++KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 55 SCHWCHVMAHESFEDEATAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTYFPP ++G P F+ +L V+ AW +RD +A+ + L+E
Sbjct: 115 FLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVQQAWTGRRDEVAEVAGKIVRDLAEREI 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ S +E AL L++ YD++ GGFG APKFP + I+ +L H + T
Sbjct: 175 SYGDSQAPGEEELAGALL----GLTREYDAQRGGFGGAPKFPPSMVIEFLLRHHAR---T 227
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L V
Sbjct: 228 GSEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRV 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + T + + D++ R++ G SA DADS +G + EGA+YVW
Sbjct: 284 YAHLWRSTGSELARRVALETADFMVRELRTNEGGFASALDADS--DDGTGKHVEGAYYVW 341
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T ++ ++LG+ A +++ + G + AS L
Sbjct: 342 TPQQFREVLGDDAERAAQYFGVTEEGTFE------------------------EGASVLQ 377
Query: 381 MPLEKYLNI---LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+P + L + + R +L R++RP P DDKV+ +WNGL I++ A
Sbjct: 378 LPQHEGLFVAEKVASVRERLLAARAERPAPGRDDKVVAAWNGLAIAALAETGAYF----- 432
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
DR + +E A AA + R DE + S G L+DYA +
Sbjct: 433 -----------DRPDLVEAAVCAADLLVRLHLDEHVQIARTSKDGQVGANAGVLEDYADV 481
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L WL +A L + F+D G ++T + ++ R ++ D A
Sbjct: 482 AEGFLALASVTGEGVWLEFAGFLLDHVLARFVDERSGALYDTAVDAERLIRRPQDPTDNA 541
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
PSG + + L+ S A + ++ +R AE +L V +K + VP + A
Sbjct: 542 APSGWTAAAGALL---SYAAQTGAEPHRAAAERALGV----VKALGPRVPRFIGWGLAAA 594
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN---KTVIHIDPADTEEMDFWEEH 669
L P K V +VG ++D + A H + L V+ D++E+
Sbjct: 595 EAWLDGP--KEVAVVG--PALD-DPATRALHRTALLGIAPGAVVAAGTPDSDELPL---- 645
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+NF+C P TDP L L
Sbjct: 646 ------LAGRPLVGGEPAAYVCRNFTCDAPTTDPERLRAAL 680
>gi|409122619|ref|ZP_11222014.1| thioredoxin domain-containing protein [Gillisia sp. CBA3202]
Length = 620
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 197/560 (35%), Positives = 305/560 (54%), Gaps = 63/560 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFEDE VA+++N + +IKVDREERPDVD VYM+ VQ + G GGWP+++
Sbjct: 55 CHWCHVMEHESFEDEDVAEIMNTHYYNIKVDREERPDVDMVYMSAVQIMTGSGGWPMNIV 114
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EAL 139
PD +P+ GGTYF ED +K L ++ + + + L + E L + +
Sbjct: 115 ALPDGRPVWGGTYFRKED------WKNSLLQIAKLYKENPEKLYEYADKLNEGLKNIQLI 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM-----LYHS 194
++S S N + L L +E+L K++D ++GG PKF P + + LY+
Sbjct: 169 ASSKSENDID-------LNLISEKLEKNFDWQYGGTKQTPKFVIPSNFEFLLKYSQLYNH 221
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
K ++D V +L ++ GGI+DH+ GGF RYSVDE+WH+PHFEKMLYD
Sbjct: 222 KNIKD------------FVKLSLTKISFGGIYDHIEGGFSRYSVDEKWHIPHFEKMLYDN 269
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ ++Y A+++TK +Y + L+++ ++ G +S+ DADS + G R E
Sbjct: 270 AQMVSLYSKAYAVTKIGWYREVVEQTLEFIENNLKTKEGSFYSSLDADSIDKNGKLR--E 327
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
GAFY W E++++L + LFKE+Y + G + NE+ VLI D ++
Sbjct: 328 GAFYTWEVDELKELLKDEFSLFKEYYNVNSYGKWE-------DNEY----VLIRTEDEAS 376
Query: 375 SASKLGMPLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+K + ++ I L + R+KR +P LDDK + SWN L++S + A KI
Sbjct: 377 FLNKNQLDSMEFKAIKAHWLEVLSSEERNKREKPRLDDKQLTSWNALMLSGYVDAYKI-- 434
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+ K+Y+ A A+FI+ HLY + + L SF+NG S G+L+D
Sbjct: 435 --------------TQNKDYLATALQNATFIQEHLYKSEGN-LHRSFKNGISSINGYLED 479
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAF I + LYE +WL ++ +L + ++F + E G ++ T+ +D ++ R E
Sbjct: 480 YAFTIEAFIKLYEITLDFEWLHFSKKLMDYSIQIFYEPETGLFYFTSKQDKPLITRNYEL 539
Query: 554 HDGAEPSGNSVSVINLVRLA 573
D P+ NSV NL +L+
Sbjct: 540 SDNVIPASNSVMAQNLFKLS 559
>gi|29829838|ref|NP_824472.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
gi|29606947|dbj|BAC71007.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
Length = 675
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 235/700 (33%), Positives = 339/700 (48%), Gaps = 82/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE A LN+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SSCHWCHVMAHESFEDETTAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPPE ++G P F+ +L V+ AW +RD +A+ + L+
Sbjct: 107 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLEGVRSAWTDRRDEVAEVAGKIVRDLAGRE 166
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S SS +EL Q L L++ YD+R GGFG APKFP + ++ +L H +
Sbjct: 167 ISYGDSSTPGEEELAQALL-----GLTRDYDARRGGFGGAPKFPPSMVVEFLLRHHAR-- 219
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 274
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G+ R EGA+Y
Sbjct: 275 RVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGSGRHVEGAYY 332
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
VWT +++E LG E A L + + G + +G +VL + D A
Sbjct: 333 VWTPEQLEQALGREDAELAARCFGVTRDGTFE-----------EGASVLQLPQQDVVFDA 381
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ + R +L R++RP P DDKV+ +WNGL I++ A
Sbjct: 382 ER-----------IASVRARLLGRRAERPAPGRDDKVVAAWNGLAIAALAETGAYF---- 426
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
DR + +E A AA + R DE RL + ++G + A G L+DY
Sbjct: 427 ------------DRPDLVEAAIGAADLLVRLHLDEHA-RLARTSKDGRAGAHAGVLEDYG 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F D E G ++T + ++ R ++ D
Sbjct: 474 DVAEGFLALASVTGEGVWLEFAGFLLDHVLAQFTDPESGALYDTAADAEKLIRRPQDPTD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG S + L+ S A + ++ +R AE +L V +K + P +
Sbjct: 534 NATPSGWSAAAGALL---SYAAHTGAEPHRTAAERALGV----VKALGPRAPRFVGWGLA 586
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P + V +VG A+ L++T + + A + +
Sbjct: 587 VAEALLDGP--REVSVVGPADD----------PATGTLHRTAL-LGTAPGAVVAVGTPGS 633
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+NF+C P+TD L L
Sbjct: 634 DEFPLLADRPLVGGGPAAYVCRNFTCDAPITDADRLRTAL 673
>gi|189424638|ref|YP_001951815.1| hypothetical protein Glov_1579 [Geobacter lovleyi SZ]
gi|189420897|gb|ACD95295.1| protein of unknown function DUF255 [Geobacter lovleyi SZ]
Length = 610
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 209/560 (37%), Positives = 294/560 (52%), Gaps = 66/560 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFED+ VA +LN FV +KVDREERPD+D+ M Q+L GGWPL+
Sbjct: 72 TCHWCHVMAHESFEDDEVADILNHAFVPVKVDREERPDLDEFCMAACQSLTNSGGWPLNC 131
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL PD P TY P E K G PGF +L + W K++ + ++ +E L + ++
Sbjct: 132 FLKPDGTPFYALTYLPKEPKRGMPGFLELLENIARVWQHKQEAVERNARSLMEALGQ-MA 190
Query: 141 ASASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A+ PD EL +A+ L K +D R+ GFG APKFP P + +L ++E
Sbjct: 191 AAPVQTTAPDLKELADSAV----ATLRKIHDPRYHGFGKAPKFPMPPYLLFLLGRDNRIE 246
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
Q++ L TLQ M +GGI D +GGG HRYS D+ W VPHFEKMLYDQ +A
Sbjct: 247 -----------QELALNTLQAMRQGGIWDQLGGGIHRYSTDQHWLVPHFEKMLYDQALVA 295
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
L A++LTK+ Y + ++L+++ ++ P G + DADS EG +EGA Y
Sbjct: 296 YTALKAYALTKENRYLEMADNLLEFVLAELTAPEGGFYCGLDADS---EG----REGACY 348
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VW +E+E ILG+ A F ++Y + GN E G+NVL + ++ +
Sbjct: 349 VWKKQELEQILGDQAAFFCQYYGVTEQGNF----------EEPGENVLFQALPAAEEPAA 398
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ +KL VR+ R +P D KV+ WNGL+I++ AR + +
Sbjct: 399 IKA-----------AGQKLLQVRAMRQQPLRDLKVLSGWNGLMIAALARGAAL------- 440
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
++ + ++E A AA+FI L RL S+ PS GFL+DYAFL
Sbjct: 441 ---------TNNRRWLEAARRAATFISSAL-TRADGRLLRSWCGTPSTIAGFLEDYAFLG 490
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGA 557
G L+L++ G L A +L +D L L R T G D L L + ++HDG
Sbjct: 491 WGYLELFKAGGDAADLATAEQL--CRDALHLFRTEDERLVTAGNDQEQLPLALSDNHDGV 548
Query: 558 EPSGNSVSVINLVRLASIVA 577
PSG + V+NLV LA A
Sbjct: 549 IPSGPAALVMNLVALAKCTA 568
>gi|75674298|ref|YP_316719.1| hypothetical protein Nwi_0099 [Nitrobacter winogradskyi Nb-255]
gi|74419168|gb|ABA03367.1| Protein of unknown function DUF255 [Nitrobacter winogradskyi
Nb-255]
Length = 676
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 228/691 (32%), Positives = 335/691 (48%), Gaps = 74/691 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED+ VA ++N+ FV IKVDREERPD+D++YM+ + L GGWPL++
Sbjct: 59 ACHWCHVMAHESFEDDDVAAVMNELFVCIKVDREERPDIDQIYMSALHHLGEQGGWPLTM 118
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FLSPD P GGTYFP +GRP F +L+ V + + D +A+ I +LSE
Sbjct: 119 FLSPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFRDQPDQIARHRDTLIARLSE--- 175
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
++ K P L L A + +S D GG APKFP+ ++++ + D
Sbjct: 176 --RATTKSPANLGVAELNNAAVAIMRSTDPVNGGLRGAPKFPQCSVLELLWRAGARTRDD 233
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ TL M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD Q+ ++
Sbjct: 234 RFFAATT-------LTLTRMSQGGIYDHIGGGYARYSVDDRWLVPHFEKMLYDNAQILDL 286
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
++ +K+ Y + +D+LRR+M+ G S+ DADS EG +EG FYVW
Sbjct: 287 LALDYARSKNPLYRERAIETVDWLRREMLTAEGGFASSLDADS---EG----EEGRFYVW 339
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+ E++D+LG Y T N + R + P N K +V ND SA L
Sbjct: 340 SLSEIDDVLGAADAADFAARY-DITANGNFERRNIP-NRLKSIDV---ANDDSAHMRAL- 393
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
R+KL R R RP LDDK++ WNGL+I++ + +
Sbjct: 394 -------------RKKLLVRRESRVRPGLDDKILADWNGLMIAALVHGACVF-------- 432
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
D+ +++ +A +A FIR + + RL HS+R G P DYA +
Sbjct: 433 --------DKPDWLRIARAAYDFIRTMM--TRDGRLGHSWREGRLLIPALASDYATMARA 482
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L L+E +L A+ Q+T D + D GGY+ T + +++R D A P+
Sbjct: 483 ALALFEATGDGTFLEQALRWQSTLDTHYADAAHGGYYLTADDAEGLIVRPHSSEDDAIPN 542
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
+ V NLVRLA++ +K +R + A R + + A D+ +
Sbjct: 543 HDGVIAQNLVRLAALTGDAK---WRDRIDSHFAALLPRATEKGFGQLSLMNALDLRLTGA 599
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
++V + + + AA Y V+H AD D AR
Sbjct: 600 E---IVVAGEDAQAAALLGAARKLPY-ATSIVLHAPHADALPADH----------PARAK 645
Query: 681 FSA-DKVVALVCQNFSCSPPVTDPISLENLL 710
SA + A +C+ SCS PVT P +L L+
Sbjct: 646 LSAVAQSAAFICRGQSCSLPVTQPDALNELM 676
>gi|344940058|ref|ZP_08779346.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
gi|344261250|gb|EGW21521.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
Length = 754
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 217/602 (36%), Positives = 315/602 (52%), Gaps = 61/602 (10%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F K + L +TC+WCHVME E FE+ +AKL+N+ VSIK+DRE+RPD
Sbjct: 36 GEEAFAKARKENKPILLSIGYSTCYWCHVMEREIFENPEIAKLMNESIVSIKIDREQRPD 95
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YMT Q + GGWP +VF++PDLKP GTYFPP F ++++++ W
Sbjct: 96 VDDLYMTATQMMTHSGGWPNNVFVTPDLKPFYAGTYFPP------AAFSSLIQQIHYIWM 149
Query: 119 KKRDML---AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
+ + L A+ A AI ++ + +A S+ LP AL S YD+R GGF
Sbjct: 150 QDQVPLKAQAERLASAIIRIKQQ-ENNAQSSSLPGSRLVEAL---ISHFSDYYDNRLGGF 205
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
APKFP + + L + +L E + G TL+ MA+GGIHDHVGGGFHR
Sbjct: 206 YQAPKFPNE-DALLFLLEAYRLTSNNTCLEMARG------TLEKMAEGGIHDHVGGGFHR 258
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
Y+ D +W +PHFEKMLY+Q L Y + ++L+ + I D+ R M G
Sbjct: 259 YATDAQWRIPHFEKMLYNQALLGRAYTELYALSNKPDDRVVAEGIFDFTLRQMTHKDGGF 318
Query: 296 FSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRM 353
+SA DA+ T EGA+Y WT E++D L +A L K HY G ++ ++
Sbjct: 319 YSALDAE-------TDAVEGAYYAWTDAELQDALDTDSYAWLMK-HY-----GLAEIPKI 365
Query: 354 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 413
H G+ VL + S SA+ G+ E + L + R KR PHLD+K+
Sbjct: 366 PG-HKHVDGR-VLYLIQPLSESATAEGLSYEDAVKKQQAVMTSLRESRDKRKLPHLDNKI 423
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 473
I SWNGL+I +FARA ++ + EY E + AA FI +L +Q
Sbjct: 424 ITSWNGLMIDAFARAGLCMR----------------KLEYTEASRRAADFILANL-RKQD 466
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 533
L ++R+G ++ + +DYAF+I GL+ +Y ++L A EL +LF D +
Sbjct: 467 GSLYRTWRDGQAEISAYFEDYAFMIQGLVSIYRAAKDNRYLQAAKELAAKAKQLFWDEKH 526
Query: 534 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GGY+ T G + +L+R+K D A PSGN+V L+ L I ++ ++Q AE L
Sbjct: 527 GGYYFTDGSE-LLLVRMKNAVDSAIPSGNAVMAQALLDLYEITGDAE---WKQQAEALLI 582
Query: 594 VF 595
F
Sbjct: 583 AF 584
>gi|339325405|ref|YP_004685098.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
gi|338165562|gb|AEI76617.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
Length = 666
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 238/701 (33%), Positives = 340/701 (48%), Gaps = 104/701 (14%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESFE+ +A L+N+ F+SIKVDR+ERPD+D +Y Q + GGGWPL+V
Sbjct: 49 TCHWCHVMAHESFENPRIAALMNERFISIKVDRQERPDLDDIYQKVPQLMGQGGGWPLTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GGTYFPP+D+YGRPG +L + +AW +R L + IEQ +
Sbjct: 109 FLTPQGEPFYGGTYFPPDDRYGRPGLPRVLLSLSEAWRHRRQELRDT----IEQFQQGFR 164
Query: 141 A----------SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
+ + ++ D Q AL L+++ D GG G APKFP ++
Sbjct: 165 HLDEGVLSREDAEQAAEVQDLPAQTAL-----ALARNTDPTHGGLGGAPKFPNASAYDLV 219
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L ++ + TL MA GGIHD +GGGF RYSVDERW VPHFEKM
Sbjct: 220 LRICQRTHEPALLDALER-------TLDGMAAGGIHDQLGGGFSRYSVDERWAVPHFEKM 272
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD GQL +Y +A+ LT + + + Y+ RDM P G + EDADS EG
Sbjct: 273 LYDNGQLVTLYANAYRLTGKQAWRRVFEGTIAYILRDMTHPDGGFHAGEDADS---EG-- 327
Query: 311 RKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVWT+ EV+ +LGE L Y + GN + G++VL
Sbjct: 328 --EEGRFYVWTAAEVKAVLGESEGALACRAYGVTEGGNFE-----------PGRSVL--- 371
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
A L PLE+ L R +L R++R RP DD ++ WNGL+I A
Sbjct: 372 ----HRAVTL-TPLEE--ARLEGWRERLLAARARRVRPGRDDNILAGWNGLMIQGLCAAY 424
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKA 487
+ + A ++ A AASF++ L D +R ++NG K
Sbjct: 425 QATGNPA----------------HLAAARRAASFVQDKLTMPDGGVYRY---WKNGTVKV 465
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSV 546
PGFL+DYAFL + L+DLYE ++L A EL L +DR G G + T + +
Sbjct: 466 PGFLEDYAFLANALIDLYESCFDRRYLDRAAELVT----LIIDRFRGDGLYFTPNDGEPL 521
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R + +DGA PSG S SV +RL + + D YR AE +
Sbjct: 522 IHRPRGPYDGAWPSGISASVFAFLRLHEL---TGEDRYRDLAEQEFQRYRAAATAAPAGF 578
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
+ AAD + ++L G K++ ++ + H +Y L V+
Sbjct: 579 VHLLAAADFAQRGAFG-IILAGDKAAA--AALVESVHRTY-LPARVLAF----------- 623
Query: 667 EEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISL 706
+ + + + D + A VC++ +C+ PVT +L
Sbjct: 624 ----AEDVPVGQGRLPVDGRPAAYVCRHRTCTAPVTSGQAL 660
>gi|427427562|ref|ZP_18917606.1| Thymidylate kinase [Caenispirillum salinarum AK4]
gi|425883488|gb|EKV32164.1| Thymidylate kinase [Caenispirillum salinarum AK4]
Length = 678
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 211/574 (36%), Positives = 290/574 (50%), Gaps = 64/574 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED A ++ND F++IKVDREERPDVD +YM+ +Q + GGWPL++
Sbjct: 51 ACHWCHVMAHESFEDAETAAVMNDLFINIKVDREERPDVDAIYMSALQLMGQRGGWPLTM 110
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GGTYFP + +GRPGFK +LR+V DA+ + + ++ + ++ L + L+
Sbjct: 111 FLTPDGEPFWGGTYFPKDSAFGRPGFKDVLRQVADAYHQSPEKVSNNTGALVDALRKGLN 170
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
SS P L + AE L+ D +GG APKFP + + T
Sbjct: 171 LPQSSEP-PAALALPVVDQLAESLAGHVDPEWGGLRGAPKFPVVFAFDALW---RSWHRT 226
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ E VL TL + +GGI+DH+GGGF RYS D +W VPHFEKMLYD QL ++
Sbjct: 227 GR----QELHDAVLLTLDRLCQGGIYDHLGGGFARYSTDAQWLVPHFEKMLYDNAQLIDL 282
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ T+ + +D+L R+MI G S+ DAD TEG +EG FYVW
Sbjct: 283 MTSVWQETRSPLLQARVEETVDWLEREMIAENGAFASSLDAD---TEG----EEGRFYVW 335
Query: 321 TSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL----IELNDSSA 374
T E++ +LG A LFK Y ++P GN ++GK VL ++ D A
Sbjct: 336 TKDEIDRVLGTDADAALFKRAYDVRPGGN------------WEGKTVLNRNFSDVGDEPA 383
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+K L R L R KR P DDKV+ WNGL+I + ARA
Sbjct: 384 LETK-----------LYRARMLLLRERDKRVMPGRDDKVLADWNGLMIHALARA------ 426
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
A F P E++++A SA IR + RL HSFR G + LDDY
Sbjct: 427 ---GAAFGRP-------EWVDLARSAYDGIRDTM-SRPGDRLGHSFRKGRLQDVAMLDDY 475
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + L L++ ++ A D + D GGYF T + ++LR K
Sbjct: 476 ANMARAALTLHQVTGVADFIDHASRWVAVLDAEYWDDAAGGYFLTAADATDLILRTKSAQ 535
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
D A PSGN + L L + + YR+ A
Sbjct: 536 DNATPSGNGTMAVVLATLWHLTGEER---YRRRA 566
>gi|367469960|ref|ZP_09469682.1| Thymidylate kinase [Patulibacter sp. I11]
gi|365814937|gb|EHN10113.1| Thymidylate kinase [Patulibacter sp. I11]
Length = 685
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 231/699 (33%), Positives = 327/699 (46%), Gaps = 71/699 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A ++N FV +KVDREERPDVD + M VQA+ G GGWPL+
Sbjct: 48 SACHWCHVMAHESFEDPATASVMNAHFVCVKVDREERPDVDAICMEAVQAITGQGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P+ GGTYFPP+ + G P ++ +L V +AW ++ + + + ++LS A
Sbjct: 108 VFLTPEQQPIHGGTYFPPQPRQGMPSWRMVLDAVAEAWRERSGEIREQLSDVADRLSGAS 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + EL A+R L + YDS GGFG APKFP + +L +
Sbjct: 168 RLTPADAVPGPELLDAAVR----GLGERYDSVQGGFGGAPKFPPHPSLLFLLQRAADERP 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
SG A M TL+ MA GGI+D +GGGF RY+VD W VPHFEKMLYD LA
Sbjct: 224 GEDSGTAGRAAAMARHTLRSMASGGINDQIGGGFARYAVDGTWTVPHFEKMLYDNALLAR 283
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y++ F L D L +L ++ GP G SA DADS EG EG FYV
Sbjct: 284 AYVEGFRLWGDERLRETAERTLAFLADELRGPEGGFLSALDADS---EGV----EGRFYV 336
Query: 320 WTSKEVEDIL----GEHAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
WT ++V L E AI + EH + R P +E
Sbjct: 337 WTPEQVRAALSSADAEAAIAWLGVTEHGNFEDGATVLEDRGERPDDE------------- 383
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ R L RS+R RP DDK + WNGL I +FA AS +L
Sbjct: 384 ----------------TVARIRAGLLAARSQRIRPGTDDKRVAGWNGLAIHAFAEASAVL 427
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
E + V ++ + +RR D +T S G ++ L+
Sbjct: 428 GRE------DLLEVARRAAAFVRRDLTVDGRLRRTWSDRETAGADTSGHGGRARHAAVLE 481
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
D+ FL+ + L+E G + L WA EL +T F D E G +F T + ++L+R KE
Sbjct: 482 DHGFLLEAAVALFEAGGDPEDLAWARELADTILNRFADPERGAFFATADDAEALLVRRKE 541
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D PSG + + L+RLA++ ++ Y A+ L + T + + AV A
Sbjct: 542 LDDAPIPSGGASASRGLLRLAALTGEAR---YADAADGWLRLAATVAERIPQAVAYALLA 598
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
D P R+ V +VG ++ + + L V + +
Sbjct: 599 LDERHRPPRE-VAIVGPPAARAALVAVVRERSRPGLVLAV-------------GDGLDDR 644
Query: 673 NASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
++ R + D + A VC+ FSC PVT+P +L L
Sbjct: 645 GVALLRGRPTVDGQATAYVCERFSCRAPVTEPDALRAAL 683
>gi|92115739|ref|YP_575468.1| hypothetical protein Nham_0107 [Nitrobacter hamburgensis X14]
gi|91798633|gb|ABE61008.1| protein of unknown function DUF255 [Nitrobacter hamburgensis X14]
Length = 682
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 232/696 (33%), Positives = 338/696 (48%), Gaps = 74/696 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ VA ++N+ FV IKVDREERPD+D++YM + L GGWPL++F
Sbjct: 60 CHWCHVMAHESFEDDEVAAVMNELFVCIKVDREERPDIDQIYMNALHLLGEQGGWPLTMF 119
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
LSPD P GGTYFP +GRP F +L+ V + K + + + I +LSE
Sbjct: 120 LSPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFHDKPERVTLNRDAVIARLSERAKV 179
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ +N L L A +++S D GG APKFP+ ++ L G
Sbjct: 180 GSPAN-----LGVAELNTAAVSIARSTDPVNGGLHGAPKFPQCSVLEF-------LWRAG 227
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ TL M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD Q+ ++
Sbjct: 228 ARTGSDRFYAATTLTLTQMSQGGIYDHLGGGYARYSVDDRWLVPHFEKMLYDNAQILDLL 287
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
++ +K+ Y + + +L R+M+ G S+ DADS EG KEG FYVW+
Sbjct: 288 ALDYARSKNPLYRERAIETVAWLLREMLTGEGGFASSLDADS---EG----KEGKFYVWS 340
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
E+E++LG A F Y + GN F+G+N+ L SS S G
Sbjct: 341 LSEIEEVLGATDAADFAARYDITANGN------------FEGRNIPNRLK-SSDLVSDDG 387
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ R KL R+ R RP LDDKV+ WNGL+I++ +
Sbjct: 388 AHMRT-------LRAKLLARRAGRVRPGLDDKVLADWNGLMIAALVHG---------ACA 431
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
F P +++E A +A FIR+ + + RL HS+R G P DYA ++
Sbjct: 432 FGLP-------DWLETARTAFEFIRKTM--TRGDRLGHSWREGRLLVPALACDYAAMVRA 482
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L L E T +L A+ Q T D + D E GGY+ T + +++R D A P+
Sbjct: 483 ALALSEATGDTAYLEQALRWQATLDTHYADVEHGGYYLTADDAEGLIVRPHSTIDDAIPN 542
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
N + NLVRLA++ SK +R + +R + + A D+ +
Sbjct: 543 YNGLIAQNLVRLAALTGDSK---WRDRIDALFGALLSRAAENGFGHLALLSALDLRLTGA 599
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
+V+VG + + A A V+H+ D EH + +
Sbjct: 600 --EIVVVGEGAQAEALLAAARALPHA--TSIVLHVSRGDALP----AEHPARAKAD---- 647
Query: 681 FSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
S A VC+N SCS PVT P +L +L++++ S+
Sbjct: 648 -SVQGAAAFVCRNQSCSLPVTTPQALVDLVMQRTSA 682
>gi|266619634|ref|ZP_06112569.1| dTMP kinase [Clostridium hathewayi DSM 13479]
gi|288868801|gb|EFD01100.1| dTMP kinase [Clostridium hathewayi DSM 13479]
Length = 622
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 234/685 (34%), Positives = 329/685 (48%), Gaps = 71/685 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE++ +A LLN +V +KVDREERPDVD VYM+ QA+ G GGWPL++ ++PD K
Sbjct: 1 MEQESFENDRIAALLNREYVCVKVDREERPDVDAVYMSVCQAMNGQGGWPLTIIMTPDCK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFPP +YGR G + +L V W R+ S L + S+
Sbjct: 61 PFFSGTYFPPYARYGRVGLEELLTAVAGQWKADRETFLDSAGQIEAHLKAQERITMSAEP 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
D + Q A R Q ++D + GGFG APKFP P + ++ + G +
Sbjct: 121 GVDAVHQ-AFR----QFLGNFDKKNGGFGGAPKFPTPHNLIFLM-------EYGVREKKR 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E M TL M +GGI DH+GGGF RYS DE W VPHFEKMLYD L Y++AF L
Sbjct: 169 EALAMAETTLVQMYRGGIFDHIGGGFSRYSTDETWLVPHFEKMLYDNALLVMAYVEAFGL 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T Y + R IL Y+ ++ G + +DADS EG EG +YV+T +E+
Sbjct: 229 TGRNGYKRVARRILAYVEAELTDEKGGFYCGQDADS---EGL----EGKYYVFTPQEICR 281
Query: 328 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 387
ILG A T C +++ N F+GK++ L + + A +
Sbjct: 282 ILGPDA----------GTDFCSCYGITERGN-FEGKSIPNLLKNEAYEAV--------WE 322
Query: 388 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 447
N +KL+D R R R H DDK++VSWNG +I + A+A +L
Sbjct: 323 NHESPDLKKLYDYRITRTRLHRDDKILVSWNGWMICACAKAGAVL--------------- 367
Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
D Y+++A A +FI +L + RL +R G S G LDDYA I LL+LY
Sbjct: 368 -DDTNYLDMAVRAETFIHENLV--RDGRLMVRYREGDSAGEGKLDDYACYILALLELYRV 424
Query: 508 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
T +L A + T + F DRE GG++ T + +++R KE +DGA PSGNS + +
Sbjct: 425 TFQTDYLTRAAQWAETMVQQFFDRERGGFWMTAEDGEPLIVRTKETYDGAVPSGNSAAAL 484
Query: 568 NLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 626
L +LA I +K D Q + E + A+ M + PSR+ V
Sbjct: 485 GLYQLARITGETKWQDVLNQQLHYLAGAMEGYPSGHSFALLTMM----NVLYPSRELVCT 540
Query: 627 VGHKSSVDFENMLA--AAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
V S + ++LA A+ + + + + AD E E +
Sbjct: 541 VSPDESGEALSILARRLAYLAETVPGLTVVVKTADNE-----TELTKLAPYIGDYPLPEA 595
Query: 685 KVVALVCQNFSCSPPVTDPISLENL 709
+ +C C PPV SLE L
Sbjct: 596 GSLFYLCSGSRCMPPVK---SLEEL 617
>gi|332292243|ref|YP_004430852.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
gi|332170329|gb|AEE19584.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
Length = 679
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 220/684 (32%), Positives = 338/684 (49%), Gaps = 73/684 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFE+ VA+L+N F +IKVDREERPDVD VYM VQ + GGWPL+
Sbjct: 53 SCHWCHVMEHESFENTEVAQLMNAHFKNIKVDREERPDVDNVYMNAVQLMTSRGGWPLNA 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYFP E+ + + L ++ + + L + A +EQ + +
Sbjct: 113 IALPDGRPVWGGTYFPKEE------WTSALEQIAKLYQTAPEKLIEY-AEKLEQGMQEMD 165
Query: 141 ASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A ++ PD E QNA+ Q S+ +D+R GG APKF P +L ++ +
Sbjct: 166 AIIPNDSSPDFKLETLQNAI----SQWSRQWDTRQGGLNRAPKFMMPNNYLFLLRYAHQN 221
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+D E + V TL+ +A GGI+DHVGGGF RYSVD +WHVPHFEKMLYD QL
Sbjct: 222 QD-------QEILEYVNTTLEQIAFGGINDHVGGGFARYSVDTKWHVPHFEKMLYDNAQL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y A++ TK+ Y L ++ R+M G +SA DADS +G +EGA+
Sbjct: 275 VSLYALAYTKTKNPLYKQTVYQTLTFIAREMTTEDGAFYSAIDADSLTADGIL--EEGAY 332
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT KE++ ++G+ LFKE+Y + G + K VLI + +
Sbjct: 333 YVWTEKELQTLVGDDFDLFKEYYNINSYGKWE-----------KDNYVLIRQDTDQDFSK 381
Query: 378 KLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ + +E+ ++ + L R S + +P LDDK++ SWNGL+I + A + +A
Sbjct: 382 ECDISVEEIISKKNKWHEDLLRFRESNKEKPRLDDKILTSWNGLMIKGYVDAYRAFNEDA 441
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
++ A A+F+ +L E L +F+NG S G+L+DYA
Sbjct: 442 ----------------FLTAALKNATFLSTNLMREDG-GLNRTFKNGKSTINGYLEDYAA 484
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
++ + LYE + +WL A EL + + F + + +F + +DPS+ R E +D
Sbjct: 485 IVDAFIALYEVTADNQWLNKAKELTDYTFQHFQNPKNDLFFFKSNQDPSLASRNTEFYDN 544
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
PS NS+ N+ L+ + YR A+ L + ++ +
Sbjct: 545 VIPSSNSIMAKNIFTLSHYYGDNT---YRDTAKAMLHNIQPSIEQSPTSFSNWMDGMLNY 601
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
++P + +V+VG + + L SY + +I ++ F
Sbjct: 602 TMPFYE-LVIVGKDAEI-----LRKEFNSYYIPNKLIATSTIKSDHDIF----------- 644
Query: 677 ARNNFSADKVVALVCQNFSCSPPV 700
+ F DK VC N +C PV
Sbjct: 645 -KGRFHKDKTFIYVCVNNTCQLPV 667
>gi|313203107|ref|YP_004041764.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
gi|312442423|gb|ADQ78779.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
Length = 680
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 226/697 (32%), Positives = 338/697 (48%), Gaps = 102/697 (14%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME E FEDE VA+ +N+ FV+IKVDREERPD+D++YMT VQ L GGWPL+
Sbjct: 57 CHWCHVMERECFEDEEVARYMNEHFVAIKVDREERPDIDQIYMTAVQLLTERGGWPLNCV 116
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF------AIEQL 135
PD +P+ GGTYFP K W DML Q F E
Sbjct: 117 ALPDGRPIYGGTYFP-----------------KAQW---LDMLNQVSGFIQLHPDKTENQ 156
Query: 136 SEALSASASSNK------LPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 188
+ AL+ +N+ LP E N + D+ GG+G+APKFP P +Q
Sbjct: 157 ARALTEGVQNNEMIYRADLPGLEATVNDQEDIFYHIQAGIDTVNGGYGTAPKFPMPSSLQ 216
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
+L H L SG ++ K + TL MA GGI+D +GGGF RY+ DE W +PHFE
Sbjct: 217 FLL-HFHHL-----SGN-NDALKALTTTLDRMAFGGIYDQIGGGFARYATDEAWKIPHFE 269
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KMLYD L +VY AF ++ Y + + L+++ ++ P G +S+ DADS EG
Sbjct: 270 KMLYDNALLVSVYASAFQYNRNPHYEKVLHETLEFVSSELTSPDGGFYSSLDADS---EG 326
Query: 309 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
EG FYVWT E++ ILG++A L +++ + GN + S +N+L
Sbjct: 327 V----EGKFYVWTFDELQTILGKNAGLIMDYFQVTAAGNWEES-----------QNILYR 371
Query: 369 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 428
+ A K + + + + R L VR+KR +P LDDK++ SWN L++ + A
Sbjct: 372 KGNDEEIARKHNLSTVELSESIAQARELLQTVRAKRQKPMLDDKILTSWNALMLKGYCDA 431
Query: 429 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
++ + + EY++ A A+FI R++ + L +++NG + P
Sbjct: 432 YRV----------------TAKAEYLQAALRNANFILRYM-KSADNGLFRNYKNGKASIP 474
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
FLDDYAF+I + LY+ +WLV A EL F D E G ++ T+ +P+++
Sbjct: 475 AFLDDYAFIIQAFISLYQNTFDEQWLVEASELTEYTVSHFYDPESGMFYYTSDTEPALIA 534
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R E D PS NS NL L +D Y +E L ++ A+ +
Sbjct: 535 RKMEISDNVIPSSNSEMGKNLFVLGHYF---YNDQYITMSEKML----NNVRQNALQGGI 587
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY---DLNKTVIHIDPADTEEMDF 665
D +L+G +S +E + ++ +LN +H + +
Sbjct: 588 YYANWD----------ILMGWFASAPYEVSVVGKNSDLLRKELNTHYLHNIILSGTKFE- 636
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+N + + +SAD+ + VC+N C PV+D
Sbjct: 637 ------SNLPVLKGKWSADETLIYVCRNHVCQAPVSD 667
>gi|282899862|ref|ZP_06307823.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
CS-505]
gi|281195132|gb|EFA70068.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
CS-505]
Length = 689
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 232/708 (32%), Positives = 349/708 (49%), Gaps = 115/708 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDLAIAEYMNANFIPIKVDREERPDIDSIYMQSLQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
FLSP DL P GTYFP +YGRPGF +L+ ++ +D +++ Q A +E L
Sbjct: 108 AFLSPDDLVPFYAGTYFPVAPRYGRPGFLEVLQAIRHYYDHQKEDFRQRKASILEAL--- 164
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-----FPRPVEIQMMLYH 193
LS++ N D+ + L + +++ G PK FP Q++L
Sbjct: 165 LSSTVLQNHDLDQFAHSQFH---RFLKQGWETAIGVI--TPKQMGNSFPMIPYCQLVLQG 219
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
++ A++G +M +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 220 TR-----FNYPSANDGLQMATQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYD 274
Query: 254 QGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
GQ+ + +S ++ + + +L R+MI P G ++A+DADS
Sbjct: 275 NGQIVEYLANLWSAGVEEPAFKRAVAGTVSWLEREMISPTGYFYAAQDADSFNCSTDMEP 334
Query: 313 KEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EGAFYVW+ +E++++L + +L KEH+ L GN F+GKNVL L
Sbjct: 335 EEGAFYVWSYRELQELLSDQELLEVKEHFSLSLEGN------------FEGKNVLQRL-- 380
Query: 372 SSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKV 413
SA +L LE L L CR R + ++ R P D K+
Sbjct: 381 ---SAGELSSSLELILGRLFLCRYGQTAETLTIFPPARNNHEAKTNPWHGRIPPVTDTKM 437
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQ 472
IV+WN L+IS ARAS++ + + Y+++A A FI H + D +
Sbjct: 438 IVAWNSLMISGLARASEVFQ----------------QPSYLQLAVQATRFILDHQFVDGR 481
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDR 531
HRL + +G +DYA I LLDL++ SG + WL AI LQ+ +E L
Sbjct: 482 FHRLNY---DGEPTVLAQSEDYALFIKALLDLHQADSGSSNWLEQAITLQDEFNEFLLSV 538
Query: 532 EGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
E GGYFNT+ ++ +++R + D A PS N V++ NL++L + + + YY AE
Sbjct: 539 ELGGYFNTSSDNSQDLIIRERNFVDNATPSANGVAIANLIKLCLL---TDNLYYLDLAES 595
Query: 591 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 650
+L F T ++ + P + A D ++ LV +SS+D +LA + +
Sbjct: 596 ALKAFSTIIEKSPQSCPSLLIAIDWY-----RNSTLV--RSSIDNIKILAGKYLPTTIFD 648
Query: 651 TVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
+ + P +T + LVCQ C P
Sbjct: 649 VISKL-PGNT--------------------------IGLVCQGLKCLP 669
>gi|85714094|ref|ZP_01045083.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
gi|85699220|gb|EAQ37088.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
Length = 714
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 224/691 (32%), Positives = 335/691 (48%), Gaps = 74/691 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE VA ++N+ FV IKVDREERPD+D++YM + L GGWPL++
Sbjct: 94 ACHWCHVMAHESFEDEDVAAVMNELFVCIKVDREERPDIDQIYMNALHHLGEQGGWPLTM 153
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL PD P GGTYFP +GRP F +L+ V + ++ D +A+ I +LSE
Sbjct: 154 FLFPDGSPFWGGTYFPKLPDFGRPAFTDVLQSVARVFREQPDKIARHRDALIARLSERAR 213
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A +N EL NA L A+ S D GG APKFP+ ++ + + D
Sbjct: 214 ADNPANIGLAEL-DNAAALIAQ----STDPVHGGLRGAPKFPQCSVLEFLWRAGARTHD- 267
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
V T+ M++GGI+DH+GGG+ RYSVD++W VPHFEKMLYD Q+ ++
Sbjct: 268 ------DHFFAAVTLTMTRMSQGGIYDHLGGGYARYSVDDKWLVPHFEKMLYDNAQILDL 321
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ +K+ Y + +D+LRR+M+ P G S+ DADS EG +EG FY+W
Sbjct: 322 LALDHARSKNPLYRERATETVDWLRREMLTPAGGFASSLDADS---EG----EEGRFYIW 374
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ KE+E++LG A F Y + GN F+G+N+ L ++
Sbjct: 375 SLKEIEEVLGTTDAADFAARYDITANGN------------FEGRNIPNRLRSIEVASDD- 421
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
++ L R KL R R RP LDDK++ WNGL+I++ A+ +
Sbjct: 422 ----SAHMRAL---REKLLARRESRVRPGLDDKILADWNGLMIAALVHAACVF------- 467
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
DR +++++A + F+R + + RL HS+R G P DYA +
Sbjct: 468 ---------DRPDWLQIARAVYDFVRTTM--TRDGRLGHSWREGRLLVPALASDYAAMGR 516
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L L+E LV A+ Q+T D + D E GGY+ T + +++R D A P
Sbjct: 517 AALALFEATGDNDCLVQALRWQSTLDTHYADVEHGGYYLTAADAEGLIVRPHSSDDDATP 576
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
+ + + NLVRLA++ +K +R + + + A D+
Sbjct: 577 NHDGLIAQNLVRLAALTGDTK---WRARIDGLFTALLPSATEKGFGQLSLMNALDLRLTG 633
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ +V+VG + +L AA V+H A+ D + + + A
Sbjct: 634 A--EIVVVGEDAQAG--ALLNAARKLPHATSIVLHAPHAEALAADHPAQAKARSVRGA-- 687
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A VC+ CS PV+ P +L L+
Sbjct: 688 -------AAFVCRQQRCSLPVSIPKTLIELV 711
>gi|346977780|gb|EGY21232.1| spermatogenesis-associated protein [Verticillium dahliae VdLs.17]
Length = 801
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 226/675 (33%), Positives = 342/675 (50%), Gaps = 91/675 (13%)
Query: 25 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 84
C + ++SF A LLN+ FV + VDREERPD+D +YM YVQA+ G GGWPL++FL+P
Sbjct: 70 CRLTAIDSFSHPECASLLNEAFVPVIVDREERPDLDTIYMNYVQAVNGAGGWPLNLFLTP 129
Query: 85 DLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKK--------RDMLAQS 127
+L+P+ GGTY+P PE++ G F IL+ ++ W ++ +++L++
Sbjct: 130 ELEPVFGGTYWPGPGAHTKTGPEEEEGV-DFLAILKNLRKVWQEQEPRCRQEAKEVLSKL 188
Query: 128 GAFAIE---------QLSE--------ALSASASSNKLP----------DELPQNALRLC 160
FA E Q+S+ A ASA S + P EL + L
Sbjct: 189 REFAAEGTLGTRSTVQMSKIGLTSSSTAPVASAVSTENPGAGKTAADVSSELDLDQLEEA 248
Query: 161 AEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTL 217
++ ++D +GGFG APKFP P ++ +L ++ ++D E + +M LFTL
Sbjct: 249 YSHIAGTFDPVYGGFGLAPKFPVPAKLSFLLRLPHYLHPVQDVVGPTECAHATEMALFTL 308
Query: 218 QCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---KDVFY 273
+ + G+ DHVGG GF RYS+ W +PHFEK+ D L +YLDA+ ++ KD
Sbjct: 309 RKIRDSGLRDHVGGCGFARYSITPDWSIPHFEKLTSDNALLLGLYLDAWLISNGDKDGEL 368
Query: 274 SYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 331
+ ++ DY M PGG S+E ADS G T +EGAF++WT KE + ++G E
Sbjct: 369 YDVVVELADYFSSPPMRLPGGGFASSEAADSYYRRGDTDVREGAFHLWTRKEFDAVIGDE 428
Query: 332 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
H A + ++ + GN + + DP++EF +N+ L + S + G+ E+ ++
Sbjct: 429 HEATIAATYWNILEHGNVEPDQ--DPNDEFMNQNIPRVLKEQSEIGKQFGISGEEVARVI 486
Query: 391 GECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFAR--ASKILKSEAESAMFNFPVVG 447
+ KL R + R RP LDDK+I WNGLVIS+ AR A+ +K A+SA
Sbjct: 487 ASAKAKLKAHRGRERVRPELDDKIISGWNGLVISALARTGAALAVKDAAKSA-------- 538
Query: 448 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 507
+Y+ A +A F+R L+DE+ L FR F +DYA+ I GL+DLYE
Sbjct: 539 ----QYLGAAIQSAEFVRAQLWDEKEKTLYKVFRGTRGSTKAFAEDYAYFIEGLIDLYEA 594
Query: 508 GSGTKWLVWAIELQNTQDELFLDREG----------------GGYFNTTGEDPSVLLRVK 551
+ +A ELQ TQ +LF D G +F TT + +LR+K
Sbjct: 595 TGEENCIAFADELQQTQIKLFYDASAPTTSASPNPLPAHSSCGAFFATTEDAKHTILRLK 654
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
+ D A PS N+VSV NL RL +A ++ Y A +L FE + P +
Sbjct: 655 DGMDTAFPSNNAVSVSNLFRLGVALA---TETYTALARETLNAFEAEILQYPWLFPGLLS 711
Query: 612 AADMLSVPSRKHVVL 626
+ R ++V+
Sbjct: 712 GVVSSRLGGRTYIVV 726
>gi|452207570|ref|YP_007487692.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
gi|452083670|emb|CCQ36982.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
Length = 709
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 226/698 (32%), Positives = 333/698 (47%), Gaps = 68/698 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED +A+ LN+ FV IKVDREERPDVD +YM Q + G GGWPLSV+
Sbjct: 50 CHWCHVMADESFEDPEIAETLNEAFVPIKVDREERPDVDTLYMNVCQMVRGSGGWPLSVW 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLSEA 138
L+P+ KP GTYFPPE P F ++L + D+W+ + + +Q+ +A E
Sbjct: 110 LTPEGKPFHVGTYFPPEATANMPSFGSVLGDIADSWNDPEGRSRLESQADQWASSTKGEL 169
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKL 197
S + P E L A + D GG+G KFP P I ++L +
Sbjct: 170 EGTPDRSGEAPGE---GFLDTAANAAVRGADREAGGWGQGQKFPHPGRIHLLLRAYDATD 226
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
DT + + L TL MA GG++DHVGGGFHRY VD W VPHFEKMLYD ++
Sbjct: 227 RDTYR--------DVALETLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEI 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+L + LT + Y+ I + +L R++ P G +S DA+S ++ G+ ++EGAF
Sbjct: 279 PRAFLAGYRLTGEERYAEIASETFAFLERELTHPDGGFYSTLDAESEDSTGS--REEGAF 336
Query: 318 YVWTSKEVEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT + V + + + A LF E Y + +GN + G VL E
Sbjct: 337 YVWTPETVREAVDDPTAAELFCERYGVTDSGNFE-----------NGTTVLTESTPIGEL 385
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ M + +L R +LF+ R RPRP D KV+ WNGL+IS+ A + L
Sbjct: 386 AADAVMDTDSVEALLETARSQLFEARESRPRPPRDGKVLAGWNGLMISALAEGALALN-- 443
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH-----RLQHSFRNGPSKAPG 489
Y ++AE+A F R L+ DE T RL F G G
Sbjct: 444 ---------------PTYADLAEAALEFCRDRLWEDEGTQDGDVGRLNRRFERGEVGISG 488
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIEL-QNTQDELFLDREGGGYFNTTGEDPSVLL 548
+L+DYA+L G DLY+ + L +A++L + + + + EG YF TG + ++
Sbjct: 489 YLEDYAYLGRGAFDLYQATGDVEHLQFALQLGRAIRASFYEESEGTLYFTPTGGE-ELIA 547
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R ++ D + PS V+V L L++ + D + L + L+ +
Sbjct: 548 RPQQLADSSTPSSTGVAVQLLAALSAFDPDAGFDAV---VDSVLETHASTLESNPITHTS 604
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ AA SV S + V G + L+ + L + + P + W +
Sbjct: 605 LTLAAIDRSVGSPELTVAAGELPPA-WREALSGTY----LPGRTLSVRPPTESGLSAWLD 659
Query: 669 ----HNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
++ R+ + V C++F+CSPP D
Sbjct: 660 AIGLEDAPPIWAGRDAVDGRETV-YACRSFTCSPPTHD 696
>gi|389690661|ref|ZP_10179554.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
gi|388588904|gb|EIM29193.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
Length = 676
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 241/702 (34%), Positives = 355/702 (50%), Gaps = 87/702 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED VA ++N+ FV+IKVDREERPDVD VYM+ + L GGWPL++
Sbjct: 48 ACHWCHVMAHESFEDADVAAVMNELFVNIKVDREERPDVDHVYMSALHLLGEPGGWPLTM 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GGTYFP E ++GRPGF +LR++ + + + + ++ + L+ +
Sbjct: 108 FLTPEGEPFWGGTYFPKEPRFGRPGFVGVLREISRLYRSEPERILKNRDAIKQHLARSDR 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ L D L +L++ D+ GG APKFP P ++ + ++
Sbjct: 168 GDGGTLGLVD------LDRLGARLAELIDTENGGLQGAPKFPNPPILECLYRYA------ 215
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G++G+ E ++ L TL+ MA GGIHDH+GGGF RYSVDERW VPHFEKMLYD QL +
Sbjct: 216 GRTGDG-EAKRRFLLTLERMALGGIHDHLGGGFARYSVDERWLVPHFEKMLYDNAQLLEL 274
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y A++ T + I+ +L R+M P G S+ DADS EG +EG FYVW
Sbjct: 275 YGLAYAETGRALFRDAAEGIVIWLGREMTTPEGGFASSLDADS---EG----EEGLFYVW 327
Query: 321 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ E+ ++LGE A F + Y + GN F+G+N+ L A
Sbjct: 328 SLAEIREVLGEEDAAFFGQVYDITEEGN------------FEGRNIPNRLLSGVAP---- 371
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ +E+ L L R KL + RS R RP LDDKV+ WNGL+I++ RAS +L
Sbjct: 372 -LAIEERLAAL---RAKLLERRSARVRPGLDDKVLADWNGLMIAALVRASPLL------- 420
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
DR +++ +A+ A F+ + + RL HS+R G PGF D+A ++
Sbjct: 421 ---------DRPDWIALAQRAYRFVTEAM--TRDGRLGHSWRGGALIVPGFALDHAAMMR 469
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLD---REGGGYFNTTGEDPSVLLRVKEDHDG 556
L L+E + +L + Q +D L D + G T +++R + D
Sbjct: 470 AALALFEVTADQAYLR---DAQTWRDRLMSDYRIEDTGALAMTARNADPLVVRPQPTQDD 526
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-MCCAADM 615
A P+ N V LVRLA + ++ D + A L T+L +A + PL +
Sbjct: 527 AVPNANGVCAEALVRLAQL---TEMDGDLRQASEVL----TKLGGIARSSPLGHTSILNA 579
Query: 616 LSVPSRKHVVLV-GHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
L + R +LV G+ + FE L + + + EE+D + H +
Sbjct: 580 LDLHLRGLTILVTGNGADALFEAGLKIPYPIRSIRRL------KSDEELD--DNHPAKAL 631
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+ S ALVC CS PVTD L+ +LE S+
Sbjct: 632 AA-----SGAGPRALVCAGMRCSLPVTDADGLKAQVLEMSSA 668
>gi|344340301|ref|ZP_08771227.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
gi|343799959|gb|EGV17907.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
Length = 691
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 246/711 (34%), Positives = 362/711 (50%), Gaps = 91/711 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFED G A+L+N FV+IKVDREERPD+DK+Y T Q L GGWPL
Sbjct: 58 SACHWCHVMAHESFEDPGTAELMNRLFVNIKVDREERPDLDKIYQTAHQLLAQRPGGWPL 117
Query: 79 SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+VFL PD KP GTYFP E ++G P FK +++ V+ A+ +++ AIE +E
Sbjct: 118 TVFLMPDDQKPFFAGTYFPREPRHGLPAFKQLMQGVERAYREQKT--------AIESQNE 169
Query: 138 ALSASAS------SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
+L A+ + S+ LP+ ++A+ +QL S+D GGFG APKFP P + ++L
Sbjct: 170 SLMAALAELEPHASDALPE---RSAIDAALQQLDTSFDPEHGGFGDAPKFPHPTNLDLLL 226
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
H+ TG ++ + ++TL+ M +GG+ D +GGGF+RYSVD W +PHFEKML
Sbjct: 227 RHATDAPQTGAPDRSALAK--AVWTLERMVRGGLTDQLGGGFYRYSVDALWMIPHFEKML 284
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD G L + DAF++T+D + D++ R+M P G +S+ DADS EG
Sbjct: 285 YDNGPLLALCCDAFAVTEDPVFRDAAVMTADWVLREMQSPEGGYWSSLDADS---EG--- 338
Query: 312 KKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVW +E+ +L E+A F Y L NC+ G+ L
Sbjct: 339 -EEGKFYVWDREEIRALLAPAEYAP-FAAVYRLDRPANCE------------GRWHLHGY 384
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
A A LG+ + +L R L+ R +R RP D+KV+ +WN L+I ARA+
Sbjct: 385 RTPEAVAVDLGLEPARVQALLAAARATLYVARERRVRPGRDEKVLTAWNALMIKGLARAA 444
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+ DR +Y+E AE A +FIR L+ E RL ++++G +
Sbjct: 445 RTF----------------DRPDYLESAEQALAFIRGTLWREG--RLLATYKDGTAHLNA 486
Query: 490 FLDDYAFLISGLLDLYEFGSGTKW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
+LDDYA L+ LL+L + T+W L +A+ L + F D GGG++ T + +
Sbjct: 487 YLDDYANLLDALLELLQ----TRWSRADLDFALALAEVLLDQFEDPIGGGFWFTGRDHET 542
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
++ R K D A PSGN V+ + L RL +V + Y AE +L + ++ M A
Sbjct: 543 LIHRTKPLGDEAIPSGNGVAALALERLGHLVGEPR---YLAAAERTLKLAAESIRRMPYA 599
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
+ A D P V+ G + + A Y + V+ I PAD +
Sbjct: 600 HATLLFALDEWLDPPETLVIRAGDER---LDAWRREAQRGYRPRRFVLGI-PADESHL-- 653
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
A+MA ++ C C PP SL +++ KP+S
Sbjct: 654 ----PGTLAAMA----PGERPRIYRCSGTRCEPPTE---SLADVV--KPTS 691
>gi|289548374|ref|YP_003473362.1| hypothetical protein Thal_0601 [Thermocrinis albus DSM 14484]
gi|289181991|gb|ADC89235.1| protein of unknown function DUF255 [Thermocrinis albus DSM 14484]
Length = 655
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 205/589 (34%), Positives = 314/589 (53%), Gaps = 56/589 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM E FE+ +A+++N+ FV+IKVDR+ERPD+D+ Y V +L G GGWPL+VF
Sbjct: 58 CHWCHVMAKECFENPEIAQIINENFVAIKVDRDERPDIDRRYQEVVVSLTGSGGWPLTVF 117
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD K GGTYFPPED++GRPGFK++L ++ W + RD + +S E L +
Sbjct: 118 LTPDGKAFFGGTYFPPEDRWGRPGFKSLLLRIAQLWKEDRDRVIRSAEHIFELLR---NY 174
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
S+SS+K D + + L L S D ++GG G+APKF +++LYH TG
Sbjct: 175 SSSSHK--DNVGEELLNRGIANLLASVDYQYGGIGTAPKFHHARAFELLLYHHFF---TG 229
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ + V TL MA+GGI+DH+GGGF RYS D+RW VPHFEKML D +L VY
Sbjct: 230 QTLPV----EAVEITLDSMARGGIYDHLGGGFFRYSTDDRWIVPHFEKMLSDNAELLLVY 285
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
AF +TK Y Y+ IL+Y +R GG ++++DAD + + EG +Y ++
Sbjct: 286 SLAFQVTKKDLYRYVVEGILNYYQRFGFDEGGGFYASQDADIGDLD------EGGYYTFS 339
Query: 322 SKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E+ IL E + Y+ + P G DP KNVL A+ G
Sbjct: 340 LEELRGILTEEELKVTSLYFDIHPKGEMH----HDP-----SKNVLFIAMSEEEVATATG 390
Query: 381 MPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+PLE+ +L RRK+ R S R +P +D + +WNGL++ + + K+
Sbjct: 391 IPLERVRQLLESARRKMLSYRESTRQQPFIDKTIYTNWNGLMLEALSTCYKV-------- 442
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F P V S AE A + + ++ + +L H++ G +DY FL
Sbjct: 443 -FRIPWVLSS-------AEKTADRLMKEMWKDG--QLMHTY-----GVKGMAEDYIFLAR 487
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAE 558
GLL L+E ++L ++ L + + F D +G G+F+T +D +L +R+K D
Sbjct: 488 GLLSLFEVTQKREYLEASVMLAHEAIKKFWDPQGWGFFDTEEKDEGLLRIRLKTLQDTPT 547
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
S N + + L S+ ++ + + AE +L F ++++ + P
Sbjct: 548 QSVNGAAPYLYLVLGSVTPYTE---FLEYAEKNLQAFARMVREIPLISP 593
>gi|256005004|ref|ZP_05429976.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
2360]
gi|255991073|gb|EEU01183.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
2360]
Length = 482
Score = 333 bits (853), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 189/467 (40%), Positives = 263/467 (56%), Gaps = 59/467 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFEDE VA++LN FVSIKVDREERPD+D +YMT QAL G GGWPL+
Sbjct: 53 STCHWCHVMESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTYFP +D+ G PG +IL+ V + W ++D LA+ + + +SE++
Sbjct: 113 IIMTPDKKPFFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESI 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ DE+ ++ Q +D+ +GGFG+APKFP P + +L + K
Sbjct: 173 DDDYYYS--VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK--- 227
Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
A E +V+ TL M GGI+DH+G GF RYS DE+W VPHFEKMLYD L
Sbjct: 228 ------AKEEYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALL 281
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A YL+ + TK+ Y+ I ++I Y+ RDM P G +SAEDADS EG +EG F
Sbjct: 282 AIAYLETYQATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKF 334
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W+ E++++LGE F ++Y + GN F+G N+ +N +
Sbjct: 335 YIWSPTEIKEVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDE 382
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
K + L CR+KLFD R KR PH DDK++ +WNGL+I++ A ++L E
Sbjct: 383 DKEFVEL---------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE- 432
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
+Y AE A+ FI L RL +R+G
Sbjct: 433 ---------------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDG 463
>gi|160935413|ref|ZP_02082795.1| hypothetical protein CLOBOL_00308 [Clostridium bolteae ATCC
BAA-613]
gi|158441771|gb|EDP19471.1| hypothetical protein CLOBOL_00308 [Clostridium bolteae ATCC
BAA-613]
Length = 642
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 206/557 (36%), Positives = 289/557 (51%), Gaps = 49/557 (8%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+E +A++LN +V +KVDREERPDVD VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENEVIAEILNREYVCVKVDREERPDVDSVYMSVCQAMNGQGGWPLTIIMTPDCR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEALSASASSN 146
P GTYFPP +YGRPG + +L W KK +L Q+G Q+ + L + +
Sbjct: 61 PFFSGTYFPPRARYGRPGLEELLTAAAGQWKVKKEKLLDQAG-----QIEKYLKSQERTE 115
Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+ E A+ QL+ +DS+ GGFGSAPKFP P + ++ + G +
Sbjct: 116 RQA-EPELGAVHQAFRQLADCFDSKNGGFGSAPKFPAPHNLIFLM-------EYGAREKR 167
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
E M TL M +GGI DH+GGGF RYS D +W VPHFEKMLYD L Y+ A+
Sbjct: 168 PEALAMAEKTLVQMYRGGIFDHIGGGFSRYSTDGQWLVPHFEKMLYDNSLLVMAYIKAYG 227
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
T Y + IL+Y+RR++ G + +DADS EG +YV+T +E+
Sbjct: 228 STGRKMYGCVAEKILEYVRRELTDSQGGFYCGQDADSDGV-------EGKYYVFTREEIR 280
Query: 327 DILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
++LGE A F Y + TG+ + S P N + N + + G
Sbjct: 281 EVLGEKAGRDFCRQYGI--TGHGNFEGRSIP-NLLENDNYEEICEEPWGNGDHGGNICHG 337
Query: 386 YLNILG-----ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ +G ECRR L+ R R R H DDK++VSWN +I + A A +L E
Sbjct: 338 SCDTIGGRENEECRR-LYQYRIDRARLHKDDKILVSWNSWMICACAMAGAVLGEE----- 391
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+Y+++A A +FI+ HL E RL +R+G + G LDDYA
Sbjct: 392 -----------QYVDMAVRADAFIKSHLVKE--GRLMVRYRDGDAAGEGKLDDYACYSLA 438
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL+LY +L A E F DRE GG++ + +++R KE +DGA PS
Sbjct: 439 LLELYRVTFRVDYLKRAAAWAEIMTEQFFDRERGGFYLYAKDGEQLIVRTKETYDGAMPS 498
Query: 561 GNSVSVINLVRLASIVA 577
GNSV+ L RL I
Sbjct: 499 GNSVAAQVLYRLTRITG 515
>gi|402773173|ref|YP_006592710.1| thioredoxin domain-containing protein [Methylocystis sp. SC2]
gi|401775193|emb|CCJ08059.1| Thioredoxin domain protein [Methylocystis sp. SC2]
Length = 675
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 216/697 (30%), Positives = 341/697 (48%), Gaps = 79/697 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A L+N+ F+++KVDREERPDVD +Y + + GGWPL++
Sbjct: 52 ACHWCHVMAHESFENPEIAALMNESFINVKVDREERPDVDYLYQQALMMMGQRGGWPLTM 111
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GGTYFPP + GRPGF +L+ + + W + + + + + +LS L+
Sbjct: 112 FLTPEGQPFWGGTYFPPFAQGGRPGFAELLKTIAELWRARANAIEHN----VAELSAGLA 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + + P +CA QL++ D GGFG+APKFP+ + + K
Sbjct: 168 SLSETTPGEPVSPHLVESICA-QLAQRLDRVDGGFGAAPKFPQTTSLDFLWRAWK----- 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
++G S Q +VL TL +++GG++DH+GGGF RYS D RW VPHFEKMLYD QL +
Sbjct: 222 -RTGRDSLRQAVVL-TLDHISQGGVYDHLGGGFARYSTDNRWLVPHFEKMLYDNAQLIEL 279
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ + + Y + ++++ R+M PGG S+ DADS EG +EG FY W
Sbjct: 280 LTEVWQDERRELYRLRVTETIEWMTREMRAPGGGFASSLDADS---EG----EEGKFYAW 332
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSAS 375
+ E+ + LG A F+ Y + GN + GK+VL IEL D
Sbjct: 333 SQTEIREALGARAPFFERAYGVSREGNWE-----------HGKSVLNRLGSIELLDEETE 381
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A+ +L R++R RP DDKV+ WNGL I++ A+A+ +
Sbjct: 382 AALARDRAALFL------------ARARRVRPGCDDKVLADWNGLTIAAIAKAACVF--- 426
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+R++++++A +A F++ + ++ RL HS+R ++ LDDY
Sbjct: 427 -------------EREDWLDIAIAAFDFVKSAMTTDEG-RLLHSWRCARARHMAVLDDYG 472
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ L LYE +L A + + DR GGYF + +++ RVK D
Sbjct: 473 AMCRAALALYEAAGAPSYLECARRWVEHVEHHYRDRT-GGYFYAADDADTLIARVKIAED 531
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSGN + + L +L + S YR+ AE F +++ + + +M
Sbjct: 532 SALPSGNGMMLQALAQLYYLTGES---VYRERAEAIAQDFAGTIRERILGFSSLLNGMEM 588
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L +V++G + D + + + + I PA T H + +
Sbjct: 589 LR--EALQIVVIGENDAADTAALKRVIYGVSQPGRVLNVIAPAAT----LPRAHPAFGKT 642
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
M + A VC+ CS P+ +P +L L E
Sbjct: 643 ML-----GARATAYVCRGMVCSLPIIEPDALAAALRE 674
>gi|345850486|ref|ZP_08803482.1| hypothetical protein SZN_12143 [Streptomyces zinciresistens K42]
gi|345638083|gb|EGX59594.1| hypothetical protein SZN_12143 [Streptomyces zinciresistens K42]
Length = 637
Score = 332 bits (851), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 234/700 (33%), Positives = 332/700 (47%), Gaps = 81/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ A LN+ FVS+KVDREERPDVD VYM VQA G GGWP+S
Sbjct: 8 SACHWCHVMAHESFEDDDTAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 67
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VF++PD +P GTYFPP + G P F+ +L V+ AW +RD +A+ + L+
Sbjct: 68 VFMTPDGEPFYFGTYFPPAPRQGMPSFRQVLEGVRGAWTDRRDEVAEVAGKIVRDLAGRE 127
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S EL Q L L++ YD + GGFG APKFP + I+ +L H +
Sbjct: 128 ISYGGPEAPGEQELSQALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 180
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 181 -TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 235
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G+ R EGA+Y
Sbjct: 236 RVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGSGRHVEGAYY 293
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
VWT ++ ++LG E A L H+ + G + G +VL + D A
Sbjct: 294 VWTPAQLREVLGDEDAGLAARHFGVTEEGTFE-----------HGASVLQLPRQDEVFDA 342
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+++ R +L R+ RP P DDKV+ +WNGL +++ A
Sbjct: 343 ARIA-----------SVRERLLSHRAGRPAPGRDDKVVAAWNGLAVAALAETGAYF---- 387
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
DR + +E A AA + R +D+Q RL + R+G + A G L+DYA
Sbjct: 388 ------------DRPDLVEAALGAADLLVRLHFDDQA-RLTRTSRDGQAGANSGVLEDYA 434
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F D E G ++T + ++ R ++ D
Sbjct: 435 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFSDEESGALYDTAADAERLIRRPQDPTD 494
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG S + L+ A+ A + +R AE +L V +T + VP +
Sbjct: 495 NAVPSGWSAAAGALLGYAAQTASAP---HRHAAERALGVVKT----LGPRVPRFIGWGLA 547
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A L P + V +VG + + L V+ D+ E +
Sbjct: 548 VAEARLDGP--REVAVVGPALTDEATRALHRTALLGTAPGAVVAAGTPDSGEFPLLADRT 605
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A A VC++F+C P TDP L L
Sbjct: 606 LRQGAPA----------AYVCRDFTCDAPTTDPERLRAAL 635
>gi|402494465|ref|ZP_10841206.1| thioredoxin domain-containing protein [Aquimarina agarilytica ZC1]
Length = 706
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 215/694 (30%), Positives = 342/694 (49%), Gaps = 69/694 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED VA ++N +++IK+DREERPD+D+VYM+ VQ + G GGWPL+V
Sbjct: 80 CHWCHVMEHESFEDSTVAAVMNKNYINIKIDREERPDIDQVYMSAVQLMTGRGGWPLNVI 139
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTY+P + G L++++ ++ L + E +
Sbjct: 140 ALPDGRPVWGGTYYPKAEWMGA------LQQIQKIYEDDPSKLEEYATKLTEGIQSVSLV 193
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ + N L E + + E +K +D + GG APKF P +L ++ + +
Sbjct: 194 TPNPNALKFE--NSTIESAVETWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQTNN-- 249
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ + V+ TL ++ GG++DHVGGGF RY+ DE+WHVPHFEKMLYD QL ++Y
Sbjct: 250 -----EKLKDYVITTLNQISYGGVYDHVGGGFARYATDEKWHVPHFEKMLYDNAQLVSLY 304
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
DA+ LTK+ +Y + + LD+++R++ G +S+ DADS G + +EGAFYVW
Sbjct: 305 SDAYLLTKNEWYKQVVYETLDFVQRELTNAEGVFYSSLDADSVTHSG--KLEEGAFYVWQ 362
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E LG E LF ++Y + G + HN + VLI + K
Sbjct: 363 KPALETALGVEDFKLFADYYNVNAYGIWE-------HNNY----VLIRNESDADFIEKHK 411
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ +L + +++L +RSKR RP LDDK + SWN L++ +A A +
Sbjct: 412 LDKGDFLQKQKKWKQRLLSIRSKRERPRLDDKTLTSWNALMLKGYADAYSVF-------- 463
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+ +++VA + A+FI+ +L H+++ G S G+L+DYA I
Sbjct: 464 --------NDANFLKVALTNAAFIKNKQM-ASNGQLMHNYKEGKSTINGYLEDYAATIDA 514
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
+ LY+ +WL + + + + F D G +F T+ ED +++ R E D P+
Sbjct: 515 FIALYQVTFDQQWLDLSKTMTDYVFDHFYDDASGLFFFTSDEDAALVTRNIESSDNVIPA 574
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV-FETRLKDMAMAVPLMCCAADMLSVP 619
NS+ NL +L+ + K + Q H++ V E + + LM +
Sbjct: 575 SNSMMAKNLYKLSHYFSNKKYLEHSQKMLHNIQVNIEEYPSGYSNWLDLMLNYTEDFY-- 632
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
VV+VG + E A Y NK + +E +N + +N
Sbjct: 633 ---EVVIVGAAA----EEKRVAIQKQYYPNKII----AGSAKE---------SNQPLLQN 672
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
FS +C N +C PVT+ + LL +K
Sbjct: 673 RFSEKDTHIFICVNNACKYPVTEVEAAFKLLNDK 706
>gi|288941778|ref|YP_003444018.1| hypothetical protein Alvin_2064 [Allochromatium vinosum DSM 180]
gi|288897150|gb|ADC62986.1| protein of unknown function DUF255 [Allochromatium vinosum DSM 180]
Length = 688
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 237/687 (34%), Positives = 339/687 (49%), Gaps = 67/687 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFED A+ +N FV+IKVDREERPD+DKVY T Q L GGWPL
Sbjct: 59 SACHWCHVMAHESFEDPATAERMNRLFVNIKVDREERPDLDKVYQTAHQLLSQRAGGWPL 118
Query: 79 SVFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+VFL+PD P GTYFP E ++G P F +L V+ A+ ++ GA EQ
Sbjct: 119 TVFLTPDDHTPFFAGTYFPREPRHGLPSFTQLLVGVERAYREQ-------GAAIREQNRS 171
Query: 138 ALSASAS-SNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
L A A + ELP+ L A QL+ S+D+ GGFG APKFP +++++L
Sbjct: 172 LLEALAGLEPQGGAELPEAGLLEAAFHQLALSFDAEHGGFGRAPKFPHATDLELLLRRQA 231
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+L G + M FTL+ M +GG+ D +GGGF RYSVD+ W +PHFEKMLYD G
Sbjct: 232 RLAANGGDPD-PRPLHMAGFTLERMIRGGLTDQLGGGFCRYSVDDEWMIPHFEKMLYDNG 290
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L + DAFS T + + D++ R+M P G +S DADS EG EG
Sbjct: 291 PLLALCCDAFSATGESIFRDAALATADWVMREMQSPEGGYYSTLDADS---EG----HEG 343
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW V HA L Y L + + P N F+G+ L + +
Sbjct: 344 TFYVWDRDAV------HARLSAAEYPL----FAAVYGLDRPPN-FEGRWHLHGYRTPTQA 392
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A LG+ L + +L R LF R +R P D+K++ +WN L+I ARA+++L
Sbjct: 393 AESLGLNLPQAEALLASARATLFSAREQRVHPGRDEKILTAWNALMIKGMARAARVL--- 449
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
DR +Y+E AE A +FIR L+ + RL + ++G + +LDDYA
Sbjct: 450 -------------DRPDYLESAEQALAFIRSTLWHDG--RLLATCKDGVAHLNAYLDDYA 494
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
LI LL+L + + L +A+EL + F D E GG++ T ++ R K D
Sbjct: 495 NLIDALLELLQVRWSSADLAFAVELAEVLLDEFHDAERGGFWFTGRSHEPLIHRAKPLGD 554
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAAD 614
+ P+GN V+ + L RL ++ + Y + A+ +L + ++ M A L+ D
Sbjct: 555 DSMPAGNGVAALALQRLGHLIGEVR---YLEAADGTLRLAAESMRRMPHAHASLLMALDD 611
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
L P +LV + E A Y ++ V I P+ + + A
Sbjct: 612 WLDPPE----MLVIRAADDRLETWQRLAQQGYRPHRLVFAI-PSGIDAL------PGTLA 660
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVT 701
SM ++ + C+ C PPV
Sbjct: 661 SMR----GGERPLIYRCRGTHCEPPVA 683
>gi|386842157|ref|YP_006247215.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374102458|gb|AEY91342.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451795451|gb|AGF65500.1| hypothetical protein SHJGH_5837 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 677
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 235/703 (33%), Positives = 334/703 (47%), Gaps = 88/703 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDRATADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-A 138
VFL+PD +P GTYFPP ++G P F+ +L V+ AW +RD +A + L++
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVQQAWTTRRDEVADVAGKIVRDLAQRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ A+ EL Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 168 IVRQAAEAPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + D +L R++ G SA DADS +G+ R EGA+Y
Sbjct: 276 RVYTHLWRATGSDLARRVALDTAQFLLRELRTAEGGFASALDADS--DDGSGRHVEGAYY 333
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
VW ++ + LG+ A L +++ + G + G++VL + + A
Sbjct: 334 VWRPDQLREALGDDAELAAQYFGVTDEGTFE-----------HGQSVLQLPQTEGVFEAE 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K + + +L R++RP P DDKV+ +WNGL I++ A
Sbjct: 383 K-----------IASVKDRLLAARARRPAPGRDDKVVAAWNGLAIAALAETGACF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDYA 495
DR + E A +AA + R DE R R GP+ G L+DYA
Sbjct: 427 -----------DRPDLTEAAVAAADLLVRVHLDEHGRLARTSKDGRVGPNA--GVLEDYA 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F D E G ++T + ++ R ++ D
Sbjct: 474 DVAEGFLALASVTGEGVWLDFAGLLLDHVLARFTDTETGALYDTASDAEQLIRRPQDPTD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG + + L+ S A + S+ +R AE +L V +T + VP +
Sbjct: 534 NAAPSGWTAAAGALL---SYAAHTGSEPHRAAAERALGVVKT----LGPRVPRFIGWGLA 586
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
A +L P + V +VG + AA H + L+ V+ D+EE
Sbjct: 587 VAEALLDGP--REVAVVGPAPD---DERTAALHRTALLSTAPGAVVACGTPDSEEFPL-- 639
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+ F C PVTDP +L L
Sbjct: 640 --------LADRTLVEGAPTAYVCRGFVCDLPVTDPDALRTKL 674
>gi|225871957|ref|YP_002753411.1| hypothetical protein ACP_0267 [Acidobacterium capsulatum ATCC
51196]
gi|225793798|gb|ACO33888.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length = 702
Score = 332 bits (850), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 218/694 (31%), Positives = 337/694 (48%), Gaps = 61/694 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES+E+ +A ++N+ F++IKVDR+ERPDVD Y VQA+ G GGWPL+
Sbjct: 53 CHWCHVMDRESYENPAIAAVINEHFIAIKVDRDERPDVDSRYQAAVQAMAGQGGWPLTAI 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P+ KP GGTYFPPED+YGRPGF+ +LR + D W +R ++ + + S
Sbjct: 113 LTPEGKPFFGGTYFPPEDRYGRPGFERVLRSLADVWQNRRGEALETANSVLGAIEHGESF 172
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ S L + + + +Q +D+R+GGFGS PKFP P + M++ DT
Sbjct: 173 AGRSGTLSISIVEKLVSSAVQQ----FDARYGGFGSQPKFPHPSAMDMLI-------DTA 221
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ TL+ MA GG++D + GGFHRYSVDE+W VPHFEKMLYD L + Y
Sbjct: 222 SRTGNERVREAATVTLRKMAAGGVYDQLAGGFHRYSVDEQWIVPHFEKMLYDNAGLLSNY 281
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ AF + ++ + DI+ ++ + G ++++DAD +G ++ W
Sbjct: 282 VHAFQSFVEPEFAAVAVDIIRWMDECLSDRERGGFYASQDAD------INLDDDGDYFTW 335
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E +L + Y+ D+ M D H+ + KNVL + A+ L
Sbjct: 336 TLAEARAVLSNEELAVAASYF-------DIGEMGDMHHNPQ-KNVLHSKRTLAEVAAALS 387
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ E+ L + KL R +RP P +D + SWN L IS++ +A+++L
Sbjct: 388 LSAEEAQKKLDSAKSKLLAARRERPTPFIDTTIYTSWNALAISAYLQAARVLDLPHAR-- 445
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-----GFLDDYA 495
F ++ DR I R + E T L H K+P G LDDYA
Sbjct: 446 -TFALLTLDR-------------ILREAWSE-TSGLSHVVAYADGKSPAAWVAGVLDDYA 490
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP---SVLLRVKE 552
FL L+ +E K+ A ++ + F D+ G +F+T + ++ R K
Sbjct: 491 FLTDACLEAWESTGDRKYYDAAAQIADAMIARFYDQTSGAFFDTEIQGSKLGALAARRKP 550
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D P+GN + L+RLAS+ + + + AE +L F ++ + A
Sbjct: 551 LQDTPTPAGNPAAASALLRLASLSGEKR---HAELAEDTLEAFAGVVEHFGLYAGTYGLA 607
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+P + +++ G + A A A Y +NK+V+ D A E
Sbjct: 608 LLRFLLPPAQ-IIVAGDGPRA--RELAAMAVARYAVNKSVVQFDAAQLAV----ENLPPA 660
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
A + + VALVCQ SC PP+T+P +L
Sbjct: 661 LAETLPHLSGFTEPVALVCQGMSCQPPITEPQAL 694
>gi|336427724|ref|ZP_08607719.1| hypothetical protein HMPREF0994_03725 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336008885|gb|EGN38889.1| hypothetical protein HMPREF0994_03725 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 655
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 198/560 (35%), Positives = 291/560 (51%), Gaps = 59/560 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ +A LLN +V IKVDREERPD+D VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENAAIAGLLNREYVCIKVDREERPDIDSVYMSVCQAMTGQGGWPLTIIMTPDCR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFPP +YG G + +L W +++ + S +E ++A +
Sbjct: 61 PFFAGTYFPPTARYGSVGLQELLTAAAAQWKLEKEKILDS--------AEQITAYVKEQE 112
Query: 148 LPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
P E ++ + L Q + ++D + GGFG APKFP P + +L + G
Sbjct: 113 QPTAAEPGKDMVHLAFRQFADNFDKKNGGFGGAPKFPTPHNLMFLL-------EYGIREN 165
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
+ E M TL M +GGI DH+GGGF RYS D+RW VPHFEKMLYD LA YL+A+
Sbjct: 166 SREALDMAETTLTQMYRGGIFDHIGGGFSRYSTDDRWLVPHFEKMLYDNALLAIAYLEAY 225
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
S T Y + + +L Y+ R++ G + +DADS EG +YV+T +E+
Sbjct: 226 SRTGRKLYECVAKKVLRYVERELTDAQGGFYCGQDADSDGV-------EGKYYVFTQEEI 278
Query: 326 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK---NVLIELNDSSASASKLGM 381
ILG E F Y + GN F+GK N+L + + G
Sbjct: 279 RRILGKEEGEAFCVRYGITANGN------------FEGKSIPNLLGNKDYERICEEQCGC 326
Query: 382 PLEKYLNILG-ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+++ +G E +KL++ R +R H DDK++VSWNG +I ++A+A +
Sbjct: 327 DGGGHMDGIGREAFQKLYEYRIRRTPLHKDDKILVSWNGWMICAYAKAGAVFGD------ 380
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
K Y+++A A F+R++L + RL +R+G + G LDDY I
Sbjct: 381 ----------KRYVDMAVRAEGFVRQNLMKD--GRLLVRYRDGDAAGEGKLDDYTCYILA 428
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL+LY+ T +L A E F D+E GG++ + + +R KE++DGA PS
Sbjct: 429 LLELYQVTFQTAYLEQAARCAEILLEQFFDQEKGGFYLYAEDGEQLFMRTKENYDGAMPS 488
Query: 561 GNSVSVINLVRLASIVAGSK 580
GNSV L +LA I +K
Sbjct: 489 GNSVGARVLHKLAQITGETK 508
>gi|329935309|ref|ZP_08285275.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
gi|329305132|gb|EGG48991.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
Length = 675
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 235/700 (33%), Positives = 338/700 (48%), Gaps = 83/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A LN+ FVS+KVDREERPDVD VYM VQA G GGWP+S
Sbjct: 48 SACHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMS 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
VFL+P+ +P GTYFPPE ++G P F+ IL+ V AW ++R+ +A +G +
Sbjct: 108 VFLTPEAEPFYFGTYFPPEPRHGSPSFRQILQGVHQAWTERREEVADVAGKITRDLAGRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L+ + E+ Q L L++ YD+R GGFG APKFP + ++ +L H +
Sbjct: 168 LAHGGAQVPGEQEMAQALL-----GLTREYDARRGGFGGAPKFPPSMVLEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GG++D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGSEG----ALQMAADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + +++ R++ G SA DADS +G R EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETAEFMVRELGTAEGGFASALDADS--DDGTGRHVEGAYY 333
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
VWT +++ ++LGE A L ++ + G + G++VL + D A
Sbjct: 334 VWTPEQLAEVLGEDAGLAARYFGVTEEGTFE-----------HGQSVLQLPQTDGVFDAE 382
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ + R +L RS RP P DDKV+ +WNGL I++ A
Sbjct: 383 R-----------VASVRERLLGARSARPAPGRDDKVVAAWNGLAIAALAETGAYF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
DR + ++ A AA + R DE RL + ++G + A G L+DYA
Sbjct: 427 -----------DRPDLVDAAVRAADLLVRLHLDEHG-RLTRTSKDGRAGAHAGVLEDYAD 474
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ G L L + WL +A L F E G F+T + ++ R ++ D
Sbjct: 475 VAEGFLALAQVTGEGVWLEFAGLLLGHVRTRFTGEE-GTLFDTASDAEKLIRRPQDPTDN 533
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMC 610
A PSG + + L+ S A + S+ +R AE +L V T R +AV
Sbjct: 534 ATPSGWTAAAGALL---SYAAHTGSEAHRTAAEQALGVVRTLGPRAPRFVGWGLAV---- 586
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P + V +VG S+D + A L++T + + A + E +
Sbjct: 587 -AEALLDGP--REVAVVG--PSLDDPDTSA-------LHRTAL-LGTAPGAVVAAGAEGS 633
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+NF C P +D L L
Sbjct: 634 EEFPLLADRPLRRGAPAAYVCRNFVCEAPTSDAEELRAAL 673
>gi|302553816|ref|ZP_07306158.1| spermatogenesis-associated protein 20 [Streptomyces
viridochromogenes DSM 40736]
gi|302471434|gb|EFL34527.1| spermatogenesis-associated protein 20 [Streptomyces
viridochromogenes DSM 40736]
Length = 677
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 235/701 (33%), Positives = 343/701 (48%), Gaps = 83/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A+ LN+ +VS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDQQTAEYLNEHYVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPP + G P F+ +L V+ AWD++RD + + + L+
Sbjct: 108 VFLTPEAEPFYFGTYFPPAPRQGMPSFRQVLEGVRQAWDERRDEVTEVAGKIVRDLA-GR 166
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S ++ P EL Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 167 EISYGDDQAPGEQELAQALL-----ALTREYDPQRGGFGGAPKFPPSMALEFLLRHHAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 --TGAEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T + + D++ R++ G SA DADS +G + EGA+
Sbjct: 275 CRVYAHLWRATGSELARRVALETADFMVRELRTTEGGFASALDADS--DDGTGKHVEGAY 332
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
YVWT ++ ++LGE A L +++ + G + G++VL + DS
Sbjct: 333 YVWTPGQLREVLGEQDAELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDSLFD 381
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A K + R +L R++RP P DDKV+ +WNGL I++ A
Sbjct: 382 AGK-----------IASVRERLLAKRAERPAPGRDDKVVAAWNGLAIAALAET------- 423
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
A F+ P +A +R HL DEQ RL + ++G + A G L+DY
Sbjct: 424 --GAYFDRP------DLVEAAVAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLEDY 473
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + F D E G F+T + ++ R ++
Sbjct: 474 ADVAEGFLALASVTGEGVWLQFAGFLLDHVLVRFTDAESGALFDTAADAERLIRRPQDPT 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---C 611
D A PSG + + L+ S A + S+ +R A +L V +K + VP
Sbjct: 534 DNAAPSGWTAAAGALL---SYAAHTGSEPHRTAARKALGV----VKALGPRVPRFIGWGL 586
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY--DLNKTVIHIDPADTEEMDFWEEH 669
AA ++ + V +VG S+D E A H + V+ + +EE
Sbjct: 587 AAAEAALDGPREVAIVG--PSLDHEGTRALHHTALLGTAPGAVVAVGTPGSEEFPL---- 640
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+NF+C P T+ L +L
Sbjct: 641 ------LADRPLVGGEPAAYVCRNFTCDVPTTEVDRLRAVL 675
>gi|332663431|ref|YP_004446219.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332332245|gb|AEE49346.1| protein of unknown function DUF255 [Haliscomenobacter hydrossis DSM
1100]
Length = 686
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 223/693 (32%), Positives = 342/693 (49%), Gaps = 74/693 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFE+ VA ++N+ F++IKVDREERPDVD +YM + G GGWPL+
Sbjct: 47 STCHWCHVMERESFENADVAAIMNENFINIKVDREERPDVDHIYMEACVIMTGSGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+PD +P + GTY+PP + RP + +L V D + +R + + + I + +
Sbjct: 107 CFLTPDGRPFLAGTYYPPLAAFNRPSWPQLLHHVTDVYRNRRKDVEEQASRLIGNIEQTN 166
Query: 140 SASASSN--KLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHS 194
S + N +L P N + L + L K++D + GGFG+APKFP + +Q +L YH
Sbjct: 167 SYFLAKNEAELSGINPFNPVVLHNVFQTLKKNFDLQDGGFGAAPKFPGSMALQFLLDYHH 226
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+GE E + +F+L M +GGI+D +GGGF RY+ D W VPHFEKMLYD
Sbjct: 227 -------FTGE-KEALEHTVFSLDRMIRGGIYDQLGGGFARYATDRAWLVPHFEKMLYDN 278
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
L + D + +T+ + + L ++ R+M G +SA DADS EG +E
Sbjct: 279 ALLVGLLSDTYKVTQQPIFRRAIEETLGWIEREMTSADGGFYSALDADS---EG----EE 331
Query: 315 GAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
G FYVW+++E+ + E A LF +Y ++P GN ++G N+L
Sbjct: 332 GKFYVWSAEEIAAVCPSVEDAALFSSYYGVEPLGN------------WEGHNILWCPLPL 379
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+A A + G E R +L VR +R RP LDDK+++SWN L+ S++A+A L
Sbjct: 380 AAFAVEAGQSPEALEARFAPIRTQLMAVRDERIRPGLDDKILLSWNALMASAYAKAYTAL 439
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR----NGPSKAP 488
+E Y A F+ ++ L H+++ ++
Sbjct: 440 GNET----------------YKVAALRNVDFLLEKFKRDEIGGLYHTYKKVKDQDQAQYA 483
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
FLDDYA+ I+ L+D+YE T++L A +L FLD ++ T+ + V+L
Sbjct: 484 AFLDDYAYFIAALIDVYEISLETRYLRQAADLTEYTLAHFLDDTRNLFYFTSKDQQDVVL 543
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R E +D A PSGNS V NL RL + + Y + A L + L+ +
Sbjct: 544 RKIELYDNALPSGNSSMVQNLQRLGLLWGKMQ---YIELAAAMLKEMLSGLERYPSSFAR 600
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
A + P + V +VG ++ E + +Y NK ++ AD
Sbjct: 601 WANALIYMVYPMHE-VAIVGPEA----EELSRELQKNYIPNKVLMGALEAD--------- 646
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ + + VCQN++C PV+
Sbjct: 647 ---DTFPLLAGRQTQGMTQIFVCQNYTCQLPVS 676
>gi|323693373|ref|ZP_08107588.1| hypothetical protein HMPREF9475_02451 [Clostridium symbiosum
WAL-14673]
gi|323502578|gb|EGB18425.1| hypothetical protein HMPREF9475_02451 [Clostridium symbiosum
WAL-14673]
Length = 639
Score = 331 bits (848), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 206/561 (36%), Positives = 290/561 (51%), Gaps = 57/561 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ +A+LLN ++ +KVDREERPD+D VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFPP +YGR G +L W +KR+ L S L E + SS
Sbjct: 61 PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
P E+ + A R Q + S+D + GGFG APKFP P + ++ + G +
Sbjct: 121 GP-EIVRQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E M TL M +GGI DH+GGGF RYS DERW VPHFEKMLYD L Y+ A++L
Sbjct: 169 EALSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYAL 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T Y +L Y+ ++ P G + +DADS EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
ILG + F +Y + GN F+GK++ L + + + P +
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESVCEERPGAEE 329
Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ + RR KL+ R KR R H DDK++VSWNG +IS+ A+A +L
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K+Y+++A A FIR L + RL +R+G + G LDDYA
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LL+LY T +L A + E FLDRE GG+F + +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNS + L LA + +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512
>gi|387790403|ref|YP_006255468.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
3403]
gi|379653236|gb|AFD06292.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
3403]
Length = 674
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 194/574 (33%), Positives = 295/574 (51%), Gaps = 73/574 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA ++N+ FV IKVDREERPD+D+VYM VQ + GGGGWPL+
Sbjct: 51 SACHWCHVMEHESFEDEQVASIMNEHFVCIKVDREERPDIDQVYMNAVQLMTGGGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSGA--FAI 132
F PD +P GGTYF +D + F ++ ++ D+ + QS F
Sbjct: 111 CFCLPDQRPFYGGTYFRKQDWMRLLNDLQAFFVNKPKEAEEYADRLHKGIKQSDVVGFVA 170
Query: 133 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
EQ E N L+ + ++ +D GG+ APKFP P Q +L
Sbjct: 171 EQ---------------KEYSVNTLKEIVDPWTRYFDYSDGGYNRAPKFPLPNNFQFLLR 215
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+++ +D + + TL MA GGI+D +GGGF RYSVD W VPHFEKMLY
Sbjct: 216 YARLAKDQASN-------VITRLTLDKMAYGGIYDQLGGGFARYSVDSVWLVPHFEKMLY 268
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D GQL ++Y +A+ + + Y + + L+++RR++ P G +SA DADS EG
Sbjct: 269 DNGQLVSLYAEAYQYSGSLLYKNVVAETLEFIRRELTSPEGGFYSALDADS---EGV--- 322
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FY WT E++ IL + +F +Y + GN ++ N+L D
Sbjct: 323 -EGKFYCWTRDELKGILSDDEEIFSTYYNVTEEGN------------WEETNILHRKEDD 369
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A+ G+ ++ I+ C+ KL VR R RP LDDK++ SWNG+++ + A ++
Sbjct: 370 KVIANAHGLSEDELTVIIDRCKAKLMKVREHRVRPGLDDKILTSWNGIMLKGYIDAYRVF 429
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
+ + EY++ A + ASF+ +L + + +++NG + FLD
Sbjct: 430 RVD----------------EYLQTALTNASFLLENL-KQADGSWKRNYKNGNATINAFLD 472
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DY + ++LY+ +WL A + + E F D++ G ++ T+ D ++ R E
Sbjct: 473 DYVLVAEAFIELYQATFDEQWLAEAKAIVDYCIEHFYDQQSGMFYYTSNTDEQLITRKFE 532
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
D PS NSV L+++ + YY+Q
Sbjct: 533 LMDSVIPSSNSVLARVLLKIGT--------YYQQ 558
>gi|373956291|ref|ZP_09616251.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
18603]
gi|373892891|gb|EHQ28788.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
18603]
Length = 718
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 201/560 (35%), Positives = 295/560 (52%), Gaps = 57/560 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA+++N+ FV IKVDREERPD+D++YM+ VQ + G GGWPL+
Sbjct: 92 SACHWCHVMENESFEDEQVAEIMNEHFVCIKVDREERPDIDQIYMSAVQLMTGRGGWPLN 151
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
PD +P+ GGTYF D + +L + + W++K D ++ +A+ +L+E +
Sbjct: 152 CVCLPDQRPIYGGTYFRKTD------WMALLFNLANFWEQKPD---EAKEYAV-KLTEGI 201
Query: 140 SASASSNKLPDELPQNA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ + +++ L + +SYD + GG APKFP P Q ++ ++ +
Sbjct: 202 HQYENIGFVNEQMENTPADLEAIVKPWKQSYDFKEGGLNRAPKFPMPNNWQFLMRYAYLM 261
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+D E +V TL+ MAKGGI+DH+GGGF RYSVD WHVPHFEKMLYD QL
Sbjct: 262 QD-------EETNVIVRLTLEKMAKGGIYDHIGGGFARYSVDGHWHVPHFEKMLYDNAQL 314
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y +AF+ D Y + + + +++R++ P +SA DADS EG EG F
Sbjct: 315 IGLYSEAFTWCGDELYKKVVAETIAFIQRELTSPENGFYSALDADS---EGV----EGKF 367
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y +T EVE ILG+ A LF +Y + GN E + N+ +D + A
Sbjct: 368 YTFTLAEVEAILGDDAGLFAIYYNVTNEGNW----------EEEHTNIFFRRDDDAVLAE 417
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
KLG+P + ++ + R ++ + R+KR P LD K++ SWN L++ A +
Sbjct: 418 KLGIPADALVDKIAGLRNQVLEARAKRVLPGLDYKILTSWNALMLKGLCDAYRAF----- 472
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE--QTHRLQHSFRNGPSK--APGFLDD 493
D Y+E+A A FI+ +L ++ Q R+ ++ G K A FLDD
Sbjct: 473 -----------DEPAYLELALKNAHFIKDNLINKNNQLSRV-YAKPTGDEKLDAIAFLDD 520
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA LI + LYE WL A L + F D G +F T ++ R E
Sbjct: 521 YALLIDAFIALYEVTFDEAWLHQAKALTEHTLDHFYDNATGMFFYTPDYGEQLIARKFEV 580
Query: 554 HDGAEPSGNSVSVINLVRLA 573
D PS NSV N +L+
Sbjct: 581 MDNVMPSSNSVMARNFKKLS 600
>gi|282897059|ref|ZP_06305061.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
gi|281197711|gb|EFA72605.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
Length = 657
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 228/717 (31%), Positives = 353/717 (49%), Gaps = 108/717 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 16 SSCHWCTVMEGEAFSDLAIAEYMNANFIPIKVDREERPDIDSIYMQSLQMMTGQGGWPLN 75
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
FLSP DL P GTYFP +YGRPGF +L+ ++ +D +++ Q A +E L
Sbjct: 76 AFLSPDDLVPFYAGTYFPVSPRYGRPGFLEVLQAIRHYYDHQKEDFRQRKASILESL--- 132
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
LS++ N + + +Q ++ FP Q++L ++
Sbjct: 133 LSSTVLQNHGSGQFAHSQFHRFLKQGWETAIGVITPRQMGNSFPMIPYCQLVLQGTRF-- 190
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
A++G +M +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 191 ---NYPSANDGLEMATQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQIV 247
Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ +S +++ + + +L R+MI P G ++A+DADS +EGAF
Sbjct: 248 EYLANLWSAGVEELAFKRAVAGTVSWLEREMISPTGYFYAAQDADSFNYSTDMEPEEGAF 307
Query: 318 YVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ E++++L + +L KEH+ + GN F+GKNVL L SA
Sbjct: 308 YVWSYGELQELLSDQELLELKEHFSVSLEGN------------FEGKNVLQRL-----SA 350
Query: 377 SKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWN 418
+LG LE L L R R ++ ++ R P D K+IV+WN
Sbjct: 351 GELGSSLELILGRLFLSRYGQTAETLTIFPPARNNYEAKTNPWHGRIPPVTDTKMIVAWN 410
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQ 477
L+IS ARAS++ + + Y+++A A FI R + + HRL
Sbjct: 411 SLMISGLARASQVFQ----------------QPSYLKLAVKATRFILDRQFVNGRFHRLN 454
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGGGY 536
+ +G +DYA I LLDL++ SG + WL AI LQ+ +E L E GGY
Sbjct: 455 Y---DGEPTVLAQSEDYALFIKALLDLHQADSGSSSWLEQAIALQDEFNEFLLSVELGGY 511
Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
FNT+ ++ +++R + D A PS N V++ NL++L+ + + + YY AE +L F
Sbjct: 512 FNTSSDNSQDLIIRERNFVDNATPSANGVAIANLIKLSLL---TDNLYYLDLAESALKAF 568
Query: 596 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
T ++ + P + A+D ++ LV +S++D +LA+ + + + +
Sbjct: 569 STMIEKSPQSCPSLLIASDWY-----RNSTLV--RSNIDNIKILASQYLPTTVFDVISKL 621
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
P +T + LVCQ C P P+ + LL +
Sbjct: 622 -PTNT--------------------------IGLVCQGLKCLPA---PVDFDELLAQ 648
>gi|330465851|ref|YP_004403594.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
gi|328808822|gb|AEB42994.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
Length = 679
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 224/701 (31%), Positives = 340/701 (48%), Gaps = 78/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE+EGV +LLN+ FVSIKVDREERPDVD VYMT QA+ G GGWP++
Sbjct: 47 SACHWCHVMAHESFENEGVGRLLNEGFVSIKVDREERPDVDAVYMTATQAMTGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF +PD P GTYFP R F +L V AW ++RD + + GA +E + A
Sbjct: 107 VFATPDGTPFYCGTYFP------RQNFVRLLESVGTAWREQRDAVLRQGAAVVEAVGGAQ 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + L +L L A QL+ YD GGFG APKFP + + +L H ++
Sbjct: 161 AVGGPTAPLTADL----LDAAATQLAGEYDETNGGFGGAPKFPPHLNLLFLLRHHQR--- 213
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + + +MV T + MA+GGIHD + GGF RYSVD W VPHFEKMLYD L
Sbjct: 214 TG----SPQSLEMVRHTCEAMARGGIHDQLAGGFARYSVDGHWTVPHFEKMLYDNALLLR 269
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + LT D + RDI +L ++ PG SA DAD+ EG T YV
Sbjct: 270 VYTQLWRLTGDALALRVARDIARFLADELHRPGQGFASALDADTEGVEGLT-------YV 322
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT ++ ++LG+ + DL +++ G +VL D + +
Sbjct: 323 WTPAQLVEVLGDEDGRWA----------ADLFAVTESGTFEHGTSVLKLARDVDDADPAV 372
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK------ 433
E++ +++ R+L R RP+P DDKV+ +WNGL +++ A ++++
Sbjct: 373 ---RERWQDVV----RRLLAARDTRPQPARDDKVVAAWNGLAVTALAEFVRLVETSGRIG 425
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLD 492
+E E+ + + +D + ++A R H+ D RL+ + R+G P G L+
Sbjct: 426 TEGEANLLEGVTIVADGA----MRDTAEYLARVHMVD---GRLRRASRDGRVGEPAGVLE 478
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DY + +++ +WL WA +L +T F GG +++T + ++ R +
Sbjct: 479 DYGCVAEAFCAMHQVTGEGRWLEWAGQLLDTALAHFA-APGGAFYDTADDAEQLVARPAD 537
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D A PSG S LV +++ + +YR+ AE +L+ + A
Sbjct: 538 PTDNATPSGRSAIAAALVAYSAL---TGQTHYREVAEAALSTVAPIVGRHARFTGYAATV 594
Query: 613 AD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
+ +LS P VV + ++AAAH ++ P +
Sbjct: 595 GEALLSGPYEIAVVTADPAG----DPLVAAAHRHAPPGAVIVAGQP-----------DQA 639
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+A + A VC+ F C PV ++E+L+ +
Sbjct: 640 GVPLLADRPLLDGESAAYVCRGFVCQRPVD---TVEDLVAQ 677
>gi|110635801|ref|YP_676009.1| hypothetical protein Meso_3473 [Chelativorans sp. BNC1]
gi|110286785|gb|ABG64844.1| protein of unknown function DUF255 [Chelativorans sp. BNC1]
Length = 676
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 223/611 (36%), Positives = 304/611 (49%), Gaps = 79/611 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM E FED VA+L+N FV+IKVDREERPD+D++YMT + A+ GGWPL++
Sbjct: 53 ACHWCHVMAHECFEDNEVAELMNSLFVNIKVDREERPDIDQIYMTALSAMGEQGGWPLTM 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFP +YGRPGF +L+ V AW K D L +S + L+
Sbjct: 113 FLTPEAKPFWGGTYFPKRSRYGRPGFIDVLKAVHSAWQTKEDELLRSADTLSIHVRTHLA 172
Query: 141 A--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+SN++P LR AE++ +D + GG APKFP + ++ + LE
Sbjct: 173 PMQGTTSNEVP-------LRALAEKIRAVFDPQLGGLRGAPKFPNAPFLDLLWLN--WLE 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ +S + VL TL+ M GGI+DHVGGG RYSVD +W VPHFEKMLYD QL
Sbjct: 224 NGAESD-----RDTVLLTLRSMLAGGIYDHVGGGLARYSVDAQWLVPHFEKMLYDNAQLI 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+ A+ T D + D + +L R+M GG S+ DADS EG +EG FY
Sbjct: 279 RLCSYAYGGTHDRLFRVRIEDTVKWLLREMTVEGGGFASSLDADS---EG----EEGKFY 331
Query: 319 VWTSKEVEDILG--EHAILFKEHYYLKPT---GNCDLSRMSDPHNEFKGKNVLIELNDSS 373
+WT E+ED+LG + L + P GN L R P L+DSS
Sbjct: 332 LWTRAEIEDVLGVGDARELLAIYDLANPEEWEGNPILHRRRHPE----------VLDDSS 381
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
E+ L L + +L R R RP DDKV+V WNGL I++ A A +
Sbjct: 382 ----------EQRLRTLLD---RLMAAREARTRPGRDDKVLVDWNGLAIAAIAVAGRQFA 428
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
R E++E A A F+ L + RL HS R P D
Sbjct: 429 ----------------RPEWIEAAARAFRFV---LESMEEGRLPHSIRGEKRLFPALSSD 469
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA +IS + LY ++ A + + D +LD G GYF T + +R++ D
Sbjct: 470 YAAMISAAIALYGATHDDSYVDQARQWLDKLDAWYLDDAGSGYFLTASDSADTPMRIRGD 529
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE---TRLKDMAMAVPLMC 610
D PS + V LV LA+ V+GS Y +H + V E R ++ A +
Sbjct: 530 MDDPIPSATAQIVTALVHLAA-VSGSHELY-----QHGVRVSEAALARAQNQAYGQLGII 583
Query: 611 CAADMLSVPSR 621
CAA + P +
Sbjct: 584 CAAALAQRPMK 594
>gi|323484029|ref|ZP_08089400.1| hypothetical protein HMPREF9474_01149 [Clostridium symbiosum
WAL-14163]
gi|323402646|gb|EGA94973.1| hypothetical protein HMPREF9474_01149 [Clostridium symbiosum
WAL-14163]
Length = 639
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 206/561 (36%), Positives = 289/561 (51%), Gaps = 57/561 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ +A+LLN ++ +KVDREERPD+D VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFPP +YGR G +L W +KR+ L S L E + SS
Sbjct: 61 PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
P E+ + A R Q + S+D + GGFG APKFP P + ++ + G +
Sbjct: 121 GP-EIVRQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E M TL M +GGI DH+GGGF RYS DERW VPHFEKMLYD L Y+ A+ L
Sbjct: 169 EAVSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYGL 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T Y +L Y+ ++ P G + +DADS EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
ILG + F +Y + GN F+GK++ L + + + P +
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESICEERPGAEE 329
Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ + RR KL+ R KR R H DDK++VSWNG +IS+ A+A +L
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K+Y+++A A FIR L + RL +R+G + G LDDYA
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LL+LY T +L A + E FLDRE GG+F + +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNS + L LA + +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512
>gi|149279373|ref|ZP_01885504.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
gi|149229899|gb|EDM35287.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
Length = 674
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 202/581 (34%), Positives = 292/581 (50%), Gaps = 52/581 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE+ VA ++N +V IKVDREERPD+D++YM +Q + G GGWPL+
Sbjct: 51 SACHWCHVMERESFENHEVAAVMNQHYVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
PD +P+ GGTYF +D + +IL V W + D Q + + A
Sbjct: 111 CICLPDQRPVYGGTYFKKDD------WTSILENVAALWLHEPDKALQYADRLTDGIRNAE 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ K P LR + + D GG+ APKFP P Q +L +S D
Sbjct: 165 KIIPNEKKEPYNYTH--LREITDPWKRELDMTDGGYNRAPKFPMPNNWQFLLRYSLLTGD 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
L +L+ MA GGI+D +GGGF RYSVD RWHVPHFEKMLYD Q+
Sbjct: 223 NAT-------HVATLLSLEKMALGGIYDQIGGGFARYSVDGRWHVPHFEKMLYDNAQMIA 275
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A+ T+ ++ + + + ++ R+M P G ++A DADS EG EG FYV
Sbjct: 276 LYAEAYQYTQLPLFNSVVAETIGWMAREMRSPEGLFYAALDADS---EGV----EGKFYV 328
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W +E E + +L K +Y + +GN E + N+L+ A++
Sbjct: 329 WDEEEFEVVTQGDHLLMKAYYQVTSSGNW----------EEEETNILMRRFADEDFAAQQ 378
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ LE+ + R KL + RSKR P LDDK +++WN + I A + +
Sbjct: 379 GITLEELDLKVSAAREKLLEHRSKRVTPALDDKCLLAWNAMAIKGLASCASVF------- 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
R++Y E+A +AA FI + + EQ RL +F+NG + GFLDDYAF I
Sbjct: 432 ---------GRQDYYEMARTAADFILQPM-QEQDGRLYRNFKNGKATISGFLDDYAFFID 481
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L+ LY++ +WL+ A + T F D + +F T S++ R E D P
Sbjct: 482 ALIALYQYDFDEQWLLEARKYAETVLGQFADPDSPMFFYTPSGAESLIARKHELMDNVIP 541
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 600
+ NSV NL L + D Y + A LA + ++K
Sbjct: 542 ASNSVMAQNLHLLGLLF---DDDSYTERASAMLAAIQPQIK 579
>gi|355621830|ref|ZP_09046381.1| hypothetical protein HMPREF1020_00460 [Clostridium sp. 7_3_54FAA]
gi|354823297|gb|EHF07630.1| hypothetical protein HMPREF1020_00460 [Clostridium sp. 7_3_54FAA]
Length = 639
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 206/561 (36%), Positives = 288/561 (51%), Gaps = 57/561 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+ +A+LLN ++ +KVDREERPD+D VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENREIAQLLNREYICVKVDREERPDIDSVYMSVCQAMNGQGGWPLTIIMTPDGR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTYFPP +YGR G +L W +KR+ L S L E + SS
Sbjct: 61 PFFSGTYFPPRARYGRIGLDGLLAAAAKQWKEKREKLLDSADQIEAFLKEQEQLTVSSEP 120
Query: 148 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 207
P E+ A R Q + S+D + GGFG APKFP P + ++ + G +
Sbjct: 121 GP-EIVSQAYR----QFAGSFDKQNGGFGGAPKFPAPHNLMFLM-------EYGIREDRP 168
Query: 208 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
E M TL M +GGI DH+GGGF RYS DERW VPHFEKMLYD L Y+ A+ L
Sbjct: 169 EALSMAETTLTQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLVMAYVKAYGL 228
Query: 268 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
T Y +L Y+ ++ P G + +DADS EG +YV+T +E+ +
Sbjct: 229 TGRKLYGCAAEMVLKYIEAELTDPQGGFYCGQDADSDGV-------EGKYYVFTPEEINE 281
Query: 328 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
ILG + F +Y + GN F+GK++ L + + + P +
Sbjct: 282 ILGTKQGKAFCRNYGITGPGN------------FEGKSIPNLLGNEAYESVCEERPGAEE 329
Query: 387 LNILGECRR-------KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ + RR KL+ R KR R H DDK++VSWNG +IS+ A+A +L
Sbjct: 330 EDGRSKSRREADEVYEKLYAYRLKRTRLHKDDKILVSWNGWMISACAKAGAVL------- 382
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K+Y+++A A FIR L + RL +R+G + G LDDYA
Sbjct: 383 ---------GEKKYVDMAVRAEEFIRTALV--RNGRLLVRYRDGEAAGEGKLDDYACYSL 431
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LL+LY T +L A + E FLDRE GG+F + +++R KE +DGA P
Sbjct: 432 ALLELYRVTFRTDYLDRAAGWADKMVEQFLDRERGGFFLNAKDAERLIVRTKETYDGAMP 491
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNS + L LA + +K
Sbjct: 492 SGNSAAARVLQHLAQLTGEAK 512
>gi|440749562|ref|ZP_20928808.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
gi|436481848|gb|ELP37994.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
Length = 674
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 210/612 (34%), Positives = 311/612 (50%), Gaps = 59/612 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE A L+N FV IK+DREERPD+D +YM +QA+ GGWPL+
Sbjct: 47 SACHWCHVMERESFEDEETADLMNAHFVCIKIDREERPDLDNIYMEALQAMGVQGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL P+ KP GGTYFP + +K +L + +A+ L +S + +
Sbjct: 107 VFLMPNQKPFYGGTYFPNKQ------WKNLLGSIANAYKNHHGQLLESAEGFGRSIGRSE 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
L + + L ++L+ +D +GG PKFP P +L D
Sbjct: 161 LEKYGLKAAETGLEKADIELVLDKLTAQFDLEWGGMNRKPKFPMPAVWLFVL-------D 213
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G+ E + V FTL+ + GGI+DH+ GG+ RYSVD W PHFEKMLYD GQL +
Sbjct: 214 AALLGKDQELLEKVFFTLKKIGMGGIYDHLRGGWARYSVDGEWFAPHFEKMLYDNGQLLD 273
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A+ ++ D F+ + +D++ +M+ G F+A+DADS EG EG FY
Sbjct: 274 LYAKAYQVSGDEFFKEKVLETVDWIEAEMLLSEGGFFAAQDADS---EGV----EGKFYT 326
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W +E+E ILGE FK+ Y LK GN + G N+L + + A+++
Sbjct: 327 WKYEELEAILGEDLSWFKKLYNLKYQGNWE-----------DGVNILFQTEPYADLAAEI 375
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + Y L + + KL VR++R P LDDKV+ WNGL I+ A+
Sbjct: 376 GLSEKAYRERLQQIKTKLLTVRNRRIYPGLDDKVLSGWNGLAIAGLAQV----------- 424
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
F GS++ + +A+ F+ ++ Q L S+++G + P FL+DYA +I
Sbjct: 425 ---FLATGSEKA--LSLAKRNGKFLWEKMFKGQV--LYRSYKDGQAYTPAFLEDYAAVIR 477
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
G + LY+ T+WL+ A EL + E + D G +F + ++ KE D P
Sbjct: 478 GYISLYQASFETEWLLKAKELTDLVLEQYYDEGDGFFFFNNPKAEKLIANKKELFDNVIP 537
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC--AADML- 616
+ NSV NL L + Y+ AEH LA +K + + P C A+ ML
Sbjct: 538 ASNSVMARNLQDLGLYFY---QEEYQAIAEHMLA----SVKRLILTEPGFLCNWASLMLH 590
Query: 617 SVPSRKHVVLVG 628
++ + V +VG
Sbjct: 591 TLVPKAEVAVVG 602
>gi|336172537|ref|YP_004579675.1| hypothetical protein [Lacinutrix sp. 5H-3-7-4]
gi|334727109|gb|AEH01247.1| hypothetical protein Lacal_1399 [Lacinutrix sp. 5H-3-7-4]
Length = 679
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 215/686 (31%), Positives = 333/686 (48%), Gaps = 76/686 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E VA ++N F++IK+DREERPD+D+VYM VQ + G GGWP++V
Sbjct: 54 CHWCHVMEHESFENEDVAIVMNSNFINIKIDREERPDIDQVYMNAVQLMTGSGGWPMNVV 113
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF E + L ++ D + K D L + +L++ + A
Sbjct: 114 ALPDGRPVWGGTYFKKEQ------WVNALNQISDLYKKNPDKLYEYAT----KLAKGIKA 163
Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
N + L+ S +D+ GG G PKF P Q +L
Sbjct: 164 MDLIKPNTNEPKFDTTFLKEIIADWSVYFDTNKGGIGKEPKFMMPNNYQFLL-------- 215
Query: 200 TGKSGEASEGQKMVLF---TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ G + +K++ F TL MA GGI+D +GGGF RYSVD++WHVPHFEKMLYD Q
Sbjct: 216 --RYGYQKQDKKILDFVNTTLTKMAYGGIYDQIGGGFSRYSVDDKWHVPHFEKMLYDNAQ 273
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L ++Y +AF+LTK+ Y + + L++++R++ G G +S+ DADS + +EGA
Sbjct: 274 LVSLYAEAFALTKNELYENVVIETLEFIKRELTGTNGIFYSSLDADSLTEDNVL--EEGA 331
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+YVW +E++ +L + LF +Y + G + H + VLI +
Sbjct: 332 YYVWKKEELQTLLKDDFKLFSTYYNVNNYGYWE-------HKNY----VLIRDKNDLKFT 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ + LEK + L R KR P LDDK + SWN L++ + A ++L+ E
Sbjct: 381 NQENITLEKLKEKKKRWKSILLKEREKRNLPRLDDKTLTSWNALMLKGYVDAYRVLQDE- 439
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y++ A A FI + E L H+++NG S GFL+DYA
Sbjct: 440 ---------------NYLDCAIKNAEFILNNQLKEDG-SLYHNYKNGASSINGFLEDYAT 483
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
I L LY+ S KWL A L + + F D E +F T+ +D ++++ E D
Sbjct: 484 TIDAFLALYQVTSTIKWLDNAKALTDYCFDTFFDTESQLFFFTSNQDKKLIVQTIEYRDN 543
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
P+ NS+ L L+ ++YY + +++ L + + A
Sbjct: 544 VIPASNSIMANCLYMLSHFY---NNNYYLKTSKNMLNNIKPEIHQYGSAFSNWMSLMLNF 600
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ P + V + G K+++ + DLNK + E + NN +
Sbjct: 601 TEPFYE-VAITGDKANIKVK----------DLNKEYLPNKIVACSERN-------NNLPL 642
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
N + +K + VC N +C PV +
Sbjct: 643 LHNRYVENKTLIYVCVNNTCKLPVIN 668
>gi|345006662|ref|YP_004809515.1| hypothetical protein [halophilic archaeon DL31]
gi|344322288|gb|AEN07142.1| hypothetical protein Halar_3548 [halophilic archaeon DL31]
Length = 727
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 227/708 (32%), Positives = 341/708 (48%), Gaps = 67/708 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED VA+ +N+ FV +KVDREERPD+D+VY T Q + GGGGWPLS
Sbjct: 50 SACHWCHVMAEESFEDPAVAETINENFVPVKVDREERPDLDRVYQTVCQLVTGGGGWPLS 109
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGR--PGFKTILRKVKDAW---DKKRDM---LAQSGAFA 131
+L+P+ KP GTYFPPE R PGF+ + R++ D+W +++++M Q A A
Sbjct: 110 AWLTPEGKPFYIGTYFPPEPHPQRNAPGFQDLCRQIADSWSDPEQRQEMENRAEQWTAAA 169
Query: 132 IEQLSEALSASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGS-APKFPRPVEIQ 188
++L A + + ++ E + L A + + D GGFGS PKFP P ++
Sbjct: 170 RDRLEPASTGRNTESETATETLSSTELLDDAAAAVVRGADRTNGGFGSGGPKFPHPGRVE 229
Query: 189 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 248
++L ++ G GE + L M GG++DH+GGGFHRY VD W VPHFE
Sbjct: 230 LLL----RVAALGDDGEP---LSVARNALNAMGSGGLYDHLGGGFHRYCVDAEWTVPHFE 282
Query: 249 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 308
KM YD G + +L + + + R+ L+++ R++ P G +S DA S ET
Sbjct: 283 KMAYDNGTIPAAFLAGYRAMGRERDAEVVRETLEFVSRELRHPDGGFYSTLDARS-ETPA 341
Query: 309 A-------TRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGN----CDLSRMSDP 356
+ ++EGAFYVWT E+ ++ E A LF Y + GN + + P
Sbjct: 342 SRLEDDEEPEREEGAFYVWTPAEIRAVVDEPAATLFCRRYGVISGGNFEGGTSVLNETVP 401
Query: 357 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
E G E ++ +A S+ E +L ++LF+ R +RPRP D+KV+
Sbjct: 402 IAELVGA----EFDEGTAPDSE-----EAVEELLQTATQELFEARGERPRPLRDEKVLAG 452
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNGL+IS+FA A +L +Y E A++A SF+R HL+D RL
Sbjct: 453 WNGLLISTFAEAGLVLDD-----------------QYTEDAQAALSFVREHLWDADARRL 495
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
F++G G+L+DYAFL G + Y+ + L +A+EL + F D + G
Sbjct: 496 SRRFKDGDVAVSGYLEDYAFLGRGAFETYQATGNVEPLSFALELAEVIADAFYDADDGTL 555
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
+ T + ++ R +E D + PS +V L+ L S R +LA
Sbjct: 556 YFTANDAEELVARPQELTDQSTPSSVGAAVSLLLELDSFTDRDLGAVARD----TLATHR 611
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
R++ + + AAD + V G E + Y +
Sbjct: 612 DRIEASPVEHVSLVLAADAADRGPLELTVAAGELP----EEWRETLRSRYLPGAVLARRP 667
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSA--DKVVALVCQNFSCSPPVTD 702
P ++ +E A N A + C++F+CSPP TD
Sbjct: 668 PTKAGLKEWLDELGLEEAPPIWANREAREGEPTVYACRSFTCSPPETD 715
>gi|418471574|ref|ZP_13041379.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
gi|371547815|gb|EHN76170.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
Length = 680
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 240/709 (33%), Positives = 341/709 (48%), Gaps = 88/709 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A+ LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPPE ++G P F+ +L+ V+ AW ++RD +++ + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVQQAWAERRDEVSEVAGKIVRDLAGRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S + ++L Q L L++ YD++ GGFG APKFP + I+ +L H +
Sbjct: 168 ISYGDAEAPGEEQLGQALL-----GLTREYDAQRGGFGGAPKFPPSMAIEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GG++D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G + EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT ++ ++LG E A L +++ + G + H AS
Sbjct: 334 VWTPAQLTEVLGAEDAELAAQYFGVTEEGTFE-------HG-----------------AS 369
Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
L +P ++ + + R +L R RP P DDKV+ +WNGL I++ A
Sbjct: 370 VLQLPQQEGVFDAARIASVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET------ 423
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDD 493
A F P +A +R HL DEQ R+ + ++G P G L+D
Sbjct: 424 ---GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RITRTSKDGRPGANAGVLED 472
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA G L L WL +A L + F D G G T D L+R +D
Sbjct: 473 YADAAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD--GSGSLYDTAADAEQLIRRPQD 530
Query: 554 -HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
D A PSG S + L+ A A + S+ +R AEH+L V +K + VP
Sbjct: 531 PTDNATPSGWSAAAGALLTYA---AHTGSEPHRTAAEHALGV----VKALGPRVPRFIGW 583
Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
+ A +L P + V +VG A A+ L++T + + A + F
Sbjct: 584 GLAAAEALLDGP--REVAVVGPAP---------ADPAARGLHRTAL-LGTAPGAVVAFGT 631
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
E + +A A VC+NF+C P TDP L L P+
Sbjct: 632 EGSDEFPLLADRPLVGGAAAAYVCRNFTCDAPTTDPERLRAALGAAPTG 680
>gi|455649958|gb|EMF28748.1| hypothetical protein H114_12956 [Streptomyces gancidicus BKS 13-15]
Length = 679
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 232/700 (33%), Positives = 336/700 (48%), Gaps = 81/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A +N FVSIKVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDQATADEMNAHFVSIKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEA 138
VFL+PD +P GTYFPP ++G P F+ +L V AW ++RD + + +G +
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVAQAWAERRDEVGEVAGKITRDLAGRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
LS EL Q L L++ YD++ GGFG APKFP + ++ +L H +
Sbjct: 168 LSVGGDEVPGEQELAQALL-----GLTREYDAQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ P G SA DADS +G R EGA+Y
Sbjct: 276 RVYTHLWRTTGSELARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYY 333
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
VWT ++ ++LG+ Y+ +++ +G +VL + D A A+
Sbjct: 334 VWTPAQLREVLGDADAEPAARYF----------GVTEEGTFEEGASVLQLPQRDEVADAA 383
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ + R +L R +RP P DDKV+ +WNGL I++ A
Sbjct: 384 R-----------IDGIRERLLAARDRRPAPGRDDKVVAAWNGLAIAALAETGACFG---- 428
Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
R + +E A +A +R HL D R+ + ++G A G L+DYA
Sbjct: 429 ------------RPDLVEAAVAAGDLLVRVHLDDHA--RIARTSKDGQVGANAGVLEDYA 474
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + FLD E G ++T + ++ R ++ D
Sbjct: 475 DVAEGFLALASVTGEGVWLDFAGLLVDHILARFLDAESGALYDTASDAERLIRRPQDPTD 534
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG + + L A + S+ +R AE +L V +K + VP +
Sbjct: 535 NAAPSGWTAAAGA---LLGYAAHTGSEPHRTAAERALGV----VKALGPRVPRFIGWGLA 587
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P + V +VG A A+ +L++T + + A + E +
Sbjct: 588 VAEAVLDGP--REVAVVGRG---------ADDPATAELHRTAL-LGTAPGAVVAVGTEGS 635
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+NF+C P TDP L L
Sbjct: 636 DEFPLLADRPLVDGAPAAYVCRNFTCDAPTTDPDRLRTAL 675
>gi|322435300|ref|YP_004217512.1| hypothetical protein AciX9_1682 [Granulicella tundricola MP5ACTX9]
gi|321163027|gb|ADW68732.1| hypothetical protein AciX9_1682 [Granulicella tundricola MP5ACTX9]
Length = 702
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 224/702 (31%), Positives = 345/702 (49%), Gaps = 60/702 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES+E+ A+L+N+ F++IKVDR+ERPDVD Y V A+ G GGWPL+ F
Sbjct: 48 CHWCHVMDRESYENAETARLINEHFIAIKVDRDERPDVDARYQAAVAAISGQGGWPLTAF 107
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEAL 139
L+P +P GGTYFPP D++GRPG + +L + +A+ KR+ + + I + +E+
Sbjct: 108 LTPQGQPYFGGTYFPPLDQHGRPGLRRVLMTMAEAFQNKREEVMDTAGSVIAAIEHNESF 167
Query: 140 SASASS--NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
SAS+ +L D+L +AL + +D R GGFGS PKFP + +++ + ++
Sbjct: 168 DGSASNPGTELVDKLIASAL--------QQFDRRNGGFGSQPKFPNSGALDLLIDAASRV 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G A+ + FTL+ M+KGGI+DH+ GGFHRYSVDERW VPHFEKM YD +L
Sbjct: 220 --GSQDGIAAAARATAAFTLEKMSKGGIYDHLAGGFHRYSVDERWVVPHFEKMSYDNSEL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG-GEIFSAEDADSAETEGATRKKEGA 316
Y+ A+ + + I R+I+ ++ M G ++++DAD A +G
Sbjct: 278 LKNYVHAYQTFVEPECARIAREIIRWVEEVMSDRELGGFYASQDAD------ANLDDDGD 331
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
++ WT E L + + +Y D+ + D H+ + KN L A
Sbjct: 332 YFTWTLAEARAALTKKELAVTAPFY-------DIGELGDMHHNPQ-KNTLHVDQPLETVA 383
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G+ L++ +L KL+ R RP P++D + +WN ++IS+ A+++L A
Sbjct: 384 KAAGVSLDQASALLQTSLPKLYAARKTRPTPYIDKTLYTAWNAMMISAHLEAARVL---A 440
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ A F + DR + A S Y E + PG LDDYAF
Sbjct: 441 DPATRLFALKTLDR--VLSTAWHEGSLDHVIAYGESSEPTD--------PIPGILDDYAF 490
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL------LRV 550
LD +E + A+ L + F D E GG+F+T P L R
Sbjct: 491 TGHAALDAWEATGHISYFNSALALADAAITKFYDEEKGGFFDTETPAPGELRLGALSTRR 550
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
K D P+GN V+ L + A + + ++Q A+ +L F ++ +
Sbjct: 551 KPLQDSPTPAGNPVAAAL---LLRLEALTGREDFKQMAKATLECFAAVVEHFGLYAATFG 607
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A L +P + VV+VG S D + AA Y +NKTV+ + P+ +
Sbjct: 608 LALQRLLLPPIQ-VVIVGEDSVAD--RLERAALGRYAVNKTVVRLTPSQLTTLP------ 658
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ A + + A VC F+C PPV P +L +LLE
Sbjct: 659 PSLAQTLPHFLTTLGSYAAVCTGFTCRPPVNTPEALAEILLE 700
>gi|441179453|ref|ZP_20970097.1| hypothetical protein SRIM_39324 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440614431|gb|ELQ77705.1| hypothetical protein SRIM_39324 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 641
Score = 328 bits (842), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 228/712 (32%), Positives = 343/712 (48%), Gaps = 103/712 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE VA ++N+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 10 SACHWCHVMAHESFEDEAVAAVINEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 69
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---- 135
VFL+PD +P GTYFPP ++G P F IL+ V+ AW ++RD + + + L
Sbjct: 70 VFLTPDAEPFYFGTYFPPAPRHGMPSFPQILQGVRGAWAERRDEVGEVAGRIVADLSARS 129
Query: 136 -SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
SE L+ P++L L L++ +D+ GGFG APKFP + ++ +L H
Sbjct: 130 VSETLAKGGQVPPGPEDLASALL-----ALTRDFDAVHGGFGGAPKFPPSMALEFLLRHH 184
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ E+ +MV T + MA+GGI+D +GGGF RY+VD W VPHFEKMLYD
Sbjct: 185 ART-------ESEAALQMVQATAEAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDN 237
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
L Y + +T + + D++ R++ G SA DADS +G+ + E
Sbjct: 238 ALLCRTYAHLWRVTGSDLARRVAVETADFMVRELRTEEGGFASALDADS--DDGSGKHVE 295
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
GA+YVWT +++ +LGE HY+ G + F+ +++L D+
Sbjct: 296 GAYYVWTPEQLRAVLGEKDAAVAAHYF----GVTE-------EGTFEEGASVLQLPDTDD 344
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
E+ +I + +L R RPRP DDKV+ +WNGL I++ A
Sbjct: 345 LVDA-----ERIASI----KERLRAARDSRPRPGRDDKVVAAWNGLAIAALAETGAYF-- 393
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
DR + ++ A AA + R D Q RL + R+G + A G L+D
Sbjct: 394 --------------DRPDLVQAATDAADLLVRVHMDWQA-RLHRTSRDGVAGANSGVLED 438
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-------EGGGYFNTTGEDPSV 546
YA + G L L W+ +A LFLD E G ++T + +
Sbjct: 439 YADVAEGFLALASVTGEGVWVDFA--------GLFLDTVIVHFTAEDGTLYDTADDAEQL 490
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R ++ D A PSG + + L+ A++ + S +R+ AE +L V +K ++
Sbjct: 491 IRRPQDPTDNATPSGWTAAAGALLSYAAL---TGSGPHREAAERALGV----VKALSGRA 543
Query: 607 PL-----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPA 658
P + A L P + V +VG D + A H + L V+ +
Sbjct: 544 PRFIGWGLAVAEAALDGP--REVAVVGP----DGDPATRALHRAALLGTAPGAVVALGAP 597
Query: 659 DTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++E+ ++ + A A VC++F+C P TDP L L
Sbjct: 598 GSDEVPLLKDRPLVDGRPA----------AYVCRHFTCERPTTDPEELGEKL 639
>gi|320107222|ref|YP_004182812.1| N-acylglucosamine 2-epimerase [Terriglobus saanensis SP1PR4]
gi|319925743|gb|ADV82818.1| N-acylglucosamine 2-epimerase [Terriglobus saanensis SP1PR4]
Length = 714
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 227/695 (32%), Positives = 345/695 (49%), Gaps = 66/695 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES+E+ A L+N +F++IKVDR+ERPDVD Y V A+ G GGWPL+ F
Sbjct: 62 CHWCHVMDRESYENADTADLINRYFIAIKVDRDERPDVDTRYQAAVSAISGQGGWPLTAF 121
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P+ KP GGTYFPPED++GRP F+ +L+ + DA+ +R + S ++ + S
Sbjct: 122 LTPEGKPFFGGTYFPPEDRFGRPSFQRVLQTMADAFQDRRSEVEDSADSVMQAIEFNESF 181
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
S S+ L +L + AE + K +D ++GGFGS PKFP P + + L D
Sbjct: 182 SGRSSDLGPDL----VNKLAESMLKQFDPQYGGFGSQPKFPHPGALDL-------LTDIA 230
Query: 202 KSGE--ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G A + +V TL MA GG+ D +GGGFHRYSVDERW VPHFEKM YD +L
Sbjct: 231 SRGGPLAEQASNVVRVTLDKMALGGMRDQIGGGFHRYSVDERWVVPHFEKMAYDNAELLK 290
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFY 318
Y+ AF Y+ + R+IL ++ + G +S++DAD T +G ++
Sbjct: 291 SYVRAFRTFLVPEYAEVAREILRWMDGTLSDRERGGFYSSQDAD------LTLDDDGDYF 344
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E +L + E YY D+ + D H++ +NVL + + +
Sbjct: 345 TWTRDEAAAVLSPEELAVAEIYY-------DIGEIGDMHHD-PSRNVLHVRYTLAEVSRR 396
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+G+ E+ ++L R KL RS+R P +D + WNGL I+++ A + L ++ E+
Sbjct: 397 IGITEEEVQSLLLSLRGKLASARSERAAPFVDRTMYTGWNGLCIAAYLEAGRALHNQ-ET 455
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT---HRLQHSFRNGPSKA-PGFLDDY 494
F + DR + + ++E+T H + ++ + P++A G L+DY
Sbjct: 456 VQFGLRSL--DR-------------LLQEAWNEETGLGHVISYADGHVPAQAVAGVLEDY 500
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL----LRV 550
AF + +E ++WL A L F D GGG+F+T L R
Sbjct: 501 AFAGLACVAAWEVTGESRWLRHAEALAARMIRDFADAVGGGFFDTARGSGVALGALSARR 560
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
K D P+GNS + + L++LA K + A +L F ++ +
Sbjct: 561 KPLQDSPTPAGNSAAALFLLQLADWTMDEK---LQAKAADTLETFAGIVEHFGLYAATFG 617
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM--DFWEE 668
A L +P + VV+ SS E AAA A Y K+V+ + + E++ E
Sbjct: 618 LALQRLLLPEIQIVVVGEDDSSAVLE---AAALAGYSATKSVLRLKRSQLEDLRGPMAET 674
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
A M N+F A+VC + C PP +DP
Sbjct: 675 LPHLPAEMFENSF------AMVCGDGRCQPPTSDP 703
>gi|386360498|ref|YP_006058743.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
gi|383509525|gb|AFH38957.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
Length = 639
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 219/592 (36%), Positives = 303/592 (51%), Gaps = 83/592 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESF+DE VA+LLN FV +KVDREERPDVD YM + +L G GGWP+S
Sbjct: 47 HTCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMS 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+P+ KP GGTYFP ED+ G PGFK +L V +AW KR+ + + E+L+ AL
Sbjct: 107 LFLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWTGKREAVLEEA----ERLTRAL 162
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + P LP+ A + L +++D +GGF APKFP+ + +L + + E+
Sbjct: 163 WKSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+++ TL+ MA GG++D VGGGFHRYSVD W +PHFEKMLYD LA
Sbjct: 221 --------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLAR 272
Query: 260 VYLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
VYL A+ L + + + R+ LD+L RR+ G +A D AE+EG +EG
Sbjct: 273 VYLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEG 320
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
+Y WT E+ + LGE L + ++ L DL ++VL ++
Sbjct: 321 RYYTWTEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVR 366
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
+ LG E + R KL R +R P LDDKV+ W+ L + + A A ++ E
Sbjct: 367 EA-LG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE 422
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
A Y+E A+ A F+ H+Y + L+H++R G +L D A
Sbjct: 423 A----------------YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQA 463
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
F L+LY +L WA LF REG PS+ L KE +
Sbjct: 464 FAALAFLELYAATGEWPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEE 511
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
GA PSG S LVRL ++ G YR+ AE LA L A+P
Sbjct: 512 GALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 559
>gi|390957418|ref|YP_006421175.1| thioredoxin domain-containing protein [Terriglobus roseus DSM
18391]
gi|390412336|gb|AFL87840.1| thioredoxin domain protein [Terriglobus roseus DSM 18391]
Length = 710
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 219/700 (31%), Positives = 338/700 (48%), Gaps = 68/700 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES+E+ A L+N++FV++KVDR+ERPDVD Y V A+ G GGWPL+ F
Sbjct: 54 CHWCHVMDRESYENAETAALINEYFVAVKVDRDERPDVDTRYQAAVAAISGQGGWPLTAF 113
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL------ 135
L+PD +P GGTYFPPE++YGRP F+ +L + ++ K + +S + +E +
Sbjct: 114 LTPDGRPYFGGTYFPPEERYGRPSFRRVLMTMAGSFYDKHHEVEESASSVMEAIEYSETF 173
Query: 136 ---SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 192
+ L AS +S L D+L AL K +D GGFGS PKFP P ++M+L
Sbjct: 174 TGDATDLDASGASLALLDKLIDGAL--------KQFDPIHGGFGSQPKFPHPAALEMLLD 225
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ + A + + L +L+ MA+GGI D + GGFHRYSVDERW VPHFEKM Y
Sbjct: 226 AASR-----PGPNAPQCAEAALVSLKKMARGGIFDQLAGGFHRYSVDERWVVPHFEKMAY 280
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATR 311
D +L Y+ AF D + R + ++ + G + ++DAD +
Sbjct: 281 DNSELLRAYVHAFQTFVDPECADAARATMQWMDEWLSDRERGGFYGSQDAD------LSL 334
Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+G ++ W+ E +L E E YY D+ + D H++ +NVL
Sbjct: 335 DDDGGYFTWSRDEAAAVLTEDEAKLAELYY-------DIGAVGDMHHD-PARNVLFRPMT 386
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+A + G+ E +L R KL R +RP P +D + WN + IS++ RA ++
Sbjct: 387 LEQAAQQAGVDAEIAPMMLKVMRSKLLAARLQRPTPFVDKTIYTGWNAMCISAYVRAGRV 446
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGF 490
L+ A F DR ++VA + H + +S P + G
Sbjct: 447 LQVPGAVA---FACKSLDR--VLDVALVEGTL---------KHVVAYSDPAAPHTDVAGV 492
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL--- 547
LDDY FL LD++E + A L T F D +GGG+F+ + +
Sbjct: 493 LDDYVFLGHACLDVWEATGEIVYFEAARVLATTLLRKFYDGKGGGFFDMASDSTETIGAL 552
Query: 548 -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
R K D P+GN L+RL ++ + + YR+ A+ +L F ++ + +
Sbjct: 553 STRRKPVQDAPTPAGNPAGAALLLRLHAL---TGDETYRETAQETLETFAVIVEHLGLYG 609
Query: 607 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
P A L+ P+ + V++ G + E + A A + +NK+V+ I A +
Sbjct: 610 PTFGLALGRLARPAVQVVIVGGGAKAAQLEMV---ALARFAVNKSVVRIARAQLGAL--- 663
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
A + +D+ +ALVC +C PP+ D L
Sbjct: 664 ---PPALAETLPHLPDSDEAIALVCSGMTCQPPIRDAAEL 700
>gi|452943278|ref|YP_007499443.1| thymidylate kinase [Hydrogenobaculum sp. HO]
gi|452881696|gb|AGG14400.1| thymidylate kinase [Hydrogenobaculum sp. HO]
Length = 634
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 214/612 (34%), Positives = 306/612 (50%), Gaps = 85/612 (13%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
+F K + FL ++CHWCHVME ESFEDE VA LN +FVSIKVD+EERPD
Sbjct: 29 SEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKESFEDEEVASFLNKYFVSIKVDKEERPD 88
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YM Y L GGWPLS FL+P +P GTYFP + F +L+++KD WD
Sbjct: 89 IDSLYMEYCVLLNNSGGWPLSAFLTPTKEPFFAGTYFP------KASFLKLLQQIKDLWD 142
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K + + +EQL + +++ EL ++ + L+ YD FGGF A
Sbjct: 143 KDSKNIIEKSKRLVEQLKQFMNSFEKR-----ELNESFIDKALFGLANRYDEEFGGFSEA 197
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP + ++L K+ Q M L TL M +GGI DHVGGGFHRYS
Sbjct: 198 PKFPSLHNVLLLLKSQKQ-----------PFQDMALSTLLNMRRGGIWDHVGGGFHRYST 246
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYDQ Y +A+ LTK+ + +++++ ++ G +++
Sbjct: 247 DRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNEIFKDTVYKTINFVKENLY-ENGFFYTS 305
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
DAD TEG +EG FY+WT +E++DIL E A F E + +K GN + +
Sbjct: 306 MDAD---TEG----EEGGFYLWTYQEIKDILKEKADKFIEFFNIKKEGNF----LDEAKR 354
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
+ GKNVL A + + E+ L IL R KR +P +DDK+++ N
Sbjct: 355 VYTGKNVLY--------AKEPSLAFEEELKILKA-------FREKRKKPLIDDKILLDQN 399
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
++ + A + D K+++++A ++L + H LQH
Sbjct: 400 AMMDFALIEAYLVF----------------DDKDFLDMA-------TKNLNNISKHPLQH 436
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
+ + P LDDYA+LI L LY+ L AI L E D+ GG++
Sbjct: 437 ALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSKDALEKAISLTEETIEKLWDKNAGGFYL 495
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
+ G+D VL+ K +DGA PSGNSV +NLV L I +K D Y E+ + +
Sbjct: 496 SVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVELFFI---TKEDTY----ENRYQILSSI 546
Query: 599 LKDMAMAVPLMC 610
DM P C
Sbjct: 547 YSDMLSRNPTAC 558
>gi|407781159|ref|ZP_11128379.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
gi|407208585|gb|EKE78503.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
Length = 680
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 224/705 (31%), Positives = 335/705 (47%), Gaps = 86/705 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED+ A L+N FV++KVDREERPD+D +Y + + L GGWPL++
Sbjct: 50 ACHWCHVMAHESFEDDETAALMNRLFVNVKVDREERPDIDHIYQSALAILGEQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD P GGTYFP E +YGRPGFK +L+ + DA + D ++++ + + L +
Sbjct: 110 FLTPDGDPFWGGTYFPKEARYGRPGFKAVLQAIADAHAEGSDKVSRNASALRQALRQLAE 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+A N P L + AE+L + D GG G APKFP+P + ++ H
Sbjct: 170 PAAGENIEPALLDR-----IAERLHREIDPIHGGIGGAPKFPQPGMLMLLWRHWL----- 219
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+SG + + VL TL+ M +GGI+DH+GGGF RYS D +W PHFEKMLYD QL +
Sbjct: 220 -RSGN-QDSRDYVLLTLERMCQGGIYDHLGGGFARYSTDAQWLAPHFEKMLYDNAQLIEM 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
A T + + + ++ R+MI G S+ DADS EG +EG FYVW
Sbjct: 278 LTHAALETGRPLFRQRLEETIGWVLREMITDEGGFASSLDADS---EG----EEGKFYVW 330
Query: 321 TSKEVEDIL----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
E++ +L GE FK Y + P GN + + +N +L + +A +
Sbjct: 331 REAEIDQLLAHLPGEALESFKRAYDVTPEGNWEGVTILH-------RNRRPDLGNGAAES 383
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
L + R+ LF+ R +R RP DDKV+ WNGL+I + A+AS
Sbjct: 384 Q------------LAQVRQLLFEHREQRERPGWDDKVLADWNGLMIRALAQAS------- 424
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F F +++ A A ++ + + RL+HS R + P L+DYA
Sbjct: 425 ----FAFA-----HADWLRAAIRAFDYVVEKMTLDG--RLRHSRRGDILRHPATLEDYAN 473
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ S L L++ ++L AI + D + D EGGGYF T + V+LR K D
Sbjct: 474 MASAALALFQITRHQRFLGQAIAWVDVLDRHYWDHEGGGYFTTADDTNDVVLRAKNAQDN 533
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A P+GN + L L + + D YR A+ + F + + D+
Sbjct: 534 AVPAGNGTMLQVLTTLYHL---TGDDSYRGKADLLIPRFAGEIGRNFFPLATFLNGCDIA 590
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
P + + L G ++ + +L A AD ++
Sbjct: 591 QRPLQ--ITLTGDPTTPTYVGLLRAI---------------ADVSAPGLILHQLGQKGAL 633
Query: 677 ARNNFSADKV------VALVCQNFSCSPPVTDPISLENLLLEKPS 715
N+ ++ + A +C CS P+ +P +L LL S
Sbjct: 634 PSNHPASTALEGTLQSAAYLCVGQRCSLPLREPKALSEALLAARS 678
>gi|94969411|ref|YP_591459.1| hypothetical protein Acid345_2384 [Candidatus Koribacter versatilis
Ellin345]
gi|94551461|gb|ABF41385.1| protein of unknown function DUF255 [Candidatus Koribacter
versatilis Ellin345]
Length = 705
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 216/699 (30%), Positives = 340/699 (48%), Gaps = 62/699 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES++D VA +LN F++IKVDR+ERPDVD Y T V A+ G GGWPL+ F
Sbjct: 53 CHWCHVMDRESYDDPEVADILNREFIAIKVDRDERPDVDSRYQTAVAAITGQGGWPLTAF 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+ + KP GGTYFPP D +GRPGFK IL + DA+ +RD + + + L A
Sbjct: 113 LTTEGKPFYGGTYFPPRDAHGRPGFKKILLAIADAYKNRRDDVLREADGMMTALHHAEGL 172
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDT 200
+ + + + + S+D + GGFGSAPKFP ++++L ++++ T
Sbjct: 173 AGHGG----DFNPRVITMMVQSALNSFDPKNGGFGSAPKFPHASIVEVLLDWYAR----T 224
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ G A+ + TL+ MA+GG++D + GGFHRYSVDE W VPHFEKM YD +L
Sbjct: 225 GEDGAANVART----TLEKMAQGGVYDQIAGGFHRYSVDENWIVPHFEKMSYDNSELLRN 280
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y+ A L D ++ +DI+ ++ + G ++++DAD + +G ++
Sbjct: 281 YVHAAQLFPDAAFAETAKDIIRWVDSTLTDREHGGFYASQDAD------INLEDDGDYFT 334
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT E + L +Y D++ + + H+ KNVL + A +L
Sbjct: 335 WTVDEAKAALTAQEFEVAALHY-------DINEVGEMHHN-SAKNVLWIRAEVEEIAMRL 386
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ ++ +L ++K+ R +RP P++D V V+WN + +S++ A ++L +
Sbjct: 387 SLKPDQIRMLLNSAKQKMLVARLQRPTPYIDKTVYVNWNAMFVSAYLAAGRVLGMKDAH- 445
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSK-APGFLDDYAF 496
+F + DR I D+Q H + +S N + + G LDDY F
Sbjct: 446 --HFALRTLDR-------------ILGQWNDKQQLPHVIAYSDPNAVLRESRGLLDDYVF 490
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-----RVK 551
LD YE + A ++ +T F D GG+F+ V L R K
Sbjct: 491 TALACLDAYEATGDLTYFRCAQQIADTAIAKFGDATSGGFFDAEPTTEQVALGALSVRRK 550
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
D P+GN + I ++RL + ++ YR AE +L F ++ +
Sbjct: 551 AFQDSPTPAGNPAAAILMLRLHAYTNDTR---YRDKAEDTLETFAGAVEQFGIYAGTYGR 607
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
AA S P + V++ S+ D E AA ++ N +VI + AD +
Sbjct: 608 AAIWFSKPHTQVVIIGTDASAADLER---AAFQTFAENLSVIRLAQADAHLLPPALAETI 664
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
N + + VA+VC NF+C PP+T L + L
Sbjct: 665 PNVPGVNDG----RAVAVVCSNFACQPPITSAQDLTDTL 699
>gi|389645929|ref|XP_003720596.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
gi|351637988|gb|EHA45853.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
Length = 865
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 224/669 (33%), Positives = 336/669 (50%), Gaps = 133/669 (19%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CH+C + ESF ++ VA LLN F+ I VDREERPD+D +YM Y+QA+ GGWPL+VF
Sbjct: 96 CHYCRLTTQESFRNKNVAALLNSSFIPILVDREERPDIDSIYMNYIQAVNSAGGWPLNVF 155
Query: 82 LSPDLKPLMGGTYFPP---------EDKYGRPGFKTILRKVKDAWDKK--------RDML 124
L+P+L+P+ GGTY+P ED F IL+K++ W ++ +D++
Sbjct: 156 LTPELEPVFGGTYWPGPGRSTSSAVEDGEEPLDFLGILKKLQKVWTEQEAKCRKEAQDIV 215
Query: 125 AQSGAFA-----------------------------------IEQLSEALSASASSNKLP 149
Q FA E + ++ASAS+ L
Sbjct: 216 LQLREFAAEGTMGVGNTEKVPSVATTGATVNISTGVAAPTTSTETPKKTVTASASATDLD 275
Query: 150 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED-TGKSGE 205
+L Q L +S+S+D GGF +PKFP P ++ +L + ++ D G E
Sbjct: 276 VDLDQ--LEEAYANISRSFDRVNGGFNLSPKFPTPPKLSFLLRLAHLPPEVGDIVGGPEE 333
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
+ M L TL+ + GG+ DH+G GFHRYSV W VPHFEKM+ D L VYLDA+
Sbjct: 334 IARATHMALATLRALRDGGLRDHIGAGFHRYSVTADWSVPHFEKMIADNALLLGVYLDAW 393
Query: 266 ---------SLTKDVFYSYICRDILDYLRRDMIGPGGEIFS-----------AEDADSAE 305
+ T + ++ + ++ DYL PG E S +E +DS +
Sbjct: 394 LGQAAKEGRAPTLEDEFADVVLELGDYLG----NPGSEFGSSSTCQDSLLPTSEASDSYQ 449
Query: 306 TEGATRKKEGAFYVWTSKEVEDIL----------GEH-----AILFKEHYYLKPTGNCDL 350
+ +EGAFY+WT +E + + G+H A + ++ +K GN +
Sbjct: 450 RKSDKHMREGAFYLWTRREFDATVSNTEDGDLTNGKHDGDFYARVAAAYWNVKEHGN--I 507
Query: 351 SRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHL 409
DPH+EF +NVL + + ++ G+ +++ IL E RRKL R S R RP +
Sbjct: 508 PEEQDPHDEFINQNVLRVVKTPAELSTSFGIAVDEVNQILAEARRKLRARRDSDRVRPEV 567
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR---KEYMEVAESAASFIRR 466
D+K +V++N + +S+ ARA +L S G D+ +M A+ AA ++
Sbjct: 568 DEKQVVAYNAMAMSALARAGVVLWS-----------TGLDKHRGSAWMMCAKQAAIEMKG 616
Query: 467 HLYDEQTHRL-QHSFRNGPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQ 524
LYD++T +L +H FRN S +DYAFLI LLDLY+ G + +L WA +LQ+ Q
Sbjct: 617 RLYDQETGKLSRHWFRNKKSSTDALAEDYAFLIEALLDLYDATGDESAYLDWAKQLQDKQ 676
Query: 525 DELFLDREG-----------------GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 567
E+F DR GG+++T E P V+LR+K+ D ++PS N+VS
Sbjct: 677 IEMFYDRVAPSSQNLDSDAAKTKSGSGGFYSTAEEAPDVILRLKDGMDTSQPSTNAVSAS 736
Query: 568 NLVRLASIV 576
NL RLA I+
Sbjct: 737 NLFRLALIL 745
>gi|167772692|ref|ZP_02444745.1| hypothetical protein ANACOL_04074 [Anaerotruncus colihominis DSM
17241]
gi|167665170|gb|EDS09300.1| hypothetical protein ANACOL_04074 [Anaerotruncus colihominis DSM
17241]
Length = 614
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 234/698 (33%), Positives = 336/698 (48%), Gaps = 102/698 (14%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFED A +LN F+SIKVDREERPD+D VYM QA+ G GGWPL++ ++P+ K
Sbjct: 1 MERESFEDAQAADVLNSGFISIKVDREERPDIDAVYMAVCQAMTGSGGWPLTILMTPEQK 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 147
P GTY P +YG+PG +L++V W +R+ L Q+G E + A
Sbjct: 61 PFWAGTYLPKYSRYGQPGLIDLLKRVSLLWRTEREQLLQAG-------DEIAAYIAQRGP 113
Query: 148 LPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+ PQ A L A QL ++D GGFG APKFP P + ++ +++ ++
Sbjct: 114 GGAQAPQPALLHTAAGQLRAAFDPADGGFGDAPKFPSPHNLLFLMNYARW-------EKS 166
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
++ + M TL MA+GG+ D VGGGF RYS D RW PHFEKMLYD LA YLDAFS
Sbjct: 167 ADARSMAERTLTQMARGGLFDQVGGGFSRYSTDRRWLAPHFEKMLYDNALLAYAYLDAFS 226
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
F+ R LDY+ R++ P G + +DADS +EGA+Y+ T + VE
Sbjct: 227 QDGRPFWETTARRTLDYVLRELTSPEGAFYCGQDADSG-------GEEGAYYLLTPQSVE 279
Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
LG + A F Y + +GN F+G+++ L +++ G
Sbjct: 280 QALGAQDAARFCRWYGITESGN------------FEGRSIANLLENTAYEQEPEG----- 322
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
G R +L D R R H DDKV+ +WN L+I++ ++A + L
Sbjct: 323 ----FGRLRERLLDFRRSRAALHRDDKVLTAWNALMIAALSKAYRTL------------- 365
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 505
G R Y++ A AA+F+ +L RL +R+G + G LDDYAF LL+LY
Sbjct: 366 -GDAR--YLDAARRAAAFLHANLTGPDG-RLWLRWRDGEAANMGQLDDYAFYAWALLELY 421
Query: 506 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
L A+ + T F D + GG+F T + ++ R KE +DGA PSGN+ +
Sbjct: 422 AADFDAAHLEEAVSMMQTLQVHFWDGQEGGFFLTADDAERLITRPKEIYDGAMPSGNAAA 481
Query: 566 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC----AADMLSVPSR 621
+ L RL + + ++ A+ LA ++ A+ P C A PSR
Sbjct: 482 GLVLERLWKL---TGDPVWQTRADGQLAFLASK----ALPYPAGHCFSLLAMGEALYPSR 534
Query: 622 KHVVLVGHKSSVDFENMLAAAHAS--YDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR- 678
+ LV S + +LA A + L KT SN A + R
Sbjct: 535 E---LVCATSGTVPDGLLALAERRRLHTLIKT------------------PSNAALLERL 573
Query: 679 NNFSA------DKVVALVCQNFSCSPPVTDPISLENLL 710
F+A D + +CQN +C+ P +L LL
Sbjct: 574 APFTAAYPIPEDGALFYLCQNGACAAPAGSVQALVRLL 611
>gi|300113281|ref|YP_003759856.1| hypothetical protein Nwat_0572 [Nitrosococcus watsonii C-113]
gi|299539218|gb|ADJ27535.1| protein of unknown function DUF255 [Nitrosococcus watsonii C-113]
Length = 694
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 236/699 (33%), Positives = 367/699 (52%), Gaps = 70/699 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFE+ A ++N+ F++IKVDREERPD+D++Y Q L G GGWPL
Sbjct: 53 SACHWCHVMAHESFENPETAAVMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112
Query: 79 SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++FL P P GGTYFPPE+++G PGFK +L++V + + +R+++ QS + E
Sbjct: 113 TMFLEPVKQAPFFGGTYFPPEERHGLPGFKDLLQRVAEYFHTRREVI-QSQNERLLDAFE 171
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L +S+ ++ + L + L+ +QL++++DSR+GGF APKFP P I+ L +
Sbjct: 172 KLDGRSSAAEV-EGLNRAPLQAAHQQLAQAFDSRYGGFRGAPKFPNPSIIERCLRDAHGE 230
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
T E + M TL+ MA+GGI+D +GGGF RYSVDE+W +PHFEKMLYD GQL
Sbjct: 231 HIT--EDEKQQALTMARLTLEQMAQGGIYDQLGGGFCRYSVDEKWRIPHFEKMLYDNGQL 288
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+Y DA+ L + + I + ++ R+M P G +S+ DADS EG EG F
Sbjct: 289 LVLYRDAYRLWGNGIFRRILEETGHWVVREMQSPEGGYYSSLDADS---EG----HEGKF 341
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT ++V +L + Y+ + P N F+G L A A
Sbjct: 342 YVWTREQVRALLDDEKYTLAVRYF----------SLDQPAN-FEGHWHLYAAMTPEALAE 390
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
++ +P L ++KLF R R RP DDK++ +WN L+I A A + L
Sbjct: 391 EMKVPAPGLQEQLTAAKQKLFAAREARIRPGRDDKILTAWNSLMIKGMAAAGQALAQ--- 447
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
PV ++ AE A F+R HL+ Q RL S+++G ++ G+LDDYAFL
Sbjct: 448 ------PV-------FIASAEKAVDFVRAHLW--QKGRLLVSYKDGRAQHQGYLDDYAFL 492
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL+L + L +A++L F D+ GG++ T + +++ R D A
Sbjct: 493 LDALLELLQVRWRDGDLAFAVDLAEAVLGHFEDKAQGGFYFTADDHETLIHRPVPLMDNA 552
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCAADML 616
P+GN + +L+RL ++ + Y + AE++L A +E+ + L+ + L
Sbjct: 553 TPAGNGILAWSLLRLGHLLGEMR---YLKAAENTLKAAWESLQQTPHAHCSLLKALEEWL 609
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM-----DFWEEHNS 671
+ P + V+L G S + E+ A A A+Y + + I P + + + ++W + +
Sbjct: 610 TPP--QIVILRG--SGEELESWRAVAAAAYAPRRVTLAI-PLEAQYLPGILGEYWPQEAA 664
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
V A VC +CS P+T +L+ L
Sbjct: 665 --------------VTAYVCSGHTCSAPLTQREALKEHL 689
>gi|344344146|ref|ZP_08775011.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
984]
gi|343804430|gb|EGV22331.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
984]
Length = 683
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 220/613 (35%), Positives = 321/613 (52%), Gaps = 58/613 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESF D VA L+N FV+IKVDREERPD+D +Y Q L G GGGWPL
Sbjct: 58 SACHWCHVMAHESFADPEVATLMNRAFVNIKVDREERPDLDGLYQRAHQLLNGRGGGWPL 117
Query: 79 SVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
+VFLSP DL+P GTYFPP ++G P F +L V+ A+ ++ D + Q G E L E
Sbjct: 118 TVFLSPHDLRPFFAGTYFPPTPRHGLPAFTQLLAGVERAYREQHDKILQQG----ENLIE 173
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A A +N + QL+ S+D R GGFG APKFP E+ ++L + +
Sbjct: 174 AF-AGLEPEPGERPPERNLIGAALNQLAVSFDPRHGGFGGAPKFPHAPELALLLRCAARG 232
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G+ +A E +M +L+ M + G++D +GGGF RY+VD +W +PHFEKMLYD L
Sbjct: 233 DRPGE--DAPEPLEMARVSLERMIRSGLNDQLGGGFCRYAVDAQWMIPHFEKMLYDNAAL 290
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ D + T + + D++ R+M P G +S+ DADS EG +EG F
Sbjct: 291 LALCCDLHACTGEQLFRSAAESTADWVLREMQSPEGGYYSSLDADS---EG----EEGRF 343
Query: 318 YVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y+W ++V +L E F Y L N F+G+ L +A A
Sbjct: 344 YLWEREQVRALLPEAEYRPFAAVYGLDRPPN------------FEGRWHLHGHLTPAAVA 391
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G+ LE+ ++LG R LF R +R RP DDKV+ +WN L+I + ARA+++L
Sbjct: 392 AAQGLTLEQVQSLLGAARATLFAERERRVRPGRDDKVLGAWNALMIGAMARAARVL---- 447
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+R +Y+E AE A +R L+ + RL S R+G +LDD+A
Sbjct: 448 ------------ERDDYLESAEQALGCVRERLWRDG--RLLASCRDGRVAFDAYLDDHAL 493
Query: 497 LISGLLDLYEFGSGTKW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
L++ +L+L + T+W L +AIEL T F D E GG++ T + ++ R K
Sbjct: 494 LLATVLELLQ----TRWSSADLAFAIELAETLLARFHDPEAGGFWFTAHDHERLIHRTKP 549
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
D P+GN V+ + L RL +V + Y E +L + T ++ + A + CA
Sbjct: 550 LADETLPAGNGVAALALQRLGHLVGEPR---YLAAVESTLRLAATAMRRLPHAHATLLCA 606
Query: 613 ADMLSVPSRKHVV 625
D P + V+
Sbjct: 607 LDEWLDPPEQLVI 619
>gi|297202044|ref|ZP_06919441.1| transmembrane protein [Streptomyces sviceus ATCC 29083]
gi|297148022|gb|EDY58354.2| transmembrane protein [Streptomyces sviceus ATCC 29083]
Length = 570
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 210/576 (36%), Positives = 297/576 (51%), Gaps = 59/576 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A LLN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 51 SSCHWCHVMAQESFEDQATADLLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP + G P F+ +L V+ AW +RD +A+ + L+
Sbjct: 111 VFLTPDAEPFYFGTYFPPSPRQGMPSFRQVLEGVRAAWTDRRDEVAEVAGKIVRDLA-GR 169
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S ++ P E A L L++ YD++ GGFG APKFP + ++ +L H +
Sbjct: 170 EISYGDSQAPGEEQLAAALLG---LTREYDAQRGGFGGAPKFPPSMVVEFLLRHHAR--- 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T + MA+GGIHD +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 224 TGAEG----ALQMAQDTCERMARGGIHDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCR 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + D D++ R++ G SA DADS +G R EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVALDTADFMVRELRTAEGGFASALDADS--DDGTGRHVEGAYYV 337
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
WT +++ ++LGE A L +++ + G + G++VL + D+ A
Sbjct: 338 WTPEQLREVLGEQDAELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDTVFDAE 386
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K + RR+L D R++RP P DDKV+ +WNGL I++ A
Sbjct: 387 K-----------VESIRRRLLDARAQRPAPGRDDKVVAAWNGLAIAALAETGAYF----- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
DR + ++ A AA + R DEQ RL + ++G A G L+DYA
Sbjct: 431 -----------DRPDLVDAALGAADLLVRLHLDEQA-RLSRTSKDGQVGANAGVLEDYAD 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ G L L WL +A L + F E G F+T + ++ + D
Sbjct: 479 VAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTGPE-GALFDTAADAERLIPPPQNPTDN 537
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
A PSG + + + S A + S+ +R+ AE +L
Sbjct: 538 AVPSGWTAAAPAPL---SYAAQTGSENHREGAEKAL 570
>gi|292493652|ref|YP_003529091.1| hypothetical protein Nhal_3684 [Nitrosococcus halophilus Nc4]
gi|291582247|gb|ADE16704.1| protein of unknown function DUF255 [Nitrosococcus halophilus Nc4]
Length = 694
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 234/691 (33%), Positives = 349/691 (50%), Gaps = 70/691 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFE +A +N+ F++IKVDREERPD+D++Y Q L G GGWPL
Sbjct: 53 SACHWCHVMAHESFESPEIAAAMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPL 112
Query: 79 SVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++FL P+ + P GGTYFPPE ++G PGFK +L ++ + + R+ + + + E
Sbjct: 113 TMFLEPENQVPFFGGTYFPPEGRHGLPGFKDLLERIAEFFHAHREEIQSQNSRLLAAFEE 172
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ +++ P+ L L+ +QL++S+D R+GGF APKFP P I+ L + +
Sbjct: 173 LDTRTSAVE--PEMLGPAPLKAAQQQLAQSFDPRYGGFKGAPKFPNPSSIERCL---RDV 227
Query: 198 EDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
S EA + + TL+ MA+GGI+D +GGGF RY+VD +W +PHFEKMLYD GQ
Sbjct: 228 RGEHLSAEARQKALDLARLTLEQMAQGGIYDQLGGGFCRYAVDSQWRIPHFEKMLYDNGQ 287
Query: 257 LANVYLDAFSLTKDVFYSYICRDILD----YLRRDMIGPGGEIFSAEDADSAETEGATRK 312
L +Y DA+ L + S CR +L+ + R+M P G +S+ DADS EG
Sbjct: 288 LLALYADAYEL----WGSERCRRVLEETGHWAIREMQSPEGGYYSSLDADS---EG---- 336
Query: 313 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
+EG FYVWT ++V+ +L E Y+ + P N F+G L
Sbjct: 337 REGKFYVWTREQVQALLEEDEYPLVARYF----------GLDQPAN-FEGHWHLYGAITP 385
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
A A +L + L ++KLF R +R RP DDK++ SWNGL+I A A + L
Sbjct: 386 EALAQELNLSPRILEETLATAKQKLFAAREERIRPGRDDKILTSWNGLMIKGMAAAGQAL 445
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
A ++ AE A F+R HL+ E RL S+++G + PG+LD
Sbjct: 446 AEPA----------------FIASAERALDFVRGHLWREG--RLLVSYKDGRVQHPGYLD 487
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYAFL+ LL L + L +A+EL F D GG++ T + +++ R
Sbjct: 488 DYAFLLDALLALLQARWREGDLAFAVELAEAALAHFEDPAQGGFYFTADDHETLIHRPVP 547
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCC 611
D A P+GN V +L RL ++ + Y + AE +L ++ A L+
Sbjct: 548 LMDNATPAGNGVLAWSLQRLGHLLGEMR---YLKAAERTLKASWASIQHTPHAHCSLLKT 604
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
+ L P + V+L G + ++ + A A Y + + I D W
Sbjct: 605 LEEWLYPP--QMVILRGPEENLG--SWRAIATGEYAPRRVSLAIPKGAR---DLW----- 652
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ D+V A VC +CSPP+T
Sbjct: 653 --GQLEEYRPEGDRVTAYVCSGHTCSPPLTQ 681
>gi|313675015|ref|YP_004053011.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
gi|312941713|gb|ADR20903.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
Length = 675
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 229/690 (33%), Positives = 336/690 (48%), Gaps = 87/690 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVME ESFEDE VAK++N+ ++ IK+DREERPD+D++YM +Q + GGWPL+V
Sbjct: 51 ACHWCHVMEHESFEDEEVAKVMNENYICIKLDREERPDIDQIYMDAIQTMGLHGGWPLNV 110
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL P+ KP GGTYFP + + IL KV A+ R+ L +S + ++AL+
Sbjct: 111 FLIPNQKPFYGGTYFP------KNKWLEILDKVAIAFQSSRNQLEESA----NKFAQALN 160
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSY-------DSRFGGFGSAPKFPRPVEIQMML-- 191
A+ L NA ++ LS++Y D GG APKFP PV Q ++
Sbjct: 161 AADGEKLSLGAL--NAENFNSKILSEAYQKLGSFLDWDNGGTLGAPKFPMPVIWQFLMKY 218
Query: 192 -YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+HS+ E +K + FTL +A GGI+D +GGGF RYSVD W PHFEKM
Sbjct: 219 AFHSQN----------PEAKKALEFTLTSLADGGIYDQIGGGFARYSVDAEWFAPHFEKM 268
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
LYD GQL ++Y DAF TK+ ++ I D + + R+++ P +SA DADS EG
Sbjct: 269 LYDNGQLISLYADAFRFTKNPYFKEIFEDSIRFSAREIMDPYCRFYSALDADS---EG-- 323
Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG FY WT E+E ILG+ A + Y GN + G+N+L +
Sbjct: 324 --EEGKFYTWTYTELEQILGDKAEPILKFYNATEKGNWE-----------NGRNILFRHS 370
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ EK+ L E + L D R R RP +DDK++ WN L + A K
Sbjct: 371 SIEDFCKAEKIDQEKFKAQLIEAKDSLLDAREDRVRPAMDDKILTGWNALQMKGICDAYK 430
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+ K+Y +A+ F+ ++D ++L SF+N K +
Sbjct: 431 AYQD----------------KKYKAIAQDNFVFLSEFVWD--GNQLFRSFKNEQPKIKAY 472
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYA I + L+E S +K L +A +L N + F D + +F T ++ R
Sbjct: 473 LEDYALAIQASISLFEISSDSKALDFAEKLTNYAIQNFYDEKEKLFFYTDKSSEKLIARK 532
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
KE D P+ NSV + NL L I+ G+ S + + +E L + L +
Sbjct: 533 KEIFDNVIPASNSVMIENLHWLG-ILKGNSS--FTEISEQMLKQIQHLLPREPKFLANYA 589
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A + + S +V+VG K++ L S+ L T I P ++++ W+
Sbjct: 590 SAYALKAFRSY-DIVIVGTKAT-----ELQKELWSHYLPNTFIMAIPEESKDQLVWKGKE 643
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPV 700
N K VC+N +C PV
Sbjct: 644 IINT----------KTTIYVCENNACQQPV 663
>gi|456389199|gb|EMF54639.1| hypothetical protein SBD_4307 [Streptomyces bottropensis ATCC
25435]
Length = 686
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 230/698 (32%), Positives = 331/698 (47%), Gaps = 76/698 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A+ LN FV+IKVDREERPDVD VYM VQA G GGWP++
Sbjct: 52 SSCHWCHVMAHESFEDGETAEYLNAHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMT 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPP ++G P F+ +L V+ AW +RD +A+ + L+
Sbjct: 112 VFLTPDGEPFYFGTYFPPAPRHGMPSFRQVLEGVRAAWADRRDEVAEVAGKIVRDLAGRE 171
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L +A DEL Q L L++ YD+ GGFG APKFP + I+ +L H+ +
Sbjct: 172 LKFAAVDVPGEDELAQALL-----GLTREYDAARGGFGRAPKFPPSMVIEFLLRHAAR-- 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 225 -TGSEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS + G + EGA+Y
Sbjct: 280 RVYAHLWRATGSELARRVALETADFMVRELRTNEGGFASALDADSDDGTGTGKHVEGAYY 339
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VWT +++ ++LGE H++ + E AS
Sbjct: 340 VWTPEQLTEVLGEEDARLAAHHF-----------------------GVTEEGTFEEGASV 376
Query: 379 LGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
L +P + + + + R +L R +RP P DDKV+ +WNGL +++ A
Sbjct: 377 LQLPQREGVFDADKIESIRERLLAARVRRPAPGRDDKVVAAWNGLAVAALAET------- 429
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
A F+ P +A +R HL DE+ RL + ++G A G L+DY
Sbjct: 430 --GAYFDRP------DLVDAAIAAADLLVRLHL-DERA-RLARTSKDGRVGANAGVLEDY 479
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + F+D E G ++T + ++ R ++
Sbjct: 480 ADVAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALYDTASDAEKLIRRPQDPT 539
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A PSG S + L A + S+ +R AE +L V + + A+
Sbjct: 540 DNATPSGWSAAAGA---LLGYAAHTGSEPHRTAAERALGVVKALGPRAPRFIGWGLATAE 596
Query: 615 MLSVPSRKHVVL--VGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
L R+ VL GH + + A V+ + P D++E+
Sbjct: 597 ALLDGPREVAVLGPQGHPGTRELHRTALLGTAP----GAVVAVGPPDSDELPL------- 645
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+NF+C P TD L L
Sbjct: 646 ---LADRPLVGGEPTAYVCRNFTCDAPTTDVDRLRTAL 680
>gi|46198930|ref|YP_004597.1| hypothetical protein TTC0622 [Thermus thermophilus HB27]
gi|46196554|gb|AAS80970.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length = 642
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 219/591 (37%), Positives = 300/591 (50%), Gaps = 83/591 (14%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESF+DE VA+LLN FV +KVDREERPDVD YM + +L G GGWP+S+
Sbjct: 49 SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFP ED+ G PGFK +L V +AW KR+ + + E+L+ AL
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALW 164
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + P LP+ A + L +++D +GGF APKFP+ + +L + + E+
Sbjct: 165 KSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+++ TL+ MA GG++D VGGGFHRYSVD W +PHFEKMLYD LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274
Query: 261 YLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
YL A+ L + + + R+ LD+L RR+ G +A D AE+EG +EG
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGR 322
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+Y W E+ + LGE L + ++ L DL ++VL ++ A
Sbjct: 323 YYTWAEVELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEAR- 367
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
LG E + R KL R +R P LDDKV+ W+ L + + A A ++ E
Sbjct: 368 KVLG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE- 423
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+E A A F+ H+Y E L+H++R G +L D AF
Sbjct: 424 ---------------RYLEAARRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAF 465
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+LY +L WA L LF REG PS+ L KE +G
Sbjct: 466 AALAFLELYAATGEWPYLDWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEG 513
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
A PSG S LVRL ++ G YR+ AE LA L A+P
Sbjct: 514 ALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560
>gi|296445985|ref|ZP_06887935.1| protein of unknown function DUF255 [Methylosinus trichosporium
OB3b]
gi|296256503|gb|EFH03580.1| protein of unknown function DUF255 [Methylosinus trichosporium
OB3b]
Length = 679
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 226/698 (32%), Positives = 340/698 (48%), Gaps = 76/698 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE++ +A L+N F+++KVDREERPD+D +Y +Q L GGWPL++F
Sbjct: 52 CHWCHVMAAESFENDRIAALMNANFINVKVDREERPDIDHLYQQALQMLGRRGGWPLTMF 111
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS-GAFA--IEQLSEA 138
L+PD +P GGTYFPPE ++G PGF IL+ V + W +K ++ ++ GA A +++L+E+
Sbjct: 112 LTPDGEPFWGGTYFPPEPRHGMPGFADILQAVAELWREKPAVVTRNVGAIANGLDRLAES 171
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A S L L E+L + D GG APKFP+P ++ + K
Sbjct: 172 APAEPISPVL--------LETITERLEELIDREHGGIRGAPKFPQPPSLEFLWRAWK--- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++G AS ++ VL TL + +GGI+DH+GGGF RYS DERW PHFEKMLYD GQL
Sbjct: 221 ---RTGRASL-REAVLTTLDHICQGGIYDHIGGGFARYSTDERWLAPHFEKMLYDNGQLV 276
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+ + + Y+ + +D+ R+M P G S+ DADS +EG FY
Sbjct: 277 ELLTLVWQDERKPLYAARVEETIDWALREMRLPEGVFASSLDADS-------EHEEGKFY 329
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VW++ E++ LGE A F+ Y + GN + E N L+E+ SA A
Sbjct: 330 VWSAAEIDAALGERAGAFRAAYDVTEAGNWE---------EKNIPNRLLEMALGSAEAEA 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
L L E R RP DDK + WNGL+I++ A A++
Sbjct: 381 ALAADRAALLALRETRV----------RPGRDDKALADWNGLMIAALAAAAQA------- 423
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
R +++ VA +A FI + RL HS+R G +K LDDYA L
Sbjct: 424 ---------FARPDWLAVATAAFDFIATSMTTADG-RLLHSYRAGRAKHMAVLDDYADLC 473
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L L+E +L E + + D GGYF T + +++ R K D
Sbjct: 474 RAALTLHEATGDDAYLTRCREWAEIVETHYRD-PAGGYFFTADDAEALIRRAKIAEDAPL 532
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN L RL + + YR+ AE +L F ++ + + A++L
Sbjct: 533 PSGNGAMTQVLARLYHLTGETA---YRERAEATLTAFAGTVRRGLLGYSTLLSGAEILR- 588
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+V++G +++ D +L H + ++++ P D H + +
Sbjct: 589 -DGLQIVIIGARAAEDTAALLRVLHETSLPGRSLLVAAPGAALPPD----HPAAGKTQVD 643
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+ A +C+ +CS P+ +P SL L +P +
Sbjct: 644 G-----RAAAYMCRGTTCSLPIVEPASLALALRGEPQT 676
>gi|381190578|ref|ZP_09898097.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
gi|384431187|ref|YP_005640547.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
gi|333966655|gb|AEG33420.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
gi|380451573|gb|EIA39178.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
Length = 642
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 218/591 (36%), Positives = 302/591 (51%), Gaps = 83/591 (14%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESF+DE VA+LLN FV +KVDREERPDVD YM + +L G GGWP+S+
Sbjct: 49 SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFP ED+ G PGFK +L V +AW KR+ + + E+L+ AL
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAVLEEA----ERLTRALW 164
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + P LP+ A + L +++D +GGF APKFP+ + +L + + E+
Sbjct: 165 KSLTPP--PGPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+++ TL+ MA GG++D VGGGFHRYSVD W +PHFEKMLYD LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274
Query: 261 YLDAFSLTKDVFYSYICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
YL A+ L + + + R+ LD+L RR+ G +A D AE+EG +EG
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGR 322
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+Y WT E+ + LGE L + ++ L DL ++VL ++
Sbjct: 323 YYTWTEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVRE 368
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ LG E + R KL R +R P LDDKV+ W+ L + + A A ++ EA
Sbjct: 369 A-LG---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEEA 424
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
Y+E A+ A F+ H+Y + L+H++R G +L D AF
Sbjct: 425 ----------------YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQAF 465
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+LY +L WA LF REG PS+ L KE +G
Sbjct: 466 AALAFLELYAATGEWPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEEG 513
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
A PSG S LVRL ++ G YR+ AE LA L A+P
Sbjct: 514 ALPSGESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560
>gi|289769445|ref|ZP_06528823.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289699644|gb|EFD67073.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length = 680
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 234/701 (33%), Positives = 337/701 (48%), Gaps = 72/701 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A+ LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPPE ++G P F+ +L+ V+ AW ++RD + + + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVRQAWAERRDEVDEVAGKIVRDLAGRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S + ++L Q L L++ YD R GGFG APKFP + I+ +L H +
Sbjct: 168 ISYGDAEAPGEEQLGQALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G + EGA Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHY 333
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
VWT ++ ++LG E A L +++ + G + G +VL + +S A
Sbjct: 334 VWTPAQLTEVLGAEDAELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDA 382
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ + R +L R RP P DDKV+ +WNGL I++ A
Sbjct: 383 AR-----------IASVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET-------- 423
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
A F P +A +R HL DEQ RL + ++G + A G L+DYA
Sbjct: 424 -GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYA 474
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F D E G ++T + ++ R ++ D
Sbjct: 475 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTD 533
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSG S + L+ S A + S +R AE +L V + + + AA+
Sbjct: 534 NATPSGWSAAAGALL---SYAAHTGSAPHRAAAERALGVVKALGPRVPRFIGWGLAAAEA 590
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L R+ V+ + A+ L++T + + A + F E +
Sbjct: 591 LLDGPREVAVVAPDPAD----------PAARGLHRTAL-LGTAPGAVVAFGTEGSDEFPL 639
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+A A VC+NF+C P TDP L L P+
Sbjct: 640 LADRPLVGGAPAAYVCRNFTCDAPTTDPDRLRTALGVAPTG 680
>gi|375097065|ref|ZP_09743330.1| thioredoxin domain containing protein [Saccharomonospora marina
XMU15]
gi|374657798|gb|EHR52631.1| thioredoxin domain containing protein [Saccharomonospora marina
XMU15]
Length = 673
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 226/693 (32%), Positives = 327/693 (47%), Gaps = 78/693 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ A +N FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFEDDETAAFMNAHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GTY+PP ++G P F+ +L V AW ++ D L Q + + E +
Sbjct: 109 LTPDGKPFHCGTYYPPTPRHGMPSFRQVLTAVARAWSERADELRQGATKIVSHIQEQTAP 168
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A + + A+ L D GGFG APKFP + ++ +L H E TG
Sbjct: 169 LAQR-----PVDEEAIATAVSTLRGQIDPGHGGFGGAPKFPPAMVMEFLLRH---YERTG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++E +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SAEALSVVELTAEGMARGGIYDQLAGGFARYSVDAAWVVPHFEKMLYDNALLLRCY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + + ++L RD+ G ++ DAD TEG EG YVWT
Sbjct: 277 AHLARRTSSALATRVAAETAEFLLRDLRTQEGGFAASLDAD---TEGV----EGLTYVWT 329
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
++ ++LG + + R+++ G + L D +A
Sbjct: 330 PAQLVEVLGPEDGSWAAEVF----------RVTEEGTFEHGASTLQLPRDPDETA----- 374
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
++L + L + R+ RP+P DDKV+ +WNGL I++ A A L
Sbjct: 375 ---RWLRV----STALLEARNGRPQPSRDDKVVTAWNGLAITALAEAGVAL--------- 418
Query: 442 NFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
+R +++E A SAA + RHL D RL+ S R G +A G L+DYA L
Sbjct: 419 -------ERPDWVEAAVSAAELLLDRHLVDA---RLRRSSRGGVVGEAAGVLEDYACLAE 468
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAE 558
GLL +++ + WL A L +T ELF D E G F+ T D L+ R + D A
Sbjct: 469 GLLAVHQASGESVWLTQATLLLDTALELFSDDELPGAFHDTAADAEALVHRPSDPTDNAT 528
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA-MAVPLMCCAADMLS 617
PSG S L+ +++ ++ YRQ E +L T + A + A +L+
Sbjct: 529 PSGASALAGALLTASALAGPDRAGEYRQACERALDRAGTIVAQAPRFAGHWLSVAEALLA 588
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
P + V +VG ++ + ++ AA + V+ P + + +A
Sbjct: 589 GPVQ--VAVVGPDAAARSDLLVEAAREVH--GGGVVLAGPPEAGGVPL----------LA 634
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A VC + C PVT P L L
Sbjct: 635 DRPLVDGNAAAYVCHGYVCERPVTTPQRLAAAL 667
>gi|288932323|ref|YP_003436383.1| hypothetical protein Ferp_1971 [Ferroglobus placidus DSM 10642]
gi|288894571|gb|ADC66108.1| protein of unknown function DUF255 [Ferroglobus placidus DSM 10642]
Length = 628
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 205/611 (33%), Positives = 314/611 (51%), Gaps = 73/611 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM + FE+E +AK++N+ FV++KVDR+ERPD+D+ Y +V A G GGWPL+VF
Sbjct: 50 CHWCHVMAKKCFENEDIAKIINENFVAVKVDRDERPDIDRRYQEFVFATTGTGGWPLTVF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GGTYFPPED +G GFKT+L K+ + W+K R+ L +S +E L +
Sbjct: 110 LTPDGEPFFGGTYFPPEDGFGMIGFKTLLLKISEMWEKDRESLLKSAKQIVESLKKFSER 169
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLED 199
SSN L + ++ + + D GG G APKF +++L Y+ K ED
Sbjct: 170 DFSSN-FDFTLIEKGIKAVLDNM----DYVNGGIGRAPKFHHAKAFELLLTHYYFTKDED 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
K+ E TL MAKGG++D + GGF RYS D+RWHVPHFEKMLYD +L
Sbjct: 225 LIKAVE---------LTLDAMAKGGVYDQLIGGFFRYSTDDRWHVPHFEKMLYDNAELLK 275
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A+ +TK Y + + I+DY R+ + G ++++DAD E E EG +Y+
Sbjct: 276 LYTIAYQITKKELYRKVAKGIVDYYRKFGVDERGGFYASQDADIGELE------EGGYYI 329
Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
++ +E++++L + Y+ L+ +GKNVL D + +
Sbjct: 330 FSLEEIKEVLNDEEFRIASLYFGLR-----------------EGKNVLHVSLDENEISEI 372
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
LG+P+ + I+ + KL +VR +R P +D + +WNGL+I + K
Sbjct: 373 LGIPVRRVKEIIESAKEKLLEVRERRETPFIDKTIYTNWNGLMIEAMCDYYK-------- 424
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
FN P +EVAE + R L L H+ GF +DY F
Sbjct: 425 -SFNDPWA-------VEVAEKSGE---RLLKFWDGDVLLHT-----DDVEGFSEDYIFFA 468
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGA 557
GL+ L+E K+L A+E+ +LF D + GG+F+ +L L+VK+ D
Sbjct: 469 KGLIALFEITQKGKYLNAAVEITKRAVDLFWDHKRGGFFDRKSSGNGLLSLKVKDIQDSP 528
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
+ S N ++ + L L+S+ ++ + A+ SL F L+ + P L
Sbjct: 529 QQSVNGIAPLLLTTLSSVTG---TEEFGALAKKSLRAFAGILEKYPLISPSYMISLYAYI 585
Query: 613 ADMLSVPSRKH 623
+ V +R+H
Sbjct: 586 RGIYLVKTRRH 596
>gi|354612894|ref|ZP_09030833.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
90007]
gi|353222771|gb|EHB87069.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
90007]
Length = 667
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 232/698 (33%), Positives = 336/698 (48%), Gaps = 91/698 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF D A +N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFSDADTAAYMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GTY+PP K+G P F +L V AW ++RD L + + ++E
Sbjct: 109 LTPDGEPFHCGTYYPPVSKHGLPSFVQVLTAVTQAWTERRDELVEGAGRIVTHIAE--QT 166
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
S DE AL +L + D GGFG+APKFP + ++ +L H ++ TG
Sbjct: 167 GPLSEHPVDE---QALSSAVAKLRQEADPANGGFGTAPKFPPSMVLEFLLRHHER---TG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++E +V T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SAEALSLVELTAERMARGGIYDQLGGGFARYSVDVAWVVPHFEKMLYDNALLLRAY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + + ++L RD+ G ++ DAD+ EG T YVWT
Sbjct: 277 AHLARRTGSAIATRVAGETAEFLLRDLRTAEGGFAASLDADTDGVEGLT-------YVWT 329
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+++ ++LG E E + + G + KG + L +D A
Sbjct: 330 PEQLVEVLGPEDGAWAAELFGVTEEGTFE-----------KGASTLRLPHDPDDPA---- 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
++L + LF R RP+P DDKVI +WNGL I++ A A L+
Sbjct: 375 ----RWLRV----STALFQARGTRPQPARDDKVIAAWNGLAITALAEAGTALR------- 419
Query: 441 FNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
R E+++ A SA ++ + RHL D RL+ S RNG A G L+D+ L
Sbjct: 420 ---------RPEWVDAAVSAGAYLLDRHLVD---GRLRRSSRNGEVGAANGVLEDHGCLA 467
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGA 557
GLL L++ + WL+ A L + E F + G F+ T +D L+ R + D A
Sbjct: 468 DGLLALHQATGESVWLLEATRLLDIARERFAVADTPGAFHDTADDAEALVHRPSDPTDNA 527
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
PSG S L+ +++V K+ YR AE ++ +R + VP + A
Sbjct: 528 SPSGASTVAGALLTASALVGPEKASDYRAAAEQAV----SRAGALVAQVPRFAGHWLSVA 583
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
M + P + V +VG + E + AAH + V+ P ++E + +
Sbjct: 584 EAMAAGPVQ--VAVVGPDAEARSELLSTAAHDVH--GGGVVLGGPPESEGVPLLADRPLV 639
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ S A A VC + C PVT + E LL
Sbjct: 640 DGSAA----------AYVCHGYVCDRPVT---TTEELL 664
>gi|82701479|ref|YP_411045.1| hypothetical protein Nmul_A0345 [Nitrosospira multiformis ATCC
25196]
gi|82409544|gb|ABB73653.1| Protein of unknown function DUF255 [Nitrosospira multiformis ATCC
25196]
Length = 700
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 228/705 (32%), Positives = 340/705 (48%), Gaps = 81/705 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM E FED VA+++N +F++IKVDREERPD+D++Y T + L GGWPL
Sbjct: 48 SACHWCHVMAHECFEDAEVAEVMNRYFINIKVDREERPDIDQIYQTALYMLTQRSGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD KP GGTYFP ++ PGF +L +V + + +R + + A ++ +
Sbjct: 108 TLFLTPDQKPFFGGTYFPKTPRHSLPGFLDLLPRVAETYRVRRPEIERQSASLLKSFANM 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L + A + E P L +L +DS GGFG PKF E+ L ++
Sbjct: 168 LPSKAPEAPVFSERP---LEQALAELKNRFDSENGGFGEPPKFLHLTELDFCL---RRYF 221
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G S E M TL+ MA+GGI+D VGGGF+RYS D++W +PHFEKMLYD G L
Sbjct: 222 TAGNS----EALHMATLTLEKMAEGGIYDQVGGGFYRYSTDKQWQIPHFEKMLYDNGPLL 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIG--------PGGEIFSAEDADSAETEGAT 310
++Y DA+ + + ++ I + ++ R+M G +S DADS
Sbjct: 278 HLYADAWIASGNPLFARIVEETATWVMREMQPEYEENEKRTGAGYWSTLDADSENV---- 333
Query: 311 RKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
EG FYVW E IL + +Y LS+ ++ N + V L
Sbjct: 334 ---EGKFYVWDRSEASHILSRREYVVAASHY-------GLSQPANFGNRYWHLAVAQSLP 383
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
+ A G+ + L R+KL R R RP D+K++ SWNGL+I ARA +
Sbjct: 384 E---IAENFGVTYAEARQWLESGRKKLLAQRQCRVRPGRDEKILTSWNGLMIKGMARAGR 440
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+ R +++ A A FIR L+ + RL ++++G ++ +
Sbjct: 441 VF----------------GRDDWVRSAICAVDFIRSTLW--KNGRLLATWKDGNARLNAY 482
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
LDDYAFL+ GLL+L + L +AI L + F D+E GG+F T+ + +++ R
Sbjct: 483 LDDYAFLLDGLLELMQTTFRPVDLDFAIALAEVLLDQFEDKEAGGFFFTSHDHENLIHRP 542
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 610
K +D A PSGN V+ L R+ ++ + Y Q AE +L +F L + P C
Sbjct: 543 KPGYDNATPSGNGVAAHTLQRMGYLLGEFR---YLQAAERALRLFYPAL----LRHPDSC 595
Query: 611 C----AADMLSVPSRKHVVLVGHKSSVDFENML-AAAHASYDLNKTVIHIDPADTEEMDF 665
C A + P ++ + +EN L + L V + PA
Sbjct: 596 CSLLLALEQWLTPPPVVILRGKAEPMAKWENALRQRVPIALVLALPVERVTPA------- 648
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ S+A+ S V A VC C P VTD L+ LL
Sbjct: 649 -----ALPPSLAKPVPSGMGVNAWVCHGVKCLPEVTD---LQELL 685
>gi|299133196|ref|ZP_07026391.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
gi|298593333|gb|EFI53533.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
Length = 683
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 238/715 (33%), Positives = 349/715 (48%), Gaps = 110/715 (15%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFEDE A ++N+ FV IKVDREERPD+D++YM + L GGWPL++F
Sbjct: 56 CHWCHVMAHESFEDETTAAVMNELFVPIKVDREERPDIDQIYMNALHLLGEQGGWPLTMF 115
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P+ GGTYFP +YGR F +LR++ + + D +A + A + LS+ SA
Sbjct: 116 LTPDGAPVWGGTYFPKTAQYGRAAFVEVLRELARIFRDEPDKIAANKAAIEKSLSQRSSA 175
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A+S L N L A ++++ D GG APKFP+ LE
Sbjct: 176 DAASIGL------NELDNAAGSIARATDPTNGGLRGAPKFPQ----------CSMLEFLW 219
Query: 202 KSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++G + ++ + T L M++GGI+DH+GGG+ RYSVD RW VPHFEKMLYD Q+
Sbjct: 220 RAGARTGDERYFITTNLALTQMSQGGIYDHLGGGYARYSVDARWLVPHFEKMLYDNAQIL 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++ + + Y + + +L+R+M+ G S+ DADS EG +EG FY
Sbjct: 280 DMLALEHARAPNELYRQRAEETVGWLKREMLTKEGGFASSLDADS---EG----EEGKFY 332
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW+ ++ +LG + A F Y + GN F+G N+L L+D S +A+
Sbjct: 333 VWSQADIAHLLGPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSETAT 380
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ L R LF R KR P LDDKV+ WNGL I++
Sbjct: 381 E--------AEQLAALRAILFRAREKRVHPGLDDKVLADWNGLTIAA---------LAHA 423
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ FN R +++ +A +A F+ + + RL HS+R G P D+A +
Sbjct: 424 ANAFN-------RPDWLTLATTAFGFVTTTM--SRRDRLGHSWRAGKLLQPALASDHAAM 474
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I L LYE +L AI Q D + D + GGYF T+ + ++LR D A
Sbjct: 475 IRAALALYEATGDHLFLDQAILWQADLDTHYGDPQHGGYFLTSDDAEGLILRPHSTVDDA 534
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-- 615
P+ ++ NL RLA + + + RQ DM L AA+M
Sbjct: 535 IPNHVGLTAQNLARLAVLTGDER--WRRQ-------------LDMLFKHMLPVAAANMFG 579
Query: 616 -LSVPSRKHVVLVGHKSSVD-----FENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEE 668
LS+ + + L G + V E +L AA A V+ + DP
Sbjct: 580 HLSLLNALDLYLAGSEIVVTGQGEGVEALLKAARALPHATTIVLRVPDP----------- 628
Query: 669 HNSNNASMARNNFSADKV-----VALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
A + ++ +ADKV A VC+ +CS PVT+P +L L+L + +S+A
Sbjct: 629 -----AKLPPHHPAADKVAPGGGAAFVCRGQTCSLPVTEPDALTALVLREDASSA 678
>gi|452958537|gb|EME63890.1| hypothetical protein H074_04714 [Amycolatopsis decaplanina DSM
44594]
Length = 688
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 240/716 (33%), Positives = 333/716 (46%), Gaps = 97/716 (13%)
Query: 10 TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
K R L++ CHWCHVM ESFEDE A L+N FV+IKVDREERPD+D VYM
Sbjct: 54 AKRRNVPILLSVGYAACHWCHVMAHESFEDEATATLMNANFVNIKVDREERPDIDSVYMA 113
Query: 66 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
QA+ G GGWP++ FL+P+ +P GTY+PP + G P F +L V +AWD++ L
Sbjct: 114 ATQAMTGQGGWPMTCFLTPEGEPFHCGTYYPPSPRPGMPSFSQLLVAVAEAWDERPGELR 173
Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRP 184
I L+E S LP+ + A L L K YD+ GGFG APKFP
Sbjct: 174 SGARQIIAHLTE------KSGPLPESVVDGAVLESAVASLRKEYDAENGGFGGAPKFPPT 227
Query: 185 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
+ + +L H ++ TG G MV T + MA GG++D + GGF RYSVD RW V
Sbjct: 228 MALNFLLRHHER---TGS------GLSMVEHTAEAMALGGLNDQLAGGFARYSVDARWEV 278
Query: 245 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 304
PHFEKMLYD G L Y +T + + ++L RD+ G ++ DAD+
Sbjct: 279 PHFEKMLYDNGLLLRFYARFHGVTGYEYARRTVEETAEFLLRDLGTAEGGFAASLDADTD 338
Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGN----CDLSRMSDPHNE 359
EG T YVWT ++ ++LGE E + + GN R+ +PH E
Sbjct: 339 GVEGLT-------YVWTPAQLAEVLGEEDGAWAAELFQVAEPGNFEHGASTLRLREPHPE 391
Query: 360 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 419
E+Y + RR L R +RP+P DDKVI +WNG
Sbjct: 392 DA----------------------ERYERV----RRALLAARGQRPQPARDDKVIAAWNG 425
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR-RHLYDEQTHRLQH 478
L I +FA A L R ++++ A AA+F+ +H D RL+
Sbjct: 426 LAIGAFANAGSRLG----------------RPQWIDAATRAAAFLMDKHFVD---GRLRR 466
Query: 479 SFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+ R+G G L+DYA L GLL+L++ +WL AI L + F + G +
Sbjct: 467 TSRDGVVGTTAGVLEDYACLAEGLLELHQSTGEPRWLADAITLLDLALAHFGVPDSPGAY 526
Query: 538 NTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAG-SKSDYYRQNAEHSLAVF 595
T +D VL+ R + D A PSG S ++ N + AS++AG + YR+ AE +LA
Sbjct: 527 YDTADDAEVLVQRPSDPTDNASPSGAS-ALANALLTASVLAGHDQVGRYREAAEQALARA 585
Query: 596 ETRLKDMA-MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
A + A + P + VV S D +LAAA AS V+
Sbjct: 586 GRLAAHAPRFAGHWLTVAEAAAAGPVQVAVVGPDAASRAD---LLAAAVASSPDGAVVVS 642
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
P D + + +A A VC+ + C PV L + L
Sbjct: 643 GTP-DADGVPL----------LADRPLVEGAAAAYVCRGYVCERPVATAEELRSQL 687
>gi|291009338|ref|ZP_06567311.1| hypothetical protein SeryN2_32865 [Saccharopolyspora erythraea NRRL
2338]
Length = 683
Score = 326 bits (835), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 229/693 (33%), Positives = 321/693 (46%), Gaps = 89/693 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A ++N+ FV+IKVDREERPDVD VYM QA+ G GGWP++
Sbjct: 51 ACHWCHVMAHESFEDEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTC 110
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTY+P +G P F+ +L V AW ++ + Q+ +EQL
Sbjct: 111 FLTPDAEPFHCGTYYPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL----- 165
Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA LP+ L + +L D GFG APKFP + ++ +L H ++
Sbjct: 166 -SAQRTALPESFLDDEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSA 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G A E M T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L
Sbjct: 225 PGSGHTALE---MAEATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + + R+ +L RD+ P G ++ DAD TEG EG YV
Sbjct: 282 VYAHLARRRESPLAERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYV 334
Query: 320 WTSKEVEDILGE-----HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
WT +++ ++LGE A LF + + + T L R DP + + + V
Sbjct: 335 WTPEQLAEVLGEADGAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV------ 386
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
R L++ RS+RP+P DDKV+ SWNG+ I++ AS
Sbjct: 387 ----------------------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTA 424
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
L E++ AE AA + RHL D+ RL+ S R+G A G
Sbjct: 425 LGE----------------PEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAG 465
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLL 548
L+DY L GLL L++ +WL A L +T E F D + G YF+T + ++
Sbjct: 466 VLEDYGCLADGLLSLHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVR 525
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R + D A PSG S L+ +++ GS + YR AE +L+ + A
Sbjct: 526 RPSDPTDNASPSGASSLTSALLTASALAGGSAAQRYRHAAEQALSRAGLLAERAARFAGH 585
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
A+ L+ V + G + D +L AA V+ +P T
Sbjct: 586 WLSTAEALA-HGPLQVAVAGPEDDGDRAALLEAAWRHSPGGAVVLAGEPEAT-------- 636
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+A A VC+ + C PVT
Sbjct: 637 ---GVPLLADRPLVGGSAAAYVCRGYLCDRPVT 666
>gi|134097521|ref|YP_001103182.1| hypothetical protein SACE_0923 [Saccharopolyspora erythraea NRRL
2338]
gi|133910144|emb|CAM00257.1| protein of unknown function DUF255 [Saccharopolyspora erythraea
NRRL 2338]
Length = 681
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 229/693 (33%), Positives = 321/693 (46%), Gaps = 89/693 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A ++N+ FV+IKVDREERPDVD VYM QA+ G GGWP++
Sbjct: 49 ACHWCHVMAHESFEDEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTC 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTY+P +G P F+ +L V AW ++ + Q+ +EQL
Sbjct: 109 FLTPDAEPFHCGTYYPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL----- 163
Query: 141 ASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
SA LP+ L + +L D GFG APKFP + ++ +L H ++
Sbjct: 164 -SAQRTALPESFLDDEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSA 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G A E M T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L
Sbjct: 223 PGSGHTALE---MAEATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLR 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + + R+ +L RD+ P G ++ DAD TEG EG YV
Sbjct: 280 VYAHLARRRESPLAERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYV 332
Query: 320 WTSKEVEDILGE-----HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
WT +++ ++LGE A LF + + + T L R DP + + + V
Sbjct: 333 WTPEQLAEVLGEADGAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV------ 384
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
R L++ RS+RP+P DDKV+ SWNG+ I++ AS
Sbjct: 385 ----------------------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTA 422
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPG 489
L E++ AE AA + RHL D+ RL+ S R+G A G
Sbjct: 423 LGE----------------PEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAG 463
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLL 548
L+DY L GLL L++ +WL A L +T E F D + G YF+T + ++
Sbjct: 464 VLEDYGCLADGLLSLHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVR 523
Query: 549 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 608
R + D A PSG S L+ +++ GS + YR AE +L+ + A
Sbjct: 524 RPSDPTDNASPSGASSLTSALLTASALAGGSAAQRYRHAAEQALSRAGLLAERAARFAGH 583
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
A+ L+ V + G + D +L AA V+ +P T
Sbjct: 584 WLSTAEALA-HGPLQVAVAGPEDDGDRAALLEAAWRHSPGGAVVLAGEPEAT-------- 634
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+A A VC+ + C PVT
Sbjct: 635 ---GVPLLADRPLVGGSAAAYVCRGYLCDRPVT 664
>gi|383785408|ref|YP_005469978.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
gi|383084321|dbj|BAM07848.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
Length = 694
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 235/700 (33%), Positives = 350/700 (50%), Gaps = 77/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPL 78
+ CHWCHVM ESFED A ++N+ F++IKVDREERPD+D +Y M + GGWPL
Sbjct: 48 SACHWCHVMAHESFEDPETASVMNESFINIKVDREERPDLDHIYQMAHTVITKRNGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD P GGTYFP ++G PGF ++L +++ +D+ ++ L+ + E LS +
Sbjct: 108 TMFLTPDQVPFAGGTYFPKSPRFGLPGFISVLHQIRQFYDENKEALSGTKHPVTELLSRS 167
Query: 139 LSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ +N P L P+ LR + L +DS GGF APKFP P++I +
Sbjct: 168 DALGEGANPDPSSLTIEPEARLR---DSLRARFDSEDGGFTPAPKFPHPMDI------AA 218
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
L + + GE + M TL+ MA GGI+D +GGGF RYSVD W +PHFEKMLYD
Sbjct: 219 CLREYEREGEVFD-LWMARHTLERMASGGIYDQIGGGFSRYSVDGTWTIPHFEKMLYDNA 277
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
L VY + L++D + +C I+ +L R+M G +A DADS EG +EG
Sbjct: 278 LLLCVYAEGAHLSEDAGLASVCDGIVTWLFREMRDSSGAFHAALDADS---EG----EEG 330
Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG--KNVLIELNDS 372
+YVWT +EV IL E + Y L T N + +EF KN+
Sbjct: 331 KYYVWTREEVSRILTPEEYQVVSLTYGLSETPNFE--------HEFWHFRKNLPF----- 377
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
S AS+L + + ++L + KL VRS+R P DDKV+ WNGL+ RA +IL
Sbjct: 378 SEVASRLSLTEGPFHSLLSSAKEKLLSVRSQRIPPGKDDKVLTGWNGLLARGLIRAGRIL 437
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 492
DR E++ + +R L+ L G S+ +LD
Sbjct: 438 ----------------DRPEWIMEGQKILDILRETLW--TGDHLLAVRTKGESRLNAYLD 479
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA+++ L++ L WA+ L + F D GG+ T+ + ++ R K
Sbjct: 480 DYAYVLDALVESLATVYRPSDLAWALSLADVLVSKFWDDAAGGFHFTSHDHEQLIHRPKS 539
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 612
HD A PSG++V+ L RLA + + D+ + +LA++ + + M M A
Sbjct: 540 GHDAAIPSGSAVTCRALNRLAHL--SGRMDWL-EKVGRTLALYSKPMLEQPMGYASMIMA 596
Query: 613 -ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM-DFWEEHN 670
+ LS P +VLV KSS+++ +A A L+ +I + D+ + DF ++
Sbjct: 597 LGEYLSPPV---IVLVRGKSSLEWS---LSARAKSPLDTLIIDLGERDSLSLPDFLQKPP 650
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ S + A VC C PVTD L++LL
Sbjct: 651 ATGVSF--------ETQADVCGGGVCLSPVTD---LKDLL 679
>gi|124002212|ref|ZP_01687066.1| thymidylate kinase [Microscilla marina ATCC 23134]
gi|123992678|gb|EAY32023.1| thymidylate kinase [Microscilla marina ATCC 23134]
Length = 681
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 221/690 (32%), Positives = 333/690 (48%), Gaps = 85/690 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFED+ VA ++N +F+ IKVDREERPDVD +YM VQA+ GGWPL+
Sbjct: 55 SACHWCHVMERESFEDDEVAAIMNRYFICIKVDREERPDVDAIYMDAVQAMGQRGGWPLN 114
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
L+P+ KP TY P E + +L+ V + + KRD L QS E EA+
Sbjct: 115 ALLTPEAKPFYALTYLPKE------SWVQLLQNVAEVYQTKRDELEQSA----EAYREAI 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRF-------GGFGSAPKFPRPVEIQMMLY 192
+ S + +L N +R E L K + S + GG APKFP P Q +L+
Sbjct: 165 ATSEAKKY---DLKPNDIRYAREDLDKMFQSVYNDVDHTRGGTNRAPKFPMPSIWQFLLH 221
Query: 193 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 252
+ + + E + V TL MAKGGI+D +GGGF RYSVD W PHFEKMLY
Sbjct: 222 YY-------QITKKEEALRTVEVTLNEMAKGGIYDQIGGGFARYSVDADWFAPHFEKMLY 274
Query: 253 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
D GQL ++Y DA+++T++ Y + +D++ R++ G FSA DADS EG
Sbjct: 275 DNGQLLSLYADAYNVTQNPLYQQVVMQTVDFVARELTSEEGGFFSALDADS---EGV--- 328
Query: 313 KEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
EG FYVW ++++G E A + ++Y + N ++ N+L
Sbjct: 329 -EGKFYVWEKTAFDEVIGVEDAAIAADYYQVTSQAN------------WEEGNILHRSIG 375
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
A A K + +E + + +L RSKR RP LDDK++ SWNGL++ A ++
Sbjct: 376 DLAFAEKHQIDVESLKQKVTQWNERLLTARSKRIRPGLDDKILTSWNGLMLKGLVDAYRV 435
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
D + + +A + A FI L E ++L HS++NG + +L
Sbjct: 436 F----------------DSPKLLNLALANAQFIAEKLTTE-NYQLYHSYKNGKASINAYL 478
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
+DYA ++ + LY+ +WL A L + F D+E G +F T ++ R K
Sbjct: 479 EDYAAVVDAYIALYQATFDEQWLTKAKSLTDYALANFYDKEEGLFFFTDVNAEKLIARKK 538
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
E D P+ NS+ NL L + +SD Y+Q A L + + + +
Sbjct: 539 ELFDNVIPASNSMMAKNLYWLG--LYYEQSD-YQQKASQMLGQMQKIIVENPESAANWAT 595
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI-HIDPADTEEMDFWEEHN 670
+ P+ + V +VG ++ + A+ Y NK + + P D+ + +
Sbjct: 596 LYTYFAQPTAE-VAIVGEQA----QEYRASLDKYYYPNKILAGTLQPQDS--LGLLQNRG 648
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPV 700
+ N + VC N +C PV
Sbjct: 649 TING----------QTTVYVCYNKTCQLPV 668
>gi|225418720|ref|ZP_03761909.1| hypothetical protein CLOSTASPAR_05944, partial [Clostridium
asparagiforme DSM 15981]
gi|225041746|gb|EEG51992.1| hypothetical protein CLOSTASPAR_05944 [Clostridium asparagiforme
DSM 15981]
Length = 506
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 197/519 (37%), Positives = 264/519 (50%), Gaps = 64/519 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVM ESFED VAK LN +V +KVDREERP++D VYM+ QA+ G GGWPL+
Sbjct: 48 STCHWCHVMAHESFEDREVAKRLNADYVPVKVDREERPEIDMVYMSVCQAMTGQGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+ ++PD KP GTY P + G +L V + W R L + L A
Sbjct: 108 IIMTPDKKPFFAGTYLPKTSRRNMTGLLELLSAVSEIWKSDRKRLLNMSDQILAVLRRAP 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
AS+ ++ P+ R E+L ++D +GGFG APKFP P + ++ +
Sbjct: 168 DASSPAD------PETLARRGYEELRAAFDRTYGGFGRAPKFPAPHNLLFLMRY------ 215
Query: 200 TGKSGEASEGQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
A E Q + + TL MA+GGIHDH+GGGF RYS D+ W VPHFEKMLYD L
Sbjct: 216 ---RAWADEPQALAMAEKTLSSMARGGIHDHLGGGFSRYSTDQMWLVPHFEKMLYDNALL 272
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
A YL+ + LT + FY R ILDY+RR++ GP G + +DADS EG +
Sbjct: 273 ALAYLEGYRLTGNRFYQRTARQILDYVRRELTGPEGGFYCGQDADSQGV-------EGKY 325
Query: 318 YVWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YV++ +E+ +LG F Y + GN F+G N+ +++
Sbjct: 326 YVFSEEEIGRVLGSRKDQEKFCRRYGITKEGN------------FEGANIPNLIHNPDYE 373
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
L M CRR L++ R KR H DDK++ SWN L+I + ARA +L
Sbjct: 374 QRDLEMD--------ALCRR-LYEYRLKRLPLHRDDKILASWNALMIIACARAGFLL--- 421
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
D Y+E+A A F+ + L+DE RL +R G S PG LDDYA
Sbjct: 422 -------------DDPGYLEMAGRAQMFVEQKLFDENG-RLLVRYRQGESAFPGNLDDYA 467
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 534
F LL LYE +L A+ ELF D E G
Sbjct: 468 FYCLALLTLYEVTLDASYLELAVNRAEQMVELFWDEERG 506
>gi|30248134|ref|NP_840204.1| hypothetical protein NE0103 [Nitrosomonas europaea ATCC 19718]
gi|30180019|emb|CAD84014.1| putative similar to unknown proteins [Nitrosomonas europaea ATCC
19718]
Length = 689
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 222/685 (32%), Positives = 340/685 (49%), Gaps = 66/685 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFED VA +N+ FV+IKVDREERPD+D++Y + L + GGWPL
Sbjct: 48 SACHWCHVMAHESFEDAQVATAMNEHFVNIKVDREERPDIDQIYQSAHYTLNHRSGGWPL 107
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+P+ KP GGTYFP E +Y PGF +L KV + + ++ + + A ++ L+++
Sbjct: 108 TMFLTPEQKPFFGGTYFPKEARYSMPGFLELLPKVAELYRTRKTDIEKQNAVLLKLLAQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L A + L + + EQL++ +D GGFG APKF P E+Q L
Sbjct: 168 LPAPDTR---ASALSRQPIDRAWEQLNRLFDETDGGFGDAPKFLHPAELQFCLRRYVTDN 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
DT +V TL+ MA+GG++D +GGGF RYS D W +PHFEKMLYD +
Sbjct: 225 DT-------RALHVVTHTLEKMAQGGLYDQLGGGFCRYSTDHSWQIPHFEKMLYDNALML 277
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATRKKEG 315
+Y + + +T + + + + ++ R+M I G FS+ DADS +EG
Sbjct: 278 PLYAETWLVTGNPLFKQVVEETAAWVIREMQSGIDGEGGYFSSLDADS-------EHEEG 330
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW + V IL YY D S + H+ IE
Sbjct: 331 KFYVWDRQAVSAILTPEEYRVTAAYY-----GLDRSPNFENHHWHLAVTESIE-----TV 380
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A++ + E ++ RRKL + R +R RP D+K++ SWN L+I RA +I
Sbjct: 381 AARHQISQEAVQQLIDSARRKLLNEREQRIRPGRDEKILTSWNALMIKGMTRAGQIF--- 437
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
+R+E++ A A FIR L+ Q RL +F++ + +LDD+A
Sbjct: 438 -------------EREEWISSAVRALDFIRSRLW--QNDRLLATFKDDKAHLNAYLDDHA 482
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
FL+ LL L + L +AI L + F D+ GG+F T+ + +++ R K HD
Sbjct: 483 FLLDSLLTLLQADFRQTDLDFAITLADVLLTRFEDKTSGGFFFTSHDHETLIHRPKTGHD 542
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GA P+GN ++ L RL ++ + Y + AE +L VF + L A + + +
Sbjct: 543 GAIPAGNGIAATTLQRLGHLLNEQR---YLEAAERTLNVFSSGLSLHASSHCSLLITLEE 599
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
P+ K V+L G++ + + A Y L+K VI + P + E+ S
Sbjct: 600 FLEPT-KTVILHGNRPEL---QIWLKALLPYSLDKIVIAL-PLELSELP---------DS 645
Query: 676 MARNNFSADKVVALVCQNFSCSPPV 700
+ + K+ A VC+ C P +
Sbjct: 646 LKMRSTPDGKISARVCEGRRCLPEI 670
>gi|344203206|ref|YP_004788349.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343955128|gb|AEM70927.1| hypothetical protein Murru_1888 [Muricauda ruestringensis DSM
13258]
Length = 699
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 210/686 (30%), Positives = 329/686 (47%), Gaps = 78/686 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME E FED VA+++N FV+IK+DREERPDVD++YM +Q + G GGWPL++
Sbjct: 77 CHWCHVMEKECFEDAEVAEVMNKNFVNIKIDREERPDVDQIYMDAIQMISGQGGWPLNIV 136
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P G TY P ++ + L ++ + + K + + Q A L+ L A
Sbjct: 137 ALPDGRPFWGATYVPKDN------WIKSLEQLAELYKKDKPRVTQYAA----DLANGLHA 186
Query: 142 S--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
++K D + L + + ++ +D+ GG APKF P +L+++ +
Sbjct: 187 INLVENDKDSDLYSLDQLDVAIQNWTQYFDTFLGGHKRAPKFMMPNNWDFLLHYATAV-- 244
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ E + V TL MA GG++DHVGGGF RY+VD +WHVPHFEKMLYD GQL +
Sbjct: 245 -----DKPEIMEFVDTTLTRMAYGGVYDHVGGGFSRYAVDTKWHVPHFEKMLYDNGQLTS 299
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A++ TK+ Y + + +++++ + + G +S+ DADS + EGA+YV
Sbjct: 300 LYAKAYAATKNELYKNVVEETINFVQEEFLDRSGGFYSSLDADSLDENAELV--EGAYYV 357
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT KE+ +LG+ LF+E++ + G + + VLI A K
Sbjct: 358 WTKKELSGLLGDDFELFQEYFNINSYGYWE-----------EENYVLIRDKSDEEVADKF 406
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ + + + E KL R KRP+P LDDK++ SWNGL++ A + L E
Sbjct: 407 NITIPELKTTITESLAKLKGEREKRPKPRLDDKILTSWNGLMLKGLVDAYRYLGEE---- 462
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+Y+ +A A FI R + + L + + G S GFL+DYA +I
Sbjct: 463 ------------DYLNLALKNAEFIEREMI-KSDGSLYRNHKEGKSTINGFLEDYATVID 509
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LYE KWL A L + F D G +F T+ ED S++ R E D
Sbjct: 510 AYFSLYEATFDEKWLDLAKNLLEYSKKHFWDETSGMFFYTSDEDQSLIRRTIEVDDNVIS 569
Query: 560 SGNSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
S NS+ INL + + G+ S+ +N + F+ R + A + L+ +
Sbjct: 570 SSNSIMAINLYKFHKLYPEESYGNMSEQMLKNVQKD---FDRRAQGFANWLHLV-----L 621
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ ++G D++N+ Y N ++ N
Sbjct: 622 FQNQDFYEIAILGE----DYKNLGQQISKEYVPNSILVG-------------SQKEGNLE 664
Query: 676 MARNNFSADKVVALVCQNFSCSPPVT 701
+ +N + +K + VC +C PVT
Sbjct: 665 LLKNRGNPNKTLVYVCIEGACKLPVT 690
>gi|21223348|ref|NP_629127.1| hypothetical protein SCO4975 [Streptomyces coelicolor A3(2)]
gi|20520976|emb|CAD30960.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
Length = 686
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 234/701 (33%), Positives = 337/701 (48%), Gaps = 72/701 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A+ LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 54 SACHWCHVMAHESFEDGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPPE ++G P F+ +L+ V+ AW ++RD + + + L+
Sbjct: 114 VFLTPDAEPFYFGTYFPPEPRHGMPSFRQVLQGVQQAWAERRDEVDEVAGKIVRDLAGRE 173
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S + ++L Q L L++ YD R GGFG APKFP + I+ +L H +
Sbjct: 174 ISYGDAEAPGEEQLGQALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR-- 226
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 227 -TGAEG----ALQMAADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G + EGA Y
Sbjct: 282 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHY 339
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
VWT ++ ++LG E A L +++ + G + G +VL + +S A
Sbjct: 340 VWTPAQLTEVLGAEDAELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDA 388
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ + R +L R RP P DDKV+ +WNGL +++ A
Sbjct: 389 AR-----------IASVRERLLAARDGRPAPGRDDKVVAAWNGLAVAALAET-------- 429
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
A F P +A +R HL DEQ RL + ++G + A G L+DYA
Sbjct: 430 -GAYFERP------DLVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYA 480
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F D E G ++T + ++ R ++ D
Sbjct: 481 DVAEGFLALASVTGEGVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTD 539
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A PSG S + L+ S A + S +R AE +L V + + + AA+
Sbjct: 540 NATPSGWSAAAGALL---SYAAHTGSAPHRAAAERALGVVKALGPRVPRFIGWGLAAAEA 596
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
L R+ V+ A A+ L++T + + A + F E +
Sbjct: 597 LLDGPREVAVVAPDP----------ADPAARGLHRTAL-LGTAPGAVVAFGTEGSDEFPL 645
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+A A VC+NF+C P TDP L L P+
Sbjct: 646 LADRPLVGGAPAAYVCRNFTCDAPTTDPDRLRTALGVAPTG 686
>gi|113474681|ref|YP_720742.1| hypothetical protein Tery_0863 [Trichodesmium erythraeum IMS101]
gi|110165729|gb|ABG50269.1| protein of unknown function DUF255 [Trichodesmium erythraeum
IMS101]
Length = 693
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 218/632 (34%), Positives = 324/632 (51%), Gaps = 93/632 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F DE +A+ LN+ F+ IKVDREERPDVD +YM +Q L G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDEKIAQYLNEKFLPIKVDREERPDVDSIYMQALQMLTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P +GGTYFP E +YGRPGF +L+K++ +D +++ L +E L ++
Sbjct: 108 IFLTPDDLIPFVGGTYFPIEPRYGRPGFLEVLQKIRSFYDLEKNKLDTLKVEMLEGLRKS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + + L +E+ Q L + + + Y S FP Q L KKL
Sbjct: 168 VLLPEAED-LKEEILQQGLEVITKIIGDRY--------SQQSFPMIPYAQAAL-QGKKLN 217
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ-- 256
++ K+ L +A GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 218 FKSQNN----SNKVCLERGLNLALGGIYDHVAGGFHRYTVDPNWTVPHFEKMLYDNGQIV 273
Query: 257 --LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
LAN++ + K F I + ++L+R+M P G ++A+DADS T +E
Sbjct: 274 EYLANLWSAGYH--KPAFKRGIIGTV-NWLKREMTAPTGFFYAAQDADSFTTPDEVEPEE 330
Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFY+W+ KE+E++L + + + ++++P GN F+GK VL
Sbjct: 331 GAFYIWSYKELENLLTKEELSELSKQFFIEPNGN------------FEGKIVL-----QR 373
Query: 374 ASASKLGMPLEKYLNILGECRRKL--FDVRSKRPRPH----------------LDDKVIV 415
A +L +E L+ L + R + F++ + P + D K+IV
Sbjct: 374 KQAEELSKTVENSLSKLFKLRYGVQPFNIETFPPATNNKEAKNNNWPGKIPAVTDTKMIV 433
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTH 474
+WN L+IS AR + + S EY+E+A +AA F I D + H
Sbjct: 434 AWNSLMISGLARTATVFNS----------------LEYLELAMNAAHFIITNQQIDGRFH 477
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE----------FGSGTK-WLVWAIELQNT 523
RL + G +DYA I LLDL + + T WL AI+LQ+
Sbjct: 478 RLNYE---GKPAVTAQSEDYALFIKALLDLQQASISLETLSKLNTNTNFWLETAIKLQDE 534
Query: 524 QDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
DE +E GY+NT+ E ++LR + D A P+ N +++ NLVRL+ + ++
Sbjct: 535 FDEFLWSQETAGYYNTSYEVTGELILRERNYIDNATPAANGIAIANLVRLSLL---TEEL 591
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
YY AE +L F + +K A P + A D
Sbjct: 592 YYLDRAESALTAFSSIMKKSPQACPSLFVALD 623
>gi|298293757|ref|YP_003695696.1| hypothetical protein Snov_3807 [Starkeya novella DSM 506]
gi|296930268|gb|ADH91077.1| protein of unknown function DUF255 [Starkeya novella DSM 506]
Length = 672
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 238/690 (34%), Positives = 340/690 (49%), Gaps = 89/690 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A ++N+ FV+IKVDREERP+VD++YM+ +Q L GGWP+++
Sbjct: 49 ACHWCHVMAHESFEDEATAAVMNELFVNIKVDREERPEVDQIYMSALQQLGVQGGWPMTM 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL + P GGTYFP E +YG+P F +L+ + +A+ +A + + +L + +
Sbjct: 109 FLDAEGAPFWGGTYFPKEARYGQPAFTDVLKTMANAYGSGDPRIASNREALLARLRQKAA 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
P+EL A R+ DS+ GG +PKFP ++++ + E T
Sbjct: 169 PVGKVTIGPNELDDVAGRILG-----IMDSQHGGLQGSPKFPNTPFLELLW---RAWERT 220
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ + L L M++GGI+DHVGGG+ RYSVDERW VPHFEKMLYD Q+ +
Sbjct: 221 GR----QRLRDAALHALDGMSEGGIYDHVGGGYARYSVDERWLVPHFEKMLYDNAQILEL 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
A+S T + + + +L+R+M+ G ++ DADS EG EG +YVW
Sbjct: 277 LGLAYSETLADLFRARAEETVGWLQREMLTTSGAFAASLDADS---EG----HEGRYYVW 329
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T K+V D LG E A F HY + P GN + +S P N L E+ S A +L
Sbjct: 330 TLKQVLDALGAEDAEFFARHYDIAPFGNWE--GVSIP-------NRLKEMERSPADEMRL 380
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
M R KL VR R P DDKV+ WNGL+I++ A +
Sbjct: 381 AM-----------LRDKLLKVRETRVPPGRDDKVLADWNGLMIAALANVA---------- 419
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
P G R E++E+A A FI + E RL HS+R G PG DYA +I
Sbjct: 420 ----PRFG--RPEWVELAARAFRFIAESMAREG--RLGHSWREGRLVFPGLSSDYAAMIG 471
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L L++ + A+ Q Q E E GGY+ T + ++LR D A
Sbjct: 472 AALALHQATGEASYFDHAVAWQ-AQLEAHHAAEDGGYYLTADDAEGLILRPDAAADDAVT 530
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD--MAMAVPLMCCAADMLS 617
+ N++ NLVRLA++ + D YR+ A+ RL D + A P + A +L+
Sbjct: 531 NPNALIARNLVRLAAV---TGDDGYRERAD--------RLFDGLLPRAAPSLYSHAGLLN 579
Query: 618 VPSRK----HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSN 672
+ +V+VG D +L AA ++ + + DPA E N
Sbjct: 580 ALDTRLRAPEIVVVGSGEVAD--ALLDAARRLPRVDLMIERVSDPASLPE---------N 628
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A+ S D A VC CS PVTD
Sbjct: 629 HPARAKAE-SIDGAAAFVCAGSVCSLPVTD 657
>gi|206603590|gb|EDZ40070.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
CG']
Length = 689
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 220/697 (31%), Positives = 338/697 (48%), Gaps = 64/697 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSV 80
CHWCHVM ESFE +AK++N++FV+IKVDREERPD+D++Y M + GGWPL++
Sbjct: 50 CHWCHVMAHESFERPDIAKVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P P GGTYFP + ++G PGF +L +++D + R+ L + ++ L +
Sbjct: 110 FLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTNP 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ S+ D P AL L +D FGGFG APKFP +++ + ++
Sbjct: 170 VADSTGFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFHRK 223
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G S A M TL M +GGI DHVGGGF RYSVDERW +PHFEKMLYD L
Sbjct: 224 GDSTAA----HMATLTLSAMKRGGIWDHVGGGFARYSVDERWLIPHFEKMLYDNALLLEA 279
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
S++++ YS +++ +L R+M G +S+ DADS EG +EG FYV+
Sbjct: 280 LALGASVSRNPVYSRTAEELVGWLFREMRSEHGVYYSSLDADS---EG----EEGRFYVF 332
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
++EV IL E + +HY L S+P N L E + +
Sbjct: 333 QAEEVRSILSDEEYRVVSKHYGL-----------SEPPNFESHAWHLYEARSIGELSKEF 381
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+P + + R+KLF RS R RP LDDK++ SWN L+ A++
Sbjct: 382 HLPESDIESRIDSARQKLFTYRSLRVRPGLDDKILASWNALM--------------AKAL 427
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+F+ ++G ++E+M ++ R+++ L + P +LDDYAFL+
Sbjct: 428 LFSGRILG--KQEWMTAGRKTIDYMHRNMWKNGV--LMAVYSKKEPFLPAYLDDYAFLLL 483
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+L+ + L +A + + F D E GG++ T +++ R K HDGA P
Sbjct: 484 AVLESIRIDFRPEDLSFATAIADVLLTEFYDPESGGFYFTGKNHEALIHRPKNGHDGALP 543
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SGN+ +V L+ L ++ Y A+ +L ++ ++K+ M A + S
Sbjct: 544 SGNAAAVQGLLWLGTLTGHLP---YTSAADQTLRLYFAQMKEQPAGYTTMISALETYS-- 598
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ V+L+ + D++N + D VI + A + E R
Sbjct: 599 DSQPVILLAGPQAEDWKNTI---RQGLDPEAFVIDLTSAVRNSLPLPEG--------MRK 647
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+F +K VC+ C P SL+ L P S
Sbjct: 648 HFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 684
>gi|429201724|ref|ZP_19193171.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
gi|428662694|gb|EKX62103.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
Length = 687
Score = 324 bits (831), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 225/692 (32%), Positives = 338/692 (48%), Gaps = 82/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 52 SSCHWCHVMAHESFEDRETADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 111
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPP ++G P F+ +L V+ AW +RD + + + L+
Sbjct: 112 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVRAAWADRRDEVTEVAGKIVRDLAGRE 171
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L +A ++L + L L++ YD+ GGFG APKFP + I+ +L H +
Sbjct: 172 LQFAAVEVPGEEDLARALL-----GLTREYDAVHGGFGGAPKFPPSMVIEFLLRHYAR-- 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 225 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G + EGA+Y
Sbjct: 280 RVYAHLWRATGSELARRVALETADFMVRELGTGEGGFASALDADS--DDGTGKHVEGAYY 337
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 376
VWT ++ ++LG+ A L + + + G + G++VL + ++ A
Sbjct: 338 VWTPAQLREVLGDQDADLAAQFFGVTEEGTFE-----------HGQSVLRLPQHEGVFDA 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
K + + +L R++RP P DDKV+ +WNGL +++ A
Sbjct: 387 EK-----------IASIKDRLNRARAQRPAPGRDDKVVAAWNGLAVAALAETGAYF---- 431
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYA 495
DR + +E A +AA + R DE+ +L + ++G A G L+DYA
Sbjct: 432 ------------DRPDLVEAAIAAADLLVRLHLDEKA-QLARTSKDGRVGANAGVLEDYA 478
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + F+D E G ++T + ++ R ++ D
Sbjct: 479 DVAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALYDTAADAEKLIRRPQDPTD 538
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG S + L+ S A + S+ +R AE +L + +K + VP +
Sbjct: 539 NATPSGWSAAAGALL---SYTAHTGSEPHRAAAERALGI----VKALGPRVPRFIGWGLA 591
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P + V +VG + + AA V+ + A+++E+
Sbjct: 592 TAEALLDGP--REVAVVGPEGHPGTRALHRAALLG-TAPGAVVAVGTAESDELPL----- 643
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+A + A VC+NF+C P TD
Sbjct: 644 -----LADRPLVGGEPAAYVCRNFTCDAPTTD 670
>gi|357055989|ref|ZP_09117045.1| hypothetical protein HMPREF9467_04017 [Clostridium clostridioforme
2_1_49FAA]
gi|355381481|gb|EHG28604.1| hypothetical protein HMPREF9467_04017 [Clostridium clostridioforme
2_1_49FAA]
Length = 646
Score = 324 bits (831), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 205/561 (36%), Positives = 286/561 (50%), Gaps = 51/561 (9%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
ME ESFE+E +A++LN +V +KVDREERPDVD VYM+ QA+ G GGWPL++ ++PD +
Sbjct: 1 MERESFENEVIAEILNREYVCVKVDREERPDVDSVYMSVCQAMNGQGGWPLTIIMTPDCR 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEALSASASSN 146
P GTYFPP +YGRPG + +L D W K+D +L Q+G Q+ + L + +
Sbjct: 61 PFFSGTYFPPRARYGRPGLEELLTAAADQWKAKKDKLLEQAG-----QIEKYLRSQEQTG 115
Query: 147 KLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 205
+ + EL A+ Q + S+D + GGFGSAPKFP P + ++ + G +
Sbjct: 116 RWAEPELA--AVHQAFRQFADSFDRKNGGFGSAPKFPTPHSLIFLM-------EYGARQK 166
Query: 206 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 265
E M TL M +GGI DH+GGGF RYS D +W VPHFEKMLYD L Y+ A+
Sbjct: 167 RPEALAMAETTLVQMYRGGIFDHIGGGFSRYSTDGQWLVPHFEKMLYDNSLLVMAYIKAY 226
Query: 266 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
T Y + +L+Y+RR++ G + +DADS EG +YV+T +E+
Sbjct: 227 GRTGRKMYGCVAEKVLEYVRRELTDSQGGFYCGQDADSDGV-------EGKYYVFTQEEI 279
Query: 326 EDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
+LGE A F Y + GN S P N + +N + G
Sbjct: 280 RAVLGEKAGRDFCRQYGITRHGN--FEGRSIP-NLLENENYEEICEEPWGGDDHGGNVCH 336
Query: 385 KYLNILG-----ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
N G +C +KL+ R R R H DDK++VSWNG +I + A A +L
Sbjct: 337 GVRNSFGGRKNEDC-KKLYQYRLDRARLHKDDKILVSWNGWMICACAMAGAVLGE----- 390
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K Y+++A A +FI L + RL R+G + G LDDYA
Sbjct: 391 -----------KRYVDMAVRAEAFINSRLV--KNGRLMVRCRDGDAAGEGKLDDYACYSL 437
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LL+LY +L A E F DRE GG++ + +++R KE +DGA P
Sbjct: 438 ALLELYRVTFQADYLKRAAAWAEIMTEQFFDRERGGFYLYAEDGEQLIVRTKETYDGAMP 497
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SGNSV+ L RL I K
Sbjct: 498 SGNSVAAQVLHRLTQITGEVK 518
>gi|374987022|ref|YP_004962517.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
gi|297157674|gb|ADI07386.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
Length = 677
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 228/703 (32%), Positives = 333/703 (47%), Gaps = 86/703 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE A LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMARESFEDEATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPP ++G P F+ +L V+ AW +RD + + L+E
Sbjct: 108 VFLTPEAEPFYFGTYFPPAPRHGMPSFQQVLEGVQAAWADRRDEVKDVAERIVRDLAERG 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
AS + P++ L L++ +D+ GGFG APKFP + ++ +L H +
Sbjct: 168 GASLAYGAAQPPGPED-LHTALMTLTREFDAVHGGFGGAPKFPPSMVLEFLLRHHAR--- 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG ++V T + MA+GGI+D +GGGF RY+VD W VPHFEKMLYD L
Sbjct: 224 TGSQA----ALQIVQATCEAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDNALLCR 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + + ++L R++ G SA DADS + +G EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVAVETAEFLVRELRTEQGGFASALDADSDDGKGG--HAEGAYYV 337
Query: 320 WTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +++ + LGE A L E++ + G F+ + ++ L D A A
Sbjct: 338 WTPEQLSEALGEKDAELAAEYFGVTEEGT------------FEQSSSVLRLPDREALADA 385
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ R +L R +RPRP DDKV+ +WNGL +++ A
Sbjct: 386 ---------ERIASVRERLLAARGQRPRPGRDDKVVAAWNGLAVAALAETGAYF------ 430
Query: 439 AMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
DR + +E A +AA +R HL D RL + +G + A G L+DYA
Sbjct: 431 ----------DRPDLVEAATAAADLLVRVHLDDRG--RLARTSLDGTAGAHAGVLEDYAD 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+ G L L W+ A L +T F +G Y T +D L+R +D D
Sbjct: 479 VAEGFLALSSVTGEGAWVGLAGLLLDTVQRHFAAEDGMLY--DTADDAEALIRRPQDPTD 536
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG + + L+ A++ + D R+ AE +L V + + VP +
Sbjct: 537 NAAPSGWTAAAGALLSYAAV---TGEDRPREAAERALGVVQA----LGARVPRFIGWGLA 589
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
A +L P + V +VG D + A H + L V+ + + E+
Sbjct: 590 VAEALLDGP--REVAVVGP----DGDPATRALHRAALLGTAPGAVVAVGEPGSREVPL-- 641
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + A VC+ F+C P D +L L
Sbjct: 642 --------LLDRPLLEGRPAAYVCRRFTCDAPTADVGTLAGKL 676
>gi|443624623|ref|ZP_21109091.1| putative Spermatogenesis-associated protein 20 [Streptomyces
viridochromogenes Tue57]
gi|443341889|gb|ELS56063.1| putative Spermatogenesis-associated protein 20 [Streptomyces
viridochromogenes Tue57]
Length = 680
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 227/699 (32%), Positives = 325/699 (46%), Gaps = 79/699 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A LN FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 51 SSCHWCHVMAHESFEDQETADYLNAHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPP ++G P F+ +L V AW +RD +A+ + L+
Sbjct: 111 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVHSAWADRRDEVAEVAGKIVRDLAGRE 170
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S + EL Q L L++ YD + GGFG APKFP + I+ +L H +
Sbjct: 171 ISFGGTEAPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR-- 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 224 -TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLC 278
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y + T + + D++ R++ G SA DADS +G R EGA+Y
Sbjct: 279 RGYAHLWRATGSELARRVALETADFMVRELRTNEGGFSSALDADS--DDGTGRHVEGAYY 336
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VWT +++ + LG+ Y+ + E +S
Sbjct: 337 VWTPRQLRETLGDDDAELAARYF-----------------------GVTEEGTFEHGSSV 373
Query: 379 LGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
L +P + L + + R++L D RS+RP P DDK++ +WNGL I++ A
Sbjct: 374 LQLPQQDELFDADRVASIRQRLLDRRSERPAPGRDDKIVAAWNGLAIAALAET------- 426
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
A F+ P +A +R HL D RL + ++G A G L+DY
Sbjct: 427 --GAYFDRP------DLVDAALAAADLLVRLHLDD--AARLARTSKDGQVGANAGVLEDY 476
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
+ G L L WL +A L + F D E G ++T + ++ R ++
Sbjct: 477 GDVAEGFLALASVTGEGVWLDFAGFLLDHVLARFTDEESGALYDTAADAEQLIRRPQDPT 536
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---C 611
D A PSG S + L+ S A + S +R AE +L V +K + VP
Sbjct: 537 DNAAPSGWSAAAGALL---SYAAQTGSAPHRAAAEKALGV----VKALGPRVPRFVGWGL 589
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A ++ + V +VG L V+ + D++E+
Sbjct: 590 AVAEANLDGPREVAIVGPSLDEQATRTLHRTALLATAPGAVVAVGTPDSDELPL------ 643
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+NF+C P TDP L L
Sbjct: 644 ----LADRPLVGGEPAAYVCRNFTCDAPTTDPERLRTAL 678
>gi|312194562|ref|YP_004014623.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
gi|311225898|gb|ADP78753.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
Length = 686
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 216/623 (34%), Positives = 312/623 (50%), Gaps = 71/623 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFEDE A +N+ FV+IKVDREERPDVD VYM AL G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDEATAAFMNEHFVNIKVDREERPDVDAVYMDVTVALTGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GTYFPP+ + G P F +L+ + +AW +RD + SGA +L+EA +
Sbjct: 109 FLTPAGEPFFAGTYFPPQGRPGMPAFSQVLQALSEAWVTRRDEIESSGADIARKLAEA-A 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + L + L +QL+ +D R GGFG+APKFP + +++L H
Sbjct: 168 ESPVGGRAGTRLDADLLDRAVDQLAGRFDPRNGGFGAAPKFPPSMVAELLLRHH------ 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+SG+A +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD QL V
Sbjct: 222 ARSGDA-RALDLVALTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRV 280
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS-----------AET--- 306
YL + T + + R+ ++L D+ G SA DAD+ AE+
Sbjct: 281 YLHLWRATGSGLAARVVRETAEFLLADLRTAEGGFASALDADAVPPAAPDGPGGAESGPG 340
Query: 307 -EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
E + EGA YVWT ++ +L + A E + + P G F+ +
Sbjct: 341 DEHGSHPVEGASYVWTPAQLAAVLAPDDAAWAAELFAVTPEGT------------FEHGS 388
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
+++L A ++ L R +L R+ RP+P DDKV+ SWN
Sbjct: 389 SVLQLPADPADPAR-----------LARVRDELAAARALRPQPARDDKVVASWN------ 431
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG 483
I A+F P ++E AE AAS +R HL D + R + G
Sbjct: 432 ---GLAIAALAEAGALFEVPA-------WIEAAERAASLLRDVHLVDGRLRRTSRHGKVG 481
Query: 484 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 543
P+ G LDDY + GLL LY+ WL A EL + F + GG+++T +
Sbjct: 482 PNA--GVLDDYGNVAEGLLALYQVTGELAWLELARELLDVARARFRAPD-GGFYDTADDA 538
Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDM 602
++L R +E D PSG S L+ A++ + S +R++AE ++ + +D
Sbjct: 539 ETLLRRPREISDSPTPSGQSAFAGALLTYAAL---TGSADHREDAEATVGLLAALLARDA 595
Query: 603 AMAVPLMCCAADMLSVPSRKHVV 625
+ A A +L+ P+ VV
Sbjct: 596 SFAGYAGAVAEALLAGPAEVAVV 618
>gi|402848267|ref|ZP_10896531.1| Thymidylate kinase [Rhodovulum sp. PH10]
gi|402501421|gb|EJW13069.1| Thymidylate kinase [Rhodovulum sp. PH10]
Length = 710
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 231/702 (32%), Positives = 344/702 (49%), Gaps = 70/702 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED A ++N+ FV IKVDREERPD+D++YM + L GGWPL++
Sbjct: 57 ACHWCHVMAHESFEDPATAAVMNELFVPIKVDREERPDIDQIYMAALHHLGDQGGWPLTM 116
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P+ GGTYFP ++G+P F +LR+V + ++ + + Q+ + +L+
Sbjct: 117 FLTPSGEPVWGGTYFPRVSRFGKPAFVDVLREVSRLFREEPEKIEQNRRALMGRLAHRAQ 176
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED- 199
A+ EL + A Q++ + D GG APKFP+P ++ ++ + + ED
Sbjct: 177 AAGRPVIGLAELDR-----MAAQIAGAIDLVNGGLRGAPKFPQPTMLE-TIWRAGEREDA 230
Query: 200 -TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG + + +V TL+ M +GGI DH+GGGF RYSVD+RW VPHFEKMLYD QL
Sbjct: 231 RTGFAHPTNLFYDLVALTLERMCEGGIFDHLGGGFARYSVDDRWLVPHFEKMLYDNAQLL 290
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+ A + T + + + +L R+M P G ++ DADS EG +EG FY
Sbjct: 291 ELLALAHARTGHELFRQRAEETVGWLLREMTTPEGAFCASLDADS---EG----EEGKFY 343
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND-SSASA 376
VWT +E+ +LG E A F HY ++P GN F+GK +L L A+
Sbjct: 344 VWTLEEIVGVLGPEDAARFAAHYDVEPAGN------------FEGKTILDRLPGLDQAAQ 391
Query: 377 SKLGMP--LEKYLNI-----LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
++ G+P L KY + L R++LFD RS R RP DDK++ WNGL I++ A A
Sbjct: 392 ARTGLPFALHKYADARIEADLAAMRQRLFDARSTRVRPGTDDKILADWNGLTIAALANAG 451
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+L D +++A A +F+ + + RL HS+R+G PG
Sbjct: 452 TLL----------------DVPASIDLARRAFAFVATEM--TRHGRLGHSWRDGRLLFPG 493
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
DYA +I L L+E ++L A+ Q D D E G Y+ + + +++R
Sbjct: 494 LASDYAAMIRAALALHEATGEKEFLDRAVAWQEAFDHHHQDVETGTYYLSADDAEGLVVR 553
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
D A P+ N ++ NLVRLA + + D +R+ A+ L R D +
Sbjct: 554 PSATTDDAIPNPNGLAAQNLVRLAVL---TGDDRWRERADALLEGLLPRAADNLFGHLSV 610
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A D+ + +VG + L A ++ P+ E
Sbjct: 611 MNALDLRL--RGLEIAIVGEGPHI---AALTGAAQHIPFGSRILFRAPS--------PEA 657
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLL 711
N +A + A VC CS PVT P L +L
Sbjct: 658 LPENHPARAQAAAAPEGAAFVCAGERCSLPVTTPEGLREAIL 699
>gi|218437933|ref|YP_002376262.1| hypothetical protein PCC7424_0938 [Cyanothece sp. PCC 7424]
gi|218170661|gb|ACK69394.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7424]
Length = 687
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 241/731 (32%), Positives = 343/731 (46%), Gaps = 126/731 (17%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDGAIAEYMNANFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SE 137
+FL+P DL P GGTYFP E +Y RPGF +L+ V+ +D +++ L +E L +
Sbjct: 108 IFLTPDDLVPFYGGTYFPVEPRYNRPGFLQVLQSVRHFYDTEKEKLKSFKQEILEVLHNS 167
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-K 196
+ + +N EL L+ + ++KS G FG P FP ++L S+ K
Sbjct: 168 TILPLSDTNLQAHELFYRGLKTNTQVITKS----VGDFGR-PSFPMIPYASLILQGSRFK 222
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
E +A+E + L A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 223 FESDYDGKQAAEARGADL------ALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276
Query: 257 LANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
+ + +S Y R I +L+R+M P G ++A+DAD+ +
Sbjct: 277 IIEYLANLWSSGSQ--YPSFQRAIAGTAQWLKREMTAPEGYFYAAQDADNFVHSEDAEPE 334
Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVW ++E +L E + K + + P GN F+G NVL
Sbjct: 335 EGAFYVWRYSDLEKLLSEDELEALKTAFTITPEGN------------FEGSNVL------ 376
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
++ G E + IL KLF VR R P
Sbjct: 377 --QRTQEGTFTEDFEEILD----KLFGVRYGASSQDIEHFPPARNNQEAKTGNWQGRIPP 430
Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
D K+IV+WN L+IS ARA + + P+ Y E+A AA FI ++
Sbjct: 431 VTDTKMIVAWNSLMISGLARAYGVFRE---------PL-------YWELATGAAEFICQN 474
Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDE 526
+ Q RL G + +DYAFLI LLDL F S T+WL AIE+Q D
Sbjct: 475 QW--QNGRLHRLNYEGQATVLAQSEDYAFLIKALLDLQTAFPSKTEWLNKAIEIQEEFDN 532
Query: 527 LFLDREGGGYFNT-TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
LF E GGY+N T +L+R + D A PS N +++ NL+RL + +++ Y
Sbjct: 533 LFCSVEMGGYYNNATDNSEDLLVRERSYLDNATPSANGIAITNLIRLGRL---TENLSYF 589
Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHAS 645
+ AE +L F + L A P + A D +H + V S +
Sbjct: 590 EQAERALQAFSSILSQSPQACPSLFTALDWY-----RHGISVRATSQI------------ 632
Query: 646 YDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPIS 705
L + + P +D A + +D+ V LVCQ SC P T
Sbjct: 633 --LERLIFQYFPTAVYRVD---------AEL------SDQTVGLVCQGLSCLEPATTLEK 675
Query: 706 LENLLLEKPSS 716
L+ + + SS
Sbjct: 676 LQTQMKQATSS 686
>gi|354611184|ref|ZP_09029140.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
gi|353196004|gb|EHB61506.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
Length = 724
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 228/709 (32%), Positives = 333/709 (46%), Gaps = 56/709 (7%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESF D+GVA LN+ FV +KVDREERPDVD +YM Q + GGGGWPLS
Sbjct: 53 SACHWCHVMEEESFSDDGVAAALNENFVPVKVDREERPDVDSLYMKVCQVVRGGGGWPLS 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+PD KP GTYFP E K +PGF +L V D+W +R L + L
Sbjct: 113 AFLTPDRKPFFVGTYFPKEPKRNQPGFTQLLDDVADSWQTERGDLEDRAEQWLSAAKGEL 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ L D+ P L A L+++ D GGFG APKFP+ + +L +D
Sbjct: 173 EDLPDATDLGDDSP---LDEAANALARTADRDNGGFGRAPKFPQAGRVDALLRAHDASDD 229
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ G+ +V L MA GG++DH+GGGFHRY D W VPHFEKMLYDQ L
Sbjct: 230 GKQYGD------IVREALDAMAGGGLYDHLGGGFHRYCTDADWTVPHFEKMLYDQATLVR 283
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK-EGAFY 318
Y+D + + Y+ + L ++ R++ P G ++ DA S + ++ EGAFY
Sbjct: 284 TYVDGYRSFGEERYADEVGETLAFVDRELGHPDGGFYATLDARSPPIDDPEGERVEGAFY 343
Query: 319 VWTSKEVEDILGEHA-------------ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
VWT ++VE+ + ++A LF+ Y + GN + G+ V
Sbjct: 344 VWTPEQVENAVADYADEAPADVDPGDLVDLFRARYGVDEAGNFE-----------HGQTV 392
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
L A + G ++ +L +L R RPRP DDKV+ WNGL+ ++
Sbjct: 393 LTVSASREELADEFGYQEDEVAELLAAAETRLRAARDDRPRPARDDKVLAGWNGLMARAY 452
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
A A F+ +D Y E A A +R L+D + RL +G
Sbjct: 453 AEA---------GLAFDGAEARADEDSYAERAAEAIDHVRSELWDGE--RLARRVIDGDV 501
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
G+ +DYA+L +G L YE L +A++L + + D E G + T
Sbjct: 502 AGIGYAEDYAYLAAGALATYEATGDHAHLGFALDLADALLDACYDAETGALYQTPASVQD 561
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
V +R + G PS V+ L+ L + ++ Y AE L + R++ A
Sbjct: 562 VDVRSQAVDGGPTPSPVGVAAETLLALDAFDPDAE---YANAAEAMLERYGERVQRSPAA 618
Query: 606 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
P + AADML V + V + V++ + A+ L ++ P E+D
Sbjct: 619 HPTLVLAADML-VTGHREVTVAADSLPVEWRRTVGTAY----LPDRLLSRRPRSAVELDE 673
Query: 666 WEEH--NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
W ++ + S + A VC+ +CSPP++ +E L E
Sbjct: 674 WLAALGLADAPPIWAGRQSHEAATAYVCRR-ACSPPLSTAEEIEEWLAE 721
>gi|294631112|ref|ZP_06709672.1| conserved hypothetical protein [Streptomyces sp. e14]
gi|292834445|gb|EFF92794.1| conserved hypothetical protein [Streptomyces sp. e14]
Length = 676
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 231/702 (32%), Positives = 326/702 (46%), Gaps = 85/702 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SACHWCHVMAHESFEDQATAGYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP ++G P F+ +L V+ AW +RD + + + L++
Sbjct: 107 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVRQAWATRRDEVTEVAGKIVRDLAQ-R 165
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+LP +EL Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 166 EIGYGGVQLPGEEELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR- 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 --TGSEG----ALQMARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALL 273
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T + + D++ R++ G SA DADS +G R EGA+
Sbjct: 274 CRVYAHLWRATGSELARRVALETADFMVRELRTGEGGFASALDADS--DDGTGRHVEGAY 331
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT +++ D LGE Y+ + E +S
Sbjct: 332 YVWTPEQLRDALGEEDAQLAAQYF-----------------------GVTEEGTFEHGSS 368
Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
L +P ++ + + RR L + R+ RP P DDK++ +WNGL I++ A
Sbjct: 369 VLQLPQQEGVFDAERIESVRRLLLERRAGRPAPGRDDKIVAAWNGLAIAALAETGAYF-- 426
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
DR + +E A AA + R DE L + R+G A G L+D
Sbjct: 427 --------------DRPDLVEAALGAADLLVRLHMDEHAG-LARTSRDGQVGANAGVLED 471
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA + G L L WL +A L F D + G ++T + ++ R ++
Sbjct: 472 YADVAEGFLALASVTGEGVWLDFAGLLLGHVLTRFTDPDSGALYDTAADAEQLIRRPQDP 531
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL----- 608
D A PSG S + L A + S+ +R AE +L V +K + VP
Sbjct: 532 TDNATPSGWSAAAGA---LLGYAAHTGSEAHRTAAEKALGV----VKALGPRVPRFIGWG 584
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ A L P VV S D A L++T + + A + + E
Sbjct: 585 LAVAEAALDGPREVAVVA---PSLAD--------EAGRVLHRTAL-LGTAPGAVVAYGTE 632
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC++F+C P TDP L L
Sbjct: 633 GGEEFPLLADRPLVGGAPAAYVCRDFTCDAPTTDPERLRAAL 674
>gi|307154410|ref|YP_003889794.1| hypothetical protein Cyan7822_4611 [Cyanothece sp. PCC 7822]
gi|306984638|gb|ADN16519.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7822]
Length = 685
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 240/730 (32%), Positives = 344/730 (47%), Gaps = 123/730 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDAAIAEYMNTHFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +Y RPGF +L+ V+ +D ++D L +E L A
Sbjct: 108 IFLTPDDLVPFYGGTYFPVEPRYNRPGFLQVLQSVRHFYDNEKDKLKSFKKEILEVLQSA 167
Query: 139 -LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-- 195
+ +N + ++L + ++ S + FG P FP + L S+
Sbjct: 168 TVLPLGDANLVSNDLFYRGIETNTAVITNSAND----FGR-PSFPMIPYANLTLQGSRFE 222
Query: 196 -KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ ++ GK G+ + L GGI+DH+GGGFHRY+VD W VPHFEKMLYD
Sbjct: 223 FQSQNDGKQAAIQRGEDLAL--------GGIYDHIGGGFHRYTVDSTWTVPHFEKMLYDN 274
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
GQ+ + +S +V + R I + +L+R+M P G ++A+DADS T
Sbjct: 275 GQIVEYLANLWS--SEVQKPSLARAIAGTVQWLKREMTAPEGYFYAAQDADSFTTPEDVE 332
Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EGAFYVW+ +++ +L + K + + P GN F+GKNVL
Sbjct: 333 PEEGAFYVWSYSDIQQLLSTDELEALKTAFTVTPEGN------------FEGKNVL---- 376
Query: 371 DSSASASKLGMPLEKYLNILGECR--------------RKLFDVRS----KRPRPHLDDK 412
AS K E L+ L R R + +S R P D K
Sbjct: 377 -QRASEGKFAEDFEAVLDKLFAVRYGASSSTLDRFPPARNNAEAKSGNWPGRIPPVTDTK 435
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DE 471
+IV+WN L+IS ARA + + P+ Y E+A A FI H + +
Sbjct: 436 MIVAWNSLMISGLARAYGVFRE---------PL-------YWELAVGATEFIFTHQWKNG 479
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLD 530
+ HRL + G + +DYAFLI LLDL T+WL AI +Q D LF
Sbjct: 480 RLHRLNYE---GETGVLAQSEDYAFLIKALLDLQTASPAETEWLNKAISVQQEFDNLFWS 536
Query: 531 REGGGYFNTTGEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
E GGY+N + ++ L+ VKE D A PS N V+V NL+RLA + + Y A
Sbjct: 537 VEMGGYYNNSTDNSQDLI-VKERSYIDNATPSANGVAVTNLIRLARLTENLE---YLSQA 592
Query: 589 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 648
E +L F + LK A P + A D ++ + V K + L
Sbjct: 593 EQTLQAFSSILKQSPQACPSLFTALDWY-----RYSISVRSKPDI--------------L 633
Query: 649 NKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
+ + P +D H AD+V LVCQ SC P SLE
Sbjct: 634 ERLIFQYFPTAVYRVD----HQ-----------LADQVEGLVCQGLSCLEPAR---SLEK 675
Query: 709 LLLEKPSSTA 718
L + +T+
Sbjct: 676 LQQQIKQATS 685
>gi|345008957|ref|YP_004811311.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344035306|gb|AEM81031.1| hypothetical protein Strvi_1280 [Streptomyces violaceusniger Tu
4113]
Length = 678
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 218/689 (31%), Positives = 322/689 (46%), Gaps = 74/689 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDKATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPP + G F+ +L V AW +R+ + +E L++
Sbjct: 108 VFLTPEAQPFYFGTYFPPRPRPGMASFRQVLEGVSAAWTDRREEVVDVAGRIVEDLAQRT 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S+ P + L L++ +D+ GGFG APKFP + ++ +L H +
Sbjct: 168 GIALGSDA-PAPPGEEDLHAALMGLTREFDATRGGFGGAPKFPPSMALEFLLRHHAR--- 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +MV T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 224 TGSEG----ALQMVSATCEAMARGGIYDQLGGGFARYSVDAGWTVPHFEKMLYDNALLCR 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + + D++ R++ G SA DADS +G R EGA+YV
Sbjct: 280 VYAHLWRATGSDLARRVALETADFMVRELRTAQGGFASALDADS--DDGTGRHVEGAYYV 337
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT + + ++LGE F Y+ F+ +++L D A
Sbjct: 338 WTPERLREVLGEADAEFAAGYF-----------GVTQEGTFEQGASVLQLPDGKRPADA- 385
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ R +L R +R RP DDK++ +WNGL +++ A
Sbjct: 386 --------GRVASVRERLLAARERRARPGRDDKIVAAWNGLAVAALAETGAYF------- 430
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLI 498
DR + ++VA AA + R L+ +Q RL + +G + G L+DYA +
Sbjct: 431 ---------DRPDLVDVATEAAELLMR-LHMDQRGRLARTSLDGTAGGHAGVLEDYADVA 480
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
G L L W+ +A L +T F E G F+T + +++ R ++ D A
Sbjct: 481 EGFLALSAVTGDGAWVDFAGLLLDTVLTRFT-AEDGTLFDTADDAEALIRRPQDPTDNAA 539
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCAA 613
PSG + + L+ A+I S+ +R+ AE +LAV ++ + VP + A
Sbjct: 540 PSGWTAAAGALLSYAAITGSSR---HRETAERALAV----VRALGPRVPRFIGWGLAVAE 592
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
L P + V +VG + AA + V +P E
Sbjct: 593 ARLDGP--REVAVVGPGDDPATRALHRAALLATAPGAVVAVGEPGSGE-----------V 639
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A VC+ F+C P D
Sbjct: 640 PLLQDRPLLEGRPAAYVCRGFTCDAPTAD 668
>gi|440700552|ref|ZP_20882794.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
Car8]
gi|440276815|gb|ELP65027.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
Car8]
Length = 677
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 233/701 (33%), Positives = 336/701 (47%), Gaps = 83/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDQATADYLNENFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPPE + G P F+ +L V+ AW +RD +A+ + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRSGMPSFREVLEGVRSAWTDRRDEVAEVAQKIVRDLA-GR 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + P E Q L L++ YD++ GGFG APKFP + ++ +L H +
Sbjct: 167 EIGYGATEAPTEEDQARALLG---LTREYDAQRGGFGGAPKFPPSMVLEFLLRHGAR--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 TGSEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + + D+L R++ G SA DADS +G + EGA+YV
Sbjct: 277 VYAHLWRATGSELARRVALETADFLVRELRTAEGGFASALDADS--DDGTGKHVEGAYYV 334
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASAS 377
WT ++ ++LG E A L +++ + G + +G +VL + ++ A
Sbjct: 335 WTPAQLTEVLGAEDAELAAQYFGVTADGTFE-----------EGASVLQLPQHEGVFDAE 383
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
K+ + +L R +RP P DDKV+ +WNGL I++ A
Sbjct: 384 KVDY-----------VKARLLAARGERPAPGRDDKVVAAWNGLAIAALAET--------- 423
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
A F P +A +R HL D++ H L + ++G A G L+DYA
Sbjct: 424 GAYFERP------DLVDAALAAADLLVRVHL-DDRAH-LARTSKDGQVGANAGVLEDYAD 475
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ G L L WL +A L + F+D E G F+T + ++ R ++ D
Sbjct: 476 VAEGFLALASVTGEGVWLEFAGFLLDHVLVRFVDEESGALFDTASDAEQLIRRPQDPTDN 535
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCC 611
A PSG + + L+ A+ +R AE +L V +K + VP +
Sbjct: 536 AVPSGWTAAAGALLGYAAQTGAVP---HRAAAERALGV----VKALGPRVPRFIGWGLAV 588
Query: 612 AADMLSVPSRKHVV--LVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A +L P VV +G ++V A A V+ + D+EE+
Sbjct: 589 AEALLDGPREVAVVGPSLGDPATVALHRTALLATAP----GAVVAVGSVDSEELPL---- 640
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+NF+C P TDP L L
Sbjct: 641 ------LAGRPLVGGAAAAYVCRNFTCDAPTTDPERLRIAL 675
>gi|284989523|ref|YP_003408077.1| hypothetical protein Gobs_0945 [Geodermatophilus obscurus DSM
43160]
gi|284062768|gb|ADB73706.1| protein of unknown function DUF255 [Geodermatophilus obscurus DSM
43160]
Length = 665
Score = 322 bits (825), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 229/691 (33%), Positives = 318/691 (46%), Gaps = 78/691 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A +N FV +KVDREERPDVD VYM QAL G GGWP++V
Sbjct: 49 ACHWCHVMAHESFEDEATAGQMNADFVCVKVDREERPDVDSVYMAATQALTGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +PD +P GTYFPP +G P F+ +L V DAW +R+ L +G E +S L
Sbjct: 109 FTTPDGRPFYCGTYFPPRPAHGMPSFRQLLSAVSDAWRSRREDLETAGTRIAEGISSRLD 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
P L L L+ YD R+GGFG APKFP + ++ +L H+ + D
Sbjct: 169 LGP-----PAPLAAEVLDHAVAALAGEYDERWGGFGGAPKFPPSMVLEFLLRHAARTGD- 222
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+M TL MA+GGIHD + GGF RYSVD RW VPHFEKMLYD L +
Sbjct: 223 ------DRALRMARGTLGAMARGGIHDQLAGGFARYSVDARWVVPHFEKMLYDNALLLRL 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL + T D + + +L RD+ P G SA DAD+ EG T YVW
Sbjct: 277 YLHLWRATGDEWARRVADATAAFLVRDLDTPEGGFASALDADAEGVEGLT-------YVW 329
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E+ ++LGE + + ++D G + L L D A
Sbjct: 330 TPAELVEVLGEDDGRWAAAVF----------EVTDAGTFEHGTSTLQLLRDPGDPAR--- 376
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
L R +L R++RP+P DDKV+ +WNGL I++ A + S +
Sbjct: 377 ---------LASVRERLGAARARRPQPARDDKVVTAWNGLAIAALAEHGVLTGSPS---- 423
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLIS 499
+ + +V H D RL+ + RNG + AP G L+DY L
Sbjct: 424 -SVDAARRAAELLADV----------HWGD---GRLRRASRNGVAGAPSGVLEDYGDLAE 469
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
GLL L++ +WL A +L + F+D + G+ +T + +++ R + DG P
Sbjct: 470 GLLALHQATGEGRWLELAGDLLDVVAGQFIDAD--GWHDTAADAEALVHRPFDPADGPTP 527
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
SG + V A++ + + A SLA R M +L+ P
Sbjct: 528 SGLAAVAGAAVTYAALAGAPRHRELGEAAVGSLARLAERAPQAVGWA--MAVGEALLAGP 585
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
V V + D + ++AAA AS V+ +P D + +A
Sbjct: 586 LE---VAVSGPAGPDRDALVAAARASTSPGAVVVVGEP-DAPGVPL----------LAGR 631
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC+ F C+ PVTD +L L
Sbjct: 632 PLVGGRPAAYVCRGFVCAAPVTDVSALGAAL 662
>gi|269125325|ref|YP_003298695.1| hypothetical protein Tcur_1071 [Thermomonospora curvata DSM 43183]
gi|268310283|gb|ACY96657.1| protein of unknown function DUF255 [Thermomonospora curvata DSM
43183]
Length = 662
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 235/691 (34%), Positives = 323/691 (46%), Gaps = 90/691 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFEDE A+L+ND FV+IKVDREERPDVD VYM QA+ G GGWP++VF
Sbjct: 49 CHWCHVMAHESFEDEATARLMNDLFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTVF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
+PD +P GTYFP R F+ +L V AW ++R+ + + G +E L+ A
Sbjct: 109 ATPDGEPFYCGTYFP------RQQFRALLMAVARAWREEREDVLKQGRKVVEALTARGPA 162
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ E A+R L+ SYD+ +GGFG APKFP + ++ +L H + +D
Sbjct: 163 PGETEPPSPERLSAAVR----SLAASYDTAYGGFGGAPKFPPSMVLEFLLRHYARTQD-- 216
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ M TL+ MA+GGI+D +GGGF RYSVDE W VPHFEKMLYD LA VY
Sbjct: 217 -----AQALAMATGTLEAMARGGIYDQLGGGFARYSVDEAWVVPHFEKMLYDNALLARVY 271
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ LT I + +++ RD+ P G + SA DADS EG +EG +YVWT
Sbjct: 272 AHWWRLTGSPLAKRIALETCEWMLRDLRTPQGGLASALDADS---EG----QEGKYYVWT 324
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGN--CDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+++ +LGE GN +L +++ G +VL D
Sbjct: 325 PEQLRRVLGEA------------DGNAAAELLGVTESGTFEHGTSVLRLPGDPGDQ---- 368
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
R +L R++R P DDKV+ +WNGL I++ A +L
Sbjct: 369 --------EWWSRVRARLLAARAERVPPARDDKVVTAWNGLAIAALAECGALLG------ 414
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
R + + AE A +R HL D RL + R+G P G L+DYA
Sbjct: 415 ----------RPDLVGAAEEIARLLREVHLRD---GRLTRTSRDGVPGANAGVLEDYADF 461
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDG 556
GLL L+ + A L T F D GG F T +D L R +D D
Sbjct: 462 AEGLLALHAVTGDPAHVRLAGTLLETVLTHFPDDRGG--FYDTADDAERLFRRPQDPTDN 519
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSG + L+ A++ S+ +RQ A +LA A A+ L
Sbjct: 520 ATPSGQFAAAGALLSYAALTGSSR---HRQAAASALAAATLLAGRHARFAGWGLAVAEAL 576
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
V + +VG + + AA AS PA + + + +
Sbjct: 577 -VSGPLEIAIVGDPADARTRALHGAALAS-----------PAPGAVITVGTGEAAGDVPL 624
Query: 677 ARNNFSADKV-VALVCQNFSCSPPVTDPISL 706
R D A VC+NF+C PVT P L
Sbjct: 625 LRGRTPVDGAPAAYVCRNFTCRLPVTTPADL 655
>gi|383649966|ref|ZP_09960372.1| hypothetical protein SchaN1_31668 [Streptomyces chartreusis NRRL
12338]
Length = 677
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 229/700 (32%), Positives = 331/700 (47%), Gaps = 81/700 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A+ LN +VS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDQQTAEYLNAHYVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPP + G P F+ +L+ V AW+++RD + + + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRQGMPSFRQVLQGVHQAWEERRDEVTEVAGKIVRDLAGRE 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+S + EL Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 168 ISYGDAQTPGEQELAQALL-----ALTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAQDTCERMARGGIYDQIGGGFARYSVDRDWIVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS +G + EGA+Y
Sbjct: 276 RVYAHLWRATGSEPARRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYY 333
Query: 319 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT ++ ++LGE A L ++ + G + R S
Sbjct: 334 VWTPAQLREVLGEQDAELAARYFGVTEEGTFEHGR------------------------S 369
Query: 378 KLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
L +P + L + + R +L RS RP P DDKV+ +WNGL I++ A
Sbjct: 370 VLQLPQQDGLFDADRIASIRERLLAARSGRPAPGRDDKVVAAWNGLAIAALAET------ 423
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 493
A F+ P +A +R HL DEQ RL + ++G + A G L+D
Sbjct: 424 ---GAYFDRP------DLVEAALAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLED 472
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA + G L L WL +A L + F D E G F+T + ++ R ++
Sbjct: 473 YADVAEGFLALASVTGEGVWLEFAGFLLDHVLARFTDEESGALFDTAADAERLIRRPQDP 532
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC--- 610
D A PSG + + L+ S A + S +R AE +L V +K + VP
Sbjct: 533 TDNAAPSGWTAAAGALL---SYAAHTGSQPHRTAAEKALGV----VKALGPRVPRFIGWG 585
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
AA ++ + V +VG + L V+ + ++E
Sbjct: 586 LAAAEAALDGPREVAVVGPSLEHEGTRTLHRTALLGTAPGAVVAVGAPGSDEFPL----- 640
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+NF+C P T+ L L
Sbjct: 641 -----LADRPLVGGEPAAYVCRNFTCDAPTTEADRLRATL 675
>gi|118579500|ref|YP_900750.1| hypothetical protein Ppro_1067 [Pelobacter propionicus DSM 2379]
gi|118502210|gb|ABK98692.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
2379]
Length = 687
Score = 322 bits (824), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 234/691 (33%), Positives = 320/691 (46%), Gaps = 87/691 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM + FED+ VA LLN FV IKVDREERPD+D YMT Q L G GGWPL++
Sbjct: 76 TCHWCHVMAHDGFEDDQVADLLNRHFVCIKVDREERPDIDDFYMTASQVLTGSGGWPLNI 135
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++PD +P TY P R F +L + W + + ++ + +E +
Sbjct: 136 FMTPDRRPFFAMTYLP------RQRFMELLAGIVTLWQQHPGEVEKNCSAIMEGIERLSR 189
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + EL A EQLS +D +GGFG APKFP P+ + L
Sbjct: 190 GNDHECPVLAELDSLAF----EQLSAIHDRTWGGFGPAPKFPLPLSLGW-------LAGQ 238
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G +G E +M TL + +GGI D +GGG HRYSVDERW VPHFEKMLYDQ LA
Sbjct: 239 GMNGN-QEALEMAQKTLGMIRQGGIWDQLGGGVHRYSVDERWLVPHFEKMLYDQALLAMA 297
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
LD D + + DI ++ R++ G FSA DADS +EGA+Y+W
Sbjct: 298 CLDVCLAGNDPAFLTMAEDIFRFVGRELTSTEGAFFSALDADSG-------GEEGAYYLW 350
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T ++E+ILG LF + + GN F+G+N+L D + G
Sbjct: 351 TRDDIEEILGRDGELFCRFFDVGEKGN------------FQGQNILHMPVDLETFCT--G 396
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
E+ IL +CR +L + R +R P D+K+I SWNGL+I++ AR +
Sbjct: 397 EDPERTGEILDDCRERLLEYREERSYPLRDEKIITSWNGLMIAALARGGAL--------- 447
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+EY+E A AA FI ++L Q RL S+ GPS P FL+DYAFL G
Sbjct: 448 -------GGEQEYIESASRAARFILKNLR-RQDGRLLRSYLAGPSSTPAFLEDYAFLCCG 499
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEP 559
L++L+E + W A+ L + LF D F T G D + + D DG P
Sbjct: 500 LIELFEATLDSFWQEQALLLADEMLRLFRD-PVRCVFVTVGLDAEQMAGQSPRDSDGVLP 558
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 619
S S + +RL A + D A + ++ + + A + P
Sbjct: 559 SPFSRAAHCFIRLG--YACDRDDLLDHAHLLLGAPLDDAAENPLSHLGALQALAMLEQEP 616
Query: 620 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARN 679
+ H G + S ++LA+ S+ L VI D E
Sbjct: 617 TIIH--FRGQRDSRRIASLLASTR-SFPLPNLVIRFTETDHEGE---------------- 657
Query: 680 NFSADKVVALVCQNFSCSPPVTDPISLENLL 710
ALVC SC P D SLE L
Sbjct: 658 --------ALVCAQGSCHGPFPDESSLERQL 680
>gi|322702606|gb|EFY94241.1| hypothetical protein MAA_10309 [Metarhizium anisopliae ARSEF 23]
Length = 738
Score = 322 bits (824), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 213/670 (31%), Positives = 333/670 (49%), Gaps = 71/670 (10%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M ESF + A +LN+ FV + +DREERPDVD +YM YVQA+ GG
Sbjct: 78 HIGYKACHFCRLMTQESFSNPECAAILNESFVPVIIDREERPDVDTIYMNYVQAVSNVGG 137
Query: 76 WPLSVFLSPDLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKKR----- 121
WPL+VF++P+L+P+ GGTY+P E + P TI RKV+D W +
Sbjct: 138 WPLNVFVTPNLEPVFGGTYWPGPGTSRRVTTESEDESPDCLTIFRKVRDIWHDQETRCRK 197
Query: 122 ---DMLAQSGAFAIEQL-----------------------SEALSASASSNKLPDELPQN 155
++LAQ FA E + + A ++ EL +
Sbjct: 198 EASEVLAQLREFAAEGTLGTRGLTGTHPIATPSWNIPSNPTTPIRARDKDAQVSSELDLD 257
Query: 156 ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQKM 212
L ++ ++D +GGFG APKF P ++ +L+ + ++D E +M
Sbjct: 258 QLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLHLNTFPSAVQDVVGEAECKHATEM 317
Query: 213 VLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--- 268
+ TL+ + G +HDH+G GF R SV W +P+FEK++ D L +Y+DA+ +
Sbjct: 318 AVDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNALLLALYVDAWRIAGGK 377
Query: 269 KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
D + I ++ DYL I P G + ++E ADS G +EGA+Y+WT +E +
Sbjct: 378 ADSEFYDIVLELADYLSSPPIALPSGGLATSEAADSFMRRGDREMREGAYYLWTRREFDS 437
Query: 328 ILG------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
++ + + + H+ ++ GN D DP+++F N+L + + + +
Sbjct: 438 VVDASGHDKQISQVAAAHWDVQEGGNVDEDH--DPNDDFINHNILRVVKTQDELSRQFNI 495
Query: 382 PLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ + R++L +R RP LDDKVI +WNGL IS+ A+AS LK
Sbjct: 496 SPDTVRQHIQAARKELKARRERERVRPELDDKVITAWNGLAISALAQASSALK------- 548
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
PV + +Y+ AESAA+FI+ L+DE + L +R G + GF DDY +LI G
Sbjct: 549 ---PVDSARSDKYLHAAESAAAFIKASLWDESSKLLYRIYREG-RETKGFADDYTYLIHG 604
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LLDL+ S L +A LQ TQ+ LF D + G +F+TT P +LR+K+ D + PS
Sbjct: 605 LLDLFAATSDEGHLAFADALQKTQNSLFHDSDSGAFFSTTASSPQAILRLKDGMDTSLPS 664
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
N+V+ NL RL +++ + Y A ++ FE + P + +
Sbjct: 665 VNAVAASNLFRLGALL---DDERYSALARGTVNAFEAEMLQHPWLFPGLLSGVVTARLGP 721
Query: 621 RKHVVLVGHK 630
R+ V V +K
Sbjct: 722 RESVSDVKYK 731
>gi|375012491|ref|YP_004989479.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
DSM 17368]
gi|359348415|gb|AEV32834.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
DSM 17368]
Length = 675
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 227/706 (32%), Positives = 337/706 (47%), Gaps = 107/706 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME +SFED A L+N+ F+SIKVDREERPDVD+VYMT VQ + G GGWPL+
Sbjct: 64 SACHWCHVMEHQSFEDSAAAALMNEHFISIKVDREERPDVDQVYMTAVQLMTGRGGWPLN 123
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V PD +P+ GGTYFP + G+ L+ + + + + + + E+L+E +
Sbjct: 124 VITLPDGRPIWGGTYFP------KDGWMQSLQSIVEVYHDDPEKVLEYA----EKLTEGV 173
Query: 140 SAS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S S N+ P + + + L + SK++D + GG APKFP PV + +L
Sbjct: 174 VQSELVSPNETPGDYSKEEIDLLFKNWSKNFDKKEGGSAGAPKFPMPVGYEFLL------ 227
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G E + + TL+ MA GGI+D VGGGF RYSVD+ W VPHFEKMLYD GQL
Sbjct: 228 -EYGSLTGNEEAMQQLNLTLRKMAFGGIYDQVGGGFSRYSVDDEWKVPHFEKMLYDNGQL 286
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y A+ TK+ Y I +++L RDM+GP GE +SA DADS EG +EG +
Sbjct: 287 VSLYSRAYQKTKNPLYKSIVIQTIEWLERDMLGPDGEFYSALDADS---EG----EEGKY 339
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVW E+++I+G+ +Y+ DL + +++G+ VL+ +DS + S
Sbjct: 340 YVWPEVELKEIIGDSDWEDFTNYF-------DLKK-----GKWEGRIVLMRSDDSENTDS 387
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
E ++L VR R P LDDK + SWN L+I+ A K
Sbjct: 388 AKVKAWE----------QELLKVRENRVPPGLDDKSLTSWNALMITGLVDAYKAFGD--- 434
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
Y+++A+ ++ ++ + L HS++ G S G ++DY F
Sbjct: 435 -------------SHYLDLAKKNGEWLLKNQV-RKDESLFHSYKKGKSSIDGLIEDYTFA 480
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ G LDLYE K+L A F D G +F + ++ + E HD
Sbjct: 481 VQGFLDLYEATFDVKYLEQANAWMKYAKANFEDEGTGLFFTRSKNAKQLIAKSMEVHDNV 540
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
P+ NSV NL L Y+ E LA E L M
Sbjct: 541 IPAANSVMAHNLFHL----------YHLTGNESYLAQSEKMLAQM--------------- 575
Query: 618 VPSRKHVVLVGHKSSV-DFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN---- 672
V LV + S ++ +L + Y + I + AD + M++ ++ N
Sbjct: 576 ----DKVRLVTYPESFSNWARLL--LNFKYPFYEVAIVGNEADEKYMEWQKQFVPNVLIQ 629
Query: 673 ------NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ + N F + VC+N C PV + +LLL+
Sbjct: 630 GSWKESDLPLLENRFVKGSTMIYVCENRVCQLPVEEVSKALDLLLK 675
>gi|358457848|ref|ZP_09168063.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
gi|357078866|gb|EHI88310.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
Length = 673
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 220/615 (35%), Positives = 308/615 (50%), Gaps = 62/615 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED+ A +N+ FV+IKVDREERPDVD VYM AL G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDDTTAAYMNEHFVNIKVDREERPDVDSVYMDVTMALTGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GTYFPP + G F+ +L V AWD +R+ + SGA +L+EA
Sbjct: 109 FLTPTGEPFFAGTYFPPTPRPGMGSFRQVLSAVSSAWDTRREEIESSGADIARKLAEAAE 168
Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + + P L L +QL+ +D R GGFG APKFP + +++L H +
Sbjct: 169 APVAGGRGPAIRLDGELLDTAVDQLAARFDPRHGGFGGAPKFPPSMVAELLLRHHAR--- 225
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG E S G MV T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD QL
Sbjct: 226 TGN--ERSLG--MVALTCERMARGGIYDQLTGGFARYSVDATWTVPHFEKMLYDNAQLLR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS-----AETEGATRKK- 313
VYL + T D + + R+ +L D+ P G SA DAD+ ++T+G +
Sbjct: 282 VYLHLWRTTGDALAARVVRETAAFLLTDLRTPQGGFASALDADAVPPSDSDTDGHPHQPV 341
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGA YVWT ++ D LG + A + + TG + G +VL D
Sbjct: 342 EGASYVWTPGQLADALGPDDAAWAANLFEVTATGTFE-----------HGSSVLALPADP 390
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
+ + R L R+ RP+P DDKV+ SWN +
Sbjct: 391 DDA------------DRFARVRATLAATRAARPQPARDDKVVASWN---------GLAVA 429
Query: 433 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFL 491
A+F P E++ AE AA +R HL D + R R GP+ G L
Sbjct: 430 ALAEAGALFEEP-------EWVTAAERAAVLLRDVHLVDGRLRRTSRDGRVGPNV--GVL 480
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DDY + G L L++ +WL A +L + F + GG+++T + P++L R +
Sbjct: 481 DDYGNVADGFLALHQVTGAVEWLELAGQLLDVARARFRAAD-GGFYDTADDAPTLLRRPR 539
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMC 610
E D A PSG S L+ A++ + S +R++AE ++ + L +D A
Sbjct: 540 EVSDSATPSGQSAFAGALLTYAAL---TGSAGHREDAEATIGLLAPLLARDARFAGHAGT 596
Query: 611 CAADMLSVPSRKHVV 625
A +L+ P VV
Sbjct: 597 VAEALLAGPPEVAVV 611
>gi|302530109|ref|ZP_07282451.1| transcriptional regulator [Streptomyces sp. AA4]
gi|302439004|gb|EFL10820.1| transcriptional regulator [Streptomyces sp. AA4]
Length = 663
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 224/695 (32%), Positives = 329/695 (47%), Gaps = 103/695 (14%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE EG A L+N FV+IKVDREERPD+D VYM QA+ G GGWP++
Sbjct: 49 ACHWCHVMAHESFEHEGTAALMNAHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTC 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GTY+PP + G P F +L V +AW+++ D L + + L+E
Sbjct: 109 FLTPEGEPFHCGTYYPPAPRPGIPSFTQLLLAVAEAWEERPDDLREGAKQIVGHLAE--- 165
Query: 141 ASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S L + + +AL +L++ D GGFG APKFP + ++ +L H ++
Sbjct: 166 ---QSGPLKEAAVDADALAEAVTKLAQEADPVHGGFGGAPKFPPSMVLEFLLRHHER--- 219
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG +++ + + MA+GGIHD +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 TG----SAQAYALAESAAEAMARGGIHDQLGGGFARYSVDAEWIVPHFEKMLYDNALLLR 275
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + + I+ +L D++ P G ++ DAD+ EG T YV
Sbjct: 276 VYAH-LARRGSASARRVAEGIVRFLEHDLLTPQGGFAASLDADTEGVEGLT-------YV 327
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELNDSSA 374
WT ++ ++LGE E + + G + L +DP + + + V
Sbjct: 328 WTPAQLNEVLGEDGPWAAELFSVTEEGTFEEGASTLQLRADPDDFARFERV--------- 378
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
R+ L + R+ RP+P DDKV+ +WNGL IS+ A A L
Sbjct: 379 -------------------RQALLEARAARPQPGRDDKVVAAWNGLAISALAEAGVAL-- 417
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKAP-GFLD 492
+R +++E+A +AAS + HL D RL+ S R+G AP G L+
Sbjct: 418 --------------ERPQWIELARNAASLLLDLHLVD---GRLRRSSRDGAVGAPVGVLE 460
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVK 551
DYA L GLL L++ +WL A L + F G ++ T +D VL+ R
Sbjct: 461 DYACLADGLLALHQATGEPRWLTEATRLLDVALTHFASDSAPGAYHDTADDAEVLVQRPS 520
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
+ D A PSG S L+ +++ ++ YR AE +L R+ +A VP
Sbjct: 521 DPTDNASPSGASALAGALLTASALAGSDQAARYRDAAELAL----RRVGLLAARVPRF-- 574
Query: 612 AADMLSVPSRK-----HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFW 666
A LSV V +VG + + ++ AA V+ +P D +
Sbjct: 575 AGHWLSVAEAAQSGPVQVAVVGGERA----QLVTAAAQHIHGGGIVLGGEP-DAPGVPL- 628
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+A + A VC+ + C PVT
Sbjct: 629 ---------LADRPLVGGEAAAYVCRGYVCERPVT 654
>gi|289209063|ref|YP_003461129.1| hypothetical protein TK90_1902 [Thioalkalivibrio sp. K90mix]
gi|288944694|gb|ADC72393.1| protein of unknown function DUF255 [Thioalkalivibrio sp. K90mix]
Length = 677
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 231/696 (33%), Positives = 343/696 (49%), Gaps = 73/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPL 78
+ CHWCHVM ESFED A+++N F++IKVDREERPD+D++Y L GGWPL
Sbjct: 47 SACHWCHVMAHESFEDPATAEVMNRRFINIKVDREERPDLDRIYQNAHMLLSQRPGGWPL 106
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+VFL+PD P GTYFP ++G P F ++ +V D + D + + E L +A
Sbjct: 107 TVFLTPDQVPFFAGTYFPSTPRHGLPSFVDLMNRVADFLAEHPDEIQRQN----ESLQQA 162
Query: 139 LSA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L+ + +P L +L++++D +FGGFG APKFP P ++ + +H+ +
Sbjct: 163 LARIYRPAGGAIP---AIGVLDKARAELAQTFDDQFGGFGDAPKFPHPASLEWLAWHAAR 219
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
D +E ++M+ TL MA GGI D VGGGF RYSVD RW +PHFEKMLYD G
Sbjct: 220 HND-------AEAERMLERTLAAMAAGGIFDQVGGGFCRYSVDARWMIPHFEKMLYDNGP 272
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L +Y + + D + + +L R+M P G +S+ DADS EG +EG
Sbjct: 273 LLGLYAERAAAGDDR-ARRVAEQTVAWLEREMRDPSGAFYSSLDADS---EG----EEGR 324
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
FYVW + VE +L E + + ++ P N F+G+ L E+ + A
Sbjct: 325 FYVWDPEMVEGLLPEDEWVVASRVW----------GLNGPAN-FEGRWHLHEVAPIATVA 373
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
LG+ + LG R +L R +R RPH DDK++ +WN L+I+ ARA++ L
Sbjct: 374 DALGIDESEAETRLGRARERLLAAREQRVRPHRDDKILGAWNALMINGLARAARAL---- 429
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAP-GFLDDY 494
+R +++ +A +A +R L+ + RL SFR G S+ P +LDD+
Sbjct: 430 ------------ERHDWLGLARAAMRAVRERLWHDG--RLFASFREGATSELPRAYLDDH 475
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A L+ L L E L WA L F D E GG+F T + +++ R K
Sbjct: 476 ALLLEATLALLEVEWDGDLLGWATTLAEALLADFEDTEHGGFFYTARDHEALIQRPKVYA 535
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A +GN ++ L +L ++A + Y + AE +LA ++ + + A D
Sbjct: 536 DDAMAAGNGIAAQALQKLGYLLAEPR---YLEAAERTLANAGPMIEQAPLGHMSLLVALD 592
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
M P VVL G + AH D V I PA +++
Sbjct: 593 MHQQPP-PLVVLRGAADELAPWQQRLRAH---DAPMWVFAI-PAQADDL---------PP 638
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
++A V A +C+ C PVTDP +LE +L
Sbjct: 639 ALAEKAAPETGVRAYLCRGLHCEVPVTDPAALEGVL 674
>gi|195952439|ref|YP_002120729.1| hypothetical protein HY04AAS1_0059 [Hydrogenobaculum sp. Y04AAS1]
gi|195932051|gb|ACG56751.1| protein of unknown function DUF255 [Hydrogenobaculum sp. Y04AAS1]
Length = 634
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 214/612 (34%), Positives = 305/612 (49%), Gaps = 85/612 (13%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
+F K + FL ++CHWCHVME ESFEDE VA LN FVSIKVD+EERPD
Sbjct: 29 SEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKESFEDEEVASFLNKCFVSIKVDKEERPD 88
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +Y+ Y L GGWPLSVFL+P +P GTYFP + F +L ++KD WD
Sbjct: 89 IDSLYIEYCVLLNNSGGWPLSVFLTPTKEPFFAGTYFP------KASFLKLLNQIKDLWD 142
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
K + + +EQL + +++ EL ++ + L+ YD FGGF A
Sbjct: 143 KDSKNIIEKSKRMVEQLKQFMNSFEKR-----ELNESFIDKALFGLANRYDEEFGGFSEA 197
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP + ++L K+ Q M L TL M +GGI DHVGGGFHRYS
Sbjct: 198 PKFPSLHNVLLLLKSQKQ-----------PFQDMALSTLLNMRRGGIWDHVGGGFHRYST 246
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D W +PHFEKMLYDQ Y +A+ LTK+ + +++++ ++ G +++
Sbjct: 247 DRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNEIFKDTVYKTINFVKENLY-ENGFFYTS 305
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHN 358
DAD TEG +EG FY+WT +E++DIL E F E + +K GN + +
Sbjct: 306 MDAD---TEG----EEGGFYLWTYQEIKDILKEKTDKFIEFFNIKKEGNF----LDEAKR 354
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
+ GKNVL A + M E L +L K F R KR +P +DDK+++ N
Sbjct: 355 VYTGKNVLY--------AKEPTMLFENELQVL-----KAF--REKRKKPLIDDKILLDQN 399
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
++ + A + + K+++++A ++L + H LQH
Sbjct: 400 AMMDWALIEAYLVFED----------------KDFLDMA-------TKNLNNISKHPLQH 436
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
+ + P LDDYA+LI L LY+ L AI L E D+ GG++
Sbjct: 437 ALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSKDALEKAISLTEEAIEKLWDKNAGGFYL 495
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
+ G+D VL+ K +DGA PSGNSV +NLV L I +K D Y E+ + +
Sbjct: 496 SVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVELFFI---TKEDTY----ENRYQILSSI 546
Query: 599 LKDMAMAVPLMC 610
DM P C
Sbjct: 547 YSDMLSRNPTAC 558
>gi|302542885|ref|ZP_07295227.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC
53653]
gi|302460503|gb|EFL23596.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC
53653]
Length = 678
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 235/703 (33%), Positives = 332/703 (47%), Gaps = 86/703 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A+ LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDAETAEYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP + G P F+ +L V+ AW +RD + +E L+
Sbjct: 108 VFLTPDAQPFYFGTYFPPRPRPGMPSFRQVLEGVRAAWADRRDEVRDVAGKIVEDLAGRT 167
Query: 140 SASASSNKLPDELPQNALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ S P A L A L++ +D+ GGFG APKFP + ++ +L H +
Sbjct: 168 GIALGSGA---PQPPGAEDLAAGLMGLTREFDAVRGGFGGAPKFPPSMALEFLLRHHAR- 223
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +MV T + MA+GGI+D +GGGF RY+VD W VPHFEKMLYD L
Sbjct: 224 --TGSEG----ALQMVQATCEAMARGGIYDQLGGGFARYAVDAEWIVPHFEKMLYDNALL 277
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T + + D+L R+M G SA DADS +G R EGA+
Sbjct: 278 CRVYAHLWRATGSDLARRVALETADFLVREMRTEQGGFASALDADS--DDGTGRHVEGAY 335
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT +++ + LGE Y+ +++ KG +VL +L D + A
Sbjct: 336 YVWTPEQLREALGEADAEQAAAYF----------GVTEEGTFEKGASVL-QLPDGARPAD 384
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L R +L R +R RP DDK++ +WNGL I++ A
Sbjct: 385 A---------AQLASVRERLLAARERRERPGRDDKIVAAWNGLAIAALAETGAYF----- 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAF 496
DR + +E A AA + R L+ + RL + G A G L+DYA
Sbjct: 431 -----------DRPDLVEAATEAADLLVR-LHMDNGGRLARTSLGGAVGAHAGVLEDYAD 478
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HD 555
+ G L L W+ +A L +T F +G Y T +D L+R +D D
Sbjct: 479 VAEGFLALSAVSGEGVWVDFAGLLLDTVLHHFAAEDGTLY--DTADDAEALIRRPQDPTD 536
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG + + L+ A++ S S +R+ AE +L V ++ +A VP +
Sbjct: 537 NAVPSGWTAAAGALLSYAAV---SGSGRHREAAERALGV----VRALAGRVPRFIGWGLA 589
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFWE 667
A L P + V +VG D + A H + L VI + ++E+ E
Sbjct: 590 VAEARLDGP--REVAVVGP----DDDPATRALHRAALLGTAPGAVIAVGAPGSDEVPLLE 643
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC++F+C P D +L L
Sbjct: 644 G----------RVLLEGRPAAYVCRHFTCDAPTADVAALTAKL 676
>gi|357411497|ref|YP_004923233.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
33331]
gi|320008866|gb|ADW03716.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
33331]
Length = 675
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 225/687 (32%), Positives = 324/687 (47%), Gaps = 75/687 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED VA LN FV +KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDPSVADYLNAHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+ + +P GTYFPPE ++G P F+ +L V AW +R+ +A+ + L+ S
Sbjct: 109 FLTAEAEPFYFGTYFPPESRHGMPSFQQVLEGVAAAWTDRREEVAEVAGRIVRDLA-GRS 167
Query: 141 ASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+A+ LP E L Q LRL ++ YD R GGFG APKFP + I+ +L H +
Sbjct: 168 LAAAEGGLPGEPELAQALLRL-----TRDYDERHGGFGGAPKFPPSMVIEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGAEG----ALQMAADSCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS + +G R EGAFY
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTAEGGFASALDADSEDAQG--RHVEGAFY 333
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VWT ++ ++LGE F Y+ +++ +G +VL + A +
Sbjct: 334 VWTPAQLREVLGEDDAAFAAEYF----------GVTEEGTFEEGSSVLRLVPAGEAEPAD 383
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
E+ + G +L R RPRP DDKV+ +WNGL I++ A
Sbjct: 384 D----ERIAGVRG----RLLAARELRPRPERDDKVVAAWNGLAIAALAETGAYF------ 429
Query: 439 AMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAF 496
DR + +E A AA +R H+ D RL + ++G G L+DY
Sbjct: 430 ----------DRPDLVERATEAADLLVRVHMGD--VARLCRTSKDGRAGDNSGVLEDYGD 477
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ G L L WL +A L + + F E G F+T + ++ R ++ D
Sbjct: 478 VAEGFLALASVTGEGAWLEFAGFLLDIVLQHFTG-EKGQLFDTADDAEQLIRRPQDPTDN 536
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A P+G + + L+ S A + S+ +R AE +L V + A+ L
Sbjct: 537 ATPAGWTAAAGALL---SYAAHTGSEAHRAAAEGALGVVGALGPKAPRFIGWGLAVAEAL 593
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNSNNAS 675
R+ V A + +L++T ++ P + + S
Sbjct: 594 LDGPREVAV---------------AGPVAGELHRTALLGRAPGAVVAVGVGPDAGSEFPL 638
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A VC++F C P TD
Sbjct: 639 LVDRPLAGGAPTAYVCRHFVCDAPTTD 665
>gi|409096974|ref|ZP_11216998.1| hypothetical protein PagrP_00615 [Pedobacter agri PB92]
Length = 686
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 204/586 (34%), Positives = 292/586 (49%), Gaps = 56/586 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+ VA+++N FV IKVDREERPD+D++YM +Q + G GGWPL+
Sbjct: 69 CHWCHVMERESFENFEVAEVMNKHFVCIKVDREERPDIDQIYMYAIQLMTGSGGWPLNCI 128
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEAL 139
PD +P+ GGTYF D + IL V W + + Q + SE +
Sbjct: 129 CLPDQRPIYGGTYFRKND------WVNILENVAALWSNEPEKAIQYAERLTSGIRDSEKI 182
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + DE L E + +D FGG+ APKFP P +L + L+D
Sbjct: 183 IPSVTKEDYTDE----HLTEIIEPWKRHFDISFGGYNRAPKFPLPNNWVFLLRYG-YLKD 237
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
A V TL+ M++GGI+D +GGGF RYSVD++WHVPHFEKMLYD QL +
Sbjct: 238 DESVFTA------VCHTLEEMSRGGIYDQIGGGFARYSVDDKWHVPHFEKMLYDNAQLIS 291
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y +A+ TK + + ++++ +M P G +SA DADS EG EG FYV
Sbjct: 292 LYAEAYQCTKFNSFKQTAVESINWVFNEMTSPEGLFYSALDADS---EGI----EGKFYV 344
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W E D+LG+ A L E++ + GN E + N+L ++ SK
Sbjct: 345 WDKTEFYDLLGDDAQLLGEYFNITEEGNW----------EEEQTNILRKILSDDDILSKH 394
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ E + + KL ++R++R RP LDDK + +WNG++I + A A+ +L +
Sbjct: 395 NIDAETLYTKVESAKAKLLNIRNQRIRPGLDDKCLTAWNGMMIKALADAATVLSHDL--- 451
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
Y + A +AA FI +L + L + +NG + FLDDYAFLI
Sbjct: 452 -------------YYQKAAAAARFILVNL-KTASGGLYRNCKNGKASITAFLDDYAFLIE 497
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L+ LYE+ WL A + E F D E +F T+ S++ R E D P
Sbjct: 498 ALIALYEYDFDENWLNEAKSFTDYVLENFSDSESPMFFYTSATGESLIARKHEVMDNVIP 557
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
+ NS NL +L + + Y A LA + ++K A
Sbjct: 558 ASNSTMAQNLTKLGLLF---DLEGYNNKAAEMLAAVQPKIKTYGSA 600
>gi|398782996|ref|ZP_10546612.1| hypothetical protein SU9_09379 [Streptomyces auratus AGR0001]
gi|396996281|gb|EJJ07275.1| hypothetical protein SU9_09379 [Streptomyces auratus AGR0001]
Length = 623
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 224/683 (32%), Positives = 323/683 (47%), Gaps = 70/683 (10%)
Query: 28 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 87
M ESFED A LLND FV++KVDREERPDVD VYM VQA G GGWP++VFL+PD +
Sbjct: 1 MAHESFEDPATAALLNDHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAE 60
Query: 88 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSN 146
P GTYFPPE ++G P F IL V+ AW +RD + + +G + +LSAS ++
Sbjct: 61 PFYFGTYFPPEPRHGMPSFAQILEGVRSAWADRRDEVGEVAGRIVADLAGRSLSASLPAD 120
Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+ P + L L++ +D+ GGFG APKFP P+ ++ +L H + G
Sbjct: 121 RRPPRAEE--LHTALMGLTREFDAAHGGFGGAPKFPPPMVLEFLLRHHARTASAGA---- 174
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
+MV T MA+GGI+D +GGGF RY+VD W VPHFEKMLYD L Y +
Sbjct: 175 ---LEMVQATCAAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDNALLCRTYAHLWR 231
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
T + D++ R++ G SA DADS +G R EGA+YVWT ++
Sbjct: 232 STGSEEARRTAVETADFMVRELRTDQGGFASALDADS--DDGTGRHVEGAYYVWTPGQLR 289
Query: 327 DILGEHAILF-KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
+LGE F H+ + G + +G +VL +L D+ E+
Sbjct: 290 AVLGEEDAEFAAAHFGVTEEGTFE-----------EGASVL-QLPDTEGLVDA-----ER 332
Query: 386 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 445
+ R++L R +RPRP DDKV+ WNGL I++ A
Sbjct: 333 VARV----RQRLLAAREERPRPGRDDKVVACWNGLAIAALAETGAYF------------- 375
Query: 446 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDL 504
DR + ++ A AA + R D Q RL + R+G P G L+DYA + G L L
Sbjct: 376 ---DRPDLIQAATDAADLLVRVHMDAQV-RLHRTSRDGTPGANSGVLEDYADVAEGFLTL 431
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
W+ +A L +T L E G ++T + +++ R ++ D A PSG +
Sbjct: 432 ASVTGEGVWVEFAGFLLDTV-LLQFTTEDGALYDTAADAEALIRRPQDPTDNATPSGWTA 490
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 624
+ L+ A++ + S +R AE +L + T L A A ++ + V
Sbjct: 491 AAGALLSYAAL---TGSGRHRDAAERALGIV-TALAGRAPRFIGWGLAVAEAALDGPREV 546
Query: 625 VLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSAD 684
+VG + AA V P ++ + +N D
Sbjct: 547 AVVGPPGDPATAALHHAALLGTAPGAVVAMGAP------------GADEVPLLQNRPLVD 594
Query: 685 -KVVALVCQNFSCSPPVTDPISL 706
K A VC++F+C P TDP L
Sbjct: 595 GKPAAYVCRHFTCERPTTDPAEL 617
>gi|340619141|ref|YP_004737594.1| hypothetical protein zobellia_3176 [Zobellia galactanivorans]
gi|339733938|emb|CAZ97315.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 703
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 340/687 (49%), Gaps = 86/687 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVME E+FE+E VAK++N+ F++IKVDREERPDVD+VYMT +Q + G GGWPL+
Sbjct: 84 SSCHWCHVMEDETFENEEVAKIMNENFINIKVDREERPDVDQVYMTALQLISGSGGWPLN 143
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V P+ KPL GGTY + R + +L K+ + L ++ E+ S+ +
Sbjct: 144 VITLPNGKPLYGGTY------HTREQWMQVLTKISE--------LYKNDPKKAEEYSDMV 189
Query: 140 SASASSNKLP------DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 193
+A + L + + + AL+ S ++D GG KF P + +L +
Sbjct: 190 AAGIAEANLVEPAKGFESITKEALKTSVANWSPNWDLEEGGEKGVQKFMIPSNLSFLLDY 249
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ D + ++ V TL MA GG++D +GGGF+RYS D W VPHFEKMLYD
Sbjct: 250 AVLTGD-------DKAKRHVRNTLDKMALGGVYDQIGGGFYRYSTDAFWKVPHFEKMLYD 302
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q+ ++Y A++L KD Y + + +D+L R+M G +A DADS EG +
Sbjct: 303 NAQVLSLYSKAYTLFKDDAYKNVVWETIDFLDREMKDTNGGYHAALDADS---EG----E 355
Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
EG FYVW +E++ +LGE LF +Y + + GK VL D +
Sbjct: 356 EGKFYVWKEEELKSVLGEGFELFSAYYNINKEAVWE-----------DGKYVLHRKVDDA 404
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
+ + K I E +KL R+KR P DDK+I SWN L+++ F A K
Sbjct: 405 EFVKEHDIEQGKLNFIKSEWNKKLLAERNKRVFPRSDDKIITSWNALLVNGFVDAYKAF- 463
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+K ++E AES SFIR + Y Q +L H+F+ G + GF++D
Sbjct: 464 ---------------GQKRFLEKAESVFSFIRSNAY--QNGKLVHTFKKGSKRKEGFIED 506
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YAF+I L+LY T++L +A EL + F D G Y G D ++ R+ +
Sbjct: 507 YAFMIDASLELYGLTLNTEYLDFAKELNAKAEAGFADEASGMYHYNEGND--LIARIIKT 564
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
DG PS N+V NL RL + +++ E + ++ VP + +A
Sbjct: 565 DDGVLPSPNAVMAHNLFRLGHL-------------DYNTGYTEKAKRMLSAMVPALTESA 611
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
S+ + +L+ H FE + A L K + I +T + E +N
Sbjct: 612 PSY---SKWNALLLNHTYPY-FEIAVVGKDAEV-LIKALNEIHLPNTLVVGSKVE---SN 663
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPV 700
A + ++ + AD VC+N +C PV
Sbjct: 664 APLFKDRYVADGTFIYVCRNTTCKLPV 690
>gi|420252291|ref|ZP_14755426.1| thioredoxin domain protein [Burkholderia sp. BT03]
gi|398055929|gb|EJL47977.1| thioredoxin domain protein [Burkholderia sp. BT03]
Length = 664
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 247/706 (34%), Positives = 344/706 (48%), Gaps = 111/706 (15%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A L+N+ +VSIKVDR+ERPD+D++Y Q + GGGWPL+V
Sbjct: 49 ACHWCHVMAHESFENPRIASLMNERYVSIKVDRQERPDIDEIYQQVSQMMGQGGGWPLTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GGTYFPP+D+YGRP F +L + +AW + D L + I Q+ +
Sbjct: 109 FLTPQGEPFFGGTYFPPDDRYGRPAFARVLIALSEAWRHRHDELRDT----IVQIQQGFR 164
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + P ++ A L++ D GG G APKFP P +ML ++
Sbjct: 165 QLDQAQQGPTAAVEDLPAQTARALTRDTDPAHGGLGGAPKFPNPSCYDLMLRVYER---- 220
Query: 201 GKSGEASEGQKMVLF-----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
++ LF TL MA GGI+D VGGGF RYSVD W VPHFEKMLYD G
Sbjct: 221 --------SREPTLFDALERTLDHMAAGGIYDQVGGGFARYSVDAHWAVPHFEKMLYDNG 272
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL +Y DA+ LT + I + L Y+ RDM P G +++EDADS EG +EG
Sbjct: 273 QLVKLYADAYRLTGKRTWRRIFEETLAYILRDMTHPEGGFYASEDADS---EG----QEG 325
Query: 316 AFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELND 371
FY W E++ +LGE L Y + GN + G VL +EL+
Sbjct: 326 KFYCWMPAEIKAVLGESEGALACRAYGVTERGNFE-----------HGATVLHRAVELD- 373
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
LE+ L R +L R++R RP DD ++ WNGL+I+ A
Sbjct: 374 ----------ALEE--TQLAGWRERLLAARARRVRPARDDNILTGWNGLMIAGLCAA--- 418
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPG 489
F G EY+ A+ AA+FI L D R+ +++G +K PG
Sbjct: 419 -----------FQATGV--PEYLSAAKRAANFIGNELTLADGGVFRV---WKDGVAKVPG 462
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVL 547
FL+DYAFL + LLDLYE ++L AIEL L LD+ E G YF +P ++
Sbjct: 463 FLEDYAFLCNALLDLYESCFDRRYLDRAIELAT----LILDKFWEDGLYFTPCDGEP-LV 517
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
R + +D A PSG S S VRL ++ + D Y AEH +ET + A
Sbjct: 518 HRPRAPYDSASPSGISSSAFAFVRLHAL---TGRDLYLDRAEHEFRRYETAAGSVPSAFA 574
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
+ A D + + +V G K S + H +Y L V+
Sbjct: 575 HLIAARDFVQRGPLE-IVFAGEKYSAAV--LATGVHRAY-LPARVLAF------------ 618
Query: 668 EHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLLLE 712
+ + + R D + A VC+N +C+ P+T+ N LLE
Sbjct: 619 ---AEHVPIGRECHPVDGRAAAYVCRNRTCAAPMTE----GNALLE 657
>gi|342883561|gb|EGU84024.1| hypothetical protein FOXB_05444 [Fusarium oxysporum Fo5176]
Length = 870
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 210/661 (31%), Positives = 335/661 (50%), Gaps = 100/661 (15%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M +E+F + A +LN+ F+ + VDREERPD+D +YM YVQA+ GG
Sbjct: 208 HIGYKACHFCRLMSIETFSNPDSASVLNESFIPVIVDREERPDLDAIYMNYVQAVSNVGG 267
Query: 76 WPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFK--------------TILRKVKDAW---- 117
WPL+VFL+P+L+P+ GGTY+ +G G + TI +KV+D W
Sbjct: 268 WPLNVFLTPNLEPVFGGTYW-----FGPAGRRHLSDDSTEEVLDSLTIFKKVRDIWIDQE 322
Query: 118 ----DKKRDMLAQSGAFAIEQL----------------------SEALSASASSNKLPDE 151
+ +++ Q FA E S A +A S + +E
Sbjct: 323 ARCRKEATEVVGQLKEFAAEGTLGTRSISAPSALGPAGWGAPAPSHASTAKEKSTAVSEE 382
Query: 152 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK---LEDTGKSGEASE 208
L + L ++ ++D FGGFG APKF P ++ +L K ++D E
Sbjct: 383 LDLDQLEEAYTHIAGTFDPVFGGFGLAPKFLTPPKLAFLLGLLKSPGAVQDVVGEAECKH 442
Query: 209 GQKMVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 267
++ L T++ + G +HDH+GG GF R SV W +P+FEK++ D QL ++Y+DA+ +
Sbjct: 443 ATEIALDTMRHIRDGALHDHIGGTGFSRCSVTADWSIPNFEKLVTDNAQLLSLYIDAWKV 502
Query: 268 T----KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 322
+ KD F + ++ +YL ++ P G S+E ADS +G K+EGA+YVWT
Sbjct: 503 SGGGEKDEFLDVVL-ELAEYLTSSPIVLPEGGFASSEAADSYYRQGDKEKREGAYYVWTR 561
Query: 323 KEVEDILGE----HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+E + +L E + + ++ + GN + SDP+++F +N+L + +++
Sbjct: 562 REFDSVLDEIDSHMSPILASYWNVNQDGNVE--EESDPNDDFIDQNILRVKSTIEQLSTQ 619
Query: 379 LGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
P+EK + + RR L R + R RP LDDK++V WNGLVIS+ ++A+ LK+
Sbjct: 620 FSTPVEKIKEYIEQGRRALRKRREQERVRPDLDDKIVVGWNGLVISALSKAASSLKT--- 676
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ + +AE AA+ IR+ L+D R+ + +G F DDYA++
Sbjct: 677 -------LRPEQSSKCRAIAEQAAACIRKKLWD-GNERILYRIWSGGRGNTAFADDYAYM 728
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQN-------------------TQDELFLDREGGGYFN 538
I GLLDL E ++L +A LQ TQ LF D + G +F+
Sbjct: 729 IQGLLDLLELTGNQEYLEFADILQRESSQFPSHLTHPADHAITETQTSLFYDAD-GAFFS 787
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
T P +LR+K+ D + PS N+VSV NL RLA++++ +D A ++ FE
Sbjct: 788 TQANSPYTILRLKDGMDTSLPSTNAVSVANLFRLANLLS---NDDLAAKARQTINAFEVE 844
Query: 599 L 599
+
Sbjct: 845 V 845
>gi|23100033|ref|NP_693499.1| hypothetical protein OB2578 [Oceanobacillus iheyensis HTE831]
gi|22778264|dbj|BAC14534.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
Length = 691
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 223/709 (31%), Positives = 342/709 (48%), Gaps = 78/709 (11%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G ++F K ++ FL ++C WCH M ESF D+ VA LLN ++VSIKVDREERPD
Sbjct: 32 GEKAFNKARKEQKPIFLSIGYSSCTWCHNMNRESFMDQEVAALLNQYYVSIKVDREERPD 91
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
+D +YM Q + G GGWPL++ ++ D P GTYFP YG PG IL + +
Sbjct: 92 IDGLYMKACQMMTGHGGWPLTIIMTDDQVPFFAGTYFPKHQNYGLPGLMDILPTIAKKYA 151
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ +A+ ++++ +AL + S ++++R +QL++ +D +GGF
Sbjct: 152 EDPQQIAE----YMKKVEDALQDTLSKKSNESLTSEDSVR-TYQQLNELFDYPYGGFYKE 206
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP P + ++++ K D KMV TL+ + + DHVG G RY+
Sbjct: 207 PKFPSPHNLSFLIHYYYKTGD-------KNALKMVDMTLKSIFQSSTWDHVGFGVFRYAT 259
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
D +W PHFEKMLYDQ L +V +D F +TKD FY +I+ +++R+M G +++
Sbjct: 260 DRKWMFPHFEKMLYDQAFLLDVSVDMFLITKDPFYQLKVNEIIQFVKREMTAENGCFYAS 319
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPH 357
ADS +EGA+Y+W+ +E+ ILGE LF E Y + P G
Sbjct: 320 LSADS-------NGEEGAYYLWSLEEIYSILGEDEGDLFAEAYGIVPVG----------- 361
Query: 358 NEFKGKNVLIELNDSSAS-ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 416
+GKN+ S S AS G+ +EK L + KL R R P DDK++ S
Sbjct: 362 -VHQGKNLPYRSGISLESLASTYGIQVEKVKTTLTKSVDKLQKARLLRTAPATDDKILTS 420
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 476
WNG +I++ A+A + + E ++ A + + L + +R
Sbjct: 421 WNGYMIAALAKAGSVFQEE----------------NWINHAINTMKNLSDILIKD--NRW 462
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
++R G + GFLDDYA ++ G ++L++ L A + N +LF D GG+
Sbjct: 463 FANYRQGKTNTKGFLDDYAAILWGYIELHQATMEIDHLKKAKTIANDMIKLFWDSNDGGF 522
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
F + ++ R KE +D PSGNS++ I L RLA++ G S Y + + F
Sbjct: 523 FFVANDAEQLISREKEIYDSPIPSGNSLASIQLSRLANLT-GEMS--YYSYVDTMMYTFY 579
Query: 597 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHID 656
L+D L K V+++G + F ++ Y N IHI
Sbjct: 580 RELQDEPSGASFFMRNL-FLQQDQTKQVIIIGENTEAFFNHI----RKRYLPN---IHII 631
Query: 657 PADTEEMDFWEEHNSNNASMARNNFSADKV----VALVCQNFSCSPPVT 701
A TE +S+ A++ N + KV VC NF C+ P T
Sbjct: 632 SA-TE--------SSSLATLLPNGENYKKVNGQTTYYVCSNFHCNRPTT 671
>gi|428781674|ref|YP_007173460.1| thioredoxin domain-containing protein [Dactylococcopsis salina PCC
8305]
gi|428695953|gb|AFZ52103.1| thioredoxin domain protein [Dactylococcopsis salina PCC 8305]
Length = 678
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 220/619 (35%), Positives = 319/619 (51%), Gaps = 76/619 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ LN+ F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDSTIAQYLNENFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P D P GGTYFP E +YGRPGF IL+ ++ +D++++ L +F E ++
Sbjct: 108 IFLTPHDRVPFYGGTYFPLEPRYGRPGFLQILQAIRRFYDQEKEKL---NSFKGEVMT-L 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
L SA+ LP + L E L K ++ G G+ P FP Q+ ++
Sbjct: 164 LQRSAT-------LPSSETPLNRELLIKGLETAVGITSSRGTPPSFPMIPHAQLARRKTQ 216
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+++ EA Q+ + TL GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 217 FSDESRYDAEAITTQRGMDLTL-----GGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDNG 271
Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q+ + +S + + F S I + +L+R+M P G ++++DADS T +
Sbjct: 272 QIMEYLANLWSSGVKEPAFASAIAHAV-QWLQREMTAPEGYFYASQDADSFTTSEEAEPE 330
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI----- 367
EGAFYVW+ +E+E +L E + + + GN F+G NVL
Sbjct: 331 EGAFYVWSYQELESLLTPEELNALQSEFTVTSEGN------------FEGNNVLQRQTGG 378
Query: 368 ELNDSSASASK---------LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
EL+ S +A K L P+ + K + P P D K+I +WN
Sbjct: 379 ELSSPSETALKKLFNARYGNLSSPVTPFPPATNNTEAKQTAWEGRIP-PVTDTKMITAWN 437
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
L+IS ARA + V G K Y E A AA+FI + + + +RL
Sbjct: 438 SLMISGLARA--------------YAVFG--EKTYWECAVKAANFIGENQWVAGRFYRLN 481
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLY-EFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ +G + +DYA I LLDLY T+WL A +LQ T DE E GGY
Sbjct: 482 Y---DGKATVSAQSEDYALFIKALLDLYCCHPEQTQWLDQATQLQATFDEYLWSSETGGY 538
Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
FNT ++ S +++R + D A P+ N V+V NLVRL + K+DY +AE +L F
Sbjct: 539 FNTAKDNSSDLIIRERTYIDNATPAANGVAVANLVRLFELT--EKTDYV-ASAEKTLQAF 595
Query: 596 ETRLKDMAMAVPLMCCAAD 614
+ ++ A P + D
Sbjct: 596 SSIMEQSPQACPGLFSGLD 614
>gi|23014746|ref|ZP_00054548.1| COG1331: Highly conserved protein containing a thioredoxin domain
[Magnetospirillum magnetotacticum MS-1]
Length = 671
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 230/696 (33%), Positives = 334/696 (47%), Gaps = 75/696 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDEG+A L+ND F++IKVDREERPD+D +Y + + GGWPL+
Sbjct: 49 SACHWCHVMAHESFEDEGIAGLMNDLFINIKVDREERPDLDALYQNALGLIGQHGGWPLT 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+PD +P GGTYFP + +YGR F +L + ++ K D + + + ++ E+L
Sbjct: 109 MFLTPDAEPFWGGTYFPAQARYGRAAFPDVLEGISHSFHKDPDKIGHN----VARIRESL 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A S P L + L A Q + D GG APKFP+P + L+HS
Sbjct: 165 EQMARSPG-PLSLDMEVVDLGAAQCLRLIDFEDGGTVGAPKFPQPGLFR-FLWHSYL--- 219
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++G +S + V TL + +GGI+DH+GGGF RYS DE W VPHFEKMLYD QL +
Sbjct: 220 --RTGNSSL-KDAVTVTLDHICQGGIYDHLGGGFMRYSTDETWLVPHFEKMLYDNAQLVS 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ + T Y + + +L RDM+ GG +A DADS EG +EG FY
Sbjct: 277 LLTKVWKQTGSPLYRARIFETVGWLLRDMMAEGGAFAAALDADS---EG----EEGLFYT 329
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WTS+E+ +L E A F Y ++ GN ++G+N+L N
Sbjct: 330 WTSEELSALLDIETATRFGHLYGVQAHGN------------WEGRNIL-HRNHPRGGGDD 376
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ L E + L R KR P DDKV+ WN ++I++ A A+
Sbjct: 377 ---------HDLAEAKMVLLAERDKRIWPGRDDKVLADWNAMMITALAEAALTF------ 421
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
DR +++ AE A I + R HS G ++ LDDYA+ I
Sbjct: 422 ----------DRPDWLAAAEHAFQVITTRMVRPDG-RPAHSLCRGRAETNAVLDDYAWAI 470
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L LYE +G ++L AI D +GGGYF + + V++R K D A
Sbjct: 471 FAALTLYETTTGPEYLDQAIAWAEQVHAHHWDGQGGGYFLSADDATDVVIRTKPAFDSAV 530
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN V L RL +V G + +R+ A+ AV + M +P M D ++
Sbjct: 531 PSGNGVMAEVLARL-WLVTGEER--WRERAQ---AVIDAFGAAMPEQIPHMTSLLDAFAI 584
Query: 619 PSRK-HVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ VV+VG +L A A+ +++ + + + H ++ S+
Sbjct: 585 LAEPLQVVIVGPLDDPGGLALLRAFAATSLPPASLLRVQDGNALPVG----HPAHGKSLV 640
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
A +C+ +C PVTD L L EK
Sbjct: 641 DGC-----AAAYICRGSTCRAPVTDSDRLMAQLCEK 671
>gi|227537485|ref|ZP_03967534.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
gi|227242622|gb|EEI92637.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
Length = 672
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 190/567 (33%), Positives = 282/567 (49%), Gaps = 57/567 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ +A+ +N ++V +K+DREERPD+D++YMT VQ + GGWPL+
Sbjct: 48 SACHWCHVMERESFENDAIAQTMNKFYVPVKIDREERPDIDQIYMTAVQLMTNAGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
PD +P+ GGTYF P D ++ IL ++ W+++ + + + +
Sbjct: 108 CICLPDGRPIYGGTYFKPHD------WQNILLQIAQMWEEQPQVAIEYATKLTNGIQQ-- 159
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S N +PD+ + L +D++ GG+ APKFP P +L
Sbjct: 160 SERLPINPIPDQYDSSDLSAIITPWVALFDTKDGGYNRAPKFPLPNNWIFLL-------- 211
Query: 200 TGKSGEASEGQKM---VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ G + +K+ V FTLQ MA GGI+D +GGGF RYSVD WH+PHFEKMLYD GQ
Sbjct: 212 --RYGVLAGDEKIIDHVHFTLQKMASGGIYDQIGGGFARYSVDPYWHIPHFEKMLYDNGQ 269
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L +++ +A+ FY I ++ + + R+M+ P + A DADS EG EG
Sbjct: 270 LLSLFSEAYQQRPSPFYKRIVQETIQWANREMLAPNNGFYCALDADS---EGV----EGK 322
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+Y ++ E+EDILGE A LF ++ + GN + N+ I D+ A
Sbjct: 323 YYSFSKSEIEDILGEDAPLFISYFNITEEGNW----------AEESTNIPILDPDADQMA 372
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G E++ L E + KL+ R R RP LD K + +WN L++ A +I
Sbjct: 373 LDAGYSAEEWETCLAEAKEKLYSYRETRIRPGLDHKQLATWNALMLKGLTDAYRIF---- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
D Y++ A A FI L + R+ H ++ + GFLDDYAF
Sbjct: 429 ------------DNSSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFLDDYAF 475
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ LYE KWL A +L + ELF D ++ T ++ R E D
Sbjct: 476 TTEAFIALYEATFDEKWLDLARQLADKALELFYDSNQKTFYYTADSSGELIARKSEIMDN 535
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDY 583
P+ S V+ L +L + K DY
Sbjct: 536 VIPASTSTIVLQLKKLGLLF--DKEDY 560
>gi|381211526|ref|ZP_09918597.1| hypothetical protein LGrbi_16484 [Lentibacillus sp. Grbi]
Length = 582
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 217/637 (34%), Positives = 317/637 (49%), Gaps = 76/637 (11%)
Query: 72 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 131
G GGWPLS+F++PD P GTYFP KYG PG +L ++ + + ++ D + +
Sbjct: 4 GQGGWPLSIFMTPDKVPFYAGTYFPRVSKYGMPGIMDVLTQLYERYKQEPDHIDEVTKSV 63
Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
+ L + ++A S N+L E+ + QL K +D +GGFGSAPKFP P Q +L
Sbjct: 64 TDALEKTVTAK-SENRLTQEMTDKVFK----QLGKRFDFTYGGFGSAPKFPTP---QNLL 115
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
Y + TG + KM TLQ MAKGGI+DHVG GF RYS DE+W VPHFEKML
Sbjct: 116 YLLRYYHFTGNTA----ALKMTESTLQAMAKGGIYDHVGFGFARYSTDEKWLVPHFEKML 171
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD L Y + + +TK+ Y I I+ ++ R+M G SA DADS EG
Sbjct: 172 YDNALLLMAYTECYQITKNPLYKTISEQIITFVVREMHCSEGGFNSAIDADS---EGI-- 226
Query: 312 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
EG +YVW E+ +ILGE ++ Y + P GN F+GKN+ LN
Sbjct: 227 --EGKYYVWDYDEIFNILGEELGDIYAAVYGITPDGN------------FEGKNIPNLLN 272
Query: 371 -DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
DS A A M + + + L E R +L R KR PH+DDK++ SWN ++I++ A+A
Sbjct: 273 TDSEAIAKANDMSVSELHHRLDEAREQLLSAREKRVYPHVDDKILTSWNSMMIAALAKAG 332
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
K +Y + AE++ +FI ++L Q R+ +R+G K G
Sbjct: 333 KAFA----------------EPKYTKAAENSMNFIEQNLI--QNGRVMARYRDGEVKYNG 374
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
+LDDYAFL+ +LYE K+L A L N +LF D + GG+F + +L R
Sbjct: 375 YLDDYAFLLWAYTELYETTFSLKYLKQARTLANDMIDLFWDNDQGGFFFNGHDSEELLSR 434
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 609
K +DGA PSGN V+ + LV++ + +DY + E +E ++ V +
Sbjct: 435 EKAVYDGALPSGNGVAGVMLVKMGYLTG--DTDYLDKLEEMYHTFYEDIIQVPVAGVHFI 492
Query: 610 CCAADMLSVPSRKHVVLVGHKS--SVDFENMLAAAHASYDLNKTVIHIDPADT--EEMDF 665
ML K VV++G + +VD + + T++ + AD E F
Sbjct: 493 QSL--MLMENPTKEVVVLGESNPFTVDLQQTFLP-------DVTLLAGNNADKLGEVAPF 543
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
E+ + ++ VC+NF+C P TD
Sbjct: 544 VSEYRQLDNAL----------TIYVCENFACHQPTTD 570
>gi|284037137|ref|YP_003387067.1| hypothetical protein Slin_2247 [Spirosoma linguale DSM 74]
gi|283816430|gb|ADB38268.1| protein of unknown function DUF255 [Spirosoma linguale DSM 74]
Length = 700
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 207/575 (36%), Positives = 299/575 (52%), Gaps = 60/575 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE E VA+++N FV IKVDREERPDVD +YM VQA+ GGWPL+
Sbjct: 48 SACHWCHVMERESFEKEAVAQVMNKHFVCIKVDREERPDVDAIYMDAVQAMGVQGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSE 137
VFL PD KP G TY P ++ + +L + +A+++ R LAQS FA E LS+
Sbjct: 108 VFLMPDAKPFYGVTYLPQKN------WVNLLESIDNAFNEHRADLAQSAEGFARELNLSD 161
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A + N P P+ L + +++ D GG APKFP P + +L +
Sbjct: 162 AERYGLTQND-PLFAPET-LAVLYRKVAVKADDEKGGMRRAPKFPMPSVWRFLLRYYAVA 219
Query: 198 EDTGKSGEAS----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+ + EA+ + +V TL MA GGI+D +GGGF RYS D W PHFEKMLYD
Sbjct: 220 SSSRQIAEAADTSDQALNLVRITLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYD 279
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
GQL +Y +A+SLTK Y ++ + + +R+++ P G +SA DADS EG
Sbjct: 280 NGQLLTLYSEAYSLTKSKLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV---- 332
Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
EG FY +T+ E+++ILG F + Y + GN + G+N+L +
Sbjct: 333 EGKFYTFTTPELKEILGADFDWFADLYSISENGNWE-----------HGRNILHRIEADD 381
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A+++G + L +L VR++R RP LDDK++ SWNGL++ A ++
Sbjct: 382 EFAARMGWSVADLNVRLDATHTRLLRVRNERIRPGLDDKILCSWNGLMLKGLVTAYRV-- 439
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-----NGPSKAP 488
F P E++ +A A F+ + + D + RL H+++ G ++
Sbjct: 440 -------FGEP-------EFLTLALRLAYFLLKKMRDSRNGRLWHTYKVSEGGTGRARQA 485
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ----NTQDELFLDREGGG---YFNTTG 541
GFLDDYA +I GLL LY+ WL A +L +L +D G F T
Sbjct: 486 GFLDDYAAVIDGLLALYQATFTRNWLTEADQLMQYVLTNFADLSVDELTGPEPLLFFTDK 545
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
++ R KE D PS NS+ NL L+ ++
Sbjct: 546 NSEELIARRKELFDNVIPSSNSMMAENLYVLSLLL 580
>gi|55980955|ref|YP_144252.1| hypothetical protein TTHA0986 [Thermus thermophilus HB8]
gi|55772368|dbj|BAD70809.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length = 642
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 219/587 (37%), Positives = 300/587 (51%), Gaps = 75/587 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESF+DE VA+LLN FV +KVDREERPDVD YM + +L G GGWP+S+
Sbjct: 49 SCHWCHVMHRESFQDEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSL 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ KP GGTYFP ED+ G PGFK +L V +AW KR+ + + E+L+ AL
Sbjct: 109 FLTPEGKPFFGGTYFPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALW 164
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S S LP+ A + L +++D +GGF APKFP+ + +L + + E+
Sbjct: 165 KSLSPPP--GPLPEGAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE- 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+++ TL+ MA GG++D VGGGFHRYSVD W +PHFEKMLYD LA V
Sbjct: 222 -------RAARLLRPTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARV 274
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
YL A+ L + + + R+ LD+L GG +A D AE+EG +EG +Y W
Sbjct: 275 YLGAYKLFGEDLFLRVARETLDWLLSMQRREGG-FHTALD---AESEG----EEGRYYTW 326
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E+ + LGE L + ++ L DL ++VL ++ A + LG
Sbjct: 327 TEAELREALGEDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEARKA-LG 371
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
E + R KL R +R P LDDKV+ W+ L + + A A ++ E
Sbjct: 372 ---EGFFAWREGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE----- 423
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A+ A F+ H+Y E L+H++R G +L D AF
Sbjct: 424 -----------RYLEAAKRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAFAALA 469
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L+LY +L WA L LF REG PS+ L KE +GA PS
Sbjct: 470 FLELYAATGEWPYLDWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEGALPS 517
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
G S LVRL ++ G YR+ AE LA L A+P
Sbjct: 518 GESALAEALVRLGAVFGGD----YRERAEEVLAEKARWLARYPHALP 560
>gi|271969730|ref|YP_003343926.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270512905|gb|ACZ91183.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
Length = 682
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 232/713 (32%), Positives = 328/713 (46%), Gaps = 109/713 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDEG A L+N+ FV++KVDREERPDVD VYM QA+ G GGWP++
Sbjct: 47 SACHWCHVMAHESFEDEGTAALMNEHFVNVKVDREERPDVDAVYMAATQAMTGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF +P P GTYFP RP F+ +L V +AW+ R+ + + + +E L+E
Sbjct: 107 VFATPGGHPFYTGTYFP------RPQFQRLLAGVSNAWNGDREAVLEQSSKIVEALNERS 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + PD L + + LS+S+D GGFG APKFP + ++ +L + E
Sbjct: 161 ALPSGPLPTPDTLAR-----AVQSLSRSFDQVRGGFGGAPKFPPSMALEFLLRYGAAAEP 215
Query: 200 -TGKSGEASEGQK-----------------MVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
TG G E ++ M TL+ MA+GGI+D +GGGF RYSVD
Sbjct: 216 RTGAEGGEPEDRREPGAGAGAGAGAPTATAMAGRTLEAMARGGIYDQLGGGFARYSVDAD 275
Query: 242 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 301
W VPHFEKMLYD L VY + LT + + D+L +M P G SA DA
Sbjct: 276 WVVPHFEKMLYDNALLLRVYAHWWRLTGSALGRRVALETADWLLAEMRTPEGGFASALDA 335
Query: 302 DSAETEGATRKKEGAFYVWTSKEVEDILGEH----AILFKEHYYLKPTGNCDLSRMSDPH 357
DS EG EG FY WT +E+ ++LGE A+ E G L +SDP
Sbjct: 336 DS---EGV----EGKFYAWTPEEIHEVLGEEDGAWAVALYEVTGTFEHGTSVLQLLSDP- 387
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
+D+ SA R +L R+ R RP DDKV+ +W
Sbjct: 388 ------------DDAERSA---------------RVRAELLAARAHRVRPGRDDKVVAAW 420
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL I++ A + DR + +E A +AA + D RL
Sbjct: 421 NGLAIAALAETGALF----------------DRPDLVEAARAAAVLLDGSHMDGD--RLL 462
Query: 478 HSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
+ R+G + A G L+DYA L GLL LY +W A L T + F D GG+
Sbjct: 463 RTSRDGRAGANAGVLEDYADLAEGLLTLYGVTGEVRWFHRAGALLETVLDRFADGS-GGF 521
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF- 595
F+T + + R ++ D A PSG + L+ A++ ++ + A ++ V
Sbjct: 522 FDTADDAERLFQRPQDPTDNATPSGQFAAAGALLSYAALTGSARHREAAEAALGTVTVLA 581
Query: 596 --ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
R +AV A +S P +V +D A+ L++T +
Sbjct: 582 DKHARFAGWGLAV-----AQAAVSGPVEAAIV-----GPLD-------DPATSALHRTAL 624
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+ PA + E ++ + A VC+ F+C PVT P L
Sbjct: 625 -LSPAPGLVVALGEPGSAEVPLLEGRGLLDGAPAAYVCRGFTCRMPVTTPAGL 676
>gi|85817359|gb|EAQ38539.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
Length = 705
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 212/685 (30%), Positives = 337/685 (49%), Gaps = 75/685 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME ESFED VA+ +N+ F++IKVDREERPDVD VYM VQ + G GGWPL+
Sbjct: 79 SCHWCHVMEHESFEDTLVAQFMNENFINIKVDREERPDVDNVYMNAVQLMTGRGGWPLNA 138
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYF ED + L +V D + + L + L++
Sbjct: 139 VALPDGRPVWGGTYFSKED------WLNALGQVADIYTSDPNKLVEYADKLGTGLAQMDL 192
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + NK + L+ E+ S+ +D+R GG APKF P + +L ++ + D
Sbjct: 193 VTPNPNK--PSFVIDTLQTSIEKWSRQWDTRQGGLNRAPKFMMPNNYEFLLRYAHQNND- 249
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
E + V TL+ +A GG++D VGGGF RYSVD +WH+PHFEKMLYD QL ++
Sbjct: 250 ------DEILEYVNTTLEQIAFGGVNDQVGGGFARYSVDTKWHIPHFEKMLYDNAQLVSL 303
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y +A+ TK+ Y + L++++R+M G +SA DADS +G +EGA+YVW
Sbjct: 304 YSNAYLKTKNPLYKETVYETLEFIKREMTTSQGGFYSALDADSLTPDGEL--EEGAYYVW 361
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+++++G+ LF +Y + D + + H VLI + + +
Sbjct: 362 TEEELKNLVGDDFKLFSAYYNIN-----DYGKWENDH------YVLIRQDLDTDFVKEHQ 410
Query: 381 MPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ LE+ + R L R SK+ +P LDDK++ SWNGL+ + A ++
Sbjct: 411 ISLEELTTKKSKWREDLLRFRESKKEKPRLDDKILTSWNGLMTKGYVDAYRVF------- 463
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
D KE+++ A A+F+ +L + L ++++G S +L+DYA I
Sbjct: 464 ---------DEKEFLDAALKNANFVVDNLL-RKDGGLNRTYKDGKSTINAYLEDYAATID 513
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ L+E +WL A L + F + E ++ T+ EDP++ R E +D P
Sbjct: 514 AFIALFEVTMDEQWLEKAKSLTDYTFTHFQNAENKLFYFTSNEDPTLSSRNTEFYDNVIP 573
Query: 560 SGNSVSVINLVRLASIVAGSKSDYY--RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
S NS+ N+ L S YY + + + A+ + + D++
Sbjct: 574 SSNSIMAKNIFTL--------SHYYLDKTYTDTAAAMLNNMQPNFTQSPTSFSNWMDLML 625
Query: 618 VPSRKH--VVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
++ + +V+VG D +N+LA Y NK + A +E
Sbjct: 626 NYTKPYYELVVVGP----DAQNILAELEQEYLPNKLIAATTTASKQE------------- 668
Query: 676 MARNNFSADKVVALVCQNFSCSPPV 700
+ + + + VC N +C PV
Sbjct: 669 IFEGRYLEGETLIYVCVNNACKLPV 693
>gi|347535413|ref|YP_004842838.1| hypothetical protein FBFL15_0482 [Flavobacterium branchiophilum
FL-15]
gi|345528571|emb|CCB68601.1| Protein of unknown function YyaL [Flavobacterium branchiophilum
FL-15]
Length = 674
Score = 318 bits (814), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 216/694 (31%), Positives = 332/694 (47%), Gaps = 74/694 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+ VA+++N FV+IK+DREERPD+D +YM +Q + G GGWPL++
Sbjct: 50 CHWCHVMEHESFENLEVAQVMNSHFVNIKIDREERPDLDALYMKALQIMTGQGGWPLNMV 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF ED + T L+++++ ++ + + + E+L + +
Sbjct: 110 CLPDGRPVWGGTYFRKED------WTTALKQIQEVFENQPERMLDYA----EKLQKGIDT 159
Query: 142 SASSNKLPDEL--PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ D+L + L + +S+D FGG APKF P ++L ++ + +D
Sbjct: 160 IGFKPQFHDDLVFSKKTLEDLISKWKRSFDLDFGGMARAPKFMMPNNYVLLLRYADQNQD 219
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
E V TL MA GG+ D +GGGF RYSVD +WHVPHFEKMLYD QL
Sbjct: 220 -------EELLDFVHLTLTKMAYGGLFDVLGGGFSRYSVDMKWHVPHFEKMLYDNAQLLF 272
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y AF T D Y + + ++ ++ +A DADS ++ +EGAFY+
Sbjct: 273 LYAQAFQKTGDPLYQEVVEKTIQFIEKEWFTDNKSFCAAYDADSINSQNVL--EEGAFYI 330
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT E+ +LG+ +LF + + + G+ + G VLI+ + A K
Sbjct: 331 WTQDELIALLGDDYVLFSKIFNINEFGHWE-----------HGHYVLIQNQTLAYWAEKE 379
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ L N E +KL+ R +RP+P LD+KVI SWN L I A K +
Sbjct: 380 SIDLAVLKNKKQEWEQKLYQKRQQRPKPRLDNKVITSWNALTIKGLVEAYKTFGT----- 434
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
K+Y+++A A FI L+ H L H ++NG K GFL+DYAF+I
Sbjct: 435 -----------KKYLQMALQNAQFIAHTLWSPDGH-LWHIYQNGTCKINGFLEDYAFVIE 482
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ +YE WL+ A L + + F D + + +DP ++ + E D P
Sbjct: 483 AFIHIYEVTFDEDWLLKAKTLTDYTFDYFFDTSKQMFRFNSRKDPELIAQHFEIEDNVIP 542
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAADMLSV 618
S NSV NL + ++ + + Y Q H++ + T D A + D L
Sbjct: 543 SSNSVMAHNL----NYLSLAFDNLYYQKTAHNMLLQATANVDYPSAFSNWLWLQMDNLYF 598
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
S +VL + V+ + H Y + D ++ + ++ SN
Sbjct: 599 TSE--MVLNSENAVVE----ASEIHRHYHPENRI--FGCFDHSKIPYLKDKTSN------ 644
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
K + C+N C PVTD L+ L+E
Sbjct: 645 ------KSMYYFCKNKECHLPVTDFQLLKKKLME 672
>gi|182436351|ref|YP_001824070.1| hypothetical protein SGR_2558 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178464867|dbj|BAG19387.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 672
Score = 318 bits (814), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 208/582 (35%), Positives = 298/582 (51%), Gaps = 61/582 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA LN FV +KVDREERPD+D VYM VQA G GGWP++
Sbjct: 47 SSCHWCHVMAHESFEDETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPPE ++G P F+ +L V AW +R+ +A+ + L+
Sbjct: 107 VFLTPDAEPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLA-GR 165
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S + +P E+ Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 166 SLVHGGDGVPGESEIAQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR- 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 --TGSEG----ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 273
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T I + D++ R++ G SA DADS + +G R EGA+
Sbjct: 274 CRVYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAY 331
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT ++ ++LGE F Y+ +++ +G +VL D+
Sbjct: 332 YVWTPAQLREVLGEDDAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG---- 377
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
P++ + + R +L R +RPRP LDDKV+ +WNGL I++ A
Sbjct: 378 ----PVDA--ARVADVRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYA 495
DR + +E A AA +R HL + RL + ++G + G L+DY
Sbjct: 427 -----------DRPDLVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYG 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + E F EGG ++T + ++ R ++ D
Sbjct: 474 DVAEGFLTLAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTD 532
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
A PSG + + L+ S A + S+ +R AE +L V +
Sbjct: 533 SATPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 571
>gi|209883527|ref|YP_002287384.1| thioredoxin domain-containing protein [Oligotropha carboxidovorans
OM5]
gi|337739402|ref|YP_004631130.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
gi|386028421|ref|YP_005949196.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
gi|209871723|gb|ACI91519.1| highly conserved protein contAining a thioredoxin domain
[Oligotropha carboxidovorans OM5]
gi|336093489|gb|AEI01315.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
gi|336097066|gb|AEI04889.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
Length = 684
Score = 318 bits (814), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 221/694 (31%), Positives = 334/694 (48%), Gaps = 83/694 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED A+++N+ FV IKVDREERPD+D++YM + L GGWP+++F
Sbjct: 56 CHWCHVMAHESFEDAATAEVMNELFVCIKVDREERPDIDQIYMRALHLLGQQGGWPMTMF 115
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
LSPD P+ GGTYFP +YGRP F I+R+ + + D +A + L+E
Sbjct: 116 LSPDGAPIWGGTYFPNTPQYGRPSFVGIMREFIRIYRDEPDKIAANKTAIERSLAERSPT 175
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+S L N L A +++S D GG APKFP+ LE
Sbjct: 176 DTASIGL------NELDNVAGSIARSTDPDNGGLRGAPKFPQ----------CSMLEFLW 219
Query: 202 KSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
++G + + + T L M++GGI+DH+GGG+ RY+VD++W VPHFEKMLYD Q+
Sbjct: 220 RAGARTGDDRFFITTNLALTRMSQGGIYDHLGGGYARYTVDDKWLVPHFEKMLYDNAQIL 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++ + + Y + + +L+R+M+ G S+ DADS EG +EG FY
Sbjct: 280 DLLALEHARAPNALYHQRAEETVGWLKREMLTREGGFASSLDADS---EG----EEGRFY 332
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+W+ E+E++LG + A F Y + GN F+G+N+L L D S +A+
Sbjct: 333 IWSQSEIEELLGKDDATFFAAKYGVTADGN------------FEGRNILNRLGDDSDTAT 380
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ L R LF R KR RP LDDKV+ WNGL I++ A++
Sbjct: 381 E--------AEQLAAMRAILFRAREKRVRPGLDDKVLADWNGLTIAALVHAAQAFA---- 428
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
R +++ +A +A FI + + RL HS+R G P D A +
Sbjct: 429 ------------RPDWLTLAATAFGFITTTM--SRHGRLGHSWRAGKLLQPALASDNAAM 474
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
I L L+E +L A+ Q D + D GGYF T+ + ++LR D A
Sbjct: 475 IRAALALHEATGDHLFLDQAVLWQADLDTHYGDPRHGGYFLTSDDAEGLILRPHSSVDDA 534
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
P+ ++ NL RLA + + D +R+ + + + + A D+
Sbjct: 535 TPNHIGLTAQNLARLAVL---TGDDRWRKQLDTLFSRMLAVAGENVFGHLSLLNALDLYL 591
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSNNASM 676
+ +V+ G E +L AA A V+H+ DPA H +N+ +
Sbjct: 592 AGAE--IVVTGEGEEA--EALLKAARALPHATTIVLHVPDPAKLP-----AHHPANDKVV 642
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
VA VC+ +CS PV++ +L L+
Sbjct: 643 -----PGGGAVAFVCRGQTCSLPVSETDALAALV 671
>gi|399928052|ref|ZP_10785410.1| hypothetical protein MinjM_13607 [Myroides injenensis M09-0166]
Length = 665
Score = 318 bits (814), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 223/683 (32%), Positives = 326/683 (47%), Gaps = 75/683 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFED VA L+N+ F+SIK+DREE PD+D YM VQ + GGWPL+V
Sbjct: 48 TCHWCHVMEHESFEDNKVATLMNNHFISIKIDREEFPDIDAFYMKAVQIMTKQGGWPLNV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYFP + + L ++ + + K + + FA EQL E +S
Sbjct: 108 VCLPDGRPIWGGTYFP------KQTWLDSLTQLNELYQTKPETVID---FA-EQLHEGIS 157
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
SS + + + L + E+ SKS+D GG+G APKF P +LY L+
Sbjct: 158 L-LSSGPIENSETRFNLEVLIEKWSKSFDWENGGYGRAPKFMMPSN---LLY----LQKL 209
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + + + TL MA GG+ D V GGF RYSVD RWH+PHFEKMLYD QL V
Sbjct: 210 GVYSHTKDILEYIDLTLTKMAWGGLFDTVEGGFSRYSVDMRWHIPHFEKMLYDNAQLLTV 269
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y DA+ TK+ Y + + Y+ + G +SA DADS + + KEGA+YVW
Sbjct: 270 YADAYKRTKNNLYKEVIAKTITYIENNWANKEGGYYSALDADSLNHDN--QLKEGAYYVW 327
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T KE++DI+ + +FK+ + + G + + VLI+ D + A++
Sbjct: 328 TEKELQDIINKEYDIFKQVFNINDNGYWE-----------ENNYVLIQTQDLHSIANQNN 376
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ + + E L R R P LDDK + SWN + I+ + L
Sbjct: 377 IEYSHLVTLKKEWEELLLQARKNRKAPRLDDKTLTSWNAMYINGLLNSYTAL-------- 428
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+ KEY+ +A FI L+DE L H+++NG +LDDYA+ IS
Sbjct: 429 --------NNKEYLVLAIKTFDFITAKLWDEDK-GLYHTYKNGQKTIKAYLDDYAYYISA 479
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
++LYE +L A + + F D + +F + ++ + E D PS
Sbjct: 480 AIELYEHTGEDNYLTIAKNCTDYVFDHFYDDKTKFFFYSQDIQEYIIKNI-ETEDNVIPS 538
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 620
N++ +NL +LA + +YR + + L + +T++ D A A S P+
Sbjct: 539 SNAIMCLNLQKLAVLYDNL---HYRNTSINMLEIIKTQI-DYPSAYSHWLLADLYQSHPA 594
Query: 621 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNN 680
+ LVG A S L K VI T F E S + + N
Sbjct: 595 E--ITLVGK----------GALKTSLLLRKKVI------THTFVFPVEQESKIPYLNKEN 636
Query: 681 FSADK-VVALVCQNFSCSPPVTD 702
DK ++ +C N +C P D
Sbjct: 637 ---DKHLLVYLCANSTCYKPEED 656
>gi|295838670|ref|ZP_06825603.1| conserved hypothetical protein [Streptomyces sp. SPB74]
gi|197699107|gb|EDY46040.1| conserved hypothetical protein [Streptomyces sp. SPB74]
Length = 683
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 229/693 (33%), Positives = 323/693 (46%), Gaps = 77/693 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED G A +N+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SACHWCHVMARESFEDVGTAAYVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P +P GTYFPP +G P F+ +L V+ AW +R + + A L
Sbjct: 107 VFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRRAEVDEVAARVTADL---- 162
Query: 140 SASASSNKLPD-ELPQNALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
+ LPD P A L A L++ YDSR GGFG APKFP + ++ +L H +
Sbjct: 163 --TGRGLGLPDGAAPPGADALGAALLGLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR 220
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
TG G +M T + MA+GGI+D +GGGF RY+VD W VPHFEKML D
Sbjct: 221 ---TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWTVPHFEKMLSDNAL 273
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L Y + T + + D+L R++ P G SA DADS +G R EGA
Sbjct: 274 LCRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGA 331
Query: 317 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT +++ ++LGE A L HY + P G F+ + ++ L +
Sbjct: 332 SYVWTPEQLREVLGEADAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGF 379
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
S P++ L RR L R +RP P DDKV+ +WNGLVI++ A
Sbjct: 380 DSP---PVDA--ARLDRIRRALLAAREERPAPGRDDKVVAAWNGLVIAALAE-------- 426
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 495
A F R + + A AA + R D + H + S P G L+DYA
Sbjct: 427 -TGAYFG-------RPDLVAAATGAADLLVRVHLDTRGHLTRTSRDGRPGGNAGVLEDYA 478
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L W +A L + F D + G ++T + +++ R ++ D
Sbjct: 479 DVAEGFLTLASVTGEGVWTDFAGLLLDQVLARFRD-DTGALYDTAADAEALIHRPQDPTD 537
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A PSG + + L+ A++ + S +R AE +L+V + +A P +
Sbjct: 538 NATPSGWNAAAGALLTYAAL---TGSTAHRAAAEQALSV----VAALAPRAPRFVGHGLA 590
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L+ P V +VG + AA + V P+ E
Sbjct: 591 VAEALLAGP--YEVAVVGAPEDPRTRALHCAALLATSPGAVVAAGPPSAEPEFPL----- 643
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
+A A +C+ F C P TDP
Sbjct: 644 -----LADRPLVEGAPAAYLCRGFVCDRPETDP 671
>gi|404497256|ref|YP_006721362.1| thioredoxin domain-containing protein YyaL [Geobacter
metallireducens GS-15]
gi|418065852|ref|ZP_12703222.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
gi|78194859|gb|ABB32626.1| thioredoxin domain protein YyaL [Geobacter metallireducens GS-15]
gi|373561650|gb|EHP87881.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
Length = 706
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 225/692 (32%), Positives = 328/692 (47%), Gaps = 81/692 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVM ESF D VA +LN FV+IKVDREERPD+D YM Q + G GGWPL+V
Sbjct: 79 TCHWCHVMAHESFGDHEVAAVLNRDFVAIKVDREERPDIDDTYMRVAQLMNGSGGWPLTV 138
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
++PD +P TY P + G PG IL ++ + W +R+++ Q+ ++ L
Sbjct: 139 CMTPDREPFFVATYIPKHSRGGMPGLVEILGRIAEVWKTRRELVHQNCTAILDSLRNLSV 198
Query: 141 ASASSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A P E+P LR QL+ +D GFG APKFP P+ + +L + ++ D
Sbjct: 199 AK------PGEIPGAEPLRAARSQLAGMFDPVNAGFGQAPKFPMPLNLSFLLRYGRRFGD 252
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G + MV+ TL+ + +GGI D +G G HRYSVD RW VPHFEKMLYDQ +A
Sbjct: 253 PGAT-------VMVVATLEALRRGGIFDQLGFGLHRYSVDSRWLVPHFEKMLYDQALVAM 305
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
++AF T + + D++ R++ P G +SA DAD TEG +EG +Y+
Sbjct: 306 AAVEAFQATGQESLREMAEQLCDFVLRELAAPEGGFYSALDAD---TEG----EEGRYYL 358
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT +V +LGE LF + + GN F+G N+L A +
Sbjct: 359 WTPAQVRSVLGETEGELFCRLFDVTGKGN------------FEGANILNLPVLLHEFAQR 406
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
GM E + R L R+KR RP D+K++ +WNGL+I++ AR
Sbjct: 407 EGMSPENLEEKVEGWRLLLLAERAKRERPFRDEKIVTAWNGLMIAALARL---------- 456
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
F G +R ++ AE+A I R L RL S G + P FL+DYA L+
Sbjct: 457 ----FLAGGGER--FLVAAEAALVRILRDLR-RADGRLLRSIHRGEGEVPAFLEDYAALL 509
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL L++ ++ A L LF E G ++T + +VL+R + D+DG
Sbjct: 510 HGLLALHDATLDPRYREEACSLARDMLRLF-SGEDRGLYDTGNDAETVLMRSRVDYDGVM 568
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN ++ LVRL + + + + + E + F +A A D+L
Sbjct: 569 PSGNGLAATGLVRLGRM---ADEERFVEAGEEIIRAFMAGAGRQPVAHLQTLMALDLLRG 625
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P + + G + V + MLA + + V+ +P
Sbjct: 626 PQVEVAISGGSRGKV--QGMLAEIGKRF-IPGFVLRGEPD-------------------- 662
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC +C PV P +L +L
Sbjct: 663 ---QGRRATAQVCAAGACHIPVESPAALGGIL 691
>gi|429859406|gb|ELA34188.1| duf255 domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 811
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 213/652 (32%), Positives = 314/652 (48%), Gaps = 85/652 (13%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+ + E F AK+LN+ FV + +DREERP++D +YM YVQA+ G GG
Sbjct: 76 HIGFKPCHYSRLTSTECFTHSECAKILNESFVPVIIDREERPELDTIYMNYVQAVSGNGG 135
Query: 76 WPLSVFLSPDLKPLMGGTYFP-PEDKYGRPG------FKTILRKVKDAWDKKR------- 121
WPL++FL+P+L+P+ GGTY+P PE G G F IL+K++ W ++
Sbjct: 136 WPLNLFLTPELEPVFGGTYYPAPEPNNGSSGDDERLDFLAILKKLQKVWKEQEARCRQEA 195
Query: 122 --------DMLAQSGAFAIEQLSEALSASASSN------------------KLPDELPQN 155
D A+ A + ++ S S+ + EL
Sbjct: 196 KEVVVKLHDFAAEGTLGATSTVEPGVAGSQSATLARSETGLEHPGTGRTAAVVSSELDLE 255
Query: 156 ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL---EDTGKSGEASEGQKM 212
L ++ ++D +GGFG APKFP P ++ +L + L +D E + +M
Sbjct: 256 HLEEAYTHIAGTFDPVYGGFGLAPKFPTPPKLSFLLRLPRYLAPVQDVVGETECAHAAEM 315
Query: 213 VLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--- 268
LFTL+ + G+ DHVGG GF RYSV W VP FEK++ L +YLDA+ +
Sbjct: 316 ALFTLRKIRDSGLRDHVGGHGFARYSVTADWSVPRFEKLVVHNALLLGLYLDAWLIATGG 375
Query: 269 --KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
FY + +++DYL I P G S+E ADS G +EGA+ +WT +E
Sbjct: 376 EKNGEFYDVVV-ELVDYLTSAPISLPDGGFVSSEAADSYR-RGDRHLREGAYSLWTRREF 433
Query: 326 EDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 383
+ ++G+ A L ++ + GN + + DP++EF +N+L + D + + G+ +
Sbjct: 434 DSVIGDDHEAALAASYWNVLEDGNIEPDQ--DPNDEFVNENILRVVKDKAEIGRQAGITI 491
Query: 384 EKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 442
+ +L ++KL R K R RP D K++ NGLVI + AR L
Sbjct: 492 DDVERVLASAKQKLKAHREKERTRPEADTKIVAGRNGLVIGALARTGSALA--------- 542
Query: 443 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 502
P+ E A AA+FIR L+DE L + G G DDYA LI GL+
Sbjct: 543 -PIDADRSNACFEAASKAAAFIRAQLWDENERILYRIYNEGRGDTKGLADDYAHLIEGLI 601
Query: 503 DLYEFGSGTKWLVWAIELQNTQDELFLD--------------REGGGYFNTTGED-PSVL 547
DLYE KW +A ELQ Q ++F D R G F TT E+ P +
Sbjct: 602 DLYEATGEEKWAEFADELQKVQIDMFYDSTSVPATTPTSPTARSSCGAFYTTPENAPHTI 661
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
LR+K+ D A PS N+VSV NL RL +++ + Y A S+ FE +
Sbjct: 662 LRLKDGMDTALPSTNAVSVSNLFRLGIMLS---DEAYTALARESINAFEAEI 710
>gi|326776975|ref|ZP_08236240.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
gi|326657308|gb|EGE42154.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
Length = 672
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 208/582 (35%), Positives = 297/582 (51%), Gaps = 61/582 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE VA LN FV +KVDREERPD+D VYM VQA G GGWP++
Sbjct: 47 SSCHWCHVMAHESFEDETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPPE ++G P F+ +L V AW +R+ +A+ + L
Sbjct: 107 VFLTPDAEPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLG-GR 165
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S + +P E+ Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 166 SLVHGGDGVPGESEIAQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR- 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 --TGSEG----ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 273
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T I + D++ R++ G SA DADS + +G R EGA+
Sbjct: 274 CRVYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAY 331
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT ++ ++LGE F Y+ +++ +G +VL D+
Sbjct: 332 YVWTPAQLREVLGEDDAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG---- 377
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
P++ + + R +L R +RPRP LDDKV+ +WNGL I++ A
Sbjct: 378 ----PVDA--ARVADVRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----- 426
Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYA 495
DR + +E A AA +R HL + RL + ++G + G L+DY
Sbjct: 427 -----------DRPDLVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYG 473
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G L L WL +A L + E F EGG ++T + ++ R ++ D
Sbjct: 474 DVAEGFLTLAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTD 532
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
A PSG + + L+ S A + S+ +R AE +L V +
Sbjct: 533 SATPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 571
>gi|302519353|ref|ZP_07271695.1| transmembrane protein [Streptomyces sp. SPB78]
gi|302428248|gb|EFL00064.1| transmembrane protein [Streptomyces sp. SPB78]
Length = 578
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 213/581 (36%), Positives = 296/581 (50%), Gaps = 60/581 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A +N FV +KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SSCHWCHVMARESFEDAETAAYMNAHFVCVKVDREERPDVDAVYMEAVQAATGHGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+P +P GTYFPP +G P F+ +L V+ AW +R+ +A A L+ A
Sbjct: 107 VFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRREEVADVAARVTADLTGRA 166
Query: 139 LSASA-SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L A +S PD L L L++ YDSR GGFG APKFP + ++ +L H +
Sbjct: 167 LGLPADASPPGPDALGAALL-----GLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RY+VD W VPHFEKML D L
Sbjct: 221 --TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWIVPHFEKMLSDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y + T + + D+L R++ P G SA DADS +G R EGA
Sbjct: 275 CRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGAS 332
Query: 318 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT +++ ++LGE A L HY + P G F+ + ++ L + S
Sbjct: 333 YVWTPEQLREVLGEDDAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGSD 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S P++ L RR L R +RP P DDKV+ +WNGL I++ A
Sbjct: 381 SP---PVDA--ARLDRIRRALLAARDERPAPGRDDKVVAAWNGLAIAALAETGAYF---- 431
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTH-RLQHSFRNGPSKA-PGFLDD 493
DR + +E A AA +R HL TH RL + R+G + G L+D
Sbjct: 432 ------------DRPDLVEAALGAADLLVRVHL---DTHGRLSRTSRDGRTGTNTGVLED 476
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA + G L L W +A L + + F D + G ++T + +++ R ++
Sbjct: 477 YADVAEGFLTLASVTGEGVWTDFAGLLLDHVLDRFRD-DSGALYDTAADAETLIHRPQDP 535
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
D A PSG + + L+ A++ AGS +R +E L+V
Sbjct: 536 TDNATPSGWNAAAGALLTYAAL-AGSTP--HRAASEQGLSV 573
>gi|441511562|ref|ZP_20993411.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
gi|441453542|dbj|GAC51372.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
Length = 674
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 201/575 (34%), Positives = 284/575 (49%), Gaps = 65/575 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFEDE A +N FV IKVDREERPD+D +YM A+ G GGWP++ F
Sbjct: 60 CHWCHVMAHESFEDETTAAQMNRDFVCIKVDREERPDIDAIYMAATVAMTGQGGWPMTCF 119
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GTY+PP + P F+ +L V +AW ++R L + A E + S
Sbjct: 120 LTPDSDPFYTGTYYPPRPRGQMPSFRQVLTAVTEAWTQRRADLDDTAAKVREHIVVNTSP 179
Query: 142 -SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A + + D L + +R ++ D GGFG APKFP + ++ H+++ DT
Sbjct: 180 LPAGTVPVDDRLLAHGVRTVLDE----EDREHGGFGGAPKFPPSALLDALIRHTERTGDT 235
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
A T+ M +GGI+D +GGGF RYSVD W VPHFEKMLYD QL
Sbjct: 236 AAIEAAGR-------TMHAMGRGGIYDQLGGGFARYSVDAGWVVPHFEKMLYDNAQLLRA 288
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y T D + + + +LRRD+ PGG S+ DAD+ EG+T YVW
Sbjct: 289 YAHLARRTGDALAHRVVEETVTFLRRDLRVPGG-FASSLDADAGGVEGST-------YVW 340
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T E+ ++LG A + V+ E S L
Sbjct: 341 TPDELAEVLGPEAGRRAAELF-----------------------VVTEQGTFEHGRSTLQ 377
Query: 381 MPLE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+P + + + LG R LFD R++R +P DDKV+ +WN + I++ A A L E+
Sbjct: 378 LPADPEDRDRLGTVRAALFDARARRVQPTRDDKVVTAWNAMTITALAEAGAGL---GETG 434
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
+ V +D +R HL RL+ S G A G LDD+A L +
Sbjct: 435 FVDDAVRCAD------------ELLRGHLVG---GRLRRSSLGGAVGADGGLDDHAALST 479
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAE 558
LL L++ T+WL + L +T ELF D E G +F+ TGE ++ R ++ DGA
Sbjct: 480 ALLTLFQVTGETRWLGAGLGLLDTAIELFADPEAPGAWFDATGE--GLIARPRDPIDGAT 537
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
PSG S+ L+ + + ++ Y + EHSL+
Sbjct: 538 PSGASLMAEALLTASMLADPERAVGYAELLEHSLS 572
>gi|126659475|ref|ZP_01730608.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
gi|126619209|gb|EAZ89945.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
Length = 686
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 243/719 (33%), Positives = 346/719 (48%), Gaps = 110/719 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LND F+ IKVDREERPD+D +YM+ +Q + GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRPGF +L+ ++ +D +++ L F E L +
Sbjct: 108 IFLTPGDLVPFYGGTYFPVEPRYGRPGFLQVLQSIRHFYDVEKEKL---NGFKQEIL-KG 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKK 196
L SA+ LP + + + QL + D +A F RP M+ Y +
Sbjct: 164 LQQSAT-------LPMSEIDVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLA 215
Query: 197 LEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
LE T GE E QK+V+ Q +A GGI DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 216 LEGTRFLFGEPEERQKLVIQRGQDLALGGIFDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
Q LAN++ + ++ + + +L+R+M P G ++A+DADS T+
Sbjct: 276 QIMEYLANLWSNG---QQEPAFERAIALTVQWLQREMTSPEGYFYAAQDADSFATKEDKE 332
Query: 312 KKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG FYVW +++E +L + E + + P GN F+GKNVL N
Sbjct: 333 PEEGTFYVWKYEQLEQLLNTKKLEELTEVFTITPEGN------------FEGKNVLQRRN 380
Query: 371 DSSASASKLGMPLEK-YLNILGECRRKL---FDVRSKRPRPHL----------DDKVIVS 416
S S S + + L+K + G R L ++ + + D K+IV+
Sbjct: 381 GSKFSDS-IEIILDKLFQERYGTSRNNLETFLPAKNNQEAQEINWPGRIPAVTDTKMIVA 439
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHR 475
WN L+IS ARA I K P+ Y ++ +A FI + + + HR
Sbjct: 440 WNSLMISGLARAYAIFKQ---------PL-------YWQLGCNATQFILNKQWLNGRLHR 483
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGG 534
+ + G +DY FLI LLDL+ + T+WL AIE+Q DE F E G
Sbjct: 484 INYE---GNPSILAQSEDYGFLIKALLDLHAANAQETQWLDKAIEIQQEFDEFFWSLEMG 540
Query: 535 GYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GY+N ++ + +L+R + D A PS N +++ NLVRLA + Y AE L
Sbjct: 541 GYYNNAADNSNDLLVRERSYIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQGLQ 597
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F L + A P + A D LV ++ L K +
Sbjct: 598 AFSHILSESPRACPSLLTALDWYHFG-----CLVRTNETL--------------LPKLMT 638
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
P +D NN D V LVCQ SC P T L N ++E
Sbjct: 639 QYFPTTAYCLD--------------NNL-PDNAVGLVCQGLSCLEPATTEEQLLNQIIE 682
>gi|302536490|ref|ZP_07288832.1| conserved hypothetical protein [Streptomyces sp. C]
gi|302445385|gb|EFL17201.1| conserved hypothetical protein [Streptomyces sp. C]
Length = 687
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 223/700 (31%), Positives = 329/700 (47%), Gaps = 74/700 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED+ A +N+ FV+IKVDREERPD+D VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAGESFEDDLAAAYMNEHFVNIKVDREERPDIDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+PD +P GTYFPPE ++G P F +L V+ AW +R+ +++ + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFMQVLEGVRTAWAGRREEVSEVAQRIVRDLAGRQ 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L + P+EL + L L++ YD+ GGFG APKFP + ++ +L H +
Sbjct: 168 LDYGRAGLPGPEELGRALL-----GLTREYDAARGGFGGAPKFPPSMVLEFLLRHHAR-- 220
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 275
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS E + + EGA+Y
Sbjct: 276 RVYAHLWRATGSDLARRVALETADFMVRELRTEQGGFASALDADS-EDPSSGKHVEGAYY 334
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT E+ ++LGE Y+ G + F+ +++L
Sbjct: 335 AWTPAELAEVLGEEDGAVAAAYF----GVTE-------EGTFEHGRSVLQLPQ------- 376
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G P+ + + R +L R +RP P DDKV+ +WNGL +++ A
Sbjct: 377 -GGPVVEAGKV-ASIRERLLAARGRRPAPGRDDKVVAAWNGLAVAALAECGAFF------ 428
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKA-PGFLDDYA 495
+R + +E A AA + R +D RL + R+G G L+DY
Sbjct: 429 ----------ERPDLVERAIEAADLLVRVHFDSTAGMARLARTSRDGRVGVNAGVLEDYG 478
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-H 554
+ G L L WL +A L + F G G T D L+R +D
Sbjct: 479 DVAEGFLALASVTGEGVWLEFAGFLVDLVMARFT--AGDGSLYDTAHDAEQLIRRPQDPT 536
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
D A PSG + + L+ S A + S +R+ AE +L V + A+
Sbjct: 537 DTAAPSGWTAAAGALL---SYAAHTGSAPHREAAERALGVVHALGPRAPRFIGHGLAVAE 593
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNSNN 673
L V + V +VGH A+ L++T ++ P + + + +
Sbjct: 594 AL-VDGPREVAVVGHPED----------PATVALHRTALLATAPGAVVAVGLPRKADGSG 642
Query: 674 AS---MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A D A VC++F C+ P T+P+SL L
Sbjct: 643 GEFPLLAERTLVRDLPTAYVCRHFVCARPTTEPVSLAEQL 682
>gi|255033843|ref|YP_003084464.1| hypothetical protein Dfer_0027 [Dyadobacter fermentans DSM 18053]
gi|254946599|gb|ACT91299.1| protein of unknown function DUF255 [Dyadobacter fermentans DSM
18053]
Length = 671
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 224/685 (32%), Positives = 326/685 (47%), Gaps = 75/685 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME E FE E +A+++N +FV IKVDREERPDVD VYM VQA+ GGWPL+
Sbjct: 47 SACHWCHVMERECFEKEPIAEVMNAYFVCIKVDREERPDVDAVYMDAVQAMGVRGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL PD KP G TY PP++ + +L+ + A+ D LA S ++ + +
Sbjct: 107 VFLLPDSKPFYGVTYLPPQN------WVQLLKSINQAFTNHFDELADSAEGFVQNMIASE 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + + L + EQ+ + +D++ GG APKF P + +L + D
Sbjct: 161 SQKYGLVEGTVHFNADDLDVMFEQIQRHFDTQKGGMDRAPKFMMPSIYKFLL----RYFD 216
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
++ EA V +L +A GGI+DHVGGG+ RYSVDE W +PHFEKMLYD QL +
Sbjct: 217 VSQNPEA---LAQVELSLNRIALGGIYDHVGGGWARYSVDEDWFIPHFEKMLYDNAQLLS 273
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY +A+SLT++ Y+ + +L +M G FSA DADS EG EG FY+
Sbjct: 274 VYAEAYSLTQNPLYASRIEQTIQWLSAEMRSADGGFFSALDADS---EGI----EGKFYI 326
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT +E++ +LGE F + Y + GN + G N L +A
Sbjct: 327 WTQQELQSVLGEDFDWFSKLYNISAQGNWE-----------HGYNHLHLTEPVEHAAKTA 375
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + + KL + R +R RP LDDK++ SWNGL+I + L E
Sbjct: 376 GILTDDFAGRYENAVTKLAEKRRERVRPGLDDKILASWNGLLIKGLTDCYRALGHE---- 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
E E+A FI + +L HSF+NG + GFL+DYA +I
Sbjct: 432 ------------EIRELAIGTGHFIAGKM--TTGSKLNHSFKNGVATVTGFLEDYAAVIE 477
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
G L LY+ WL A +L F D+ G + T +++ R KE D P
Sbjct: 478 GYLGLYQITFEEDWLQKAQQLTEYALSNFYDQSEGFFHFTDAYGEALIARKKELFDNVIP 537
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV---PLMCCAADML 616
+ NS+ NL L ++ + DY + + + + L D+ L C A
Sbjct: 538 ASNSIMAQNLYTLGKML--DRDDYIEISDKMLSKMTKLLLADVQWVTNWAALYCQRA--- 592
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
VP+ + ++ G D + M + NK V+ + T + +
Sbjct: 593 -VPTAEIAIVGG-----DADAMRKDLDRFFIPNKIVMGTSTSSTLPL-----------LL 635
Query: 677 ARNNFSADKVVALVCQNFSCSPPVT 701
R + +A K VC + +C PVT
Sbjct: 636 NRTDINA-KTAIYVCYDKTCQLPVT 659
>gi|402820063|ref|ZP_10869630.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
IMCC14465]
gi|402510806|gb|EJW21068.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
IMCC14465]
Length = 751
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 225/724 (31%), Positives = 346/724 (47%), Gaps = 100/724 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE+E +A ++ND FV+IKVDREERPD+D +YM+ + + GGWPL++F
Sbjct: 61 CHWCHVMAHESFENEDIASVMNDLFVNIKVDREERPDIDDIYMSALHMMGEQGGWPLTMF 120
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILR-----------KVKDAWDKKRDMLAQSGAF 130
L PD +P GGTYFPP K+GRPGF I R KV++ DK L
Sbjct: 121 LLPDGRPFWGGTYFPPIAKFGRPGFPDICREIARICTEETDKVQENADKLTQALQNKNNA 180
Query: 131 AIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM 190
A + ++ + S LP LP++ +E L++ D +GG APKFP+P+ +++
Sbjct: 181 AFKAANQKTALEQLSPNLPLGLPEDLASEASENLARQIDLTYGGMQGAPKFPQPLIYELL 240
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
+D ++G ++ VL TL + GGI DH+ GGF RYSVDE W VPHFEKM
Sbjct: 241 ------WQDWLRNGR-DVSREAVLITLSGLCHGGIFDHIRGGFSRYSVDEEWLVPHFEKM 293
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-------GPGGEIFSAED--- 300
+YD G + ++ + + T+D + +D+L DM+ G S +D
Sbjct: 294 IYDNGLILDLMGNVWKSTRDPMLTDRISKTVDWLLDDMLTNATNNSTDGAAALSKDDTPK 353
Query: 301 ---ADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPH 357
A +A + + +EG +YVWT E+ +LGE+ F Y + GN P
Sbjct: 354 PPAAFAASLDADSEGEEGKYYVWTVAELTSLLGENFPDFARTYRVTDAGNF-------PE 406
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLE----KYLNILGECRRKLFDVRSKRPRPHLDDKV 413
G NV I LN S G E + LNIL + ++ R RP DDK+
Sbjct: 407 GGGAGDNVNI-LNRLPPSLHNEGFDEEARHAQSLNILAQA-------QALRTRPERDDKI 458
Query: 414 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ- 472
+ WNGLVI++ AR S + ++ K+++E AE A + + + E+
Sbjct: 459 LADWNGLVIAALARLSPVFQN----------------KKWLETAERAYRDVMQTMSYEEG 502
Query: 473 -THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
+L H+ R +DY+ + L L+ +L A L T ++ + D
Sbjct: 503 GCLKLAHAARGESKLNISMAEDYSNMADAALALFSATGTASYLASAEALTKTLEQFYTD- 561
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
+ GG++ T+ + +++ R +DGA P+ N ++I + R ++ G + YR + E
Sbjct: 562 DVGGFYMTSSQAETLITRPHTSYDGATPNANG-TMIGVYRRLAVFTGKQD--YRDSLE-- 616
Query: 592 LAVFETRLKDMAMAVPLMC-CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA------ 644
A+ +T P M + + + V+VG S DF+ +L AHA
Sbjct: 617 -ALIKTHAIAAIKHYPQMPRYLTETENTRHQASCVIVGDPSDNDFKLLLETAHAHPCPGL 675
Query: 645 -------SYDLNKTV-IHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSC 696
DL + IH PA+ + NA+ + F+ D+ A VC + +C
Sbjct: 676 IVHPVGLGQDLPTHIPIHETPANP----------TKNATDDKMPFAFDQPTAYVCTHNTC 725
Query: 697 SPPV 700
PP
Sbjct: 726 LPPA 729
>gi|322697732|gb|EFY89508.1| DUF255 domain protein [Metarhizium acridum CQMa 102]
Length = 724
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 207/641 (32%), Positives = 322/641 (50%), Gaps = 74/641 (11%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M ESF + A +LN+ FV + +DREERPD+D +YM YVQA+ GG
Sbjct: 63 HIGYKACHFCRLMTQESFSNPECAAILNESFVPVIIDREERPDIDTIYMNYVQAVSNVGG 122
Query: 76 WPLSVFLSPDLKPLMGGTYFP---------PEDKYGRPGFKTILRKVKDAWDKKR----- 121
WPL+VF++P+L+P+ GGTY+P E + P TI +KV+D W +
Sbjct: 123 WPLNVFVTPNLEPVFGGTYWPGPGTSRRVAAESEDESPDCLTIFKKVRDIWHDQETRCRK 182
Query: 122 ---DMLAQSGAFAIEQL------------------------SEALSASASSNKLPDELPQ 154
++LAQ FA E + + A ++ EL
Sbjct: 183 EASEVLAQLREFAAEGTLGTRGLTGTHPIATPSWNIPSNPENTPIRARDKDAQVSSELDL 242
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQK 211
+ L ++ ++D +GGFG APKF P ++ +L+ ++D E
Sbjct: 243 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLHLNTFPSAVQDVVGEAECRHATV 302
Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL--- 267
M + TL+ + G +HDH+G GF R SV W +P+FEK++ D L +YLDA+ +
Sbjct: 303 MAVDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNALLLVLYLDAWGIAGG 362
Query: 268 -TKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 325
FY + ++ DYL I P G + ++E ADS G +EGA+Y+WT +E
Sbjct: 363 KADSEFYDTVL-ELADYLSSPPIALPSGGLATSEAADSFMRRGDREMREGAYYLWTRREF 421
Query: 326 EDILG------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ ++ + + + H+ ++ GN D DP+++F N+L + + +
Sbjct: 422 DSVVDASGQDKQISQVAAAHWDVQEGGNVDEDH--DPNDDFINHNILRVVKTPDELSRQF 479
Query: 380 GMPLEKYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ + + R++L +R RP LDDKVI +WNGL IS+ A+AS LK
Sbjct: 480 NISTDTVRQHIQAARKELKARRERERVRPELDDKVITAWNGLAISALAQASSALK----- 534
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
PV + ++Y+ AESAA FI+ L+DE + L +R G + GF DDY +LI
Sbjct: 535 -----PVDPARSEKYLHAAESAAGFIKASLWDESSKLLYRIYREG-RETKGFADDYTYLI 588
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLLDL+ S L +A LQ TQ+ LF D + G +F+TT P +LR+K+ D +
Sbjct: 589 HGLLDLFAATSDESHLAFADALQKTQNSLFHDSDSGAFFSTTASSPQAILRLKDGMDTSL 648
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
PS N+V+ NL RL +++ + Y A ++ FE +
Sbjct: 649 PSINAVAASNLFRLGALL---DDEPYSTLARGTVNAFEAEM 686
>gi|428319651|ref|YP_007117533.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
7112]
gi|428243331|gb|AFZ09117.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
7112]
Length = 695
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 217/628 (34%), Positives = 319/628 (50%), Gaps = 82/628 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIAQYMNSHFIPIKVDREERPDIDSIYMQTLQMMTGQGGWPLN 107
Query: 80 VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD + P GGTYFP E +YGRPGF +L+ ++ +D ++ + A + L ++
Sbjct: 108 VFLTPDERVPFYGGTYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILSNLQQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ S + +L EL Q L + ++ G P FP M+ Y L
Sbjct: 168 AALSGVTAELNRELFQKGLEINTGIVA--------GHNPGPSFP------MIPYAELALR 213
Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
T + E+ K V +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 214 GTRFNFESKYDSKQVCTQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQI 273
Query: 258 ANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ +S + + F + I + ++L+R+MI P G ++A+DADS T +EG
Sbjct: 274 VEYLANLWSAGIQEPAFETAIAGTV-EWLKREMIAPTGYFYAAQDADSFNTSEEVEPEEG 332
Query: 316 AFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----EL 369
AFYVWT E+E +L E K + + +GN F+GKNVL L
Sbjct: 333 AFYVWTYAELEQLLTAEELAEIKAQFTVSRSGN------------FEGKNVLQRRHPGRL 380
Query: 370 NDSSASA-SKL------GMP-LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
+D+ +A +KL G P K + D R D K+I +WN L+
Sbjct: 381 SDTVETALAKLFAVRYGGNPNTVKTFPPARNNQEAKNDSWPGRIPAVTDTKMIAAWNSLM 440
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
IS ARA+ + + EY+E+A AA+FI + + E R Q
Sbjct: 441 ISGLARAAAVFGN----------------LEYLELAVKAANFILDNQWTE--GRFQRLNY 482
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYE----FGSGTK---------WLVWAIELQNTQDELF 528
+G S +DYA + LLDL++ G+G + WL A+++Q DE
Sbjct: 483 DGQSAVTAQSEDYALFVKALLDLHQASLTLGNGEEAKQLPNSQFWLEKALQVQEEFDEFL 542
Query: 529 LDREGGGYFNTTGEDPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
E GGY+N T +D S +L+R + D A P+ N +++ +LVRLA + G +Y +
Sbjct: 543 WSVELGGYYN-TAQDASGDLLVRERSYIDNATPAANGIAIASLVRLA--LLGPNLEYLDR 599
Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAAD 614
AE L F + ++D A P + A D
Sbjct: 600 -AEQGLQAFSSIVQDSPQACPSLLSAID 626
>gi|336120019|ref|YP_004574797.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
gi|334687809|dbj|BAK37394.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
Length = 669
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 225/692 (32%), Positives = 319/692 (46%), Gaps = 78/692 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A LN+ FVS+KVDREERPDVD V+M QAL G GGWP++V
Sbjct: 49 ACHWCHVMAHESFEDETTAAYLNEHFVSVKVDREERPDVDAVFMAATQALAGQGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTYFPP + G P F +L + AW +RD + S A +L
Sbjct: 109 FLTPDRRPFYAGTYFPPRARQGMPAFADVLAAIASAWRDRRDEVLSSVAHISGELERR-- 166
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ KLP E+ + L + L + +D GGFG APKFP + ++ +L +L D
Sbjct: 167 ---HAPKLPGEVTRAGLDVARANLQREFDEVRGGFGGAPKFPPSMVLEGLL----RLGD- 218
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
E MV T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L V
Sbjct: 219 ------DESMAMVDVTCEAMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLGV 272
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + T++ + + +++L ++ P G ++ DADS + +G EGA+Y W
Sbjct: 273 YTHWWRRTQNPIGERVVAETVEWLVAELRTPQGGFAASLDADSLDEQG--HSAEGAYYAW 330
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+ +LGE + + ++D G++ L L D
Sbjct: 331 DPVGLTAVLGEDDGRWAAEVF----------GVTDQGTFEHGRSTLRLLGDPD------- 373
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
P+ L R +L R +RPRP DDKV+ +WNG +I+S A+ +
Sbjct: 374 -PVR-----LASARERLRTTREQRPRPGRDDKVVAAWNGWLIASLVEAAGVFG------- 420
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
R +++ +A AA I R H D RL+ + R+G A G L+DYA +
Sbjct: 421 ---------RPDWLALAREAAELIWRVHWVD---GRLRRTSRDGEVGSAAGVLEDYAAMT 468
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ L + WL A L F D G G+F+T S+ LR ++ D A
Sbjct: 469 MAAVRLGCAEADATWLTRAEALAEVILAEFGD--GDGFFDTASGAESLYLRPQDPTDNAT 526
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S +V L LA +SD + + + A L+ AA L
Sbjct: 527 PSGLSATVHALALLAETT--GRSDLAERAERAAATAGGLVDRAPRFAGWLLAYAASRLVS 584
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
P V +VG S + + A+ +VI + D ++ +A
Sbjct: 585 PP-VQVAIVGDASDTGTQELARTAYRCAPAG-SVIMVGVPDEPGLEL----------LAD 632
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC+ F C PVTD L + L
Sbjct: 633 RPLLDGRPTAYVCRGFVCRLPVTDSQELADQL 664
>gi|88604224|ref|YP_504402.1| hypothetical protein Mhun_2996 [Methanospirillum hungatei JF-1]
gi|88189686|gb|ABD42683.1| protein of unknown function DUF255 [Methanospirillum hungatei JF-1]
Length = 700
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 225/688 (32%), Positives = 310/688 (45%), Gaps = 77/688 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME FEDE VA LLN FVS+KVDREERPD+D+VYM QA+ G GGWPL VF
Sbjct: 53 CHWCHVMETVCFEDEVVASLLNTHFVSVKVDREERPDIDQVYMAVCQAMTGSGGWPLHVF 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P T+ P PG +L + W +R+ ++ +Q+ A+
Sbjct: 113 LTPDKRPFYAATFIPKMSSPNMPGMLDLLPYLASVWRDEREKVSDLS----DQIMSAIQE 168
Query: 142 SASSNKL--PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
L PDEL A R +L+ YD ++GGF APKFP + +L ++ +D
Sbjct: 169 QTRRGTLHDPDELIHTAAR----RLTALYDKKYGGFSPAPKFPSVPVLLFLLRYAVIHQD 224
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
M+ TL MA GG+ DH+ GGFHRY+ D W +PHFEKMLYDQ A
Sbjct: 225 RSI-------LDMITTTLNRMAWGGMRDHLDGGFHRYATDTAWKLPHFEKMLYDQAMCAI 277
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y + + +TK Y + R +L+Y+ + G S+EDADS EGA+Y+
Sbjct: 278 IYTEIWQVTKQDRYRRLARSVLEYMTTVLSDAPGGFSSSEDADSP-------GGEGAYYL 330
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W+ E+E I GE A L + + GN +S H G NVL D S
Sbjct: 331 WSYDEIEKIFGEEARLVCTMFGITREGN-----VSGMHGMKPGDNVLFPERDPLEILSAA 385
Query: 380 GM--PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
G+ P + Y +IL L + R +R RP LDDKV+ WN L I + A A + E+
Sbjct: 386 GVRDPEKTYASILN----TLTNARKERERPPLDDKVLTDWNALAIQALAFAGMVFHDESL 441
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
A SAA F+ ++ L H +RNG G DY L
Sbjct: 442 CTR----------------AISAAEFLFSNMVRPDGSVL-HRWRNGQGGIEGTAGDYVHL 484
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LY+ + WL AI L+ + + F D GGYF E + +R+KE DG
Sbjct: 485 AWACVTLYQTTGNSLWLRRAISLEKSASDRFYDSVHGGYFQVPSET-DLPVRMKEMTDGP 543
Query: 558 EPSGNSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
S N + + L L +I G KS RQ E+ R D M
Sbjct: 544 TFSTNGAAYLLLCALFTITGDELYGQKS---RQIEEYQ------RSLDPRMITGCCTFLC 594
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
++ R VL S + + + +SY IHI E + +
Sbjct: 595 GLIEKNLRGTAVLCNTSGSTGDDEIWSLLWSSYLPGMIRIHI-----------RERSDSY 643
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVT 701
+ D +C + C PP+T
Sbjct: 644 FLPLYVHCQGDTPALHICSHQQCYPPIT 671
>gi|453051421|gb|EME98928.1| hypothetical protein H340_19073 [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 680
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 230/696 (33%), Positives = 333/696 (47%), Gaps = 79/696 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE A LN+ FVS+KVDREERPD+D VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAGESFEDEETAAYLNEHFVSVKVDREERPDIDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPP ++G P F+ +L V AW +R+ + + ++ L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPAPRHGMPSFRQVLEGVAAAWRDRREEVGEVAGRIVQDLARRP 167
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+A + P + L + L++ +D+ GGFG APKFP + ++ +L H +
Sbjct: 168 LTAAVGGQPP---AADELHMALMALTREFDAVRGGFGGAPKFPPSMVLEFLLRHHVR--- 221
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + MV T + MA+GGIHD +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 222 TGSAA----ALDMVTATCEAMARGGIHDQLGGGFARYSVDNGWVVPHFEKMLYDNALLCR 277
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + D D+L R+M G SA DADS + +G R +EGA+YV
Sbjct: 278 VYAHLWRATGSGLARRVALDTADFLVREMRTDQGGFASALDADSDDGQG--RHREGAYYV 335
Query: 320 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT ++ ++LGE A L +++ + G + +G +VL +L DS
Sbjct: 336 WTPEQFREVLGEADAELAADYFGVTEEGTFE-----------EGASVL-QLPDS------ 377
Query: 379 LGMPLEKYLNI--LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
E+ ++ + R +L R++RPRP DDKV+ WNGL I++ A
Sbjct: 378 -----ERLVDAERIASVRERLLAARARRPRPGRDDKVVAGWNGLAIAALAETGAYF---- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
DR + ++ A AA + R D + S G L+DYA
Sbjct: 429 ------------DRPDLVQAATDAADLLVRTHMDWNARLFRTSLDGVAGGHAGVLEDYAD 476
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ G L L W+ +A L +T F D E G F+T + +++ R ++ D
Sbjct: 477 VAEGFLALSAVTGEGVWVDFAGLLLDTVLIRFRDEE-GALFDTADDAETLIRRPQDPTDN 535
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMC 610
A PSG S + L+ A++ + S +R+ AE +L V R +AV
Sbjct: 536 ATPSGWSAAAGALLTYAAL---TGSAPHREAAERALGVVRALGPKAPRFIGWGLAV---- 588
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P V +VG + A S V +PA +
Sbjct: 589 -AEALLDGP--YEVAVVGPHDDPATRELHRTALLSQRPGLAVALGEPASATAAEV----- 640
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+A A + A VC+ F+C P +DP L
Sbjct: 641 ---PLLADRPLLAGRPAAYVCRGFTCDAPTSDPEEL 673
>gi|381163013|ref|ZP_09872243.1| thioredoxin domain-containing protein [Saccharomonospora azurea
NA-128]
gi|379254918|gb|EHY88844.1| thioredoxin domain-containing protein [Saccharomonospora azurea
NA-128]
Length = 667
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 226/702 (32%), Positives = 322/702 (45%), Gaps = 96/702 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF DE VA L+N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFSDEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GTY+PP +G P F+ +L V AW ++RD L + ++ + E
Sbjct: 109 LTPDGKPFHCGTYYPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE---- 164
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ P + + +L D GGFG APKFP + ++ +L H E TG
Sbjct: 165 -QTKPLGPHPVTAETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ E +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + ++L RD+ P G S+ DAD TEG EG YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPQGAFASSLDAD---TEGV----EGLTYVWT 329
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+++ D+LG + + V +E AS L +
Sbjct: 330 PQQLVDVLGPDDGAWAAATF----------------------GVTVE-GTFERGASTLRL 366
Query: 382 PLE-----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
P + +++ + L + R+ RP+P DDKVI +WNGL I++ A A L+
Sbjct: 367 PRDPDDPSRWMRVTA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ--- 419
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
R E++E A +A +F+ H+ D R S R+G +A G L+DY
Sbjct: 420 -------------RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDY 463
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKED 553
A L GLL L++ +WLV A L +T F G F+ T D L+ R +
Sbjct: 464 ACLADGLLSLHQATGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDP 523
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----L 608
D A PSG S L+ +++ + YR E ++ +R + VP
Sbjct: 524 TDNASPSGASALADALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHW 579
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ A ML+ P + V +VG + E ++ AA + + E
Sbjct: 580 LSVAEAMLAGPVQ--VAVVGEDAQARHELVVEAATRVHGGGVVLGG------------EP 625
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+ + C PVT P L + L
Sbjct: 626 EAEGVPLLADRPLVDGSPAAYVCRGYVCDRPVTTPEDLAHAL 667
>gi|428777664|ref|YP_007169451.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
gi|428691943|gb|AFZ45237.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
Length = 677
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 232/698 (33%), Positives = 344/698 (49%), Gaps = 94/698 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ LND FV IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDSAIAQYLNDNFVPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+PD + P GGTYFP E ++GRPGF IL+ ++ +D++++ L F E +
Sbjct: 108 IFLTPDDRVPFYGGTYFPIEPRFGRPGFLDILKAIRRFYDQEKEKL---NTFKSEVMG-L 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
L SA+ LP+ L ++ L+K ++ G G+ P FP M+ Y
Sbjct: 164 LQQSAT-------LPETQTNLNSDLLTKGIETGVGITSHRGTPPSFP------MIPYAQL 210
Query: 196 KLEDTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
L T + E+ K V +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 211 ALRGTRFNYESRYDAKDVAQQRGYDLALGGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDN 270
Query: 255 GQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
GQ+ + +S + + F S I + + ++L+R+M P G ++++DADS T A
Sbjct: 271 GQIVEYLANLWSSGVEEPAFKSAIAQTV-EWLQREMTAPEGYFYASQDADSFTTSEADEP 329
Query: 313 KEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCD----LSRMSDPHNEFKGKNVLI 367
+EGAFYVW+ +E+E +L E + + + GN + L R + + + KN L
Sbjct: 330 EEGAFYVWSDRELETLLTAEELQALQSEFTVTAEGNFEGSNVLQRQNGGNLSNEAKNALK 389
Query: 368 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 427
+L ++ S + N E + ++ R P D K+I +WN L+IS AR
Sbjct: 390 KLFNARYGNSSIATFPPATNN--SEAKTTAWEGRIP---PVTDTKMITAWNSLMISGLAR 444
Query: 428 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSK 486
A + V G K Y + A A +FI + + E + HRL + NG +
Sbjct: 445 A--------------YAVFG--EKTYWDCAVKATNFIWENQWVEGRFHRLNY---NGKAT 485
Query: 487 APGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
+DYA I LLDL+ +WL A++LQ DE E GGYFNT ++ +
Sbjct: 486 VSAQSEDYALFIKALLDLHACHPEQPQWLDQAVQLQAEFDEYLWSVETGGYFNTANDNSN 545
Query: 546 -VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 604
+++R + D A P+ N V+V NLV+L I ++DY +AE +L F + ++
Sbjct: 546 DLIVRERTYIDNATPAANGVAVANLVQLFEIT--EQTDYL-ASAEKTLNAFSSIMEKSPQ 602
Query: 605 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD 664
A P + D H LV S L A Y L ++ +
Sbjct: 603 ACPGLFSGLDWY-----LHGTLVRSTSE-----QLQALMNQY-LPTCTYRVETS------ 645
Query: 665 FWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTD 702
D +ALVC+ +C P TD
Sbjct: 646 -----------------LPDSAIALVCKGLTCLEPATD 666
>gi|418461665|ref|ZP_13032732.1| thioredoxin domain-containing protein [Saccharomonospora azurea
SZMC 14600]
gi|359738246|gb|EHK87140.1| thioredoxin domain-containing protein [Saccharomonospora azurea
SZMC 14600]
Length = 667
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 226/702 (32%), Positives = 322/702 (45%), Gaps = 96/702 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF DE VA L+N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFSDEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GTY+PP +G P F+ +L V AW ++RD L + ++ + E
Sbjct: 109 LTPDGKPFHCGTYYPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE---- 164
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ P + + +L D GGFG APKFP + ++ +L H E TG
Sbjct: 165 -QTKPLGPHPVTAETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ E +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + ++L RD+ P G S+ DAD TEG EG YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPQGAFASSLDAD---TEGV----EGLTYVWT 329
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+++ D+LG + + V +E AS L +
Sbjct: 330 PQQLVDVLGPDDGAWAAATF----------------------GVTVE-GTFERGASTLRL 366
Query: 382 PLE-----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
P + +++ + L + R+ RP+P DDKVI +WNGL I++ A A L+
Sbjct: 367 PRDPDDPSRWMRVTA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ--- 419
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
R E++E A +A +F+ H+ D R S R+G +A G L+DY
Sbjct: 420 -------------RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDY 463
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKED 553
A L GLL L++ +WLV A L +T F G F+ T D L+ R +
Sbjct: 464 ACLADGLLSLHQATGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDP 523
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----L 608
D A PSG S L+ +++ + YR E ++ +R + VP
Sbjct: 524 TDNASPSGASALAGALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHW 579
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
+ A ML+ P + V +VG + E ++ AA + + E
Sbjct: 580 LSVAEAMLAGPVQ--VAVVGEDAQARHELVVEAATRVHGGGVVLGG------------EP 625
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC+ + C PVT P L + L
Sbjct: 626 EAEGVPLLADRPLVDGSPAAYVCRGYVCDRPVTTPEDLAHAL 667
>gi|88813137|ref|ZP_01128378.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
gi|88789621|gb|EAR20747.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
Length = 689
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 196/551 (35%), Positives = 297/551 (53%), Gaps = 56/551 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPL 78
+ CHWCHVM ESFEDE +A+ +N+ F++IKVDREERPD+D++Y T Q L GGWPL
Sbjct: 54 SACHWCHVMAHESFEDETIARAMNEHFINIKVDREERPDLDRIYQTAHQLLNNRPGGWPL 113
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQL 135
+VFL+P+ P GTYFPP+ YG PGF IL ++ A+ ++ + + Q+ A+ +L
Sbjct: 114 TVFLTPEQMPFFCGTYFPPKSHYGLPGFHEILLQIAQAYRQQHEAIKKQNQAVLDALNRL 173
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
SE A + P+ AL A L++ +DS FGGFG APKFP+P I+ +L H
Sbjct: 174 SEPPPNRAGA-------PKAALFDNARSALAREFDSTFGGFGPAPKFPQPSSIERLLRHY 226
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + +M TL+ MA GGI+D +GGGF RYSVD W +PHFEKMLYD
Sbjct: 227 AR--TAANDVPDYDALRMAQLTLRKMALGGIYDQIGGGFARYSVDNYWIIPHFEKMLYDN 284
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
GQL +Y DA+ T + + + + ++ R+M P G +++ DADS EG E
Sbjct: 285 GQLLALYADAWRATGEELFQRVANETAEWALREMRHPDGAFYASLDADS---EGG----E 337
Query: 315 GAFYVWTSKEVEDILGE---HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
GAFY+WT +E+ ++L E +L + C L+ + F+G+ L
Sbjct: 338 GAFYLWTPEEIRNVLREDEAEVVLAR----------CGLNNQPN----FEGRWHLYVRLT 383
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+ A+ P ++ + + R +L + R +RPRP D+KV+ SWN L++S ARA +
Sbjct: 384 FTDLANNQHRPRQELIALWRSARERLREAREQRPRPPRDEKVLTSWNALMVSGLARAGRR 443
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
+ A +A + F+ +L+ + RL +++G + P +L
Sbjct: 444 FGNTALTA----------------AGDQTLHFLHSNLW--RNGRLLTVWKDGQADLPAYL 485
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DD+A+L++ LL+ E WL WA + + F D+ GG+F T + ++ R +
Sbjct: 486 DDHAYLLAALLEQLEARWEPHWLQWARAIADLLLARFEDKTHGGFFFTADDHEPLVQRPR 545
Query: 552 EDHDGAEPSGN 562
D A PSGN
Sbjct: 546 PLGDDACPSGN 556
>gi|340975510|gb|EGS22625.1| hypothetical protein CTHT_0010970 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 785
Score = 315 bits (806), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 215/656 (32%), Positives = 325/656 (49%), Gaps = 102/656 (15%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H+CH+ +SF + VA+ LN F+ I +DREERPD+D ++ Y +A+ GGWPL++FL
Sbjct: 84 HFCHLTTQDSFSNPAVAEFLNQSFIPILIDREERPDLDTIFQNYSEAVNATGGWPLNLFL 143
Query: 83 SPDLKPLMGGTYF-----------------------PPEDKYGRPGFKTILRKVKDAWDK 119
+PDL P+ GGTY+ P ED YG F I +K+ W
Sbjct: 144 TPDLYPIFGGTYWPGPGTEHSTLGSDRASESAIAGEPGEDSYG--DFLAIAKKIHGFWVT 201
Query: 120 KRDM--------------LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLS 165
+ + AQ G F+ S + +++A+ N +L + L +++
Sbjct: 202 QEERCRREAFEMLHKLQDFAQEGTFSTPVGSGSAASAAADNS---DLDLDQLDEALTRIA 258
Query: 166 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAK 222
K +D + GFG+ PKFP P + +L +K ++ D E G M L TL+ +
Sbjct: 259 KMFDPVYHGFGT-PKFPNPARLSFLLRLAKFPTEVSDVIGEREVENGTAMALKTLRRIRD 317
Query: 223 GGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----------SLTKDVF 272
GG+HDH+G GF R+SV + W +PHFEKM+ + L V+LDA+ SL +
Sbjct: 318 GGLHDHLGAGFMRFSVTKNWGLPHFEKMVCENALLLGVFLDAWLGYTAGPKGPSLQDE-- 375
Query: 273 YSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 330
++ + ++ DYL +I P G ++E ADS G +EGA+Y+WT +E + ++G
Sbjct: 376 FADVVVEVADYLTGPIIRTPQGGFVTSEAADSYYRRGDKHMREGAYYLWTRREFDQVVGG 435
Query: 331 ------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
+HA+ Y+ + ++ + +DP +EF +NVL D + + GMP
Sbjct: 436 SGTSSDDHALAVAAAYW-NVLEDGNVPQENDPFDEFINQNVLCVNRDVVELSRQFGMPQA 494
Query: 385 KYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
+ ++ + R KL R K R RP D+KV+VS NG+VIS+ AR + LK
Sbjct: 495 EIRRVVDDARAKLRAHREKERVRPERDEKVVVSTNGMVISALARTAAALKG--------- 545
Query: 444 PVVGSDR-KEYMEVAESAASFIRRHLYDEQT---HRLQHSFRNGPSKAPGFLDDYAFLIS 499
V +R Y++ AE AASFI+ L+DE+ + L+ + PS F DDYAFLI
Sbjct: 546 --VDDERAARYLKAAEQAASFIKEKLWDEKQTAGNPLRRFWYQRPSDTKAFADDYAFLIE 603
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLD----------------REGGGYFNTTGED 543
GLLDLY KW WA +LQ+ Q LF D GG Y N
Sbjct: 604 GLLDLYTTTLDKKWADWAKQLQDAQIRLFYDPIVPATTGAQPSPRQAYSGGFYSNELAAI 663
Query: 544 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+LR+K D ++PS N+V+ NL RL ++ A S Y A ++ FE +
Sbjct: 664 SPTILRLKSGMDKSQPSTNAVAAANLFRLGALFA---SKEYTSLARETVNAFEAEV 716
>gi|291447326|ref|ZP_06586716.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
gi|291350273|gb|EFE77177.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
15998]
Length = 679
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 206/580 (35%), Positives = 292/580 (50%), Gaps = 59/580 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED+ A LN FV +KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 55 SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
FL+PD +P GTYFPPE ++G P F+ +L V AW +RD +A+ +G + +L
Sbjct: 115 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSL 174
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
E+ Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 175 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR--- 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 227 TGAEG----ALQMAADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T I + D++ R++ G SA DADS + +G + EGA+YV
Sbjct: 283 VYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYV 340
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT ++ ++LGE F Y+ +++ +G +VL D+
Sbjct: 341 WTPAQLREVLGEDDGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG------ 384
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
P++ + G R +L R +RPRP DDKV+ +WNGL I++ A
Sbjct: 385 --PVDA-ARVAG-VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF------- 433
Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
DR + +E A AA +R HL + RL + ++G G L+DY +
Sbjct: 434 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 482
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L WL +A L + E F EGG ++T + ++ R ++ D A
Sbjct: 483 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 541
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
PSG + + L+ S A + S+ +R AE +L V +
Sbjct: 542 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 578
>gi|239990319|ref|ZP_04710983.1| hypothetical protein SrosN1_23633 [Streptomyces roseosporus NRRL
11379]
Length = 673
Score = 314 bits (805), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 206/580 (35%), Positives = 292/580 (50%), Gaps = 59/580 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED+ A LN FV +KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
FL+PD +P GTYFPPE ++G P F+ +L V AW +RD +A+ +G + +L
Sbjct: 109 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSL 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
E+ Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 169 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 TGAEG----ALQMAADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T I + D++ R++ G SA DADS + +G + EGA+YV
Sbjct: 277 VYAHLWRTTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYV 334
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT ++ ++LGE F Y+ +++ +G +VL D+
Sbjct: 335 WTPAQLREVLGEDDGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG------ 378
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
P++ + G R +L R +RPRP DDKV+ +WNGL I++ A
Sbjct: 379 --PVDA-ARVAG-VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF------- 427
Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
DR + +E A AA +R HL + RL + ++G G L+DY +
Sbjct: 428 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 476
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L WL +A L + E F EGG ++T + ++ R ++ D A
Sbjct: 477 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
PSG + + L+ S A + S+ +R AE +L V +
Sbjct: 536 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGVVKA 572
>gi|428201584|ref|YP_007080173.1| thioredoxin domain-containing protein [Pleurocapsa sp. PCC 7327]
gi|427979016|gb|AFY76616.1| thioredoxin domain protein [Pleurocapsa sp. PCC 7327]
Length = 685
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 238/713 (33%), Positives = 336/713 (47%), Gaps = 110/713 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEREAFSDSAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL P DL P GGTYFP E +YGRPGF +L+ ++ +D +++ L A++Q E
Sbjct: 108 IFLIPGDLVPFYGGTYFPLEPRYGRPGFLQVLQSIRRFYDVEKEKLD-----ALKQ--EI 160
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKL 197
L S LP + L E L + ++ G A F RP M+ Y S L
Sbjct: 161 LGGLKQSTILPISTSDS---LSKELLYRGVETNTGVISIGASDFGRP-SFPMIPYASLAL 216
Query: 198 EDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ + E+ +G+++ + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 217 QGSRFQFESRYDGRQLSARRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276
Query: 257 LANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ + +S K+ + + +L+R+M P G ++A+DADS + A+ +EG
Sbjct: 277 ILEYLSNLWSAGMKEPAFERAIAGTVAWLKREMTTPEGYFYAAQDADSFTSTEASEPEEG 336
Query: 316 AFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
AFYVW E+E IL + K + + GN F+G NVL
Sbjct: 337 AFYVWRYDELEKILTADELEELKAAFTITEKGN------------FEGSNVL-----QRK 379
Query: 375 SASKLGMPLEKYLNILGECR--RKLFDVRSKRPRPH----------------LDDKVIVS 416
+ KL LE L+ L E R K ++ + P + D K+I +
Sbjct: 380 ESGKLSDSLEAILDKLFEVRYGAKSTEIETFVPARNNQEAKTGNWKGRIPAVTDTKMIAA 439
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHR 475
WN L IS ARA A+F P Y E+A AA FI + + E + HR
Sbjct: 440 WNSLTISGLARA---------YAVFGEP-------SYWELATRAAKFILEYQWIEGRFHR 483
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGG 534
L + G + +DYAF I LLDL + T WL A+E+Q DE F E G
Sbjct: 484 LNY---EGQATVLAQSEDYAFFIKALLDLQAASPTETFWLEKAVEVQQEFDEFFWSLEMG 540
Query: 535 GYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
GYFNT +D +L+R + D A P+ N V++ NL+R+A + + Y AE L
Sbjct: 541 GYFNTAADDSGDLLVRSRSYIDNATPAANGVAIANLIRIALLTENLE---YLDRAEQGLQ 597
Query: 594 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVI 653
F L+ A P + A D H LV K E L Y TV+
Sbjct: 598 AFSAVLQQSPQACPSLFAALDWY-----LHATLVRTK-----EEQLKTLIPQY--FPTVV 645
Query: 654 HIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+ +D E K V ++C+ SC P L
Sbjct: 646 YRIESDLPE----------------------KAVGIICRGLSCLEPAQSQAQL 676
>gi|375102437|ref|ZP_09748700.1| thioredoxin domain containing protein [Saccharomonospora cyanea
NA-134]
gi|374663169|gb|EHR63047.1| thioredoxin domain containing protein [Saccharomonospora cyanea
NA-134]
Length = 670
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 219/698 (31%), Positives = 325/698 (46%), Gaps = 85/698 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF D+ VA +N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFADDDVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GTY+PP +G P FK +L V AW ++RD L + ++ ++E
Sbjct: 109 LTPDAEPFHCGTYYPPVPAHGIPAFKQLLTAVDQAWRERRDELVEGAGRIVDHIAE---- 164
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ P + + + +L D GGFG APKFP + ++ +L H E TG
Sbjct: 165 -QTGPLSPHPVTGDTVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ E +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SVEALSIVDMTAEGMARGGIYDQLAGGFARYSVDSGWVVPHFEKMLYDNALLLRFY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + ++L RD+ P G ++ DAD+ EG T YVWT
Sbjct: 277 AHLARRTDSPLAHRVAGETAEFLLRDLRTPQGAFAASLDADTEGVEGLT-------YVWT 329
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+++ ++LG + E + + G F+ ++L AS
Sbjct: 330 PQQLVEVLGPDDGAWAAETFGVTEEGT------------FEHGASTLQLRRDPDDAS--- 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+++ + L R+ RP+P DDKVI +WNGL I++ A A L+
Sbjct: 375 ----RWMRVT----SALLQARNARPQPARDDKVIAAWNGLAITALAEAGVALQ------- 419
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
R E++E A +A +F+ H + L+ + R+G A G L+DY L
Sbjct: 420 ---------RPEWVEAAVAAGAFVLDVHAGGDTAGGLRRTSRDGVVGTAAGVLEDYGCLA 470
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGA 557
GLL L++ + WLV A L +T F G F+ T D L+ R + D A
Sbjct: 471 DGLLALHQATGESVWLVEATTLLDTALRRFGVEGAPGAFHDTAADAEALVHRPSDPTDNA 530
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
PSG S L+ +++ ++ YR E +L +R + VP + A
Sbjct: 531 SPSGASALAGALLPASALAGPERAGTYRAACEEAL----SRAGALVAQVPRFAGHWLSVA 586
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
+LS P + V +VG ++ E ++ AA + + AD +
Sbjct: 587 EALLSGPVQ--VAVVGTDAADRAELVVEAARRVHGGGVVLGGSPEADGVPL--------- 635
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+ + C PVT P +L L
Sbjct: 636 ---LADRPLADGAPAAYVCRGYVCDRPVTTPEALARSL 670
>gi|428772641|ref|YP_007164429.1| hypothetical protein Cyast_0808 [Cyanobacterium stanieri PCC 7202]
gi|428686920|gb|AFZ46780.1| protein of unknown function DUF255 [Cyanobacterium stanieri PCC
7202]
Length = 686
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 211/611 (34%), Positives = 314/611 (51%), Gaps = 72/611 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWC VME E+F D +A LN F++IKVDREERPD+D +YM +Q + G GGWPL++
Sbjct: 49 SCHWCTVMEGEAFSDGAIADYLNQNFIAIKVDREERPDIDSIYMQGLQMMTGQGGWPLNI 108
Query: 81 FLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+P DL P GGTYFP E +YGRPGF IL + + + ++ D L + L +
Sbjct: 109 FLTPHDLVPFYGGTYFPLEPRYGRPGFLQILESIHNFYHQQTDKLNALKEEIVSILENNI 168
Query: 140 SASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + S N L +L L ++ L + + +GG P+FP MM Y + L
Sbjct: 169 NLNPSIENHLNTKLLIQGLEKNSQILGR---NEYGG----PRFP------MMPYSNTTLT 215
Query: 199 --DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
T A + ++ + + GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 216 AIHTLPPETAQKAHQLGIQRGIDLVNGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGL 275
Query: 257 LANVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ + +S K + Y C L +L R+M+ P G +SA+DAD+ +EG
Sbjct: 276 IMEFLANLWSSGKENPQYHIACEGTLQWLEREMVAPEGYFYSAQDADNFGNIQDEEPEEG 335
Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
FYVW +++ IL E I +E + + GN F+GKNVL + D A
Sbjct: 336 EFYVWHYLDLQQILSHEELIALQEVFTISNEGN------------FEGKNVLQKHPD-KA 382
Query: 375 SASKLGMPLEKYLNI-LGECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGL 420
+ L+K + G+ +L R P D K+IV+WN L
Sbjct: 383 ITPMVKNALDKLFTMRYGQTPERLTTFPPARNNHEAKSLEWLGRIPPVTDTKMIVAWNSL 442
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ-THRLQHS 479
+IS ARA + K+E +Y+E+AESA FI ++ ++ Q +RL +
Sbjct: 443 MISGLARAYGVFKNE----------------KYLELAESAVKFILKNQWENQRLYRLNYG 486
Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYE--FGSGTKWLVWAIELQNTQDELFLDREGGGYF 537
+ +DYAFL+ LLDL + +G WL AI++Q D+ D++ GGY+
Sbjct: 487 NK---VSVLAQSEDYAFLVKALLDLQQNSLNAGNYWLEKAIKVQQEFDDYCYDQKNGGYY 543
Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
N ++ S +L++ K D A PS N V+V NL+RL + DY+ + AE +L +F
Sbjct: 544 NNAYDNSSDLLIKEKGYIDNATPSPNGVAVANLLRLGLMT--DNLDYFEK-AEQTLKIFA 600
Query: 597 TRLKDMAMAVP 607
++ + ++ P
Sbjct: 601 DKMVNSPVSCP 611
>gi|350269357|ref|YP_004880665.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
Sjm18-20]
gi|348594199|dbj|BAK98159.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
Sjm18-20]
Length = 642
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 201/578 (34%), Positives = 291/578 (50%), Gaps = 81/578 (14%)
Query: 3 RRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDV 59
+ +F T+ + FL ++CHWCHVM ESFEDE VA +LN FVS+KVDREERPD+
Sbjct: 47 QEAFKKATRENKPVFLSIGYSSCHWCHVMAKESFEDETVAGVLNKSFVSVKVDREERPDI 106
Query: 60 DKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK 119
D +YM Q GGGGWP SVF++PD KP GTYFP + F +L +++ W +
Sbjct: 107 DNIYMRVCQTFTGGGGWPTSVFMTPDQKPFFAGTYFP------KAPFLDLLEVIREKWAE 160
Query: 120 KRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 179
+ L G Q++E L+ S S + P P ++ L +++D+ FGGFG AP
Sbjct: 161 DKQALLNQG----NQITETLTHSTHSPQTPQTAP---IKAAVSALKETFDNEFGGFGRAP 213
Query: 180 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 239
KFP P + ++L + + + TL M KGGI D +G GF RYS D
Sbjct: 214 KFPTPHILYLLLKTAPDMAEK---------------TLIQMYKGGIFDQIGFGFSRYSTD 258
Query: 240 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 299
W VPHFEKMLYD LA YL AF T Y + L Y+ RD+ P G FSA+
Sbjct: 259 RFWLVPHFEKMLYDNALLATAYLMAFEQTGRELYRTVAEKTLLYMERDLGSPEGGFFSAQ 318
Query: 300 DADSAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHN 358
DADS +EG +YV+ +E+ +LGE F ++ + GN
Sbjct: 319 DADS-------DGEEGKYYVFKPEELTALLGEAEGRRFNAYFGITQNGN----------- 360
Query: 359 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 418
F+G ++ +N+SS S ++K+L K+++ R R D KV+ SWN
Sbjct: 361 -FEGYSIPNLINNSSMDDS-----VDKFL-------PKVYEYRKSRTSLRTDQKVLTSWN 407
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 478
L +++ A A +I+ ++ Y++ A F+ R + D T +
Sbjct: 408 ALALAACANAYRII----------------GKRAYLDTALKTFGFMEREVTDGDT--VFC 449
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 538
+G GFLDDYAF I L+ L++ +L+ A +LQ + D + GG+F
Sbjct: 450 GVTDGVRGGVGFLDDYAFYIYALICLHQATQDPAFLIRAQDLQIKAISEYFDDQNGGFFF 509
Query: 539 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
+ + ++ KE +DGA PSGNSV NL RL ++
Sbjct: 510 SGKSNEKLIFNPKETYDGAIPSGNSVMAYNLARLYALT 547
>gi|411002310|ref|ZP_11378639.1| hypothetical protein SgloC_05852 [Streptomyces globisporus C-1027]
Length = 673
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 226/691 (32%), Positives = 326/691 (47%), Gaps = 85/691 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED+ A LN FV +KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEAL 139
FL+PD +P GTYFPPE ++G P F+ +L V AW +R+ +A+ +G + +L
Sbjct: 109 FLTPDAEPFYFGTYFPPEPRHGSPSFQQVLEGVTTAWTDRREEVAEVAGRIVADLAGRSL 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
E+ Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 169 VHGGDGVPGESEVAQALL-----GLTREYDEQHGGFGGAPKFPPAMAVEFLLRHYAR--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 TGAEG----ALQMAADTCAAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T I D++ R++ G SA DADS + EG R EGAFYV
Sbjct: 277 VYAHLWRATGSDEARRIALKTADFMVRELRTAEGGFASALDADSEDAEG--RHVEGAFYV 334
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT +++ ++LGE F Y+ +++ +G +VL D+
Sbjct: 335 WTPEQLREVLGEDDAAFAAAYF----------GVTEEGTFEEGASVLRLPGDTG------ 378
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
P++ + G R +L R +RP P DDKV+ +WNGL I++ A
Sbjct: 379 --PVDA-ARVAG-VRARLLAARDERPHPGRDDKVVAAWNGLAIAALAETGAYF------- 427
Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
DR + +E A AA +R HL + RL + ++G G L+DY +
Sbjct: 428 ---------DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDV 476
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L WL +A L + E F EGG ++T + ++ R ++ D A
Sbjct: 477 AEGFLALAAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCA 612
PSG + + L+ S A + S+ +R AE +L V +K + VP + A
Sbjct: 536 TPSGWTAAAGALL---SYAAYTGSEAHRTAAEGALGV----VKALGPRVPRFVGWGLAVA 588
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT-VIHIDPADTEEMDFWEEHNS 671
+L P + V + G +L++T ++ P + +
Sbjct: 589 EALLDGP--REVAVAGPVGG--------------ELHRTALLGRAPGAVVAAGEGPDAGA 632
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTD 702
+ + A VC++F C P TD
Sbjct: 633 EFPLLVDRPLVGGEPTAYVCRHFVCDAPTTD 663
>gi|144899665|emb|CAM76529.1| Protein of unknown function DUF255 [Magnetospirillum
gryphiswaldense MSR-1]
Length = 650
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 222/695 (31%), Positives = 321/695 (46%), Gaps = 104/695 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFE+ +A L+N FV++K+DREERPD+D +Y +Q + GGWPL+
Sbjct: 53 SACHWCHVMAHESFENPEIAALMNRLFVNVKIDREERPDLDAIYQQALQHMGQHGGWPLT 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+F +PD KP GGTYFPP +YGRPGF +L+ + D W + RD + + + L EAL
Sbjct: 113 MFCTPDGKPFWGGTYFPPAPRYGRPGFPEVLQAIHDLWQRDRDRVDHN----VAALVEAL 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ + P L L A+ + D GG G APKFP+P + +K+
Sbjct: 169 AHDGGGDASP--LTLEMLDRGAKAILSHVDMEHGGLGGAPKFPQPGLFDYLWRSAKR--- 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG SG + V TL + +GGI DH+GGGF RYS D+ W PHFEKMLYD GQL +
Sbjct: 224 TGNSGL----HQAVTLTLDRICQGGITDHLGGGFMRYSTDDVWLAPHFEKMLYDNGQLID 279
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ + T++ + + + ++ R+M+ E + A A++EG EG FY
Sbjct: 280 LLTLVWQDTQNPLFQTRIEECITWVSREML---AEGAAFAAALDADSEG----HEGRFYT 332
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
W ++E+ D+LG E A +F + Y + GN ++G N+ LN S
Sbjct: 333 WKAQEIIDLLGPETARIFAQAYDVSIQGN------------WEGVNI---LNRSKPQG-- 375
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
++ L + R L R+ R RP DDKV+ WNG++I+ ARA +
Sbjct: 376 -----HEHEEQLAQARTILLAARANRIRPGRDDKVLADWNGMMIAGLARAGFVFI----- 425
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFI--RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
R +++++AE A + I + L D+ RL HS + GF DD A
Sbjct: 426 -----------RPDWLDMAERAFAVITDKMTLADD---RLAHSLCQEQASHVGFADDLAH 471
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+ L LY+ +L WA D D+ GGYF V++R K D
Sbjct: 472 MARAALALYQATGKADYLTWAETWVAAADRHHWDKAKGGYFQVAHSASDVIVRTKTVMDA 531
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PS N V L LA I + Y A+ + VF + D
Sbjct: 532 AVPSANGTMVQVLAILAQI---TDKPAYADRAQAVVTVFMDQFND--------------- 573
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN-KTVIHIDPADTEEMDFWEEHNSNNAS 675
F NM +A +DL V+ P + EM H +
Sbjct: 574 -----------------HFANM-SALLTGFDLAVDPVLVTLPRNNAEMIDVVRHAALPNL 615
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ R D+V+A +C+N CS P P L +L
Sbjct: 616 IIR---WTDEVMATLCRNSVCSAPTGSPADLARML 647
>gi|400597948|gb|EJP65672.1| DUF255 domain protein [Beauveria bassiana ARSEF 2860]
Length = 731
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 202/615 (32%), Positives = 321/615 (52%), Gaps = 70/615 (11%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M ESF + A +LND F+ + +DRE RPD+D +YM YVQA+ GG
Sbjct: 75 HIGYKACHYCRLMSTESFANTECAAVLNDAFIPVLIDRESRPDLDTIYMNYVQAVSSVGG 134
Query: 76 WPLSVFLSPDLKPLMGGTYFPPEDKYGRP---------GFKTILRKVKDAWDKKR----- 121
WPL++F++P+L+P+ GGTY+P + R F TI++KV+D W ++
Sbjct: 135 WPLNLFVTPELEPIFGGTYWPGPNAAPRAHDENAEDALDFLTIVKKVRDIWKEQEARCRK 194
Query: 122 ---DMLAQSGAFAIE------QLSEALSASASSNKLP--DELPQNALR-------LCAEQ 163
++LAQ FA E +++A + + S P E Q A++ L +Q
Sbjct: 195 EATEVLAQLREFAAEGTLGTRAIAQAQTIAPSGWAAPAHSEQTQEAVKNVSVSSELDLDQ 254
Query: 164 LSKSY-------DSRFGGFGSAPKFPRPVEIQMMLY---HSKKLEDTGKSGEASEGQKMV 213
+ ++Y D +GGFG APKF P ++Q ++ ++D E + M
Sbjct: 255 VEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLQFLIGLRDSPSAVQDIVGEAECTHALDMA 314
Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----SLT 268
+ TL+ + G +HDHVG GF R SV W +P+FEK++ D QL ++YL A+
Sbjct: 315 VDTLRKIRDGALHDHVGNTGFARCSVTPDWTIPNFEKLVVDNAQLLSLYLTAWRRAGGQA 374
Query: 269 KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
FY+ I ++ YL ++ G + S+E ADS +G KEGAFY+WT +E +
Sbjct: 375 TSEFYN-IVLELATYLTSTPILRSDGLLASSEAADSYARKGDGEMKEGAFYLWTKREFDS 433
Query: 328 IL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
++ G ++ H+ + GN D DP+ +F +N+L + S + +L +P
Sbjct: 434 VIEAAEKGASPVV-AAHWGILEDGNID--EQHDPNEDFMNQNILRVVKTSEELSKQLNIP 490
Query: 383 LEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+EK + +++L R S+R RP +DDK + WNGL +S+ A+ S+ +K+ +
Sbjct: 491 VEKVEQTIRTSQKELKARRESERVRPEVDDKAVTGWNGLALSALAKTSRAVKTTS----- 545
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
P + + + VA ASFI++ L+D Q ++ + G GF DDYA++I GL
Sbjct: 546 --PELSA---KCATVASGIASFIQKQLWDAQA-KILYRVWTGERDTEGFADDYAYVIQGL 599
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
LDL++ + +A LQ Q F D GG+F T S +LR+K+ D + PS
Sbjct: 600 LDLFDTNGDESLIEFADALQKAQSSYFYD-PAGGFFTTKAGSSSAILRLKDGMDTSLPST 658
Query: 562 NSVSVINLVRLASIV 576
N+VSV NL RL ++
Sbjct: 659 NAVSVANLYRLGHLL 673
>gi|311746315|ref|ZP_07720100.1| dTMP kinase [Algoriphagus sp. PR1]
gi|126576550|gb|EAZ80828.1| dTMP kinase [Algoriphagus sp. PR1]
Length = 678
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 204/562 (36%), Positives = 281/562 (50%), Gaps = 59/562 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFED+ A L+N+ FV IK+DREERPD+D +YM VQA+ GGWPL+
Sbjct: 51 SACHWCHVMERESFEDKLTADLMNESFVCIKIDREERPDIDNIYMDAVQAMGLQGGWPLN 110
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS----GAFAIEQL 135
VFL P+ KP GGTYFP + +K +L + DA+ D LA+S G
Sbjct: 111 VFLMPNQKPFYGGTYFPNQQ------WKNLLANIADAFANHEDKLAESAEGFGRSIARNE 164
Query: 136 SEALSASASSNKL-PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+E + +L PDEL + L QLS DS +GG PKFP P +L
Sbjct: 165 TEKYGIRSGKIELDPDELAEAVL-----QLSSQIDSEWGGMNRIPKFPMPAIWNFIL--- 216
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
D ++ + VLFTL+ M GGI+D + GGF RYSVD W PHFEKMLYD
Sbjct: 217 ----DYALLSKSQNLEDKVLFTLKKMGMGGIYDQLKGGFARYSVDGEWFAPHFEKMLYDN 272
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
GQL +Y A+ + D F+ ++ +L +M+ G +A+DADS EG E
Sbjct: 273 GQLLELYAKAYQTSHDDFFLEKIQETYTWLLDEMLQEEGGFHAAQDADS---EGV----E 325
Query: 315 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
G FY WT +E+ I+ E F E Y LKP GN + G N+L + S
Sbjct: 326 GKFYTWTYEELSSIIPEEMPWFAELYNLKPQGNWE-----------DGINILFQTKSYSE 374
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
A+ + E L E + L +R++R P DDKV+ WN L+IS +A
Sbjct: 375 VAAAHNLSEEVLNQKLKEVKATLLSIRNQRIYPGKDDKVLCGWNALMISGLVQAY----- 429
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
SD+K ++++A S FI + + ++ RL S++NG + P FL+DY
Sbjct: 430 ----------FATSDQK-FLDLALSNRDFISKKVTVDR--RLYRSYKNGVAYTPAFLEDY 476
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A LI + L+E S L A L + F D G +F ++ KE
Sbjct: 477 AALIKADIMLFEATSEASHLKSAERLTKIVLDEFYDENDGFFFFNNPSSEKLIANKKELF 536
Query: 555 DGAEPSGNSVSVINLVRLASIV 576
D PS NS+ NL +L+ +
Sbjct: 537 DNVIPSSNSLMARNLHQLSILT 558
>gi|410479889|ref|YP_006767526.1| thioredoxin [Leptospirillum ferriphilum ML-04]
gi|406775141|gb|AFS54566.1| conserved hypothetical protein containing a thioredoxin domain
[Leptospirillum ferriphilum ML-04]
Length = 699
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 216/698 (30%), Positives = 335/698 (47%), Gaps = 64/698 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLS 79
CHWCHVM ESFE +A ++N++FV+IKVDREERPD+D++Y M + GGWPL+
Sbjct: 59 ACHWCHVMAHESFERPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLT 118
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+P P GGTYFP + ++G PGF +L +++D + R+ L + ++ L +
Sbjct: 119 MFLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTN 178
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S D P AL L +D FGGFG APKFP +++ + ++ +
Sbjct: 179 PVADSREFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQR 232
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G S A M TL M +GGI D VGGGF RYSVDERW +PHFEKMLYD L
Sbjct: 233 KGDSTAA----HMATLTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLE 288
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
S++K+ YS +++ +L R+M G +S+ DADS EG +EG FYV
Sbjct: 289 ALSLGASVSKNPVYSRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYV 341
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASK 378
+ ++EV IL + YY +S P N F+G L E + +
Sbjct: 342 FQAEEVRSILSDEEYRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKE 390
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ + R+KLF RS R RP LDDKV+ SWN L+ A++
Sbjct: 391 FHLSESDIERRIESARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKA 436
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+F+ ++G ++E++ ++ R ++ + L + P +LDDYAFL+
Sbjct: 437 LLFSGRILG--KQEWISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLL 492
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+L+ + L +A + + F D E GG++ T +++ R K HDGA
Sbjct: 493 LAVLESMRIDFRPEDLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGAL 552
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN+ +V L+ L ++ Y A+ +L ++ ++K+ M A + S
Sbjct: 553 PSGNAAAVQGLLWLGTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS- 608
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+ VV + + D+++ ++ D V+ + A + + E R
Sbjct: 609 -DSQPVVFLAGPQAGDWKDKISCG---VDTEAFVLDLTNAVRDSLPLPEG--------MR 656
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+F +K VC+ C P SL+ L P S
Sbjct: 657 KHFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 694
>gi|408794723|ref|ZP_11206328.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
5]
gi|408461958|gb|EKJ85688.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
5]
Length = 689
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 218/692 (31%), Positives = 341/692 (49%), Gaps = 81/692 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESFED+ A++LN FV IK+DREERPD+DK+YM + A+ GGWPL+
Sbjct: 54 STCHWCHVMERESFEDDSTAEVLNRDFVCIKLDREERPDIDKIYMDALHAMGTQGGWPLN 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+P +P++GGTYFPPE++YG+ FK +LR V DAW +R+ L + A + Q
Sbjct: 114 MFLTPTKEPILGGTYFPPENRYGKRSFKEVLRLVSDAWKNQREELI-TAATDLTQYLRDN 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMM--LYHSK 195
+ K+P + + E+ + YD F GF S KFP + + + Y K
Sbjct: 173 ETRPNEGKVP---AKEIIEKNFERYVQVYDKEFFGFKTNSVNKFPPSMALSFLTEFYLLK 229
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
K +M T M GGI+D VGGG RY+ D W VPHFEKMLYD
Sbjct: 230 K---------DPRALEMAFNTAYAMKSGGIYDQVGGGICRYATDHEWLVPHFEKMLYDN- 279
Query: 256 QLANVYLDAFSL----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
++Y++A +L T++ F+ + R+I+ Y+RRDM G I SAEDADS EG
Sbjct: 280 ---SLYVEALALLYKATEEPFFLEVIREIVTYIRRDMTLGSGGIASAEDADS---EG--- 330
Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EG FY+W E I+ E I + T + + H +KGKN ++
Sbjct: 331 -EEGKFYIWNHSEFNQIVPEEEI----QGFWNVTEEGNFEHQNILHVYWKGKNPFVD--- 382
Query: 372 SSASASKLGMPLE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
G+ + +++N + + + KL RS+R RP DDKV+ SWN L I + A +
Sbjct: 383 --------GIQFKPEFINKIEKTKEKLLAHRSQRIRPLRDDKVLTSWNCLWIRALLSAYE 434
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
+ S EY+ A+ FI + L + L+ FR G +K G
Sbjct: 435 V----------------SGDTEYLNDAKKIYRFITKQLVGDDGSILRR-FREGEAKYFGT 477
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIEL-QNTQDELFLDREG--GGYFNTTGEDPSVL 547
L DY I + L++ + A E+ + + D +F + E G ++ + + ++
Sbjct: 478 LPDYTEFIWVSMKLFQLDEDIE----AYEIGKKSLDYVFANFESKVGPFYESYHGNEDLI 533
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
+R E +DG EPSGNS ++++L L + K D ++ A A F L +++ P
Sbjct: 534 VRTIEGYDGVEPSGNS-TILHLFYLLFSIGYKKVD-LQKKANSIFAYFLPELTQNSLSYP 591
Query: 608 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
M A PS++ +V+ + + + + D N + ++ ++ + +
Sbjct: 592 SMISAFQKFQYPSKEVLVVYKGYDAAEIKEIRKKLSELKDPNLVWLVLEESNAKAL---- 647
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPP 699
+ + + ++ VC+NFSC P
Sbjct: 648 ---APELELLTGRSAGSGILYYVCRNFSCELP 676
>gi|434397636|ref|YP_007131640.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
7437]
gi|428268733|gb|AFZ34674.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
7437]
Length = 684
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 207/616 (33%), Positives = 304/616 (49%), Gaps = 67/616 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A+ LN FV+IKVDREERPD+D +YM VQ + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIAEYLNVNFVAIKVDREERPDLDSIYMQAVQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP + +Y RPGF +L+ V + + + L F E LS
Sbjct: 108 IFLTPGDLVPFYGGTYFPLQPRYNRPGFLDVLQAVLRFYQEDKAKLEH---FKTEILSHL 164
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
++ + PD L + L E + G S P P ++
Sbjct: 165 QQSTVLPLETPDSLTKQLLFAGIETNTGVISPNDLGRPSFPMIPYATLALQGSRFKQEFR 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
+ G+ +VL GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 225 YNPQELSWQRGKDLVL--------GGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIL 276
Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ +S ++ + + +++L+R+M P G ++A+DADS A +EG+F
Sbjct: 277 EYLANLWSAGCQEPEIALAVTETVNWLKREMTAPNGYFYAAQDADSFVDVDAVEPEEGSF 336
Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW +E+ D L E + + + GN F+GKNVL + S
Sbjct: 337 YVWNYQELADNLTAEELTELQTEFTVSVEGN------------FEGKNVLQRRQSGNLSD 384
Query: 377 SKLGMPLEKYLNI-LGECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGLVI 422
S L LEK I G+ + L R P D K+IV+WN +VI
Sbjct: 385 S-LTNTLEKLFTIRYGQAKESLAIFTPARNNHEAKTTPWQGRIPPVTDTKMIVAWNSIVI 443
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
S AR + ++ Y+++A +A +FI +H + DE+ HRL +
Sbjct: 444 SGLARVYAVFGNQL----------------YLDLAVTATNFILQHQWLDERFHRLNY--- 484
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
+G ++ P +DYA I LLDL ++WL A+ +Q D+L E GGY+N++
Sbjct: 485 DGLAQVPAQSEDYALFIKALLDLQAATPEKSQWLEQAVRIQTEFDQLLWSNEMGGYYNSS 544
Query: 541 GEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 598
D + L ++E D A P+ N V+V NLVRL+ + + Y AE +L F +
Sbjct: 545 NTDANQELLIQERSYIDNATPAANGVAVTNLVRLSLLTDNLE---YLDRAEQALQAFSSV 601
Query: 599 LKDMAMAVPLMCCAAD 614
+ A P + A D
Sbjct: 602 MTRSPQACPTLFVALD 617
>gi|408671866|ref|YP_006871614.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
17448]
gi|387853490|gb|AFK01587.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
17448]
Length = 679
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 220/700 (31%), Positives = 337/700 (48%), Gaps = 101/700 (14%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE+E +A+++N V IKVDREERPDVD +YM +QA+ GGWPL+VF
Sbjct: 50 CHWCHVMERESFENEQIAQIMNQHLVCIKVDREERPDVDAIYMDALQAMGLRGGWPLNVF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEALS 140
L PD KP GGTYFPP + + ++ + +A+ R+ L +S F L +
Sbjct: 110 LMPDAKPFYGGTYFPPRN------WANLVESIANAFKNDREKLQKSAEGFTQNMLVKESD 163
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + + L +L + +D GG +PKFP P + ++ + D
Sbjct: 164 KYRMSVEDTLSFSEEELTTIFNRLHQDFDFEKGGMNRSPKFPMPSIWKFLIRYYSITND- 222
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ ++ TL +A GGI+D +GGG+ RYS DE W VPHFEKMLYD GQL ++
Sbjct: 223 ------KRAYQHLIHTLNRVALGGIYDTIGGGWTRYSTDEDWKVPHFEKMLYDNGQLISL 276
Query: 261 YLDAFSLTK-----DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
Y +A++LTK D FY+ + +++L R+M+ G +SA DADS EG +EG
Sbjct: 277 YAEAYALTKSEGNPDNFYAAKVTETIEWLEREMMSKEGGFYSALDADS---EG----EEG 329
Query: 316 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSA 374
FY+W +E+ LGE A F E + GN + G NV+ +E D
Sbjct: 330 KFYIWKKEEIIAALGEDAGPFIETFDFTEAGNWE-----------HGNNVVHLEERDFME 378
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
+ G PL E ++KLFD R+KR RP LDDK++ SWNGL++ A + L
Sbjct: 379 N----GWPL------TAEIKQKLFDFRAKRVRPGLDDKILCSWNGLMLKGLVDAYRYL-- 426
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-------DEQTHRLQHSFRNGPSKA 487
D ++++++A A FI+ + + L H+++NG +
Sbjct: 427 --------------DNQKFLDLALKNAHFIKDCMSIKVMNEDGSEARGLWHNYKNGKANI 472
Query: 488 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 547
+L+DYA +I L LY+ WL A L F D E ++ T + ++
Sbjct: 473 VAYLEDYASVIDAYLALYQVTFDEVWLHEAEMLAIYTVANFYDDEDEFFYFTDSQGEELI 532
Query: 548 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
R KE D P+ NS+ NL L I+ ++D+ + + +L + ++K + + P
Sbjct: 533 ARKKEIFDNVIPASNSIMATNLYNLGLILG--RNDFIQIS---NLMI--GKMKRIVLTDP 585
Query: 608 ----LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM 663
C A + P+ + V +VG ++ K ID
Sbjct: 586 QWVTQWACLATQHTKPTAE-VAMVGK-----------------EITKIRKQIDEVLILNK 627
Query: 664 DFWEEHNSNNASMARNNFSAD-KVVALVCQNFSCSPPVTD 702
F N++N + +N + D + VC + +C P T+
Sbjct: 628 VFVGTTNTSNLPLLQNRVTKDAQTTIFVCFDKTCQLPTTE 667
>gi|333026825|ref|ZP_08454889.1| hypothetical protein STTU_4329 [Streptomyces sp. Tu6071]
gi|332746677|gb|EGJ77118.1| hypothetical protein STTU_4329 [Streptomyces sp. Tu6071]
Length = 639
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 230/698 (32%), Positives = 330/698 (47%), Gaps = 83/698 (11%)
Query: 18 LINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 77
++ +WCHVM ESFED A +N FV +KVDREERPDVD VYM VQA G GGWP
Sbjct: 1 MLLIIYWCHVMARESFEDAETAAYMNAHFVCVKVDREERPDVDAVYMEAVQAATGHGGWP 60
Query: 78 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 137
++VFL+P +P GTYFPP +G P F+ +L V+ AW +R+ +A A L+
Sbjct: 61 MTVFLTPGGEPFYFGTYFPPRPLHGTPAFRQVLEGVRAAWADRREEVADVAARVTADLTG 120
Query: 138 ---ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
L A AS PD L L L++ YDSR GGFG APKFP + ++ +L H
Sbjct: 121 RGLGLPADASPPG-PDALGAALL-----GLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHH 174
Query: 195 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ TG G +M T + MA+GGI+D +GGGF RY+VD W VPHFEKML D
Sbjct: 175 AR---TGAEG----ALQMAADTAEHMARGGIYDQLGGGFARYAVDREWIVPHFEKMLSDN 227
Query: 255 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
L Y + T + + D+L R++ P G SA DADS +G R E
Sbjct: 228 ALLCRFYAHLWRATGSALARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVE 285
Query: 315 GAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GA YVWT +++ ++LGE A L HY + P G F+ + ++ L +
Sbjct: 286 GASYVWTPEQLREVLGEDDAALAAAHYGVTPEGT------------FEHGSSVLRLPRTD 333
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
S P++ L RR L R +RP P DDKV+ +WNGL I++ A
Sbjct: 334 GFDSP---PVDA--ARLDRIRRALLAARDERPAPGRDDKVVAAWNGLAIAALAETGAYF- 387
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTH-RLQHSFRNGPSKA-PGF 490
DR + +E A AA +R HL TH RL + R+G + + G
Sbjct: 388 ---------------DRPDLVEAALGAADLLVRVHL---DTHGRLSRTSRDGRTGSNTGV 429
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DYA + G L L W +A L + + F D + G ++T + +++ R
Sbjct: 430 LEDYADVAEGFLTLASVTGEGVWTDFAGLLLDHVLDRFRD-DSGALYDTAADAETLIHRP 488
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-- 608
++ D A PSG + + L+ A++ + S +R AE +L+V ++ +A P
Sbjct: 489 QDPTDNATPSGWNAAAGALLTYAAL---TGSTPHRAAAEQALSV----VRALAPRAPRFV 541
Query: 609 ---MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
+ A +L+ P V +VG + A + V P+ E
Sbjct: 542 GHGLAVAEALLAGP--YEVAVVGAPEDPRTRALHRTALLATSPGTVVAAGPPSPDPEFPL 599
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDP 703
+ + + A A +C+ F C P TDP
Sbjct: 600 LADRPLVDGTPA----------AYLCRGFVCDRPETDP 627
>gi|11499326|ref|NP_070565.1| hypothetical protein AF1737 [Archaeoglobus fulgidus DSM 4304]
gi|2648814|gb|AAB89512.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 642
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 195/575 (33%), Positives = 294/575 (51%), Gaps = 64/575 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE+E +A+++N FV+IKVDR+ERPD+DK Y +V A G GGWPL+VF
Sbjct: 50 CHWCHVMAKESFENEEIAEMINRNFVAIKVDRDERPDIDKRYQEFVMATTGSGGWPLTVF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GGTYFPPED+Y PGFKT+LRK+ + W R+ L +S E+L+EA+
Sbjct: 110 LTPDGKPFFGGTYFPPEDRYHLPGFKTVLRKIAEMWRHDRERLLKSA----EELTEAVRR 165
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A + ++ + L E + D GGFGSAPKF ++++L H D
Sbjct: 166 YAEGS-FKGDVDEKLLDKGIEAVLDQTDYVNGGFGSAPKFHHAKAVELLLTHHFFTGD-- 222
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
E K TL MA+GGI+DH+ GGF RYS D +W PH+EKMLYD +L +Y
Sbjct: 223 -----EEVLKAAEITLDAMARGGIYDHLLGGFFRYSTDAKWVTPHYEKMLYDNAELLYLY 277
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
A++LT Y I I++Y R+ G ++++DAD E + EG +Y+++
Sbjct: 278 SIAYALTGKRLYQKIADGIVEYYRKFGCSNEGGFYASQDADIGELD------EGGYYLFS 331
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK-LG 380
+E+++IL E YY + +G+ L + + SK LG
Sbjct: 332 DRELKEILDEREFRIATLYY-----------------DIQGERKLPRIFLTEEEISKILG 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ +E+ + RRK+ + R +R P++D + WNGL+I + K+
Sbjct: 375 VSVEEVERAVNSARRKMLEFREQREMPYIDTTIYAGWNGLMIEALCMHHKVFGDNWS--- 431
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+E+AE A+ + + +D + L H+ G +DY F G
Sbjct: 432 -------------LEMAEKTANRLLKEFWDGR--ELLHT-----HNVEGLSEDYIFFARG 471
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
LL L+E ++L E+ ++ E F D E GG+F++ E + +R+K HD S
Sbjct: 472 LLALFEVTQRHEYLEKCFEIVDSAVEKFWDGEDGGFFDS--ERAVLGIRLKNFHDSPTQS 529
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
N + L+ L++I + Y + A L F
Sbjct: 530 VNGSAPQLLLALSAITGERR---YEELAVEGLRTF 561
>gi|374369685|ref|ZP_09627707.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
gi|373098764|gb|EHP39863.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
Length = 683
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 235/690 (34%), Positives = 340/690 (49%), Gaps = 96/690 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ +A L+N F+SIKVDR+ERPD+D +Y + GGGWPL+V
Sbjct: 49 ACHWCHVMAHESFENPRIAGLMNARFISIKVDRQERPDIDDIYQKVPLMMGQGGGWPLTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKK----RDMLAQ-SGAFAIEQL 135
FL+P +P GGTYFPP+D+YGRPGF +L + +AW + RDM+ Q F L
Sbjct: 109 FLTPQGEPFFGGTYFPPDDRYGRPGFVRVLLSLSEAWTHRRGELRDMIEQFRLGFRQLDL 168
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 195
+ +A LP + A L++ D GG G APKFP ++L +
Sbjct: 169 VDLGREAAEVEDLPAQ--------TARALAQDTDPTHGGLGGAPKFPNASGYDLVL---R 217
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+ TG+ + ++ TL MA GGIHD +GGGF RYSVDERW VPHFEKMLYD G
Sbjct: 218 ICQRTGEPVLLAALER----TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNG 273
Query: 256 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
QL +Y DA+ LT + + + + Y+ RDM P G ++ EDADS EG +EG
Sbjct: 274 QLVTLYADAYRLTGKPAWRRVFEEAIAYIVRDMTHPDGCFYAGEDADS---EG----EEG 326
Query: 316 AFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
FYVWT EV +LG E A+ C ++D N +G +VL + +
Sbjct: 327 RFYVWTPAEVRAVLGASEGAL------------ACRAYGVTDGGNFARGTSVL----NRA 370
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A+ P ++ L + R +LF R++R RP DD ++ WNGL+I A +
Sbjct: 371 ATLD----PFDE--ARLEDWRGRLFAARARRARPARDDNILTGWNGLMIQGLCAAYQATG 424
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
P + + R+ + E + D +R ++++G +K PGFL+D
Sbjct: 425 CP--------PHLAAARRAASAIQEKLT------MPDGGVYR---AWKDGTAKVPGFLED 467
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD--REGGGYFNTTGEDPSVLLRVK 551
YA L + L+DLYE ++L A+EL L LD R+ G YF +P ++ R +
Sbjct: 468 YALLANALIDLYESCFDKRYLDRAVELV----ALILDKFRDDGLYFTPRDGEP-LVHRPR 522
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
HD A PSG S SV +RL ++ + D YR AE + +
Sbjct: 523 APHDSAWPSGISTSVFAFLRLHAL---TGRDVYRDLAEDEFRRYRAAAAAAPAGFVHLLA 579
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A D + ++L G K++ ++ + H +Y L V+ A E++
Sbjct: 580 ARD-FAQRGPFEIILAGDKAAA--AGLVQSVHRAY-LPARVL----AFAEDVPIGHGRRP 631
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
A A VC++ +C+ PVT
Sbjct: 632 VKGRPA----------AYVCRHRTCAAPVT 651
>gi|414164591|ref|ZP_11420838.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
gi|410882371|gb|EKS30211.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
Length = 684
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 224/710 (31%), Positives = 345/710 (48%), Gaps = 97/710 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A ++N+ FV+IKVDREERPD+D++YM + L GGWPL++
Sbjct: 55 ACHWCHVMAHESFEDEATAAVMNEQFVAIKVDREERPDIDQIYMNALHLLGQQGGWPLTM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD P+ GGTYFP + +YGR F ++++ + + D +A + L+E S
Sbjct: 115 FLTPDGAPIWGGTYFPKQAQYGRASFIDVMQQFMRIYRDEPDKIAANKEAIARSLNERHS 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A +S L N L A ++++ D GG APKFP+ LE
Sbjct: 175 ADTASIGL------NELDNAAGSIARATDPDNGGLRGAPKFPQ----------CSMLEFL 218
Query: 201 GKSGEASEGQKMVLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
++G + ++ + T L M++GGI+DH+GGG+ RYSVDERW VPHFEKMLYD Q+
Sbjct: 219 WRAGARTGDERYFITTNLALTRMSQGGIYDHLGGGYARYSVDERWLVPHFEKMLYDNAQI 278
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++ + + Y + + +L+R+M+ G S+ DADS EG +EG F
Sbjct: 279 LDMLALEHARAPNELYLQRAEETVGWLKREMLTKEGGFSSSLDADS---EG----EEGRF 331
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ ++ +LG + A F Y + GN F+G N+L L+D S +A
Sbjct: 332 YVWSQSDIAQLLGPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSDTA 379
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ L R LF R KR P LDDKV+ WNGL+I++
Sbjct: 380 TE--------AEQLAALRAILFRAREKRVHPGLDDKVLADWNGLMIAA---------LAH 422
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+ FN R +++ +A + F+ + + RL HS+R G P D A
Sbjct: 423 AAGAFN-------RPDWLTLACTVFGFVTTTM--SRHDRLGHSWRAGKLLQPALASDNAA 473
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
+I L L+E +L AI Q D + D + GGYF T + ++LR D
Sbjct: 474 MIRAALALHEATGDHLFLDQAILWQADLDTHYGDPQHGGYFLTADDAEGLILRPHSSVDD 533
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDMAMAVPLMCCAADM 615
A P+ ++ NL RLA + + +R+ + A + ++M + L+ A D+
Sbjct: 534 AIPNHIGLTAQNLARLAVLTGDER---WRRQLDMLFAHMLSAAARNMFGHLSLL-NALDL 589
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI-DPADTEEMDFWEEHNSNNA 674
+ +V+ G D +L A A N V+H+ DP A
Sbjct: 590 YLAGAE--IVITGQGEEAD--ALLKTARALPHANTIVLHVPDP----------------A 629
Query: 675 SMARNNFSADKV------VALVCQNFSCSPPVTDPISLENLLLEKPSSTA 718
+ ++ +ADK+ A +C+ +CS P+T+P +L +L +S +
Sbjct: 630 KLPPHHPAADKIAPGGEAAAFICRGQTCSLPMTEPHALAAFVLRGEASAS 679
>gi|407975443|ref|ZP_11156348.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
gi|407429071|gb|EKF41750.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
Length = 673
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 206/570 (36%), Positives = 295/570 (51%), Gaps = 68/570 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE++ VA ++N FV+IKVDREERP++D++YM + A GGWPL++
Sbjct: 54 ACHWCHVMAHESFENDQVADVMNRLFVNIKVDREERPEIDQIYMAALSATGEQGGWPLTM 113
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEAL 139
FLSPD KP GGTYFPP+ +YGRPGF +L V AW +K RD+ SG + E+L + +
Sbjct: 114 FLSPDGKPFWGGTYFPPQQRYGRPGFIEVLNAVHTAWLEKNRDL---SG--SAERLHDHV 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A S PQ+A+ AE++ D GG APKFP IQ++ L+
Sbjct: 169 KARLSPPSAEGFDPQSAVTDLAERIHGMIDQDMGGLRGAPKFPNMPFIQILWL--SWLQT 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+S S V+ +L+ M GGI+DHVGGG RYS D W VPHFEKMLYD QL
Sbjct: 227 GNQSHRDS-----VITSLKRMLSGGIYDHVGGGLARYSTDANWLVPHFEKMLYDNAQLLR 281
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ F T+D + +++++L RDM GG S+ DADS EGA EG Y+
Sbjct: 282 LLSWVFGETEDELFRIRIEEVINFLLRDMRVNGGAFASSLDADS---EGA----EGKAYL 334
Query: 320 WTSKEVEDILGEHAILFKEHYYL-KPT---GNCDLSRMSDPHNEFKGKNVLIEL-NDSSA 374
W+ ++E +LG F + L KP G+ L R++ H EF+G + L ND +A
Sbjct: 335 WSRLQIEAVLGSRTEAFLSTFELTKPDDWHGDPVLHRLA--HPEFQGTDTENALRNDLNA 392
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
L R+ R +P DDKV+V WNGL I++ A ++ +
Sbjct: 393 ----------------------LLSTRAGRIQPGRDDKVLVDWNGLAIAAIANCARQFQ- 429
Query: 435 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 494
R+++++ A++A F+ + ++ RL HS R G P DY
Sbjct: 430 ---------------RQDWLDAAKAAFHFVCESM---ESRRLPHSIRLGKRLFPALSSDY 471
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A +IS LY+ +L A E T D E G++ T+ + V LR++ D
Sbjct: 472 AAMISAATALYQATRKRGFLDQASEWFETLKSWNADEENAGFYLTSSDASDVPLRIRGDV 531
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYY 584
D A PS ++ + + LA++ K + Y
Sbjct: 532 DEAMPSATALIIEAMCGLAALSGDDKVEEY 561
>gi|345001747|ref|YP_004804601.1| hypothetical protein SACTE_4222 [Streptomyces sp. SirexAA-E]
gi|344317373|gb|AEN12061.1| protein of unknown function DUF255 [Streptomyces sp. SirexAA-E]
Length = 673
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 225/693 (32%), Positives = 323/693 (46%), Gaps = 71/693 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED +A LN+ FV +KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDAALAAYLNEHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ D +P GTYFPPE ++G P F+ +L V AW +R +A+ + L+
Sbjct: 108 VFLTADAEPFYFGTYFPPEPRHGMPSFRQVLEGVTAAWTGRRGEVAEVAGRIVTDLA-GR 166
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
S + + +P E P+ A L A LS+ YD + GGFG APKFP + ++ +L H +
Sbjct: 167 SLAHGGDGVPGE-PELAQALLA--LSREYDEKHGGFGGAPKFPPSMAVEFLLRHHAR--- 220
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 TGAEG----ALEMAADTCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCR 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + T + + D++ R++ G SA DADS + G R EGA+YV
Sbjct: 277 VYAHLWRATGSDLARRVALETADFMVRELRTTEGGFASALDADSEDARG--RHVEGAYYV 334
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT +++ ++LGE F Y+ +S+ +G +VL ++
Sbjct: 335 WTPEQLREVLGEDDAAFAAAYF----------GVSEEGTFEEGSSVL--------RLART 376
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G P E + + R +L R R RP DDK++ +WNGL +++ A
Sbjct: 377 G-PDEDPARV-ADVRARLLAARGDRVRPERDDKIVAAWNGLAVAALAETGAYF------- 427
Query: 440 MFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
DR + +E A AA +R H+ D T RL + ++G G L+DY +
Sbjct: 428 ---------DRPDLIERATEAADLLVRVHMGD--TARLCRTSKDGRAGDNAGVLEDYGDV 476
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L WL +A L + E F E G ++T + ++ R ++ D A
Sbjct: 477 AEGFLALASVTGEGAWLDFAGFLLDIVLERFTG-ENGQLYDTADDAEQLIRRPQDPTDSA 535
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
P+G + + L+ S A + S+ +R AE +L V + + A+ L
Sbjct: 536 TPAGWTAAAGALL---SYAAHTGSEAHRTAAEGALGVVKALGPKAPRFIGWGLAVAEALL 592
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
R+ V + +L A + V P E +
Sbjct: 593 DGPREVAVAGPVGGELHRTALLGRAPGAVVAAGEV----PGGAAEFPL----------LV 638
Query: 678 RNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
A VC++F C P TD LE L
Sbjct: 639 DRPLVDGAPTAYVCRHFVCEAPTTDAEELERGL 671
>gi|383830441|ref|ZP_09985530.1| thioredoxin domain containing protein [Saccharomonospora
xinjiangensis XJ-54]
gi|383463094|gb|EID55184.1| thioredoxin domain containing protein [Saccharomonospora
xinjiangensis XJ-54]
Length = 667
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 221/697 (31%), Positives = 324/697 (46%), Gaps = 86/697 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF D+ VA +N+ FV+IKVDREERPD+D VYM QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFSDDDVAAFMNEHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P+ KP GTY+PP +G P F+ +L V AW ++R L + +E ++E +
Sbjct: 109 LTPEGKPFHCGTYYPPVPAHGMPSFRQVLEAVDQAWRERRAELVEGAGRIVEHIAE-RTT 167
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
S++ + ++ +A+ L D GGFG APKFP + ++ +L H E TG
Sbjct: 168 PLSTHPVDEDTVTSAV----ATLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG 220
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+++ +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 221 ----SAQALSIVDLTAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRFY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + ++L RD+ P G S+ DAD+ EG T YVWT
Sbjct: 277 AHLARRTGSALAHRVAGETAEFLLRDLRTPEGGFASSLDADTDGVEGLT-------YVWT 329
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+++ D+LG + + E + + G + +G + L D A
Sbjct: 330 PQQLVDVLGRDDGVWAAETFGVTREGTFE-----------RGASTLQLRRDPDDPA---- 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+++ + L + R+ RP+P DDKVI +WNGL I++ A A L+
Sbjct: 375 ----RWMRVTS----ALVEARNARPQPARDDKVIAAWNGLAITALAEAGLALR------- 419
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
R E++E A +A +F+ L S R+G A G L+DY L
Sbjct: 420 ---------RPEWVEAAVAAGAFVLD--VHASGDGLLRSSRDGVAGAAAGVLEDYGCLAD 468
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAE 558
GLL L++ + WLV A L +T F G F+ T ED L+ R + D A
Sbjct: 469 GLLALHQATGESGWLVEATSLIDTALRRFGVEGAPGAFHDTAEDAETLVHRPSDPTDNAS 528
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAA 613
PSG S L+ +++ ++ YR E +L R + P + A
Sbjct: 529 PSGASALAGALLTASALAGPDRAGAYRAACEEAL----RRAGALVAQAPRFAGHWLSVAE 584
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
MLS P + V +VG + + + AA + + AD +
Sbjct: 585 AMLSGPVQ--VAVVGSDAQERADLLTEAARNVHGGGVVLGGSPEADGVPL---------- 632
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC + C PVTD SL LL
Sbjct: 633 --LADRSLVDGAAAAYVCHGYVCDRPVTDTESLARLL 667
>gi|359774323|ref|ZP_09277696.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
gi|359308634|dbj|GAB20474.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
Length = 654
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 196/583 (33%), Positives = 293/583 (50%), Gaps = 79/583 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM E FE+E +A +N FV IKVDREERPD+D +YM A+ G GGWP++ F
Sbjct: 49 CHWCHVMAHECFENEQIAAQMNAEFVCIKVDREERPDIDAIYMNATVAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+P +P GTYFPP + G+PGF ++ + D W +RD + + G ++L+ L
Sbjct: 109 LTPAGEPFYCGTYFPPSPRNGQPGFTELMSAITDTWINRRDEVTRVG----KELTGHL-- 162
Query: 142 SASSNKLPDE--LPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
SA+S LPD + +AL + A +L D GGFG APKFP +++ +L H ++
Sbjct: 163 SAASGGLPDAQFVLDDALAIHASNELVAQEDRAHGGFGGAPKFPPSAQLEALLRHYERTG 222
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D E +V T Q MA+GGI+D +GGGF RY+VD W +PHFEKMLYD QL
Sbjct: 223 D-------REALGVVERTAQAMARGGIYDQLGGGFSRYAVDIAWAIPHFEKMLYDNAQLL 275
Query: 259 NVYLDAFSLTKD--VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
VY + D + + + +D+L D+ GG S+ DAD+ EGAT
Sbjct: 276 RVYAHLACVASDASAMAARVTAETVDFLATDLRVEGG-FASSLDADTDGVEGAT------ 328
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELND 371
YVWT +E +++LG + E + + TG + L DP N
Sbjct: 329 -YVWTRREFDELLGSDSDWAAELFTVTETGTFEHGTSTLQLPVDPDN------------- 374
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
++++ ++ R R KRP+P D KV+ +WNG+ I+ A
Sbjct: 375 -----------VQRFAAVVDRLRA----AREKRPQPGRDGKVVTAWNGMTITGLVEAGTA 419
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAE-SAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
L +R E++++A A + RH+ + + R S PG
Sbjct: 420 L----------------NRPEWVDLAAWCADELLSRHIVEGELRRT--SLDGVVGTTPGM 461
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLR 549
LDD+A L++GLL L+ + +WL AI L + LF D + G +F+ ++ R
Sbjct: 462 LDDHAALVTGLLGLFAATAQERWLDAAIALLDKAIGLFGDPDAQGSWFDAPAGATGLITR 521
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
++ DGA PSG S+ L+ + + A K+ Y + A+ +L
Sbjct: 522 PRDPADGATPSGGSLMAEALLTASMLAAPEKAGSYLELADATL 564
>gi|158426331|ref|YP_001527623.1| highly protein [Azorhizobium caulinodans ORS 571]
gi|158333220|dbj|BAF90705.1| highly conserved protein [Azorhizobium caulinodans ORS 571]
Length = 657
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 216/616 (35%), Positives = 313/616 (50%), Gaps = 65/616 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED A L+N FV+IKVDREERPDVD++YM + L GGWPL++
Sbjct: 50 ACHWCHVMAHESFEDAETADLMNALFVNIKVDREERPDVDQIYMNALHELGEQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+ D P GGTYFP YGRPGFK +L +V A+ + + +A + + +L+ A
Sbjct: 110 FLNADGAPFWGGTYFPKTASYGRPGFKDVLWQVSQAYRETPEKVAHNTDAILSRLAAAAK 169
Query: 141 -ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + L D L A+Q++ +D GG APKFP+ ++++ + D
Sbjct: 170 PAGGVALTLAD------LDKAAQQIAGLFDRAHGGLRGAPKFPQAGLLELLWRAGDRTGD 223
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
+ + +V FTL M +GGI+DHVGGGF RYSVDERW VPHFEKMLYD QL
Sbjct: 224 -------PQLKAVVAFTLNRMCEGGIYDHVGGGFSRYSVDERWLVPHFEKMLYDNAQLLE 276
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+ A+ T D + R+ + +L+R+M+ G ++ DADS EG EG FYV
Sbjct: 277 LLALAYQETGDELFLLRARETVSWLKREMVTADGAFAASLDADS---EG----HEGKFYV 329
Query: 320 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT+ E+ +LG E A F Y + GN ++G+ +L + S
Sbjct: 330 WTADEIVAVLGKEDAAEFAAFYDVTDEGN------------WEGQTIL-----NRTSFGD 372
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ M E L + E KL R++R RP LDDKV+ WNGL+I++ ARA +
Sbjct: 373 VSMVEEARLRPMKE---KLLAARAQRVRPGLDDKVLADWNGLMIAALARAGAL------- 422
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
D E++++A +A + R + + RL HS+R G PG D A +
Sbjct: 423 ---------LDEPEWVDLAATAFDAVVRLMVKDG--RLGHSYREGRLVLPGLASDLAAMA 471
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ L+E L A + N + +LD + G YF T + P++++R D A
Sbjct: 472 RAGIALHEAAGDEAPLAHAEDFLNRLEADYLDPQSGAYFLTAADAPALVMRPLSSLDEAL 531
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+ NSV+ L+RLA++ + D R A+ + +A P + A D +
Sbjct: 532 PNYNSVAADALIRLAAL---TGQDGLRARADRLIGALTGAAAQNPLAHPSLLNALD--TR 586
Query: 619 PSRKHVVLVGHKSSVD 634
+V VG +S D
Sbjct: 587 LRLAEIVAVGARSVRD 602
>gi|325104043|ref|YP_004273697.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324972891|gb|ADY51875.1| protein of unknown function DUF255 [Pedobacter saltans DSM 12145]
Length = 669
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 193/557 (34%), Positives = 282/557 (50%), Gaps = 54/557 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFEDE VA+++N+ FV IKVDREERPD+D++YM VQ + G GGWPL+
Sbjct: 48 SACHWCHVMEHESFEDEEVAQIMNEHFVCIKVDREERPDIDQIYMNAVQLMTGRGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
F PD +P+ GGTYF ED +K IL + + K L ++ +A+ +L + +
Sbjct: 108 CFCLPDQRPIYGGTYFQKED------WKNILHNLAGFYANK---LQEAEEYAV-RLMDGI 157
Query: 140 SASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ S S K E Q + + +D GG APKFP P ++ + +
Sbjct: 158 NQSERLSFVKEEKEYTQEHIENIVKPWKMHFDFSEGGQNRAPKFPMPDNWAFLMKVAHLM 217
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+D + TL MA GGI+D +GGGF RYSVD WH+PHFEKMLYD GQL
Sbjct: 218 KDDA-------AFVITRLTLDKMAAGGIYDQLGGGFARYSVDHEWHIPHFEKMLYDNGQL 270
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y DA+ K+ Y + + D+++R+M P +SA DADS EG EG F
Sbjct: 271 MSLYADAYKYYKNERYKEVVYETYDWIKREMTSPEYGFYSALDADS---EGV----EGKF 323
Query: 318 YVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
Y W +E+E IL E A +F +Y + GN + + N L + A
Sbjct: 324 YTWDKQEIEKILDKEQAAIFNAYYAVTDEGNWEEEEI----------NHLWIRKEKQHIA 373
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ +E+ I+ + +L + R+KR P LDDK++ SWN L++ A K +
Sbjct: 374 EAFHISIERLDEIIQHSKTQLLEYRNKRIHPGLDDKILTSWNALMLKGLCDAYKAFADQ- 432
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
+++ +A A F+ +L E L +++NG + FLDDYA
Sbjct: 433 ---------------QFLTLALDNAKFLLNNLCREDG-MLYRNYKNGKATIEAFLDDYAL 476
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L + LYE W+ A L + + F D + G +F T+ +++ R E D
Sbjct: 477 LAQAFISLYEVTFDEAWIFKAKSLCDYVIKHFSDAQSGMFFYTSDASEALVARKYEIMDN 536
Query: 557 AEPSGNSVSVINLVRLA 573
PS NSV NL +L+
Sbjct: 537 VIPSSNSVMAWNLRKLS 553
>gi|385681202|ref|ZP_10055130.1| highly conserved protein containing a thioredoxin domain-containing
protein [Amycolatopsis sp. ATCC 39116]
Length = 675
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 224/696 (32%), Positives = 328/696 (47%), Gaps = 83/696 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED A+L+N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++
Sbjct: 49 ACHWCHVMAHESFEDAETARLMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTC 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GTY+PPE + G P F+ +L V AW ++RD L + +E L+ L
Sbjct: 109 FLTPDGEPFHCGTYYPPEPRPGMPSFQHLLVAVAQAWQERRDELREGAGKIVEHLAGQLG 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
P + L +L+ D GGFG APKFP + ++ +L H ++ T
Sbjct: 169 PLP-----PAPVDAGVLDAALLKLTGEADRARGGFGGAPKFPPSMVLEFLLRHHER---T 220
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G ++E +V + MA+GGIHD + GGF RYSVD W VPHFEKMLYD L V
Sbjct: 221 G----SAEALSLVESCAEAMARGGIHDQLAGGFARYSVDASWVVPHFEKMLYDNALLLRV 276
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y T + + R ++L + G ++ DAD T +EG YVW
Sbjct: 277 YAHLARRTGSALAAEVARMTGEFLLARLRTEQGGFAASLDAD-------TLGEEGLTYVW 329
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T ++ ++LG + E + + +G F+ +++L D
Sbjct: 330 TPAQLREVLGDDDGAWAAELFSVTESGT------------FEHGASVLQLRDPDDR---- 373
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
E++ + R L R +RP+P DDKVI +WNGL I++ A L
Sbjct: 374 ----ERFERV----RSALLAARDERPQPGRDDKVIAAWNGLAITALCEAGVAL------- 418
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPS-KAPGFLDDYAFL 497
D ++ A+ AAS + HL D +RL+ S R+G + A G L+DY L
Sbjct: 419 ---------DEPHWVTAAQEAASAVLGIHLRD---NRLRRSSRDGTAGDAAGVLEDYGCL 466
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDG 556
GLL L++ +WL A+ L +T F + G ++ T +D VL+ R + D
Sbjct: 467 AEGLLALHQATGDPRWLTEAVNLLDTALANFAVADTPGAYHDTADDAEVLVHRPSDPTDN 526
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL--KDMAMAVPLMCCAAD 614
A PSG S ++ N + AS++ G + A +L K A + A
Sbjct: 527 ASPSGAS-ALTNALVTASVLVGPDRSARYRAAAEEAVHRTGQLIAKAPRFAGHWLTAAEA 585
Query: 615 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
+L+ P + V + G S+ ++L A A V+ D E +
Sbjct: 586 LLAGPVQ--VAIAGPDSTE--RDLLRAVAARRAHGGAVVLAGEPDAEGVPL--------- 632
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A + A VC+ + C PVT P L + L
Sbjct: 633 -LADRPLVAGQAAAYVCRGYVCDRPVTSPDDLVSAL 667
>gi|288818675|ref|YP_003433023.1| hypothetical protein HTH_1371 [Hydrogenobacter thermophilus TK-6]
gi|384129427|ref|YP_005512040.1| hypothetical protein [Hydrogenobacter thermophilus TK-6]
gi|288788075|dbj|BAI69822.1| conserved hypothetical protein [Hydrogenobacter thermophilus TK-6]
gi|308752264|gb|ADO45747.1| protein of unknown function DUF255 [Hydrogenobacter thermophilus
TK-6]
Length = 648
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 198/585 (33%), Positives = 306/585 (52%), Gaps = 53/585 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED +AK++N+ FV+IKVDR+ERPD+D+ Y V AL G GGWPL+ F
Sbjct: 52 CHWCHVMAKESFEDPEIAKIINENFVAIKVDRDERPDIDRRYQETVIALTGSGGWPLTAF 111
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD K GGTYFPPED++GRPG K++L ++ W ++++ + +S +L
Sbjct: 112 LTPDGKLFFGGTYFPPEDRWGRPGLKSLLLRISQLWREEKERILKSADHIFLELQ----- 166
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ SS D + + L+ L S D GG GSAPKF +++LYH ++
Sbjct: 167 NYSSMTFKDFVDEELLKRGIGALLSSVDYEKGGIGSAPKFHHAKAFELLLYHYYFTKE-- 224
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ ++ +L MAKGGI+DH+ GGF RYS D+ W++PHFEKMLYD +L +Y
Sbjct: 225 -----EIVKRAIISSLDAMAKGGIYDHLLGGFFRYSTDDTWNIPHFEKMLYDNAELLRLY 279
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
A+ + ++ Y Y+ + I++Y + G ++++DAD + EG Y +T
Sbjct: 280 SLAYQVFENPLYEYVAKGIVNYYKLYGSDQEGGFYASQDADIGVLD------EGGHYTFT 333
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
S E+ +L + + Y+ G RM PH++ KNVL D+ + L +
Sbjct: 334 SDELRLLLDPEELKVVKLYF----GIDTRGRM--PHHQH--KNVLFINMDAQQVSKVLDI 385
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
P EK +L + K+ R+ R P++D + WNGL+I + K+ + E M
Sbjct: 386 PKEKVEELLKSAKEKMLSYRNSREIPYIDKTIYTGWNGLMIDALCVYYKVFQDEWSLLM- 444
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
AE A+ + + Y + + L H+ +G S G+ +DY +L GL
Sbjct: 445 ---------------AEKTANRLIKERYRDGS--LDHT--DGVS---GYSEDYIYLSQGL 482
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPS 560
L L+E +L A EL + ELF D +G G+F+T + +LL + K D S
Sbjct: 483 LSLFEITQNRTYLDMAKELLDKAIELFWDDQGWGFFDTHQKGEGLLLIKHKPIQDTPIQS 542
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
N S L+ + +I +K Y + AE +L F +++M MA
Sbjct: 543 VNGTSPYLLLLMEAITGDTK---YGEYAEKNLMAFSRFMREMPMA 584
>gi|424867573|ref|ZP_18291355.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
'C75']
gi|124516649|gb|EAY58157.1| protein of unknown function [Leptospirillum rubarum]
gi|387221885|gb|EIJ76392.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
'C75']
Length = 689
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 216/698 (30%), Positives = 335/698 (47%), Gaps = 64/698 (9%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLS 79
CHWCHVM ESFE +A ++N++FV+IKVDREERPD+D++Y M + GGWPL+
Sbjct: 49 ACHWCHVMAHESFERPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLT 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FL+P P GGTYFP + ++G PGF +L +++D + R+ L + ++ L +
Sbjct: 109 MFLTPSQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTN 168
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S D P AL L +D FGGFG APKFP +++ + ++ +
Sbjct: 169 PVADSREFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQR 222
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G S A M TL M +GGI D VGGGF RYSVDERW +PHFEKMLYD L
Sbjct: 223 KGDSTAA----HMATVTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLE 278
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
S++K+ YS +++ +L R+M G +S+ DADS EG +EG FYV
Sbjct: 279 ALALGASVSKNPVYSRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYV 331
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASK 378
+ ++EV IL + YY +S P N F+G L E + +
Sbjct: 332 FQAEEVRSILSDEEYRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKE 380
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ + R+KLF RS R RP LDDKV+ SWN L+ A++
Sbjct: 381 FHLSESDIERRIESARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKA 426
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+F+ ++G ++E++ ++ R ++ + L + P +LDDYAFL+
Sbjct: 427 LLFSGRILG--KQEWISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLL 482
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+L+ + L +A + + F D E GG++ T +++ R K HDGA
Sbjct: 483 LAVLESMRIDFRPEDLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGAL 542
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSGN+ +V L+ L ++ Y A+ +L ++ ++K+ M A + S
Sbjct: 543 PSGNAAAVQGLLWLGTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS- 598
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+ VV + + D+++ ++ D V+ + A + + E R
Sbjct: 599 -DSQPVVFLAGPQAGDWKDKISCG---VDTEAFVLDLTNAVRDSLPLPEG--------MR 646
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPSS 716
+F +K VC+ C P SL+ L P S
Sbjct: 647 KHFPENKTTGWVCRGTMCLPSADSLESLQEQLRLWPLS 684
>gi|302894519|ref|XP_003046140.1| hypothetical protein NECHADRAFT_33848 [Nectria haematococca mpVI
77-13-4]
gi|256727067|gb|EEU40427.1| hypothetical protein NECHADRAFT_33848 [Nectria haematococca mpVI
77-13-4]
Length = 712
Score = 311 bits (797), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 212/669 (31%), Positives = 326/669 (48%), Gaps = 91/669 (13%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M +ESF + A +LN++FV + VDREERPD+D +YM YVQA+ GG
Sbjct: 75 HIGYKACHFCRLMLLESFSNPDCASVLNEFFVPVIVDREERPDLDTIYMNYVQAVSNAGG 134
Query: 76 WPLSVFLSPDLKPLMGGTYFPPEDKYGRP-----------GFKTILRKVKDAWDKKR--- 121
WPL++FL+P+L+P+ GGTY+P GR F TI++KV+D W +
Sbjct: 135 WPLNLFLTPNLEPVFGGTYWP--GPAGRRHTTDDSADEVLDFLTIVKKVRDIWSDQESRC 192
Query: 122 -----DMLAQSGAFAIEQLSEALSASASSNKLP----------------------DELPQ 154
++L Q FA E + SA+S P +EL
Sbjct: 193 RKEATEVLGQLREFAAEGTLGTRNISATSALAPSGWGAPAPSHTSAPKDKDTSVSEELDL 252
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKF---PRPVEIQMMLYHSKKLEDTGKSGEASEGQK 211
+ L ++ ++D +GGFG APKF P+ + +L ++++D E +
Sbjct: 253 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLGFLLGLLNFPREVQDVVGEAECKHATE 312
Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT-- 268
M L TL+ + G +HDHVGG GF R SV W +P+FEK++ D QL ++YLDA+ T
Sbjct: 313 MALDTLRHIRDGALHDHVGGTGFSRCSVTPDWSIPNFEKLVVDNAQLLSLYLDAWKSTGG 372
Query: 269 -KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
K + I ++ +YL I P G S+E ADS G +EGA+YVWT +E +
Sbjct: 373 DKPTEFFDIVIELAEYLSSAPIALPEGGFASSEAADSHYRRGDREMREGAYYVWTRREFD 432
Query: 327 DILGE----HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
+L E + + H+ + GN D DP+++F +N+L + + +P
Sbjct: 433 SVLDEVNKHMSPVLAAHWAVNEDGNVD--EHHDPNDDFINQNILRIERSVQQLSVQFSIP 490
Query: 383 LEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+K + E + L R K R RP LDDKV+ WNGLVIS+ A+ + LK
Sbjct: 491 EDKVRQYVQEGKVALKQRRDKERVRPDLDDKVVAGWNGLVISALAKTALALKG------- 543
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+ +Y+ VAE A FI+ L+D ++ + +G + F DDYA+L GL
Sbjct: 544 ---LRPEQSSKYLAVAEKAVKFIQEKLWDSD-RKVLYRIWSGERETQAFADDYAYLTQGL 599
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
LDL++ +LV+A LQ + P +LR+K+ D + PS
Sbjct: 600 LDLFDATGNEAYLVFADTLQPSS-------------------PHTILRLKDGMDTSVPST 640
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 621
N++SV NL R+A ++A D NA ++ FE + P + + S+
Sbjct: 641 NAISVSNLFRIADLLA---DDKLAVNARQTINAFEAEMLQHPWLFPGLLAGVVTARLGSQ 697
Query: 622 KHVVLVGHK 630
+ V V ++
Sbjct: 698 RRNVNVNYQ 706
>gi|13473777|ref|NP_105345.1| hypothetical protein mlr4484 [Mesorhizobium loti MAFF303099]
gi|14024528|dbj|BAB51131.1| mlr4484 [Mesorhizobium loti MAFF303099]
Length = 671
Score = 311 bits (797), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 197/556 (35%), Positives = 283/556 (50%), Gaps = 56/556 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE++GVA ++N FV+IKVDREERPD+D++YM + ++ GGWPL++
Sbjct: 53 ACHWCHVMAHESFENDGVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTM 112
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GGTYFP E +YGRPGF ++ V AW +KRD L QS + L+ +
Sbjct: 113 FLTPDGKPFWGGTYFPREARYGRPGFIQVMEAVDKAWREKRDSLHQSA----DGLTSHVE 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A S L + AL A ++ D GG APKFP + L+ S
Sbjct: 169 ARLSGTHARQSLDRGALTDLAGRIDGMVDRDLGGLRGAPKFPN-APFMLTLWLSWL---- 223
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ G A+ + VL +L+ M GGI+DH+GGG RYS D W VPHFEKMLYD +L
Sbjct: 224 -RDGNAAH-RDDVLVSLERMLAGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAELIRF 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
AFS + + + + +D+L R+M GG ++ DADS +EG FY W
Sbjct: 282 CNWAFSASGNDLFRIRIEETVDWLLREMRVEGGAFAASLDADS-------DGEEGLFYTW 334
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+E++ +LG+ + LF +++ L S PH ++GK V+ + A
Sbjct: 335 NRQEIKTVLGDDSALFFKYFTL-----------SAPHG-WEGKPVIHQTRTQQAQGVA-- 380
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
EK + + + +L VR +R RP LD K + WNGL+I++ A A + L
Sbjct: 381 -DREKLIPL----KARLLAVREERVRPGLDAKTLTDWNGLMIAALAEAGRSLG------- 428
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
R E++E A+ A + I D RL HS P DYA + +
Sbjct: 429 ---------RPEWIEAADKAFAHISGASRD---GRLPHSMLGTRKLFPALSSDYAAMANA 476
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
+ L+E ++ A + D + D G GY+ T + V +R++ D D A S
Sbjct: 477 GISLFEASGDWSYIDQAKQFIEQLDHWYPDPAGTGYYLTASDSTDVPIRIRGDVDEAISS 536
Query: 561 GNSVSVINLVRLASIV 576
S + LVRLAS+
Sbjct: 537 ATSQIIAALVRLASVT 552
>gi|338213486|ref|YP_004657541.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336307307|gb|AEI50409.1| protein of unknown function DUF255 [Runella slithyformis DSM 19594]
Length = 700
Score = 311 bits (797), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 198/580 (34%), Positives = 290/580 (50%), Gaps = 71/580 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE E VA ++N FV IKVDREERPDVD +YM + A+ GGWPL+
Sbjct: 48 SACHWCHVMERESFEKEQVAAVMNADFVCIKVDREERPDVDAIYMDAIHAMGARGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---- 135
VFL PD KP G TY P ++ + +L VK+A+ + L +S + +
Sbjct: 108 VFLLPDAKPFYGVTYLPAQN------WVQLLGSVKNAFVNHHEELVKSAEGFTDNMLIKE 161
Query: 136 -------------SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 182
EA A AS D+L + E++ +D+ GG APKFP
Sbjct: 162 TDKYNLHATSPQGDEADRAEASPAPTLDDLHE-----MFEKIKGHFDTEKGGMDRAPKFP 216
Query: 183 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 242
P + +L + ++ E + + +L +A GGI+DHVGGG+ RYSVD+ W
Sbjct: 217 MPSIYKFLLRYYALTQN-------PEALRHIELSLNRIALGGIYDHVGGGWARYSVDDEW 269
Query: 243 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 302
+PHFEKMLYD GQL ++Y +A++LTK+ Y + +D+L R+M G +SA DAD
Sbjct: 270 FIPHFEKMLYDNGQLLSIYSEAYTLTKNELYKSRVYETIDWLEREMTSTEGGFYSALDAD 329
Query: 303 SAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKG 362
S EG EG FYVWT E+ +LG+ F + Y ++ +GN + +N
Sbjct: 330 S---EGV----EGKFYVWTQAELRSVLGDDFEWFSKLYNIRASGNWEHG-----YNHLHL 377
Query: 363 KNVLIELNDSSASASKLGMPLEKYLNILGE-------CRRKLFDVRSKRPRPHLDDKVIV 415
+ S ++G PL + L E +KLF R R RP LDDK++
Sbjct: 378 TTISFVPETVEKSQWRVGPPLNYLMKGLFEKNSTYQAALQKLFVARESRIRPGLDDKILA 437
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 475
SWNGL++ A + E ++ +A +A F++ + H+
Sbjct: 438 SWNGLMLKGLTDAYRAFGEE----------------KFKTLALQSAHFLKDKM-TAPNHQ 480
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 535
L HS++NG + GFL+DYA ++ G L LY+ +WL A++L E D E
Sbjct: 481 LWHSYKNGKASIVGFLEDYAAVVDGYLGLYQATFEEQWLDEALKLTAYAIENLYDPEEEL 540
Query: 536 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
++ T ++ R KE D P+ NS+ NL L ++
Sbjct: 541 FYFTDANAEELIARKKEIFDNVIPASNSLMAHNLFTLGTL 580
>gi|291437584|ref|ZP_06576974.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
gi|291340479|gb|EFE67435.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
Length = 677
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 230/701 (32%), Positives = 333/701 (47%), Gaps = 83/701 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A LN FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SSCHWCHVMAHESFEDRTTADYLNGHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+PD +P GTYFPPE ++G P F +L+ + AW ++RD + L+
Sbjct: 108 VFLTPDAEPFYFGTYFPPEPRHGMPSFLQVLQGIHQAWQERRDEVTDVAGKITRDLA-GR 166
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S K+P EL Q L L++ YD + GGFG APKFP + ++ +L H +
Sbjct: 167 EISYGDAKVPGEQELAQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 --TGAEG----ALQMAQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T + + D++ R++ P G SA DADS +G R EGA+
Sbjct: 275 CRVYAHLWRATGSELARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAY 332
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSAS 375
YVWT ++ ++LGE A L ++ + G + +G +VL + D
Sbjct: 333 YVWTPAQLREVLGEEDADLAARYFGVTEEGTFE-----------EGASVLQLPQRDEVFD 381
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
A++ + R +L R+ RP P DDKV+ +WNGL +++ A
Sbjct: 382 AAR-----------VDGVRERLLAARAARPAPGRDDKVVAAWNGLAVAALAETGAYF--- 427
Query: 436 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDY 494
DR + +E A +A + R +DE R+ + ++G A G L+DY
Sbjct: 428 -------------DRPDLVEAAVAAGDLLVRLHFDEHA-RIARTSKDGHVGANAGVLEDY 473
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + F D + G ++T + ++ R ++
Sbjct: 474 ADVAEGFLALASVTGEGVWLEFAGLLLDHVLARFTDPDSGALYDTAADAERLIRRPQDPT 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
D A PSG S + L+ S A + S+ +R AE +L V +K + VP +
Sbjct: 534 DNAVPSGWSAAAGALL---SYAAHTGSEPHRTAAERALGV----VKALGPRVPRFIGWGL 586
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A +L P + + +VG L V+ + ++E
Sbjct: 587 AVAEAVLDGP--REIAVVGPAPDDPATRTLHRTALLGTAPGAVVAVGTPGSDEFPL---- 640
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A D+ A VC++F+C P TDP L L
Sbjct: 641 ------LADRPLVRDEPAAYVCRDFTCDAPTTDPDRLRAAL 675
>gi|238062793|ref|ZP_04607502.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
gi|237884604|gb|EEP73432.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
Length = 703
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 223/707 (31%), Positives = 336/707 (47%), Gaps = 75/707 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED GV KLLND FV+IKVDREERPDVD VYMT QA+ G GGWP++VF
Sbjct: 49 CHWCHVMAHESFEDAGVGKLLNDGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
+PD P GTYFP +P F +L V AW ++R+ + + G+ +E + A +
Sbjct: 109 ATPDGTPFFCGTYFP------KPNFVRLLESVGTAWREQREAVLRQGSAVVEAIGGAQAV 162
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ L A +L++ YD GGFG APKFP + + +L H ++ TG
Sbjct: 163 GGPTAP----FTAELLDAAAARLAREYDRDNGGFGGAPKFPPHLNLLFLLRHHQR---TG 215
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++E ++ T + MA+GGIHD + GGF RYSVD W VPHFEKMLYD L VY
Sbjct: 216 ----SAESLEIARHTAEAMARGGIHDQLAGGFARYSVDAHWTVPHFEKMLYDNALLLRVY 271
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ LT D + RD +L ++ PG SA DAD+ EG T Y WT
Sbjct: 272 THLWRLTGDPLARRVARDTARFLADELHRPGEGFASALDADTEGVEGLT-------YAWT 324
Query: 322 SKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
++ ++LGE + + + P+G S P + +E S +L
Sbjct: 325 PAQLVEVLGESDGRWAADLFAVTPSGTFAPHSASAPQGGTPDRRKGVE---HGTSVLRLA 381
Query: 381 MPLE--------KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 432
++ ++ +++G +L R RP+P DDKV+ +WNGL I++ A +++
Sbjct: 382 RDVDDADPAIRGRWRDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITALAEFVRLV 437
Query: 433 KS------EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 486
++ +A++ + + +D + AE A+ HL D + R+ G +
Sbjct: 438 EAVGTGDEQADANLLEGVTIVAD-GALRDAAEHLAAV---HLVDGRLRRVSRDRVVG--E 491
Query: 487 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 546
G L+DY + +++ +WL A +L +T F GGG+++T + +
Sbjct: 492 PAGVLEDYGCVAEAFCAMHQLTGEGRWLELAGDLLDTALARFA-APGGGFYDTADDAERL 550
Query: 547 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 606
+ R + D A PSG S V LV A++ S YR+ AE +LA + A
Sbjct: 551 VTRPADPTDNATPSGRSAIVAALVTYAAL---SGQPRYREVAEAALATVAPIVARHARFT 607
Query: 607 PLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
A + +LS P VV + ++AAA+ ++ P
Sbjct: 608 GYAATAGEALLSGPYEIAVV----TDDPAGDPLVAAAYRHAPPGAVLVAGRP-------- 655
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+A + A VC+ F C PVT ++E+LL +
Sbjct: 656 ---DQPGVPLLADRPMLDGRPTAYVCRGFVCQRPVT---TVEDLLAQ 696
>gi|154245776|ref|YP_001416734.1| hypothetical protein Xaut_1832 [Xanthobacter autotrophicus Py2]
gi|154159861|gb|ABS67077.1| protein of unknown function DUF255 [Xanthobacter autotrophicus Py2]
Length = 669
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 205/573 (35%), Positives = 290/573 (50%), Gaps = 61/573 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE+ VA L+N FV+IKVDREERPDVD++YM+ +Q L GGWPL++
Sbjct: 50 ACHWCHVMAHESFENADVAGLMNALFVNIKVDREERPDVDQIYMSALQQLGQSGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL P+ KP GGTYFPP YGRPGF +L++V + + +D + ++ A + +L +A +
Sbjct: 110 FLDPEGKPFWGGTYFPPAASYGRPGFTDVLQQVSTVFTQNKDKVEKNTATILARLKKAAT 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A + ++L A RL A +D GG APKFP+ ++ + + +D
Sbjct: 170 PVAGAAIGREDLNDAAARLPA-----MFDPVHGGLKGAPKFPQSGLLEFLWRVGTRRKDD 224
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ +V TL M +GGI+DH+GGGF RYSVDE W VPHFEKMLYD L +
Sbjct: 225 AL-------KAIVALTLNRMCEGGIYDHLGGGFARYSVDEIWFVPHFEKMLYDNALLLEL 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
A+S T D + R+ + +L+R+M+ P G ++ DAD TEG EG FYVW
Sbjct: 278 LALAYSDTGDALFLTRARETVGWLKREMLTPEGAFAASLDAD---TEG----HEGRFYVW 330
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ E+ +LG E A F Y + GN ++ N+L SA
Sbjct: 331 SEAEITAVLGAEDAAFFNRLYDVSRAGNWEVG------------NILNRTEAGVVSAEDE 378
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
L R KL R KR RP DDKV+ WNGL+I++ ARA L
Sbjct: 379 AR--------LAPLREKLLLAREKRVRPGRDDKVLADWNGLMIAALARAGGFL------- 423
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
E++ +A+ A + H+ E RL HS+ PG D A +
Sbjct: 424 ---------GEAEWVALAQRAFDAVVSHMVVEG--RLAHSWCGTKIVLPGLASDLAAMAR 472
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ L+E + L A + D E G YF T + S++LR HD A P
Sbjct: 473 AGIALHEATGAPEPLAQAAHFLEVLETHHRDPETGAYFLTAYDGDSLILRPLATHDEAVP 532
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
+ N+V+ L+RLA++ + +D +R A+ L
Sbjct: 533 NANAVAADALIRLAAL---TGNDAFRTRADRVL 562
>gi|284033485|ref|YP_003383416.1| hypothetical protein Kfla_5611 [Kribbella flavida DSM 17836]
gi|283812778|gb|ADB34617.1| protein of unknown function DUF255 [Kribbella flavida DSM 17836]
Length = 670
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 218/696 (31%), Positives = 327/696 (46%), Gaps = 83/696 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+ A LN+ FV +KVDREERPDVD +YM A+ G GGWP+S
Sbjct: 49 SACHWCHVMAHESFEDDATAAYLNEHFVCVKVDREERPDVDAIYMEATVAMTGHGGWPMS 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P +P GTYFP + ++G F+ +L + DAW KR+ + GA ++QL
Sbjct: 109 VFLTPAGEPFFCGTYFPLDPRHGMASFRQVLESLVDAWRTKREQIDGIGASVVQQL---- 164
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
A + + + L L +D GGFG APKFP + + +L H ++
Sbjct: 165 --GARQPAVGEAVDAAVLDRAVALLQGDFDPVDGGFGQAPKFPPSMVLDFLLRHHRR--- 219
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG + E MV T + MA+GG++D + GGF RYSVD++W VPHFEKMLYD L +
Sbjct: 220 TG----SEEALAMVTHTCERMARGGMYDQLAGGFARYSVDKQWIVPHFEKMLYDNALLLD 275
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY +++T + + D+L ++ P G SA DAD TEG +EG +YV
Sbjct: 276 VYTHWWTVTGSPLAERVALETADFLLAELRTPEGGFASALDAD---TEG----EEGRYYV 328
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
W+ E+ ++LGE A E CD++ F+ +++L
Sbjct: 329 WSPTELRELLGEDADWVIEL--------CDVT------GTFEHGTSVLQLRSDPDD---- 370
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
L+++ I R L D R++R P DDKV+ +WNGL I++ RA +L
Sbjct: 371 ---LDRWNRI----RSVLRDARARRTYPGRDDKVVAAWNGLAITALTRAGLVL------- 416
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLI 498
DR EY+E A AA + R ++ + + RL + R+G A G L+DYA
Sbjct: 417 ---------DRPEYVEAAVKAAELV-RDVHVDGSGRLHRTSRDGAVGTAHGVLEDYAAYA 466
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L L WL A L + + F+ G +F+T + ++ R ++ D A
Sbjct: 467 QACLTLLAATRDDSWLTLAQRLLDRVLQQFV--ADGTFFDTAADAETLAWRPQDATDNAS 524
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+G S++ LAS+ ++ Y + + A + A + A + S
Sbjct: 525 PAGVSLAAEAFSTLASVTGEAR--YEQAADQALAASAAIAARAPRFAGRALAVAETLQSG 582
Query: 619 PSRKHVVLVGHKSSVDFE----NMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNA 674
P V+ ++ D + ++ A AS V+ P S+
Sbjct: 583 PLEIAVIGAEDVAAGDGQEQVTQLVRTALASAPWGTAVVQGKP------------GSDVP 630
Query: 675 SMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VCQ F+C P+ P L L
Sbjct: 631 LLAGRGLVDGRAAAYVCQKFTCRLPIVLPEDLRGEL 666
>gi|305665308|ref|YP_003861595.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
gi|88710063|gb|EAR02295.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
Length = 703
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 197/595 (33%), Positives = 317/595 (53%), Gaps = 78/595 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVME E+FEDE VA+++N+ F+S+KVDREERPDVD+VYMT VQ + G GWPL+V
Sbjct: 85 SCHWCHVMEEETFEDEKVAEIMNNDFISVKVDREERPDVDQVYMTAVQLMSGNAGWPLNV 144
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD---KKRDMLAQSGAFAIEQLSE 137
+ P+ KPL GGTY + + +L K+ + + K + A + I+ ++
Sbjct: 145 IVLPNGKPLYGGTY------HTNAQWSQVLEKINNLYKDDPTKANEYADMVSKGIQDVNL 198
Query: 138 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ +S E+ + L+ Q ++D GG KF P + +L
Sbjct: 199 IEPSEENS-----EISLDILKEGVTQWKPNWDLERGGNMGPEKFMLPGSLDFLL------ 247
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D + + + TL MAKGGI+DH+ GGF+RYS D W++PHFEKMLYD QL
Sbjct: 248 -DYAELSNDESVRSYIKTTLDQMAKGGIYDHIAGGFYRYSTDPNWNIPHFEKMLYDNAQL 306
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
++Y A+++ KD Y I + + +L+++M G F+A DADS EG +EG +
Sbjct: 307 ISLYSKAYTIFKDPVYKQIVLETVAFLQKEMKNTTGGYFAALDADS---EG----EEGKY 359
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASA 376
YVWT++E+ + + LF ++Y ++ + +G +++ N + A
Sbjct: 360 YVWTNEELRSTINNNQELFSKYY------------STEISTKMEGDKIVLRKNQNDEVFA 407
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
S+ + +EK + E ++KL +VR+ R +P +DDK+IVSWN L+I+ + A
Sbjct: 408 SENEISIEKLQELNKEWKKKLVEVRADRVKPRIDDKIIVSWNALLINGYVDA-------- 459
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
F G R ++ AES + I + Y + ++L HSF+ G ++ GFL+DY+F
Sbjct: 460 ------FKAFGETR--FLVEAESIFTTIHENAYSD--NQLVHSFKKGSNRTEGFLEDYSF 509
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDHD 555
L + L+LY +L +A +L T + F D + Y FN++ S++ ++ ++ D
Sbjct: 510 LANASLNLYSASMNPDYLNFAQQLIKTTQKRFKDDDSDFYKFNSSN---SLIAKIIKNDD 566
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV-PLM 609
G PS N+V NL+ L I +Y + A HS K+M +++ PL+
Sbjct: 567 GVIPSPNAVMAHNLLTLGHI------EYNKDYAAHS--------KNMLISIQPLL 607
>gi|386383690|ref|ZP_10069151.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
NRRL18488]
gi|385668865|gb|EIF92147.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
NRRL18488]
Length = 672
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 238/704 (33%), Positives = 333/704 (47%), Gaps = 90/704 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFEDE A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SSCHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+ D +P GTYFPPE ++G F+ +L V AW +R+ + + A L+
Sbjct: 107 VFLNADGEPFYFGTYFPPEPRHGMASFRQVLEGVTAAWRDRREEVGEVAAKITRDLA-GR 165
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+A+ LP DEL Q L L++ YD R+GGF APKFP + ++ +L H +
Sbjct: 166 AAAHGGEGLPGEDELSQALL-----GLTRDYDERYGGFAGAPKFPPSMVLEFLLRHYAR- 219
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G M T + MA+GG++D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 --TGARG----ALDMAAGTCEAMARGGLYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 273
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + I + D+L R++ G SA DADS + G EGAF
Sbjct: 274 CRVYAHLWRADGSPLARRIALETADFLVRELRTAEGGFASALDADSHDPAG--EHGEGAF 331
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
YVWT ++ + LGE D R ++ + + E AS
Sbjct: 332 YVWTPAQLTEALGE----------------ADGRRAAEIYG-------VTEEGTFERGAS 368
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L +P E + R +LF+ R +RPRP DDKV+ +WNGL I++ A
Sbjct: 369 VLRLPGEDDPAL----RARLFEARERRPRPERDDKVVAAWNGLAIAALAETGAFF----- 419
Query: 438 SAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYA 495
DR + +E A AA +R HL D RL + ++G PG L+DYA
Sbjct: 420 -----------DRPDLVERATEAADLLVRVHLGDGA--RLTRTSKDGVAGHNPGVLEDYA 466
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ G + L WL +A L + +LF E G F+T + ++ R ++ D
Sbjct: 467 DVAEGFIALAGVTGEGVWLDFAGVLLDLVIDLFTG-ENGTLFDTAHDAERLIRRPQDPTD 525
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MC 610
A P+G + + L+ S A + S+ +R AE +L V +K + VP +
Sbjct: 526 NATPAGWTAAAGALL---SYAAHTGSEPHRAAAERALGV----VKALGPRVPRFAGWGLA 578
Query: 611 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHN 670
A +L P + + +VG + A + V +P D +E +
Sbjct: 579 VAEALLDGP--REIAVVGLDGDPAARALHRTALIATAPGAVVASGEP-DGDEFPLLKGRP 635
Query: 671 SNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
N A A VC+ F+C P TDP L + L P
Sbjct: 636 LVNGEAA----------AYVCRGFTCRTPTTDPAELASELAGAP 669
>gi|444721531|gb|ELW62264.1| Spermatogenesis-associated protein 20 [Tupaia chinensis]
Length = 857
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 210/575 (36%), Positives = 289/575 (50%), Gaps = 81/575 (14%)
Query: 178 APKFPRPVEIQMMLYHSKKLED--------TGKSGEASEGQKMVLFTLQCMAKGGIHDHV 229
AP P P + +ML S + + + S Q+M L TL+ MA GGI DHV
Sbjct: 320 APHHPDPPPLSLMLSVSTVILSFLFSYWLGHRLTQDGSRAQQMALHTLKMMANGGIRDHV 379
Query: 230 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS---------------LTKDVFYS 274
G +WHVPHFEKMLYDQ QLA Y AF ++ D FYS
Sbjct: 380 G----------QWHVPHFEKMLYDQAQLAVAYSQAFQAAPVTSIYSLLSAPQISGDEFYS 429
Query: 275 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV-----EDIL 329
+ + IL Y+ R + G +SAEDADS G R KEGAFYVWT KEV E +L
Sbjct: 430 DVAKGILQYVSRSLSHRSGGFYSAEDADSPPERG-LRPKEGAFYVWTVKEVLQQLPEPVL 488
Query: 330 G-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
G L +HY L GN +S DP E +G+NVL +A++ G+ ++
Sbjct: 489 GATEPLTSGQLLMKHYGLTEPGN--ISPNQDPKGELQGQNVLTVRYSLELTAARFGLDVD 546
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+L KLF R RP+PHLD K++ +WNGL++S +A +L
Sbjct: 547 AVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL------------ 594
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAF 496
G DR + A + A F++RH++D + RL + G S P GFL+DYAF
Sbjct: 595 --GVDR--LITYATNGAKFLKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYAF 650
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHD 555
++ GLLDLYE + WL WA+ LQ+TQD+LF D +GGGYF + E + L LR+K+D D
Sbjct: 651 VVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCSEAELGAGLPLRLKDDQD 710
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
GAEPS NSVS NL+RL G K + L F R++ + +A+P M A
Sbjct: 711 GAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA 767
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNAS 675
+ K +V+ G + D + +L H+ Y NK +I AD + F ++
Sbjct: 768 -HQQTLKQIVICGDPQAKDTKALLQCVHSIYVPNKVLIL---ADGDPSSFLSRQLPFLST 823
Query: 676 MARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ R D+ A VC+N +CS P+T+P L LL
Sbjct: 824 LRRLE---DRATAYVCENQACSMPITEPSELRKLL 855
Score = 215 bits (547), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 153/237 (64%), Gaps = 17/237 (7%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G+ +F K + FL TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPD
Sbjct: 83 GQEAFDKARKENKPIFLSVGCATCHWCHMMEEESFQNEEIGRLLSEEFVSVKVDREERPD 142
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VDKVYMT+VQA GGGWP++V+L+PDL+P +GGTYFPPED R GF+T+L +++D W
Sbjct: 143 VDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWK 202
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGF 175
+ ++ L ++ E+++ AL A + + +LP +A + C +QL + YD +GGF
Sbjct: 203 QNKNTLLENS----ERVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGF 258
Query: 176 GSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
APKFP PV + + + +L G S Q+M L TL+ MA GGI DHVG
Sbjct: 259 AEAPKFPTPVILSFLFSYWLGHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVG 310
>gi|300770884|ref|ZP_07080761.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762157|gb|EFK58976.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
Length = 672
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 192/572 (33%), Positives = 284/572 (49%), Gaps = 67/572 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE++ +A+ +N ++VS+K+DREERPD+D++YMT VQ + GGWPL+
Sbjct: 48 SACHWCHVMERESFENDAIAQTMNKFYVSVKIDREERPDIDQIYMTAVQLMTNAGGWPLN 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE---QLS 136
PD +P+ GGTYF P D ++ IL ++ W+ Q AIE +L+
Sbjct: 108 CICLPDGRPIYGGTYFKPHD------WQNILLQIAQMWE-------QQPLVAIEYATKLT 154
Query: 137 EALSASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 194
+ + S N +PD+ L +D++ GG+ APKFP P +L
Sbjct: 155 DGIQQSERLPINPIPDQYNTADLSAIITPWVALFDTKDGGYNRAPKFPLPNNWLFLL--- 211
Query: 195 KKLEDTGKSGEASEGQKM---VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
+ G + +K+ V FTLQ MA GGI+D +GGGF RYSVD WH+PHFEKML
Sbjct: 212 -------RYGVLAGDEKIIDHVHFTLQKMACGGIYDQIGGGFARYSVDPYWHIPHFEKML 264
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD GQL +++ +A+ FY + ++ + + R+M+ + A DADS EG
Sbjct: 265 YDNGQLLSLFSEAYQQRPLPFYKRVVQETIHWANREMLAANNGFYCALDADS---EGV-- 319
Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
EG +Y ++ E+E ILGE A LF ++ + GN + N+ I D
Sbjct: 320 --EGKYYSFSKSEIEKILGEDAPLFISYFNITAEGNWTE----------ESTNIPILDPD 367
Query: 372 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 431
+ A + G E++ L E + KL+ R R RP LD K + +WN L++ A ++
Sbjct: 368 ADLMALEAGYSAEEWETCLAEAKEKLYRYRETRIRPGLDHKQLATWNALMLKGLTDAYRV 427
Query: 432 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 491
D Y++ A A FI L + R+ H ++ + GFL
Sbjct: 428 F----------------DNSSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFL 470
Query: 492 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 551
DDYAF + LYE KWL A +L + ELF D ++ T ++ R
Sbjct: 471 DDYAFTTEAFIALYEATFDEKWLDLARQLADKALELFYDSHQKTFYYTADSSGELIARKS 530
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
E D P+ S V+ L +L + K DY
Sbjct: 531 EIMDNVIPASTSAIVLQLKKLGLLF--DKEDY 560
>gi|374599798|ref|ZP_09672800.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
gi|423324955|ref|ZP_17302796.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
103059]
gi|373911268|gb|EHQ43117.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
gi|404606964|gb|EKB06498.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
103059]
Length = 665
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 202/594 (34%), Positives = 296/594 (49%), Gaps = 79/594 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+TCHWCHVME ESF + VA+++N F+SIKVDREE PDVD YM VQ + GGWPL+
Sbjct: 47 STCHWCHVMEEESFTNPAVAEVMNQDFISIKVDREEHPDVDAYYMKAVQLMTKQGGWPLN 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ--------SGAFA 131
V PD +P+ GGTYFP K W LAQ + FA
Sbjct: 107 VVCLPDGRPIWGGTYFP-----------------KQTWVNALTQLAQLHQNKPEATLEFA 149
Query: 132 IEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 191
+L E + + + +E + L + E+ +S+D +GG+ APKF P +L
Sbjct: 150 T-KLQEGVYIMGLA-PVANEESRFNLDIVLEKWKQSFDLEYGGYQRAPKFMMPTN---LL 204
Query: 192 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
Y L+ G + + TL MA GGI D + GGF RYSVD +WH+PHFEKML
Sbjct: 205 Y----LQKVGDLTRDKDLLHYIDLTLTQMAWGGIFDVLEGGFSRYSVDFKWHIPHFEKML 260
Query: 252 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
YD QL +VY DA+ T + Y + + +++R+ + G I+SA DADS +G +
Sbjct: 261 YDNAQLLSVYSDAYKRTANPLYLEVITKTIQFIQRNWLSDWGGIYSALDADSVNDKGIS- 319
Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EGA+YVWT + ILG+ LF + + + G + +G VLI+ N
Sbjct: 320 -QEGAYYVWTEATLRRILGDDFSLFAQIFNVNAYGYWE-----------EGHFVLIQ-NQ 366
Query: 372 SSASASKLGMPLEKYLNILGECRRK------LFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
AS + L++ RK L + R RP+PHLDDK+I SWN ++I+
Sbjct: 367 PLASIATANQ-----LDVFDLQERKKKWEQLLLEERDHRPKPHLDDKIICSWNAMLITGL 421
Query: 426 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 485
A ++ Y++ AES +I+ +L DE+ L HS N +
Sbjct: 422 LDAYS----------------ATNETSYLQQAESIYHYIQTYLLDEE-RGLFHSSHNQNA 464
Query: 486 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 545
G+LDDYAF I L+ L+E + +L A L + +LFLD + ++ +
Sbjct: 465 HTLGYLDDYAFYIQALIRLFEHTANQDYLWQAKRLMDLTLDLFLDEKSKFFYFNQASQAN 524
Query: 546 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+LR E D PS N+V ++L++L + +Y Q A+H + V ++ L
Sbjct: 525 HILRSIETEDNVIPSANAVLCMSLLQLG---VAFEHAHYTQLAQHMIEVMQSNL 575
>gi|359145694|ref|ZP_09179393.1| hypothetical protein StrS4_07994 [Streptomyces sp. S4]
Length = 675
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 235/705 (33%), Positives = 336/705 (47%), Gaps = 93/705 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A ++N FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPPE ++G PGF+ +L V+ AW ++R + + + L E
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A +LP +E Q L L++ YD GGFG APKFP + ++ +L H +
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY+ + T + + +++ RD+ P G SA DADSA+ G R EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT ++ ++LGE + H+ + G F+ ++ L +
Sbjct: 333 YVWTPAQLVEVLGEEDGRVAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G + R +L++ R +RP P DDKV+ +WNGL I++ A A
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
+R + ++ A +AA +R HL D RL + R+G S G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + + F D E G ++T + ++ R ++
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
D A PSG + + L A + S+ +R AE +L V + + VP +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
+L P + V +VG S + A H + L+ V+ PAD E
Sbjct: 587 AVTEALLDGP--REVAVVGDPS----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635
Query: 667 EEHNSNNASMARNNFSADKV-VALVCQNFSCSPPVTDPISLENLL 710
+ AD A VC+ F C P TDP L L
Sbjct: 636 -------LPLLAGRVPADGAPTAYVCRGFVCDAPTTDPALLAAQL 673
>gi|357028650|ref|ZP_09090680.1| hypothetical protein MEA186_27750 [Mesorhizobium amorphae
CCNWGS0123]
gi|355537917|gb|EHH07167.1| hypothetical protein MEA186_27750 [Mesorhizobium amorphae
CCNWGS0123]
Length = 672
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 227/694 (32%), Positives = 333/694 (47%), Gaps = 83/694 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE++ VA ++N FV+IKVDREERPD+D++YM + A+ GGWPL++F
Sbjct: 54 CHWCHVMAHESFENDTVAAVMNRLFVNIKVDREERPDIDQIYMAALHAMGEQGGWPLTMF 113
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GGTYFP + +YGRPGF ++ V AW +KR+ LAQS A + E A
Sbjct: 114 LTPDGKPFWGGTYFPRDARYGRPGFIQVMEAVDKAWREKRESLAQS-ADGLTSHVETRLA 172
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A + + D ++ L A ++ D GG APKFP L+ S + T
Sbjct: 173 GAHTKAVLD---RDTLGDLAGRIDGMIDRELGGLRGAPKFPN-APFMHTLWLSWLRDGTA 228
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+A VL +L+ M GGI+DHVGGG RYS D W VPHFEKMLYD QL +
Sbjct: 229 SHRDA------VLLSLEMMLAGGIYDHVGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRMC 282
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
A++ T + D +++L R+M GG ++ DADS +EG FY W+
Sbjct: 283 NWAYAATGSDLFRLRIEDTVEWLLREMRVDGGAFAASLDADS-------DGEEGLFYTWS 335
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
++ +LG+ + LF ++ L S PH ++GK ++ + + + LG+
Sbjct: 336 RDDINSVLGDDSALFFNYFIL-----------STPHG-WEGKPIIHQ----TQAQQSLGI 379
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
L L + KL R +R RP D K + WNGL+I++ A A + L
Sbjct: 380 ADRDQLAPL---KAKLLAAREQRIRPGRDGKALTDWNGLMIAALAEAGRTLT-------- 428
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
R ++++ A A S I ++ RL HS P DYA + +
Sbjct: 429 --------RSDWIDAAAQAFSHIAGASHE---GRLPHSMLGAKKLFPALSSDYAAMTNAA 477
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
+ L+E ++ A D D E GY+ T + V +R++ D D A PS
Sbjct: 478 ISLFEATGDPNYVEQARHFVAQLDLWHRDSESTGYYLTASDSGDVPIRIRGDVDEAIPSA 537
Query: 562 NSVSVINLVRLASIVA----GSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 617
+S + LVRL+S G K+ AEH++ T + A + CA L+
Sbjct: 538 SSQIIEALVRLSSATGDLDLGEKA---WTTAEHAMG--RTAQQAYGQAGIVNACA---LA 589
Query: 618 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMA 677
+ K VV+ S + +++ A+ + D + I + TE +N ++
Sbjct: 590 LEPLKLVVV----DSPENPSLVPVANRNPDPRRVDIVVQ-VGTE---------ANRPTLP 635
Query: 678 RNNF-SADKVVALVCQNFSCSPPVTDPISLENLL 710
DK A +C C P VTDP LE LL
Sbjct: 636 GGVLPPTDKPGAWLCTGQVCLPVVTDPEELEELL 669
>gi|117929090|ref|YP_873641.1| hypothetical protein Acel_1883 [Acidothermus cellulolyticus 11B]
gi|117649553|gb|ABK53655.1| protein of unknown function DUF255 [Acidothermus cellulolyticus
11B]
Length = 658
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 236/702 (33%), Positives = 328/702 (46%), Gaps = 104/702 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHVM ESFED A +N+ FV +KVDREERPD+D VYM QA+ G GGWPL+
Sbjct: 48 SSCHWCHVMAHESFEDPATAAFMNEHFVCVKVDREERPDIDAVYMEATQAMTGRGGWPLT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
FL+PD +P GTYFP E + G P F+ +L V AW + L + + L +
Sbjct: 108 CFLTPDGEPFFTGTYFPKEPRAGMPAFRQVLEAVWTAWQSRSADLVAAARRVVAVLQQ-- 165
Query: 140 SASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
++L D+L + L +L + YD GGFGSAPKFP ++ +L +
Sbjct: 166 -----GSRLTDDLGAIDADLLDAAVGELRRQYDPVHGGFGSAPKFPSATTLEFLLRY--- 217
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
G G +MV T + MA+GGI+D + GGFHRYSVD W VPHFEKMLYD Q
Sbjct: 218 ----GSLG----AMEMVAVTCEHMARGGIYDQLAGGFHRYSVDAAWTVPHFEKMLYDNAQ 269
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L VYL + T+ I ++ ++L RD+ P G +A DAD+ EG T
Sbjct: 270 LLGVYLHWWRRTQHQLARRIVEEVAEFLLRDLCTPAGGFAAALDADAGGVEGGT------ 323
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
YVWT E+ D LG + A E + + GN + G++VL D+
Sbjct: 324 -YVWTLAELRDALGSDDAAYAAELFGVTEHGNTE-----------DGRSVLQLAVDAP-- 369
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 435
LE++ I R++L VRS+R +P DDK+I SWNGL ++S A A +L
Sbjct: 370 ------DLERWRRI----RQRLLAVRSRRAQPARDDKIIASWNGLAVASLAEAGFLL--- 416
Query: 436 AESAMFNFPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDD 493
DR ++ A SA I HL D RL S R+G + G LDD
Sbjct: 417 -------------DRDALVDAAVRSAEYLIDVHLRD---GRLCRSSRDGERNPVDGALDD 460
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRV 550
YA + GLL L + S ++L EL E L E GG+++T + ++ R
Sbjct: 461 YANVAQGLLTLAQIRSEARYL----ELAGALLEAILTHFRAEDGGFYDTADDAERLVRRP 516
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL--AVFETRLKDMAMAVPL 608
+ D A PSGNS + L+ A++ + S +R +L V R A+ L
Sbjct: 517 RTFTDDATPSGNSAAAHALLTYAAL---TGSQRHRDAVPGALRPTVRLARRYPHAVGYGL 573
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEE 668
AA L P+ + +VG S L +T +D +
Sbjct: 574 ATIAA-WLDGPA--EIAVVGDGS----------------LWRTAWLVDRPGAVRAARAAD 614
Query: 669 HNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + +A VC+NF C PV L LL
Sbjct: 615 GPPWAPLLEGRTAPPGQSLAYVCRNFECQRPVASEAELRALL 656
>gi|72160855|ref|YP_288512.1| hypothetical protein Tfu_0451 [Thermobifida fusca YX]
gi|71914587|gb|AAZ54489.1| conserved hypothetical protein [Thermobifida fusca YX]
Length = 665
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 229/690 (33%), Positives = 326/690 (47%), Gaps = 96/690 (13%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF DE A+++N FV++KVDREERPDVD VYM QA+ G GGWP++VF
Sbjct: 50 CHWCHVMARESFADEQTAQIMNANFVNVKVDREERPDVDAVYMEATQAMTGHGGWPMTVF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
+PD +P GTYFP E F+ +L + AW R + G ++++EALSA
Sbjct: 110 ATPDGEPFYCGTYFPREH------FQRLLLGISHAWRTDRTGVVGQG----KRVAEALSA 159
Query: 142 SASSNKLPDELPQNA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
LP P +A L +L+ YD+ GG+G+APKFP ++ +L H ++ D
Sbjct: 160 ---PRTLPSGPPPSAQVLEQAVARLAAEYDTVNGGYGTAPKFPPSPVMEFLLRHHARVSD 216
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
G +E +MV T + MA+GGI+D + GGF RY+VD W VPHFEKMLYD L
Sbjct: 217 ----GAETEALRMVRHTAEAMARGGIYDQLAGGFARYAVDATWTVPHFEKMLYDNALLLR 272
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
Y + T D + + D++ ++ G SA DADS EG +EG +YV
Sbjct: 273 CYTHLWRQTGDELARRVAVETADWMVAELRTAEGGFASALDADS---EG----EEGRYYV 325
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT ++ D+LGE + +L +++ +G +VL D
Sbjct: 326 WTPAQLRDVLGEEDGAWA----------AELFGVTEQGTFERGTSVLQLRADPDDR---- 371
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
E+Y + R +L R+ R P DDKV+ WNGL I+ A A +L
Sbjct: 372 ----ERYAYV----RDRLRKARANRVPPARDDKVVTGWNGLAIAGLAEAGALL------- 416
Query: 440 MFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFL 497
DR + +E A AA + RH D RL R+G P + G L+DYA L
Sbjct: 417 ---------DRPDLVERAREAARLVVERHYAD---GRLVRVSRDGVPGTSAGVLEDYANL 464
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GLL L+ +W+ EL T F D GG+++T + ++ R +E D A
Sbjct: 465 AEGLLALHAVTGEIRWVGVCGELLETVLTRFTDGS-GGFYDTADDAEALFNRPREFTDDA 523
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMCC 611
PSG S + L+ A++ + S +R+ AE +L V T R MAV
Sbjct: 524 TPSGWSAAAGALLSYAAL---TGSFRHREAAEAALGVVSTLAEKTPRFAGWGMAV----- 575
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A +L+ P + +VG K E + A + V D + + E
Sbjct: 576 AEALLAGPV--EIAVVGPKGDPVAEELHRTALLATTPGTVVSRGDGVNDGGIGLLEGRTL 633
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ A A VC+NF+C P T
Sbjct: 634 VDGRPA----------AYVCRNFTCRLPAT 653
>gi|372222108|ref|ZP_09500529.1| hypothetical protein MzeaS_07308 [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 701
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 185/555 (33%), Positives = 296/555 (53%), Gaps = 47/555 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME E FE+E VAKL+N+ F++IK+DREERPDVD++YM +Q + G GGWPL++
Sbjct: 78 CHWCHVMEEECFENEEVAKLMNENFINIKIDREERPDVDQIYMDAIQMMTGNGGWPLNIV 137
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS- 140
PD +P G TY P ++ + L+ + D + + + Q A +EQ +A++
Sbjct: 138 ALPDGRPFWGATYLPKDN------WTKSLKSLIDLYHNDPEKV-QEYAGKLEQGIQAINL 190
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
++K+ + L L + S S+D+ GG+ APKF P ++ +L+++
Sbjct: 191 VENKTSKI--HFTKEELDLAVQNWSTSFDTYLGGYKRAPKFMMPNNLEYLLHYA------ 242
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ + + V TL MA GGI D + GGF RY+VD +WHVPHFEKMLYD GQL ++
Sbjct: 243 -TANKNDTILEYVNTTLTRMAYGGIFDPIDGGFSRYAVDVKWHVPHFEKMLYDNGQLISL 301
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y A+++TK+ Y + + +++ G +S+ DADS G + +EGA+YVW
Sbjct: 302 YSKAYAVTKNSLYKETVEKSVGFATLELLDTNGGFYSSLDADSKNNSG--KLEEGAYYVW 359
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T KE++ ILG + +FK +Y + G + + K VLI + A LG
Sbjct: 360 TEKELDSILGSESSVFKTYYNINSYGYWE-----------EDKYVLIRDASDNELADSLG 408
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+ + + ++L VR +R +P LDDK++ SWNGL++ A + L+++
Sbjct: 409 IATTNLTQQIAKNLKQLKKVRGQREKPRLDDKILTSWNGLMLKGLTDAYRYLQND----- 463
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
+Y+++A A+F+ + + + + + +NG S GFLDDYA LI G
Sbjct: 464 -----------KYLQLALKNANFLEQEIIQDD-FSVYRNHKNGKSSINGFLDDYATLIDG 511
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
+ LYE +WL A L + F D+E ++ T+ D ++ R E +D +
Sbjct: 512 FIGLYEVTFDDRWLTLAKNLTDYAITHFKDQESNMFYYTSDLDDKLIRRSIETNDNVISA 571
Query: 561 GNSVSVINLVRLASI 575
NS+ NL +L +
Sbjct: 572 SNSIMANNLYKLHKV 586
>gi|383775980|ref|YP_005460546.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
gi|381369212|dbj|BAL86030.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
Length = 688
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 237/720 (32%), Positives = 349/720 (48%), Gaps = 82/720 (11%)
Query: 11 KTRRTHFLIN----TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
K R LI+ +CHWCHVM ESFED +A +N+ FVS+KVDREERPDVD VYMT
Sbjct: 34 KRRDVPLLISVGYSSCHWCHVMAHESFEDAAIAAQMNEGFVSVKVDREERPDVDAVYMTA 93
Query: 67 VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
QA+ G GGWP++VF +PD P GTYF P D++GR +L V AW +RD + +
Sbjct: 94 TQAMTGQGGWPMTVFATPDGDPFFCGTYF-PRDQFGR-----LLASVTTAWRDQRDDVLK 147
Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
GA +E + A + P + + L A+ L+K D +GGFG APKFP +
Sbjct: 148 QGAAVVEAVGGAQMIGGP--RAP--ISGDLLAAAAQGLAKEQDQTYGGFGGAPKFPPHMN 203
Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
+ +L H ++ TG +++ ++V + MA+GGI+D + GGF RY+VDE W VPH
Sbjct: 204 LLFLLRHHER---TG----SADALEIVRHACERMARGGIYDQLAGGFARYAVDETWTVPH 256
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD L VY + LT D+F I + +L RD+ G + SA DAD++
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDLFARRIADETAAFLLRDLGTAQGGLASALDADTSGV 316
Query: 307 EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFK---- 361
EG T Y WT E+ + LG E + + + G + S P +
Sbjct: 317 EGLT-------YAWTPAELAEALGAEDGAWAADLFRVTEPGTFAHNSASAPIDGAADRMK 369
Query: 362 ----GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
GK+VL+ D + + +E++ ++ R++L R+ RP+P DDKV+ SW
Sbjct: 370 GVEHGKSVLVLARDIDEADPAI---VERWRDV----RQRLLTARNGRPQPARDDKVVASW 422
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL I++ A +L A S R + +AE A RHL D RL+
Sbjct: 423 NGLAITALAE-HGVLTGSAGS-----------RDAAVALAEVLAD---RHLVD---GRLR 464
Query: 478 HSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
R+G + P G L+DY + L +++ + +WL A EL + F + GG+
Sbjct: 465 RVSRDGVAGEPAGVLEDYGSVAEAFLAVHQVTASPRWLTLAGELLDVALARFGSGD-GGF 523
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
++T + +L R + D A PSG SV LV A++ S S +R+ A+ +LA
Sbjct: 524 YDTADDAEKLLTRPADPTDNATPSGLSVVCAALVSYAAL---SGSTAHREAADAALATVG 580
Query: 597 TRLKDMA-MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHI 655
+ A A L+ P + + + + ++ AAH S TVI +
Sbjct: 581 PLIGGHPRFAGYAAAVAEAALTGP---YEIAIATTDRTAADPLVEAAHWSAP-GGTVIVV 636
Query: 656 DPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKPS 715
D + +A A VC+ F C PVT P L + L + P+
Sbjct: 637 GEPDRPGVPL----------LADRPLIGGASTAYVCRGFVCDRPVTTPGDLADRLGQSPT 686
>gi|443288943|ref|ZP_21028037.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
gi|385888344|emb|CCH16111.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
Length = 680
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 209/586 (35%), Positives = 285/586 (48%), Gaps = 60/586 (10%)
Query: 10 TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
K R LI+ CHWCHVM ESFE+E VA LLND FVSIKVDREERPDVD VYMT
Sbjct: 33 AKRRDVPVLISVGYAACHWCHVMAHESFENEQVAALLNDNFVSIKVDREERPDVDAVYMT 92
Query: 66 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
QA+ G GGWP++VF +PD P GTYFP R F +L+ V AW +R +
Sbjct: 93 ATQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFVRLLQSVTTAWADQRAEVL 146
Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 185
+ GA +E + A + + L L L A L+ YD+ GGFG APKFP +
Sbjct: 147 RQGAAVVEAIGGAQAVGGPTAPLDGPL----LDAAAGNLASGYDATNGGFGGAPKFPPHM 202
Query: 186 EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 245
+ +L H ++ D ++V T + MA+GGI+D + GGF RYSVD W VP
Sbjct: 203 NLLFLLRHHQRTGD-------PRSLEIVRHTAEAMARGGIYDQLAGGFARYSVDAHWTVP 255
Query: 246 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 305
HFEKMLYD L VY + LT D + RD +L ++ PG SA DAD+
Sbjct: 256 HFEKMLYDNALLLRVYAQLWRLTGDPLARRVARDTARFLADELHRPGEGFASALDADTEG 315
Query: 306 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 365
EG T Y WT ++ + LGE F DL ++D G +V
Sbjct: 316 VEGLT-------YAWTPAQLVEALGEDDGRFA----------ADLFTVTDEGTFEHGMSV 358
Query: 366 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 425
L D A ++ ++ ++G+ L R RP+P DDKV+ +WNGL I++
Sbjct: 359 LRLARDVDDVAPEV---RARWQRVVGQ----LLAARDTRPQPARDDKVVAAWNGLAITAI 411
Query: 426 AR----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR 481
A A+ E E A V + AE A+ H+ D RL+ R
Sbjct: 412 AEFLQVAALYASPEDEDANLMEGVTIVADGAMRDAAEHLATV---HVVD---GRLRRVSR 465
Query: 482 NGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 540
+G AP G L+DY + L++ +WL A +L + E F GG Y++T
Sbjct: 466 DGRVGAPAGVLEDYGCVAEAFCALHQLTGEGRWLTVAGQLLDAALEHFA-APGGAYYDTA 524
Query: 541 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
+ ++ R + D A PSG S V LV A++ ++ YR+
Sbjct: 525 DDAEQLVARPADPTDNATPSGRSALVAGLVSYAALTGETR---YRE 567
>gi|427707072|ref|YP_007049449.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
gi|427359577|gb|AFY42299.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
Length = 685
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 211/623 (33%), Positives = 315/623 (50%), Gaps = 82/623 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDGAIADYMNTNFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
FLSP DL P GTYFP + +YGRPGF +L+ ++ +D +++ L Q A ++ L
Sbjct: 108 TFLSPEDLVPFYAGTYFPVDPRYGRPGFLQVLQALRRYYDTEKEDLRQRKAVILDSL--- 164
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSK 195
L+++ N P E+ ++ L L K +++ G S FP M+ Y
Sbjct: 165 LTSAVLQNSDPQEVQEHEL------LGKGWETSTGIITSNQYGNSFP------MIPYSEL 212
Query: 196 KLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
L T + + +G+++ +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 213 ALRGTRFNLPSRYDGKQICTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDN 272
Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
GQ+ + +S ++ ++ + +L+R+MI P G ++A+DADS A +
Sbjct: 273 GQIVEYLANLWSAGIQEPAFARAIAGTVQWLQREMIAPEGYFYAAQDADSFTNSDAVEPE 332
Query: 314 EGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVW+ ++E +L E ++ + + GN F+ NVL N
Sbjct: 333 EGAFYVWSYSDLEQLLTSEELTQLQQEFTVSSQGN------------FESLNVLQRRN-- 378
Query: 373 SASASKLGMPLEKYLNILGECRR-------KLFDV--RSKRPRPH---------LDDKVI 414
+L +E+ L L R K+F ++ + H D K+I
Sbjct: 379 ---VGQLSAEIERILAKLFTARYGDKAESLKIFPPARNNQEAKTHNWPGRIPSVTDTKMI 435
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQT 473
V+WN L+IS ARA + F P+ Y+E+A AA+FI H + D +
Sbjct: 436 VAWNSLMISGLARAGGV---------FQEPL-------YLELAAQAANFILEHQFVDGRF 479
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDRE 532
HRL + G + +DYAF I LLDL +WL AI +Q DE E
Sbjct: 480 HRLNY---QGEATVLAQSEDYAFFIKALLDLQACSPDDQQWLENAIAIQAEFDEFLWSVE 536
Query: 533 GGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFNT+ + +++R + D A PS N V++ NLVRL+ + + + +Y AE
Sbjct: 537 LGGYFNTSSDASQDLIIRERSYTDNATPSANGVAIANLVRLSLL---TDNLHYLDLAEQG 593
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F + + A P + A D
Sbjct: 594 LKAFRSVMSSHPQACPSLFTALD 616
>gi|220935906|ref|YP_002514805.1| hypothetical protein Tgr7_2744 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|219997216|gb|ACL73818.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
HL-EbGr7]
Length = 676
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 228/686 (33%), Positives = 339/686 (49%), Gaps = 70/686 (10%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT-YVQALYGGGGWPL 78
+ CHWCHVM ESFED A+++N +V+IKVDREERPD+DK+Y T + GGWPL
Sbjct: 52 SACHWCHVMAHESFEDPATAQVMNRLYVNIKVDREERPDLDKIYQTAHFMLSQRSGGWPL 111
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
++FL+PD P GGTYFP ++G P F+ +L ++ + ++RD + + A L A
Sbjct: 112 TMFLTPDQVPFFGGTYFPDAPRHGLPAFRDLLERIAGFYHERRDEIERQNA----SLQGA 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L+ S D L L +++ +D R GGFG+ PKFP P ++ +L H +
Sbjct: 168 LTGLFSPRGH-DPLNSAVLDTVRSAIAQQFDERDGGFGTPPKFPHPSTLERLLRHHAQTH 226
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D + M FTL+ MA+GG++D + GGF RYS D +W +PHFEKMLYD G L
Sbjct: 227 D-------ERARYMACFTLEKMARGGLNDQLAGGFCRYSTDGQWMIPHFEKMLYDNGPLL 279
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+Y A++ T D +++ + + + M P G +SA DADS EG +EG +Y
Sbjct: 280 ALYAQAYAATGDAYFADVAGRTAAWAVQTMQSPEGGFYSALDADS---EG----EEGRYY 332
Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VW +EV ++ E +F Y L N F+G+ L A
Sbjct: 333 VWQPEEVRKLVPEEVYPVFARVYGLDRGPN------------FEGRWHLHSFVTPEQLAK 380
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ G ++ R L R KR P LDDK++ SWN L+I A A++ L
Sbjct: 381 ESGTDEATIEAMIEAARAPLLAARDKRVPPGLDDKILTSWNALMIRGLAVAARHLG---- 436
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
R E+++ A A FIR L+ + RL +++NG ++ +LDD+A+L
Sbjct: 437 ------------RSEWVDAASRALDFIRAQLW--RDGRLLATYKNGSARLSAYLDDHAYL 482
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
+ LL+L + T+ LV+A E+ F D E GG+F T + +++ R K D A
Sbjct: 483 LDALLELLQVRWRTEDLVFAREIAEILLAHFEDSEHGGFFFTADDHEALIQRPKTFADEA 542
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAADML 616
PSGN V+ + L RL ++ + Y + AE ++ + T + MA L+ + L
Sbjct: 543 MPSGNGVAALALNRLGHLLGEPR---YVEAAERTVRLATTLMDQAPMAHASLISAFEEQL 599
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+P K V+L G + E A Y + V I PAD ++ E + A
Sbjct: 600 YLP--KLVILRGEAQRI--ETWRAELERDYAPRRLVFAI-PADASDL---PEALATKAPK 651
Query: 677 ARNNFSADKVVALVCQNFSCSPPVTD 702
+ VA VC CS PVTD
Sbjct: 652 G-------EAVAYVCTGTRCSAPVTD 670
>gi|443327996|ref|ZP_21056601.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
gi|442792405|gb|ELS01887.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
Length = 682
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 213/623 (34%), Positives = 304/623 (48%), Gaps = 79/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN+ FV IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDNAIADYLNNNFVPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP +Y RP F IL+ V+ +D + + L + L +
Sbjct: 108 IFLTPGDLVPFYGGTYFPVTPRYNRPSFIDILKSVRRFYDVETEKLEGFKTEILFNLQRS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S + + L EL L LS R P FP M+ Y + L+
Sbjct: 168 TSLETTEDALTSELLDQGLETNTAVLSSGDPGR-------PNFP------MIPYATAALQ 214
Query: 199 DTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ + + K+ L Q + GGI DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 215 GSRLNFNNRYDADKLCLQRGQDLVLGGICDHVAGGFHRYTVDHTWTVPHFEKMLYDNGQI 274
Query: 258 ANVYLDAFSLTKDVF-YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ +S + I+++L+R+M+ P G ++++DAD+ T A +EG
Sbjct: 275 LEYLANLWSCQRHFLTIEDAIAGIVNWLKREMLAPQGYFYASQDADNFATAEAAEPEEGL 334
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW+ E+E++L E + + + P GN F+G NVL N S
Sbjct: 335 FYVWSYNELENLLSAEELAELQAEFSITPQGN------------FEGSNVLQRFNHEELS 382
Query: 376 ASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSW 417
S LE+ L L R + + ++K R P D K+I +W
Sbjct: 383 PS-----LEQTLQKLFAARYGEKQTGIDTFPVAKNNREAKTKPWPGRIPPVTDTKMITAW 437
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRL 476
N L+IS ARA+ +L Y ++AE+ A+FI + + E + HRL
Sbjct: 438 NSLIISGLARAASVLGI----------------TNYQQLAENTANFILQQQWLEGRLHRL 481
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGG 535
+ +G + +DYA I LLDL++ +WL AI LQ D LF GGG
Sbjct: 482 NY---DGQATVLAQSEDYALFIKALLDLHQSSPQNPQWLDSAIALQAEFDRLFWSEMGGG 538
Query: 536 YFNTTGED--PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
Y+N G D ++L+R + D A P+ N V++ NLVRL + + YR AE L
Sbjct: 539 YYN-NGSDVGDNLLIRERSYMDNATPAANGVAMANLVRLFLLTDNLE---YRDRAEQGLQ 594
Query: 594 VFETRLKDMAMAVPLMCCAADML 616
F +K A P + A D L
Sbjct: 595 AFAGIMKSSPQACPSLFVALDWL 617
>gi|378728836|gb|EHY55295.1| hypothetical protein HMPREF1120_03437 [Exophiala dermatitidis
NIH/UT8656]
Length = 842
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 215/630 (34%), Positives = 295/630 (46%), Gaps = 106/630 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
N CHWCHVME ESF VA LN F+ IKVDRE RPD+D +YM YV A G GGWPL+
Sbjct: 57 NACHWCHVMERESFSSPEVASFLNKHFIPIKVDRECRPDLDDIYMNYVTATTGSGGWPLN 116
Query: 80 VFLSPDLKPLMGGTYFPPEDKY-----------GRPGFKTILRKVKDAWDKKR------- 121
VFL+PDL+P+ GGTY+P P F ILRK+++ W +R
Sbjct: 117 VFLTPDLRPVFGGTYWPGPSSTTNLHRKASHDEAAPSFLDILRKMQEVWSTQRERCRRSS 176
Query: 122 -DMLAQSGAFAIEQLSEALSAS-----ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
D+ Q AFA E + + S +S ++ P+ L + L YDS GGF
Sbjct: 177 TDITTQLRAFAAEGIHSQSNGSVRDGGSSGSEEPEPLELDLLDDALNHFIARYDSTNGGF 236
Query: 176 GSAP---KFPRPVEIQMMLYHSKKLEDT-------------GKSGEAS--EGQKMVLFTL 217
++ KFP P + +L + G GE S + M L TL
Sbjct: 237 SASTNGQKFPTPSNLAFLLRIGAAIAQPSTHTRFGFFSPVLGILGEDSCLKAASMALHTL 296
Query: 218 QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYIC 277
+ M++ G+ D +G GFHRYSV W++PHFEKM+ D QL Y DA++L +D
Sbjct: 297 KAMSRSGLRDQLGYGFHRYSVTPDWNLPHFEKMMCDNAQLLGCYCDAWALGRDPEILGTI 356
Query: 278 RDILDYL---RRDMIGPGGEIFSAEDADS--------AETEGA-TRKKEGAFYVWTSKEV 325
++++Y ++ PGG +++EDADS TE A KKEGAFYVWT KE+
Sbjct: 357 YNLVEYFTNPESPIVRPGGGWYASEDADSRPSRTGNGGGTETAHNEKKEGAFYVWTYKEL 416
Query: 326 EDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 384
E +LGE A + H+ +KP GN + D H+EF +NVL S A + G+ +
Sbjct: 417 ESLLGEQDAPIIARHFGVKPHGN--VPAQHDIHDEFLSQNVLHVDATPSTLAKEFGIAED 474
Query: 385 KYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 443
+ + I+ R KL + R ++R P +D VI SWNGL I+S RA+ L +
Sbjct: 475 EVVRIIKRGRTKLLEHRKAEREPPQVDTNVIASWNGLAIASLTRAANTLAT--------- 525
Query: 444 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG-------------- 489
V E AE AA+F+ +YD T RL P
Sbjct: 526 -VDKHRAARCQEAAERAATFVHCAMYDPTTGRLARIANATDKSRPRSRSKSASHASNNDN 584
Query: 490 -------------FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
F+DDYA++ L LY+ +L WA++LQ D F D G
Sbjct: 585 DNSNGGGGGSNIVFVDDYAYMTQAALMLYDLTLSQPYLDWAVQLQEYLDTHFADVTEGSS 644
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 566
+ G D GA +G S+S
Sbjct: 645 TSGAGTD-----------KGASANGASIST 663
>gi|328541699|ref|YP_004301808.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
gi|326411451|gb|ADZ68514.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
Length = 670
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 227/701 (32%), Positives = 331/701 (47%), Gaps = 95/701 (13%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFED A+++N FV+IKVDREERPD+D++YM + AL GGWPL++
Sbjct: 50 ACHWCHVMAHESFEDPATAEVMNRLFVNIKVDREERPDIDQIYMNALHALGEQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD +P GGTYFP E ++GRP F IL V + +R + ++ ++ L +
Sbjct: 110 FLTPDGEPFWGGTYFPKEARWGRPAFVDILEAVAATYRSERSRIDRNRTGLMQVLKQRAQ 169
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+A L L L ++L +D GG APKFP+ + ++ + T
Sbjct: 170 PAAP-------LDSAILVLAGDRLLSLFDPEHGGIRGAPKFPQASILDLVWRAGLR---T 219
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G ++ L TL+ ++ GGI+DH+ GG RYSVDERW VPHFEKMLYD Q
Sbjct: 220 GNPA----ARETFLHTLRQISNGGIYDHLKGGIARYSVDERWLVPHFEKMLYDNAQYLQH 275
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
L A+ T + + + + +L +M P G S+ DADS EG +EG FYVW
Sbjct: 276 LLTAWLATGEDLFRCRIDETVGWLLDEMRLPEGGFASSLDADS---EG----EEGRFYVW 328
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T+ EV ++LG A F Y + GN ++G +L L ++AS
Sbjct: 329 TAAEVAEVLGADAAFFARFYDISAAGN------------WEGVTILNRLTGTAAS----- 371
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
P E+ N L R KL R+ R RP LDDKV+ WNGL+I++ ARA +I+
Sbjct: 372 -PEEE--NRLAALRAKLLSRRASRVRPALDDKVLADWNGLLIAALARAGRIVS------- 421
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
R+ ++ AE A FI + RL H++R G PGF D+A ++
Sbjct: 422 ---------RESWIAAAEQAFRFIAESM--TGGGRLGHAWRAGRLVFPGFASDHAAMMQA 470
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLD-------REGGGYFNTTGEDPSVLLRVKED 553
+ L E W + E F D GGG++ T + ++LR
Sbjct: 471 AIALAEARP------WDAQHYLRIAEGFADALVRHYAAPGGGFYMTADDATDLILRPLSS 524
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D A P+ NSV+ RL + + +R A+ F + A + CA
Sbjct: 525 ADEAVPNANSVAADAFARLYLLTGDRR---HRDVADAVFHAFAGDVPKNLFATASLLCAF 581
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPA----DTEEMDFWEEH 669
D + R VV+ + S D N++ + L++ V DPA TE D +
Sbjct: 582 DT-RINGRLAVVVAPNGS--DPSNLVDS------LDRAV---DPALTRLVTESTDGLPKD 629
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + A + + A VC+ +CS P L+ L
Sbjct: 630 HPAHGKPALDG----RPAAYVCREGACSLPAATTTELQRTL 666
>gi|291451582|ref|ZP_06590972.1| conserved hypothetical protein [Streptomyces albus J1074]
gi|291354531|gb|EFE81433.1| conserved hypothetical protein [Streptomyces albus J1074]
Length = 675
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 234/705 (33%), Positives = 335/705 (47%), Gaps = 93/705 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A ++N FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPPE ++G PGF+ +L V+ AW ++R + + + L E
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A +LP +E Q L L++ YD GGFG APKFP + ++ +L H +
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY+ + T + + +++ RD+ P G SA DADSA+ G R EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT ++ ++LGE + H+ + G F+ ++ L +
Sbjct: 333 YVWTPAQLVEVLGEEDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G + R +L++ R +RP P DDKV+ +WNGL I++ A A
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
+R + ++ A +AA +R HL D RL + R+G S G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + + F D E G ++T + ++ R ++
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
D A PSG + + L A + S+ +R AE +L V + + VP +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
+L P + V +VG + A H + L+ V+ PAD E
Sbjct: 587 AVTEALLDGP--REVAVVGDPD----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635
Query: 667 EEHNSNNASMARNNFSADKV-VALVCQNFSCSPPVTDPISLENLL 710
+ AD A VC+ F C P TDP L L
Sbjct: 636 -------LPLLAGRVPADGAPTAYVCRGFVCDAPTTDPALLAAQL 673
>gi|422304439|ref|ZP_16391784.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
gi|389790409|emb|CCI13705.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
Length = 692
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 214/623 (34%), Positives = 309/623 (49%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
ED+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDREPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSHLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
KLG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGKLGKDIENMLDKLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AIELQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIELQGEFDRWFWAEDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L+ A P + A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618
>gi|218246233|ref|YP_002371604.1| hypothetical protein PCC8801_1388 [Cyanothece sp. PCC 8801]
gi|218166711|gb|ACK65448.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8801]
Length = 688
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 220/618 (35%), Positives = 309/618 (50%), Gaps = 70/618 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LND F+ IK+DREERPD+D +YM VQ + GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRPGF +L+ ++ +D ++D L +F E
Sbjct: 108 IFLTPDDLVPFYGGTYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEI 160
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKL 197
L S LP NA L E + + P+ F RP M+ Y + L
Sbjct: 161 LDTLQKSAILP---VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLAL 216
Query: 198 EDTGKSGEASEGQKMVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ + + ++ E Q V + + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 217 QGSRFAFQSQENQATVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276
Query: 257 LANVYLDAFSL--TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
+ + +S + F I R + ++L+R+M P G ++A+DAD+ T +E
Sbjct: 277 IVEYLANLWSQGHQEPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEPEE 335
Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW +E+ED L E L + + L GN F+G NVL
Sbjct: 336 GAFYVWKYQELEDCLTSEELKLLEATFSLTAEGN------------FEGSNVLQRRMGGE 383
Query: 374 ASASKLGMPLEKYLNI-LGECRRKLF-------------DVRSKRPRPHLDDKVIVSWNG 419
S + L + L+K I G R+ L R P D K+IV+WN
Sbjct: 384 FSEA-LEVILDKLFMIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWNS 442
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQH 478
L+IS ARA + F P+ Y E+A +A FI + + + + +RL +
Sbjct: 443 LMISGLARAYGV---------FGDPL-------YWELAINATEFILQEQWVNNRLYRLNY 486
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYF 537
G +DYAF I LLDL + + WL A E+Q DE F EGGGY+
Sbjct: 487 E---GQPSVLAQAEDYAFFIKALLDLQKANPWERQWLEKAKEVQEEFDEFFWSIEGGGYY 543
Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
N ++ +L+R + D A PS N V++ NLVRL+ + Y AE L F
Sbjct: 544 NNASDNSGDLLIRERSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTFS 600
Query: 597 TRLKDMAMAVPLMCCAAD 614
+ L A P + A D
Sbjct: 601 SVLSQSPKACPSLFVALD 618
>gi|428224685|ref|YP_007108782.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
gi|427984586|gb|AFY65730.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
Length = 682
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 229/715 (32%), Positives = 338/715 (47%), Gaps = 116/715 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F + +A +ND+FV IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSNGAIAAYMNDFFVPIKVDREERPDLDSIYMQSLQLMVGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P DL P GGTYFP + +YGRPGF +L+ ++ +D ++D ++ +E L EA
Sbjct: 108 VFLAPDDLVPFYGGTYFPVDPRYGRPGFLQVLQAIRRHFDTEKDKVSAVKQEILEHLQEA 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSK 195
S P L + L+KS + G G P FP M+ Y
Sbjct: 168 GSLE----------PGQGSDLTHDLLAKSLEYSTGILSARGPGPSFP------MIPYGEA 211
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
T S E + + + +A GGI+DHV GGFHRY+VD W VPHFEKMLYD G
Sbjct: 212 AQRATRLSLERYDAGTICQQRGEHLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 271
Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
Q LAN + A +T+ F I + +L+R+M G ++A+DAD+ + A
Sbjct: 272 QILEYLANEW--ARGVTEPAFERAIAGTV-TWLKREMTDAQGYFYAAQDADNFTSPEALE 328
Query: 312 KKEGAFYVWTSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVW E+ +L E A L +E + + P+GN F+G+NVL
Sbjct: 329 PEEGDFYVWRYDELAALLTPAELAAL-QEEFTVTPSGN------------FEGRNVLQRS 375
Query: 370 NDSSAS-----------ASKLGMPLEKYLNILGECRRKLFDVRS--KRPRPHLDDKVIVS 416
+ S S A + G P ++ ++ R P D K+I +
Sbjct: 376 REGSLSEVAEAALAKLFAVRYGAPPVAVPTFPPAPSAQVAKTQTWPGRIPPVTDTKMIAA 435
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHR 475
WN L+IS ARA+ + + R+EY ++A AA F+ H + E + HR
Sbjct: 436 WNSLMISGLARAAAVWQ----------------REEYYQLAAGAARFLLAHQWVEGRFHR 479
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGG 534
L + +G + +DYA I L+DL + G + W+ A+++Q D L EGG
Sbjct: 480 LNY---DGEASVLAQSEDYALFIKALIDLDQARPGAEDWIEQAVKVQREFDALLGAEEGG 536
Query: 535 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
Y +++R + D A P+ NS+++ NLVRLA + ++ Y AE +L
Sbjct: 537 YYNAARDRSQDLVIRERSYADNATPAPNSIAIANLVRLALL---TEDLSYLDRAEKALQS 593
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F + A P M A D+ R H+++ +++ D LAA + + K
Sbjct: 594 FSAPMARSPQACPSMFGALDLY----RNHLLI---RATPDVLQTLAARYCPTAVYKVADE 646
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
+ + V LVCQ SC P SLE L
Sbjct: 647 L---------------------------PEGAVGLVCQGLSCQEPAR---SLEQL 671
>gi|411116326|ref|ZP_11388814.1| thioredoxin domain-containing protein [Oscillatoriales
cyanobacterium JSC-12]
gi|410713817|gb|EKQ71317.1| thioredoxin domain-containing protein [Oscillatoriales
cyanobacterium JSC-12]
Length = 698
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 235/721 (32%), Positives = 344/721 (47%), Gaps = 122/721 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +AK +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 60 SSCHWCTVMEGEAFSDQEIAKFMNTNFLPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 119
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRP F +L V+ +D+++ L A E
Sbjct: 120 IFLTPDDLVPFYGGTYFPVEPRYGRPSFLQVLEGVRRFYDQEKTKLQSVKA-------EI 172
Query: 139 LSASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
LS SS LP + LP++ E + S+ G P FP M+ ++
Sbjct: 173 LSNLQSSTLLPAVEALPRDVFLHGLEYNTGVISSKSVG----PSFP-------MIPYADV 221
Query: 197 LEDTGKSGEASEGQKMVLFTLQC--MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+ + S + + T + +A GGI DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 222 AQRAMRFLAKSRYNALEVSTQRGIDLALGGIFDHVGGGFHRYTVDPTWTVPHFEKMLYDN 281
Query: 255 GQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
GQ+ + +S + + F I + ++L+R+M P G ++A+DADS + AT
Sbjct: 282 GQIMEYLANQWSADVQEPAFKRAIALTV-EWLQREMTAPEGYFYAAQDADSFTSPDATEP 340
Query: 313 KEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EGAFYVW E+ +L E + + + GN F+G NVL +
Sbjct: 341 EEGAFYVWGYDELTTLLTEKELREMQTQLTITEKGN------------FEGVNVL-QRRH 387
Query: 372 SSASASKLGMPLEKYLNI---LGECRRKLF-DVRSKRPR----------PHLDDKVIVSW 417
S + + L+K I +G R K F R+ R P D K+IV+W
Sbjct: 388 SGQLSEAIETALDKLFQIRYGIGTDRIKPFPPARNNREAQEMPWAGRIPPVTDTKMIVAW 447
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRL 476
N L+IS ARA+ + ++ + ++E+A +A FI R + + HR+
Sbjct: 448 NSLMISGLARAAAVFQNCS----------------WLELAVNATQFILERQWVENRLHRV 491
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYE-------FGSGTKWLVWAIELQNTQDELFL 529
+ NG +DYA I LLDL++ + + +L A+ +Q DE
Sbjct: 492 NY---NGQPSVLAQSEDYALFIKALLDLHQAYQSLDSVAALSSFLDAAVRVQAELDEFLW 548
Query: 530 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
E GGYFN T P +L+R + D A P+ N V+V NLVRLA + ++ Y AE
Sbjct: 549 SVELGGYFN-TDRTPDLLVRERSYMDNATPAANGVAVANLVRLALL---TEDLSYLDRAE 604
Query: 590 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLN 649
+L F + ++ A P + D H LV +++ D +LAA + +
Sbjct: 605 QTLKAFGSVMERSPQACPSLFVGMDWF-----LHQTLV--RATPDAIALLAAQYQPTVMY 657
Query: 650 KTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENL 709
KT + + PA V LVCQ SC P S+E L
Sbjct: 658 KTEVDL-PAGA--------------------------VGLVCQGLSCKEPAR---SMEQL 687
Query: 710 L 710
L
Sbjct: 688 L 688
>gi|358396472|gb|EHK45853.1| hypothetical protein TRIATDRAFT_241655 [Trichoderma atroviride IMI
206040]
Length = 726
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 202/638 (31%), Positives = 319/638 (50%), Gaps = 71/638 (11%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+ +M +ESF + A +LN F+ I VDRE RPD+D +YM YVQA+ GG
Sbjct: 67 HIGYRACHFSRLMALESFMNPDCAAVLNHSFIPIIVDREVRPDIDTIYMNYVQAVSNSGG 126
Query: 76 WPLSVFLSPDLKPLMGGTYFP--------PEDKYGRP-GFKTILRKVKDAWDKKR----- 121
WPL++FL+P+L+P+ GGTY+P ED P F I++KV++ W ++
Sbjct: 127 WPLNLFLTPELEPVFGGTYWPGPSVARRAAEDHGDEPLDFLVIVKKVRNIWKDQQARCRK 186
Query: 122 ---DMLAQSGAFAIE--------------QLSEALSASASSNK----------LPDELPQ 154
+++ Q FA E Q++ A A+ SN+ + EL
Sbjct: 187 EATEVIGQLREFAAEGTLGKRSIAAPQQQQIAPAGWAAPVSNQPVAKVSDSTDVSSELDI 246
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQK 211
+ L ++ ++D +GGFG APKF P ++ +L ++D E
Sbjct: 247 DQLEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLAFLLNLVNFPAPVQDVVGEAECKHALD 306
Query: 212 MVLFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT-- 268
M L TL+ + G +HDH+G GF R SV W +P+FEK++ D +L +YL+A+ +
Sbjct: 307 MALDTLRKIRDGALHDHIGATGFARCSVTPDWSIPNFEKLVVDNAELLQLYLEAWRKSGA 366
Query: 269 -KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
+D + + ++ DYL I P G S+E ADS G K+EGA+Y+WT +E
Sbjct: 367 REDSEFYNVVIELADYLTSPPIALPDGGFASSEAADSYAKRGDAEKREGAYYLWTRREFA 426
Query: 327 DILG---EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
++ +H E Y+ ++ GN D DP+++F +N+L + + +P
Sbjct: 427 SVVNADDKHISAIAEAYWDVQEDGNVDEDH--DPNDDFINQNILRIRKTPEELSKQFNVP 484
Query: 383 LEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+ + R L R K RP P +DDK++ WNGLV+S+ R + LK
Sbjct: 485 VATVKRDIETAREALKKRREKERPHPDVDDKIVAGWNGLVVSALIRTAAFLKE------- 537
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+ ++Y+ A+ + SFI+ L+DE+ L + +G GF DDYA+L GL
Sbjct: 538 ---LQPERSRKYLGAAKKSISFIKEKLWDEKNKILYRIWSDG-RHTEGFADDYAYLTHGL 593
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
LDL++ +L +A LQ +Q+ F D G +++TT P +LR+K+ D + PS
Sbjct: 594 LDLFDATGDESYLEFADNLQKSQNAFFYD-SAGAFYSTTPSSPHTILRLKDGMDTSLPST 652
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
N VSV NL RL ++A K + A ++ FE +
Sbjct: 653 NGVSVSNLFRLGELLADEK---FTGLARETINAFEAEM 687
>gi|427728058|ref|YP_007074295.1| hypothetical protein Nos7524_0793 [Nostoc sp. PCC 7524]
gi|427363977|gb|AFY46698.1| highly conserved protein containing a thioredoxin domain [Nostoc
sp. PCC 7524]
Length = 688
Score = 308 bits (789), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 227/713 (31%), Positives = 341/713 (47%), Gaps = 124/713 (17%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQALAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
VFL+P DL P GTYFP E +Y RPGF +L+ ++ +D +++ L Q A +E L S
Sbjct: 108 VFLTPEDLVPFYAGTYFPLEPRYNRPGFLQVLQALRRYYDTEKEELRQRKAVILESLLTS 167
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMML 191
L A+ EL L + +++ G +G++ FP M+
Sbjct: 168 AVLQGDATQEAEAQEL-----------LGRGWETSTGIITPNQYGNS--FP------MIP 208
Query: 192 YHSKKLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
Y L T + + + Q++ +A GGI+DHV GGFHRY+VD W VPHFEKM
Sbjct: 209 YAELALRGTRFNFPSRYDAQQVCTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKM 268
Query: 251 LYDQGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 309
LYD GQ+ + +S ++ ++ +++L+R+M P G ++A+DADS
Sbjct: 269 LYDNGQIVEFLANLWSAGIQEPAFTRAVAGTIEWLQREMTAPEGYFYAAQDADSFTNPAE 328
Query: 310 TRKKEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 368
T +EGAFYVW+ E+ ++L + ++ + + P GN F+GKNVL
Sbjct: 329 TEPEEGAFYVWSYTELAELLSPTELAELQQQFTVTPNGN------------FEGKNVLQR 376
Query: 369 LNDSSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHLD 410
N +L + LE L+ L R R + ++ R D
Sbjct: 377 RN-----PGQLSITLETALDKLFTARYGAAPDALETFPPARDNQEAKTSNWPGRIPSVTD 431
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH-LY 469
K+IV+WN L+IS ARA +A+F P+ G ++A AA FI +H L
Sbjct: 432 TKMIVAWNSLMISGLARA---------AAVFQEPIYG-------DIAARAAKFILQHQLV 475
Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELF 528
+ + HRL + G +DYAF I LLDL + WL AI LQ +E
Sbjct: 476 NGRFHRLNY---QGQPTVLAQSEDYAFFIKALLDLQACSPEQRFWLENAIALQTEFNEFL 532
Query: 529 LDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 587
E GGYFNT + +++R + D A PS N V++ NLVRL + + +Y
Sbjct: 533 WSVELGGYFNTASDASQELIVRERSYADNATPSANGVAIANLVRLTLL---TDDLHYLDL 589
Query: 588 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 647
AE L F + ++ A P + A D ++ L+ +S+ + N+L +
Sbjct: 590 AEQGLKAFNSVMQQAPQACPSLFTALDWY-----RNCTLI--RSTTEQINVLIPKY---- 638
Query: 648 LNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPV 700
L V+++ +N D V LVCQ C P V
Sbjct: 639 LPNVVLNV----------------------VSNLPTDS-VGLVCQGLKCLPSV 668
>gi|17228732|ref|NP_485280.1| hypothetical protein all1237 [Nostoc sp. PCC 7120]
gi|17130584|dbj|BAB73194.1| all1237 [Nostoc sp. PCC 7120]
Length = 685
Score = 308 bits (788), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 216/650 (33%), Positives = 312/650 (48%), Gaps = 87/650 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIADYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
VFLSP DL P GTYFP E KY RPGF IL ++ +D +++ L Q A +E L S
Sbjct: 108 VFLSPEDLVPFYAGTYFPIEPKYNRPGFLQILEALRRYYDTEKEDLRQRKALIVESLLTS 167
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L A+ EL + +++ + +G FP M+ Y
Sbjct: 168 AVLKGEATQEAEESELLKRGWETNTSVITR---NEYGN-----SFP------MIPYAELA 213
Query: 197 LEDTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
L T + +GQ++ +A GGI+DHV GGFHRY+VD W VPHFEKMLYD G
Sbjct: 214 LRGTRFNFASRYDGQQVSTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 273
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S K+ ++ + +L+R+M P G ++A+DADS T +E
Sbjct: 274 QIVEYLANLWSAGVKEPAFARAVTGTVVWLQREMTAPAGYFYAAQDADSFTTPTDVEPEE 333
Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+E ++ + ++ + + P GN F+GKNVL
Sbjct: 334 GAFYVWSYAELEQLVTPTELTELQQQFTVSPQGN------------FEGKNVL-----QR 376
Query: 374 ASASKLGMPLEKYLNILGECRR-KLFDVRSKRPRPH-----------------LDDKVIV 415
+LG +E L L R D P D K+IV
Sbjct: 377 RQPGELGATIETALGKLFAARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSVTDTKMIV 436
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTH 474
+WN L+IS ARA+ + F P+ G E+A AA+FI D + H
Sbjct: 437 AWNSLMISGLARAAGV---------FQQPLAG-------ELAAKAANFILENQFVDGRFH 480
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREG 533
RL + G + +DYA I LLDL+ + WL AI LQ+ DE E
Sbjct: 481 RLNY---RGEAAVLAQSEDYALFIKALLDLHTAEPENRFWLEKAIALQHQFDEFLWSIEL 537
Query: 534 GGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
GGYFNT + +++R + D A PS N V++ NLVRL+ + + +Y AE L
Sbjct: 538 GGYFNTASDASQDLIIRERSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEQGL 594
Query: 593 AVFETRLKDMAMAVPLMCCAAD-------MLSVPSRKHVVLVGHKSSVDF 635
F++ + A P + A D + S + H ++ + +V F
Sbjct: 595 KAFKSVMSSAPQACPSLFTALDWYRNSTLIRSTNEQIHTLIPSYLPTVAF 644
>gi|343087024|ref|YP_004776319.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342355558|gb|AEL28088.1| protein of unknown function DUF255 [Cyclobacterium marinum DSM 745]
Length = 682
Score = 308 bits (788), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 215/618 (34%), Positives = 297/618 (48%), Gaps = 61/618 (9%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVME ESFE + VAKL+N F+ IK+DREERPD+D +YM VQ + GGWPL+
Sbjct: 56 SACHWCHVMEGESFEAKDVAKLMNAHFICIKIDREERPDLDNIYMEAVQVMGLQGGWPLN 115
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL P+ KP GGTYF E + +L V A+ ++ D L +S + + ++
Sbjct: 116 VFLLPNQKPFYGGTYFSKEQ------WIQVLSGVAQAFSQQYDDLVKSAEGFGQSIERSV 169
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
K + +R A+ L D +GG PKFP PV I L L+D
Sbjct: 170 IEKYGLKKGKSKFFPETIRQIAKDLIGKIDPVWGGMKRVPKFPMPV-IWSFLLDMAILDD 228
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
GE V FTL+ MA GGI+DH+GGGF RYSVD W PHFEKMLYD GQL +
Sbjct: 229 HEDLGEK------VCFTLEKMAMGGIYDHLGGGFCRYSVDGEWFAPHFEKMLYDNGQLLS 282
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
+Y A+ + + + + + +L DM GP +SA DADS +EG FY
Sbjct: 283 LYSKAYQYSANALFREKITETISWLLNDMCGPEMGFYSALDADS-------DGEEGRFYT 335
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
WT E++D+LG+ F + Y +K GN + GKN+L +
Sbjct: 336 WTFSELKDLLGDDLNWFCQLYGIKEQGNWE-----------AGKNILYQTLPYVEVGENF 384
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G E L+ L E + KL + R R RP LDDK+I WNG VI A L E
Sbjct: 385 GFTQEALLSKLREVKLKLKEKRESRTRPGLDDKIISGWNGWVIKGLCDAYLALGEE---- 440
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
E A +FI H+ E + L S++ G + P FL+DYA +I
Sbjct: 441 ------------EIRNTAVRTGNFIWHHMVIE--NELYRSYKGGQAYTPAFLEDYAAVIQ 486
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ LY+ + WL A L F D E ++ + ++ KE D P
Sbjct: 487 SFISLYKISFDSFWLRRAELLAQRVLRNFHDEEDEMFYFNDPKIEKLIANKKELFDNVIP 546
Query: 560 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMCCAADML- 616
S NSV NL +L + +D Y A+ L + + DM + P L A+ L
Sbjct: 547 SSNSVMARNLHQLGLYLY---NDTYLAQAKSMLQL----VSDMLIKEPDFLANWASFYLE 599
Query: 617 -SVPSRKHVVLVGHKSSV 633
SVP+ + +V+ G ++S
Sbjct: 600 QSVPTAE-IVIAGKEAST 616
>gi|75906768|ref|YP_321064.1| hypothetical protein Ava_0545 [Anabaena variabilis ATCC 29413]
gi|75700493|gb|ABA20169.1| Protein of unknown function DUF255 [Anabaena variabilis ATCC 29413]
Length = 711
Score = 308 bits (788), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 211/617 (34%), Positives = 307/617 (49%), Gaps = 70/617 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 74 SSCHWCTVMEGEAFSDQAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 133
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
VFLSP DL P GTYFP E KY RPGF +L ++ +D +++ L Q A +E L S
Sbjct: 134 VFLSPEDLVPFYAGTYFPLEPKYNRPGFLQVLEALRRYYDTEKEDLRQRKALIVESLLTS 193
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L A+ EL ++ +++ + +G FP ++ L ++
Sbjct: 194 AVLKGEATQEAEESELLRSGWETNTGVITR---NEYGN-----SFPMIPYAELALRGTRF 245
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ GE Q+ + +A GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 246 NFASRYEGEQISTQRGL-----DLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQ 300
Query: 257 LANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ + +S ++ ++ + +L+R+M P G ++A+DADS T T +EG
Sbjct: 301 IVEYLANLWSAGVQEPSFARAVTGTVAWLQREMTAPAGYFYAAQDADSFTTPTDTEPEEG 360
Query: 316 AFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
AFYVW+ E+E +L + ++ + + P GN F+GKNVL +
Sbjct: 361 AFYVWSYAELEQLLTPTELTELQQQFTVSPQGN------------FEGKNVLQRRHQWEL 408
Query: 375 SA---SKLGM-----------PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 420
SA + LG LE + K + P D K+IV+WN L
Sbjct: 409 SATIETALGKLFVARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSV-TDTKMIVAWNSL 467
Query: 421 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHS 479
+IS ARA +A+F P+ G E+A AA+FI D + +RL +
Sbjct: 468 MISGLARA---------AAVFQQPLAG-------ELAAKAANFILENQFVDGRFYRLNY- 510
Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFN 538
G + +DYA I LLDL+ + WL AI LQ DE E GGYFN
Sbjct: 511 --RGEAAVLAQSEDYALFIKALLDLHAATPENRFWLEKAIALQQQFDEFLWSIELGGYFN 568
Query: 539 TTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
T + +++R + D A PS N V++ NLVRL+ + + +Y AE L F+T
Sbjct: 569 TASDASQDLIIRERSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEAGLKAFKT 625
Query: 598 RLKDMAMAVPLMCCAAD 614
+ A P + A D
Sbjct: 626 VMSSAPQACPSLFTALD 642
>gi|312138733|ref|YP_004006069.1| hypothetical protein REQ_12910 [Rhodococcus equi 103S]
gi|311888072|emb|CBH47384.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length = 674
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 198/575 (34%), Positives = 288/575 (50%), Gaps = 63/575 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ A ++N+ FV IKVDREERPD+D VYM A+ G GGWP++ F
Sbjct: 57 CHWCHVMAHESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCF 116
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GTY+P E + G P F +L V D W +R + + A + +L + S
Sbjct: 117 LTPDGAPFYCGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SG 175
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ + P ++P L + + D GGFG APKFP + ++ +L ++
Sbjct: 176 ALPAGGAPIDVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT---- 229
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
A + V T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD L Y
Sbjct: 230 ---SAGPTLRAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFY 286
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + +D+L RD+ G SA DAD T +EG Y WT
Sbjct: 287 AHLARRTGSALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWT 339
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+++ D++G + E + + TG + +G +VL D
Sbjct: 340 PQQIADVVGDDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD--------- 379
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
PL+ + L + R +L R++RP+P DDKV+ +WNGL I++ A A L
Sbjct: 380 -PLDA--DRLADVRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG------- 429
Query: 441 FNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLI 498
R +++E AE A + HL D RL+ + G P G L+DY L
Sbjct: 430 ---------RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALA 477
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGA 557
+GL L++ +WL A L +T + F D E G +F+T + +++ R ++ DGA
Sbjct: 478 AGLSTLHQVTGAAEWLEAATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGA 537
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
PSG SV+ L+ +S+VA +S Y A SL
Sbjct: 538 TPSGASVTTEALLTASSLVAADRSARYAVAAADSL 572
>gi|407778219|ref|ZP_11125484.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
gi|407299900|gb|EKF19027.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
Length = 668
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 216/697 (30%), Positives = 330/697 (47%), Gaps = 87/697 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFE++ VA ++N F++IKVDREERP++D++YM + A GGWPL++F
Sbjct: 53 CHWCHVMAHESFENDAVAAVMNRLFINIKVDREERPEIDQIYMAALAATGEQGGWPLTMF 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GGTYFPPE ++GRPGF +L+ + AW +KR L +S + +L+
Sbjct: 113 LTPDGSPFWGGTYFPPEPRFGRPGFVQVLQAIDAAWREKRHELTKSAGNLKAHVQASLAP 172
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
PD + LR A ++ D GG APKFP ++++ + D
Sbjct: 173 PPGEPPEPDAM----LRDLAARVHGMIDPALGGLRGAPKFPNAPFMKILWLDGIQHGDRT 228
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ + V +L+ M GGI+DHVGGG RY+VD+RW VPHFEKMLYD QL +
Sbjct: 229 RI-------EAVADSLRHMLSGGIYDHVGGGLARYAVDDRWVVPHFEKMLYDNAQLLQLL 281
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
++ T D + + +D+L R+M GG S+ DAD T +EG YVW+
Sbjct: 282 CWVYARTHDQLFRIRIEETVDWLLREMRVDGGGFASSLDAD-------TDGEEGKTYVWS 334
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+E+ ++LG A F + + L+ + +D H + +L LN +A+
Sbjct: 335 RQELGEVLGSEAGAFLDVFTLE--------KPADWHRD----PILHRLNHPAATDPASET 382
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+ L+ +L R RP+P DDK++V WNG+ I++ A A ++L
Sbjct: 383 RMRTLLD-------RLLVARQARPQPGRDDKLLVDWNGMTITALATAGRLL--------- 426
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
DR ++ + A +A F+ + + RL HS R P DYA +IS
Sbjct: 427 -------DRPDWTQAARTAFRFVCESM---ENGRLPHSIRGDKQLFPALSSDYAAMISAA 476
Query: 502 LDLYEFGSGTKWLV----WAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
LY S L WA +LQ D+ G G++ + + V +R++ D D A
Sbjct: 477 TALYGATSDDALLQQARKWAGQLQRWHQ----DKAGSGFYMSASDSGDVPMRIRGDVDEA 532
Query: 558 EPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
PS S + L LA++ + + + A +L + A V A +
Sbjct: 533 IPSATSQVIEALAALATLTGDEEMTGLLHETARTALGRAARQPYGQAGTV-----HAASV 587
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+V +RK +V+V SV F V + +P D D ++
Sbjct: 588 AVSARK-LVMVEPAGSVVF--------------IPVANRNP-DPRRFDSVVSTGGEKVTL 631
Query: 677 ARN-NFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ + A +C +C PP T+P +LE L E
Sbjct: 632 PGDVVVDTTRPAAYLCIGQTCLPPFTEPSALEEALRE 668
>gi|257057143|ref|YP_003134975.1| highly conserved protein containing a thioredoxin domain-containing
protein [Saccharomonospora viridis DSM 43017]
gi|256587015|gb|ACU98148.1| highly conserved protein containing a thioredoxin domain protein
[Saccharomonospora viridis DSM 43017]
Length = 667
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 216/694 (31%), Positives = 322/694 (46%), Gaps = 88/694 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESF D VA +N+ FV+IKVDREERPD+D VYMT QA+ G GGWP++ F
Sbjct: 49 CHWCHVMAHESFADADVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD KP GTY+PP G P FK +L V AWD++RD L + ++ ++E
Sbjct: 109 LTPDGKPFHCGTYYPPVPTQGMPSFKQVLTAVAQAWDERRDELVEGAGRIVDHIAE---- 164
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ P + + + +L D GGFG APKFP + ++ +L H ++
Sbjct: 165 -QTRPLSPQPVTADTIASAVAKLRTEVDPENGGFGGAPKFPPSMVLEFLLRHYERT---- 219
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ E +V T + MA+GG++D + GGF RYSVD W VPHFEKMLYD L Y
Sbjct: 220 ---DSMEVLSIVDMTAEGMARGGVYDQLAGGFARYSVDAEWVVPHFEKMLYDNALLLRCY 276
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + ++L RD+ P G S+ DAD+ EG T YVWT
Sbjct: 277 AHLARRTGSPLAHRVAGETAEFLLRDLRTPQGGFASSLDADAEGVEGLT-------YVWT 329
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
+++ D+LG + E + + G + +G + L D A
Sbjct: 330 REQLVDVLGPDDGAWAAETFGVTEEGTFE-----------RGASTLRLPQDPDDPA---- 374
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+++ + L D R++RP+P DDKVI +WNGL I++ A A L+
Sbjct: 375 ----RWMRVTS----TLLDARNERPQPARDDKVIAAWNGLAITALAEAGVALQ------- 419
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
R +++E A +A SF+ H D+ L+ S R+G +A L+DY
Sbjct: 420 ---------RPDWIEAAVAAGSFVLDVHKTDDG---LRRSSRDGVVGEADAVLEDYGCFA 467
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
GLL L++ +WL AI L + F ++ G Y +T + ++ R + D A
Sbjct: 468 DGLLALHQATGEPRWLEEAIALLDIALRRFGVEGMPGAYHDTAVDAEELVHRPSDPTDNA 527
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCA 612
PSG S L+ +++ ++ YR E +LA R + VP + A
Sbjct: 528 SPSGASALAGALLTASALAGPERASAYRAACEEALA----RAGALIAQVPRFAGHWLSVA 583
Query: 613 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSN 672
ML+ P + VV + E + A + V+ P D E +
Sbjct: 584 EAMLAGPVQVAVVGTDARQR---ERFVVEAAQNIHGGGVVLGGVP-DAEGVPL------- 632
Query: 673 NASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
+ + A VC+ + C PVT P +L
Sbjct: 633 ---LTDRPLVDGRPAAYVCRGYVCDRPVTTPEAL 663
>gi|384567356|ref|ZP_10014460.1| thioredoxin domain-containing protein [Saccharomonospora glauca
K62]
gi|384523210|gb|EIF00406.1| thioredoxin domain-containing protein [Saccharomonospora glauca
K62]
Length = 670
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 226/701 (32%), Positives = 328/701 (46%), Gaps = 89/701 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESF D+ VA +ND FV+IKVDREERPD+D VYMT QA+ G GGWP++
Sbjct: 48 ACHWCHVMAHESFSDDEVAAFMNDHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTC 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GTY+PP +G P FK +L V AW ++RD L + ++ + E
Sbjct: 108 FLTPDGKPFHCGTYYPPVPAHGMPSFKQVLVAVDQAWRERRDELVEGAGRVVDHIVE--- 164
Query: 141 ASASSNKLPDELPQNALRLCA--EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
K P A + A +L + D GGFG APKFP + ++ +L H E
Sbjct: 165 ----QTKPLSLRPVTAETVAAAVSKLRREADPGNGGFGGAPKFPPSMVLEFLLRH---YE 217
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG + E +V T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L
Sbjct: 218 RTG----SVEALSVVDATAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLL 273
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
Y T + + ++L RD+ P G S+ DAD+ EG T Y
Sbjct: 274 RFYAHLARRTGSALAYRVAGETAEFLLRDLRTPQGAFASSLDADTEGVEGLT-------Y 326
Query: 319 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
VWT +++ D+LG E + + + G + +G + L D A
Sbjct: 327 VWTPQQLVDVLGPEDGAWAAKLFGVTEEGTFE-----------RGASTLQLRRDPDDPA- 374
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+++ + R R+ RP+P DDKVI +WNGL I++ A A L+
Sbjct: 375 -------RWMRVTSALSR----ARAARPQPARDDKVIAAWNGLAITALAEAGVALR---- 419
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYA 495
R E++E A +AA+F+ H+ + L+ S R+G A L+DY
Sbjct: 420 ------------RPEWVEAAVAAAAFVLDVHVGGDGAEGLRRSSRDGVVGDAAAVLEDYG 467
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDH 554
L GLL L++ WL A L +T F +D G + +T + +++ R +
Sbjct: 468 CLADGLLALHQATGEPVWLTEATALLDTALRRFGVDGAPGAFHDTAADAEALVHRPSDPT 527
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LM 609
D A PSG S L+ +++ ++ YR E +L +R + VP +
Sbjct: 528 DNASPSGASALAGALLTASALAGPERAGAYRAACEEAL----SRAGVLVEQVPRFAGHWL 583
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEH 669
A +LS P + VV G K D ++A A V+ +P + E + +
Sbjct: 584 SVAEALLSGPVQVAVVGAGAK---DRAELVAEAARGVHGGGVVLGGEP-EAEGVPLLADR 639
Query: 670 NSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ + A A VC+ + C PVT P +L L
Sbjct: 640 PLVDGAPA----------AYVCRGYVCDRPVTTPEALARSL 670
>gi|421744678|ref|ZP_16182637.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
gi|406686908|gb|EKC90970.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
Length = 675
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 233/704 (33%), Positives = 335/704 (47%), Gaps = 91/704 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFEDE A ++N FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 48 SACHWCHVMAHESFEDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 107
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VFL+P+ +P GTYFPPE ++G PGF+ +L V+ AW ++R + + + L E
Sbjct: 108 VFLTPEGEPFYFGTYFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERR 167
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
A +LP +E Q L L++ YD GGFG APKFP + ++ +L H +
Sbjct: 168 LALGEP-RLPGAEEAAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR- 220
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 221 --TGAEG----ALQMAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALL 274
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY+ + T + + +++ RD+ P G SA DADSA+ G R EGA+
Sbjct: 275 CRVYVHLWRATGSEQARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAY 332
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT ++ ++LGE + H+ + G F+ ++ L +
Sbjct: 333 YVWTPAQLVEVLGEEDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAV 380
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
G + R +L++ R +RP P DDKV+ +WNGL I++ A A
Sbjct: 381 QDAGR--------IASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF---- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDY 494
+R + ++ A +AA +R HL D RL + R+G S G L+DY
Sbjct: 429 ------------ERPDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDY 474
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
A + G L L WL +A L + + F D E G ++T + ++ R ++
Sbjct: 475 ADVAEGFLALASVTGEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPT 533
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
D A PSG + + L A + S+ +R AE +L V + + VP +
Sbjct: 534 DNATPSGWTAAAGA---LLGYAAQTGSEPHRTAAERALGV----VAALGPKVPRFIGNGL 586
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK---TVIHIDPADTEEMDFW 666
+L P + V +VG + A H + L+ V+ PAD E
Sbjct: 587 AVTEALLDGP--REVAVVGDPD----DPRTAVLHRTALLSTAPGAVVAAGPADGE----- 635
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A + A VC+ F C P TDP L L
Sbjct: 636 ------LPLLAGRVPAEGAPTAYVCRGFVCDAPTTDPALLAAQL 673
>gi|346321450|gb|EGX91049.1| DUF255 domain protein [Cordyceps militaris CM01]
Length = 735
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 199/617 (32%), Positives = 316/617 (51%), Gaps = 72/617 (11%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+C +M +ESF + A +LND F+ + +DRE RPD+D +YM YVQA+ GG
Sbjct: 77 HIGYKACHYCRLMSIESFANAECAAVLNDAFIPVLIDRESRPDLDTIYMNYVQAVSSVGG 136
Query: 76 WPLSVFLSPDLKPLMGGTYFPPEDKYGRP---------GFKTILRKVKDAWDKKR----- 121
WPL++F++P+L+P+ GGTY+P + R F TI++KV+D+W ++
Sbjct: 137 WPLNLFVTPELEPVFGGTYWPGPNAARRAHDESTEDALDFLTIIKKVRDSWKEQESRCRK 196
Query: 122 ---DMLAQSGAFAIEQLSEALSASASSNKLP----------------------DELPQNA 156
++LAQ FA E + + N +P EL +
Sbjct: 197 EATEVLAQLREFAAEGTLGTRPVTQTQNFVPSGWAAPISSESSQGMDKTASVSSELDLDQ 256
Query: 157 LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHS--KKLEDTGKSGEASEGQKMV 213
L ++ ++D +GGFG APKF P ++Q +L H+ ++D E + M
Sbjct: 257 LEEAYTHIAGTFDPVYGGFGLAPKFLTPPKLQFLLELHTSPSAVQDIVGEAECAHATDMA 316
Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----SLT 268
L TL+ + G +HDHVG GF R SV W +P+FEK++ D QL ++YL A+
Sbjct: 317 LDTLRKIRDGALHDHVGATGFARCSVTPDWTIPNFEKLVVDNAQLLSLYLTAWHRAGGQA 376
Query: 269 KDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
FY I ++++YL ++ G + S+E ADS G KEGAFY+WT +E +
Sbjct: 377 TSEFYD-IVLELVEYLTSTPILRSDGLLASSEAADSYVRNGDRGMKEGAFYLWTKREFDS 435
Query: 328 IL-----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 382
++ G ++ H+ + GN D DP+++F +N+L + S + +
Sbjct: 436 VIEAAEKGASPVV-AAHWGVLEDGNVD--EQHDPNDDFMKQNILRVVKTSEELSKLFSVS 492
Query: 383 LEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
+E+ + R +L R +R RP +DDK + WNGL +S+ A+ AE+ +
Sbjct: 493 VERIEQSIHTARNELKRRREGERVRPEVDDKAVTGWNGLALSALAKT-------AEALVT 545
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
P + + + VA ASFI++HL+D Q+ ++ + G F +DYA++I GL
Sbjct: 546 VNPEISA---KCNTVASGIASFIQKHLWDTQS-KILYRIWTGDRDTEAFAEDYAYVIQGL 601
Query: 502 LDLYEFGSGTKWLVWAIELQNT--QDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
LDL++ + +A +LQ T Q F D GG+F TT E +LR+K+ D + P
Sbjct: 602 LDLFDTNGDESLIAFADQLQRTEAQASYFYD-AAGGFFTTTAESTFAILRLKDGMDTSLP 660
Query: 560 SGNSVSVINLVRLASIV 576
S N+VSV NL RL ++
Sbjct: 661 STNAVSVSNLYRLGQLL 677
>gi|376005318|ref|ZP_09782832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326245|emb|CCE18585.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 686
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 207/631 (32%), Positives = 312/631 (49%), Gaps = 97/631 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERP++D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P D P GGTYFP E +YGRPGF +L+ + + + ++ L + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ P EL ++ L+ E + + +GG P+FP + M + +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ K +G+ L + + GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
D +S K Y +++L+R+M P G ++A+DADS T +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVWT++E+E L + + + +GN F+GK VL N
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWN----- 375
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHL-------------------------D 410
+L +E L KLF VR P + D
Sbjct: 376 CDELDPLIETALT-------KLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTD 428
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY- 469
K+IV+WN L+IS A+A+++L D EY+E+A AA F+ H +
Sbjct: 429 TKMIVAWNALMISGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWV 472
Query: 470 DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-----WLVWAIELQNTQ 524
D++ HR+ + +G +DYA LI L+DL++ WL A+++QN
Sbjct: 473 DDRFHRVNY---DGKVAVLSQSEDYALLIKALIDLHQASLQQPELADFWLTNAVQVQNEF 529
Query: 525 DELFLDREGGGYFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 583
D+ E GGYFNT +D ++L+R + D A P+ N V++ NLVRL + ++
Sbjct: 530 DQYLWSVELGGYFNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLN 586
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
Y A +L F + ++ A P + A D
Sbjct: 587 YLDRALQALEAFASVMRQSPQACPSLFVAFD 617
>gi|257059286|ref|YP_003137174.1| hypothetical protein Cyan8802_1422 [Cyanothece sp. PCC 8802]
gi|256589452|gb|ACV00339.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8802]
Length = 688
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 233/710 (32%), Positives = 333/710 (46%), Gaps = 114/710 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LND F+ IK+DREERPD+D +YM VQ + GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRPGF +L+ ++ +D ++D L +F E
Sbjct: 108 IFLTPDDLVPFYGGTYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEI 160
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKL 197
L S LP NA L E + + P+ F RP M+ Y + L
Sbjct: 161 LDTLQKSAILP---VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLAL 216
Query: 198 EDTGKSGEASEGQKMVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ + + ++ E Q V + + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 217 QGSRFAFQSQENQATVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQ 276
Query: 257 ----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 312
LAN++ + + F I R + ++L+R+M P G ++A+DAD+ T
Sbjct: 277 IVEYLANLWSQGYQ--EPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEP 333
Query: 313 KEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 371
+EGAFYVW +E+E+ L E L + + L GN F+G NVL
Sbjct: 334 EEGAFYVWKFQELEEYLNSEEFKLLEATFSLTAEGN------------FEGSNVLQRRMG 381
Query: 372 SSASASKLGMPLEKYLNILGECRRKLF-------------DVRSKRPRPHLDDKVIVSWN 418
S + + + ++ G R+ L R P D K+IV+WN
Sbjct: 382 GEFSEALEAILDKLFMIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWN 441
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
L+IS ARA + F P+ Y E+A +A FI + + + + +RL
Sbjct: 442 SLMISGLARAYGV---------FGDPL-------YWELAINATEFILQEQWVNNRLYRLN 485
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGY 536
+ G +DYAF I LLDL + WL A E+Q DE F EGGGY
Sbjct: 486 YE---GQPSVLAQAEDYAFFIKALLDLQRANPWERQWLEKAKEVQEEFDEFFWSIEGGGY 542
Query: 537 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
+N ++ +L+R + D A PS N V++ NLVRL+ + Y AE L F
Sbjct: 543 YNNASDNSGDLLIRERSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTF 599
Query: 596 ETRLKDMAMAVPLMCCAADML----SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 651
+ L A P + A D SV + K + L +
Sbjct: 600 SSVLSQSPKACPSLFVALDWYRFGNSVQTTKEI-----------------------LKQF 636
Query: 652 VIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+ P ++ +H +N+ V LVCQ SC P T
Sbjct: 637 ITQYFPVTVYQLT---DHLPDNS------------VGLVCQGLSCLEPAT 671
>gi|258511893|ref|YP_003185327.1| hypothetical protein Aaci_1926 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
gi|257478619|gb|ACV58938.1| protein of unknown function DUF255 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
Length = 626
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 209/602 (34%), Positives = 293/602 (48%), Gaps = 54/602 (8%)
Query: 27 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
+M ESFEDE VA +LN +V+IKVDREERPD+D +YMTY QAL G GGWPL++ ++PD
Sbjct: 1 MMAHESFEDETVAAILNAHYVAIKVDREERPDIDHIYMTYCQALQGEGGWPLTIIMTPDG 60
Query: 87 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
P GTYFP +YGRPG IL+++ W R L ++ E++ A
Sbjct: 61 HPFFAGTYFPKTPRYGRPGLIQILQEIARLWQTDRARLERASRSMAERMQPLFEGQAGEA 120
Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+ + A E L ++D+ +GGFG APKFP +Q +L ++ +L + ++
Sbjct: 121 R-----GREAADRAYEALEATFDTEYGGFGPAPKFPTFHRVQFLLRYA-RLRPSERAA-- 172
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
M L TL+ + +GGI DHVGGG RYS D W VPHFEKMLYD Y DA++
Sbjct: 173 ----AMALSTLRAIQRGGIVDHVGGGMARYSTDPFWRVPHFEKMLYDNALALAAYADAYA 228
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
KD + R + + R+M P G +SA DADS+ EG FY W ++V
Sbjct: 229 HAKDPAFLRFVRQTVAFFEREMRSPEGLYYSAVDADSS-------GGEGRFYFWRPEDVI 281
Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLE 384
LG E L+ Y + GN F+G NV ++ D +A A+ GM E
Sbjct: 282 AALGPEDGELYNAFYDITEAGN------------FEGANVPNYIDQDPAAFAASRGMTEE 329
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+ L KL VR R RP +DDK + +WN L+ ARA K A
Sbjct: 330 ELWQKLDALNEKLRAVRDARERPAIDDKCLTAWNALMAYGLARAGLACKETA-------- 381
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
+++ A + I R L RL +R+G + + DD+A+L++ L+L
Sbjct: 382 --------WVDRAREVVAAIERILMRADDGRLLARYRDGEAGIFAYADDHAYLVAAYLEL 433
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV-KEDHDGAEPSGNS 563
Y +L A Q QD LF D+ GGY G D L+ V K +DGA PS NS
Sbjct: 434 YRATLDRAYLDRARHWQAVQDALFWDKAQGGY-TFYGRDAESLIAVPKPVYDGAMPSANS 492
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
S NL L ++ ++ Y + + F + M + AA M V S +
Sbjct: 493 QSAHNLWILHALTGDAE---YADRLDGLVRAFGGDIASAPMDCLWLVTAAMMSEVGSTEI 549
Query: 624 VV 625
V+
Sbjct: 550 VI 551
>gi|256389916|ref|YP_003111480.1| hypothetical protein Caci_0704 [Catenulispora acidiphila DSM 44928]
gi|256356142|gb|ACU69639.1| protein of unknown function DUF255 [Catenulispora acidiphila DSM
44928]
Length = 710
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 197/577 (34%), Positives = 294/577 (50%), Gaps = 61/577 (10%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFEDE A L+N+ +V +KVDREERPDVD VYM QA+ GGGGWP++VF
Sbjct: 49 CHWCHVMAHESFEDEATAALMNEKYVCVKVDREERPDVDAVYMAATQAMTGGGGWPMTVF 108
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
+P+ KP GTY+PP ++G P F+ +L V AW R+ + ++G + +L+
Sbjct: 109 ATPEGKPFQAGTYYPPVARHGLPSFRQLLVAVDRAWGDIREDVLRAGDGLVAELAHHARV 168
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
A + +PD AL L + +D GGFG APKFP + ++ +L H + D
Sbjct: 169 VAGAEGVPD---AGALATAVGVLRREFDGVRGGFGGAPKFPPSMTLEQLLRHHARTGD-- 223
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++ MV T + MA+GG++D +GGGF RY+VD+ W VPHFEKMLYD L Y
Sbjct: 224 -----ADALAMVRQTCEAMARGGMYDQLGGGFARYAVDDAWVVPHFEKMLYDNALLLRAY 278
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMI--GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
L + T D + + D++ R++ G GG S+ DAD T EG FY
Sbjct: 279 LHLWRATGDALALRVVNETADWMLRELWLDGAGG-FASSLDAD-------TDGVEGKFYA 330
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASK 378
W ++++ D +GE KE F+ G +VL L D
Sbjct: 331 WDAEQIADAVGE-----KEAGDAGDAAWAAAVFNVTAQGTFEHGLSVLQLLQDPD----- 380
Query: 379 LGMPLEKYLNILGECRRKLFDV-RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
L+++ I R LF+ R +R P DDK + +WNGL +++ A A +
Sbjct: 381 ---DLDRFQRI----RDSLFEARRDQRTAPGRDDKAVAAWNGLAVAALAEAGAL------ 427
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA--PGFLDDYA 495
+ R+E + A A + R +D +T RL + R+G + A PG L+DYA
Sbjct: 428 ----------TGRQELVSAARQTAEMLERIHWDGKTMRLTRTSRDGVAGAQNPGVLEDYA 477
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
+ GLL LY T+W +A L + + F D + G +++T + +++ R + D
Sbjct: 478 DVAEGLLALYAVTGETRWFAFAGRLLDVVLDNFRD-DSGLFYDTADDAEALIFRPADPTD 536
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
A P G S + L+ A++ + S +R+ AE +L
Sbjct: 537 NATPGGTSAAAGALLTYAAL---TGSGRHREAAEQAL 570
>gi|119488064|ref|ZP_01621508.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
gi|119455353|gb|EAW36492.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
Length = 688
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 210/638 (32%), Positives = 321/638 (50%), Gaps = 109/638 (17%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D VA+ +N+ F+SIKVDREERP++D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDGAVAQYMNEHFISIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FLSP DL P +GGTYFP + +YG+PGF +LR+V+ ++ ++ L +++ A
Sbjct: 108 IFLSPDDLVPFVGGTYFPVQPRYGQPGFLEVLRRVRGFYNTEKTRLQNLK----QEIRNA 163
Query: 139 LSASA--SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L S S+++L + L Q L +++ + GG P+FP M+ Y
Sbjct: 164 LVQSTVLSASQLNEGLLQQGLTTNTAVITR---NDLGG----PRFP------MIPYADTA 210
Query: 197 LEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
L D E+ + Q+ +A GGI+DHV GGFHRY+VD W VPHFEKMLYD G
Sbjct: 211 LHDVRFDFESPYDSQQACTQRGTDLASGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNG 270
Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q+ + +S +TK F I + +L+R+M P G ++++DAD+ T +
Sbjct: 271 QIVEYLANLWSAGITKPAFERSISGTV-SWLKREMTAPKGHFYASQDADNFTTPEDVEPE 329
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EG FYVW +++E+I+ E + + + +GN F+GKNVL N
Sbjct: 330 EGEFYVWNWQDLEEIVSPEEFGELQAQFSITKSGN------------FEGKNVLQRWN-- 375
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
L P+E L KLF VR S R P
Sbjct: 376 ---CDALSQPIESAL-------AKLFAVRYGAKPQDLETFPPATNNQEAKSKNWSGRIPP 425
Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
D K+IV+WN L+IS ARA+ + + + EY+++A +AA FI +
Sbjct: 426 VTDTKMIVAWNSLMISGLARAATVFQ----------------QPEYLKIATTAAQFILEN 469
Query: 468 LY-DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE-------FGSGTKWLVWAIE 519
+ D + HR+ + +G +DYA I L+DL++ F W A++
Sbjct: 470 QWVDGRLHRVNY---DGNPDVLAQSEDYALFIKALIDLHQASLIESSFQLPEYWFEKAVK 526
Query: 520 LQNTQDELFLDREGGGYFNT---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 576
+Q D+ E GGY+N TG++ +L+R + D A P+ N V++ NLVRL +
Sbjct: 527 VQQEFDQFLWSVELGGYYNIGTDTGQE--LLMRERSYTDNATPAANGVAMANLVRL--FL 582
Query: 577 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
+ DY + AE + F + ++ A P + A D
Sbjct: 583 LTEQLDYLDK-AEQGIQAFSSIMEKSPQACPSLFVALD 619
>gi|409198348|ref|ZP_11227011.1| thioredoxin domain-containing protein [Marinilabilia salmonicolor
JCM 21150]
Length = 675
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 210/697 (30%), Positives = 323/697 (46%), Gaps = 81/697 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM E FEDE A+L+N+ F+ IKVDREERPDVD ++T VQ + GGWPL+
Sbjct: 53 SACHWCHVMAHECFEDEETARLMNEHFICIKVDREERPDVDNFFITAVQLMGAQGGWPLN 112
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V PD +P GGTYFP + +K IL K+ + R+ L + +
Sbjct: 113 VVTLPDGQPFWGGTYFPKDQ------WKEILIKINKLFHSDREKLTHHAHQLTTGIQQTS 166
Query: 140 SASASSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YH 193
S+ +++PD E+ AL E+ S +D + GG PKFP PV ++ +L +H
Sbjct: 167 MISSEQSEVPDLSEVINEAL----ERWSAQWDLQLGGSLGKPKFPMPVNLEFLLHLHFHH 222
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
+K+ + TLQ MA+GGI+D GGGF RYSVDE W VPHFEKMLYD
Sbjct: 223 PQKM-----------FSDFLNTTLQQMARGGIYDQAGGGFARYSVDEFWKVPHFEKMLYD 271
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
QL +Y A++ + Y + ++ + ++ ++ P G FSA DADS EG +
Sbjct: 272 NAQLIELYSHAYAHSGIKEYRDVVKETIAFVENKLMHPSGAFFSALDADS---EG----E 324
Query: 314 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
EG +YVWT +E+ +I G LF +++ + G+ + G +L+
Sbjct: 325 EGKYYVWTEEELLNIFGRDFPLFADYFNVNENGHWE-----------NGNYILLRTGSDE 373
Query: 374 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
A K M LE+ + ++ L + R KR RP LDDK I SWN L+ A K +
Sbjct: 374 EFAHKHKMTLEEVEKRVSVWKKDLVNRRKKRIRPGLDDKTITSWNALMTKGLVEAHKAVS 433
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 493
+ ++A FI L + L ++++G + GF++D
Sbjct: 434 D----------------SHFRKLALKNGEFICHSLISKDG-SLFRTWKDGRASVTGFMED 476
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 553
YA +IS + LYE KW+ + L + ++ F D+ G + + +
Sbjct: 477 YASVISAFIGLYEITGDEKWIEQSSRLADYAEKAFYDKATGQFHYMEKNQTELPANHFDT 536
Query: 554 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 613
D PS NS+ L +LA++ +YR+ AE L + K+
Sbjct: 537 QDNVIPSANSMMGHALFKLAALTG---DQHYRETAEKMLNQMLLQFKNYPWGFAHWGSLM 593
Query: 614 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNN 673
M+ PS + VV+ G K+ + + Y N + P E++
Sbjct: 594 LMIHKPSFE-VVVAGSKTVQALQRL----QKQYRPNVIWAPLKPESPGELN--------- 639
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ +N S +++ VC +C PV ++LL
Sbjct: 640 --ITKNRKSDEEITIYVCAQGACQLPVHSVEEAQHLL 674
>gi|390440171|ref|ZP_10228522.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
gi|389836455|emb|CCI32648.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
Length = 692
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 214/624 (34%), Positives = 312/624 (50%), Gaps = 82/624 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-- 195
L SA + L + +L E +K +G P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAEPSLLATGIETNTKVIRVNPNNYGR-PSFPMIPYSHLALQGSRFG 222
Query: 196 -KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
+D+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 223 DDFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDN 274
Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
GQ+ + +S ++ + + +++L+R+M P G ++A+DADS E +
Sbjct: 275 GQIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPE 334
Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVW+ + + D L + L + ++ + GN F+G+NVL
Sbjct: 335 EGAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----Q 377
Query: 373 SASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVI 414
KLG +E L+ L G + +L R D K+I
Sbjct: 378 RRQGGKLGKEIENMLDKLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMI 437
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQT 473
V+WN L+IS ARA A+F P+ Y ++A AA FI +H + D +
Sbjct: 438 VAWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRF 481
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDRE 532
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 482 QRLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAED 538
Query: 533 GGGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 590
GGYFN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE
Sbjct: 539 EGGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEK 594
Query: 591 SLAVFETRLKDMAMAVPLMCCAAD 614
+L F T L+ A P + A D
Sbjct: 595 ALQSFTTILEQSPTACPSLFVALD 618
>gi|291569597|dbj|BAI91869.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 686
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 205/620 (33%), Positives = 315/620 (50%), Gaps = 75/620 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERP++D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P D P GGTYFP E +YGRPGF +L+ + + + ++ L + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYHTDKNKLETVTEEILTQLRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ P EL ++ L+ E + + +GG P+FP M S+ +
Sbjct: 168 VILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPMIPYADMAWRGSRLI 217
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ G+A+ Q+ + + GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 218 SSSKVDGKAACLQRG-----KDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
D +S K + +++L+R+M P G ++A+DADS T +EGA
Sbjct: 273 LEFLADLWSEGEKQPAFQRSINGTVEWLKREMTAPQGYFYAAQDADSFVTSQDKEPEEGA 332
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
FYVWT++E+E L E + + + +GN F+GK V
Sbjct: 333 FYVWTNQELETFLTSEEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELD 380
Query: 366 -LIELNDSSASASKLGMPLEKYLNI-LGECRR--KLFDVRSKRPRPHLDDKVIVSWNGLV 421
LIE + A + G P E+ + E + K D + P D K+IV+WN L+
Sbjct: 381 PLIETALAKLFAVRYGAPPEEVKTFPVAENNQGAKQRDWPGRIP-AVTDTKMIVAWNALM 439
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSF 480
IS A+A+++ D EY+E+A +AA FI +H + D++ HR+ +
Sbjct: 440 ISGLAKAARVF----------------DNSEYLELATTAAKFILKHQWVDDRFHRVNY-- 481
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-----WLVWAIELQNTQDELFLDREGGG 535
+G +DYA + L+DL++ WL A+ +Q+ DE E GG
Sbjct: 482 -DGQVAVLSQAEDYALFVKALIDLHQASLQQPELAEFWLTNAVNVQSELDEYLWSMELGG 540
Query: 536 YFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
YFNT +D ++L+R + D A P+ N V++ NLVRL + ++ Y A +L
Sbjct: 541 YFNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRAGQALEA 597
Query: 595 FETRLKDMAMAVPLMCCAAD 614
F + ++ A P + A D
Sbjct: 598 FASIMRQSPQACPSLFVAFD 617
>gi|354566297|ref|ZP_08985470.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
gi|353546805|gb|EHC16253.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
Length = 691
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 236/724 (32%), Positives = 338/724 (46%), Gaps = 123/724 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D G+A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDPGIAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
FLSP DL P GTYFP E +YGRPGF +L+ ++ +D ++ L A +E L S
Sbjct: 108 AFLSPDDLVPFYAGTYFPVEPRYGRPGFLQVLQAIRHYYDTEKQDLRDRKAVILESLLTS 167
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L ++ EL ++ +++G FP ++ L +
Sbjct: 168 AVLQQQGTTATQDKELLHKGRETSTGIITP---NQYGN-----SFPMIPYAELAL-RGTR 218
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
E T + +G+++ +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 219 FEVTSE----YDGKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQ 274
Query: 257 LANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------AETEG 308
+ + +S + + F I + +L+R+M P G ++A+DADS +G
Sbjct: 275 IVEYLANLWSAGIEEPAFKRAIAGTV-QWLKREMTAPEGYFYAAQDADSFTPPYQGGDKG 333
Query: 309 ATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 367
+ +EGAFYVWT E+E +L E I ++ + + GN F+ KNVL
Sbjct: 334 GSEPEEGAFYVWTFSELEQLLTAEELIELQQQFTVTANGN------------FESKNVLQ 381
Query: 368 ELNDSSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----RPRPHL 409
SA+ +E L L R R + +S+ R
Sbjct: 382 RRRSGELSAT-----VETALKKLFVARYGATPESLETFPPARNNQEAKSRHWPGRIPAVT 436
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K+IV+WN L+IS ARA A+F PV Y+E+A +AA FI H +
Sbjct: 437 DTKMIVAWNSLMISGLARA---------YAVFREPV-------YLELATTAADFIVNHQF 480
Query: 470 -DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDEL 527
D + HRL ++ N P+ +DYAF I LLDL KWL AI LQ DE
Sbjct: 481 VDGRFHRL--NYENQPT-VLAQSEDYAFFIKALLDLQTCSPEQNKWLERAIALQEEFDEY 537
Query: 528 FLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
E GGY+NT+ + +++R + D A PS N V++ NLVRLA + + +Y
Sbjct: 538 LWSVELGGYYNTSSDASQDLIVRERSYVDNATPSANGVAIANLVRLALF---TDNLHYLD 594
Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 646
AE L F + + A P + A D +
Sbjct: 595 LAEQGLNAFRSVMNSTPQACPSLFTALD-------------------------------W 623
Query: 647 DLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
N T+I TE++ + A + D V LVCQ C P + SL
Sbjct: 624 YRNSTLIR---TTTEQLHSLMSQYLPSVVFAIASKLPDNSVGLVCQGLKCLPAAS---SL 677
Query: 707 ENLL 710
E +L
Sbjct: 678 EQML 681
>gi|310797732|gb|EFQ32625.1| hypothetical protein GLRG_07639 [Glomerella graminicola M1.001]
Length = 811
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 205/650 (31%), Positives = 319/650 (49%), Gaps = 81/650 (12%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H CH+ + E F A +LN+ F+ + +DREERP++D +YM YVQA+ G GG
Sbjct: 78 HIGFKACHYSRLTSTECFTHRECAAILNESFIPVIIDREERPELDTIYMNYVQAVSGSGG 137
Query: 76 WPLSVFLSPDLKPLMGGTYFPP-------EDKYGRPGFKTILRKVKDAWDKKRDMLAQSG 128
WPL++FL+P+L+P+ GGTY+P D R F ILRK++ W ++ Q
Sbjct: 138 WPLNLFLTPELEPVFGGTYYPAPGPNNGGSDDEDRLDFLAILRKLQKVWREQEGRCRQEA 197
Query: 129 AFAIEQL--------------------SEALSASASSNKL------------PDELPQNA 156
+ +L S+ ++ S L EL +
Sbjct: 198 KEVVVKLHDFAAEGTLGTATVQPGVAGSQTIAIGRSETGLEHPGTGRTAAAVSSELDLDL 257
Query: 157 LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL---EDTGKSGEASEGQKMV 213
L ++ ++D +GGFG APKFP P ++ +L + L +D E + +M
Sbjct: 258 LEEAYSHIAGTFDPVYGGFGLAPKFPTPPKLSFLLRLPRYLAPVQDVVGESECAHATEMA 317
Query: 214 LFTLQCMAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---- 268
LFTL+ + + DHVGG GF RYSV W VP FEK++ L +YLDA+ +
Sbjct: 318 LFTLRKIRDSSLRDHVGGCGFARYSVTADWSVPRFEKLIAHNALLLGLYLDAWLIATGGE 377
Query: 269 KDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 327
K + + +++DYL I P G S+E ADS G +EGA+ +WT +E +
Sbjct: 378 KGTEFYDVVVELVDYLSSPPISLPEGGFVSSEAADSYYRRGDRHMREGAYNLWTRREFDT 437
Query: 328 ILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 385
++G+ A L ++ + GN + + DP++EF +N+L + D S + G+ +++
Sbjct: 438 VIGDDHEAALAASYWNVLEHGNVEPDQ--DPNDEFMNENILRVVKDVSEIGRQAGITVDE 495
Query: 386 YLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
++ ++KL R K R RP +D K++ NGLVIS+ RA L +
Sbjct: 496 VKRVISSAKQKLKVHREKERVRPEVDAKIVAGRNGLVISALTRAGLALAT---------- 545
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
V + + + A AA FIR +L+DE+ L + G +A G +DYA+LI GL+ L
Sbjct: 546 VDAAKSQAAIASAGRAAEFIRANLWDEKERILYRIWNEGRGEAKGLAEDYAYLIEGLIGL 605
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLD--------------REGGGYFNTTGED-PSVLLR 549
YE + +W+ +A ELQ Q + F D R G F T E+ P +LR
Sbjct: 606 YEATADERWIEFADELQKVQIDTFYDSPSVGTSVLESPASRSSCGAFYITAENAPHTILR 665
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
+K+ D A PS N+VSV NL RL ++++ + Y A S+ FE +
Sbjct: 666 LKDGMDTALPSTNAVSVSNLFRLGTMLS---DEAYTALARESINAFEAEI 712
>gi|433772248|ref|YP_007302715.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
gi|433664263|gb|AGB43339.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
Length = 675
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 200/557 (35%), Positives = 281/557 (50%), Gaps = 58/557 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFE++ VA ++N FV+IKVDREERPD+D++YM + ++ GGWPL++
Sbjct: 56 ACHWCHVMAHESFENDDVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTM 115
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+PD KP GGTYFP E +YGRPGF ++ V AW +KR L QS + LS
Sbjct: 116 FLTPDGKPFWGGTYFPREPRYGRPGFIQVMEAVDKAWREKRTSLHQSADGLTSHVEARLS 175
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A+ S L ++ L A ++S D GG APKFP +Q + L D
Sbjct: 176 ATHSKALLDRDM----LSDLAGRVSGMIDRDRGGLAGAPKFPNAPFMQTLWL--SWLRD- 228
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G A+ + VL +L+ M GGI+DH+GGG RYS D W VPHFEKMLYD QL
Sbjct: 229 ---GNAAH-RDDVLVSLEHMLSGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRF 284
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
A + T + + D + +L R+M GG ++ DADS +EG FY W
Sbjct: 285 CNWALAATGNDLFRVRIEDTVGWLLREMRVEGGAFAASLDADS-------DGEEGLFYTW 337
Query: 321 TSKEVEDILGEHAILFKEHYYL-KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
+ E+E +LG+ + LF +++ L P G ++GK VL + + S
Sbjct: 338 SRGEIESVLGDDSTLFFKYFSLSSPPG-------------WEGKPVLHQ----TLSQQAF 380
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
G+ + L L + +L VR +R RP LD K + WNGL+I++ A A + L
Sbjct: 381 GVADRERLVPL---KTRLLTVREQRVRPGLDAKTLTDWNGLMIAALAEAGRSLA------ 431
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 499
R +++E A A + I + D RL HS P DYA + +
Sbjct: 432 ----------RPDWIEAAAKAFAHIGKAGRD---GRLPHSMLGVRKLFPALSSDYAAMTN 478
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
+ L+E ++ A + D D EG GY+ T + V +R++ D D A P
Sbjct: 479 AAISLFEATEDWSYVEQASQFLGQLDHWHADVEGTGYYLTASDSTDVPIRIRGDVDEAIP 538
Query: 560 SGNSVSVINLVRLASIV 576
S S + VRLASI
Sbjct: 539 SATSQIIEAQVRLASIT 555
>gi|295132488|ref|YP_003583164.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
gi|294980503|gb|ADF50968.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
Length = 678
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 192/584 (32%), Positives = 290/584 (49%), Gaps = 48/584 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFED VA ++N ++SIKVDREERPD+D+VYM VQ + G GGWP+++
Sbjct: 53 CHWCHVMEHESFEDPEVADIMNAHYISIKVDREERPDIDQVYMQAVQLMTGSGGWPMNIV 112
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
PD +P+ GGTYF E +K+ L +++ + K+ L E L +
Sbjct: 113 ALPDGRPVWGGTYFRKEQ------WKSALLQIQQIYKKESTQLTNYANKLKEGLQQLNLI 166
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+N E Q L E D + GG +APKF P + +L ++ + +D
Sbjct: 167 DIGNNSY--EFSQKRLGEFIEIWKPYLDMKLGGTKNAPKFMMPTNLDFLLRYAYQFKD-- 222
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
+ Q+ VL +L ++ GG DH+GGGF RYSVD+RWHVPHFEKMLYD QL ++Y
Sbjct: 223 -----KKLQEYVLHSLDKISFGGTFDHIGGGFARYSVDDRWHVPHFEKMLYDNAQLLSLY 277
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
A+ LT+D +Y + + ++ ++ G +SA DADS +G ++EGAFY W
Sbjct: 278 SKAYKLTQDHWYKEVIKKTARFIETELTDSTGAFYSALDADSENAKG--NQEEGAFYTWK 335
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 381
+E+E++L LF ++ + G + G +L + K +
Sbjct: 336 KEELEELLASEFDLFSAYFNINARGYWE-----------NGNYILYKTEKDDDFTKKHNI 384
Query: 382 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 441
LE+ + L + R KR +P LDDK + SWN L ++ FA A
Sbjct: 385 SLEELYQKKSNWTKILSEARKKRKKPGLDDKTLTSWNALSLNGFAEA------------- 431
Query: 442 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 501
+ + Y+ +A A FI ++ + + L HS++N SK +L+DYAF I
Sbjct: 432 ---YTATGKNHYLNIALKNAEFIIQNQLNPD-YSLFHSYKNKQSKINAYLEDYAFTIEAF 487
Query: 502 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 561
L LYE KW+ + L E F ++E + T+ +D +++ E D P+
Sbjct: 488 LKLYEVTFDKKWIDISSHLTKYCFENFYNQENTLFNFTSKKDDALISTPIELTDNVIPAS 547
Query: 562 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 605
NSV NL RL + S+ Y + +E L V ++ M
Sbjct: 548 NSVMANNLFRLGRLTGTSR---YLEVSEKMLQVISGKIGSYPMG 588
>gi|86606925|ref|YP_475688.1| hypothetical protein CYA_2291 [Synechococcus sp. JA-3-3Ab]
gi|86555467|gb|ABD00425.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 701
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 207/625 (33%), Positives = 301/625 (48%), Gaps = 74/625 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDPEIAAFLNAHFLPIKVDREERPDLDSIYMQALQLMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P DL P GTYFP E ++GRPGF T+L+++ + +++D + + L+
Sbjct: 108 VFLTPDDLVPFYAGTYFPVEPRFGRPGFLTVLQRILQFYRQEKDKIEDMKGQILAALT-T 166
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
LS + +P +L ++ + L+ + G+ +FP Q++L ++
Sbjct: 167 LSDLVPEDHIPPDLLRSGIPKIQPLLANA--------GAVQQFPMMPYAQLVLRSARFDP 218
Query: 199 DTGKSGEAS-------EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 251
G G + G +VL GGI DHV GGFHRY+VD W VPHFEKML
Sbjct: 219 PEGIPGSPTALERAKERGMALVL--------GGIFDHVAGGFHRYTVDPTWTVPHFEKML 270
Query: 252 YDQGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 310
YD GQ+ + ++ +D R ++++ R+M P G ++A+DADS
Sbjct: 271 YDNGQILEFLSELWAHGIQDAAIERAVRLTVEWVAREMTAPAGYFYAAQDADSFARREDA 330
Query: 311 RKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 369
+EG FYVW +E++D+L E ++ ++L P GN P + EL
Sbjct: 331 EPEEGEFYVWRWQELQDLLDEETFRALQQAFFLLPGGNFP----DRPGCIVLQRRQGGEL 386
Query: 370 NDSSASASKLGMPLEKYLNILGECRRKL-----FDVRSKRPR-------PHLDDKVIVSW 417
+A + +Y G R+ D +S R + P D K+IVSW
Sbjct: 387 PPEVETALTTHLFRARY----GSTERRTPFPLAVDAQSARRQSWPGRIPPVTDTKMIVSW 442
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 477
NGL+IS ARA ++ E +Y+ +A AA FI QT L
Sbjct: 443 NGLMISGLARAYQVFGEE----------------DYLRLALRAAQFILSQQRHPQTGSLL 486
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEF-------GSGTKWLVWAIELQNTQDELFLD 530
+G ++ P +DYA LI LLDL++ S WL AI LQ D D
Sbjct: 487 RLNYDGTAQVPAQSEDYALLIKALLDLHQACLPRTGDPSSQYWLEAAIRLQQEMDTRLWD 546
Query: 531 REGGGYFNTTGED-PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 589
GGYF + + P +L+R KE D A P+ N V+V NLVRLA+I Y + AE
Sbjct: 547 EARGGYFVSDAQSTPELLVREKEFQDNATPAANGVAVANLVRLAAITGDLD---YLERAE 603
Query: 590 HSLAVFETRLKDMAMAVPLMCCAAD 614
+L F + P + D
Sbjct: 604 QALKTFAHIMSTQPRVCPSLFVGLD 628
>gi|297192427|ref|ZP_06909825.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
25486]
gi|297151361|gb|EDY61872.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
25486]
Length = 678
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 229/710 (32%), Positives = 329/710 (46%), Gaps = 102/710 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWCHV+ ESFED A +N+ FV+IKVDREERPDVD VYM VQA G GGWP+S
Sbjct: 54 SSCHWCHVLAHESFEDAETAAYMNEHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMS 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
V+++ D +P GTYFPP ++G P F+ +L V DAW +RD + + L+ A
Sbjct: 114 VWMTADGEPFYFGTYFPPAPRHGMPSFRQVLEGVSDAWTGRRDEVGEVAQRIASDLA-AR 172
Query: 140 SASASSNKLP--DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
S + +P +EL Q L L++ YD R GGFG APKFP + ++ +L H +
Sbjct: 173 SLVVGGDGVPGEEELAQALL-----GLTRDYDERHGGFGGAPKFPPSMVLEFLLRHHAR- 226
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 227 --TGAEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALL 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
VY + T + + D+L R++ G SA DADS +G EGAF
Sbjct: 281 CRVYAHLWRATGSDLARRVALETADFLVRELRTSEGGFASALDADSDTADGG--HAEGAF 338
Query: 318 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVWT ++ ++LGE E + + G F+ + ++ L A A
Sbjct: 339 YVWTPAQLREVLGEEDGARAAELFAVTEEGT------------FEEGSSVLRLPHGEADA 386
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ R++L R +RPRP DDKV+ +WNGL I++ A
Sbjct: 387 ---------------DLRQRLLAAREERPRPGRDDKVVAAWNGLAIAALAETGAFFG--- 428
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAAS-FIRRHL-YDEQTHRLQHSFRNGPSKA-PGFLDD 493
R + +E A AA +R H+ ++ RL + ++G A G L+D
Sbjct: 429 -------------RPDLVERATEAADLLVRVHMDFEAGGVRLHRTSKDGRLGANAGVLED 475
Query: 494 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRV 550
YA + G L L G WL +A L + + +DR EG ++T D L+R
Sbjct: 476 YADVAEGFLALAAVGGEGSWLEFAGFLLD----MVMDRFTGEGCALYDTA-HDAEPLIRR 530
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAG---SKSDYYRQNAEHSLAVFETRLKDMAMAVP 607
+D P+ N+ A+++ + S+ +R AE +L V +K + P
Sbjct: 531 PQD-----PTDNAAPSGWSAAAAALLLYSAHTGSEAHRTAAEGALGV----VKGLGPRAP 581
Query: 608 L-----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
+ A +L P + V +VG + A V +P D++E
Sbjct: 582 RFIGWGLAAAEALLDGP--REVAVVGRPGDPATRELHLTALMGTAPGAAVAVGEP-DSDE 638
Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+ N S A A VC+ F C P TD L L +
Sbjct: 639 FPLLRDRPLVNGSSA----------AYVCRGFVCDSPTTDATELARKLTD 678
>gi|425470696|ref|ZP_18849556.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
gi|389883513|emb|CCI36064.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
Length = 692
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 211/623 (33%), Positives = 315/623 (50%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTDEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L + +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAEPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
ED+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ + + D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-------GECRRKLF-DVRSKRPRPHL----------DDKVIV 415
+LG +E L+ L + + LF R + ++ D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F+ P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATVAAEFILQHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L++ A P + A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618
>gi|334119055|ref|ZP_08493142.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
gi|333458526|gb|EGK87143.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
Length = 695
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 213/642 (33%), Positives = 313/642 (48%), Gaps = 110/642 (17%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ +KVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIAEYMNSHFIPVKVDREERPDIDSIYMQTLQMMTGQGGWPLN 107
Query: 80 VFLSPDLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD + P GGTYFP E +YGRPGF +L+ ++ +D ++ + A + L +
Sbjct: 108 VFLTPDERVPFYGGTYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILGNLQQT 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ S + +L E+ Q L L ++ G P FP M+ Y L
Sbjct: 168 AALSGVTAELNREIFQKGLELNTGIVA--------GHNPGPSFP------MIPYAELALR 213
Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ- 256
T + E+ K V +A GGI+D VGGGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 214 GTRFNFESKYDSKQVCTQRGLDLALGGIYDQVGGGFHRYTVDPTWTVPHFEKMLYDNGQI 273
Query: 257 ---LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
LAN++ + + F + I + ++L+R+M P G ++A+DADS T +
Sbjct: 274 VEYLANLW--GAGIQEPAFETAIAGTV-EWLKREMTAPTGYFYAAQDADSFNTSEEVEPE 330
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVWT E+E +L E K H+ + +GN F+GKNVL +
Sbjct: 331 EGAFYVWTYAELEQLLTPEELAEIKAHFTVSRSGN------------FEGKNVLQRRHPG 378
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVR-------------------------SKRPRP 407
S + + KLF VR R
Sbjct: 379 KLS------------DTVKTALAKLFQVRYGGNPDSVKTFPPARNNQEAKNESWPGRIPA 426
Query: 408 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 467
D K+I +WN LVIS ARA+ + + EY+E+A AA+FI +
Sbjct: 427 VTDTKMIAAWNSLVISGLARAAAVFGN----------------WEYLELAVKAANFILDN 470
Query: 468 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE----FGSGTK---------WL 514
+ + R Q +G S +DYA + LLDL++ G+G + WL
Sbjct: 471 QWTD--GRFQRLNYDGHSAVTAQSEDYALFVKALLDLHQASLTLGNGEEAKQLPNSQFWL 528
Query: 515 VWAIELQNTQDELFLDREGGGYFNTTGEDPS--VLLRVKEDHDGAEPSGNSVSVINLVRL 572
A+++Q DE E GGY+N T +D S +L+R + D A P+ N +++ +LVRL
Sbjct: 529 NKAVQVQEEFDEFLWSVELGGYYN-TAKDASGDLLVRERSYIDNATPAANGIAIASLVRL 587
Query: 573 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
A + G +Y + A+ L F + ++D A P + A D
Sbjct: 588 ALL--GPNLEYLDR-AQQGLQAFSSIVQDAPQACPSLLSAID 626
>gi|423065340|ref|ZP_17054130.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
gi|406713250|gb|EKD08422.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
Length = 686
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 205/619 (33%), Positives = 312/619 (50%), Gaps = 73/619 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERP++D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P D P GGTYFP E +YGRPGF +L+ + + + ++ L + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ P EL ++ L+ E + + +GG P+FP + M + +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ K +G+ L + + GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
D +S K Y +++L+R+M P G ++A+DADS T +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
FYVWT++E+E L + + + +GN F+GK V
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELE 380
Query: 366 -LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNGLVI 422
LIE + A + G P + + R R P + D K+IV+WN L+I
Sbjct: 381 PLIETALAKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMI 440
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
S A+A+++L D EY+E+A AA F+ H + D++ HR+ +
Sbjct: 441 SGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY--- 481
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGY 536
+G +DYA LI L+DL++ WL A+++QN D+ E GGY
Sbjct: 482 DGKVAVLSQSEDYALLIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGY 541
Query: 537 FNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
FNT +D ++L+R + D A P+ N V++ NLVRL + ++ Y A +L F
Sbjct: 542 FNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAF 598
Query: 596 ETRLKDMAMAVPLMCCAAD 614
+ ++ A P + A D
Sbjct: 599 ASVMRQSPQACPSLFVAFD 617
>gi|254409993|ref|ZP_05023773.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183029|gb|EDX78013.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 695
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 223/714 (31%), Positives = 337/714 (47%), Gaps = 121/714 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDPAIAQYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P D P GGTYFP E +YGRPGF +L+ ++ +D ++ L + L ++
Sbjct: 108 IFLTPEDRVPFYGGTYFPVEPRYGRPGFLQVLQAIRRFYDVEKTKLQNFKDEILGHLQQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ AS +L LR ++ + DS G +G P FP + L + E
Sbjct: 168 VLLPASG-----QLTAELLRQGMDKTIRIVDS--GSYG--PSFPMIPYADLALRGIRFQE 218
Query: 199 DTGKSG-EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
T +AS + + L AKGGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 219 MTEVDAYQASRSRGLDL------AKGGIYDHVAGGFHRYTVDATWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ +S+ K+ + + +L R+M G ++A+DADS A +EGA
Sbjct: 273 VEYLANLWSVGIKEAAFERAISGTVQWLTREMTASSGYFYAAQDADSFTEPSAAEPEEGA 332
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELN 370
FYVW+ E++ +L E +E + + P GN F+G+NVL +L+
Sbjct: 333 FYVWSYAELQQLLTAEELAELQEQFTVTPEGN------------FEGQNVLQRRYSDQLS 380
Query: 371 DSSASA------SKLGMP---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 421
D+ +A ++ G P LE + K + + P D K+IV+WN L+
Sbjct: 381 DTLETALAKLFTARYGSPPDSLETFPPAQNNQEAKTKNWSGRIP-AVTDTKMIVAWNSLM 439
Query: 422 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSF 480
IS ARA + + + EY+E+A +AA FI + + D++ HRL +
Sbjct: 440 ISGLARAYGVFR----------------KPEYLELATTAAKFILENQWVDQRFHRLNY-- 481
Query: 481 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGT-------------KWLVWAIELQNTQDEL 527
G + +DYA I LLDL++ G WL AI++Q+ DE
Sbjct: 482 -EGEASILAQSEDYALFIKALLDLHQASLGLATAQESSQSPIPDSWLEEAIKVQDEFDEY 540
Query: 528 FLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
E GY+N + +L+R + D A P+ N V++ NLVRL + +++ Y
Sbjct: 541 LWSVELAGYYNAANDSSGDLLIRERSYTDNATPAANGVAIANLVRLTLL---TENLAYLD 597
Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAADML--SVPSRKHVVLVGHKSSVDFENMLAAAHA 644
AE +L F + + + + P + A D S R +V + + F
Sbjct: 598 RAEVALNAFSSVMNQSSQSCPSLFTALDWFRNSTLIRTNVAQILSLMTQYFP-------- 649
Query: 645 SYDLNKTVIHIDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSP 698
T+ I+P+ E V LVCQ SC P
Sbjct: 650 -----ATMYRIEPSLPE-----------------------NAVGLVCQGLSCKP 675
>gi|425465473|ref|ZP_18844782.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
gi|389832278|emb|CCI24243.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
Length = 692
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 208/622 (33%), Positives = 311/622 (50%), Gaps = 78/622 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+D+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
+LG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGELGKEIEDMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F+ P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATVAAEFILKHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDE 539
Query: 534 GGYFNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
GGYFNT + ++LR + D A PS N +++ NL+RL+ + + Y AE +L
Sbjct: 540 GGYFNTASDHSLDLILRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKAL 596
Query: 593 AVFETRLKDMAMAVPLMCCAAD 614
F T L++ A P + A D
Sbjct: 597 QSFSTILEESPTACPSLFVALD 618
>gi|425459385|ref|ZP_18838871.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
gi|389822926|emb|CCI29290.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
Length = 692
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 209/623 (33%), Positives = 311/623 (49%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ A + L ++
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSKFTAEMLGALRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
+ L D + L E + +G P FP + L S+
Sbjct: 168 AILPRAETNLADP---SLLATGIETNTAVIQVNPNNYGR-PSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
ED+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ + + D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
+LG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGELGKEIENLLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F+ P+ Y +++ AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMSTQAAEFILQHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T+WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAGDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NLVRL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L+ A P + A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618
>gi|300789899|ref|YP_003770190.1| hypothetical protein AMED_8085 [Amycolatopsis mediterranei U32]
gi|384153415|ref|YP_005536231.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
gi|399541779|ref|YP_006554441.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
gi|299799413|gb|ADJ49788.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340531569|gb|AEK46774.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
gi|398322549|gb|AFO81496.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
Length = 879
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 226/709 (31%), Positives = 326/709 (45%), Gaps = 94/709 (13%)
Query: 10 TKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 65
K R L++ CHWCHVM ESFED G A L+N FV+IKVDREERPD+D VYM
Sbjct: 257 AKRRNVPILLSVGYAACHWCHVMAHESFEDAGTAALMNANFVTIKVDREERPDIDAVYMA 316
Query: 66 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 125
QA+ G GGWP++ FL+PD +P GTY+PP + G P F+ +L V +W ++ D L
Sbjct: 317 ATQAMTGQGGWPMTCFLTPDGEPFHCGTYYPPSPRPGMPSFRQLLVAVVQSWQERPDELV 376
Query: 126 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRP 184
+ L+E + L + + A+ A +L + D GGFG APKFP
Sbjct: 377 DGAKQIVAHLAE------QTGPLKESVVDEAVLAGAVGKLQQEADRVNGGFGRAPKFPPS 430
Query: 185 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 244
+ ++ +L H E TG + S +V T + MA+GG++D + GGF RYSVD W V
Sbjct: 431 MVLEFLLRHH---ERTGSAVALS----LVDSTAEAMARGGLYDQLAGGFARYSVDAEWIV 483
Query: 245 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 304
PHFEKMLYD L Y + T + ++L + P G S+ DAD+
Sbjct: 484 PHFEKMLYDNALLLRFYAHLWRRTGSATALRVATGTAEFLFESLRTPEGGFASSLDADTE 543
Query: 305 ETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 364
EG T YVWT ++ +++G+ + E + + G + +G +
Sbjct: 544 GVEGLT-------YVWTPAQLREVVGDDSA--AELFGVTKEGTFE-----------EGAS 583
Query: 365 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 424
L D L P+ R KL + R+KRP+P DDKVI SWNGL I++
Sbjct: 584 TLRLFGD-------LPEPM----------RVKLLEARAKRPQPGRDDKVIASWNGLAITA 626
Query: 425 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG 483
A A L DR +++E A AA + R H+ D RL+ S R+G
Sbjct: 627 LAEAGVAL----------------DRPQWIEWAREAAELLLRVHVVD---GRLRRSSRDG 667
Query: 484 -PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTG 541
++ G L+DYA + G L L++ KWL A L + F + G YF+T
Sbjct: 668 VVGESAGVLEDYACVADGFLALHQATGAAKWLTEATRLLDLALAHFASPDVPGAYFDTAD 727
Query: 542 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 601
+ +++ R + D A PSG S L+ +++ + S YR+ AE +L +R
Sbjct: 728 DAETLVQRPADPGDNASPSGASALAGALLTASALAGHADSGRYREAAERAL----SRAGV 783
Query: 602 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTE 661
+A VP A LSV + V + +L AA V+ +P D
Sbjct: 784 LAGRVPRF--AGHWLSVAEARQAGPVQVAVAGASPELLRAAARGIHGGGVVLAGEP-DAP 840
Query: 662 EMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ +A A VC+ + C PVT L L
Sbjct: 841 GVPL----------LADRPLVDGAPAAYVCRGYVCDRPVTSAAELTARL 879
>gi|425435449|ref|ZP_18815900.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
gi|389679973|emb|CCH91261.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
Length = 692
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 212/623 (34%), Positives = 313/623 (50%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLADPSLLATGIETNTAVIQVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
ED+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFEDSLQQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ + + D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDRSLRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-------GECRRKLF-DVRSKRPRPHL----------DDKVIV 415
+LG +E L+ L + + LF R + ++ D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F+ P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NLVRL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L+ A P + A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618
>gi|217978724|ref|YP_002362871.1| hypothetical protein Msil_2586 [Methylocella silvestris BL2]
gi|217504100|gb|ACK51509.1| protein of unknown function DUF255 [Methylocella silvestris BL2]
Length = 691
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 226/705 (32%), Positives = 340/705 (48%), Gaps = 88/705 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE A ++N+ FV+IKVDREERPD+D +YM + A GGWPL++
Sbjct: 55 ACHWCHVMAHESFEDEATAAVMNELFVNIKVDREERPDIDHIYMQALHAFGERGGWPLTM 114
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GGTYFP ++YGRP F T+LR V A+ ++ +A + L++A +
Sbjct: 115 FLTPKGEPFWGGTYFPKTEQYGRPAFVTVLRTVAHAFHEEPHRIAANVGAVRRNLTKAPT 174
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
AS L + A QL + D+ GG APKFP I ML+ +
Sbjct: 175 ASGGDFSLAQ------MDDIAAQLVTAIDTVDGGLKGAPKFPN-TPILEMLWRAG----- 222
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
++G A+ Q M L L+ M++GGI+DH+GGG+ RYS D+RW VPHFEKMLYD Q+
Sbjct: 223 ARTGTAAYRQAMRL-ALEKMSEGGIYDHLGGGYARYSTDDRWLVPHFEKMLYDNAQILEC 281
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ KD + R+ + +L R+M PGG ++ DADS EG EG FYVW
Sbjct: 282 LALCYDAFKDDLFLQRARETVAWLEREMTNPGGAFSASLDADS---EGI----EGKFYVW 334
Query: 321 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 379
T E+ + LG + A F + Y GN D H G +L L + +A +
Sbjct: 335 TFDELVEPLGADEARFFGKFYNAARIGN-----WVDAHYP-NGVTILNRLESARPTAEEE 388
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
L R++LFD R R P LDDK++ WNGL+I++ A+ +
Sbjct: 389 AR--------LAPLRQRLFDRREARVHPGLDDKIMADWNGLMIAALVNAATL-------- 432
Query: 440 MFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAF 496
+ ++ +A A +FI LY ++ RL HSFR G PG DY+
Sbjct: 433 --------TGEHRWIALAARAYNFIVATMLYRDEAGLTRLAHSFRAGVLVKPGLALDYST 484
Query: 497 LISGLLDLY------EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
++ L LY EF + +L A T + +D + + V++++
Sbjct: 485 MMRAALALYEVRNLKEFAATRDYLSDARAFAQTLEACHIDPDSRLITMAAKDAADVIVKL 544
Query: 551 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV--PL 608
D A P+ + V + L+RLA V+G + R +A +K M ++ +
Sbjct: 545 APTADDAIPNAHPVYLGALIRLAG-VSGDQGALDRADA---------LIKAMGPSIRGNI 594
Query: 609 MCCAADMLSVPSR---KHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDF 665
+ A + ++ R + +V G + +E L A +++ V+ +D D
Sbjct: 595 VGHAGTLNAIDLRLRVREIVTAGPARAPLYEAALGAPF----IDRIVMDLDRPD------ 644
Query: 666 WEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
E + + + A+ A + A VC +CS P D +L LL
Sbjct: 645 --EIPAAHPARAQAEL-AGEAAAFVCAGGACSLPARDVDALRQLL 686
>gi|320589398|gb|EFX01859.1| duf255 domain containing protein [Grosmannia clavigera kw1407]
Length = 836
Score = 305 bits (780), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 208/634 (32%), Positives = 310/634 (48%), Gaps = 71/634 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CH+CH +SF VA++LN F+ I VDREERPD+D +Y Y+Q + GWP++VF
Sbjct: 97 CHYCHTTTQDSFSSPAVAEILNTSFIPIVVDREERPDIDAIYWNYLQLVNSSAGWPINVF 156
Query: 82 LSPDLKPLMGGTYFPPEDKYGRP-------------GFKTILRKVKDAW--------DKK 120
L+P+L+P+ GGTY+P G GF IL+K++ +W ++
Sbjct: 157 LTPELEPVFGGTYWPGPGSEGSVRDGQEDGGEDEMIGFLGILKKLRQSWTDREAQCREEA 216
Query: 121 RDMLAQSGAFAIEQ-------LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG 173
R+ + Q FA E L ++ A +L + L QL K++D G
Sbjct: 217 RETVVQLRKFAAEGTLGPRGLLRPTVAEGAPYLSRDLDLDIDQLDDAYTQLKKTFDPVNG 276
Query: 174 GFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 230
GFG PKF P + +L ++ EA +M LFTL+ + G+HDH+
Sbjct: 277 GFGVVPKFVTPAKYSFLLKLGSFPNVVQGIIGDAEAKNAVQMALFTLRKLQDSGLHDHLR 336
Query: 231 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF----------SLTKDVFYSYICRDI 280
GGF R S W +PHFEK++ D L ++YLDA+ + D ++ + +
Sbjct: 337 GGFSRASHTINWTLPHFEKLVPDNALLLSLYLDAWLYGLRTSGTGAKGTDAEFADVVYAL 396
Query: 281 LDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH------- 332
DYL I GG S+E ADS G +EGA+YVWT +E + ++G
Sbjct: 397 ADYLSSSPIRLEGGGFASSEAADSYYRRGDNHTREGAYYVWTRREFDAVVGGQRSENDLD 456
Query: 333 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 392
++ + GN D R DP++EF +NVL D+S A + G+ L ++
Sbjct: 457 TRAAAAYWNVLEHGNVD--REDDPNDEFINQNVLYVNKDASEVARQFGISRSDVLRVVKT 514
Query: 393 CRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 451
++KL R K R RP D KV V+ NG+VI++ AR +L F+ P G +
Sbjct: 515 SKKKLAAHREKERVRPAADRKVTVANNGVVIAALARVGAVLVHGG----FD-PANG---E 566
Query: 452 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSG 510
+Y+ A SAA FI+ +L+D Q L ++ G GF +DYA LI GLL+LYE
Sbjct: 567 KYISAARSAARFIKANLWDVQDKCLFRTYSYGQKGTNCGFAEDYAVLIEGLLELYEATGE 626
Query: 511 TKWLVWAIELQNTQDELFLD----------REGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
+WL WA +LQ Q E F D GG++ T+ +P +LR+K+ D P+
Sbjct: 627 LEWLQWADQLQQRQIEQFYDGVDMPPTSSHSASGGFYRTSEHEPFNILRIKDGMDTTLPA 686
Query: 561 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
N V+ NL RL S++ + + + HS V
Sbjct: 687 TNGVAASNLFRLGSLLGDEEYSHLARETIHSFEV 720
>gi|423133250|ref|ZP_17120897.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
101113]
gi|371649306|gb|EHO14787.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
101113]
Length = 667
Score = 305 bits (780), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 192/560 (34%), Positives = 289/560 (51%), Gaps = 50/560 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ VA L+N+ F+SIKVDREE P +D YM +Q + GGWPL+V
Sbjct: 48 TCHWCHVMEKESFENQEVADLMNEHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYF R + L ++ + +KRD + FA QL E +S
Sbjct: 108 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 157
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + E + L E KS+D +GG+ PKF P +LY KK
Sbjct: 158 I-LSQAPIAQEDSRFNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK---- 209
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + + + TL MA GG+ D V GGF RYSVD +WH+PHFEKMLYD QL +V
Sbjct: 210 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 269
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y D + T + Y + +D++ + G +SA DADS ++ + +EGAFYVW
Sbjct: 270 YADGYKRTHNKLYKEVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYVW 327
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+++++ + LF + + G+ + S+ VLI+ + A++
Sbjct: 328 TIEELKELVQQDFPLFSTVFNINSFGHWENSQY-----------VLIQTRELIDIANENN 376
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+PLE N + L R+ RP+P LDDK + SWN + I+ A ++ A
Sbjct: 377 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 432
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A++ FI +L+ E+ L+ ++++G +K FLDDYAF I G
Sbjct: 433 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 479
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG-GYFNTTGEDPSVLLRVKEDHDGAEP 559
L+ L+E +++ A L + + FLD E YFN ++ ++ + E D P
Sbjct: 480 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFNKHNQEDTITPAI-ETEDNVIP 538
Query: 560 SGNSVSVINLVRLASIVAGS 579
S N++ +NL +L + S
Sbjct: 539 SSNAIMAMNLYKLGLLYENS 558
>gi|384135742|ref|YP_005518456.1| hypothetical protein TC41_2025 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339289827|gb|AEJ43937.1| protein of unknown function DUF255 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 626
Score = 305 bits (780), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 209/602 (34%), Positives = 289/602 (48%), Gaps = 54/602 (8%)
Query: 27 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
+M ESFEDE VA +LN+ +V+IKVDREERPD+D +YMTY QAL G GGWPL++ ++PD
Sbjct: 1 MMAHESFEDEKVAAILNEHYVAIKVDREERPDIDHIYMTYCQALQGEGGWPLTIIMTPDG 60
Query: 87 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
P GTYFP +YG PG IL+++ W R L ++ E++ A
Sbjct: 61 YPFFAGTYFPKTPRYGPPGLIQILQEIARLWQTDRARLERASRSMAERMQPLFEGQAGEA 120
Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+ D Q + L ++D +GGFG APKFP +Q +L +++ +
Sbjct: 121 RGRDAADQ-----AYQALEAAFDHEYGGFGPAPKFPTFHRVQFLLRYARLRPN------- 168
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
M L TL+ + +GGI DHVGGG RYS D W VPHFEKMLYD Y DA+
Sbjct: 169 ERAAAMALSTLRAIQRGGIVDHVGGGMARYSTDPFWRVPHFEKMLYDNALALAAYADAYV 228
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
KD + R + + R+M P G +SA DADSA EG FY+W ++V
Sbjct: 229 HAKDPAFLRFVRQTVAFFDREMQSPEGLYYSAVDADSA-------GGEGRFYLWRPEDVI 281
Query: 327 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLE 384
LG E LF Y + GN F+G NV ++ D +A A+ GM E
Sbjct: 282 AALGPEDGELFNAFYDITEAGN------------FEGANVPNYIDQDPAAFAASRGMTEE 329
Query: 385 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 444
+ L + KL VR R RP +DDK + +WN L+ ARA A
Sbjct: 330 ELWQKLDDLNAKLRAVRDGRERPAIDDKCLTAWNALMAYGLARAGLAFGEMA-------- 381
Query: 445 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 504
++ A + I R L RL +R+G + + DD+A+L++ L+L
Sbjct: 382 --------WVNRATEVVAAIERILVRPDDGRLLARYRDGEAGIFAYADDHAYLVAAYLEL 433
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV-KEDHDGAEPSGNS 563
Y +L A Q QD LF D+ GGY G D L+ V K +DGA PS NS
Sbjct: 434 YRATLDRAYLDRARHWQAVQDALFWDKAQGGY-TFYGRDAESLIAVPKPVYDGAMPSANS 492
Query: 564 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 623
S NL L ++ ++ Y + L F ++ M + AA M V S +
Sbjct: 493 QSAHNLWMLHALTGDAE---YADRLDALLRAFGGDIRSAPMDCLWLVTAAMMSEVGSTEI 549
Query: 624 VV 625
V+
Sbjct: 550 VI 551
>gi|427718285|ref|YP_007066279.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
gi|427350721|gb|AFY33445.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
Length = 690
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 214/628 (34%), Positives = 310/628 (49%), Gaps = 87/628 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDLAIAQYMNTNFLPIKVDREERPDLDSIYMQALQMMNGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
VFLSP DL P GTYFP E +YGRPGF +L+ ++ +D + + L Q A +E L S
Sbjct: 108 VFLSPEDLVPFYAGTYFPLEPRYGRPGFLQVLQAIRRYYDTETEDLRQRKAVIVESLLTS 167
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L ++ + +EL + C ++ FP M+ Y
Sbjct: 168 AVLQDGSTQDIQENELLRQGWETCTGVITPHQQGN--------SFP------MIPYAELA 213
Query: 197 LEDTGKS-GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
L T + +G+++ +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 214 LRGTRFNFASHYDGKQICQQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNG 273
Query: 256 QLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q+ + +S + + F I + + ++L+R+M P G ++A+DADS A +
Sbjct: 274 QIVEYLANLWSAGVQEPAFARAIAKTV-EWLQREMTAPAGYFYAAQDADSFINPTAVEPE 332
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVWT E+ +L E ++ + + P GN F+ KNVL L+
Sbjct: 333 EGAFYVWTYSELAKLLTPEELTELQQQFTVTPHGN------------FESKNVLQRLH-- 378
Query: 373 SASASKLGMPLEKYLNILGECRRKL-------FDVRSK-----------RPRPHLDDKVI 414
+ +L LEK L L + R + F S R D K+I
Sbjct: 379 ---SGELSKTLEKALGKLFKARYGITPESLDTFPPASNNQEAKTNNWPGRIPSVTDTKMI 435
Query: 415 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQT 473
V+WN L+IS ARAS + F P+ Y+++A AA+FI D +
Sbjct: 436 VAWNSLMISGLARASGV---------FQQPL-------YLQIAARAANFIWDNQFVDGRF 479
Query: 474 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG------SGTKWLVWAIELQNTQDEL 527
HRL + G +DYA I LLDL++ S + WL AI LQ+ D
Sbjct: 480 HRLNYV---GQPNVLAQSEDYALFIKALLDLHQATLLIGNESASFWLEKAIALQDEFDAY 536
Query: 528 FLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 586
E GGY+N + + +++R + D A PS N V++ NLVRL + + + +Y
Sbjct: 537 LWSVELGGYYNASIDASQDLIVRERSYADNATPSANGVAIANLVRLTLL---TDNLHYLD 593
Query: 587 NAEHSLAVFETRLKDMAMAVPLMCCAAD 614
AE L F+T + A P + A D
Sbjct: 594 LAEQGLKAFKTVMSRSPQACPSLFTALD 621
>gi|164422571|ref|XP_957963.2| hypothetical protein NCU09980 [Neurospora crassa OR74A]
gi|157069724|gb|EAA28727.2| hypothetical protein NCU09980 [Neurospora crassa OR74A]
Length = 827
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 209/660 (31%), Positives = 326/660 (49%), Gaps = 97/660 (14%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H + + SF + VA LN F+ + +DR+ERPD+D +Y Y +A+ GGWPL++FL
Sbjct: 126 HIGFLADHHSFSNNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185
Query: 83 SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
+PDL P+ GGTY+P PE G F I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVGGVAATPEASSINGGGEESYNDFLAIAK 245
Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
K+ W ++ + AQ G F+ E + +A+ + +L
Sbjct: 246 KIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
+ L +++ K +D GFG+ PKFP P + +L + +++ D E
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364
Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
M TL+ + GG+ DHVG GF R+SV W +PHFEKM+ + L VYLDA+
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424
Query: 266 -----SLTKDVFYSYICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYV 319
L+ + ++ + D+ DYL +I GG ++E ADS +G +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFADVVIDLADYLTSPLIQFSGGGFVTSEAADSFYRKGDRHMREGAYYL 484
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
WT +E +D++G Y + ++ R DPH+EF +NVL + D+ A +
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDTQALSK 544
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G+P+ I+ + R +L R + RPRP D+KV+V NG+VIS+ AR + +++
Sbjct: 545 QFGIPVNDVKKIIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRE-- 602
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE---QTHRLQHSFR-NGPSKAPGFLD 492
+ + ++Y+E A+ AA+FI+ +L+ + Q+ ++ F N PS F D
Sbjct: 603 --------LDKTKSQKYLEAAQQAAAFIKENLWVQDGTQSRKVLKRFWFNQPSDTRAFAD 654
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE------------GGGYFNTT 540
DYAFLI GLLDLYE KWLVWA ELQ+ Q ELF D GG+++T
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPVVGSTPSLRHSYTGGFYSTE 714
Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
S +LR+K D ++PS N+VS NL RL +I+ + + RQ E ++ FE +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771
>gi|425446506|ref|ZP_18826509.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
gi|389733246|emb|CCI02963.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
Length = 689
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 214/621 (34%), Positives = 313/621 (50%), Gaps = 76/621 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
L SA + L +L E+ + +G P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAAPSLLATGIEKNTAVIRVNPNNYGR-PSFPMIPYSHLALQGSRFG 222
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
ED S + Q+ + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 223 EDFDDSLRQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQI 277
Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ +S ++ + + +++L+R+M P G ++A+DADS E +EGA
Sbjct: 278 VEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGA 337
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW+ E+ D L + + + ++ + GN F+G+NVL
Sbjct: 338 FYVWSDLELRDYLSTEELGVLQANFTVTAEGN------------FEGRNVL-----QRRQ 380
Query: 376 ASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSW 417
+LG +E L+ L G + +L R D K+IV+W
Sbjct: 381 GGELGEEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAW 440
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRL 476
N L+IS ARA A+F P+ Y ++A AA FI +H + D + RL
Sbjct: 441 NSLMISGLARA---------FAVFGEPL-------YWQMAAQAAEFILKHQWLDGRFQRL 484
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGG 535
+ G + +D+A+ I LLDL T+WL AI+LQ D F + GG
Sbjct: 485 NY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAEDEGG 541
Query: 536 YFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 593
YFNT D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +L
Sbjct: 542 YFNTAS-DHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQ 597
Query: 594 VFETRLKDMAMAVPLMCCAAD 614
F T L+ A P + A D
Sbjct: 598 SFSTILEQSPTACPSLFVALD 618
>gi|428211294|ref|YP_007084438.1| thioredoxin domain-containing protein [Oscillatoria acuminata PCC
6304]
gi|427999675|gb|AFY80518.1| thioredoxin domain protein [Oscillatoria acuminata PCC 6304]
Length = 691
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 209/622 (33%), Positives = 310/622 (49%), Gaps = 80/622 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F E +A +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSSEAIASYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRPGF +L+ ++ +D ++ LA + L +A
Sbjct: 108 IFLTPDDLIPFYGGTYFPVEPRYGRPGFLELLQAIRRYYDLEKGKLAAFKEEIMGHLQQA 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ + + LP+EL L ++ +G P FP MM Y L+
Sbjct: 168 ATLPGTED-LPEELLWKGLETSVTVIAH---REYG-----PSFP------MMPYAQVVLQ 212
Query: 199 DTGKSGEASEGQKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ- 256
T E+ ++ + +A GGI+D V GGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 213 STRFDRESEYDERSAIAQRGIDLASGGIYDAVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 257 ---LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
LAN++ + ++ + + + +L+R+M P G ++A+DADS T +
Sbjct: 273 VEFLANLWSEGI---QEPGFEWAVAGTIQWLKREMTAPEGYFYAAQDADSFITPEDKEPE 329
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVWT +E+E +L E + ++L P GN F+GK VL N
Sbjct: 330 EGAFYVWTYQELERLLTVEEFTALNQEFFLSPEGN------------FEGKIVLKRTNLQ 377
Query: 373 SASAS-----------KLGMPLEKYLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWN 418
+ S + + G E C K + + P P D K+IV+WN
Sbjct: 378 ALSPTVETALAKLFKVRYGALPEAVKTFPPACNNHEAKTHNWPGRIP-PVTDPKMIVAWN 436
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
L+IS ARA+ + + EY +A +AA+FI H + E + HRL
Sbjct: 437 SLMISGLARAAVVFGN----------------GEYATLATTAANFILDHQWVEGRFHRLN 480
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFLDREG 533
+ +G + +DYA I LLDL + S + WL AI++Q DE E
Sbjct: 481 Y---DGQAAVLAQSEDYALFIKALLDLEQMEQVHPSNSNWLEKAIQVQEEFDEFLWSVEL 537
Query: 534 GGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 592
GGYFNT + S +++R + D A P+ N V++ +L+RL+ ++ Y A ++L
Sbjct: 538 GGYFNTAKDSSSDLIVRERSYTDNATPAANGVAIASLIRLSMF---TEDLSYLDRAFNAL 594
Query: 593 AVFETRLKDMAMAVPLMCCAAD 614
F + A P + A D
Sbjct: 595 KSFGAIMDRAPSACPSLFAALD 616
>gi|209523771|ref|ZP_03272324.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
gi|209495803|gb|EDZ96105.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
Length = 686
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 204/619 (32%), Positives = 311/619 (50%), Gaps = 73/619 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERP++D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+P D P GGTYFP E +YGRPGF +L+ + + + ++ L + QL ++
Sbjct: 108 VFLTPGDRIPFYGGTYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKL 197
+ P EL ++ L+ E + + +GG P+FP + M + +L
Sbjct: 168 MILP------PSELTEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRL 216
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ K +G+ L + + GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 217 ISSPK----VDGKAACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFS-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
D +S K Y +++L+R+M P G ++A+DADS T +EGA
Sbjct: 273 LEFLADLWSDGEKQPAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGA 332
Query: 317 FYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV---------- 365
FYVWT++E+E L + + + +GN F+GK V
Sbjct: 333 FYVWTNQELETFLSPAEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELE 380
Query: 366 -LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNGLVI 422
LIE + A + G P + + R R P + D K+IV+WN L+I
Sbjct: 381 PLIETALAKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMI 440
Query: 423 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 481
S A+A+++L D EY+E+A AA F+ H + D++ HR+ +
Sbjct: 441 SGLAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY--- 481
Query: 482 NGPSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGY 536
+G +DYA I L+DL++ WL A+++QN D+ E GGY
Sbjct: 482 DGKVAVLSQSEDYALFIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGY 541
Query: 537 FNTTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 595
FNT +D ++L+R + D A P+ N V++ NLVRL + ++ Y A +L F
Sbjct: 542 FNTALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAF 598
Query: 596 ETRLKDMAMAVPLMCCAAD 614
+ ++ A P + A D
Sbjct: 599 ASVMRQSPQACPSLFVAFD 617
>gi|186686249|ref|YP_001869445.1| hypothetical protein Npun_R6218 [Nostoc punctiforme PCC 73102]
gi|186468701|gb|ACC84502.1| protein of unknown function DUF255 [Nostoc punctiforme PCC 73102]
Length = 685
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 208/618 (33%), Positives = 308/618 (49%), Gaps = 72/618 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A +N ++ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDSAIADYMNANYLPIKVDREERPDLDSIYMQALQMMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FLSP DL P GTYFP + +YGRPGF +L+ ++ +D ++ L Q A IE L
Sbjct: 108 IFLSPEDLVPFYAGTYFPVDPRYGRPGFLQVLQALRRYYDTEKAELQQRKALIIESL--- 164
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG---SAPKFPRPVEIQMMLYHSK 195
L+++ + DEL L L + +++ G S FP M+ Y
Sbjct: 165 LTSAVLQDGTTDELEDREL------LRQGWETSTGVITPGQSGNSFP------MIPYTEL 212
Query: 196 KLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 254
L T + E+ +G+++ +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 213 ALRGTRFNFESRYDGKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDN 272
Query: 255 GQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
GQ+ + +S ++ + + +L+R+M P G ++++DADS A +
Sbjct: 273 GQIVEYIANLWSAGVQEPAFERAVAVTVQWLKREMTAPEGYFYASQDADSFTEPTAVEPE 332
Query: 314 EGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVW+ EV+ +L E ++ + + P GN F+G+NVL N
Sbjct: 333 EGAFYVWSYSEVQQLLTPEELTELQQQFTVTPNGN------------FEGRNVLQRRNSG 380
Query: 373 SASAS-----------KLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNG 419
SA+ + G+ E C + + R P + D K+IV+WN
Sbjct: 381 KLSATLETSLSKLFTARYGVSSELLETFPPACNNQEAKTTNWPGRIPSVTDTKMIVAWNS 440
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQH 478
L+IS A+A+ + F P+ Y+E+A AA+FI D + RL +
Sbjct: 441 LMISGLAKAAGV---------FQQPL-------YLELAARAANFILENQFVDGRFQRLNY 484
Query: 479 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYF 537
G +DYAF + LLDL K WL AI +Q+ E E GGYF
Sbjct: 485 ---QGEPTVLAQSEDYAFFVKALLDLQASNPEHKQWLENAIAIQDEFTEFLWSVELGGYF 541
Query: 538 NTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 596
NT+ + +++R + D A PS N +++ NLVRLA + Y AE L F+
Sbjct: 542 NTSSDSSQDLIVRERSYADNATPSANGIAIANLVRLALLTDNLD---YLDLAELGLKAFK 598
Query: 597 TRLKDMAMAVPLMCCAAD 614
+ + A P + A D
Sbjct: 599 SVMHRAPQACPSLFTALD 616
>gi|166365023|ref|YP_001657296.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
gi|166087396|dbj|BAG02104.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
Length = 692
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 211/623 (33%), Positives = 311/623 (49%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRSETNLAAPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+D+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGNREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
+LG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGELGKEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L++ A P + A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618
>gi|336464974|gb|EGO53214.1| hypothetical protein NEUTE1DRAFT_126582 [Neurospora tetrasperma
FGSC 2508]
Length = 827
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 209/660 (31%), Positives = 325/660 (49%), Gaps = 97/660 (14%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H + + SF + VA LN F+ + +DR+ERPD+D +Y Y +A+ GGWPL++FL
Sbjct: 126 HIGFLADHHSFSNNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185
Query: 83 SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
+PDL P+ GGTY+P PE G F I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVVGGAATPEASSINGGGEESYNDFLAIAK 245
Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
KV W ++ + AQ G F+ E + +A+ + +L
Sbjct: 246 KVHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
+ L +++ K +D GFG+ PKFP P + +L + +++ D E
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364
Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
M TL+ + GG+ DHVG GF R+SV W +PHFEKM+ + L VYLDA+
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424
Query: 266 -----SLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
L+ + ++ + D+ DYL +I GG ++E ADS +G +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFANVVIDLADYLTSPLIQSSGGGFITSEAADSFYRKGDRHMREGAYYL 484
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
WT +E +D++G Y + ++ R DPH+EF +NVL + D A +
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDIQALSK 544
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G+P+ ++ + R +L R + RPRP D+KV+V NG+VIS+ AR + +++
Sbjct: 545 QFGIPVNDVKKMIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRD-- 602
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE---QTHRLQHSFR-NGPSKAPGFLD 492
+ + ++Y+E A+ AA+FI+ +L+ + Q+ ++ F N PS F D
Sbjct: 603 --------LDKTKSQKYLEAAQRAATFIKENLWVQDGTQSRKVLKRFWFNQPSDTRAFAD 654
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG------------GGYFNTT 540
DYAFLI GLLDLYE KWLVWA ELQ+ Q ELF D GG+++T
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPAVGSTPSLRHSYTGGFYSTE 714
Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
S +LR+K D ++PS N+VS NL RL +I+ + + RQ E ++ FE +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771
>gi|350297081|gb|EGZ78058.1| hypothetical protein NEUTE2DRAFT_101642 [Neurospora tetrasperma
FGSC 2509]
Length = 827
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 208/660 (31%), Positives = 324/660 (49%), Gaps = 97/660 (14%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H + + SF + VA LN F+ + +DR+ERPD+D +Y Y +A+ GGWPL++FL
Sbjct: 126 HIGFLADHHSFANNAVAAFLNSSFIPVIIDRDERPDLDTIYQNYSEAVNATGGWPLNLFL 185
Query: 83 SPDLKPLMGGTYFP------------------------PEDKYGRPG-------FKTILR 111
+PDL P+ GGTY+P PE G F I +
Sbjct: 186 TPDLYPIFGGTYWPGPGTEHSLAAARGGASGVGGGAATPEVSSINGGGEESYNDFLAIAK 245
Query: 112 KVKDAWDKKRDM--------------LAQSGAFA---IEQLSEALSASASSNKLPDELPQ 154
K+ W ++ + AQ G F+ E + +A+ + +L
Sbjct: 246 KIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFSGTPAEPVPVVAPVAAADVEAGADLDL 305
Query: 155 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKSGEASEGQK 211
+ L +++ K +D GFG+ PKFP P + +L + +++ D E
Sbjct: 306 DQLDEALDRIFKMFDPVDCGFGT-PKFPNPARLSFLLRLAQFPREVRDVVGDKEVENAAS 364
Query: 212 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF------ 265
M TL+ + GG+ DHVG GF R+SV W +PHFEKM+ + L VYLDA+
Sbjct: 365 MARSTLRRIRDGGLRDHVGAGFMRFSVTSDWSMPHFEKMVGENALLLGVYLDAWLGRVQS 424
Query: 266 -----SLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYV 319
L+ + ++ + D+ DYL +I GG ++E ADS +G +EGA+Y+
Sbjct: 425 SAAETRLSLEDEFADVVIDLADYLTSPLIQSSGGGFITSEAADSFYRKGDRHMREGAYYL 484
Query: 320 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL--NDSSASAS 377
WT +E +D++G Y + ++ R DPH+EF +NVL + D A +
Sbjct: 485 WTRREFDDVVGPAGSAEVAAAYWNVLEDGNIPRDQDPHDEFINQNVLCSVWGKDIQALSK 544
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
+ G+P+ ++ + R +L R + RPRP D+KV+V NG+VIS+ AR + +++
Sbjct: 545 QFGIPVNDVKKMIAKARERLRAHREQERPRPARDEKVVVGVNGMVISALARTAAVVRD-- 602
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR----LQHSFRNGPSKAPGFLD 492
+ + ++Y+E A+ AA+FI+ +L+ + R L+ + N PS F D
Sbjct: 603 --------LDKTKSQKYLEAAQHAATFIKENLWVQDGTRSRKVLKRFWFNQPSDTRAFAD 654
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG------------GGYFNTT 540
DYAFLI GLLDLYE KWLVWA ELQ+ Q ELF D GG+++T
Sbjct: 655 DYAFLIEGLLDLYEATLEVKWLVWAKELQDVQSELFYDTPAVGSTPSLRHSYTGGFYSTE 714
Query: 541 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
S +LR+K D ++PS N+VS NL RL +I+ + + RQ E ++ FE +
Sbjct: 715 EATLSHTILRLKSGMDKSQPSTNAVSASNLFRLGTIL--DEKPFIRQAIE-TINAFEAEI 771
>gi|300864691|ref|ZP_07109547.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300337297|emb|CBN54695.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 694
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 213/633 (33%), Positives = 312/633 (49%), Gaps = 93/633 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F + +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMENEAFSNAAIAEYMNAHFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL P D P GGTYFP +YGRPGF +L ++ +D ++ L AF E L+
Sbjct: 108 IFLDPIDRIPFYGGTYFPVYPRYGRPGFLEVLHAIRRFYDLEKGKLQ---AFKEEILAHF 164
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
++A S ++L LR E + +R G P FP MM Y L
Sbjct: 165 QQSAALSGT--EKLSGKLLRRGLETSTAIISAREYG----PSFP------MMPYSESALR 212
Query: 199 DTGKSGEA-SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ E S+ Q++ +A GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 213 GMRFNLEGKSDSQQVCTQRGLDLALGGIYDHVAGGFHRYTVDGTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ +S ++ + +++L+R+MI P G ++A+DAD+ T +EGA
Sbjct: 273 VEYLANLWSAGVREPAFERAVAGTVEWLQREMIAPAGYFYAAQDADNFTNIEETEPEEGA 332
Query: 317 FYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FYVW+ E+E++L + +E + + TGN F+ KNVL
Sbjct: 333 FYVWSYSELENLLEADEFRELQEQFTVTQTGN------------FEAKNVL-----QRRH 375
Query: 376 ASKLGMPLEKYLNILGECR-------------------RKLFDVRSKRPRPHLDDKVIVS 416
KL LE L L + R K +D + P D K+IV+
Sbjct: 376 PGKLSSTLETALAKLFKVRYGAVPESVKVFPPARNNQEAKSYDWPGRIP-AVTDTKMIVA 434
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHR 475
WN L+IS ARA+ + + EY+E+A AA+FI + + D + HR
Sbjct: 435 WNSLMISGLARATAVFH----------------KSEYLELAAKAANFILDNQWIDGRFHR 478
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG---TK----------WLVWAIELQN 522
L + +G S +DYA + LLDL++ G TK WL A+++Q
Sbjct: 479 LNY---DGKSAVMAQSEDYALFLKALLDLHQVSEGWLETKPDSFNLKPEVWLEKAVKIQE 535
Query: 523 TQDELFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
DE E GGY+NT + + +L+R + D A P+ N V++ NLVRL + +
Sbjct: 536 EFDEFLWSIEVGGYYNTASDASADLLVRERSYTDNATPAANGVAIANLVRLTLLTEDLQ- 594
Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
Y AE L F + ++D A P + A D
Sbjct: 595 --YLDRAEQGLQAFSSVMQDSPQACPSLFAALD 625
>gi|428770863|ref|YP_007162653.1| hypothetical protein Cyan10605_2528 [Cyanobacterium aponinum PCC
10605]
gi|428685142|gb|AFZ54609.1| protein of unknown function DUF255 [Cyanobacterium aponinum PCC
10605]
Length = 676
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 228/712 (32%), Positives = 343/712 (48%), Gaps = 115/712 (16%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWC VME E+F D +A LND F+SIKVDREERPD+D +YMT +Q + G GGWPL++
Sbjct: 49 SCHWCTVMEGEAFSDGAIASYLNDNFISIKVDREERPDIDSIYMTALQMMTGQGGWPLNI 108
Query: 81 FLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SEA 138
FLSP DL P GGTYFP E +YGRPGF IL+ ++D + K D ++ L + +
Sbjct: 109 FLSPDDLVPFYGGTYFPIEPRYGRPGFLQILQALRDFYHDKSDKFISLKNEIVKGLETNS 168
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S N+L EL Q + ++ ++++ +GS P+FP MM Y + L+
Sbjct: 169 NIIFTSENQLTPELLQQGIANNSKVIARN------DYGS-PRFP------MMPYSNITLQ 215
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ-- 256
K + + + + GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 216 GGVKDKNYRD---LAIRRALDLVNGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGLIM 272
Query: 257 --LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
LAN++ + +++ C I D+L+R+M G ++A+DAD+ +E
Sbjct: 273 EFLANLWANGVEISE---IKRACEGIKDWLKREMTSEKGYFYAAQDADNFADIHHIEPEE 329
Query: 315 GAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
G FYVW+ +++++IL E F + + + GN F+ KNVL + D S
Sbjct: 330 GEFYVWSYQQLKEILSAEEFNAFIDTFIISEDGN------------FESKNVLQKREDKS 377
Query: 374 ASASKLGMPLEKYLNI-LGECRRKL--------------FDVRSKRPRPHLDDKVIVSWN 418
+ + L+K + GE R L F + P P D K+I++WN
Sbjct: 378 IN-EIINNALDKLFKVRYGEERNSLEKFSPAKNNQEAKTFQWLGRIP-PVTDTKMILAWN 435
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQ 477
L+IS A A + + + Y+++AE A FI H ++ + HRL
Sbjct: 436 SLMISGLATAYGVFQDVS----------------YLDLAEKATEFILNHQWENGRLHRLN 479
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK--WLVWAIELQNTQDELFLDREGGG 535
+ G +DY+ I LLDL + +L AI++Q ++ D+E GG
Sbjct: 480 YE---GNVAVFAQSEDYSLFIKALLDLAQNHPTNTGFYLDQAIKIQAEFNQFCQDKEQGG 536
Query: 536 YFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
Y+N ++ S +L+R K D A PS N +++ NLVRL K Y AE +L +
Sbjct: 537 YYNNAHDNSSDLLIREKSYIDNATPSPNGIAIANLVRLHLFTDEEK---YLDEAEKTLKL 593
Query: 595 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIH 654
F + + + P + A + ++ K++ D + L + L TVI
Sbjct: 594 FSDIMNKASTSCPSLFTALNW-------YLNRTSVKTTKDTKLQLIQKY----LPNTVIR 642
Query: 655 IDPADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISL 706
D EE SN+ +A+VC+ SC P T L
Sbjct: 643 TD----------EELPSNS-------------IAIVCRGVSCFEPATTITQL 671
>gi|145593487|ref|YP_001157784.1| hypothetical protein Strop_0929 [Salinispora tropica CNB-440]
gi|145302824|gb|ABP53406.1| protein of unknown function DUF255 [Salinispora tropica CNB-440]
Length = 699
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 191/553 (34%), Positives = 271/553 (49%), Gaps = 44/553 (7%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESF DE VA LLN+ FV+IKVDREERPDVD VYMT QA+ G GGWP++V
Sbjct: 48 ACHWCHVMAHESFADEQVAALLNEGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +PD P GTYFP +P F +L+ V AW +R + Q GA +E + A +
Sbjct: 108 FAAPDGTPFFCGTYFP------KPNFLRLLQSVTTAWQDQRSAVLQQGAAVVEAIGGAQA 161
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S L +L L A++L + YD GGFG APKFP + + +L ++ D
Sbjct: 162 VGGPSAPLTVDL----LDAAADRLGEEYDEANGGFGGAPKFPPHLNLLFLLRRYQRTGD- 216
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
++V T + MA+GG+HD + GGF RY VD +W VPHFEKMLYD L V
Sbjct: 217 ------QRSLEIVRHTAEAMARGGLHDQLAGGFARYCVDGQWAVPHFEKMLYDNALLLRV 270
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + LT D + RD +L ++ PG SA DAD+ EG T YVW
Sbjct: 271 YTHLWRLTGDPMARRVARDTARFLADELHRPGEGFASALDADADGVEGLT-------YVW 323
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T ++ + LGE + + + P E + E SAS +L
Sbjct: 324 TPAQLVEALGEEDGRWAADLFAVTEQGSFTPHAASPPGEARSG---AEAAAQSASVLRLA 380
Query: 381 MPLEKYLNIL----GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
++ + E +L VR RP+P DDKV+ +WNGL I++ A ++ A
Sbjct: 381 RDVDDATPEVQARWQEIAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAGYA 440
Query: 437 ESAMFNFPVVGSDRKEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 490
E A P ++ E + + ++A R HL + R R G +A G
Sbjct: 441 EDA----PGPDANLMEGVTIVADGAMRDAAEHLARVHLVAGRLRRTSRDGRVG--EAAGV 494
Query: 491 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 550
L+DY + +++ +WL+ A +L + E F + G +++T + ++ R
Sbjct: 495 LEDYGCVAEAFCAMHQLTGEGRWLILAGQLLDVALERFAAPQ-GSFYDTADDAERLVSRP 553
Query: 551 KEDHDGAEPSGNS 563
+ D A PSG S
Sbjct: 554 ADPTDNATPSGRS 566
>gi|423129587|ref|ZP_17117262.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
12901]
gi|371648637|gb|EHO14125.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
12901]
Length = 706
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 187/559 (33%), Positives = 286/559 (51%), Gaps = 48/559 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ VA L+N F+SIKVDREE P +D YM +Q + GGWPL+V
Sbjct: 87 TCHWCHVMEKESFENQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 146
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYF R + L ++ + +KRD + QL E +S
Sbjct: 147 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLDFAT----QLQEGIS 196
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + +E N L E KS+D +GG+ APKF P +LY KK
Sbjct: 197 ILSQAPIAQEESRFNT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQKK---- 248
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + + + TL MA GG+ D V GGF RYSVD +WH+PHFEKMLYD QL +V
Sbjct: 249 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 308
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y D + T + Y + ++++ + G +SA DADS ++ + +EGAFY+W
Sbjct: 309 YADGYKRTHNKLYKEVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 366
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+++++ + LF + + G+ + +N++ VLI+ + A++
Sbjct: 367 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 415
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+PLE N + L R+ RP+P LDDK + SWN + I+ A ++ A
Sbjct: 416 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 471
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A++ FI +L+ E+ L+ ++++G +K FLDDYAF I G
Sbjct: 472 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 518
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L+ L+E +++ A L + + FLD E ++ + + E D PS
Sbjct: 519 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 578
Query: 561 GNSVSVINLVRLASIVAGS 579
N++ INL +L + S
Sbjct: 579 SNAIMAINLYKLGLLYENS 597
>gi|386845926|ref|YP_006263939.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
gi|359833430|gb|AEV81871.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
Length = 663
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 197/561 (35%), Positives = 283/561 (50%), Gaps = 63/561 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ VA LN FV+IKVDREERPDVD VYMT QA+ G GGWP++VF
Sbjct: 50 CHWCHVMAHESFEDDAVAAQLNADFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
+PD P GTYFP + F +L V AW +RD + + GA ++ + A +
Sbjct: 110 ATPDGDPFYCGTYFPKQQ------FTRLLTSVTAAWRDERDGVLKQGAAVVQAVGGAQAV 163
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ E+ A A++ +D +GGFG APKFP + + +L H LE TG
Sbjct: 164 GGPVAAVTAEMLAAAAAGLAQE----HDQTYGGFGGAPKFPPHMNLLFLLRH---LERTG 216
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++E ++V T + MA+GGI+D + GGF RY+VDE W VPHFEKMLYD L VY
Sbjct: 217 ----SAEALELVRHTAERMARGGIYDQLAGGFARYAVDEHWTVPHFEKMLYDNALLLRVY 272
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
+ LT DV + + ++L RD+ P G + SA DAD+ EG T Y WT
Sbjct: 273 TQLWRLTGDVPARRVADETAEFLLRDLATPAGGLASALDADTDGVEGLT-------YAWT 325
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASKLG 380
E+ ++LG + DL R++ P F+ G++VL+ D A+ L
Sbjct: 326 PAELTEVLGPDDGAWA----------ADLFRVT-PDGTFEHGRSVLVLARDIDAADPAL- 373
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
++++ ++ R +L D R KRP+P DDKV+ SWNGL I++ A + S A
Sbjct: 374 --VDRWRDV----RARLLDARGKRPQPARDDKVVASWNGLAITALAEHGALTGSTASREA 427
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLIS 499
A RHL D RL+ R+G P G L+DY +
Sbjct: 428 AV---------------ALAGVLADRHLID---GRLRRVSRDGVVGDPAGVLEDYGCVAE 469
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 559
L +++ + +W A L + F GG+++T + ++ R + D A P
Sbjct: 470 AFLAVHQITADPRWSRLAGRLLDVALARF-GTGSGGFYDTADDAEKLVTRPADPTDNATP 528
Query: 560 SGNSVSVINLVRLASIVAGSK 580
SG + LV A++ ++
Sbjct: 529 SGLAAVCAALVTYAALTGETR 549
>gi|443651764|ref|ZP_21130697.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
gi|159027460|emb|CAO89425.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334405|gb|ELS48917.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
Length = 692
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 212/623 (34%), Positives = 309/623 (49%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ ++++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYEEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L +L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLADPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
ED+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFEDSLRQAAHQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGDQEAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
+LG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NLVRL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L+ A P + A D
Sbjct: 596 LQSFSTILEQSPTACPSLFVALD 618
>gi|159036527|ref|YP_001535780.1| hypothetical protein Sare_0871 [Salinispora arenicola CNS-205]
gi|157915362|gb|ABV96789.1| protein of unknown function DUF255 [Salinispora arenicola CNS-205]
Length = 699
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 192/555 (34%), Positives = 276/555 (49%), Gaps = 46/555 (8%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESF DE V LLN+ FV+IKVDREERPDVD VYMT QA+ G GGWP++
Sbjct: 47 SACHWCHVMAHESFADEQVGALLNENFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF +PD P GTYFP +P F +L+ V AW +R + + GA +E + A
Sbjct: 107 VFATPDGTPFFCGTYFP------KPNFLRLLQSVAAAWRDQRAAVLRQGAAVVEAIGGAQ 160
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ S L EL L A++L++ YD GGFG APKFP + + +L ++ +
Sbjct: 161 AVGGPSAPLTAEL----LDAAADRLAEEYDETNGGFGGAPKFPPHLNLLFLL---RQYQR 213
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 259
TG A +++ T + MA+GG+HD + GGF RYSVD RW VPHFEKMLYD L
Sbjct: 214 TG----AQRSLEIIRHTCEAMARGGLHDQLAGGFARYSVDGRWAVPHFEKMLYDNALLLR 269
Query: 260 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 319
VY + LT D + RD +L ++ PG SA DAD+ EG T YV
Sbjct: 270 VYTHLWRLTGDQLARRVARDTARFLADELHRPGEGFASALDADTDGVEGLT-------YV 322
Query: 320 WTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
WT ++ + LGE + + + G+ + P + D S +
Sbjct: 323 WTPAQLVEALGEEDGRWAADLFDVTEEGSFTPHAAAPPGEALTAADA----TDQPTSVLR 378
Query: 379 LGMPLE----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 434
L ++ + E +L VR RP+P DDKV+ +WNGL I++ A ++
Sbjct: 379 LARDVDDAAPEVRTRWQEVAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAG 438
Query: 435 EAESAMFNFPVVGSDRKEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 488
AE A P ++ E + + ++A + HL D + R R G +A
Sbjct: 439 YAEDA----PGQDANLMEGVTIVADGAMRDAAEHLAQVHLVDGRLRRTSRDGRVG--EAA 492
Query: 489 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 548
G L+DY + +++ +WLV A L + E F + G +++T + ++
Sbjct: 493 GVLEDYGCVAEAFCAMHQVTGEGRWLVLAGRLLDVALERFAAPD-GSFYDTADDAERLVS 551
Query: 549 RVKEDHDGAEPSGNS 563
R + D A PSG S
Sbjct: 552 RPADPTDNATPSGRS 566
>gi|373108743|ref|ZP_09523024.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
10230]
gi|371645988|gb|EHO11505.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
10230]
Length = 681
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 189/559 (33%), Positives = 288/559 (51%), Gaps = 48/559 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ VA L+N F+SIKVDREE P +D YM +Q + GGWPL+V
Sbjct: 62 TCHWCHVMEKESFENQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 121
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYF R + L ++ + +KRD + FA QL E +S
Sbjct: 122 VCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 171
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + +E N L E KS+D +GG+ APKF P +LY KK
Sbjct: 172 ILSQAPIAQEESRFNT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQKK---- 223
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + + + TL MA GG+ D V GGF RYSVD +WH+PHFEKMLYD QL +V
Sbjct: 224 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 283
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y D + T + Y + ++++ + G +SA DADS ++ + +EGAFY+W
Sbjct: 284 YADGYKRTHNKLYKEVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 341
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+++++ + LF + + G+ + +N++ VLI+ + A++
Sbjct: 342 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 390
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+PLE N + L R+ RP+P LDDK + SWN + I+ A ++ A
Sbjct: 391 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 446
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A++ FI +L+ E+ L+ ++++G +K FLDDYAF I G
Sbjct: 447 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 493
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L+ L+E +++ A L + + FLD E ++ + + E D PS
Sbjct: 494 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 553
Query: 561 GNSVSVINLVRLASIVAGS 579
N++ INL +L + S
Sbjct: 554 SNAIMAINLYKLGLLYENS 572
>gi|150026141|ref|YP_001296967.1| hypothetical protein FP2103 [Flavobacterium psychrophilum JIP02/86]
gi|149772682|emb|CAL44165.1| Protein of unknown function YyaL [Flavobacterium psychrophilum
JIP02/86]
Length = 686
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 191/570 (33%), Positives = 284/570 (49%), Gaps = 54/570 (9%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVME ESFE++ VA ++N F+SIKVDREERPDVD +YM VQ + GGWPL+V
Sbjct: 63 CHWCHVMEHESFENQEVASVMNLNFISIKVDREERPDVDAIYMKAVQMMTNRGGWPLNVV 122
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEA 138
PD +P+ GGTYF E+ + L+++ + + +K AQ I+ L
Sbjct: 123 CLPDGRPIWGGTYFQKEE------WTNTLQQLHELYVSNPQKIIKYAQKLHQGIQVLGTI 176
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+A ++ N ++ E+ SKS+D +GG+ APKF P L+
Sbjct: 177 QHHTAQ-----EQNHTNNIKPLVEKWSKSFDWEYGGYARAPKFMMPNNYLF-------LQ 224
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G ++ E V TL MA GGI D + GGF RYSVD RWH+PHFEKMLYD GQL
Sbjct: 225 RYGYQTKSQELLNFVDLTLTKMAHGGIFDTIAGGFSRYSVDIRWHIPHFEKMLYDNGQLV 284
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
++Y A+ T++ Y + L ++ R+ + ++A DADS +EGAFY
Sbjct: 285 SLYAQAYKRTQNPLYKEVIEKTLTFVEREFLNSDNGFYAALDADSLNQNNEL--EEGAFY 342
Query: 319 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
VWT E+++IL +F Y + G + D H VLI+ S + ASK
Sbjct: 343 VWTKTELQEILKNDFEIFSHLYNVNDFGFWE----HDNH-------VLIQNQPSKSIASK 391
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
G+ + N + LF R KRP+P LDDK + SWN +++ + A L ++
Sbjct: 392 FGLTENELQNKRKNWEQLLFTKREKRPKPRLDDKSLTSWNAIMLKGYTDAYNALGNQ--- 448
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+Y+ +AE A FI + + L S++ S GFL+DYAF I
Sbjct: 449 -------------KYLAIAEKNAQFITTKQWSAEGF-LYRSYKKNKSTIEGFLEDYAFTI 494
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
+ LY+ K+L A +L + + F + + + + + ++ + E D
Sbjct: 495 DAFISLYQATLNEKYLQQAKQLTDYCFDNFYNEKQHFFAFNSRKSAQLIAQHFETEDNVM 554
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNA 588
P+ NSV NL L + + ++YY + A
Sbjct: 555 PASNSVMANNLYVLGLLFS---NNYYEKIA 581
>gi|367034245|ref|XP_003666405.1| hypothetical protein MYCTH_2311055 [Myceliophthora thermophila ATCC
42464]
gi|347013677|gb|AEO61160.1| hypothetical protein MYCTH_2311055 [Myceliophthora thermophila ATCC
42464]
Length = 827
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 218/681 (32%), Positives = 333/681 (48%), Gaps = 114/681 (16%)
Query: 16 HFLINTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 75
H H+CH+ +SF + VA LLN+ F+ I VDREERPD+D +Y Y +A+ GG
Sbjct: 73 HIGFQADHFCHLTTQDSFSNPSVAALLNNSFIPILVDREERPDLDTIYQNYSEAVNATGG 132
Query: 76 WPLSVFLSPDLKPLMGGTYFP-PEDKY--------------------------GRPG--- 105
WPL++FL+PDL P+ GGTY+P P ++ G G
Sbjct: 133 WPLNLFLTPDLYPIFGGTYWPGPGTEHSSAAASAAGGGGGGGGGGSGTGAISRGSAGEES 192
Query: 106 ---FKTILRKVKDAWDKKRDM--------------LAQSGAF---AIEQLSEALSASASS 145
F I +K+ W ++ + AQ G F A +S ASA +
Sbjct: 193 YSDFLGIAKKIHKFWVEQEERCRREAFEMLHKLQDFAQEGTFGAGATLPVSATPVASAGA 252
Query: 146 NKLP-----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 197
P +L + L +++K +D GFG+ PKFP P + +L ++ ++
Sbjct: 253 GPAPVSVDPGDLDLDQLDEALARITKMFDPVDYGFGT-PKFPNPARLSFLLRLAQFPGEV 311
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D E +M L TL+ + G + DHVG GF R+SV W +PHFEKM+ + L
Sbjct: 312 RDVIGDEEVENAVRMALGTLRRIRDGALRDHVGAGFMRFSVTSNWSMPHFEKMVGENALL 371
Query: 258 ANVYLDAF-SLTKDVF--------YSYICRDILDYLRRDMIGPG-GEIFSAEDADSAETE 307
V+LDA+ L +D ++ + ++ DYL ++ G S+E ADS +
Sbjct: 372 LGVFLDAWLGLPRDAGKGPALDDEFADVVLELADYLTSPIVRVAEGGFVSSEAADSFYRK 431
Query: 308 GATRKKEGAFYVWTSKEVEDILG-----EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFK 361
G +EGAFY WT +E + ++G +HA Y+ ++ GN +++ DP +EF
Sbjct: 432 GDRHMREGAFYTWTRREFDQVVGGGSSDDHASTVAAAYWDVQEDGN--VAQEQDPFDEFI 489
Query: 362 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGL 420
+N+L ++ + +LG+P + +++ R KL R K RPRP D+K++VS NG+
Sbjct: 490 NQNILSVKASAAELSKQLGIPPSEIKHLVSVAREKLRAHREKERPRPPRDEKIVVSTNGM 549
Query: 421 VISSFARASKILKS-EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR---L 476
VIS+ +R + L+S E E A DR Y++ A AA+FI+ +L+D + L
Sbjct: 550 VISALSRTAAALRSLEGERA---------DR--YLQAARDAAAFIKENLWDGANSKGNPL 598
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD------ 530
F PS+ F DDYAFLI GLLDLY +W+ WA +LQ+ Q LF D
Sbjct: 599 HRFFWERPSQVLAFADDYAFLIDGLLDLYNATLEQEWVDWARQLQDAQTNLFYDAPLTGP 658
Query: 531 -----------REGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 578
GG+++T E S +LR+K D ++PS N+VS NL RL +++
Sbjct: 659 VSTDTAPSPRHAHSGGFYSTESETLSPTILRLKSGMDKSQPSTNAVSASNLFRLGTLLG- 717
Query: 579 SKSDYYRQNAEHSLAVFETRL 599
D Y A ++ FE +
Sbjct: 718 --VDAYLIQARETVNAFEAEI 736
>gi|254381981|ref|ZP_04997344.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194340889|gb|EDX21855.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length = 686
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 224/703 (31%), Positives = 324/703 (46%), Gaps = 80/703 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED A +N+ FV++KVDREERPDVD VYM VQA G GGWP++
Sbjct: 47 SACHWCHVMAHESFEDGATAAYMNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMT 106
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EA 138
VFL+ D +P GTYFPPE ++G P F +L V AW + + + + + L+
Sbjct: 107 VFLTADAEPFYFGTYFPPEPRHGMPSFPQVLEGVHTAWTGRPEEVTEVARRIVGDLAGRR 166
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
++ P+EL L L++ YD+ GGFG APKFP + ++ +L H +
Sbjct: 167 PDYGKAAVPGPEELAGALL-----GLTREYDAAHGGFGGAPKFPPSMVLEFLLRHHAR-- 219
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
TG G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L
Sbjct: 220 -TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLC 274
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
VY + T + + D++ R++ G SA DADS E E + EGA+Y
Sbjct: 275 RVYAHLWRATGSELARRVALETADFMVRELRTREGGFASALDADSEEPE-TGKHVEGAYY 333
Query: 319 VWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
WT ++ ++LGE L + + G + G +VL D A
Sbjct: 334 AWTPDQLREVLGEADGELAAGCFGVTEEGTFE-----------HGTSVLRLPQDGPA--- 379
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
+ E++ +I R +L R RP P DDKV+ +WNGL I++ A
Sbjct: 380 ---VDAERFASI----RARLLAARGGRPAPGRDDKVVAAWNGLAIAALAECGAYF----- 427
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKA-PGFLDDY 494
+R + +E A AA + R +D RL + ++G + A G L+DY
Sbjct: 428 -----------ERPDLIERATEAADLLVRVHFDAAAGGPRLARTSKDGRAGANAGVLEDY 476
Query: 495 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 554
+ G L L WL +A L + +LF E G ++T + ++ R ++
Sbjct: 477 GDVAEGFLALAAVTGEGVWLEFAGFLVDLVLDLFT-AEDGSLYDTAHDAERLIRRPQDPT 535
Query: 555 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----M 609
D A PSG + + L+ S A + S +R AE +L V + VP +
Sbjct: 536 DSAAPSGWTAAAGALL---SYAAHTGSQAHRTAAERALGVVHA----LGPRVPRFIGHGL 588
Query: 610 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP--ADTEEMDFWE 667
A +L P + V +VG + + A V P AD +F
Sbjct: 589 AVAEALLDGP--REVAVVGDPDDPQWAALHRTALLGTAPGAVVAAGPPRAADGSGGEF-- 644
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+A A VC++F C+ P TDP+ L L
Sbjct: 645 ------PLLAERAPVRGLPAAYVCRHFVCARPTTDPVELAEQL 681
>gi|334338370|ref|YP_004543522.1| hypothetical protein Isova_2944 [Isoptericola variabilis 225]
gi|334108738|gb|AEG45628.1| protein of unknown function DUF255 [Isoptericola variabilis 225]
Length = 658
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 229/692 (33%), Positives = 326/692 (47%), Gaps = 88/692 (12%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ VA L D FV+IKVDREERPDVD VYM AL G GGWP++ F
Sbjct: 50 CHWCHVMAHESFEDDDVAAALADRFVAIKVDREERPDVDAVYMGATTALTGQGGWPMTCF 109
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD +P GTY+P R F +L V +AW ++RD + + GA L+EA+ A
Sbjct: 110 LTPDGEPFFAGTYYP------REHFLQVLDAVWEAWTERRDAVERQGA----ALTEAI-A 158
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
S+ PD L + AL +++ D GGFG APKFP + ++ +L H + D
Sbjct: 159 RTSARLTPDVLDEAALERSVRLVARDADPEHGGFGGAPKFPPSMTLEHLLRHHARTGD-- 216
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
++V T + MA+GGI+D + GGF RY+VD W VPHFEKMLYD QL VY
Sbjct: 217 -----PSALELVERTCEAMARGGIYDQLAGGFARYAVDAAWVVPHFEKMLYDNAQLLRVY 271
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
L + T + R+ ++LR D+ P G SA DAD+ EG T YVWT
Sbjct: 272 LHWYRATGSPLAERVVRETAEFLRADLRTPEGGFASALDADTDGVEGLT-------YVWT 324
Query: 322 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDS-SASASKL 379
++++ D+LG P + + VL + L + S L
Sbjct: 325 AEQLADVLG-------------------------PADGARAAEVLSVTLEGTFEHGTSTL 359
Query: 380 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 439
+ + R +L + R+ RP+P DDKV+ +WNGL I++ A A ++L
Sbjct: 360 QLREDPDPEWWTGVRARLAEARAGRPQPARDDKVVTAWNGLAIAALAEAGELL------- 412
Query: 440 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 498
P D ++ ++ +R H+ D RL+ + R G APG D+ L
Sbjct: 413 --GVPGYVDDARDCADL------LLRLHVVD---GRLRRASRGGVVGTAPGVAADHGDLA 461
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
GLL L++ T+WL A EL E F D GG+++ + ++ R K+ DG E
Sbjct: 462 EGLLALHQATGETRWLDAAGELLEVALERFGD-GAGGFYDVADDAERLVSRPKDPTDGPE 520
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
PSG S L A++ S+ +R+ AE ++A T K + A+ L+
Sbjct: 521 PSGQSSLAGALATYAALTGSSR---HREAAEAAVAAAGTLAKQVPRFAGWTLAVAEALAA 577
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
V +VG AA +S V+ + DT + +A
Sbjct: 578 -GPLQVAVVGPDDGARLALERAARASSS--PGLVLAVGEPDTPGVPL----------LAD 624
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ A VC+ F C PVT LE L
Sbjct: 625 RPLVDGRPAAYVCRGFVCDRPVTTVEELERAL 656
>gi|423328847|ref|ZP_17306654.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
3837]
gi|404604409|gb|EKB04043.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
3837]
Length = 667
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 188/559 (33%), Positives = 286/559 (51%), Gaps = 48/559 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
TCHWCHVME ESFE++ VA ++N F+SIKVDREE P +D YM +Q + GGWPL+V
Sbjct: 48 TCHWCHVMEKESFENQEVADIMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNV 107
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
PD +P+ GGTYF E + L ++ + +KRD + FA QL E +S
Sbjct: 108 VCLPDGRPIWGGTYFKKE------AWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGIS 157
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
S + E + L E KS+D +GG+ PKF P +LY KK
Sbjct: 158 I-LSQAPIAQEDSRFNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK---- 209
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G + + + TL MA GG+ D V GGF RYSVD +WH+PHFEKMLYD QL +V
Sbjct: 210 GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSV 269
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y D + T + Y + +D++ + G +SA DADS ++ + +EGAFY+W
Sbjct: 270 YADGYKRTHNKLYKEVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIW 327
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T +E+++++ + LF + + G+ + +N++ VLI+ + A++
Sbjct: 328 TIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENN 376
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
+PLE N + L R+ RP+P LDDK + SWN + I+ A ++ A
Sbjct: 377 IPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA---- 432
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 500
Y+E A++ FI +L+ E+ L+ ++++G +K FLDDYAF I G
Sbjct: 433 ------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQG 479
Query: 501 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 560
L+ L+E +++ A L + + FLD E ++ + + E D PS
Sbjct: 480 LIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPS 539
Query: 561 GNSVSVINLVRLASIVAGS 579
N++ INL +L + S
Sbjct: 540 SNAIMAINLYKLGLLYENS 558
>gi|425450832|ref|ZP_18830655.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
gi|389768138|emb|CCI06653.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
Length = 692
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 210/620 (33%), Positives = 311/620 (50%), Gaps = 74/620 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ + ++++ L++ A + L ++
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYGEEKEKLSKFTAEMLGALRQS 167
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
+ L D + L E + +G P FP + L S+ +
Sbjct: 168 AILPRAETNLADP---SLLATGIETNTAVIQVNPNNYGR-PSFPMIPYSHLALQGSRFGD 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D S + + Q+ + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 224 DFDDSLQQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIV 278
Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ +S ++ + + +++L+R+M P G ++A+DADS E +EGAF
Sbjct: 279 EYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKARDREPEEGAF 338
Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 339 YVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQG 381
Query: 377 SKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWN 418
+LG +E L+ L G + +L R D K+IV+WN
Sbjct: 382 GELGKEIENILDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWN 441
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
L+IS ARA A+F+ P+ Y ++A AA FI +H + D + RL
Sbjct: 442 SLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQRLN 485
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGY 536
+ G + +D+A+ I LLDL T WL AI+LQ D F + GGY
Sbjct: 486 Y---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWSEDEGGY 542
Query: 537 FNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
FN T D S+ L V+E D A PS N +++ NLVRL+ + + Y AE +L
Sbjct: 543 FN-TASDHSLDLIVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQS 598
Query: 595 FETRLKDMAMAVPLMCCAAD 614
F T L+ A P + A D
Sbjct: 599 FSTILEQSPTACPSLFVALD 618
>gi|294814700|ref|ZP_06773343.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|326443082|ref|ZP_08217816.1| hypothetical protein SclaA2_18553 [Streptomyces clavuligerus ATCC
27064]
gi|294327299|gb|EFG08942.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 675
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 226/695 (32%), Positives = 327/695 (47%), Gaps = 81/695 (11%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED A LN+ FVS+KVDREERPDVD VYM VQA G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDGATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F++ + +P GTYFPPE ++G P F+ +L V AW +RD + + A L+ S
Sbjct: 109 FMTAEGEPFYFGTYFPPEPRHGMPSFRQVLEGVTAAWTGRRDEVDEVAARIRRDLA-GRS 167
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+ + +P Q + LS+ YD R GGFG APKFP + ++ +L H + T
Sbjct: 168 LAHGGDGVPGAEEQARALIG---LSREYDERHGGFGGAPKFPPSMVLEFLLRHHAR---T 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G +M T + MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L V
Sbjct: 222 GSEA----ALQMAAETAEAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRV 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
Y + LT + + D++ R++ G SA DADS +G + EGAFYVW
Sbjct: 278 YARLWRLTGAPLARRVALETADFMVRELRTAEGGFASALDADSTGADGV--RAEGAFYVW 335
Query: 321 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
T ++ ++LGE +L ++D G +VL D
Sbjct: 336 TPAQLTEVLGEE----------DGRRAAELYGVTDEGTFEHGTSVLRLPGDDPGPG---- 381
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
R++L R R RP DDKV+ +WNGL I++ A
Sbjct: 382 ------------IRQRLLASRELRERPERDDKVVAAWNGLAIAALAETGAYF-------- 421
Query: 441 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLIS 499
DR + +E A AA + R L+ + + RL + R+G + G L+DY +
Sbjct: 422 --------DRPDLVERATEAADLLVR-LHLDGSARLTRTSRDGRAGRNAGVLEDYGDVAE 472
Query: 500 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDG 556
G L L WL +A L + + LDR E G ++T + ++ R ++ D
Sbjct: 473 GFLALASVTGEGVWLEFAGLLLD----IVLDRFTGENGTLYDTAHDAEQLIRRPQDPTDN 528
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSG + + L+ S A + S+ +R AE +L V + + AA+ L
Sbjct: 529 AAPSGWTAAAGALL---SYAAHTGSEAHRTAAERALGVVKALGPRAPRFIGWGLAAAEAL 585
Query: 617 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASM 676
+ + V +VG D E+ A+ +L++T + + +
Sbjct: 586 -LDGPREVAVVG-----DPED-----PAARELHRTALLAPAPGAVVAA--GAPGGDEFPL 632
Query: 677 ARNNFSAD-KVVALVCQNFSCSPPVTDPISLENLL 710
R+ D + A VC+ F C PVT P +L L
Sbjct: 633 LRDRDLVDGRAAAYVCRGFVCRRPVTGPSALAEEL 667
>gi|425439757|ref|ZP_18820072.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
gi|389719932|emb|CCH96294.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
Length = 692
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 211/623 (33%), Positives = 310/623 (49%), Gaps = 80/623 (12%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDRAIADYLNHYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK--- 195
L SA + L L + + + P FP + L S+
Sbjct: 164 LRQSAILPRAETNLAAPYLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
+D+ + G+ + L GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 224 DFDDSLRQAAYQRGEDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
Q+ + +S ++ + + +++L+R+M P G ++A+DADS E +E
Sbjct: 276 QIVEYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEE 335
Query: 315 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 336 GAFYVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QR 378
Query: 374 ASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIV 415
+LG +E L+ L G + +L R D K+IV
Sbjct: 379 RQGGELGEEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIV 438
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTH 474
+WN L+IS ARA A+F+ P+ Y ++A AA FI +H + D +
Sbjct: 439 AWNSLMISGLARA---------FAVFSEPL-------YWQMATQAAEFILKHQWLDGRFQ 482
Query: 475 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREG 533
RL + G + +D+A+ I LLDL T WL AI+LQ D F +
Sbjct: 483 RLNY---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDE 539
Query: 534 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
GGYFN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +
Sbjct: 540 GGYFN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKA 595
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F T L++ A P + A D
Sbjct: 596 LQSFSTILEESPTACPSLFVALD 618
>gi|380805071|gb|AFE74411.1| spermatogenesis-associated protein 20 precursor, partial [Macaca
mulatta]
Length = 397
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 165/420 (39%), Positives = 238/420 (56%), Gaps = 43/420 (10%)
Query: 79 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+V+L+P+L+P +GGTYFPPED R GF+T+L ++++ W + ++ L ++ ++++ A
Sbjct: 1 NVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTA 56
Query: 139 LSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH-- 193
L A + + +LP +A + C +QL + YD +GGF APKFP PV + + +
Sbjct: 57 LLARSEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWL 116
Query: 194 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
S +L G S Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYD
Sbjct: 117 SHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYD 171
Query: 254 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q QLA Y AF ++ D FYS + + IL Y+ R + G +SAEDADS G R K
Sbjct: 172 QAQLAVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPK 230
Query: 314 EGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGK 363
EGA+YVWT KEV+ +L E + L +HY L GN S+ DP E +G+
Sbjct: 231 EGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGNISPSQ--DPKGELQGQ 288
Query: 364 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 423
NVL +A++ G+ +E +L KLF R RP+PHLD K++ +WNGL++S
Sbjct: 289 NVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVS 348
Query: 424 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 483
+A +L G DR + A + A F++RH++D + RL + G
Sbjct: 349 GYAVTGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTG 392
>gi|408395590|gb|EKJ74769.1| hypothetical protein FPSE_05104 [Fusarium pseudograminearum CS3096]
Length = 717
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 196/607 (32%), Positives = 310/607 (51%), Gaps = 67/607 (11%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H C +M +E+F + A +LN+ FV + VDREERPD++ VYM Y QA++ GGWPL+VFL
Sbjct: 84 HSCRLMSIETFSNPESAAVLNESFVPVIVDREERPDIEAVYMNYAQAVHKVGGWPLNVFL 143
Query: 83 SPDLKPLMGGTYFP-PEDKYGRPGFK--------TILRKVKDAWDKKR--------DMLA 125
+P+L+P+ GGTY+ P + G TIL K++D W+ + +++A
Sbjct: 144 TPNLEPVFGGTYWVGPAGRRRHNGDSTDEVLDSLTILNKMRDTWNDQEARCRKEATEIVA 203
Query: 126 QSGAFAIEQLSEALSASASSNKLP-----------------------DELPQNALRLCAE 162
Q FA E S +A S P EL + L +
Sbjct: 204 QLKEFAAEGTLGTRSITAPSALGPLAGWGAPAPSNPSTTENRTMIVSQELDLDQLEVAYR 263
Query: 163 QLSKSYDSRFGGFGSAPKFPRPVEIQM---MLYHSKKLEDTGKSGEASEGQKMVLFTLQC 219
++ ++D GGFG APK+ P ++ +L ++D E K+ L+TL+
Sbjct: 264 NIAGTFDPVHGGFGLAPKYMIPPKLTFLLGLLTAPGPVQDVVGYDECRHATKIALYTLRQ 323
Query: 220 MAKGGIHDHVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT---KDVFYSY 275
+ G +HDH+G GF SV W +P+FEK++ D QL ++Y+DA+ + + +
Sbjct: 324 IRDGALHDHIGATGFSHCSVTADWSIPNFEKLVIDNAQLLSLYIDAWKASGGGEQGEFLD 383
Query: 276 ICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--- 331
+ ++++YL + P G S+E ADS +G K+EGA+YVWT +E + +L +
Sbjct: 384 VVLELIEYLTTSPVTLPEGGFASSEAADSYYRQGDNEKREGAYYVWTWREFKSVLDDIDH 443
Query: 332 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 390
+ + ++ + GN + +DP+++F +N+L +S P+EK +
Sbjct: 444 HMSPILAAYWNVNKDGN--VKETNDPNDDFMNQNILCVKTTVEQLSSHFSTPVEKIREYI 501
Query: 391 GECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 449
+ + L R + R RP LDDK++ WNGLVIS+ ++A+ L++ +
Sbjct: 502 EKGKAALRKKREQERVRPELDDKIVAGWNGLVISALSKAASALRT----------LKPEQ 551
Query: 450 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 509
AE AA+ I+ L+D L ++ G F DDYA+LI GLLDL+
Sbjct: 552 SSRCKSAAERAAACIKERLWDADEKVLYRTW-CGERGHTAFADDYAYLIQGLLDLFGLTE 610
Query: 510 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 569
++L +A LQ TQ LF D + G +F T P V+LR+KE D + PS N+VSV NL
Sbjct: 611 NHQYLEFAETLQQTQISLFFD-DDGAFFTTKAHSPHVILRLKEGMDTSLPSTNAVSVANL 669
Query: 570 VRLASIV 576
RLAS++
Sbjct: 670 FRLASLL 676
>gi|374310263|ref|YP_005056693.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358752273|gb|AEU35663.1| hypothetical protein AciX8_1320 [Granulicella mallensis MP5ACTX8]
Length = 704
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 214/697 (30%), Positives = 341/697 (48%), Gaps = 61/697 (8%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM+ ES+E+ A+++N+ F+++KVDR+ERPDVD Y + + G GGWPL+ F
Sbjct: 54 CHWCHVMDRESYENAATAEVINEHFIAVKVDRDERPDVDTRYQAAISTISGQGGWPLTAF 113
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA---FAIEQLSEA 138
L+P+ KP GGTYFPP+D+YGRP F+ +L + D + +RD + +S AIE+ +E+
Sbjct: 114 LTPEGKPYFGGTYFPPDDRYGRPSFQRVLLTMADVFQNRRDEVEESAGGVMLAIEE-NES 172
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
S A + P L + L Q +D + GGFGS PKFP I ++ ++
Sbjct: 173 FSVPAGNPGAP--LLDKLVALTVSQ----FDQKNGGFGSQPKFPNSGAIDLL------ID 220
Query: 199 DTGKSGE-ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ GE A + + + TLQ MA GGIHD + GGFHRYSVDERW VPHFEKM YD +L
Sbjct: 221 AASRGGELAEQARHVATVTLQKMAAGGIHDQLAGGFHRYSVDERWIVPHFEKMAYDNSEL 280
Query: 258 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
Y+ AF + ++ + +DIL ++ + F A ++ + +G +
Sbjct: 281 LKNYVHAFQSFGEPEFARVAKDILRWMDEWLSDREQGGFYA-----SQDADDSLDDDGDY 335
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
+ WT E + +L E Y+ +L + D H+ + KNVL A A
Sbjct: 336 FTWTRAEAKAVLTAEEFAVAELYF-------NLRDVGDMHHNPQ-KNVLHLGEPVEAIAR 387
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
KL L++ L KL+ R +R P++D + WNG+ ++++ A+++L
Sbjct: 388 KLNRALDEVNETLAAATGKLYAARLQRKTPYVDKTIYTGWNGMCLAAYFEAARVLDL--- 444
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
+ +F + DR + VA + H + + ++ G L+DY FL
Sbjct: 445 PEVRSFALRSLDR--VLNVAWDPVEGL--------AHVVAYGEGGSAARVAGVLEDYGFL 494
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT----GEDP--SVLLRVK 551
+ +LD +E ++ A + + F D GGG+F+T P ++ R K
Sbjct: 495 ANAVLDAWESTGELRYFTAAQAIADVMLVRFYDAAGGGFFDTERMEGAPQPIGALSTRRK 554
Query: 552 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 611
D P+GNSV+V L+RLA++ + SD Y + A+ +L F ++ +
Sbjct: 555 PLQDAPTPAGNSVAVTLLLRLAALT--NHSD-YGERAQETLEAFAGVVEHFGLYAASYGL 611
Query: 612 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNS 671
A +V S + +VG + A A + +NK+VI +D + E+
Sbjct: 612 ALRR-AVESSVQICVVGDDARARELEAAAV--AGFAVNKSVIRLDRSRFHELPAALAETL 668
Query: 672 NNASMARNNFSADKVVALVCQNFSCSPPVTDPISLEN 708
N +F A+VC+ +C PP+ L N
Sbjct: 669 PNLPQVEGSF------AVVCKGNTCLPPIQSVEELRN 699
>gi|434393621|ref|YP_007128568.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
gi|428265462|gb|AFZ31408.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
Length = 687
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 221/665 (33%), Positives = 320/665 (48%), Gaps = 119/665 (17%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDLAIADYMNAHFLPIKVDREERPDLDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAF--AIEQL 135
+F++PD L P GGTYFP E +YGRPGF +L+ ++ +D +K+D+LA+ A AI+Q
Sbjct: 108 IFIAPDDLVPFYGGTYFPVEPRYGRPGFLQVLQAIRRYYDTEKQDLLARKAAILEAIQQ- 166
Query: 136 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMM 190
SA + DE + L K ++ G +G+ +FP ++
Sbjct: 167 ----SAVLPKTQQSDE----------DLLKKGIETNTGVITPHDYGT--QFPMIPYAELA 210
Query: 191 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 250
L ++ + Q+ L +A GGI+DHV GGFHRY+VD W VPHFEKM
Sbjct: 211 LRGTRFNYSAWRYDIPQVCQQRGL----DLALGGIYDHVAGGFHRYTVDPTWTVPHFEKM 266
Query: 251 LYDQGQLANVYLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETE 307
LYD GQ+ + +S V I R I + +L+R+M P G ++A+DADS +
Sbjct: 267 LYDNGQIVEYLANLWS--NGVQEPAIERAIALTVQWLKREMTAPEGYFYAAQDADSFTSP 324
Query: 308 GATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
+EGAFYVW+ E++ IL E ++ + + GN F+G+ VL
Sbjct: 325 YEAEPEEGAFYVWSYSELQQILSSEELSALEQQFTITSQGN------------FEGQIVL 372
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVR------------------------- 401
+ S S +I + KLF VR
Sbjct: 373 QRRHPGSLS------------DITEQALSKLFTVRYGATPESLDVFPPARNNQEAKTQNW 420
Query: 402 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 461
S R D K+IV+WN L+IS ARA + K + EY+E+A S+A
Sbjct: 421 SGRIPAVTDTKMIVAWNSLMISGLARAYAVFK----------------KSEYLEIALSSA 464
Query: 462 SFIRRH-LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVW 516
FI H D + HRL + G + +DYA I LLDLY+ + WL
Sbjct: 465 RFILNHQQVDGRFHRLNY---EGQTSVIAQSEDYALFIKALLDLYQVTLKDANSQHWLEQ 521
Query: 517 AIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI 575
AI LQ DE E GGY+NT + +++R + D A P+ N V++ NLVRLA +
Sbjct: 522 AIALQAEFDEYLWSIELGGYYNTASDASRDLIVRERSYADNATPAANGVAIANLVRLALL 581
Query: 576 VAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF 635
++ Y AE +L F + + A P + A D ++ LV +S
Sbjct: 582 ---TEKLSYLDRAEQALQAFTSVMDSAPQACPSLFTALDWY-----RNCTLV-RTTSTTL 632
Query: 636 ENMLA 640
E +LA
Sbjct: 633 ETVLA 637
>gi|288917991|ref|ZP_06412350.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
gi|288350646|gb|EFC84864.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
Length = 669
Score = 301 bits (772), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 197/609 (32%), Positives = 292/609 (47%), Gaps = 54/609 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED +A +N+ FV+IKVDREERPDVD VYM AL G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDAQIAAYMNEHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--A 138
FL+P +P GTYFPP + G+ F +L V DAW ++R+ + ++GA +L+E A
Sbjct: 109 FLTPAAEPFFAGTYFPPRPRQGQTSFPQLLTAVSDAWTQRREEIEEAGADIARRLAEVVA 168
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L + + +L + L L+ +D+R GGFG PKFP + +++L H +
Sbjct: 169 LPGGTAGGEGGPQLGADLLDGAVAGLAGRFDARHGGFGPKPKFPPSMVAELLLRHWARTG 228
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D +MV T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD QL
Sbjct: 229 D-------DRALEMVRVTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLL 281
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAF 317
VYL + T + R+ +++L D+ P G SA DAD+ + +EGA
Sbjct: 282 RVYLHLWRATGSALAERVVRETVEFLLTDLRTPEGGFASALDADAVPAGQPNAHPEEGAS 341
Query: 318 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 377
Y WT ++ D+LG + + +++ G +VL+ D A
Sbjct: 342 YSWTPAQLADVLGPEDGAWA----------AGVLGVTEAGTFEHGTSVLMLPADPDDPAR 391
Query: 378 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 437
R L RS RP+P DDK++ +WN I
Sbjct: 392 ------------FARVRSALAAARSSRPQPARDDKIVAAWN---------GLAIAALAEA 430
Query: 438 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 497
A+ P + E+ HL+D + R R GP+ G L+DY +
Sbjct: 431 GALLAEPAWIAAATRAAELLRDV------HLHDGRLWRTSRDGRRGPNA--GVLEDYGCV 482
Query: 498 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 557
G L L++ + +WL A EL + F + GG+F+T + ++L R +E D A
Sbjct: 483 ADGYLALHQVTADPRWLTLAGELLDVVRARFAAPD-GGFFDTADDAEALLRRPRESSDSA 541
Query: 558 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADML 616
PSG + ++ A++ ++ +R A ++ + L KD A A +L
Sbjct: 542 TPSGQAAVAGAMLTFAALTGSAE---HRDAAVATVGLLMPLLAKDARYAGWAGAVAEAVL 598
Query: 617 SVPSRKHVV 625
+ P+ VV
Sbjct: 599 AGPAEVAVV 607
>gi|440682478|ref|YP_007157273.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
gi|428679597|gb|AFZ58363.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
Length = 693
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 212/629 (33%), Positives = 320/629 (50%), Gaps = 86/629 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDLEIAQYMNTNFLPIKVDREERPDLDSIYMQTLQFMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+ DL P GTYFP + +YGRPGF +L ++ +D +++ L Q A + EA
Sbjct: 108 VFLAADDLVPFYAGTYFPVDPRYGRPGFLQVLEALRRYYDTEKEELRQRKALIV----EA 163
Query: 139 LSASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHS 194
L SA K+ + E+ N L L K +++ G S FP M+ Y
Sbjct: 164 LLTSAVMQKVTNQEVADNQL------LQKGWETCTGIITSKQVGNSFP------MIPYAE 211
Query: 195 KKLEDTGKSGEAS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 253
L T + + +GQ++ +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD
Sbjct: 212 FALRGTRFNYQFQYDGQQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYD 271
Query: 254 QGQLANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
GQ+ + +S + + F + + +L+R+M GG ++A+DADS A
Sbjct: 272 NGQIIEYLANLWSGGIQEPAFERAVAGTV-KWLQREMTAQGGYFYAAQDADSFINSTAIE 330
Query: 312 KKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---- 366
+EGAFYVW+ +E++ +L E ++ + + GN F+G+ VL
Sbjct: 331 PEEGAFYVWSYRELQQLLTTEELNELQQQFAVTANGN------------FEGQIVLQRSH 378
Query: 367 -------IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPR--PHL-DDKVIVS 416
+E+ S ++ G E N R ++ P P + D K+IV+
Sbjct: 379 PGELSQTLEIALSKLFTARYGATPESLSN-FPPARDNQEAKKTNWPGRIPAVTDTKMIVA 437
Query: 417 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHR 475
WN L+IS ARA+++ + + Y+E+A AA FI H + D + HR
Sbjct: 438 WNSLMISGLARAAEVFQ----------------QPNYLELAAQAARFILDHQFVDGRFHR 481
Query: 476 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG---------TKWLVWAIELQNTQDE 526
L + G + +DYAF I LLDL++ G + WL A+ LQ+ DE
Sbjct: 482 LNYE---GEATVLAQSEDYAFFIKALLDLHQATLGQLDHVSSQNSDWLEKAVSLQDEFDE 538
Query: 527 LFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 585
E GGYFNT+ ++ +++R + D A PS N +++ NLVRLA + + + +Y
Sbjct: 539 FLWSIELGGYFNTSSDNSQDLIVRERSYIDNATPSANGIAIANLVRLALL---TDNLHYL 595
Query: 586 QNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
AE L F+ + + A P + A D
Sbjct: 596 DLAEQGLTAFKGVMSNSPQACPSLFTALD 624
>gi|171683203|ref|XP_001906544.1| hypothetical protein [Podospora anserina S mat+]
gi|170941561|emb|CAP67213.1| unnamed protein product [Podospora anserina S mat+]
Length = 753
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 215/632 (34%), Positives = 311/632 (49%), Gaps = 71/632 (11%)
Query: 23 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 82
H CH+ ++F + VA LN+ FV I VDREERPD+D +Y Y A+ GWPL +F
Sbjct: 84 HLCHITTRDTFHNPTVAAFLNEHFVPIIVDREERPDLDAIYQNYSVAVNSISGWPLHLFF 143
Query: 83 SPDLKPLMGGTYFPPEDKYGRPG----FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE- 137
+PDL+P Y P G G TIL+ W +K + A +E L +
Sbjct: 144 TPDLEPFFANAYLPAPGTVGEDGEACDLLTILQSNHRLWVEKEQKCREEAAKELEGLEKF 203
Query: 138 --------ALSASASSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSA--PKFPRPVE 186
A + +A++ D E+ + + L +++K +D GGFG PKFP P
Sbjct: 204 VQEGALPLARAPNATATYDSDIEVDLDHVELAVSRIAKLFDPVHGGFGQPGEPKFPNPAR 263
Query: 187 IQMMLYHSKKLEDT-----GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDER 241
+ +L ++ DT G + KM L TL M G+ DH+G GF R S
Sbjct: 264 LSFLL-RLRECPDTVRDVIGGDEDVERATKMALQTLSKMKNSGLRDHIGEGFMRMSSTSD 322
Query: 242 WHVPHFEKMLYDQGQLANVYLDAF-------SLTKDVFYSYICRDILDYLRRDMIGP-GG 293
W++PHFEKM+ D L VYLDA+ LT ++ + + DYL I G
Sbjct: 323 WNMPHFEKMVGDNALLLGVYLDAWLGNRKGTQLTNQDEFADVVLGLADYLISPAIQQENG 382
Query: 294 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSR 352
S+E A S +G G FY+WT +E +++LG A Y+ ++ GN R
Sbjct: 383 GFISSEAAYSYYRKGEQHMTNGTFYLWTHREFDEVLGPEASNIAAAYWNVQEDGNVPQER 442
Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDD 411
DP +EF +N+L N +++ G+P+E+ I+ ++KL R K R RP D
Sbjct: 443 --DPSDEFLNQNILSAGNGVHELSTQHGLPVEEIHRIIASSKKKLLAHRDKERVRPPRDT 500
Query: 412 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-- 469
K+I NG+VIS+ +R+ ++ AE+ V S EY++ AE AA FI +L+
Sbjct: 501 KIIAGVNGMVISALSRS----QAAAEA------VGHSKSAEYIKRAEKAAQFIFDNLWLN 550
Query: 470 DEQT-------HRLQHSF-RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 521
D T H++ H + NGPS+ F DDYAFLI GLLDLYE +WL WA +LQ
Sbjct: 551 DINTEGPNGGQHKVLHRYWNNGPSETLAFADDYAFLIEGLLDLYEATLSKRWLNWAQDLQ 610
Query: 522 NTQDELFLDRE-------------GGGYFNTTGED-PSVLLRVKEDHDGAEPSGNSVSVI 567
+ Q+ LF D GG+++T + S + R+K D PS N+VS
Sbjct: 611 DAQNRLFYDSPSAVNGTPSRRAAGSGGFYSTELQTISSNIPRLKSAMDILIPSVNAVSAS 670
Query: 568 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 599
NL RL SI A S+ Y+Q A ++ F+ L
Sbjct: 671 NLYRLGSIFAESR---YKQIALETIKAFDPEL 699
>gi|374850591|dbj|BAL53576.1| hypothetical conserved protein [uncultured Bacteroidetes bacterium]
Length = 676
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 194/569 (34%), Positives = 286/569 (50%), Gaps = 51/569 (8%)
Query: 2 GRRSFCGGTKTRRTHFL---INTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 58
G +F + ++ FL + CHWCHVME ESF D VA LL W++ IKVDREERPD
Sbjct: 27 GEEAFARARREQKLVFLSIGYSACHWCHVMEEESFADPEVAALLERWYIPIKVDREERPD 86
Query: 59 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 118
VD +YM+ QA+ G GGWPL+V L+P+ + + GTYFP R G +L ++ W
Sbjct: 87 VDALYMSICQAMTGQGGWPLTVILTPEREVIFAGTYFPKRSTPYRIGLIELLERIAALWQ 146
Query: 119 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 178
+ ML S +E+++ L ++ S + + + EQL K +D R+GGFG+
Sbjct: 147 QDGQMLRSSAHALMERIAPHLRSAHSGH-----ITAGTITAALEQLDKLFDRRYGGFGTR 201
Query: 179 PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV 238
PKFP + +L + ++ + TL+ M GGI DHVG GFHRYS
Sbjct: 202 PKFPMAAALWFLLIAGPR--------TSTRALDIATATLEAMRWGGIWDHVGFGFHRYST 253
Query: 239 DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA 298
DERW +PHFEKMLYDQ L VY +A +TK + +I YL R ++ G ++
Sbjct: 254 DERWFLPHFEKMLYDQALLLLVYAEAARITKRRLFEITAMEIAAYLDRTLLLEHGAFAAS 313
Query: 299 EDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPH 357
EDAD+ + EGAFY W +++ ++ H + ++L P GN P
Sbjct: 314 EDADTPD-------GEGAFYQWRYEDLRRLIPSHEFERMRAIFHLSPEGNAHDEATGQP- 365
Query: 358 NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 417
G+N+L + + G LE++L R++L VR+ R RP D+KV+ W
Sbjct: 366 ---TGRNILSAGTRTEDVLERFGGTLEEFLAWWEPLRQRLETVRNSRARPARDEKVLCDW 422
Query: 418 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRL 476
N LV+++ ARA ++L+ P + +E A S++ R H++ + T L
Sbjct: 423 NALVVAALARAGRLLRQ---------PTL-------IERARRTWSYLERVHVHADGT--L 464
Query: 477 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 536
H +G GFLDDYAF L+LY +L L ++ E F+D G G
Sbjct: 465 AHCSYSGEPAIDGFLDDYAFAAWAALELYHATGANDFLEHVEHLLHSITERFVD--GDGI 522
Query: 537 FNTTGEDPSVLLRVKEDHDGAEPSGNSVS 565
T + +L + E DGA SG ++
Sbjct: 523 VRTAAS--ADVLPLTEPSDGATVSGIGIT 549
>gi|158312686|ref|YP_001505194.1| hypothetical protein Franean1_0830 [Frankia sp. EAN1pec]
gi|158108091|gb|ABW10288.1| protein of unknown function DUF255 [Frankia sp. EAN1pec]
Length = 669
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 219/637 (34%), Positives = 303/637 (47%), Gaps = 69/637 (10%)
Query: 2 GRRSFCGGTKTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERP 57
G +F T TR L++ CHWCHVM ESFED +A +N FV+IKVDREERP
Sbjct: 27 GPEAFAEAT-TRGVPVLLSVGYAACHWCHVMAHESFEDPEIAAYMNQHFVNIKVDREERP 85
Query: 58 DVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW 117
DVD VYM AL G GGWP++VFL+P +P GTYFPP G F ++ + DAW
Sbjct: 86 DVDSVYMDVTVALTGHGGWPMTVFLTPAAEPFFAGTYFPPRPMRGSASFPQVMAAIVDAW 145
Query: 118 DKKRDMLAQSGAFAIEQLSE--ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGF 175
+R + QSGA QL+E A +AS ++ + L L+ +DS GGF
Sbjct: 146 TARRAEVEQSGADIARQLAEAVAPGGAASGGGATTQITADLLDRAVAGLADRFDSVHGGF 205
Query: 176 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 235
G APKFP + +M+L + D G MV T + MA+GG++D +GGGF R
Sbjct: 206 GGAPKFPPSMVAEMLLRSWARTGDGRALG-------MVRETCERMARGGMYDQLGGGFAR 258
Query: 236 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 295
YSVDE W VPHFEKMLYD QL VYL + T + R+ +L D+ P G
Sbjct: 259 YSVDESWTVPHFEKMLYDNAQLLRVYLHLWRATGLPLAERVVRETAAFLLADLRTPEGGF 318
Query: 296 FSAEDADS--AETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSR 352
SA DAD+ A + G +EGA Y WT ++ D+LG + L + G+ +
Sbjct: 319 ASALDADAVPAGSPGG-HPEEGASYSWTPAQLVDVLGPDDGALAARVLGVTAEGSFE--- 374
Query: 353 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 412
G +VL+ D A R L R+ RP+P DDK
Sbjct: 375 --------HGTSVLMLPADPEDPARFA------------RVRAALAAARATRPQPARDDK 414
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDE 471
++ +WNGLVI + A A +L ++ AE AA +R HL++
Sbjct: 415 IVAAWNGLVIGALAEAGALLGE----------------PSWVGAAERAAELLRDVHLHEG 458
Query: 472 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 531
+ R R GP+ G L+DY + G L L++ WL A EL + F
Sbjct: 459 RLWRTSRDGRRGPNA--GVLEDYGCVAEGFLTLHQVTGAAGWLALAGELLDVVRARFAAP 516
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEH 590
+ GGYF+T + ++L R ++ D A PSG + L+ A++ + D R E
Sbjct: 517 D-GGYFDTADDAEALLRRPRDASDSATPSGQAAVAGALLTYAALTGSADHRDSARATVEQ 575
Query: 591 SLAVF--ETRLKDMAMAVPLMCCAADMLSVPSRKHVV 625
+ + R A AV A +L+ P+ VV
Sbjct: 576 LTPLLSRDARFAGWAGAV-----AEALLAGPAEVAVV 607
>gi|427733870|ref|YP_007053414.1| thioredoxin domain-containing protein [Rivularia sp. PCC 7116]
gi|427368911|gb|AFY52867.1| thioredoxin domain protein [Rivularia sp. PCC 7116]
Length = 691
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 210/631 (33%), Positives = 309/631 (48%), Gaps = 93/631 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D VA+ +N F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDLEVAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--S 136
FLSP DL P GTYFPPE++Y RPGF +L+ ++ +D ++ L + A +E L S
Sbjct: 108 AFLSPDDLVPFYAGTYFPPEERYNRPGFLQVLKAIRHYYDTEKQDLQKRKAVILESLLTS 167
Query: 137 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 196
L A++ ++L Q + ++ + FP QM L S+
Sbjct: 168 AVLQTEATAETQDNQLLQKGWEIFTGIIAPNEQGN--------SFPTIPYAQMALQGSRF 219
Query: 197 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 256
+ + Q+ + +A GGI DHV GGFHRY+VD W VPHFEKMLYD GQ
Sbjct: 220 NFTSRYDCKQICTQRGL-----DLALGGIFDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQ 274
Query: 257 LANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 314
+ + +S + + F + I + + +L+R+M P G ++A+DADS T+ +E
Sbjct: 275 IVEYLANLWSAGVKEPAFETAIAKTV-KWLQREMTAPNGYFYAAQDADSFITQEDVEPEE 333
Query: 315 GAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 373
GAFYVW ++E +L + ++++ + P GN F+ +NVL + N
Sbjct: 334 GAFYVWGFSDLEQLLTRAELTELQQNFTVTPNGN------------FENQNVLQKRN--- 378
Query: 374 ASASKLGMPLEKYLNILGECRR-------KLF-----DVRSK------RPRPHLDDKVIV 415
+ +L LE L L R K F + ++K R P D K+IV
Sbjct: 379 --SDRLSNTLEATLEKLFTARYGDDSSTIKTFAPARNNAQAKSHNWQGRIPPVTDTKMIV 436
Query: 416 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTH 474
+WN ++IS ARA + + EY+E+A AA F+ D + +
Sbjct: 437 AWNAIMISGLARAYAVFS----------------QLEYLEMATQAAKFVLENQFVDGRFY 480
Query: 475 RLQHSFRNGPSKAPGFL---DDYAFLISGLLDLYE------FGSGTKWLVWAIELQNTQD 525
RL + + PG L +DYA I LLDL++ G WL A+ LQ +
Sbjct: 481 RLNYEGK------PGVLAQSEDYALFIKALLDLHQACFKADTGKPAFWLEKAVSLQEEFN 534
Query: 526 ELFLDREGGGYFNTTGEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDY 583
+ E GYFN T D S L V+E + D A PS N +++ NLVRL + +
Sbjct: 535 DYLWSVELHGYFN-TASDASKELIVRERNYIDSATPSANGIALCNLVRLTLVTDNLQ--- 590
Query: 584 YRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
Y AE +L F + D A P + A D
Sbjct: 591 YLNLAEQALTAFRGVMNDATQACPSLFVALD 621
>gi|425456902|ref|ZP_18836608.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
gi|389801878|emb|CCI18996.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
Length = 692
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 210/620 (33%), Positives = 311/620 (50%), Gaps = 74/620 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LN +F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLN 107
Query: 80 VFLSPD-LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
VFL+PD L P GGTYFP + ++ RPGF +L+ V+ +D++++ L++ F E L A
Sbjct: 108 VFLTPDSLIPFYGGTYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-A 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L SA + L +L + + + P FP + L S+ +
Sbjct: 164 LRQSAILPRSETNLAAPSLLTTGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGD 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D S + + Q+ + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 224 DFDDSLQQAAYQRG-----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIV 278
Query: 259 NVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
+ +S ++ + + +++L+R+M P G ++A+DADS E +EGAF
Sbjct: 279 EYLANLWSAGDREAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAF 338
Query: 318 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW+ E+ D L + L + ++ + GN F+G+NVL
Sbjct: 339 YVWSDLELRDYLSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQG 381
Query: 377 SKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWN 418
+LG +E L+ L G + +L R D K+IV+WN
Sbjct: 382 GELGKEIENMLDKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWN 441
Query: 419 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQ 477
L+IS ARA A+F P+ Y ++A A FI ++ + D + RL
Sbjct: 442 SLMISGLARA---------FAVFGEPL-------YWQMATVATEFILKYQWLDGRFQRLN 485
Query: 478 HSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGY 536
+ G + +D+A+ I LLDL T WL AI+LQ D F + GGY
Sbjct: 486 Y---QGQASVLAQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDEGGY 542
Query: 537 FNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 594
FN T D S+ L V+E D A PS N +++ NL+RL+ + + Y AE +L
Sbjct: 543 FN-TASDHSLDLIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQS 598
Query: 595 FETRLKDMAMAVPLMCCAAD 614
F T L+ A P + A D
Sbjct: 599 FSTILEQSPTACPSLFVALD 618
>gi|359457589|ref|ZP_09246152.1| hypothetical protein ACCM5_02608 [Acaryochloris sp. CCMEE 5410]
Length = 695
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 211/633 (33%), Positives = 302/633 (47%), Gaps = 99/633 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F + +AK +N ++ IKVDREERPD+D +YM VQA+ G GGWPL+
Sbjct: 57 SSCHWCTVMEGEAFSNSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLN 116
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FLSP DL P GGTYFP E KYGRPGF +L ++ +D +++ L E+LS
Sbjct: 117 MFLSPGDLVPFYGGTYFPEEPKYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGH 172
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L +S N + D P+ + A+ + + G P FP MM Y + L
Sbjct: 173 LQSSTVLNPIGDLQPELLSKGIAKNTTVLINKMPG-----PSFP------MMPYATIALH 221
Query: 199 DTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
+ + E + Q+ +A GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 222 GSRFSTSEQEQAQQACRQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 281
Query: 258 ANVYLDAFS--LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 315
+ +S + + F I + +L+R+M G ++A+DAD+ T +EG
Sbjct: 282 VEYLANLWSTGVEEPAFKRAIAVTVA-WLQREMTAEAGYFYAAQDADNFVTTADIEPEEG 340
Query: 316 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 374
FY WT E+ +L E E + L GN + G VL
Sbjct: 341 RFYTWTDSELTHLLTPEEYAAMAEIFNLSVQGNFE-----------DGLTVLQRQQPGVI 389
Query: 375 SASKLGMPLEKYLNILGECRRKLFDVR-SKRPR------------------------PHL 409
S + + E +KLF VR RP P
Sbjct: 390 SET------------VEEALQKLFQVRYGDRPESLKTFPPATHNQVAKTHPWPGRIPPVT 437
Query: 410 DDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY 469
D K+IV+WN L+IS ARA+ + + + +Y+ +A AASFI +
Sbjct: 438 DTKMIVAWNSLMISGLARAAAVFQ----------------QPDYLALATKAASFILDQQW 481
Query: 470 DE-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQN 522
E + HR+ + +G +DYA LI LDL++ G ++WL A Q
Sbjct: 482 SEGRLHRVNY---DGEIAVIAQSEDYALLIKAFLDLHQACQSLAVGQASRWLEAAQTTQA 538
Query: 523 TQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 581
DE EGGGYFNT E +L+R + D A P+ N V++ NL+RL+ ++
Sbjct: 539 EFDEHLWAVEGGGYFNTGSEISEELLIRERSWLDNATPAANGVAIANLIRLSLFC--DRT 596
Query: 582 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
+Y Q AE +L F + A P + A D
Sbjct: 597 EYLSQ-AEQALQTFGQVMDSSTQACPSLFVALD 628
>gi|172036954|ref|YP_001803455.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
ATCC 51142]
gi|354554754|ref|ZP_08974058.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
ATCC 51472]
gi|171698408|gb|ACB51389.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
ATCC 51142]
gi|353553563|gb|EHC22955.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
ATCC 51472]
Length = 686
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 234/715 (32%), Positives = 334/715 (46%), Gaps = 102/715 (14%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A LND F+ IKVDREERPD+D +YM+ +Q + GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFCDLAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP E +YGRPGF +L+ ++ +D +++ L F +++
Sbjct: 108 IFLTPGDLVPFYGGTYFPVEPRYGRPGFLQVLQSIRRFYDVEKEKL---NGFK-QEIVNT 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKK 196
L SA LP+ + + QL + D +A F RP M+ Y +
Sbjct: 164 LQQSAI-------LPKTDINVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLA 215
Query: 197 LEDTG-KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
L+ T GE E +V+ Q +A GGI D VGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 216 LQGTRFLFGEPEERHILVIQRGQDLALGGIFDQVGGGFHRYTVDSTWTVPHFEKMLYDNG 275
Query: 256 QLANVYLDAFSLTKD--VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 313
Q+ + +S + F I + +L+R+M P G ++A+DADS T+ +
Sbjct: 276 QIVEYLANLWSSGQQEPAFERAIALTV-QWLQREMTAPDGYFYAAQDADSFATKEDKEPE 334
Query: 314 EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 372
EGAFYVW +++E +L + + + + P GN F+GKNVL N
Sbjct: 335 EGAFYVWEYEQLEQLLTSTELEALTDVFTITPEGN------------FEGKNVLQRRNKE 382
Query: 373 SASASKLGMPLEKYLNILGECRRKLFDVRSK-------------RPRPHLDDKVIVSWNG 419
S S + + + G R L ++ R P D K+IV+WNG
Sbjct: 383 KLSDSIETILDKLFKERYGTSRNNLDTFQAAKNNQDAKTIHWPGRIPPVTDTKMIVAWNG 442
Query: 420 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 479
L+IS ARA + K P+ Y ++A +A FI + R Q
Sbjct: 443 LMISGLARAYAVFKQ---------PL-------YWQLACNATQFILEKQW--VNGRFQRI 484
Query: 480 FRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFN 538
G +DYAF I LLDL T+WL A+E+Q DE F + GGY+N
Sbjct: 485 NYQGNPSILAQSEDYAFFIKALLDLQAANPQDTQWLDKAMEIQQEFDEYFWSVDTGGYYN 544
Query: 539 TTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
++ + LL R + D A PS N +++ NLVRLA + Y AE +L F
Sbjct: 545 NADDNNNDLLVRERSYIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQALQAFSY 601
Query: 598 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDP 657
L++ A P + A D LV +V L + P
Sbjct: 602 VLRESPRACPSLLTALDWYHFG-----CLVRTNETV--------------LPTLITRYLP 642
Query: 658 ADTEEMDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLE 712
+D ++ NNA + LVCQ SC P T L + ++E
Sbjct: 643 TTAYRLD---DNLPNNA------------IGLVCQGLSCLEPATTQEQLLSQIIE 682
>gi|325676575|ref|ZP_08156253.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
gi|325552753|gb|EGD22437.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
Length = 674
Score = 299 bits (766), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 192/559 (34%), Positives = 282/559 (50%), Gaps = 63/559 (11%)
Query: 22 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 81
CHWCHVM ESFED+ A ++N+ FV IKVDREERPD+D VYM A+ G GGWP++ F
Sbjct: 57 CHWCHVMAHESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCF 116
Query: 82 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 141
L+PD P GTY+P E + G P F +L V D W +R + + A + +L + S
Sbjct: 117 LTPDGAPFYCGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SG 175
Query: 142 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 201
+ + P ++P L + + D GGFG APKFP + ++ +L ++
Sbjct: 176 ALPAGGAPIDVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT---- 229
Query: 202 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 261
A + V T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD L Y
Sbjct: 230 ---SAGPTLRAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFY 286
Query: 262 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 321
T + + +D+L RD+ G SA DAD T +EG Y WT
Sbjct: 287 AHLARRTGSALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWT 339
Query: 322 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 380
++++ D++G + E + + TG + +G +VL D
Sbjct: 340 AQQIADVVGDDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD--------- 379
Query: 381 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 440
PL+ + L + R +L R++RP+P DDKV+ +WNGL I++ A A L
Sbjct: 380 -PLDA--DRLADIRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG------- 429
Query: 441 FNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLI 498
R +++E AE A + HL D RL+ + G P G L+DY L
Sbjct: 430 ---------RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALA 477
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGA 557
+GL L++ +WL A L +T + F D E G +F+T + +++ R ++ DGA
Sbjct: 478 TGLSTLHQVTGVAEWLEVATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGA 537
Query: 558 EPSGNSVSVINLVRLASIV 576
PSG SV+ L+ +S+V
Sbjct: 538 TPSGASVTTEALLTASSLV 556
>gi|290957891|ref|YP_003489073.1| hypothetical protein SCAB_34251 [Streptomyces scabiei 87.22]
gi|260647417|emb|CBG70522.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 691
Score = 299 bits (765), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 220/707 (31%), Positives = 326/707 (46%), Gaps = 83/707 (11%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
+ CHWCHVM ESFED+G A LN+ FVS+KVDREERPDVD VYM VQA G GGWP+S
Sbjct: 54 SACHWCHVMAKESFEDKGTAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMS 113
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
VF++P +P GTYFPP + G P F+ +L V AW +R +A L+E
Sbjct: 114 VFMTPAAEPFYFGTYFPPGPRQGMPSFRQVLEGVHHAWSSRRQEVADVAVKITRDLAE-R 172
Query: 140 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 199
+ A S+ LP Q L QL++ DS G F + KFP + ++ +L H +
Sbjct: 173 ALGAGSDGLPTGETQAQALL---QLTRDVDSTSGWFKGSTKFPPSMVVEFLLRHHAR--- 226
Query: 200 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSV---DERWHVPHFEKMLYDQGQ 256
TG ++M MA+ ++D VGGGFHRY + + VPHFEKMLYD
Sbjct: 227 TGSVA----AREMAEGLCGAMARSSLYDQVGGGFHRYVLLAHADGPLVPHFEKMLYDNAL 282
Query: 257 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
L VY + T + + D++ R++ G SA DADS + G+ + EGA
Sbjct: 283 LCRVYAHLWRATGSEPARRVALETADFMVRELRTNEGGFASALDADSDDGTGSGKHVEGA 342
Query: 317 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
+YVWT +++ ++LGE Y+ + E A
Sbjct: 343 YYVWTPEQLTEVLGEEDAALAVRYF-----------------------GVTEEGTFEEGA 379
Query: 377 SKLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 433
S L +P ++ + + R +L RS+RP P DDKV+ +WNGL +++ A
Sbjct: 380 SVLQLPQQEGVFDAERIESVRERLLAARSRRPAPGRDDKVVAAWNGLAVAALAETGAYF- 438
Query: 434 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLD 492
DR + ++ A +AA + R DE+ RL + R+G + A G L+
Sbjct: 439 ---------------DRPDLVDAAITAADLLVRLHLDERA-RLTRTSRDGQAGANAGVLE 482
Query: 493 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 552
DYA + G L L WL +A L + F+D G ++T + ++ R ++
Sbjct: 483 DYADVAEGFLALASVTGEGVWLEFAGFLLDHVLARFVDEGSGALYDTASDAEKLIRRPQD 542
Query: 553 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---- 608
D A PSG S + L A + S+ +R+ AE +L V +K + P
Sbjct: 543 PTDNATPSGWSAAAGA---LLGYAAQTGSEPHRRAAERALGV----VKALGPRAPRFIGW 595
Query: 609 -MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWE 667
+ A +L P + V +VG + + AA V+ + AD +E+
Sbjct: 596 GLATAEALLDGP--REVAVVGPEGHPGRRELHRAALLG-TAPGAVVAVGVADGDELPL-- 650
Query: 668 EHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEKP 714
+A + A VC++F+C P TD L +L P
Sbjct: 651 --------LAGRPLVGGEPAAYVCRHFTCDAPTTDADRLREVLGAAP 689
>gi|297626872|ref|YP_003688635.1| thioredoxin [Propionibacterium freudenreichii subsp. shermanii
CIRM-BIA1]
gi|296922637|emb|CBL57214.1| Conserved protein containing thioredoxin domain [Propionibacterium
freudenreichii subsp. shermanii CIRM-BIA1]
Length = 894
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 222/707 (31%), Positives = 328/707 (46%), Gaps = 85/707 (12%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESF D VA+ +ND FV+I VDREERPDVD+V+M QAL G GGWP++V
Sbjct: 49 SCHWCHVMAQESFRDPQVAQFVNDNFVAIAVDREERPDVDQVFMNATQALTGQGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
F +PD +P GTYFP + + G+P F + + + AW ++RD + +SGA QL++ S
Sbjct: 109 FCTPDGEPFFAGTYFPSQARVGQPSFLQVCQTLARAWAERRDEVVESGAHIASQLADQAS 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A+ + E P A L A L+ D GGFG+APKFP+P + ++
Sbjct: 169 AADPAGDQTGE-PPAADELLARALAL-VDPDNGGFGTAPKFPQPASLDALMV-------- 218
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ----GQ 256
+GE + V +L+ + +GGIHD VGGGFHRY+VD W VPHFEKML D G
Sbjct: 219 --TGEPHQ-IGAVQLSLEHIVRGGIHDIVGGGFHRYAVDAAWAVPHFEKMLDDNALLLGT 275
Query: 257 LANVYLDAFSLTKDV--FYSYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATR 311
L + T D+ + R I+ +L R+M G S +DADS + +G +
Sbjct: 276 LTRAWRRTGPETGDLREHFELAIRGIVGWLSREMAITTDAGTAFASGQDADSLDADG--Q 333
Query: 312 KKEGAFYVWTSKEVEDILGEHAILFKEH-YYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+ EGAFY+WT +VE + LF + ++L P G +
Sbjct: 334 RVEGAFYLWTPHQVEAVFNRRDALFAQAVFHLTPKGT---------------------MP 372
Query: 371 DSSASASKLGMP-LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 429
D S++ G P ++ ILGE R +VR++RP P DDKV+ WNGL+ S A+
Sbjct: 373 DHSSTLRLHGDPDPDRLKRILGELR----EVRARRPAPARDDKVVAGWNGLLADSLTSAA 428
Query: 430 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 489
+ F P E++ +A S ++ + + H + S AP
Sbjct: 429 MV---------FGEP-------EWLTMARSVLDYLWSVHHFDTDHAARSSLAGVAGPAPA 472
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
L+DYA G L T+ L A+ + ELF + GG+F+ D ++ R
Sbjct: 473 VLEDYAGFALGAARLAGATGDTELLDRAVTVLGRGVELF-GADDGGFFDAQ-HDEALFTR 530
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEHSLAVFETRLKDMAMAVPL 608
++ D PS S+ V L +A + +D R+ V E +
Sbjct: 531 ARQLADEGGPSATSIMVTALQVVAGLTGNRDWADRARRAEPGLWQVLEQTPLASGWGLTQ 590
Query: 609 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMD--FW 666
+ A + R V +V +S +LA A TV + D F
Sbjct: 591 LAIDAQATAGMGRAQVAIVDPESRP--MGLLARAVWRLAPEGTVAALGTPDAPGFGELFA 648
Query: 667 EEHNSNNASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLLLEK 713
+ H+ + A A +C++ +C PVTD L + L +
Sbjct: 649 QRHDIDGAP-----------TAYICRDETCFDPVTDFTRLRDPLWRR 684
>gi|158334352|ref|YP_001515524.1| hypothetical protein AM1_1172 [Acaryochloris marina MBIC11017]
gi|158304593|gb|ABW26210.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 686
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 208/632 (32%), Positives = 303/632 (47%), Gaps = 97/632 (15%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F + +AK +N ++ IKVDREERPD+D +YM VQA+ G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSNSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FLSP DL P GGTYFP E +YGRPGF +L ++ +D +++ L E+LS
Sbjct: 108 MFLSPGDLVPFYGGTYFPEEPRYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGH 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-KL 197
L +S N + D P+ L ++ ++K+ P FP + L+ S+
Sbjct: 164 LQSSTVLNPIGDLQPE----LLSKGIAKNTTVLINKM-PGPSFPMMPYAAIALHGSRFST 218
Query: 198 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 257
D K+ +A + + L A GGI+DHV GGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 219 PDQEKAQQACRQRGLDL------ALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQI 272
Query: 258 ANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 316
+ +S K+ + + +L+R+M G ++A+DAD+ T +EG
Sbjct: 273 VEYLANLWSAGVKEPAFERAIAGTVAWLQREMTAEAGYFYAAQDADNFVTTADIEPEEGR 332
Query: 317 FYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 375
FY WT E+ +L E E + L GN + G VL S
Sbjct: 333 FYTWTDSELTHLLTTEEYAAMAEIFNLSAQGNFE-----------DGLTVLQRQQPGVIS 381
Query: 376 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPR------------------------PHLD 410
+ + E RKLF VR +RP P D
Sbjct: 382 ET------------VEEALRKLFQVRYGERPESLTTFPPATNNQVAKTHPWPGRIPPVTD 429
Query: 411 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 470
K+IV+WN L+IS ARA+ + + + +Y+ +A AA FI +
Sbjct: 430 TKMIVAWNSLMISGLARAAAVFQ----------------QPDYLALATKAARFILDQQWS 473
Query: 471 E-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQNT 523
E + HR+ + +G +DYA LI LDL++ ++WL A Q
Sbjct: 474 EGRLHRVNY---DGEIAVIAQSEDYALLIKAFLDLHQASQSLAVDQASRWLEAAQTTQAE 530
Query: 524 QDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 582
DE EGGGYFNT E +L+R + D A P+ N V++ NL+RL+ + +++
Sbjct: 531 FDEHLWAVEGGGYFNTGSEMSEELLIRERSWLDNATPAANGVAIANLIRLSLVC--DRTE 588
Query: 583 YYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 614
Y Q AE +L F + A P + A D
Sbjct: 589 YLSQ-AEQALQTFGQVMGSSTQACPSLFVALD 619
>gi|154251723|ref|YP_001412547.1| hypothetical protein Plav_1270 [Parvibaculum lavamentivorans DS-1]
gi|154155673|gb|ABS62890.1| protein of unknown function DUF255 [Parvibaculum lavamentivorans
DS-1]
Length = 676
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 227/689 (32%), Positives = 339/689 (49%), Gaps = 74/689 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
CHWCHVM ESFEDE VA ++N+ FV+IKVDREERPD+D +YM+ + L GGWPL++
Sbjct: 50 ACHWCHVMAHESFEDESVAAVMNEHFVNIKVDREERPDIDAIYMSALHLLGQQGGWPLTM 109
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P+ +P GGTYFP E YGRPGF +L +V + ++ + ++ ++ L E S
Sbjct: 110 FLTPEGEPFWGGTYFPKEPNYGRPGFVQVLEEVARIFREEPAKVYKNRTALVKALEEQ-S 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
A+A + ++P + AE+L + D GG APKFP+ V + +L+ + T
Sbjct: 169 ATARPGEPTPQVPI----VVAEKLREIMDPVHGGIRGAPKFPQ-VPLLTLLWRAHL--RT 221
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
G+ A+ V L M++GGI+DH+GGG+ RYSVDE W PHFEKMLYD L ++
Sbjct: 222 GREDLAAP----VSRALDHMSEGGIYDHLGGGYARYSVDEFWLAPHFEKMLYDNALLIDL 277
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 320
+ T+ Y R+ +++L R+M+ GG ++ DADS EG EG FYVW
Sbjct: 278 LTLVWQETRKPLYERRIRETVEWLAREMVTEGGGFAASLDADS---EGV----EGKFYVW 330
Query: 321 TSKEVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 378
+ E++++L GE A LFK+ Y + GN ++ N+L L + A +
Sbjct: 331 SEAEIDNLLTPGE-AELFKQVYNVSGEGN------------WEETNILNRLARADAPFTA 377
Query: 379 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 438
+ L + +LF R R P DDKV+ WNGL+I++ ARA
Sbjct: 378 ------EEEAALEPLKARLFLERDLRVHPGFDDKVLADWNGLMIAALARAGAAFGEAG-- 429
Query: 439 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 498
+ E+A +A F+ + + RL H++R G + DD A +
Sbjct: 430 --------------WTEMAAAAFRFVMTEM--RKDGRLHHAWRAGKLQHIAMADDLANMA 473
Query: 499 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 558
L LYE ++L A L + D GGYF T + P++++R + D A
Sbjct: 474 DAALALYEATGEAEYLQAAESLAAELGAHYRDETNGGYFFTADDAPALIVRRRTVADDAT 533
Query: 559 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 618
P+ N L RLA + K DY + A+ + F L+ PL A + +
Sbjct: 534 PAANGTMPGVLARLALMT--GKQDYLAR-ADELIRAFAGELQQNIF--PLGSYIASLDTR 588
Query: 619 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEMDFWEEHNSNNASMAR 678
+VL+G K+ LA A L V+ + AD + E H ++ +
Sbjct: 589 LKPVQIVLIGSKAET---AELARAAFGTSLPARVL-MRVADGSALP--EGHPAHGKTALD 642
Query: 679 NNFSADKVVALVCQNFSCSPPVTDPISLE 707
K A VC +CS PVT+ +LE
Sbjct: 643 G-----KPTAYVCAGETCSLPVTEAAALE 666
>gi|37521713|ref|NP_925090.1| hypothetical protein gll2144 [Gloeobacter violaceus PCC 7421]
gi|35212711|dbj|BAC90085.1| gll2144 [Gloeobacter violaceus PCC 7421]
Length = 650
Score = 298 bits (763), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 226/697 (32%), Positives = 315/697 (45%), Gaps = 114/697 (16%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D +A +N FV+IKVDREERPD+D +YM +Q + GGWPL+
Sbjct: 53 SSCHWCTVMENEAFSDPEIAGFMNAHFVAIKVDREERPDIDAIYMQALQLMNQQGGWPLN 112
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP +D+YGRPGF +L + D + +R+ L E++ A
Sbjct: 113 IFLTPGDLVPFYGGTYFPVQDRYGRPGFLRVLEAIHDYYRGQRERLGDHK----ERMLGA 168
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
L A+ L ELP + LR L + G P FP L + LE
Sbjct: 169 LEAATRLQPL-SELPPDPLRRAVPPLR----ALLARDGMGPSFPMIPHAGFALRMGRFLE 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
G+ +A GGI DHVGGGFHRY+VD W VPHFEKMLYD GQ+
Sbjct: 224 VELAQSACERGED--------LATGGIFDHVGGGFHRYTVDGTWTVPHFEKMLYDNGQIV 275
Query: 259 NVYLDAFSLTKDV-FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 317
D ++ + + +L R+M G ++A+DADS EG +EG F
Sbjct: 276 EFLSDLWASGLHIPAFERAVEFTHRWLLREMTDGRGYFYAAQDADS---EG----EEGKF 328
Query: 318 YVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 376
YVW++ E+++IL GE + ++L GN F+G+ +++ S
Sbjct: 329 YVWSASELQEILSGEELAALESAFFLSAEGN------------FEGRTTVLQRR----SG 372
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
L +E L KLF VRS+R D K+IVSWN L+I+ RA+ +
Sbjct: 373 DVLAPVVETALT-------KLFGVRSRRVPAATDTKLIVSWNALMIAGLNRAADVF---- 421
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDDYA 495
R EY E A AA FI H + +RL + +G P +DYA
Sbjct: 422 ------------GRPEYRETAVGAARFILEHQRAPGEFYRLNY---DGEPAIPAHAEDYA 466
Query: 496 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 555
I L+DLY +WL A LQ DE D E GGYF+ P +L+R K+ D
Sbjct: 467 CFIKALIDLYVSTQQGEWLEAARALQQQMDERLWDLEMGGYFSAPS-GPDLLIREKDFQD 525
Query: 556 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 615
A P+ N ++ NLVRL + + Y + AE L F L ++ A P + D
Sbjct: 526 SATPAANGLAAANLVRLFLL---TDEPAYLEAAEALLRQFARILAEVPRAGPSLLAGYD- 581
Query: 616 LSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEEM--DFWEEHNSNN 673
+ N+ ++ DP E+ +W
Sbjct: 582 ------------------------------WYRNQVLVQSDPERIAELLRGYW------- 604
Query: 674 ASMARNNFSADKVVALVCQNFSCSPPVTDPISLENLL 710
+ VALVC+ C P+ LE L
Sbjct: 605 PTAVFKAVDVKPAVALVCEGLRCLEPIESEAQLEAQL 641
>gi|86742579|ref|YP_482979.1| hypothetical protein Francci3_3900 [Frankia sp. CcI3]
gi|86569441|gb|ABD13250.1| protein of unknown function DUF255 [Frankia sp. CcI3]
Length = 673
Score = 298 bits (763), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 212/617 (34%), Positives = 297/617 (48%), Gaps = 66/617 (10%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 80
+CHWCHVM ESFED A+ +ND FV+IKVDREERPDVD VYM AL G GGWP++V
Sbjct: 49 SCHWCHVMAHESFEDAATAEYMNDHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTV 108
Query: 81 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 140
FL+P +P GTYFPP + G F+ +L V +AW +RD + +SGA +L+EA +
Sbjct: 109 FLTPTAEPFFAGTYFPPRPRPGMGSFRQVLTAVTEAWRTRRDEIEESGADIARRLAEAAT 168
Query: 141 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 200
+S L E+ L LS +D+R GGFG APKFP + +M+L HS + D
Sbjct: 169 RGPASG-LAAEITPALLDTAVAGLSARFDARHGGFGGAPKFPPSMVAEMLLRHSARTGD- 226
Query: 201 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 260
+ +MV T + MA+GGI+D + GGF RYSVD W VPHFEKMLYD L V
Sbjct: 227 ------ARSLEMVAVTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNALLLRV 280
Query: 261 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD---------SAETEGATR 311
YL + T + R+ +L D+ P G SA DAD SA GA
Sbjct: 281 YLHLWRATGSALAERVVRETAAFLLADLRTPQGGFASALDADAVPADAVPASAAPAGA-H 339
Query: 312 KKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EGA Y WT + +LG E + + G+ + +G +VL
Sbjct: 340 PEEGASYAWTPAQFVAVLGPEDGRWAAGVFGVTEQGSFE-----------RGTSVLRLPA 388
Query: 371 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 430
D P + +P DDKV+ +WNGL I++ A A
Sbjct: 389 D----------PDDPARFAAVRAALAAARATRPQP--ARDDKVVAAWNGLAIAALAEAGA 436
Query: 431 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPG 489
+ D +++ AE AA +R HL + + R R G + G
Sbjct: 437 LF----------------DEPDWVRAAEQAAVLLRDVHLVNGRLRRTSRDGRVGVNA--G 478
Query: 490 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 549
L+DY + GLL L++ +WL A L + + F + GG+F+T + +L R
Sbjct: 479 VLEDYGDVAEGLLTLHQVTGDPEWLALAGTLLDIVRDRFAASD-GGFFDTADDAEVLLRR 537
Query: 550 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPL 608
++D D A PSG + LV A++ + S +R AE ++A L +D A
Sbjct: 538 PRDDSDSATPSGQAAVAGALVSYAAL---TGSTEHRSAAETTVARVAPLLARDARFAGWA 594
Query: 609 MCCAADMLSVPSRKHVV 625
A +L+ P+ VV
Sbjct: 595 GAVAEALLAGPAEVAVV 611
>gi|121604944|ref|YP_982273.1| hypothetical protein Pnap_2043 [Polaromonas naphthalenivorans CJ2]
gi|120593913|gb|ABM37352.1| protein of unknown function DUF255 [Polaromonas naphthalenivorans
CJ2]
Length = 610
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 198/602 (32%), Positives = 309/602 (51%), Gaps = 52/602 (8%)
Query: 21 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLS 79
CHWCHVM ESF D +A L+N+ FV+IKVDREERPD+D VY Q L GGGWPL+
Sbjct: 49 ACHWCHVMAAESFSDPAIAALMNEGFVNIKVDREERPDLDAVYQMAHQLLRRTGGGWPLT 108
Query: 80 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 139
+FLSP P GTYFP G+ F+ +L V W ++R LA+ +Q A
Sbjct: 109 IFLSPQGVPFYSGTYFPSAAPEGQATFQAVLGSVSAVWREQRPALARQ-----DQALLAA 163
Query: 140 SASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 198
A+++ + +P A+R A +QL+ ++D GGFG+APKFP P ++ +L +++
Sbjct: 164 LAASAPRRDDAAVPGAAVRAQALQQLATAFDPAQGGFGAAPKFPHPSDLAFLLRRAREEG 223
Query: 199 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 258
D ++ ++M L TL+ MA+GG++D +GGGF RYSVD +W +PHFEKML D G L
Sbjct: 224 D-------AQAREMALLTLRKMAEGGLYDQIGGGFFRYSVDAQWRIPHFEKMLCDNGVLL 276
Query: 259 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 318
+Y DA +LT + + + D + R+M G ++ AD A+ +EG FY
Sbjct: 277 ALYADALALTGEPLFRRVVEDTASWALREMQSSAGGFHASLAADDAQ------GREGRFY 330
Query: 319 VWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-A 376
VW S+ + L + + H+ L + P F+G++ + + ++ A
Sbjct: 331 VWESEPLRLALSPNEWDVCAAHWGL----------VDGPG--FEGRHWHLRVARAAGPLA 378
Query: 377 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 436
L P + ++ R KL R KR RP D K++ W L+++ ARAS + +
Sbjct: 379 VTLRRPEAQVEELIASARPKLLAERDKRERPARDAKLLTGWTALMMTGLARASAVCQ--- 435
Query: 437 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 496
R E++ A SA F++ + + H P +A FLDD+AF
Sbjct: 436 -------------RPEWLLAARSALRFVQAGRWQDDGRTSGHLLAL-PGQA-AFLDDHAF 480
Query: 497 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 556
L+ +L L++ L +A + F DR+ GG+F T + P+++ R+K D
Sbjct: 481 LLEAVLALHDADPQPGDLPFAQAIAKAMLAQFEDRDAGGFFFTRHDAPALIHRLKTGLDA 540
Query: 557 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 616
A PSGN + + L+ L+ + ++ YR AE + VF + + + P + AA++L
Sbjct: 541 ATPSGNGTAALALLALSGKLDAPQAAAYRLAAERCVRVFAATVLNDPASFPRLLQAAELL 600
Query: 617 SV 618
Sbjct: 601 QA 602
>gi|302865439|ref|YP_003834076.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
27029]
gi|302568298|gb|ADL44500.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
27029]
Length = 678
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 221/699 (31%), Positives = 321/699 (45%), Gaps = 72/699 (10%)
Query: 11 KTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
K R LI+ CHWCHVM ESFE+E VA+L+ND FV +KVDREERPDVD VYMT
Sbjct: 34 KRRDVPVLISVGYAACHWCHVMAHESFENEAVARLMNDDFVCVKVDREERPDVDAVYMTA 93
Query: 67 VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
QA+ G GGWP++VF +PD P GTYFP R F +L V AW +R+ + +
Sbjct: 94 TQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFIRLLGSVATAWRDQREAVLR 147
Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
G +E + A + + L EL L A +L+ YD GGFG APKFP +
Sbjct: 148 QGTAVVEAIGGAQAVGGVTAPLTAEL----LDAAASRLAGEYDETNGGFGGAPKFPPHMN 203
Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
+ +L H ++ TG ++ ++V T + MA+GG++D + GGF RYSVD W VPH
Sbjct: 204 LLFLLRHHQR---TG----SARSLEIVRHTCEAMARGGLNDQLAGGFARYSVDGHWTVPH 256
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD L VY + LT D + RD +L ++ G SA DAD+
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDRLARRVARDTARFLADELHRAGEGFASALDADTEGV 316
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
EG T YVWT ++ ++LGE F DL ++ G +VL
Sbjct: 317 EGLT-------YVWTPDQLVEVLGEDDGRFA----------ADLFEVTADGTFEHGTSVL 359
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
D + ++ ++ +++G +L R RP+P DDKV+ +WNGL I++ A
Sbjct: 360 RLARDVDDADPEV---RARWQDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITAIA 412
Query: 427 R----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
AS ++ + E A V+ + AE A+ HL D + R+
Sbjct: 413 EFQQVASLLVSPDDEDANLMDGVLIVSDGAMRDAAEHLATV---HLVDGRLRRVSRDKVV 469
Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
G + G L+DY + +++ +WL A EL + F + G +++T +
Sbjct: 470 G--QPAGVLEDYGCVAEAFCAMHQLTGEGRWLTLAGELLDVALARFAGPD-GAFYDTADD 526
Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
++ R + D A PSG S V LV A++ ++ YR+ AE +L +
Sbjct: 527 AERLVTRPADPTDNATPSGRSAIVAALVAYAALTGETR---YREAAEKTLTTVAPIVDRH 583
Query: 603 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
A + L + V G + ++AAA V+ P
Sbjct: 584 ARFTGYAATVGEALLSGPYEIAVATGDPEG---DPLVAAARRHAPPGAVVVAGAP----- 635
Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+A F K A VC+ F C PVT
Sbjct: 636 ------DQPGVPLLAGRPFVDGKPAAYVCRGFVCQRPVT 668
>gi|315501987|ref|YP_004080874.1| n-acylglucosamine 2-epimerase [Micromonospora sp. L5]
gi|315408606|gb|ADU06723.1| N-acylglucosamine 2-epimerase [Micromonospora sp. L5]
Length = 678
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 221/699 (31%), Positives = 321/699 (45%), Gaps = 72/699 (10%)
Query: 11 KTRRTHFLINT----CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 66
K R LI+ CHWCHVM ESFE+E VA+L+ND FV +KVDREERPDVD VYMT
Sbjct: 34 KRRDVPVLISVGYAACHWCHVMAHESFENEAVARLMNDDFVCVKVDREERPDVDAVYMTA 93
Query: 67 VQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ 126
QA+ G GGWP++VF +PD P GTYFP R F +L V AW +R+ + +
Sbjct: 94 TQAMTGQGGWPMTVFATPDGTPFFCGTYFP------RANFIRLLGSVATAWRDQREAVLR 147
Query: 127 SGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 186
G +E + A + + L EL L A +L+ YD GGFG APKFP +
Sbjct: 148 QGTAVVEAIGGAQAVGGVTAPLTAEL----LDAAASRLAGEYDETNGGFGGAPKFPPHMN 203
Query: 187 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 246
+ +L H ++ TG ++ ++V T + MA+GG++D + GGF RYSVD W VPH
Sbjct: 204 LLFLLRHHQR---TG----SARSLEIVRHTCEAMARGGLNDQLAGGFARYSVDGHWTVPH 256
Query: 247 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 306
FEKMLYD L VY + LT D + RD +L ++ G SA DAD+
Sbjct: 257 FEKMLYDNALLLRVYTQLWRLTGDRLARRVARDTARFLADELHRAGEGFASALDADTEGV 316
Query: 307 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 366
EG T YVWT ++ ++LGE F DL ++ G +VL
Sbjct: 317 EGLT-------YVWTPGQLVEVLGEDDGRFA----------ADLFEVTADGTFEHGTSVL 359
Query: 367 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 426
D + ++ ++ +++G +L R RP+P DDKV+ +WNGL I++ A
Sbjct: 360 RLARDVDDADPEV---RARWQDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITAIA 412
Query: 427 R----ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 482
AS ++ + E A V+ + AE A+ HL D + R+
Sbjct: 413 EFQQVASLLVSPDDEDANLMDGVLIVSDGAMRDAAEHLATV---HLVDGRLRRVSRDKVV 469
Query: 483 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 542
G + G L+DY + +++ +WL A EL + F + G +++T +
Sbjct: 470 G--QPAGVLEDYGCVAEAFCAMHQLTGEGRWLTLAGELLDVALARFAGPD-GAFYDTADD 526
Query: 543 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 602
++ R + D A PSG S V LV A++ ++ YR+ AE +L +
Sbjct: 527 AERLVTRPADPTDNATPSGRSAIVAALVAYAALTGETR---YREAAEKTLTTVAPIVDRH 583
Query: 603 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVIHIDPADTEE 662
A + L + V G + ++AAA V+ P
Sbjct: 584 ARFTGYAATVGEALLSGPYEIAVATGDPEG---DPLVAAARRHAPPGAVVVAGAP----- 635
Query: 663 MDFWEEHNSNNASMARNNFSADKVVALVCQNFSCSPPVT 701
+A F K A VC+ F C PVT
Sbjct: 636 ------DQPGVPLLAGRPFVDGKPAAYVCRGFVCQRPVT 668
>gi|427723011|ref|YP_007070288.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
gi|427354731|gb|AFY37454.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
Length = 681
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 210/623 (33%), Positives = 303/623 (48%), Gaps = 86/623 (13%)
Query: 20 NTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 79
++CHWC VME E+F D+ +A LN F+ IKVDREERPD+D +YM +Q + G GGWPL+
Sbjct: 48 SSCHWCTVMEGEAFSDQAIADYLNANFLPIKVDREERPDIDSIYMQALQLMTGQGGWPLN 107
Query: 80 VFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 138
+FL+P DL P GGTYFP +Y RPGF +L ++ +D + + L + E++
Sbjct: 108 IFLTPDDLIPFYGGTYFPVSPRYNRPGFLDVLSSIRHFYDDEPERLKEIK----EEIFTI 163
Query: 139 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSK 195
L S + LP L L L KS ++ G G P FP + L S+
Sbjct: 164 LDRSVT-------LPTTELSLDQTLLEKSIEACTGVVGRVSHGPSFPMIPYAAIALQGSR 216
Query: 196 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 255
E+T G A ++ + +A GGI+DHVGGGFHRY+VD W VPHFEKMLYD G
Sbjct: 217 FTENTKHDGSAITKKRGL-----DLALGGIYDHVGGGFHRYTVDPNWTVPHFEKMLYDNG 271
Query: 256 Q----LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATR 311
Q LAN++ + T + + +++L R+M P G ++A+DADS G
Sbjct: 272 QITEFLANLWANG---TTEPSFKTALEGTVEWLSREMTAPQGYFYAAQDADSFLDAGHVE 328
Query: 312 KKEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 370
+EG FYVW E++ + A +E+++++P GN F+GK VL
Sbjct: 329 PEEGTFYVWDFDELQTQFSDTAFQELQENFFIEPDGN------------FEGKIVL---- 372
Query: 371 DSSASASKLGMPLEKYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDK 412
+++++ L+ LN L G R+ L R D K
Sbjct: 373 -KRRASTEIPESLQATLNQLFAERYGGDRQSLETFPPARDNAEAKNTDWAGRIPAVTDTK 431
Query: 413 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 472
+IV+WN L+IS AR +L E + ++A + +FI + E
Sbjct: 432 LIVAWNALMISGLARIYGVLSLE----------------KAWDLAVNCVNFILETQWQE- 474
Query: 473 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDR 531
H + +F P +DYAFLI LLDL + T WL AI LQ+ D F
Sbjct: 475 GHLYRLNFGEEPDGVAQ-SEDYAFLIKALLDLQANNPTETHWLDKAITLQSEFDAKFWSA 533
Query: 532 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 591
E GYFN T E +L++ + D A PS N ++V NL+RL + ++ Y AE +
Sbjct: 534 ETKGYFNNT-EAKELLIKERSYQDNATPSANGIAVTNLIRLFLL---TEDLAYLDKAEQA 589
Query: 592 LAVFETRLKDMAMAVPLMCCAAD 614
L F L + P + A D
Sbjct: 590 LQTFAVVLDKSSQQAPSLIAALD 612
>gi|365866818|ref|ZP_09406418.1| hypothetical protein SPW_6722 [Streptomyces sp. W007]
gi|364003721|gb|EHM24861.1| hypothetical protein SPW_6722 [Streptomyces sp. W007]
Length = 619
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 202/573 (35%), Positives = 289/573 (50%), Gaps = 57/573 (9%)
Query: 27 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 86
+M ESFEDE VA LN FV +KVDREERPDVD VYM VQA G GGWP++VFL+ D
Sbjct: 1 MMAHESFEDETVAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTADA 60
Query: 87 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 146
+P GTYFPPE ++G P F+ +L V AW +R+ +A+ + L+ S +
Sbjct: 61 EPFYFGTYFPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAGRIVADLA-GRSLVHGGD 119
Query: 147 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 206
+P E + A L L++ YD + GGFG APKFP + ++ +L H + TG G
Sbjct: 120 GVPGE-QETAQALLG--LTREYDEQHGGFGGAPKFPPSMAVEFLLRHYAR---TGSEG-- 171
Query: 207 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 266
+M T MA+GGI+D +GGGF RYSVD W VPHFEKMLYD L VY +
Sbjct: 172 --ALQMAADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWR 229
Query: 267 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 326
T I + D++ R++ G SA DADS + +G R EGAFYVWT ++
Sbjct: 230 TTGSDEARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAFYVWTPGQLR 287
Query: 327 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 386
++LGE F Y+ +++ +G +VL D+ P++
Sbjct: 288 EVLGEDDAAFAAAYF----------GVTEEGTFEEGASVLRLPGDTG--------PVDA- 328
Query: 387 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 446
+ + R +L R++RPRP DDKV+ +WNGL I++ A
Sbjct: 329 -ARVADVRARLLAARAERPRPGRDDKVVAAWNGLAIAALAETGAYF-------------- 373
Query: 447 GSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDL 504
DR + +E A AA +R HL + RL + ++G G L+DY + G L L
Sbjct: 374 --DRPDLVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDVAEGFLAL 429
Query: 505 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 564
WL +A L + E F EGG ++T + ++ R ++ D A PSG +
Sbjct: 430 AAVTGEGAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTA 488
Query: 565 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 597
+ L+ S A + S+ +R AE +L V +
Sbjct: 489 AAGALL---SYAAYTGSEAHRTAAEGALGVVKA 518
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,821,563,226
Number of Sequences: 23463169
Number of extensions: 524598392
Number of successful extensions: 1077751
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1480
Number of HSP's successfully gapped in prelim test: 102
Number of HSP's that attempted gapping in prelim test: 1066747
Number of HSP's gapped (non-prelim): 2237
length of query: 718
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 568
effective length of database: 8,839,720,017
effective search space: 5020960969656
effective search space used: 5020960969656
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)