BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019180
(345 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255555723|ref|XP_002518897.1| Protein MYG1, putative [Ricinus communis]
gi|223541884|gb|EEF43430.1| Protein MYG1, putative [Ricinus communis]
Length = 373
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 276/338 (81%), Positives = 307/338 (90%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
SSPAYST SP P KRVGTHNGSFHCDEALGCFMIRLT+KF NA+IVRSRD +VL+ LD
Sbjct: 36 SSPAYSTGSPCHAPPKRVGTHNGSFHCDEALGCFMIRLTNKFSNAEIVRSRDLQVLESLD 95
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
AVLDVGGVYDP +D YDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKE+IAKEL +DEGH
Sbjct: 96 AVLDVGGVYDPVHDRYDHHQKGFEEVFGHGFTTKLSSAGLVYKHFGKEIIAKELQLDEGH 155
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSA 187
P+VHRLFLA+YK+FMEAIDA+DNGINQYDTDKPPRYVNNT+LSSRVG+LNLDW EPDQS
Sbjct: 156 PNVHRLFLAIYKSFMEAIDAVDNGINQYDTDKPPRYVNNTHLSSRVGRLNLDWIEPDQSP 215
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
E+ENE FQ+ M LAG EFLD++ ++ +SWLPARSIV+EC+ R D DPSGEIMVL FCP
Sbjct: 216 EKENEVFQRAMTLAGSEFLDSLHYHAKSWLPARSIVMECLEARSDVDPSGEIMVLATFCP 275
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
WKLHLFE+EEE+KIEP IKYVLY+DDR K WRVQAVAV+PDRFESR+ LPAQWRGLRDDE
Sbjct: 276 WKLHLFEIEEELKIEPSIKYVLYQDDRSKHWRVQAVAVAPDRFESRRALPAQWRGLRDDE 335
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
LS+E+GIP CVFVHMSGFIGGN+SY GALAMARAALKL
Sbjct: 336 LSRESGIPACVFVHMSGFIGGNKSYEGALAMARAALKL 373
>gi|297805440|ref|XP_002870604.1| hypothetical protein ARALYDRAFT_330340 [Arabidopsis lyrata subsp.
lyrata]
gi|297316440|gb|EFH46863.1| hypothetical protein ARALYDRAFT_330340 [Arabidopsis lyrata subsp.
lyrata]
Length = 372
Score = 582 bits (1501), Expect = e-164, Method: Compositional matrix adjust.
Identities = 271/346 (78%), Positives = 311/346 (89%), Gaps = 1/346 (0%)
Query: 1 MAGTGVSSSPAYSTS-SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRD 59
MA + V + +TS SPS+ +K+VGTHNGSFHCDEALGCFMIRL DKF A IVRSRD
Sbjct: 27 MAPSSVVRVYSTATSPSPSEVSVKKVGTHNGSFHCDEALGCFMIRLVDKFSGADIVRSRD 86
Query: 60 PKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK 119
PK+L +LDAVLDVGGVYDP +D YDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKE+IAK
Sbjct: 87 PKILAELDAVLDVGGVYDPEHDRYDHHQKGFEEVFGHGFNTKLSSAGLVYKHFGKEIIAK 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
ELNVD+ HPDV RLFLAVYK+FMEAIDA+DNGIN+YDTD+PPRYVNNT+LS+RVG+LNLD
Sbjct: 147 ELNVDQDHPDVLRLFLAVYKSFMEAIDAVDNGINRYDTDQPPRYVNNTHLSARVGRLNLD 206
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W +PDQS E+ENEAFQ+ M LAGKEFL++V+F+ RSWLPARSIV++C+ ER+ DPSGEI
Sbjct: 207 WIDPDQSQEKENEAFQRAMALAGKEFLESVQFHARSWLPARSIVMQCLEERFKTDPSGEI 266
Query: 240 MVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
M+L +FCPWKLHLFELE+EMKIEPLIKYV+Y+D+RGKQWRVQAVAV+PDRFE+RKPLP +
Sbjct: 267 MILNKFCPWKLHLFELEQEMKIEPLIKYVIYQDERGKQWRVQAVAVAPDRFENRKPLPEK 326
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRGLRD+ELSK A IPGCVFVHMSGFIGGNQSY GAL MAR AL L
Sbjct: 327 WRGLRDEELSKAAEIPGCVFVHMSGFIGGNQSYDGALCMARTALTL 372
>gi|297742425|emb|CBI34574.3| unnamed protein product [Vitis vinifera]
Length = 338
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 274/333 (82%), Positives = 304/333 (91%), Gaps = 1/333 (0%)
Query: 14 TSSPSQTP-LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S+ S P LKRVGTHNGSFHCDEALGCFMIRLTDKF +AQIVR+RD KVLD LDAVLDV
Sbjct: 6 VSAFSTAPGLKRVGTHNGSFHCDEALGCFMIRLTDKFRDAQIVRTRDQKVLDGLDAVLDV 65
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
GGVYDPS D YDHHQKGF EVFGHGFSTKLSSAGLVYKHFG E+IAKEL +D+GHPDV R
Sbjct: 66 GGVYDPSRDRYDHHQKGFGEVFGHGFSTKLSSAGLVYKHFGMEIIAKELQLDQGHPDVLR 125
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
LFLAVYK+FMEA+DA+DNGINQYDTD+P RYVNNT+LSSRVGKLNLDW +PDQS+E+ENE
Sbjct: 126 LFLAVYKSFMEAVDAVDNGINQYDTDQPLRYVNNTHLSSRVGKLNLDWMDPDQSSEKENE 185
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
AF++ M+L+G EFLD VRF+ +SWLPARSIV+EC+A R D DPSGEIMVL RFCPWKLHL
Sbjct: 186 AFERAMNLSGSEFLDNVRFHAKSWLPARSIVMECLAARMDIDPSGEIMVLNRFCPWKLHL 245
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
FELEEEMKI+P IKYVLY+DDR K WRVQAVAV+PD+FESRKPLP+QWRGLRDDELS+EA
Sbjct: 246 FELEEEMKIDPPIKYVLYQDDRSKHWRVQAVAVAPDKFESRKPLPSQWRGLRDDELSREA 305
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GIPGCVFVHMSGFIGGNQ+Y GALAMA+ ALKL
Sbjct: 306 GIPGCVFVHMSGFIGGNQNYEGALAMAKDALKL 338
>gi|225426621|ref|XP_002280827.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Vitis
vinifera]
Length = 361
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 274/333 (82%), Positives = 304/333 (91%), Gaps = 1/333 (0%)
Query: 14 TSSPSQTP-LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S+ S P LKRVGTHNGSFHCDEALGCFMIRLTDKF +AQIVR+RD KVLD LDAVLDV
Sbjct: 29 VSAFSTAPGLKRVGTHNGSFHCDEALGCFMIRLTDKFRDAQIVRTRDQKVLDGLDAVLDV 88
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
GGVYDPS D YDHHQKGF EVFGHGFSTKLSSAGLVYKHFG E+IAKEL +D+GHPDV R
Sbjct: 89 GGVYDPSRDRYDHHQKGFGEVFGHGFSTKLSSAGLVYKHFGMEIIAKELQLDQGHPDVLR 148
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
LFLAVYK+FMEA+DA+DNGINQYDTD+P RYVNNT+LSSRVGKLNLDW +PDQS+E+ENE
Sbjct: 149 LFLAVYKSFMEAVDAVDNGINQYDTDQPLRYVNNTHLSSRVGKLNLDWMDPDQSSEKENE 208
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
AF++ M+L+G EFLD VRF+ +SWLPARSIV+EC+A R D DPSGEIMVL RFCPWKLHL
Sbjct: 209 AFERAMNLSGSEFLDNVRFHAKSWLPARSIVMECLAARMDIDPSGEIMVLNRFCPWKLHL 268
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
FELEEEMKI+P IKYVLY+DDR K WRVQAVAV+PD+FESRKPLP+QWRGLRDDELS+EA
Sbjct: 269 FELEEEMKIDPPIKYVLYQDDRSKHWRVQAVAVAPDKFESRKPLPSQWRGLRDDELSREA 328
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GIPGCVFVHMSGFIGGNQ+Y GALAMA+ ALKL
Sbjct: 329 GIPGCVFVHMSGFIGGNQNYEGALAMAKDALKL 361
>gi|79527307|ref|NP_199012.2| Metal-dependent protein hydrolase [Arabidopsis thaliana]
gi|332007365|gb|AED94748.1| Metal-dependent protein hydrolase [Arabidopsis thaliana]
Length = 373
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/346 (77%), Positives = 310/346 (89%), Gaps = 1/346 (0%)
Query: 1 MAGTGVSSSPAYSTS-SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRD 59
MA + V + +TS SPS+ +K+VGTHNGSFHCDEALGCFMIRL DKF A IVRSRD
Sbjct: 28 MAPSSVVRVYSTATSPSPSEISVKKVGTHNGSFHCDEALGCFMIRLVDKFSGADIVRSRD 87
Query: 60 PKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK 119
PK+L +LDAVLDVGGVYDP +D YDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKE+IAK
Sbjct: 88 PKILAELDAVLDVGGVYDPEHDRYDHHQKGFEEVFGHGFNTKLSSAGLVYKHFGKEIIAK 147
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
ELNV++ HPDV RLFLAVYK+FMEAIDA+DNGIN+YDTD+PPRYVNNT+LS RVG+LNLD
Sbjct: 148 ELNVEQDHPDVLRLFLAVYKSFMEAIDAVDNGINRYDTDQPPRYVNNTHLSPRVGRLNLD 207
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W +PDQS E+ENEAFQ+ M LAGKEFL++V+F+ RSWLPARSIV++C+ ER+ DPSGEI
Sbjct: 208 WIDPDQSQEKENEAFQRAMALAGKEFLESVQFHARSWLPARSIVMQCLEERFKTDPSGEI 267
Query: 240 MVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
M+L RFCPWKLHLFELE+EMKIEPLIKYV+Y+D+R KQWRVQAVAV+PDRFE+RKPLP +
Sbjct: 268 MILDRFCPWKLHLFELEQEMKIEPLIKYVIYQDERAKQWRVQAVAVAPDRFENRKPLPEK 327
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRGLRD+ELSK A IPGCVFVHMSGFIGGNQSY GAL+MA+ AL L
Sbjct: 328 WRGLRDEELSKAAEIPGCVFVHMSGFIGGNQSYDGALSMAQTALTL 373
>gi|9757942|dbj|BAB08430.1| GAMM1 protein-like [Arabidopsis thaliana]
Length = 346
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/346 (77%), Positives = 310/346 (89%), Gaps = 1/346 (0%)
Query: 1 MAGTGVSSSPAYSTS-SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRD 59
MA + V + +TS SPS+ +K+VGTHNGSFHCDEALGCFMIRL DKF A IVRSRD
Sbjct: 1 MAPSSVVRVYSTATSPSPSEISVKKVGTHNGSFHCDEALGCFMIRLVDKFSGADIVRSRD 60
Query: 60 PKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK 119
PK+L +LDAVLDVGGVYDP +D YDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKE+IAK
Sbjct: 61 PKILAELDAVLDVGGVYDPEHDRYDHHQKGFEEVFGHGFNTKLSSAGLVYKHFGKEIIAK 120
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
ELNV++ HPDV RLFLAVYK+FMEAIDA+DNGIN+YDTD+PPRYVNNT+LS RVG+LNLD
Sbjct: 121 ELNVEQDHPDVLRLFLAVYKSFMEAIDAVDNGINRYDTDQPPRYVNNTHLSPRVGRLNLD 180
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W +PDQS E+ENEAFQ+ M LAGKEFL++V+F+ RSWLPARSIV++C+ ER+ DPSGEI
Sbjct: 181 WIDPDQSQEKENEAFQRAMALAGKEFLESVQFHARSWLPARSIVMQCLEERFKTDPSGEI 240
Query: 240 MVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
M+L RFCPWKLHLFELE+EMKIEPLIKYV+Y+D+R KQWRVQAVAV+PDRFE+RKPLP +
Sbjct: 241 MILDRFCPWKLHLFELEQEMKIEPLIKYVIYQDERAKQWRVQAVAVAPDRFENRKPLPEK 300
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRGLRD+ELSK A IPGCVFVHMSGFIGGNQSY GAL+MA+ AL L
Sbjct: 301 WRGLRDEELSKAAEIPGCVFVHMSGFIGGNQSYDGALSMAQTALTL 346
>gi|15146292|gb|AAK83629.1| AT5g41970/MJC20_7 [Arabidopsis thaliana]
gi|19699166|gb|AAL90949.1| AT5g41970/MJC20_7 [Arabidopsis thaliana]
Length = 346
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 268/346 (77%), Positives = 310/346 (89%), Gaps = 1/346 (0%)
Query: 1 MAGTGVSSSPAYSTS-SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRD 59
MA + V + +TS SPS+ +K+VGTHNGSFHCDEALGCFMIRL DKF A IVRSRD
Sbjct: 1 MAPSSVVRVYSTATSPSPSEISVKKVGTHNGSFHCDEALGCFMIRLVDKFSGADIVRSRD 60
Query: 60 PKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK 119
PK+L +LDAVLDVGGVYDP +D YDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKE+IAK
Sbjct: 61 PKILAELDAVLDVGGVYDPEHDRYDHHQKGFEEVFGHGFNTKLSSAGLVYKHFGKEIIAK 120
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
ELNV++ HPDV RLFLAVYK+FMEAIDA+DNGIN+YDTD+PPRYVNNT+LS RVG+LNLD
Sbjct: 121 ELNVEQDHPDVLRLFLAVYKSFMEAIDAVDNGINRYDTDQPPRYVNNTHLSPRVGRLNLD 180
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W +PDQS E+ENEAFQ+ + LAGKEFL++V+F+ RSWLPARSIV++C+ ER+ DPSGEI
Sbjct: 181 WIDPDQSQEKENEAFQRAVALAGKEFLESVQFHARSWLPARSIVMQCLEERFKTDPSGEI 240
Query: 240 MVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
M+L RFCPWKLHLFELE+EMKIEPLIKYV+Y+D+R KQWRVQAVAV+PDRFE+RKPLP +
Sbjct: 241 MILDRFCPWKLHLFELEQEMKIEPLIKYVIYQDERAKQWRVQAVAVAPDRFENRKPLPEK 300
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRGLRD+ELSK A IPGCVFVHMSGFIGGNQSY GAL+MA+ AL L
Sbjct: 301 WRGLRDEELSKAAEIPGCVFVHMSGFIGGNQSYDGALSMAQTALTL 346
>gi|449452196|ref|XP_004143846.1| PREDICTED: UPF0160 protein-like [Cucumis sativus]
gi|449509379|ref|XP_004163571.1| PREDICTED: UPF0160 protein-like [Cucumis sativus]
Length = 345
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 268/323 (82%), Positives = 296/323 (91%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTH+GSFHCDEALGCFMIRLTDKF NAQIVR+RDP+VL LDAVLDVGGVYDPS+D
Sbjct: 23 KRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLQGLDAVLDVGGVYDPSHDR 82
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE+IAKEL VDEGHPDVHRLFLAVYK+FM
Sbjct: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFM 142
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAIDA+DNGINQYDTDKPP+YVNNT+LSSRVG+LNLDW +PDQS E EN+AF++ M LAG
Sbjct: 143 EAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIDPDQSPENENKAFEKAMALAG 202
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFLD+VRF+ +SWLPARSIV+ +A R+ DPSGEIMV+ FCPWKLHLFELE E+KIE
Sbjct: 203 NEFLDSVRFHAKSWLPARSIVMGSLAARHTIDPSGEIMVMTTFCPWKLHLFELEAELKIE 262
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IKYVLY+DDR K WRVQAVAVSPDRFESR+PLPAQWRGLRD+ELSKE+GIPGCVFVHM
Sbjct: 263 NSIKYVLYQDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHM 322
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
SGFIGGNQ+Y GAL MA+ ALKL
Sbjct: 323 SGFIGGNQTYDGALTMAKNALKL 345
>gi|356504252|ref|XP_003520911.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Glycine max]
Length = 369
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 263/338 (77%), Positives = 306/338 (90%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
+S ++S+ SP PLKRVGTHNGSFHCDEALGCFMIRLT+KF +A+IVRSRDP+VL+DLD
Sbjct: 32 ASLSFSSGSPVHAPLKRVGTHNGSFHCDEALGCFMIRLTNKFNSAEIVRSRDPQVLEDLD 91
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
AVLDVGGVYDP+ D YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE+IAKEL VDE H
Sbjct: 92 AVLDVGGVYDPARDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELKVDEEH 151
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSA 187
DVH ++LAVYK+FMEAIDAIDNGIN+YDTD+PPRYVNNT+LSSRVG+LNLDWT+PDQS
Sbjct: 152 RDVHYIYLAVYKSFMEAIDAIDNGINRYDTDQPPRYVNNTHLSSRVGRLNLDWTDPDQSP 211
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
E+ENEAFQ+ M LAG EFLD+VRF+V SWLPARSIV+E + R+ DPSGEI+VL + CP
Sbjct: 212 EKENEAFQRAMALAGSEFLDSVRFHVNSWLPARSIVMETLEARHTVDPSGEILVLTKLCP 271
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
WKLHLFELE E+K +P +KYVLY+D+R KQWRVQAVAVSPD F+SRK LP+QW+GLRD+E
Sbjct: 272 WKLHLFELEGELKNDPSVKYVLYQDERSKQWRVQAVAVSPDSFQSRKALPSQWQGLRDEE 331
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
LSKE+GIPGCVFVHMSGFIGGN+++ GALAMA+AAL +
Sbjct: 332 LSKESGIPGCVFVHMSGFIGGNKNFDGALAMAKAALTM 369
>gi|147827008|emb|CAN62285.1| hypothetical protein VITISV_034702 [Vitis vinifera]
Length = 325
Score = 553 bits (1425), Expect = e-155, Method: Compositional matrix adjust.
Identities = 264/324 (81%), Positives = 291/324 (89%), Gaps = 3/324 (0%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
LKRVGTHNGSFHCDEALGCFMIRLTDKF +AQIVR+RD KVLD LDAVLDVGGVYDPS D
Sbjct: 5 LKRVGTHNGSFHCDEALGCFMIRLTDKFRDAQIVRTRDQKVLDGLDAVLDVGGVYDPSRD 64
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
YDHHQKGF EVFGHGFSTKLSSAGLVYKHFG E+IAKEL +D+GHPDV RLFLAVYK+F
Sbjct: 65 RYDHHQKGFGEVFGHGFSTKLSSAGLVYKHFGMEIIAKELQLDQGHPDVLRLFLAVYKSF 124
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
MEA+DA+DNGINQYDTD+P RYVNNT+LSSRVGKLNLDW +PDQS+E+ENEAF++ M+L+
Sbjct: 125 MEAVDAVDNGINQYDTDQPXRYVNNTHLSSRVGKLNLDWMDPDQSSEKENEAFERAMNLS 184
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G E F LPARSIV+EC+A R D DPSGEIMVL RFCPWKLHLFELEEEMKI
Sbjct: 185 GSEMF---VFMQSHGLPARSIVMECLAARMDIDPSGEIMVLNRFCPWKLHLFELEEEMKI 241
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+P IKYVLY+DDR K WRVQAVAV+PD+FESRKPLP+QWRGLRDDELS+EAGIPGCVFVH
Sbjct: 242 DPPIKYVLYQDDRSKHWRVQAVAVAPDKFESRKPLPSQWRGLRDDELSREAGIPGCVFVH 301
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
MSGFIGGNQ+Y GALAMA+ ALKL
Sbjct: 302 MSGFIGGNQNYEGALAMAKDALKL 325
>gi|224075220|ref|XP_002304577.1| predicted protein [Populus trichocarpa]
gi|222842009|gb|EEE79556.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 262/325 (80%), Positives = 291/325 (89%), Gaps = 4/325 (1%)
Query: 11 AYSTSSPSQTPLKR---VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
+ S P Q P R VGTHNGSFHCDEALGCFMIRLT K+ NA+IVRSRDP+VL+ LD
Sbjct: 9 GFLASQPWQ-PYSRHFHVGTHNGSFHCDEALGCFMIRLTSKYSNAEIVRSRDPQVLEGLD 67
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
AVLDVGGVYDPS D YDHHQKGF EVFGHGFSTKLSSAGLVYKHFGKE+IAKEL VDEGH
Sbjct: 68 AVLDVGGVYDPSRDRYDHHQKGFGEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGH 127
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSA 187
PDVHRLFLAVYK+FMEAIDAIDNGINQYDTD+PP+YVNNT+LSSRVGK NLDWT+PDQS+
Sbjct: 128 PDVHRLFLAVYKSFMEAIDAIDNGINQYDTDQPPKYVNNTHLSSRVGKFNLDWTDPDQSS 187
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
++ENEAF++ M LAG EFLD VRF+ +SWLPARSIV+EC+A R+D DPSGEIMVLK FCP
Sbjct: 188 KKENEAFERAMALAGSEFLDNVRFHAKSWLPARSIVMECLATRFDVDPSGEIMVLKTFCP 247
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
WKLHLFELE E+KI+PL+KYVLY+DDR KQWRVQAVA SPD FESRK LPAQWRGLRDD+
Sbjct: 248 WKLHLFELEGELKIDPLVKYVLYQDDRSKQWRVQAVAKSPDSFESRKALPAQWRGLRDDD 307
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSY 332
LS+E+ IPG VFVHMSGFIGGNQS+
Sbjct: 308 LSRESQIPGGVFVHMSGFIGGNQSF 332
>gi|359806789|ref|NP_001241305.1| uncharacterized protein LOC100800706 [Glycine max]
gi|255636959|gb|ACU18812.1| unknown [Glycine max]
Length = 369
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 253/318 (79%), Positives = 289/318 (90%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
P PLKRVGTHNGSFHCDEALGCFMIRLT+KF +A+IVRSRDP+VL+ LDAVLDVGGVY
Sbjct: 41 PVHAPLKRVGTHNGSFHCDEALGCFMIRLTNKFNSAEIVRSRDPQVLEVLDAVLDVGGVY 100
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DP+ D YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE+IAKEL VDE H DVH ++LA
Sbjct: 101 DPARDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELEVDEEHRDVHHIYLA 160
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
VYK+FMEAIDAIDNGINQYDTD+PPRYVNNT+LSSRVG+LNLDWT+PDQS E+ENEAFQ+
Sbjct: 161 VYKSFMEAIDAIDNGINQYDTDQPPRYVNNTHLSSRVGRLNLDWTDPDQSPEKENEAFQR 220
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M LAG EFLD+VRF+V SWLPARSIV+E +A R+ DPSGEI+VL +FCPWKLHLFELE
Sbjct: 221 AMALAGSEFLDSVRFHVNSWLPARSIVMETLAARHTVDPSGEILVLTKFCPWKLHLFELE 280
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
E+K +P +KYVLY+D+R K WRVQAVAVSPD F+SRK LP+QWRGLRD+ELSK++GIPG
Sbjct: 281 GELKNDPAVKYVLYQDERSKHWRVQAVAVSPDSFQSRKALPSQWRGLRDEELSKKSGIPG 340
Query: 317 CVFVHMSGFIGGNQSYGG 334
CVFVH+SGFIGGNQ++ G
Sbjct: 341 CVFVHISGFIGGNQNFDG 358
>gi|357137062|ref|XP_003570120.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Brachypodium
distachyon]
Length = 385
Score = 539 bits (1388), Expect = e-151, Method: Compositional matrix adjust.
Identities = 249/338 (73%), Positives = 293/338 (86%)
Query: 7 SSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
SS+ A SS KRVGTHNGSFHCDEALGCF+IRLT +F A +VR+RD ++LD L
Sbjct: 47 SSAAADGDSSNGTGSGKRVGTHNGSFHCDEALGCFLIRLTSQFAGADVVRTRDSQILDSL 106
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
DAVLDVGGVYDPS YDHHQKGF EVFG+GF TKLSSAGLVYKHFGKE+IAKEL ++E
Sbjct: 107 DAVLDVGGVYDPSRHRYDHHQKGFNEVFGYGFHTKLSSAGLVYKHFGKEIIAKELELNED 166
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
H DVHR++LA+YK+F+EA+DAIDNGINQYDTD+PP+YVNNT+LSSRVG+ N DWT+PDQS
Sbjct: 167 HEDVHRVYLAIYKSFVEALDAIDNGINQYDTDQPPKYVNNTHLSSRVGRFNPDWTDPDQS 226
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
E+EN AFQQ M LAG EF+++VRF+V+SWLPARSIV+EC+ R + DPSGEIMVL RFC
Sbjct: 227 PEKENAAFQQAMMLAGSEFMESVRFHVKSWLPARSIVMECLLSRGNVDPSGEIMVLDRFC 286
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
PWKLHLFELE+E+K +PL KYVLY+D+R K WRVQAV+V+PDRFESRK LP +WRGLRDD
Sbjct: 287 PWKLHLFELEQELKTDPLTKYVLYQDERSKTWRVQAVSVAPDRFESRKALPERWRGLRDD 346
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
ELSKE GIPGCVF+HMSGFIGGN++Y GAL MARAA++
Sbjct: 347 ELSKETGIPGCVFIHMSGFIGGNKTYEGALEMARAAVQ 384
>gi|224132674|ref|XP_002321381.1| predicted protein [Populus trichocarpa]
gi|222868377|gb|EEF05508.1| predicted protein [Populus trichocarpa]
Length = 361
Score = 537 bits (1383), Expect = e-150, Method: Compositional matrix adjust.
Identities = 246/322 (76%), Positives = 281/322 (87%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTHNG FHCDEALGCF+IRLTDKF NA I+RSRDP+VL+ LDAVLDVGGVYDPS D
Sbjct: 39 RAVGTHNGKFHCDEALGCFLIRLTDKFSNAHIIRSRDPQVLETLDAVLDVGGVYDPSRDR 98
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKH+G E+IAKEL ++EGH DVH LFLAVYKNF+
Sbjct: 99 YDHHQKGFHEVFGHGFTTKLSSAGLVYKHYGAEIIAKELQLNEGHQDVHELFLAVYKNFV 158
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAIDA DNGINQYD D+PP+YVNNT+LS RVG+LNLDW +P QS+ERE+EAFQ M +AG
Sbjct: 159 EAIDAADNGINQYDIDQPPKYVNNTSLSQRVGRLNLDWVDPSQSSEREDEAFQHAMKVAG 218
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF++ + F+ +SWLPARSIV+EC+A R D D SGEIMVL R CPWKLH+FELEE+MKI
Sbjct: 219 TEFMENINFHAKSWLPARSIVMECLASREDIDHSGEIMVLTRSCPWKLHIFELEEKMKIN 278
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
P IKYV+Y+DDR + WR+QAVAVSPD+FESRKPLP WRGL DDELSK GI GCVFVHM
Sbjct: 279 PSIKYVIYQDDRSENWRIQAVAVSPDKFESRKPLPLPWRGLVDDELSKATGIAGCVFVHM 338
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGFIGGN+SY GALAMARA+LK
Sbjct: 339 SGFIGGNRSYEGALAMARASLK 360
>gi|115447985|ref|NP_001047772.1| Os02g0686600 [Oryza sativa Japonica Group]
gi|41052722|dbj|BAD07579.1| putative GAMM1 protein [Oryza sativa Japonica Group]
gi|113537303|dbj|BAF09686.1| Os02g0686600 [Oryza sativa Japonica Group]
gi|125583288|gb|EAZ24219.1| hypothetical protein OsJ_07968 [Oryza sativa Japonica Group]
Length = 352
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 249/322 (77%), Positives = 287/322 (89%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNGSFHCDEALGC++IRLT +F A +VR+RDP++LD LDAVLDVGGVYDPS
Sbjct: 30 KRVGTHNGSFHCDEALGCYLIRLTSQFAGADVVRTRDPQILDTLDAVLDVGGVYDPSRHR 89
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V E H DVHRL+LA+YK+F+
Sbjct: 90 YDHHQKGFNEVFGHGFNTKLSSAGLVYKHFGKEIIAKELEVSEDHEDVHRLYLAIYKSFV 149
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DA+DNGINQYDTD+PP+YVNNT+LSSRVG+LN DWT+PDQS E+EN AFQQ M LAG
Sbjct: 150 EALDAVDNGINQYDTDQPPKYVNNTHLSSRVGRLNPDWTDPDQSPEKENAAFQQAMMLAG 209
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF+++VRF+V+SWLPARSIV+EC+ R DPS EIMVL RFCPWKLHLFELEEE+KI+
Sbjct: 210 SEFMESVRFHVKSWLPARSIVLECLLSRGKVDPSEEIMVLDRFCPWKLHLFELEEELKID 269
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
PL KYVLY+D+R K WRVQAV V+PDRFESRKPLP +WRGLRDDELS + GIPGCVFVHM
Sbjct: 270 PLTKYVLYQDERSKSWRVQAVGVAPDRFESRKPLPEKWRGLRDDELSNDIGIPGCVFVHM 329
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGFIGGN++Y GAL MARAA+K
Sbjct: 330 SGFIGGNKTYEGALEMARAAIK 351
>gi|226505164|ref|NP_001141129.1| hypothetical protein [Zea mays]
gi|194702788|gb|ACF85478.1| unknown [Zea mays]
gi|413923497|gb|AFW63429.1| hypothetical protein ZEAMMB73_324496 [Zea mays]
Length = 384
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 248/323 (76%), Positives = 287/323 (88%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNGSFHCDEALGCF+IRLT +F A +VR+RD ++LD LDAVLDVGGVYDPS
Sbjct: 62 KRVGTHNGSFHCDEALGCFLIRLTSQFAGADVVRTRDSQILDTLDAVLDVGGVYDPSRHR 121
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V+E H DVH L+LA+YK+F+
Sbjct: 122 YDHHQKGFSEVFGHGFNTKLSSAGLVYKHFGKEIIAKELGVNEDHEDVHCLYLAIYKSFV 181
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DAIDNGINQYDTD+PP+YVNNT+LSSRVG+ NLDWT+PDQS E+EN AF+Q M LAG
Sbjct: 182 EALDAIDNGINQYDTDQPPKYVNNTHLSSRVGRFNLDWTDPDQSLEKENAAFEQAMVLAG 241
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF+++VRF+V+SWLPARSIV+EC+ + + DPSGEIMVL RFCPWKLHLFELEEE+K +
Sbjct: 242 SEFMESVRFHVKSWLPARSIVLECLLSKGNIDPSGEIMVLDRFCPWKLHLFELEEELKTD 301
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
PL KYVLY+D+R K WRVQAVAV+PDRFESRK LP +WRG+RDDELS E GIPGCVFVHM
Sbjct: 302 PLTKYVLYQDERSKSWRVQAVAVAPDRFESRKALPEKWRGMRDDELSAETGIPGCVFVHM 361
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
SGFIGGN++Y GAL MARAALK
Sbjct: 362 SGFIGGNKTYEGALEMARAALKF 384
>gi|125540715|gb|EAY87110.1| hypothetical protein OsI_08511 [Oryza sativa Indica Group]
Length = 355
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 249/322 (77%), Positives = 287/322 (89%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNGSFHCDEALGC++IRLT +F A +VR+RDP++LD LDAVLDVGGVYDPS
Sbjct: 33 KRVGTHNGSFHCDEALGCYLIRLTSQFAGADVVRTRDPQILDTLDAVLDVGGVYDPSRHR 92
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V E H DVHRL+LA+YK+F+
Sbjct: 93 YDHHQKGFNEVFGHGFNTKLSSAGLVYKHFGKEIIAKELEVSEDHEDVHRLYLAIYKSFV 152
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DA+DNGINQYDTD+PP+YVNNT+LSSRVG+LN DWT+PDQS E+EN AFQQ M LAG
Sbjct: 153 EALDAVDNGINQYDTDQPPKYVNNTHLSSRVGRLNPDWTDPDQSPEKENAAFQQAMMLAG 212
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF+++VRF+V+SWLPARSIV+EC+ R DPS EIMVL RFCPWKLHLFELEEE+KI+
Sbjct: 213 SEFMESVRFHVKSWLPARSIVLECLLSRGKVDPSEEIMVLDRFCPWKLHLFELEEELKID 272
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
PL KYVLY+D+R K WRVQAV V+PDRFESRKPLP +WRGLRDDELS + GIPGCVFVHM
Sbjct: 273 PLTKYVLYQDERSKSWRVQAVGVAPDRFESRKPLPEKWRGLRDDELSNDIGIPGCVFVHM 332
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGFIGGN++Y GAL MARAA+K
Sbjct: 333 SGFIGGNKTYEGALEMARAAIK 354
>gi|413923496|gb|AFW63428.1| hypothetical protein ZEAMMB73_324496 [Zea mays]
Length = 385
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 248/324 (76%), Positives = 287/324 (88%), Gaps = 1/324 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNGSFHCDEALGCF+IRLT +F A +VR+RD ++LD LDAVLDVGGVYDPS
Sbjct: 62 KRVGTHNGSFHCDEALGCFLIRLTSQFAGADVVRTRDSQILDTLDAVLDVGGVYDPSRHR 121
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V+E H DVH L+LA+YK+F+
Sbjct: 122 YDHHQKGFSEVFGHGFNTKLSSAGLVYKHFGKEIIAKELGVNEDHEDVHCLYLAIYKSFV 181
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DAIDNGINQYDTD+PP+YVNNT+LSSRVG+ NLDWT+PDQS E+EN AF+Q M LAG
Sbjct: 182 EALDAIDNGINQYDTDQPPKYVNNTHLSSRVGRFNLDWTDPDQSLEKENAAFEQAMVLAG 241
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF+++VRF+V+SWLPARSIV+EC+ + + DPSGEIMVL RFCPWKLHLFELEEE+K +
Sbjct: 242 SEFMESVRFHVKSWLPARSIVLECLLSKGNIDPSGEIMVLDRFCPWKLHLFELEEELKTD 301
Query: 263 PLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
PL KYVLY +D+R K WRVQAVAV+PDRFESRK LP +WRG+RDDELS E GIPGCVFVH
Sbjct: 302 PLTKYVLYQQDERSKSWRVQAVAVAPDRFESRKALPEKWRGMRDDELSAETGIPGCVFVH 361
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
MSGFIGGN++Y GAL MARAALK
Sbjct: 362 MSGFIGGNKTYEGALEMARAALKF 385
>gi|242062898|ref|XP_002452738.1| hypothetical protein SORBIDRAFT_04g031530 [Sorghum bicolor]
gi|241932569|gb|EES05714.1| hypothetical protein SORBIDRAFT_04g031530 [Sorghum bicolor]
Length = 384
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 246/322 (76%), Positives = 286/322 (88%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNGSFHCDEALGCF+IRLT +F A +VR+RD ++LD LDAVLDVGGVYDPS
Sbjct: 62 KRVGTHNGSFHCDEALGCFLIRLTSQFAGADVVRTRDSQILDTLDAVLDVGGVYDPSRHR 121
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V+E H DVHRL+L++YK+F+
Sbjct: 122 YDHHQKGFSEVFGHGFNTKLSSAGLVYKHFGKEIIAKELGVNEDHEDVHRLYLSIYKSFV 181
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DAIDNGINQYDTD+PP+YVN+T+LSSRVG+LN DWT+PDQS E+EN AF+Q M LAG
Sbjct: 182 EALDAIDNGINQYDTDQPPKYVNSTHLSSRVGRLNPDWTDPDQSPEKENAAFEQAMILAG 241
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF+++VRF+V+SWLPARSIV+EC+ R + D SGEIMVL RFCPWKLHLFELEEE+K +
Sbjct: 242 SEFMESVRFHVKSWLPARSIVLECLKSRENIDLSGEIMVLDRFCPWKLHLFELEEELKTD 301
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
PL KYVLY+D+R K WRVQAV V+PDRFESRK LP +WRG+RDDELS E GIPGCVFVHM
Sbjct: 302 PLTKYVLYQDERSKSWRVQAVGVAPDRFESRKALPEKWRGMRDDELSAEIGIPGCVFVHM 361
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGFIGGN++Y GAL MARAALK
Sbjct: 362 SGFIGGNKTYEGALEMARAALK 383
>gi|297819564|ref|XP_002877665.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323503|gb|EFH53924.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 355
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 229/321 (71%), Positives = 281/321 (87%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KR+GTHNG+FHCDEAL CF++R + +F +AQI+R+RD +VL+ LDA LDVGGVYDP ++
Sbjct: 32 KRIGTHNGTFHCDEALACFILRRSSRFSDAQIIRTRDHQVLEKLDAALDVGGVYDPQSER 91
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFG GF+TKLSSAGLVYKH+G E+I+KEL +D+ HPDV RLFLAVYKNF+
Sbjct: 92 YDHHQKGFSEVFGLGFNTKLSSAGLVYKHYGLEIISKELQLDQRHPDVFRLFLAVYKNFI 151
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DAIDNGI+QYDTD+PPRYVNNT+L R+G+LNLDW EPDQS+E+E+EAF + M+LAG
Sbjct: 152 EAVDAIDNGIHQYDTDQPPRYVNNTSLGHRIGRLNLDWIEPDQSSEKEDEAFHRAMELAG 211
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFL+ V F+ +SWLPARSIV+EC+A+RYD D SGEIM L + CPWKLH+FELEEEMKI+
Sbjct: 212 SEFLECVHFHAKSWLPARSIVMECLAKRYDIDSSGEIMKLSKQCPWKLHIFELEEEMKID 271
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
P IKYVLY+DDR + WR+QAV+VSP+RFESRK LP WRGL ++LS+E+ IPGCVFVHM
Sbjct: 272 PPIKYVLYQDDRSENWRIQAVSVSPERFESRKALPLSWRGLEMEKLSEESSIPGCVFVHM 331
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGFIG N++Y GALAMARA+L
Sbjct: 332 SGFIGANRTYEGALAMARASL 352
>gi|15229119|ref|NP_190501.1| Metal-dependent protein hydrolase [Arabidopsis thaliana]
gi|12324441|gb|AAG52181.1|AC012329_8 unknown protein; 11418-9170 [Arabidopsis thaliana]
gi|6723402|emb|CAB66411.1| putative protein [Arabidopsis thaliana]
gi|332645007|gb|AEE78528.1| Metal-dependent protein hydrolase [Arabidopsis thaliana]
Length = 354
Score = 506 bits (1304), Expect = e-141, Method: Compositional matrix adjust.
Identities = 228/321 (71%), Positives = 280/321 (87%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNG+FHCDEAL CF++R +++F +AQIVR+RD +VL+ LDA LDVGGVYDP ++
Sbjct: 32 KRVGTHNGTFHCDEALACFILRRSNRFSDAQIVRTRDHQVLEKLDAALDVGGVYDPQSER 91
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQKGF EVFG GF+TKLSSAGLVYKH+G E+I+KEL +++ HPDV RLFLAVYKNF+
Sbjct: 92 YDHHQKGFSEVFGLGFNTKLSSAGLVYKHYGLEIISKELQLEQRHPDVFRLFLAVYKNFI 151
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DA+DNGI+QYDTD+PPRYVNNT+L R+G+LNLDW EPDQS+ +E+EAF + M+LAG
Sbjct: 152 EAVDALDNGIHQYDTDQPPRYVNNTSLGHRIGRLNLDWIEPDQSSAKEDEAFHRAMELAG 211
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFL+ V F+ +SWLPARSIV+EC+A+RYD D SGEIM L + CPWKLH+FELEEEMKI+
Sbjct: 212 SEFLECVHFHAKSWLPARSIVMECLAKRYDIDSSGEIMKLSKQCPWKLHIFELEEEMKID 271
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
P IKYVLY+DDR + WR+QAV+VSP+RFESRK LP WRGL ++LS+E+ IP CVFVHM
Sbjct: 272 PPIKYVLYQDDRSENWRIQAVSVSPERFESRKALPLAWRGLEKEKLSEESSIPRCVFVHM 331
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGFIG NQ+Y GALAMARA+L
Sbjct: 332 SGFIGANQTYEGALAMARASL 352
>gi|116781318|gb|ABK22052.1| unknown [Picea sitchensis]
Length = 383
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 231/313 (73%), Positives = 269/313 (85%), Gaps = 1/313 (0%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
LKR+GTHNG+FHCDEALGCFMIRLT KF AQIVR+RDP+VL+ +DAVLDVGGVYDPS D
Sbjct: 61 LKRIGTHNGTFHCDEALGCFMIRLTHKFAGAQIVRTRDPQVLETMDAVLDVGGVYDPSKD 120
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
YDHHQ+GF VFGHGF TKLSSAGL+YKHFG E+I++E+ D GHPDV R++LAVYK+F
Sbjct: 121 RYDHHQRGFGAVFGHGFVTKLSSAGLIYKHFGLEIISREIGRDPGHPDVERMYLAVYKSF 180
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+EAIDAIDNGIN YDTD+PP+Y+NNT+LSSRVGK+N+DW + DQS E+ENE F M LA
Sbjct: 181 VEAIDAIDNGINMYDTDQPPKYINNTHLSSRVGKINIDWIDEDQSPEKENERFHHAMALA 240
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G EFL+++ + +SWLPARSIV E IA R D SGEI VLK FCPWKLHLFE+E+E+K+
Sbjct: 241 GGEFLESIHHHAKSWLPARSIVEESIAAR-QVDSSGEIFVLKGFCPWKLHLFEIEQELKL 299
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
EP IKYVLYED R KQWRVQAVAVSPD+FESRKPLP WRGLRDDELS+E GIP CVFVH
Sbjct: 300 EPSIKYVLYEDSRAKQWRVQAVAVSPDKFESRKPLPESWRGLRDDELSRECGIPDCVFVH 359
Query: 322 MSGFIGGNQSYGG 334
+SGFIGGN+S+ G
Sbjct: 360 ISGFIGGNKSFDG 372
>gi|302816009|ref|XP_002989684.1| hypothetical protein SELMODRAFT_229483 [Selaginella moellendorffii]
gi|300142461|gb|EFJ09161.1| hypothetical protein SELMODRAFT_229483 [Selaginella moellendorffii]
Length = 337
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 227/324 (70%), Positives = 269/324 (83%), Gaps = 1/324 (0%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ +VGTHNG+FHCDEALGCF+IRLTDKF NA IVR+R+ +VLD LDAVLDVGGVYDP D
Sbjct: 15 VAKVGTHNGTFHCDEALGCFLIRLTDKFANADIVRTRNQEVLDKLDAVLDVGGVYDPEKD 74
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF+ V G+GF+TKLSSAGLVYKHFGKE++AKEL + HPDV +++A+YKNF
Sbjct: 75 RFDHHQRGFDHVLGNGFNTKLSSAGLVYKHFGKEIVAKELALQPDHPDVEAIYVALYKNF 134
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
ME IDAIDNGINQYD D+PP+YV NTNLS+RVGKLN DWTE +QS E EN AF + M+L
Sbjct: 135 MEGIDAIDNGINQYDVDQPPKYVENTNLSARVGKLNPDWTE-EQSVEMENAAFARAMELT 193
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G EF +R+Y +SWLPARSIV++CI R + PSGE+MVL FCPWK HLFELEEE++I
Sbjct: 194 GSEFQQALRYYAKSWLPARSIVLDCINRRKEIHPSGEVMVLDTFCPWKQHLFELEEELQI 253
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
EP IKYVLYED+R QWRVQAVA++P RFESRK LPA WRGLRD+ELSK AGI VFVH
Sbjct: 254 EPSIKYVLYEDERAGQWRVQAVALAPGRFESRKALPAAWRGLRDEELSKVAGIDNGVFVH 313
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
SGFIGGN+S+ GAL MA+ +L L
Sbjct: 314 ASGFIGGNKSFEGALRMAQKSLAL 337
>gi|302820256|ref|XP_002991796.1| hypothetical protein SELMODRAFT_272231 [Selaginella moellendorffii]
gi|300140477|gb|EFJ07200.1| hypothetical protein SELMODRAFT_272231 [Selaginella moellendorffii]
Length = 337
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 225/324 (69%), Positives = 269/324 (83%), Gaps = 1/324 (0%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ +VGTHNG+FHCDEALGCF+IRLTDKF NA IVR+R+ +VLD LDAVLDVGGVYDP D
Sbjct: 15 VAKVGTHNGTFHCDEALGCFLIRLTDKFANADIVRTRNQEVLDKLDAVLDVGGVYDPEKD 74
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF+ V G+GF+TKLSSAGLVYKHFGKE++AKEL + HPDV +++A+YKNF
Sbjct: 75 RFDHHQRGFDHVLGNGFNTKLSSAGLVYKHFGKEIVAKELALQPDHPDVEAIYVALYKNF 134
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
ME IDAIDNGINQYD D+PP+YV NTNLS+RVGKLN DWTE +QS E EN AF + M+L
Sbjct: 135 MEGIDAIDNGINQYDVDQPPKYVENTNLSARVGKLNPDWTE-EQSVEMENAAFARAMELT 193
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G EF +R+Y +SWLPARSIV++CI R + PSGE+MVL FCPWK HLFELEEE++I
Sbjct: 194 GSEFQQALRYYAKSWLPARSIVLDCINRRKEIHPSGEVMVLDTFCPWKQHLFELEEELQI 253
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+P IKYVLYED+R QWRVQAVA++P RFESRK LPA WRGLRD+ELS+ AGI VFVH
Sbjct: 254 KPSIKYVLYEDERAGQWRVQAVALAPGRFESRKALPAAWRGLRDEELSQVAGIDNGVFVH 313
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
SGFIGGN+S+ GAL MA+ +L L
Sbjct: 314 ASGFIGGNKSFEGALRMAQKSLAL 337
>gi|168024008|ref|XP_001764529.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684393|gb|EDQ70796.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 331
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 220/313 (70%), Positives = 265/313 (84%), Gaps = 2/313 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNG+FHCDEALGCFMIRLTDK+ +AQIVR+RD KVLD DAVLDVGG YDP+ D
Sbjct: 3 KRVGTHNGTFHCDEALGCFMIRLTDKYADAQIVRTRDQKVLDTCDAVLDVGGAYDPATDR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+GF FGHGF TKLSSAGLVYKH+G+E++AKEL + E HPDV R+FLAVYK+FM
Sbjct: 63 YDHHQRGFNHNFGHGFVTKLSSAGLVYKHYGQEIVAKELGLSEDHPDVQRVFLAVYKSFM 122
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+D IDNGIN YDTDKPPRY N+T+LS+RVG+LN DW + +Q+ E E+EAF++ M LAG
Sbjct: 123 EAVDGIDNGINLYDTDKPPRYSNDTHLSARVGRLNPDWMD-EQTPEAEDEAFRKAMSLAG 181
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFL++VR+Y RSWLPARSIV EC+A+R D SGEI+++K +CPWK HL +LE+E+ I+
Sbjct: 182 SEFLESVRYYARSWLPARSIVAECLADRLKDDSSGEILIIKHYCPWKGHLADLEKELNID 241
Query: 263 PLIKYVLYEDD-RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
P IKYVLYE D R K WRVQAV+++P FE+R PLPA WRGLRDDELSKE+GI GCVFVH
Sbjct: 242 PTIKYVLYEQDGRSKGWRVQAVSLTPGSFENRLPLPAVWRGLRDDELSKESGIDGCVFVH 301
Query: 322 MSGFIGGNQSYGG 334
MSGFIGG +++ G
Sbjct: 302 MSGFIGGAKTFEG 314
>gi|168012546|ref|XP_001758963.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690100|gb|EDQ76469.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 331
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 219/324 (67%), Positives = 272/324 (83%), Gaps = 2/324 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTHNG+FHCDEALGCFMIRLTDKF +A+IVR+RD +VLD DAVLDVGGVYDP D
Sbjct: 3 KRVGTHNGTFHCDEALGCFMIRLTDKFADAEIVRTRDQQVLDTCDAVLDVGGVYDPVIDR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+GF+ FGHGF TKLSSAGL+YKH+G+E++AKEL + E HPDV R+FL +YK+F+
Sbjct: 63 YDHHQRGFDCSFGHGFVTKLSSAGLIYKHYGQEIVAKELGLSEDHPDVQRVFLTMYKSFV 122
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAID +DNGINQYDTDK PRYVN+T+LS+RVG+LN W + +++ E E+EAF++ M L G
Sbjct: 123 EAIDGVDNGINQYDTDKFPRYVNDTHLSARVGRLNPGWMD-EKTPEAEDEAFRKAMSLTG 181
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFL ++R+Y RSWLPARSIV +C+A+R + + SGEI+++K++CPWK HL ELE+E+ ++
Sbjct: 182 IEFLQSLRYYARSWLPARSIVADCVADRKESNKSGEILIIKQYCPWKGHLIELEKELNVD 241
Query: 263 PLIKYVLYEDD-RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
P IKYVLYE D R K WRVQAV+VSP FESR PLP+ WRGLRDDELS+E GI G VFVH
Sbjct: 242 PTIKYVLYEQDGRSKGWRVQAVSVSPGSFESRLPLPSTWRGLRDDELSRETGIDGGVFVH 301
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
MSGFIGGN+++ GALAMA AL +
Sbjct: 302 MSGFIGGNKTFEGALAMAEKALAM 325
>gi|356509212|ref|XP_003523345.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Glycine max]
Length = 356
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 213/321 (66%), Positives = 258/321 (80%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
RVGTHNG+FHCDEAL CFM+RL+ +F A IVR+RDP +L L A++DVGG YDP+ +
Sbjct: 35 RVGTHNGTFHCDEALACFMLRLSKRFSAAHIVRTRDPNLLQSLHALVDVGGAYDPTRHRF 94
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQK F +VFGHGF TKLSSAGLVYKHFG E+IA L + E HP VH+L+ +Y+NF+E
Sbjct: 95 DHHQKDFHQVFGHGFHTKLSSAGLVYKHFGLEIIANVLKLHEDHPHVHQLYPVIYRNFVE 154
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
AIDA+DNG+N YD D PP+Y NT L+SR+ +LNL+W + DQS++RENEAF + M LAG
Sbjct: 155 AIDAVDNGVNLYDLDVPPKYEINTTLASRIKRLNLNWMDSDQSSDRENEAFHRAMALAGA 214
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
EFL+ V +Y +SWLPARSIV++ +A R D SGEI+ L R CPWKLH+ ELEEEMKI P
Sbjct: 215 EFLENVNYYAKSWLPARSIVMDSLAARESVDSSGEIVKLNRSCPWKLHIHELEEEMKISP 274
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
IKYVLY DDR + WR+QAVA+SP +FESRKPLP WRGL +D LS+ AGIPGC FVHMS
Sbjct: 275 SIKYVLYPDDRSENWRLQAVAISPAKFESRKPLPYLWRGLENDNLSEVAGIPGCTFVHMS 334
Query: 324 GFIGGNQSYGGALAMARAALK 344
GFIGGN+SYGGALAMARA+LK
Sbjct: 335 GFIGGNRSYGGALAMARASLK 355
>gi|357464057|ref|XP_003602310.1| hypothetical protein MTR_3g092050 [Medicago truncatula]
gi|355491358|gb|AES72561.1| hypothetical protein MTR_3g092050 [Medicago truncatula]
Length = 345
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 211/331 (63%), Positives = 264/331 (79%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
+SS S + KRVGTHNG+FHCDEAL CFM+RL+ F A IVR+RD +L+ LDAV+DVG
Sbjct: 14 SSSFSTSRAKRVGTHNGTFHCDEALACFMLRLSKLFSGADIVRTRDSNLLEVLDAVVDVG 73
Query: 74 GVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
VYDP YDHHQ+ F++VFG+GF TKLSSAGL+YKHFG E+IA L++DE HP VH+L
Sbjct: 74 RVYDPKRHRYDHHQRDFDQVFGNGFVTKLSSAGLIYKHFGLEIIANVLHLDEDHPHVHQL 133
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ A+Y+NF+EA+DA+DNG++QYD + P+Y+ NT+L+ RV +LN DW + DQSA+ ENEA
Sbjct: 134 YPAIYRNFVEAVDAVDNGVSQYDLKESPKYIINTDLAFRVERLNFDWIDSDQSADAENEA 193
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F + M LAG EF++ V +Y +SWLPA+SIV+EC+A R D SGEI+ L R CPWKLH+
Sbjct: 194 FHRAMALAGGEFVENVNYYAKSWLPAQSIVMECLAARETIDSSGEIIKLNRSCPWKLHIH 253
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
ELEEEMKI P IKYVLY+DDR + WR+QAVA+SP +E+RKPLP WRGL +D LS+ AG
Sbjct: 254 ELEEEMKINPSIKYVLYQDDRSETWRIQAVAISPATYENRKPLPYLWRGLENDRLSEVAG 313
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
IP C F HMSGFIGGNQSY GALAMARA+LK
Sbjct: 314 IPDCTFAHMSGFIGGNQSYDGALAMARASLK 344
>gi|413923498|gb|AFW63430.1| hypothetical protein ZEAMMB73_324496 [Zea mays]
Length = 293
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/268 (76%), Positives = 238/268 (88%)
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
PS YDHHQKGF EVFGHGF+TKLSSAGLVYKHFGKE+IAKEL V+E H DVH L+LA+
Sbjct: 26 PSRHRYDHHQKGFSEVFGHGFNTKLSSAGLVYKHFGKEIIAKELGVNEDHEDVHCLYLAI 85
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
YK+F+EA+DAIDNGINQYDTD+PP+YVNNT+LSSRVG+ NLDWT+PDQS E+EN AF+Q
Sbjct: 86 YKSFVEALDAIDNGINQYDTDQPPKYVNNTHLSSRVGRFNLDWTDPDQSLEKENAAFEQA 145
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
M LAG EF+++VRF+V+SWLPARSIV+EC+ + + DPSGEIMVL RFCPWKLHLFELEE
Sbjct: 146 MVLAGSEFMESVRFHVKSWLPARSIVLECLLSKGNIDPSGEIMVLDRFCPWKLHLFELEE 205
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E+K +PL KYVLY+D+R K WRVQAVAV+PDRFESRK LP +WRG+RDDELS E GIPGC
Sbjct: 206 ELKTDPLTKYVLYQDERSKSWRVQAVAVAPDRFESRKALPEKWRGMRDDELSAETGIPGC 265
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVHMSGFIGGN++Y GAL MARAALK
Sbjct: 266 VFVHMSGFIGGNKTYEGALEMARAALKF 293
>gi|384245283|gb|EIE18778.1| metal-dependent protein hydrolase [Coccomyxa subellipsoidea C-169]
Length = 337
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 198/333 (59%), Positives = 253/333 (75%), Gaps = 1/333 (0%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S SS + + ++GTHNGSFHCDEALGCF+++ TD F +A IVR+RD +L LD V+DV
Sbjct: 5 SASSEVKRKMVKIGTHNGSFHCDEALGCFLLKRTDHFKDADIVRTRDEDILKQLDIVIDV 64
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
GG YDP +DHHQ+GF EVFGHGF TKLSSAGLVYKHFG+++IAK L + H V +
Sbjct: 65 GGKYDPDACRFDHHQRGFAEVFGHGFQTKLSSAGLVYKHFGRDIIAKLLESRKDHEQVEK 124
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
++LAVYK+FMEAIDAIDNGINQ+D D PP+Y++ TNLSSRVG LN W E DQS+ER +E
Sbjct: 125 VYLAVYKHFMEAIDAIDNGINQWDVDTPPKYMSTTNLSSRVGALNPRWNE-DQSSERTDE 183
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F + + L G +FLD+V + ++WLPAR+ V E I +R + D SGEI+ L R CPWK HL
Sbjct: 184 QFLKAVQLTGSDFLDSVDYISKAWLPARTYVQEAIEKRSEVDSSGEIIRLPRVCPWKEHL 243
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
++LE+E+ ++ +K+ +YEDDR K WRVQAV+V+ FE+RK LPA W GLRD ELS+ A
Sbjct: 244 YDLEQELALDKPLKFCIYEDDRAKSWRVQAVSVAAGSFENRKSLPAAWMGLRDAELSEAA 303
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GIP CVFVH SGFIGG ++ GA+ MA+A LKL
Sbjct: 304 GIPDCVFVHASGFIGGTKTLEGAIRMAQAGLKL 336
>gi|307105228|gb|EFN53478.1| hypothetical protein CHLNCDRAFT_56260 [Chlorella variabilis]
Length = 359
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/329 (59%), Positives = 245/329 (74%), Gaps = 2/329 (0%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
PS LK +GTH+GSFHCDEALG F+++ T KF AQ+VR+R+P+V+ LD V+DVGGVY
Sbjct: 32 PSNGGLK-IGTHSGSFHCDEALGAFLLKQTAKFAGAQVVRTREPEVIKTLDVVIDVGGVY 90
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
+P +DHHQ+GF EVFGHGF+TKLSSAGLVYKH+G+E++A + + H DV ++LA
Sbjct: 91 EPEAQRFDHHQRGFGEVFGHGFNTKLSSAGLVYKHYGREIVASAMQLPADHADVEAVYLA 150
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
VYKNFMEAIDAIDNG+NQ+D++ PP+YVNNT+LS+RVG+LN DW E D S FQ
Sbjct: 151 VYKNFMEAIDAIDNGVNQWDSEGPPKYVNNTHLSARVGRLNPDWNE-DASEAATMRQFQA 209
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M+L G+EF + V++Y RSWLPAR+ V E + +R PSG IM L FCPWK HL+ LE
Sbjct: 210 AMELTGREFTEAVQYYARSWLPARTHVKEALEQRAQVHPSGRIMKLNTFCPWKEHLYVLE 269
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
EEM + + Y LYEDDR K+WR+QAV V+P F+SRK LP WRGLRD ELS G+PG
Sbjct: 270 EEMGVAGEVLYCLYEDDREKKWRIQAVGVAPGSFDSRKALPVAWRGLRDAELSAATGVPG 329
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH SGFIGG + GA+AMA AL +
Sbjct: 330 GVFVHASGFIGGCSTLEGAVAMATLALDM 358
>gi|255080606|ref|XP_002503876.1| predicted protein [Micromonas sp. RCC299]
gi|226519143|gb|ACO65134.1| predicted protein [Micromonas sp. RCC299]
Length = 325
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 2/322 (0%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+GSFHCDEALGC++++ T +F A+IVRSRDP+ L D V+DVG VY+P N+ +
Sbjct: 5 KIGTHDGSFHCDEALGCYLLKQTKQFREAEIVRSRDPETLGACDIVIDVGAVYEPENNRF 64
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+GFEE+FGHGF TKLSSAGLVYKHFG+E++A L + E P V +++L VYK+F+E
Sbjct: 65 DHHQRGFEEIFGHGFVTKLSSAGLVYKHFGREIVATTLALPESDPIVEKIYLKVYKSFIE 124
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D IDNG+N YDTD P +Y +NT LS+RVGKLN W E D S + F + + L G
Sbjct: 125 GVDGIDNGVNMYDTDAPAKYSDNTGLSARVGKLNPAWNE-DSSPATQMTQFTKAVALTGS 183
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
EF D V++Y SWLPAR V + + + PSGEI+ L +CPWK HLFELE E P
Sbjct: 184 EFDDAVKYYGLSWLPARKHVEQALDTAKEVHPSGEILKLPCYCPWKDHLFELEAERGTSP 243
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
L KY LYEDD+G WR+QA+ + P FE+RKPLPA WRG+RD L + +GI GC+FVH +
Sbjct: 244 LPKYALYEDDKG-NWRIQAIPLMPSSFENRKPLPAAWRGIRDSALDELSGIEGCIFVHAA 302
Query: 324 GFIGGNQSYGGALAMARAALKL 345
GFIGGN++Y GALAMA AL +
Sbjct: 303 GFIGGNKTYEGALAMATKALTM 324
>gi|393212463|gb|EJC97963.1| GAMM1 protein [Fomitiporia mediterranea MF3/22]
Length = 348
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/324 (55%), Positives = 230/324 (70%), Gaps = 3/324 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL CF+++LT + +A++VRSRDPK+LD D V+DVGGVY+ S
Sbjct: 23 KVIGTHNGTFHCDEALACFLLKLTSTYRDARVVRSRDPKILDGCDIVVDVGGVYNDSKKR 82
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+GFE+VFGHGF TKLSSAGLVYKH+GKE+IA LNV P V L+L +YK F+
Sbjct: 83 YDHHQRGFEDVFGHGFVTKLSSAGLVYKHYGKEIIASRLNVTPDSPVVDLLWLKMYKEFI 142
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAID IDNG+ QY D Y N T+LSSRV LN W E + + + F + ++ G
Sbjct: 143 EAIDGIDNGVTQYPKDSQAAYRNRTDLSSRVAWLNPAWNESADTTQMD-ALFAKASNMTG 201
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+EFL+ + +Y +SW+PAR IVV ++ R SG I+V +F PWK HLF++EEE I
Sbjct: 202 EEFLNRLDYYAKSWMPARDIVVAALSNRMTVHSSGRIVVFDQFAPWKEHLFDIEEEQNIP 261
Query: 263 PLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
K Y+LY D+ WR+QAV +SPD FESRK LP WRG+RDD LS+ +GIPGC+FV
Sbjct: 262 DTEKPFYILYPDETASNWRIQAVPISPDSFESRKALPQAWRGVRDDALSQLSGIPGCIFV 321
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H SGFIGGN + G +AMA AAL+
Sbjct: 322 HASGFIGGNATKEGVMAMAIAALQ 345
>gi|302695191|ref|XP_003037274.1| hypothetical protein SCHCODRAFT_64424 [Schizophyllum commune H4-8]
gi|300110971|gb|EFJ02372.1| hypothetical protein SCHCODRAFT_64424 [Schizophyllum commune H4-8]
Length = 338
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/324 (56%), Positives = 229/324 (70%), Gaps = 2/324 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL +M+R T + +A + RSRDP VLD D V+DVGG+YD +
Sbjct: 16 KVIGTHNGTFHCDEALAVYMLRQTATYKDANLKRSRDPAVLDTCDIVVDVGGIYDEAKQR 75
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF EVFGHGFSTKLSSAGLVYKHFGKE+IA + P VH L+L +Y+ F+
Sbjct: 76 FDHHQRGFTEVFGHGFSTKLSSAGLVYKHFGKEIIASRTQLSIDDPKVHTLWLKLYEGFI 135
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E+IDAIDNGI+QYDTD PRY T+LSSRVG +N W E + F++ + G
Sbjct: 136 ESIDAIDNGISQYDTDVKPRYKIRTDLSSRVGSVNPSWNESVDPVHV-DALFEKASAMTG 194
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAE-RYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+F D +R Y SWLPAR IVVE + + + + DP+G I+VL +F PWK HLFELE E
Sbjct: 195 VDFSDKLRGYANSWLPARDIVVESVKKSKAEVDPTGRIIVLSQFAPWKEHLFELEAEQGT 254
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ YVLYEDD WR+ AV VS DRF RK LP WRG+RD ELSK +GI GC+FVH
Sbjct: 255 TGISLYVLYEDDTSHDWRISAVPVSLDRFGDRKSLPEAWRGVRDQELSKISGIGGCIFVH 314
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
SGF GGN++Y GALAMA+AAL++
Sbjct: 315 ASGFTGGNKTYDGALAMAKAALEI 338
>gi|303271417|ref|XP_003055070.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463044|gb|EEH60322.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 330
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/331 (54%), Positives = 236/331 (71%), Gaps = 3/331 (0%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGG 74
S+ P K +GTH+GSFHCDEALGC ++RLT + +A IVR+RD +L + D V+DVG
Sbjct: 2 STKRMAPTK-IGTHDGSFHCDEALGCHLLRLTKAYADATIVRTRDQDLLSECDIVIDVGA 60
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
Y+P +DHHQKGFEEVFGHG +TKLSSAGLVYKHFG+E++A L D P V +++
Sbjct: 61 TYEPEKFRFDHHQKGFEEVFGHGHTTKLSSAGLVYKHFGREIVANVLKKDAADPIVEKIY 120
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
L VYK+F+E +D +DNG+ +++DKP YV+NT LS+RVGK N +W EP + E + F
Sbjct: 121 LKVYKSFVECVDGVDNGVQIFESDKPALYVDNTGLSARVGKFNANWNEP-FTDETQMAGF 179
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
++ M LAG EF D VR+Y W PAR+ V++ + SGEI+ L+ FCPWK HLFE
Sbjct: 180 EKAMALAGGEFEDAVRYYGEVWHPARAHVIDALDAASATHASGEILKLETFCPWKEHLFE 239
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
LEEE + L KYVLY+D +G WRVQA+ +P F SR+ LP WRG+RDD LS+ +GI
Sbjct: 240 LEEERGMTTLPKYVLYDDGKG-SWRVQAIPETPSSFNSRRGLPKAWRGVRDDALSELSGI 298
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GC+FVH +GFIGGN++Y GALAMA AL+L
Sbjct: 299 EGCIFVHAAGFIGGNKTYDGALAMASKALEL 329
>gi|389751382|gb|EIM92455.1| metal-dependent protein hydrolase [Stereum hirsutum FP-91666 SS1]
Length = 331
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/332 (56%), Positives = 234/332 (70%), Gaps = 10/332 (3%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
TP K +GTHNG+FHCDEAL F++RLT + +A++ R+RDPKVLD D V+DVG VYD S
Sbjct: 4 TP-KIIGTHNGTFHCDEALAVFLLRLTGTYRDAELKRTRDPKVLDTCDIVVDVGAVYDES 62
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+GFEEVFG+GF TKLSSAGLVYKHFG+E+IA L V P V L+L +YK
Sbjct: 63 KQRFDHHQRGFEEVFGYGFVTKLSSAGLVYKHFGREIIANRLQVALDDPKVETLWLKLYK 122
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+EAID +DNG+ QY D P+Y N T+LSSRV LN W E SA +E F +
Sbjct: 123 EFIEAIDGVDNGVTQYPKDVSPKYRNRTDLSSRVSWLNPAWNESVDSASV-DERFAKASK 181
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
L G+EFL + +Y ++WLPAR +VV+ + ER Y PSG+I+V ++F PWK HLFELE E
Sbjct: 182 LTGEEFLGRLDYYGKAWLPARDLVVKALNERTQYHPSGKIIVFEQFAPWKEHLFELEAEQ 241
Query: 260 KI------EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
I +P+ Y+LY D+ WR+QAV VSP+ FESRK LP WRG+RDDELSK G
Sbjct: 242 SITSSTPSQPI--YILYPDETAGNWRIQAVPVSPESFESRKALPEVWRGVRDDELSKVTG 299
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
I G +FVH SGFIGGNQ+ GA+ MA AL++
Sbjct: 300 IEGGIFVHASGFIGGNQTREGAMEMATKALEM 331
>gi|302848163|ref|XP_002955614.1| hypothetical protein VOLCADRAFT_83324 [Volvox carteri f.
nagariensis]
gi|300259023|gb|EFJ43254.1| hypothetical protein VOLCADRAFT_83324 [Volvox carteri f.
nagariensis]
Length = 350
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/343 (53%), Positives = 240/343 (69%), Gaps = 4/343 (1%)
Query: 6 VSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD 65
+ PA + + +K++GTH+GSFHCDEALGC+++R T F A+IVR+RDP+VL D
Sbjct: 8 IPGGPAAQITGNKRPMVKKIGTHSGSFHCDEALGCWLLRRTTLFQEAEIVRTRDPEVLKD 67
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGF-STKLSSAGLVYKHFGKELIAKELNVD 124
LD V+DVGGVYDP +DHHQ+GFE FG GF +TKLSSAGLVYKHFG+E+IA L+
Sbjct: 68 LDVVIDVGGVYDPDRMRFDHHQRGFETKFGFGFEATKLSSAGLVYKHFGREIIAGLLSWP 127
Query: 125 EGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT--DKPPRYVNNTNLSSRVGKLNLDWTE 182
HPD+ ++L VY+NF+E++DAIDNG+ QY+ PRY+NNT LS RVGKLN W E
Sbjct: 128 IDHPDLEMVYLEVYRNFIESVDAIDNGVMQYNLPPGTVPRYLNNTTLSCRVGKLNPRWNE 187
Query: 183 PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
P A E F + ++L G EF + V + R WLP R V +AER+ PSG+I++L
Sbjct: 188 PSDDATL-YERFLRAVELTGSEFSEAVDWVARGWLPGRRPVEAALAERHSVHPSGKILLL 246
Query: 243 KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRG 302
K FCPWK HL+ LEEE + Y +++D+R K +RVQAV+V P FE+R+ LP WRG
Sbjct: 247 KEFCPWKDHLYNLEEEGGFVGELLYAIFQDERDKTYRVQAVSVGPGSFENRRSLPCAWRG 306
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
LRD+ LS +GIP CVFVH GFIGGN++ GAL MA+ +L+L
Sbjct: 307 LRDEALSALSGIPDCVFVHAGGFIGGNKTLEGALEMAKRSLEL 349
>gi|402223017|gb|EJU03082.1| GAMM1 protein [Dacryopinax sp. DJM-731 SS1]
Length = 328
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 180/325 (55%), Positives = 223/325 (68%), Gaps = 3/325 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G+FHCDEAL +++R + NA + R+R+P LD D V+DVG VYDPS
Sbjct: 5 KAIGTHDGTFHCDEALAVYLLRQLPTYANASLTRTREPAKLDSCDIVVDVGSVYDPSKQR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+GF EVFGHG+ TKLSSAGLVYKHFGKEL+A + P V L+L +YK F+
Sbjct: 65 YDHHQRGFTEVFGHGYVTKLSSAGLVYKHFGKELVALRIKKPVEDPIVELLWLKLYKEFI 124
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+DA DNGINQY TD RY NT+LSSRV LN W EP +++ + F + +L G
Sbjct: 125 EAVDANDNGINQYPTDIEARYRLNTSLSSRVSYLNPAWNEP-ATSDISDAQFLKASELTG 183
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
KEF + + +Y ++WLPAR I+ + R D DPSG I+V + FCPWK HLFELE EM I
Sbjct: 184 KEFYERLDYYAKAWLPARDIIERAMESRKDVDPSGRILVFETFCPWKEHLFELENEMGIP 243
Query: 263 PL--IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
YVLY D WR+QAV VSP+ F SRK LP WRG+RDD+LS +GIPGC FV
Sbjct: 244 EAENTLYVLYPDTSSHTWRIQAVGVSPNSFASRKALPEPWRGVRDDDLSTLSGIPGCTFV 303
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H SGFIGGN++ GA+ MAR ALK
Sbjct: 304 HASGFIGGNKTKEGAMEMARQALKF 328
>gi|390604328|gb|EIN13719.1| metal-dependent protein hydrolase [Punctularia strigosozonata
HHB-11173 SS5]
Length = 351
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/314 (57%), Positives = 221/314 (70%), Gaps = 4/314 (1%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL FM+R T+ F N+ + RSRDP +L+ D V+DVGGVYD S
Sbjct: 29 KIIGTHNGTFHCDEALAVFMLRQTNDFRNSDVKRSRDPAILNACDIVVDVGGVYDDSKRR 88
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+GF EVFGHGF TKLSSAGLVYKHFGKE+IA L + P V L+L VYK F+
Sbjct: 89 YDHHQRGFTEVFGHGFETKLSSAGLVYKHFGKEIIADRLQSPQDDPRVETLWLKVYKEFI 148
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EA+D IDNG++QY + P+Y T+LSSRVG LN W E S + F++ L G
Sbjct: 149 EAVDGIDNGVSQYPAELKPKYRVRTDLSSRVGWLNPAWNEASDSQSYDTR-FERASSLTG 207
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+EFL + +Y +WLPARS+V +A R DPSG +++ + F PWK HLFELEEE +
Sbjct: 208 QEFLSRLDYYANAWLPARSLVAAALAARTSTDPSGNLIIFESFLPWKEHLFELEEEQSVA 267
Query: 263 PLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
P + YV+Y D+ GK WRVQAV VSPD FESRK LP WRGLRD+ELSK +G+ GC+FV
Sbjct: 268 PDAQPIYVIYPDESGK-WRVQAVPVSPDSFESRKALPQAWRGLRDEELSKASGVEGCIFV 326
Query: 321 HMSGFIGGNQSYGG 334
H SGFIGG Q+ G
Sbjct: 327 HASGFIGGAQTQEG 340
>gi|392571518|gb|EIW64690.1| GAMM1 protein [Trametes versicolor FP-101664 SS1]
Length = 339
Score = 369 bits (948), Expect = e-99, Method: Compositional matrix adjust.
Identities = 172/325 (52%), Positives = 230/325 (70%), Gaps = 3/325 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL ++++ T + NA + R+RDP VL+ D V+DVGGVYD
Sbjct: 16 KVIGTHNGTFHCDEALAVYLLKHTKDYGNADLKRTRDPAVLNTCDIVVDVGGVYDEKTQR 75
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF+E+FGHGFSTKLSSAGLVYKHFGK++IA L + E P V L+L +YK ++
Sbjct: 76 FDHHQRGFDEIFGHGFSTKLSSAGLVYKHFGKQIIANRLQLSEDEPKVETLWLKLYKEYI 135
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAIDAIDNGI+QY D PRY T++S+RVG LN W + A+ + FQQ L G
Sbjct: 136 EAIDAIDNGISQYPADLKPRYRLRTDVSARVGYLNPAWNQ-SVDAQTVDANFQQASALVG 194
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF++ + +Y +WLPAR +V+ + +R + DPSG I++ ++F PWK HLFELEE + I
Sbjct: 195 SEFMNRLDYYANAWLPARDLVIAALNKRKEIDPSGHIILFEQFAPWKEHLFELEESLAIP 254
Query: 263 PLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
K YVLY D+ WR+QAV VSP+ FESRK LP +WRG+RDD LS+ +G+ GC+F+
Sbjct: 255 EAEKPIYVLYPDETANNWRIQAVPVSPESFESRKALPEKWRGIRDDALSQLSGVDGCIFI 314
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H SGFIGGN++ G L +A AL +
Sbjct: 315 HASGFIGGNKTKEGVLKLANLALNI 339
>gi|395334300|gb|EJF66676.1| GAMM1 protein [Dichomitus squalens LYAD-421 SS1]
Length = 340
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 175/325 (53%), Positives = 230/325 (70%), Gaps = 3/325 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL F++R T+ + +A + RSRDP +LD D V+DVGGVYD
Sbjct: 17 KAIGTHNGTFHCDEALAVFLLRKTNAYADAGLKRSRDPAILDTCDIVVDVGGVYDEKIQR 76
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF EVFGHGF TKLSSAGLVYKHFGKE+IA L + P + L+L +Y+ ++
Sbjct: 77 FDHHQRGFTEVFGHGFDTKLSSAGLVYKHFGKEIIANTLQLPLQDPKIETLWLKLYREYI 136
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAIDAIDNGINQY T+ P+Y T++S+RVG LN W + SA + + F Q L G
Sbjct: 137 EAIDAIDNGINQYPTELKPKYRIRTDISARVGYLNPAWNQSTDSATVDAK-FLQASALVG 195
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EFLD + +Y +WLPAR +VV + +R + +G I++ ++F PWK HLFELEE ++I
Sbjct: 196 NEFLDRLGYYADAWLPARDLVVAALEKRKQVEQNGRIILFEQFAPWKEHLFELEEALQIP 255
Query: 263 PLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
K YVLY D+ G WR+QAV V+P+ FESRK LP QWRG+RDD+LSK +G+ GC+FV
Sbjct: 256 DAEKPIYVLYPDETGGNWRIQAVPVAPESFESRKALPEQWRGVRDDDLSKLSGVEGCIFV 315
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H SGFIGGN++ G L +A AL +
Sbjct: 316 HASGFIGGNKTKEGVLRLANLALDM 340
>gi|409051547|gb|EKM61023.1| hypothetical protein PHACADRAFT_247325 [Phanerochaete carnosa
HHB-10118-sp]
Length = 326
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 176/325 (54%), Positives = 230/325 (70%), Gaps = 3/325 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL +++RLT ++ A + R+RDP +LD V+DVGG+YD +
Sbjct: 3 KMIGTHNGTFHCDEALAVYLLRLTQEYAGADLKRTRDPAILDTCTIVVDVGGIYDEARQR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF EVFGHGF+TKLSSAGLVYKHFGKE+IA LN+ P + L+L +YK F+
Sbjct: 63 FDHHQRGFAEVFGHGFNTKLSSAGLVYKHFGKEIIANRLNMTLDDPRIEILWLKLYKEFI 122
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+AIDAIDNGI+QY D PP+Y T++S+RVG LN W +P A+ + F + L G
Sbjct: 123 DAIDAIDNGISQYPADMPPQYRVRTDISARVGHLNPAWNQP-ADAKTVDTLFLKASALVG 181
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF + +Y +WLPAR IV + +R + DPSG+I++ + PWK HLFELE ++ I
Sbjct: 182 GEFAGRLEYYANAWLPARDIVAAGLKDRGNVDPSGKIILFDAYAPWKEHLFELECDLDIA 241
Query: 263 PLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
K YV+Y D+ QWRVQAV V+P+ FESRK LP WRGLRDDELSK +GI CVFV
Sbjct: 242 DHEKPIYVIYPDETSDQWRVQAVPVAPESFESRKALPEIWRGLRDDELSKVSGIDSCVFV 301
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H SGFIGGN++ GAL +A+ AL+L
Sbjct: 302 HASGFIGGNKTKDGALKLAKFALEL 326
>gi|336369744|gb|EGN98085.1| hypothetical protein SERLA73DRAFT_169152 [Serpula lacrymans var.
lacrymans S7.3]
Length = 391
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 179/356 (50%), Positives = 236/356 (66%), Gaps = 12/356 (3%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKR---------VGTHNGSFHCDEALGCFMIRLTDKFFN 51
+A ++S +Y TS + P KR +GTHNG+FHCDEAL F++R T + +
Sbjct: 36 IACNSPAASKSYMTSVSEEQPTKRQKMYQKPKIIGTHNGTFHCDEALAVFLLRQTTAYQD 95
Query: 52 AQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKH 111
A + RSRDP VL D V+DVG VYD +DHHQ+GF EVFGHGFSTKLSSAGL+YKH
Sbjct: 96 AGLKRSRDPAVLGTCDIVVDVGAVYDEGKQLFDHHQRGFTEVFGHGFSTKLSSAGLIYKH 155
Query: 112 FGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSS 171
FGKE+IA + + V L+L +Y+ F+EAIDAIDNGI QY D +Y + T+LSS
Sbjct: 156 FGKEIIAHKAQLALNDEKVTILWLKMYREFIEAIDAIDNGIPQYPNDIAAKYKSRTDLSS 215
Query: 172 RVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERY 231
RVG LN W E +++ + F + L G+EF+ + +Y +WLPARS+VV +++R
Sbjct: 216 RVGWLNPAWNE-SVASDIVDAQFSKASQLTGEEFMGRLGYYADAWLPARSLVVAALSQRK 274
Query: 232 DYDPSGEIMVLKRFCPWKLHLFELEEEMKI--EPLIKYVLYEDDRGKQWRVQAVAVSPDR 289
D D S +I++ ++F PWK HLFELE E I E YV+Y D+ WR+QAV VSP+
Sbjct: 275 DVDNSAKILLFEQFAPWKEHLFELESEEGIAEESQPVYVIYPDETAGNWRIQAVPVSPES 334
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FESRK LP WRG RDDELS+ +G+PG +FVH SGFIGGN + GA MA+ AL++
Sbjct: 335 FESRKALPEAWRGFRDDELSRISGVPGGIFVHASGFIGGNHTKEGAFTMAKLALRI 390
>gi|388519995|gb|AFK48059.1| unknown [Lotus japonicus]
Length = 204
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 165/204 (80%), Positives = 187/204 (91%)
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
MEAIDA+DNGINQ+DTD+PP+YVNNT+LSSRVG+LNLDWT+PDQS E+ENEAFQ+ M LA
Sbjct: 1 MEAIDAVDNGINQFDTDQPPKYVNNTHLSSRVGRLNLDWTDPDQSPEKENEAFQRAMALA 60
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G EFLD+VRF+V SWLPARSIV+E IA R DPSGEI+VL +FCPWKLHLFELE E+KI
Sbjct: 61 GSEFLDSVRFHVNSWLPARSIVMETIAARQSIDPSGEIIVLNKFCPWKLHLFELEGELKI 120
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+P IKYVLYED+R KQWRVQAV VSPDRFESRK LP+QWRGLRDD+LSKE+GIPGCVFVH
Sbjct: 121 DPPIKYVLYEDERSKQWRVQAVGVSPDRFESRKALPSQWRGLRDDDLSKESGIPGCVFVH 180
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
MSGFIGGNQS+ GALAMARAALK+
Sbjct: 181 MSGFIGGNQSFDGALAMARAALKM 204
>gi|159488921|ref|XP_001702449.1| MYG1/GAMM1-like protein [Chlamydomonas reinhardtii]
gi|158271117|gb|EDO96944.1| MYG1/GAMM1-like protein [Chlamydomonas reinhardtii]
Length = 324
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 171/327 (52%), Positives = 231/327 (70%), Gaps = 7/327 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTH+G+FHCDEALGC+++R T +F +A+IVR+RDP+VL DLD V+DVGG YD
Sbjct: 1 MVKIGTHSGTFHCDEALGCWLLRQTPRFKDAEIVRTRDPEVLKDLDVVIDVGGEYDFPRL 60
Query: 82 CYDHHQKGFEEVFGHGF-STKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
DHHQ+ F E FGHGF +T+LSSAGLVY+HFG+E++A L HP++ ++L VY+N
Sbjct: 61 RLDHHQREFTEKFGHGFETTRLSSAGLVYRHFGREVLAGLLGWPLDHPELETVYLQVYRN 120
Query: 141 FMEAIDAIDNGINQYD--TDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
F+EA+DA+DNG+ QY + P YVNNT L +RV LN W +P A +QQ +
Sbjct: 121 FIEAVDAVDNGVGQYTVPSGTAPLYVNNTTLGARVASLNPRWNQPSDDAT----LYQQAV 176
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+L G+EF + V + R WLP R V E +A R+ PSG+I++L +CPWK HL++LE E
Sbjct: 177 ELTGREFSEAVDWVARGWLPGRRPVAESLAARHSVHPSGKIILLLEYCPWKDHLYDLERE 236
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ Y +Y+DDR K +RVQ V+V P FE+R+ LPA WRG+RDD LS+ +GIPGCV
Sbjct: 237 SGCVGEVLYCVYQDDRDKSYRVQCVSVGPGSFENRRSLPAAWRGVRDDALSELSGIPGCV 296
Query: 319 FVHMSGFIGGNQSYGGALAMARAALKL 345
FVH GFIGGN++ GALAMA AL+L
Sbjct: 297 FVHAGGFIGGNKTKEGALAMAVKALEL 323
>gi|336382521|gb|EGO23671.1| hypothetical protein SERLADRAFT_370529 [Serpula lacrymans var.
lacrymans S7.9]
Length = 330
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 174/329 (52%), Positives = 226/329 (68%), Gaps = 4/329 (1%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P K +GTHNG+FHCDEAL F++R T + +A + RSRDP VL D V+DVG VYD
Sbjct: 3 QKP-KIIGTHNGTFHCDEALAVFLLRQTTAYQDAGLKRSRDPAVLGTCDIVVDVGAVYDE 61
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+DHHQ+GF EVFGHGFSTKLSSAGL+YKHFGKE+IA + + V L+L +Y
Sbjct: 62 GKQLFDHHQRGFTEVFGHGFSTKLSSAGLIYKHFGKEIIAHKAQLALNDEKVTILWLKMY 121
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ F+EAIDAIDNGI QY D +Y + T+LSSRVG LN W E +++ + F +
Sbjct: 122 REFIEAIDAIDNGIPQYPNDIAAKYKSRTDLSSRVGWLNPAWNE-SVASDIVDAQFSKAS 180
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
L G+EF+ + +Y +WLPARS+VV +++R D D S +I++ ++F PWK HLFELE E
Sbjct: 181 QLTGEEFMGRLGYYADAWLPARSLVVAALSQRKDVDNSAKILLFEQFAPWKEHLFELESE 240
Query: 259 MKI--EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
I E YV+Y D+ WR+QAV VSP+ FESRK LP WRG RDDELS+ +G+PG
Sbjct: 241 EGIAEESQPVYVIYPDETAGNWRIQAVPVSPESFESRKALPEAWRGFRDDELSRISGVPG 300
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH SGFIGGN + GA MA+ AL++
Sbjct: 301 GIFVHASGFIGGNHTKEGAFTMAKLALRI 329
>gi|76779908|gb|AAI06266.1| LOC443610 protein, partial [Xenopus laevis]
Length = 371
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 228/332 (68%), Gaps = 11/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q L ++GTHNG+FHCDEAL C+ +R D + +A+I+R+RDP++L D V+DVGG YDP
Sbjct: 44 QAMLPQIGTHNGTFHCDEALACYFLRTLDAYRDAEIIRTRDPQLLAQCDVVVDVGGEYDP 103
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 104 SRHRYDHHQRSFCETMHSLYPDKPWVTKLSSAGLVYVHFGSQILATLLGTEEEDPIISVL 163
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W E DQ E+
Sbjct: 164 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNESDQDTEQ--AG 219
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I ER D SGE++VL + CPWK HL
Sbjct: 220 FKKAMELAGSEFVSRLDFYHRSWLPARALVEEAIRERIQADESGEVIVLAQGGCPWKEHL 279
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F++R LP WRGLR D+LS +
Sbjct: 280 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFQNRLSLPEDWRGLRADDLSSIS 338
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN++ GAL MAR AL+
Sbjct: 339 GIPGCIFVHTSGFIGGNETQEGALEMARKALR 370
>gi|49117035|gb|AAH73028.1| LOC443610 protein, partial [Xenopus laevis]
Length = 367
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 228/332 (68%), Gaps = 11/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q L ++GTHNG+FHCDEAL C+ +R D + +A+I+R+RDP++L D V+DVGG YDP
Sbjct: 40 QAMLPQIGTHNGTFHCDEALACYFLRTLDAYRDAEIIRTRDPQLLAQCDVVVDVGGEYDP 99
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 100 SRHRYDHHQRSFCETMHSLYPDKPWVTKLSSAGLVYVHFGSQILATLLGTEEEDPIISVL 159
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W E DQ E+
Sbjct: 160 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNESDQDTEQ--AG 215
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I ER D SGE++VL + CPWK HL
Sbjct: 216 FKKAMELAGSEFVSRLDFYHRSWLPARALVEEAIRERIQADESGEVIVLAQGGCPWKEHL 275
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F++R LP WRGLR D+LS +
Sbjct: 276 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFQNRLSLPEDWRGLRADDLSSIS 334
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN++ GAL MAR AL+
Sbjct: 335 GIPGCIFVHTSGFIGGNETQEGALEMARKALR 366
>gi|123959684|gb|AAI28922.1| LOC443610 protein [Xenopus laevis]
Length = 376
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 228/332 (68%), Gaps = 11/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q L ++GTHNG+FHCDEAL C+ +R D + +A+I+R+RDP++L D V+DVGG YDP
Sbjct: 49 QAMLPQIGTHNGTFHCDEALACYFLRTLDAYRDAEIIRTRDPQLLAQCDVVVDVGGEYDP 108
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 109 SRHRYDHHQRSFCETMHSLYPDKPWVTKLSSAGLVYVHFGSQILATLLGTEEEDPIISVL 168
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W E DQ E+
Sbjct: 169 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNESDQDTEQ--AG 224
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I ER D SGE++VL + CPWK HL
Sbjct: 225 FKKAMELAGSEFVSRLDFYHRSWLPARALVEEAIRERIQADESGEVIVLAQGGCPWKEHL 284
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F++R LP WRGLR D+LS +
Sbjct: 285 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFQNRLSLPEDWRGLRADDLSSIS 343
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN++ GAL MAR AL+
Sbjct: 344 GIPGCIFVHTSGFIGGNETQEGALEMARKALR 375
>gi|409083500|gb|EKM83857.1| hypothetical protein AGABI1DRAFT_81584 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426201460|gb|EKV51383.1| hypothetical protein AGABI2DRAFT_133062 [Agaricus bisporus var.
bisporus H97]
Length = 335
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 221/326 (67%), Gaps = 2/326 (0%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q K +GTHNG+FHCDEAL F++R T + +A + R+RD VL D V+DVG +YD
Sbjct: 9 QKSTKIIGTHNGTFHCDEALAVFLLRQTPTYAHADLKRTRDEAVLSTCDIVVDVGAIYDE 68
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ +DHHQ+GF E+FGHGF TKLSSAGL+YKHFGKE+IA + V L+L +Y
Sbjct: 69 AKQRFDHHQRGFTEIFGHGFDTKLSSAGLIYKHFGKEIIADRFQLPSEDQKVDLLWLKMY 128
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ F+EAIDAIDNGI+QY D PRY + T+LSSRVG LN W P A+ + F +
Sbjct: 129 REFVEAIDAIDNGISQYPNDVEPRYRSRTDLSSRVGHLNPPWNVP-TDAQAIDTRFLEAS 187
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAE-RYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+L G EFL + +Y +WLPAR I+V+ I++ + P G I++L +F PWK HLFELE
Sbjct: 188 NLTGTEFLQRLDYYATAWLPARDILVQSISDSEHLTKPPGRIILLNQFIPWKEHLFELES 247
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
K+ YV+Y D+ WR+QA+ V+ FESRK LP +WRGLRDDELS+ +GI GC
Sbjct: 248 TGKVTKDFIYVVYPDELAGSWRIQAIPVASQSFESRKALPEEWRGLRDDELSRVSGIDGC 307
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGGN++ GAL +AR AL
Sbjct: 308 IFVHASGFIGGNKTKEGALKLARIAL 333
>gi|392597360|gb|EIW86682.1| metal-dependent protein hydrolase [Coniophora puteana RWD-64-598
SS2]
Length = 377
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 170/334 (50%), Positives = 222/334 (66%), Gaps = 20/334 (5%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
T +K +GTHNG+FHCDEAL F++RLTD + NA + R+RDP VLD D V+DVG +YD +
Sbjct: 34 THVKIIGTHNGTFHCDEALAVFLLRLTDTYSNAGVKRTRDPIVLDTCDIVVDVGAIYDEA 93
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYK-----------------HFGKELIAKELN 122
+DHHQ+GF EVFG GFSTKLSSAGL+YK HFG+E+IAK
Sbjct: 94 RQRFDHHQRGFTEVFGSGFSTKLSSAGLIYKFFGVFMFHHQPLTSLNRHFGQEIIAKHTK 153
Query: 123 VDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE 182
P V L+L +YK F+EAIDAIDNGI Q+ T + P Y N T+LSSRVG LN W E
Sbjct: 154 APSDDPKVTALWLKMYKEFVEAIDAIDNGIAQFSTKESPTYKNRTDLSSRVGWLNPAWNE 213
Query: 183 PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
++++ + F++ L G+EF+ + +Y +WLP R IV + R D DPSG+I++
Sbjct: 214 T-VTSDQVDSLFEKASSLTGEEFIGRLSYYTDAWLPGRDIVQAAFSSRLDIDPSGKILLF 272
Query: 243 KRFCPWKLHLFELEEEMKIEPLIK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
+ F PWK HLF+LE++ + K YV+Y D+ G WRVQAV VSPD FESRK LP W
Sbjct: 273 ETFAPWKEHLFDLEKDTNVPDDSKPFYVIYPDETGNSWRVQAVPVSPDSFESRKALPEAW 332
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGG 334
RG+RD ELS +G+ GC+F+H SGFIGGN++ G
Sbjct: 333 RGIRDAELSNISGVEGCIFIHASGFIGGNKTRNG 366
>gi|148225454|ref|NP_001088807.1| uncharacterized protein LOC496075 [Xenopus laevis]
gi|76780041|gb|AAI06661.1| LOC496075 protein [Xenopus laevis]
Length = 369
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 174/332 (52%), Positives = 227/332 (68%), Gaps = 12/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q ++GTHNG+FHCDEAL C+ +R + + +A+IVR+RDP++L D V+DVGG YD
Sbjct: 43 QAMTPQIGTHNGTFHCDEALACYFLRTLEPYRDAEIVRTRDPQLLAQCDVVVDVGGEYDA 102
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 103 SRHRYDHHQRSFCETMNSLYPDKPWVTKLSSAGLVYAHFGSQILATLLGTEEEDPIISVL 162
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W EPDQ E
Sbjct: 163 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNEPDQDTE---AG 217
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I ER D SGE++VL + CPWK HL
Sbjct: 218 FKKAMELAGSEFVSRLDFYNRSWLPARALVEEAIRERIQVDESGEVIVLAQGGCPWKEHL 277
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F++R LP WRGLR D+LS +
Sbjct: 278 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFQNRLSLPEDWRGLRADDLSSVS 336
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN+++ GAL MAR AL+
Sbjct: 337 GIPGCIFVHASGFIGGNETHEGALEMARKALR 368
>gi|327286322|ref|XP_003227879.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Anolis
carolinensis]
Length = 357
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 232/337 (68%), Gaps = 12/337 (3%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S P + R+GTH+G+FHCD+AL CF++RL + +A+I+R+RDP++L D D V+DV
Sbjct: 20 SMPGPKRPHPARIGTHSGTFHCDDALACFLLRLLPAYQDAEIIRTRDPQLLSDCDVVVDV 79
Query: 73 GGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
GG YDP YDHHQ+ F E + TKLSSAGLVY HFG +++A L + E
Sbjct: 80 GGEYDPQRHRYDHHQRSFGESMHSLKPDKPWQTKLSSAGLVYCHFGSQILANRLGLKEQD 139
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSA 187
P +H L+ +Y+NF+E IDAIDNGI+Q+D + PRY TN+S+RVG LN W + DQ
Sbjct: 140 PILHVLYDKIYENFVEEIDAIDNGISQWDGE--PRYAMTTNVSARVGYLNPRWNDKDQDT 197
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FC 246
E FQ+ M+L GKEF+D + +Y +W+PARS+V + I +R++ DPSGEI+VL + C
Sbjct: 198 E---AGFQKAMELVGKEFMDRLDYYYHAWIPARSLVEKAIQQRFEVDPSGEILVLAQGGC 254
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
PWK HLF LE+EM ++ +K++LY D+G QWRVQ V F++R L +WRGLRDD
Sbjct: 255 PWKEHLFTLEQEMDLKKPLKFILYT-DQGGQWRVQCVPAGLHTFQNRLSLLEEWRGLRDD 313
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+LS+ +GIPGC+FVH SGFIGGN + GAL MAR L
Sbjct: 314 QLSEVSGIPGCIFVHSSGFIGGNHTKEGALEMARKTL 350
>gi|56270488|gb|AAH87491.1| LOC496075 protein, partial [Xenopus laevis]
Length = 356
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 174/332 (52%), Positives = 227/332 (68%), Gaps = 12/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q ++GTHNG+FHCDEAL C+ +R + + +A+IVR+RDP++L D V+DVGG YD
Sbjct: 30 QAMTPQIGTHNGTFHCDEALACYFLRTLEPYRDAEIVRTRDPQLLAQCDVVVDVGGEYDA 89
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 90 SRHRYDHHQRSFCETMNSLYPDKPWVTKLSSAGLVYAHFGSQILATLLGTEEEDPIISVL 149
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W EPDQ E
Sbjct: 150 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNEPDQDTE---AG 204
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I ER D SGE++VL + CPWK HL
Sbjct: 205 FKKAMELAGSEFVSRLDFYNRSWLPARALVEEAIRERIQVDESGEVIVLAQGGCPWKEHL 264
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F++R LP WRGLR D+LS +
Sbjct: 265 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFQNRLSLPEDWRGLRADDLSSVS 323
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN+++ GAL MAR AL+
Sbjct: 324 GIPGCIFVHASGFIGGNETHEGALEMARKALR 355
>gi|449551067|gb|EMD42031.1| hypothetical protein CERSUDRAFT_79631 [Ceriporiopsis subvermispora
B]
Length = 345
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 179/327 (54%), Positives = 230/327 (70%), Gaps = 7/327 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL F+++ T + NA + R+RDP +LD V+DVGGVYD S
Sbjct: 22 KVIGTHNGTFHCDEALAVFLLKRTSAYANADLTRTRDPAILDTCTVVVDVGGVYDESKQR 81
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF EVFGHGF+TKLSSAGLVYKHFGKE+IA L V P V L+L +Y+ F+
Sbjct: 82 FDHHQRGFTEVFGHGFNTKLSSAGLVYKHFGKEIIASRLRVSVDDPKVETLWLKLYREFI 141
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E+IDAIDNG+NQY D P Y T++S+RVG LN W EP SA ++ F Q L G
Sbjct: 142 ESIDAIDNGVNQYPADLKPLYRIRTDVSARVGYLNPAWNEPFDSATLDSR-FLQASSLVG 200
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI- 261
+EFL + +Y +WLPAR +V E + +R + DPSG+I++ ++F PWK HLF+LE ++ I
Sbjct: 201 EEFLGRLDYYGNAWLPARDLVAEGLKKRTEVDPSGKIIIFEQFVPWKEHLFDLEADLSIP 260
Query: 262 ---EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+PL Y+LY D+ G WRVQAV +P+ F+SRK LP WRG RDDELSK G+ G V
Sbjct: 261 EDSKPL--YILYRDESGGSWRVQAVPAAPESFQSRKALPEAWRGHRDDELSKITGVEGGV 318
Query: 319 FVHMSGFIGGNQSYGGALAMARAALKL 345
FVH SGFIGGN++ GAL MA+ AL L
Sbjct: 319 FVHASGFIGGNKTQKGALEMAKLALTL 345
>gi|89886163|ref|NP_001034831.1| uncharacterized protein LOC594885 [Xenopus (Silurana) tropicalis]
gi|89266727|emb|CAJ83877.1| melanocyte proliferating gene 1 [Xenopus (Silurana) tropicalis]
Length = 369
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 227/332 (68%), Gaps = 12/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q ++GTHNG+FHCDEAL CF++R + + +A+IVR+RDP++L D V+DVGG YDP
Sbjct: 43 QAMTPQIGTHNGTFHCDEALACFLLRTVEPYRDAEIVRTRDPQLLAQCDVVVDVGGEYDP 102
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 103 SRHRYDHHQRSFCETMQSLYPDKPWVTKLSSAGLVYAHFGSQILATLLGKEEEDPIISVL 162
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W EPDQ E
Sbjct: 163 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNEPDQDTE---AG 217
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I +R D SGE++VL + CPWK HL
Sbjct: 218 FKKAMELAGSEFVSRLDFYHRSWLPARALVEEAIRKRIQVDESGEVIVLAQGGCPWKEHL 277
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F +R LP WRGLR D+LS +
Sbjct: 278 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFLNRLSLPEDWRGLRADDLSSVS 336
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN++ GAL MAR AL+
Sbjct: 337 GIPGCIFVHASGFIGGNETQVGALEMARKALR 368
>gi|60551354|gb|AAH91097.1| hypothetical protein LOC594885 [Xenopus (Silurana) tropicalis]
Length = 373
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 227/332 (68%), Gaps = 12/332 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q ++GTHNG+FHCDEAL CF++R + + +A+IVR+RDP++L D V+DVGG YDP
Sbjct: 47 QAMTPQIGTHNGTFHCDEALACFLLRTVEPYRDAEIVRTRDPQLLAQCDVVVDVGGEYDP 106
Query: 79 SNDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
S YDHHQ+ F E + TKLSSAGLVY HFG +++A L +E P + L
Sbjct: 107 SRHRYDHHQRSFCETMQSLYPDKPWVTKLSSAGLVYAHFGSQILATLLGKEEEDPIISVL 166
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y+NF+E IDAIDNGI+Q+D D+ RY T LS+RVG LN W EPDQ E
Sbjct: 167 YDKMYENFVEEIDAIDNGISQFDGDQ--RYSVTTTLSARVGHLNPRWNEPDQDTE---AG 221
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F++ M+LAG EF+ + FY RSWLPAR++V E I +R D SGE++VL + CPWK HL
Sbjct: 222 FKKAMELAGSEFVSRLDFYHRSWLPARALVEEAIRKRIQVDESGEVIVLAQGGCPWKEHL 281
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
F+LE+E+ +E +KYVLY D K WRVQ V P+ F +R LP WRGLR D+LS +
Sbjct: 282 FQLEKELGLEKQVKYVLYPDQSAK-WRVQCVPTGPNTFLNRLSLPEDWRGLRADDLSSVS 340
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GIPGC+FVH SGFIGGN++ GAL MAR AL+
Sbjct: 341 GIPGCIFVHASGFIGGNETQVGALEMARKALR 372
>gi|452824528|gb|EME31530.1| hypothetical protein Gasu_12050 [Galdieria sulphuraria]
Length = 389
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 178/326 (54%), Positives = 226/326 (69%), Gaps = 9/326 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KR+GTHNG FHCDEAL CF++RLTD F +A +VR+R+ VL +D V+DVGGVYDP+
Sbjct: 66 KRIGTHNGKFHCDEALACFLLRLTDNFRDASVVRTREENVLSQMDCVVDVGGVYDPTIFR 125
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL-NVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF E F S LSSAGLVYKHFG+E++ K L ++D+ + + LF VY +F
Sbjct: 126 FDHHQRGFFETFSAKTSIPLSSAGLVYKHFGREILTKLLPDMDDTNRSI--LFEHVYNSF 183
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+E +DAIDNG+N YDTD P+Y +T+LS+RVG LN W + D E AFQ+ M LA
Sbjct: 184 VEELDAIDNGVNAYDTDVAPKYSIHTHLSARVGSLNPAWNDSDPD---EETAFQKAMKLA 240
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G EF++ V+ WLPAR+IV E R D S EI+VL+ CPWK HL+ELEE +
Sbjct: 241 GDEFIEFVKKSANIWLPARNIVKEAFDRRASIDDSLEIVVLETACPWKSHLYELEEVAQN 300
Query: 262 EPL--IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
+ IKYVLY G W VQAV++S F+SRKPLP+ WRGLRDD LS GIP CVF
Sbjct: 301 DEAKSIKYVLYPRKDG-SWSVQAVSLSEQSFKSRKPLPSNWRGLRDDALSSVVGIPDCVF 359
Query: 320 VHMSGFIGGNQSYGGALAMARAALKL 345
VH +GFIG +++Y GAL MAR ALK+
Sbjct: 360 VHATGFIGVHKNYKGALTMARKALKI 385
>gi|356518655|ref|XP_003527994.1| PREDICTED: LOW QUALITY PROTEIN: UPF0160 protein MYG1,
mitochondrial-like [Glycine max]
Length = 301
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 176/318 (55%), Positives = 218/318 (68%), Gaps = 30/318 (9%)
Query: 23 KRVGTH-NGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+RV H NG+FHCDEAL CFM+RL+ +F A IVR+RDP +L LDAV+DVGG YDP
Sbjct: 11 RRVWAHTNGTFHCDEALACFMLRLSKRFTGAHIVRTRDPNLLQSLDAVVDVGGAYDPX-- 68
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQKGF +VFGHG TKLSSA L +HFG ++IA L +DE HPDVH+L+ +Y+N
Sbjct: 69 -FDHHQKGFHQVFGHGLHTKLSSACL-QRHFGLKIIANVLKLDEDHPDVHQLYPTIYRNV 126
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+EAIDA+DNG+NQYD D PP+Y NT LSSR+ +LNL W + DQS++RENEAF + LA
Sbjct: 127 VEAIDAVDNGVNQYDLDVPPKYEINTILSSRIKRLNLSWMDSDQSSDRENEAFLHAIVLA 186
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV-LKRFCPWKLHLFELEEEMK 260
G EFL+ V +Y +SWLPARS V+E +A R D SGEI+V L R CP
Sbjct: 187 GAEFLENVNYYAKSWLPARSTVMESLAARGSADSSGEIIVKLNRSCP------------- 233
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
DDR + R+QAVA+SP +FESRKPLP WRGL +D+L + AGIPGC FV
Sbjct: 234 -----------DDRSENXRLQAVAISPAKFESRKPLPNLWRGLENDKLYEVAGIPGCTFV 282
Query: 321 HMSGFIGGNQSYGGALAM 338
MSGFI S+ AM
Sbjct: 283 RMSGFIFPPHSFVVWRAM 300
>gi|412992494|emb|CCO18474.1| MYG1 protein [Bathycoccus prasinos]
Length = 415
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 170/346 (49%), Positives = 240/346 (69%), Gaps = 10/346 (2%)
Query: 4 TGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
T +SS + + + ++ +GTH+GSFHCDEALGC++++ T++F N +IVR+RD L
Sbjct: 72 TAATSSKRHQSMTGTEDT---IGTHDGSFHCDEALGCYLLQNTEQFSNCRIVRTRDADAL 128
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV 123
VLDVG YD + +DHHQKGF E F + F TKLSSAGLVYKHFGKE+++K++
Sbjct: 129 AKCAVVLDVGAEYDVAKLRFDHHQKGFSETF-NDFKTKLSSAGLVYKHFGKEIVSKKIEK 187
Query: 124 DEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT--DKPPRYVNNTNLSSRVGKLNLDWT 181
P ++L+L +YK+F+EA+D +DNG++QYD P Y NNTNLSSRVGKLN +W
Sbjct: 188 SVEDPMTNQLYLKMYKSFIEAVDGVDNGVSQYDILEGAEPNYENNTNLSSRVGKLNPNWN 247
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
EP +AER+ E F + + LAGKEF + + + WLP R V E IA+R + D SG+I+
Sbjct: 248 EP-FTAERQMEQFLKAVALAGKEFDEELNYLANVWLPGRKPVEEAIAKREEVDTSGKIIK 306
Query: 242 LKRFCPWKLHLFELEEEMKI---EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
L+++CPWK HL+E+E+E + E +++VL+ED+ K+WRV V ++ F+ RK L
Sbjct: 307 LEQYCPWKDHLYEVEKESGLDSEEKRVEFVLFEDESAKKWRVATVPIAAGSFDKRKGLKK 366
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+W GLRD +LS+++GIPGCVFVH FIGGN +Y GAL MAR +L+
Sbjct: 367 EWAGLRDQDLSEKSGIPGCVFVHNGLFIGGNDTYEGALEMARVSLE 412
>gi|145351144|ref|XP_001419945.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580178|gb|ABO98238.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 337
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 223/326 (68%), Gaps = 8/326 (2%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G+FHCDEALGC ++R T F A I RSRD + D V+DVG VYD YD
Sbjct: 14 IATHDGAFHCDEALGCHLLRRTRAFAGAAIDRSRDGERWAKADVVIDVGAVYDAEKRLYD 73
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN-VDEGH-PD---VHRLFLAVYK 139
HHQ+ F E FG GF TKLSSAGLVYKH+G+E++ + L G PD V ++++ +Y+
Sbjct: 74 HHQREFAETFGRGFGTKLSSAGLVYKHYGEEIVREALTRAKRGEAPDEKTVEKIYVKMYE 133
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+E +DAIDNG+N YDTD +Y +NT LS+RV +LN W EP+ S E++ E F++ +
Sbjct: 134 EFIEGVDAIDNGVNMYDTDAKAKYKDNTGLSARVKRLNPAWDEPN-SPEKQMEQFEKAVA 192
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
L G EF D + +Y WLPARS V + + SGEI+ L+ FCPWK HL+ELE E
Sbjct: 193 LTGGEFDDVLEYYASKWLPARSHVESALDKAKSVHESGEILYLETFCPWKEHLYELEAER 252
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
++ L K+VL++D +G +RV +++SP FE RK LP WRGLRDD+LSK +GIPGCVF
Sbjct: 253 QMTTLPKFVLWQDPKG--FRVSTISLSPSSFEFRKGLPTAWRGLRDDDLSKASGIPGCVF 310
Query: 320 VHMSGFIGGNQSYGGALAMARAALKL 345
+H +GFIGGN +Y GALAMA A L++
Sbjct: 311 IHAAGFIGGNATYDGALAMAVAGLQM 336
>gi|260832468|ref|XP_002611179.1| hypothetical protein BRAFLDRAFT_125216 [Branchiostoma floridae]
gi|229296550|gb|EEN67189.1| hypothetical protein BRAFLDRAFT_125216 [Branchiostoma floridae]
Length = 336
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 171/341 (50%), Positives = 228/341 (66%), Gaps = 16/341 (4%)
Query: 15 SSPSQTP-----LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV 69
S P++ P K++GTH+G+FHCDEAL C+++R ++ + IVRSR P VL D V
Sbjct: 2 SQPNKKPREDGDWKQIGTHDGTFHCDEALACYLLRKLPRYRDYDIVRSRKPDVLAACDIV 61
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVD 124
+DVGG YDP+ YDHHQ+ F E + TKLSSAGLVY HFG+E+IA L
Sbjct: 62 VDVGGQYDPATHRYDHHQRTFNETMNSLSPEKPWVTKLSSAGLVYLHFGQEIIAHMLGDG 121
Query: 125 EGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD 184
+ ++ VY+N +E IDAIDNG++Q+D PRY+ TNLS+RVG+LN W E D
Sbjct: 122 ADKKLCNVVYNKVYQNLIEEIDAIDNGVSQFDG--TPRYLMTTNLSARVGRLNPAWNEAD 179
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
Q E F + M + G+EF D VR+Y ++W P R +V + + R+ D SG+++V+++
Sbjct: 180 QDT---TERFHKAMAMVGEEFEDRVRYYSQAWWPGRKLVRDALEARHGVDQSGQVIVMEQ 236
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
CPWK HLFELE+EM++EP IKYVLY D G WRVQ V + F+SR PLP WRG+R
Sbjct: 237 VCPWKEHLFELEQEMQVEPSIKYVLYADQNGN-WRVQCVPMETQSFQSRLPLPEDWRGVR 295
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
DDELSK +GI GCVFVH SGFIGGN++ GAL MA+ AL++
Sbjct: 296 DDELSKLSGIDGCVFVHASGFIGGNKTRQGALLMAQRALQM 336
>gi|260832394|ref|XP_002611142.1| hypothetical protein BRAFLDRAFT_88459 [Branchiostoma floridae]
gi|229296513|gb|EEN67152.1| hypothetical protein BRAFLDRAFT_88459 [Branchiostoma floridae]
Length = 342
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 167/325 (51%), Positives = 222/325 (68%), Gaps = 11/325 (3%)
Query: 26 GTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDH 85
GTH+G+FHCDEAL C+++R ++ + +IVRSR P VL D V+DVGG YDP+ YDH
Sbjct: 24 GTHDGTFHCDEALACYLLRKLPRYRDYEIVRSRKPDVLATCDIVVDVGGQYDPATHRYDH 83
Query: 86 HQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
HQ+ F E + TKLSSAGLVY HFG+E+IA L + ++ VY+N
Sbjct: 84 HQRTFNETMNSLAPEKPWVTKLSSAGLVYLHFGQEIIAHMLGDGADKKLCNVVYNKVYQN 143
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E IDAIDNG++Q+D PRY+ TNLS+RVG+LN W E DQ E F + M +
Sbjct: 144 LIEEIDAIDNGVSQFD--GTPRYLMTTNLSARVGRLNPAWNEADQDT---TERFHKAMAM 198
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G+EF D VR+Y ++W P R +V + + R+ DPSG+++V+++ CPWK HLFELE+EM+
Sbjct: 199 VGEEFEDRVRYYSQAWWPGRKLVRDALEARHGVDPSGQVIVMEQVCPWKEHLFELEQEMQ 258
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+EP IKYVLY D G WRVQ V + F+SR PLP WRG+RDD+LSK +GI GCVFV
Sbjct: 259 VEPSIKYVLYADQNGN-WRVQCVPMETQSFQSRLPLPEDWRGVRDDDLSKLSGIDGCVFV 317
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H SGFIGGN++ GAL MA+ AL++
Sbjct: 318 HASGFIGGNKTRQGALLMAQRALQM 342
>gi|170085173|ref|XP_001873810.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651362|gb|EDR15602.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 326
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 171/326 (52%), Positives = 225/326 (69%), Gaps = 6/326 (1%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG+FHCDEAL F++R T + +A + RSRDP VL+ D V+DVG VYD +D
Sbjct: 2 IGTHNGTFHCDEALAVFLLRQTSTYRDAGLKRSRDPAVLNTCDIVVDVGAVYDEPTQRFD 61
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF EVFGHGF TKLSSAGLVYKHFG ++I + + P V L+L +YK F+EA
Sbjct: 62 HHQRGFTEVFGHGFQTKLSSAGLVYKHFGLDVIVNKTGLPVEDPKVTTLWLKLYKEFIEA 121
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAIDNGI+QY TD P Y + T+LSSRVG LN W +P S + + LAG+E
Sbjct: 122 IDAIDNGISQYPTDIKPLYRSRTDLSSRVGALNPAWNQPADSETVDARHTLSSL-LAGEE 180
Query: 205 FLDTVRFYVRSWLPARSIVV-ECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK--- 260
FL + +Y +WLPAR +++ ++ + DPSG+I++ ++F PWK HLFELE +
Sbjct: 181 FLGRLNYYSNAWLPARDLLLASVLSSKSLIDPSGKIILFEKFLPWKEHLFELETDGSNPA 240
Query: 261 IEP-LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
IEP YV+Y D+ G WRVQAV VSP+ F+SR LP WRGLRD+ELSK +GI G +F
Sbjct: 241 IEPNQAIYVVYPDEFGGNWRVQAVPVSPESFDSRNALPEVWRGLRDEELSKVSGIDGGIF 300
Query: 320 VHMSGFIGGNQSYGGALAMARAALKL 345
+H SGFIGGN++ GAL +A+ AL++
Sbjct: 301 IHASGFIGGNKTREGALKIAQMALEM 326
>gi|58266024|ref|XP_570168.1| GAMM1 protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57226401|gb|AAW42861.1| GAMM1 protein, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 336
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 171/341 (50%), Positives = 227/341 (66%), Gaps = 11/341 (3%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
SSP Y + K +GTH+G+FHCDEAL FM+RLTD++ +A +VRSRDP LD LD
Sbjct: 2 SSPVYKKIKST----KVIGTHSGTFHCDEALAVFMLRLTDEYKDADLVRSRDPAKLDPLD 57
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHG--FSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGGVYDP + YDHHQ+GF EVFGHG TKLSSAGL+YKHFGKE+IAK +
Sbjct: 58 IVVDVGGVYDPKTNRYDHHQRGFTEVFGHGGFNRTKLSSAGLIYKHFGKEIIAKRIGAPV 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQ 185
V L+L +Y +E++D IDNG+N + P Y ++LSSR+ ++N +W EP
Sbjct: 118 EDEKVEILWLQLYSELIESVDGIDNGVNI--SSSPLAYTQRSDLSSRIKRINPNWNEP-T 174
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
S E ++ F+ G+EFL + ++ +WLPAR IV E + +R + PSG I+V +
Sbjct: 175 SDEIYDQKFEVASKTTGEEFLSQLDYFAFAWLPARDIVEEALKKRTEVHPSGAIVVFDKS 234
Query: 246 CPWKLHLFELEEEM-KIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
CPWK HLF LE + K I Y+LY E + +WR+Q V SPD F +RK LP WRG+
Sbjct: 235 CPWKDHLFTLEPTLPKTSSPILYILYPESESSSKWRIQCVPESPDSFVNRKSLPEPWRGM 294
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RD +LS+ +GIPG VFVH SGFIGGN+++ G LAMA+ AL+
Sbjct: 295 RDSKLSEISGIPGGVFVHASGFIGGNETFEGTLAMAKKALE 335
>gi|134110754|ref|XP_775841.1| hypothetical protein CNBD2510 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258507|gb|EAL21194.1| hypothetical protein CNBD2510 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 336
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 171/341 (50%), Positives = 227/341 (66%), Gaps = 11/341 (3%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
SSP Y + K +GTH+G+FHCDEAL FM+RLTD++ +A +VRSRDP LD LD
Sbjct: 2 SSPVYKKIKST----KVIGTHSGTFHCDEALAVFMLRLTDEYKDADLVRSRDPAKLDPLD 57
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHG--FSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGGVYDP + YDHHQ+GF EVFGHG TKLSSAGL+YKHFGKE+IAK +
Sbjct: 58 IVVDVGGVYDPKTNRYDHHQRGFTEVFGHGGFNRTKLSSAGLIYKHFGKEIIAKRIGAPV 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQ 185
V L+L +Y +E++D IDNG+N + P Y ++LSSR+ ++N +W EP
Sbjct: 118 EDEKVEILWLQLYSELIESVDGIDNGVNI--SSSPLAYTQRSDLSSRIKRINPNWNEP-T 174
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
S E ++ F+ G+EFL + ++ +WLPAR IV E + +R + PSG I+V +
Sbjct: 175 SDEIYDQKFEIASKTTGEEFLSQLDYFAFAWLPARDIVEEALKKRTEVHPSGAIVVFDKS 234
Query: 246 CPWKLHLFELEEEM-KIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
CPWK HLF LE + K I Y+LY E + +WR+Q V SPD F +RK LP WRG+
Sbjct: 235 CPWKDHLFTLEPTLPKTSSPILYILYPESESSSKWRIQCVPESPDSFVNRKSLPEPWRGM 294
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RD +LS+ +GIPG VFVH SGFIGGN+++ G LAMA+ AL+
Sbjct: 295 RDSKLSEISGIPGGVFVHASGFIGGNETFEGTLAMAKKALE 335
>gi|405120112|gb|AFR94883.1| GAMM1 protein [Cryptococcus neoformans var. grubii H99]
Length = 336
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 171/341 (50%), Positives = 227/341 (66%), Gaps = 11/341 (3%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
SSP Y + K +GTH+G+FHCDEAL FM+RLTD++ +A +VRSRDP LD LD
Sbjct: 2 SSPVYKRIKST----KVIGTHSGTFHCDEALAVFMLRLTDEYKDADLVRSRDPAKLDPLD 57
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHG--FSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGGVYDP + YDHHQ+GF EVFGHG TKLSSAGL+YKHFGKE+IAK +
Sbjct: 58 IVVDVGGVYDPKTNRYDHHQRGFTEVFGHGGFNRTKLSSAGLIYKHFGKEIIAKRIGAPV 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQ 185
V L+L +Y +E++D IDNG+N + P Y ++LSSR+ ++N +W EP
Sbjct: 118 EDEKVEILWLQLYSELIESVDGIDNGVNI--SSSPLAYTQRSDLSSRIKRINPNWNEP-T 174
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
S E ++ F+ G+EFL + ++ +WLPAR IV E + +R + PSG I+V +
Sbjct: 175 SDEIYDQKFEIASKTTGEEFLSQLDYFAFAWLPARDIVEEALKKRTEVHPSGAIVVFDKS 234
Query: 246 CPWKLHLFELEEEM-KIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
CPWK HLF LE + K I Y+LY E + +WR+Q V SPD F +RK LP WRG+
Sbjct: 235 CPWKDHLFTLEPTLPKTSSPILYILYPESETSSKWRIQCVPESPDSFVNRKSLPEPWRGM 294
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RD +LS+ +GIPG VFVH SGFIGGN+++ G LAMA+ AL+
Sbjct: 295 RDSKLSEISGIPGGVFVHASGFIGGNETFEGTLAMAKKALE 335
>gi|321257920|ref|XP_003193750.1| GAMM1 protein [Cryptococcus gattii WM276]
gi|317460220|gb|ADV21963.1| GAMM1 protein, putative [Cryptococcus gattii WM276]
Length = 336
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 170/341 (49%), Positives = 227/341 (66%), Gaps = 11/341 (3%)
Query: 8 SSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
SSP Y + K +GTH+G+FHCDEAL FM+RLTD++ +A +VRSRDP LD LD
Sbjct: 2 SSPVYKKIKST----KVIGTHSGTFHCDEALAVFMLRLTDEYKDADLVRSRDPAKLDPLD 57
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHG--FSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGGVYDP + YDHHQ+GF EV GHG TKLSSAGL+YKHFGKE+IAK +
Sbjct: 58 IVVDVGGVYDPKTNRYDHHQRGFTEVLGHGGFNRTKLSSAGLIYKHFGKEIIAKRIGAPV 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQ 185
V L+L +Y +E++D IDNG+N T P Y ++LSSR+ ++N +W EP
Sbjct: 118 DDEKVEILWLQLYSELIESVDGIDNGVNISST--PLAYTQRSDLSSRIKRINPNWNEP-T 174
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
S E ++ F+ + G+EFL + ++ +WLPAR IV E + +R + PSG I+V +
Sbjct: 175 SDEIYDQKFEVASKITGEEFLSQLDYFAFAWLPARDIVEEALKKRTEVHPSGAIVVFDKS 234
Query: 246 CPWKLHLFELEEEM-KIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
CPWK HLF LE + + I Y+LY E + +WR+Q V SPD F +RK LP WRG+
Sbjct: 235 CPWKDHLFTLEPTLPQTSSPILYILYPESETSSKWRIQCVPESPDSFVNRKSLPEPWRGM 294
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RD +LS+ +GIPG VFVH SGFIGGN+++ GAL MA+ AL+
Sbjct: 295 RDSKLSEISGIPGGVFVHASGFIGGNETFEGALDMAKKALE 335
>gi|403415265|emb|CCM01965.1| predicted protein [Fibroporia radiculosa]
Length = 513
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 169/332 (50%), Positives = 215/332 (64%), Gaps = 23/332 (6%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL F+++ T + NA + R+R+P VL+ V+DVGGVYD
Sbjct: 23 KVIGTHNGTFHCDEALAVFLLKRTKLYGNADVTRTREPAVLESCSIVVDVGGVYDEGKQR 82
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF EVFGHGF TKLSSAGLVYKHFGKE+IA L+ V L+L VYK F+
Sbjct: 83 FDHHQRGFVEVFGHGFVTKLSSAGLVYKHFGKEIIANRLHTAPDDAKVETLWLKVYKEFI 142
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSA--------ERENEA- 193
EAID IDNG++QY D Y T++SSRVG LN W EP S +R N
Sbjct: 143 EAIDGIDNGVSQYPKDAKQLYRVRTDVSSRVGHLNPAWNEPFDSQTVDVSMFLQRSNTTK 202
Query: 194 --------FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
F Q L G+EFL + +Y +WLPAR++V E + R D SG+I+V ++F
Sbjct: 203 ADVRAQARFLQASSLVGEEFLSRLDYYSNAWLPARNLVAEALKRRDVADASGKIIVFEQF 262
Query: 246 CPWKLHLFELEEEMKI----EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
PWK HLF+LE E++I +P+ YVLY D+ G WRVQAV ++PD FESRK LP WR
Sbjct: 263 LPWKEHLFDLETELQISDPSQPI--YVLYPDESGNNWRVQAVPIAPDSFESRKALPEAWR 320
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYG 333
GLRD+ELS+ +G+ G +FVH SGFIG G
Sbjct: 321 GLRDNELSQVSGVDGGIFVHASGFIGAMAELG 352
>gi|392580385|gb|EIW73512.1| hypothetical protein TREMEDRAFT_67398 [Tremella mesenterica DSM
1558]
Length = 335
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 168/335 (50%), Positives = 220/335 (65%), Gaps = 9/335 (2%)
Query: 15 SSPSQTPLKR---VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
SSP+ LK +GTH+G+FHCDEAL FM+RLTD+F +A ++R+RDP L L+ V+D
Sbjct: 2 SSPAAKRLKSTKIIGTHSGTFHCDEALAVFMLRLTDEFRDADVLRTRDPAKLAPLNIVVD 61
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFGHGF-STKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
VGGVYDP YDHHQ+GF E FGHGF KLSSAGLVYKHFGK++IA L +E P V
Sbjct: 62 VGGVYDPKAHRYDHHQRGFNETFGHGFDKIKLSSAGLVYKHFGKKIIADYLGCEESDPRV 121
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE 190
++L +Y +E++D IDNG+N P Y T+LSSRVG++N DW E A+ +
Sbjct: 122 EIIWLRLYSELIESVDGIDNGVNV--ASGPLAYTVRTDLSSRVGRINPDWNEHVDDADYD 179
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
F + G+EF+ +R+Y ++WLPAR IV + R+ D SG I+V ++ PWK
Sbjct: 180 AR-FAIASKVTGEEFMSHLRYYTKAWLPARDIVKAALDTRHKVDESGSIVVFEKSAPWKD 238
Query: 251 HLFELEEEMKIEPLIKYVLY--EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HLF LE + I YVLY D G +WR+Q V SPD F +RK +P WRG+RD EL
Sbjct: 239 HLFSLESTLSPPQQILYVLYPESDAPGSKWRIQCVPESPDSFLNRKSMPEAWRGVRDAEL 298
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
SK +G+ GCVF H SGF GGN+++ G LAMAR +L
Sbjct: 299 SKVSGVNGCVFCHASGFTGGNETFEGVLAMARKSL 333
>gi|291241232|ref|XP_002740515.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 380
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 169/328 (51%), Positives = 220/328 (67%), Gaps = 12/328 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDE L CF+++ ++ +A+I+R+RD LD D V+DVGGV+D S
Sbjct: 52 KKIGTHNGTFHCDEVLACFLLKQLPEYEHAEIIRTRDSCKLDTCDIVVDVGGVFDASKHR 111
Query: 83 YDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ FEE ++TKLSSAGLVY HFG +IA+ L +F V
Sbjct: 112 YDHHQRSFEESMNSLTSEKPWTTKLSSAGLVYLHFGHSVIARTLQTLPDDKITKIIFDKV 171
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y++F+E +DAIDNGI+Q+D + PRY+ TNLS RV LN W + E E F++
Sbjct: 172 YESFIEEVDAIDNGISQWD--EKPRYLLTTNLSCRVSNLNPWWNDTKVDVE---ERFEKA 226
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELE 256
M + G EFLD V +Y + W PARS+V E I +R++ D SGEI++ ++ CPWK HLF+LE
Sbjct: 227 MSMVGAEFLDRVLYYHKCWWPARSLVEESIKKRFEVDDSGEIIIFQQGGCPWKDHLFDLE 286
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
EEMKIE IKYVLY D + WRVQ V + FE+R L +WRGLRDDELSK +GIPG
Sbjct: 287 EEMKIERPIKYVLYTD-QNDNWRVQCVPIRKTSFENRVSLLEEWRGLRDDELSKVSGIPG 345
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
C+FVH SGFIGGN++ G L MAR LK
Sbjct: 346 CIFVHASGFIGGNKNKDGVLEMARHTLK 373
>gi|353242980|emb|CCA74573.1| related to Gamm1 protein / Ni-binding urease accessory protein
(UreG) [Piriformospora indica DSM 11827]
Length = 329
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 219/324 (67%), Gaps = 10/324 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDEAL ++++ T + +A+++R+RD VL D V+DVG Y+P N Y
Sbjct: 13 KIGTHNGTFHCDEALAVWLLKRTGAYKDAKVIRTRDESVLATCDIVVDVGAKYEPENHRY 72
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+GF E+FGHGFSTKLSSAGLVYKHFG+E+I+ L L+L +YK F+E
Sbjct: 73 DHHQRGFTEIFGHGFSTKLSSAGLVYKHFGREIISNLLQWQVQDSRTEILWLKMYKEFIE 132
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW---TEPDQSAERENEAFQQGMDL 200
AIDAIDNGI+ Y +D P Y T++S+RVG LN W PD+ + F + +L
Sbjct: 133 AIDAIDNGISAYPSDIKPSYRMRTDVSARVGFLNPRWNVHATPDEV----DSLFVKASNL 188
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G EF + + +Y +WLPAR IV+ + + D ++V + F PWK HLF+LE+E+
Sbjct: 189 VGGEFHERLDYYGTAWLPAREIVMSALQKGSGIDE--RVLVFEDFAPWKEHLFDLEKELG 246
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
IE YVLY D+ GK WR+QAV VS + FESRK LP +WRGLRDD LS+ G+PGCVFV
Sbjct: 247 IEGRSIYVLYPDESGK-WRIQAVPVSSESFESRKALPDEWRGLRDDALSQATGVPGCVFV 305
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H SGFIGG Q+ G A+AR ALK
Sbjct: 306 HASGFIGGAQTREGVEALARLALK 329
>gi|156384260|ref|XP_001633249.1| predicted protein [Nematostella vectensis]
gi|156220316|gb|EDO41186.1| predicted protein [Nematostella vectensis]
Length = 341
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 223/329 (67%), Gaps = 13/329 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDE L C+M++ ++ +A I+R+RDPK+L+ D V+DV GVYD S
Sbjct: 11 KKIGTHNGTFHCDEVLACYMLKQLPEYSHADIIRTRDPKILEQCDVVVDVSGVYDASKHR 70
Query: 83 YDHHQKGFEEVF------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
YDHHQ+ F + TKLSSAGLVY HFG+ ++++ + + E H + +++
Sbjct: 71 YDHHQRSFTGCMQTLTDSQKPWKTKLSSAGLVYLHFGRRVLSQVMQMPEDHQALDKVYDK 130
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y+N ++ +DAIDNG++Q +D+ PRY+ TNLS+RVG LN W + + E + F +
Sbjct: 131 IYENLIQEVDAIDNGVSQ--SDEKPRYIITTNLSARVGNLNPKWNDKNMDEEAQ---FNK 185
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFEL 255
+ L G EF+D V +Y SWLPARS+V + R++ D SGEI L CPWK HLF+L
Sbjct: 186 ALQLVGGEFMDKVLYYKNSWLPARSLVKGAVLHRFEVDESGEIAELAECGCPWKDHLFDL 245
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E+E+ IE IK+V+Y D+ GK WRVQ V+V P FE+R L +WRG+RDD LSK +GI
Sbjct: 246 EKELNIEVPIKFVIYSDNSGK-WRVQCVSVRPQSFENRLGLLEEWRGIRDDALSKLSGIQ 304
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALK 344
GC+FVH +GFIGGNQ+ GAL MAR L+
Sbjct: 305 GCIFVHATGFIGGNQTREGALEMARKTLQ 333
>gi|47229885|emb|CAG07081.1| unnamed protein product [Tetraodon nigroviridis]
Length = 343
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 161/330 (48%), Positives = 220/330 (66%), Gaps = 14/330 (4%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K++GTHNG+FHCDE L CF +R ++ +A+I+R+RDP L + D V+DVGG +DP
Sbjct: 15 FKKIGTHNGTFHCDEVLACFFLRQLPEYKDAEIIRTRDPAQLAECDIVVDVGGEFDPKRH 74
Query: 82 CYDHHQKGFEEVFGHGFS------TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YDHHQ+ F E F H TKLSSAGLVY HFG+ L+A+ + E ++ L+
Sbjct: 75 RYDHHQRTFTETF-HSLCPEKPWVTKLSSAGLVYMHFGRRLVAELTKLKEQDRELEVLYD 133
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y+NF+E +DA+DNGI+QYD + RY +T LS+RV LN W Q E E F+
Sbjct: 134 KLYENFVEEVDAVDNGISQYDGE--ARYTVSTTLSARVAHLNPRWNSKSQDTE---EGFR 188
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFE 254
Q M + G+EFLD + FY SWLPAR++V E + +R+ DPSGE+++ + CPWK HLF
Sbjct: 189 QAMQMVGEEFLDRLEFYQSSWLPARAVVEEAVKKRFQVDPSGEVLLFSQGGCPWKEHLFA 248
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
LE+E+ +E IK+VLY D G QWR+Q V + F++R PL +WRG+RD+ LS+ +GI
Sbjct: 249 LEKELNVETPIKFVLYADQNG-QWRIQCVPAGLNTFQNRLPLLEEWRGVRDEALSELSGI 307
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GC+FVH GFIGGN+S GAL MAR L+
Sbjct: 308 QGCIFVHAGGFIGGNKSVEGALEMARRTLQ 337
>gi|308808526|ref|XP_003081573.1| putative GAMM1 protein [Oryza sativa (ISS) [Ostreococcus tauri]
gi|116060038|emb|CAL56097.1| putative GAMM1 protein [Oryza sativa (ISS) [Ostreococcus tauri]
Length = 338
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 218/333 (65%), Gaps = 6/333 (1%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
S QT + + TH+G+FHCDEALGC ++R T F +A I RSRD D V+DVG
Sbjct: 8 SKRQTMRRTIATHDGAFHCDEALGCHLLRKTTTFRDATIDRSRDAARHALADVVIDVGAT 67
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD---VHR 132
YD YDHHQ+ F E FG GF TKLSSAGLVYKH+G+E++ L+ PD V +
Sbjct: 68 YDHEKRLYDHHQREFTETFGRGFETKLSSAGLVYKHYGEEIVRDVLSRGGKVPDTKTVDK 127
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
++L +Y+ F+E++D DNG+N YDTD +Y NT+L RV +LN W EP + E++ E
Sbjct: 128 IYLKMYEEFIESVDGNDNGVNMYDTDAKAKYKENTSLPHRVKRLNPAWDEP-FTPEKQME 186
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F++ + L G EF D + +Y WLPAR+ V + + SGEI++L+ FCPWK HL
Sbjct: 187 QFEKAVALTGAEFDDMLEYYAHKWLPARAHVESALDKAKSVHESGEILLLETFCPWKEHL 246
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+E+E E + PL K++L++D +G +RV + V+ FE RK LP WRG+RDD LS+ +
Sbjct: 247 YEIEAERGVTPLPKFILWQDSKG--YRVSTIPVTASSFEFRKGLPTAWRGVRDDALSELS 304
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GIPGCVF+H +GFIGGN +Y GALAMA A L L
Sbjct: 305 GIPGCVFIHAAGFIGGNDTYEGALAMAVAGLNL 337
>gi|342320247|gb|EGU12189.1| GAMM1 protein, putative [Rhodotorula glutinis ATCC 204091]
Length = 337
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 164/330 (49%), Positives = 220/330 (66%), Gaps = 7/330 (2%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
TP+ + +H+G+FH D+AL M+R+ ++ A++ RSRDPK ++ + DVGG YD
Sbjct: 11 TPI--LTSHSGTFHADDALALSMLRVLPQYAQAEVRRSRDPKEWEEATVLFDVGGEYDAE 68
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+GF EVFGHGF TKLSSAGL+YKHFG+E+++ LN P V L+L +Y
Sbjct: 69 KGKFDHHQRGFSEVFGHGFVTKLSSAGLIYKHFGQEILSTLLNEPVSSPVVQTLYLKMYA 128
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
+F+EA D IDNGI+QY T +PPRY + T++SSRVG LN W EP + E F + +
Sbjct: 129 DFVEAFDGIDNGISQYVTTEPPRYRSRTDISSRVGALNPRWNEPSNE-DILLERFLKASE 187
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
+ G EF + + + ++WLPAR IV + R PSG+I++ F PWK HL LEE++
Sbjct: 188 MCGNEFKERLDYLAKAWLPAREIVQRAVEARKQVHPSGKILIFDEFAPWKEHLHNLEEDL 247
Query: 260 KI--EPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
KI + L Y LY ++ +WRVQAVAVSP+ FESRK LP WRG+RDD LS+ GI
Sbjct: 248 KIPTDELPLYALYPENEKPDSKWRVQAVAVSPESFESRKALPEAWRGVRDDALSQLTGID 307
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GCVFVH +GFIGGN + GALAMA A +L
Sbjct: 308 GCVFVHAAGFIGGNNTKDGALAMAVKAREL 337
>gi|299755564|ref|XP_002912112.1| GAMM1 protein [Coprinopsis cinerea okayama7#130]
gi|298411282|gb|EFI28618.1| GAMM1 protein [Coprinopsis cinerea okayama7#130]
Length = 327
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 170/332 (51%), Positives = 216/332 (65%), Gaps = 23/332 (6%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q K++GTHNG+FHCDEAL F+++ T + +A +VR+RDP +LD D V+DVG VYD
Sbjct: 14 QRLTKKIGTHNGTFHCDEALAVFLLKQTATYASADVVRTRDPAILDGCDIVVDVGAVYDA 73
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+DHHQ+GF +V HFG+E+IA LN P V ++L VY
Sbjct: 74 EKLRFDHHQRGFTDV-----------------HFGQEIIANRLNAAPDDPKVKYVWLKVY 116
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
++F+EA+DAIDNGI+QY TD PRY N T+LSSRVG LN W EP S + FQ+
Sbjct: 117 RDFIEALDAIDNGISQYPTDIQPRYRNKTDLSSRVGSLNPRWNEPTDSTILDAR-FQEAS 175
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIA-ERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
LAGKEFL V +Y SWLPA +V E IA + + D SG+I++ ++F PWK HLFELE
Sbjct: 176 SLAGKEFLQVVDYYRDSWLPAGDLVKELIAWSKANVDSSGKIILFQQFAPWKEHLFELES 235
Query: 258 EM---KIEP-LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+EP YV+Y D+ G WRVQAV V+P FESRKPLP WRG+RD LS+ G
Sbjct: 236 SEGGGALEPNQAIYVVYPDEIGGNWRVQAVPVAPSSFESRKPLPEAWRGVRDGALSQLTG 295
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
I GCVFVH SGFIGGN++ GAL MA+ AL+L
Sbjct: 296 IAGCVFVHASGFIGGNKTKEGALEMAKKALEL 327
>gi|405958240|gb|EKC24386.1| UPF0160 protein MYG1, mitochondrial, partial [Crassostrea gigas]
Length = 346
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 174/340 (51%), Positives = 227/340 (66%), Gaps = 16/340 (4%)
Query: 16 SPSQTPL----KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
SP++ P ++GTHNGSFHCDE L CFM+R K+ +A+++R+RDP VLD D V+D
Sbjct: 8 SPNKKPRLIDSMKIGTHNGSFHCDEILACFMLRQLPKYADAEVIRTRDPAVLDTCDVVVD 67
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
VGGVYDP+ + +DHHQ+ F E ++TKLSSAGLVY HFG+E+++K L +
Sbjct: 68 VGGVYDPAKNRFDHHQRTFNESMSTVNPPKKWTTKLSSAGLVYCHFGREIVSKILELPVE 127
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
P ++ VY+NF+E IDAIDNGINQYD + PRY +TN+S RV N W EP+
Sbjct: 128 DPITDVVYDKVYENFVEEIDAIDNGINQYDGE--PRYQISTNISIRVSHFNPQWNEPNAD 185
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-F 245
E F + M + G EFLD V +Y +SWLPAR +V E + R + DPSGEI+V K
Sbjct: 186 ---EMAGFLKAMKMVGAEFLDKVLYYKKSWLPARELVEEAVKGRTEVDPSGEIVVFKTGG 242
Query: 246 CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
CPWK HLF LE E+ I P IKYVLY D+ +WR+Q V S F +R LP WRGLRD
Sbjct: 243 CPWKDHLFNLEAELDINPPIKYVLYT-DQANKWRIQCVPESLVSFSNRLSLPEDWRGLRD 301
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
D L++++GI GC+FVH GFIGGN +Y GAL MA+ +LK+
Sbjct: 302 DVLTEKSGIEGCIFVHAGGFIGGNHTYEGALEMAKKSLKM 341
>gi|393243398|gb|EJD50913.1| putative GAMM1 protein [Auricularia delicata TFB-10046 SS5]
Length = 335
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 159/324 (49%), Positives = 214/324 (66%), Gaps = 3/324 (0%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FHCDEAL +++R T F A ++R+RDP LD D V+DVG VYD +
Sbjct: 11 KTIGTHNGTFHCDEALAVYLLRQTATFKGADVLRTRDPAKLDTCDIVVDVGAVYDAAQLR 70
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+ F FG G+ TKLSSAGLVYKHFG+E++A L+ + L+L +YK F+
Sbjct: 71 FDHHQREFNHTFGEGWHTKLSSAGLVYKHFGREIVANSLDAPLDDASLDLLYLKMYKGFI 130
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
EAIDAIDNG+ QYDT P RY ++T+LSSRVG LN W E SA + F+Q L G
Sbjct: 131 EAIDAIDNGVAQYDTTSPQRYSSSTDLSSRVGFLNPAWNEKVDSAG-VDALFEQASTLTG 189
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI- 261
EF ++ Y SWLPAR IV R+ SG I+V + + PWK HL+++E+ ++
Sbjct: 190 SEFASRLKHYTSSWLPARDIVRAAFELRHKAHASGRIVVFESYAPWKDHLYDVEKAAQVA 249
Query: 262 -EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ L+ YV+Y D+ WR+QAV + F +RK LP WRGLRDD+LS+ +GIPG +F+
Sbjct: 250 GDDLVLYVVYPDETAGSWRIQAVTKEGEAFTNRKSLPEPWRGLRDDKLSEASGIPGGIFI 309
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H SGFIGGN++ GAL +A AL+
Sbjct: 310 HASGFIGGNKTKEGALQIAVKALE 333
>gi|395835006|ref|XP_003790476.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Otolemur garnettii]
Length = 374
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 220/339 (64%), Gaps = 17/339 (5%)
Query: 17 PSQTPLK------RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
P + PL+ R+GTHNG+FHCDEAL C +RL ++ +A+IVR+RDP+ L D V+
Sbjct: 31 PPKRPLRNLMAPPRIGTHNGTFHCDEALACAFLRLLPEYRDAEIVRTRDPEKLASCDIVV 90
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E
Sbjct: 91 DVGGEYDPQRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSE 150
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQ 185
V L+ +Y+NF+E +DA+DNGI+Q++ + PRY T LS+RV +LN W +P+Q
Sbjct: 151 EDSVVGTLYDKMYENFVEEVDAVDNGISQWEEGE-PRYALTTTLSARVARLNPTWNQPNQ 209
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KR 244
E F++ MDL +EFL + FY SWLPA+++V +A+R+ DPSGEI+ L K
Sbjct: 210 DTE---AGFKRAMDLVKEEFLQRLDFYQHSWLPAQALVEAALAQRFQVDPSGEIIELAKG 266
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
CPWK HL+ LE + I +V++ D G QWRVQ V P F+SR PLP WRGLR
Sbjct: 267 GCPWKEHLYHLESGLSPPVAIIFVIFTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLR 325
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
D L + +GIPGC+FVH SGFIGG+ + GAL MARA L
Sbjct: 326 DKALDQVSGIPGCIFVHASGFIGGHHTREGALNMARATL 364
>gi|346470471|gb|AEO35080.1| hypothetical protein [Amblyomma maculatum]
Length = 374
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 169/348 (48%), Positives = 233/348 (66%), Gaps = 15/348 (4%)
Query: 6 VSSSPAYSTSSPSQTP---LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKV 62
V S+ + P++ P +K +GTHNG+FHCDEAL CF+I+ ++ +A +VRSRDP V
Sbjct: 30 VRSASLMAGEPPTKKPAAMVKTIGTHNGTFHCDEALACFLIKQLAEYKDATVVRSRDPAV 89
Query: 63 LDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELI 117
LD D V+DVGGV+DP+ YDHHQ+GFEE ++TKLSSAGLVY H G+E+I
Sbjct: 90 LDTCDVVVDVGGVFDPAKKRYDHHQRGFEETMHSLDSSKKWTTKLSSAGLVYVHHGREVI 149
Query: 118 AKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN 177
A+ L +P++ +++ VY+NFME IDAIDNGIN YD + PRY NTNLSSRV LN
Sbjct: 150 AETLGWKVTNPNLEKIYDKVYENFMEEIDAIDNGINAYDGE--PRYRINTNLSSRVAHLN 207
Query: 178 LDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSG 237
W EP+ + +E FQ+ M L G+EFLD + +Y SW+PAR +V+ + +R +
Sbjct: 208 PPWNEPN---PKPDEQFQKAMKLTGEEFLDRIHYYATSWIPARELVLSALQQRQKVGEAS 264
Query: 238 EIMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPL 296
+++L+R CPWK HL LEEE+ I +K+VLY+D G WRVQ V V+ FE R L
Sbjct: 265 SVVILERGGCPWKDHLLALEEELSIPGEVKFVLYQDQNG-SWRVQGVPVTLGSFECRIHL 323
Query: 297 PAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
P +WRGLR++ELSK GI GC+F+H SGFIGG+++ G + +A ++
Sbjct: 324 PEKWRGLRNEELSKATGIDGCIFIHSSGFIGGHKTKEGVVEIAMRTIQ 371
>gi|410920035|ref|XP_003973489.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Takifugu
rubripes]
Length = 337
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 220/329 (66%), Gaps = 14/329 (4%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDE L CF +R ++ +A+++R+RDP L + D V+DVGG +DP
Sbjct: 4 KKIGTHNGTFHCDEVLACFFLRQIPEYKDAEVIRTRDPTRLAECDIVVDVGGEFDPKRHR 63
Query: 83 YDHHQKGFEEVFGHGFS------TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
YDHHQ+ F E F H TKLSSAGLVY HFG++L+A+ + E ++ LF
Sbjct: 64 YDHHQRTFTETF-HSLCPEKPWVTKLSSAGLVYVHFGRQLLAELTKLKEHDRELEVLFDK 122
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y+NF+E +DA+DNGI+QYD + RY +T LS+RV LN W Q E E F++
Sbjct: 123 LYENFVEEVDAVDNGISQYDGE--ARYTVSTTLSARVAHLNPRWNSKSQDTE---EGFRK 177
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFEL 255
+ + +EFLD + FY SWLPAR++V E + +R+ DPSGE+++ + CPWK HLF L
Sbjct: 178 ALQMVEEEFLDRLEFYQSSWLPARAVVEEAVKKRHQVDPSGEVLLFSQGGCPWKEHLFAL 237
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E+E+ +E IK+VLY D G QWR+Q V + F++R PL +WRG+RD+ LS+ +GI
Sbjct: 238 EKELNVETPIKFVLYTDQNG-QWRIQCVPAGLNTFQNRLPLLEEWRGVRDEALSELSGIK 296
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALK 344
GC+FVH GFIGGN+S GAL MAR L+
Sbjct: 297 GCIFVHAGGFIGGNKSMEGALEMARRTLQ 325
>gi|432857620|ref|XP_004068720.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Oryzias
latipes]
Length = 396
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/328 (49%), Positives = 217/328 (66%), Gaps = 12/328 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDE L CF +R ++ +A+IVRSRDP VL + D V+DVGG +DP
Sbjct: 64 KKIGTHNGTFHCDEVLACFFLRQLPEYADAEIVRSRDPAVLAECDVVVDVGGEFDPKRHR 123
Query: 83 YDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+GF E F + TKLSSAGL+Y HFG+ L+A+ + + E V L+ +
Sbjct: 124 YDHHQRGFTESFNSLCPEKPWVTKLSSAGLIYLHFGRRLLAQLMQLKEDDRLVELLYNKL 183
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+NF+E +DAIDNGI+QYD RY ++ LSSRVG LN W Q E E F++
Sbjct: 184 YENFVEEVDAIDNGISQYDGK--ARYKISSTLSSRVGYLNPRWNSKSQDTE---EGFRKA 238
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELE 256
M L G+EF+D V FY SWLPAR +V + ER+ DPSGE+++ + CPWK HLF+LE
Sbjct: 239 MKLVGEEFMDRVDFYKSSWLPAREVVEAAVKERHQVDPSGEVVLFSQGGCPWKEHLFDLE 298
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
E+++E IK+VLY D G WRVQ V F++R L +WRG+RD+ LS+ +GI
Sbjct: 299 NELQVETQIKFVLYPDQNG-HWRVQCVPADLHSFQNRLSLLEEWRGIRDEALSELSGIKD 357
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
C+FVH GFIGGN++ GAL MAR L+
Sbjct: 358 CIFVHAGGFIGGNKNQEGALEMARRTLQ 385
>gi|344266905|ref|XP_003405519.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Loxodonta
africana]
Length = 381
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 173/345 (50%), Positives = 224/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P Y R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPVSVPPYKRPRGDFMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGLVY HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPQRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLVYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L+ E V L+ +Y+NF+E +DA+DNGI+Q++ + PRY TNLS+RV +LN
Sbjct: 147 LLSTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQWEEGE-PRYAVTTNLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W +P+Q E FQ+ MDL +EFL + FY SWLPAR+++ E +A+R+ DPSGEI
Sbjct: 206 WNQPNQDTE---AGFQRAMDLVREEFLQRLDFYQHSWLPARALMEEALAQRFQVDPSGEI 262
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE E+ I +V+Y D G QWRVQ V P F+SR PLP
Sbjct: 263 VELAKGGCPWKDHLYNLESELSPPVAIAFVIYTDQAG-QWRVQCVPKEPYSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD L + GIPGC+FVH SGFIGG+ + GAL+MARA L
Sbjct: 322 PWRGLRDKALDQVTGIPGCIFVHASGFIGGHHTREGALSMARATL 366
>gi|317418812|emb|CBN80850.1| Melanocyte proliferating gene 1 [Dicentrarchus labrax]
Length = 348
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 220/330 (66%), Gaps = 14/330 (4%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
++++GTHNG+FHCDE L CF +R ++ +A+I+R+RDP L D V+DVGG +DP
Sbjct: 15 VQKIGTHNGTFHCDEVLACFFLRQLPEYEDAEIIRTRDPTKLAQCDIVVDVGGEFDPKRH 74
Query: 82 CYDHHQKGFEEVFGHGFS------TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YDHHQ+ F E F H TKLSSAGLVY HFG++L+A+ + EG + LF
Sbjct: 75 RYDHHQRTFTESF-HSLCPEKPWVTKLSSAGLVYLHFGRQLLAQLTQLKEGDKQLEMLFD 133
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y+NF+E +DAIDNGI+Q+D + RY +T LS+RV LN W Q E E F+
Sbjct: 134 KLYENFVEEVDAIDNGISQFDGE--ARYSISTTLSARVSHLNPCWNSKSQDTE---EGFK 188
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFE 254
+ M + G+EFLD + FY SWLPAR++V + + ER+ DPSGE+++ + CPWK HLF
Sbjct: 189 KAMKMVGEEFLDRLEFYQSSWLPARTVVEDAVKERHQVDPSGEVVLFSQGGCPWKEHLFT 248
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
LE+E+++E IK++LY D G QWRVQ V + F++R L +WRG+RD+ LS+ +GI
Sbjct: 249 LEKELQVETPIKFILYPDQNG-QWRVQCVPAGLNTFQNRLSLLEEWRGVRDEALSELSGI 307
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALK 344
C+FVH GFIGGN++ GAL MAR L+
Sbjct: 308 KDCIFVHAGGFIGGNKNQEGALEMARRTLQ 337
>gi|345791678|ref|XP_849816.2| PREDICTED: UPF0160 protein MYG1, mitochondrial [Canis lupus
familiaris]
Length = 383
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 168/326 (51%), Positives = 221/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTH+G+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 47 RIGTHDGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLAACDVVVDVGGEYDPQRHRY 106
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGLVY HFG +L+A+ L E V ++ +Y
Sbjct: 107 DHHQRSFTETMSSLSPGKRWQTKLSSAGLVYLHFGHKLLAQLLGTSEEDSMVGTIYDKMY 166
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ +P RYV T LS+RV +LN W +P+Q E FQ+ M
Sbjct: 167 ENFVEEVDAVDNGISQWEEGEP-RYVLTTTLSARVARLNPTWNQPNQDTE---AGFQRAM 222
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL V FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 223 DLVREEFLQRVDFYQHSWLPARALVQEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 282
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ + +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 283 GLSPPVTVAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 341
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+++ GAL+MARA L
Sbjct: 342 VFVHTSGFIGGHRTREGALSMARATL 367
>gi|348528813|ref|XP_003451910.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Oreochromis
niloticus]
Length = 400
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 223/344 (64%), Gaps = 17/344 (4%)
Query: 12 YSTSSPSQ-----TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
Y +S P + + ++GTHNG+FHCDE L CF +R ++ + +IVR+RDP L +
Sbjct: 52 YMSSEPKRLCTENMTVTKIGTHNGTFHCDEVLACFFLRQLPEYSDGEIVRTRDPAQLAEC 111
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKEL 121
D V+DVGG +DP YDHHQ+ F + F + TKLSSAGLVY HFG+ ++++
Sbjct: 112 DIVVDVGGEFDPKRHRYDHHQRTFSDTFNSLCPEKPWVTKLSSAGLVYLHFGRRVLSQLT 171
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
+ E + L+ +Y+NF+E +DA+DNGI+QYD + RY +T LS+RV LN W
Sbjct: 172 QLKEDDRQLEVLYDKLYENFVEEVDAVDNGISQYDGES--RYSISTTLSARVSHLNPRWN 229
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
DQ E E F++ + + G+EFLD + FY SWLPAR +V E + +R+ DPSGE+++
Sbjct: 230 STDQDTE---EGFKKALKMVGEEFLDRLDFYKSSWLPAREVVEEAVKKRHQIDPSGEVLL 286
Query: 242 LKR-FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
+ CPWK HLF LE+E+ +E IK+VLY D G QWRVQ V + F++R LP +W
Sbjct: 287 FSQGGCPWKEHLFALEKELHVETPIKFVLYPDQNG-QWRVQCVPAGLNTFQNRLSLPEEW 345
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RG+RD+ LS+ +GI GC+FVH GFIGGN++ GAL MAR L+
Sbjct: 346 RGVRDEALSELSGIKGCIFVHAGGFIGGNKTQEGALEMARRTLQ 389
>gi|388578771|gb|EIM19108.1| metal-dependent protein hydrolase [Wallemia sebi CBS 633.66]
Length = 343
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 161/331 (48%), Positives = 225/331 (67%), Gaps = 11/331 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG FH DEAL F++RL ++ + + R+RD VL+ +D V+DVG VYD
Sbjct: 16 KVIGTHNGVFHADEALAVFLLRLLPEYTGSAVTRTRDINVLNQMDIVVDVGDVYDHGKLR 75
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHH +GF EVFGH +TKLSSAGL+YKH+GK++I+K L E +++L VYK F+
Sbjct: 76 YDHHFRGFNEVFGHNHNTKLSSAGLIYKHYGKQIISKHLGWSEDEDRTLQIWLKVYKEFI 135
Query: 143 EAIDAIDNGINQYDT-DKPPR--YVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
EA+DAIDNG++ Y T + P Y N T+LSSR+G+LN +W E + + + + F++
Sbjct: 136 EALDAIDNGVSLYPTVENLPEAAYRNKTDLSSRIGRLNSNWNE-EFTEQSQMTRFEEASK 194
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE--- 256
LAG EFLD+V Y + WLPAR +VVE +++R ++D SG+I+V K FCPW H FELE
Sbjct: 195 LAGSEFLDSVNSYAKVWLPARDLVVEALSKRKEFDESGKILVFKTFCPWIDHYFELEHLP 254
Query: 257 --EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
E K E + +L + +RG WRV+A+ + F RKPLP WRG+RD++L++ +G+
Sbjct: 255 QFEVKKDEEPLYALLPDSNRG--WRVRAIPPNSTTFALRKPLPEPWRGVRDEKLAEVSGV 312
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
PG +F H SGFIGGN +Y GAL MAR A++L
Sbjct: 313 PGTIFCHASGFIGGNDTYEGALEMARKAVQL 343
>gi|444513884|gb|ELV10469.1| UPF0160 protein MYG1, mitochondrial [Tupaia chinensis]
Length = 379
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 172/349 (49%), Positives = 226/349 (64%), Gaps = 14/349 (4%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDP 60
M G+ P S P R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP
Sbjct: 25 MLGSETVPPPKRPRSDLMAPP--RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDP 82
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKE 115
+ L D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +
Sbjct: 83 EKLASCDVVVDVGGEYDPKRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHK 142
Query: 116 LIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGK 175
L+A+ L+++E V L+ +Y+NF+E +DA+DNGI+Q++ + PRY T LS+RV +
Sbjct: 143 LLAQLLDMNEEDSMVGTLYDKMYENFVEEVDAVDNGISQWEGE--PRYALTTTLSARVAR 200
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
LN W +P+Q E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DP
Sbjct: 201 LNPTWNQPNQDTE---AGFRRAMDLVREEFLQRLDFYQHSWLPARALVEEALAQRFQVDP 257
Query: 236 SGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
SGEI+ L K CPWK HLF LE E+ I +V+Y D G QWRVQ V P F+SR
Sbjct: 258 SGEILELAKGGCPWKEHLFHLESELCPPVAIAFVIYTDQAG-QWRVQCVPKEPHSFQSRL 316
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
PLP WRGLRD L + +GIPGC+FVH SGFIGG+ + G L MARA L
Sbjct: 317 PLPEPWRGLRDKALDQVSGIPGCIFVHASGFIGGHHTREGVLNMARATL 365
>gi|449019325|dbj|BAM82727.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 435
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 217/333 (65%), Gaps = 6/333 (1%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S S S + +GTH+G+FHCDEAL C+M+RL +F +A I+R+RDP +L LD V+DV
Sbjct: 101 SLQSFSAMTSRSIGTHSGTFHCDEALACYMLRLLPEFRDASIIRTRDPALLATLDCVVDV 160
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
G YDP +DHHQ+ F + F T+LSSAGLVYKHFG+ELI +++ +
Sbjct: 161 GAEYDPERLRFDHHQRSFSDTFSTSKKTRLSSAGLVYKHFGRELI-RQITGTSDENVLEL 219
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
L+ VY F+EAID DNGI+ T++ PRYV+NT L R+ +LN W D+S + +
Sbjct: 220 LYTRVYDGFIEAIDGNDNGIDA--TNERPRYVDNTTLPMRISRLNTPWN-VDESEQEQLV 276
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F+Q M++AG EF+ VR WLPAR++V+E +R++ SG ++ L+R+CPWK HL
Sbjct: 277 RFEQAMEMAGTEFVAYVRGLYGQWLPARAVVLEAFHKRFETHASGRMIYLERWCPWKTHL 336
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+++E E I +VLY D G WRVQAV PD FE+R LP +WRGL+D L K
Sbjct: 337 YDIESENSALGEIYFVLYRDHSG-MWRVQAVN-KPDSFENRISLPIEWRGLQDAALEKVT 394
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GI GC FVH SGFIGGN++Y GALAMA AL +
Sbjct: 395 GIRGCTFVHASGFIGGNETYEGALAMACLALSM 427
>gi|351706153|gb|EHB09072.1| UPF0160 protein MYG1, mitochondrial [Heterocephalus glaber]
Length = 351
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 219/326 (67%), Gaps = 12/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL +F +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 22 RIGTHNGTFHCDEALACALLRLLPEFQDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 81
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V ++ +Y
Sbjct: 82 DHHQRSFTETMNSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDTMVGTIYDKMY 141
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NFME +DA+DNGI+Q++ + PRY TNLS+RV +LN W +P+Q E F++ M
Sbjct: 142 ENFMEEVDAVDNGISQWEGE--PRYALTTNLSARVARLNPTWNQPNQDTE---AGFRRAM 196
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY RSWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 197 DLVREEFLQRLDFYQRSWLPARALVEEALAQRFQVDPSGEIVELSKGGCPWKEHLYHLES 256
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ +V+Y D G QWRVQ V F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 257 GLSPPQTTAFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPELWRGLRDEALDQVSGIPGC 315
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL+MARA L
Sbjct: 316 IFVHASGFIGGHHTREGALSMARATL 341
>gi|11096332|ref|NP_068359.1| UPF0160 protein MYG1, mitochondrial precursor [Mus musculus]
gi|14194965|sp|Q9JK81.1|MYG1_MOUSE RecName: Full=UPF0160 protein MYG1, mitochondrial; AltName:
Full=Protein Gamm1; Flags: Precursor
gi|7595964|gb|AAF64518.1|AF252871_1 GAMM1 protein [Mus musculus]
gi|10444287|gb|AAG17846.1| MYG1 [Mus musculus]
gi|12835156|dbj|BAB23171.1| unnamed protein product [Mus musculus]
gi|148672042|gb|EDL03989.1| melanocyte proliferating gene 1, isoform CRA_b [Mus musculus]
Length = 380
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ NA+IVR+RDP+ L D V+DVGG Y+P + Y
Sbjct: 45 RIGTHNGTFHCDEALACALLRLLPEYANAEIVRTRDPEKLASCDIVVDVGGEYNPQSHRY 104
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGLVY HFG++L+A+ L E V ++ +Y
Sbjct: 105 DHHQRTFTETMSSLCPGKPWQTKLSSAGLVYLHFGRKLLAQLLGTSEEDSVVDTIYDKMY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +P+Q E F++ M
Sbjct: 165 ENFVEEVDAVDNGISQW-AEGEPRYAMTTTLSARVARLNPTWNQPNQDTE---AGFRRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ D SGEI+ L K CPWK HL+ LE
Sbjct: 221 DLVQEEFLQRLNFYQHSWLPARALVEEALAQRFKVDSSGEIVELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E+ + I +V+Y D G QWRVQ V P F+SR PLP WRGLRD L + +GIPGC
Sbjct: 281 ELSPKVAITFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDKALDQVSGIPGC 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL MARA L
Sbjct: 340 IFVHASGFIGGHHTREGALNMARATL 365
>gi|149714826|ref|XP_001504562.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Equus
caballus]
Length = 380
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 45 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 104
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V ++ +Y
Sbjct: 105 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTIYDKMY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ +P RY T LS+RV +LN W +P+Q E F++ M
Sbjct: 165 ENFVEEVDAVDNGISQWEEGEP-RYALTTTLSARVARLNPTWNQPNQETE---AGFKRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 221 DLVREEFLQRLDFYQHSWLPARTLVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 281 GLSPPVTIAFVTYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+++ GAL+MARA L
Sbjct: 340 VFVHASGFIGGHRTREGALSMARATL 365
>gi|302565808|ref|NP_001181189.1| UPF0160 protein MYG1, mitochondrial precursor [Macaca mulatta]
gi|355564280|gb|EHH20780.1| UPF0160 protein MYG1, mitochondrial [Macaca mulatta]
Length = 376
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDP 60
M G S P S +P R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP
Sbjct: 25 MLGPESVSPPKRPRSKLMASP--RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDP 82
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKE 115
+ L D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +
Sbjct: 83 EKLASCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHK 142
Query: 116 LIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGK 175
L+A+ L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +
Sbjct: 143 LLAQLLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVAR 201
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
LN W +PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DP
Sbjct: 202 LNPTWNQPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDP 258
Query: 236 SGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
SGEI+ L K CPWK HL+ LE + I +V+Y D G QWRVQ+V P F+SR
Sbjct: 259 SGEIVELAKGGCPWKEHLYHLESGLSPPVAIIFVIYTDQAG-QWRVQSVPKEPHSFQSRL 317
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
PLP WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL MARA L
Sbjct: 318 PLPEPWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALNMARATL 366
>gi|291389245|ref|XP_002711173.1| PREDICTED: MYG1 protein [Oryctolagus cuniculus]
Length = 378
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 43 RIGTHNGTFHCDEALACALLRLLPEYQDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 102
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGLVY HFG +L+A+ L E + V L+ +Y
Sbjct: 103 DHHQRSFTETMSSLCPGKPWQTKLSSAGLVYLHFGHKLLAQLLGTTEDNDVVGVLYDKMY 162
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI Q+D +P RY T LS+RV +LN W +P+Q E FQ+ M
Sbjct: 163 ENFVEEVDAVDNGIAQWDEGEP-RYALTTTLSARVARLNPTWNQPNQDTE---AGFQRAM 218
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R DPSGEI+ L K CPWK HL+ LE
Sbjct: 219 DLVREEFLQRLDFYQHSWLPARALVEEALAQRLQVDPSGEIVELAKGGCPWKEHLYHLES 278
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 279 TLCPPVAIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDETLDQVSGIPGC 337
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 338 IFVHASGFIGGHRTREGALSMARATL 363
>gi|330846600|ref|XP_003295106.1| hypothetical protein DICPUDRAFT_44326 [Dictyostelium purpureum]
gi|325074269|gb|EGC28369.1| hypothetical protein DICPUDRAFT_44326 [Dictyostelium purpureum]
Length = 334
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 216/319 (67%), Gaps = 4/319 (1%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+GSFH DEAL C+M++L + +A+IVRSRD KV+D +DVG VYD +D
Sbjct: 8 ICTHSGSFHADEALACYMLKLVPTYKDAKIVRSRDKKVIDASTVAVDVGAVYDLEKLRFD 67
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ GF E F TKLSSAGL+YK+FGK +I LN +E ++ LF +Y N +E
Sbjct: 68 HHQAGFTETFDDKHLTKLSSAGLIYKNFGKTIIKNRLNTNESVTEL--LFNKIYDNTIEE 125
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG+ +Y +D P+Y +N+++SSRVG+LN W EP Q + F++ M+L G
Sbjct: 126 LDGVDNGVERYPSDVKPKYQSNSSISSRVGRLNPAWNEP-QDDDLVFTQFEKAMELMGSY 184
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
FLD + +Y +SWLP RSIV I R + SGEI+VL+ +CPWK HL+ LE+E +I+
Sbjct: 185 FLDCLDYYGKSWLPCRSIVENSIDNRKNIHSSGEIIVLEMYCPWKDHLYTLEQEKEIKTP 244
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
IK+VL+ED G QWR+ AV+++ F +R LP +WRG RD+ELS+ +GI GCVF H +G
Sbjct: 245 IKFVLFEDTSG-QWRIGAVSINLHSFTNRLSLPEEWRGKRDEELSQISGIEGCVFCHANG 303
Query: 325 FIGGNQSYGGALAMARAAL 343
FIGGN++ GAL MA AL
Sbjct: 304 FIGGNKTREGALLMAIKAL 322
>gi|268559356|ref|XP_002637669.1| Hypothetical protein CBG19425 [Caenorhabditis briggsae]
Length = 341
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 162/328 (49%), Positives = 215/328 (65%), Gaps = 12/328 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDEA CFM++ +F + IVR+RDP L+ D V+DVGG++D S YD
Sbjct: 5 IGTHSGKFHCDEAFACFMLKQLKQFKDHSIVRTRDPAQLEKCDIVVDVGGIFDHSKQRYD 64
Query: 85 HHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
HHQ+GF E F TKLSSAGLVY H+GK++I + L + V + +Y+
Sbjct: 65 HHQRGFTETMRTLEKLNFDTKLSSAGLVYAHYGKDVINQILGGNVPPTMVDLFYHRLYEQ 124
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+E+IDAIDNGI+QYD PRY + NLSSR G+ N W EP ++E FQQ M+
Sbjct: 125 FVESIDAIDNGISQYDG--VPRYHSPGNLSSRTGQFNAHWNEP---GNNDDERFQQAMEF 179
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEEM 259
G EF +V++ W PAR I+ + ER++ DPSG I++++ CPWK H F++E E
Sbjct: 180 IGGEFERSVKYLANVWWPAREIIEAAVDERFENDPSGRIILIENGGCPWKEHFFDIEAEK 239
Query: 260 KI-EPLIKYVLYEDDRGKQWRVQAVAVSP-DRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
I + I Y+L+ D WRVQA+ + FE+R PLPA WRGLRDD+LSKE+GIPG
Sbjct: 240 NIADDKITYILFSDSTNSSWRVQAIPIDKMSSFENRMPLPAAWRGLRDDDLSKESGIPGG 299
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH+SGFIGGN + GALAMAR AL++
Sbjct: 300 VFVHISGFIGGNLTREGALAMARKALEI 327
>gi|301775781|ref|XP_002923299.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Ailuropoda
melanoleuca]
Length = 382
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 220/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 47 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRY 106
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V ++ +Y
Sbjct: 107 DHHQRSFTETMSSLSPGKPWRTKLSSAGLIYLHFGHKLLAQLLGTSEEDSVVGTIYDKMY 166
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ +P RYV T LS+RV +LN W +P+Q E F++ M
Sbjct: 167 ENFVEEVDAVDNGISQWEEGEP-RYVLTTTLSARVARLNPAWNQPNQDTE---AGFKRAM 222
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 223 DLVREEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 282
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 283 GLSPPGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 341
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+ + GAL+MARA L
Sbjct: 342 VFVHTSGFIGGHHTREGALSMARATL 367
>gi|326435856|gb|EGD81426.1| MYG1 protein [Salpingoeca sp. ATCC 50818]
Length = 334
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/328 (50%), Positives = 217/328 (66%), Gaps = 17/328 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+GSFHCDEAL CF++RL + + I+R+R+ + LD V+DVGG +DP+ +D
Sbjct: 17 IGTHDGSFHCDEALACFLLRLLPTYKDCAIIRTRNQEKLDTCTVVVDVGGKFDPTKLRFD 76
Query: 85 HHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
HHQ+ F ++TKLSSAGLVY HFG+E+I + D+ +V LF +Y
Sbjct: 77 HHQREFAHTMNSLDNNKRWTTKLSSAGLVYFHFGREIIKEVCKTDDSTTEV--LFDQMYD 134
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE--PDQSAERENEAFQQG 197
NF+E IDA+DNGI+ ++ P+Y + + +RV +LN WTE PD A+ F++
Sbjct: 135 NFVEEIDAVDNGISPHEGQ--PKYRVASTVGNRVARLNPPWTEKNPDFDAQ-----FKKA 187
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
M+L G+EFL+ + Y WLPAR VV ++ R + D SGEI+V ++ CPWK HLF LEE
Sbjct: 188 MNLVGQEFLEVLNHYNDVWLPARKHVVNALSTRKEVDKSGEIIVFEQACPWKAHLFALEE 247
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E + IKY L+ D GK WRV AV VSP F SRK LP WRGLRD ELS +AGIPGC
Sbjct: 248 EADEKINIKYALFTDTHGK-WRVMAVPVSPSSFTSRKALPEVWRGLRDQELSDKAGIPGC 306
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH SGFIGGN + GALAMARA+LK+
Sbjct: 307 IFVHASGFIGGNNTKEGALAMARASLKM 334
>gi|380789933|gb|AFE66842.1| UPF0160 protein MYG1, mitochondrial precursor [Macaca mulatta]
gi|383411893|gb|AFH29160.1| MYG1 protein precursor [Macaca mulatta]
gi|384942826|gb|AFI35018.1| MYG1 protein precursor [Macaca mulatta]
Length = 376
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNQPDQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ+V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPPVAIIFVIYTDQAG-QWRVQSVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALNMARATL 366
>gi|355786139|gb|EHH66322.1| UPF0160 protein MYG1, mitochondrial [Macaca fascicularis]
Length = 376
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNQPDQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ+V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPPVAIIFVIYTDQAG-QWRVQSVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALNMARATL 366
>gi|281341829|gb|EFB17413.1| hypothetical protein PANDA_012440 [Ailuropoda melanoleuca]
Length = 369
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 220/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 47 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRY 106
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V ++ +Y
Sbjct: 107 DHHQRSFTETMSSLSPGKPWRTKLSSAGLIYLHFGHKLLAQLLGTSEEDSVVGTIYDKMY 166
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ +P RYV T LS+RV +LN W +P+Q E F++ M
Sbjct: 167 ENFVEEVDAVDNGISQWEEGEP-RYVLTTTLSARVARLNPAWNQPNQDTE---AGFKRAM 222
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 223 DLVREEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 282
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 283 GLSPPGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 341
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+ + GAL+MARA L
Sbjct: 342 VFVHTSGFIGGHHTREGALSMARATL 367
>gi|355733010|gb|AES10882.1| MYG1 protein [Mustela putorius furo]
Length = 350
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 218/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 18 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 77
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V ++ +Y
Sbjct: 78 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTIYDKMY 137
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRYV T LS+RV +LN W +PDQ F++ M
Sbjct: 138 ENFVEEVDAVDNGISQWEEGE-PRYVLTTTLSARVARLNPTWNQPDQDT---XAGFKRAM 193
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 194 DLVREEFLQRLDFYQHSWLPARALVEEALARRFQVDPSGEIVELAKGGCPWKEHLYHLES 253
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWR+Q V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 254 GLSPPGTIAFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 312
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL+MARA L
Sbjct: 313 IFVHTSGFIGGHHTREGALSMARATL 338
>gi|332839255|ref|XP_522402.3| PREDICTED: UPF0160 protein MYG1, mitochondrial isoform 4 [Pan
troglodytes]
Length = 439
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 223/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 90 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 149
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 150 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 209
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 210 LLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 268
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 269 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 325
Query: 240 M-VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ ++K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 326 VELVKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 384
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 385 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 429
>gi|30704877|gb|AAH51871.1| Chromosome 12 open reading frame 10 [Homo sapiens]
gi|119617094|gb|EAW96688.1| chromosome 12 open reading frame 10, isoform CRA_b [Homo sapiens]
gi|312153118|gb|ADQ33071.1| chromosome 12 open reading frame 10 [synthetic construct]
Length = 376
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 222/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 147 LLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 206 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 262
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 263 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 322 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 366
>gi|397522001|ref|XP_003831069.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Pan paniscus]
gi|410227702|gb|JAA11070.1| chromosome 12 open reading frame 10 [Pan troglodytes]
gi|410255074|gb|JAA15504.1| chromosome 12 open reading frame 10 [Pan troglodytes]
gi|410299334|gb|JAA28267.1| chromosome 12 open reading frame 10 [Pan troglodytes]
gi|410333115|gb|JAA35504.1| chromosome 12 open reading frame 10 [Pan troglodytes]
Length = 376
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 223/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 147 LLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 206 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 262
Query: 240 M-VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ ++K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 263 VELVKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 322 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 366
>gi|145275185|ref|NP_067653.3| UPF0160 protein MYG1, mitochondrial precursor [Homo sapiens]
Length = 376
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 221/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 147 LLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 206 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 262
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 263 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+ + GAL+MARA L
Sbjct: 322 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHHTREGALSMARATL 366
>gi|426372729|ref|XP_004053270.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Gorilla gorilla
gorilla]
Length = 376
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 218/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNHPDQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGACPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWR+Q V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALSMARATL 366
>gi|90076778|dbj|BAE88069.1| unnamed protein product [Macaca fascicularis]
Length = 376
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 219/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNQPDQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V + +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 DLVQEEFLQRLDFYQHSWLPARALVEKALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ+V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPPVAIIFVIYTDQAG-QWRVQSVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALNMARATL 366
>gi|53850618|ref|NP_001005545.1| UPF0160 protein MYG1, mitochondrial precursor [Rattus norvegicus]
gi|81884070|sp|Q641W2.1|MYG1_RAT RecName: Full=UPF0160 protein MYG1, mitochondrial; Flags: Precursor
gi|51980347|gb|AAH82112.1| Melanocyte proliferating gene 1 [Rattus norvegicus]
gi|149031933|gb|EDL86845.1| MYG1 protein, isoform CRA_a [Rattus norvegicus]
Length = 381
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 217/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ NA+IVR+RDP+ L D V+DVGG Y+P Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRNAEIVRTRDPEKLALCDIVVDVGGEYNPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGLVY HFG +L+A+ L E V ++ +Y
Sbjct: 106 DHHQRTFTETMSSLCPGKPWQTKLSSAGLVYLHFGHKLLAQLLGTSEEDSVVDTIYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVSRLNPTWNQPDQDTE---AGFRRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ D SGEI+ L K CPWK HL+ LE
Sbjct: 222 DLVQEEFLQRLNFYQHSWLPARALVEEALAQRFKVDSSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 282 ELSPTVAITFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL MARA L
Sbjct: 341 IFVHASGFIGGHHTREGALNMARATL 366
>gi|20306404|gb|AAH28501.1| Melanocyte proliferating gene 1 [Mus musculus]
Length = 380
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 218/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ NA+IVR+RDP+ L D V+DVGG Y+P + Y
Sbjct: 45 RIGTHNGTFHCDEALACALLRLLPEYANAEIVRTRDPEKLASCDIVVDVGGEYNPQSHRY 104
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGLVY HFG++L+A+ L E V ++ +Y
Sbjct: 105 DHHQRTFTETMSSLCPGKPWQTKLSSAGLVYLHFGRKLLAQLLGTSEEDSVVDTIYDKMY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + P Y T LS+RV +LN W +P+Q E F++ M
Sbjct: 165 ENFVEEVDAVDNGISQW-AEGEPLYAMTTTLSARVARLNPTWNQPNQDTE---AGFRRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ D SGEI+ L K CPWK HL+ LE
Sbjct: 221 DLVQEEFLQRLNFYQHSWLPARALVEEALAQRFKVDSSGEIVELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E+ + I +V+Y D G QWRVQ V P F+SR PLP WRGLRD L + +GIPGC
Sbjct: 281 ELSPKVAITFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDKALDQVSGIPGC 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL MARA L
Sbjct: 340 IFVHASGFIGGHHTREGALNMARATL 365
>gi|354490209|ref|XP_003507252.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Cricetulus
griseus]
gi|344239269|gb|EGV95372.1| UPF0160 protein MYG1, mitochondrial [Cricetulus griseus]
Length = 381
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 171/349 (48%), Positives = 227/349 (65%), Gaps = 13/349 (3%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDP 60
M G+ S P S+ P R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP
Sbjct: 25 MLGSESVSPPKRPRSNLMAPP--RIGTHNGTFHCDEALACALLRLLPEYKDAEIVRTRDP 82
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKE 115
+ L D V+DVGG Y+P YDHHQ+ F E G + TKLSSAGLVY HFG +
Sbjct: 83 EKLASCDIVVDVGGEYNPLRHRYDHHQRTFTETMSSLCPGKPWQTKLSSAGLVYLHFGHK 142
Query: 116 LIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGK 175
L+A+ L E V L+ +Y+NF+E +DA+DNGI+Q++ +P RY T LS+RV +
Sbjct: 143 LLAQLLGTSEEDSIVDTLYDKMYENFVEEVDAVDNGISQWEEGEP-RYALTTTLSARVSR 201
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
LN W +P+Q E F++ MDL +EFL + FY SWLPAR++V E +A+R+ D
Sbjct: 202 LNPTWNQPNQDTE---AGFRRAMDLVQEEFLQRLNFYQHSWLPARALVEEALAQRFQVDS 258
Query: 236 SGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
SG+I+ L K CPWK HL+ LE E+ I +V+Y D G QWRVQ V P F+SR
Sbjct: 259 SGKIVELAKGGCPWKEHLYHLESELSPPVAITFVIYTDQAG-QWRVQCVPKEPHSFQSRL 317
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
PLP WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL MARA L
Sbjct: 318 PLPEPWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALNMARATL 366
>gi|402886158|ref|XP_003906505.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Papio anubis]
Length = 378
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 218/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG Y+P Y
Sbjct: 48 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYEPRRHRY 107
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 108 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 167
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W +PDQ E F++ M
Sbjct: 168 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNQPDQDTE---AGFKRAM 223
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 224 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 283
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 284 GLSPPVAIIFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 342
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL MARA L
Sbjct: 343 IFVHASGFIGGHRTREGALNMARATL 368
>gi|10444289|gb|AAG17847.1| MYG1 homolog [Homo sapiens]
Length = 376
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 221/345 (64%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 147 LLGPSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 206 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 262
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 263 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+ + GAL+MARA L
Sbjct: 322 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHPTREGALSMARATL 366
>gi|296439232|sp|Q9HB07.2|MYG1_HUMAN RecName: Full=UPF0160 protein MYG1, mitochondrial; Flags: Precursor
Length = 376
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 220/345 (63%), Gaps = 11/345 (3%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L E V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN
Sbjct: 147 LLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPT 205
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W PDQ E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 206 WNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 262
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 263 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 321
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGF GG+ + GAL+MARA L
Sbjct: 322 PWRGLRDEALDQVSGIPGCIFVHASGFTGGHHTREGALSMARATL 366
>gi|341884346|gb|EGT40281.1| hypothetical protein CAEBREN_09431 [Caenorhabditis brenneri]
Length = 340
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 215/328 (65%), Gaps = 12/328 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDEA CFM++ +F + I+R+RDP L+ D V+DVGG++D S YD
Sbjct: 5 IGTHSGKFHCDEAFACFMLKQLPQFKDHSILRTRDPAQLEKCDIVVDVGGIFDHSKQRYD 64
Query: 85 HHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
HHQ+GF E F TKLSSAGLVY H+GK++I + L V F +Y+
Sbjct: 65 HHQRGFTETMRTLEKLNFDTKLSSAGLVYAHYGKDVINQILGGKVQSTMVDLFFHRLYEQ 124
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+E+IDAIDNGI+QYD PRY + NLSSR G+ N W EP+ A+ E FQQ M+
Sbjct: 125 FVESIDAIDNGISQYDG--IPRYHSPGNLSSRTGQFNPHWNEPENDAD---ERFQQAMEF 179
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEEM 259
G EF+ +V++ W PAR I+ + +R++ D SG I++++ CPWK H F++E E
Sbjct: 180 IGGEFVRSVKYLANVWWPAREIIESAVEKRFENDASGRIILIENGGCPWKEHFFDIEAEK 239
Query: 260 KI-EPLIKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGLRDDELSKEAGIPGC 317
I + I Y+L+ D WRVQA+ + FE+R PLPA WRGLRD+ELSKE+GIPG
Sbjct: 240 NIADDNITYILFSDSTNASWRVQAIPIDKTAAFENRMPLPAAWRGLRDEELSKESGIPGG 299
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH+SGFIGGN + GALAMAR AL++
Sbjct: 300 VFVHISGFIGGNLTREGALAMARKALEI 327
>gi|328855248|gb|EGG04376.1| hypothetical protein MELLADRAFT_37531 [Melampsora larici-populina
98AG31]
Length = 355
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 219/341 (64%), Gaps = 24/341 (7%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G+FH DEAL ++R +++ ++++VR+RDPKVL+ D V+DVGG Y P + YD
Sbjct: 19 IGTHSGTFHADEALAVNLLRSLEEYKSSRLVRTRDPKVLETCDIVVDVGGEYKPDSHRYD 78
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF E + TKLSSAGL+YKHFGK++IA L + + L +Y +F+EA
Sbjct: 79 HHQRGFNETYSSSHLTKLSSAGLIYKHFGKQIIATYLKLKSDDGSLPILMAKMYDDFVEA 138
Query: 145 IDAIDNGINQYDTDKP----------PRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
ID IDNGINQY+ P +Y ++T+LS R+ +LN DW EP S + + F
Sbjct: 139 IDGIDNGINQYEAVNPDGKPAEVEVRKKYRSSTSLSDRISRLNPDWNEPSNS-DILDAKF 197
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYD---YDPSGEIMVLKRFCPWKLH 251
+ LAG EF + + +Y +SWLP R +VVE + +R+ DP G ++V ++FCPWK H
Sbjct: 198 EVASKLAGSEFFERLDYYAKSWLPGRDLVVEALKDRFTNDVQDPQGRVLVFEQFCPWKGH 257
Query: 252 LFELEEEM-------KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
L LE + K+ PL YVLY D+ G WRVQA+ PD FESRK LP WRG+R
Sbjct: 258 LHTLEATLEASSTGEKVLPL--YVLYADESGG-WRVQAIPKVPDGFESRKALPEAWRGIR 314
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
D+ LS+ GIP CVFVH +GFIGGN++ GA+ M + AL L
Sbjct: 315 DEALSQLTGIPDCVFVHAAGFIGGNKTRQGAVEMVKKALAL 355
>gi|432112577|gb|ELK35293.1| UPF0160 protein MYG1, mitochondrial [Myotis davidii]
Length = 379
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 218/326 (66%), Gaps = 12/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 45 RIGTHNGTFHCDEALACALLRLLPEYQDAEIVRTRDPEKLASCDVVVDVGGEYDPQRHRY 104
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 105 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY T LS+RV +LN W +P+Q E F++ M
Sbjct: 165 ENFVEEVDAVDNGISQWEGE--PRYALTTTLSARVARLNPTWNQPNQDTE---AGFERAM 219
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EF+ + FY SW+PAR++V E + +R DPSGEI+ L K CPWK HL+ LE
Sbjct: 220 DLVREEFVQRLDFYQHSWMPARALVEEALTQRLQVDPSGEIIELAKGGCPWKEHLYHLES 279
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E+ I +V+Y D G QWRVQ V F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 280 ELSPPVTIAFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPESWRGLRDEALDQVSGIPGC 338
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+++ GAL+MARA L
Sbjct: 339 VFVHASGFIGGHRTREGALSMARATL 364
>gi|242012107|ref|XP_002426782.1| hypothetical protein Phum_PHUM275760 [Pediculus humanus corporis]
gi|212510964|gb|EEB14044.1| hypothetical protein Phum_PHUM275760 [Pediculus humanus corporis]
Length = 557
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 213/328 (64%), Gaps = 12/328 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
LKR+GTH+G FHCDE L CFM++ ++ N++I+RSR+P+VL+ D V+DVGG+YDPS
Sbjct: 5 LKRIGTHDGVFHCDEVLACFMLKKLPEYQNSEIIRSRNPEVLNTCDIVVDVGGIYDPSKH 64
Query: 82 CYDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
YDHHQK F + KLSSAGLVY HFG+ ++ L + V LF
Sbjct: 65 RYDHHQKDFNHTMNSLNPDRPWKIKLSSAGLVYHHFGERILESILGPPKDSGTVSNLFSF 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y+NF++ +D IDNG+ +D + PRY +TNLSSRV LN W + E + F
Sbjct: 125 MYENFIQEVDGIDNGVLMFDGE--PRYSIHTNLSSRVKYLNPSWN--SEENEDSQKLFHA 180
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
MDL GKEF+D ++F+ +S+ PAR +V E I R SG I+ LK+ CPWK HLF+LE
Sbjct: 181 AMDLVGKEFIDRIQFFTKSFWPARDLVKEAILNRNSTHQSGSIIELKQCCPWKKHLFQLE 240
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E+ I IKYV+++ D WR++AV ++PD F R PL WRGLRD++LSK +GI
Sbjct: 241 KELGITGQIKYVIFQSD---SWRIEAVPINPDSFVLRLPLHETWRGLRDEKLSKISGIDE 297
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
C+FVH +GFIGGN++ GAL MA L+
Sbjct: 298 CIFVHSNGFIGGNKTREGALKMALKTLE 325
>gi|198414788|ref|XP_002128622.1| PREDICTED: similar to LOC496075 protein [Ciona intestinalis]
Length = 346
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 166/346 (47%), Positives = 229/346 (66%), Gaps = 21/346 (6%)
Query: 13 STSSPSQTPL------KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
S S P Q + ++GTHNG+FHCDE L C++++L K+ +A+IVR+RD ++L++
Sbjct: 3 SESEPPQKKMCLNGEKVKIGTHNGTFHCDEVLACYLLKLLPKYKDAEIVRTRDTEILNNC 62
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKEL 121
D V+DVGGVYD + YDHHQ+ F + TKLSSAGLVY H+G+E++ K L
Sbjct: 63 DIVVDVGGVYDHDKNRYDHHQRSFSGTMNSIRPDKPWKTKLSSAGLVYCHYGEEIL-KLL 121
Query: 122 NVDEGHPD--VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
+EG + V ++ VY+ F+ IDAIDNG++Q+D + RY +TN+SSRVG LN
Sbjct: 122 LGEEGSDEKIVSVIYDKVYEKFVHEIDAIDNGVDQFDGE--ARYHISTNISSRVGHLNPS 179
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
W E ++ EN++F+ M + GKEF D + YV SWLPAR IVV+ I +R+ G+I
Sbjct: 180 WNEKRKN---ENKSFEVAMAMVGKEFEDRIHGYVTSWLPARDIVVKAIKQRHQISDQGQI 236
Query: 240 MVLKRFCPWKLHLFELE-EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
++L ++CPWK HLF LE EE IKYVLYED+ ++WR+Q V + FE+R + A
Sbjct: 237 LLLDQYCPWKEHLFTLEDEEFVPNGEIKYVLYEDN-AQKWRIQCVPAGRNTFENRLSILA 295
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+WRGLRD+ELSK +GIPGCVFVH SGFIGGN + G L MA L+
Sbjct: 296 EWRGLRDEELSKLSGIPGCVFVHASGFIGGNATRDGVLQMAMKCLE 341
>gi|426224388|ref|XP_004006353.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Ovis aries]
Length = 381
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 168/346 (48%), Positives = 227/346 (65%), Gaps = 17/346 (4%)
Query: 10 PAYSTSSPSQTPLK------RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
P+ PS+ P R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 26 PSLEPVPPSKRPRSHLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKL 85
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIA 118
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A
Sbjct: 86 AACDIVVDVGGEYDPQRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLA 145
Query: 119 KELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNL 178
+ L E V L+ +Y+NF+E +DA+DNGI+Q + + PRY+ T LS+RV +LN
Sbjct: 146 QLLGASEEDGMVGTLYDKMYENFVEEVDAVDNGISQCEEGE-PRYLLTTTLSARVARLNP 204
Query: 179 DWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGE 238
W +P+Q E F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGE
Sbjct: 205 TWNQPNQDTE---AGFKRAMDLVREEFLQRLDFYQNSWLPARALVEEALAKRFQVDPSGE 261
Query: 239 IMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLP 297
I+ L++ CPWK HL++LE + I +V+Y D G QWRVQ V P F+SR PL
Sbjct: 262 IIELEKGGCPWKEHLYQLESGLSPAGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLL 320
Query: 298 AQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 321 EPWRGLRDEALDQISGIPGCIFVHASGFIGGHRTREGALSMARATL 366
>gi|440900681|gb|ELR51760.1| UPF0160 protein MYG1, mitochondrial [Bos grunniens mutus]
Length = 381
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 221/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYREAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDGMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY+ T LS+RV +LN W +P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQWEEGE-PRYLLTTTLSARVARLNPTWNQPNQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L++ CPWK HL++LE
Sbjct: 222 DLVREEFLQRLDFYQNSWLPARTLVEEALAKRFQVDPSGEIIELEKGGCPWKEHLYQLEL 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PL WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPAGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLLEPWRGLRDEALDQISGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALSMARATL 366
>gi|75057702|sp|Q58DG1.1|MYG1_BOVIN RecName: Full=UPF0160 protein MYG1, mitochondrial; Flags: Precursor
gi|61553813|gb|AAX46463.1| MYG1 protein [Bos taurus]
gi|61553946|gb|AAX46483.1| MYG1 protein [Bos taurus]
gi|84708745|gb|AAI11154.1| MYG1 protein [Bos taurus]
gi|296487908|tpg|DAA30021.1| TPA: MYG1 protein precursor [Bos taurus]
Length = 381
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 221/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYREAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDGMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY+ T LS+RV +LN W +P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQWEEGE-PRYLLTTTLSARVARLNPTWNQPNQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L++ CPWK HL++LE
Sbjct: 222 DLVREEFLQRLDFYQNSWLPARTLVEEALAKRFQVDPSGEIIELEKGGCPWKEHLYQLEL 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PL WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPAGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLLEPWRGLRDEALDQISGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALSMARATL 366
>gi|78042530|ref|NP_001030210.1| UPF0160 protein MYG1, mitochondrial precursor [Bos taurus]
gi|61554766|gb|AAX46611.1| MYG1 protein [Bos taurus]
Length = 381
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 220/326 (67%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYREAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDGMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY+ T LS+RV +LN W +P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQWEEGE-PRYLLTTTLSARVARLNPTWNQPNQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL++LE
Sbjct: 222 DLVREEFLQRLDFYQNSWLPARTLVEEALAKRFQVDPSGEIIELGKGGCPWKEHLYQLEL 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PL WRGLRD+ L + +GIPGC
Sbjct: 282 GLSPAGTIAFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLLEPWRGLRDEALDQISGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 341 IFVHASGFIGGHRTREGALSMARATL 366
>gi|348580619|ref|XP_003476076.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Cavia
porcellus]
Length = 375
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 220/326 (67%), Gaps = 12/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYQDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L +E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTNEEDSMVGTLYDKIY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY TNLS+RV +LN W +P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQWEGE--PRYALTTNLSARVARLNPTWNQPNQDTE---AGFRRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSG+I+ L K CPWK HL+ LE
Sbjct: 221 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGKIVELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V F+SR PLP WRGLRD+ L + +GIPG
Sbjct: 281 VLSPPGTIAFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPEPWRGLRDEALDQVSGIPGG 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 340 IFVHASGFIGGHRTREGALSMARATL 365
>gi|324509649|gb|ADY44051.1| Unknown [Ascaris suum]
Length = 361
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 213/335 (63%), Gaps = 12/335 (3%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S + + +GTHNG FHCDE CFM++ +F + I+R+RDP L + V+DVGGVYD
Sbjct: 23 SASGMPSIGTHNGKFHCDEVFACFMLKSLPEFSHYDIIRTRDPSTLSNCSIVVDVGGVYD 82
Query: 78 PSNDCYDHHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
YDHHQ+GF + F TKLSSAGL+Y H+GK +IA+ L + + +V L
Sbjct: 83 HDKLRYDHHQRGFNDTMKTLNVLDFETKLSSAGLIYAHYGKRVIAELLALRDDSTEVDIL 142
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ VY+ F+EA+DAIDNGI Q+ D PRY LSSRVG LN W E D E+
Sbjct: 143 YKKVYEAFVEAVDAIDNGIPQF--DGVPRYHLGGTLSSRVGSLNPAWNEEDIDIEKR--- 197
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHL 252
F + M L G EFLD ++++ RSWLPAR +V EC+ R+D D SG+I+ L K PWK H
Sbjct: 198 FHEAMKLVGVEFLDRLKYFHRSWLPARDVVAECVKNRFDVDKSGQILALDKGGVPWKEHF 257
Query: 253 FELEEEMK-IEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSK 310
F LE+E + I Y+++ D WRVQA+ + FE+R PLP WRG R+DELS+
Sbjct: 258 FTLEKEHNLLNAQITYIVFADATSGDWRVQAIPLDEKATFENRLPLPESWRGYRNDELSE 317
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
AGIP C+F HM+GFIGGN++ GA+ MA+ +L++
Sbjct: 318 LAGIPSCIFTHMTGFIGGNRTRDGAVEMAKRSLQI 352
>gi|443723954|gb|ELU12172.1| hypothetical protein CAPTEDRAFT_225005 [Capitella teleta]
Length = 327
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 211/327 (64%), Gaps = 10/327 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDE L CFM++ + +A+I+R+RDPK+L+ D V+DVGGVYD + Y
Sbjct: 2 KIGTHNGAFHCDEVLACFMLKELPIYKDAEIIRTRDPKLLETCDIVVDVGGVYDHAAKRY 61
Query: 84 DHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E ++TKLSSAGLVY H+G++LIA+ L E ++ +Y
Sbjct: 62 DHHQRSFAESMSSLSPEKKWTTKLSSAGLVYLHYGRQLIAQMLGSKEEDKVTGTIYDKIY 121
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ F+E +DAIDNGI+Q D + PRY ++NLSSRVG LN W + D S E E F M
Sbjct: 122 ETFVEEVDAIDNGISQCDGE--PRYSISSNLSSRVGYLNPRWND-DSSPEMEKHRFNSAM 178
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEE 257
L G EF D V FY SWLPARSIV + R+ D SGEI+ L+ CPWK HL LE
Sbjct: 179 RLVGTEFKDRVDFYCHSWLPARSIVEHALKSRHQVDSSGEIIKLESGGCPWKDHLHALEA 238
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
EM + P +K+ LY D K WRVQ V FE+R L W G+RD +LS+ +GI GC
Sbjct: 239 EMGLNPQVKFTLYADSNNK-WRVQCVPKVLGGFENRLSLLEAWCGIRDAQLSELSGIDGC 297
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALK 344
+FVH +GFIGGN++Y GAL MAR L+
Sbjct: 298 IFVHTNGFIGGNETYKGALEMARKTLE 324
>gi|25150283|ref|NP_741610.1| Protein C27H6.8 [Caenorhabditis elegans]
gi|14195536|sp|O17606.2|YK4P_CAEEL RecName: Full=UPF0160 protein C27H6.8
gi|3874563|emb|CAB02797.1| Protein C27H6.8 [Caenorhabditis elegans]
Length = 340
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 215/328 (65%), Gaps = 12/328 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDEA CFM++ +F + I+R+RD L+ D ++DVGG++D S YD
Sbjct: 5 IGTHSGKFHCDEAFACFMLKQLPQFKDHSILRTRDAAQLEKCDIIVDVGGIFDHSKQRYD 64
Query: 85 HHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
HHQ+GF + F TKLSSAGLVY H+G+E+I + L + + + +Y+
Sbjct: 65 HHQRGFTDTMRTLEKLNFDTKLSSAGLVYAHYGREVINQILGGNVSSSMIDLFYHRLYEQ 124
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+E+IDAIDNGI+QYD PRY ++ NLSSR G+ N W EP+ A+ E FQQ M
Sbjct: 125 FVESIDAIDNGISQYDG--VPRYHSSGNLSSRTGQFNSHWNEPENDAD---ERFQQAMQF 179
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEEM 259
G+EF +V++ W PAR I+ + +R++ D SG I++++ CPWK H F++E E
Sbjct: 180 IGEEFSRSVKYLANVWWPAREIIEAAVDKRFEIDASGRIILIENGGCPWKEHFFDIEVEK 239
Query: 260 KI-EPLIKYVLYEDDRGKQWRVQAVAVSP-DRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
I + I Y+L+ D WRVQA+ V FE+R PLPA WRGLRDD+LSKE+GIPG
Sbjct: 240 NIADDNITYILFSDSTNASWRVQAIPVDKMSSFENRMPLPAAWRGLRDDDLSKESGIPGG 299
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH+SGFIGGN + GA+AMAR AL++
Sbjct: 300 VFVHISGFIGGNLTREGAIAMARKALEI 327
>gi|340374057|ref|XP_003385555.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Amphimedon
queenslandica]
Length = 341
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 215/330 (65%), Gaps = 14/330 (4%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTHNG FHCDE L C+M+++ ++ A IVRSRDP+VLD D V+DVG VYD
Sbjct: 15 IPKIGTHNGKFHCDEVLACYMLKVLPEYQEAAIVRSRDPQVLDKCDIVVDVGAVYDHDKL 74
Query: 82 CYDHHQKGFEEVFGHGFS-------TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
YDHHQ+GF E H S TKLSSAGL+Y H+G+ +++ G ++ L+
Sbjct: 75 RYDHHQRGFNETM-HSLSKSTKPWTTKLSSAGLIYYHYGERVLSALTGFSIGSSELASLY 133
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
AVY NF+E IDA+DNGI D + P+Y + + RV +LN W + D + +E F
Sbjct: 134 DAVYNNFIEEIDAVDNGIQ--DREGIPKYKITSTIGYRVSRLNSRWNDTDH---KPDERF 188
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ M+ G+EF D ++ Y SWLPARSIV+ I +R++ D SGE++ L+++CPW HL
Sbjct: 189 KYAMERVGEEFNDIIQHYKDSWLPARSIVLSAIEKRHEVDASGEVINLEQYCPWVEHLLM 248
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
LE +M ++ IK+VLY ++ GK WRVQAV ++ F +R L +WRGLRD ELS +GI
Sbjct: 249 LESQMNLKKTIKFVLYPEESGK-WRVQAVPLATGSFVNRVSLLEEWRGLRDQELSDLSGI 307
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALK 344
PGCVFVH +GFIGGN + GAL+MA L+
Sbjct: 308 PGCVFVHANGFIGGNSTLDGALSMAMITLQ 337
>gi|345567883|gb|EGX50785.1| hypothetical protein AOL_s00054g871 [Arthrobotrys oligospora ATCC
24927]
Length = 343
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 156/323 (48%), Positives = 212/323 (65%), Gaps = 6/323 (1%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTHNG+FH DEAL +++RLT + ++ +VRSRDP L+ D ++DV GV D +
Sbjct: 23 KTIGTHNGAFHADEALAVYLLRLTPTYKDSPVVRSRDPPTLEACDIIVDVTGVCDHTKH- 81
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+ F E F T+LSSAGL+YK+FGKE+I+ + ++D P V L+ +Y F+
Sbjct: 82 FDHHQRTFTETFSDKHVTRLSSAGLIYKYFGKEIISLKTDIDVNDPRVEILYQKLYTEFI 141
Query: 143 EAIDAIDNGINQYDTDK--PPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
EAIDA DNGI+ + T P L S VG LN W E + E N F++ L
Sbjct: 142 EAIDANDNGISAFPTSAGAPAFSEKGITLPSMVGSLNPRWNET-ITDEISNSQFEKASKL 200
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G F+D + +Y ++WLPAR IVV + +R++ DPSG+I+ L CPWK HLF +EEE
Sbjct: 201 MGDVFVDKLDYYSKAWLPAREIVVNAVKKRFETDPSGKIVNLGISCPWKEHLFSVEEEEG 260
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
I+ Y LY D GK WR+QAV++S D FE+RK LP +WRG+RD++LS GI GCVFV
Sbjct: 261 IKGETLYALYSD--GKGWRIQAVSISKDSFENRKGLPEKWRGVRDEDLSTLTGIDGCVFV 318
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H SGFIGGN++Y GAL MA+ A+
Sbjct: 319 HASGFIGGNKTYEGALEMAKRAV 341
>gi|403296788|ref|XP_003939277.1| PREDICTED: UPF0160 protein MYG1, mitochondrial [Saimiri boliviensis
boliviensis]
Length = 376
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 218/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLCPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEDDSVVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W++P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYAVTTTLSARVARLNPTWSQPNQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
+L +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 NLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ+V F+SR PLP WRGLRD L + +GIPGC
Sbjct: 282 GLSPPVAIMFVIYTDQAG-QWRVQSVPKELHSFQSRLPLPEPWRGLRDKALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL+MARA L
Sbjct: 341 IFVHASGFIGGHCTREGALSMARATL 366
>gi|417399943|gb|JAA46952.1| Putative metal-binding protein [Desmodus rotundus]
Length = 380
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 163/326 (50%), Positives = 217/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP +
Sbjct: 45 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPQRQRF 104
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 105 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q++ + PRY T LS+RV +LN W +P+Q E F++ M
Sbjct: 165 ENFVEEVDAVDNGISQWEEGE-PRYAVTTTLSARVARLNPTWNQPNQDTEAR---FKRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 221 DLVREEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIIELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 281 GLSPPVTITFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPESWRGLRDEALDQVSGIPGC 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL+MA A L
Sbjct: 340 IFVHASGFIGGHHTREGALSMACATL 365
>gi|431921625|gb|ELK18977.1| UPF0160 protein MYG1, mitochondrial [Pteropus alecto]
Length = 383
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 167/344 (48%), Positives = 222/344 (64%), Gaps = 11/344 (3%)
Query: 6 VSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD 65
+ S P + S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 30 LESVPPFKRPRSSLMTPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLAA 89
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKE 120
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 90 CDIVVDVGGEYDPQRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQL 149
Query: 121 LNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
L E V L+ +Y+NF+E +DA+DNGI+Q++ K PRY T LS+RV +LN W
Sbjct: 150 LGTSEEDSMVATLYDKMYENFVEEVDAVDNGISQWEEGK-PRYALTTTLSARVARLNPTW 208
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
+P+Q E F++ MD+ +EFL + FY SWLPAR++V E +A+R DPSGEI+
Sbjct: 209 NQPNQDTE---AGFKRAMDMVREEFLQRLDFYQHSWLPARALVEEALAQRLQVDPSGEII 265
Query: 241 VL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
L K CPWK HL+ LE + I +V+Y D G QWRVQ V F+SR PLP
Sbjct: 266 ELAKGGCPWKEHLYYLESGLLPPVTIAFVVYTDQAG-QWRVQCVPKELHSFQSRLPLPEP 324
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
W GLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 325 WWGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 368
>gi|343428083|emb|CBQ71607.1| related to Gamm1 protein / Ni-binding urease accessory protein
(UreG) [Sporisorium reilianum SRZ2]
Length = 602
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 157/321 (48%), Positives = 209/321 (65%), Gaps = 10/321 (3%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G+FH DEAL ++R +F A + R+RD +D V+DVG YDP+ YDHH
Sbjct: 21 THSGTFHADEALAVHLLRSLPRFATAPLTRTRDAATIDSGSIVVDVGATYDPAAHRYDHH 80
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVD--EGHPDVHRLFLAVYKNFMEA 144
Q+GFEEVF STKLSSAGLV+KHFG+E++A L +D V L+L +Y +F+EA
Sbjct: 81 QRGFEEVFDANHSTKLSSAGLVWKHFGQEIVATHLQLDATANKETVDLLWLKLYDDFVEA 140
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID IDNG++QY +D P+Y + T+LS+RVG +N W E +AE + F++ +AGKE
Sbjct: 141 IDGIDNGVSQYPSDLKPKYKSRTDLSARVGYMNPSWNEKSDNAELDAR-FEKASAMAGKE 199
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F + + + +WLPAR IVV+ + +R +++V F WK HLF LE+ + I P
Sbjct: 200 FFERLDYTYSAWLPARQIVVDALNKR----THPQLLVFDEFASWKDHLFTLEKHLNIAPT 255
Query: 265 IK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+ YV+Y D+ GK WRVQAV VSP+ F SRK LP WRG+RD LS GIPGC+FVH
Sbjct: 256 ERPIYVVYPDESGK-WRVQAVPVSPESFVSRKALPEPWRGIRDQALSDLTGIPGCIFVHQ 314
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGFIGGN + GAL MAR L
Sbjct: 315 SGFIGGNATKEGALKMARDGL 335
>gi|388857993|emb|CCF48438.1| related to Gamm1 protein / Ni-binding urease accessory protein
(UreG) [Ustilago hordei]
Length = 607
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 212/320 (66%), Gaps = 9/320 (2%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G+FH DEAL ++R +F +A + R+RD ++D V+DVG Y+P YDHH
Sbjct: 20 THSGTFHADEALAVNLLRTLPRFSSAPLTRTRDTTIIDSGTIVVDVGATYNPITHRYDHH 79
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+GFE+VF STKLSSAGLV+KHFGKE+++ L V + P V L+ +Y +F+EAID
Sbjct: 80 QRGFEQVFSPEHSTKLSSAGLVWKHFGKEIVSTHLQVQDA-PTVDLLWKKLYDDFIEAID 138
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
IDNGI+QY +D P+Y + T+LS+RV +N W E AE + F++ +AG EF
Sbjct: 139 GIDNGISQYPSDLKPKYKSRTDLSARVAYMNPSWNESWSDAESDAR-FEKASLMAGAEFF 197
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+++ +SWLPAR IVV+ + +R +++V F WK HLF+LE+E+ I P +
Sbjct: 198 QRLQYAYKSWLPARQIVVQALGKR----EVAQVLVFDEFASWKDHLFDLEKELNIAPTER 253
Query: 267 --YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
YV+Y D+ GK WR+QAV VSP+ F SRK LP WRG+RD +LS+ GI GC+FVH SG
Sbjct: 254 PIYVVYPDESGK-WRIQAVPVSPESFVSRKALPEPWRGIRDQQLSELTGIEGCIFVHQSG 312
Query: 325 FIGGNQSYGGALAMARAALK 344
FIGGN + GAL MA ALK
Sbjct: 313 FIGGNATKQGALKMASDALK 332
>gi|308464349|ref|XP_003094442.1| hypothetical protein CRE_06052 [Caenorhabditis remanei]
gi|308247761|gb|EFO91713.1| hypothetical protein CRE_06052 [Caenorhabditis remanei]
Length = 343
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 216/328 (65%), Gaps = 12/328 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDEA CFM++ KF I+R+RD L+ D V+DVGGV+D S YD
Sbjct: 5 IGTHSGKFHCDEAFACFMLKQLPKFKEHAILRTRDASQLEKCDIVVDVGGVFDHSKQRYD 64
Query: 85 HHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
HHQ+GF E F TKLSSAGLVY H+GK++I + L + V + +Y+
Sbjct: 65 HHQRGFTETMRTLEKLNFDTKLSSAGLVYAHYGKDVINQILGGNVTPSMVDLFYHRLYEQ 124
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+E+IDAIDNGI QYD PRY + NLS+R G+ N W EP+ A+ E F++ M+
Sbjct: 125 FVESIDAIDNGIAQYDG--VPRYHSPGNLSARTGQFNAHWNEPENDAD---ERFEKAMEF 179
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEEM 259
G+EF+ +V++ W PAR I+ + + +R++ D SG I++++ CPWK H F++E +
Sbjct: 180 IGEEFVRSVKYLANVWWPAREIIEKAVEKRFENDASGRIILIENGGCPWKEHFFDIEAQK 239
Query: 260 KI-EPLIKYVLYEDDRGKQWRVQAV-AVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
I + + Y+L+ D WRVQA+ + FE+R+PLPA WRGLRDD+LSKE+GIPG
Sbjct: 240 NIRDDNVTYILFSDSTNASWRVQAIPSDKTSSFENRQPLPAAWRGLRDDDLSKESGIPGG 299
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH+SGFIGGN + GALAMAR AL++
Sbjct: 300 VFVHISGFIGGNLTRDGALAMARKALEI 327
>gi|331212333|ref|XP_003307436.1| urease accessory protein [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309297839|gb|EFP74430.1| urease accessory protein [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 354
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 221/343 (64%), Gaps = 23/343 (6%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
++PL +GTH+G+FH DEAL ++R +KF ++++VR+RDP VL+ D V+DVGG YDP
Sbjct: 17 ESPL--IGTHSGTFHADEALAVSLLRSLEKFKSSRLVRTRDPAVLETCDIVVDVGGQYDP 74
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ YDHHQ+ F E + TKLSSAGL+YKHFG E++A L + +P V L +Y
Sbjct: 75 ARHRYDHHQREFSETYSGDHRTKLSSAGLIYKHFGSEIVASHLGLPTDNPTVPILVAKLY 134
Query: 139 KNFMEAIDAIDNGINQYDT---DKPP-------RYVNNTNLSSRVGKLNLDWTEPDQSAE 188
F+EAIDAIDNGI +Y+T D P RY ++T+LSSR+ LN W EP +
Sbjct: 135 DGFIEAIDAIDNGIERYETVDQDGKPTNLPVKLRYQSHTDLSSRISHLNPAWNEPVNNTI 194
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYD---YDPSGEIMVLKRF 245
+++ F++ LAG EF + ++ ++WLP R +VV+ + R+ D G +++ F
Sbjct: 195 LDSQ-FEKASSLAGSEFFSRLDYFSKAWLPGRELVVKALENRFTNQAQDKFGRVLIFDSF 253
Query: 246 CPWKLHLFELEEEMKIE----PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
CPWK HL LE+ E PL YVLY D+ + WR+QA+ V+P FESRK LP WR
Sbjct: 254 CPWKDHLHTLEKTTLTESSERPL--YVLYADE-SQNWRIQAIPVTPGGFESRKALPEAWR 310
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RD LS+ GIPG +FVH SGFIGGNQ++ GAL MA+ AL+
Sbjct: 311 GVRDAALSELIGIPGSIFVHASGFIGGNQTFEGALKMAQKALE 353
>gi|335287845|ref|XP_003126234.2| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Sus scrofa]
Length = 384
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 164/328 (50%), Positives = 218/328 (66%), Gaps = 11/328 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 49 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPQRHRY 108
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 109 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSVVGILYDKMY 168
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI Q++ + PRY T LS+RV +LN W +P+Q E F++ M
Sbjct: 169 ENFVEEVDAVDNGIAQWEEGE-PRYALTTTLSARVARLNPTWNQPNQDTE---AGFKRAM 224
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ D SG+I+ L K CPWK HL+ LE
Sbjct: 225 DLVREEFLQRLDFYQHSWLPARALVEEALAQRFQVDASGKIIELAKGGCPWKEHLYHLEL 284
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
++ I +V+Y D G QWRVQ V F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 285 DLSPPVAISFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 343
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH SGFIGG+++ GAL+MAR L L
Sbjct: 344 VFVHASGFIGGHRTREGALSMARTTLAL 371
>gi|296211812|ref|XP_002807154.1| PREDICTED: LOW QUALITY PROTEIN: UPF0160 protein MYG1, mitochondrial
[Callithrix jacchus]
Length = 376
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 216/326 (66%), Gaps = 11/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPERHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W++P+Q E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYAVTTTLSARVARLNPTWSQPNQDTE---AGFKRAM 221
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
+L +EFL + FY SWLPAR +V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 222 NLVQEEFLQRLDFYQHSWLPARVLVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 281
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V F+SR PLP WRGLRD L + +GIPGC
Sbjct: 282 GLSPPVAIIFVIYTDQAG-QWRVQCVPKELHSFQSRLPLPEPWRGLRDKALDQVSGIPGC 340
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+ + GAL+MARA L
Sbjct: 341 IFVHASGFIGGHCTREGALSMARATL 366
>gi|299472230|emb|CBN77200.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 380
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 162/329 (49%), Positives = 213/329 (64%), Gaps = 7/329 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+GSFHCDEAL C M+++ ++ +A IVR+RD L DAV+DVGGVYDP+
Sbjct: 47 KCIGTHSGSFHCDEALACAMLKILPEWSDAVIVRTRDEAELAKCDAVVDVGGVYDPTTLR 106
Query: 83 YDHHQKGFEEVFG-HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV--HRLFLAVYK 139
+DHHQK F + F T+LSSAGLVY+H+G++++ + + DV +L+ VYK
Sbjct: 107 FDHHQKTFHDTLSEENFQTRLSSAGLVYRHYGRQILKQLVEGTSVPEDVVDKKLYAKVYK 166
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD--QSAERENEAFQQG 197
NF+E ID IDNG++ Y +++LS+RVG+LN W EP + + N F++
Sbjct: 167 NFVEHIDGIDNGVD-VAKGADLSYDVSSHLSARVGQLNPSWNEPSDPEGPDGPNARFKKA 225
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
M L G E V+ +SW P R+IV E +A R + PSGEIMVL FCPWK HL ELEE
Sbjct: 226 MSLTGGELCRYVQGLAKSWWPGRAIVAESLARRKEDHPSGEIMVLTGFCPWKGHLLELEE 285
Query: 258 EMKIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
EM I +KYVLY E D +WR+QAV VS F R PL W GLRD ELS+ +GI G
Sbjct: 286 EMGIAGELKYVLYAEGDPAGKWRIQAVPVSEGSFALRLPLAKPWLGLRDKELSEASGIEG 345
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH++GFIGGN S GA+AMA +L+L
Sbjct: 346 GVFVHVNGFIGGNASKAGAMAMASKSLEL 374
>gi|66827877|ref|XP_647293.1| metal-dependent protein hydrolase domain-containing protein
[Dictyostelium discoideum AX4]
gi|74859468|sp|Q55G91.1|U160_DICDI RecName: Full=UPF0160 protein
gi|60475402|gb|EAL73337.1| metal-dependent protein hydrolase domain-containing protein
[Dictyostelium discoideum AX4]
Length = 329
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 212/319 (66%), Gaps = 4/319 (1%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+GSFH DEAL C++++L + +++I+RSRD V++ +DVG VY+ +D
Sbjct: 6 ICTHSGSFHADEALACYLLKLLPTYKDSKIIRSRDKSVIEKSTVAVDVGAVYNFEKLRFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ GF E F KLSSAGL+YKH+GK++I + L+ ++ ++ L+ +Y + ++
Sbjct: 66 HHQSGFTETFDDKHDIKLSSAGLIYKHYGKDIIKQRLDTNDSITEL--LYQKLYDSMIQE 123
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG+ +Y +D PRY + +++S+RVG LN W EP Q E N+ F++ M+L G+
Sbjct: 124 LDGVDNGVERYPSDIKPRYQSGSSISARVGHLNQGWNEP-QDDEIVNKQFEKAMELMGQY 182
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
FLD + +Y +SWLP RSIV + R SGEI++L FCPWK HLF LE+E I+
Sbjct: 183 FLDRLDYYGKSWLPCRSIVENALENRKQTHSSGEILILDMFCPWKDHLFSLEQEKDIKTP 242
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
IK+VL+ED G QWRV AV ++ F R PLP +WRG RD+ELS+ +GI GCVF H +G
Sbjct: 243 IKFVLFEDTSG-QWRVSAVGINLHSFTLRLPLPEEWRGKRDEELSQISGIEGCVFAHANG 301
Query: 325 FIGGNQSYGGALAMARAAL 343
FIGGN++ GAL MA L
Sbjct: 302 FIGGNKTREGALLMAIKTL 320
>gi|332206061|ref|XP_003252108.1| PREDICTED: LOW QUALITY PROTEIN: UPF0160 protein MYG1, mitochondrial
[Nomascus leucogenys]
Length = 375
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 216/326 (66%), Gaps = 12/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMY 165
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E +DA+DNGI+Q+ + PRY T LS+RV + W PDQ E F++ M
Sbjct: 166 ENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVAH-DPTWNHPDQDTE---AGFKRAM 220
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEE 257
DL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE
Sbjct: 221 DLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGGCPWKEHLYHLES 280
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+Y D G QWRVQ V P F+SR PLP WRGLRD+ L + +GIPGC
Sbjct: 281 GLSPPVAIFFVIYTDQAG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGC 339
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+FVH SGFIGG+++ GAL+MARA L
Sbjct: 340 IFVHASGFIGGHRTREGALSMARATL 365
>gi|71024677|ref|XP_762568.1| hypothetical protein UM06421.1 [Ustilago maydis 521]
gi|46101961|gb|EAK87194.1| hypothetical protein UM06421.1 [Ustilago maydis 521]
Length = 605
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 155/321 (48%), Positives = 214/321 (66%), Gaps = 10/321 (3%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G+FH DEAL ++R +F +A++ R+RD ++ V+DVG YD +N YDHH
Sbjct: 21 THSGTFHADEALAVNLLRKLPRFASARLTRTRDAATIESGSIVVDVGATYDAANHRYDHH 80
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD--VHRLFLAVYKNFMEA 144
Q+GFEEVF STKLSSAGLV+KHFG+E+IA L +D V L+L +Y +F+EA
Sbjct: 81 QRGFEEVFDADHSTKLSSAGLVWKHFGQEIIATHLQLDAAAQKEVVDLLWLKLYDDFVEA 140
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID IDNGI+QY +D P+Y + T+LS+RVG LN W + +AE + F++ +AG E
Sbjct: 141 IDGIDNGISQYPSDLKPKYKSRTDLSARVGYLNPSWNDKSDNAELDAR-FEKASAMAGNE 199
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F + + + +WLPAR +VV+ + +R +++V ++ WK HLFELE+ + I P
Sbjct: 200 FFERLDYTFNAWLPARQVVVDALNKR----SHPQLLVFDQYASWKDHLFELEKNLNIAPT 255
Query: 265 IK--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+ YV+Y D+ GK WRVQAV VS + F SRK LP WRG+RD +LS+ GIPGC+FVH
Sbjct: 256 ERPIYVVYPDESGK-WRVQAVPVSAESFVSRKALPEPWRGIRDQQLSELTGIPGCIFVHQ 314
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGFIGGN + GAL MA+ AL
Sbjct: 315 SGFIGGNATQEGALRMAKDAL 335
>gi|358058084|dbj|GAA96063.1| hypothetical protein E5Q_02724 [Mixia osmundae IAM 14324]
Length = 345
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 160/330 (48%), Positives = 207/330 (62%), Gaps = 11/330 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + TH+G+FH DEAL ++R + +A++VR+RDP ++D D VLDVGGVY
Sbjct: 16 KLICTHSGTFHADEALAVHLVRTLPDYKDARLVRTRDPAIIDMADIVLDVGGVYSVERQR 75
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF F TKLSSAGLVYKH+GK++IA+ L + E V L+ VY++F+
Sbjct: 76 FDHHQRGFAATFDSSHRTKLSSAGLVYKHYGKQIIAQRLGLQEADKTVSTLYPKVYEDFV 135
Query: 143 EAIDAIDNGINQYDT----DKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
E +D IDNGINQY+ D Y N T LSSR+ +LN W E + + +E F+
Sbjct: 136 EGLDGIDNGINQYEAVLGGDVRSNYRNFTGLSSRISRLNPSWNETSNN-DILDERFETAS 194
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY---DPSGEIMVLKRFCPWKLHLFEL 255
LAG EF + WLP RS++VE + ER + DP G IM+L F PWK HLF L
Sbjct: 195 ALAGSEFFQVLDAAAHQWLPGRSLIVEALFERCKFDGADPHGRIMLLGSFAPWKDHLFRL 254
Query: 256 EEEMKIEP--LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
E E+ IE + YV+Y D+ GK WRVQAV S + FESRK LPA WRG+RD EL G
Sbjct: 255 ERELSIEDDRKVLYVIYPDEAGK-WRVQAVPRSSEGFESRKALPAAWRGIRDKELDALVG 313
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
I GCVFVH +GFIGG+ + GAL MA A+
Sbjct: 314 IEGCVFVHAAGFIGGHATREGALIMANKAI 343
>gi|321460865|gb|EFX71903.1| hypothetical protein DAPPUDRAFT_308656 [Daphnia pulex]
Length = 362
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 213/332 (64%), Gaps = 13/332 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FHCDE L C M++L ++ +A + R+R+ +LD D V+DVGGV+DPS
Sbjct: 34 KVIGTHDGKFHCDEVLACAMLKLLPQYADALVKRTRNATILDTCDIVVDVGGVFDPSVHR 93
Query: 83 YDHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ F+E F + T+LSSAGLVY HFG+E+I++ LN+ V + V
Sbjct: 94 YDHHQRSFQESFSTLKPDFPWVTRLSSAGLVYVHFGQEIISQVLNMPVDSKLVQETYKRV 153
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+ F+E ID IDNGI+ +D + RY TNLSSRVG LN W +P ++ E F QG
Sbjct: 154 YEGFIEEIDGIDNGISTHDGEG--RYKITTNLSSRVGNLNPKWNQP-ETEEDVTTKFNQG 210
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELE 256
M+L +EFLD + ++ + W PAR +VV+ I +R+ D SG+I+ CPWK H F+LE
Sbjct: 211 MELVKEEFLDKINYFGKCWWPARELVVKSIEDRFSVDESGQIIEFNNGGCPWKEHFFQLE 270
Query: 257 EEMKIE---PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
++M ++ I Y +Y D G WR+Q V VS FE RK LP QWRGLRD EL + +G
Sbjct: 271 KKMGLDVGDKKILYSIYPDSNG-SWRIQGVPVSSHSFELRKALPDQWRGLRDKELDQMSG 329
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
I GCVF+H SGFIGG+ + GAL M R AL +
Sbjct: 330 IDGCVFIHASGFIGGHSTRAGALEMGRKALAI 361
>gi|91083497|ref|XP_972332.1| PREDICTED: similar to AGAP002408-PA [Tribolium castaneum]
gi|270010805|gb|EFA07253.1| hypothetical protein TcasGA2_TC013284 [Tribolium castaneum]
Length = 346
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/343 (46%), Positives = 218/343 (63%), Gaps = 18/343 (5%)
Query: 15 SSPSQTP--LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
++ S+TP +K++GTH+G FHCDEAL C+M++ + A+I+R+RD VLD D V+DV
Sbjct: 2 ATDSKTPKLMKKIGTHSGVFHCDEALACYMLKQLPDYREAEIIRTRDMPVLDTCDVVVDV 61
Query: 73 GGVYDPSNDCYDHHQKGFEEV-------FGHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
G VY+P + YDHHQ+GFEE + KLSSAGLVY HFG ++I KE+ +
Sbjct: 62 GAVYNPKINRYDHHQRGFEETLSSVRPDLAKKSTIKLSSAGLVYAHFGLDVI-KEIIEQQ 120
Query: 126 GHPD----VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
G+P + ++FL VY+ F+E IDAIDNG+ Y K PRY NTNLS+R+ +LN +W
Sbjct: 121 GYPIPANCLQKVFLHVYEGFVEEIDAIDNGVPMYAEGK-PRYRINTNLSARIHRLNPEWN 179
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
+ E ++ F + +++AG EF + V V W PAR IV I R SGEI+V
Sbjct: 180 --SEEPESTDQLFMKAVEMAGTEFTERVIEAVTIWWPARQIVRNAIENRKKIHESGEIIV 237
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
L+ CPWK HL LEEEM I+ +K+V++ D+ WRVQ + + PD F R L WR
Sbjct: 238 LEERCPWKEHLLALEEEMGIQDQLKFVIFH-DKSSSWRVQGIPIQPDSFICRVFLHKDWR 296
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RDD+LS AGI GCVF H +GFIGGN++ G L MA +LK
Sbjct: 297 GVRDDQLSDIAGIEGCVFCHATGFIGGNKTKEGVLQMAIKSLK 339
>gi|428173536|gb|EKX42437.1| hypothetical protein GUITHDRAFT_73898, partial [Guillardia theta
CCMP2712]
Length = 313
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 209/317 (65%), Gaps = 17/317 (5%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KR+GTH+G+FHCDE L C+M+ T +F +A+IVR+R+ L ++D V+DVG V+DPS
Sbjct: 5 KRIGTHDGAFHCDEVLACWMLHQTKQFADAEIVRTREENKLSEMDIVVDVGAVFDPSTHR 64
Query: 83 YDHHQKGFEEVFGHG-FSTKLSSAGLVYKHFGKEL---IAKELNVDEGHPDVHRLFLAVY 138
YDHHQK F + F TKLSSAGL+YK+FG+E+ I K ++V + +L+ VY
Sbjct: 65 YDHHQKSFTDTFDEKHVVTKLSSAGLIYKYFGQEILQNIVKSVDV-----NYEKLYHLVY 119
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
NF+E IDA+DNG+ Y D P+Y +T L RV +LN W DQS ++ +E FQ+ M
Sbjct: 120 DNFIEEIDAVDNGVQCYAADAVPKYKVSTMLGQRVARLNPSWN--DQS-KKPDEQFQKAM 176
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+ G E V +Y +++LPARSIV E I+ R D PSG+I+VLK CPWK HLFE+E+E
Sbjct: 177 QIVGDEMESIVNYYAKAFLPARSIVYEAISSRLDDHPSGKIIVLKTACPWKDHLFEIEKE 236
Query: 259 MKIEPL----IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+++ + Y +YE D+G WRVQAV P+ F RK LP WRG+RD LS G+
Sbjct: 237 LQVSMYTLANVLYAVYE-DQGGSWRVQAVPERPESFHCRKALPEAWRGIRDQALSDLTGV 295
Query: 315 PGCVFVHMSGFIGGNQS 331
GC+FVH +GFIGG Q+
Sbjct: 296 QGCIFVHATGFIGGAQT 312
>gi|443893983|dbj|GAC71171.1| predicted metal-binding protein [Pseudozyma antarctica T-34]
Length = 587
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/320 (49%), Positives = 210/320 (65%), Gaps = 9/320 (2%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G+FH DEAL ++R +F +A + R+RD +D V+DVG YD + YDHH
Sbjct: 7 THSGTFHADEALAVNLLRKLPRFASAPLTRTRDAATIDKGTIVVDVGATYDAAAHRYDHH 66
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV-DEGHPDVHRLFLAVYKNFMEAI 145
Q+GF+EVF +TKLSSAGLV+KHFG+E++A LN+ D V L+ +Y +F+EAI
Sbjct: 67 QRGFDEVFDDKHATKLSSAGLVWKHFGREIVAAHLNLGDRDRETVELLYHKLYDDFVEAI 126
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
D IDNGI QY + P+Y + T+LS+RVG LN W E AE + F++ +AGKEF
Sbjct: 127 DGIDNGIAQYPNELKPKYKSRTDLSARVGYLNPRWNEKSTDAELDAR-FERASAMAGKEF 185
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLI 265
D + + +WLPAR IVV+ +A+R +++V +F WK HLF LE+E+ I P
Sbjct: 186 FDRLDYTFNAWLPARQIVVDALAKR----THPQLLVFDQFASWKDHLFTLEQELNIAPTE 241
Query: 266 K--YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+ YV+Y D+ GK WRVQAV VS + F SRK LP WRG+RD+ LS+ GIPGC+FVH S
Sbjct: 242 RAIYVVYPDESGK-WRVQAVPVSAESFISRKALPEPWRGIRDEALSQLTGIPGCIFVHQS 300
Query: 324 GFIGGNQSYGGALAMARAAL 343
GFIGGN + GAL MA AL
Sbjct: 301 GFIGGNATREGALKMASDAL 320
>gi|301097828|ref|XP_002898008.1| protein MYG1 [Phytophthora infestans T30-4]
gi|262106453|gb|EEY64505.1| protein MYG1 [Phytophthora infestans T30-4]
Length = 365
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 167/357 (46%), Positives = 219/357 (61%), Gaps = 18/357 (5%)
Query: 2 AGTGVSSSPAYS-------TSSPSQTPLKR-VGTHNGSFHCDEALGCFMIRLTDKFFNAQ 53
A V+SSP ++ TSS ++ K+ +GTHNG+FHCDEAL M++L KF
Sbjct: 7 AAAFVASSPCFAAFRSAQVTSSLAEGATKKYIGTHNGTFHCDEALAVSMLKLLPKFAAHD 66
Query: 54 IVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFG 113
++R+RD L +AV+DVGG+YDP +DHHQ+ F F TKLSSAGLVYKHFG
Sbjct: 67 VLRTRDESKLAQCEAVVDVGGIYDPEALRFDHHQRTFTGTFDQR-DTKLSSAGLVYKHFG 125
Query: 114 KELI---AKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGI---NQYDTDKPPRYVNNT 167
+E+I A +D+ D+ L YKNF+E ID IDNG+ D Y ++
Sbjct: 126 REIIQHLAAPTTLDDATLDI--LHQKAYKNFVEHIDGIDNGVEVATAGDAKLTYNYQVSS 183
Query: 168 NLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECI 227
LS+RVG LN W E DQS +R NE FQQ M + EF D + V SWLPAR IV + +
Sbjct: 184 TLSNRVGYLNPRWNE-DQSEKRVNEQFQQAMYMTISEFTDAIHDLVHSWLPAREIVEKAV 242
Query: 228 AERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSP 287
+ R+ SGEI+ +CPWK HL +LE+++ I IK+VLY D G RVQA+ V P
Sbjct: 243 SNRFQTHKSGEIVHFHEYCPWKSHLHDLEKKLMIPGQIKFVLYNDATGSMTRVQALNVEP 302
Query: 288 DRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
F RK L WRGLRDDELS+ +G+ GC FVH +GFIGGN+++ GAL MA +L+
Sbjct: 303 GSFALRKGLLQAWRGLRDDELSRVSGVEGCTFVHNAGFIGGNRTFEGALEMAAKSLE 359
>gi|167519585|ref|XP_001744132.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777218|gb|EDQ90835.1| predicted protein [Monosiga brevicollis MX1]
Length = 327
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 216/332 (65%), Gaps = 15/332 (4%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTH+G+FHCDEAL CF+++ ++ +A+IVR+RD + L D V+DVG V+DP+
Sbjct: 1 MVKIGTHDGTFHCDEALACFLLKRLPEYQDAEIVRTRDLEKLKACDIVVDVGAVFDPATH 60
Query: 82 CYDHHQKGFEEVFG------HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YDHHQ+ F+E + +ST+LSSAGLVY HFGK +IA L D + LF
Sbjct: 61 RYDHHQREFQETMTTLSGGKYRWSTRLSSAGLVYYHFGKAIIAAILP-DASAEQIDTLFS 119
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y F+E IDA+DNG++QY+ P ++ +T +S+RVG++N W E D F+
Sbjct: 120 KMYGGFVEEIDAVDNGVSQYEEGSPKYHIGST-VSTRVGRINPSWKETDTDMMPR---FE 175
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ M L G EF++ ++ Y SWLPAR +V E + R++ DPSGEI+ L +CPWK HL++L
Sbjct: 176 RAMALVGGEFVEVLKSYYESWLPARDVVKEALETRFEVDPSGEIIKLPIWCPWKSHLYDL 235
Query: 256 EEEMKIE--PL-IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
E EM E P+ IKY LY D WRV +++P F RK LP WRGLRD +L ++
Sbjct: 236 EAEMGTEEKPINIKYALYVDSH-NAWRVLCASITPGSFVCRKALPEAWRGLRDADLEAKS 294
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI G FVH SGF GGN++Y GALAMA+A L+
Sbjct: 295 GIAGATFVHASGFTGGNKTYEGALAMAKAGLE 326
>gi|345496652|ref|XP_001601011.2| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Nasonia
vitripennis]
Length = 370
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 215/332 (64%), Gaps = 22/332 (6%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDE L C+M++L ++ +A IVRSRD +LD D V+DVGG YD + Y
Sbjct: 45 KIGTHNGTFHCDEVLACYMLKLLPEYKDATIVRSRDQSILDTCDIVVDVGGKYDAATHRY 104
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELI---AKELNVDEGHPDVHRL 133
DHH + F E G+ + KLSSAGL+Y HFG ++I A ELN D D+ R+
Sbjct: 105 DHHMRDFTESISTVIKKPGYDSTIKLSSAGLIYCHFGHKIIKQLAPELNED----DLERI 160
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
F VY+ F++ ID IDNG+ +D + P Y +TNLS+RV +LNL W + + E E
Sbjct: 161 FKKVYETFIKEIDGIDNGVPMFDGE--PLYRISTNLSARVSRLNLQW---NTTHLNEEEQ 215
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F + M +AG+EF + R+WLPAR++V E I R DPSGEI+ + + PWK HL+
Sbjct: 216 FNKAMVMAGEEFTYFIENAARTWLPARTLVKEAIENRLQIDPSGEIIEMTKAVPWKEHLY 275
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+E E+ I+P IK+++++D+ +RVQ V + R LP +W GLRD+EL+ EAG
Sbjct: 276 NIETELNIDPTIKFIVFKDN---TYRVQGVPQQLGSYICRIFLPEKWCGLRDEELTAEAG 332
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
I CVFVH +GFIGGN++ GALAMAR ALKL
Sbjct: 333 IKDCVFVHTTGFIGGNKTREGALAMARYALKL 364
>gi|384486384|gb|EIE78564.1| hypothetical protein RO3G_03268 [Rhizopus delemar RA 99-880]
Length = 328
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 159/328 (48%), Positives = 209/328 (63%), Gaps = 5/328 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K +GTH+G FHCDEAL ++R TD+F +A +VR+RDP L + D ++DVGGVYDP+
Sbjct: 2 VKTIGTHSGHFHCDEALAIALLRRTDEFKDAVLVRTRDPAKLAECDIIVDVGGVYDPATH 61
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
YDHHQ+GF E F +TKLSSAGLVYKHFGKE++ K L D +++L Y F
Sbjct: 62 RYDHHQRGFTETFDDKHATKLSSAGLVYKHFGKEVVGKLLGNAVSPSDAEKIYLKTYDCF 121
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNN-TNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+EA+DA DNGI+ Y + P + T+L RV K N W E + E + F Q +L
Sbjct: 122 VEALDANDNGISAYPNNITPLFKETPTSLPDRVAKKNPSWNEVVTNDEVDAR-FVQASEL 180
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
AG+E +D V SWLPAR++VVE + + + S I+ L+R CPWK HL +LE E
Sbjct: 181 AGEELVDYVLRLKTSWLPARTLVVEALDKADEVHSSARIIALERSCPWKEHLLDLENERG 240
Query: 261 I--EPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ + I YVLY E WR+Q V V P+ FE+RK LP WRG RDDELS+ +GI C
Sbjct: 241 LTGDKSILYVLYPESSPEGNWRIQCVPVRPEGFENRKSLPEAWRGFRDDELSRISGIDNC 300
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH GFIGGN++ GA MAR AL+L
Sbjct: 301 IFVHAGGFIGGNKTRQGACEMARLALEL 328
>gi|19115396|ref|NP_594484.1| conserved eukaryotic protein [Schizosaccharomyces pombe 972h-]
gi|14195571|sp|Q9P7T6.1|YIW4_SCHPO RecName: Full=UPF0160 protein C694.04c
gi|6901199|emb|CAB71842.1| conserved eukaryotic protein [Schizosaccharomyces pombe]
Length = 324
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 208/325 (64%), Gaps = 12/325 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
L ++ TH+G+FH DEAL +M+R D+F AQIVRSRDP+VLD D ++DVGG YD
Sbjct: 4 LVKIATHSGTFHADEALAVYMLRRLDRFSGAQIVRSRDPQVLDSCDIIVDVGGKYD-GIK 62
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELI---AKELNVDEGHPDVHRLFLAVY 138
+DHHQ+ F + F +ST+LSSAGL+YKHFG+E+I +L ++E D+ L+ VY
Sbjct: 63 YFDHHQREFNDTFSPKYSTRLSSAGLIYKHFGREVIHAVLPQLKINE--QDLETLYEKVY 120
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
++F+E +DA DNGI+ Y P + +L V W Q + E FQ+
Sbjct: 121 QSFVEGLDANDNGISAYPAGLKPSFKAAMSLPEMVSSFLPAWNSEKQDDQTYLECFQKAS 180
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
DL G F+ +V Y SWLPA+++ E I + D I+++ +F PWK HLF++E+E
Sbjct: 181 DLMGTWFVRSVEHYALSWLPAKTLAREAILKAKD----SPILIVDQFFPWKGHLFDIEKE 236
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ IE KY +Y D GK WRVQAV++ P F R PLP WRG+RD++LS+ GIPGC+
Sbjct: 237 LGIENQFKYAIYSD--GKAWRVQAVSIDPTSFTCRLPLPEPWRGIRDEKLSELTGIPGCI 294
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
FVH SGFIGGNQ++ GAL MAR AL
Sbjct: 295 FVHASGFIGGNQTFEGALEMARKAL 319
>gi|406695355|gb|EKC98663.1| GAMM1 protein [Trichosporon asahii var. asahii CBS 8904]
Length = 321
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 166/337 (49%), Positives = 209/337 (62%), Gaps = 27/337 (8%)
Query: 15 SSPSQTPLKR---VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
SSP +K +GTH+G+FHCD A + T+ +D LD V+D
Sbjct: 2 SSPVAKKMKSTKVIGTHSGTFHCDAATPPSVRAATNAL-----------TAVDPLDIVVD 50
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFGHGF--STKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
VGGVYDP+ YDHHQ+GF EVFG G TKLSSAGLVYKHFGK +IAK+L V E P
Sbjct: 51 VGGVYDPAKQRYDHHQRGFTEVFGFGGYDRTKLSSAGLVYKHFGKHIIAKQLGVPESDPK 110
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
V L+L +Y +E+ID IDNG+N + Y T+LSSRV +LN W EP S +
Sbjct: 111 VETLWLQLYGELIESIDGIDNGVNIAQGELA--YAQRTDLSSRVRRLNPRWNEP-ASDDD 167
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
+ F G EFL + ++ +WLPAR +V E + +R++ DPSG+I+V K +
Sbjct: 168 YDARFAVASKTTGDEFLQQLDYFANAWLPARDVVKEALEKRHEVDPSGKIVVFK-----Q 222
Query: 250 LHLFELEEEMKIEPLIKYVLYE---DDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
HLF LE M E I YVLY D+ +WR+Q V SPD F +RK LPA WRGLRD+
Sbjct: 223 DHLFSLEPTMAKETKILYVLYPENPDNENSKWRIQCVPESPDSFTNRKSLPAAWRGLRDE 282
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
ELSKE+GIPG VFVH SGFIGGN+++ G LAMARAAL
Sbjct: 283 ELSKESGIPGGVFVHASGFIGGNETFNGVLAMARAAL 319
>gi|296414285|ref|XP_002836833.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295631672|emb|CAZ81024.1| unnamed protein product [Tuber melanosporum]
Length = 351
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 216/345 (62%), Gaps = 23/345 (6%)
Query: 20 TPLKRVGTHNGSFH------------CDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
TPL R+GTHNG FH DEAL +M+RL ++ + ++RSRDP VL++ D
Sbjct: 11 TPL-RIGTHNGHFHVCLILGRQTDSGADEALAVYMLRLLPEYQGSTLIRSRDPAVLEECD 69
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
++DV YD +DHHQ+ F E F F TKLSSAGL+YKHF ++I+ L++
Sbjct: 70 IIVDVHAQYDGIKH-FDHHQRTFSETFSPDFQTKLSSAGLIYKHFAPQIISHRLSIPIDD 128
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQS 186
P V L+ VYK+F+EAIDA DNGI+ Y +D P + + + +L S VG LN +W ++
Sbjct: 129 PSVTLLYNKVYKDFIEAIDANDNGISAYPSDVKPAFNDKSISLPSLVGLLNPEWNSTVKT 188
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
E E+E F + L G F + +Y +SWLPAR VVE + R+++D G I+V +
Sbjct: 189 QELEDEKFLEASKLMGTVFDQKLDYYGKSWLPARDHVVEAMKGRFEFDEKGRILVFSQSV 248
Query: 247 PWKLHLFELEEEMKIE-----PLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
PWK HLF LE E+ +E PL YVLY E WR+Q V VS D FESRKPLP W
Sbjct: 249 PWKDHLFTLEAEVGVEGEDKRPL--YVLYGEGAEKSSWRIQCVPVSKDSFESRKPLPEAW 306
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RG RDD LS+ +GIPG VFVH SGFIGGN+++ GAL MA+ +L+L
Sbjct: 307 RGTRDDALSEVSGIPGGVFVHASGFIGGNKTFEGALEMAKTSLEL 351
>gi|260940216|ref|XP_002614408.1| hypothetical protein CLUG_05894 [Clavispora lusitaniae ATCC 42720]
gi|238852302|gb|EEQ41766.1| hypothetical protein CLUG_05894 [Clavispora lusitaniae ATCC 42720]
Length = 335
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 153/322 (47%), Positives = 209/322 (64%), Gaps = 4/322 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+ TH+GSFH DE+L +++RL KF +Q+VRSR+P ++ D V+DVGG YD +
Sbjct: 15 RICTHSGSFHADESLAVYLLRLLPKFQGSQLVRSRNPTDWEESDIVVDVGGKYDGGIKWF 74
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F E F F TKLSSAGL+YKHFGKE+I L++ + D+ L+L VYK F+E
Sbjct: 75 DHHQREFSETFSQNFQTKLSSAGLIYKHFGKEIIQHVLDLKKP-SDIELLYLKVYKEFIE 133
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
AIDA DNGIN Y +D ++ + N L + V LN W D + +EAF++ +L G
Sbjct: 134 AIDANDNGINNYPSDVEKKFNDKNLGLPALVSHLNPAWN-VDPVDKDFDEAFEKASELMG 192
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
FL+ V+ Y +SWLPAR IV E R+D SGEI+VL +FCPWK HL+ +E+E +
Sbjct: 193 SAFLNLVKGYGQSWLPARDIVEEAFKSRFDTHKSGEILVLGKFCPWKEHLYAIEKENNAQ 252
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IK+VL+ D K WR+ V V+ FE R LP WRG+RD+ELSK G+ GC+FVH
Sbjct: 253 GSIKFVLFSDSSSK-WRISTVPVTSTSFEFRFGLPEPWRGVRDEELSKLTGVDGCIFVHA 311
Query: 323 SGFIGGNQSYGGALAMARAALK 344
+GFIGG ++ + +AR AL+
Sbjct: 312 AGFIGGAETQDAVMELARLALE 333
>gi|391326650|ref|XP_003737825.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Metaseiulus
occidentalis]
Length = 334
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 162/328 (49%), Positives = 218/328 (66%), Gaps = 13/328 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDEAL C +++L ++ A+I+R+RDP +L+ D V+DVG VYD S
Sbjct: 4 KKIGTHNGTFHCDEALACALLKLLPEYSTAEILRTRDPALLETCDVVVDVGAVYDASKFR 63
Query: 83 YDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
+DHHQK F+E + KLSSAGL+Y H+G++++A + H D ++ +
Sbjct: 64 FDHHQKTFQETMHSLRADFPWEIKLSSAGLIYFHYGEKILADLMKRPAEHRDTQTVYRKI 123
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+ F+ IDAIDNG+N D D RY NT LSSRVG LN W E +Q+ + FQ
Sbjct: 124 YEEFIIEIDAIDNGVNMCDGDT--RYRINTGLSSRVGGLNPKWNE-EQTPAAADRQFQLA 180
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELE 256
M LA EFLD V +Y +WLPA+ +V E I +RY+ D SG I+ L + CPWK HLF++E
Sbjct: 181 MKLAETEFLDKVNYYSNAWLPAKELVREAINKRYEVDKSGRIIELPQTGCPWKDHLFDIE 240
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
EE+ I+ + YV++ D K +RVQ V + F++R LP WRGLRDDEL +E+GIPG
Sbjct: 241 EELGIQGELIYVIHTDT--KSYRVQCVPLKLGAFDNRHSLP--WRGLRDDELCRESGIPG 296
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
C+FVH SGFIGGNQ+ GAL MARAALK
Sbjct: 297 CIFVHASGFIGGNQTRDGALEMARAALK 324
>gi|430811406|emb|CCJ31157.1| unnamed protein product [Pneumocystis jirovecii]
gi|430814556|emb|CCJ28230.1| unnamed protein product [Pneumocystis jirovecii]
Length = 322
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 208/322 (64%), Gaps = 4/322 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K +GTH+G+FH D+ L +M+RL +F +++IVRSRDP++L+ D ++DV G YD
Sbjct: 3 VKTIGTHSGTFHADDCLAVYMLRLLPEFKDSKIVRSRDPEILETCDIIVDVHGKYDGVK- 61
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F E F F TKLSS+GL++KHFGK +I+ L++D HP V L++ +Y++F
Sbjct: 62 YFDHHQRCFNETFSENFRTKLSSSGLIFKHFGKRIISYRLDLDLDHPYVDVLYVKLYESF 121
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+EAIDA DNGIN Y D P + LSS V N W +P ++ F + +
Sbjct: 122 LEAIDANDNGINAYPYDVRPLFKTQLELSSLVANFNPWWNQPTDDNILFSK-FLKASEFI 180
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G FLD + +Y ++W+ AR +V++ +R++YD G I+V ++F PWK HLF++E E
Sbjct: 181 GAIFLDHLDYYGKAWIVARELVLDAFKKRFEYDSKGRIVVFEQFLPWKSHLFQIESEFNT 240
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
I YV+Y D GK WRVQAV ++P+ F SRK LP WRGLRD L E+ I CVFVH
Sbjct: 241 CGQILYVIYTD--GKDWRVQAVPIAPESFTSRKGLPENWRGLRDKALCLESNIDNCVFVH 298
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
SGF GGN++ GAL MA AL
Sbjct: 299 ASGFTGGNKTKEGALQMAIKAL 320
>gi|406607040|emb|CCH41555.1| hypothetical protein BN7_1096 [Wickerhamomyces ciferrii]
Length = 324
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/322 (47%), Positives = 211/322 (65%), Gaps = 6/322 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+ TH+GSFH DE+L +M+RL ++ N+ +VRSRDP + D V+DVGG YD + +
Sbjct: 7 RICTHSGSFHADESLAVYMLRLLPRYQNSSLVRSRDPAQWEQADIVVDVGGKYDGTK-FF 65
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+GFEE F ++TKLSSAGLVYKHFGKE+I E+ + + L+ +YK F+E
Sbjct: 66 DHHQRGFEETFNEKYATKLSSAGLVYKHFGKEIIG-EVIQSKDESVLELLYDKIYKEFIE 124
Query: 144 AIDAIDNGINQYD--TDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
++DA DNG+++YD T+K N L S V LN W +QS E + F +L
Sbjct: 125 SLDANDNGVSKYDNLTEKAKFNDRNITLPSLVSNLNPHWN-AEQSDEDFDRQFLIASELM 183
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G F++ V+ Y SWLPA+ IV + +R++ D SG+I+VL +F PWK HL+E+E+
Sbjct: 184 GNAFVNVVKNYGLSWLPAKKIVEDSFNQRFEVDKSGKILVLSQFAPWKEHLYEIEKSNDK 243
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
E I YVL+ED GK WR+ V ++ F++RKPLP WRGLRDD LSKE+G+ GCVFVH
Sbjct: 244 EGEILYVLFEDS-GKNWRISTVPITSASFDNRKPLPENWRGLRDDALSKESGVEGCVFVH 302
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
+GFIGG ++ L +AR +L
Sbjct: 303 AAGFIGGAKTKDAVLQLARLSL 324
>gi|320167610|gb|EFW44509.1| UPF0160 protein c [Capsaspora owczarzaki ATCC 30864]
Length = 367
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 158/326 (48%), Positives = 209/326 (64%), Gaps = 10/326 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V H+G+FHCDEAL CFM++LTDKF NA IVR+R P VL + D +DVG V DPS YD
Sbjct: 45 VCAHDGTFHCDEALACFMLKLTDKFANADIVRTRVPAVLAEADCCVDVGAVDDPSTHRYD 104
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN--VDEGHPDVHR----LFLAVY 138
HHQ F+ F + ++LSSAGLVYKH G+E+I K + + HP++ L+ VY
Sbjct: 105 HHQASFKGTFSPAYKSRLSSAGLVYKHHGREVIRKIVAGVISPEHPELEAQTAVLYERVY 164
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
K F+E IDA+DNG + + Y ++++ +RV +LN +W E +A NE F+ +
Sbjct: 165 KQFIEHIDAMDNG-QEVSENGVLMYNLSSHIGTRVSRLNPEWNEEADNATV-NERFRLAI 222
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
L G EF D V R+WLPAR IV +A R+ PSG+I+ L +FCPWK HLF LEEE
Sbjct: 223 YLTGSEFHDYVSGQARAWLPARGIVEAAVAARHTVHPSGQIIKLDQFCPWKEHLFMLEEE 282
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAV-SPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I+ IK+VL++D G +RVQ V+V F SR PL WRG+RD+ LS+ GIPG
Sbjct: 283 LNIQGAIKFVLFQDKSGT-YRVQTVSVDEASFFSSRIPLLEAWRGVRDEALSELLGIPGA 341
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
VFVH SGFIGG+++Y G L MA L
Sbjct: 342 VFVHASGFIGGHKTYEGVLEMAARCL 367
>gi|118117413|ref|XP_423851.2| PREDICTED: UPF0160 protein MYG1, mitochondrial [Gallus gallus]
Length = 370
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/301 (51%), Positives = 200/301 (66%), Gaps = 12/301 (3%)
Query: 51 NAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSA 105
+A++VR+RDP+ L D V+DVGG YDP YDHHQ+ F + ++TKLSSA
Sbjct: 68 DAEVVRTRDPQRLAQCDVVVDVGGEYDPERHRYDHHQRSFTQSMQSLQPSKPWTTKLSSA 127
Query: 106 GLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN 165
GLVY HFG E++A L E P V L+ +Y+NF+E IDAIDNGI Q TD PRY
Sbjct: 128 GLVYCHFGSEILAGLLGQPEDSPAVTALYDKLYENFVEEIDAIDNGIAQ--TDGEPRYAL 185
Query: 166 NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVE 225
TNLS+RVG LN W +PDQ E F++ M+L G EF+D + +Y R+WLPAR++V +
Sbjct: 186 TTNLSARVGHLNPRWNDPDQDTE---AGFKRAMELVGSEFMDRLDYYHRAWLPARALVED 242
Query: 226 CIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVA 284
I R++ D SG ++ L + CPWK HLF LE+E+ + ++ VL+ D R QWRVQ+V
Sbjct: 243 AIRRRFEVDTSGVLLELPQGGCPWKEHLFSLEKELVLPDPLQLVLFPD-RSGQWRVQSVP 301
Query: 285 VSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
V P FESR PLP WRG+RD+ LS+ GIPGCVFVH SGFIGGN++ GAL MAR L
Sbjct: 302 VGPRSFESRLPLPEPWRGVRDEALSQLTGIPGCVFVHSSGFIGGNRTREGALEMARRTLA 361
Query: 345 L 345
L
Sbjct: 362 L 362
>gi|195997823|ref|XP_002108780.1| hypothetical protein TRIADDRAFT_52107 [Trichoplax adhaerens]
gi|190589556|gb|EDV29578.1| hypothetical protein TRIADDRAFT_52107 [Trichoplax adhaerens]
Length = 337
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 212/328 (64%), Gaps = 11/328 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTHNG+FHCDEAL C+M++ ++ +A+++R+RD K+LD D V+DVGG+YD +
Sbjct: 15 KKIGTHNGTFHCDEALACYMLKTLPEYTSAEVIRTRDEKLLDTCDIVVDVGGIYDHDKNR 74
Query: 83 YDHHQKGF----EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
YDHHQ+ F + + + TKLSSAGL+Y H+G+ +I + N + ++ ++ VY
Sbjct: 75 YDHHQRTFDGTMQSLGSLNYQTKLSSAGLIYLHYGRRVITEIANRELDSNVLNIIYDKVY 134
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+NF+E IDAIDNGI+Q T++ RY+ T +S+RV LN W ++E F + M
Sbjct: 135 ENFLEEIDAIDNGISQ--TEETARYLITTGVSARVSHLNPAWNCKQPLPDKE---FVKAM 189
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEE 257
DL G E D + ++ W PAR IV + R+ SGEI++L+ CPWK HL+ +E+
Sbjct: 190 DLVGSELRDKILYFCNVWYPAREIVEAAVKNRHQIHESGEIILLENGGCPWKEHLYVIEQ 249
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+++ P +KYVLY D+ WRVQ V RFE+R LP WRGLR+ +LS E+GI C
Sbjct: 250 KLQTNPTLKYVLY-SDQNNNWRVQCVPERHKRFENRLSLPQNWRGLREADLSNESGIKDC 308
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVHM GFIGGN++ GAL MA AL+
Sbjct: 309 IFVHMGGFIGGNKTKEGALEMAIKALQF 336
>gi|348679062|gb|EGZ18879.1| hypothetical protein PHYSODRAFT_544405 [Phytophthora sojae]
gi|348679071|gb|EGZ18888.1| hypothetical protein PHYSODRAFT_351392 [Phytophthora sojae]
Length = 367
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 164/358 (45%), Positives = 211/358 (58%), Gaps = 19/358 (5%)
Query: 2 AGTGVSSSPAYSTSSPSQT--------PLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQ 53
A V+SSP ++ +Q K +GTHNG+FHCDEAL M++L KF
Sbjct: 7 AAAFVASSPCFAAFRSAQVTSQLAEGAAKKYIGTHNGTFHCDEALAVSMLKLLPKFAAHD 66
Query: 54 IVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFG 113
++R+RD L +AV+DVGGVYD +DHHQ+ F F TKLSSAGLVY HFG
Sbjct: 67 VLRTRDEAKLAQCEAVVDVGGVYDAQALRFDHHQRSFAGTFDQR-DTKLSSAGLVYNHFG 125
Query: 114 KELI---AKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY----DTDKPPRYVNN 166
+E+I A + +D+ D+ L YKNF+E ID IDNG+ D Y +
Sbjct: 126 REIIQVLAAPVTLDDATLDI--LHQKAYKNFVEHIDGIDNGVEVASAAGDAKITYNYQVS 183
Query: 167 TNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVEC 226
++LS+RVG LN W E DQS R N FQQ M + EF D + V SWLPAR IV +
Sbjct: 184 SSLSNRVGYLNPRWNE-DQSEARVNAQFQQAMYMTITEFTDAIHDLVHSWLPAREIVEKA 242
Query: 227 IAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVS 286
+++R+ SGEI+ +CPWK HL +LEE++ I IK+VLY D G RVQA+
Sbjct: 243 VSKRFQTHKSGEIVHFPEYCPWKSHLHDLEEKLMISGQIKFVLYNDATGSMTRVQALNTE 302
Query: 287 PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
P F RK L WRGLRD ELS +GI GC FVH +GFIGGN++Y GAL MA +L+
Sbjct: 303 PGSFALRKGLLPAWRGLRDAELSTVSGIEGCTFVHSAGFIGGNRTYEGALEMAAKSLE 360
>gi|442754435|gb|JAA69377.1| Putative secreted protein [Ixodes ricinus]
Length = 338
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 164/331 (49%), Positives = 213/331 (64%), Gaps = 12/331 (3%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
T K +GTHNG+FHCDEAL CF+++ + +A IVRSRD VL D V+DVGGVYDP+
Sbjct: 11 TAAKTIGTHNGTFHCDEALACFLLKQLPAYEDANIVRSRDSAVLSTCDIVVDVGGVYDPA 70
Query: 80 NDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
+ +DHHQ+ F E ++TKLSSAGL+Y HFG+E+IAK L + P+V +++
Sbjct: 71 TNRFDHHQRTFSETMHSLNESKKWTTKLSSAGLIYAHFGREVIAKVLGWKKDDPNVEKIY 130
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
VY+NFME IDAIDNG+ +D + PRY +TNLS+RV LN ++ +E F
Sbjct: 131 DKVYENFMEEIDAIDNGVKMFDEE--PRYRISTNLSARVAHLN---PPWNEPNPNPDERF 185
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLF 253
Q+ + L G+EFLD VR Y W+PAR +V I +R D G ++ CPWK HL
Sbjct: 186 QKALKLTGEEFLDRVRSYATVWMPARDLVFNAIQQRQKVDGLGSVVTFSGGCCPWKDHLL 245
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
ELEEE+ I +K+VLYED+ K WRVQ V S F+ R LP +W GLRD ELS +G
Sbjct: 246 ELEEELGIPGEVKFVLYEDEN-KAWRVQGVPPSLGSFDCRVFLPKKWCGLRDKELSDASG 304
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
I GCVFVH +GFIGGN++ GAL MA LK
Sbjct: 305 IEGCVFVHSNGFIGGNKTREGALQMAVHTLK 335
>gi|156846836|ref|XP_001646304.1| hypothetical protein Kpol_1032p40 [Vanderwaltozyma polyspora DSM
70294]
gi|156116979|gb|EDO18446.1| hypothetical protein Kpol_1032p40 [Vanderwaltozyma polyspora DSM
70294]
Length = 339
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 222/343 (64%), Gaps = 13/343 (3%)
Query: 9 SPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA 68
SP + +K++ TH+ SFH DEAL +M+RL ++ +A++VRSRDP + +D D
Sbjct: 2 SPVSKKIKIANNMVKQICTHSNSFHADEALAVYMLRLLPEYSDAKVVRSRDPAMWEDSDI 61
Query: 69 VLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL----NVD 124
V+DV G YD +DHHQ+ F E F + TKLSSAGLV+KHFG+++I+ L ++D
Sbjct: 62 VVDVSGKYDGIK-FFDHHQREFYETFSENYKTKLSSAGLVFKHFGRDIISSILTGNASMD 120
Query: 125 EGHPDVHRLFLAVYKNFMEAIDAIDNGINQYD---TDKPPRYVNNT-NLSSRVGKLNLDW 180
D+ L+ VYKNF+EAIDA DNGIN YD TD P++++ + + + + +N DW
Sbjct: 121 SNKLDI--LYERVYKNFVEAIDANDNGINCYDVKGTDIEPKFIDKSITIPAVISNMNPDW 178
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
+ D S E+ +E F G F V+ Y SWLPA+ +V + I R + DPSG+I+
Sbjct: 179 ND-DCSPEKFDENFFMASKFIGDIFFRLVKRYGDSWLPAKDLVEQAIKNRLEVDPSGKII 237
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
+ +FCPWK HL+ +E+E+ IE I++VL++D GK WRV V VS FE RK LP +
Sbjct: 238 LFDQFCPWKEHLYAVEKELGIENQIQFVLFQD-SGKTWRVSTVPVSSTSFEFRKGLPEEL 296
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
RGLRDDELS+++G+PGCVF+H +GFIGG +S L +A+ +L
Sbjct: 297 RGLRDDELSEKSGVPGCVFIHAAGFIGGAKSKDAVLKLAKMSL 339
>gi|149244590|ref|XP_001526838.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449232|gb|EDK43488.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 340
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 215/321 (66%), Gaps = 4/321 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L ++++L +F NA++VRSRDPK +D D V+DVGG YD + +
Sbjct: 19 KICTHSGSFHADESLAVYLLKLLPQFQNAELVRSRDPKDWEDSDIVVDVGGKYDGTK-FF 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F FG + TKLSSAGLVYKHFGK++I + L +++G +V L+ +YK F+E
Sbjct: 78 DHHQREFNTTFGENYETKLSSAGLVYKHFGKDIIREVLGLEDGDKNVELLYNKIYKEFIE 137
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNGIN Y D ++ + N L S V KLN W E A+ + + F ++ G
Sbjct: 138 SLDANDNGINNYPKDVESKFNDRNITLPSIVSKLNPRWNESCTDADYDRQ-FLIASEMMG 196
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F+ + Y + WLPA+SIV E R+D DPSGEI+VL +FCPWK HL+ +E++ E
Sbjct: 197 RVFVSLLEGYGKGWLPAKSIVEEAFKSRFDVDPSGEILVLSQFCPWKEHLYAIEKDAGKE 256
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IK+VL++D GK WRV V+V+ FE R+ LP + RGLRD ELS++AGI GC+FVH
Sbjct: 257 GAIKFVLFQDSSGK-WRVSTVSVTSSSFEFRQGLPERLRGLRDAELSEKAGIEGCIFVHA 315
Query: 323 SGFIGGNQSYGGALAMARAAL 343
+GFIGG S G L +A+ L
Sbjct: 316 AGFIGGANSKEGVLQLAKLGL 336
>gi|410964535|ref|XP_003988809.1| PREDICTED: UPF0160 protein MYG1, mitochondrial, partial [Felis
catus]
Length = 322
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 151/299 (50%), Positives = 199/299 (66%), Gaps = 11/299 (3%)
Query: 51 NAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSA 105
+A+IVR+RDP+ L D V+DVGG YDP YDHHQ+ F E G + TKLSSA
Sbjct: 15 DAEIVRTRDPEKLAACDIVVDVGGEYDPQRHRYDHHQRSFTETMSSLSPGKPWQTKLSSA 74
Query: 106 GLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN 165
GL+Y HFG +L+A+ L +E V ++ +Y+NF+E +DA+DNGI+Q++ +P RYV
Sbjct: 75 GLIYLHFGHKLLAQLLGTNEEDSMVGTIYDKMYENFVEEVDAVDNGISQWEEGEP-RYVL 133
Query: 166 NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVE 225
T LS+RV +LN W +P+Q E F++ MDL +EFL V FY SWLPAR++V E
Sbjct: 134 TTTLSARVARLNPTWNQPNQDTE---AGFKRAMDLVREEFLQRVGFYQHSWLPARALVEE 190
Query: 226 CIAERYDYDPSGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVA 284
+A+R+ DPSGEI+ L K CPWK HL+ LE + I +V+Y D G QWRVQ V
Sbjct: 191 ALAQRFQVDPSGEIVELAKGGCPWKEHLYHLESGLSPPVTIAFVIYTDQAG-QWRVQCVP 249
Query: 285 VSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
P F+SR PLP WRGLRD+ L + +GIPGC+FVH SGFIGG+ + GAL+MARA L
Sbjct: 250 KEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGCIFVHTSGFIGGHHTREGALSMARATL 308
>gi|340939493|gb|EGS20115.1| putative GAMM1 protein [Chaetomium thermophilum var. thermophilum
DSM 1495]
Length = 361
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 154/334 (46%), Positives = 216/334 (64%), Gaps = 17/334 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
VGTHNG FH DEAL +M+R ++ A++VRSRDP VL++ V+DVGG YDP+ + +
Sbjct: 15 VGTHNGHFHADEALAVYMLRQHVPQYIGARLVRSRDPAVLNECSIVVDVGGEYDPARNRF 74
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F G T+LSSAGLVY HFG+ +IA++L E V ++ +Y++F+E
Sbjct: 75 DHHQRSFATTFP-GRQTRLSSAGLVYLHFGRGIIARKLGQPEESEPVGLVWNKLYESFIE 133
Query: 144 AIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEPD-----QSAEREN 191
A+DA DNGIN YD ++ R+ + + L + VG+LN +W +P ++ +E+
Sbjct: 134 ALDAHDNGINVYDPERLKSAGLEKRFSDGSFTLGAMVGRLNPNWNDPIPEDPVEAQAQED 193
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKL 250
F+Q G+EF ++ +Y SWLPAR +V + RY++D G IMV K + PWK
Sbjct: 194 SRFEQASQRIGEEFDRSLEYYTLSWLPARDLVADAFKARYEWDSEGRIMVFKGQSPPWKD 253
Query: 251 HLFELEEEMKIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LEE+ + YVLY + G +WRVQ V V+ D FESRKPLP WRG RD+EL
Sbjct: 254 HLYALEEQDPDANQVYYVLYPEKPVEGSKWRVQCVPVTKDSFESRKPLPEHWRGFRDEEL 313
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
+ +G+ GCVFVH SGFIGGN+++ GALAMA+ A
Sbjct: 314 DRTSGVSGCVFVHASGFIGGNKTFEGALAMAKRA 347
>gi|448104274|ref|XP_004200243.1| Piso0_002821 [Millerozyma farinosa CBS 7064]
gi|359381665|emb|CCE82124.1| Piso0_002821 [Millerozyma farinosa CBS 7064]
Length = 335
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 211/333 (63%), Gaps = 18/333 (5%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
+ + +V TH+GSFH DE+L FMIRL K+ ++++RSR+P + D V+DVGG YD
Sbjct: 12 STMLKVCTHSGSFHADESLAVFMIRLLPKYKGSEVIRSRNPADWESSDIVIDVGGKYDGV 71
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ F E F + TKLSSAGL+YKHFGK++I ++++E +V L+ +YK
Sbjct: 72 K-FFDHHQREFMETFSRDYQTKLSSAGLIYKHFGKDIIRTVVDLEES--EVELLYQKIYK 128
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYV--------NNTNLSSRVGKLNLDWTEPDQSAEREN 191
F+EA+DA DNGIN Y PR V N L + V LN W A+ +
Sbjct: 129 EFIEALDANDNGINNY-----PREVEASKKFNDKNITLPAIVSSLNPSWVTDPNDADF-D 182
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLH 251
AFQ +L GK FL+ V Y +SW+PA+ +V E +R++ D SGEI++L+RFCPWK H
Sbjct: 183 RAFQSSSELMGKVFLNLVENYGKSWIPAKRLVQEAFDKRFEVDKSGEIIILERFCPWKEH 242
Query: 252 LFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKE 311
L+ +E++ + + K+VL+ D GK WRV VAVS F+ R L WRGLRD+ELSK
Sbjct: 243 LYAIEKDANAQGVTKFVLFSDSSGK-WRVSTVAVSSGSFDFRLGLLESWRGLRDEELSKA 301
Query: 312 AGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+G+ GC+FVH SGFIGG QS L +AR +LK
Sbjct: 302 SGVDGCIFVHASGFIGGAQSEKAVLELARISLK 334
>gi|401886943|gb|EJT50954.1| GAMM1 protein [Trichosporon asahii var. asahii CBS 2479]
Length = 305
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 164/337 (48%), Positives = 205/337 (60%), Gaps = 43/337 (12%)
Query: 15 SSPSQTPLKR---VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
SSP +K +GTH+G+FHCD +D LD V+D
Sbjct: 2 SSPVAKKMKSTKVIGTHSGTFHCD---------------------------VDPLDIVVD 34
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFGHGF--STKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
VGGVYDP+ YDHHQ+GF EVFG G TKLSSAGLVYKHFGK +IAK+L V E P
Sbjct: 35 VGGVYDPAKQRYDHHQRGFTEVFGFGGFDRTKLSSAGLVYKHFGKHIIAKQLGVAESDPK 94
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
V L+L +Y +E+ID IDNG+N + Y T+LSSRV +LN W EP S +
Sbjct: 95 VETLWLQLYGELIESIDGIDNGVNIAQGELA--YAQRTDLSSRVRRLNPRWNEP-ASDDD 151
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
+ F G EFL + ++ +WLPAR +V E + +R++ DPSG+I+V K +
Sbjct: 152 YDARFAVASKTTGDEFLQQLDYFANAWLPARDVVKEALEKRHEVDPSGKIVVFK-----Q 206
Query: 250 LHLFELEEEMKIEPLIKYVLYE---DDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
HLF LE M E I YVLY D+ +WR+Q V SPD F +RK LPA WRGLRD+
Sbjct: 207 DHLFSLEPTMAKETKILYVLYPENPDNENSKWRIQCVPESPDSFTNRKSLPAAWRGLRDE 266
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
ELSKE+GIPG VFVH SGFIGGN+++ G LAMARAAL
Sbjct: 267 ELSKESGIPGGVFVHASGFIGGNETFNGVLAMARAAL 303
>gi|358378888|gb|EHK16569.1| hypothetical protein TRIVIDRAFT_187423 [Trichoderma virens Gv29-8]
Length = 350
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 158/337 (46%), Positives = 209/337 (62%), Gaps = 19/337 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
VGTHNG FH DEAL M+R+ + ++ +VR+RDPK L+ V+DVGG YD + YD
Sbjct: 17 VGTHNGHFHADEALAVHMLRMLPSYRDSALVRTRDPKTLETCHTVVDVGGEYDAQRNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDE-GHPDVHRLFLAVYKNFME 143
HHQ+GF+ F G +TKLSSAGLVY HFG++LIA+ L D+ PDV L+ +Y+NF+E
Sbjct: 77 HHQRGFDTTFP-GKNTKLSSAGLVYMHFGRQLIAQRLKTDDDASPDVELLYNKIYENFIE 135
Query: 144 AIDAIDNGINQYDTDK------PPRY-VNNTNLSSRVGKLNLDWTEPD-----QSAEREN 191
A+DA DNGI++YD D R+ L + VG+LN W +P + + E+
Sbjct: 136 AVDAHDNGISRYDRDALAAAGIEQRFSTGGFTLGAMVGRLNPAWNDPKPADPVAAQQAED 195
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKL 250
F + G EF T+ F +WLPARSIV +R +DP G I+VL+ + PWK
Sbjct: 196 ALFLKASARIGDEFERTLDFMTAAWLPARSIVQTAFDQRQKHDPKGRILVLEGQSVPWKE 255
Query: 251 HLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LE E P + YVLY + G +WR+Q V + D F SRKPLP WRG RD EL
Sbjct: 256 HLYSLEGEGN--PSVLYVLYAESTQPGAKWRIQCVPETQDSFTSRKPLPEAWRGFRDAEL 313
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
AGI GCVFVH +GFIGGN+++ GA+ MA AL+L
Sbjct: 314 DGVAGIDGCVFVHAAGFIGGNKTFEGAMEMAVKALEL 350
>gi|50305523|ref|XP_452721.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641854|emb|CAH01572.1| KLLA0C11671p [Kluyveromyces lactis]
Length = 333
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 215/335 (64%), Gaps = 7/335 (2%)
Query: 13 STSSP---SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV 69
S S P ++ LK++ TH+GSFH DEAL +M+RL ++ +++IVRSR+P+ + D V
Sbjct: 2 SVSQPLKKAKMSLKQICTHSGSFHADEALAVYMLRLLPQWKDSKIVRSRNPEDWEASDIV 61
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DVGG YD +DHHQ+GF E F + TKLSSAGLVYKHFGKE+I KEL
Sbjct: 62 VDVGGKYDNGVKFFDHHQRGFSETFNDKYKTKLSSAGLVYKHFGKEII-KELGPGLNEDQ 120
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRY-VNNTNLSSRVGKLNLDWTEPDQSAE 188
V L+ VY F+E++DA DNGI+Q+DTD PR+ L S + + N +W + + S E
Sbjct: 121 VELLYDRVYSQFIESLDANDNGIDQFDTDVEPRFSAKAITLPSIISRFNPEWNK-ESSDE 179
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPW 248
F + G F D V Y+ SWLPA+++V++ + R++ D SG+I+ L FCPW
Sbjct: 180 TYYNQFLKASGYIGTVFFDLVSGYINSWLPAKTLVIDAVKNRFNIDASGKIIELAEFCPW 239
Query: 249 KLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
K HLF++E EM IE I++V++ D G +RV V ++ F+ RK LP RGLRD+EL
Sbjct: 240 KEHLFQVEREMNIENTIEFVIFPDTSG-SYRVSTVPITSTSFDFRKGLPEPLRGLRDEEL 298
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
S+++G+ GCVF+H +GFIGG + A+ +A+ +L
Sbjct: 299 SEKSGLSGCVFIHAAGFIGGAKPRDSAIGLAKMSL 333
>gi|254578886|ref|XP_002495429.1| ZYRO0B11176p [Zygosaccharomyces rouxii]
gi|238938319|emb|CAR26496.1| ZYRO0B11176p [Zygosaccharomyces rouxii]
Length = 339
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 152/322 (47%), Positives = 208/322 (64%), Gaps = 4/322 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K V TH+G+FH DEAL +M++L ++ +A++VRSR P+ + D V+DVGG YD +
Sbjct: 22 VKTVCTHSGTFHADEALAVYMLKLLPQYKDAKVVRSRQPEDWEKSDIVVDVGGKYDGTK- 80
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF E F + TKLSSAGL YKHFG+E+I LN + D+ L +Y+ F
Sbjct: 81 FFDHHQRGFYENFENR-ETKLSSAGLTYKHFGREIIKTILNKELSQSDLDFLCHKIYQQF 139
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+EA+DA DNGIN YD KP N L + ++N DW E D S ER ++ F + D
Sbjct: 140 IEALDANDNGINCYDEGKPKFIQNGITLPGVISRMNPDWNE-DNSPERFDQQFTKASDFI 198
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
G F+D V+ Y SWLPA+++V E I R+ SG+I+ K+FCPWK HLF++E E+KI
Sbjct: 199 GNVFVDLVKGYGNSWLPAKTLVREAIDNRFKVHQSGQILEFKQFCPWKDHLFDIESELKI 258
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
E +K+VL+ D G WRV V V P F+ R+ + WRGLRD ELS+++GI C+FVH
Sbjct: 259 ENQLKFVLFPDGSGA-WRVTTVPVEPGSFQFRQGILEPWRGLRDQELSEKSGIKDCIFVH 317
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
SGF GG +S GAL MA +L
Sbjct: 318 ASGFTGGVRSREGALEMALRSL 339
>gi|354544066|emb|CCE40788.1| hypothetical protein CPAR2_108260 [Candida parapsilosis]
Length = 332
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 215/324 (66%), Gaps = 4/324 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++ TH+GSFH DE+L +M++L +F NA++VRSR+P+ + D V+DVGG+YD
Sbjct: 10 MAKICTHSGSFHADESLAVYMLKLLPQFQNAELVRSRNPEDWEASDIVVDVGGIYDEKK- 68
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F F + TKLSSAGLVYKHFGK++I + L + +V L+ VYK F
Sbjct: 69 FFDHHQREFNTTFNEKYQTKLSSAGLVYKHFGKDIIKEVLGLSSDDKNVELLYDKVYKEF 128
Query: 142 MEAIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E++DA DNGIN Y D P++ + N L S V KLN W E A+ + + F +L
Sbjct: 129 IESLDANDNGINNYPKDAEPKFNDRNITLPSIVSKLNPRWNESCTDADYDRQ-FNVSSEL 187
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
GK F++ + Y + WLPA++IV + +R++ DPSGEI+VL +FCPWK HL+ +E+E
Sbjct: 188 MGKVFVNLLEGYGKGWLPAKTIVEDAFNKRFEVDPSGEILVLSQFCPWKEHLYAIEKENN 247
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
E IK+VL++D GK WRV V+V+ FE R LP RGLRD+ELS++AG+ GC+F+
Sbjct: 248 AEGAIKFVLFKDSSGK-WRVSTVSVTSSSFEFRLGLPEPLRGLRDEELSEKAGVEGCIFI 306
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H +GFIGG + G L +AR +LK
Sbjct: 307 HAAGFIGGASTEEGVLKLARISLK 330
>gi|224011824|ref|XP_002294565.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969585|gb|EED87925.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 373
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 161/351 (45%), Positives = 217/351 (61%), Gaps = 29/351 (8%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
P+ T K +GTH+GSF DEALG +++R + + +VRSRDP LD D V+DVGGVY
Sbjct: 25 PAST--KTIGTHSGSFQADEALGVWLLRQLPLYRRSPVVRSRDPSTLDKCDIVIDVGGVY 82
Query: 77 DPSNDCYDHHQKGFEEVFGHGF---------STKLSSAGLVYKHFGKELIAKE---LNVD 124
D S YDHHQ+G++E F TKLS++GLVY+H+GK++I + LN D
Sbjct: 83 DHSKLLYDHHQRGYDERFTSKIRADGSVEERCTKLSASGLVYRHYGKDVIREHYPSLNGD 142
Query: 125 EGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPP------RYVNNTNLSSRVGKLNL 178
V + +Y +FME+IDAID G+ + + +YV+ T LS+RVG+ N
Sbjct: 143 L----VELAYEKMYNSFMESIDAIDTGVEPIPSPQEGHEAVKLQYVDRTGLSARVGRCNP 198
Query: 179 DWTEPDQSAERENEA---FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
W E D++ + +A F+ + G +FL + V S LPARS V + R++ DP
Sbjct: 199 RWNEIDETTGKTPDADARFEVASSMCGDDFLSMLTHIVESDLPARSYVETAVTNRHEVDP 258
Query: 236 SGEIMVLKRF-CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
SGEI+V PWK HL+ELE + KI PLIKYVLY D G WR+Q V+V FE+R
Sbjct: 259 SGEIIVFPSGGLPWKNHLYELEHQYKINPLIKYVLYTDQAG-MWRIQCVSVEGQAFENRL 317
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
LP +WRG+RD++L+K AGI GC F H +GFIGGN+SY GAL MAR ALK+
Sbjct: 318 SLPKEWRGVRDEDLAKVAGIEGCTFCHAAGFIGGNKSYEGALEMARVALKV 368
>gi|367040947|ref|XP_003650854.1| hypothetical protein THITE_2110738 [Thielavia terrestris NRRL 8126]
gi|346998115|gb|AEO64518.1| hypothetical protein THITE_2110738 [Thielavia terrestris NRRL 8126]
Length = 353
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 156/337 (46%), Positives = 213/337 (63%), Gaps = 17/337 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL +M+R + A++VR+RDPK+LD+ V+DVGG YDP+ + Y
Sbjct: 18 IGTHNGHFHADEALAVYMLRTHIPAYSGARLVRTRDPKLLDECHTVVDVGGEYDPARNRY 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F TKLSSAGLVY HFG+E++A+ L E P V ++ +Y++F+E
Sbjct: 78 DHHQRTFATSFPDR-PTKLSSAGLVYLHFGREIVARRLAQPEDSPQVGMVWRKLYESFIE 136
Query: 144 AIDAIDNGINQYDTDKPPRYV-------NNTNLSSRVGKLNLDWTEP---DQSA--EREN 191
A+DA DNGI+ YD L + VG+LN +W +P D +A E E+
Sbjct: 137 ALDAHDNGISVYDAAALAAAGLEKQFSDGGFTLGAMVGRLNPNWNDPIPEDPAAAQEAED 196
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKL 250
F+ G+EF + ++ R+WLPAR +V E A R++YD G I+VLKR PWK
Sbjct: 197 RRFELASQRIGEEFDRDLDYFTRAWLPAREVVAEAFAARHEYDSEGRIVVLKRQAAPWKD 256
Query: 251 HLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LEE+ + YVLY + G +WR+Q V V+ D F+SR+PLP WRG RD+EL
Sbjct: 257 HLYSLEEQEPQGGKVLYVLYPEKPVPGAKWRIQCVPVAKDSFQSRRPLPEAWRGSRDEEL 316
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+GI GCVFVH SGFIGGN+++ GALAMA+ AL +
Sbjct: 317 DGLSGISGCVFVHASGFIGGNKTFEGALAMAKKALDI 353
>gi|310800141|gb|EFQ35034.1| hypothetical protein GLRG_10178 [Glomerella graminicola M1.001]
Length = 362
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 217/346 (62%), Gaps = 33/346 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + ++Q++R+RDPK+L+ V+DVGG YD YD
Sbjct: 25 IGTHNGHFHADEALAVHMLRKLPTYHDSQLIRTRDPKLLETCHTVVDVGGEYDDGKKRYD 84
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN------VDEGHPDVHRLFLAVY 138
HHQ+GF F G +TKLSSAGLV+ HFGK +IA++L+ V E P+V L+ +Y
Sbjct: 85 HHQRGFTTTFP-GKNTKLSSAGLVFMHFGKAIIAQKLSEGAEKPVTEDSPEVELLYNKLY 143
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDW-----TE 182
++F+EA+DA DNGI+ YD P+ V L R VG+LN +W ++
Sbjct: 144 ESFVEALDAHDNGISVYD----PKAVAAAGLEKRFSEGGFTLGSVVGRLNPNWNDPVPSD 199
Query: 183 PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
P ++ + E+E F + G+EF + +Y ++WLPAR++V +R YDP G ++V
Sbjct: 200 PAEAQKLEDERFVKASRRIGEEFDADLDYYAKAWLPARAVVQAAFEKRTQYDPEGRVLVF 259
Query: 243 K-RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQ 299
+ + PWK HL+ LEE +P + YVLY ++ +WRVQAV V+ D FESRKPLP
Sbjct: 260 EGQSAPWKDHLYSLEEG---KPSVVYVLYPENTAPDAKWRVQAVPVTKDSFESRKPLPEA 316
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRG RD+EL +GIPGCVFVH +GFIGGN+++ GA MA AL L
Sbjct: 317 WRGFRDEELDGISGIPGCVFVHAAGFIGGNKTFDGAKDMAIKALAL 362
>gi|195444116|ref|XP_002069721.1| GK11675 [Drosophila willistoni]
gi|194165806|gb|EDW80707.1| GK11675 [Drosophila willistoni]
Length = 363
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 215/343 (62%), Gaps = 18/343 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-DDLDAVLDV 72
++ P ++ R+GTHNG+FHCDE + CFM++ ++ NA+I RSRD KVL + D ++DV
Sbjct: 21 STPPKRSTPSRIGTHNGTFHCDEVVACFMLQQLPEYENAEIFRSRDDKVLRETCDIIVDV 80
Query: 73 GGVYDPSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
GG YD YDHHQK F+E F +LSSAGLVY H+G+ +I + L +
Sbjct: 81 GGEYDHEKKWYDHHQKPFQETLHSLHPEVSKDFDIRLSSAGLVYSHYGERVIQRILEKEH 140
Query: 126 GHP----DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
++ F+ +Y+NF+ +DAIDNG+ ++ +P RY +T++S+R+G+LN W
Sbjct: 141 QKQLSARNLQLAFVQIYRNFISELDAIDNGVPMFEGGEP-RYKISTHISARIGRLNPSWQ 199
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
E D E+ F+Q +++AGKE + V SWL AR V E + PSGEI++
Sbjct: 200 EKDVDIEQR---FKQAIEVAGKELVQNVLEVCCSWLAARDYVREALENAKSVYPSGEILL 256
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
L FCPWK HL +LE+E IE + K V++ D G+ WRV V V+P F RK LP+ WR
Sbjct: 257 LSTFCPWKAHLADLEKEYNIEGVPKLVIFND--GQSWRVAGVPVTPSSFLGRKFLPSAWR 314
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GLRDDELS++AGI VFVH SGFIGG ++ ALAMA+ +++
Sbjct: 315 GLRDDELSEKAGIKELVFVHHSGFIGGAKTREAALAMAKKSIE 357
>gi|393908398|gb|EFO22734.2| hypothetical protein LOAG_05750 [Loa loa]
Length = 357
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 161/342 (47%), Positives = 206/342 (60%), Gaps = 14/342 (4%)
Query: 11 AYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
AY S S P ++GTH+G FHCDE F+++ + N +IVRSRD +L+ D V+
Sbjct: 17 AYCLSKHSAMP--KIGTHDGKFHCDEVFAIFLLKSLPGYNNYEIVRSRDKDILNLCDIVV 74
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVFG----HGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
DVG Y+ + YDHHQ+ F F TKLSSAGL+Y HFGK +I L +
Sbjct: 75 DVGDEYNHATMKYDHHQRNFAHTMNTLGVMDFDTKLSSAGLIYAHFGKSVIRALLGLQNH 134
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
V+ LF +Y+ F+E+IDAIDNGI Q+D + PRY LSSRV LN W E S
Sbjct: 135 DLVVNILFKKIYETFVESIDAIDNGIAQFDGE--PRYYLGGTLSSRVSMLNPAWNEDTIS 192
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF- 245
+ E F + L GKEF + + +SWLPARS V+ + R+D D SG+I+ L+
Sbjct: 193 VD---ERFMMAIKLVGKEFTELLTHLYKSWLPARSHVINAVTHRFDVDKSGQIVCLEGGG 249
Query: 246 CPWKLHLFELEEEMKIE-PLIKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGL 303
PWK H F +EEE I+ I YV+YED QWRVQA+ +S + FE+R PLP WRGL
Sbjct: 250 MPWKDHFFLIEEEFHIKNDDIIYVIYEDKAKIQWRVQAIPISERQPFENRLPLPEAWRGL 309
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RD EL K A IPGC FVH SGFIGGN+S G + MAR +L L
Sbjct: 310 RDAELIKVADIPGCTFVHPSGFIGGNESMKGVIEMARKSLSL 351
>gi|50418549|ref|XP_457791.1| DEHA2C02574p [Debaryomyces hansenii CBS767]
gi|49653457|emb|CAG85831.1| DEHA2C02574p [Debaryomyces hansenii CBS767]
Length = 335
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 144/327 (44%), Positives = 212/327 (64%), Gaps = 8/327 (2%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
+ + ++ TH+GSFH DE+L FMI+L K+ N+Q++RSR+P + D V+DVGG YD
Sbjct: 14 STMLKICTHSGSFHADESLAVFMIKLLPKYQNSQLIRSRNPSDWESSDVVIDVGGKYDGI 73
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ F E F + TKLSSAGL+YKHFGKE+I+ L++++ + D+ L+ VYK
Sbjct: 74 K-WFDHHQREFNETFSAEYKTKLSSAGLIYKHFGKEIISHVLSINDSNVDL--LYNKVYK 130
Query: 140 NFMEAIDAIDNGINQYDTD-KPPRYVNNTNLS--SRVGKLNLDWTEPDQSAERENEAFQQ 196
F+EA+DA DNGIN Y D + + N+ N++ S V +LN W A+ + AF
Sbjct: 131 EFIEALDANDNGINNYSKDVELNKKFNDRNITLPSIVSRLNPSWNTDPTDADFD-AAFST 189
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
L G+ F++ + Y +SWLPA+ IV R+D D SGEI++L FCPWK HL+ +E
Sbjct: 190 SSQLMGQVFVNLLTGYGKSWLPAKQIVERAFNSRFDVDKSGEIIILDNFCPWKEHLYSIE 249
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
++ + IK+VL++D K WR+ V+V+ FE R LP +WRG+RD+ELS+ G+PG
Sbjct: 250 KDASKQGEIKFVLFQDSSSK-WRISTVSVTSTSFEFRLGLPEKWRGIRDEELSELTGVPG 308
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAAL 343
C+FVH +GFIGG + L +A+ +L
Sbjct: 309 CIFVHAAGFIGGANTKDAVLQLAKLSL 335
>gi|328874110|gb|EGG22476.1| metal-dependent protein hydrolase domain-containing protein
[Dictyostelium fasciculatum]
Length = 327
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 201/319 (63%), Gaps = 12/319 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+GSFH DEAL CF+++LTD+F +A+I+RSRD +V+ D +DVG Y+ S YD
Sbjct: 21 ICTHSGSFHADEALACFLLKLTDQFKDAKIIRSRDTEVVKAADVAVDVGAEYNQSKHRYD 80
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ GF E+FG GF TKLSSAGL+YKHFGKE+I L +DE V+ ++ VY N +E
Sbjct: 81 HHQAGFTEIFGDGFKTKLSSAGLIYKHFGKEIIKNRLQLDE--RKVNLIYKKVYANAIEE 138
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNGI +Y D P Y + + +RV LN W EP Q + + F++ M + G+
Sbjct: 139 LDGMDNGIERYPIDVKPLYAVTSTIGNRVASLNPSWNEP-QDDDILFKQFEKAMTMMGEY 197
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
FLD V +Y +SW+PA IV + R SGEI++L ++CPWK HL+ +E+ + I+
Sbjct: 198 FLDKVDYYGKSWVPAYDIVETAVKNRSSVHSSGEIIILDQYCPWKDHLYHVEDVLSIQTK 257
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
IK+VLY+D G WR+QAV + F RK L WRG RD ELS + +G
Sbjct: 258 IKFVLYQDTLG-SWRIQAVNLDSHSFALRKALLEAWRGKRDQELSD--------IIDANG 308
Query: 325 FIGGNQSYGGALAMARAAL 343
FIGG+++ GAL MA +L
Sbjct: 309 FIGGHKTKEGALTMAIKSL 327
>gi|448100575|ref|XP_004199384.1| Piso0_002821 [Millerozyma farinosa CBS 7064]
gi|359380806|emb|CCE83047.1| Piso0_002821 [Millerozyma farinosa CBS 7064]
Length = 335
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 154/339 (45%), Positives = 218/339 (64%), Gaps = 9/339 (2%)
Query: 9 SPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA 68
S A + + T LK V TH+GSFH DE+L FMIRL K+ ++++RSR+P + D
Sbjct: 2 SEAIKRAKLASTMLK-VCTHSGSFHADESLAVFMIRLLPKYKGSEVIRSRNPGDWESSDI 60
Query: 69 VLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP 128
V+DVGG YD +DHHQ+ F E F + TKLSSAGL+YKHFGKE+I ++++E
Sbjct: 61 VIDVGGKYDGVK-FFDHHQREFMETFSKDYQTKLSSAGLIYKHFGKEIIRTVVDLEES-- 117
Query: 129 DVHRLFLAVYKNFMEAIDAIDNGINQY--DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQ 185
+V L+ +YK ++EA+DA DNGIN Y + + ++ + + L + V LN W
Sbjct: 118 EVELLYQKIYKEYIEALDANDNGINNYPREVEASKKFNDKSITLPAIVSSLNPSWVTDPT 177
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
A+ ++ AFQ +L GK FL+ V Y +SW+PA+ +V E +R++ D SGEI++L+RF
Sbjct: 178 DADFDS-AFQSSSELMGKVFLNLVENYGKSWIPAKRLVQEAFDKRFEVDKSGEIIILERF 236
Query: 246 CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
CPWK HL+ +E++ + IK+VL+ D GK WRV VAVS F+ R L WRGLRD
Sbjct: 237 CPWKEHLYAIEKDANAQGSIKFVLFCDSSGK-WRVSTVAVSSGSFDFRLGLLENWRGLRD 295
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ELSK +G+ GC+FVH SGFIGG QS L +A+ +LK
Sbjct: 296 EELSKVSGVDGCIFVHASGFIGGAQSKEAVLELAKISLK 334
>gi|343470452|emb|CCD16851.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 170/379 (44%), Positives = 216/379 (56%), Gaps = 47/379 (12%)
Query: 4 TGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
TGVS P+++ + P+ +GTHNGSFHCDEAL C M+R + +F A I+R+RD +L
Sbjct: 15 TGVSVLPSFTDAGSGVFPV--IGTHNGSFHCDEALACGMLRCSAEFSTANILRTRDASLL 72
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHG---FSTKLSSAGLVYKHFGKELIA-- 118
+ + V+DVGG Y+ +DHHQ F+ + T+LSSAGLVYKH+G+E+I
Sbjct: 73 EKCNIVVDVGGEYNAETLRFDHHQPSFQTTMTTSKSVYRTRLSSAGLVYKHYGREIIQRY 132
Query: 119 ---------------------KELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT 157
K N+ + D +F VYKNF+E ID IDNG+N Y
Sbjct: 133 VEAALSSSYRSELITMTSWDLKRKNLTDSELDT--IFDIVYKNFVEHIDGIDNGVNSYGP 190
Query: 158 DKPP-----------RYVNNTNLSSRVGKLNLDWTEPDQ-SAERENEAFQQGMDLAGKEF 205
YV T LS RVG+L W E D + E EN AF Q ++L EF
Sbjct: 191 AAQVDVADAGSLCVRNYVVKTALSDRVGQLMPWWNENDSGNIESENAAFLQAVELTLLEF 250
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPL 264
+ V FYV +WLPAR +V PSG I+VLK R+CPWK HL E+E E E
Sbjct: 251 ITAVHFYVFTWLPARGLVESAFLGAEKVHPSGRIIVLKERYCPWKDHLLEIETERGKEGH 310
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ YVL+ D G WRVQAV FESRK LP WRGLRD ELS+ +G+ G VFVH+SG
Sbjct: 311 VLYVLFADKSG--WRVQAVPKDAVGFESRKALP--WRGLRDAELSEASGVDGGVFVHVSG 366
Query: 325 FIGGNQSYGGALAMARAAL 343
FIGGN++Y GALAMA AL
Sbjct: 367 FIGGNKTYEGALAMATKAL 385
>gi|156043107|ref|XP_001588110.1| hypothetical protein SS1G_10556 [Sclerotinia sclerotiorum 1980]
gi|154694944|gb|EDN94682.1| hypothetical protein SS1G_10556 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 350
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 150/336 (44%), Positives = 214/336 (63%), Gaps = 16/336 (4%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL + +++++R+RDPK+L+ V+DVGG Y+ YD
Sbjct: 16 IGTHNGHFHADEALAVYMLRLLPTYQSSELIRTRDPKILETCHTVVDVGGEYNNETKRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY H+GK +IA+ L V E V ++ +Y++F+EA
Sbjct: 76 HHQRTFDTTFPDR-PTKLSSAGLVYMHYGKAIIAQRLGVSEDAEQVEIIWKKIYQSFIEA 134
Query: 145 IDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDW-----TEPDQSAERENE 192
+DA DNGI+ YD ++ N + L + VG+LN +W ++P ++ + E+E
Sbjct: 135 LDAHDNGISVYDPKAISAAGLEKKFSNGSFTLGAMVGRLNPNWNDPAPSDPVEAQKAEDE 194
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLH 251
F G+EF + +Y +SWLPAR+IV + A+R YD G I+V + + PWK H
Sbjct: 195 KFLIASTRIGEEFSRDLDYYTKSWLPARAIVQQAYAKRLQYDSKGRILVFEGQSVPWKDH 254
Query: 252 LFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
L+ +E+E K E I YVLY ++ +WR+Q V S D F+SRKPLP WRG RD+ELS
Sbjct: 255 LYTIEDEEKSENEILYVLYPENTAPDAKWRIQCVPASKDSFQSRKPLPEAWRGFRDEELS 314
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ GIPG VFVH +GFIGGN+++ GA MA AA+ L
Sbjct: 315 ELTGIPGGVFVHAAGFIGGNKTFEGASKMAAAAVDL 350
>gi|238879325|gb|EEQ42963.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 334
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 214/321 (66%), Gaps = 4/321 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L ++++L K+ A++VRSR+P+ + D V+DV G YD +
Sbjct: 14 KICTHSGSFHADESLAVYLLKLLPKYSQAELVRSRNPEDWESSDIVVDVSGKYDGVK-YF 72
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F+ F + TKLSSAGLVYKHFGKE+I K L +++G+P++ L+ +YK F+E
Sbjct: 73 DHHQREFDTTFNENYKTKLSSAGLVYKHFGKEIIKKVLKLEDGNPNIELLYDKIYKEFVE 132
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNGI+ Y D P++ + N L S V +LN W E A+ + + F + +L G
Sbjct: 133 SLDANDNGISNYPKDVSPKFTDKNITLPSIVSRLNPSWNESCTDADYDRQ-FLKSSELMG 191
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
F+ + Y +SWLPA+SIV + R++ D SGEI+VL +FCPWK HL+ +E+E E
Sbjct: 192 TVFVSLLEGYGKSWLPAKSIVAKAFDNRFEVDKSGEILVLDQFCPWKEHLYNIEKEKNAE 251
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
LIK+VL++D GK WRV V+V+ FE R LP + RGLRD ELS+++GI GC+F+H
Sbjct: 252 GLIKFVLFKDSSGK-WRVSTVSVTSTSFEFRLGLPQELRGLRDQELSEKSGIDGCIFIHA 310
Query: 323 SGFIGGNQSYGGALAMARAAL 343
+GFIGG + L ++R +L
Sbjct: 311 AGFIGGANTKEAVLELSRLSL 331
>gi|68486508|ref|XP_712863.1| hypothetical protein CaO19.12007 [Candida albicans SC5314]
gi|46434280|gb|EAK93694.1| hypothetical protein CaO19.12007 [Candida albicans SC5314]
Length = 354
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 215/323 (66%), Gaps = 4/323 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++ TH+GSFH DE+L ++++L K+ A++VRSR+P+ + D V+DV G YD
Sbjct: 32 MAKICTHSGSFHADESLAVYLLKLLPKYSQAELVRSRNPEDWESSDIVVDVSGKYDGVK- 90
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F+ F ++TKLSSAGLVYKHFGKE+I L +++G+P++ L+ +YK F
Sbjct: 91 YFDHHQREFDTTFNENYTTKLSSAGLVYKHFGKEIIKGVLKLEDGNPNIELLYDKIYKEF 150
Query: 142 MEAIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E++DA DNGI+ Y D P++ + N L S V +LN W E A+ + + F + +L
Sbjct: 151 VESLDANDNGISNYPKDVSPKFTDKNITLPSIVSRLNPSWNESCTDADYDRQ-FLKSSEL 209
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G F+ + Y +SWLPA+SIV + R++ D SGEI+VL +FCPWK HL+ +E+E
Sbjct: 210 MGTVFVSLLEGYGKSWLPAKSIVAKAFDNRFEVDKSGEILVLDQFCPWKEHLYNIEKENN 269
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
E LIK+VL++D GK WRV V+V+ FE R LP + RGLRD ELS+++GI GC+F+
Sbjct: 270 AEGLIKFVLFKDSSGK-WRVSTVSVTSTSFEFRLGLPQELRGLRDQELSEKSGIDGCIFI 328
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H +GFIGG + L +AR +L
Sbjct: 329 HAAGFIGGANTKEAVLELARLSL 351
>gi|302914331|ref|XP_003051114.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732052|gb|EEU45401.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 346
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 208/335 (62%), Gaps = 19/335 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + +A +VR+RDP VL+ V+DVGG YDP +D
Sbjct: 16 IGTHNGHFHADEALAVHMLRRLPAYRDAGLVRTRDPAVLETCHTVVDVGGEYDPERRRFD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF F G TKLSSAGLV+ HFG+ ++A LN E PDV L+ +Y+NF+EA
Sbjct: 76 HHQRGFATTF-PGRPTKLSSAGLVFMHFGRAIVADRLNQPEDSPDVELLYNKLYENFVEA 134
Query: 145 IDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEA---- 193
+DA DNGI+ YD + R+ N L + VG+LN +W +P + E +A
Sbjct: 135 LDAHDNGISLYDPEAIAAAGIEKRFSNGGFGLGAVVGRLNPNWNDPAPADRAEAQAAEDA 194
Query: 194 -FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLH 251
F + G+EF + Y SWLPARSIV + A+R YD G I+VL+ + PWK H
Sbjct: 195 KFNEASRRIGEEFDRDLDGYASSWLPARSIVEQAFAKRSQYDEQGRILVLEGQSVPWKDH 254
Query: 252 LFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
L+ LE E+ P + YVLY + + G +WR+Q V S D F SRKPLP WRG RD+EL
Sbjct: 255 LYALENEV---PSVLYVLYAEKPEPGAKWRIQCVPESKDSFVSRKPLPEAWRGFRDEELD 311
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+GIPGCVFVH +GFIGGN+++ GA MA AL+
Sbjct: 312 GISGIPGCVFVHAAGFIGGNKTFEGAKQMATKALE 346
>gi|145538722|ref|XP_001455061.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422849|emb|CAK87664.1| unnamed protein product [Paramecium tetraurelia]
Length = 337
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 211/342 (61%), Gaps = 22/342 (6%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMI-RLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
Q K++GTHNG+FH DE L C M+ + T++F N I RSRDP V D ++DVGGVYD
Sbjct: 2 QQVFKKIGTHNGAFHVDEVLACAMLTKYTNEFKNGIITRSRDPAVWAQQDILVDVGGVYD 61
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--------------NV 123
P YDHHQK F++ F + F+ +LSSAGL+YKHFG E+I + V
Sbjct: 62 PQTHRYDHHQKEFQQSFSNDFNIRLSSAGLIYKHFGLEIIQNVIAHINATTETTIEIQKV 121
Query: 124 DEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP 183
DE ++ +++ +YKNF+++IDAIDNGINQY + +Y NT+LS+ + + N W E
Sbjct: 122 DE--KTLNLIYIKLYKNFIQSIDAIDNGINQYPNQEQLKYQINTHLSAVINRFNPTWCEK 179
Query: 184 DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK 243
+Q EN F Q ++ E + V+ W P R VV+ R++ D SG+I+ L
Sbjct: 180 NQD---ENAKFHQAVEFVTTELISQVKSIYLGWYPGRCYVVQAFDSRFEQDQSGQIIKLP 236
Query: 244 RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
PWK H+F++E+E LIK+VLY DR + WRVQAV+ + D FE+RK L ++WRG+
Sbjct: 237 LALPWKSHVFDIEQERGTVGLIKFVLYP-DRNEGWRVQAVSANEDSFENRKSLKSEWRGV 295
Query: 304 RD-DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+D ++L +GI VFVH SGFIGG +SY L +A+ +L+
Sbjct: 296 KDIEQLKAISGIDDIVFVHASGFIGGAKSYENTLKIAKISLE 337
>gi|448519457|ref|XP_003868081.1| hypothetical protein CORT_0B09420 [Candida orthopsilosis Co 90-125]
gi|380352420|emb|CCG22646.1| hypothetical protein CORT_0B09420 [Candida orthopsilosis]
Length = 332
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 214/322 (66%), Gaps = 4/322 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L +M++L +F NA++VRSR+P+ + D V+DVGG YD +
Sbjct: 12 KICTHSGSFHADESLAVYMLKLLPQFQNAELVRSRNPQDWESSDIVVDVGGKYD-GKKFF 70
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F+ F + TKLSSAGLVYKHFG+++I + L + ++ L+ VYK F+E
Sbjct: 71 DHHQREFDTTFNEKYQTKLSSAGLVYKHFGRDIIKEVLGLSLEDKNIELLYDKVYKEFIE 130
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNGIN Y D P++ + N L S V KLN W E A+ + + F +L G
Sbjct: 131 SLDANDNGINNYPKDVEPKFNDRNITLPSIVSKLNPRWNESCTDADYDRQ-FNISSELMG 189
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
K F++ + Y + WLPA+SIV E +R++ DPSGEI+VL +FCPWK HL+ +E++ E
Sbjct: 190 KVFVNLLEGYGKGWLPAKSIVEEAFNKRFEVDPSGEILVLSQFCPWKEHLYVIEKDNNAE 249
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IK+VL++D GK WRV V+V+ FE R LP RGLRD+ELS++AGI GC+F+H
Sbjct: 250 GAIKFVLFKDSSGK-WRVSTVSVTSSSFEFRLGLPEPLRGLRDEELSEKAGIEGCIFIHA 308
Query: 323 SGFIGGNQSYGGALAMARAALK 344
+GFIGG S G L +AR +LK
Sbjct: 309 AGFIGGAASEEGVLKLARMSLK 330
>gi|241948255|ref|XP_002416850.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223640188|emb|CAX44437.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 334
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 214/321 (66%), Gaps = 4/321 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L ++++L K+ A++VRSR P+ + D V+DV G YD +
Sbjct: 14 KICTHSGSFHADESLAVYLLKLLPKYSQAELVRSRSPEDWESSDIVVDVSGKYDGVK-YF 72
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F+ F + TKLSSAGLVYKHFG+E+I + L +++G+P++ L+ +YK F+E
Sbjct: 73 DHHQREFDTTFNEHYKTKLSSAGLVYKHFGREIIKEVLKLEDGNPNIELLYDKIYKEFIE 132
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNGIN Y D P++ + N L S V +LN W E A+ + + F + +L G
Sbjct: 133 SLDANDNGINNYPKDVSPKFTDKNITLPSIVSRLNPSWNESCTDADYDRQ-FLKSSELMG 191
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
F+ + Y +SWLPA+SIV + +R++ D SGEI+VL +FCPWK HL+ +E+E E
Sbjct: 192 TVFVSLLEGYGKSWLPAKSIVAKAFDDRFEVDKSGEILVLDQFCPWKEHLYNIEKENNAE 251
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
LIK+VL++D GK WRV V+V+ FE R LP + RGLRD ELS+++G+ GC+F+H
Sbjct: 252 GLIKFVLFKDSSGK-WRVSTVSVTSTSFEFRLGLPQELRGLRDQELSEKSGVDGCIFIHA 310
Query: 323 SGFIGGNQSYGGALAMARAAL 343
+GFIGG + L +AR +L
Sbjct: 311 AGFIGGANTKEAVLQLARLSL 331
>gi|388492028|gb|AFK34080.1| unknown [Medicago truncatula]
Length = 234
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 130/215 (60%), Positives = 172/215 (80%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
+SS S + KRVGTHNG+FHCDEAL CFM+RL+ F A IVR+RD +L+ LDAV+DVG
Sbjct: 14 SSSFSTSRAKRVGTHNGTFHCDEALACFMLRLSKLFSGADIVRTRDSNLLEVLDAVVDVG 73
Query: 74 GVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
VYDP YDHHQ+ F++VFG+GF TKLSSAGL+YKHFG E+IA L++DE HP VH+L
Sbjct: 74 RVYDPKRHRYDHHQRDFDQVFGNGFVTKLSSAGLIYKHFGLEIIANVLHLDEDHPHVHQL 133
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ A+Y+NF+EA+DA+DNG++QYD + P+Y+ NT+L+ RV +LN DW + DQSA+ ENEA
Sbjct: 134 YPAIYRNFVEAVDAVDNGVSQYDLKESPKYIINTDLAFRVERLNFDWIDSDQSADAENEA 193
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIA 228
F + M LAG EF++ V +Y +SWLPA+SIV+EC+A
Sbjct: 194 FHRAMALAGGEFVENVNYYAKSWLPAQSIVMECLA 228
>gi|380485069|emb|CCF39601.1| hypothetical protein CH063_10384 [Colletotrichum higginsianum]
Length = 343
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 217/346 (62%), Gaps = 33/346 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + ++Q++R+RDPK+L+ V+DVGG YD S YD
Sbjct: 6 IGTHNGHFHADEALAVHMLRKLPTYHDSQLIRTRDPKLLETCHTVVDVGGEYDDSKKRYD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN------VDEGHPDVHRLFLAVY 138
HHQ+GF F G +TKLSSAGLV+ HFGK +IA++L+ V E +V L+ +Y
Sbjct: 66 HHQRGFTTTFP-GKNTKLSSAGLVFMHFGKAIIAQKLSEGAEQAVAEDSAEVELLYKKLY 124
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDW-----TE 182
++F+EA+DA DNGI+ YD P+ V L R VG+LN +W ++
Sbjct: 125 ESFVEALDAHDNGISVYD----PKAVAAAGLEKRFSEGGFTLGSVVGRLNPNWNDPVPSD 180
Query: 183 PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
P ++ + E+E F + G+EF + +Y ++WLPAR++V +R YDP G ++VL
Sbjct: 181 PAEAQKLEDERFVKASRRIGEEFDADLDYYAKAWLPARAVVQAAFEKRTQYDPEGRVLVL 240
Query: 243 K-RFCPWKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQ 299
+ + PWK HL+ LEE +P + YVLY + G +WRVQAV V+ D FESRKPLP
Sbjct: 241 EGQSAPWKDHLYSLEEG---KPSVVYVLYPEKPAPGAKWRVQAVPVTKDSFESRKPLPEA 297
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRG RD+EL +GIPG VFVH +GFIGGN+++ GA MA AL L
Sbjct: 298 WRGFRDEELDGISGIPGGVFVHAAGFIGGNKTFEGARDMAVKALSL 343
>gi|68486317|ref|XP_712957.1| hypothetical protein CaO19.4532 [Candida albicans SC5314]
gi|46434381|gb|EAK93792.1| hypothetical protein CaO19.4532 [Candida albicans SC5314]
Length = 354
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 214/323 (66%), Gaps = 4/323 (1%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++ TH+GSFH DE+L ++++L K+ A++VRSR+P+ + D V+DV G YD
Sbjct: 32 MAKICTHSGSFHADESLAVYLLKLLPKYSQAELVRSRNPEDWESSDIVVDVSGKYDGVK- 90
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F+ F ++TKLSSAGLVYKHFGKE+I L +++G+P++ L+ +YK F
Sbjct: 91 YFDHHQREFDTTFNENYTTKLSSAGLVYKHFGKEIIKGVLKLEDGNPNIELLYDKIYKEF 150
Query: 142 MEAIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E++DA DNGI+ Y D P++ + N L S V +LN W E A+ + + F + +L
Sbjct: 151 VESLDANDNGISNYPKDVSPKFTDKNITLPSIVSRLNPSWNESCTDADYDRQ-FLKSSEL 209
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G F+ + Y +SWLPA+SIV + R++ D SGEI+VL +FCPWK HL+ +E+E
Sbjct: 210 MGTVFVSLLEGYGKSWLPAKSIVAKAFDNRFEVDKSGEILVLDQFCPWKEHLYNIEKENN 269
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
E LIK+VL++D GK WRV V+V+ FE R LP + RGLRD ELS+++GI C+F+
Sbjct: 270 AEGLIKFVLFKDSSGK-WRVSTVSVTSTSFEFRLGLPQELRGLRDQELSEKSGIDSCIFI 328
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H +GFIGG + L +AR +L
Sbjct: 329 HAAGFIGGANTKEAVLELARLSL 351
>gi|302421964|ref|XP_003008812.1| MYG1 [Verticillium albo-atrum VaMs.102]
gi|261351958|gb|EEY14386.1| MYG1 [Verticillium albo-atrum VaMs.102]
Length = 337
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 205/338 (60%), Gaps = 27/338 (7%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + A ++R+RDPK+L+ V+DVGG YD + YD
Sbjct: 6 IGTHNGHFHADEALAVHMLRQLPAYEGASLIRTRDPKLLETCHTVVDVGGEYDAEKNRYD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF F G STKLSSAGLV+ HFG+ +IA+++ E PDV L Y++F+EA
Sbjct: 66 HHQRGFTTTFP-GRSTKLSSAGLVFLHFGRAIIAQKMGTAEDSPDVALLHNKFYESFIEA 124
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDW-----TEPDQSAE 188
+DA DNGI+ YD P V+ L R VG+ N +W +P+Q+
Sbjct: 125 LDAHDNGISVYD----PAAVSAAGLEKRFSEGGFTLGAMVGRFNGNWNDPVIADPEQAQA 180
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CP 247
E++ F+ G+EF +++Y ++WLPAR++V R +DP G I+V P
Sbjct: 181 AEDKRFEAASARIGEEFDRDLQYYAKAWLPARAVVKAAFDARAQHDPEGRILVFDGVSAP 240
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
WK HL+ LEE +P + YVLY + G +WRVQ V V+ D F+SRKPLP WRG RD
Sbjct: 241 WKDHLYSLEEG---KPAVLYVLYPEKPVPGAKWRVQTVPVTKDSFQSRKPLPEAWRGFRD 297
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
EL AGI GCVFVH +GFIGGN ++ GALAMA AL
Sbjct: 298 QELDGIAGIDGCVFVHAAGFIGGNATFEGALAMATKAL 335
>gi|401625984|gb|EJS43954.1| YER156C [Saccharomyces arboricola H-6]
Length = 338
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 212/327 (64%), Gaps = 8/327 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 14 VKQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAKYDGVK- 72
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF E F + TKLSSAGL++KH+G+E+I LN D+ L+ VYK F
Sbjct: 73 FFDHHQRGFFETFNENYKTKLSSAGLIFKHYGREIIKTILNGKASSADLELLYDKVYKQF 132
Query: 142 MEAIDAIDNGINQY----DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+EA+DA DNGI++Y D+D P + +N + V +N +W E D S E ++ F +
Sbjct: 133 VEALDANDNGISKYTIPRDSDLAPNFRDNAIGIPGIVSGMNPNWNE-DSSDESFDKCFVR 191
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
G+ F+ V+ Y SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 192 ASAFIGEVFVTLVKGYGESWLPAKTLVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELE 251
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
E IE I++VL+ D G WRV V ++ F+ R+ LP RGLRD+ELS ++G+PG
Sbjct: 252 REKNIEKQIEFVLFADSSG-AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPG 310
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAAL 343
C+F+H +GFIGG +S +A+ +L
Sbjct: 311 CIFIHAAGFIGGAKSKEAVYELAKMSL 337
>gi|213403962|ref|XP_002172753.1| UPF0160 protein c [Schizosaccharomyces japonicus yFS275]
gi|212000800|gb|EEB06460.1| UPF0160 protein c [Schizosaccharomyces japonicus yFS275]
Length = 322
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 146/326 (44%), Positives = 202/326 (61%), Gaps = 12/326 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KR+ TH+G FH DE+L +M+R +++ +A+++R+RD +++D D +DVGG +D
Sbjct: 5 KRIATHSGKFHADESLAVYMLRCLEEYRDAKVIRTRDLELIDSCDIAVDVGGKFD-GVKY 63
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN---VDEGHPDVHRLFLAVYK 139
+DHHQ+ F + F + TKLSSAGLVYK+FGK +I L + E D+ L + +Y+
Sbjct: 64 FDHHQREFNDTFSPDYKTKLSSAGLVYKYFGKRVITSILPSAPITESQLDL--LHVKIYR 121
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+E +DA DNGI+ Y + P + ++ +L V L +W Q + E F +
Sbjct: 122 EFIEGLDADDNGISPYPAELKPAFRSSLSLPGMVSMLFPEWNSDKQDDDAIYEQFMKASR 181
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
+ G F V++Y SWLPA+SIV + E S I+V ++ CPWK HLFE+EEE
Sbjct: 182 MMGHWFEAAVKYYTLSWLPAKSIVESAVNEA----GSSPIIVFQKSCPWKSHLFEIEEEK 237
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
I KY LY D GK WR+QAV++SPD F SR PLP WRG+RDD LSK GIPGC+F
Sbjct: 238 NIVGQFKYALYSD--GKNWRIQAVSISPDSFVSRLPLPEPWRGVRDDALSKLTGIPGCIF 295
Query: 320 VHMSGFIGGNQSYGGALAMARAALKL 345
VH SGFIGGN +Y G L MA+ AL
Sbjct: 296 VHASGFIGGNATYEGVLEMAKKALNF 321
>gi|340520590|gb|EGR50826.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/359 (44%), Positives = 210/359 (58%), Gaps = 41/359 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
VGTHNG FH DEAL M+R+ + +++VR+RDPK+L+ V+DVGG YD + YD
Sbjct: 19 VGTHNGHFHADEALAVHMLRMLPTYRESRLVRTRDPKLLETCHTVVDVGGEYDAQRNRYD 78
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN---------------------- 122
HHQ+GFE F G TKLSSAGLVY HFG+ LIA+ L
Sbjct: 79 HHQRGFETTFP-GKQTKLSSAGLVYMHFGRALIAQRLAEERNKPASAAATSEEQKEEEGK 137
Query: 123 --VDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDK------PPRYVN-NTNLSSRV 173
V E PDV L+ +Y++F+EA+DA DNGI++Y D PR+ + L + V
Sbjct: 138 EVVSEESPDVELLYNKIYESFIEAVDAHDNGISKYPRDALAAASIEPRFSSAGFTLGAMV 197
Query: 174 GKLNLDW-----TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIA 228
G+LN +W +PD + E+ F G+EF T+ F R+WLPARSIV
Sbjct: 198 GRLNPNWNDPKPADPDAAQAHEDALFLTASRRIGEEFERTLDFMARAWLPARSIVQTAFD 257
Query: 229 ERYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKI-EPLIKYVLYEDDR--GKQWRVQAVA 284
R +DP G I+VL + PWK HL+ LEE+ + +P + YVLY + G +WRVQ V
Sbjct: 258 ARRKHDPEGRILVLGGQSVPWKDHLYSLEEDKEDGKPSVLYVLYAESTQPGAKWRVQCVP 317
Query: 285 VSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ D F SR+PLP WRG RD EL AG+ GCVFVH +GFIGGN+++ GAL MA AL
Sbjct: 318 ETADSFTSRRPLPEAWRGFRDAELDAVAGVDGCVFVHAAGFIGGNKTFEGALEMATKAL 376
>gi|241695280|ref|XP_002413037.1| secreted protein, putative [Ixodes scapularis]
gi|215506851|gb|EEC16345.1| secreted protein, putative [Ixodes scapularis]
Length = 351
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 211/332 (63%), Gaps = 13/332 (3%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
T K +GTHNG+FHCDEAL CF+++ + +A IVRSRD VL D V+DVGGVYDP+
Sbjct: 23 TAAKTIGTHNGTFHCDEALACFLLKQLPAYEDANIVRSRDSAVLSTCDIVVDVGGVYDPA 82
Query: 80 NDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
+ +DHHQ+ F E ++TKLSSAGLVY HFG+E+IA+ L P+V +++
Sbjct: 83 TNRFDHHQRTFLETMHSLNESKKWTTKLSSAGLVYAHFGREVIARVLGWKLDDPNVEKIY 142
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
VY+NFME IDAIDNG+ +D + PRY TNLS+RV LN ++ +E F
Sbjct: 143 DKVYENFMEEIDAIDNGVKMFDEE--PRYRICTNLSARVAHLN---PPWNEPNPNPDERF 197
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLF 253
Q+ + L G+EFLD VR Y W+PAR +V I +R D G ++ CPWK HL
Sbjct: 198 QKALKLTGEEFLDRVRSYATIWMPARDLVFNAIQQRQKVDGLGSVVTFSSGCCPWKDHLL 257
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPL-PAQWRGLRDDELSKEA 312
ELEEE+ I +K+VLY+D+ K WRVQ V S F+ R+ P+ W GLRD ELS +
Sbjct: 258 ELEEELGIPGEVKFVLYQDE-NKAWRVQGVPPSLGSFDCRQVYSPSPWCGLRDKELSDAS 316
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI GCVFVH +GFIGGN++ GAL MA LK
Sbjct: 317 GIEGCVFVHSNGFIGGNKTREGALQMAVHTLK 348
>gi|195499291|ref|XP_002096886.1| GE25920 [Drosophila yakuba]
gi|194182987|gb|EDW96598.1| GE25920 [Drosophila yakuba]
Length = 358
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 221/343 (64%), Gaps = 20/343 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDVG 73
+ P ++PL +GTH+G+FHCDE + CFM++ +++ NA+I RSRD K L + D ++DVG
Sbjct: 18 TPPKRSPL-WIGTHSGTFHCDEVVACFMLKQLEEYENAEIFRSRDDKALQEKCDIIVDVG 76
Query: 74 GVYDPSNDCYDHHQKGFEEVFG-------HGFST-KLSSAGLVYKHFGKELIAKELNVDE 125
GVYD + YDHHQ+ F+E F F+ +LSSAGLVY H+G+ +I L ++
Sbjct: 77 GVYDHAKKLYDHHQRTFKETFSSLRPEVSEDFNVIRLSSAGLVYSHYGERVIQSILQREK 136
Query: 126 G---HPDVHRL-FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
G P+ +L F+ +Y+NF+ +DAIDNG++ ++ +P Y +T+LS+R+GKLN W
Sbjct: 137 GIKLSPENLQLAFIQIYRNFISELDAIDNGVSMFEGAEPI-YKISTHLSARIGKLNPSWQ 195
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
E E + F+Q MD AG+E +D V SW+ AR V E + P+GEI+V
Sbjct: 196 ETGVDIE---DRFRQAMDTAGRELVDNVVEVCCSWIAARDHVREALNNAKSVHPTGEILV 252
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
LK FCPWK HL +LE+E K+E + K +++ D G WRV V V+P F+ RK LP WR
Sbjct: 253 LKNFCPWKSHLADLEKEYKVEGVPKLIVFND--GSSWRVAGVPVTPGSFQGRKFLPTPWR 310
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GLRDDELS++AGI +FVH SGFIGG ++ A+ +A+ +++
Sbjct: 311 GLRDDELSQKAGIKDLIFVHHSGFIGGAKTEDAAMLLAKKSVE 353
>gi|195572403|ref|XP_002104185.1| GD18587 [Drosophila simulans]
gi|194200112|gb|EDX13688.1| GD18587 [Drosophila simulans]
Length = 362
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 219/343 (63%), Gaps = 20/343 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDVG 73
+ P ++P+ +GTHNG+FHCDE + CFM++ D++ NA+I RSRD K L++ D +LDVG
Sbjct: 19 TPPKRSPI-WIGTHNGTFHCDEVVACFMLKQLDEYKNAEIFRSRDNKALEEKCDIILDVG 77
Query: 74 GVYDPSNDCYDHHQKGFEEVFG-------HGFST-KLSSAGLVYKHFGKELIAKELNVDE 125
GVYD + YDHHQ+ F+E F F+ +LSSAGLVY H+G+ +I L ++
Sbjct: 78 GVYDHAKKLYDHHQQTFKETFSSVRPEVSEDFNVIRLSSAGLVYCHYGERVIQSILQREK 137
Query: 126 G---HPDVHRL-FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
G P+ +L F+ +Y+NF+ +DAIDNG+ ++ +P Y T+LS+R+GKLN W
Sbjct: 138 GIQLSPENLQLAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKITTHLSARIGKLNPSWQ 196
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
E E + F+Q MD AG+EF+D V SW+ AR V E + P+GEI+V
Sbjct: 197 ETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIAARDHVREALENAKSVHPTGEILV 253
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
LK FCPWK HLF+LE+E K+E + K V++ D G WRV V V+P F RK LP WR
Sbjct: 254 LKNFCPWKSHLFDLEKEYKVEGVPKLVVFND--GTSWRVAGVPVTPSSFLGRKFLPTPWR 311
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GLRDDEL ++A I VF+H +GFIGG ++ A+ +A+ +++
Sbjct: 312 GLRDDELCEKASIKDLVFIHHNGFIGGAKTEEAAMLLAKKSIE 354
>gi|74025572|ref|XP_829352.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834738|gb|EAN80240.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 394
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 166/385 (43%), Positives = 215/385 (55%), Gaps = 53/385 (13%)
Query: 4 TGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
+G+ P+ + S P+ +GTHNGSFHCDEAL C M+R + +F A ++R+RD K L
Sbjct: 15 SGLRVLPSLADVDTSSFPV--IGTHNGSFHCDEALACGMLRCSGQFRAANVLRTRDAKTL 72
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF---GHGFSTKLSSAGLVYKHFGKELIAKE 120
D V+DVGGVYD +DHHQ F + + T+LSSAGLVYKHFG+E+I +
Sbjct: 73 DRCSIVVDVGGVYDADALRFDHHQPTFHDTMKTPKSMYRTRLSSAGLVYKHFGREIIQRY 132
Query: 121 L-----------------------NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT 157
+ N+ E D +F VYKNF+E ID IDNG+N +
Sbjct: 133 VEAALSSSYRAELLKMGSWSESRKNLSEAELDT--VFDIVYKNFVEHIDGIDNGVNSFGP 190
Query: 158 DKPP-----------------RYVNNTNLSSRVGKLNLDWTE-PDQSAERENEAFQQGMD 199
Y T LS R+G+L W E D + E EN AF Q ++
Sbjct: 191 ATETVTASGCSADAQSASCVRNYAVTTTLSDRIGRLMPWWNEGGDGNPESENTAFLQAVE 250
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEE 258
LA EF+ V F+ SW+PAR +V + PSG I+VLK RFCPWK HL E+E E
Sbjct: 251 LALSEFVAAVHFFTFSWMPARGLVEDAFHAAESVHPSGRIIVLKERFCPWKDHLLEIEAE 310
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ YVL+ D G WRVQAV FESRKPLP W+G+RD +LS+ +G+ GCV
Sbjct: 311 HGKVGHVLYVLFSDKSG--WRVQAVPKDAVSFESRKPLP--WKGIRDADLSEASGVEGCV 366
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
FVH+SGFIGGN++Y GAL MA AL
Sbjct: 367 FVHVSGFIGGNKTYEGALQMATKAL 391
>gi|324508491|gb|ADY43583.1| Unknown [Ascaris suum]
Length = 338
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 201/323 (62%), Gaps = 12/323 (3%)
Query: 30 GSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKG 89
G + CFM++ +F + I+R+RDP L + V+DVGGVYD YDHHQ+G
Sbjct: 12 GILKVVQVFACFMLKSLPEFSHYDIIRTRDPSTLSNCSIVVDVGGVYDHDKLRYDHHQRG 71
Query: 90 FEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAI 145
F + F TKLSSAGL+Y H+GK +IA+ L + + +V L+ VY+ F+EA+
Sbjct: 72 FNDTMKTLNVLDFETKLSSAGLIYAHYGKRVIAELLALRDDSTEVDILYKKVYEAFVEAV 131
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
DAIDNGI Q+D PRY LSSRVG LN W E D E+ F + M L G EF
Sbjct: 132 DAIDNGIPQFDG--VPRYHLGGTLSSRVGNLNPAWNEEDIDIEKR---FHEAMKLVGVEF 186
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEEEMKI-EP 263
LD ++++ RSWLPAR +V EC+ R+D D SG+I+ L K PWK H F LE+E +
Sbjct: 187 LDRLKYFHRSWLPARDVVAECVKNRFDVDKSGQILALDKGAVPWKEHFFTLEKEHNLLNA 246
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
I Y+++ D WRVQA+ + FE+R PLP WRG R+DELS+ AGIP C+F HM
Sbjct: 247 QITYIVFADATSGDWRVQAIPLDEKATFENRLPLPESWRGYRNDELSELAGIPSCIFTHM 306
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
+GFIGGN++ GA+ MA+ +L++
Sbjct: 307 TGFIGGNRTRDGAVEMAKRSLQI 329
>gi|325186990|emb|CCA21534.1| protein MYG1 putative [Albugo laibachii Nc14]
Length = 376
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 208/348 (59%), Gaps = 19/348 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD--LDAVLDV 72
+S Q + +GTH+GSFHCDEAL +++L K+ + I+R+R+P L DAV DV
Sbjct: 26 TSSVQEGKRYIGTHHGSFHCDEALAVSLLKLLPKYKDHDILRTREPTKLGTGTCDAVADV 85
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELI------AKELNVDEG 126
GGVY+P+ + +DHHQ+ F F KLSSAGLVYKH+G+++I + + +
Sbjct: 86 GGVYEPNTNRFDHHQQEFNTTFSDKHRIKLSSAGLVYKHYGRDIIQLLSQLSIQPTTEAT 145
Query: 127 HP--------DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNL 178
+P V ++ VY +F+E ID IDNG+ +T +P Y +T LSSRVG LN
Sbjct: 146 NPLPAVLPSETVDLVYEKVYTSFIEHIDGIDNGVGVAETSQPLNYQVSTTLSSRVGYLNP 205
Query: 179 DWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSG 237
W +P N FQ M L EF++TV + WLPARS+V ER + PSG
Sbjct: 206 SWNDPRSNEVNYVNSRFQDAMYLTVTEFIETVIHCIHVWLPARSLVEAAFQERLQHHPSG 265
Query: 238 EIMVLKRFCPWKLHLFELEEEMKIEPL--IKYVLYEDDRGKQWRVQAVAVSPDRFESRKP 295
I+ ++CPWK HL+E+EE K+ + +V+Y ++ RVQAV +P F RK
Sbjct: 266 RILYFPQYCPWKSHLYEVEEAHKLSAAQQVLFVIYHENTENTLRVQAVNTAPGSFALRKS 325
Query: 296 LPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
L WRGLR +ELS +GIP CVFVH +GFIGGN+++GGAL+MA +L
Sbjct: 326 LNEAWRGLRGEELSTISGIPNCVFVHNAGFIGGNKNFGGALSMAVQSL 373
>gi|195157486|ref|XP_002019627.1| GL12112 [Drosophila persimilis]
gi|194116218|gb|EDW38261.1| GL12112 [Drosophila persimilis]
Length = 344
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 214/343 (62%), Gaps = 19/343 (5%)
Query: 15 SSPSQTPLKR-VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-DDLDAVLDV 72
+SPS+ P +GTHNG+FHCDE + CFM++ ++ NA+I RSRD K L + D ++DV
Sbjct: 4 ASPSKRPTPLWIGTHNGTFHCDEVVACFMLKQLPEYENAEIFRSRDDKELREKCDVIVDV 63
Query: 73 GGVYDPSNDCYDHHQKGFEEVFG-------HGFSTKLSSAGLVYKHFGKELI----AKEL 121
G VYD YDHHQ F+E F F+ +LSSAGL+Y H+G+ +I +E
Sbjct: 64 GSVYDHDKKWYDHHQTSFKETFNILRPEVSKDFNIRLSSAGLIYTHYGERVIQSILKRER 123
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
NV ++ F+ +Y NF+ +DAIDNG++ YD +P RY NT+LS+RVG+LN W
Sbjct: 124 NVQLSPDNLQLAFVQIYGNFISELDAIDNGVSMYDGGEP-RYKINTHLSARVGRLNTSWQ 182
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
+ D E+ F+Q MD+AG+EF+D V SW+ AR V + E P+GEI++
Sbjct: 183 DTDVDNEQR---FKQAMDVAGREFVDNVLEVACSWITARDHVRTALKEAKTIYPTGEIIL 239
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
L FCPWK HL +LE+E +E + K V++ D G WRV V +SP F RK LP WR
Sbjct: 240 LSTFCPWKAHLADLEKEYHVEGVPKLVIFSD--GSGWRVAGVPLSPGSFVGRKFLPTPWR 297
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G RD++LS+ AGI +FVH +GFIGG ++ ALA+A+ +++
Sbjct: 298 GARDEKLSELAGIKDLIFVHHNGFIGGAKTMEAALALAKKSVE 340
>gi|312077509|ref|XP_003141335.1| hypothetical protein LOAG_05750 [Loa loa]
Length = 332
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/331 (47%), Positives = 202/331 (61%), Gaps = 12/331 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTH+G FHCDE F+++ + N +IVRSRD +L+ D V+DVG Y+ +
Sbjct: 1 MPKIGTHDGKFHCDEVFAIFLLKSLPGYNNYEIVRSRDKDILNLCDIVVDVGDEYNHATM 60
Query: 82 CYDHHQKGFEEVFG----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ F F TKLSSAGL+Y HFGK +I L + V+ LF +
Sbjct: 61 KYDHHQRNFAHTMNTLGVMDFDTKLSSAGLIYAHFGKSVIRALLGLQNHDLVVNILFKKI 120
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+ F+E+IDAIDNGI Q+D + PRY LSSRV LN W E S + E F
Sbjct: 121 YETFVESIDAIDNGIAQFDGE--PRYYLGGTLSSRVSMLNPAWNEDTISVD---ERFMMA 175
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELE 256
+ L GKEF + + +SWLPARS V+ + R+D D SG+I+ L+ PWK H F +E
Sbjct: 176 IKLVGKEFTELLTHLYKSWLPARSHVINAVTHRFDVDKSGQIVCLEGGGMPWKDHFFLIE 235
Query: 257 EEMKIE-PLIKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGLRDDELSKEAGI 314
EE I+ I YV+YED QWRVQA+ +S + FE+R PLP WRGLRD EL K A I
Sbjct: 236 EEFHIKNDDIIYVIYEDKAKIQWRVQAIPISERQPFENRLPLPEAWRGLRDAELIKVADI 295
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
PGC FVH SGFIGGN+S G + MAR +L L
Sbjct: 296 PGCTFVHPSGFIGGNESMKGVIEMARKSLSL 326
>gi|380018939|ref|XP_003693376.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Apis florea]
Length = 329
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 210/328 (64%), Gaps = 16/328 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FHCDEAL CFM++ ++ +A I+RSR+ +L+ D V+DVGG Y+P Y
Sbjct: 6 KIGTHDGCFHCDEALACFMLKTLPRYKDAIIIRSRNMNILNTCDIVVDVGGEYNPCKHRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F E G+ + TKLSSAGL+Y HFG E+I KEL + D+ +F
Sbjct: 66 DHHMRDFNESVSTIIKKSGYDWKTKLSSAGLIYCHFGHEII-KELVPEANDNDIELIFKY 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y F++ ID+IDNGI+ ++ P+Y T+LSSR+ LN W + D + N F +
Sbjct: 125 IYNTFIQEIDSIDNGISI--CNESPKYQIVTDLSSRIKFLNPPWNDKDLNP---NTQFLK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
++L G++F+ V + WLPARSIV E IA+R++ DP+GEI+ L + PW HLFE+E
Sbjct: 180 AVELTGQDFVQHVNYTANVWLPARSIVEEAIAKRFEVDPNGEIIELSQCVPWFQHLFEIE 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E I+PL+KYV+++DD +R++ V V P+ F+ R LP W GL++D L K GI G
Sbjct: 240 KEQNIKPLLKYVIFKDD---TYRIRCVPVEPNSFKCRLFLPESWAGLQNDTLVKVCGIEG 296
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
FVH FIGGN++ G L MAR AL+
Sbjct: 297 ATFVHSVRFIGGNRTREGILMMARKALE 324
>gi|195330534|ref|XP_002031958.1| GM23777 [Drosophila sechellia]
gi|194120901|gb|EDW42944.1| GM23777 [Drosophila sechellia]
Length = 362
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 221/343 (64%), Gaps = 20/343 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDVG 73
+ P ++P+ +GTHNG+FHCDE + CFM++ D++ NA+I RSRD K L++ D +LDVG
Sbjct: 19 TPPKRSPI-WIGTHNGTFHCDEVVACFMLKQLDEYKNAEIFRSRDNKALEEKCDIILDVG 77
Query: 74 GVYDPSNDCYDHHQKGFEEVFG-------HGFST-KLSSAGLVYKHFGKELIAKELNVDE 125
GVYD + YDHHQ+ F+E F F+ +LSSAGLVY H+G+ +I L ++
Sbjct: 78 GVYDHAKKLYDHHQQTFKETFSSVRPEVSEDFNVIRLSSAGLVYCHYGERVIQSILQREK 137
Query: 126 G---HPDVHRL-FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
G P+ +L F+ +Y+NF+ +DAIDNG+ ++ +P Y +T+LS+R+GKLN W
Sbjct: 138 GIQLSPENLKLAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKISTHLSARIGKLNPSWQ 196
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
E E + F+Q MD AG+EF+D V SW+ AR V E + P+GEI+V
Sbjct: 197 ETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIAARDHVREALKNAKSVHPTGEILV 253
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
LK FCPWK HLF+LE+E K+E + K V++ + G WRV V+V+P F RK LP WR
Sbjct: 254 LKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NNGTSWRVAGVSVTPSSFLGRKFLPTPWR 311
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GLRDDEL ++A I +F+H +GFIGG ++ A+ +A+ +++
Sbjct: 312 GLRDDELCEKASIKDLLFIHHNGFIGGAKTEEAAMLLAKKSIE 354
>gi|367008396|ref|XP_003678698.1| hypothetical protein TDEL_0A01550 [Torulaspora delbrueckii]
gi|359746355|emb|CCE89487.1| hypothetical protein TDEL_0A01550 [Torulaspora delbrueckii]
Length = 334
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 215/329 (65%), Gaps = 14/329 (4%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ TH+GSFH DEAL +M+RL +F +A+I+RSR P D D V+DV G YD
Sbjct: 13 VKQICTHSGSFHADEALAVYMLRLLPEFKDAKILRSRVPTDWDQSDIVVDVSGKYDGVK- 71
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP---DVHRLFLAVY 138
+DHHQ+ F E F TKLSSAGLVYKHFG+++I L G P DV L+L VY
Sbjct: 72 FFDHHQREFFETFDGEHKTKLSSAGLVYKHFGRDIIKSIL----GGPSKDDVEILYLKVY 127
Query: 139 KNFMEAIDAIDNGINQYDTD----KPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
K F+EA+DA DNGI+++D + KP + + +N +W + D SAE+ +E F
Sbjct: 128 KQFIEALDANDNGISKFDAEEANIKPKFSDTAITIPGIISGMNPNWND-DCSAEKFDEQF 186
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ G F++ V Y +SWLPA+ +V E + R + D SG+I++LK+FCPWK HLFE
Sbjct: 187 FKASSFIGGVFVNLVDGYGKSWLPAKELVQEAVKNRKNVDKSGKIIILKQFCPWKEHLFE 246
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+E+++ I+ I++VL+E D K WRV V +S ++ RK LP +RGLRD+ELSK++GI
Sbjct: 247 VEKQLGIKDEIEFVLFE-DSSKTWRVSTVPISASSYKFRKGLPEPFRGLRDEELSKKSGI 305
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
P CVF+H +GFIGG ++ GAL MA+ +L
Sbjct: 306 PDCVFIHAAGFIGGTKTKDGALKMAQMSL 334
>gi|255730749|ref|XP_002550299.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240132256|gb|EER31814.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 335
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 210/321 (65%), Gaps = 4/321 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L ++++L KF A++VRSR+P+ + D V+DV G YD +
Sbjct: 14 KICTHSGSFHADESLAVYLLKLLPKFSEAELVRSRNPEDWESSDIVVDVSGKYDGIK-YF 72
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F + TKLSSAGLVYKHFGK++I L D+ +P+V L+ +YK F+E
Sbjct: 73 DHHQREFNTTFNDKYQTKLSSAGLVYKHFGKDIIKHVLKWDDSNPNVELLYDKIYKEFIE 132
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNG+N Y D P++ + N L + V KLN W E A+ + + F + +L G
Sbjct: 133 SLDANDNGVNNYPKDISPKFTDKNITLPAIVSKLNPSWNESCTDADFDRQ-FLKSSELMG 191
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
FL+ + Y SW+PA++IV + R++ D SGEI++L +FCPWK HL+ +E+E E
Sbjct: 192 TVFLNLLEGYGNSWIPAKTIVAKAFENRFNVDKSGEILILDQFCPWKEHLYNIEKENNAE 251
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IK+VL++D GK WRV V+V+ FE R LP + RGLRD+ELSK AG+ GC+F+H
Sbjct: 252 GEIKFVLFKDSTGK-WRVSTVSVTSTSFEFRLGLPEELRGLRDEELSKAAGVEGCIFIHA 310
Query: 323 SGFIGGNQSYGGALAMARAAL 343
+GFIGG Q+ L +AR +L
Sbjct: 311 AGFIGGAQTEEAVLKLARMSL 331
>gi|261335327|emb|CBH18321.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 394
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 166/385 (43%), Positives = 215/385 (55%), Gaps = 53/385 (13%)
Query: 4 TGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
+G+ P+ + S P+ +GTHNGSFHCDEAL C M+R + +F A ++R+RD K L
Sbjct: 15 SGLRVLPSLADVDTSSFPV--IGTHNGSFHCDEALACGMLRCSGQFRAANVLRTRDAKTL 72
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF---GHGFSTKLSSAGLVYKHFGKELIAKE 120
D V+DVGGVYD +DHHQ F + + T+LSSAGLVYKHFG+E+I +
Sbjct: 73 DRCSIVVDVGGVYDADALRFDHHQPTFHDTMKTPKSMYRTRLSSAGLVYKHFGREIIQRY 132
Query: 121 L-----------------------NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT 157
+ N+ E D +F VYKNF+E ID IDNG+N +
Sbjct: 133 VEAALSSSYRAELLKMGSWSESRKNLSEAELDT--VFDIVYKNFVEHIDGIDNGVNSFGP 190
Query: 158 DKPP-----------------RYVNNTNLSSRVGKLNLDWTE-PDQSAERENEAFQQGMD 199
Y T LS R+G+L W E D + E EN AF Q ++
Sbjct: 191 ATETVTASGCSADAQSASCVRNYAVTTTLSDRIGRLMPWWNEGGDGNPEGENTAFLQAVE 250
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEE 258
LA EF+ V F+ SW+PAR +V + PSG I+VLK RFCPWK HL E+E E
Sbjct: 251 LALSEFVAAVHFFTFSWMPARGLVEDAFHAAESVHPSGRIIVLKERFCPWKDHLLEIEAE 310
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ YVL+ D G WRVQAV FESRKPLP W+G+RD +LS+ +G+ GCV
Sbjct: 311 HGKVGHVLYVLFSDKSG--WRVQAVPKDAVSFESRKPLP--WKGIRDADLSEASGVEGCV 366
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
FVH+SGFIGGN++Y GAL MA AL
Sbjct: 367 FVHVSGFIGGNKTYEGALQMATKAL 391
>gi|290981552|ref|XP_002673494.1| metal binding protein [Naegleria gruberi]
gi|284087078|gb|EFC40750.1| metal binding protein [Naegleria gruberi]
Length = 355
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 203/324 (62%), Gaps = 6/324 (1%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTH+GSFHCDE+L ++ L ++ + ++R+RD +L D ++DVG +YD
Sbjct: 22 KKIGTHSGSFHCDESLAIGLLSLLKEYQDGIVIRTRDETILSQCDIIVDVGAIYDAQKHR 81
Query: 83 YDHHQKGFEEVFGHGFSTK--LSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
YDHHQ F++ F + K LSSAGL+YKHFG+++I + + ++L +Y N
Sbjct: 82 YDHHQASFKDTFDNDKFNKIRLSSAGLIYKHFGRQIIEELVGEKATIDQKDDIYLRMYAN 141
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+E IDA DNGI +D +Y T L +RV + N W +P + E N F + ++L
Sbjct: 142 FIEHIDANDNGIEV--SDGELKYKITTTLPNRVSRFNPKWNQP-STDESLNLGFSKAIEL 198
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
EFL+++ FYV WLPA IV R+ D SGEI++ +FCPWK HL+ LEE+
Sbjct: 199 TRSEFLESLSFYVDDWLPAYQIVERAFNSRFQVDSSGEIVLFDQFCPWKAHLYVLEEKTN 258
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
IK+ L++D +G WRVQAV S F SR PL WRG+RD+ELS+++GIPGC+FV
Sbjct: 259 SIGSIKFALFQDVKG-DWRVQAVPQSESSFTSRVPLHKDWRGIRDEELSQKSGIPGCIFV 317
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H SGFIGG +SY AL +A+ +L+
Sbjct: 318 HASGFIGGAKSYESALQLAKLSLE 341
>gi|171678021|ref|XP_001903961.1| hypothetical protein [Podospora anserina S mat+]
gi|170937079|emb|CAP61738.1| unnamed protein product [Podospora anserina S mat+]
Length = 352
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 209/335 (62%), Gaps = 17/335 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL +M+R + NA++VR+RDPK+LD+ D V+DVGG Y+P+ Y
Sbjct: 18 IGTHNGHFHADEALAVYMLRKHIPTYANAKLVRTRDPKLLDECDIVVDVGGEYEPARHRY 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F +TKLSSAGLVY HFGK++IA+ L+ E V ++ +Y++F+E
Sbjct: 78 DHHQRSFSTTFPER-ATKLSSAGLVYMHFGKQIIARRLSQPEESEQVGLVWNKIYQSFVE 136
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLS-------SRVGKLNLDWTEP---DQSAER--EN 191
A+DA DNGI+ YD S + VG+LN +W EP D A + E+
Sbjct: 137 ALDAHDNGISAYDAAGLAAAGLEKKFSDGGFTLGAMVGRLNPNWNEPIPEDPVAAQAAED 196
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKL 250
+ F+ G+EF + ++ +WLPAR +V E A R ++D G IMVLK+ PWK
Sbjct: 197 QRFELASQRIGEEFDRGLDYFTSAWLPAREVVAEAFAARNEFDAGGRIMVLKKQSAPWKD 256
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LEE+ + YVLY + +WR+Q V + D F+SRKPLP WRG RD+EL
Sbjct: 257 HLYTLEEQNPEAGKVLYVLYPEKPTPDAKWRIQCVPETKDSFQSRKPLPEAWRGFRDEEL 316
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+G+PG VFVH +GFIGGN ++ GALAMA+ AL
Sbjct: 317 DGISGVPGSVFVHAAGFIGGNNTFDGALAMAQKAL 351
>gi|170594023|ref|XP_001901763.1| Hypothetical UPF0160 protein C27H6.8 in chromosome V [Brugia
malayi]
gi|158590707|gb|EDP29322.1| Hypothetical UPF0160 protein C27H6.8 in chromosome V, putative
[Brugia malayi]
Length = 331
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 205/331 (61%), Gaps = 13/331 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTH+GSFHCDE F+++ ++ N +IVRSRD VL D V+DVGG Y+ +
Sbjct: 1 MPKIGTHDGSFHCDEVFAIFLLKSLPEYNNYEIVRSRDKDVLSLCDIVVDVGGEYNHAAM 60
Query: 82 CYDHHQKGFEEVFG----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ F F TKLSSAGL+Y HFGK +I+ L + + LF V
Sbjct: 61 KYDHHQRDFAHTMNTLGVMNFHTKLSSAGLIYAHFGKNVISSLLGLQHD-SIIDVLFKKV 119
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+ F+E+IDAIDNGI Q+D PRY LSSR+ LN W E + +E F
Sbjct: 120 YQTFVESIDAIDNGIAQFDG--KPRYYLGGTLSSRISMLNPAWNEDTVNV---HERFMMA 174
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELE 256
+ L KEF + + + +SWLPARS ++ + RYD D SG+I L+ PWK H F +E
Sbjct: 175 IKLVDKEFNELLTYLHKSWLPARSHIINAVTHRYDVDKSGQIFCLEGGGMPWKDHFFLIE 234
Query: 257 EEMKIE-PLIKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGLRDDELSKEAGI 314
E+ ++ I YV+YED+ QWRVQA+ V+ + FE+R PLP WRGLRD EL+K A I
Sbjct: 235 EQFHLKNDDIIYVIYEDNVNAQWRVQAIPVNERQPFENRLPLPEAWRGLRDAELTKVADI 294
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
PGC+FVH SGFIGGN+S G + MAR +L L
Sbjct: 295 PGCIFVHPSGFIGGNKSMQGVIEMARKSLSL 325
>gi|194744580|ref|XP_001954771.1| GF16574 [Drosophila ananassae]
gi|190627808|gb|EDV43332.1| GF16574 [Drosophila ananassae]
Length = 359
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 214/344 (62%), Gaps = 20/344 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-DDLDAVLDV 72
++ P ++PL +GTHNG+FHCDE + CFM++ ++ NA+I RSRD K L + D ++DV
Sbjct: 19 STPPKRSPL-WIGTHNGTFHCDEVVACFMLKQLPEYENAEIFRSRDDKQLREKCDIIVDV 77
Query: 73 GGVYDPSNDCYDHHQKGFEEVFG-------HGFST-KLSSAGLVYKHFGKELIAKELNVD 124
GG YD YDHHQK F E F+ +LSSAGLVY H+G+ +I L +
Sbjct: 78 GGEYDHGKKWYDHHQKTFGETLSSIRPEVSEEFNVIRLSSAGLVYSHYGERVIQSILQKE 137
Query: 125 EG---HPDVHRL-FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
G P +L F+ +Y+N + +DAIDNG+ ++ +P Y +T+LS+R+GKLN W
Sbjct: 138 RGIQLSPQNLKLAFIQIYRNLISELDAIDNGVPMFEGGEPV-YKISTHLSARIGKLNPSW 196
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E E+ F Q M++AG+EF+ V SW+ AR V E + PSGEI+
Sbjct: 197 QETSVDIEQR---FSQAMEVAGREFVQNVIEVSCSWIAARDYVREALENAKSVYPSGEIL 253
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
VLK FCPWK HL +LE+E K+E + K V++ D G WRV V VSP F RK LP W
Sbjct: 254 VLKTFCPWKAHLADLEKEYKVEGVPKLVVFND--GTSWRVAGVPVSPSSFVGRKFLPTPW 311
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RGLRD+ELSK+AG+ G VFVH SGFIGG +S AL+MA+ +++
Sbjct: 312 RGLRDEELSKKAGVEGLVFVHHSGFIGGAKSEEAALSMAKKSIE 355
>gi|71418464|ref|XP_810858.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70875455|gb|EAN89007.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 394
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 161/373 (43%), Positives = 211/373 (56%), Gaps = 47/373 (12%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
+ + T +GTHNGSFHCDEA+ C ++R + F + I+RSRDP L+ + V+DVG V
Sbjct: 24 AETSTATGVIGTHNGSFHCDEAMACGLLRCSQNFGQSNILRSRDPNALERCNIVVDVGSV 83
Query: 76 YDPSNDCYDHHQKGFEEVFGHG---FSTKLSSAGLVYKHFGKELIAKELNVDEGHP---- 128
YD + +DHHQ F + + T+LSSAGLVYKHFG+E+I + P
Sbjct: 84 YDEATLRFDHHQPSFHDTMKTPKAVYQTRLSSAGLVYKHFGREIIQGYVESALASPYRVK 143
Query: 129 -----------------DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPP---------- 161
++ LF VYKNF+E ID IDNG+N Y +
Sbjct: 144 LLDATKWGTDKKKLSEQELDTLFDIVYKNFVEHIDGIDNGVNAYGPAETEGEEGAVPVSS 203
Query: 162 -------RYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYV 213
+Y +T LS+R+G L W E + E EN AF Q M+LA EF D+V ++V
Sbjct: 204 SSPSCVRKYNVSTTLSARIGNLMPWWNEEGNGKVENENAAFLQAMELATSEFFDSVHYHV 263
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYED 272
+W+PAR IV E + PSG I+V K FCPWK HL ELE E + YVL+ D
Sbjct: 264 FAWMPARGIVRAAFLEAMNVHPSGRIVVFKDCFCPWKEHLLELEGEHGKVGHVLYVLFAD 323
Query: 273 DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSY 332
+G WRVQAV FE+RK LP WRGLRD+ELS+ +GI G +FVH+SGFIGGN++Y
Sbjct: 324 KKG--WRVQAVPKEASGFENRKSLP--WRGLRDEELSQASGIEGGIFVHVSGFIGGNKTY 379
Query: 333 GGALAMARAALKL 345
GAL MA AL +
Sbjct: 380 EGALQMAVKALTV 392
>gi|198455056|ref|XP_001359839.2| GA11307 [Drosophila pseudoobscura pseudoobscura]
gi|198133073|gb|EAL28991.2| GA11307 [Drosophila pseudoobscura pseudoobscura]
Length = 344
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 213/343 (62%), Gaps = 19/343 (5%)
Query: 15 SSPSQTPLKR-VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-DDLDAVLDV 72
+SPS+ P +GTHNG+FHCDE + CFM++ ++ NA+I RSRD K L + D ++DV
Sbjct: 4 ASPSKRPTPLWIGTHNGTFHCDEVVACFMLKQLPEYENAEIFRSRDDKELREKCDVIVDV 63
Query: 73 GGVYDPSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELI----AKEL 121
G VYD YDHHQ F+E F F+ +LSSAGL+Y H+G+ +I +E
Sbjct: 64 GSVYDHDKKWYDHHQTSFKETFNILRPEVSKDFNIRLSSAGLIYTHYGERVIQSILKRER 123
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
N+ ++ F+ +Y NF+ +DAIDNG N YD +P RY NT+LS+RVG+LN W
Sbjct: 124 NIQLSPDNLQLAFVQIYGNFISELDAIDNGENMYDGGEP-RYKINTHLSARVGRLNPSWQ 182
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
+ D E+ F+Q MD+AG+EF+D V SW+ AR V + E P+GEI++
Sbjct: 183 DTDVDIEQR---FKQAMDVAGREFVDNVLEVACSWIAARDHVRTALKEAKTIYPTGEIIL 239
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
L FCPWK HL +LE+E +E + K V++ D G WRV V +SP F RK LP WR
Sbjct: 240 LSTFCPWKAHLADLEKEYHVEGVPKLVIFSD--GSGWRVAGVPLSPGSFVGRKFLPTPWR 297
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G RD++LS+ AGI +FVH +GFIGG ++ ALA+A+ +++
Sbjct: 298 GARDEKLSELAGIKDLIFVHHTGFIGGAKTKEAALALAKKSVE 340
>gi|398364965|ref|NP_011083.3| hypothetical protein YER156C [Saccharomyces cerevisiae S288c]
gi|731528|sp|P40093.1|YEY6_YEAST RecName: Full=UPF0160 protein YER156C
gi|603396|gb|AAB64683.1| Yer156cp [Saccharomyces cerevisiae]
gi|45269463|gb|AAS56112.1| YER156C [Saccharomyces cerevisiae]
gi|285811789|tpg|DAA07817.1| TPA: hypothetical protein YER156C [Saccharomyces cerevisiae S288c]
Length = 338
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 212/326 (65%), Gaps = 8/326 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 15 KQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYDGVK-F 73
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF E F + TKLSSAGL++KH+G+++I LN D+ L+ VYK F+
Sbjct: 74 FDHHQRGFFETFNEKYKTKLSSAGLIFKHYGRDIIKTILNNKVSSSDLDLLYDKVYKQFV 133
Query: 143 EAIDAIDNGINQY----DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
EA+DA DNGI++Y D++ P + +N ++ + +N +W E D S E + F +
Sbjct: 134 EALDANDNGISKYTIPKDSNLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARA 192
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ G F+ VR Y +SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 193 SEFIGGVFVTLVRGYGQSWLPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELER 252
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E IE I++VL+ D G WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC
Sbjct: 253 EKNIEKQIEFVLFTDSSG-AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGC 311
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F+H +GFIGG +S +A+ +L
Sbjct: 312 IFIHAAGFIGGAKSKEAVYELAKMSL 337
>gi|392299860|gb|EIW10952.1| hypothetical protein CENPK1137D_3570 [Saccharomyces cerevisiae
CEN.PK113-7D]
Length = 326
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 212/326 (65%), Gaps = 8/326 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 3 KQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYDGVK-F 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF E F + TKLSSAGL++KH+G+++I LN D+ L+ VYK F+
Sbjct: 62 FDHHQRGFFETFNEKYKTKLSSAGLIFKHYGRDIIKTILNNKVSSSDLDLLYDKVYKQFV 121
Query: 143 EAIDAIDNGINQY----DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
EA+DA DNGI++Y D++ P + +N ++ + +N +W E D S E + F +
Sbjct: 122 EALDANDNGISKYTIPKDSNLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARA 180
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ G F+ VR Y +SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 181 SEFIGGVFVTLVRGYGQSWLPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELER 240
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E IE I++VL+ D G WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC
Sbjct: 241 EKNIEKQIEFVLFTDSSG-AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGC 299
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F+H +GFIGG +S +A+ +L
Sbjct: 300 IFIHAAGFIGGAKSKEAVYELAKMSL 325
>gi|383854472|ref|XP_003702745.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Megachile
rotundata]
Length = 329
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 207/329 (62%), Gaps = 16/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FHCDEAL CFM++ ++ +A IVRSRD +LD D V+DVGG Y+PS Y
Sbjct: 6 KIGTHDGCFHCDEALACFMLKTLPQYKDAVIVRSRDKSILDTCDIVIDVGGEYNPSKHRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F+E G+ KLSSAGL+Y HFG E+I K L + DV +F
Sbjct: 66 DHHMRDFKESLSTVVKKPGYNSDIKLSSAGLIYCHFGHEII-KHLIPQLSNNDVEAVFKQ 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y F++ ID IDNGI + + P Y T+LSSRV LN+ W + +++ +E F +
Sbjct: 125 IYNTFIKEIDGIDNGIPMFKEE--PVYGIVTDLSSRVQFLNVPWNDKTRTS---DEQFPK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
++L G+EFL + + WLPA+ IV + IA+R++ DPSGEI+ L + PW HLFELE
Sbjct: 180 AVELTGQEFLQHLNYAAHVWLPAKIIVQDAIAKRFEVDPSGEIIELSQPVPWSEHLFELE 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+EM +EPL+KY+++++ +R++ V V P F +R LP W GLR++ L G+PG
Sbjct: 240 KEMNVEPLLKYMIFKES---NYRIRCVPVKPKSFVARLFLPEAWAGLRNEALETACGVPG 296
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
FVH F GG+ + GAL MAR AL+L
Sbjct: 297 AEFVHTIRFTGGHNTREGALMMARKALEL 325
>gi|361128123|gb|EHL00076.1| putative UPF0160 protein C27H6.8 [Glarea lozoyensis 74030]
Length = 350
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 208/340 (61%), Gaps = 24/340 (7%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL + +++++R+RDP +LD V+DVGG YD + + YD
Sbjct: 16 IGTHNGHFHADEALAVYMLRLLPTYNDSKLIRTRDPALLDTCHTVVDVGGEYDAAKERYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + TKLSSAGLVY HFGK +IA++L V E +V ++ +Y +F+EA
Sbjct: 76 HHQRTFNTTFPNR-PTKLSSAGLVYMHFGKAIIAQKLGVAEDAEEVSVIWEKIYTSFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDWTEPD-----QSAE 188
+DA DNGI+ YD P+ ++ L R V +LN +W +P Q+
Sbjct: 135 LDAHDNGISAYD----PKAISAAGLEKRFSDGGFSLGAMVSRLNPNWNDPTPSDPVQAQA 190
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCP 247
E+E F G+EF + +Y +SWLPAR IV + +R ++DP G I+V + P
Sbjct: 191 AEDEKFLVASARMGEEFSRDLDYYTKSWLPAREIVHKAYQKRLEHDPQGRILVFDGQSVP 250
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
WK HL+ LEE+ E I YVLY + +WR+Q V V+ D FESRKPLP WRG RD
Sbjct: 251 WKDHLYTLEEKEGGEAKILYVLYPEKPVPDAKWRIQCVPVTKDSFESRKPLPEAWRGFRD 310
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
DELS G+ G VFVH +GFIGGN+++ GA MA AL +
Sbjct: 311 DELSGITGVDGGVFVHAAGFIGGNKTFEGAKTMAVKALDI 350
>gi|340059510|emb|CCC53897.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 391
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 161/371 (43%), Positives = 207/371 (55%), Gaps = 44/371 (11%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S + + L +GTHNGSFHCDE L C M+R T +F A IVR+RD +++D + V+DV
Sbjct: 22 SLADATAAALPVIGTHNGSFHCDEVLSCGMLRCTTQFSTASIVRTRDARIVDGCNIVVDV 81
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGH---GFSTKLSSAGLVYKHFGKELIAKELNVDEGHP- 128
GGVYD DHHQ F++ + T+LSSAGLVYKHFG+E+I + P
Sbjct: 82 GGVYDAEALRLDHHQPSFQDTMTTQKATYKTRLSSAGLVYKHFGREIIQGYVEAALTSPY 141
Query: 129 --------------------DVHRLFLAVYKNFMEAIDAIDNGINQY--------DTDKP 160
++ +F AVY+NF+E ID IDNG+N Y D
Sbjct: 142 RPELLRMGDWDSSRKKLTERELETVFDAVYRNFVEHIDGIDNGVNAYGPSTEALRGMDAD 201
Query: 161 P------RYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYV 213
P Y T LS R+G + W E + E AF + +D+A EF+ V +YV
Sbjct: 202 PSLLCVRNYAVTTTLSDRIGAIMPWWNEEGNGDLGSETAAFLKAVDVALLEFIAVVHYYV 261
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYED 272
+WLPAR +V + PSG I+ L+ RFCPWK HL E+E + + YVL+ D
Sbjct: 262 FAWLPARVLVEDAFLSSDTVHPSGRIVALRERFCPWKDHLLEMEAAHNKKGHVLYVLFAD 321
Query: 273 DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSY 332
G WRVQAV F+SRKPLP WRGLRD ELS+ +GI GCVFVH SGFIGGN++Y
Sbjct: 322 KSG--WRVQAVPKDTSSFDSRKPLP--WRGLRDAELSEASGIDGCVFVHASGFIGGNRTY 377
Query: 333 GGALAMARAAL 343
GAL MA AL
Sbjct: 378 EGALQMAVKAL 388
>gi|406863530|gb|EKD16577.1| MYG1 protein [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 350
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 209/340 (61%), Gaps = 24/340 (7%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL + +Q+VR+RDP +L V+DVGG YD + + YD
Sbjct: 16 IGTHNGHFHADEALAVYMLRLLPTYNCSQLVRTRDPALLQTCHTVVDVGGEYDAAANRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + TKLSSAGLVY HFGK +IA++L V E +V ++ +Y++F+EA
Sbjct: 76 HHQRTFATTFPNR-PTKLSSAGLVYMHFGKAIIAQKLGVSEDVEEVSVIWNKIYESFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDWTEPDQSAERENEA 193
+DA DNGI+ YD P+ + L R V +LN +W +P S E +A
Sbjct: 135 LDAHDNGISAYD----PKAITAAGLEKRFSDGGFSLGAMVSRLNPNWNDPIPSDPVEAQA 190
Query: 194 -----FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCP 247
F + G+EF + +YV+ WLPAR IV + A+R +YDP G I+V + P
Sbjct: 191 AEDAKFLVASERMGEEFSRDLDYYVKCWLPAREIVHKAYAKRLEYDPQGRILVFDGQSVP 250
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
WK HL+ LE + + E + YVLY + +WR+Q V V+ D FESRKPLP WRG RD
Sbjct: 251 WKDHLYTLEGQDEGEAKVLYVLYPEKPTSDAKWRIQCVPVTKDSFESRKPLPEAWRGFRD 310
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+EL K +G+ G VFVH +GFIGGN+S+ GA AMA AL L
Sbjct: 311 EELDKVSGVSGGVFVHAAGFIGGNKSFEGAKAMAVKALDL 350
>gi|154300952|ref|XP_001550890.1| hypothetical protein BC1G_10614 [Botryotinia fuckeliana B05.10]
gi|347831167|emb|CCD46864.1| similar to MYG1 protein [Botryotinia fuckeliana]
Length = 350
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 212/340 (62%), Gaps = 24/340 (7%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL + +++++R+RDPK+L+ V+DVGG Y+ YD
Sbjct: 16 IGTHNGHFHADEALAVYMLRLLPTYQSSELIRTRDPKLLETCHTVVDVGGEYNDETKRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F + TKLSSAGLVY H+GK +IA+ L V E +V ++ +Y++F+EA
Sbjct: 76 HHQRTFDTTFPNR-PTKLSSAGLVYMHYGKAIIAQHLGVAEDAEEVAVIWRKIYESFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDWTEPD-----QSAE 188
+DA DNGI+ YD P+ ++ L + V +LN +W +P ++ +
Sbjct: 135 LDAHDNGISVYD----PKAISAAGLEKKFSDGGFSLGAMVSRLNPNWNDPTPSDPVEAQK 190
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCP 247
E+E F G+EF + +Y +SWLPARSIV + A+R YD G I+V + P
Sbjct: 191 AEDEKFLVASTRMGEEFSRDLDYYTKSWLPARSIVQQAYAKRLQYDSKGRILVFDGQSVP 250
Query: 248 WKLHLFELEEEMKIEPLIKYVLY-EDDR-GKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
WK HL+ LE++ E + YVLY E R +WR+Q V V+ D F+SRKPLP WRG RD
Sbjct: 251 WKDHLYTLEDQENSENKVLYVLYPESPRPDAKWRIQCVPVTKDSFQSRKPLPEAWRGFRD 310
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ELS+ GIPG VFVH +GFIGGN+++ GA MA A+ L
Sbjct: 311 EELSQITGIPGGVFVHAAGFIGGNKTFDGASKMAATAVDL 350
>gi|151944874|gb|EDN63133.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|190405715|gb|EDV08982.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|256272060|gb|EEU07072.1| YER156C-like protein [Saccharomyces cerevisiae JAY291]
gi|349577818|dbj|GAA22986.1| K7_Yer156cp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 338
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 212/326 (65%), Gaps = 8/326 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 15 KQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYDGVK-F 73
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF E F + TKLSSAGL++KH+G+++I LN D+ L+ VYK F+
Sbjct: 74 FDHHQRGFFETFNEKYKTKLSSAGLIFKHYGRDIIKTILNDKVSSSDLDLLYDKVYKQFV 133
Query: 143 EAIDAIDNGINQY----DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
EA+DA DNGI++Y D++ P + +N ++ + +N +W E D S E + F +
Sbjct: 134 EALDANDNGISKYTIPKDSNLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARA 192
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ G F+ V+ Y +SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 193 SEFIGGVFVTLVKGYGQSWLPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELER 252
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E IE I++VL+ D G WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC
Sbjct: 253 EKNIEKQIEFVLFTDSSG-AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGC 311
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F+H +GFIGG +S +A+ +L
Sbjct: 312 IFIHAAGFIGGAKSKEAVYELAKMSL 337
>gi|322802932|gb|EFZ23073.1| hypothetical protein SINV_09604 [Solenopsis invicta]
Length = 336
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 207/335 (61%), Gaps = 15/335 (4%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S+ R+GTH+G+FHCDE L C +++L ++ +A IVRSR+ VLD D V+DVGG YD
Sbjct: 2 SRKSAVRIGTHDGTFHCDEVLACALLKLLPQYKDASIVRSRNQNVLDTCDIVVDVGGEYD 61
Query: 78 PSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
PS YDHH + F+E G+ ++ KLSSAGL+Y HFG E++ L +
Sbjct: 62 PSRHRYDHHMRDFQESVSTVMKRSGYDWTIKLSSAGLIYCHFGHEILRNVLPEVTEDRVI 121
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE 190
+F +Y ++ ID IDNGI YD + P Y T+LS+RV +LN W D + E++
Sbjct: 122 DDIFKKIYDTLIKEIDGIDNGIPMYDAE--PLYRIVTDLSARVSRLNPQWNSQDVNIEKQ 179
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
F++ M L +EFL+ V++ WLPAR +V + R++ DPSGEI+VL + PWK
Sbjct: 180 ---FEKAMALTLEEFLEFVQYAKNVWLPARDVVRHALESRFEVDPSGEIIVLSQAVPWKE 236
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
HLF+LE+EM + PLIKY ++E D +RVQ + V+ F R LP W GLR D L +
Sbjct: 237 HLFQLEKEMNVSPLIKYAIFESD---TYRVQCMPVALGSFVCRMFLPKAWGGLRTDALVE 293
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GI G VFVH FIGG+++ GA+AMA+ AL++
Sbjct: 294 ACGIEGAVFVHSVLFIGGHKTKDGAIAMAQKALEI 328
>gi|259146085|emb|CAY79345.1| EC1118_1E8_3026p [Saccharomyces cerevisiae EC1118]
Length = 338
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 212/326 (65%), Gaps = 8/326 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 15 KQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYD-GVIF 73
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+GF E F + TKLSSAGL++KH+G+++I LN D+ L+ VYK F+
Sbjct: 74 FDHHQRGFFETFNEKYKTKLSSAGLIFKHYGRDIIKTILNDKVSSSDLDLLYDKVYKQFV 133
Query: 143 EAIDAIDNGINQY----DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
EA+DA DNGI++Y D++ P + +N ++ + +N +W E D S E + F +
Sbjct: 134 EALDANDNGISKYTIPKDSNLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARA 192
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ G F+ V+ Y +SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 193 SEFIGGVFVTLVKGYGQSWLPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELER 252
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
E IE I++VL+ D G WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC
Sbjct: 253 EKNIEKQIEFVLFTDSSG-AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGC 311
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F+H +GFIGG +S +A+ +L
Sbjct: 312 IFIHAAGFIGGAKSKEAVYELAKMSL 337
>gi|328772774|gb|EGF82812.1| hypothetical protein BATDEDRAFT_9327 [Batrachochytrium
dendrobatidis JAM81]
Length = 315
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 198/307 (64%), Gaps = 12/307 (3%)
Query: 31 SFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGF 90
SFH DE+L +M+RL ++ +A++VR+RD V++ D V+DVGG+YDPS YDHHQ+ F
Sbjct: 9 SFHADESLAVYMLRLLPEYKDAKLVRTRDASVIESADIVVDVGGIYDPSKHRYDHHQREF 68
Query: 91 EEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNFMEAIDAI 148
+ F +LSSAGLVYKHFG +I + L + DE VH L++ VY + ++ D +
Sbjct: 69 VDTFDSDHKIRLSSAGLVYKHFGHRIIREVLGWHQDEQEDIVHMLYMKVYDDLIQEYDGV 128
Query: 149 DNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDT 208
DNG+++Y ++ P Y +T +S RV LN W +QS + +E F + + L G EF D
Sbjct: 129 DNGVSRYPSNLDPAYKESTTISHRVSALNPWW---NQSVDDMDERFAKAVALTGMEFTDK 185
Query: 209 VRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI----EPL 264
V + +W+PAR +V + + +R PSG IMV ++CPWK +++ LE+E KI +PL
Sbjct: 186 VLYLGNAWIPARKLVQDALNDRKAIHPSGRIMVFDQYCPWKEYVYLLEKENKIPASEQPL 245
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
YVLY D QWR+QAV+ +P FESRK LP WRGLRD ELS GI GCVFVH SG
Sbjct: 246 --YVLYP-DTSSQWRIQAVSCNPSSFESRKALPESWRGLRDQELSTLCGIDGCVFVHASG 302
Query: 325 FIGGNQS 331
FIGGN +
Sbjct: 303 FIGGNMT 309
>gi|313231126|emb|CBY19124.1| unnamed protein product [Oikopleura dioica]
Length = 325
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 204/328 (62%), Gaps = 9/328 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++GTHNG FH DE L C ++R +F +A+IVRSR+P L+ D V+DVG V+DP
Sbjct: 2 VKKIGTHNGHFHADEVLACVLLRQLPEFKDAEIVRSRNPADLEPCDIVVDVGAVFDPERQ 61
Query: 82 CYDHHQKGFEEVFGH----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
+DHHQ+ F E + TKLSSAGLVY ++GK +I+ + + D+ L+ +
Sbjct: 62 RFDHHQREFTETMKSLKILDYETKLSSAGLVYAYYGKAVISSITGIPKESQDMTVLYEKM 121
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y F+E D IDNG+NQ+D + PRY + +S+RVG+ N W E DQ+ E F++
Sbjct: 122 YAKFVEEYDGIDNGVNQFDGE--PRYHITSTVSARVGRCNPRWNE-DQTPAAERAGFEKA 178
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELE 256
M + E D + YV+SW+PAR+IV E I ER +D G ++ K PWK HLFE+E
Sbjct: 179 MQIVEIELRDAIEGYVKSWMPARAIVKEAIEERKKHDTEGRLLKFPKGGLPWKSHLFEME 238
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+EM+IE I YV+Y+ + + WR+Q V+ S F++RK LPA W G+RD L++ G+
Sbjct: 239 KEMQIEGQILYVVYQ-SKPEDWRIQCVSESEGSFKNRKTLPAAWLGVRDQALNELTGLTD 297
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
C FVH +GFIGG +S + MA AL+
Sbjct: 298 CTFVHANGFIGGAKSEASVMKMAEMALQ 325
>gi|346969967|gb|EGY13419.1| MYG1 protein [Verticillium dahliae VdLs.17]
Length = 337
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 152/336 (45%), Positives = 203/336 (60%), Gaps = 19/336 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + A ++R+RDPK+L+ V+DVGG YD + YD
Sbjct: 6 IGTHNGHFHADEALAVHMLRQLPAYQGASLIRTRDPKLLETCHTVVDVGGEYDAEKNRYD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F G STKLSSAGLV+ HFG+ +IA+++ E PDV L Y++F+EA
Sbjct: 66 HHQRDFTTTFP-GRSTKLSSAGLVFLHFGRAIIAQKMGTAEDSPDVALLHNKFYESFIEA 124
Query: 145 IDAIDNGINQYD------TDKPPRYVNNT-NLSSRVGKLNLDW-----TEPDQSAERENE 192
+DA DNGI+ YD R+ L + VG+ N +W +P+Q+ E++
Sbjct: 125 LDAHDNGISVYDHLAVLAAGLEKRFSEGGFTLGAMVGRFNGNWNDPVIADPEQAQAAEDK 184
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLH 251
F+ G+EF +++YV +WLPAR++V R +DP G I+V PWK H
Sbjct: 185 RFEAASARIGEEFDRDLQYYVTAWLPARAVVKAAFDARAQHDPEGRILVFDGVSAPWKDH 244
Query: 252 LFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
L+ LEE +P + YVLY + G +WRVQ V V+ D F+SRKPLP WRG RD EL
Sbjct: 245 LYSLEEG---KPTVLYVLYPEKPVPGAKWRVQTVPVAKDSFQSRKPLPEIWRGFRDQELD 301
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GI GCVFVH +GFIGGN ++ GALAMA AL L
Sbjct: 302 GITGIDGCVFVHAAGFIGGNATFEGALAMATKALAL 337
>gi|365760974|gb|EHN02652.1| YER156C-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
gi|401841405|gb|EJT43799.1| YER156C-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 338
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 140/327 (42%), Positives = 211/327 (64%), Gaps = 8/327 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ TH+GSFH DE+L +M+RL +F +A++VRSR+PK + D ++DVG YD
Sbjct: 14 VKQICTHSGSFHADESLAVYMLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYDGVK- 72
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+GF E F + TKLSSAGL++KH+G+++I LN D+ L+ VYK F
Sbjct: 73 FFDHHQRGFFETFNENYKTKLSSAGLIFKHYGRDIIKTILNSSTSSADLDLLYNKVYKQF 132
Query: 142 MEAIDAIDNGINQYD----TDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+EA+DA DNGI++Y +D P + +N ++ + +N +W E D S E ++ F +
Sbjct: 133 VEALDANDNGISKYTIPRGSDLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDKCFAR 191
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
G+ F+ V+ Y SWLPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE
Sbjct: 192 ASAFIGEVFVTLVKGYGESWLPAKTLVAQAIDERLDVDKSGKIIVLPQFCPWKEHLYELE 251
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
E I+ I++VL+ D WRV V ++ F+ R+ LP RGLRD+ELS ++GIPG
Sbjct: 252 REKNIQKQIEFVLFTDS-SNAWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGIPG 310
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAAL 343
C+F+H +GFIGG ++ +A+ +L
Sbjct: 311 CIFIHAAGFIGGAKTKEAVYQLAKMSL 337
>gi|219111881|ref|XP_002177692.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410577|gb|EEC50506.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 351
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 210/333 (63%), Gaps = 21/333 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G+F DEA+G FM+R K+ N+++VRSRD +VLD LD V+DVGG+YD + YD
Sbjct: 18 IGTHSGTFQADEAMGVFMLRQVSKYRNSKVVRSRDLEVLDKLDIVIDVGGIYDHTKLRYD 77
Query: 85 HHQKGFEEVFGHGFS------TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL--- 135
HHQ+ ++E F G TKLS++GLVY+H+GK+++ +PD+ +L
Sbjct: 78 HHQRNYDERFDAGKEGTAARCTKLSASGLVYRHYGKQVLKAY------YPDLSDDYLQLA 131
Query: 136 --AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAEREN-- 191
+Y + +EA+DAID G+ Y + T L+SRVG+LN W E D+S N
Sbjct: 132 YDKLYNSLLEALDAIDTGVEMAPDGTELVYKDTTGLASRVGRLNPRWNEVDESGNTPNHD 191
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKL 250
E F++ +D+ G++FL + V S +PAR V + +R++ DPSGEI+ L+ PW+
Sbjct: 192 ERFEKAVDICGQDFLSVMTVIVESDIPARDFVERALLKRHETDPSGEILCLESGGLPWRN 251
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
HL+ELE+ +EP++K+VLY D G WRVQAV V FE+R LPA+WRG+RD +L
Sbjct: 252 HLYELEKIHAVEPIVKFVLYTDVSG-MWRVQAVTVQGKAFENRLSLPAEWRGVRDQDLES 310
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
I G FVH +GFIGG ++Y G L MA+ AL
Sbjct: 311 VTKISGSRFVHAAGFIGGAETYEGVLKMAQVAL 343
>gi|154332888|ref|XP_001562706.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059709|emb|CAM41831.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 156/375 (41%), Positives = 213/375 (56%), Gaps = 41/375 (10%)
Query: 6 VSSSPAYSTSSPSQTPLKR--VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
+S P +T S + + + THNGSFHCDEA+ C ++R ++ A I+R+RDPK +
Sbjct: 24 LSDRPTTTTESCGSSAAAKPVICTHNGSFHCDEAMACGLLRHVPEYREAVILRTRDPKEI 83
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFG---HGFSTKLSSAGLVYKHFGKELIAK- 119
D D V+DVG VYD + YDHHQ F + T+LSSAGLVY+HFG+++I +
Sbjct: 84 DACDIVVDVGAVYDADRNRYDHHQASFHGTMTTPRKMYKTRLSSAGLVYRHFGRQIIRQY 143
Query: 120 ----------------------ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT 157
E ++ L A+Y NF+E +D IDNG+ +
Sbjct: 144 VEAALQPLSSVRETVLSMARWSESRTSLSDAELDALEDALYANFVEEVDGIDNGVECWGL 203
Query: 158 DKP------PRYVNNTNLSSRVGKLNLDWTEPDQ-SAERENEAFQQGMDLAGKEFLDTVR 210
P P Y +TNLS R+GKL W EP+ + EN F +++A EF + +
Sbjct: 204 ADPAVGTLVPNYKQDTNLSRRIGKLQAFWNEPENGNVVAENANFAVAVEMAVTEFFEALT 263
Query: 211 FYVRSWLPARSIVVECIAERYDYDPSGEIMVLK--RFCPWKLHLFELEEEMKIEPLIKYV 268
+ SWLPAR+IV R+++D SG+IMV K + CPWK HL ELE E + YV
Sbjct: 264 YLAFSWLPARTIVEAAFQRRHEFDESGKIMVFKDTKMCPWKDHLLELEAETHCVGAVLYV 323
Query: 269 LYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGG 328
++ D GK WRVQAV + FE+RKPLP +RGLRDDELS GI G VFVH+SGFIGG
Sbjct: 324 VFSD--GKGWRVQAVPKTSTSFENRKPLP--YRGLRDDELSAACGIEGGVFVHVSGFIGG 379
Query: 329 NQSYGGALAMARAAL 343
++ GA+A+A+ AL
Sbjct: 380 MKTLDGAMALAKQAL 394
>gi|126134369|ref|XP_001383709.1| hypothetical protein PICST_43180 [Scheffersomyces stipitis CBS
6054]
gi|126095858|gb|ABN65680.1| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 339
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 216/338 (63%), Gaps = 12/338 (3%)
Query: 15 SSPSQTPLK----RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
S+P + L R+ TH+GSFH DE+L +M+RL KF +A+++RSR+PK ++ D V+
Sbjct: 2 SAPKKIKLDNKMFRICTHSGSFHADESLAVYMLRLLPKFADAELIRSRNPKDWEESDIVV 61
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK--ELNVDEGHP 128
DV G YD +DHHQ+ F F ++TKLSSAGLVYKHFGKE+I + E ++
Sbjct: 62 DVSGKYDGVK-FFDHHQREFNSTFSEKYNTKLSSAGLVYKHFGKEIIQQVLEYTPEKDSK 120
Query: 129 DVHRLFLAVYKNFMEAIDAIDNGINQYDTD--KPPRYVN-NTNLSSRVGKLNLDWTEPDQ 185
++ L+ VYK F+E++DA DNGIN Y T+ ++ + N L S V KLN W E
Sbjct: 121 NIDLLYEKVYKEFIESLDANDNGINNYSTEIEATKKFSDKNITLPSLVSKLNPKWNESCT 180
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF 245
A+ + + F + +L G F+ + Y + WLPA+SIV E R+D D SGEI+VL +F
Sbjct: 181 DADFDRQ-FLKSSELMGMAFISVLEGYGKGWLPAKSIVEESFDARFDVDKSGEILVLNQF 239
Query: 246 CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
CPWK HL+ +E+E E IK+VL++D K WRV V+V+ FE R LP + RGLRD
Sbjct: 240 CPWKEHLYSIEKENNAEGQIKFVLFQDSSDK-WRVSTVSVTSSSFEFRLGLPEELRGLRD 298
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
DELS+++G+ GC+F+H +GFIGG Q+ L +A+ +L
Sbjct: 299 DELSQKSGVDGCIFIHAAGFIGGAQTKEAVLQLAKLSL 336
>gi|365985339|ref|XP_003669502.1| hypothetical protein NDAI_0C06000 [Naumovozyma dairenensis CBS 421]
gi|343768270|emb|CCD24259.1| hypothetical protein NDAI_0C06000 [Naumovozyma dairenensis CBS 421]
Length = 331
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 215/324 (66%), Gaps = 8/324 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH D+AL +++RL ++ +A+++RSR+P+ D D V+DVG YD S
Sbjct: 13 KQICTHSGSFHADDALAVYLLRLLPEYKDAKVIRSRNPEDWDASDIVVDVGAKYD-SVKF 71
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKN 140
+DHHQ+GF+E F + TKLSSAGLV+KHFGK +I + N+ +V L++ VY+
Sbjct: 72 FDHHQRGFDETFSDKYKTKLSSAGLVFKHFGKRIIELLVTKNITLTPENVDLLYIRVYER 131
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+EA+DA DNGI+QYDTD PR+ N + + V LN +W + D S +E F + +
Sbjct: 132 FVEALDADDNGISQYDTDLEPRFKNKSITIPGIVAGLNPNWND-DCSPAIFDENFLKASE 190
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
G F++ V+ Y SWLPA+S+V E IA+ D I+VL +FCPWK HL+++E+E+
Sbjct: 191 FVGSTFVNLVKGYGESWLPAKSLVKEAIAKNETNDK--RIIVLDQFCPWKEHLYDIEKEL 248
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
+E I++V+++D WRV V VS F+ RK LP RGLRD+ELS+++G+P C+F
Sbjct: 249 NLENKIQFVIFKDS-SNSWRVSTVPVSSTSFQFRKGLPEHLRGLRDEELSEKSGVPDCIF 307
Query: 320 VHMSGFIGGNQSYGGALAMARAAL 343
+H +GFIGG ++ AL +A+ +L
Sbjct: 308 IHSAGFIGGAKTKESALKLAKMSL 331
>gi|255719830|ref|XP_002556195.1| KLTH0H07260p [Lachancea thermotolerans]
gi|238942161|emb|CAR30333.1| KLTH0H07260p [Lachancea thermotolerans CBS 6340]
Length = 334
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 143/325 (44%), Positives = 209/325 (64%), Gaps = 7/325 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++ TH+GSFH DEAL +M+R+ +F +A++VRSRDP ++ D V+DV G YD
Sbjct: 13 KQICTHSGSFHADEALAVYMLRVLPEFKDAKVVRSRDPAKWEESDIVVDVSGKYDVEKRL 72
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+ F E F + TKLSSAGLVYKHFG+++I K L DV L+ VYK+F+
Sbjct: 73 FDHHQREFFETFSEKYKTKLSSAGLVYKHFGQDII-KTLQPQLDESDVEFLYEKVYKDFV 131
Query: 143 EAIDAIDNGINQYDTD----KPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
E++DA DNGI+ +D + KP N + + +N +W D S ++ F +
Sbjct: 132 ESLDANDNGISNFDAEDLGVKPKFSDKNITIPGIISGMNPNWN-GDCSDASFDKCFFKAS 190
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
D G+ F++ V+ Y SW+PA+++V + + R DPSG+I+VLK+FCPWK HL+ +E+E
Sbjct: 191 DFIGEVFVNLVKGYGESWMPAKALVRDAVQNRLQEDPSGKIIVLKQFCPWKEHLYNIEKE 250
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ I I +VL+ED K WRV V VS F+ R+ LP RGLRD+ELS+++G+PGC+
Sbjct: 251 LGIVGEILFVLFEDSSSK-WRVSTVPVSATSFKFREGLPEPLRGLRDNELSEKSGLPGCI 309
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
F+H +GFIGG S L +A+ +L
Sbjct: 310 FIHAAGFIGGADSKDTVLKLAQMSL 334
>gi|390349106|ref|XP_798510.3| PREDICTED: UPF0160 protein MYG1, mitochondrial-like
[Strongylocentrotus purpuratus]
Length = 341
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 204/336 (60%), Gaps = 53/336 (15%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
S S+ R+GTHNG+FHCDE L C+M++ ++ +A+IVR+RDP VL+ D V+DVGGV
Sbjct: 48 SGSKRTCVRIGTHNGTFHCDETLACYMLQRLPQYKDAEIVRTRDPAVLETCDIVVDVGGV 107
Query: 76 YDPSNDCYDHHQKGFEEVFGH-----GFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
+DP YDHHQ+ F++ ++ KLSSAGLVY HFGKE+I + L++ PDV
Sbjct: 108 FDPKRHRYDHHQRTFKDTMNSLSAEMPWTIKLSSAGLVYFHFGKEVIWRTLDLSPDDPDV 167
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE 190
++ VY NFM+ +DAIDNG++Q TD+ PRYV +TNLS+RVG LN W + Q
Sbjct: 168 TSVYNKVYDNFMQEVDAIDNGVDQ--TDEKPRYVVSTNLSARVGHLNPKWNDEVQD---- 221
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWK 249
Y+ D GEI+ + CPWK
Sbjct: 222 ----------------------------------------YEVDACGEIITFPQGGCPWK 241
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HLFELE+ + +E IKYVLY D+ G WR+Q V ++ F++R LP +WRG+RD+ LS
Sbjct: 242 EHLFELEKILDLELPIKYVLYPDNSGA-WRIQCVPINRHSFDNRLSLPEKWRGVRDEALS 300
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ +GI GC+FVH SGFIGGN+S GAL MAR +L++
Sbjct: 301 ELSGIYGCIFVHASGFIGGNKSKEGALQMARKSLQM 336
>gi|66514740|ref|XP_623698.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Apis
mellifera]
Length = 329
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 201/328 (61%), Gaps = 16/328 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FHCDEAL CFM++ ++ +A IVRSRD +L+ D V+DVG Y+P Y
Sbjct: 6 KIGTHDGCFHCDEALACFMLKTLPRYKDAIIVRSRDMSILNTCDIVVDVGEEYNPCKHRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F E GH + TKLSSAGL+Y HFG E+I KEL D+ +F
Sbjct: 66 DHHMRDFNESVSTIIKKPGHDWKTKLSSAGLIYCHFGHEII-KELVPQASDADIEIIFKH 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y F++ ID+IDNGI+ + +Y T+LSSRV LN W D + N F +
Sbjct: 125 IYNTFIQEIDSIDNGISICSENT--KYQIVTDLSSRVKFLNPPWNSKDLNP---NTQFLK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M+L G++F+ + + WLPARSIV E +A+R++ DPSGEI+ L + PW HLF +E
Sbjct: 180 AMELTGQDFVQHINYAANVWLPARSIVEEAVAKRFEVDPSGEIIELSQCVPWFQHLFAIE 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E I+PL+KYV+++DD +R++ V V PD F R LP W GL+++ L GI G
Sbjct: 240 KEQNIKPLLKYVIFKDD---TYRIRCVPVKPDSFICRLFLPESWEGLKNEALVNVCGIEG 296
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
FVH FIGGN+ G L MAR AL+
Sbjct: 297 ATFVHSGRFIGGNRMREGILMMARKALE 324
>gi|440468230|gb|ELQ37402.1| hypothetical protein OOU_Y34scaffold00597g28 [Magnaporthe oryzae
Y34]
gi|440486275|gb|ELQ66157.1| hypothetical protein OOW_P131scaffold00421g9 [Magnaporthe oryzae
P131]
Length = 366
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 158/356 (44%), Positives = 213/356 (59%), Gaps = 34/356 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
S+ P+ +GTHNG FH DEAL +M+R + A +VR+RDP L V+DVGG Y
Sbjct: 16 SEGPVT-IGTHNGHFHADEALAVYMLRQHIPTYAGASLVRTRDPAALARCHTVVDVGGEY 74
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD------V 130
D S +DHHQ+ F F G TKLSSAGLVY HFGK++IA+ L PD V
Sbjct: 75 DASTLRFDHHQRTFSTTFP-GRQTKLSSAGLVYMHFGKQIIARRL----ADPDCDNEATV 129
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLD 179
L+ +Y+NF+EA+DA DNGI+ YD N L R VG+LN +
Sbjct: 130 DMLYAKLYENFVEALDAHDNGISAYDPAVLATASNGQPLQKRFSDGGFTLGAVVGRLNPN 189
Query: 180 WTEP-----DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYD 234
W +P ++ E++ F+ G+EF + +Y RSWLPARS+V + + R ++D
Sbjct: 190 WNDPIPEDPAEAQAAEDQRFEAASARIGEEFDRDLDYYTRSWLPARSVVADAFSHRAEHD 249
Query: 235 PSGE--IMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDR 289
P+G+ I+V K + PWK HL+ LE+E + YVLY + +WR+Q V VS D
Sbjct: 250 PAGKGRILVFKGQSVPWKDHLYTLEQENPSAGEVVYVLYPEKPTPDAKWRIQCVPVSKDS 309
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
F+SR PLP WRG RD+EL K GIPGCVFVH +GFIGGN+++ GA+AMA+ AL+L
Sbjct: 310 FQSRLPLPEAWRGFRDEELDKITGIPGCVFVHAAGFIGGNKTFEGAMAMAQKALEL 365
>gi|389635673|ref|XP_003715489.1| MYG1 protein [Magnaporthe oryzae 70-15]
gi|351647822|gb|EHA55682.1| MYG1 protein [Magnaporthe oryzae 70-15]
Length = 377
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 158/356 (44%), Positives = 213/356 (59%), Gaps = 34/356 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
S+ P+ +GTHNG FH DEAL +M+R + A +VR+RDP L V+DVGG Y
Sbjct: 27 SEGPVT-IGTHNGHFHADEALAVYMLRQHIPTYAGASLVRTRDPAALARCHTVVDVGGEY 85
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD------V 130
D S +DHHQ+ F F G TKLSSAGLVY HFGK++IA+ L PD V
Sbjct: 86 DASTLRFDHHQRTFSTTFP-GRQTKLSSAGLVYMHFGKQIIARRL----ADPDCDNEATV 140
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLD 179
L+ +Y+NF+EA+DA DNGI+ YD N L R VG+LN +
Sbjct: 141 DMLYAKLYENFVEALDAHDNGISAYDPAVLATASNGQPLQKRFSDGGFTLGAVVGRLNPN 200
Query: 180 WTEP-----DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYD 234
W +P ++ E++ F+ G+EF + +Y RSWLPARS+V + + R ++D
Sbjct: 201 WNDPIPEDPAEAQAAEDQRFEAASARIGEEFDRDLDYYTRSWLPARSVVADAFSHRAEHD 260
Query: 235 PSGE--IMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDR 289
P+G+ I+V K + PWK HL+ LE+E + YVLY + +WR+Q V VS D
Sbjct: 261 PAGKGRILVFKGQSVPWKDHLYTLEQENPSAGEVVYVLYPEKPTPDAKWRIQCVPVSKDS 320
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
F+SR PLP WRG RD+EL K GIPGCVFVH +GFIGGN+++ GA+AMA+ AL+L
Sbjct: 321 FQSRLPLPEAWRGFRDEELDKITGIPGCVFVHAAGFIGGNKTFEGAMAMAQKALEL 376
>gi|366997973|ref|XP_003683723.1| hypothetical protein TPHA_0A02070 [Tetrapisispora phaffii CBS 4417]
gi|357522018|emb|CCE61289.1| hypothetical protein TPHA_0A02070 [Tetrapisispora phaffii CBS 4417]
Length = 338
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 213/329 (64%), Gaps = 13/329 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ TH+ SFH DEAL +M+RL ++ +A + RSRDP + D V+DV G YD
Sbjct: 14 VKKICTHSNSFHADEALAVYMLRLLPEYRDASVTRSRDPADWEASDIVVDVSGKYDGVK- 72
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL----NVDEGHPDVHRLFLAV 137
+DHHQ+ F E F + TKLSSAGLV+KHFG+++I+ L N+ E D+ L+ V
Sbjct: 73 FFDHHQREFSETFNESYKTKLSSAGLVFKHFGRDIISSVLTGNVNIKENELDI--LYDKV 130
Query: 138 YKNFMEAIDAIDNGINQYDTDK---PPRYVNN-TNLSSRVGKLNLDWTEPDQSAERENEA 193
YKNF+EA+DA DNGIN +D D+ ++++ L + +N DW + D SA + +E
Sbjct: 131 YKNFIEALDANDNGINNFDVDELKVKEKFIDKGITLPGVISNMNPDWND-DCSAAKFDEM 189
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F G F+ V+ Y SWLPA+++V + +++R+ DPSG+I++ ++FCPWK HL+
Sbjct: 190 FFVASKFIGDIFVRLVKRYGESWLPAKALVADAVSKRFQIDPSGKIILFEQFCPWKEHLY 249
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+E+E+ IE I++VL++D G WRV V VS F+ R+ LP RGLRD+ELS+++G
Sbjct: 250 AVEKELNIENKIEFVLFKDS-GNTWRVSTVPVSSTSFKFRRGLPEPLRGLRDEELSEKSG 308
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAA 342
+P CVF+H +GFIGG +S L +A+ +
Sbjct: 309 VPDCVFIHAAGFIGGAKSKDSVLKLAKMS 337
>gi|367030379|ref|XP_003664473.1| hypothetical protein MYCTH_2307334 [Myceliophthora thermophila ATCC
42464]
gi|347011743|gb|AEO59228.1| hypothetical protein MYCTH_2307334 [Myceliophthora thermophila ATCC
42464]
Length = 353
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 209/337 (62%), Gaps = 17/337 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL +M+R + A++VR+RDPK+LD+ V+DVGG YD + + +
Sbjct: 18 IGTHNGHFHADEALAVYMLRTHVPTYSGAKLVRTRDPKLLDECHTVVDVGGEYDAARNRF 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F G TKLSSAGLVY HFG+E+IA+ L E V ++ +Y++F+E
Sbjct: 78 DHHQRSFGTSFP-GRQTKLSSAGLVYMHFGREVIARRLGQAEDSEQVDLVWRKIYESFIE 136
Query: 144 AIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEP---DQSA--EREN 191
A+DA DNGI+ YD ++ + L + V +LN +W +P D +A E E+
Sbjct: 137 ALDAHDNGISVYDPAALAAAGLQKKFSDGGFTLGAMVSRLNPNWNDPVPEDPAAAQEAED 196
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKL 250
+ F+ G+EF + ++ ++WLPAR +V E A R ++D G IMVLKR PWK
Sbjct: 197 KRFELASQRIGEEFDRDLDYFTKAWLPAREVVAEAFAARQEHDSQGRIMVLKRQSAPWKD 256
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LEE + YVLY + +WR+Q V V+ D FESRKPLP WRG RD+EL
Sbjct: 257 HLYSLEEGQPEGGKVLYVLYPEKPTPDSKWRIQCVPVAKDSFESRKPLPEAWRGFRDEEL 316
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
G+ G +FVH SGFIGGN++Y G LAMA+ AL L
Sbjct: 317 DGICGVSGSIFVHASGFIGGNKTYEGVLAMAKKALDL 353
>gi|194903505|ref|XP_001980881.1| GG14318 [Drosophila erecta]
gi|190652584|gb|EDV49839.1| GG14318 [Drosophila erecta]
Length = 358
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 219/343 (63%), Gaps = 20/343 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDVG 73
+ P ++PL +GTH+G+FHCDE + CFM++ D++ +A+I RSRD K L + D ++DVG
Sbjct: 18 TPPKRSPL-WIGTHSGTFHCDEVVACFMLKQLDEYKSAEIFRSRDNKALQEKCDIIVDVG 76
Query: 74 GVYDPSNDCYDHHQKGFEEVFG-------HGFST-KLSSAGLVYKHFGKELIAKELNVDE 125
GVYD + YDHHQ+ F+E F F+ +LSSAGLVY H+G+ +I L ++
Sbjct: 77 GVYDHAKKLYDHHQQTFKETFSSIRPEVSEDFNVIRLSSAGLVYSHYGERVIQSILQREK 136
Query: 126 G---HPDVHRL-FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
G P+ +L FL +Y+NF+ +DAIDNG++ ++ +P Y +++LSSR+GKLN W
Sbjct: 137 GIKLSPENLQLAFLQIYRNFICELDAIDNGVSMFEGAEPI-YKISSHLSSRIGKLNPSWQ 195
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
E E + F+ MD AG+EF+D V SW+ AR V E + P+GEI+
Sbjct: 196 ETGVDIE---DRFRLAMDTAGREFVDNVLEVCCSWMAARDHVREALNNAKSVYPTGEILE 252
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
LK FCPWK HL +LE+E K+E + K V++ D G WRV AV V+P F+ RK LP W
Sbjct: 253 LKNFCPWKSHLADLEKEYKVEGVPKLVVFND--GSSWRVAAVPVAPSSFKCRKFLPMPWW 310
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GLRDDELS +AGI +F+H +GFIGG ++ A+ +A+ +++
Sbjct: 311 GLRDDELSHKAGINDLIFIHHTGFIGGAKTKAAAMLLAKKSIE 353
>gi|281202305|gb|EFA76510.1| metal-dependent protein hydrolase domain-containing protein
[Polysphondylium pallidum PN500]
Length = 315
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 202/321 (62%), Gaps = 13/321 (4%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G+FH DEAL C++++ T+KF +A+I+RSRD V++ D +DVG YD S +D
Sbjct: 6 ICTHSGTFHADEALACYLLKQTNKFKDAKIIRSRDTAVINAADVAVDVGATYDFSKLRFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY-KNFME 143
HHQ GF E F KLSSAGL+YKH+GKE+I L V++ D+ +Y K+F+E
Sbjct: 66 HHQAGFTETFDDHHEIKLSSAGLIYKHYGKEIIKNRLKVNDEITDI------IYKKDFIE 119
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D +DNGI +Y + +Y NT +S RV +N W E DQS + E F++ M L G+
Sbjct: 120 ELDGVDNGIERYPAEIKAKYKMNTTISQRVASMNPYWNE-DQSEQVLYERFEKAMALMGE 178
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F D V +Y +SW+PAR+IV E + R + SG+++VL++FCPWK HL+ LE+ I
Sbjct: 179 TFNDKVDYYGKSWMPARAIVEEALRTRKEVHSSGQLIVLRQFCPWKEHLYHLEKLNNINK 238
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
I + L+ED+ G WR+QAV + F RK LP WRG RD ELS GI G + +
Sbjct: 239 SILFCLFEDNLG-SWRIQAVNLDNSSFTLRKALPEAWRGKRDQELSDIIGIDG----YAN 293
Query: 324 GFIGGNQSYGGALAMARAALK 344
GFIGG++ GA+ MA AL+
Sbjct: 294 GFIGGHKHKDGAMHMAIKALE 314
>gi|401416543|ref|XP_003872766.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488991|emb|CBZ24240.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 398
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/372 (41%), Positives = 212/372 (56%), Gaps = 41/372 (11%)
Query: 7 SSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
+++ S +S + P+ + THNGSFHCDEA+ C ++R + A I+R+RDPK ++
Sbjct: 29 AATAGSSGNSAAAKPV--ICTHNGSFHCDEAMACGLLRHVLAYREAVILRTRDPKQIEAC 86
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFG---HGFSTKLSSAGLVYKHFGKELIAKELNV 123
D V+DVG +YD + YDHHQ F + T+LSSAGLVYKHFG+++I + +
Sbjct: 87 DIVVDVGAIYDAGTNRYDHHQASFHGTMTTPKKAYKTRLSSAGLVYKHFGRQIIRQYVEA 146
Query: 124 ---------------------DEGHPDVHRLFL--AVYKNFMEAIDAIDNGINQYDTDKP 160
G D L A+Y NF+E +D IDNG+ + P
Sbjct: 147 VLQPSSPARAAVLSMTSWSESRTGLSDTELDVLEDALYANFVEQVDGIDNGVECWGLADP 206
Query: 161 ------PRYVNNTNLSSRVGKLNLDWTEPDQS-AERENEAFQQGMDLAGKEFLDTVRFYV 213
P Y +TNLS R+G+L W EP+ EN F +++A EF + + +
Sbjct: 207 AVGTLVPNYKQSTNLSQRIGQLQAYWNEPENGDVVAENAKFAVAVEMAVTEFFEALTYLA 266
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLK--RFCPWKLHLFELEEEMKIEPLIKYVLYE 271
SWLPARSIV R+++D SG+IMV K + CPWK HL ELE E + YV++
Sbjct: 267 FSWLPARSIVEAAFQRRHEFDESGKIMVFKDVKVCPWKDHLLELEAESNCLGAVLYVVFS 326
Query: 272 DDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQS 331
D GK WRVQAV + FE+RKPLP +RGLRDD LS GI G VFVH+SGFIGG ++
Sbjct: 327 D--GKGWRVQAVPKTSTSFENRKPLP--YRGLRDDTLSTACGIEGGVFVHVSGFIGGMKT 382
Query: 332 YGGALAMARAAL 343
+ GA+A+A+ AL
Sbjct: 383 FDGAMALAKQAL 394
>gi|366987115|ref|XP_003673324.1| hypothetical protein NCAS_0A03780 [Naumovozyma castellii CBS 4309]
gi|342299187|emb|CCC66936.1| hypothetical protein NCAS_0A03780 [Naumovozyma castellii CBS 4309]
Length = 339
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 220/334 (65%), Gaps = 9/334 (2%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
+P +K++ TH+GSFH DE+L +M+RL ++ +A++VRSR+P ++ D V+DVG
Sbjct: 9 NPLTNMVKQICTHSGSFHADESLAVYMLRLLPEYKDAKVVRSREPAQWEESDIVVDVGAK 68
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL-NVDEGHPDVHRLF 134
YD +DHH + F E F TKLSSAGLV+KHFG+++I L D +++ L+
Sbjct: 69 YDGVK-FFDHHHREFMETFSDEHKTKLSSAGLVFKHFGRDIIRNVLAKSDLTEENLNILY 127
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKP----PRYVNNT-NLSSRVGKLNLDWTEPDQSAER 189
VY++F+EA+DA DNGIN+YD K P++ + ++ + +N +W + D SAE
Sbjct: 128 KRVYEHFIEALDANDNGINKYDGLKELHVEPKFNDKAISIPGIISGMNPNWND-DCSAEA 186
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
++ F + D G F++ V+ Y SWLPA+SIV+E + +R + D SG+I++L +FCPWK
Sbjct: 187 FDKHFFKASDFIGATFVNLVKGYGESWLPAKSIVLEAVKDRLNADASGKIVILPQFCPWK 246
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HL+E+E+E+ IE I++VL+ED GK WRV V VS F+ RK LP RGLRDD LS
Sbjct: 247 EHLYEIEKELGIENQIEFVLFEDS-GKSWRVSTVPVSSTSFQFRKGLPEVLRGLRDDVLS 305
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+++G+P C+F+H +GFIGG ++ A +A+ +L
Sbjct: 306 EKSGVPDCIFIHAAGFIGGAKTKESAYRLAKMSL 339
>gi|46136947|ref|XP_390165.1| hypothetical protein FG09989.1 [Gibberella zeae PH-1]
Length = 349
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 155/358 (43%), Positives = 208/358 (58%), Gaps = 29/358 (8%)
Query: 6 VSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD 65
++ S A + PL +GTH+G FH DEAL M+R + +A +VR+RDP VL
Sbjct: 1 MAESAAKRVKTSGNGPL--IGTHSGHFHADEALAVHMLRRLPTYRDADLVRTRDPAVLAT 58
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGG YD +DHHQ+GF F G TKLSSAGLV+ HFG+ ++A+ L + E
Sbjct: 59 CHTVVDVGGEYDAEKRRFDHHQRGFNTTF-PGRPTKLSSAGLVFLHFGRAIVAERLGLSE 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VG 174
PDV ++ +Y+NF+EA+DA DNGI+ YD P + + R VG
Sbjct: 118 DSPDVDLIYKKLYENFVEALDAHDNGISVYD----PAAIAAAGIEKRFSEGAFGLGAVVG 173
Query: 175 KLNLDWTEPDQSAERENEA-----FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
+LN W +P S E +A F + G+EF + Y SWLPAR+IV E +
Sbjct: 174 RLNPKWNDPTPSDPAEAQAAEDAKFNEASSRIGQEFDRDLDNYAASWLPARTIVQEAFNK 233
Query: 230 RYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVS 286
R YD G I++L+ + PWK HL+ LE+ P + YVLY + + G +WR+Q V S
Sbjct: 234 RKQYDEQGRILILEGQSVPWKDHLYTLEDGT---PSVIYVLYAEKPEPGAKWRIQCVPES 290
Query: 287 PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
D F SRKPLP WRG RD EL +GIPGCVFVH +GFIGGN+++ GA MA AL+
Sbjct: 291 KDSFTSRKPLPEAWRGFRDAELDGISGIPGCVFVHAAGFIGGNKTFEGAKEMATKALE 348
>gi|444316974|ref|XP_004179144.1| hypothetical protein TBLA_0B08090 [Tetrapisispora blattae CBS 6284]
gi|387512184|emb|CCH59625.1| hypothetical protein TBLA_0B08090 [Tetrapisispora blattae CBS 6284]
Length = 341
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/334 (43%), Positives = 209/334 (62%), Gaps = 8/334 (2%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGG 74
S+ +Q +K + H+GSFH DE+L +M+RL ++ +A ++RSR + + D V+DVG
Sbjct: 11 SNITQKMVKTICVHSGSFHADESLAVYMLRLLPEYKDANVIRSRKAEDWEKSDIVVDVGA 70
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN-VDEGHPDVHRL 133
YD +DHHQ+GFEE F + TKLS AGL+YKHFGKE+I + D+ L
Sbjct: 71 TYDGIKH-FDHHQRGFEETFNANYKTKLSGAGLIYKHFGKEIIKNAVKPTVVKDNDLEIL 129
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTD----KPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
+ VY NF+EAIDA DNGI D + KP + +L VG +N W E D + +
Sbjct: 130 YDKVYSNFIEAIDANDNGIKILDYEALNVKPKFRDSAVSLPGIVGGMNPSWNE-DCTPAK 188
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
+E F + + G F++ V+ Y SWLPA+S+V +++R D D +GEI+VL++FCPWK
Sbjct: 189 FDENFFKASEFIGTIFVNLVKGYSNSWLPAKSLVRAAVSKRMDVDSTGEIIVLEQFCPWK 248
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HL+E+E+E E IK+VL++D WRV V VS F R+ + + RGLRD+ELS
Sbjct: 249 EHLYEVEKEFDCEGKIKFVLFKDS-SNTWRVSTVPVSSGSFAFRQGILEKLRGLRDEELS 307
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
KE+G+P CVFVH +GFIGG Q+ GA +A+ +L
Sbjct: 308 KESGVPDCVFVHAAGFIGGTQTQEGAYKLAKMSL 341
>gi|403216734|emb|CCK71230.1| hypothetical protein KNAG_0G01720 [Kazachstania naganishii CBS
8797]
Length = 336
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 216/331 (65%), Gaps = 8/331 (2%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+T +K++ THNGSFH DE+L +M+RL +F +A++VRSR P ++ D V+DV YD
Sbjct: 9 ETMVKQICTHNGSFHADESLAVYMLRLLPEFKDAKVVRSRTPSDWEESDIVVDVSAQYDG 68
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN-VDEGHPDVHRLFLAV 137
+DHHQ+ F E F + TKLSSAGLVYKH+G+++I L+ D+ L+ V
Sbjct: 69 VK-YFDHHQREFTETFSDKYKTKLSSAGLVYKHYGRDIIKCILDGAVTSDADLEVLYQRV 127
Query: 138 YKNFMEAIDAIDNGINQYDTDK---PPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEA 193
Y+ F+EA+DA DNGIN+YD ++ P++ +N ++ + N +W E D SAE +
Sbjct: 128 YEKFVEALDANDNGINKYDVEELGVQPKFNDNAISIPGIISGFNPNWNE-DSSAEAFDRG 186
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F + D GK F+D V Y ++WLPA+++V + I R D D SG+I++ +FCPWK HL+
Sbjct: 187 FFKASDFIGKIFVDLVTGYGKAWLPAKTLVRKAIDGRMDVDKSGKIILFDQFCPWKEHLY 246
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+E+E+ +E +I++V+++D G WRV V VS F+ R+ LP RGLRD ELS+++G
Sbjct: 247 NVEKELNLEGVIEFVIFQDSMG-SWRVATVPVSSTSFKFRRGLPEPLRGLRDAELSEKSG 305
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+P C+F+H +GFIGG + AL +A+ +L+
Sbjct: 306 VPDCIFIHAAGFIGGAKQKESALKLAQMSLE 336
>gi|408396785|gb|EKJ75939.1| hypothetical protein FPSE_03887 [Fusarium pseudograminearum CS3096]
Length = 349
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 155/358 (43%), Positives = 208/358 (58%), Gaps = 29/358 (8%)
Query: 6 VSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD 65
++ S A + PL +GTH+G FH DEAL M+R + +A +VR+RDP VL
Sbjct: 1 MAESAAKRVKTSGNGPL--IGTHSGHFHADEALAVHMLRRLPTYRDADLVRTRDPAVLAT 58
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGG YD +DHHQ+GF F G TKLSSAGLV+ HFG+ ++A+ L + +
Sbjct: 59 CHTVVDVGGEYDAEKRRFDHHQRGFNTTF-PGRPTKLSSAGLVFLHFGRAIVAERLGLSD 117
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VG 174
PDV ++ +Y+NF+EA+DA DNGI+ YD P + L R VG
Sbjct: 118 DSPDVDLIYKKLYENFVEALDAHDNGISVYD----PAAIAAAGLEKRFSEGAFGLGAVVG 173
Query: 175 KLNLDWTEPDQSAERENEA-----FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
+LN W +P S E +A F + G+EF + Y SWLPAR+IV E +
Sbjct: 174 RLNPKWNDPTPSDPAEAQAAEDAKFNEASSRIGQEFDRDLDNYAASWLPARTIVQEAFNK 233
Query: 230 RYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVS 286
R YD G I++L+ + PWK HL+ LE+ P + YVLY + + G +WR+Q V S
Sbjct: 234 RKQYDEQGRILILEGQSVPWKDHLYTLEDGT---PSVIYVLYAEKPEPGAKWRIQCVPES 290
Query: 287 PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
D F SRKPLP WRG RD EL +GIPGCVFVH +GFIGGN+++ GA MA AL+
Sbjct: 291 KDSFTSRKPLPEAWRGFRDAELDGISGIPGCVFVHAAGFIGGNKTFEGAKEMATKALE 348
>gi|146078451|ref|XP_001463546.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398011146|ref|XP_003858769.1| hypothetical protein, conserved [Leishmania donovani]
gi|134067632|emb|CAM65911.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322496979|emb|CBZ32049.1| hypothetical protein, conserved [Leishmania donovani]
Length = 398
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 153/356 (42%), Positives = 205/356 (57%), Gaps = 43/356 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNGSFHCDEA+ C ++R ++ A I+R+RDPK +D D V+DVG +YD + YD
Sbjct: 45 ICTHNGSFHCDEAMACGLLRHVPEYREAVILRTRDPKQIDACDIVVDVGAIYDVDTNRYD 104
Query: 85 HHQKGFEEVFG---HGFSTKLSSAGLVYKHFGKELIAKEL-------------------- 121
HHQ F + T+LSSAGLVYKHFG+++I + +
Sbjct: 105 HHQASFHGTMTTPKKAYKTRLSSAGLVYKHFGRQIIRQYVEAVLQPSSPARAAVLSMTSW 164
Query: 122 -----NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKP------PRYVNNTNLS 170
V + DV L A+Y NF+E +D IDNG+ + P P Y +TNLS
Sbjct: 165 SESRTGVSDAELDV--LEDALYANFVEQVDGIDNGVECWGLADPAVGTLVPNYKQSTNLS 222
Query: 171 SRVGKLNLDWTEPDQS-AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
R+G+L W EP+ EN F +++A EF + + SWLPARSIV
Sbjct: 223 QRIGQLQAYWNEPENGDVVAENANFAVAVEMAVTEFFGALTYLAFSWLPARSIVEAAFQR 282
Query: 230 RYDYDPSGEIMVLK--RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSP 287
R+++D SG+IMV K + CPWK HL ELE E + YV++ D GK WRVQAV +
Sbjct: 283 RHEFDESGKIMVFKDVKVCPWKDHLLELEAESNCLGAVLYVVFSD--GKGWRVQAVPKTS 340
Query: 288 DRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
FE+RKPLP +RGLRDD LS GI G VFVH+SGFIGG +++ GA+A+A+ AL
Sbjct: 341 TSFENRKPLP--YRGLRDDALSAACGIEGGVFVHVSGFIGGMKTFDGAMALAKQAL 394
>gi|332027751|gb|EGI67818.1| UPF0160 protein MYG1, mitochondrial [Acromyrmex echinatior]
Length = 333
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 203/329 (61%), Gaps = 14/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G+FHCDE L C +++L ++ +A IVRSR VLD D V+DVGG YDPS Y
Sbjct: 8 KIGTHDGTFHCDEVLACALLKLLPQYKDASIVRSRSQSVLDTCDIVVDVGGEYDPSRHRY 67
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F+E G+ ++ KLSSAGL+Y HFG E++ L+ + +F
Sbjct: 68 DHHMREFQESMSTVMKKPGYDWTIKLSSAGLIYCHFGHEILRNILSNITEDRIIDEIFKK 127
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y ++ ID IDNGI YD + P Y T+LS+RV +LN W D E++ F++
Sbjct: 128 IYDTLIKEIDGIDNGIPMYDAE--PLYRIVTDLSARVSRLNPQWNSQDIDIEKQ---FEK 182
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M L +EFL+ V++ WLPAR +V + R++ DPSGEI+ L + PWK HLF+LE
Sbjct: 183 AMALVLEEFLEFVQYSKNVWLPARDLVRRAVENRFEVDPSGEIIALSQAVPWKEHLFQLE 242
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
++M + P IKY ++E D +RVQ + V+ F R LPA W GLR + L + GI G
Sbjct: 243 KDMNVSPSIKYAIFEADNA--YRVQCMPVALGSFVCRMFLPAAWGGLRANALVEACGIEG 300
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH FIGG+++ GA+AMAR AL++
Sbjct: 301 AIFVHSVRFIGGHKTKDGAVAMARKALEI 329
>gi|407406714|gb|EKF30895.1| hypothetical protein MOQ_005279 [Trypanosoma cruzi marinkellei]
Length = 394
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 165/364 (45%), Positives = 209/364 (57%), Gaps = 47/364 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNGSFHCDEA+ C ++R + F + I+R+RDPK L+ + V+DVG VYD + +D
Sbjct: 33 IGTHNGSFHCDEAMACGLLRCSKNFGQSNILRTRDPKALERCNIVVDVGSVYDEATLRFD 92
Query: 85 HHQKGFEEVFGH---GFSTKLSSAGLVYKHFGKELIAKELNVDEGHP------------- 128
HHQ F + + T+LSSAGLVYKHFG+E+I + P
Sbjct: 93 HHQPSFHDTMKTPKAAYRTRLSSAGLVYKHFGREIIQGYVESALASPYRVQLLDATKWRT 152
Query: 129 --------DVHRLFLAVYKNFMEAIDAIDNGINQYD-------------TDKPPRYVNN- 166
++ LF VYKNF+E ID IDNG+N Y P V N
Sbjct: 153 DKKKLSEQELDTLFDIVYKNFVEHIDGIDNGVNAYGPTETEGEEGTVPVLSSSPSCVKNY 212
Query: 167 ---TNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSI 222
T LS+R+G L W E + E EN AF Q M+LA EF +V ++ +W+PAR I
Sbjct: 213 NVSTTLSARIGNLMPWWNEEGNGKVENENAAFLQAMELATSEFFHSVHYHAFAWMPARGI 272
Query: 223 VVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQ 281
V E + PSG I+V K FCPWK HL ELEEE + YVL+ D +G WRVQ
Sbjct: 273 VQAAFLEAMNVHPSGRIVVFKDCFCPWKEHLLELEEEQGKVGHVLYVLFTDKKG--WRVQ 330
Query: 282 AVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARA 341
AV FESRKPLP WRGLRD+ELSKE+GI G +FVH+SGFIGGN++Y GAL MA
Sbjct: 331 AVPKEASGFESRKPLP--WRGLRDEELSKESGIDGGIFVHVSGFIGGNKTYEGALQMAVK 388
Query: 342 ALKL 345
AL +
Sbjct: 389 ALTV 392
>gi|410076186|ref|XP_003955675.1| hypothetical protein KAFR_0B02420 [Kazachstania africana CBS 2517]
gi|372462258|emb|CCF56540.1| hypothetical protein KAFR_0B02420 [Kazachstania africana CBS 2517]
Length = 327
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 208/328 (63%), Gaps = 9/328 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ TH+GSFH DE+L +M+RL ++ +A++VRSRDPK ++ D V+DVG YD
Sbjct: 2 VKQICTHSGSFHADESLAVYMLRLLPEYKDAKLVRSRDPKDWEESDIVVDVGAQYDGVK- 60
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F E F F TKLSSAGLVYKHFGK +I K + D D+ L+L VYK F
Sbjct: 61 YFDHHQREFMETFSSDFHTKLSSAGLVYKHFGKRII-KSILGDVSEEDLDVLYLRVYKQF 119
Query: 142 MEAIDAIDNGINQYDTDK----PPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+EA+DA DNGI++YD R+ +N ++ + +N W D SAE ++ F +
Sbjct: 120 VEALDANDNGISKYDIKDDMGIKARFNDNAISIPGIISGMNPSWN-GDSSAENFDKCFLK 178
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
G F+ V Y SWLPA+S+V E + R SGEI++L++FCPWK HL+E+E
Sbjct: 179 ASAFIGSAFVTLVEGYGNSWLPAKSLVKEAVLNRKSCYESGEIVILEQFCPWKEHLYEIE 238
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E+ I+ K++L+ D WRV V VS FE R+ LP RGLRD++LS+ +G+PG
Sbjct: 239 KELGIQGQTKFILFVDS-SNSWRVSTVPVSSGSFEFRRGLPKALRGLRDEQLSEASGVPG 297
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
CVF+H +GFIGG ++ L +A+ L+
Sbjct: 298 CVFIHAAGFIGGAKTKEAVLQLAKMGLE 325
>gi|116201613|ref|XP_001226618.1| hypothetical protein CHGG_08691 [Chaetomium globosum CBS 148.51]
gi|88177209|gb|EAQ84677.1| hypothetical protein CHGG_08691 [Chaetomium globosum CBS 148.51]
Length = 353
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 211/337 (62%), Gaps = 17/337 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL +M+R + A++VR+RDPK+L++ V+DVGG YD + + +
Sbjct: 18 IGTHNGHFHADEALAVYMLRTHVPTYSGAKLVRTRDPKLLEECHTVVDVGGEYDAARNRF 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F F TKLSSAGLVY HFG+E+IA+ L E V L+ +Y++F+E
Sbjct: 78 DHHQRTFSTTFPDR-QTKLSSAGLVYMHFGREIIARRLAQAEDSEQVGMLWRKIYESFVE 136
Query: 144 AIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEPD-----QSAEREN 191
A+DA DNGI+ YD+ ++ N L + VG+LN +W +P ++ E+
Sbjct: 137 ALDAHDNGISVYDSAALAAAGLEKKFSNGGFTLGAMVGRLNPNWNDPTPADPAEAQAAED 196
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKL 250
+ F+ G+EF + ++ ++WLPAR +V E R ++D G IMVL+ + PWK
Sbjct: 197 KRFELASQRIGEEFDRDLDYFTKAWLPAREVVAEAFNGRQEFDSQGRIMVLQGQSAPWKD 256
Query: 251 HLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
HL+ LEE+ + YVLY++ +WR+Q V + D FESRKPLP WRG RD+EL
Sbjct: 257 HLYTLEEQHPEGGKVLYVLYKEKPVPDSKWRIQCVPSTKDSFESRKPLPEAWRGFRDEEL 316
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+GI G VFVH +GFIGGN+++ GALAMA+ AL++
Sbjct: 317 DGISGISGSVFVHAAGFIGGNKTFEGALAMAQKALEI 353
>gi|307189446|gb|EFN73856.1| UPF0160 protein MYG1 [Camponotus floridanus]
Length = 337
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 202/329 (61%), Gaps = 14/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G+FHCDE L C +++L ++ +A IVRSR VL+ D V+DVGG YDPSN Y
Sbjct: 8 KIGTHDGTFHCDEVLACALLKLLPQYKDASIVRSRTQSVLETCDIVVDVGGEYDPSNHRY 67
Query: 84 DHHQKGFEEVFG-------HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F+E + ++ KLSSAGL+Y HFG E++ L + +F
Sbjct: 68 DHHMREFQETVSSVMKKSEYKWTVKLSSAGLIYCHFGHEILRSVLPEVTEDRVIEEIFKK 127
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y ++ IDAIDNGI YD + P Y T+LS+RVG+LN W D + E E F++
Sbjct: 128 IYDTLIKEIDAIDNGIPMYDAE--PLYRIVTDLSARVGRLNPQWNSKDVNIE---ERFEK 182
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M L +EFL+ V + WLPAR IV + +R+ D SGEI++L + PWK HLF+LE
Sbjct: 183 AMALVLEEFLEFVHYTKNVWLPARDIVRHAVEDRFKVDSSGEIIMLSQTVPWKEHLFQLE 242
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+EM + P IKY ++E + G +RVQ + VS F R LP W GLR+D L + I
Sbjct: 243 KEMNVSPSIKYAVFEAEDG--YRVQCMPVSQGSFICRMFLPEAWGGLRNDALVEACEIED 300
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH FIGG+++ GALAMAR AL++
Sbjct: 301 AMFVHSVRFIGGHKTKDGALAMARKALEI 329
>gi|344304596|gb|EGW34828.1| hypothetical protein SPAPADRAFT_57919 [Spathaspora passalidarum
NRRL Y-27907]
Length = 322
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 210/322 (65%), Gaps = 5/322 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DE+L +++RL +F ++ +VRSR+P + D V+DV G YD +
Sbjct: 3 QICTHSGSFHADESLAVYLLRLLPQFSDSTLVRSRNPSDWEASDIVVDVSGKYDGVK-YF 61
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F EVF + TKLSSAGL+YKH+G+++I L + E DV L+ VYK F+E
Sbjct: 62 DHHQREFNEVFNGNYKTKLSSAGLIYKHYGRDIIKHVLKLSED-KDVEFLYDKVYKEFIE 120
Query: 144 AIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
++DA DNGIN Y D P++ + N L S V +LN W E A+ + + F + +L G
Sbjct: 121 SLDANDNGINNYPKDATPKFHDKNITLPSMVSRLNPSWNESCTDADFDRQ-FLKSTELMG 179
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
FL + Y +SWLPA+SIV + +R++ D SGEI+VL +FCPWK HL+ +E+E E
Sbjct: 180 SAFLALLEGYGKSWLPAKSIVQDAFDQRFEVDKSGEILVLSQFCPWKEHLYAIEQENNAE 239
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
IK+VL++D K WRV V+V+ FE R LP + RGLRD+ELS+++G+ GC+F+H
Sbjct: 240 GEIKFVLFKDSTDK-WRVSTVSVTSSSFEFRLGLPEELRGLRDEELSEKSGVEGCIFIHA 298
Query: 323 SGFIGGNQSYGGALAMARAALK 344
+GFIGG S L +AR +L+
Sbjct: 299 AGFIGGANSEESVLKLARLSLQ 320
>gi|358391275|gb|EHK40679.1| hypothetical protein TRIATDRAFT_258813 [Trichoderma atroviride IMI
206040]
Length = 364
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/348 (42%), Positives = 208/348 (59%), Gaps = 30/348 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
VGTH+G FH DEAL M+R+ + ++ +VR+RDPK+L+ V+DVGG YD + YD
Sbjct: 20 VGTHSGHFHADEALAVHMLRMLPTYRDSSLVRTRDPKILETCHTVVDVGGEYDAQRNRYD 79
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR------------ 132
HHQ+GF F G +TKLSSAGLVY HFG++LIA+ + +
Sbjct: 80 HHQRGFTTTF-PGKNTKLSSAGLVYMHFGRDLIAQSIQQASSKANADADAAVDSSSDVEL 138
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEP-- 183
L+ +Y+NF+EA+DA DNGI++YD D R+ + L + VG+LN W +P
Sbjct: 139 LYNKIYENFIEAVDAHDNGISRYDRDALQAAGIEQRFSSGGFTLGAMVGRLNPAWNDPKP 198
Query: 184 ---DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
D++ + E++ F G+EF T+ F +WLPAR+IV + +R +D G I+
Sbjct: 199 ADADEAQQAEDQLFLTASRRIGEEFERTLDFMTGAWLPARTIVQQAFDQRTKHDAEGRIL 258
Query: 241 VLK-RFCPWKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLP 297
V++ + PWK HL+ LE E P + YVLY + G +WR+Q V + D F SRKPLP
Sbjct: 259 VIEGQSVPWKEHLYTLEGEGN--PSVLYVLYAEGTQPGAKWRIQCVPETQDSFTSRKPLP 316
Query: 298 AQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
WRG RD EL +G+ GCVFVH +GFIGGN+++ GA+ MA AL L
Sbjct: 317 EAWRGFRDAELDGVSGVDGCVFVHAAGFIGGNKTFEGAMEMATKALAL 364
>gi|157865100|ref|XP_001681258.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124553|emb|CAJ02738.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 388
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 205/356 (57%), Gaps = 43/356 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNGSFHCDEA+ C ++R ++ A I+R+RDPK +D D V+DVG +YD + YD
Sbjct: 35 ICTHNGSFHCDEAMACGLLRHVPEYREAVILRTRDPKQIDACDIVVDVGAIYDADTNRYD 94
Query: 85 HHQKGFEEVFG---HGFSTKLSSAGLVYKHFGKELIAKEL-------------------- 121
HHQ F + T+LSSAGLVYKHFG+++I + +
Sbjct: 95 HHQASFHGTMTTPKKAYKTRLSSAGLVYKHFGRQIICEYVEAVLQPSSPARAAVLSMTSW 154
Query: 122 -----NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKP------PRYVNNTNLS 170
+ E DV L A+Y +F+E +D IDNG+ + P P Y +T+LS
Sbjct: 155 SESRTGLSEAELDV--LEDALYAHFVEQVDGIDNGVECWGLADPAVGTLVPNYTQSTSLS 212
Query: 171 SRVGKLNLDWTEPDQS-AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
R+G+L W EP+ EN F +++A EF + + SWLPARSIV
Sbjct: 213 QRIGQLQAYWNEPENGDVVAENANFAVAVEMAVTEFFKALTYLSFSWLPARSIVEAAFQR 272
Query: 230 RYDYDPSGEIMVLK--RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSP 287
R+++D SG+IMV K + CPWK HL ELE E + YV++ D GK WRVQAV +
Sbjct: 273 RHEFDESGKIMVFKGVKACPWKDHLLELEAESNCLGAVLYVVFSD--GKGWRVQAVPKTS 330
Query: 288 DRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
FE+RKPLP +RGLRDD LS GI G VFVH+SGFIGG +++ GA+A+A+ AL
Sbjct: 331 TSFENRKPLP--YRGLRDDALSAACGIEGGVFVHVSGFIGGMKTFDGAMALAKQAL 384
>gi|322701147|gb|EFY92898.1| MYG1 protein [Metarhizium acridum CQMa 102]
Length = 347
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 209/337 (62%), Gaps = 21/337 (6%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG FH DEAL M+R+ + ++ +VR+RDP++L V+DVGG YD + Y
Sbjct: 14 KIGTHNGHFHADEALAVHMLRMLPTYRDSPLVRTRDPELLAACHTVVDVGGEYDAQRNRY 73
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN--VDEGHPDVHRLFLAVYKNF 141
DHHQ+GF F G STKLSSAGLV+KHFG+ ++A+ L V E P+V L +Y++F
Sbjct: 74 DHHQRGFTTTF-PGRSTKLSSAGLVFKHFGRAIVAQRLGAGVAEDAPEVELLHNKLYESF 132
Query: 142 MEAIDAIDNGINQYD------TDKPPRYVNNT-NLSSRVGKLNLDW-----TEPDQSAER 189
+EA+DA DNGI+ YD RY L + VG+LN +W ++P ++
Sbjct: 133 VEALDAHDNGISVYDPAGLSAAGLEKRYSEGGFTLGAVVGRLNPNWNDEKPSDPVEAQAA 192
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPW 248
E++ F + + G+EF + +Y +WLPARSIV E +R ++DP G ++V + + PW
Sbjct: 193 EDDRFNKASERIGEEFERDLDYYATAWLPARSIVQEAFNKRSEFDPEGRVLVFEGQSVPW 252
Query: 249 KLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
K HL+ LE+ +P + YVLY + G +WR+Q V S D F SRKPLP WRG RD
Sbjct: 253 KDHLYTLEDG---KPTVLYVLYPESTAPGAKWRIQCVPESKDSFLSRKPLPEAWRGFRDQ 309
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
EL +GI GCVFVH +GFIGGN+++ G MA AL
Sbjct: 310 ELDGISGIEGCVFVHAAGFIGGNKTFEGVKEMAVKAL 346
>gi|50290377|ref|XP_447620.1| hypothetical protein [Candida glabrata CBS 138]
gi|49526930|emb|CAG60557.1| unnamed protein product [Candida glabrata]
Length = 334
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 143/328 (43%), Positives = 209/328 (63%), Gaps = 13/328 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+G FH DE+L +M+RL KF +A++VRSR+P+ + D V+DVG YD +
Sbjct: 12 QICTHSGPFHADESLAVYMLRLLPKFKDAKVVRSRNPEDWEASDIVVDVGAKYDGVK-FF 70
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIA-----KELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E F ++TKLSSAGLVYKHFG+E+I K ++ G D+ L+ VY
Sbjct: 71 DHHQREFFETFNGEYATKLSSAGLVYKHFGREIITAIVENKGASISSGDLDI--LYDKVY 128
Query: 139 KNFMEAIDAIDNGINQYDTDK--PPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQ 195
K F+EA+DA DNGI++Y+ K P++ +N ++ + +N W D SAE + FQ
Sbjct: 129 KQFIEALDANDNGIDKYENSKELTPKFKDNAISIPGVISGMNPSWN-ADTSAEAYDTNFQ 187
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ G+ F+D V Y SW+PA+S+V+E + R + D SG+I+VL +FCPWK HL+E+
Sbjct: 188 RASKFIGEIFVDLVEGYALSWIPAKSLVIEAVNRRSEVDDSGKIIVLNQFCPWKEHLYEV 247
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E+ IE I++VL++D WRV V VS F+ RK LP RGLRD ELS ++G+P
Sbjct: 248 EKAKGIEGQIEFVLFKDS-SNTWRVSTVPVSSTSFQFRKGLPEHLRGLRDQELSDKSGVP 306
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAAL 343
CVF+H +GFIGG ++ +A+ +L
Sbjct: 307 DCVFIHAAGFIGGAKTQDSVYKLAKMSL 334
>gi|164661934|ref|XP_001732089.1| hypothetical protein MGL_0682 [Malassezia globosa CBS 7966]
gi|159105991|gb|EDP44875.1| hypothetical protein MGL_0682 [Malassezia globosa CBS 7966]
Length = 340
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 201/328 (61%), Gaps = 12/328 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G+FH DEAL ++ +A+++R+RDP + V+DVG Y P YD
Sbjct: 17 IVTHSGTFHADEALAVHLLSKLPVLHDAELIRTRDPAEIAKGTIVVDVGAEYVPEKHRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF E F TKLSSAGLV+KHFG++++ LN + L+ +Y +F+EA
Sbjct: 77 HHQRGFSETFDAEHKTKLSSAGLVWKHFGRDILTAHLNCSADDKRIPVLYKKMYDDFVEA 136
Query: 145 IDAIDNGINQY-DTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
ID DNGI Y +PP Y + T+LSSRV +N W E A+ F++ +AG
Sbjct: 137 IDGHDNGIPLYPGVSEPPAYRSRTDLSSRVSYMNPRWNETWDDADLLAR-FRRASSMAGS 195
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPS---GEIMVLKRFCPWKLHLFELEEEMK 260
EF D V V W PAR +VV+ + R +D + G I++ +R WK H+F+LEE M
Sbjct: 196 EFFDRVNDAVDGWFPARQLVVDALHARKSFDGADEKGRIVLFQRAVAWKSHIFDLEESMN 255
Query: 261 I----EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
I +PL Y++Y D+ K WRVQAV V+ + FESRK LP WRG+RD++LS +GIP
Sbjct: 256 IPDDEKPL--YIVYPDESNK-WRVQAVPVNLESFESRKALPEAWRGIRDEQLSGLSGIPS 312
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
CVFVH SGFIGGNQ+ GAL MA+ AL+
Sbjct: 313 CVFVHQSGFIGGNQTKEGALTMAKKALE 340
>gi|190349159|gb|EDK41758.2| hypothetical protein PGUG_05856 [Meyerozyma guilliermondii ATCC
6260]
Length = 357
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 216/327 (66%), Gaps = 8/327 (2%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
+ + ++ TH+GSFH DE+L +M++L + + ++VRSR+P+ + D V+DVGG YD +
Sbjct: 36 SSIMQICTHSGSFHADESLAVYMVKLLPAYKDCKLVRSRNPEDWEASDLVIDVGGKYDGT 95
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ F E F + TKLSSAGL+YKHFGK++I + + + ++ L+ VYK
Sbjct: 96 K-YFDHHQREFSETFSANYKTKLSSAGLIYKHFGKQIIEEVCQSTKENTEL--LYEKVYK 152
Query: 140 NFMEAIDAIDNGINQYDTD-KPPRYVNNTNLS--SRVGKLNLDWTEPDQSAERENEAFQQ 196
F+EA+DA DNGIN Y + + + N+ N++ + V KLN W E A+ + AFQ+
Sbjct: 153 EFIEALDANDNGINNYSKEAEAEKNFNDRNITLPAIVSKLNPSWNENPTDADFD-AAFQR 211
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
L G+ F++ ++ Y SWLPAR+IV R+D D SG+I+ L++FCPWK HL+ +E
Sbjct: 212 SSALMGEVFVNLLKGYGNSWLPARTIVESGFDSRFDVDKSGKILKLEQFCPWKEHLYAIE 271
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E + I++VL++D K WRV V+V+ FE RK L WRGLRD+ELS++AG+PG
Sbjct: 272 KEKSAQGQIEFVLFQDSSSK-WRVSTVSVTSTSFEFRKGLLEPWRGLRDEELSEKAGVPG 330
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAAL 343
C+FVH +GFIGG +Y G L +A+ +L
Sbjct: 331 CIFVHAAGFIGGANTYEGVLKLAQLSL 357
>gi|45551849|ref|NP_731303.2| CG11980, isoform A [Drosophila melanogaster]
gi|45446428|gb|AAF54319.2| CG11980, isoform A [Drosophila melanogaster]
Length = 345
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDV 72
+ P ++PL +GTH+G+FHCDE + CFM++ D++ NA+I RSRD K L + D ++DV
Sbjct: 4 VTPPKRSPL-WIGTHSGTFHCDELVACFMLKQLDEYKNAEIFRSRDNKALKEKCDIIVDV 62
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFS--------TKLSSAGLVYKHFGKELIAKELNVD 124
GGVYD + YDHHQ F+E F +LSSAGLVY H+G+ +I L +
Sbjct: 63 GGVYDHAKKLYDHHQITFKETFSSVRPDVSEDYNVVRLSSAGLVYCHYGERVIQSILQRE 122
Query: 125 EG----HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
+G ++ F+ +Y+NF+ +DAIDNG+ ++ +P Y +T+LS+R+ KLN W
Sbjct: 123 KGIKLSPENLQTAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKISTHLSARIAKLNPSW 181
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E E + F+Q MD AG+EF+D V SW+PAR V E + + P+GEI+
Sbjct: 182 QETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIPARDHVREALKNAKNVHPTGEIL 238
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
VLK FCPWK HLF+LE+E K+E + K V++ + G WRV V V+P + RK LP W
Sbjct: 239 VLKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NSGNSWRVAGVPVTPGSYLGRKFLPTPW 296
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
RGL DDEL +A I F+H +GFIGG ++ A+ +A+ +
Sbjct: 297 RGLMDDELCDKASIKDLEFIHHTGFIGGAKTEEAAMLLAKKS 338
>gi|21711707|gb|AAM75044.1| LD44814p [Drosophila melanogaster]
Length = 360
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDV 72
+ P ++PL +GTH+G+FHCDE + CFM++ D++ NA+I RSRD K L + D ++DV
Sbjct: 19 VTPPKRSPL-WIGTHSGTFHCDELVACFMLKQLDEYKNAEIFRSRDNKALKEKCDIIVDV 77
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFS--------TKLSSAGLVYKHFGKELIAKELNVD 124
GGVYD + YDHHQ F+E F +LSSAGLVY H+G+ +I L +
Sbjct: 78 GGVYDHAKKLYDHHQITFKETFSSVRPDVSEDYNVVRLSSAGLVYCHYGERVIQSILQRE 137
Query: 125 EG----HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
+G ++ F+ +Y+NF+ +DAIDNG+ ++ +P Y +T+LS+R+ KLN W
Sbjct: 138 KGIKLSPENLQTAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKISTHLSARIAKLNPSW 196
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E E + F+Q MD AG+EF+D V SW+PAR V E + + P+GEI+
Sbjct: 197 QETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIPARDHVREALKNAKNVHPTGEIL 253
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
VLK FCPWK HLF+LE+E K+E + K V++ + G WRV V V+P + RK LP W
Sbjct: 254 VLKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NSGNSWRVAGVPVTPGSYLGRKFLPTPW 311
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
RGL DDEL +A I F+H +GFIGG ++ A+ +A+ +
Sbjct: 312 RGLMDDELCDKASIKDLEFIHHTGFIGGAKTEEAAMLLAKKS 353
>gi|24645226|ref|NP_731302.1| CG11980, isoform B [Drosophila melanogaster]
gi|23170755|gb|AAN13405.1| CG11980, isoform B [Drosophila melanogaster]
gi|220946342|gb|ACL85714.1| CG11980-PA [synthetic construct]
gi|220956086|gb|ACL90586.1| CG11980-PA [synthetic construct]
Length = 358
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDV 72
+ P ++PL +GTH+G+FHCDE + CFM++ D++ NA+I RSRD K L + D ++DV
Sbjct: 17 VTPPKRSPL-WIGTHSGTFHCDELVACFMLKQLDEYKNAEIFRSRDNKALKEKCDIIVDV 75
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFS--------TKLSSAGLVYKHFGKELIAKELNVD 124
GGVYD + YDHHQ F+E F +LSSAGLVY H+G+ +I L +
Sbjct: 76 GGVYDHAKKLYDHHQITFKETFSSVRPDVSEDYNVVRLSSAGLVYCHYGERVIQSILQRE 135
Query: 125 EG----HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
+G ++ F+ +Y+NF+ +DAIDNG+ ++ +P Y +T+LS+R+ KLN W
Sbjct: 136 KGIKLSPENLQTAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKISTHLSARIAKLNPSW 194
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E E + F+Q MD AG+EF+D V SW+PAR V E + + P+GEI+
Sbjct: 195 QETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIPARDHVREALKNAKNVHPTGEIL 251
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
VLK FCPWK HLF+LE+E K+E + K V++ + G WRV V V+P + RK LP W
Sbjct: 252 VLKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NSGNSWRVAGVPVTPGSYLGRKFLPTPW 309
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
RGL DDEL +A I F+H +GFIGG ++ A+ +A+ +
Sbjct: 310 RGLMDDELCDKASIKDLEFIHHTGFIGGAKTEEAAMLLAKKS 351
>gi|24645224|ref|NP_649857.1| CG11980, isoform C [Drosophila melanogaster]
gi|23170754|gb|AAN13404.1| CG11980, isoform C [Drosophila melanogaster]
gi|46409148|gb|AAS93731.1| RE51612p [Drosophila melanogaster]
gi|220952150|gb|ACL88618.1| CG11980-PA [synthetic construct]
Length = 359
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDD-LDAVLDV 72
+ P ++PL +GTH+G+FHCDE + CFM++ D++ NA+I RSRD K L + D ++DV
Sbjct: 18 VTPPKRSPL-WIGTHSGTFHCDELVACFMLKQLDEYKNAEIFRSRDNKALKEKCDIIVDV 76
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFS--------TKLSSAGLVYKHFGKELIAKELNVD 124
GGVYD + YDHHQ F+E F +LSSAGLVY H+G+ +I L +
Sbjct: 77 GGVYDHAKKLYDHHQITFKETFSSVRPDVSEDYNVVRLSSAGLVYCHYGERVIQSILQRE 136
Query: 125 EG----HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
+G ++ F+ +Y+NF+ +DAIDNG+ ++ +P Y +T+LS+R+ KLN W
Sbjct: 137 KGIKLSPENLQTAFIQIYRNFINELDAIDNGVPMFEGVEPI-YKISTHLSARIAKLNPSW 195
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E E + F+Q MD AG+EF+D V SW+PAR V E + + P+GEI+
Sbjct: 196 QETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIPARDHVREALKNAKNVHPTGEIL 252
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
VLK FCPWK HLF+LE+E K+E + K V++ + G WRV V V+P + RK LP W
Sbjct: 253 VLKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NSGNSWRVAGVPVTPGSYLGRKFLPTPW 310
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
RGL DDEL +A I F+H +GFIGG ++ A+ +A+ +
Sbjct: 311 RGLMDDELCDKASIKDLEFIHHTGFIGGAKTEEAAMLLAKKS 352
>gi|254570010|ref|XP_002492115.1| Putative protein of unknown function [Komagataella pastoris GS115]
gi|238031912|emb|CAY69835.1| Putative protein of unknown function [Komagataella pastoris GS115]
Length = 337
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/334 (42%), Positives = 209/334 (62%), Gaps = 6/334 (1%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
+ S + + ++ TH+GSFH DEAL +++R K+ N+ ++RSRDP ++ D V+DV
Sbjct: 5 TESKKRKLDMLKICTHSGSFHADEALAVYLLRSLKKWTNSTLLRSRDPAKWEEADIVVDV 64
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVD--EGHPDV 130
GG YD +DHHQ+ F F + TKLSSAGLV+K FGKE+IA++LN D E ++
Sbjct: 65 GGKYDDVK-FFDHHQREFSTTFSDKYQTKLSSAGLVFKKFGKEVIAEKLNWDLNEESTNI 123
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAER 189
+ L+ +YK+F+E++DA DNG++ Y+ ++ + N L+S V LN W + A+
Sbjct: 124 NILYEKLYKDFIESVDANDNGVSNYENTNERKFNDKNFTLASVVSNLNPLWVDEPTDADF 183
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
+ + F++ ++ G+ F + + Y RSWLPA+ V + ER+ PSG+++V RF PWK
Sbjct: 184 DKQ-FEKASEIMGQVFDNLLLSYGRSWLPAKQFVEKAFDERFQAHPSGKVLVFDRFVPWK 242
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HL+ +E K I YVL+ D G WR+ AV VS FESRK LP WRGLRD++LS
Sbjct: 243 EHLYGVEIANKCHGEILYVLFPDS-GNNWRIAAVPVSSSSFESRKALPEPWRGLRDEKLS 301
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ A + GCVF+H +GFIGG S G L +A A+
Sbjct: 302 EAANVEGCVFIHAAGFIGGANSKEGVLKLAALAV 335
>gi|71410337|ref|XP_807467.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871477|gb|EAN85616.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 394
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 164/373 (43%), Positives = 214/373 (57%), Gaps = 47/373 (12%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
+ + T +GTHNGSFHCDEA+ C ++R + F + I+RSRDP L+ + V+DVG V
Sbjct: 24 AETSTATGVIGTHNGSFHCDEAMACGLLRCSQNFGQSNILRSRDPNALERCNIVVDVGSV 83
Query: 76 YDPSNDCYDHHQKGFEEVFGH---GFSTKLSSAGLVYKHFGKELI-------------AK 119
YD + +DHHQ F + + T+LSSAGLVYKHFG+E+I K
Sbjct: 84 YDEATLRFDHHQPSFHDTMKTPKAAYQTRLSSAGLVYKHFGREIIQGYVESALASPYRVK 143
Query: 120 ELNVDEGHPDVHR--------LFLAVYKNFMEAIDAIDNGINQYDTDKP----------- 160
L+ + D + LF VYKNF+E ID IDNG+N Y +
Sbjct: 144 LLDATKWGTDKKKLSEQELDILFDIVYKNFVEHIDGIDNGVNAYGPAETEGEEGAVPVSS 203
Query: 161 ------PRYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYV 213
+Y +T LS+R+G L W E + E EN AF Q M+LA EF D+V ++V
Sbjct: 204 SSPSCVKKYNVSTTLSARIGNLMPWWNEEGNGKVENENAAFLQAMELATSEFFDSVHYHV 263
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYED 272
+W+PAR IV E + PSG I+V K FCPWK HL ELEEE + YVL+ D
Sbjct: 264 FAWMPARGIVRAAFLEAMNVHPSGRIVVFKDCFCPWKEHLLELEEEHGKVGHVLYVLFAD 323
Query: 273 DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSY 332
+G WRVQAV FE+RK LP WRGLRD+ELS+ +GI G +FVH+SGFIGGN++Y
Sbjct: 324 KKG--WRVQAVPKEASGFENRKSLP--WRGLRDEELSQASGIEGGIFVHVSGFIGGNKTY 379
Query: 333 GGALAMARAALKL 345
GAL MA AL +
Sbjct: 380 EGALQMAVKALTV 392
>gi|443922998|gb|ELU42329.1| GAMM1 protein [Rhizoctonia solani AG-1 IA]
Length = 347
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 184/294 (62%), Gaps = 18/294 (6%)
Query: 7 SSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
S+S TS P++ +GTHNG+FHCDEAL F++R T ++ +A + R+RDPK+L+
Sbjct: 46 SASKRLRTSGPTRV----IGTHNGTFHCDEALAVFLLRHTKEYADASVKRTRDPKILETC 101
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
D V+DVG YDP +DHHQ+GF EVFG+GF TKLSSAGL+YKHFG E+IA +L ++
Sbjct: 102 DIVVDVGAEYDPERLLFDHHQRGFTEVFGNGFKTKLSSAGLIYKHFGPEIIANQLQLNPT 161
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
V L+L +Y +F+EAID IDNG+ QY ++ P Y + T++SSRVG LN W E S
Sbjct: 162 DGKVQELWLKLYGDFIEAIDGIDNGVLQYPSEISPAYRSRTDISSRVGHLNPRWNESVDS 221
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
+ +L G EFL + + +W PAR +V+ + R DPSG I+ ++F
Sbjct: 222 DGVDVSYMTS--ELTGTEFLGRLDYLANAWWPARDLVLHALEARTQVDPSGRIIAFEQFA 279
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
PWK E+ I YVLY D+ G WR+QAV VSPD FESRK LP W
Sbjct: 280 PWK-------EQKPI-----YVLYPDETGGNWRIQAVPVSPDSFESRKALPEMW 321
>gi|195395372|ref|XP_002056310.1| GJ10302 [Drosophila virilis]
gi|194143019|gb|EDW59422.1| GJ10302 [Drosophila virilis]
Length = 351
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 213/352 (60%), Gaps = 19/352 (5%)
Query: 6 VSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-D 64
+SS+ + TPL +GTHNG+FHCDE + CFM++ ++ NA+I RSRD K L +
Sbjct: 1 MSSANGEPKPAKRPTPLT-IGTHNGTFHCDEVVACFMLKQLPEYENAEIFRSRDEKALRE 59
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELI 117
D ++DVG ++D YDHHQ F+E F F +LSSAGL+Y +G+ +I
Sbjct: 60 KCDIIVDVGSIFDHEKKWYDHHQLTFKETFSTVLPELADEFDIRLSSAGLIYCFYGERVI 119
Query: 118 A----KELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRV 173
+E NV ++ FL +Y+NF+ +DAIDNG+ ++ +P RY +T+LS+R+
Sbjct: 120 QSILQRERNVQLSTKNLKLAFLQIYRNFISELDAIDNGVPMFEGGEP-RYKISTHLSARI 178
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
KLN W E D ++ F M +AGKEF+D V SW+ AR V + + +
Sbjct: 179 NKLNPSWQETDVDIDQR---FYTAMAVAGKEFVDNVLEVACSWIAAREYVRQALEQAKSV 235
Query: 234 DPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESR 293
SGEI++L+RFCPWK+HL +LE+E +E + K V++ + G WRV V V+P F R
Sbjct: 236 HASGEILLLERFCPWKVHLSDLEKEYSVEGVPKLVIF--NEGANWRVAGVPVTPSSFLGR 293
Query: 294 KPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
K LP+ WRGL D+EL ++AG +FVH +GF+GG ++ ALAMA+ +++
Sbjct: 294 KFLPSPWRGLHDEELCQKAGTNDLIFVHHTGFVGGAKTKEAALAMAQKSIEF 345
>gi|322706880|gb|EFY98459.1| MYG1 protein [Metarhizium anisopliae ARSEF 23]
Length = 347
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 209/341 (61%), Gaps = 29/341 (8%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG FH DEAL M+R+ + ++ ++R+RDP +L V+DVGG YD + Y
Sbjct: 14 KIGTHNGHFHADEALAVHMLRMLPTYHDSPLIRTRDPDLLAACHTVVDVGGEYDAQRNRY 73
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN--VDEGHPDVHRLFLAVYKNF 141
DHHQ+GF F G TKLSSAGLV+KHFGK +IA++L V + P+V L +Y++F
Sbjct: 74 DHHQRGFTTTFP-GRRTKLSSAGLVFKHFGKAIIAQKLGAGVPQDAPEVDLLHDKLYESF 132
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDW-----TEPDQ 185
+EA+DA DNGI+ YD P ++ L R VG+LN +W ++PD+
Sbjct: 133 VEALDAHDNGISVYD----PAGISAAGLEKRYSEGGFTLGAVVGRLNPNWNDERPSDPDE 188
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-R 244
+ + E+ F + G+EF + +Y +WLPARS+V + +R ++DP G ++V + +
Sbjct: 189 AQKAEDARFYKASLRIGEEFERDLDYYATAWLPARSVVQKAFDKRSEFDPEGRVLVFEGQ 248
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVAVSPDRFESRKPLPAQWRG 302
PWK HL+ LE+ +P + YVLY + G +WR+Q V S D F SRKPLP WRG
Sbjct: 249 SVPWKDHLYTLEDG---KPTVLYVLYPESTAPGAKWRIQCVPESKDSFVSRKPLPEAWRG 305
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
RD EL +GI GCVFVH +GFIGGN+++ G MA AL
Sbjct: 306 FRDQELDGISGIEGCVFVHAAGFIGGNKTFEGVKKMAVKAL 346
>gi|342879293|gb|EGU80547.1| hypothetical protein FOXB_08925 [Fusarium oxysporum Fo5176]
Length = 349
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/334 (44%), Positives = 200/334 (59%), Gaps = 19/334 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + +A +VR+RDP VL V+DVGG YD +D
Sbjct: 18 IGTHNGHFHADEALAVHMLRRLPAYRDASLVRTRDPAVLATCHTVVDVGGEYDAEKRRFD 77
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+GF F G TKLSSAGLV+ HFG+ ++A+ L E DV ++ +Y+NF+EA
Sbjct: 78 HHQRGFTTTF-PGRPTKLSSAGLVFLHFGRAIVAERLGQPEDSADVELIYEKLYENFVEA 136
Query: 145 IDAIDNGINQYD------TDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEA---- 193
+DA DNGI+ +D R+ + L + VG+LN W +P S E +A
Sbjct: 137 LDAHDNGISVFDPAGIAAAGLEKRFSDGAFGLGAMVGRLNPKWNDPTPSDPAEAQAAEDA 196
Query: 194 -FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLH 251
F + + G+EF + Y SWLPAR+IV E +R YD G ++VL+ + PWK H
Sbjct: 197 KFNEASNRIGEEFDRDLDGYAGSWLPARTIVQEAFTKRTQYDEQGRLLVLEGQSVPWKDH 256
Query: 252 LFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
L+ LE+ P + YVLY + + G +WR+Q V S D F SRKPLP WRG RD EL
Sbjct: 257 LYTLEDGT---PSVLYVLYAEKPEPGAKWRIQCVPESKDSFVSRKPLPEAWRGFRDAELD 313
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+GIPGCVFVH +GFIGGN+++ GA MA AL
Sbjct: 314 GISGIPGCVFVHAAGFIGGNKTFEGAKQMALKAL 347
>gi|340726532|ref|XP_003401610.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like isoform 1
[Bombus terrestris]
gi|340726534|ref|XP_003401611.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like isoform 2
[Bombus terrestris]
Length = 329
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 204/329 (62%), Gaps = 16/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G+FHCDEAL CFM++ ++ +A IVRSRD +LD D V+DVGG Y+PS Y
Sbjct: 6 KIGTHDGTFHCDEALACFMLKTLPRYKDAVIVRSRDMNILDTCDIVVDVGGEYNPSKHRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F E G+ ++ KLSSAGL+Y HFG E+I K L DV +F
Sbjct: 66 DHHMRDFNESVSTVIKKPGYDWTMKLSSAGLIYCHFGHEII-KHLIPQANDSDVELIFKY 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y ++ +D IDNG+ ++ + P Y T+LSSRV LN +W D + + + F +
Sbjct: 125 IYDTLIKEVDGIDNGVLMFNEE--PVYRIVTDLSSRVKYLNPEWNSKDINVDSQ---FLK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
++L G+E + + + WLPARSIV + I +R++ DPSGEI+ L +F PW HL+ +E
Sbjct: 180 AVELTGQELVQHINYAAYVWLPARSIVQKAIDKRFEVDPSGEIIELLQFVPWSQHLYAIE 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+E ++P +K+V++ D +RV+ + V P+ F R LP W GLRD++L +GI
Sbjct: 240 KEQNVQPPLKFVIFSSD---NYRVRGIPVEPNSFVCRLSLPEPWGGLRDEKLVSVSGIKD 296
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
VFVH F+GG+ + GA+ MAR AL+L
Sbjct: 297 AVFVHSKRFVGGHLTREGAITMARKALEL 325
>gi|407846082|gb|EKG02417.1| hypothetical protein TCSYLVIO_006555 [Trypanosoma cruzi]
Length = 394
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 162/373 (43%), Positives = 212/373 (56%), Gaps = 47/373 (12%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
+ + T +GTHNGSFHCDEA+ C ++R + F + I+RSRDP L+ + V+DVG V
Sbjct: 24 AETSTATGVIGTHNGSFHCDEAMACGLLRCSQNFGQSNILRSRDPNALERCNIVVDVGSV 83
Query: 76 YDPSNDCYDHHQKGFEEVFGHG---FSTKLSSAGLVYKHFGKELIAKELNVDEGHP---- 128
YD + +DHHQ F + + T+LSSAGLVYKHFG+E+I + P
Sbjct: 84 YDEATLRFDHHQPSFHDTMKTPKAVYQTRLSSAGLVYKHFGREIIQGYVESALASPYRVK 143
Query: 129 -----------------DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKP----------- 160
++ LF VYKNF+E ID IDNG+N Y +
Sbjct: 144 LLDATKWGTDKKKLSEQELDTLFDIVYKNFVEHIDGIDNGVNAYGPAETEGEEGAVPVSS 203
Query: 161 ------PRYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKEFLDTVRFYV 213
+Y +T LS+R+G L W E + E EN AF Q M+LA EF D+V ++V
Sbjct: 204 SSPSCVKKYNVSTTLSARIGNLMPWWNEEGNGKVENENAAFLQAMELATSEFFDSVHYHV 263
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEEEMKIEPLIKYVLYED 272
+W+PAR IV E + PSG I+V K FCPWK HL ELEEE + YVL+ D
Sbjct: 264 FAWMPARGIVRAAFLEAMNVHPSGRIVVFKDCFCPWKEHLLELEEEHGKVGHVLYVLFAD 323
Query: 273 DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSY 332
+G WRVQAV FE+RK LP WRGLRD+ELS+ +GI G +FVH+SGFIGGN++Y
Sbjct: 324 KKG--WRVQAVPKEASGFENRKSLP--WRGLRDEELSQASGIEGGIFVHVSGFIGGNKTY 379
Query: 333 GGALAMARAALKL 345
GAL MA AL +
Sbjct: 380 EGALQMAVKALTV 392
>gi|328351397|emb|CCA37796.1| UPF0160 protein MYG1 [Komagataella pastoris CBS 7435]
Length = 324
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 205/323 (63%), Gaps = 6/323 (1%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+GSFH DEAL +++R K+ N+ ++RSRDP ++ D V+DVGG YD +
Sbjct: 3 KICTHSGSFHADEALAVYLLRSLKKWTNSTLLRSRDPAKWEEADIVVDVGGKYDDVK-FF 61
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVD--EGHPDVHRLFLAVYKNF 141
DHHQ+ F F + TKLSSAGLV+K FGKE+IA++LN D E +++ L+ +YK+F
Sbjct: 62 DHHQREFSTTFSDKYQTKLSSAGLVFKKFGKEVIAEKLNWDLNEESTNINILYEKLYKDF 121
Query: 142 MEAIDAIDNGINQYDTDKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E++DA DNG++ Y+ ++ + N L+S V LN W + A+ + + F++ ++
Sbjct: 122 IESVDANDNGVSNYENTNERKFNDKNFTLASVVSNLNPLWVDEPTDADFDKQ-FEKASEI 180
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G+ F + + Y RSWLPA+ V + ER+ PSG+++V RF PWK HL+ +E K
Sbjct: 181 MGQVFDNLLLSYGRSWLPAKQFVEKAFDERFQAHPSGKVLVFDRFVPWKEHLYGVEIANK 240
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
I YVL+ D G WR+ AV VS FESRK LP WRGLRD++LS+ A + GCVF+
Sbjct: 241 CHGEILYVLFPDS-GNNWRIAAVPVSSSSFESRKALPEPWRGLRDEKLSEAANVEGCVFI 299
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H +GFIGG S G L +A A+
Sbjct: 300 HAAGFIGGANSKEGVLKLAALAV 322
>gi|407929038|gb|EKG21877.1| Metal-dependent protein hydrolase [Macrophomina phaseolina MS6]
Length = 353
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 147/352 (41%), Positives = 215/352 (61%), Gaps = 20/352 (5%)
Query: 13 STSSPSQTPLKR----VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA 68
+++SP LK +GTHNG FH DEAL +++R + ++ +VR+RDP VL+
Sbjct: 2 TSASPQAKRLKMTGPLIGTHNGHFHADEALAVYLLRTLPTYGSSPLVRTRDPAVLETCHT 61
Query: 69 VLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP 128
V+DVGGVYD + + YDHHQ+ F+ F +TKLSSAGLVYKHFGK++IA++ +
Sbjct: 62 VVDVGGVYDHAKNRYDHHQREFDAYFPGKNATKLSSAGLVYKHFGKDIIAQQTGLRIDSE 121
Query: 129 DVHRLFLAVYKNFMEAIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWT 181
DV L+ +Y+ +EA DA D G++ YDT R+ + +++S V + N +
Sbjct: 122 DVEILYQKLYEGLIEAFDANDTGVSAYDTKALKTAGVERRFEDRGFSIASVVNRYNYHFE 181
Query: 182 E-----PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPS 236
+ +Q + E+E F + G++F+ + R+WLPAR+ V R YDP
Sbjct: 182 DDTGKSKEQLQQEEDERFLKASAFVGEQFIMELTDKHRNWLPARAAVKAAFDGRKQYDPQ 241
Query: 237 GEIMVLKRFCPWKLHLFELEEEMKI--EPLIKYVLY--EDDRGKQWRVQAVAVSPDRFES 292
G IMVLK+ PW HL+ LE E E + YVL+ D+ +WR++AV+V FE+
Sbjct: 242 GRIMVLKQGMPWADHLYTLESESNTPAESRVLYVLFPESDEADSKWRIRAVSVEGGGFEN 301
Query: 293 RKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RK LP W+G+RD++L + +GIPGCVFVH SGFIGGN++Y GAL MA+ AL+
Sbjct: 302 RKDLPDAWKGVRDEKLDEVSGIPGCVFVHASGFIGGNKTYDGALKMAQKALE 353
>gi|340507915|gb|EGR33761.1| hypothetical protein IMG5_039660 [Ichthyophthirius multifiliis]
Length = 342
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 23 KRVGTHNGSFHCDEALGCFMI-RLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K+V TH+G FHCDE L C M+ R T++F N I R+RD K++D+ D V+DVGG+YDPS
Sbjct: 4 KKVATHSGCFHCDEVLACTMLTRYTNEFKNCSITRTRDQKIIDEHDIVVDVGGLYDPSKH 63
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK-------------ELNVDEGHP 128
YDHHQ F + F F +LSSAGL+YKHFG E++ + +
Sbjct: 64 RYDHHQNSFTDTFSEEFQIRLSSAGLIYKHFGMEIVKNICDKILEQNKQYLQTEIQLNET 123
Query: 129 DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAE 188
+ ++ +YK F+ ++DAIDNGI+QY D P Y N T+L SR+ +L WTE + E
Sbjct: 124 ILQEIYYRIYKGFIMSVDAIDNGIDQYPKDIKPLYYNKTSLWSRISRLQPHWTE--KYIE 181
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPW 248
+ + F + MD+A +E V+ + S +PAR V + I R++ SG+I+ + PW
Sbjct: 182 DDIQRFNKAMDMADEELFSQVKVLLLSTIPARECVKQAINGRFNIHQSGQIIAFETPLPW 241
Query: 249 KLHLFELEEEMKIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD-D 306
K HL + E EM I+ +IK+VL+ E + K WRVQ V V+ F+ R L +WRG++D D
Sbjct: 242 KDHLEDFENEMDIKGVIKFVLFPESEEKKAWRVQGVPVNQGSFDLRIGLKKEWRGIKDMD 301
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
L GI VFVH SGFIGG +S+ L MA +++
Sbjct: 302 ILKNVTGIQDIVFVHNSGFIGGAKSFQSTLKMALESIE 339
>gi|45201241|ref|NP_986811.1| AGR145Cp [Ashbya gossypii ATCC 10895]
gi|44986095|gb|AAS54635.1| AGR145Cp [Ashbya gossypii ATCC 10895]
gi|374110060|gb|AEY98965.1| FAGR145Cp [Ashbya gossypii FDAG1]
Length = 410
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 197/316 (62%), Gaps = 8/316 (2%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
TP K++ TH+GSFH DEAL +M+RL + +A++VRSRDP ++ V+DV G YD
Sbjct: 85 TPFKQICTHSGSFHADEALAVYMLRLLPEGKDAKLVRSRDPAKWEESCIVVDVSGKYD-G 143
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ F E F + TKLSSAGLVYKHFG++++ + L + D L+ VY+
Sbjct: 144 VKYFDHHQREFFETFSDQYKTKLSSAGLVYKHFGRQIV-RALCPEISDEDTELLYEKVYR 202
Query: 140 NFMEAIDAIDNGINQYDTD----KPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+F+EA+DA DNGI+ +D + +P + N ++ V K+N DW E + S R +E F
Sbjct: 203 DFVEALDANDNGISNFDAEELGVRPRFHDKNISIPGIVAKMNPDWNE-ETSDARFDECFL 261
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
G F VR Y R+WLPA+ IV + +R DPSG I+VL+RFCPWK HL+++
Sbjct: 262 TASAFVGDCFARVVRGYGRAWLPAKDIVRAGVRDRAAVDPSGRIVVLERFCPWKEHLYDV 321
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E E+ + + +VL+ D G WRV V S F R LP WRGLRDD LS+ G+P
Sbjct: 322 ERELGLVGEVLFVLFADSSG-SWRVSTVPQSATSFRFRHGLPEPWRGLRDDALSEATGVP 380
Query: 316 GCVFVHMSGFIGGNQS 331
GC+FVH +GFIGG ++
Sbjct: 381 GCIFVHAAGFIGGART 396
>gi|353228574|emb|CCD74745.1| putative chromosome transmission fidelity factor [Schistosoma
mansoni]
Length = 1386
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 201/342 (58%), Gaps = 27/342 (7%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTH+G FH DE L C M++ ++ NA+I+R+RD +L D V+DVGGV++P N Y
Sbjct: 1048 RIGTHDGRFHADEILACAMLKHLPEYSNAEIIRTRDSSILSTCDIVVDVGGVFNPENHLY 1107
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH--PDVHRLFLA 136
DHHQ+ F + + KLSSAGL+Y HFG+++++ L +DE P V LF
Sbjct: 1108 DHHQREFNLTYKDFYPNSDWDIKLSSAGLIYVHFGRKILSCILGIDENTMDPLVTALFDK 1167
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y +F+ IDAIDNG+ T P RY NT+LSSRV ++N W + D E F
Sbjct: 1168 MYSSFIVEIDAIDNGVPMATT--PLRYSMNTSLSSRVNRMNPAWNQLDTD---ETVCFHN 1222
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFEL 255
+ L KEF V FY +W PAR IV+ + RY D SG I+ ++ CPW H FE+
Sbjct: 1223 ALQLVDKEFTTLVHFYADTWYPAREIVLNAVKNRYSVDSSGSIIYIEGTGCPWSTHFFEI 1282
Query: 256 EEEM-----------KIEPLIKYVLYEDDRGKQWRVQAVAVSP-DRFESRKPLPAQWRGL 303
E+ + K +PL+ + +Y+ + W +QA+ ++ + F R PLP WRGL
Sbjct: 1283 EKSLLLNNKNINEIEKNDPLL-FAIYQ-RKDSTWTIQAIPLNEHNNFSQRLPLPESWRGL 1340
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RD++LS G+P CVFVH +GF+G +++ G L MAR +K+
Sbjct: 1341 RDEQLSNIVGLPDCVFVHSTGFLGVHKTRDGVLQMARLTIKM 1382
>gi|146412243|ref|XP_001482093.1| hypothetical protein PGUG_05856 [Meyerozyma guilliermondii ATCC
6260]
Length = 357
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 214/325 (65%), Gaps = 8/325 (2%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++ TH+GSFH DE+L +M++L + + ++VR R+P+ + D V+DVGG YD +
Sbjct: 38 IMQICTHSGSFHADESLAVYMVKLLPAYKDCKLVRLRNPEDWEASDLVIDVGGKYDGTK- 96
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+ F E F + TKLSSAGL+YKHFGK++I + + + ++ L+ VYK F
Sbjct: 97 YFDHHQREFSETFSANYKTKLSSAGLIYKHFGKQIIEEVCQSTKENTEL--LYEKVYKEF 154
Query: 142 MEAIDAIDNGINQYDTD-KPPRYVNNTNLS--SRVGKLNLDWTEPDQSAERENEAFQQGM 198
+EA+DA DNGIN Y + + + N+ N++ + V KLN W E A+ + AFQ+
Sbjct: 155 IEALDANDNGINNYSKEAEAEKNFNDRNITLPAIVLKLNPSWNENPTDADFD-AAFQRSS 213
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
L G+ F++ ++ Y SWLPAR+IV R+D D SG+I+ L++FCPWK HL+ +E+E
Sbjct: 214 ALMGEVFVNLLKGYGNSWLPARTIVESGFDSRFDVDKSGKILKLEQFCPWKEHLYAIEKE 273
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ I++VL++D K WRV V+V+ FE RK L WRGLRD+ELS++AG+PGC+
Sbjct: 274 KSAQGQIEFVLFQDSSSK-WRVSTVSVTSTSFEFRKGLLEPWRGLRDEELSEKAGVPGCI 332
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
FVH +GFIGG +Y G L +A+ +L
Sbjct: 333 FVHAAGFIGGANTYEGVLKLAQLSL 357
>gi|195038065|ref|XP_001990481.1| GH19378 [Drosophila grimshawi]
gi|193894677|gb|EDV93543.1| GH19378 [Drosophila grimshawi]
Length = 352
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 211/356 (59%), Gaps = 20/356 (5%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDP 60
M+ T + S A + P ++P +GTHNG FHCDE + CFM++ ++ NA+I RSRD
Sbjct: 1 MSKTTTTDSGA--ATPPKRSPPLTIGTHNGIFHCDEVVACFMLKQLPEYENAEIFRSRDI 58
Query: 61 KVL-DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGH-------GFSTKLSSAGLVYKHF 112
L D D ++DVGG +D YDHHQ F++ F F +LSSAGL+Y +
Sbjct: 59 NALRDKCDIIMDVGGEFDHKKKWYDHHQLTFKDTFSTVCPDFSDDFDIRLSSAGLIYCFY 118
Query: 113 GKELIA----KELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTN 168
G+ +I +E ++ P++ FL +Y+NF+ +DAIDNG+ + +P RY +T+
Sbjct: 119 GERVIQSILLRERKMELSQPNLKLAFLQIYRNFINELDAIDNGVPMLEGGEP-RYKISTH 177
Query: 169 LSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIA 228
LS+R+ LN W + + + FQ M AGKEF+D V SW+ AR V +
Sbjct: 178 LSARIAMLNPSWQDTNVDVDAR---FQLAMATAGKEFVDNVLEVACSWIAARDHVRHALE 234
Query: 229 ERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD 288
+ SGEI++L+ FCPWK HLF+LE+E +E + K V++ D G WRV V ++P
Sbjct: 235 QAKSVLASGEILLLETFCPWKAHLFDLEKEYGLEGVPKLVIFND--GSSWRVAGVPITPS 292
Query: 289 RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
F+ RK LP WRGLRD+ELS+ A + +FVH +GFIGG ++ ALAMA+ +++
Sbjct: 293 SFQGRKFLPRPWRGLRDEELSQLAAVKDLIFVHHTGFIGGAKTKEAALAMAKMSME 348
>gi|350418413|ref|XP_003491849.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Bombus
impatiens]
Length = 330
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 203/329 (61%), Gaps = 15/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G+FHCDE L CFM++ ++ +A IVRSR+ +L+ D V+DVGG Y+PS Y
Sbjct: 6 KIGTHDGTFHCDETLACFMLKTLPRYKDAVIVRSRNMNILNTCDIVIDVGGEYNPSKHRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F E G+ + KLSSAGL+Y HFG E+I K L DV +F
Sbjct: 66 DHHMRDFNESMSTVIKKPGYDSTIKLSSAGLIYCHFGHEII-KHLIPQANDSDVELIFKY 124
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y F++ +D IDNG+ ++ + P Y +T+LSSRV LN W D + + + F +
Sbjct: 125 IYNTFVKEVDGIDNGVPMFNEE--PVYRISTDLSSRVKFLNPAWNSKDINVDSQ---FLK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
++L G+E + + + WLPARSIV E I +R++ DPSGEI+VL +F PW HL+ +E
Sbjct: 180 AVELTGQELVQRINYAANVWLPARSIVQEAIDKRFEVDPSGEIIVLLQFVPWSQHLYAIE 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
E ++P +K+V++ ++ +RVQ V + P+ F R LP W GL ++EL+ +GI
Sbjct: 240 REQNVQPPLKFVIFRNN--GNYRVQGVPIRPNSFVCRLFLPKPWGGLCNEELANVSGIKD 297
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
+FVH FIG + + GA+ MAR AL+L
Sbjct: 298 VIFVHSERFIGSHLTREGAITMARKALEL 326
>gi|336262197|ref|XP_003345883.1| hypothetical protein SMAC_06284 [Sordaria macrospora k-hell]
gi|380088954|emb|CCC13066.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 355
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 213/340 (62%), Gaps = 22/340 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL +M+R K+ A++VR+RDP +L+ D V+DVGG Y+PS D Y
Sbjct: 18 IGTHNGHFHADEALAVYMLRQHVPKYTGARLVRTRDPALLETCDIVVDVGGEYEPSRDRY 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNF 141
DHHQ+ F F TKLSSAGLVY HFGK +I++ + E V ++ +Y +F
Sbjct: 78 DHHQRTFTTTFPD-HQTKLSSAGLVYMHFGKGIISRRVPDTPAEDSDKVGLIWNKLYDSF 136
Query: 142 MEAIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEP----DQSAERE 190
+EA+DA DNGI+ YD+D R+ N L + VG+ N +W +P +A+ E
Sbjct: 137 VEALDAHDNGISVYDSDALKAAGIEKRFSNGGFTLGAMVGRFNGNWNDPVPQDPVAAQAE 196
Query: 191 NEA-FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPW 248
+A F++ + G+EF + +Y ++WLPAR +V R +YD G I+VLK + PW
Sbjct: 197 EDARFEKASNRIGEEFDRALDYYTKAWLPARDVVETAYKARKEYDAEGRILVLKGQSAPW 256
Query: 249 KLHLFELEEEMK--IEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
K HL+ LEEE ++ +I YVLY + +WRVQ V + D F+SRKPLP WRG R
Sbjct: 257 KDHLYTLEEEEGEGVQRVI-YVLYPEKPAPDAKWRVQCVPETGDSFQSRKPLPEAWRGFR 315
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
D++L + GI G VF+H +GFIGGN+++ GALAMA+ AL+
Sbjct: 316 DEKLDEITGIQGGVFIHAAGFIGGNKTFDGALAMAKKALE 355
>gi|358336124|dbj|GAA54694.1| UPF0160 protein MYG1 mitochondrial [Clonorchis sinensis]
Length = 354
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 200/343 (58%), Gaps = 28/343 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DE + C M++ ++ A+IVR+RD L V+DVGG++DP+ +D
Sbjct: 12 IGTHNGKFHADELVACAMLKQLPEYNKAKIVRTRDASQLATCTIVVDVGGLFDPNTHRFD 71
Query: 85 HHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
HHQ+GF+ F F KLSSAGL+Y H+G +IA +V E P + LF +Y
Sbjct: 72 HHQRGFDLTFKDFFKESTWDIKLSSAGLIYVHYGHRVIAGVTDVQESDPMIPILFHKIYS 131
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ IDAIDNG+ D + RY NT LSSRV +LN W +PD E F + +
Sbjct: 132 AFVAEIDAIDNGVAISDNET--RYSINTGLSSRVARLNPKWNDPDAD---ETACFMKALQ 186
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHL------ 252
+ +EF+ V Y +SW PAR++V + +++R+ DPSG I+ L+ CPW H
Sbjct: 187 MVEEEFVTLVVHYAKSWYPARALVSKALSQRHQVDPSGRIISLEDEPCPWTDHFHELEKL 246
Query: 253 ---------FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRG 302
F+ + + LY D G QW VQA++ S + F+SR PLP WRG
Sbjct: 247 ELEKLPNGHFDASDLSTATSRPVFCLYRRDDG-QWSVQAISTSEKEHFKSRVPLPEAWRG 305
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
LRD+ELS+ G+PGCVFVH +GF+G +++ GAL MAR +LKL
Sbjct: 306 LRDEELSRVVGLPGCVFVHATGFLGIHKTRDGALYMARTSLKL 348
>gi|395541038|ref|XP_003772455.1| PREDICTED: UPF0160 protein MYG1, mitochondrial, partial
[Sarcophilus harrisii]
Length = 292
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/283 (49%), Positives = 183/283 (64%), Gaps = 11/283 (3%)
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKEL 121
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGLVY HFG+ L+A+ L
Sbjct: 2 DVVVDVGGEYDPQRHRYDHHQRSFTESMSSLRPGKPWQTKLSSAGLVYLHFGQRLLAQLL 61
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
V L+ +Y+ F+E +DA+DNGI+Q++ KP RY T LS+RV +LN W
Sbjct: 62 GTSIDDRIVDTLYDKMYEYFVEEVDAVDNGISQWEGGKP-RYALTTTLSARVARLNPTWN 120
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
+P+Q E F M+L +EF+ + FY RSWLPAR++V E +A R+ DPSGEI+
Sbjct: 121 QPNQDTE---AGFHCAMNLVREEFVQRLDFYKRSWLPARALVEEALANRFQVDPSGEIIE 177
Query: 242 LKRF-CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
L + CPWK HLF+LE + I +VLY D G QWRVQ V FESR PLP QW
Sbjct: 178 LSQGGCPWKDHLFDLEPGLTPLVPIAFVLYPDQNG-QWRVQCVPKELHSFESRLPLPEQW 236
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+GLRD+ L + +GIPGC+FVH +GFIGG+++ GAL+MARA L
Sbjct: 237 QGLRDEALDQMSGIPGCIFVHANGFIGGHRTKEGALSMARATL 279
>gi|256075166|ref|XP_002573891.1| metal dependent hydrolase-related [Schistosoma mansoni]
gi|353228990|emb|CCD75161.1| metal dependent hydrolase-related [Schistosoma mansoni]
Length = 382
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 198/346 (57%), Gaps = 34/346 (9%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
T KR+GTHNG FHCDE L +++ ++ NA +VRSRDP +L D V+DVGGVYDP
Sbjct: 6 TSEKRIGTHNGCFHCDEVLAVVLLKYLPEYRNATVVRSRDPDILSACDIVVDVGGVYDPE 65
Query: 80 NDCYDHHQKGFEEVFGHGFS-----TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
+DHHQ+ F + F KLSSAGLVY HFGK +++ ++ GH + +++
Sbjct: 66 TLRFDHHQRDFSLTWSQYFGVKMWDVKLSSAGLVYVHFGKRVLSVLTGLEIGHEVLEKIY 125
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
+ +Y++F+ ID DNGI Q + P +Y T+L RV +LN W + +E AF
Sbjct: 126 MKIYESFILEIDGQDNGIAQ--SKVPLKYNIGTSLYCRVRRLNPWW---NNESEESETAF 180
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
Q+ ++L +EFLDTV ++ W PAR+IV + + R D D S I+VL R CPWK HLF+
Sbjct: 181 QRAINLVSREFLDTVDYFTNCWWPARNIVAKAMNCRGDVDSSKTIIVLDRSCPWKSHLFD 240
Query: 255 LEEEMKIE--------------------PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
LE E + E P I +V+ D W VQ V D+F+ R
Sbjct: 241 LEREERAETVVYPEPLHQASYRPIPKFPPQILFVILPSD--GNWVVQGVP--KDKFDIRL 296
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMAR 340
P P WR L+DD+L GI GC+FVH SG +G N++ GA+ MAR
Sbjct: 297 PFPNDWRSLQDDQLCAMTGISGCIFVHNSGHLGSNKTLDGAIEMAR 342
>gi|240849553|ref|NP_001155657.1| MYG1 protein-like [Acyrthosiphon pisum]
gi|239788361|dbj|BAH70867.1| ACYPI006340 [Acyrthosiphon pisum]
Length = 332
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 204/328 (62%), Gaps = 13/328 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K++GTH+G FHCDEA G +M++L F + +I+R+RD K+L + D V+DVGGVYD S
Sbjct: 10 KKLGTHSGIFHCDEAFGSYMLQLL--FPDLEIIRTRDEKLLAECDIVIDVGGVYDHSKRR 67
Query: 83 YDHHQKGFEE-----VFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ F+ + ++TK SSAGLVY H+G +++ + D ++ ++ V
Sbjct: 68 YDHHQRSFDHSMSTLIPNAKWTTKFSSAGLVYLHYGHDVLKTIIASDVPEDKLNTIYSKV 127
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y++F++ IDAIDNGI + P+Y +T+LSSRVG LN W + + E AF++
Sbjct: 128 YESFVQEIDAIDNGIPI--CEGVPKYNIHTHLSSRVGNLNKKWNHVGKFDDME--AFKKA 183
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF--EL 255
M L EF +TV + ++WLPARS+V++ + +RY SG ++ CPWK H F E
Sbjct: 184 MQLIKCEFEETVLYAYQTWLPARSLVIDALEKRYQTHKSGRVIEFSTPCPWKEHYFVLEE 243
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E + P I YV++ED + WR+Q + ++P F R+PLP W G+RDD+LS +G+
Sbjct: 244 ELGEDLLPKIDYVIFEDSSNETWRIQCIPITPHSFTLRRPLPKSWWGIRDDQLSTISGVE 303
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAAL 343
C+F H +GFIGGN++ G L M A+
Sbjct: 304 DCIFCHANGFIGGNKTRLGVLGMCEKAI 331
>gi|170072174|ref|XP_001870113.1| MYG1 [Culex quinquefasciatus]
gi|167868279|gb|EDS31662.1| MYG1 [Culex quinquefasciatus]
Length = 353
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 196/331 (59%), Gaps = 14/331 (4%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDE L CFM++ K+ A + R+RD KVLD D V+DVG V+DP + YD
Sbjct: 22 IGTHDGVFHCDEVLACFMLQQLPKYERATVRRTRDLKVLDQCDIVVDVGAVFDPDTNRYD 81
Query: 85 HHQKGFEEVFGH-------GFSTKLSSAGLVYKHFGKELIAKELNVDE-GHPD---VHRL 133
HHQ F+E +LSSAGL+Y +FG+E+I K L + +P+ V +
Sbjct: 82 HHQASFQETLNSLRPEIKVKREIRLSSAGLIYTYFGEEVIRKVLERNSIANPEEELVRGV 141
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ +Y + +D IDNG+ ++ + P+Y NT+LS+RV N W E AE +
Sbjct: 142 YRKLYDTLIAELDGIDNGVPMFEGE--PKYTINTHLSARVSHFNPAWNEAADDAEDVAKR 199
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F++ G EF+D V +Y W PAR+IV + + R + SGEI+ L+ FCPWK HL+
Sbjct: 200 FEKAKAYVGAEFIDKVLYYATRWWPARAIVEKAVRNRLEVHASGEILELENFCPWKEHLY 259
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
ELE E I L KYV+Y +R WRV V + P F RK L +WRG RDD+L + +G
Sbjct: 260 ELEGEHGIAGLPKYVIY-CNRPNDWRVICVPLEPASFVCRKFLARKWRGERDDKLEEISG 318
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
I G F H +GFIGGN++ GAL MA +L+
Sbjct: 319 IEGANFCHQTGFIGGNRTREGALRMAVVSLE 349
>gi|402085401|gb|EJT80299.1| hypothetical protein GGTG_00301 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 380
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 159/369 (43%), Positives = 215/369 (58%), Gaps = 36/369 (9%)
Query: 7 SSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDD 65
S++ T++ P +GTHNG FH DEAL +M+R + A +VR+RDP VL
Sbjct: 15 SAAKRLKTAAADGGPDVAIGTHNGHFHADEALAVYMLRQHIPTYAGAALVRTRDPAVLSG 74
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
V+DVGG YDP +DHHQ+ F F G TKLSSAGLVY HFG+++IA+ L
Sbjct: 75 CHTVVDVGGEYDPDRLRFDHHQRTFNTTFP-GRQTKLSSAGLVYMHFGRQIIARRL---- 129
Query: 126 GHPD------VHRLFLAVYKNFMEAIDAIDNGINQYD------TDKPPRY-VNNTNLSSR 172
G PD V L+ +Y++F+EA+DA DNGI+ YD R+ L +
Sbjct: 130 GDPDSDNEATVDMLYAKLYESFVEALDAHDNGIHSYDPALLAAAGAEKRFSTGGFTLGAV 189
Query: 173 VGKLNLDWTEP-----DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECI 227
VG+LN +W +P ++ E++ F+ G EF + +Y RSWLPARS+V +
Sbjct: 190 VGRLNPNWNDPVPEDPAEAQAAEDKRFEAASARIGMEFDLDLDYYTRSWLPARSVVADAF 249
Query: 228 AERYDYDP----SGEIMVLK-RFCPWKLHLFELEEEMKI-----EPLIKYVLYEDDRG-- 275
A R ++D G I+V K + PWK HL+ LE E + + + YVLY +
Sbjct: 250 AHRDEHDAPGRGGGRILVFKGQSVPWKDHLYNLEAEAEAKGAAGDNKVVYVLYPEKPAPD 309
Query: 276 KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGA 335
+WRVQAV S D F+SR PLP WRG RD+EL K GIPGCVFVH +GFIGGN+++ GA
Sbjct: 310 AKWRVQAVPESKDSFQSRLPLPEAWRGFRDEELDKITGIPGCVFVHAAGFIGGNKTFEGA 369
Query: 336 LAMARAALK 344
LAMA+ AL+
Sbjct: 370 LAMAKKALE 378
>gi|452984901|gb|EME84658.1| hypothetical protein MYCFIDRAFT_163456 [Pseudocercospora fijiensis
CIRAD86]
Length = 367
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/361 (41%), Positives = 212/361 (58%), Gaps = 37/361 (10%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGG 74
S+ S P+ +GTH+G FH DEAL ++RL ++ NA I R+RDP+VL + + V+DVGG
Sbjct: 14 SNGSAAPV--IGTHSGHFHADEALAVHLLRLLPEYQNATITRTRDPEVLKNCNIVVDVGG 71
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
V+D S YDHHQ+ F F +TKLSSAGLV+ H+GK +I+ + +PDV L+
Sbjct: 72 VHDDSAFRYDHHQREFNATFPRK-NTKLSSAGLVWMHYGKRIISHLTSAGIDNPDVDLLY 130
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLN------ 177
+Y++F+EA DA DNGI+ YD P+ + N +++S V + N
Sbjct: 131 QKLYEDFIEAFDANDNGISVYD----PQELRNAGIEEKFSDKGFSIASVVSRYNHMPITP 186
Query: 178 --------LDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
L T + E+ F + G++F + + SWLPAR+IV +
Sbjct: 187 AITNKLSALMTTSKAKDQTEEDARFVRASSFVGEQFELEISDKITSWLPARAIVKKAFES 246
Query: 230 RYDYDPSGEIMVLK---RFCPWKLHLFELEEEMKIEPLIKYVLYEDD--RGKQWRVQAVA 284
R DP G I+V+ PW HL+ LEEE + + Y L+ ++ +WR++AV+
Sbjct: 247 RQSVDPEGRIIVIPYSPEGVPWSDHLYALEEETGSQGNVLYALFAENGEENSKWRIRAVS 306
Query: 285 VSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ P FESRK LP WRG+RD+ELSK +GIPGC+FVH SGFIGGNQ+Y GAL MA+ A+K
Sbjct: 307 LEPGSFESRKGLPEDWRGVRDEELSKLSGIPGCIFVHASGFIGGNQTYEGALEMAKKAVK 366
Query: 345 L 345
L
Sbjct: 367 L 367
>gi|51849613|dbj|BAD42335.1| GAMM1 protein-like protein [Nannochloris bacillaris]
Length = 225
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/225 (55%), Positives = 160/225 (71%), Gaps = 1/225 (0%)
Query: 121 LNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW 180
+++D P+V ++L +YKNFMEAIDAIDNG+NQ+D D PP+Y+NNT+LS+RVG LN W
Sbjct: 1 MSLDPTAPEVDVVYLTLYKNFMEAIDAIDNGVNQWDGDAPPKYLNNTHLSARVGNLNPSW 60
Query: 181 TEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
E D S FQ + L G EF D + + +SWLPAR++V+ + D DPSGEIM
Sbjct: 61 NE-DSSDATLAAGFQAAVALTGSEFSDALGYLAKSWLPARALVLADLQACKDIDPSGEIM 119
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
L +CPWK HL +LE+E+ I +KYVLYED+R K+WR+QAV +P F+SRK LP W
Sbjct: 120 KLNSYCPWKEHLHQLEKELGISGQVKYVLYEDEREKKWRIQAVGTAPGSFDSRKALPVPW 179
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RGLRDD+LS GIPG VFVH SGFIGGN +Y GAL MA+ AL +
Sbjct: 180 RGLRDDDLSGVTGIPGGVFVHASGFIGGNATYEGALEMAKKALTM 224
>gi|164426565|ref|XP_955852.2| MYG1 protein [Neurospora crassa OR74A]
gi|157071388|gb|EAA26616.2| MYG1 protein [Neurospora crassa OR74A]
Length = 366
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 210/340 (61%), Gaps = 22/340 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL M+R K+ A++VR+RDP +L+ D V+DVGG Y+P+ D Y
Sbjct: 29 IGTHNGHFHADEALAVHMLRQHVPKYTGARLVRTRDPALLETCDIVVDVGGEYEPARDRY 88
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNF 141
DHHQ+ F F TKLSSAGLVY HFGK +I++ L E + V ++ +Y +F
Sbjct: 89 DHHQRTFTTTFPD-HQTKLSSAGLVYMHFGKGIISRSLPDAPAEDNDKVGLIWNKLYDSF 147
Query: 142 MEAIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDW-----TEPDQSAER 189
+EA+DA DNGI+ YD+D R+ N L + VG+ N +W ++P +
Sbjct: 148 VEALDAHDNGISVYDSDALKAAGIEKRFSNGGFTLGAMVGRFNGNWNDTAPSDPVAAQAE 207
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPW 248
E+ F+Q G+EF + +Y ++WLPAR +V R +YD G I+VLK + PW
Sbjct: 208 EDGRFEQASQRIGEEFDRALDYYTKAWLPARDVVETAYKARKEYDAEGRILVLKGQSAPW 267
Query: 249 KLHLFELEEEM--KIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
K HL+ LEEE +++ ++ YVLY + +WRVQ V + D F+SRKPLP WRG R
Sbjct: 268 KDHLYTLEEEEGERVQKVL-YVLYPEKPTPDAKWRVQCVPETGDSFQSRKPLPEAWRGFR 326
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
D++L + I G VFVH +GFIGGN+++ GALAMA+ AL+
Sbjct: 327 DEKLDEITQIQGGVFVHAAGFIGGNKTFDGALAMAKKALE 366
>gi|336466463|gb|EGO54628.1| MYG1 protein [Neurospora tetrasperma FGSC 2508]
gi|350286672|gb|EGZ67919.1| MYG1 protein [Neurospora tetrasperma FGSC 2509]
Length = 355
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 209/340 (61%), Gaps = 22/340 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+GTHNG FH DEAL M+R K+ A++VR+RDP +L+ D V+DVGG Y+P+ D Y
Sbjct: 18 IGTHNGHFHADEALAVHMLRQHVPKYTGARLVRTRDPALLETCDIVVDVGGEYEPARDRY 77
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNF 141
DHHQ+ F F TKLSSAGLVY HFGK +I++ L E + V ++ +Y +F
Sbjct: 78 DHHQRTFTTTFPD-HQTKLSSAGLVYMHFGKGIISRSLPDAPAEDNDKVGLIWNKLYDSF 136
Query: 142 MEAIDAIDNGINQYDTDK------PPRYVNNT-NLSSRVGKLNLDWTEPDQS-----AER 189
+EA+DA DNGI+ YD+D R+ N L + VG+ N +W +P S
Sbjct: 137 VEALDAHDNGISVYDSDALKAAGIEKRFSNGGFTLGAMVGRFNGNWNDPVPSDPVAAQAE 196
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPW 248
E+ F+Q G+EF + +Y ++WLPAR +V R +YD G I+VLK + PW
Sbjct: 197 EDGRFEQASQRIGEEFDRALDYYTKAWLPARDVVETAYKARKEYDAEGRILVLKGQSAPW 256
Query: 249 KLHLFELEEEM--KIEPLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
K HL+ LEEE +++ ++ YVLY + +WRVQ V + D F+SRKPLP WRG R
Sbjct: 257 KDHLYTLEEEEGERVQKVL-YVLYPEKPTPDAKWRVQCVPETGDSFQSRKPLPEAWRGFR 315
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
D++L + I G +FVH +GFIGGN+++ GALAMA+ AL+
Sbjct: 316 DEKLDEITQIQGGIFVHAAGFIGGNKTFDGALAMAKKALE 355
>gi|363753758|ref|XP_003647095.1| hypothetical protein Ecym_5538 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890731|gb|AET40278.1| hypothetical protein Ecym_5538 [Eremothecium cymbalariae
DBVPG#7215]
Length = 334
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 147/331 (44%), Positives = 207/331 (62%), Gaps = 12/331 (3%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
P+ ++ TH+GSFH DEAL +M+RL + +A +VRSR+P+ ++ D V+DV G YD
Sbjct: 8 APINKICTHSGSFHADEALAVYMLRLLPEAKDAVVVRSREPEKWEEADIVVDVSGKYD-G 66
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV-DEGHPDVHRLFLAVY 138
+DHHQ+ F E F F TKLSSAGL+YKHFG+E+I V E D+ L+ VY
Sbjct: 67 IKYFDHHQREFSETFNDVFKTKLSSAGLIYKHFGQEIIRIICPVLSEDSYDI--LYNKVY 124
Query: 139 KNFMEAIDAIDNGINQYDTDK---PPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAF 194
K F+E++DA DNGIN +D ++ R+ + N + + + ++N W E D S + +E F
Sbjct: 125 KEFIESLDANDNGINNFDAEELGVTRRFSDKNITIPAIISRMNPSWNE-DCSPGKFDEQF 183
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSG--EIMVLKRFCPWKLHL 252
+ G+ F++ V Y +SWLPA+ IV + R D SG I+VL +FCPWK HL
Sbjct: 184 FKASKFIGECFVNLVESYGKSWLPAKDIVRRAVLNRDTADKSGGSSIIVLDQFCPWKEHL 243
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+E+E+E+ IE I +VL+ED G WRV V VS F+ R+ L RGLRD+ELSK+A
Sbjct: 244 YEVEKELNIENTILFVLFEDSSG-SWRVSTVPVSSTSFKFRQGLLEPLRGLRDEELSKKA 302
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ GC+FVH SGFIGG +S AL +A +L
Sbjct: 303 QVEGCIFVHASGFIGGAKSKEAALQLAYMSL 333
>gi|396461687|ref|XP_003835455.1| similar to MYG1 protein [Leptosphaeria maculans JN3]
gi|312212006|emb|CBX92090.1| similar to MYG1 protein [Leptosphaeria maculans JN3]
Length = 354
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 149/357 (41%), Positives = 210/357 (58%), Gaps = 28/357 (7%)
Query: 13 STSSPSQ---TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV 69
+T +P++ T +GTHNG FH DE+L M+RL + ++ ++R+RDP VL V
Sbjct: 2 ATENPAKKLKTDAPVIGTHNGHFHADESLAVAMLRLLPTYLDSSLIRTRDPAVLSTCHTV 61
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DVGG YD S YDHHQ+GFE VF G +TKLSSAGLVY HFGK++I + D
Sbjct: 62 VDVGGEYDDSTKRYDHHQRGFETVFP-GHNTKLSSAGLVYMHFGKDIITTVTGLQGEDRD 120
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYD---------TDK-------PPRYVNNTNLSSRV 173
+ L+ +Y +F+EA DA DNG+N ++ T K VN N R
Sbjct: 121 I--LYEKIYADFIEAFDANDNGVNVFEPKDLESAGLTKKFENRGFSIASVVNRYNYGPRS 178
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
+ T + AE E+ F + G++FL + SWLPAR V + R Y
Sbjct: 179 ATSDEAKTAEAKQAE-EDMRFLKASQFVGEQFLIELTDRASSWLPARHQVKQAYDARLQY 237
Query: 234 DPSGEIMVLKRFCPWKLHLFELEEEMK----IEPLIKYVLYEDDRGK-QWRVQAVAVSPD 288
DP G I+VL PW HL+ LE+E + + P + YVL+ +D+ + +WR++AV+
Sbjct: 238 DPQGRILVLPEGMPWADHLYTLEKESQLPPGVSPHVLYVLFPEDQPEGKWRIRAVSKENS 297
Query: 289 RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP W+G+RD++L + +GIPGCVFVH +GFIGGN+S+ GALAMA+ AL+L
Sbjct: 298 GFENRKDLPDAWKGVRDEQLDQVSGIPGCVFVHAAGFIGGNKSFEGALAMAKKALEL 354
>gi|294875958|ref|XP_002767483.1| Protein MYG1, putative [Perkinsus marinus ATCC 50983]
gi|239869143|gb|EER00201.1| Protein MYG1, putative [Perkinsus marinus ATCC 50983]
Length = 351
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 212/349 (60%), Gaps = 31/349 (8%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFF----NAQIVRSRDPKVLDDLDA 68
S + PS P+ + TH+G FHCDE LG M+ DK N +VR+R+P+V+ D
Sbjct: 3 SATLPSLRPVT-IATHSGKFHCDEVLGTVML---DKILGGSKNYNLVRTRNPEVISKADI 58
Query: 69 VLDVGGVYDPSNDCYDHHQKGFEEVFG-HGFSTKLSSAGLVYKHFGKELIAKELNV-DEG 126
V+DVG +DP +DHHQ+ F+E F ++LSSAGLVYK+FG+E++ NV D+
Sbjct: 59 VIDVGAEFDPPRRRFDHHQRTFDEKFDDENRVSRLSSAGLVYKYFGREMLRNVYNVTDDR 118
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
D+ L+ +Y +F+E++DAIDNG+ D+ P+Y T+L+SRV + N W EP+ +
Sbjct: 119 KLDI--LYKKLYNDFIESVDAIDNGVPI--ADEAPKYRVFTDLASRVSRKNPRWNEPEVT 174
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
+E E E F+Q M + +E + + S++PAR IV E I++R++ PSG ++ L R C
Sbjct: 175 SEMEEERFRQAMAICEEELSAQIETILSSFIPAREIVEEAISKRFEVHPSGRVIHLARGC 234
Query: 247 PWKLHLFELEEEMKI----------------EPLIKYVLYEDDRGKQWRVQAVAVSPDRF 290
P+ H+FE+E E + I YV+Y D G +RVQAV V F
Sbjct: 235 PFAEHIFEIEREKGLATEASKNGDATKRKADTSSILYVIYSDATG-GYRVQAVGVEGHNF 293
Query: 291 ESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMA 339
SRKPLP+++RG+RD++LSK AGI G +FVH SGFIGG ++Y A +A
Sbjct: 294 LSRKPLPSRFRGVRDEDLSKLAGIDGLIFVHASGFIGGAKTYESAKKLA 342
>gi|169603672|ref|XP_001795257.1| hypothetical protein SNOG_04844 [Phaeosphaeria nodorum SN15]
gi|160706433|gb|EAT87235.2| hypothetical protein SNOG_04844 [Phaeosphaeria nodorum SN15]
Length = 354
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 150/359 (41%), Positives = 207/359 (57%), Gaps = 32/359 (8%)
Query: 13 STSSPS-----QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD 67
+T SPS + P+ +GTHNG FH DE+L M+RL + ++Q+VR+RDP+VL
Sbjct: 2 ATESPSKKLKMEAPV--IGTHNGHFHADESLAVAMLRLLPTYLDSQLVRTRDPEVLAKCH 59
Query: 68 AVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH 127
V+DVGG YD + YDHHQ+GFE VF G +TKLSSAGLVY HFGK++I +
Sbjct: 60 TVVDVGGEYDDNAKRYDHHQRGFETVFP-GHNTKLSSAGLVYLHFGKDIITSVTGLQGQD 118
Query: 128 PDVHRLFLAVYKNFMEAIDAIDNGIN---QYDTDKP-------------PRYVNNTNLSS 171
D+ LF +Y +F+EA DA DNG+N D +K VN N
Sbjct: 119 RDI--LFEKIYADFIEAFDANDNGVNVIPAKDLEKAGLAKQFEDRGFSIASVVNRYNYGP 176
Query: 172 RVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERY 231
R D P+ E+ F + G++FL + SWLPAR V + R
Sbjct: 177 RAANAE-DTKTPEAKQAEEDVRFLKASQFVGEQFLIELTDRANSWLPARHSVKQAYDARL 235
Query: 232 DYDPSGEIMVLKRFCPWKLHLFELEEEMKI----EPLIKYVLYEDDRGK-QWRVQAVAVS 286
YDP G I+VL PW HL+ LE+E I P + YVL+ +D+ + +WR++AV+
Sbjct: 236 QYDPQGRILVLPEGMPWADHLYNLEKESPIPEGVAPQVLYVLFPEDKPEGKWRIRAVSKE 295
Query: 287 PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
F +RK LP W+G+RD++L + +G+ GCVFVH +GFIGGN+S+ GALAMA+ AL+L
Sbjct: 296 NGGFVNRKDLPDAWKGVRDEQLDQISGVQGCVFVHAAGFIGGNKSFDGALAMAKKALEL 354
>gi|86171634|ref|XP_966250.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|46361219|emb|CAG25080.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 372
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 202/334 (60%), Gaps = 9/334 (2%)
Query: 12 YSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
YS+ T K +GTH+G FH DE L M++ ++ +A+I+R+RD LD D V+D
Sbjct: 41 YSSFFLYSTMNKVIGTHSGRFHTDEILATVMLKFLPEYKDAKIIRTRDQTKLDTCDIVVD 100
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV-DEGHPDV 130
VGGVYD N YDHHQK FE + +LSSAGL+YKH+GKE++ K ++ DE +V
Sbjct: 101 VGGVYDHENKRYDHHQKEFEGTLDDKHTIRLSSAGLIYKHYGKEVLRKGFSITDEEKINV 160
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE 190
L+ +Y +F+E++DA+DNGINQY+ P+Y NT + RV + N + E D E
Sbjct: 161 --LYEKLYTSFIESVDALDNGINQYEGQ--PKYQINTTIQCRVNRFNPTFLEDDVD---E 213
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
NE F + + +EF+ V +Y W A+SIV E I +R+++ SG ++ L+++CP+
Sbjct: 214 NERFMEAAKIVKQEFVHFVTYYSDVWYMAKSIVRESILDRFNFHKSGRVIYLQKYCPYTE 273
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
HL++LE+E+ I+ I Y +Y DR +R A++ + F R P PA +RGL+D++L
Sbjct: 274 HLYDLEQELNIQDEILYCIY-SDRYNNFRCTAISKKNEPFVLRLPFPASFRGLKDEQLQT 332
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ IPG FVH SGF ++ + + A+LK
Sbjct: 333 VSKIPGLTFVHYSGFTSAGENIESLVKLVEASLK 366
>gi|118782861|ref|XP_312544.3| AGAP002408-PA [Anopheles gambiae str. PEST]
gi|116129768|gb|EAA08054.3| AGAP002408-PA [Anopheles gambiae str. PEST]
Length = 358
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 196/331 (59%), Gaps = 14/331 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FHCDE L CFM++ ++ +A+I+R+RD LD+ D V+DVG +D + Y
Sbjct: 25 KIGTHDGIFHCDEVLACFMLQQLPQYASAEIIRTRDTSKLDECDIVVDVGATFDRARHRY 84
Query: 84 DHHQKGFEEVFGH-------GFSTKLSSAGLVYKHFGKELIAK----ELNVDEGHPDVHR 132
DHHQ F + + +LSSAGLVY +FG+E+I + LN++ +
Sbjct: 85 DHHQASFNDTLRSLRPELNVKWDIRLSSAGLVYTYFGEEVIKRVLKQTLNLEPSAECLRA 144
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
++ VY + IDAIDNG+ ++ + PRY +T+LS RVG N W EP A E
Sbjct: 145 VYTKVYDGLISEIDAIDNGVPMFEGGE-PRYNISTHLSGRVGAFNSRWNEP-TPAPGCLE 202
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F++ G EF+D V +Y W PAR IV + +A R SGEI+ L++ CPWK HL
Sbjct: 203 RFEKAKAYVGLEFVDKVTYYASCWWPARDIVSKGLANRMALHESGEILELEQPCPWKEHL 262
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
++LE+E + KYV+Y ++ WRV V + P F RK L WRG+RD+EL K +
Sbjct: 263 YQLEQEQNLVGQAKYVIY-CNKENDWRVICVPLQPASFVCRKFLAKSWRGVRDEELEKVS 321
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
GI G F H +GFIGGN++ GAL MA A+L
Sbjct: 322 GIAGSNFCHQTGFIGGNKTREGALKMAIASL 352
>gi|378727570|gb|EHY54029.1| urease accessory protein [Exophiala dermatitidis NIH/UT8656]
Length = 369
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 206/349 (59%), Gaps = 25/349 (7%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+T + +GTHNG FH DEAL +M+RL ++ +A ++R+RDP +LD V+DVGG YD
Sbjct: 20 KTAPRLIGTHNGHFHADEALAVYMLRLLPEYASASLIRTRDPGLLDTCHTVVDVGGEYDV 79
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
S + YDHHQ+ F+ F TKLSSAGLVY HFG+ +IA+ + HPDV L+ +Y
Sbjct: 80 SLNRYDHHQRTFDTAFP-DHKTKLSSAGLVYMHFGEAIIAQHTGLPIDHPDVDLLYRKLY 138
Query: 139 KNFMEAIDAIDNGINQYD------TDKPPRYVN-NTNLSSRVGKLN----LDWTEPDQSA 187
+F+EAIDA DNGI++YD R+ + L S V +N L P ++
Sbjct: 139 DDFVEAIDANDNGISKYDDALLEKAGIEKRFKDGGITLPSLVNDMNHEDPLALGTPSRNT 198
Query: 188 ERENEA-----FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
E E +A F Q L GK FL + +WLPAR+IV E A R PSG+++VL
Sbjct: 199 EEEPQAEEDYRFSQASALMGKAFLRKLHGAATAWLPARAIVKEAFAARESAHPSGQLLVL 258
Query: 243 KRF-CPWKLHLFELEEE--MKIEPLIKYVLY--EDDRGKQWRVQAVAVSPDRFESRKPLP 297
R PWK HL+ +EEE + E I YV+Y +++ G +WR+QAV+ FE+RK LP
Sbjct: 259 PRAGIPWKEHLYNIEEEAGLPAEKKILYVIYPEKEEPGSKWRIQAVSKDLSSFENRKSLP 318
Query: 298 AQWRGLRDDELSKEAG---IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRG+RD EL G G VFVH SGFIGG+++ G +MA AL
Sbjct: 319 ESWRGVRDAELDALLGDNVEDGAVFVHASGFIGGHKTEAGVRSMAALAL 367
>gi|307196182|gb|EFN77839.1| UPF0160 protein MYG1 [Harpegnathos saltator]
Length = 333
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 192/329 (58%), Gaps = 14/329 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDE C +++L ++ +A IVRSRD +LD D V+DVGGVYD Y
Sbjct: 6 KIGTHNGTFHCDEVFACVLLKLLPQYKDAIIVRSRDKNILDKCDIVVDVGGVYDHYIRRY 65
Query: 84 DHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
DHH + F E + KLSSAGLVY HFG E++ + + + F
Sbjct: 66 DHHMRDFCETAKSVLKKSNYNNKVKLSSAGLVYCHFGHEILRNLCPDIQEDKTIEKFFKR 125
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y + +DAIDNG Q ++D P Y NT+LSSRV LN W S E E F++
Sbjct: 126 IYDTLIVEVDAIDNG--QNESDCQPLYRINTDLSSRVKNLNPFWN----SNMDEEEQFKK 179
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M L F+D+V + + WLPA IV + R++ D SGEI+ L + PW+ +LF +E
Sbjct: 180 AMTLVHSVFMDSVSYTEKVWLPAEQIVYNAVNRRFEVDSSGEIIELSQRVPWQSYLFHME 239
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
EM I P IKY+++ R+Q V +S +F+ R P P +W GLR+D L K I G
Sbjct: 240 REMNISPPIKYIIFFSSDNDH-RIQCVPISAGQFKCRLPFPKKWCGLRNDALVKACQIEG 298
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALKL 345
FVH++GFIGG+ + GA+AMA+ +LK+
Sbjct: 299 ADFVHVNGFIGGHATRDGAVAMAQKSLKI 327
>gi|67527855|ref|XP_661782.1| hypothetical protein AN4178.2 [Aspergillus nidulans FGSC A4]
gi|40740087|gb|EAA59277.1| hypothetical protein AN4178.2 [Aspergillus nidulans FGSC A4]
gi|259481224|tpe|CBF74548.1| TPA: UPF0160 domain protein MYG1, putative (AFU_orthologue;
AFUA_1G08980) [Aspergillus nidulans FGSC A4]
Length = 359
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 153/368 (41%), Positives = 208/368 (56%), Gaps = 33/368 (8%)
Query: 1 MAGTGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDP 60
MA VS P SP+ +GTHNG FH DEAL +++RL + + ++R+RDP
Sbjct: 1 MATESVSKKP--RIDSPT------IGTHNGHFHADEALAVYLLRLLPAYSQSPLIRTRDP 52
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKE 120
+L V+DVGG YD + + YDHHQ+ F F G+ TKLSSAGLVY HFG+ +IA
Sbjct: 53 ALLQTCHTVVDVGGEYDAATNRYDHHQRTFNSTFP-GYKTKLSSAGLVYLHFGQSIIANH 111
Query: 121 LNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDK------PPRYVN-NTNLSSRV 173
++ H +V +F +Y +F+EAIDA DNGI YD +K R+ + ++S V
Sbjct: 112 ASLPADHSNVSLIFNKLYADFIEAIDANDNGIGVYDPEKLSEAGIEKRFKDGGVTIASIV 171
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
+N E + + E+ F Q G F +R V SWLPAR V +R +
Sbjct: 172 NDMNTPSPEDEAAGIDEDGLFNQASQFVGSVFSRKLRHAVNSWLPARETVQAAYTDRKNV 231
Query: 234 DPSGEIMVLKR-FCPWKLHLFELEEEMKIEPL--------IKYVLYEDD--RGKQWRVQA 282
SG+IMVL + PWK HL+ E+E K + + YVLY + G +WRVQA
Sbjct: 232 HASGKIMVLPQGGVPWKEHLYNFEKEAKAANINGNAEDAEVVYVLYPESAAEGSKWRVQA 291
Query: 283 VAVSPDRFESRKPLPAQWRGLRDDEL-----SKEAGIP-GCVFVHMSGFIGGNQSYGGAL 336
V+V+ F SRKPLP +WRG+RD +L + AGIP G VFVH SGFIGG+++ GA
Sbjct: 292 VSVNEGSFVSRKPLPEKWRGVRDADLDGVLAGEGAGIPEGAVFVHASGFIGGHKTREGAF 351
Query: 337 AMARAALK 344
AMA AL+
Sbjct: 352 AMAAKALE 359
>gi|346326287|gb|EGX95883.1| MYG1 protein [Cordyceps militaris CM01]
Length = 378
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 197/339 (58%), Gaps = 22/339 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL M+R + N+ +VR+RDP VL V+DVGG YD + YD
Sbjct: 40 IGTHNGHFHADEALAVHMLRRLPAYANSTLVRTRDPAVLATCHTVVDVGGEYDAARHRYD 99
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL-----NVDEGHPDVHRLFLAVYK 139
HHQ+GF F TKLSSAGLV+ HFG+ +IA+ L + E V L +Y+
Sbjct: 100 HHQRGFATTFPDR-PTKLSSAGLVFLHFGRAIIAQRLAGAGASATEDDAQVRLLHAKIYQ 158
Query: 140 NFMEAIDAIDNGINQYD------TDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSA----- 187
+F+EA+DA DNGI YD R+ + L + VG+LN W EP +
Sbjct: 159 SFVEALDAHDNGIAVYDPAAVAAAGLAKRFSDGGFGLGAMVGRLNPSWNEPAAADPAEAQ 218
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFC 246
RE+ F G+EF V +Y +WLPAR++V + A R ++DP G ++VL +
Sbjct: 219 RREDARFLTASQRIGEEFDREVDYYASAWLPARAVVQDAFARRAEHDPDGRVLVLDGQSV 278
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
PWK HL+ LE++ ++ YVLY++ + G +WR+Q V S D F SRKPLP WRG R
Sbjct: 279 PWKDHLYTLEQDEGRSSVL-YVLYQEKPEPGAKWRIQCVPESKDSFVSRKPLPEAWRGFR 337
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
D EL AG+ GCVFVH +GFIGGN ++ GA M AL
Sbjct: 338 DAELDAIAGVEGCVFVHAAGFIGGNATFDGAKVMVAKAL 376
>gi|256071071|ref|XP_002571865.1| metal dependent hydrolase-related [Schistosoma mansoni]
Length = 372
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 195/333 (58%), Gaps = 27/333 (8%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTH+G FH DE L C M++ ++ NA+I+R+RD +L D V+DVGGV++P N Y
Sbjct: 43 RIGTHDGRFHADEILACAMLKHLPEYSNAEIIRTRDSSILSTCDIVVDVGGVFNPENHLY 102
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH--PDVHRLFLA 136
DHHQ+ F + + KLSSAGL+Y HFG+++++ L +DE P V LF
Sbjct: 103 DHHQREFNLTYKDFYPNSDWDIKLSSAGLIYVHFGRKILSCILGIDENTMDPLVTALFDK 162
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y +F+ IDAIDNG+ T P RY NT+LSSRV ++N W + D E F
Sbjct: 163 MYSSFIVEIDAIDNGVPMATT--PLRYSMNTSLSSRVNRMNPAWNQLDTD---ETVCFHN 217
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFEL 255
+ L KEF V FY +W PAR IV+ + RY D SG I+ ++ CPW H FE+
Sbjct: 218 ALQLVDKEFTTLVHFYADTWYPAREIVLNAVKNRYSVDSSGSIIYIEGTGCPWSTHFFEI 277
Query: 256 EEEM-----------KIEPLIKYVLYEDDRGKQWRVQAVAVSP-DRFESRKPLPAQWRGL 303
E+ + K +PL+ + +Y+ + W +QA+ ++ + F R PLP WRGL
Sbjct: 278 EKSLLLNNKNINEIEKNDPLL-FAIYQ-RKDSTWTIQAIPLNEHNNFSQRLPLPESWRGL 335
Query: 304 RDDELSKEAGIPGCVFVHMSGFIGGNQSYGGAL 336
RD++LS G+P CVFVH +GF+G +++ G L
Sbjct: 336 RDEQLSNIVGLPDCVFVHSTGFLGVHKTRDGVL 368
>gi|344230127|gb|EGV62012.1| metal-dependent protein hydrolase [Candida tenuis ATCC 10573]
Length = 326
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 135/330 (40%), Positives = 209/330 (63%), Gaps = 12/330 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++ TH+GSFH DE+L +M++ K+ ++RSRDPK ++ + V+DV G YD
Sbjct: 1 MSKICTHSGSFHADESLAVYMLQQLPKYSQYDLIRSRDPKDWEESEIVIDVSGKYD-GEK 59
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD---VHRLFLAVY 138
+DHHQ+ F E F GFSTKLSSAGL+YKHFG+++I +LN+ + D + ++ +Y
Sbjct: 60 YFDHHQREFFETFP-GFSTKLSSAGLIYKHFGQDIIKHKLNLTDSVQDSEIIKGIWEKIY 118
Query: 139 KNFMEAIDAIDNGINQYDTDKP----PRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEA 193
K F+E+IDA DNGI++YD P++V+ + L S + LN W D + E ++
Sbjct: 119 KEFIESIDANDNGISKYDESATASLEPKFVDRSLMLPSIIANLNPQWYN-DPTPEDFDKQ 177
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F + L G+ F + + ++ SW+ +++IV + I R+D D SG I+ L+++C WK HL+
Sbjct: 178 FLKSSALMGQVFENVLEYHGVSWVKSKAIVEDAIKGRFDVDKSGAIIKLEKYCQWKTHLY 237
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
E+E+ IE IK+VLY+D WR+ V+V+ FE R + +WRG+RD+ELSK G
Sbjct: 238 NTEKELGIEEAIKFVLYKDS-SNSWRISTVSVNSGSFEFRLGIKEKWRGIRDEELSKMVG 296
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
I +FVH +GFIGG +S+ AL +AR +L
Sbjct: 297 IEDGIFVHANGFIGGAKSFESALKIARESL 326
>gi|195111550|ref|XP_002000341.1| GI22581 [Drosophila mojavensis]
gi|193916935|gb|EDW15802.1| GI22581 [Drosophila mojavensis]
Length = 349
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 207/350 (59%), Gaps = 21/350 (6%)
Query: 8 SSPAYSTSSPSQ--TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD- 64
SS A SP + TPL +GTHNG+FHCDE + CFM++ ++ NA+I RSRD + L
Sbjct: 2 SSAANGLPSPPKRPTPLT-IGTHNGTFHCDEVVACFMLKQLPEYENAEIFRSRDDEALRA 60
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELI 117
D ++DVGGV+D YDHHQ F+E F G F +LSSAGLVY +G+ +I
Sbjct: 61 KCDIIVDVGGVFDHEKKWYDHHQLTFKETFSTVHPELGDEFDIRLSSAGLVYSFYGERVI 120
Query: 118 A----KELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRV 173
+E N+ ++ FL +Y+NF+ +DAIDNG+ ++ +P RY +T+LS+R+
Sbjct: 121 QSILQRERNIQLSEANMKLAFLQIYRNFICELDAIDNGVPMFEGGEP-RYKISTHLSARI 179
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
GKLN W + + ++ F M +AGKEF++ V SW+ AR V + +
Sbjct: 180 GKLNPSWQDMNLDTDQR---FHMAMSVAGKEFVENVLEVACSWIAARDHVRLALEKAASI 236
Query: 234 DPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESR 293
S +I++L+ FCPWK+HL LE+E ++ + K V++ D G WRV V V+P + R
Sbjct: 237 HESRQILLLETFCPWKVHLDSLEKEYDVKGVPKLVIFND--GNSWRVAGVPVTPTSYVGR 294
Query: 294 KPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
K LP WRGLRD EL + A I FVH +GFIGG ++ ALAMA ++
Sbjct: 295 KFLPKPWRGLRDKELCQIAEIEDLTFVHHTGFIGGAKTKEAALAMALKSI 344
>gi|295674501|ref|XP_002797796.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226280446|gb|EEH36012.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 369
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 202/347 (58%), Gaps = 31/347 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ ++R+RDP +L ++DVGG YDP+ + YD
Sbjct: 26 IGTHNGHFHADEALAVYLLRLLPTYTSSPLIRTRDPSLLATCHTIVDVGGEYDPARNRYD 85
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F ST+LSSAGLVY HFGK +IA+ ++ HPDV L+ +Y +F+EA
Sbjct: 86 HHQRTFTTTFP-SHSTRLSSAGLVYLHFGKAIIAQHTSLPIDHPDVSTLYEKLYADFIEA 144
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKLNL-----DWTEPDQSA---ERENEAF 194
+DA DNGI+ YD P V L R G +NL D PD S+ + E+ F
Sbjct: 145 LDAHDNGISVYD----PSRVAAAGLEKRFRDGGINLGSLVGDLNGPDPSSTDPQDEDSLF 200
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLF 253
++ G FL +R SWLPAR+ V E R+D SG I+VL + PWK HL+
Sbjct: 201 EKASKFIGDVFLRKLRLASGSWLPARATVREAYESRFDTHESGRILVLPQPGVPWKEHLY 260
Query: 254 ELEEEMKI---------EPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRG 302
LEE+ E + YVLY + G +WRV V V+ FESR+PLP WRG
Sbjct: 261 TLEEKENAGMDGVADAEEGKVYYVLYPESTAEGARWRVHCVPVTDASFESRRPLPEAWRG 320
Query: 303 LRDDELS---KEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+RD++L K +P G +FVH SGF GG+ + GALAMA +L+L
Sbjct: 321 MRDEDLDGVLKGEEVPSGAIFVHASGFTGGHATREGALAMAVRSLRL 367
>gi|255939612|ref|XP_002560575.1| Pc16g02020 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585198|emb|CAP92872.1| Pc16g02020 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 363
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 159/360 (44%), Positives = 214/360 (59%), Gaps = 35/360 (9%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
TSSPS +GTHNG FH DEAL +++RL + ++ ++R+RDP L+ V+DVG
Sbjct: 11 TSSPS------IGTHNGHFHADEALAVYLLRLLPTYASSPLIRTRDPAELEKCHTVVDVG 64
Query: 74 GVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
GVYDP+ YDHHQ+ F F +TKLSSAGLVY HFGK +IA++L++ H DV L
Sbjct: 65 GVYDPAIHRYDHHQRTFSTTFPQ-RATKLSSAGLVYMHFGKAIIAQKLSLPVEHADVDLL 123
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDK------PPRYVN-NTNLSSRVGKL-NLDWTEPDQ 185
+ +Y +F+EAIDA DNGI+ YD R+ N L+S VG + N D T P
Sbjct: 124 YEKLYTDFIEAIDANDNGISAYDQAALTAAGIEKRFKNGGITLASMVGDMNNPDPTSPPG 183
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + L G F + SWLPAR+ V A R D PSG I+VL +
Sbjct: 184 EPQDEDSLFGRASTLIGNAFARKMHHACTSWLPARTTVGSAYASRKDVHPSGRIIVLPQG 243
Query: 245 FCPWKLHLFELEEEM----KIEPLIK--YVLYEDD--RGKQWRVQAVAVSPDRFESRKPL 296
PWK HL+ E E +I+P ++ YVLY ++ G +WRVQ V+VS FESRKPL
Sbjct: 244 GVPWKEHLYNFEAEASGTKEIDPTVQAYYVLYPENATEGAKWRVQCVSVSESSFESRKPL 303
Query: 297 PAQWRGLRDDEL----------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
P WRG+RD +L + ++ IP G VFVH SGFIGG+++ GA+AMA +L+L
Sbjct: 304 PEAWRGVRDQDLDGVMAAEAEKNGQSKIPEGAVFVHASGFIGGHKTREGAMAMAERSLEL 363
>gi|323348852|gb|EGA83090.1| YER156C-like protein [Saccharomyces cerevisiae Lalvin QA23]
gi|365765936|gb|EHN07439.1| YER156C-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 305
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 197/307 (64%), Gaps = 8/307 (2%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTK 101
M+RL +F +A++VRSR+PK + D ++DVG YD +DHHQ+GF E F + TK
Sbjct: 1 MLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYD-GVXFFDHHQRGFFETFNEKYKTK 59
Query: 102 LSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY----DT 157
LSSAGL++KH+G+++I LN D+ L+ VYK F+EA+DA DNGI++Y D+
Sbjct: 60 LSSAGLIFKHYGRDIIKTILNDKVSSSDLDLLYDKVYKQFVEALDANDNGISKYTIPKDS 119
Query: 158 DKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSW 216
+ P + +N ++ + +N +W E D S E + F + + G F+ V+ Y +SW
Sbjct: 120 NLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARASEFIGGVFVTLVKGYGQSW 178
Query: 217 LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGK 276
LPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE E IE I++VL+ D G
Sbjct: 179 LPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELEREKNIEKQIEFVLFTDSSG- 237
Query: 277 QWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGAL 336
WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC+F+H +GFIGG +S
Sbjct: 238 AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGCIFIHAAGFIGGAKSKEAVY 297
Query: 337 AMARAAL 343
+A+ +L
Sbjct: 298 ELAKMSL 304
>gi|323305302|gb|EGA59049.1| YER156C-like protein [Saccharomyces cerevisiae FostersB]
gi|323333813|gb|EGA75204.1| YER156C-like protein [Saccharomyces cerevisiae AWRI796]
gi|323355338|gb|EGA87163.1| YER156C-like protein [Saccharomyces cerevisiae VL3]
Length = 305
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 197/307 (64%), Gaps = 8/307 (2%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTK 101
M+RL +F +A++VRSR+PK + D ++DVG YD +DHHQ+GF E F + TK
Sbjct: 1 MLRLLPEFKDAKLVRSRNPKDWEASDILVDVGAQYDGVK-FFDHHQRGFFETFNEKYKTK 59
Query: 102 LSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY----DT 157
LSSAGL++KH+G+++I LN D+ L+ VYK F+EA+DA DNGI++Y D+
Sbjct: 60 LSSAGLIFKHYGRDIIKTILNDKVSSSDLDLLYDKVYKQFVEALDANDNGISKYTIPKDS 119
Query: 158 DKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSW 216
+ P + +N ++ + +N +W E D S E + F + + G F+ V+ Y +SW
Sbjct: 120 NLEPNFRDNAISIPGIISGMNPNWNE-DTSDESFDRCFARASEFIGGVFVTLVKGYGQSW 178
Query: 217 LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGK 276
LPA+++V + I ER D D SG+I+VL +FCPWK HL+ELE E IE I++VL+ D G
Sbjct: 179 LPAKALVAQAIDERMDVDKSGKIIVLPQFCPWKEHLYELEREKNIEKQIEFVLFTDSSG- 237
Query: 277 QWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGAL 336
WRV V ++ F+ R+ LP RGLRD+ELS ++G+PGC+F+H +GFIGG +S
Sbjct: 238 AWRVSTVPINSTSFQFRRGLPEPLRGLRDEELSTKSGVPGCIFIHAAGFIGGAKSKEAVY 297
Query: 337 AMARAAL 343
+A+ +L
Sbjct: 298 ELAKMSL 304
>gi|400597139|gb|EJP64874.1| Metal-dependent protein hydrolase [Beauveria bassiana ARSEF 2860]
Length = 379
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 200/363 (55%), Gaps = 50/363 (13%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FH DEAL M+R + +A +VR+RDP VL V+DVGG YD + YD
Sbjct: 21 IGTHSGHFHADEALAVHMLRQLPAYADASLVRTRDPAVLATCHTVVDVGGEYDAARHRYD 80
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV--------------------- 123
HHQ+GF F G TKLSSAGLV+ HFG+ +IA+ L
Sbjct: 81 HHQRGFATTFP-GRPTKLSSAGLVFLHFGRAIIARRLAALENNRNNNNNNNNNNNAAAVV 139
Query: 124 ----DEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR------- 172
E P + L +Y++F+EA+DA DNGI YD P+ + L+ R
Sbjct: 140 IEPKPEDSPHIALLHDKIYESFVEALDAHDNGIAVYD----PQALAAAGLTKRFSDGGFG 195
Query: 173 ----VGKLNLDWTEP-----DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIV 223
VG+LN +W +P ++ RE+ F G+EF V +Y +WLPAR++V
Sbjct: 196 LGALVGRLNPNWNDPPAATAAEAQSREDARFAAASARIGEEFDREVDYYASAWLPARAVV 255
Query: 224 VECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPLIKYVLYED--DRGKQWRV 280
++R +D G ++VL+ + PWK HL+ LE+E ++ YVLY++ + G +WRV
Sbjct: 256 QAAFSQRAAHDADGRVLVLEGQSVPWKDHLYTLEQEEGRSSVL-YVLYQEKPEPGAKWRV 314
Query: 281 QAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMAR 340
Q V S D F SRKPLP WRG RD EL GI GCVFVH +GFIGGN ++ GA AM
Sbjct: 315 QCVPESKDSFVSRKPLPEAWRGFRDAELDGITGIEGCVFVHAAGFIGGNATFEGAKAMVA 374
Query: 341 AAL 343
AL
Sbjct: 375 KAL 377
>gi|189195820|ref|XP_001934248.1| hypothetical protein PTRG_03915 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980127|gb|EDU46753.1| hypothetical protein PTRG_03915 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 354
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 145/357 (40%), Positives = 209/357 (58%), Gaps = 28/357 (7%)
Query: 13 STSSPSQ---TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV 69
+T +P++ T + +GTHNG FH DEAL M++L + +AQ+VR+RDP +L + V
Sbjct: 2 ATENPAKRLKTDVPTIGTHNGHFHADEALAVSMLQLLPTYQDAQLVRTRDPALLAECHTV 61
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DVGG Y+ YDHHQ+GF+ VF G +TKLSSAGLVY HFGK++I + D
Sbjct: 62 VDVGGEYEDGKKRYDHHQRGFDVVFP-GHTTKLSSAGLVYMHFGKDIITTVTGLQGADCD 120
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPR----------------YVNNTNLSSRV 173
+ L+ +Y +F+EA DA DNGI+ D + VN N +
Sbjct: 121 L--LYEKIYSDFIEAFDANDNGISALDPKDLEKAGLEKKFATSGFTIASVVNRYNYGPKS 178
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
K + T Q AE E+ F G++F + +SWLPAR V + R Y
Sbjct: 179 PKSDETKTPETQQAE-EDMRFMSASRFVGEQFRWELIDRAQSWLPARHQVKQAYDARLQY 237
Query: 234 DPSGEIMVLKRFCPWKLHLFELEEEMK----IEPLIKYVLYEDDRGK-QWRVQAVAVSPD 288
DP G I+VL PW HL+ E+E + + P + YVL+ +D+ + +WR++AV+
Sbjct: 238 DPQGRILVLPEGMPWADHLYNFEKETQQPQGVAPQVLYVLFPEDKPEGKWRIRAVSKENS 297
Query: 289 RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP W+G+RD++L + +GIPGCVFVH +GFIGGN+++ GALAMA+ AL+L
Sbjct: 298 GFENRKDLPDAWKGVRDEQLDQVSGIPGCVFVHAAGFIGGNKTFDGALAMAKKALEL 354
>gi|157115876|ref|XP_001658325.1| hypothetical protein AaeL_AAEL007356 [Aedes aegypti]
gi|108876735|gb|EAT40960.1| AAEL007356-PA [Aedes aegypti]
Length = 401
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 192/331 (58%), Gaps = 14/331 (4%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDE L CFM++ K+ NA + R+R ++LD D V+DVG V+D + +D
Sbjct: 71 IGTHDGIFHCDEVLACFMLQQLPKYANATVTRTRKQELLDQCDIVVDVGAVFDRDQNRFD 130
Query: 85 HHQKGFEEVFGH-------GFSTKLSSAGLVYKHFGKELIAKELN---VDEGHPDVHR-L 133
HHQ F + +LSSAGL+Y +FG+++I + L ++ D+ R +
Sbjct: 131 HHQASFNDTLNSLRPELKVKREIRLSSAGLIYTYFGEDVIRQILKANGIESASDDLVRGV 190
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
F +Y + IDAIDNG+ +D + P+Y NT+LS+RV N W E +
Sbjct: 191 FRKLYDTLIAEIDAIDNGVPMFDGE--PKYSINTHLSARVSHFNPAWNEDAGDDTDAMKR 248
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F++ G+EF+D V +Y W PAR +V + +R + SGEI+ L+ CPWK HL+
Sbjct: 249 FEKAKAYVGQEFIDKVLYYAVRWWPARELVENAVKKRMEVHASGEILELEMLCPWKEHLY 308
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
ELE++ I + KYV+Y ++ WRV V + P F RK L A WRG+RD EL +G
Sbjct: 309 ELEDQYDIAGVPKYVIYF-NKENDWRVICVPLQPASFVCRKFLAAPWRGVRDKELESVSG 367
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
I G F H +GFIGGN + GAL MA A+L+
Sbjct: 368 IEGITFCHQTGFIGGNTTREGALKMAVASLE 398
>gi|326473694|gb|EGD97703.1| hypothetical protein TESG_05105 [Trichophyton tonsurans CBS 112818]
Length = 358
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 202/350 (57%), Gaps = 39/350 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + + ++R+RDP L V+DVGGVYDPS + YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPAYSTSPVIRTRDPGQLATCHTVVDVGGVYDPSQNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y +F+EA
Sbjct: 76 HHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSTEHQDVSTIYEKLYADFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSAERENEA 193
+DA DNGI+ Y P+ ++N NL S +G +NL P ++ + E+
Sbjct: 135 LDAHDNGISVY----CPKAISNAGLEKRFRDGGINLGSLIGDMNL--CGPSENLD-EDAL 187
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F + G+ F + WLPAR+ V + R D PSG+IM+L+ PWK HL
Sbjct: 188 FARASTFIGEAFSRKLHAASSKWLPARATVAQAHQSRMDVHPSGKIMLLENSGIPWKEHL 247
Query: 253 FELEEEM------KIEPLIKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPAQWRGLR 304
+ LEEE + + YV+Y + ++ WRVQ V VS + FESRKPLP WRG R
Sbjct: 248 YRLEEEAGNVTAGTADNKVFYVIYPESTAEKSNWRVQCVPVSENSFESRKPLPETWRGAR 307
Query: 305 DDEL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
D EL SKE IP G +FVH SGFIGG+++ GALAMA +L
Sbjct: 308 DSELDSIISKEMEEKQRERIPEGAIFVHASGFIGGHKNKAGALAMAAESL 357
>gi|327309698|ref|XP_003239540.1| hypothetical protein TERG_01525 [Trichophyton rubrum CBS 118892]
gi|326459796|gb|EGD85249.1| hypothetical protein TERG_01525 [Trichophyton rubrum CBS 118892]
Length = 367
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 202/350 (57%), Gaps = 39/350 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + + ++R+RDP L V+DVGGVYDPS + YD
Sbjct: 25 IGTHNGHFHADEALAVYLLRLLPAYSTSPVIRTRDPGQLATCHTVVDVGGVYDPSQNRYD 84
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y +F+EA
Sbjct: 85 HHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSTEHQDVSTIYEKLYADFIEA 143
Query: 145 IDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSAERENEA 193
+DA DNGI+ Y P+ ++N NL S +G +NL P ++ + E+
Sbjct: 144 LDAHDNGISVY----CPKAISNAGMEKRFRDGGINLGSLIGDMNL--CGPGENLD-EDAL 196
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F + G+ F + WLPAR+ V + R D PSG+IM+L+ PWK HL
Sbjct: 197 FARASTFIGEAFSRKLHAASSKWLPARATVAQAHQSRMDVHPSGKIMLLENSGIPWKEHL 256
Query: 253 FELEEEM------KIEPLIKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPAQWRGLR 304
+ LEEE + + YV+Y + ++ WRVQ V VS + FESRKPLP WRG R
Sbjct: 257 YRLEEEAGNVTAGTADNKVFYVIYPESTAEKSNWRVQCVPVSENSFESRKPLPESWRGAR 316
Query: 305 DDEL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
D EL SKE IP G +FVH SGFIGG+++ GALAMA +L
Sbjct: 317 DSELDSIISKEMEEKQRERIPEGAIFVHASGFIGGHKNKAGALAMAAESL 366
>gi|157115874|ref|XP_001658324.1| hypothetical protein AaeL_AAEL007356 [Aedes aegypti]
gi|108876734|gb|EAT40959.1| AAEL007356-PB [Aedes aegypti]
Length = 408
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 194/338 (57%), Gaps = 21/338 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH+G FHCDE L CFM++ K+ NA + R+R ++LD D V+DVG V+D + +D
Sbjct: 71 IGTHDGIFHCDEVLACFMLQQLPKYANATVTRTRKQELLDQCDIVVDVGAVFDRDQNRFD 130
Query: 85 HHQKGF--------------EEVFGHGFSTKLSSAGLVYKHFGKELIAKELN---VDEGH 127
HHQ F E+ + +LSSAGL+Y +FG+++I + L ++
Sbjct: 131 HHQASFNDTLNSLRPELKVKREIRNYCMIYRLSSAGLIYTYFGEDVIRQILKANGIESAS 190
Query: 128 PDVHR-LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
D+ R +F +Y + IDAIDNG+ +D + P+Y NT+LS+RV N W E
Sbjct: 191 DDLVRGVFRKLYDTLIAEIDAIDNGVPMFDGE--PKYSINTHLSARVSHFNPAWNEDAGD 248
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
+ F++ G+EF+D V +Y W PAR +V + +R + SGEI+ L+ C
Sbjct: 249 DTDAMKRFEKAKAYVGQEFIDKVLYYAVRWWPARELVENAVKKRMEVHASGEILELEMLC 308
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
PWK HL+ELE++ I + KYV+Y ++ WRV V + P F RK L A WRG+RD
Sbjct: 309 PWKEHLYELEDQYDIAGVPKYVIYF-NKENDWRVICVPLQPASFVCRKFLAAPWRGVRDK 367
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
EL +GI G F H +GFIGGN + GAL MA A+L+
Sbjct: 368 ELESVSGIEGITFCHQTGFIGGNTTREGALKMAVASLE 405
>gi|452004196|gb|EMD96652.1| hypothetical protein COCHEDRAFT_1199549 [Cochliobolus
heterostrophus C5]
Length = 354
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 147/356 (41%), Positives = 205/356 (57%), Gaps = 32/356 (8%)
Query: 18 SQTPLKR-------VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
S+ P KR +GTHNG FH DEAL M++L + ++Q+VR+RDP VL + V+
Sbjct: 3 SENPTKRLKTDAPLIGTHNGHFHADEALAVSMLQLLPAYRDSQLVRTRDPAVLAECHTVV 62
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
DVGG YD ++ YDHHQ+GF+ VF G TKLSSAGLVY HFGK++I + PD
Sbjct: 63 DVGGEYDDNSRRYDHHQRGFDVVFP-GHKTKLSSAGLVYMHFGKDIITAVTGLQ--GPDR 119
Query: 131 HRLFLAVYKNFMEAIDAIDNGIN---QYDTDKP-------------PRYVNNTNLSSRVG 174
L+ +Y +F+EA DA DNGIN D +K VN N +
Sbjct: 120 DLLYEKIYTDFIEAFDANDNGINVIAPTDLEKAGLAKKFEDRGFSIASVVNRYNYGPK-S 178
Query: 175 KLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYD 234
L+ + P+ E+ F G++F + SWLPAR V + R YD
Sbjct: 179 PLSDEAKTPEAKQAEEDSRFMTASRFVGEQFRYELVDRAESWLPARHQVKQAYDSRLKYD 238
Query: 235 PSGEIMVLKRFCPWKLHLFELEEEMK----IEPLIKYVLYEDDRGK-QWRVQAVAVSPDR 289
G I+VL PW HL+ LE+E + P + YVL+ +D+ + +WR++AV+
Sbjct: 239 AQGRILVLPEGMPWADHLYMLEKETPQPEGVAPQVLYVLFPEDKPEGKWRIRAVSKENGG 298
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP W+G+RD++L + +GIPGCVFVH +GFIGGN+++ GALAMA+ AL+L
Sbjct: 299 FENRKDLPDAWKGVRDEQLDQVSGIPGCVFVHAAGFIGGNKTFDGALAMAQKALEL 354
>gi|326482902|gb|EGE06912.1| hypothetical protein TEQG_05965 [Trichophyton equinum CBS 127.97]
Length = 358
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 202/350 (57%), Gaps = 39/350 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + + ++R+RDP L V+DVGGVYDPS + YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPAYSTSPVIRTRDPGQLATCHTVVDVGGVYDPSQNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y +F+EA
Sbjct: 76 HHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSTEHQDVSTIYEKLYADFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSAERENEA 193
+DA DNGI+ Y P+ ++N NL S +G +NL P ++ + E+
Sbjct: 135 LDAHDNGISVY----CPKAISNAGLEKRFRDGGINLGSLIGDMNL--CGPGENLD-EDAL 187
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F + G+ F + WLPAR+ V + R D PSG+IM+L+ PWK HL
Sbjct: 188 FARASTFIGEAFSRKLHAASSKWLPARATVAQAHQSRMDVHPSGKIMLLENSGIPWKEHL 247
Query: 253 FELEEEM------KIEPLIKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPAQWRGLR 304
+ LEEE + + YV+Y + ++ WRVQ V VS + FESRKPLP WRG R
Sbjct: 248 YRLEEEAGNVTAGTADNKVFYVIYPESTAEKSNWRVQCVPVSENSFESRKPLPETWRGAR 307
Query: 305 DDEL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
D EL SKE IP G +FVH SGFIGG+++ GALAMA +L
Sbjct: 308 DSELDSIISKEMEEKQRERIPEGAIFVHASGFIGGHKNKAGALAMAVESL 357
>gi|440636693|gb|ELR06612.1| hypothetical protein GMDG_08085 [Geomyces destructans 20631-21]
Length = 332
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 143/339 (42%), Positives = 196/339 (57%), Gaps = 41/339 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL + +A ++R+RDP +L V+DVGG YD + + YD
Sbjct: 16 IGTHNGHFHADEALAVYMLRLLPAYQSASLIRTRDPALLATCHTVVDVGGEYDATRNLYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + TKLSSAGLVY HFGK +IA+ L V+E +V L+ +Y++F+EA
Sbjct: 76 HHQRTFTTTFPNR-PTKLSSAGLVYMHFGKAIIAQHLGVEESADEVMILWNKIYEDFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKLNLDWTEPDQSAER---- 189
+DA DNGI+ Y P+ + + R VG+LN +W +P S
Sbjct: 135 LDAHDNGISAY----CPKALQAAGIEKRFSDGGFTLGAMVGRLNPNWNDPRPSQPAAWQA 190
Query: 190 -ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCP 247
E+E F + G EF T+ + +SWLPAR IV A+R DYD G I+V + P
Sbjct: 191 IEDEKFAKASTKIGAEFSSTLDYLTKSWLPAREIVRGAFAKRLDYDAQGRILVFDGQSVP 250
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRD 305
WK HL++LE E + + YVLY + G +WR+QAV+V+ D FESRKPLP WRG RD
Sbjct: 251 WKDHLYKLESEQGGDEKVVYVLYPESPVAGAKWRIQAVSVTTDSFESRKPLPENWRGFRD 310
Query: 306 DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ L GI G ++ GA+AMAR AL+
Sbjct: 311 EALD---GITG--------------TFEGAIAMARKALE 332
>gi|300122707|emb|CBK23273.2| unnamed protein product [Blastocystis hominis]
Length = 370
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 194/322 (60%), Gaps = 4/322 (1%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G+FH DEAL ++R ++ + I+R+R+ ++L D V+DVGG YD
Sbjct: 53 KVIGTHSGTFHSDEALAVSLLRTLPEYRDHVIMRTRNQELLKKCDIVVDVGGEYDFEKKR 112
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQKGFE F TKLSSAGL+YKHFG+E+I LN D+ ++ VY +F+
Sbjct: 113 FDHHQKGFECTFDSQHKTKLSSAGLIYKHFGREIIGAILNRSLPPADLDYVYHKVYDHFV 172
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E ID DNG+N D Y T +SSRV +L + W E + + F+ M L
Sbjct: 173 EEIDGTDNGVNSSSGD--SNYKVTTTISSRVARLGISWREEWSEEKEQER-FRFAMGLMI 229
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF V LPAR IV E + + + PSGEI+V+K CP+ H+FELE+E E
Sbjct: 230 GEFWQRVHIEADEILPARGIVREAMEKAKEVHPSGEILVMKESCPYMEHVFELEKERGEE 289
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
K+V+ E+ G WRV+A+ P FE RK + A+ GLRD+ELS+ +GI GC+FVH+
Sbjct: 290 GKTKFVVVENTDG-SWRVRAMNAGPGTFEVRKKILAKCLGLRDEELSRASGIEGCIFVHI 348
Query: 323 SGFIGGNQSYGGALAMARAALK 344
+GFIG N++ GAL MA +L+
Sbjct: 349 NGFIGSNKTKEGALKMAIQSLE 370
>gi|330921529|ref|XP_003299457.1| hypothetical protein PTT_10456 [Pyrenophora teres f. teres 0-1]
gi|311326832|gb|EFQ92426.1| hypothetical protein PTT_10456 [Pyrenophora teres f. teres 0-1]
Length = 354
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 141/356 (39%), Positives = 208/356 (58%), Gaps = 26/356 (7%)
Query: 13 STSSPSQ---TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV 69
+T +P++ T + +GTHNG FH DEAL M++L + +AQ+VR+RDP +L + V
Sbjct: 2 ATENPAKRLKTNVPTIGTHNGHFHADEALAVSMLQLLPTYQDAQLVRTRDPALLAECHTV 61
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DVGG Y+ YDHHQ+GF+ VF G +TKLSSAGLVY HFGK++I + D
Sbjct: 62 VDVGGEYEDGKKRYDHHQRGFDVVFP-GHTTKLSSAGLVYMHFGKDIITTVTGLQGADCD 120
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRY-------VNNTNLSSRVGKLNL---- 178
+ L+ +Y +F+EA DA DNGI+ D + + ++S V + N
Sbjct: 121 L--LYEKIYSDFIEAFDANDNGISALDPKDLEKAGLEKKFATSGFTIASVVNRYNYGPKP 178
Query: 179 ----DWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYD 234
D P+ E+ F G++F + +SWLPAR V + R YD
Sbjct: 179 PKSDDTKTPEAQQAEEDMRFMSASRFVGEQFRWELIDRAQSWLPARHQVKQAYDARLQYD 238
Query: 235 PSGEIMVLKRFCPWKLHLFELEEEMK----IEPLIKYVLYEDDRGK-QWRVQAVAVSPDR 289
P G I+VL PW HL+ E+E + + P + YVL+ +D+ + +WR++AV+
Sbjct: 239 PQGRILVLLEGMPWADHLYNFEKETQQPQGVAPQVLYVLFPEDKPEGKWRIRAVSKENSG 298
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP W+G+RD++L + +GI GCVFVH +GFIGGN+++ GALAMA+ AL+L
Sbjct: 299 FENRKDLPDAWKGVRDEQLDQVSGISGCVFVHAAGFIGGNKTFDGALAMAKKALEL 354
>gi|312385728|gb|EFR30154.1| hypothetical protein AND_00412 [Anopheles darlingi]
Length = 351
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 193/332 (58%), Gaps = 14/332 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG FHCDE L CFM++ ++ AQIVRSRD KVLD D V+DVGG +D Y
Sbjct: 19 KIGTHNGVFHCDELLACFMLQQLPQYATAQIVRSRDNKVLDQCDIVVDVGGTFDRERHRY 78
Query: 84 DHHQKGFEEV-------FGHGFSTKLSSAGLVYKHFG----KELIAKELNVDEGHPDVHR 132
DHHQ F E ++ +LSSAGLVY +FG +E++ ++L ++ +
Sbjct: 79 DHHQGSFNETLNSLRPELNSPWNIRLSSAGLVYTYFGEEVLREVVKRKLGLELDADCLRA 138
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
++ VY + IDAIDNG+ ++ + P+Y ++LS+R GK N W E + + E
Sbjct: 139 VYRKVYDGMISEIDAIDNGVPMFEGGE-PQYNITSHLSARAGKFNPVWNE-PEPPPEDME 196
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F++ G+EF D V +YV+SW PAR IV + R + SG I+ L +FCPWK HL
Sbjct: 197 RFERAKAYVGQEFEDKVLYYVKSWWPARDIVRRALVNRLNVHESGAILELDQFCPWKEHL 256
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+ELE E KYV+Y ++ WR+ V F RK L WRG+RD+ L+ +
Sbjct: 257 YELENEHDAVGQAKYVIY-FNKENDWRIICVPKQSTSFVCRKFLAKPWRGVRDEALATIS 315
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI G F H +GFIGGN S GAL MA A+L+
Sbjct: 316 GIEGATFCHQTGFIGGNVSRDGALRMAIASLE 347
>gi|226290709|gb|EEH46193.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 371
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 148/355 (41%), Positives = 203/355 (57%), Gaps = 33/355 (9%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+T +GTHNG FH DEAL +++RL + ++ ++R+RDP +L V+DVGG YDP
Sbjct: 20 KTSAPVIGTHNGHFHADEALAVYLLRLLPTYTSSPLIRTRDPSLLATCHTVVDVGGEYDP 79
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ + YDHHQ+ F F ST+LSSAGLVY HFG+ +IA+ ++ HPDV L+ +Y
Sbjct: 80 ARNRYDHHQRTFTATFP-SHSTRLSSAGLVYLHFGRAIIAQHTSLPIDHPDVSTLYEKLY 138
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKLNL-----DWTEPDQSA---E 188
+F+EA+DA DNGI+ YD P V L R G +NL D PD S+ +
Sbjct: 139 ADFIEALDAHDNGISVYD----PSRVAAAGLEKRFRDGGINLGSLVGDLNGPDPSSKDPQ 194
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCP 247
E+ F++ G FL +R SWLPAR+ V E R+D SG I+VL +
Sbjct: 195 DEDSLFEKASKFIGDVFLRKLRLASASWLPARATVREAYESRFDTHESGRILVLPQPGVL 254
Query: 248 WKLHLFELEEEMKI-----------EPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRK 294
WK HL+ LE++ E + YVLY + G +WRV V V+ FESR+
Sbjct: 255 WKEHLYTLEKKQNAGLDGVAEVDAEEGKVYYVLYPESTAEGARWRVHCVPVTDASFESRR 314
Query: 295 PLPAQWRGLRDDELS---KEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
PLP WRGLRD++L K +P G +FVH SGF GG+ + GALAMA +L+L
Sbjct: 315 PLPEAWRGLRDEDLDGVLKGEDVPSGAIFVHASGFTGGHATREGALAMAVRSLRL 369
>gi|118398020|ref|XP_001031340.1| uncharacterized protein family (UPF0160) [Tetrahymena thermophila]
gi|89285667|gb|EAR83677.1| uncharacterized protein family (UPF0160) [Tetrahymena thermophila
SB210]
Length = 351
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 201/344 (58%), Gaps = 21/344 (6%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMI-RLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
++ LK++GTH+G FHCDE L C M+ + T +F + I R+R+ ++LD + ++DVGG+Y
Sbjct: 2 AEVALKKIGTHSGVFHCDEVLACVMLSKYTSEFKDGIITRTREQEILDQQNIIVDVGGIY 61
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELI---AKELNVDEGHPDVH-- 131
DPS YDHHQ+ F + F + +LSSAGLVYKHFG+E+I A+ L +DE +++
Sbjct: 62 DPSKHRYDHHQRSFVDTFSSQHNIRLSSAGLVYKHFGQEIIKNVAQSL-IDENKDNLNIE 120
Query: 132 ---------RLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE 182
L+ +Y F++ +D DNG+ QY + Y N T L R+G+LN WTE
Sbjct: 121 ITLNQETLDSLYQRIYDGFIQGVDGSDNGVEQYPVEVKSAYSNPTQLQQRIGRLNPLWTE 180
Query: 183 PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
+ EN F+ M++A E V+ + S LPA IV + + R++ PSGEI++L
Sbjct: 181 KNTD---ENVRFRSAMEIADMELRWQVKIQLLSVLPAYDIVKQSVLNRFNVHPSGEIVIL 237
Query: 243 KRFCPWKLHLFELEEEMKIEPLIKYVLY-EDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
+ PWK HL +LE+ + + IK+VL+ E K WRV V + ++ R L +WR
Sbjct: 238 ETVVPWKSHLEDLEKSLNLGKQIKFVLFPESSAKKAWRVSTVPENWGTYDLRIGLKEEWR 297
Query: 302 GLRD-DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G++D EL I VFVH SGFIGG +SY L MA +++
Sbjct: 298 GIKDMTELKNVTKIDDIVFVHNSGFIGGAKSYESVLRMALESIE 341
>gi|289741551|gb|ADD19523.1| putative metal-binding protein [Glossina morsitans morsitans]
Length = 363
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 209/351 (59%), Gaps = 15/351 (4%)
Query: 7 SSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSR-DPKVLDD 65
S + + S + +GTH+G+FHCDEAL CFM++ ++ NA+I+RSR + ++
Sbjct: 3 SKAESLSGNDCGNNSTPTIGTHSGTFHCDEALACFMLKQLPEYENAEILRSRINEELRKK 62
Query: 66 LDAVLDVGGVYDPSNDCYDHHQKGFEEVF-------GHGFSTKLSSAGLVYKHFGKELI- 117
++DVG V+D YDHHQ F+E G + +LSSAGL++ ++G+ +I
Sbjct: 63 CSIIVDVGNVFDHERKLYDHHQPSFQETLSSLRPELGDKYKIRLSSAGLIFNYYGERVID 122
Query: 118 --AKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY--DTDKPPRYVNNTNLSSRV 173
K+ +D ++ F+ VY+ F++ IDAIDNG+ + + P Y T++S+RV
Sbjct: 123 CILKKHGIDLSDENLRLTFIQVYQKFIKEIDAIDNGLPMFLCPEGQEPLYTIGTDVSARV 182
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
G++NL W D++ + +++ F++ M G EF++ V + SW+ AR V +
Sbjct: 183 GRINLSWD--DETGDCQDDRFREAMCFVGNEFVEEVLYTGGSWIKARECVRTALQNAAKV 240
Query: 234 DPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESR 293
+GEI++L+R PWK HLF+LEEE K+E K V++ED WRV V V+P F R
Sbjct: 241 YETGEILLLERALPWKEHLFDLEEECKVEGRSKLVIFEDPLDNSWRVAGVPVTPQSFLGR 300
Query: 294 KPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ LP +WRGLR+D+L + AG+ +FVH +GFIGG ++ ALAMA +++
Sbjct: 301 QFLPIEWRGLRNDDLFQAAGVKDLLFVHNNGFIGGAKNKEAALAMAIKSVQ 351
>gi|449301468|gb|EMC97479.1| hypothetical protein BAUCODRAFT_68098 [Baudoinia compniacensis UAMH
10762]
Length = 372
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/361 (39%), Positives = 209/361 (57%), Gaps = 36/361 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S+ P+ +GTHNG FH DEAL F+++L + +A +VR+RDP +L V+DVGGV+D
Sbjct: 15 SEAPV--IGTHNGHFHADEALAVFLLKLLPDYRHATLVRTRDPDLLKTCTIVVDVGGVHD 72
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
YDHHQ+ F F G TKLSSAGLV+ H+GK++I+ +D PD L+ +
Sbjct: 73 DDLLRYDHHQREFNATF-PGKQTKLSSAGLVWMHYGKQIISVVTQLDAQSPDCELLYQKI 131
Query: 138 YKNFMEAIDAIDNGINQYD-------------TDKP---PRYVNNTNL--SSRVG----K 175
Y +F+EA DA DNGI+ YD +DK VN N S+R G
Sbjct: 132 YDDFVEAFDANDNGISVYDPTALRKAGIEKKFSDKGFSIASVVNRYNYAPSAREGAEGPT 191
Query: 176 LNLDWT------EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAE 229
N T + +S + E+ F + G++F + SWLPAR++V + +E
Sbjct: 192 ANGTTTSGAPTAQSGKSQDGEDARFLRASAFVGEQFSLEIDDRFASWLPARAVVKQAFSE 251
Query: 230 RYDYDPSGEIMVL---KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDR--GKQWRVQAVA 284
R Y+ G I+V+ PW HL+ LE E E + Y L+ ++ G +WR++AV+
Sbjct: 252 RTKYEAKGRIIVIPYRSEGVPWSDHLYALESESDAEGQVLYALFAENGEPGSKWRIRAVS 311
Query: 285 VSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ P FE+RK LP WRG+RD+ELS+ +G+PGC+FVH SGFIGGN ++ GAL MA+ A++
Sbjct: 312 LDPGSFENRKGLPEAWRGVRDEELSRVSGVPGCIFVHASGFIGGNATFEGALEMAKKAVE 371
Query: 345 L 345
+
Sbjct: 372 M 372
>gi|453080261|gb|EMF08312.1| metal-dependent protein hydrolase [Mycosphaerella populorum SO2202]
Length = 368
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/357 (39%), Positives = 207/357 (57%), Gaps = 29/357 (8%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
++SP+ +P+ +GTHNG FH DEAL + +RL F A ++R+RDP L+ V+DVG
Sbjct: 16 STSPAASPV--IGTHNGHFHADEALAVYFLRLLPDFSAASLIRTRDPGTLEKCTIVVDVG 73
Query: 74 GVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
GV+D S YDHHQ+ F F G +TKLSSAGLV+ H+GK +++ +D + L
Sbjct: 74 GVHDHSQLRYDHHQREFCATF-PGKNTKLSSAGLVWMHYGKSIVSAVTGLDVESAENELL 132
Query: 134 FLAVYKNFMEAIDAIDNGINQYD-------------TDKP-------PRYVNNTNLSSRV 173
+ +Y++F++A DA DNGI+ YD +DK RY N+ L + V
Sbjct: 133 YQKLYEDFVQAFDANDNGISVYDPAEIRKAGIEKKTSDKGFSIANVVGRY-NHAPLLNGV 191
Query: 174 GKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDY 233
K + T + E+ F + G++F +WLPAR+IV + R
Sbjct: 192 PKNGVIQTSKAKDQAEEDARFLRASAFVGEQFSIEFNDKFSAWLPARAIVAQAFHSRTAV 251
Query: 234 DPSGEIMVLKRF---CPWKLHLFELEEEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPD 288
DP G I+V+ PW HL+ LEEE + + Y L+ + ++ +WR++AV++ P
Sbjct: 252 DPQGRILVVPHNDGGVPWMDHLYALEEENGAQGSVLYALFAESGEKDSKWRIRAVSLEPG 311
Query: 289 RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP WRG+RD+ELSK +GIPGC+FVH GFIGGN ++ GAL MA+ A++L
Sbjct: 312 SFENRKGLPEAWRGVRDEELSKLSGIPGCIFVHAGGFIGGNTTFEGALEMAKKAVEL 368
>gi|425774396|gb|EKV12704.1| hypothetical protein PDIG_43050 [Penicillium digitatum PHI26]
gi|425776839|gb|EKV15038.1| hypothetical protein PDIP_41630 [Penicillium digitatum Pd1]
Length = 363
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 208/349 (59%), Gaps = 29/349 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ +VR+RDP L+ V+DVGGVYDP+ YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPIYASSPLVRTRDPAELEKCHTVVDVGGVYDPAIHRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F +TKLSSAGLVY HFGK ++A++L++ H DV L+ +Y +F+EA
Sbjct: 76 HHQRTFSTTFPQ-HATKLSSAGLVYMHFGKAILAQKLSLPVEHADVDLLYEKLYTDFIEA 134
Query: 145 IDAIDNGINQYDTDKP------PRYVN-NTNLSSRVGKL-NLDWTEPDQSAERENEAFQQ 196
IDA DNGI+ YD R+ N L+S VG + N D T P + E+ F +
Sbjct: 135 IDANDNGISAYDQAALAAAGVEKRFKNGGITLASMVGDMNNPDPTSPPGEPQDEDSLFGR 194
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFEL 255
L G F + SW+PAR+ V A R D PSG I+VL + PWK HL+
Sbjct: 195 ASTLIGNAFARKMHHACTSWMPARTTVGSAYASRKDTHPSGRIIVLPQSGVPWKEHLYNF 254
Query: 256 EEEM----KIEPLIK--YVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
E E +++P + YVLY ++ G +WRVQ V+VS FESRKPLP WRG+RD +
Sbjct: 255 EAEASGTKEVDPAFQVYYVLYPENATEGAKWRVQCVSVSESSFESRKPLPEAWRGVRDQD 314
Query: 308 L----------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L + ++ IP G VFVH SGFIGG+++ GA+AMA +L+L
Sbjct: 315 LDGVMAAEAEKNSQSKIPEGAVFVHASGFIGGHKTREGAMAMAERSLEL 363
>gi|451855168|gb|EMD68460.1| hypothetical protein COCSADRAFT_156884 [Cochliobolus sativus
ND90Pr]
Length = 354
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 146/356 (41%), Positives = 203/356 (57%), Gaps = 32/356 (8%)
Query: 18 SQTPLKR-------VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
S+ P KR +GTHNG FH DEAL M++L + ++Q+VR+RDP L + V+
Sbjct: 3 SENPTKRLKTDAPLIGTHNGHFHADEALAVSMLQLLPAYRDSQLVRTRDPAALAECHTVV 62
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
DVGG YD ++ YDHHQ+GF+ VF G TKLSSAGLVY HFGK++I + PD
Sbjct: 63 DVGGEYDDNSRRYDHHQRGFDVVFP-GHKTKLSSAGLVYMHFGKDIITAVTGLQ--GPDR 119
Query: 131 HRLFLAVYKNFMEAIDAIDNGIN---QYDTDKP-------------PRYVNNTNLSSRVG 174
L+ +Y +F+EA DA DNGIN D +K VN N R
Sbjct: 120 DLLYEKIYNDFIEAFDANDNGINVIAPTDLEKAGLEKKFEDRGFSIASVVNRYNYGPR-S 178
Query: 175 KLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYD 234
L+ + P+ E+ F G++F + SWLPAR V + R YD
Sbjct: 179 PLSDETKTPEAKQAEEDSRFMTASRFVGEQFRYELVDRAESWLPARHQVKQAYDSRLKYD 238
Query: 235 PSGEIMVLKRFCPWKLHLFELEEEMK----IEPLIKYVLYEDDRGK-QWRVQAVAVSPDR 289
G I+VL PW HL+ LE+E + P + YVL+ +D+ + +WR++AV+
Sbjct: 239 AQGRILVLPEGMPWADHLYTLEKETPQPEGVAPHVLYVLFPEDKPEGKWRIRAVSKENGG 298
Query: 290 FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
FE+RK LP W+G+RD++L + +GI GCVFVH +GFIGGN+++ GALAMA+ AL+L
Sbjct: 299 FENRKDLPDAWKGVRDEQLDQVSGISGCVFVHTAGFIGGNKTFDGALAMAQKALEL 354
>gi|90652825|ref|NP_001035070.1| UPF0160 protein MYG1, mitochondrial [Danio rerio]
gi|89130760|gb|AAI14296.1| Zgc:136866 [Danio rerio]
Length = 273
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 122/276 (44%), Positives = 182/276 (65%), Gaps = 12/276 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDE L CF++R ++ +A+IVRSRD VL + D V+DVGG YD S Y
Sbjct: 4 KIGTHNGTFHCDEVLACFLLRQLPEYKDAEIVRSRDASVLAECDVVVDVGGEYDHSRQRY 63
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
DHHQ+ F E F + TKLSSAGLVY H+G+ ++ + ++ E P + L+ +Y
Sbjct: 64 DHHQRAFAESFHSVCAQKPWVTKLSSAGLVYVHYGRRVLQQLTHLQEDEPQLEVLYDKMY 123
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ F+E +DA+DNGI+Q D ++ RY ++ +SSRV LN W +Q E E F++ +
Sbjct: 124 EGFVEEVDAVDNGISQSDGEQ--RYTISSTISSRVSYLNPQWNSKEQDTE---EGFRKAL 178
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELEE 257
L G EF D + ++ +WLPAR +V++ I R+ D SG++++L++ CPWK HLF LE+
Sbjct: 179 ALVGSEFQDRLLYFTNAWLPARDVVLQAIKSRHQVDVSGQVLLLQQGGCPWKEHLFALEK 238
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESR 293
E++++ LIK+VLY D G WRVQ V P+ F++R
Sbjct: 239 ELQLQELIKFVLYCDQNG-HWRVQCVPAGPNTFQNR 273
>gi|320591149|gb|EFX03588.1| upf0160 domain containing protein [Grosmannia clavigera kw1407]
Length = 363
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 206/360 (57%), Gaps = 41/360 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRL-TDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+ THNG FH DEAL M+R ++ NAQ+VR+RDP L V+DVGGVYD + +
Sbjct: 6 IATHNGHFHADEALAVHMLRRHVPEYENAQLVRTRDPAKLAVAHTVVDVGGVYDAAARRF 65
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK------------------ELNVDE 125
DHHQ+GF+ F STKLSSAGLVY HFG++++A+ E D
Sbjct: 66 DHHQRGFDTTFPDR-STKLSSAGLVYMHFGRDIVARRIQQRKAKLKLKQQQSEAEAETDA 124
Query: 126 GHPDVHRLFLAVYKNFMEAIDAIDNGINQYD------TDKPPRYVNNT-NLSSRVGKLNL 178
DV +L+ +Y F+EA+DA DNGI+QYD R+ + +L + VG+ N
Sbjct: 125 TEADVDQLYRKLYSGFVEALDAHDNGIDQYDPVELASAGIQKRFSDGGFSLGAVVGRYNA 184
Query: 179 DWTEPDQSAE---RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
W + + A+ E+ F Q G+EF ++ + V +WLPARS+V A R D+DP
Sbjct: 185 AWNDVEAEAQGQAGEDVRFVQASARIGEEFDSSLNYLVGAWLPARSLVRSAFAARLDFDP 244
Query: 236 SGEIMVLK-RFCPWKLHLFELEEEMKIEPL------IKYVLYEDDRGKQWRVQAVAVSPD 288
G ++V + + CPWK HL+ EE + YVLY D+ +WRVQ V S D
Sbjct: 245 KGRLLVFRNQSCPWKDHLYAEEEAASQAAADDADVKVLYVLYP-DKEDRWRVQCVPASKD 303
Query: 289 RFESRKPLPAQWRGLRD---DELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
F SRKPLP WRG RD D+L + +PG VFVH +GFIGG++++ GAL MA+ AL++
Sbjct: 304 SFTSRKPLPEAWRGHRDGDLDKLLEGEVVPGAVFVHAAGFIGGHRTFEGALQMAKKALEV 363
>gi|225557735|gb|EEH06020.1| MYG1 protein [Ajellomyces capsulatus G186AR]
Length = 371
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/365 (41%), Positives = 200/365 (54%), Gaps = 45/365 (12%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+T +GTHNG FH DEAL +++RL + ++ +VR+RDP +L V+DVGG YDP
Sbjct: 11 KTSPPTIGTHNGHFHADEALAVYLLRLLPTYTSSPLVRTRDPSLLATCHTVVDVGGEYDP 70
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
S + YDHHQ+ F F STKLSSAGL+Y HFGK +IA+ ++ HPDV L+ +Y
Sbjct: 71 SRNRYDHHQRTFTATFPQ-HSTKLSSAGLIYLHFGKAIIAQHTSLPPDHPDVATLYEKLY 129
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSA---E 188
+F+EA+DA DNGI+ YD +N + R +G L D PD +A +
Sbjct: 130 SDFIEALDANDNGISVYDNAA----LNAAGVRKRFRDSGISLGALVGDLNHPDPTATGPQ 185
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCP 247
E+ F+ G FL +R SWLPAR+ V E R+D SG I+VL K P
Sbjct: 186 DEDALFENASTFIGDAFLRKLRVASASWLPARATVSEAYKARFDTHASGRILVLPKAGIP 245
Query: 248 WKLHLFELEEEMKIEPL-------------------IKYVLYEDDRGKQ--WRVQAVAVS 286
WK HLF LEEE + YVLY + + WRVQ V V+
Sbjct: 246 WKEHLFVLEEEEAAAAAAGAEAEAGKIGGAHVDRRQVYYVLYPESTAPEAKWRVQCVPVT 305
Query: 287 PDRFESRKPLPAQWRGLRDDEL-----SKEAGI---PGCVFVHMSGFIGGNQSYGGALAM 338
FESR+PLP W G+RD++L S+E + PG VFVH +GFIGG+ + GA AM
Sbjct: 306 DTSFESRRPLPEAWCGIRDEDLDKLLASEETSVVVPPGAVFVHATGFIGGHATREGAFAM 365
Query: 339 ARAAL 343
A +L
Sbjct: 366 AVRSL 370
>gi|302663342|ref|XP_003023314.1| hypothetical protein TRV_02540 [Trichophyton verrucosum HKI 0517]
gi|291187306|gb|EFE42696.1| hypothetical protein TRV_02540 [Trichophyton verrucosum HKI 0517]
Length = 354
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 203/356 (57%), Gaps = 39/356 (10%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q+ L VG +G FH DEAL +++RL + + ++R+RDP L V+DVGGVYDP
Sbjct: 6 QSKLTTVGLASGHFHADEALAVYLLRLLPAYSTSPVIRTRDPGQLATCHTVVDVGGVYDP 65
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
S + YDHHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y
Sbjct: 66 SQNRYDHHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSTEHQDVSTIYEKLY 124
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSA 187
+F+EA+DA DNGI+ Y P+ ++N NL S +G +NL P ++
Sbjct: 125 ADFIEALDAHDNGISVY----CPKAISNAGLEKRFRDGGINLGSLIGDMNL--CGPGENL 178
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-C 246
+ E+ F + G+ F + WLPAR+ V + R D PSG+IM+L+
Sbjct: 179 D-EDALFARASTFIGEAFSRKLHAASSRWLPARATVSQAHQSRMDVHPSGKIMLLENSGI 237
Query: 247 PWKLHLFELEEEM------KIEPLIKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPA 298
PWK HL+ LEEE + + YV+Y + ++ WRVQ V VS + FESRKPLP
Sbjct: 238 PWKEHLYRLEEEADNVTAGTADNKVFYVIYPESTAEKSNWRVQCVPVSENSFESRKPLPE 297
Query: 299 QWRGLRDDEL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRG RD EL SKE IP G +FVH SGFIGG+++ GALAMA +L
Sbjct: 298 SWRGARDSELDSIISKEMEEKQRERIPEGAIFVHASGFIGGHKNKAGALAMAAESL 353
>gi|302500613|ref|XP_003012300.1| hypothetical protein ARB_01560 [Arthroderma benhamiae CBS 112371]
gi|291175857|gb|EFE31660.1| hypothetical protein ARB_01560 [Arthroderma benhamiae CBS 112371]
Length = 354
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 203/356 (57%), Gaps = 39/356 (10%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q+ L VG +G FH DEAL +++RL + + ++R+RDP L V+DVGGVYDP
Sbjct: 6 QSKLTTVGLASGHFHADEALAVYLLRLLPAYSTSPVIRTRDPGQLATCHTVVDVGGVYDP 65
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
S + YDHHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y
Sbjct: 66 SQNRYDHHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSTEHQDVSTIYEKLY 124
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSA 187
+F+EA+DA DNGI+ Y P+ ++N NL S +G +NL P ++
Sbjct: 125 ADFIEALDAHDNGISVY----CPKAISNAGLEKRFRDGGINLGSLIGDMNL--CGPGENL 178
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-C 246
+ E+ F + G+ F + WLPAR+ V + R D PSG+IM+L+
Sbjct: 179 D-EDALFARASTFIGEAFSRKLHAASSKWLPARATVSQAHQSRMDVHPSGKIMLLENSGI 237
Query: 247 PWKLHLFELEEEM------KIEPLIKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPA 298
PWK HL+ LEEE + + YV+Y + ++ WRVQ V VS + FESRKPLP
Sbjct: 238 PWKEHLYRLEEEAGNVTAGTADNKVFYVIYPESTAEKSNWRVQCVPVSENSFESRKPLPE 297
Query: 299 QWRGLRDDEL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRG RD EL SKE IP G +FVH SGFIGG+++ GALAMA +L
Sbjct: 298 SWRGARDSELDSIISKEMEEKQRERIPEGAIFVHASGFIGGHKNKAGALAMAVESL 353
>gi|148672041|gb|EDL03988.1| melanocyte proliferating gene 1, isoform CRA_a [Mus musculus]
Length = 282
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 126/250 (50%), Positives = 166/250 (66%), Gaps = 6/250 (2%)
Query: 95 GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQ 154
G + TKLSSAGLVY HFG++L+A+ L E V ++ +Y+NF+E +DA+DNGI+Q
Sbjct: 23 GKPWQTKLSSAGLVYLHFGRKLLAQLLGTSEEDSVVDTIYDKMYENFVEEVDAVDNGISQ 82
Query: 155 YDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVR 214
+ + PRY T LS+RV +LN W +P+Q E F++ MDL +EFL + FY
Sbjct: 83 W-AEGEPRYAMTTTLSARVARLNPTWNQPNQDTE---AGFRRAMDLVQEEFLQRLNFYQH 138
Query: 215 SWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDD 273
SWLPAR++V E +A+R+ D SGEI+ L K CPWK HL+ LE E+ + I +V+Y D
Sbjct: 139 SWLPARALVEEALAQRFKVDSSGEIVELAKGGCPWKEHLYHLESELSPKVAITFVIYTDQ 198
Query: 274 RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYG 333
G QWRVQ V P F+SR PLP WRGLRD L + +GIPGC+FVH SGFIGG+ +
Sbjct: 199 AG-QWRVQCVPKEPHSFQSRLPLPEPWRGLRDKALDQVSGIPGCIFVHASGFIGGHHTRE 257
Query: 334 GALAMARAAL 343
GAL MARA L
Sbjct: 258 GALNMARATL 267
>gi|261198959|ref|XP_002625881.1| MYG1 protein [Ajellomyces dermatitidis SLH14081]
gi|239595033|gb|EEQ77614.1| MYG1 protein [Ajellomyces dermatitidis SLH14081]
Length = 361
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 200/349 (57%), Gaps = 35/349 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ ++R+RDP +L V+DVGG YDP+ + YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRLLPTYTSSPLIRTRDPSLLATCHTVVDVGGEYDPARNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F +TKLSSAGL+Y HFGK +IA+ + HPDV L+ +Y +F+EA
Sbjct: 77 HHQRSFTTTFPQ-HTTKLSSAGLIYLHFGKAIIAQHTALPVHHPDVATLYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDK------PPRYVN-NTNLSSRVGKLNLDWTEPDQSA---ERENEAF 194
+DA DNGI+ YD+ R+ + NL + VG LN PD +A + E+ F
Sbjct: 136 LDANDNGISVYDSAALAAAGVQKRFRDGGINLGALVGDLN----HPDPTAAEPQDEDALF 191
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLF 253
+ G FL +R SWLPAR+ V E R+D SG+I+VL PWK HLF
Sbjct: 192 ENASRFIGGAFLRKLRVASSSWLPARATVSEAYKARFDTHASGQILVLPTAGVPWKEHLF 251
Query: 254 ELEEEMKIEPL---------IKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPAQWRG 302
LEE + YVLY + + WRVQ V V+ FESRKPLP W G
Sbjct: 252 ALEEAAAAAAAEAGADLLRQVYYVLYPESTAPEAKWRVQCVPVTDVSFESRKPLPEAWCG 311
Query: 303 LRDDEL-----SKEAGI---PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+RD++L S+EA + PG VFVH +GFIGG+ + GA AMA +L
Sbjct: 312 VRDEDLDRVLASEEASVVVPPGAVFVHANGFIGGHATREGAFAMAVRSL 360
>gi|325095466|gb|EGC48776.1| MYG1 protein [Ajellomyces capsulatus H88]
Length = 372
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 150/366 (40%), Positives = 200/366 (54%), Gaps = 46/366 (12%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+T +GTHNG FH DEAL +++RL + ++ +VR+RDP +L V+DVGG YDP
Sbjct: 11 KTSPPTIGTHNGHFHADEALAVYLLRLLPTYTSSPLVRTRDPSLLATCHTVVDVGGEYDP 70
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
S + YDHHQ+ F F STKLSSAGL+Y HFGK +IA+ ++ HPDV L+ +Y
Sbjct: 71 SRNRYDHHQRTFTATFPQ-HSTKLSSAGLIYLHFGKAIIAQHTSLPPDHPDVATLYEKLY 129
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSA---E 188
+F+EA+DA DNGI+ YD +N + R +G L D PD +A +
Sbjct: 130 SDFIEALDANDNGISVYDNAA----LNAAGVRKRFRDSGISLGALVGDLNHPDPTATGPQ 185
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCP 247
E+ F+ G FL +R SWLPAR+ V E R+D SG I+VL K P
Sbjct: 186 DEDALFENASKFIGDAFLRKLRVASASWLPARATVSEAYKARFDTHASGRILVLPKAGIP 245
Query: 248 WKLHLFELEEEMKIEPL--------------------IKYVLYEDDRG--KQWRVQAVAV 285
WK HLF LEEE + YVLY + +WRVQ V V
Sbjct: 246 WKEHLFVLEEEAAAAAAAARAEAEAGKIGGAHADRRQVYYVLYPESTSPDAKWRVQCVPV 305
Query: 286 SPDRFESRKPLPAQWRGLRDDEL-----SKEAGI---PGCVFVHMSGFIGGNQSYGGALA 337
+ FESR+PLP W G+RD++L S+E + PG VFVH +GFIGG+ + GA A
Sbjct: 306 TDTSFESRRPLPEAWCGIRDEDLDKLLASEETSVVVPPGAVFVHATGFIGGHATREGAFA 365
Query: 338 MARAAL 343
MA +L
Sbjct: 366 MAVRSL 371
>gi|119617096|gb|EAW96690.1| chromosome 12 open reading frame 10, isoform CRA_d [Homo sapiens]
Length = 261
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 126/250 (50%), Positives = 166/250 (66%), Gaps = 6/250 (2%)
Query: 95 GHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQ 154
G + TKLSSAGL+Y HFG +L+A+ L E V L+ +Y+NF+E +DA+DNGI+Q
Sbjct: 7 GKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGTLYDKMYENFVEEVDAVDNGISQ 66
Query: 155 YDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVR 214
+ + PRY T LS+RV +LN W PDQ E F++ MDL +EFL + FY
Sbjct: 67 W-AEGEPRYALTTTLSARVARLNPTWNHPDQDTE---AGFKRAMDLVQEEFLQRLDFYQH 122
Query: 215 SWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDD 273
SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ LE + I +V+Y D
Sbjct: 123 SWLPARALVEEALAQRFQVDPSGEIVELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQ 182
Query: 274 RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYG 333
G QWR+Q V P F+SR PLP WRGLRD+ L + +GIPGC+FVH SGFIGG+++
Sbjct: 183 AG-QWRIQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTRE 241
Query: 334 GALAMARAAL 343
GAL+MARA L
Sbjct: 242 GALSMARATL 251
>gi|239609855|gb|EEQ86842.1| MYG1 protein [Ajellomyces dermatitidis ER-3]
gi|327350780|gb|EGE79637.1| MYG1 protein [Ajellomyces dermatitidis ATCC 18188]
Length = 362
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 200/350 (57%), Gaps = 36/350 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ ++R+RDP +L V+DVGG YDP+ + YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRLLPTYTSSPLIRTRDPSLLATCHTVVDVGGEYDPARNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F +TKLSSAGL+Y HFGK +IA+ + HPDV L+ +Y +F+EA
Sbjct: 77 HHQRSFTTTFPQ-HTTKLSSAGLIYLHFGKAIIAQHTALPVDHPDVATLYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDK------PPRYVN-NTNLSSRVGKLNLDWTEPDQSA---ERENEAF 194
+DA DNGI+ YD+ R+ + NL + VG LN PD +A + E+ F
Sbjct: 136 LDANDNGISVYDSAALAAAGVQKRFRDGGINLGALVGDLN----HPDPTAAEPQDEDALF 191
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLF 253
+ G FL +R SWLPAR+ V E R+D SG+I+VL PWK HLF
Sbjct: 192 ENASRFIGGAFLRKLRVASSSWLPARATVSEAYKARFDTHASGQILVLPTAGVPWKEHLF 251
Query: 254 ELEEEMKIEPL----------IKYVLYEDDRGKQ--WRVQAVAVSPDRFESRKPLPAQWR 301
LEE + YVLY + + WRVQ V V+ FESRKPLP W
Sbjct: 252 ALEEAAAAAAAAEAGADLLRQVYYVLYPESTAPEAKWRVQCVPVTDVSFESRKPLPEAWC 311
Query: 302 GLRDDEL-----SKEAGI---PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G+RD++L S+EA + PG VFVH +GFIGG+ + GA AMA +L
Sbjct: 312 GVRDEDLDRVLASEEASVVVPPGAVFVHANGFIGGHATREGAFAMAVRSL 361
>gi|315047831|ref|XP_003173290.1| hypothetical protein MGYG_03462 [Arthroderma gypseum CBS 118893]
gi|311341257|gb|EFR00460.1| hypothetical protein MGYG_03462 [Arthroderma gypseum CBS 118893]
Length = 358
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 142/350 (40%), Positives = 197/350 (56%), Gaps = 39/350 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + + ++R+RDP L V+DVGGVYDPS + YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPAYSTSPLIRTRDPAQLATCHTVVDVGGVYDPSQNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY HFGK ++A+ + + H DV+ ++ +Y +F+EA
Sbjct: 76 HHQRSFDTTFPQ-HQTKLSSAGLVYMHFGKAIVAQHMGLSTDHQDVNTIYEKLYADFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLNLDWTEPDQSAERENEA 193
+DA DNGI+ Y P+ ++N NL S +G +NL P + + E+
Sbjct: 135 LDAHDNGISVY----CPKTISNAGLEKRFRDGGINLGSLIGDMNL--CGPGEKLD-EDTL 187
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHL 252
F + G+ F + WLPAR+ V + R + PSG+I++L+ PWK HL
Sbjct: 188 FARASTFIGETFSRKLHAASSKWLPARATVSQSYHSRLEVHPSGKILLLESSGIPWKEHL 247
Query: 253 FELEEEM------KIEPLIKYVLYEDDRGK--QWRVQAVAVSPDRFESRKPLPAQWRGLR 304
+ LEEE + + YV+Y + + + WRVQ V VS FESRK LP WRG R
Sbjct: 248 YRLEEEASNSATENVNNEVFYVIYPESKAEDTNWRVQCVPVSESSFESRKALPETWRGAR 307
Query: 305 DDELSK----------EAGIP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
D EL K IP G +FVH SGFIGG+++ GALAMA L
Sbjct: 308 DSELDKIIAKEMEEKQREEIPEGAIFVHASGFIGGHKNKTGALAMALEGL 357
>gi|452836622|gb|EME38566.1| hypothetical protein DOTSEDRAFT_75920 [Dothistroma septosporum
NZE10]
Length = 369
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/369 (37%), Positives = 210/369 (56%), Gaps = 32/369 (8%)
Query: 8 SSPAYSTSSPSQ-TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDL 66
S+PA S T +GTHNG FH DEAL +++RL ++ NA + R+RD +VL+D
Sbjct: 2 SAPASKRVKMSNGTAAPVIGTHNGHFHADEALAVYLLRLLPEYHNASLTRTRDAEVLNDC 61
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
D V+DVGG++D S YDHHQ+ F F G TKLSSAGLV+ H+GK +I+ +
Sbjct: 62 DIVVDVGGIHDHSAKRYDHHQREFNATFP-GKQTKLSSAGLVWMHYGKHIISAVTALGVE 120
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDK-----PPRYVNNT--NLSSRVGKLNLD 179
D L+ VY++F+EA D DNGI+ YD + + N+ ++S V + N
Sbjct: 121 DADNELLYQEVYEDFVEAFDGNDNGISTYDPQELRKAGIEKKFNDGGFTIASVVSRYNRA 180
Query: 180 ------------------WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARS 221
+P ++ + E+ F + G++F + SWLPAR+
Sbjct: 181 PVSQLADGKPAGNGAPPAQGKPGKTQDEEDNRFLRASAFVGEQFSLKLTDMFTSWLPARA 240
Query: 222 IVVECIAERYDYDPSGEIMVLK---RFCPWKLHLFELEEEMKIEPLIKYVLYEDDR--GK 276
+V + R+ D SG I+V+ PW HL+ LEEE ++ + Y L+ ++
Sbjct: 241 VVAQAFNSRHAVDASGSIIVIPYKPEGVPWSDHLYSLEEEHNVKGNVLYALFAENGEPDS 300
Query: 277 QWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGAL 336
+WR++AV++ FE+RK LP WRG+RDDELSK +GI GC+FVH +GFIGGN+++ GAL
Sbjct: 301 KWRIRAVSLELGSFENRKGLPEAWRGVRDDELSKLSGISGCIFVHAAGFIGGNKTFEGAL 360
Query: 337 AMARAALKL 345
MA+ A+++
Sbjct: 361 EMAKKAVQM 369
>gi|342186344|emb|CCC95830.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 338
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 189/341 (55%), Gaps = 45/341 (13%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHG---F 98
M+R + +F A I+R+RD +L+ + V+DVGG Y+ +DHHQ F+ +
Sbjct: 1 MLRCSAEFSTANILRTRDASLLEKCNIVVDVGGEYNAETLRFDHHQPSFQTTMTTSKSVY 60
Query: 99 STKLSSAGLVYKHFGKELIA-----------------------KELNVDEGHPDVHRLFL 135
T+LSSAGLVYKH+G+E+I K N+ + D +F
Sbjct: 61 RTRLSSAGLVYKHYGREIIQRYVEAALSSSYRSELITMTSWDLKRKNLTDSELDT--IFD 118
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPP-----------RYVNNTNLSSRVGKLNLDWTEPD 184
VYKNF+E ID IDNG+N Y YV T LS RVG+L W E
Sbjct: 119 IVYKNFVEHIDGIDNGVNSYGPAAQVDVADAGSLCVRNYVVKTALSDRVGQLMPWWNENG 178
Query: 185 Q-SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK 243
+ E EN AF Q ++L EF+ V FYV +WLPAR +V PSG I+VLK
Sbjct: 179 SGNIESENAAFLQAVELTLLEFITAVHFYVFTWLPARGLVESAFLGAEKVHPSGRIIVLK 238
Query: 244 -RFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRG 302
R+CPWK HL E+E E E + YVL+ D G WRVQAV FESRK LP WRG
Sbjct: 239 ERYCPWKDHLLEIETERGKEGHVLYVLFADKSG--WRVQAVPKDAVGFESRKALP--WRG 294
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
LRD ELS+ +G+ G VFVH+SGFIGGN++Y GALAMA AL
Sbjct: 295 LRDAELSEASGVDGGVFVHVSGFIGGNKTYEGALAMATKAL 335
>gi|212532299|ref|XP_002146306.1| UPF0160 domain protein MYG1, putative [Talaromyces marneffei ATCC
18224]
gi|210071670|gb|EEA25759.1| UPF0160 domain protein MYG1, putative [Talaromyces marneffei ATCC
18224]
Length = 360
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 201/356 (56%), Gaps = 35/356 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
++ PL +GTHNG FH DEAL +M+RL ++ + +VR+RDP+ L V+DVGG YD
Sbjct: 12 TEGPL--IGTHNGHFHADEALAVYMLRLLPEYHASPLVRTRDPEQLAQCHTVVDVGGEYD 69
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
S + YDHHQ+ F F + STKLSSAGLV+ HFG+ +IA+ ++ HPDV L+ +
Sbjct: 70 ASKNRYDHHQRTFNTTFPN-HSTKLSSAGLVFMHFGRAIIAQHTSLPLDHPDVELLYEKL 128
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKLNL-----DWTEPDQ----- 185
Y +F+EA+DA DNGI+ YD P V+ L R G +NL D PD
Sbjct: 129 YTDFVEALDAHDNGISAYD----PAAVSKAGLEKRFKDGAINLGSLVGDLNYPDPVIAGG 184
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-R 244
+ E+ F + G FL +R SWLPAR+ V E R + PSG I++L
Sbjct: 185 EPQDEDSLFARASTFIGDVFLRKLRLAASSWLPARATVGEAYRNRREIHPSGRIVMLSGG 244
Query: 245 FCPWKLHLFELEEEMKIEPL--IKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQW 300
PWK HL+ E E + + YVLY ++ +WRVQ V S F SRKPLP W
Sbjct: 245 GVPWKEHLYNFESETQAGEAEEVYYVLYPENSAPDAKWRVQCVPESEGSFISRKPLPDTW 304
Query: 301 RGLRDDEL----------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RG+RD +L + + GIP G +F H SGFIGG+ + GALAMA +L++
Sbjct: 305 RGVRDQDLDGVIAVESQKAGKEGIPAGAIFTHASGFIGGHSTKEGALAMALRSLEM 360
>gi|134076669|emb|CAK45200.1| unnamed protein product [Aspergillus niger]
gi|350629360|gb|EHA17733.1| hypothetical protein ASPNIDRAFT_47753 [Aspergillus niger ATCC 1015]
Length = 363
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 203/359 (56%), Gaps = 39/359 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
+ +PL +GTHNG FH DEAL +++RL + ++ ++R+RDP L V+DVGG YD
Sbjct: 11 TSSPL--IGTHNGHFHADEALAVYLLRLLPTYSSSPLLRTRDPAQLATCHTVVDVGGEYD 68
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
+++ YDHHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV L+ +
Sbjct: 69 AASNRYDHHQRTFATTFPN-HNTKLSSAGLVYMHFGRAIIAQHTSLPLDHEDVTLLYEKL 127
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKL-NLDWTEPDQ 185
Y +F+EAIDA DNG++ YD P ++ N+ R VG + N D T P
Sbjct: 128 YTDFIEAIDANDNGVSAYD----PAAISAANIQKRFKDGGITITSVVGDMNNPDPTCPPG 183
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + G F + V SWLPAR+ V R D PSG I++L +
Sbjct: 184 EPQDEDSLFARASTFIGNVFARKLHHAVSSWLPARTTVGAAYRSRRDVHPSGRIIILPQG 243
Query: 245 FCPWKLHLFELE------EEMKIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPL 296
PWK HL+ E EE + YVLY + G +WRVQ V+V+ F SRKPL
Sbjct: 244 GVPWKEHLYNFEKEASGSEETNPDEEAYYVLYPESATEGSKWRVQCVSVNEGSFVSRKPL 303
Query: 297 PAQWRGLRDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
P WRG+RD +L ++++G P G VFVH SGFIGG+Q+ GALAMA L+
Sbjct: 304 PEAWRGVRDADLDGVMAAEAEKSGKPKVPEGAVFVHASGFIGGHQTKEGALAMATRGLE 362
>gi|317030222|ref|XP_001392166.2| hypothetical protein ANI_1_50074 [Aspergillus niger CBS 513.88]
Length = 372
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 203/359 (56%), Gaps = 39/359 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
+ +PL +GTHNG FH DEAL +++RL + ++ ++R+RDP L V+DVGG YD
Sbjct: 20 TSSPL--IGTHNGHFHADEALAVYLLRLLPTYSSSPLLRTRDPAQLATCHTVVDVGGEYD 77
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
+++ YDHHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV L+ +
Sbjct: 78 AASNRYDHHQRTFATTFPN-HNTKLSSAGLVYMHFGRAIIAQHTSLPLDHEDVTLLYEKL 136
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKL-NLDWTEPDQ 185
Y +F+EAIDA DNG++ YD P ++ N+ R VG + N D T P
Sbjct: 137 YTDFIEAIDANDNGVSAYD----PAAISAANIQKRFKDGGITITSVVGDMNNPDPTCPPG 192
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + G F + V SWLPAR+ V R D PSG I++L +
Sbjct: 193 EPQDEDSLFARASTFIGNVFARKLHHAVSSWLPARTTVGAAYRSRRDVHPSGRIIILPQG 252
Query: 245 FCPWKLHLFELE------EEMKIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPL 296
PWK HL+ E EE + YVLY + G +WRVQ V+V+ F SRKPL
Sbjct: 253 GVPWKEHLYNFEKEASGSEETNPDEEAYYVLYPESATEGSKWRVQCVSVNEGSFVSRKPL 312
Query: 297 PAQWRGLRDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
P WRG+RD +L ++++G P G VFVH SGFIGG+Q+ GALAMA L+
Sbjct: 313 PEAWRGVRDADLDGVMAAEAEKSGKPKVPEGAVFVHASGFIGGHQTKEGALAMATRGLE 371
>gi|242775267|ref|XP_002478609.1| UPF0160 domain protein MYG1, putative [Talaromyces stipitatus ATCC
10500]
gi|218722228|gb|EED21646.1| UPF0160 domain protein MYG1, putative [Talaromyces stipitatus ATCC
10500]
Length = 361
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/350 (40%), Positives = 199/350 (56%), Gaps = 34/350 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +M+RL ++ ++ +VR+RD + L + V+DVGG YDP+ + YD
Sbjct: 17 IGTHNGHFHADEALAVYMLRLLPEYHSSPLVRTRDSERLAECHTVVDVGGEYDPAKNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + TKLSSAGLV+ HFG+ ++++ ++ HPDV L+ +Y +F+EA
Sbjct: 77 HHQRTFNTTFPN-HKTKLSSAGLVFMHFGRAIVSQHTSLPIDHPDVELLYEKLYTDFVEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQ-----SAERENE 192
+DA DNGI+ YD P V+ + L R +G L D PD + E+
Sbjct: 136 LDAHDNGISVYD----PAAVSASGLEKRFKDGAISLGSLVGDLNYPDPVIAGGEPQDEDG 191
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLH 251
F + G FL +R +WLPAR+ V E R PSG I+VL PWK H
Sbjct: 192 LFARASTFIGDVFLRKLRLAASTWLPARATVSEAYRNRKQIHPSGRIIVLSGGGVPWKEH 251
Query: 252 LFELEEEMKIE---PLIKYVLYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
L+ E E + E + YVLY ++ +WRVQ V S F SRKPLP WRG+RD
Sbjct: 252 LYNFETESQCEGESEQVYYVLYPENSAPDAKWRVQCVPESEGSFVSRKPLPDTWRGVRDQ 311
Query: 307 EL----------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+L + + GIP G +F H SGFIGG+Q+ GALAMA +L++
Sbjct: 312 DLDGVIAAESQKAGKEGIPSGAIFTHASGFIGGHQTKEGALAMAIRSLEM 361
>gi|296809697|ref|XP_002845187.1| MYG1 [Arthroderma otae CBS 113480]
gi|238844670|gb|EEQ34332.1| MYG1 [Arthroderma otae CBS 113480]
Length = 358
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 198/348 (56%), Gaps = 35/348 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + + ++R+RDP L V+DVGGVYDP+ + YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPAYSTSPLIRTRDPARLATCHTVVDVGGVYDPALNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F TKLSSAGLVY HFGK +IA+ + + H DV ++ +Y +F+EA
Sbjct: 76 HHQRTFDTTFPQ-HQTKLSSAGLVYMHFGKAIIAQHMGLSSDHQDVDTIYEKLYGDFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKLNL-------DWTEPDQSAERENEAFQ 195
+DA DNGI+ Y P+ ++N L R G +NL + + P +S + E+ F
Sbjct: 135 LDAHDNGISVY----CPKDISNAGLEKRFRDGGINLGSLISDMNLSGPGESLD-EDALFA 189
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFE 254
+ G+ F + WLPAR+ V + R+D PSG+I++L PWK HL+
Sbjct: 190 RASTFIGETFSRKLHAASSKWLPARATVTQAYRSRFDVHPSGKILLLSNSGIPWKEHLYR 249
Query: 255 LEEEM------KIEPLIKYVLYEDDRGK--QWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
LEEE E + YV+Y + + WRVQ V VS FESR+ LP WRG RD
Sbjct: 250 LEEEAGNATTGGTENNVIYVIYPESSTEDANWRVQCVPVSESSFESRRALPEAWRGARDS 309
Query: 307 EL----SKEAG------IP-GCVFVHMSGFIGGNQSYGGALAMARAAL 343
EL SKE IP G +FVH SGFIGG+++ GALAMA L
Sbjct: 310 ELDRIISKEMDAKKRDRIPDGAIFVHASGFIGGHKNKTGALAMAIEGL 357
>gi|403374276|gb|EJY87079.1| Protein MYG1, putative [Oxytricha trifallax]
Length = 390
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 196/337 (58%), Gaps = 18/337 (5%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
QT + +GTH+ FHCDE L M+ T KF NA IVR+R+ VLD LD V DVG V+D
Sbjct: 36 QTGVPLIGTHSEVFHCDEVLATTMLLYTHKFQNAAIVRTRNDAVLDTLDIVCDVGAVFDV 95
Query: 79 SNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKEL---IAKEL-NVDEGHPD 129
+ +DHHQK F + KLSSAGL+YK+FGKE+ I KE+ N + P+
Sbjct: 96 EKNRFDHHQKSFNTYWYDSDTAENGGIKLSSAGLIYKYFGKEVLTNILKEVWNSEFSEPN 155
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
+ +++ +Y F + IDAIDNG+ K RY TNLS R+ + N +W P ++
Sbjct: 156 LEKIYQKLYSGFFQEIDAIDNGVT---LAKDMRYRIVTNLSDRISRFNKEWNAPKDKCQQ 212
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
F++ M + +E + ++ + ++PAR IV E A++ ++ PSGEI+ + CPWK
Sbjct: 213 MQ--FKKAMRVVEEELMYQIKSISQVFMPARMIVEEAWAKKNEFHPSGEILFMSTPCPWK 270
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HL+ELE + K+ LIK+VLY+D+R K RVQ VA + F R L +W GLR EL
Sbjct: 271 DHLYELENDNKVGELIKFVLYQDER-KMTRVQTVAPKGNFFAQRVTLCKKWHGLRAQELQ 329
Query: 310 ---KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
E+G+ FVH SGFIGG S A+ MA ++
Sbjct: 330 LMEAESGLKDLEFVHHSGFIGGAWSLETAIKMAEQSI 366
>gi|83766387|dbj|BAE56530.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 364
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 202/359 (56%), Gaps = 39/359 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
+ +PL +GTH+G FH DEAL +++R + + ++R+RDP L V+DVGG YD
Sbjct: 12 TSSPL--IGTHSGHFHADEALAVYLLRQLPTYSASPLIRTRDPVQLATCHTVVDVGGEYD 69
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
P+N+ YDHHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV+ L+ +
Sbjct: 70 PANNRYDHHQRSFSTTFPN-HTTKLSSAGLVYMHFGRAIIAEHTSLPVDHHDVNLLYEKL 128
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKL-NLDWTEPDQ 185
Y +F+EAIDA DNGI+ YD P ++ NL R VG + N D T P
Sbjct: 129 YTDFIEAIDANDNGISAYD----PAEISAANLEKRFKDGAITITSVVGDMNNPDPTCPPG 184
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + G F + SWLPAR+ V R D PSG I+VL +
Sbjct: 185 EPQDEDSLFGRASTFIGNVFTRKLHHASTSWLPARTTVGNSYRSRRDVHPSGRIIVLPQG 244
Query: 245 FCPWKLHLFELEEEM------KIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPL 296
PWK HL+ E+E K E + YVLY + G +WRVQ V+V+ F SRKPL
Sbjct: 245 GVPWKEHLYNFEKEASESDKPKSEEEVYYVLYPESATEGSKWRVQCVSVNETSFVSRKPL 304
Query: 297 PAQWRGLRDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
P WRG+RD +L +++ G P G VFVH SGFIGG+++ GA +MA L+
Sbjct: 305 PETWRGVRDADLDGVMAAEAEKTGKPKIPEGAVFVHASGFIGGHKTKEGAFSMAVRGLE 363
>gi|238485073|ref|XP_002373775.1| UPF0160 domain protein MYG1, putative [Aspergillus flavus NRRL3357]
gi|317140996|ref|XP_001818532.2| hypothetical protein AOR_1_2854174 [Aspergillus oryzae RIB40]
gi|220701825|gb|EED58163.1| UPF0160 domain protein MYG1, putative [Aspergillus flavus NRRL3357]
gi|391869888|gb|EIT79078.1| putative metal-binding protein [Aspergillus oryzae 3.042]
Length = 373
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 202/359 (56%), Gaps = 39/359 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
+ +PL +GTH+G FH DEAL +++R + + ++R+RDP L V+DVGG YD
Sbjct: 21 TSSPL--IGTHSGHFHADEALAVYLLRQLPTYSASPLIRTRDPVQLATCHTVVDVGGEYD 78
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
P+N+ YDHHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV+ L+ +
Sbjct: 79 PANNRYDHHQRSFSTTFPN-HTTKLSSAGLVYMHFGRAIIAEHTSLPVDHHDVNLLYEKL 137
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKL-NLDWTEPDQ 185
Y +F+EAIDA DNGI+ YD P ++ NL R VG + N D T P
Sbjct: 138 YTDFIEAIDANDNGISAYD----PAEISAANLEKRFKDGAITITSVVGDMNNPDPTCPPG 193
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + G F + SWLPAR+ V R D PSG I+VL +
Sbjct: 194 EPQDEDSLFGRASTFIGNVFTRKLHHASTSWLPARTTVGNSYRSRRDVHPSGRIIVLPQG 253
Query: 245 FCPWKLHLFELEEEM------KIEPLIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPL 296
PWK HL+ E+E K E + YVLY + G +WRVQ V+V+ F SRKPL
Sbjct: 254 GVPWKEHLYNFEKEASESDKPKSEEEVYYVLYPESATEGSKWRVQCVSVNETSFVSRKPL 313
Query: 297 PAQWRGLRDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
P WRG+RD +L +++ G P G VFVH SGFIGG+++ GA +MA L+
Sbjct: 314 PETWRGVRDADLDGVMAAEAEKTGKPKIPEGAVFVHASGFIGGHKTKEGAFSMAVRGLE 372
>gi|121702187|ref|XP_001269358.1| UPF0160 domain protein MYG1, putative [Aspergillus clavatus NRRL 1]
gi|119397501|gb|EAW07932.1| UPF0160 domain protein MYG1, putative [Aspergillus clavatus NRRL 1]
Length = 362
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 200/351 (56%), Gaps = 36/351 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ +VR+RDP L V+DVGG YDP+++ YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRLLPTYASSPLVRTRDPAQLATCHTVVDVGGEYDPASNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV L+ +Y +F+EA
Sbjct: 76 HHQRTFSTTFPN-HNTKLSSAGLVYMHFGRVIIAQHTSLPLDHEDVTLLYEKLYTDFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSA----ERENEA 193
+DA DNG++ YD P + + N+ R + + D PD + + E+
Sbjct: 135 VDANDNGVSVYD----PAALASANVEKRFRDGGITIASVVNDMNNPDPTCAGEPQDEDSL 190
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHL 252
F + G F + SWLPAR+ V + + R + PSG IMVL PWK HL
Sbjct: 191 FNRASTFIGHVFTRKLHRACTSWLPARATVGKAYSSRREIHPSGRIMVLPEGGVPWKEHL 250
Query: 253 FELEEEMKI------EPLIKYVLYEDDRGK--QWRVQAVAVSPDRFESRKPLPAQWRGLR 304
+ E+E E + YVLY + + +WRVQ V+++ FESRKPLP WRG+R
Sbjct: 251 YNFEKEASATKNGSSEEQVYYVLYPESASEDSKWRVQCVSLNESSFESRKPLPESWRGVR 310
Query: 305 DDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
D +L +++ G P G VFVH SGFIGG+++ GA AMA +L+
Sbjct: 311 DADLDGVLAAEAEKTGKPKIPEGAVFVHASGFIGGHKTKEGAFAMANQSLE 361
>gi|119617093|gb|EAW96687.1| chromosome 12 open reading frame 10, isoform CRA_a [Homo sapiens]
Length = 325
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 191/345 (55%), Gaps = 62/345 (17%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 27 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 86
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 87 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 146
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L G ++ D S VG L
Sbjct: 147 LL-----------------------------GTSEED--------------SMVGTLY-- 161
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
+ F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 162 ----------DKAGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 211
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 212 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 270
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 271 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 315
>gi|38014044|gb|AAH13956.2| C12orf10 protein, partial [Homo sapiens]
Length = 322
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 191/345 (55%), Gaps = 62/345 (17%)
Query: 5 GVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
G S P S R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L
Sbjct: 24 GPESVPPPKRSRSKLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLA 83
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAK 119
D V+DVGG YDP YDHHQ+ F E G + TKLSSAGL+Y HFG +L+A+
Sbjct: 84 SCDIVVDVGGEYDPRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQ 143
Query: 120 ELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLD 179
L G ++ D S VG L
Sbjct: 144 LL-----------------------------GTSEED--------------SMVGTLY-- 158
Query: 180 WTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEI 239
+ F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI
Sbjct: 159 ----------DKAGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEI 208
Query: 240 MVL-KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
+ L K CPWK HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP
Sbjct: 209 VELAKGACPWKEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPE 267
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
WRGLRD+ L + +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 268 PWRGLRDEALDQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 312
>gi|358370958|dbj|GAA87568.1| UPF0160 domain protein MYG1 [Aspergillus kawachii IFO 4308]
Length = 363
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 202/359 (56%), Gaps = 39/359 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
+ +PL +GTHNG FH DEAL +++RL + ++ ++R+RDP L V+DVGG YD
Sbjct: 11 TSSPL--IGTHNGHFHADEALAVYLLRLLPTYSSSPLLRTRDPAQLATCHTVVDVGGEYD 68
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
+++ YDHHQ+ F F + +TKLSSAGLVY HFG+ +IA+ ++ H DV L+ +
Sbjct: 69 AASNRYDHHQRTFATTFPN-HNTKLSSAGLVYMHFGRAIIAQHTSLPVDHEDVTLLYEKL 127
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR-----------VGKL-NLDWTEPDQ 185
Y +F+EAIDA DNG++ YD P ++ N+ R VG + N D T P
Sbjct: 128 YTDFIEAIDANDNGVSVYD----PAAISAANIQKRFKDGGITITSVVGDMNNPDPTCPPG 183
Query: 186 SAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR- 244
+ E+ F + G F + V SWLPAR+ V R D PSG I+VL +
Sbjct: 184 EPQDEDSLFARASTFIGNVFARKLHHAVSSWLPARTTVGAAYRSRRDVHPSGRIIVLPQG 243
Query: 245 FCPWKLHLFELE------EEMKIEPLIKYVLYEDDRGK--QWRVQAVAVSPDRFESRKPL 296
PWK HL+ E EE + YVLY + + +WRVQ V+V+ F SRKPL
Sbjct: 244 GVPWKEHLYNFEKEASGSEETNPDEEAYYVLYPESAAEDSKWRVQCVSVNEGSFVSRKPL 303
Query: 297 PAQWRGLRDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALK 344
P WRG+RD +L ++ +G P G VFVH SGFIGG+Q+ GALAMA L+
Sbjct: 304 PEAWRGVRDADLDGVMAAEAERSGKPKVPEGAVFVHASGFIGGHQTKEGALAMATRGLE 362
>gi|398389534|ref|XP_003848228.1| hypothetical protein MYCGRDRAFT_106268 [Zymoseptoria tritici
IPO323]
gi|339468102|gb|EGP83204.1| hypothetical protein MYCGRDRAFT_106268 [Zymoseptoria tritici
IPO323]
Length = 368
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/368 (38%), Positives = 208/368 (56%), Gaps = 44/368 (11%)
Query: 11 AYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
A TSSP +GTH+G FH DEAL F++RL ++ +A +VR+RDP++L V+
Sbjct: 12 ADGTSSPV------IGTHSGHFHADEALAVFILRLLPEYSSASLVRTRDPELLKTCSIVV 65
Query: 71 DVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV 130
DVGGV+D S YDHHQ+ F F G TKLSSAGLV+ H+GK +I+ N+D D
Sbjct: 66 DVGGVHDDSAKRYDHHQREFNVTF-PGKHTKLSSAGLVWMHYGKRIISAVTNLDVDSADN 124
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNT-----------NLSSRVGKLN-- 177
L+ +Y++F+E DA DNGI+ YD P + + +++S VG+ N
Sbjct: 125 ELLYQKLYEDFVEPFDANDNGISAYD----PAALRSAGIEKKHNDKGFSIASVVGRYNHA 180
Query: 178 ----------LDWTEP-----DQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSI 222
+ P +++ + E+ F + G++F + SWLPAR+I
Sbjct: 181 PISQLGDGAGVKGQPPAAGKTEKTQDEEDLRFLRASAFVGEQFSLELSDRFTSWLPARAI 240
Query: 223 VVECIAERYDYDPSGEIMVLK---RFCPWKLHLFELEEEMKIEPLIKYVLYED--DRGKQ 277
V + R DP G I+V+ PW HL+ LEEE + + Y L+ + D +
Sbjct: 241 VKQAFNSRTALDPQGRIIVIPYKPEGVPWMDHLYSLEEEAGQQGSVLYALFAESGDESSK 300
Query: 278 WRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALA 337
WR++AV++ P FE+RK LP WRG+RD+ LS+ +G+PG +FVH GFIGGN ++ GAL
Sbjct: 301 WRIRAVSLEPGSFENRKGLPEAWRGVRDEALSEVSGVPGGIFVHAGGFIGGNATFEGALE 360
Query: 338 MARAALKL 345
MA+ A++L
Sbjct: 361 MAKKAVEL 368
>gi|50551559|ref|XP_503254.1| YALI0D24970p [Yarrowia lipolytica]
gi|49649122|emb|CAG81458.1| YALI0D24970p [Yarrowia lipolytica CLIB122]
Length = 324
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/325 (44%), Positives = 189/325 (58%), Gaps = 15/325 (4%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G+FH DE+L FM++ +F +A +VRSRD + LD D V+DV G YD +
Sbjct: 3 KTIGTHSGAFHADESLAVFMLKQLPEFKDADLVRSRDMETLDKCDIVVDVSGQYDGTK-Y 61
Query: 83 YDHHQKGFEEVFGHG--FSTKLSSAGLVYKHFGKELIAKEL-NVDEGHPDVHRLFLAVYK 139
+DHHQ+GFEE+F F TKLSSAGLVYKHFGK++I L + D+ L+ +YK
Sbjct: 62 FDHHQRGFEEIFDLDGEFVTKLSSAGLVYKHFGKDVIRAILKDASVSDADIDLLYRKIYK 121
Query: 140 NFMEAIDAIDNGINQYD---TDKPPRYVNNTNLSSRVGKLN-LDWTEPDQSAERENEAFQ 195
+F+EAIDA DNGI Y +KP L + V LN L E ++ A+ F
Sbjct: 122 DFVEAIDANDNGIEPYSEPIAEKPKFKQFGITLPALVSTLNGLVKEESERDAQ-----FN 176
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
L G F + V WLPAR+ V+ + + C WK HLF +
Sbjct: 177 VASQLMGTAFHNLVYTAGTVWLPARAKVLMAVEAALKNGDEQRYLEFDESCQWKDHLFTI 236
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
EEE IE YVLY RVQAV F+SRKPLP +WRGLRD+ELS+++GIP
Sbjct: 237 EEEKGIEGRFLYVLYP--TPDSVRVQAVNEKDSAFKSRKPLPEEWRGLRDEELSEKSGIP 294
Query: 316 GCVFVHMSGFIGGNQSYGGALAMAR 340
G VFVH +GFIGGN++ GAL MA+
Sbjct: 295 GGVFVHAAGFIGGNKTLEGALEMAK 319
>gi|119496187|ref|XP_001264867.1| UPF0160 domain protein MYG1, putative [Neosartorya fischeri NRRL
181]
gi|119413029|gb|EAW22970.1| UPF0160 domain protein MYG1, putative [Neosartorya fischeri NRRL
181]
Length = 364
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 201/353 (56%), Gaps = 37/353 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R + ++ ++R+RDP L V+DVGG YDP+++ YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRQLPTYASSPLIRTRDPAQLATCHTVVDVGGEYDPASNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + +T+LSSAGLVY HFG+ +IA+ ++ H DV L+ +Y +F+EA
Sbjct: 77 HHQRTFSTTFPN-HNTRLSSAGLVYMHFGRAIIAQHTSLPLDHEDVTLLYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKL----------NLDWTEPDQSAERENE 192
+DA DNG++ YD P + + N+ R G + N D T P + E+
Sbjct: 136 VDANDNGVSVYD----PAALASANIEKRFRDGGITITSVVNDMNNPDPTCPPGEPQDEDS 191
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLH 251
F + G F + SWLPAR+ V R + PSG I++L + PWK H
Sbjct: 192 LFSRASTFMGHVFTRKLHHAFTSWLPARATVGAAYRSRREVHPSGRIIILPQGGVPWKEH 251
Query: 252 LFELEEEMK----IEP--LIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
L+ E+E + P + YVLY + G +WRVQ V+V+ FESRKPLP WRG+
Sbjct: 252 LYNFEKEASGAGDVNPDERVYYVLYPESATEGSKWRVQCVSVNEGSFESRKPLPESWRGV 311
Query: 304 RDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RD +L +++AG P G VFVH SGFIGG+++ GA AMA +L+L
Sbjct: 312 RDADLDGCLAAEAEKAGKPKIPEGAVFVHASGFIGGHKTKEGAFAMAVRSLEL 364
>gi|146322542|ref|XP_752265.2| UPF0160 domain protein MYG1 [Aspergillus fumigatus Af293]
gi|129557719|gb|EAL90227.2| UPF0160 domain protein MYG1, putative [Aspergillus fumigatus Af293]
gi|159131021|gb|EDP56134.1| UPF0160 domain protein MYG1, putative [Aspergillus fumigatus A1163]
Length = 364
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 201/353 (56%), Gaps = 37/353 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R + ++ ++R+RDP L V+DVGG YDP+++ YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRQLPTYASSPLIRTRDPAQLATCHTVVDVGGEYDPASNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + +T+LSSAGLVY HFG+ +IA+ ++ H DV L+ +Y +F+EA
Sbjct: 77 HHQRTFSTTFPN-HNTRLSSAGLVYMHFGRAIIAQHTSLPLDHEDVTLLYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSA-----ERENE 192
+DA DNG++ YD P + + N+ R + + D PD + + E+
Sbjct: 136 VDANDNGVSVYD----PAALASANIEKRFRDGGITIASVVNDMNNPDPTCPPGEPQDEDS 191
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLH 251
F + G F + SWLPAR+ V R + PSG I++L + PWK H
Sbjct: 192 LFNRASTFMGHVFTRKLHHAFTSWLPARATVGAAYQSRREVHPSGRIIILPQGGVPWKEH 251
Query: 252 LFELEEEMK----IEP--LIKYVLYEDD--RGKQWRVQAVAVSPDRFESRKPLPAQWRGL 303
L+ E+E I P + YVLY + G +WRVQ V+V+ FESRKPLP WRG+
Sbjct: 252 LYNFEKEASGAGDINPDEQVYYVLYPESATEGSKWRVQCVSVNEGSFESRKPLPESWRGV 311
Query: 304 RDDEL-------SKEAGIP----GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
RD +L ++++G P G VFVH SGFIGG+++ GA AMA +L+L
Sbjct: 312 RDADLDGCLAAEAEKSGKPKIPEGAVFVHASGFIGGHKTKEGAFAMAVRSLEL 364
>gi|357620597|gb|EHJ72744.1| hypothetical protein KGM_13668 [Danaus plexippus]
Length = 312
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 199/315 (63%), Gaps = 20/315 (6%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF------- 94
M+RL ++ +A+I+R+RDP+ L + D V+DVG +D + YDHHQ+ F E
Sbjct: 1 MLRLLPEYKDAEIIRTRDPEKLKECDIVVDVGAEFDHAKKRYDHHQREFNETLSSLRPEL 60
Query: 95 GHGFSTKLSSAGLVYKHFG----KELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDN 150
G F KLSSAGL+Y ++G +EL K ++D+ +V ++ VY+ +E +DAIDN
Sbjct: 61 GDKFKIKLSSAGLIYTYYGERIIQELAPKGFSLDKD--NVRLIYKKVYEFLIEEMDAIDN 118
Query: 151 GINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE-NEAFQQGMDLAGKEFLDTV 209
G+ T++ P+Y +T+LS+R+ +LN +W + + E+ +E F + L +EF+ V
Sbjct: 119 GVPM--TEEEPKYKIHTHLSARIHRLNPEW---NSTLEKNVDEKFHTALTLVSEEFMYMV 173
Query: 210 RFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVL 269
+++ WLPAR V I ER+D SG+I+ K PWK HL ++EE++ I+ IKYV+
Sbjct: 174 NYFMSIWLPARQCVQSAIEERFDVHESGQIVEFKDRFPWKSHLHDIEEDLGIKNEIKYVV 233
Query: 270 YEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGN 329
++D + +RVQAV V+P F +RKPL +W G+RD+ LS AGIP C+F H +GFIGGN
Sbjct: 234 FQD-KPDSYRVQAVPVTPSSFITRKPLLKEWWGVRDELLSDVAGIPNCIFCHSTGFIGGN 292
Query: 330 QSYGGALAMARAALK 344
++ GAL MA +LK
Sbjct: 293 KTREGALKMALISLK 307
>gi|392866168|gb|EAS28778.2| hypothetical protein CIMG_07176 [Coccidioides immitis RS]
Length = 378
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 201/354 (56%), Gaps = 39/354 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R+ + ++ ++R+RD + L V+DVGG YDP+ + YD
Sbjct: 27 IGTHNGHFHADEALAVYLLRMLPTYSSSPLIRTRDTEQLAACHTVVDVGGEYDPARNRYD 86
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F + T+LSSAGLVY HFGK ++A+ ++ H DV ++ +Y +F+EA
Sbjct: 87 HHQRTFQNTFPN-HQTRLSSAGLVYLHFGKAIVAQHMSKPIDHEDVQLVYEKLYTDFIEA 145
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSAER-ENEAFQQ 196
+DA DNGI+ Y+ P+ + + L R +G L D PD E+ F +
Sbjct: 146 LDANDNGISVYE----PQALAASGLQKRFRDGGINIGSLIGDMNLPDPIENLDEDGLFAK 201
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFEL 255
G+ F+ +R SWLPAR V R+D PSG I++L K PWK HL+ L
Sbjct: 202 ASKFIGETFVRKLRAASGSWLPARETVRAAYEARFDVHPSGRIILLPKGGVPWKEHLYNL 261
Query: 256 E------------EEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
E ++ ++ + YVLY + D G +WR+Q V V FESRKPLPA WR
Sbjct: 262 EVNAPGASTIAPSDDPRLGKEVFYVLYPESTDPGAKWRIQCVPVDESSFESRKPLPAPWR 321
Query: 302 GLRDDEL---------SKE-AGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RD +L SK A IP G +F H SGFIGG+++ G LAMA +L+
Sbjct: 322 GVRDSDLDGVIAAETQSKNLAPIPEGAIFTHASGFIGGHKTREGVLAMAVRSLQ 375
>gi|119184842|ref|XP_001243280.1| hypothetical protein CIMG_07176 [Coccidioides immitis RS]
Length = 368
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 201/354 (56%), Gaps = 39/354 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R+ + ++ ++R+RD + L V+DVGG YDP+ + YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRMLPTYSSSPLIRTRDTEQLAACHTVVDVGGEYDPARNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F + T+LSSAGLVY HFGK ++A+ ++ H DV ++ +Y +F+EA
Sbjct: 77 HHQRTFQNTFPN-HQTRLSSAGLVYLHFGKAIVAQHMSKPIDHEDVQLVYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSAE-RENEAFQQ 196
+DA DNGI+ Y+ P+ + + L R +G L D PD E+ F +
Sbjct: 136 LDANDNGISVYE----PQALAASGLQKRFRDGGINIGSLIGDMNLPDPIENLDEDGLFAK 191
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFEL 255
G+ F+ +R SWLPAR V R+D PSG I++L K PWK HL+ L
Sbjct: 192 ASKFIGETFVRKLRAASGSWLPARETVRAAYEARFDVHPSGRIILLPKGGVPWKEHLYNL 251
Query: 256 E------------EEMKIEPLIKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
E ++ ++ + YVLY + D G +WR+Q V V FESRKPLPA WR
Sbjct: 252 EVNAPGASTIAPSDDPRLGKEVFYVLYPESTDPGAKWRIQCVPVDESSFESRKPLPAPWR 311
Query: 302 GLRDDEL---------SKE-AGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RD +L SK A IP G +F H SGFIGG+++ G LAMA +L+
Sbjct: 312 GVRDSDLDGVIAAETQSKNLAPIPEGAIFTHASGFIGGHKTREGVLAMAVRSLQ 365
>gi|399218616|emb|CCF75503.1| unnamed protein product [Babesia microti strain RI]
Length = 323
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 123/326 (37%), Positives = 189/326 (57%), Gaps = 13/326 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K++ +HNGSFHCD+AL ++R+ ++ +Q++R+RDP++LD D V+DVG +YDP N
Sbjct: 5 VKKLASHNGSFHCDDALAISLLRMLPEYAESQLIRTRDPELLDKCDVVVDVGSIYDPQNC 64
Query: 82 CYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
+DHHQ+GF F TKLS++GL+Y+HFGK ++ ++ V H D+ +++AVY +
Sbjct: 65 RFDHHQRGFNTFFSDKHHVTKLSASGLIYRHFGKRILTEKFAVKPEHLDI--VYVAVYDS 122
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+EAIDAIDNG+ D +Y +NT LSSRV LN W + + + + E F++ + L
Sbjct: 123 FLEAIDAIDNGVEI--CDGKLKYRDNTGLSSRVALLNPTWLDENPNYD---EQFEKAVTL 177
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G EF+ V W PAR IV+ R + SG+++ L R CP+ HL+ +EE +
Sbjct: 178 TGNEFVSAVNRVYNIWFPAREIVLNAYNNRLNVHKSGKVIELSRHCPYYEHLYAIEESNR 237
Query: 261 IEP---LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+I +V+++ KQWR + F R P P RGLR +EL ++ I
Sbjct: 238 DSSNNNIIYFVIFKT--LKQWRCTCMRNKDSSFICRLPFPEHLRGLRGEELVNKSEIENL 295
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
FVH SGF G + + + L
Sbjct: 296 EFVHHSGFTCGGSTMESIIQLIDMTL 321
>gi|68064019|ref|XP_674004.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56492259|emb|CAH99344.1| conserved hypothetical protein [Plasmodium berghei]
Length = 368
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/322 (36%), Positives = 189/322 (58%), Gaps = 7/322 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FH DE L M++ ++ +A+I+R+RD ++LD D V+DVGG+YD
Sbjct: 48 KVIGTHSGRFHTDEILASVMLKFLPEYKDAKIIRTRDQELLDKCDIVVDVGGIYDHEKKR 107
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+ F + + +LSSAGL+YKH+GK+++ K N+ + + V+ L+ +Y +
Sbjct: 108 YDHHQREFNDGLDENHNIRLSSAGLIYKHYGKDVLRKGFNITDENK-VNILYDKIYTVLI 166
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E+IDA+DNGINQY+ + +Y NT L RV + N ++ E D ENE F ++
Sbjct: 167 ESIDALDNGINQYEGE--AKYQINTTLQHRVNRFNPNFLEDDID---ENERFMLASNVVK 221
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF V +Y W A++I +E + RY++ SG ++ L++ CP+ H++ +E+E+ I+
Sbjct: 222 DEFSSFVNYYSNVWYEAKTITLEAVKNRYNFHKSGRVIFLQKHCPYSDHIYNIEKELNIK 281
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
I + +Y+ DR R ++ + F R P P +RGL+++ L KE+ I G FVH
Sbjct: 282 DQILFCIYK-DRYNNCRCGTISKENEGFNIRLPFPPSFRGLKNEALVKESNIEGLTFVHY 340
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGF + + + A LK
Sbjct: 341 SGFTSAGDNIECLVKLVEATLK 362
>gi|303320633|ref|XP_003070316.1| hypothetical protein CPC735_035070 [Coccidioides posadasii C735
delta SOWgp]
gi|240110002|gb|EER28171.1| hypothetical protein CPC735_035070 [Coccidioides posadasii C735
delta SOWgp]
Length = 378
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 197/354 (55%), Gaps = 39/354 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R+ + ++ ++R+RD + L V+DVGG YDP+ + YD
Sbjct: 27 IGTHNGHFHADEALAVYLLRMLPTYSSSPLIRTRDAEQLAACHTVVDVGGEYDPARNRYD 86
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F + T+LSSAGLVY HFGK ++A+ ++ H DV ++ +Y +F+EA
Sbjct: 87 HHQRTFQNTFPN-HQTRLSSAGLVYLHFGKAIVAQHMSKPIDHEDVQLVYEKLYTDFIEA 145
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSAE-RENEAFQQ 196
+DA DNGI+ Y+ P+ + + L R +G L D PD E+ F +
Sbjct: 146 LDANDNGISVYE----PQALAASGLQKRFRDGGINIGSLIGDMNLPDPIENLDEDGLFAK 201
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFEL 255
G+ F+ +R SWLPAR V R+D PSG I++L K PWK HL+ L
Sbjct: 202 ASKFIGETFVRKLRAASGSWLPARETVRAAYEARFDVHPSGRIILLPKGGVPWKEHLYNL 261
Query: 256 EEEMKIEPLIK------------YVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
E I YVLY + D G +WR+Q V V FESRKPLPA WR
Sbjct: 262 EVNAPGASTIAPSDDPRLGKEAFYVLYPESTDPGAKWRIQCVPVDESSFESRKPLPAPWR 321
Query: 302 GLRDDEL---------SKE-AGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RD +L SK A IP G +F H SGFIGG+++ G LAMA +L+
Sbjct: 322 GVRDSDLDGVIAAETKSKNLAPIPEGAIFTHASGFIGGHKTREGVLAMAVRSLQ 375
>gi|82794484|ref|XP_728455.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23484819|gb|EAA20020.1| Uncharacterised protein family [Plasmodium yoelii yoelii]
Length = 368
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/322 (36%), Positives = 189/322 (58%), Gaps = 7/322 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FH DE L M++ ++ +A+I+R+RD ++LD D V+DVGG+YD
Sbjct: 48 KVIGTHSGRFHTDEILASVMLKFLPEYKDAKIIRTRDQELLDKCDIVVDVGGIYDHEKKR 107
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQ+ F + + +LSSAGL+YKH+GK+++ K N+ + + V+ L+ +Y +
Sbjct: 108 YDHHQREFNDGLDENHNIRLSSAGLIYKHYGKDVLRKGFNITDENK-VNILYDKIYTVLI 166
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E+IDA+DNGINQY+ + +Y NT L RV + N ++ E D ENE F ++
Sbjct: 167 ESIDALDNGINQYEGE--AKYQINTTLQHRVNRFNPNFLEDDVD---ENERFMLASNVVK 221
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF V +Y W A++I +E + RY++ SG ++ L++ CP+ H++ +E+E+ I+
Sbjct: 222 DEFSSFVNYYSNVWYEAKTITLEAVKNRYNFHKSGRVIFLQKHCPYSDHIYNIEKELNIK 281
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
I + +Y+ DR R ++ + F R P P +RGL+++ L KE+ I G FVH
Sbjct: 282 DQILFCIYK-DRYNNCRCGTISKENEGFNLRLPFPPSFRGLKNEALVKESNIEGLTFVHY 340
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGF + + + A LK
Sbjct: 341 SGFTSAGDNIECLVKLVEATLK 362
>gi|320041419|gb|EFW23352.1| UPF0160 domain-containing protein MYG1 [Coccidioides posadasii str.
Silveira]
Length = 368
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 197/354 (55%), Gaps = 39/354 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R+ + ++ ++R+RD + L V+DVGG YDP+ + YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRMLPTYSSSPLIRTRDAEQLAACHTVVDVGGEYDPARNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F+ F + T+LSSAGLVY HFGK ++A+ ++ H DV ++ +Y +F+EA
Sbjct: 77 HHQRTFQNTFPN-HQTRLSSAGLVYLHFGKAIVAQHMSKPIDHEDVQLVYEKLYTDFIEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSAE-RENEAFQQ 196
+DA DNGI+ Y+ P+ + + L R +G L D PD E+ F +
Sbjct: 136 LDANDNGISVYE----PQALAASGLQKRFRDGGINIGSLIGDMNLPDPIENLDEDGLFAK 191
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFEL 255
G+ F+ +R SWLPAR V R+D PSG I++L K PWK HL+ L
Sbjct: 192 ASKFIGETFVRKLRAASGSWLPARETVRAAYEARFDVHPSGRIILLPKGGVPWKEHLYNL 251
Query: 256 EEEMKIEPLIK------------YVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
E I YVLY + D G +WR+Q V V FESRKPLPA WR
Sbjct: 252 EVNAPGASTIAPSDDPRLGKEAFYVLYPESTDPGAKWRIQCVPVDESSFESRKPLPAPWR 311
Query: 302 GLRDDEL---------SKE-AGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G+RD +L SK A IP G +F H SGFIGG+++ G LAMA +L+
Sbjct: 312 GVRDSDLDGVIAAETKSKNLAPIPEGAIFTHASGFIGGHKTREGVLAMAVRSLQ 365
>gi|115402059|ref|XP_001217106.1| MYG1 protein [Aspergillus terreus NIH2624]
gi|114188952|gb|EAU30652.1| MYG1 protein [Aspergillus terreus NIH2624]
Length = 358
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 24/343 (6%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++R + + ++R+RDP L V+DVGG YD +++ YD
Sbjct: 16 IGTHNGHFHADEALAVYLLRQLPTYAASPLLRTRDPAQLATCHTVVDVGGEYDAASNRYD 75
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F + +TKLSSAGLVY HFG+ +IA+ + PDV L+ +Y +F+EA
Sbjct: 76 HHQRTFATTFPN-HNTKLSSAGLVYMHFGRAIIAQHTGLPLDDPDVTLLYEKLYTDFIEA 134
Query: 145 IDAIDNGINQYDTDKPPRYV-------NNTNLSSRVGKL-NLDWTEPDQSAERENEAFQQ 196
IDA DNG++ YD + ++S VG + N D T P + E+ F +
Sbjct: 135 IDANDNGVSAYDPAALAAHGIEKRFRDGGVTIASVVGDMNNPDPTCPPGEPQDEDSLFAR 194
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFEL 255
G F + SWLPAR+ V A R D PSG ++VL + PWK HL+
Sbjct: 195 ASTFIGTIFSRKLHHAATSWLPARTTVGNAYATRRDVHPSGRVLVLPQGGVPWKEHLYNF 254
Query: 256 EEEM-KIEPLIKYVLYEDDRGK--QWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL---- 308
E+E + E + YVLY + + +WRVQ V ++ F SRKPLP WRG+RD +L
Sbjct: 255 EKEAGRPEDEVYYVLYPESAAEDAKWRVQCVPMNEGSFVSRKPLPESWRGVRDADLDGVM 314
Query: 309 ------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
S + IP G VFVH SGFIGG+++ GA AMA L+
Sbjct: 315 AAEAEKSGKDKIPEGAVFVHASGFIGGHKTREGAFAMAVRGLE 357
>gi|429860731|gb|ELA35455.1| myg1 protein [Colletotrichum gloeosporioides Nara gc5]
Length = 296
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 186/319 (58%), Gaps = 38/319 (11%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTK 101
M+RL + ++ +VR+RDPK+L V+DVGG YD YDHHQ+GF F G +TK
Sbjct: 1 MLRLLPTYKDSNLVRTRDPKLLGTCHTVVDVGGEYDDGKKRYDHHQRGFTTTF-PGKATK 59
Query: 102 LSSAGLVYKHFGKELIAKELN------VDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY 155
LSSAGLV+ HFGK +IA++L+ V E P+V L+ +Y++F
Sbjct: 60 LSSAGLVFMHFGKAIIAQKLSEGAETPVSEDSPEVELLYNKLYESFK------------- 106
Query: 156 DTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQS----AER-ENEAFQQGMDLAGKEFLDTV 209
R+ L S VG+LN +W +P S A+R E+E F + G+EF +
Sbjct: 107 ------RFSEGAFTLGSVVGRLNPNWNDPVPSDPVEAQRLEDERFAKASRRIGEEFDADL 160
Query: 210 RFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK-RFCPWKLHLFELEEEMKIEPLIKYV 268
+Y ++WLPAR++V +R YDP G ++VL+ + PWK HL+ LE P + YV
Sbjct: 161 DYYAKAWLPARAVVQAAFEKRTQYDPQGRLLVLEGQSAPWKDHLYSLEGG---NPSVVYV 217
Query: 269 LYEDDRG--KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
LY + +WRVQAV V+ D FESRKPLP WRG RD+EL GIPG VFVH +GFI
Sbjct: 218 LYPEKPAPDAKWRVQAVPVTKDSFESRKPLPEAWRGFRDEELDGITGIPGGVFVHAAGFI 277
Query: 327 GGNQSYGGALAMARAALKL 345
GGN+++ GA MA AL L
Sbjct: 278 GGNKTFDGAKDMAIKALDL 296
>gi|258568586|ref|XP_002585037.1| MYG1 protein [Uncinocarpus reesii 1704]
gi|237906483|gb|EEP80884.1| MYG1 protein [Uncinocarpus reesii 1704]
Length = 355
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 134/349 (38%), Positives = 193/349 (55%), Gaps = 37/349 (10%)
Query: 29 NGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQK 88
G FH DEAL +++R+ + ++ +VR+RDP+ L V+DVGG YDP+ + YDHHQ+
Sbjct: 9 TGHFHADEALAVYLLRMLPAYSSSPLVRTRDPEQLASCHTVVDVGGEYDPARNRYDHHQR 68
Query: 89 GFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAI 148
F++ F + +T+LSSAGLVY HFGK +IA+ ++ H DV ++ +Y +F+EA+DA
Sbjct: 69 TFQDTFPN-HTTRLSSAGLVYLHFGKAIIAQHMSKPIDHEDVQTIYEKLYADFIEALDAH 127
Query: 149 DNGINQYDTDK------PPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
DNGI+ Y+ R+ + NL S +G +NL +P+ + E+ F
Sbjct: 128 DNGISVYEPQALAASGLEKRFRDGGINLGSLIGDMNL--PDPNDRLD-EDSLFANASTFI 184
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFELEEEMK 260
G+ F +R V SWLPAR V R+D SG I++L K PWK HL+ LE+
Sbjct: 185 GETFARKLRAAVSSWLPARETVRAAYESRFDVHSSGRIIILSKGGVPWKEHLYNLEQAAP 244
Query: 261 IEPL------------IKYVLYED--DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
P + YV+Y + D WR+Q V FESRKPLPA WRG+RD
Sbjct: 245 GMPTTTPSDVPMLGKEVYYVIYPESTDPAANWRIQCVPTDASSFESRKPLPAPWRGVRDQ 304
Query: 307 EL----------SKEAGIP-GCVFVHMSGFIGGNQSYGGALAMARAALK 344
+L + IP G +F H SGFIGG+++ G LAMA +L+
Sbjct: 305 DLDGVLAAETETKNLSPIPAGAIFAHASGFIGGHKTKEGVLAMADRSLQ 353
>gi|320582212|gb|EFW96430.1| hypothetical protein HPODL_2087 [Ogataea parapolymorpha DL-1]
Length = 908
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 120/277 (43%), Positives = 179/277 (64%), Gaps = 7/277 (2%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+G+FH DE+L +M+R +F ++++VRSR + D D V+DV G YD +
Sbjct: 2 KICTHSGTFHADESLAVYMLRTLKQFKDSELVRSRKQEDWDASDIVVDVSGQYD-GKKYF 60
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ F +VF GF TKLSSAGLVYKHFGK++I + L +D+ D+ L+ VY++F+E
Sbjct: 61 DHHQREFNDVFAPGFRTKLSSAGLVYKHFGKQIIRENLELDQ-EKDIDFLYERVYRDFIE 119
Query: 144 AIDAIDNGINQY---DTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
A+DA DNGIN+Y D KP N LSS V LN W A+ + +AF++ ++
Sbjct: 120 AVDANDNGINKYENQDELKPLFKDRNFQLSSVVANLNPSWITDPTDADYD-KAFEKASEI 178
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G F++ + + +S++PA+ V + I R++ DPSG+I+++ RF PWK HL+ +E+E+
Sbjct: 179 MGFAFMNYLEYMGKSFMPAKQYVEKAIGSRFEADPSGKIIIMDRFVPWKEHLYNVEKELG 238
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLP 297
IE I+YVL+ D G WR+ V VS F+SRK LP
Sbjct: 239 IEGQIQYVLFADTSGA-WRIACVPVSASSFDSRKKLP 274
>gi|221057800|ref|XP_002261408.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
gi|194247413|emb|CAQ40813.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
Length = 371
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 193/329 (58%), Gaps = 7/329 (2%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
SPS K +GTH+G FH DE L M++ ++ +A+I+R+RD + LD D V+DVGGV
Sbjct: 44 SPSME--KVIGTHSGRFHTDEILATVMLKFLPEYKDAKIIRTRDQQKLDKCDVVVDVGGV 101
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
Y+ YDHHQK F +LSSAGL+YKH+GKE+ K + + V+ L+
Sbjct: 102 YNHEKKRYDHHQKEFSGTLDDKHDIRLSSAGLIYKHYGKEVFRKGFGITD-EEKVNTLYD 160
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
VY F+E++DA+DNGINQY+ +Y NTNL RV + N ++ E + + +E F
Sbjct: 161 KVYSAFIESVDALDNGINQYEG--VAKYQINTNLQHRVNRFNPNFLEEETDVD-ADERFM 217
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ +EF+ V +Y W A+SI +E + +R+++ PSG ++ L R CP+ H++++
Sbjct: 218 SAAKIVKEEFVHFVDYYSNVWYAAKSITLEAVKDRFNFHPSGRVIFLNRHCPYNDHVYDI 277
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
EE++ I+ I + +Y DR ++ R ++ + F R P P +RGL+ +EL K + IP
Sbjct: 278 EEQLNIKDEIFFCIYH-DRYQECRCGTISKKNESFAIRLPFPKSFRGLQGEELEKVSNIP 336
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G FVH SGF G ++ L + A LK
Sbjct: 337 GLSFVHYSGFTSGGKNVDCLLKLVEATLK 365
>gi|389584560|dbj|GAB67292.1| hypothetical protein PCYB_113130, partial [Plasmodium cynomolgi
strain B]
Length = 369
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 123/329 (37%), Positives = 192/329 (58%), Gaps = 9/329 (2%)
Query: 16 SPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
SPS K +GTH+G FH DE L M++ ++ +A+I+R+RD + LD D V+DVGGV
Sbjct: 44 SPSME--KVIGTHSGRFHTDEILATVMLKFLPEYKDAKIIRTRDQEKLDQCDVVVDVGGV 101
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YD YDHHQK F +LSSAGL+YKH+ K++ K + + V+ L+
Sbjct: 102 YDHEKKRYDHHQKEFNGTLDEKHDIRLSSAGLIYKHYAKDVFRKGFGITD-EEKVNTLYD 160
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y F+E++DA+DNGINQ++ +Y NT L RV + N ++ E + A+ E F
Sbjct: 161 KIYTAFIESVDALDNGINQHEG--VAKYQINTTLQHRVNRFNPNFLEDEADAD---ERFM 215
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ +EF+ V +Y W A++I +E + R+++ PSG ++ LK+ CP+ H++++
Sbjct: 216 SAAQIVKEEFVSFVNYYSNVWYAAKTITLEAVKNRFNFHPSGRVIFLKKHCPYYDHVYDI 275
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
EEE+ I+ I + +Y +DR ++ R ++ + F R P P +RGL+ +EL K + IP
Sbjct: 276 EEELNIKDEILFCIY-NDRYQECRCGTISKKNESFTIRLPFPKSFRGLQKEELEKVSNIP 334
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALK 344
G FVH SGF G +S L + A LK
Sbjct: 335 GLSFVHYSGFTSGGKSVDCLLKLVEATLK 363
>gi|156101165|ref|XP_001616276.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805150|gb|EDL46549.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 369
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 190/322 (59%), Gaps = 7/322 (2%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FH DE L M++ ++ +A+I+R+RD + LD D V+DVGGVYD
Sbjct: 49 KVIGTHSGRFHTDEILATVMLKFLPEYKDAKIIRTRDQEKLDQCDVVVDVGGVYDHEKKR 108
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
YDHHQK F +LSSAGL+YKH+ K++ K + + V+ L+ VY F+
Sbjct: 109 YDHHQKEFNGTLDAEHDIRLSSAGLIYKHYAKDVFRKGFGITD-EEKVNTLYDKVYTAFI 167
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E++DA+DNGINQ++ +Y NT L RV + N ++ E + A+ E F +
Sbjct: 168 ESVDALDNGINQHEG--IAKYQINTTLQHRVNRFNPNFLESEADAD---ERFMSAAQIVK 222
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+EF+ V +Y W A+SI +E + +R+++ SG ++ LKR CP+ H++++EEE+ I+
Sbjct: 223 EEFVSFVNYYSNVWYAAKSITLEAVKDRFNFHKSGRVIFLKRHCPYYDHVYDIEEELNIK 282
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
I + +Y +DR ++ R ++ + F R P P +RGL+++EL K + IPG FVH
Sbjct: 283 DEILFCIY-NDRYQECRCGTISKKNESFTIRLPFPKSFRGLQNEELEKVSKIPGLSFVHY 341
Query: 323 SGFIGGNQSYGGALAMARAALK 344
SGF G ++ L + A LK
Sbjct: 342 SGFTSGGKNVDCLLKLVEATLK 363
>gi|167394080|ref|XP_001740838.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894929|gb|EDR22761.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 335
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/327 (37%), Positives = 178/327 (54%), Gaps = 16/327 (4%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH +FHCD+ GC ++ TD + +++R+ + + L+ D V DVGGVYDPS + +D
Sbjct: 5 IGTHASNFHCDDVTGCVFLKFTDNYKQHRVIRTLNQEELNKCDLVFDVGGVYDPSINRFD 64
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR-----LFLAVYK 139
HHQ+GF E + + LS+ GL++KHFG+E++ + + + + L L +Y
Sbjct: 65 HHQRGFSETYSSQHTITLSACGLLFKHFGEEIVKNVIALIDPFGSITEEQICWLKLKIYN 124
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ IDA DNGI+ T+ R ++TNLSSRV KLN E F + +
Sbjct: 125 TFVLPIDAGDNGIDPSPTELLFR--DSTNLSSRVAKLN-----AKGYGMMRLEQFLKAQE 177
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
L KEF++ V LP+ ++R++YD G IMVL C W+ L ELE+E
Sbjct: 178 LVKKEFIECVSSIYFRILPSFEPTRVAFSKRFEYDSEGRIMVLNEKCSWRDCLTELEDEE 237
Query: 260 K----IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
I+ + YV+ DD QW A ++SP F+ RKP P WRGLRD+ L I
Sbjct: 238 ASKNGIKKEVLYVISHDDTRDQWGCIATSLSPGSFKMRKPFPQPWRGLRDNALEDVCKIK 297
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAA 342
G +F H SGF+ N + G + MA+ A
Sbjct: 298 GAIFTHNSGFLACNATQEGMIKMAQLA 324
>gi|414879042|tpg|DAA56173.1| TPA: hypothetical protein ZEAMMB73_721076 [Zea mays]
Length = 352
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/264 (46%), Positives = 147/264 (55%), Gaps = 84/264 (31%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K VGTHN +FHC E +LD LD VLDVG P N
Sbjct: 31 KHVGTHNDNFHCVE-------------------------ILDTLDVVLDVGVSMIPVNIA 65
Query: 83 -------------YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DHHQKGF EVF HGF TKLSS GLVYKHFGKE+IAKEL V+E H D
Sbjct: 66 IIITRRASVSQHRHDHHQKGFSEVFEHGFDTKLSSVGLVYKHFGKEIIAKELGVNEDHED 125
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
VHRL+L++YK+F+EA+ AIDNGINQYD D+PP+Y
Sbjct: 126 VHRLYLSIYKSFVEALYAIDNGINQYDIDQPPKY-------------------------- 159
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPW- 248
+VRF+V+SWLPARSI++EC+ R + DPS EIMVL RFCP
Sbjct: 160 ------------------SVRFHVKSWLPARSIILECLLSRGNIDPSREIMVLDRFCPCY 201
Query: 249 -KLHLFELEEEMKIEPLIKYVLYE 271
+ H+ LEEE+K++PL KYVLY
Sbjct: 202 DQQHIENLEEELKVDPLTKYVLYH 225
>gi|449705296|gb|EMD45371.1| melanocyte prolifeating protein 1, putative [Entamoeba histolytica
KU27]
Length = 488
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/328 (37%), Positives = 181/328 (55%), Gaps = 18/328 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH +FHCD+ GC ++ TD++ + +++R+ + + L+ D V DVGGVYDPS + +D
Sbjct: 5 IGTHASNFHCDDVTGCVFLKFTDEYKHHRVIRTLNQEELNKCDLVFDVGGVYDPSINRFD 64
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR-----LFLAVYK 139
HHQ+GF E + + LS+ GL++KHFG+E++ + + + V L L +Y
Sbjct: 65 HHQRGFNETYSPQHTITLSACGLLFKHFGEEIVKNVIALIDPFGSVTEEQICWLKLKIYN 124
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ IDA DNGI+ T+ R ++TNL SRV KLN E F + +
Sbjct: 125 TFVLPIDAGDNGIDPSPTELLFR--DSTNLPSRVAKLN-----AKGYGMMRLEQFLKAQE 177
Query: 200 LAGKEFLDTV-RFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
L KEF++ V Y R+ LP+ +R++YD G +MVL C W+ L ELE+E
Sbjct: 178 LVKKEFIECVSSIYFRT-LPSFEPTRLAFCKRFEYDSEGRVMVLNEKCSWRDCLTELEDE 236
Query: 259 MK----IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
I+ + YV+ DD QW A ++SP F+ RKP P WRGLRD EL I
Sbjct: 237 EASKNGIKKEVLYVISHDDTRGQWGCIATSLSPGSFKMRKPFPQPWRGLRDKELEDVCKI 296
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAA 342
G +F H SGF+ N + G + MA+ A
Sbjct: 297 KGAIFTHNSGFLACNATQEGMIKMAQLA 324
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/137 (39%), Positives = 76/137 (55%), Gaps = 6/137 (4%)
Query: 210 RFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK----IEPLI 265
++Y R+ LP+ + C +R++YD G +MVL C W+ L ELE+E I+ +
Sbjct: 343 KYYFRT-LPSLN-QPSCFCKRFEYDSEGRVMVLNEKCSWRDCLTELEDEEASKNGIKKEV 400
Query: 266 KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGF 325
YV+ DD QW A ++SP F+ RKP P WRGLRD EL I G +F H SGF
Sbjct: 401 LYVISHDDTRGQWGCIATSLSPGSFKMRKPFPQPWRGLRDKELEDVCKIKGAIFTHNSGF 460
Query: 326 IGGNQSYGGALAMARAA 342
+ N + G + MA+ A
Sbjct: 461 LACNATQEGMIKMAQLA 477
>gi|17432235|gb|AAL39007.1|AF111805_1 MSTP024 [Homo sapiens]
gi|20810376|gb|AAH28904.1| C12orf10 protein [Homo sapiens]
gi|325464093|gb|ADZ15817.1| chromosome 12 open reading frame 10 [synthetic construct]
Length = 221
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 109/215 (50%), Positives = 144/215 (66%), Gaps = 6/215 (2%)
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
V L+ +Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W PDQ E
Sbjct: 2 VGTLYDKMYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNHPDQDTE- 59
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPW 248
F++ MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPW
Sbjct: 60 --AGFKRAMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGACPW 117
Query: 249 KLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL 308
K HL+ LE + I +V+Y D G QWR+Q V P F+SR PLP WRGLRD+ L
Sbjct: 118 KEHLYHLESGLSPPVAIFFVIYTDQAG-QWRIQCVPKEPHSFQSRLPLPEPWRGLRDEAL 176
Query: 309 SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ +GIPGC+FVH SGFIGG+++ GAL+MARA L
Sbjct: 177 DQVSGIPGCIFVHASGFIGGHRTREGALSMARATL 211
>gi|67468902|ref|XP_650444.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56467075|gb|EAL45062.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 335
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/328 (37%), Positives = 181/328 (55%), Gaps = 18/328 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTH +FHCD+ GC ++ TD++ + +++R+ + + L+ D V DVGGVYDPS + +D
Sbjct: 5 IGTHASNFHCDDVTGCVFLKFTDEYKHHRVIRTLNQEELNKCDLVFDVGGVYDPSINRFD 64
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR-----LFLAVYK 139
HHQ+GF E + + LS+ GL++KHFG+E++ + + + V L L +Y
Sbjct: 65 HHQRGFNETYSPQHTITLSACGLLFKHFGEEIVKNVIALIDPFGSVTEEQICWLKLKIYN 124
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ IDA DNGI+ T+ R ++TNL SRV KLN E F + +
Sbjct: 125 TFVLPIDAGDNGIDPSPTELLFR--DSTNLPSRVAKLN-----AKGYGMMRLEQFLKAQE 177
Query: 200 LAGKEFLDTV-RFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
L KEF++ V Y R+ LP+ +R++YD G +MVL C W+ L ELE+E
Sbjct: 178 LVKKEFIECVSSIYFRT-LPSFEPTRLAFCKRFEYDSEGRVMVLNEKCSWRDCLTELEDE 236
Query: 259 MK----IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
I+ + YV+ DD QW A ++SP F+ RKP P WRGLRD EL I
Sbjct: 237 EASKNGIKKEVLYVISHDDTRGQWGCIATSLSPGSFKMRKPFPQPWRGLRDKELEDVCKI 296
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAA 342
G +F H SGF+ N + G + MA+ A
Sbjct: 297 KGAIFTHNSGFLACNATQEGMIKMAQLA 324
>gi|123473905|ref|XP_001320138.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121902937|gb|EAY07915.1| hypothetical protein TVAG_064620 [Trichomonas vaginalis G3]
Length = 358
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 133/329 (40%), Positives = 194/329 (58%), Gaps = 20/329 (6%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + H+ +FH D+A CF+ T++F A+I+R+R+ +++D DAV DVGG+YDP
Sbjct: 3 KTIVVHDQTFHADDAFACFVFLHTEEFRGAKIIRTRNQEIIDKADAVADVGGIYDPEKRR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK---ELNVDEG-HPDVHRLFLAVY 138
+DHHQ F+ F G ++AG VY HFGKE I E N D G H D L+ +Y
Sbjct: 63 FDHHQLSFDFKFP-GCEIPCAAAGAVYYHFGKEAIKNILAENNRDAGEHLDY--LWNEMY 119
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW--TEPDQSAERENEAFQQ 196
F++ +DAIDNG+N Y TD Y NT +SSR+ LN W +PD + F +
Sbjct: 120 FVFLKEVDAIDNGVNMYPTDVEVTYSINTGISSRIALLNPHWKTKDPDPYGQ-----FMK 174
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL--FE 254
++L GKEF D + F + S +PA +V +R++ D SG+IM+L C ++ HL FE
Sbjct: 175 AIELIGKEFTDRLLFMLDSQVPAMDVVKRAYYDRFNVDESGKIMILSESCGFEKHLKRFE 234
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
EEE + +P + YV+ + + G + ++AV FE RK LP + GLRD+ELS ++GI
Sbjct: 235 EEEENEDKPRVLYVISQRNDG-SYNIKAVGTGKG-FELRKALP--FPGLRDEELSTKSGI 290
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
PG +FVH SGF+G + A+ A+ AL
Sbjct: 291 PGAIFVHKSGFLGAYKELDQAVQFAKLAL 319
>gi|401411913|ref|XP_003885404.1| hypothetical protein NCLIV_057990 [Neospora caninum Liverpool]
gi|325119823|emb|CBZ55376.1| hypothetical protein NCLIV_057990 [Neospora caninum Liverpool]
Length = 409
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 142/404 (35%), Positives = 200/404 (49%), Gaps = 75/404 (18%)
Query: 15 SSPSQTPLKRV-GTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
++ S TP + V GTH+G FH DE L M+ ++ NA+IVRSRDP VL D V+DVG
Sbjct: 2 TAASDTPRRPVIGTHSGKFHEDEVLATVMLLSLPEYQNARIVRSRDPAVLATCDIVVDVG 61
Query: 74 GVYDPSNDCYDHHQKGF----------------------EEVFGHGFS------------ 99
Y P +DHHQK F +EV G S
Sbjct: 62 AEYAPEKLRFDHHQKSFTATFYDAIPGVNTSDRVDTNGGKEVSGEASSKSSRAAERKTAV 121
Query: 100 TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDK 159
TKLSSAGL+YKHFGKE++ V + +F +Y +F+EA+DAIDNG++
Sbjct: 122 TKLSSAGLIYKHFGKEILRHRFGV-SCPALLDCVFHRLYTSFVEAVDAIDNGVSIAADGS 180
Query: 160 PPRYVNNTNLSSRVGKLNLDW-------TEPDQSAERENEA-----------------FQ 195
Y ++TNLS RV + W T P R ++A F+
Sbjct: 181 SLLYKDSTNLSCRVSRCYPPWNLEDLRTTRPHLKRLRTDDANLYPADVEGGSVEEVLGFR 240
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ M L EF D V V WLPAR +V++ + ER PSG I+ LK++CP++ HL E+
Sbjct: 241 KAMKLVDSEFEDAVSSIVDVWLPARQVVLDAVQERTKVHPSGRIIKLKQWCPFQEHLHEI 300
Query: 256 EEEMKIEPLIKYVLYEDDRG---KQWRVQAVAVSPDRFESRKPLPAQWRGLRDDEL---- 308
E+E++++ + + LY D + + WRV AV F+SR PL + RGLRD EL
Sbjct: 301 EDELQVDAPVLFCLYPDIKDGIVEGWRVYAVTEKDQLFKSRLPLLEKLRGLRDSELGEAV 360
Query: 309 --------SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
S++ + H +GFIGG ++ A AMA+ +K
Sbjct: 361 LTDERTDKSEDMKSSDFIHCHATGFIGGAKTLPAATAMAKLTMK 404
>gi|403223574|dbj|BAM41704.1| uncharacterized protein TOT_040000085 [Theileria orientalis strain
Shintoku]
Length = 328
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 189/329 (57%), Gaps = 26/329 (7%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY------- 76
++GTHNG FH DEAL +M+RL ++ NA++VR+RDP VLD+ + ++DVGGVY
Sbjct: 2 KIGTHNGYFHADEALAIYMLRLLPEYKNAEVVRTRDPAVLDNCEIIVDVGGVYERDNEDS 61
Query: 77 DPSNDCYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
DPS YDHHQ+ F + F TKLSSAGLVYKHF K + + NV + V L+
Sbjct: 62 DPSKHRYDHHQREFNDHFDEEHKVTKLSSAGLVYKHFAKRIFKEVYNVSDDET-VEYLYT 120
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y F+E++DA DNG+ +D +Y +T+L +RV +LN W D+ + +E F
Sbjct: 121 TIYDRFIESMDANDNGVAL--SDGELKYKISTDLPNRVSRLNPSW--KDKEVKDVDERFM 176
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ ++L G++F V + LPA++ + + + +R++ SG+++ + + CP+ L++
Sbjct: 177 KAVELTGQDFDYFVSNELNVILPAKTHLEKALEKRFETHKSGKVIEMIKSCPFSGFLYKH 236
Query: 256 EEE--MKIEPLIKYVLYEDDRGKQW-----------RVQAVAVSPDRFESRKPLPAQWRG 302
E+E + + + Y L D+ QW R + +F+SR P P RG
Sbjct: 237 EDEHGLNDKDRVLYYLTFDESSNQWLEGRGNDQVCRRSTCIKEKGTQFKSRLPFPEHLRG 296
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQS 331
LRD+ L K +GIPG F+H +GF G ++
Sbjct: 297 LRDENLEKASGIPGLTFIHATGFTCGGKT 325
>gi|226471472|emb|CAX70817.1| hypothetical protein [Schistosoma japonicum]
Length = 335
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 163/293 (55%), Gaps = 34/293 (11%)
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTK-----LSSAGLVYKHFGKELIAKELNVDEGHPDV 130
YDP +DHHQK F + F K LSSAGLVY HFGK +++ ++ H +
Sbjct: 15 YDPQTYRFDHHQKDFSLTWSKYFDVKMWDVKLSSAGLVYVHFGKRVLSLLTGLEINHEVL 74
Query: 131 HRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE 190
++F+ VY++F+ ID DNG Q + P +Y NT L RV +LN W + +E
Sbjct: 75 EKIFMRVYESFILEIDGQDNGTPQ--SKMPLKYNINTGLYCRVRRLNPWW---NSGSEES 129
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
AFQ+ ++L +EFLDTV ++ W PAR IV + ++ R D DPS I+VL R CPWK
Sbjct: 130 ESAFQRAINLVSREFLDTVDYFANCWWPARHIVAQAMSCREDVDPSKMIIVLDRSCPWKS 189
Query: 251 HLFELEEEMKIE--------------------PLIKYVLYEDDRGKQWRVQAVAVSPDRF 290
HLF+LE E ++E P I +V+ D W VQAV ++F
Sbjct: 190 HLFDLEREERMETVVYPEPLHRASYRPVPKFPPQILFVVLPSD--GNWVVQAVP--KEKF 245
Query: 291 ESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
E R P P WR LRDD+L GIPGC+FVH SG +G N++ GA+ MAR+ +
Sbjct: 246 EIRLPFPIDWRSLRDDQLCAITGIPGCIFVHNSGHLGSNKTRDGAIEMARSVI 298
>gi|440292422|gb|ELP85627.1| hypothetical protein EIN_409160 [Entamoeba invadens IP1]
Length = 330
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 179/328 (54%), Gaps = 17/328 (5%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V TH +FHCD+ ++ T+ + + +++R+ DP L+ D V D+GGVYDP + YD
Sbjct: 5 VATHASNFHCDDVSASVLLGYTNLYRHHRLIRTLDPIELNRCDLVFDIGGVYDPDTNRYD 64
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD------VHRLFLAVY 138
HHQKGF E + + +LSS GL+YKHFGKE+IA ++ + + + L L +Y
Sbjct: 65 HHQKGFVETYSSQHTIRLSSCGLIYKHFGKEIIANVIDFIDPFGEKLTADQLSWLELKIY 124
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+F+ AIDA DNGI+ Y D P + ++T L +RV KLN + E F +
Sbjct: 125 NSFIIAIDANDNGIDPY--DGTPLFRDSTTLPARVAKLN-----SKGNGMLRMEQFMKAQ 177
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK--LHLFELE 256
+L EF++ +P+ V + R YD G I+V K C W+ L E E
Sbjct: 178 ELVRSEFIEFFSAIYFKVIPSLPPVSDVFKTREKYDAEGRILVFKNKCNWRDFLEDLETE 237
Query: 257 EEMKIEPL--IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
E +K L + YV+ DD QW AV ++ F+ RKP P WRGLRD+++S+ G+
Sbjct: 238 EALKTGKLRQVLYVVSFDDTRGQWGCIAVPIAQGSFKMRKPFPLPWRGLRDEKMSEVTGV 297
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAA 342
G VF H SGF+ +++Y G L + + A
Sbjct: 298 EGSVFTHNSGFLCCHKTYDGMLKLTQLA 325
>gi|237830813|ref|XP_002364704.1| melanocyte proliferating protein 1, putative [Toxoplasma gondii
ME49]
gi|211962368|gb|EEA97563.1| melanocyte proliferating protein 1, putative [Toxoplasma gondii
ME49]
gi|221507585|gb|EEE33189.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 419
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 142/417 (34%), Positives = 201/417 (48%), Gaps = 89/417 (21%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
+T+ + P+ +GTH+G FH DE L M+ +F NA++VRSRDP VL D V+DV
Sbjct: 2 TTAEAPRRPV--IGTHSGKFHEDEVLATVMLLSLPEFQNARVVRSRDPAVLATCDIVVDV 59
Query: 73 GGVYDPSNDCYDHHQKGFEEVF-----------------------------------GHG 97
G YDP +DHHQK F F G+G
Sbjct: 60 GAEYDPEKRRFDHHQKSFTLTFYGKSPTASDTLDSGNKDAGKDDGGQVSSERRRDAAGNG 119
Query: 98 ----------FSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV-HRLFLAVYKNFMEAID 146
TKLSSAGL+YKHFGK+++ + P++ +F +Y +F+EA+D
Sbjct: 120 KVEEKACKKAAVTKLSSAGLIYKHFGKDILRHRFGLTS--PELLDCVFERLYTSFVEAVD 177
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW-------TEPDQSAERENEA------ 193
AIDNG++ P Y ++TNLSSRV + W T P R EA
Sbjct: 178 AIDNGVSIASNGSPLLYKDSTNLSSRVSRCYPPWNLEELRATRPQVKRLRTAEADLYPAE 237
Query: 194 -----------FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
F++ M L EF D V V WLPAR +V++ + ER SG I+ L
Sbjct: 238 VEGGSVEEVHGFRKAMKLVDAEFEDAVSSIVDVWLPARQVVLDAVQERTKVHSSGRIVKL 297
Query: 243 KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG---KQWRVQAVAVSPDRFESRKPLPAQ 299
K++CP++ HL E+E E++++ I + LY D + + WRV AV F+SR PL +
Sbjct: 298 KQWCPFQEHLHEIEAELQLDYPILFCLYPDLKNGTIEGWRVYAVNEKDQLFKSRLPLLEK 357
Query: 300 WRGLRDDEL------------SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RGLRD EL S++ + H +GFIGG ++ A+AMA +K
Sbjct: 358 LRGLRDSELGKAVLTDERTQKSEDMSDSDFIHCHATGFIGGTKTLSAAMAMATLTMK 414
>gi|221487794|gb|EEE26026.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 419
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 142/417 (34%), Positives = 201/417 (48%), Gaps = 89/417 (21%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
+T+ + P+ +GTH+G FH DE L M+ +F NA++VRSRDP VL D V+DV
Sbjct: 2 TTAEAPRRPV--IGTHSGKFHEDEVLATVMLLSLPEFQNARVVRSRDPAVLATCDIVVDV 59
Query: 73 GGVYDPSNDCYDHHQKGFEEVF-----------------------------------GHG 97
G YDP +DHHQK F F G+G
Sbjct: 60 GAEYDPEKRRFDHHQKSFTLTFYGESPTASDTLDSGNKDAGKDDGGQVSSERRRDAAGNG 119
Query: 98 ----------FSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV-HRLFLAVYKNFMEAID 146
TKLSSAGL+YKHFGK+++ + P++ +F +Y +F+EA+D
Sbjct: 120 KVEEKACKKAAVTKLSSAGLIYKHFGKDILRHRFGLTS--PELLDCVFERLYTSFVEAVD 177
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW-------TEPDQSAERENEA------ 193
AIDNG++ P Y ++TNLSSRV + W T P R EA
Sbjct: 178 AIDNGVSIASNGSPLLYKDSTNLSSRVSRCYPPWNLEELRATRPQVKRLRTAEADLYPAE 237
Query: 194 -----------FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL 242
F++ M L EF D V V WLPAR +V++ + ER SG I+ L
Sbjct: 238 VEGGSVEEVHGFRKAMKLVDAEFEDAVSSIVDVWLPARQVVLDAVQERTKVHSSGRIVKL 297
Query: 243 KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG---KQWRVQAVAVSPDRFESRKPLPAQ 299
K++CP++ HL E+E E++++ I + LY D + + WRV AV F+SR PL +
Sbjct: 298 KQWCPFQEHLHEIEAELQLDYPILFCLYPDLKNGTIEGWRVYAVNEKDQLFKSRLPLLEK 357
Query: 300 WRGLRDDEL------------SKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
RGLRD EL S++ + H +GFIGG ++ A+AMA +K
Sbjct: 358 LRGLRDSELGKAVLTDERTQKSEDMSDSDFIHCHATGFIGGTKTLSAAMAMATLTMK 414
>gi|156089317|ref|XP_001612065.1| MYG1 protein [Babesia bovis T2Bo]
gi|154799319|gb|EDO08497.1| MYG1 protein, putative [Babesia bovis]
Length = 321
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/326 (36%), Positives = 184/326 (56%), Gaps = 13/326 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG FHCDEAL +++L +F +A++VR+RD L DAV+DVGG +DP+ +
Sbjct: 2 KIGTHNGCFHCDEALAVSLLKLLPEFKDAEVVRTRDESTLSQCDAVVDVGGKFDPAKLRF 61
Query: 84 DHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
DHHQ F+E F + T+LSSAGLV+K+FGK +I + + + D+ ++ VY + +
Sbjct: 62 DHHQNEFDEYFDENHTVTRLSSAGLVHKYFGKRIIREVYGITD-ETDIEEVYQRVYSSLI 120
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E++DAIDNG+ D P +Y +T+LS+RV +LN W + F+ M L
Sbjct: 121 ESLDAIDNGVAV--ADGPIKYEISTHLSARVARLNPSWV---DVDVDVDSRFRDAMSLTL 175
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
EF VR + L A+S E R++ SG ++ R P+ +HL EE ++
Sbjct: 176 SEFDYYVRNTIDVHLAAKSKFEEVYRNRFNVHESGLVIETPRGMPF-VHLLHSLEEKELT 234
Query: 263 PLIK----YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
P+ K Y+ Y DD KQ+R + + +F SR+P P + GLRD++L + +GI G
Sbjct: 235 PVEKRVAFYITY-DDATKQFRCSCIREADQQFTSRRPFPKRLCGLRDEDLVEASGIDGLT 293
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F+H +GF G + L + LK
Sbjct: 294 FIHRAGFTCGGLTKQSILELINLTLK 319
>gi|240274150|gb|EER37668.1| MYG1 protein [Ajellomyces capsulatus H143]
Length = 371
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 168/316 (53%), Gaps = 39/316 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ +VR+RDP +L V+DVGG YDPS + YD
Sbjct: 17 IGTHNGHFHADEALAVYLLRLLPTYTSSPLVRTRDPSLLATCHTVVDVGGEYDPSRNRYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F STKLSSAGL+Y HFGK +IA+ ++ HPDV L+ +Y +F+EA
Sbjct: 77 HHQRTFTATFPQ-HSTKLSSAGLIYLHFGKAIIAQHTSLPPDHPDVATLYEKLYSDFIEA 135
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSR-------VGKLNLDWTEPDQSA---ERENEAF 194
+DA DNGI+ YD +N + R +G L D PD +A + E+ F
Sbjct: 136 LDANDNGISVYDNAA----LNAAGVRKRFRDSGISLGALVGDLNHPDPTATGPQDEDALF 191
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR--FCPWKLHL 252
+ G FL +R SWLPAR+ V E R+D SG I+V + K HL
Sbjct: 192 ENASKFIGDAFLRKLRVASASWLPARATVSEAYKARFDTHASGRILVFAQGGGSRGKKHL 251
Query: 253 FELEEEMKIEPL--------------------IKYVLYEDDRG--KQWRVQAVAVSPDRF 290
F LEEE + YVLY + +WRVQ V V+ F
Sbjct: 252 FVLEEEAAAAAAAAGAEAEAGKIGGAHADRRQVYYVLYPESTSPDAKWRVQCVPVTDTSF 311
Query: 291 ESRKPLPAQWRGLRDD 306
ESR+PLP W G+RD+
Sbjct: 312 ESRRPLPEAWCGIRDE 327
>gi|123409181|ref|XP_001303342.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884714|gb|EAX90412.1| hypothetical protein TVAG_488660 [Trichomonas vaginalis G3]
Length = 313
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 186/324 (57%), Gaps = 19/324 (5%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ H+G FH ++ L F+++ D++ NA+++R+RD +V+++ D V DVGGVYD Y
Sbjct: 2 KICVHSGKFHANDVLSVFLLKSLDEYKNAEVIRTRDMEVINNCDIVCDVGGVYDHDKKRY 61
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEL---IAKELNVDEGHPDVHRLFLAVYKN 140
DHHQ F + + LSS GL+Y H+G+ I K+ N D G + L ++Y N
Sbjct: 62 DHHQTNFFMTYPNR-KVPLSSCGLIYLHYGERAIREILKKNNRDAGKY-IQFLIDSMYDN 119
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F++ IDAIDNG +Q + + +YV +++SSR+ LN+ +E + F Q +DL
Sbjct: 120 FVQEIDAIDNGFSQVEG-RTSKYVITSDISSRIDYLNI-------RSEENMKEFNQAIDL 171
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
G+EF + Y +PA + + R+D DPSG IMVL FC + HL +E+
Sbjct: 172 IGEEFTFRLLRYCDCDVPAAEVTQKAYDSRFDVDPSGHIMVLSEFCDPRPHLKYIEKPD- 230
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+PLI + L +D WR+Q V P F+ R LP + GL +++LSK GIPG F
Sbjct: 231 -QPLIYFYLRQDG-NLPWRIQTVEY-PGSFKPRVRLP--YGGLMEEDLSKACGIPGGKFC 285
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H +GF+G +Y GALA AR AL+
Sbjct: 286 HKTGFLGVFYTYEGALAFARLALE 309
>gi|71029688|ref|XP_764487.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351441|gb|EAN32204.1| hypothetical protein, conserved [Theileria parva]
Length = 318
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/329 (37%), Positives = 182/329 (55%), Gaps = 34/329 (10%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S + +KR+GTHNG FH DEAL +M++L ++ +A++VR+RDP VL+ D V+DVG VYD
Sbjct: 7 SYSRMKRIGTHNGFFHADEALAVYMLKLLPEYRDAEVVRTRDPSVLETCDVVVDVGAVYD 66
Query: 78 PSNDCYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNV-DEGHPDVHRLFL 135
PS YDHHQ+ F++ F TKLSSAGLVYKHF K L + V DE D L+
Sbjct: 67 PSRHRYDHHQRDFKDFFDDEHEVTKLSSAGLVYKHFSKRLFREVYKVMDEETVDY--LYK 124
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
++Y F+EA+DAIDNG+ D D +Y NT+LS+RV +LN W + D + +E F
Sbjct: 125 SIYDKFIEAMDAIDNGVPMCDGD--LKYKTNTDLSTRVSRLNPSWIDSDVAD--VDERFM 180
Query: 196 QGMDLAGKEF----------LDTVRFYVRSWLPARSIVVECI---AERYDYDPSGEIMVL 242
+ ++L G+EF L + RS+ ++ ++ + + + S E+ +
Sbjct: 181 KAVELTGQEFEHFVHNALNVLFMLLNLFRSFYQQKAYFIKHLTIGSHAHSRLASVEVNYI 240
Query: 243 KRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRG 302
+ FC ++ +MKI+ I R + +F+SR P P RG
Sbjct: 241 RNFCTNMKT--KITSQMKIDSSI-----------TRRSTCIREKGSQFKSRLPFPEHLRG 287
Query: 303 LRDDELSKEAGIPGCVFVHMSGFIGGNQS 331
LRD EL K +GIP F+H +GF G ++
Sbjct: 288 LRDQELEKASGIPDLTFIHATGFTCGGKT 316
>gi|167393772|ref|XP_001740702.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165895072|gb|EDR22876.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 318
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 174/335 (51%), Gaps = 35/335 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G H ++HCD+ G M++ +F N Q++R+ D +L+ V D+GGVYD YD
Sbjct: 2 IGVHASNYHCDDVTGTIMLKFVKEFKNYQLIRTLDMDILNKCTLVFDIGGVYDHKIKRYD 61
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV-----HRLFLAVYK 139
HHQ+GF+E F + L GL++KH+G E I K + + H + +L + VY
Sbjct: 62 HHQRGFKETFSPAHNILLCGCGLLFKHYGNE-IVKNIIEEYQHETITEEVAEKLKILVYN 120
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ IDA DNGI+ + Y +NT LS+RV LN + F++ +
Sbjct: 121 YFVMPIDANDNGIDV--SCGELLYRDNTTLSARVAHLN-----------ETKQPFEKAQE 167
Query: 200 LAGKEFLDTVRF---YVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK--LHLFE 254
L EF+D V V LP+ + C RY PSG+I+V + PWK L+L E
Sbjct: 168 LVQPEFIDAVLTCYQIVSQHLPS---LQHCFETRYSVHPSGKILVNTQAVPWKESLNLLE 224
Query: 255 LEEE----MKIEPLIKYVLYED-DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
EE K E L YVL D RG QW+ AV+ P + K P QWRGLR+ EL
Sbjct: 225 TIEESTNGKKCEVL--YVLQPDLGRGTQWKCTAVS-KPFSYGCLKNFPEQWRGLRNKELE 281
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
K GI G +F H SGF+ N++Y G L MA +L+
Sbjct: 282 KVCGIKGSIFCHSSGFLACNETYEGMLQMAITSLE 316
>gi|159117897|ref|XP_001709168.1| MYG1 protein [Giardia lamblia ATCC 50803]
gi|157437283|gb|EDO81494.1| MYG1 protein [Giardia lamblia ATCC 50803]
Length = 361
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/333 (32%), Positives = 182/333 (54%), Gaps = 20/333 (6%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S TP K + TH+G FH DE L + I+ + +A+I+R+RDP+ L+ +DVG VYD
Sbjct: 26 SGTPFKIIATHDGKFHWDECLAVWFIKQLPDYIDARIMRTRDPEELEFATITVDVGDVYD 85
Query: 78 PSNDCYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
C+DHH KGF+ F + LSSAGL+Y H+G+ ++ + +G ++ L+
Sbjct: 86 SEKLCFDHHMKGFQVFFSDAYKDICLSSAGLIYVHYGRSILKQLFPRLDGPTELEFLYHY 145
Query: 137 VYKNFMEAIDAIDNGI-NQYDTDKPP---RYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
VY N++ +DA+DNG+ + Y+TD R+ + T++S+R+ ++ +E
Sbjct: 146 VYDNYIRVVDAVDNGVESHYNTDGSEPICRWTDPTSMSARITRIY---------EIAGDE 196
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F +AG++F++ ++ V ++LPAR +V I + + PSG I+ L P++ +
Sbjct: 197 GFGTAYKMAGQDFMEWIQTVVITYLPARRELVRAIEQAHKLHPSGRIIELSSPGPYQQFI 256
Query: 253 FELEEEMKI------EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
+E ++ + I ++++ DD +R++ V+ F R LP WRGLRD+
Sbjct: 257 HSVEHDLSLCKDGPGGDTILFLIHPDDYQAGYRIRTVSTQKGSFAFRLGLPEAWRGLRDE 316
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMA 339
L +G G FVH SGF+GG + ALA+
Sbjct: 317 ALQNVSGYQGMSFVHKSGFLGGAKDRDTALAVV 349
>gi|70925391|ref|XP_735396.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56509018|emb|CAH81038.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 244
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 149/251 (59%), Gaps = 7/251 (2%)
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
+DVGGVYD YDHHQ+ F E + +LSSAGL+YKH+GKE++ K N+ + H
Sbjct: 1 VDVGGVYDHEKKRYDHHQREFNEGLDENHNIRLSSAGLIYKHYGKEVLRKGFNITDEHK- 59
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER 189
V+ L+ +Y +E+IDA+DNGINQY+ + +Y NT L RV + N ++ E +
Sbjct: 60 VNILYDKIYTTLIESIDALDNGINQYEGE--AKYQINTTLQHRVNRFNPNFLEDNVD--- 114
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
ENE F ++ EF D V +Y W A++I +E + RY++ SG ++ LK+ CP+
Sbjct: 115 ENERFMLASNIVKDEFCDFVNYYSNVWYEAKTITLEAVKNRYNFHKSGRVIFLKKHCPYS 174
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
H++ +E E+ I+ I + +Y+ DR R ++ + F R P PA +RGLR+++L
Sbjct: 175 DHIYNIERELNIKDQILFCIYK-DRYNNCRCGTISKENEGFNLRLPFPASFRGLRNEDLV 233
Query: 310 KEAGIPGCVFV 320
KE+ I G FV
Sbjct: 234 KESNIEGLTFV 244
>gi|407036561|gb|EKE38226.1| melanocyte prolifeating protein 1, putative [Entamoeba nuttalli
P19]
Length = 318
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 176/332 (53%), Gaps = 29/332 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G H ++HCD+ G M++ +F + Q++R+ D +L+ V D+GGVYD YD
Sbjct: 2 IGVHASNYHCDDVTGTIMLKFVKEFKDCQLIRTLDMDILNKCTLVFDIGGVYDHKIKRYD 61
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEL---IAKELNVDEGHPDV-HRLFLAVYKN 140
HHQ+GF+E F + L GL++KH+G E+ I +E + DV +L + VY
Sbjct: 62 HHQRGFKETFSPAHNILLCGCGLLFKHYGNEIVKNIIEEYQHEIITEDVAEKLKVLVYNY 121
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+ IDA DNGI+ + Y +NT LS+RV LN + +P F++ +L
Sbjct: 122 FVMPIDANDNGIDV--SYGELLYKDNTTLSARVAHLN-EIKQP----------FEKAQEL 168
Query: 201 AGKEFLDTVRF---YVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK--LHLFEL 255
EF++ V V LP+ + +C RY PSG+I+V PWK L+L E
Sbjct: 169 VQPEFIEAVLTCYQIVSQHLPS---LQQCFETRYSVHPSGKILVNTLAVPWKESLNLLET 225
Query: 256 EEEMKI--EPLIKYVLYED-DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
EE K + + YVL D RG QW+ AV+ P + K P QWRGLR+ EL
Sbjct: 226 IEESKTGKKCEVLYVLQPDLGRGTQWKCTAVS-KPFSYGCLKNFPEQWRGLRNKELENVC 284
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI G +F H SGF+ N++Y G L MA +L+
Sbjct: 285 GIKGSIFCHSSGFLACNETYEGMLQMAITSLE 316
>gi|253747148|gb|EET02012.1| MYG1 protein [Giardia intestinalis ATCC 50581]
Length = 361
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 188/347 (54%), Gaps = 23/347 (6%)
Query: 7 SSSPAYSTSSPSQ---TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
S+S ST SP Q T LK + TH+G FH DE L + I+ + +A+I+R+R P+ L
Sbjct: 12 STSSILSTDSPKQGAGTLLKIIATHDGKFHWDECLAVWFIKQIPDYIDARIMRTRSPEEL 71
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGF-STKLSSAGLVYKHFGKELIAKELN 122
+ +DVG VYD C+DHH KGF+ F LSSAGL+Y H+G++++ +
Sbjct: 72 EFATITVDVGDVYDLEKLCFDHHMKGFQVFFSDAHRDISLSSAGLIYMHYGRDILKQLFP 131
Query: 123 VDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY---DTDKPP-RYVNNTNLSSRVGKLNL 178
+G ++ L+ VY+N++ +DA+DNG+ + D +P R+ + T++S+R+ ++
Sbjct: 132 RLDGPTELEFLYQYVYENYVRVVDAVDNGVESHYNEDGSEPICRWTDPTSMSARITRIY- 190
Query: 179 DWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGE 238
++ F +AG++F++ ++ V ++LPAR + + + + SG
Sbjct: 191 --------EIAGDDGFSTAYKMAGQDFMEWIQNVVITYLPARRELAKAMEQAQQIHQSGR 242
Query: 239 IMVLKRFCPWKLHLFELEEEMKIEP------LIKYVLYEDDRGKQWRVQAVAVSPDRFES 292
I+ L P++ + +E ++ + I ++++ DD +R++ V+V F
Sbjct: 243 IIELAAPGPYQQLIHSVEHDLSLCTDGPGGNTILFIIHPDDYQTGYRIRTVSVQKGSFAF 302
Query: 293 RKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMA 339
R LP WRGLRD+ L + +G G FVH SGF+GG Q ALA+
Sbjct: 303 RLGLPEAWRGLRDEALQRVSGYQGMSFVHKSGFLGGAQDRDTALAVV 349
>gi|67481709|ref|XP_656204.1| melanocyte prolifeating gene 1 [Entamoeba histolytica HM-1:IMSS]
gi|56473391|gb|EAL50820.1| melanocyte prolifeating gene 1, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449709650|gb|EMD48876.1| melanocyte proliferating protein 1, putative [Entamoeba histolytica
KU27]
Length = 318
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 178/332 (53%), Gaps = 29/332 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G H ++HCD+ G M++ +F + +++R+ D +L+ V D+GGVY+ YD
Sbjct: 2 IGVHASNYHCDDVTGTIMLKFVKEFKDCKLIRTLDMDILNKCTLVFDIGGVYNHKLKRYD 61
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEL---IAKELNVDEGHPDV-HRLFLAVYKN 140
HHQ+GF+E F + L GL++KH+G E+ I +E + DV +L + VY
Sbjct: 62 HHQRGFKETFSSAHNILLCGCGLLFKHYGNEIVKNIIEEYKHEIITEDVAEKLKVLVYNY 121
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
F+ IDA DNGI+ + Y +NT LS+RV LN + +P F++ +L
Sbjct: 122 FVMPIDANDNGIDV--SYGELLYKDNTTLSARVAHLN-EIKQP----------FEKAQEL 168
Query: 201 AGKEFLDTVRF---YVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK--LHLFEL 255
EF++ V V LP+ + +C RY PSG+I+V + PWK L+L E+
Sbjct: 169 VQPEFIEAVLTCYQIVSQHLPS---LQQCFETRYSVHPSGKILVNTQAVPWKESLNLLEI 225
Query: 256 EEEMKI--EPLIKYVLYED-DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
EE K + + YVL D RG QW+ AV+ P + K P QWRGLR+ EL
Sbjct: 226 IEESKTGKKCEVLYVLQPDLGRGTQWKCTAVS-KPFSYGCLKNFPEQWRGLRNKELEDVC 284
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI G +F H SGF+ N++Y G L MA +L+
Sbjct: 285 GIKGSIFCHSSGFLACNETYEGMLQMAITSLE 316
>gi|308158311|gb|EFO61057.1| MYG1 protein [Giardia lamblia P15]
Length = 361
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 179/333 (53%), Gaps = 20/333 (6%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
S TP K + TH+G FH DE L + I+ + +A+I+R+RDP+ L+ +DVG VYD
Sbjct: 26 SGTPFKIIATHDGKFHWDECLAVWFIKQLPDYIDARIMRTRDPEELEFATITVDVGDVYD 85
Query: 78 PSNDCYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
C+DHH KGF+ F + LSSAGL+Y H+G+ ++ + +G ++ L+
Sbjct: 86 SEKLCFDHHMKGFQVFFSDAYKDVCLSSAGLIYMHYGRNILKQLFPRLDGPTELEFLYHY 145
Query: 137 VYKNFMEAIDAIDNGI-NQYDTDKPP---RYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
VY N++ +DA+DNG+ + Y+ D R+ + T++S+R+ ++ +E
Sbjct: 146 VYDNYIRVVDAVDNGVESHYNADGSEPVCRWTDPTSMSARITRIY---------EIAGDE 196
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
F +AG++F++ ++ V ++LPAR + I + + SG I+ L P++ +
Sbjct: 197 GFGTAYKMAGQDFMEWIQTVVITYLPARKELTRAIEQAHKLHSSGRIIELSSSGPYQQFI 256
Query: 253 FELEEEMKI------EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
+E ++ + I ++++ DD +R++ V+ F R LP WRGLRD+
Sbjct: 257 HSVERDLSLCKDGPGGDTILFLIHPDDYQTGYRIRTVSTQKGSFAFRLGLPEAWRGLRDE 316
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMA 339
L +G G FVH SGF+GG + ALA+
Sbjct: 317 ALQSISGYQGMSFVHKSGFLGGAKDRKTALAVV 349
>gi|84997517|ref|XP_953480.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304476|emb|CAI76855.1| hypothetical protein, conserved [Theileria annulata]
Length = 344
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 172/304 (56%), Gaps = 27/304 (8%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP--- 78
+K++GTHNG FH DEAL +M++L ++ +A++VR+RDP+VL+ D V+DVG VY
Sbjct: 1 MKKIGTHNGFFHSDEALAVYMLKLLPEYRDAEVVRTRDPEVLETCDVVVDVGAVYGNYLC 60
Query: 79 ----------------SNDCYDHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKEL 121
N YDHHQ+ F++ F TKLSSAGLVYKHF K L KE+
Sbjct: 61 ITDFQTQRDIGKVEALKNFRYDHHQRDFKDFFDEEHEVTKLSSAGLVYKHFSKRLF-KEV 119
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
V L+ ++Y F+E +DAIDNG+ D + +Y NT+LS RV +LN W
Sbjct: 120 YKVADEETVEYLYKSIYDKFIEGMDAIDNGVPMCDGE--LKYKTNTDLSCRVSRLNPSWI 177
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
+ D + +E F + ++L G+EF V + LPA++++ + + +R+ SG+++
Sbjct: 178 DSDVAD--VDERFMKAVELTGQEFEYFVNNALNVILPAKALLHKALDQRFQTHRSGKVIE 235
Query: 242 LKRFCPWKLHLFELEEEMKI--EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
+++ CP+ L++ EEE + E + + L DD KQWR + +F+SR P P
Sbjct: 236 VRKSCPFSEFLYKHEEENNVPDEDRVLFYLTFDDASKQWRSTCIREKGSQFKSRLPFPEH 295
Query: 300 WRGL 303
RG+
Sbjct: 296 LRGI 299
>gi|56756328|gb|AAW26337.1| SJCHGC01215 protein [Schistosoma japonicum]
Length = 324
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/248 (44%), Positives = 153/248 (61%), Gaps = 10/248 (4%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPS 79
T +KR+GTH+G FHCDE L +++ ++ NA +VRSRDP VL D V+DVGGVYDP
Sbjct: 2 TSIKRIGTHDGCFHCDEVLAVVLLKHLPEYKNASVVRSRDPDVLSVCDVVVDVGGVYDPQ 61
Query: 80 NDCYDHHQKGFEEVFGHGF-----STKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
+DHHQK F + F KLSSAGLVY HFGK +++ ++ H + ++F
Sbjct: 62 TYRFDHHQKDFSLTWSKYFDVKMWDVKLSSAGLVYVHFGKRVLSLLTGLEINHEVLEKIF 121
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
+ VY++F+ ID DNG Q + P +Y NT L RV +LN W + +E AF
Sbjct: 122 MRVYESFILEIDGQDNGTPQ--SKMPLKYNINTGLYCRVRRLNPWW---NSGSEESESAF 176
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
Q+ ++L +EFLDTV ++ W PAR IV + ++ R D DPS I+VL R CPWK HLF+
Sbjct: 177 QRAINLVSREFLDTVDYFANCWWPARHIVAQAMSCREDVDPSKMIIVLDRSCPWKSHLFD 236
Query: 255 LEEEMKIE 262
LE E ++E
Sbjct: 237 LEREERME 244
>gi|429329930|gb|AFZ81689.1| uncharacterized protein family UPF0160 domain-containing protein
[Babesia equi]
Length = 272
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 98/228 (42%), Positives = 145/228 (63%), Gaps = 7/228 (3%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG FHCDEAL +M++L ++ +A+IVR+RD +LD D V+DVGGVYDPS Y
Sbjct: 2 KIGTHNGFFHCDEALAIYMLKLLPEYKDAEIVRTRDQGILDSCDIVVDVGGVYDPSKHRY 61
Query: 84 DHHQKGFEEVFGHGFS-TKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
DHHQK F F TKLSSAGL+YKHF K +I + V + V ++ +VY +F+
Sbjct: 62 DHHQKEFNVHFDDNHKVTKLSSAGLIYKHFSKRIINEIYGVKDNET-VDYIYNSVYDSFI 120
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
E++DAIDNG+ +D +Y NT+L SRVG+LN W + D + + E F + +++AG
Sbjct: 121 ESVDAIDNGVPI--SDGVLKYKWNTDLGSRVGRLNPAWMDVDGNPD---ERFMKAIEVAG 175
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
KEF V + LPA++ E + R++ SG+++ L + CP+ +
Sbjct: 176 KEFEHFVTNILNVILPAKTCFEEAFSRRFETHKSGKVIELTKSCPFSV 223
>gi|358336125|dbj|GAA54695.1| UPF0160 protein MYG1 mitochondrial, partial [Clonorchis sinensis]
Length = 1803
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 106/276 (38%), Positives = 151/276 (54%), Gaps = 39/276 (14%)
Query: 98 FSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV-----------YKNFMEAID 146
++ K SSAGLVY FGK++I+ D G + RLF V Y++F+ ID
Sbjct: 1418 WTVKFSSAGLVYATFGKDVISLLTGWDVGSDKLQRLFTKVSYPICVIYAKVYESFIMEID 1477
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DNG+ Q + +Y T + RV +LN W E + AF + +DL G+EF+
Sbjct: 1478 GTDNGVPQ--STHKLKYNIRTGICLRVSRLNPWWNE---DCKDTTPAFLEAVDLVGREFV 1532
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE---- 262
+TV+++ W PAR V +A+R D S I+VL++ CPWK HL++LE E + E
Sbjct: 1533 ETVKYFANCWWPAREFVARAMAKRELIDDSMAIIVLEQSCPWKCHLYDLEREERAESVVY 1592
Query: 263 ----------------PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
P + +V++ D G W VQAV S D FE+R PLP WR L+D
Sbjct: 1593 PQPLRLLTERPEPKFPPKVLFVVHPCDDGT-WIVQAV--SGDNFENRLPLPDSWRALQDS 1649
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAA 342
ELS+ AGIPGC+FVH +G +G N++ GA+ MAR+
Sbjct: 1650 ELSRVAGIPGCIFVHGTGHLGMNKTREGAIEMARSV 1685
>gi|440298962|gb|ELP91577.1| hypothetical protein EIN_129170 [Entamoeba invadens IP1]
Length = 317
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/333 (33%), Positives = 175/333 (52%), Gaps = 32/333 (9%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
RVG H G++H D+ G ++R F + ++VR+ D VL D V D+G VYD S Y
Sbjct: 2 RVGVHTGTYHSDDVTGVVLLRYVLAFRDFELVRTFDLAVLSTCDLVFDIGCVYDHSRKRY 61
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELI----AKELNVDEGHPDVHRLFLAVYK 139
DHHQ+GF E F + KL GL++KH+G E++ + E + ++ + VY
Sbjct: 62 DHHQRGFNETFSPAQTVKLCGCGLLFKHYGNEIVKNVLSTEFMITATDSEIESIKNNVYD 121
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F++AIDA DNG++ ++ Y + T L++R+ + N ++F+ +
Sbjct: 122 TFLKAIDANDNGVDV--SNGELLYRDTTTLTARIKRYN-----------TIGKSFEDAVA 168
Query: 200 LAGKEFLDTV---RFYVRSWLPARSIVVECIAERYDYDPSGEIMVL--KRFCPWKLHLFE 254
LA EFL V V + LP IV R++ +PSG ++ + +R + + E
Sbjct: 169 LAQPEFLAAVYDSYQIVHNKLP---IVRTAFTHRFEINPSGNVIYITDERDFTDAIQMLE 225
Query: 255 LEEEMKIEPL-IKYVLYEDD-RGKQWRVQAVAVSPDRFE--SRKPLPAQWRGLRDDELSK 310
EE + + YV+ +D RG+QWR A+ P ++ +K P WRGLRD+ LS
Sbjct: 226 NEERNAGNHVSLLYVIQKDSKRGEQWRATAL---PKQYSYIPQKCFPEAWRGLRDETLSS 282
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ I GCVF H SGF+ N+++ G + M +A+L
Sbjct: 283 VSHIAGCVFCHASGFLACNKTFEGIIQMVQASL 315
>gi|12003126|gb|AAG43462.1|AF195007_1 Yer156p [Eremothecium gossypii]
Length = 223
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 125/207 (60%), Gaps = 6/207 (2%)
Query: 129 DVHRLFLAVYKNFMEAIDAIDNGINQYDTD----KPPRYVNNTNLSSRVGKLNLDWTEPD 184
D L+ VY++F+EA+DA DNGI+ +D + +P + N ++ V K+N DW E +
Sbjct: 5 DTELLYEKVYRDFVEALDANDNGISNFDAEELGVRPRFHDKNISIPGIVAKMNPDWNE-E 63
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
S R +E F G F VR Y R+WLPA+ IV + +R DPSG I+VL+R
Sbjct: 64 TSDARFDECFLTASAFVGDCFARVVRGYGRAWLPAKDIVRAGVRDRAAVDPSGRIVVLER 123
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
FCPWK HL+++E E+ + + +VL+ D G WRV V S F R LP WRGLR
Sbjct: 124 FCPWKEHLYDVERELGLVGEVLFVLFADSSG-SWRVSTVPQSATSFRFRHGLPEPWRGLR 182
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQS 331
DD LS+ G+PGC+FVH +GFIGG ++
Sbjct: 183 DDALSEATGVPGCIFVHAAGFIGGART 209
>gi|326936039|ref|XP_003214066.1| PREDICTED: UPF0160 protein MYG1, mitochondrial-like, partial
[Meleagris gallopavo]
Length = 176
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/171 (51%), Positives = 116/171 (67%), Gaps = 5/171 (2%)
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
LN W +PDQ E F++ M+L G EF+D + +Y R+WLPAR++V + I R++ D
Sbjct: 2 LNPRWNDPDQDTE---AGFKRAMELVGSEFMDRLDYYHRAWLPARALVEDAIRRRFEVDT 58
Query: 236 SGEIMVLKRF-CPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRK 294
SG ++ L + CPWK HLF LE+E+ + ++ VL+ DR QWRVQ+V V P FESR
Sbjct: 59 SGVMLELPQGGCPWKEHLFSLEKELALPNPLQLVLF-PDRSGQWRVQSVPVGPRSFESRL 117
Query: 295 PLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
PLP WRG+RD+ LS+ GIPGCVFVH SGFIGGN++ GAL MAR L L
Sbjct: 118 PLPEPWRGVRDEALSQLTGIPGCVFVHSSGFIGGNRTREGALEMARRTLAL 168
>gi|89179340|gb|ABD63078.1| MYG1 protein, related [Asparagus officinalis]
Length = 139
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 75/104 (72%), Positives = 90/104 (86%)
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
AIDAIDNG+NQYDTD+PPRYV+ TNLSSRVG+LNLDW +PDQS+E+EN AF + M LAG
Sbjct: 13 AIDAIDNGVNQYDTDQPPRYVSKTNLSSRVGRLNLDWMDPDQSSEKENAAFHKAMALAGT 72
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
EFL++VR+Y +SWLPARSIV+EC+A R D DPSGEIM L + CP
Sbjct: 73 EFLESVRYYAKSWLPARSIVMECLASRGDIDPSGEIMALNKDCP 116
>gi|403745625|ref|ZP_10954420.1| metal-dependent protein hydrolase [Alicyclobacillus hesperidum
URH17-3-68]
gi|403121343|gb|EJY55657.1| metal-dependent protein hydrolase [Alicyclobacillus hesperidum
URH17-3-68]
Length = 321
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 115/320 (35%), Positives = 166/320 (51%), Gaps = 33/320 (10%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FH DE ++R + +A++VRSR P+ L D V+DV G Y
Sbjct: 19 KIGTHHGKFHADEVFAVAILR--QLYPDAEVVRSRSPQALATCDIVVDVNG------SPY 70
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHH E+V+ +SAGL+++ FG LI+ DEG+ ++ + + + ++
Sbjct: 71 DHHT--VEKVY-RANGLPFASAGLIWRDFGAALISHFGVEDEGNRNI--VHAHIDEKLIQ 125
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
AIDAIDNGI D D+ R +S VG N W +A+ ENEAF++ + A
Sbjct: 126 AIDAIDNGI---DLDRDTRI---KGISELVGSFNPPW----NAADDENEAFERAVRFA-T 174
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
LD Y+R + +R VE + + +MVL RFCPW L EL+ + +
Sbjct: 175 VILDN---YIRHEI-SRIDAVEIVKSAFQARTEPSLMVLPRFCPWTETLMELDTQASV-- 228
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
YV++ D G Q+R+Q V SP FE+RKPLPA W G EL++ G F H +
Sbjct: 229 --LYVVFPDKTG-QYRLQVVPKSPRSFEARKPLPASWAGKEGAELAEACGTIDATFCHPA 285
Query: 324 GFIGGNQSYGGALAMARAAL 343
FI G S G L MAR A+
Sbjct: 286 RFIAGAHSLEGILQMARQAI 305
>gi|19074600|ref|NP_586106.1| similarity to HYPOTHETICAL PROTEIN YEY6_yeast [Encephalitozoon
cuniculi GB-M1]
gi|19069242|emb|CAD25710.1| similarity to HYPOTHETICAL PROTEIN YEY6_yeast [Encephalitozoon
cuniculi GB-M1]
Length = 311
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 175/323 (54%), Gaps = 32/323 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH DE + + L + +++IVR+R V+ D V DVG +DP + YDHH
Sbjct: 5 THDGKFHLDEVMATAV--LLKIYPDSEIVRTRSSAVVRSGDIVYDVGRSFDPEANRYDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKE-LN-VDEGHPDVHRLFLAVYKNFMEA 144
Q+ F E F KLSS+GL+YK++G++ + K LN DE P R+ VY + +
Sbjct: 63 QESFNETFSPKHKIKLSSSGLIYKYYGEKFLEKYGLNRTDECFP---RVLEEVYTAYFMS 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN-LDWTEPDQSAERENEAFQQGMDLAGK 203
DAIDNG + P +LS V N LD+++ D R+N+ F + + K
Sbjct: 120 ADAIDNGYEIFGEIVP------RSLSHVVESFNALDFSDSD----RQNKRFLEAVQFVSK 169
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
+ + + V W+P+ + + IAE +G+I+ + R+C + + E+E++ K +
Sbjct: 170 DLDNFISAIVNRWVPSYKYLNKLIAE-----VNGDILYVDRYCFVDV-IPEIEKKYKKD- 222
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPD--RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+K+VL K+ V + V D RFES+ PL +WRGL +L +GI GC FVH
Sbjct: 223 -VKFVL----NRKESSVSILTVPKDRKRFESKVPLKKEWRGLAGGKLETVSGIDGCNFVH 277
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
SGF+G N++ GA+ M R +++
Sbjct: 278 ASGFVGSNKTIEGAMEMCRISIE 300
>gi|449329563|gb|AGE95834.1| hypothetical protein ECU07_1790 [Encephalitozoon cuniculi]
Length = 311
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 175/323 (54%), Gaps = 32/323 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH DE + + L + +++IVR+R V+ D V DVG +DP + YDHH
Sbjct: 5 THDGKFHLDEVMATAV--LLKIYPDSEIVRTRSSAVVRSGDIVYDVGRSFDPEANRYDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKE-LN-VDEGHPDVHRLFLAVYKNFMEA 144
Q+ F E F KLSS+GL+YK++G++ + K LN DE P R+ VY + +
Sbjct: 63 QESFNETFSPKHKIKLSSSGLIYKYYGEKFLEKYGLNRTDECFP---RVLEEVYTAYFMS 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN-LDWTEPDQSAERENEAFQQGMDLAGK 203
DAIDNG + P +LS V N LD+++ D R+N+ F + + K
Sbjct: 120 ADAIDNGYEIFGEIVP------RSLSHVVESFNALDFSDSD----RQNKRFLEAVQFVSK 169
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
+ + + V W+P+ + + IAE +G+I+ + R+C + + E+E++ K +
Sbjct: 170 DLDNFISAIVNRWVPSYKYLDKLIAE-----VNGDILCVDRYCFVDV-IPEIEKKYKKD- 222
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPD--RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+K+VL K+ V + V D RFES+ PL +WRGL +L +GI GC FVH
Sbjct: 223 -VKFVL----NRKESSVSILTVPKDRERFESKVPLKKEWRGLAGGKLETVSGIDGCNFVH 277
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
SGF+G N++ GA+ M R +++
Sbjct: 278 ASGFVGSNKTIEGAMEMCRISIE 300
>gi|303388789|ref|XP_003072628.1| hypothetical MYG1-like protein [Encephalitozoon intestinalis ATCC
50506]
gi|303301769|gb|ADM11268.1| hypothetical MYG1-like protein [Encephalitozoon intestinalis ATCC
50506]
Length = 305
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 172/321 (53%), Gaps = 26/321 (8%)
Query: 24 RVGTHNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
++ THN FH DE L C ++R+ + +A+++R+RD ++ D V DVG V+DP +
Sbjct: 2 KLVTHNERFHYDEILASCVLLRI---YPDAEVIRTRDDSIISQGDIVYDVGRVFDPQSGR 58
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+ F E F + TKLSS+GL++K+F K+L+A V+E + +Y F
Sbjct: 59 FDHHQRTFSETFSPKYRTKLSSSGLIFKYFHKKLLAL-YGVEESCRIYELVVDKIYSEFF 117
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
DAIDNGI+ Y +P +++ V N D EP++ + EN+ F + + +
Sbjct: 118 LYADAIDNGIDIYGEIRP------RSIADLVSLFNSD--EPNE--DLENQRFLEALKIVD 167
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
K+ LD +R+W+ + V I E +G I+VL + L LE E +
Sbjct: 168 KD-LDNYLKRIRTWIDSYEYVESKIKE-----TNGPILVLDKHYSTDL---VLEIEGSHQ 218
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
K++++ +RV A+ FE++ PL +WRGL +++L + +GI GC FVH
Sbjct: 219 KDFKFMVFP--LRNAYRVMAIPKCKGSFETKNPLKKEWRGLVNEDLVRVSGIEGCTFVHS 276
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGF+G N S AL M +L
Sbjct: 277 SGFMGINSSLENALKMCSESL 297
>gi|449686830|ref|XP_002163350.2| PREDICTED: UPF0160 protein MYG1, mitochondrial-like [Hydra
magnipapillata]
Length = 146
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 80/146 (54%), Positives = 100/146 (68%), Gaps = 2/146 (1%)
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEE 258
+ G EFLD + +Y +WLPAR IV + I ERY D SGE++V + CPWK HLF++E E
Sbjct: 1 MVGLEFLDRIHYYKDAWLPARKIVEDAIDERYSIDESGEVIVFHQSGCPWKEHLFDIEIE 60
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
I P IK+VLY+D + WRVQ+V FE+R L WRGLRDD LS +GIPGC+
Sbjct: 61 KSIVPNIKFVLYQD-QNHNWRVQSVPERLGSFENRLSLCESWRGLRDDILSSTSGIPGCI 119
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
FVH SGFIGGN++Y GAL MA+ LK
Sbjct: 120 FVHASGFIGGNKTYEGALQMAKHTLK 145
>gi|385305986|gb|EIF49926.1| yer156c-like protein [Dekkera bruxellensis AWRI1499]
Length = 219
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 130/214 (60%), Gaps = 7/214 (3%)
Query: 102 LSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQY--DT 157
SAGLV+KHFGKE+I L + ++ D+ ++ VYK+F+EAIDA DNGIN+Y
Sbjct: 8 FXSAGLVFKHFGKEIICSVLGFSTEKNAKDIDFVYBRVYKDFVEAIDANDNGINKYANQN 67
Query: 158 DKPPRYVN-NTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSW 216
D P++ + N +L+ V LN W A+ + + FQ L GK F+ + + +S+
Sbjct: 68 DLIPKFHDRNFSLAGTVANLNPSWDSDPTDADFDAQ-FQVASQLMGKAFMQFLNYIGKSF 126
Query: 217 LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGK 276
LPA+ V + ER+ D SG+I+++ R+ PWK H++ +E+ +E I YVL+ D
Sbjct: 127 LPAKQYVQKAFDERFSVDKSGKIILMNRYVPWKEHIYNIEKSNNVEGQILYVLFPDS-NX 185
Query: 277 QWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
WR+ AV VS F+SRK LPA+WRGLRD LS+
Sbjct: 186 NWRITAVPVSASSFDSRKKLPAEWRGLRDQALSE 219
>gi|357469229|ref|XP_003604899.1| GAMM1 protein-like protein [Medicago truncatula]
gi|355505954|gb|AES87096.1| GAMM1 protein-like protein [Medicago truncatula]
Length = 100
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 74/87 (85%), Positives = 81/87 (93%)
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
WKLHLFELE EMKI+P IKYVLYED+R KQWRVQAV+VSPDRFESRK LP+QWRGLRDD
Sbjct: 3 WKLHLFELEGEMKIDPPIKYVLYEDERSKQWRVQAVSVSPDRFESRKALPSQWRGLRDDI 62
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGG 334
LSKE+GIPGCVFVHMSGFIGGNQ++ G
Sbjct: 63 LSKESGIPGCVFVHMSGFIGGNQTFEG 89
>gi|303390127|ref|XP_003073295.1| hypothetical protein Eint_071760 [Encephalitozoon intestinalis ATCC
50506]
gi|303302440|gb|ADM11935.1| hypothetical protein Eint_071760 [Encephalitozoon intestinalis ATCC
50506]
Length = 318
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 173/321 (53%), Gaps = 26/321 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH DE + + L + +++I+R+R+ V+ D V DVGG ++P + YDHH
Sbjct: 5 THDGKFHLDEVMATAV--LLKIYPDSEIIRTRNMSVMRSGDIVYDVGGTFNPETNRYDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ F E F + KLSS+GL+YK++G++L+ K V + RL +Y + + D
Sbjct: 63 QESFNETFSSKYKIKLSSSGLIYKYYGEKLLEK-YGVTKTDEHFQRLLEEIYATYFLSAD 121
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ-QGMDLAGKEF 205
AIDNG + + PR +++ S V LN ++ R EA + MDL F
Sbjct: 122 AIDNGYEIFG-EIVPRSLSHIVESFNV--LNFSGGSNNEQDRRFLEAVRFVSMDL--DNF 176
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLI 265
+ T+ + W+P + E I++ G+I+ + R+C + + E+E + K + I
Sbjct: 177 MHTI---INGWMPNYKYLSELISK-----VDGDILCVDRYCFIDV-IPEIERKYKKD--I 225
Query: 266 KYVLYEDDRGKQWRVQAVAVSPDR--FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+VL + + V +AV R FES+ PL +WRGL +L +GI GC FVH+S
Sbjct: 226 NFVL----KKGESSVSILAVPKKRKHFESKIPLKKEWRGLSGGKLDAISGIEGCNFVHLS 281
Query: 324 GFIGGNQSYGGALAMARAALK 344
GF+G N++ GA+ M + ++K
Sbjct: 282 GFVGSNKTIEGAIEMCKESIK 302
>gi|396081807|gb|AFN83422.1| hypothetical protein EROM_071710 [Encephalitozoon romaleae SJ-2008]
Length = 315
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 167/316 (52%), Gaps = 20/316 (6%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+ FH DE L + L + +++I+R+R+P V+ D + DVGGV+DP + YDHH
Sbjct: 5 THDRKFHLDEVLATAV--LLKIYPDSEIIRTRNPAVVQGGDIIYDVGGVFDPKTNRYDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ F E F KLSS+GL+YK++G+ + + + H+ +Y+ + + D
Sbjct: 63 QESFNETFSSNHKIKLSSSGLIYKYYGERFL-EVYGITRTDEYFHKALEEIYETYFMSAD 121
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
AIDNG + P +LS V N+ +++ E +N F + + + ++
Sbjct: 122 AIDNGYEIFGEIVP------RSLSHIVESFNILSFSGNENDE-QNRRFLEAVRIVSRDLD 174
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + + SW+P + + I+ G+I+ + R+C + + E+E++ K + IK
Sbjct: 175 NFMHTMIGSWIPNYKYLDKLIS-----GVVGDILCVDRYCFIDV-VPEIEKKYKKD--IK 226
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+VL ++R + AV F+S+ PL +WRGL +L +GI GC FVH SGF+
Sbjct: 227 FVL--NERENSVTILAVPKERKHFKSKIPLKKEWRGLTGSKLETISGIEGCNFVHASGFV 284
Query: 327 GGNQSYGGALAMARAA 342
G N++ GAL M R +
Sbjct: 285 GSNKTIEGALEMCRVS 300
>gi|401827175|ref|XP_003887680.1| Myg1-like metal binding protein [Encephalitozoon hellem ATCC 50504]
gi|392998686|gb|AFM98699.1| Myg1-like metal binding protein [Encephalitozoon hellem ATCC 50504]
Length = 307
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 170/318 (53%), Gaps = 20/318 (6%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+ FH DE L + L + +++I+R+R+P V++ D V DVGG++DP YDHH
Sbjct: 5 THDKKFHLDEVLATAV--LLKIYPDSEIIRTRNPAVIETGDIVYDVGGLFDPKTSRYDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ F E F KLSS+GL+YK++ ++ + + + + + R+ VY + + D
Sbjct: 63 QESFGETFNSNHKIKLSSSGLIYKYYAEKFL-EIYGITKTNEYFQRVLEEVYTAYFMSAD 121
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
AIDNG + + PR + +++ L+ +E D+ +N+ F + + K+
Sbjct: 122 AIDNGYEIFG-EIVPRSL--SHIVESFNALSFSSSEDDE----QNKRFLEAVRFVSKDLD 174
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + V W+P + + IA D G+I+ + ++C + + E+E++ K + I+
Sbjct: 175 NFMHTIVNGWIPNYKYLDKLIA-----DSDGDILCVDKYCFIDV-IPEIEKKYKKD--IR 226
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+VL + R + AV F S+ PL +WRGL +L +GI GC FVH SGF+
Sbjct: 227 FVL--NKRESSVTILAVPRERKHFRSKVPLKKEWRGLAGSKLEAISGIEGCNFVHASGFV 284
Query: 327 GGNQSYGGALAMARAALK 344
G N++ GA+ M R +++
Sbjct: 285 GSNRTVEGAMEMCRVSVE 302
>gi|19173093|ref|NP_597644.1| similarity to HYPOTHETICAL PROTEIN YEY6_yeast [Encephalitozoon
cuniculi GB-M1]
gi|19168760|emb|CAD26279.1| similarity to HYPOTHETICAL PROTEIN YEY6_yeast [Encephalitozoon
cuniculi GB-M1]
gi|449330147|gb|AGE96410.1| hypothetical protein ECU03_1360 [Encephalitozoon cuniculi]
Length = 305
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/322 (34%), Positives = 168/322 (52%), Gaps = 28/322 (8%)
Query: 24 RVGTHNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
++ THN FH DE L C ++R+ + +A+IVR+RD ++D D V DVGGV+DP
Sbjct: 2 KLITHNERFHYDEVLASCILLRI---YPDAEIVRTRDKTLIDSGDIVYDVGGVFDPGLGR 58
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL-AVYKNF 141
+DHHQ+ F E F +S KLSS+GL++K+F ++L++ L E + L + +Y F
Sbjct: 59 FDHHQRTFFETFSPKYSVKLSSSGLIFKYFHRKLLS--LYGIESSSRTYDLVVDKIYSEF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
DAIDNG + Y KP + NL N+D + D E E F + + +
Sbjct: 117 FLYADAIDNGQDIYGEIKPRSIADLVNL------FNVDGSGRDA----ETEGFYEALRIV 166
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
K+ LD ++ W + I E SG I+VL + L LE E +
Sbjct: 167 DKD-LDNYMKRIKLWTDNYEHMENKIKE-----TSGPILVLDKHYSTDL---VLEIEAQN 217
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
K++++ RG +RV A+ FE++ PL +WRGL ++EL + +GI GC FVH
Sbjct: 218 GKDFKFMVFP-HRG-AYRVIAIPKHKGTFETKNPLRKEWRGLVNEELVEASGIEGCTFVH 275
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
SGF+G N + AL M +L
Sbjct: 276 SSGFMGINSTLENALKMCTESL 297
>gi|396081126|gb|AFN82745.1| putative MYG1-like protein [Encephalitozoon romaleae SJ-2008]
Length = 305
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 175/322 (54%), Gaps = 28/322 (8%)
Query: 24 RVGTHNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
++ TH+ FH DE L C ++R+ + +A+I+R+RD +++ D V DVGGV++P+
Sbjct: 2 KLVTHDERFHYDEVLASCVLLRI---YPDAEIMRTRDNAIIEQGDIVYDVGGVFNPATRR 58
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL-AVYKNF 141
+DHHQ+ F E F ++ KLSS+GL++K+F K+L++ L E ++ + + +Y F
Sbjct: 59 FDHHQRTFSETFSSKYNVKLSSSGLIFKYFHKQLLS--LYGIEDTCGIYNMVVDKIYSEF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
DAIDNG + Y +P ++ VG N D PD+ EN F + +++
Sbjct: 117 FLYADAIDNGQDIYGEIRP------RTMADLVGLFNTD--TPDEGL--ENRGFYKVLEIV 166
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ LD ++ W+ V E+ + +G I+VL + L + E+E +
Sbjct: 167 STD-LDNYMKRIKIWVNNYEHV-----EKKARETNGPILVLDKHYTTDL-ILEIESQNGK 219
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ K++++ +RV A+ FE++ PL +WRGL ++EL + +GI GC+FVH
Sbjct: 220 D--FKFMVFPHRNA--YRVIAIPKRRGSFETKNPLKKEWRGLVNEELVRASGIEGCIFVH 275
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
SGF+G N ++ AL M +L
Sbjct: 276 SSGFMGINSTFENALKMCEESL 297
>gi|323309319|gb|EGA62537.1| YER156C-like protein [Saccharomyces cerevisiae FostersO]
Length = 167
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 109/168 (64%), Gaps = 2/168 (1%)
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
+N +W E D S E + F + + G F+ V+ Y +SWLPA+++V + I ER D D
Sbjct: 1 MNPNWNE-DTSDESFDRCFARASEFIGGVFVTLVKGYGQSWLPAKALVAQAIDERMDVDK 59
Query: 236 SGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKP 295
SG+I+VL +FCPWK HL+ELE E IE I++VL+ D G WRV V ++ F+ R+
Sbjct: 60 SGKIIVLPQFCPWKEHLYELEREKNIEKQIEFVLFTDSSG-AWRVSTVPINSTSFQFRRG 118
Query: 296 LPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
LP RGLRD+ELS ++G+PGC+F+H +GFIGG +S +A+ +L
Sbjct: 119 LPEPLRGLRDEELSTKSGVPGCIFIHAAGFIGGAKSKEAVYELAKMSL 166
>gi|375006372|ref|YP_004975156.1| hypothetical protein AZOLI_p40179 [Azospirillum lipoferum 4B]
gi|357427630|emb|CBS90575.1| conserved protein of unknown function; metal-dependent hydrolase
domain [Azospirillum lipoferum 4B]
Length = 332
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/321 (31%), Positives = 155/321 (48%), Gaps = 24/321 (7%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLT---DKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSN 80
R+ TH GSFHCDE LG ++R + + ++R+RDP+ ++ D V DVGGV+DP+
Sbjct: 6 RLVTHGGSFHCDEVLGYAILRRALPPEALAASTLLRTRDPRAIEAADIVWDVGGVFDPAR 65
Query: 81 DCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
+DHHQ+G + SSAGL++ FG + + L G V +++ + +
Sbjct: 66 RRFDHHQRGAP---ARADGSPYSSAGLLWSAFGHDAVRTVLAGRGGEEVVGQIWAEMDEQ 122
Query: 141 FMEAIDAIDNGINQYD------TDKPPRYVNNTNLSSRVGKLNLDW-TEPDQSAERENEA 193
+ +D DNG+ D+ R + L S V LNL W A E+E
Sbjct: 123 LIRLVDLADNGLRPVPGFGDEALDRAARIADGLALPSLVEVLNLPWDAATADRAPAEDER 182
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
F + ++AG LD +R+ + AR IV+E A D I+ L R PW+
Sbjct: 183 FARAAEIAGA-VLDGRIEQIRARIAARDIVLEAHARSADP----RILELGRGMPWQGPAH 237
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
E + + + +Y D G+ W V + P F + PLPA W GLRD EL++ G
Sbjct: 238 EADLP------VLFAVYPDKGGEAWMVGCMPPEPGSFAQKLPLPAAWAGLRDAELARVCG 291
Query: 314 IPGCVFVHMSGFIGGNQSYGG 334
+ +F H++ F+ +S G
Sbjct: 292 VEDAIFCHLNRFVAAARSRDG 312
>gi|89179336|gb|ABD63074.1| hypothetical protein 17.t00001 [Asparagus officinalis]
Length = 261
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 68/87 (78%), Positives = 79/87 (90%)
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
WKLH+FELEEE+KI+P IKY LY+DDR K WRVQAVA+S D+FESR+PLP WRGLRD+E
Sbjct: 164 WKLHIFELEEELKIDPSIKYALYQDDRSKNWRVQAVAISADKFESRRPLPLPWRGLRDEE 223
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGG 334
LSKE+GIPGCVFVHMSGFIGGN++Y G
Sbjct: 224 LSKESGIPGCVFVHMSGFIGGNRTYDG 250
>gi|412991138|emb|CCO15983.1| MYG1 protein [Bathycoccus prasinos]
Length = 243
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 104/149 (69%), Gaps = 4/149 (2%)
Query: 4 TGVSSSPAYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL 63
T +SS + + + ++ +GTH+GSFHCDEALGC++++ T++F N +IVR+RD L
Sbjct: 72 TAATSSKRHQSMTGTEDT---IGTHDGSFHCDEALGCYLLQNTEQFSNCRIVRTRDADAL 128
Query: 64 DDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV 123
VLDVG YD + +DHHQKGF E F F TKLSSAGLVYKHFGKE+++K++
Sbjct: 129 AKCAVVLDVGAEYDVAKLRFDHHQKGFSETFN-DFKTKLSSAGLVYKHFGKEIVSKKIEK 187
Query: 124 DEGHPDVHRLFLAVYKNFMEAIDAIDNGI 152
P ++L+L +YK+F+EA+D +DNG+
Sbjct: 188 SVEDPVTNQLYLKMYKSFIEAVDGVDNGV 216
>gi|401825819|ref|XP_003887004.1| Myg1-like protein [Encephalitozoon hellem ATCC 50504]
gi|392998161|gb|AFM98023.1| Myg1-like protein [Encephalitozoon hellem ATCC 50504]
Length = 302
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 174/322 (54%), Gaps = 28/322 (8%)
Query: 24 RVGTHNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
++ TH+ FH DE L C ++R+ + +A++VR+RD +++ D V DVG V+DP
Sbjct: 2 KLVTHSERFHYDEILATCILLRI---YPDAEVVRTRDDALIEQGDIVYDVGKVFDPKIGR 58
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL-AVYKNF 141
+DHHQ+ F E F + KLSS+GL++K+F K+ ++ L E +++ + + +Y F
Sbjct: 59 FDHHQRTFSETFSPKYDVKLSSSGLIFKYFHKKFLS--LYGIEDSCEIYGIVVDKIYSEF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
DAIDNG + + + +P + VG N+D PD+ E E++ F++ +++
Sbjct: 117 FLYADAIDNGQDIHGSIRP------RTVPDLVGLFNVDV--PDE--ELESKGFRKALEIV 166
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ LD + W+ V ER + +G I++L + L LE E +
Sbjct: 167 SMD-LDNYMKRAKMWIDNYEYV-----ERKVRETNGPILMLDKHYSTDL---VLEVESRN 217
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ K++++ +R+ A+ FE++ PL +WRGL ++EL K +GI GC+FVH
Sbjct: 218 DKDFKFMVFP--HRNAYRIIAIPKCKGSFETKNPLKEEWRGLVNEELVKVSGIEGCIFVH 275
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
SGF+G N++ AL M +L
Sbjct: 276 SSGFLGINKTLENALKMCEESL 297
>gi|300707987|ref|XP_002996183.1| hypothetical protein NCER_100767 [Nosema ceranae BRL01]
gi|239605461|gb|EEQ82512.1| hypothetical protein NCER_100767 [Nosema ceranae BRL01]
Length = 300
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 164/320 (51%), Gaps = 24/320 (7%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+G FH DE L ++ + NA+++R+RD + D V DVG ++PS +
Sbjct: 2 KLVTHDGKFHYDEVLASSILLFI--YPNAELIRTRDLVEISKGDIVYDVGSEFNPSTKRF 59
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ F+E + + ++ KLSSAGL+YK+F KEL + + + HP RL VY F
Sbjct: 60 DHHQHSFKETYSNKYNFKLSSAGLIYKYFQKELF-QYFEIYDTHPLYERLTDKVYNEFFL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
D +DNGI+ ++ KP + V LN+D D +E + + F + + +
Sbjct: 119 GADCVDNGIDIWNIVKP------RTIYDVVNDLNVD---NDCFSEAQTKNFYEAVKIVSD 169
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
+F+ + WL S E I + SG I+ ++ L + ++E++ +
Sbjct: 170 DFIRYMYKVKDYWLKYYSECEEKIK-----NLSGHILQSDKYYDVDL-ILDIEKQYNKD- 222
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+ Y++Y + Q+R++A+ V S+ PL +WRG R EL+ G +FVH S
Sbjct: 223 -VYYLVYP--QNNQYRIRAINVESKTMASKLPLKEEWRGKRGKELND--LFEGAIFVHYS 277
Query: 324 GFIGGNQSYGGALAMARAAL 343
GFIG +++ GAL M +
Sbjct: 278 GFIGIHKTKEGALKMCENTI 297
>gi|315126112|ref|YP_004068115.1| metal-dependent protein hydrolase [Pseudoalteromonas sp. SM9913]
gi|315014626|gb|ADT67964.1| putative metal-dependent protein hydrolase [Pseudoalteromonas sp.
SM9913]
Length = 289
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 151/322 (46%), Gaps = 37/322 (11%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ +++ F + ++R+RD +++D D V+DVGG YDP D
Sbjct: 4 KTIVTHNGNFHADDVFSIAALKII--FPSFTLIRTRDNAIINDADVVIDVGGQYDPETDR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GLV+K +G L + + + DV AV +
Sbjct: 62 FDHHQRGGAGARENGI--PFSSFGLVWKKYG-------LAICDDNQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + AF Q ++ A
Sbjct: 110 STIDAIDCG-------HVEGVATGISLSQTISMFNPTWEEESNF----DAAFDQAVEFAH 158
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
L V + L A+ IV + I + D ++VL+R+ PWK + L EE
Sbjct: 159 TLLLRFVA-SAQGGLNAKKIVAQAIKQADD----PRLIVLERYTPWKKTVHSLSEEAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
YV+Y GK W VQ V FE RKPLPA W GL D E GI +F H
Sbjct: 212 ----YVVYPSHSGK-WIVQTVPAELGSFEDRKPLPAPWAGLSDSEFQAVTGIDDAMFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAALK 344
FI G +S+ +++A AL+
Sbjct: 267 GLFIAGAESFESVMSLATMALE 288
>gi|195373981|ref|XP_002046044.1| GM19272 [Drosophila sechellia]
gi|194123231|gb|EDW45274.1| GM19272 [Drosophila sechellia]
Length = 172
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/169 (44%), Positives = 105/169 (62%), Gaps = 5/169 (2%)
Query: 176 LNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP 235
LN W E E + F+Q MD AG+EF+D V SW+ AR V E + P
Sbjct: 1 LNPSWQETGVDIE---DRFRQAMDTAGREFVDNVVEVSCSWIAARDHVREALKNAKSVHP 57
Query: 236 SGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKP 295
+GEI+VLK FCPWK HLF+LE+E K+E + K V++ + G WRV V+V+P F RK
Sbjct: 58 TGEILVLKNFCPWKSHLFDLEKEYKVEGVPKLVVF--NNGTSWRVAGVSVTPSSFLGRKF 115
Query: 296 LPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
LP WRGLRDDEL ++A I +F+H +GFIGG ++ A+ +A+ +++
Sbjct: 116 LPTPWRGLRDDELCEKASIKDLLFIHHNGFIGGAKTEEAAMLLAKKSIE 164
>gi|394988244|ref|ZP_10381082.1| hypothetical protein SCD_00646 [Sulfuricella denitrificans skB26]
gi|393792702|dbj|GAB70721.1| hypothetical protein SCD_00646 [Sulfuricella denitrificans skB26]
Length = 305
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 164/329 (49%), Gaps = 36/329 (10%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P TH+GSFH D+ L +RL + I+R+RD LD D + DVG V+D
Sbjct: 9 QRPGAVAATHSGSFHADDVLAAATLRLVNPAL--PILRTRDQGQLDAADVIFDVGRVFDS 66
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ +DHHQ ++E +G SS GLV+ G +L + V
Sbjct: 67 ATCRFDHHQLEYKEARENGI--PYSSFGLVWCELGAQLCESAAAAAK-----------VD 113
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ ++ +DA+D GIN P V ++S+ +G N W + ++ R NEAF++ +
Sbjct: 114 RWLVQGVDAMDCGINLSKETLP---VTLMSISTVLGGFNPGWQDVTSASAR-NEAFERAV 169
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+A +T+R + AR+ V + I + +G ++VL+ PWK + E
Sbjct: 170 SMATTVLQNTIR-DANGFEKARAAVAQGILQE-----AGRLLVLEHDVPWKDAVLGSSE- 222
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDR---FESRKPLPAQWRGLRDDELSKEAGIP 315
E L+ YV+ D + K W V AV PDR F +RK LP+ W GL +EL G+
Sbjct: 223 --YEHLL-YVITPDAQAK-WHVTAV---PDRAGSFNNRKSLPSAWAGLDGEELDAVIGME 275
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALK 344
GCVF H F+ G+++ GGA+ MAR AL+
Sbjct: 276 GCVFCHRGRFVAGHKTKGGAVEMARLALR 304
>gi|359439024|ref|ZP_09229004.1| metal-dependent protein hydrolase [Pseudoalteromonas sp. BSi20311]
gi|358026258|dbj|GAA65253.1| metal-dependent protein hydrolase [Pseudoalteromonas sp. BSi20311]
Length = 289
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 152/324 (46%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ +++ F + ++R+RD +++D D V+DVGG YDP D
Sbjct: 4 KTIVTHNGNFHADDVFSIAALKII--FPSFTLIRTRDNAIINDADVVIDVGGQYDPQTDR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GLV+K +G L + + + DV AV +
Sbjct: 62 FDHHQRGGAGARENGI--PFSSFGLVWKKYG-------LAICDDNQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + AF Q ++ A
Sbjct: 110 STIDAIDCG-------HVEGVATGISLSQTISMFNPTWEEESNF----DTAFDQAVEFAH 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+RF R L A+ IV + I + D ++VL+R+ PWK + L EE
Sbjct: 159 TLL---IRFVASARGGLNAKKIVAQAIKQADD----PRLIVLERYTPWKKTVHSLSEEAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+V+Y GK W VQ V FE RK LPA W GL DDE GI +F
Sbjct: 212 ------FVVYPSHSGK-WIVQTVPAELGSFEDRKSLPAPWAGLSDDEFQAVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ +++A AL+
Sbjct: 265 HNGLFIAGAESFESVMSLAAMALE 288
>gi|410630697|ref|ZP_11341384.1| hypothetical protein GARC_1271 [Glaciecola arctica BSs20135]
gi|410149663|dbj|GAC18251.1| hypothetical protein GARC_1271 [Glaciecola arctica BSs20135]
Length = 288
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 155/324 (47%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ ++ F + ++R+RD KV+ D V+DVGG YDP
Sbjct: 4 KTIVTHNGNFHADDVFSIAALKTI--FPSFTLIRTRDLKVIAKADIVIDVGGEYDPEAGR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G L++ EG+ DV AV +
Sbjct: 62 FDHHQRGGAGARQNGI--PYSSFGLIWQKYG-------LDICEGNQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G + +LS + N W E + F + +D A
Sbjct: 110 STIDAIDCG-------HVEGIYDGISLSQTISMFNPTWQEDSHF----DSCFDEAVDFAL 158
Query: 203 KEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + + A+SIV + I D ++VL+++ PWK + L ++
Sbjct: 159 RVL---TRFIAAANGGISAKSIVAKAIENAQD----PRVIVLEKYIPWKRTVHALSDKAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
Y++Y G QWR+Q V V P FE RKPLP +W GL D +L + GI +F
Sbjct: 212 ------YMIYPSQTG-QWRIQTVPVEPGSFEDRKPLPKEWAGLSDKDLQEVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ + MA AL+
Sbjct: 265 HNGLFIAGAESFESTMKMATIALQ 288
>gi|89093281|ref|ZP_01166231.1| hypothetical protein MED92_04352 [Neptuniibacter caesariensis]
gi|89082577|gb|EAR61799.1| hypothetical protein MED92_04352 [Neptuniibacter caesariensis]
Length = 289
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 153/319 (47%), Gaps = 37/319 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V TH+G+FH D+ +R F + +++R+RD +++ D V+DVGG YDP+ +D
Sbjct: 6 VVTHSGNFHADDVFSIAALRTV--FPSLKLIRTRDQELISKADIVVDVGGEYDPATGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL+++ +G E+ +G+ DV + AV + + A
Sbjct: 64 HHQRGGAGERDNGI--PYSSFGLIWQKYGVEIC-------QGNTDVAK---AVDADLVSA 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E +EAF + + A
Sbjct: 112 IDAIDCGYGD-------GVAKGVSLSQTISMFNPTWQEETHF----DEAFNEAIAFA-SH 159
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
L+ + AR IV + I + D ++VL+++ PWK + L +E
Sbjct: 160 VLNRFIASADGGISARDIVAQAIKKAED----PRVVVLEQYTPWKRTVHALSKEAL---- 211
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
Y++Y D G QWR+QAV V FE +K LPA W GL L E GI +F H
Sbjct: 212 --YIVYPSDTG-QWRIQAVPVKLGSFEDKKSLPAAWAGLSGKALQDETGISDAMFCHNGL 268
Query: 325 FIGGNQSYGGALAMARAAL 343
FI G SY + MA+ AL
Sbjct: 269 FIAGCDSYESTMKMAKLAL 287
>gi|225678253|gb|EEH16537.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 372
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/184 (42%), Positives = 112/184 (60%), Gaps = 15/184 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+GTHNG FH DEAL +++RL + ++ ++R+RDP +L V+DVGG YDP+ + YD
Sbjct: 169 IGTHNGHFHADEALAVYLLRLLPTYTSSPLIRTRDPSLLATCHTVVDVGGEYDPARNRYD 228
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ F F ST+LSSAGLVY HFG+ +IA+ ++ HPDV L+ +Y +F+EA
Sbjct: 229 HHQRTFTATFP-SHSTRLSSAGLVYLHFGRAIIAQHTSLPIDHPDVSTLYEKLYADFIEA 287
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRV--GKLNL-----DWTEPDQSA---ERENEAF 194
+DA DNGI+ YD P V L R G +NL D PD S+ + E+ F
Sbjct: 288 LDAHDNGISVYD----PSRVAAAGLEKRFRDGGINLGSLVGDLNGPDPSSNDPQDEDSLF 343
Query: 195 QQGM 198
++ +
Sbjct: 344 EKAI 347
>gi|359444703|ref|ZP_09234474.1| hypothetical protein P20439_0789 [Pseudoalteromonas sp. BSi20439]
gi|358041539|dbj|GAA70723.1| hypothetical protein P20439_0789 [Pseudoalteromonas sp. BSi20439]
Length = 289
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 152/324 (46%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ +++ F + ++R+RD +++D D V+DVGG YDP +
Sbjct: 4 KTIVTHNGNFHADDVFSIAALKII--FPSFTLIRTRDNAIINDADVVIDVGGQYDPETNR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GLV+K +G L + + + DV AV +
Sbjct: 62 FDHHQRGGAGARENGI--PFSSFGLVWKKYG-------LAICDDNQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + AF Q ++ A
Sbjct: 110 STIDAIDCG-------HVEGVATGISLSQTISMFNPTWEEESDF----DAAFDQAVEFAH 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+RF R L A+ IV + I + D ++VL+R+ PWK + L EE
Sbjct: 159 TLL---IRFVASARGGLNAKKIVAQAIKQADD----PRLIVLERYTPWKKTVHSLSEEAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+V+Y GK W VQ V FE RK LPA W GL DDE GI +F
Sbjct: 212 ------FVVYPSHSGK-WIVQTVPAELGSFEDRKSLPAPWAGLSDDEFQAVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ +++A AL+
Sbjct: 265 HNGLFIAGAESFESVMSLAAMALE 288
>gi|365857223|ref|ZP_09397218.1| hypothetical protein HMPREF9946_02837 [Acetobacteraceae bacterium
AT-5844]
gi|363716528|gb|EHL99929.1| hypothetical protein HMPREF9946_02837 [Acetobacteraceae bacterium
AT-5844]
Length = 312
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 108/331 (32%), Positives = 164/331 (49%), Gaps = 28/331 (8%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIRLT----DKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
T L + TH+GSFHCDEA ++RL + + ++VR+RD +++ D V DVG
Sbjct: 2 TDLPLLVTHSGSFHCDEAFAYVVLRLALGLREPGKDHRLVRTRDAEIIAQGDYVWDVGLT 61
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YDP+ +DHHQ+G V G T S+AGL+++H G+ + L + +
Sbjct: 62 YDPATHRFDHHQRG-APVREDG--TPFSAAGLIWQHHGEAALRALLRPEGAEGMAPAIAA 118
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAER-ENEAF 194
+ + ID +DNG R +L+S VG NL W P + + E+ AF
Sbjct: 119 ELDGGLIRRIDEVDNGTA--------RSREPMDLASLVGDCNLTWDTPAEGRQAAEDAAF 170
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ + L + L +RS + A ++VV A DP ++ L+R PWK +F
Sbjct: 171 LEAVALL-EGVLRRRVSVMRSRMAADALVVA--AHGASADP--RVLELERGMPWKNVVFS 225
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
E + + +Y G W V A+ P F R PLP W GL++ L+ E+G+
Sbjct: 226 HELP------VLFAIYPVPNGN-WMVDAMPPEPGSFAQRLPLPESWAGLQEAALAAESGV 278
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
P VFVH+ F+GG +S GA+ MAR +KL
Sbjct: 279 PDAVFVHVRRFVGGARSRAGAVEMARRTIKL 309
>gi|429962865|gb|ELA42409.1| hypothetical protein VICG_00508 [Vittaforma corneae ATCC 50505]
Length = 308
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 97/321 (30%), Positives = 163/321 (50%), Gaps = 20/321 (6%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ TH+G+FH DE L + L + +A+++R+R +V++ D V DVG YDPS Y
Sbjct: 2 KLVTHDGNFHYDEILATAV--LLKIYPDAEVIRTRVREVINTGDIVYDVGQTYDPSKYRY 59
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ F + + ++ +LSSAGL++KHF +L + + VY F
Sbjct: 60 DHHQNTFHDTYSPQYNIRLSSAGLIFKHFHDKLFER-YGFTRQSTIFEEIVEKVYFEFFL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
DAIDNG D + ++ V N+ ++E + + EN F+ +D
Sbjct: 119 PADAIDNGY-----DSVFGAIRARTVADVVKNFNV-YSENTMTRD-ENLRFETALDFVSM 171
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
+ + +++ + + + + D +G+I V + L L+EL E K++
Sbjct: 172 DLDNYLKYVLCEYALSYEHFYNVLK-----DFAGDIFVTDKKVATDL-LYELNE--KLQK 223
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
IK+++ ++D +R+ + V +F + PL +WRGL L + +GI GCVFVH S
Sbjct: 224 NIKFIIVKND--NDFRIITIPVERGKFAIKYPLHPKWRGLSGTNLDEVSGINGCVFVHAS 281
Query: 324 GFIGGNQSYGGALAMARAALK 344
GF GGN + GA+ M R +L+
Sbjct: 282 GFTGGNSTLEGAVEMCRKSLE 302
>gi|392556420|ref|ZP_10303557.1| metal-dependent protein hydrolase [Pseudoalteromonas undina NCIMB
2128]
Length = 289
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 152/324 (46%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ +++ F + ++R+RD +++D D V+DVGG YDP +
Sbjct: 4 KTIVTHNGNFHADDVFSIAALKII--FPSFTLIRTRDNAIINDADVVIDVGGQYDPETNR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GLV+K +G L + + + DV AV +
Sbjct: 62 FDHHQRGGAGARENGI--PFSSFGLVWKKYG-------LAICDDNQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + AF Q ++ A
Sbjct: 110 STIDAIDCG-------HVEGVATGISLSQTISMFNPTWEEESDF----DAAFDQAVEFAH 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+RF R L A+ IV + I + D ++VL+R+ PWK + L +E
Sbjct: 159 TLL---IRFVASARGGLNAKKIVAQAIKQADD----PRLIVLERYTPWKKTVHSLSDEAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+V+Y GK W VQ V FE RK LPA W GL DDE GI +F
Sbjct: 212 ------FVVYPSHSGK-WIVQTVPAELGSFEDRKSLPAPWAGLSDDEFQAVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ +++A AL+
Sbjct: 265 HNGLFIAGAESFDSIMSLAAMALE 288
>gi|357469187|ref|XP_003604878.1| hypothetical protein MTR_4g020980 [Medicago truncatula]
gi|355505933|gb|AES87075.1| hypothetical protein MTR_4g020980 [Medicago truncatula]
Length = 117
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/109 (66%), Positives = 85/109 (77%), Gaps = 1/109 (0%)
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKE 120
+VL+ LDAVLDVGGVYDPS D YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE+IA E
Sbjct: 4 EVLEGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIANE 63
Query: 121 LNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNL 169
L VDE H +VH ++LAVY++FME + A + T+ Y++ T L
Sbjct: 64 LKVDEEHQNVHYIYLAVYRSFMEYL-AWSKKFCAFKTESLAWYIDITIL 111
>gi|218288284|ref|ZP_03492583.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
LAA1]
gi|218241643|gb|EED08816.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
LAA1]
Length = 295
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 158/320 (49%), Gaps = 33/320 (10%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FH DE ++R + +A+IVR+R+ VL D V+DVGG Y
Sbjct: 2 KIGTHHGKFHADEVFAVAILRKI--YPDARIVRTRNKSVLAQCDLVVDVGG------GPY 53
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHH + ++AGL+++ FG + + L V E D ++ + +
Sbjct: 54 DHHSV---QKVHRPNGIPYAAAGLIWRDFGDRFL-EALGV-EREEDRAQVCSNIDDKLFQ 108
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
AIDAIDNGI+ R + +S VG N W S E EN AF++ +D A +
Sbjct: 109 AIDAIDNGIDL------ERDMRIKGISELVGSFNPPWN----SQEDENRAFERALDFATQ 158
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
++ + S L A IV A R + P+ ++VL CPW L E++ ++
Sbjct: 159 ILMNYANHEI-SRLQATDIVKAAYAARKE--PA--LLVLPTCCPWTETLLEIDPTGEV-- 211
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
YV + D G Q+R+Q V P FE+RKPLP++W G +EL G+ VF H +
Sbjct: 212 --LYVAFPDKTG-QYRLQVVPKGPGTFEARKPLPSEWAGKEGEELVSICGVEDAVFCHPA 268
Query: 324 GFIGGNQSYGGALAMARAAL 343
FI G ++ G L MA AL
Sbjct: 269 RFIAGAETLDGILQMAEEAL 288
>gi|258511362|ref|YP_003184796.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
gi|257478088|gb|ACV58407.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
Length = 295
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 157/320 (49%), Gaps = 33/320 (10%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTH+G FH DE ++R + +A+IVR+R+ VL D V+DVGG Y
Sbjct: 2 KIGTHHGKFHADEVFAVAILR--KLYPDARIVRTRNKSVLAQCDLVVDVGG------GPY 53
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHH + ++AGL+++ FG + + L V E D + + +
Sbjct: 54 DHHSV---QKVHRPNGIPYAAAGLIWRDFGDRFL-EALGV-EREEDRALVSSNIDDKLFQ 108
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
AIDAIDNGI+ R + +S VG N W S E EN AF++ +D A +
Sbjct: 109 AIDAIDNGIDLE------RDMRIKGISELVGSFNPPWN----SQEDENRAFERALDFATQ 158
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
++ + S L A IV A R + P+ ++VL CPW L E++ ++
Sbjct: 159 ILMNYANHEI-SRLQATEIVKAAYAARKE--PA--LLVLPTCCPWTETLLEIDPAGEV-- 211
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
YV + D G Q+R+Q V P FE+RKPLP++W G +EL G+ VF H +
Sbjct: 212 --LYVAFPDKTG-QYRLQVVPKGPGTFEARKPLPSEWAGKEGEELVSICGVEDAVFCHPA 268
Query: 324 GFIGGNQSYGGALAMARAAL 343
FI G +S G L MA AL
Sbjct: 269 RFIAGAESLDGILQMAEEAL 288
>gi|152996754|ref|YP_001341589.1| metal-dependent protein hydrolase [Marinomonas sp. MWYL1]
gi|150837678|gb|ABR71654.1| metal-dependent protein hydrolase [Marinomonas sp. MWYL1]
Length = 291
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 156/329 (47%), Gaps = 43/329 (13%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVY 76
P QTP+ + THNG+FH D+ + L F + +++R+RD V+ D VLDVGG+Y
Sbjct: 2 PEQTPV--IATHNGNFHADDVFA--VAALKHIFSSIELIRTRDLDVIAKADMVLDVGGIY 57
Query: 77 DPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLA 136
D + +DHHQKG +G SS GLV++ +G E+ G+ ++ +
Sbjct: 58 DADTNRFDHHQKGGAGARENGI--PFSSFGLVWQKYGVEICG-------GNEEIAS---S 105
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+ KN + AIDAID G +LS + N W E + F +
Sbjct: 106 LDKNLVSAIDAIDCG-------HVEGVQTGISLSQTISMFNPTWQEESDF----DACFDE 154
Query: 197 GMDLAGKEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ A + RF + + A++IV + I + DP ++VL+++ PWK +
Sbjct: 155 AVAFASRIL---TRFIAAASGGVNAKAIVADAI--KKAADP--RVIVLEQYTPWKTTVLR 207
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
L +E +++Y G QWR+Q V V FE RK LPA W GL D EL G+
Sbjct: 208 LSKEAL------FMVYPSQTG-QWRIQTVPVELGSFEDRKKLPAPWAGLSDKELQDVTGL 260
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+F H FI G S+ + MA AL
Sbjct: 261 DDAMFCHNGLFIAGCASFENTMKMAEMAL 289
>gi|71282394|ref|YP_268888.1| hypothetical protein CPS_2163 [Colwellia psychrerythraea 34H]
gi|71148134|gb|AAZ28607.1| conserved hypothetical protein [Colwellia psychrerythraea 34H]
Length = 289
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 153/324 (47%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ + L + F + +++R+RD +V+ D V+DVGG YD
Sbjct: 4 KVIATHNGNFHADDVFS--IAALKNIFPSFKLIRTRDLEVIGKADIVIDVGGEYDADAGR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G L + +G+ +V +V +
Sbjct: 62 FDHHQRGGAGERENGI--PYSSFGLIWQKYG-------LAICQGNQEVAN---SVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + +D A
Sbjct: 110 STIDAIDCG-------HVEGVAQGISLSQTISMFNPTWQEDSHF----DTCFDEAVDFAS 158
Query: 203 KEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + + A+ IV + I D ++VL+++ PWK + L EE
Sbjct: 159 RVL---TRFIASASGGINAKDIVAKAI----DNAEDSRVIVLEKYTPWKRTVHALSEEAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
Y++Y G QWR+Q V V P FE RKPLP QW GL DD L + GI +F
Sbjct: 212 ------YMVYPSPSG-QWRIQTVPVEPGSFEDRKPLPKQWAGLSDDALKEVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ + MA AL+
Sbjct: 265 HNGLFIAGAESFESTMKMASIALQ 288
>gi|410612258|ref|ZP_11323338.1| metal-dependent protein hydrolase [Glaciecola psychrophila 170]
gi|410168259|dbj|GAC37227.1| metal-dependent protein hydrolase [Glaciecola psychrophila 170]
Length = 289
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 151/324 (46%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ + L F + ++R+RD +++ D V+DVGG YDP N
Sbjct: 4 KTIVTHNGNFHADDVFS--VAALKSIFSSFTLIRTRDLEIIAKADIVIDVGGEYDPENGR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G E+ G+ DV A+ +
Sbjct: 62 FDHHQRGGAGERDNGI--PYSSLGLIWQKYGLEIC-------NGNQDVAN---ALDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + +D A
Sbjct: 110 STIDAIDCG-------HVKGVSEGISLSQTISMFNPTWQEDSHF----DTCFDEAVDFAS 158
Query: 203 KEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + + A++IV I D ++VL+++ PWK + L ++
Sbjct: 159 RVL---TRFIAAANGGISAKTIVATAIENAQD----PRVIVLEKYTPWKRTVHALSDKAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
Y++Y G QWR+Q V V P FE R+PLP +W GL D L GI +F
Sbjct: 212 ------YMIYPSQTG-QWRIQTVPVEPGSFEDRRPLPKEWAGLSDKALQDVTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G++S+ + MA AL+
Sbjct: 265 HNGLFIAGSESFESTMKMAAMALQ 288
>gi|333907716|ref|YP_004481302.1| metal-dependent protein hydrolase [Marinomonas posidonica
IVIA-Po-181]
gi|333477722|gb|AEF54383.1| metal-dependent protein hydrolase [Marinomonas posidonica
IVIA-Po-181]
Length = 291
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 152/325 (46%), Gaps = 43/325 (13%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNA-QIVRSRDPKVLDDLDAVLDVGGVYDPSN 80
+K + THNG+FH D+ F + K F A +++R+RD +++ D VLDVGG+YD
Sbjct: 5 VKVIATHNGNFHADDV---FAVAALKKIFPAVELIRTRDLEIIAKADIVLDVGGIYDAEK 61
Query: 81 DCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN 140
D +DHHQKG +G SS GL+++ +G E+ A V + ++ KN
Sbjct: 62 DRFDHHQKGGAGARENGIP--FSSFGLIWQKYGLEICAANQEVAD----------SLDKN 109
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+ IDAID G +LS + N W E N F + +
Sbjct: 110 LVSTIDAIDCG-------HVEGVQTGISLSQTISMFNPTWQEEGDF----NACFDEAVAF 158
Query: 201 AGKEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
A + RF + + A++IV I + D ++VL+++ PWK + L ++
Sbjct: 159 ASRIL---TRFIAAASGGVNAKAIVANAIEKAED----PRVIVLEQYTPWKTTVLRLSQD 211
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+++Y G QWR+Q V V FE RK LPA W GL D EL + G+ +
Sbjct: 212 AL------FMVYPSQTG-QWRIQTVPVELGSFEDRKKLPAPWAGLSDKELQEVTGLEDAM 264
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
F H FI G S+ + MA AL
Sbjct: 265 FCHNGLFIAGCASFENTIKMAEMAL 289
>gi|87122071|ref|ZP_01077955.1| hypothetical protein MED121_03978 [Marinomonas sp. MED121]
gi|86162618|gb|EAQ63899.1| hypothetical protein MED121_03978 [Marinomonas sp. MED121]
Length = 289
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 153/322 (47%), Gaps = 41/322 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG+FH D+ ++ FN +VR+RD +V+ D VLDVGG+YD +D
Sbjct: 6 IATHNGNFHADDVFSVAALKTIFSSFN--LVRTRDLEVIKQADIVLDVGGIYDADAGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL++K +G E+ G+ +V +V +
Sbjct: 64 HHQRGGAGERENGI--PYSSFGLIWKKYGVEICG-------GNKEVAH---SVDTGLVSV 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDA+D G +LS + N W E + F++ + A +
Sbjct: 112 IDAVDCG-------HVEGVSKGISLSQTISMFNPTWQEESDY----DACFEEAVSFASRV 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
RF + + A+ IV + I E+ + DP ++VL+++ PWK + L EE
Sbjct: 161 L---TRFIAAATGGISAKEIVAKAI-EKAE-DP--RLIVLEQYTPWKTTVHRLSEEAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
YV+Y D G +WR+Q V V FE RK LP+ W GL EL + GI +F H
Sbjct: 212 ----YVVYPSDTG-EWRIQTVPVELGSFEDRKSLPSPWAGLAGKELQEVTGIEDAMFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAALK 344
FIGG QS+ + MA ALK
Sbjct: 267 GCFIGGAQSFESVMKMADMALK 288
>gi|402468339|gb|EJW03508.1| hypothetical protein EDEG_02164 [Edhazardia aedis USNM 41457]
Length = 308
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 166/331 (50%), Gaps = 41/331 (12%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFF-NAQIVRSRDPKVLDDLD------AVLDVGGVYDPS 79
THNG FH DE L C ++ +K + N+ ++R+RD K + L AV DV +D S
Sbjct: 5 THNGKFHLDEILACVIL---EKLYPNSTLLRTRDRKEIKRLVDENKHVAVFDVYDQFDHS 61
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL-NVDEGHPDVHRLFLAVY 138
YDHHQ+ F + F + KLSSAGL++K++GK+ I +++ + L++ +Y
Sbjct: 62 LRLYDHHQRCFNDTFSSDYDVKLSSAGLIFKYYGKQFILAFFSDIELSSEILEYLYIKIY 121
Query: 139 KNFMEAIDAIDNGIN---QYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+ + DAIDNGI+ +Y P V+N S+E+ E F
Sbjct: 122 EEYFLYEDAIDNGIDVGQKYKIRSLPDMVDNM---------------YKGSSEKSEEIFF 166
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVE--CIAERYDYDPSGEIMVLKRFCPWKLHLF 253
+F Y+ S +++ E I ++ D + + + C +H
Sbjct: 167 NAKKFVRNDFY----LYLNSKRKDIALISELRAIVKKTDKNDFFIYVGENKNCSSIIHFL 222
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
E K + K+++ +++ +Q+RV A+A++ F+++ PL +WRGLR++EL + +G
Sbjct: 223 E----KKYDRDFKFIIQKEE--EQFRVYAIAITALSFKTKCPLQEKWRGLRNEELQRISG 276
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
I +FVH +GF+G +SY A+ M + +
Sbjct: 277 IDDALFVHSTGFLGITKSYDNAVKMCELSYR 307
>gi|397582277|gb|EJK52230.1| hypothetical protein THAOC_28524 [Thalassiosira oceanica]
Length = 150
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 96/147 (65%), Gaps = 2/147 (1%)
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEEE 258
+ G++F+ + V S LPAR IV E + R++ D SGEI+ L PWK ++ELE +
Sbjct: 1 MCGEDFMSVLTKIVESDLPARGIVEEAVNTRFECDKSGEIIKLPSGGLPWKSSVYELEHQ 60
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
K++ IKY LY D G WR+Q V+V FE+R LPA WRG+RD++LS +GI GC
Sbjct: 61 YKVDVPIKYCLYTDQAG-MWRIQCVSVEGKAFENRLSLPAAWRGVRDEDLSSISGIEGCT 119
Query: 319 FVHMSGFIGGNQSYGGALAMARAALKL 345
F H +GFIGGN+ Y G L MA+AALK+
Sbjct: 120 FCHAAGFIGGNKCYEGVLKMAQAALKI 146
>gi|429963886|gb|ELA45884.1| hypothetical protein VCUG_02630 [Vavraia culicis 'floridensis']
Length = 344
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 173/364 (47%), Gaps = 68/364 (18%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++ THN FH DE +++ F +A + R+RDPK ++ D V DV GVYDP+ Y
Sbjct: 2 KLATHNQRFHLDEVTSTAILKKI--FPDATLKRTRDPKDFEEADIVYDVSGVYDPTRGRY 59
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE------LIAKELNVDEGHPDVHRLFLAV 137
DHHQ+GF F F KLSSAGL+YKH+ K+ L+A++ VDE V
Sbjct: 60 DHHQRGFTHTFSEAFPIKLSSAGLIYKHYHKQLFKYYGLVAEDWIVDE-----------V 108
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
Y+ + + +DA DNG++ T P V +L S E ++ + +NE G
Sbjct: 109 YEEYFKYVDACDNGVDLQCTIVPRTMV---DLVSCFNVQETGVKEDTKNGQMKNEV--GG 163
Query: 198 MDLAGKE----------FLDTVRFYVRS-------WLPARSIV--------------VEC 226
AG DT++ +V S +L A +++ VE
Sbjct: 164 ATTAGSSARVHSSCTVPLSDTLKDFVLSDNYNDYNFLCALNLISIDMDNYFRQKKQLVEM 223
Query: 227 IAERYDY--DPSGEIMVLKRFCP----WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRV 280
++ +D +G+I+V+ P + LF LE+ K + L +++Y D + R+
Sbjct: 224 VSANHDLIRSATGDILVIG---PDRDLSREALFILEKIWKKDFL--FIVY--DSSEHVRM 276
Query: 281 QAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMAR 340
+ + + PL +W GLR++EL +GI G FVH SGF GG + AL M R
Sbjct: 277 YGTIKERNSYAIKVPLCLEWGGLREEELRARSGIEGSTFVHSSGFTGGAKDLKSALEMCR 336
Query: 341 AALK 344
++K
Sbjct: 337 RSIK 340
>gi|114776504|ref|ZP_01451549.1| hypothetical protein SPV1_02252 [Mariprofundus ferrooxydans PV-1]
gi|114553334|gb|EAU55732.1| hypothetical protein SPV1_02252 [Mariprofundus ferrooxydans PV-1]
Length = 290
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 148/324 (45%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG+FH D+ I + F +VR+RD ++ D V+DVG YDP
Sbjct: 4 KTIATHNGNFHADDVFSVAAIGMILPSFT--LVRTRDSALIAKADIVIDVGLEYDPERGR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G ++ +G DV AV +
Sbjct: 62 FDHHQRGGAGERENGI--PYSSFGLIWQKYGADIC-------QGDQDVAN---AVDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDA+D G +LS +G N W E + F + ++ A
Sbjct: 110 STIDAVDCG-------HVEGVQQGISLSHTIGMFNPTWQEDSHV----DACFDEAVEFAS 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF R + A++IV + I + D ++VL+++ PWK + L EE
Sbjct: 159 RVL---TRFIAAARGGISAKAIVAKAIDDAAD----PRVIVLEKYVPWKRTVHALSEEAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
YV+Y G +WR+Q V V P FE RK LP W GL EL + GI +F
Sbjct: 212 ------YVVYPSQTG-EWRIQTVPVEPGSFEDRKSLPQAWAGLSGKELQELTGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G +S+ + MA AL+
Sbjct: 265 HNGLFIAGAESFESTMNMASMALE 288
>gi|410643593|ref|ZP_11354089.1| hypothetical protein GCHA_4358 [Glaciecola chathamensis S18K6]
gi|410137003|dbj|GAC12276.1| hypothetical protein GCHA_4358 [Glaciecola chathamensis S18K6]
Length = 289
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K V TH+GSFH D+ ++ F ++R+RD + D V+DVGG +DP+ D
Sbjct: 4 KIVVTHSGSFHADDVFSVAALKQILPSFT--LIRTRDKDTIAKADVVIDVGGEHDPATDR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G + + V A+ +
Sbjct: 62 FDHHQRGGAGERDNGI--PYSSFGLIWQKYGLAICENDQGVAN----------ALDSGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + ++ A
Sbjct: 110 STIDAIDCG-------HVKGVSEGISLSQTISMFNPTWQEDSDF----DSGFNEAVEFAS 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + + A++IV + I D ++VL+++ PWK + L E+
Sbjct: 159 RVL---TRFIAGAKGSINAKAIVADAIKNAQDE----RVIVLEKYTPWKKTVHTLSEKAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
YV+Y + +QW +QAV V P FE ++PLP W GL D+ L E GI +F
Sbjct: 212 ------YVIYPS-QSEQWMIQAVPVEPGSFEDKRPLPKAWAGLSDEALQDETGIEDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G S+ + MA ALK
Sbjct: 265 HNGLFIAGTASFESTMKMAALALK 288
>gi|332306016|ref|YP_004433867.1| metal-dependent protein hydrolase [Glaciecola sp. 4H-3-7+YE-5]
gi|332173345|gb|AEE22599.1| metal-dependent protein hydrolase [Glaciecola sp. 4H-3-7+YE-5]
Length = 289
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K V TH+GSFH D+ ++ F ++R+RD + D V+DVGG +DP+ D
Sbjct: 4 KIVVTHSGSFHADDVFSVAALKQILPSFT--LIRTRDKDTIAKADVVIDVGGEHDPATDR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G + + V A+ +
Sbjct: 62 FDHHQRGGAGERENGI--PYSSFGLIWQKYGLAICENDQGVAN----------ALDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + ++ A
Sbjct: 110 STIDAIDCG-------HVKGVSEGISLSQTISMFNPTWQEDSDF----DSGFNEAVEFAS 158
Query: 203 KEFLDTVRFYV--RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + + A++IV + I D ++VL+++ PWK + L E+
Sbjct: 159 RVL---TRFIAGAKGSINAKAIVADAIKNAQDE----RVIVLEKYTPWKKTVHALSEKAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
YV+Y + +QW +QAV V P FE ++PLP W GL D+ L E GI +F
Sbjct: 212 ------YVIYPS-QSEQWMIQAVPVEPGSFEDKRPLPKAWAGLSDEALQDETGIEDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G S+ + MA ALK
Sbjct: 265 HNGLFIAGTASFESTMKMAALALK 288
>gi|449679924|ref|XP_004209453.1| PREDICTED: UPF0160 protein C27H6.8-like, partial [Hydra
magnipapillata]
Length = 155
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 99/154 (64%), Gaps = 6/154 (3%)
Query: 13 STSSPSQ--TPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL 70
S++SP + +GTH+G+FHCDE C+M++ +F NA I+R+RD L V+
Sbjct: 2 SSASPKKQCADFLSIGTHDGTFHCDEVFACWMLKQLPRFKNASIIRTRDNAKLSQCSIVV 61
Query: 71 DVGGVYDPSNDCYDHHQKGF----EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
DVGGVYDP + +DHHQ+ F + + + TKLSSAGL+Y H GKE+I++ ++ +
Sbjct: 62 DVGGVYDPEKNKFDHHQRTFSGTMKSLGNMKWETKLSSAGLIYLHMGKEVISEITSLPQN 121
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKP 160
V RL+ +Y+ F+E IDA+DNGI+QY+ P
Sbjct: 122 DVIVSRLYEKLYEKFVEEIDAVDNGIDQYEGSHP 155
>gi|421056595|ref|ZP_15519512.1| metal-dependent protein hydrolase [Pelosinus fermentans B4]
gi|421059786|ref|ZP_15522344.1| metal-dependent protein hydrolase [Pelosinus fermentans B3]
gi|421064053|ref|ZP_15525972.1| metal-dependent protein hydrolase [Pelosinus fermentans A12]
gi|421069610|ref|ZP_15530771.1| metal-dependent protein hydrolase [Pelosinus fermentans A11]
gi|392437775|gb|EIW15637.1| metal-dependent protein hydrolase [Pelosinus fermentans B4]
gi|392449575|gb|EIW26673.1| metal-dependent protein hydrolase [Pelosinus fermentans A11]
gi|392458341|gb|EIW34886.1| metal-dependent protein hydrolase [Pelosinus fermentans B3]
gi|392461833|gb|EIW37983.1| metal-dependent protein hydrolase [Pelosinus fermentans A12]
Length = 299
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 159/326 (48%), Gaps = 31/326 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
++ +K +GTH+G FH D+ + M RL + ++ R+RD +L LD V YD
Sbjct: 3 NEKNIKTIGTHSGKFHADDVMATAMFRLL--LGDIKVTRTRDENILRTLDLV------YD 54
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
S +DHHQ +E+ + ++ GL+++ FG +I K + D+ +F +V
Sbjct: 55 ISLGEFDHHQLN-KEIRENNIP--YAACGLIWREFGSRIIQK-FDSQLEENDIISIFDSV 110
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
KN ++ IDA DNGI+ + T++S + N W D +EAF++
Sbjct: 111 DKNLVQGIDATDNGID------IKSDIKVTSISDIIQNFNPTWDSNDSI----DEAFEEA 160
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ A E + + S + AR IV E R EIMVLK CPW L +++
Sbjct: 161 VQYAT-EVIKRIISRQVSVIKARIIVNEAFENR----TINEIMVLKNGCPWLQQLLKID- 214
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+ D+ ++++Q+V + D FE+RK + RG +E++ I
Sbjct: 215 ---VNSEILFVISPDETNAEYKIQSVKKNADTFEARKDILESIRGKSSEEINSIIKIDDA 271
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F H +GFI +S AL +A+ ++
Sbjct: 272 IFCHKAGFIASTKSMESALKIAKLSV 297
>gi|392963033|ref|ZP_10328461.1| metal-dependent protein hydrolase [Pelosinus fermentans DSM 17108]
gi|392451708|gb|EIW28694.1| metal-dependent protein hydrolase [Pelosinus fermentans DSM 17108]
Length = 299
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 159/326 (48%), Gaps = 31/326 (9%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
++ +K +GTH+G FH D+ + M RL + ++ R+RD +L LD V YD
Sbjct: 3 NEKNIKTIGTHSGKFHADDVMATAMFRLL--LGDIKVTRTRDENILRTLDLV------YD 54
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
S +DHHQ +E+ + ++ GL+++ FG +I K + D+ +F +V
Sbjct: 55 ISLGEFDHHQLN-KEIRENNIP--YAACGLIWREFGSRIIQK-FDSQLEENDIISIFDSV 110
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
KN ++ IDA DNGI+ + T++S + N W D +EAF++
Sbjct: 111 DKNLVQGIDATDNGID------IKSDIKVTSISDIIQNFNPTWDSNDSI----DEAFEKA 160
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ A E + + S + AR IV E R EIMVLK CPW L +++
Sbjct: 161 VQYAT-EVIKRIISRQVSVIKARIIVNEAFENR----TINEIMVLKNGCPWLQQLLKID- 214
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ I +V+ D+ ++++Q+V + D FE+RK + RG +E++ I
Sbjct: 215 ---VNSEILFVISPDETNAEYKIQSVKKNADTFEARKDILESIRGKSSEEINSIIKIDDA 271
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
+F H +GFI +S AL +A+ ++
Sbjct: 272 IFCHKAGFIASTKSMESALKIAKLSV 297
>gi|256831811|ref|YP_003160538.1| metal-dependent protein hydrolase [Jonesia denitrificans DSM 20603]
gi|256685342|gb|ACV08235.1| metal-dependent protein hydrolase [Jonesia denitrificans DSM 20603]
Length = 287
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 155/322 (48%), Gaps = 41/322 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG FH D+ G + LTD + +A IVR+RDP++LD D VLDVGGVYDP+ YD
Sbjct: 3 IATHNGKFHADDVFGVAL--LTDLYPDATIVRTRDPQMLDTADIVLDVGGVYDPTTHRYD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ S+ +G++Y FG L+ ++ + D ++ + + A
Sbjct: 61 HHQQ----------SSGARPSGILYSAFG--LLWQDYGRTWCNND--DIWQKIDTRLVTA 106
Query: 145 IDAIDNGINQYD-TDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
IDA+DNG + Y TD R + LS +G N P E + F ++LA
Sbjct: 107 IDAVDNGQDLYTLTDYKIRPFD---LSEYLGLFN-----PITEDEDFDTQFTGAVELA-- 156
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
TV +R+ A E +Y P ++L+R+ P + +P
Sbjct: 157 ---RTVLHRLRAKYGAVIAAEEYFTTQYAASPDRRYVILERYVPHGAIATK-------QP 206
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+ Y +Y G W +QAV +F +RK LPA WRGL +L+ G+ VF H +
Sbjct: 207 ELLYTVYPGATGN-WTIQAVRPDLAQFANRKDLPAAWRGLNGSDLAAVTGVTDAVFCHKA 265
Query: 324 GFIGGNQSYGGA---LAMARAA 342
GFI +S GA LA A AA
Sbjct: 266 GFICAAESREGAEQLLAQALAA 287
>gi|392536249|ref|ZP_10283386.1| hypothetical protein ParcA3_19742 [Pseudoalteromonas arctica A
37-1-2]
Length = 289
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 145/321 (45%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD +++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPTF--KLVRTRDKALIESADFVIDVGGEYDPETNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV+K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLVWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+RF + + A++IV + I D ++VLK++ PWK + L +
Sbjct: 161 L---IRFIASAHGSVNAKAIVAKAIENAED----ARVIVLKKYTPWKKTVHILSSDAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RKPLP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKPLPKAWSGLSDQAFVDETGIDDAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLATMAL 287
>gi|109898935|ref|YP_662190.1| metal-dependent protein hydrolase [Pseudoalteromonas atlantica T6c]
gi|109701216|gb|ABG41136.1| metal-dependent protein hydrolase [Pseudoalteromonas atlantica T6c]
Length = 293
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 143/322 (44%), Gaps = 37/322 (11%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K V TH+G+FH D+ + L F ++R+RD + D V+DVGG +D + D
Sbjct: 8 KIVVTHSGNFHADDVFS--VAALKSVFPTFTLIRTRDKDTIAKADVVIDVGGEHDAATDR 65
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G + D L A+ +
Sbjct: 66 FDHHQRGGAGERDNG--VPYSSFGLIWQKYGLTICG----------DNQELANAIDAGLV 113
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + +D A
Sbjct: 114 STIDAIDCG-------HVKGVAEGISLSQTISMFNPTWQEEGDF----DAGFNEAVDFAA 162
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ + L A++IV E I + DP ++VL+++ PWK + L E+
Sbjct: 163 RVLARAIA-AASGGLNAKAIVAEAI--KNAQDP--RVIVLEKYTPWKKTVHALSEQAL-- 215
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RK LP W GL D EL E GI +F H
Sbjct: 216 ----YMVYPSQSG-QWMLQTVPVEPGSFEDRKSLPKPWAGLSDAELQTETGIKDAMFCHN 270
Query: 323 SGFIGGNQSYGGALAMARAALK 344
FI G S+ + MA AL+
Sbjct: 271 GLFIAGTASFESTMKMAELALQ 292
>gi|410619613|ref|ZP_11330507.1| metal-dependent protein hydrolase [Glaciecola polaris LMG 21857]
gi|410160745|dbj|GAC34645.1| metal-dependent protein hydrolase [Glaciecola polaris LMG 21857]
Length = 289
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 149/324 (45%), Gaps = 41/324 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K V TH+G+FH D+ ++ F + ++R+RD +++ D V+DVGG YDP D
Sbjct: 4 KIVVTHSGNFHADDVFSVAALKRI--FPSFTLIRTRDLEIIAKADIVIDVGGQYDPETDR 61
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G + N L A+ +
Sbjct: 62 FDHHQRGGAGERENGI--PYSSFGLIWQKYGLAICENNQN----------LANALDAGLV 109
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDA+D G +LS + N W E + F + ++ A
Sbjct: 110 STIDAVDCG-------HVKGVAEGISLSQTISMFNPTWQEKSDF----DSGFDEAVEFAA 158
Query: 203 KEFLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ RF + L A++IV + I + DP ++VL+++ PWK + L ++
Sbjct: 159 RVL---TRFIAAASGGLNAKAIVADAI--KNAQDP--RVIVLEKYTPWKKTVHALSDKAL 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
Y++Y G QW +Q V V P FE R+PLP +W GL D L E GI +F
Sbjct: 212 ------YMIYPSQSG-QWMIQTVPVEPGSFEDRRPLPKEWAGLSDVALQDETGIDDAMFC 264
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI G S+ + MA AL+
Sbjct: 265 HNGLFIAGAASFASTMKMAALALQ 288
>gi|332532493|ref|ZP_08408371.1| hypothetical protein PH505_af01080 [Pseudoalteromonas haloplanktis
ANT/505]
gi|332038136|gb|EGI74583.1| hypothetical protein PH505_af01080 [Pseudoalteromonas haloplanktis
ANT/505]
Length = 289
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 144/321 (44%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD +++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPTF--KLVRTRDKALIESADYVIDVGGEYDPETNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV+K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLVWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+RF + + A+ IV + I D ++VL+++ PWK + L E
Sbjct: 161 L---IRFIASAHGSVNAKEIVAKAIENAED----ARVIVLEKYTPWKKTVHILSSEAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RKPLP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKPLPKAWSGLSDQAFVDETGIADAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLAAMAL 287
>gi|449512690|ref|XP_002189426.2| PREDICTED: uncharacterized protein LOC100230912 [Taeniopygia
guttata]
Length = 323
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 71/143 (49%), Positives = 91/143 (63%), Gaps = 7/143 (4%)
Query: 51 NAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVF-----GHGFSTKLSSA 105
+A+IVR+RDP++L D ++DVGG YDP YDHHQ+ F E +STKLSSA
Sbjct: 149 DAEIVRTRDPQLLAACDVLVDVGGEYDPGRHRYDHHQRSFAESMRSLRPDKPWSTKLSSA 208
Query: 106 GLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN 165
GLVY HFG +++A L E P V LF +Y+NF+E IDA+DNGI + PRY
Sbjct: 209 GLVYGHFGPQILAALLGQPEHGPVVTALFDKLYENFVEEIDAMDNGIAPAAGE--PRYAL 266
Query: 166 NTNLSSRVGKLNLDWTEPDQSAE 188
+T LS+RVG LN W +PDQ E
Sbjct: 267 STTLSARVGHLNPRWNDPDQDTE 289
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 25/37 (67%)
Query: 98 FSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
+STKLSSAGLVY HFG +++A L E P V LF
Sbjct: 10 WSTKLSSAGLVYGHFGPQILAALLGQPEHGPVVTALF 46
>gi|269796281|ref|YP_003315736.1| hypothetical protein Sked_30010 [Sanguibacter keddieii DSM 10542]
gi|269098466|gb|ACZ22902.1| uncharacterized conserved protein [Sanguibacter keddieii DSM 10542]
Length = 287
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 152/323 (47%), Gaps = 44/323 (13%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG FH D+ G + LT F +A++VR+RDP+VL D VLDVGGVYD +D
Sbjct: 3 IATHNGKFHADDVFGVAL--LTQLFPDAEVVRTRDPEVLATADVVLDVGGVYDVETRRFD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSA-GLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
HHQ G + L SA GL+++H+GKE + ++ + ++
Sbjct: 61 HHQLSS----GARDNGILYSAFGLLWQHYGKEYCGDD-----------AVWQKIDSRLVQ 105
Query: 144 AIDAIDNGINQY---DTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
AIDA+DNG + Y D P ++S +G N P E + FQ + L
Sbjct: 106 AIDAVDNGQDLYAVSDFGAAP-----LDVSEVLGLFN-----PLGDDEDFDSQFQVAVAL 155
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A L R + + A ++ Y P +VL++F P H ++ K
Sbjct: 156 A-TTLLQRYRLKYETSIAAERTFLDA----YAASPKKTYVVLEKFVP---HGGIATKQSK 207
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ + ++ G W +Q V +F SRK LP WRGL EL++ G+P VF
Sbjct: 208 L----LFTVFPGATG-HWTIQTVRPESSQFGSRKELPESWRGLNGAELAEVTGVPDAVFC 262
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H +GFI ++ GAL M R AL
Sbjct: 263 HKAGFIAAAETREGALEMLRQAL 285
>gi|359442072|ref|ZP_09231952.1| hypothetical protein P20429_2326 [Pseudoalteromonas sp. BSi20429]
gi|358036084|dbj|GAA68201.1| hypothetical protein P20429_2326 [Pseudoalteromonas sp. BSi20429]
Length = 289
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 145/321 (45%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD +++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPTF--KLVRTRDKALIESADFVIDVGGEYDPETNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV+K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLVWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+RF + + A++IV + I D ++VL+++ PWK + L +
Sbjct: 161 L---IRFIASAHGSVNAKAIVAKAIENAED----ARVIVLEKYTPWKKTVHILSSDAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RKPLP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKPLPKAWSGLSDQAFVDETGIDDAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLATMAL 287
>gi|359435103|ref|ZP_09225332.1| hypothetical protein P20652_3462 [Pseudoalteromonas sp. BSi20652]
gi|357918240|dbj|GAA61581.1| hypothetical protein P20652_3462 [Pseudoalteromonas sp. BSi20652]
Length = 289
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 144/321 (44%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD +++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPSF--KLVRTRDKALIESADFVIDVGGEYDPQTNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV+K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLVWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
VRF + + A+ IV + I D ++VL+++ PWK + L +
Sbjct: 161 L---VRFIASAHGSVNAKEIVAKAIENAED----ARVIVLEKYTPWKKTVHILSSDAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RKPLP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKPLPKAWSGLSDQAFVDETGIDDAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLATMAL 287
>gi|414071904|ref|ZP_11407862.1| hypothetical protein D172_3094 [Pseudoalteromonas sp. Bsw20308]
gi|410805667|gb|EKS11675.1| hypothetical protein D172_3094 [Pseudoalteromonas sp. Bsw20308]
Length = 289
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/321 (30%), Positives = 145/321 (45%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD ++++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPAF--KLVRTRDKALIENADFVIDVGGEYDPETNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL++K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLIWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+RF + + A+ IV + I D ++VL+++ PWK + L +
Sbjct: 161 L---IRFIASAHGSVNAKEIVAKAIENAED----ARVIVLEKYTPWKKTVHILSSDAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RKPLP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKPLPKAWSGLSDQAFVDETGIDDAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLATMAL 287
>gi|421078531|ref|ZP_15539484.1| metal-dependent protein hydrolase [Pelosinus fermentans JBW45]
gi|392523382|gb|EIW46555.1| metal-dependent protein hydrolase [Pelosinus fermentans JBW45]
Length = 301
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 158/327 (48%), Gaps = 29/327 (8%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYD 77
++ +K +GTH+G FH D+ + ++RL + ++ R+RD +L LD V YD
Sbjct: 3 NEKNIKTLGTHSGKFHADDVMATAILRLL--LGDIKVTRTRDENILRKLDFV------YD 54
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
S +DHHQ +E+ + ++ GLV++ FG +I K N D+ +F V
Sbjct: 55 ISLGEFDHHQLN-KEIRENNIP--YAACGLVWREFGSRIIQK-FNSQLEENDIISIFDYV 110
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
KN ++ IDA DNGI+ + T++S + N W D +EAF++
Sbjct: 111 DKNLVQGIDATDNGID------IKSEIKVTSISDIIQSFNPTWDSNDSI----DEAFEEA 160
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ A E + + S + AR IV E R EIMVLK CPW L +++
Sbjct: 161 VQYAT-EVIKRIISRQVSVIKARIIVNEAFQNR----NINEIMVLKTGCPWLQQLLKID- 214
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ E L ++D ++++Q V + D FE+RK + RG +E++ I
Sbjct: 215 -LNNEVLFVISPDDNDNNTEYKIQTVKKTVDTFEARKDILESIRGKSSEEINSIIKIDDA 273
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALK 344
+F H +GFI +S AL +A+ ++K
Sbjct: 274 IFCHKAGFIASTKSMESALKIAKLSVK 300
>gi|338733423|ref|YP_004671896.1| hypothetical protein SNE_A15280 [Simkania negevensis Z]
gi|336482806|emb|CCB89405.1| UPF0160 protein TC_0665 [Simkania negevensis Z]
Length = 294
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 156/325 (48%), Gaps = 45/325 (13%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ GTHNGSFH DE C ++ L D +++R+RD VL D V DVGG+Y+P+
Sbjct: 8 RSFGTHNGSFHADEVTACALLILFDHIDLDKVIRTRDLHVLRTCDYVCDVGGMYEPTIRR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ + HG LSSAG++ K+ E + K+ +LF + ++ +
Sbjct: 68 FDHHQLDY-----HG---PLSSAGMILKYLKDEKVIKD-----------KLFQYLNRSLV 108
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+DAIDNG K V + + S+ + D++ +EAF Q +D
Sbjct: 109 MGVDAIDNG-------KTTTMVGHCSFSAVIANFVPIRHNVDENV--MDEAFFQAVDFTL 159
Query: 203 KEFLDTV-RF-YVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
V +F Y++ R ++ + + D + +M+ + PW F+L+ E
Sbjct: 160 GHLSRLVDKFHYIQE---CREVI------KREMDKNQTVMIFEESMPWMETFFDLKGEK- 209
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
P ++ GKQW+++ + S + R + R PLP +W GL DDEL ++ GIPG VF
Sbjct: 210 -HPAAFLIM---PSGKQWKLRGIPPSYEKRMQVRIPLPKEWAGLIDDELKEKTGIPGAVF 265
Query: 320 VHMSGFIGGNQSYGGALAMARAALK 344
H FI ++ AL L+
Sbjct: 266 CHKGRFISIWETKEDALKALEITLQ 290
>gi|378754522|gb|EHY64553.1| hypothetical protein NERG_02363 [Nematocida sp. 1 ERTm2]
Length = 338
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 168/338 (49%), Gaps = 17/338 (5%)
Query: 15 SSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGG 74
S+ S + + TH+G+FH D+ L CF++R+ + +A+I R+RD + + D V+DVG
Sbjct: 3 SNGSGRNISHIVTHDGAFHLDDVLACFILRVI--YPHAKITRTRDLEKIKTGDIVVDVGA 60
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK---ELNVDEGHPDVH 131
++ YDHHQ+GF+E + + LSSAGLVYK+ G E I K +L +D +
Sbjct: 61 EFNEKTLRYDHHQRGFKETYNESNNIILSSAGLVYKYHGLEFIKKLGLDLPIDFNYS--- 117
Query: 132 RLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAEREN 191
L +Y + ++DA DNG++ D K + + S V + T ++++
Sbjct: 118 MLMEILYDTYFVSVDANDNGVDIADEVKYNERSLDNVIRSFVPCDIPEGTSFERASTMRY 177
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLK--RFCPWK 249
+AF+ M+ G++ L + + + EC DPS +V++ +F P +
Sbjct: 178 QAFESAMEYIGQDLLRHCKNMIFQIIKNSRATQECFNSM--KDPSARYLVMEGGKF-PVR 234
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD--RFESRKPLPAQWRGLRDDE 307
L+ ++ + V+Y+ + + + + ++ PL +WRG+RD+E
Sbjct: 235 EILYYYNGLLQRN--VSIVIYQVSGRNSYTYKIICIPKKGVKYTPEIPLCEEWRGIRDEE 292
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L K + G +FVH +GF G + A+ M ++++
Sbjct: 293 LQKYPNMMGAMFVHGTGFCGEANNLKTAIYMVERSIEV 330
>gi|46201072|ref|ZP_00055876.2| COG4286: Uncharacterized conserved protein related to MYG1 family
[Magnetospirillum magnetotacticum MS-1]
Length = 297
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 147/321 (45%), Gaps = 29/321 (9%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ +V THNG+FH D+ ++R + ++VRSRD + LD + V DVGG YDP N
Sbjct: 1 MLKVATHNGTFHADDVFAFAILRAASAG-HIELVRSRDQQALDAAEVVFDVGGTYDPVNR 59
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
YDHH + + + SSAGLV++ FG+ IA L V R+ V
Sbjct: 60 RYDHHMR---DKPLRPNAEPYSSAGLVWRDFGEAAIAHLLPGISPQA-VLRVLEMVDCGL 115
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ +D +DNG T P + S+ + N ++E + E F Q D+A
Sbjct: 116 VRDVDLMDNGAM---TPTPGHF------STVIEAFNSTFSEIGRD---ETATFMQAADMA 163
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
R R+W ++ A R DP I+VL PW+ +++L + +
Sbjct: 164 SSVLQ---RTCARAWASVQAEATVAEAARNSSDP--RILVLDSRVPWEDAVYDLGLDQTL 218
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
YV+ G W AV F R PLP W GLRD+ ++ G+P F H
Sbjct: 219 -----YVVRP--AGAAWTCSAVPPERGSFAQRHPLPEAWGGLRDEAIAALTGVPDATFCH 271
Query: 322 MSGFIGGNQSYGGALAMARAA 342
+ F+ G +S GA+A+AR A
Sbjct: 272 PARFVCGARSKDGAVALARIA 292
>gi|387593396|gb|EIJ88420.1| hypothetical protein NEQG_01110 [Nematocida parisii ERTm3]
gi|387597053|gb|EIJ94673.1| hypothetical protein NEPG_00196 [Nematocida parisii ERTm1]
Length = 336
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 163/328 (49%), Gaps = 13/328 (3%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ + TH+G+FH D+ L CFM++ + +A IVR+R+ ++ D V+DVGGV+DP+N
Sbjct: 10 ISTIITHDGAFHLDDVLACFMLKKI--YPHANIVRTRNEDIIKTGDIVVDVGGVFDPANF 67
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDV--HRLFLAVYK 139
YDHHQ+GF + + + K+SSAGLVYK+ G + I K L +D HPD L +Y+
Sbjct: 68 KYDHHQRGFNQTYNDNYDIKMSSAGLVYKYHGMQFI-KALGLDV-HPDFDYLLLLGLLYE 125
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNT--NLSSRVGKLNLDWTEPDQSAER-ENEAFQQ 196
+ ++DA DNG+ D RY T N+ ++ E + ++ EAF++
Sbjct: 126 TYFVSVDANDNGV---DISDDVRYNERTLDNVIRSFVPFDIPEGESIEYGDKVRYEAFEK 182
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
M+ G + + ++ + + +++ + D IM + +L +
Sbjct: 183 AMEYIGSDLVRHCKYLMHQINKDKMPILQSFNQMKDPRSRYIIMGSGAYPAKELIQYYNT 242
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+ +I Y + G +++ + R+ PL +WRGLR++++ + +
Sbjct: 243 SLNRNVSIIIYKI-RSREGTIYKLLCIPKKGIRYTPEIPLCEEWRGLRNEQMRRFPKLKK 301
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
FVH +GF G A MA+ +++
Sbjct: 302 ASFVHATGFCGSAMDLETAEYMAQKSIE 329
>gi|410629226|ref|ZP_11339933.1| hypothetical protein GMES_4436 [Glaciecola mesophila KMM 241]
gi|410151250|dbj|GAC26702.1| hypothetical protein GMES_4436 [Glaciecola mesophila KMM 241]
Length = 290
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 100/322 (31%), Positives = 144/322 (44%), Gaps = 37/322 (11%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K V TH+G+FH D+ + L F ++R+RD +V+ D V+DVGG +D + D
Sbjct: 5 KIVVTHSGNFHADDVFS--VAALKSVFPTFTLIRTRDKEVIAKADVVIDVGGEHDAATDR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ+G +G SS GL+++ +G + D L A+ +
Sbjct: 63 FDHHQRGGAGERDNG--VPYSSFGLIWQKYGLAIC----------DDNQELANAIDAGLV 110
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDAID G +LS + N W E + F + +D A
Sbjct: 111 STIDAIDCG-------HVKGVAEGISLSQTISMFNPTWQEEGDF----DAGFNEAVDFA- 158
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
L L A++IV E I + DP ++VL+++ PWK + L E+
Sbjct: 159 ARILARAIAAASGGLNAKAIVAEAI--KNAQDP--RVIVLEKYTPWKKTVHALSEQAL-- 212
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RK LP W GL D EL E GI +F H
Sbjct: 213 ----YMVYPSQSG-QWMIQTVPVEPGSFEDRKSLPKPWAGLSDAELQAETGIEDAMFCHN 267
Query: 323 SGFIGGNQSYGGALAMARAALK 344
FI G S+ + MA AL+
Sbjct: 268 GLFIAGTTSFESTMKMAALALQ 289
>gi|46446216|ref|YP_007581.1| hypothetical protein pc0582 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399857|emb|CAF23306.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 290
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 157/323 (48%), Gaps = 41/323 (12%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ GTH+G+FH DE C ++ L D +I+R+RD ++L+ + + DVGG+YDPS
Sbjct: 6 RSCGTHDGTFHADEVTACALLMLFDLIDENKIIRTRDLQILNTCEYICDVGGIYDPSQKI 65
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ ++ +SSAG++ K+ +++ P+ + LF A + +
Sbjct: 66 FDHHQVDYQ--------GPMSSAGMILKYLK--------HLERLKPNEYELFNA---SLV 106
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDA DNG + P Y + +++ S ++ D + E +++AF + +
Sbjct: 107 MGIDAHDNGRDPL----IPGYCSISHIVSNFTPIHYDC-----AHEEQDQAFHKALKFV- 156
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
E LD + + R I+ EC+ + S E ++ + PW FEL+ E
Sbjct: 157 YEHLDRLWERFKYTQSCREIIAECM------NKSKECLMFDQNLPWLEIFFELKGEE--H 208
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
P + ++ G W+++ + S DR + R P P +W GL +++L + +GIPG VF H
Sbjct: 209 PALFVIM---PSGPHWKLRGIPPSYQDRMKVRLPQPKEWAGLLEEDLKRVSGIPGAVFCH 265
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
FI ++ AL LK
Sbjct: 266 KGRFISVWETKEDALKALDYTLK 288
>gi|163752074|ref|ZP_02159281.1| hypothetical protein KT99_10668 [Shewanella benthica KT99]
gi|161328020|gb|EDP99191.1| hypothetical protein KT99_10668 [Shewanella benthica KT99]
Length = 289
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 143/319 (44%), Gaps = 41/319 (12%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G+FH D+ ++ FN +VR+R+ ++ D VLDVGG YDPS +DHH
Sbjct: 8 THDGNFHADDVFSIAALKCVFPAFN--LVRTRNADIITKADVVLDVGGEYDPSLGRFDHH 65
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+G G SS GL++K +G E+ +G+ L +V + ID
Sbjct: 66 QRGGAGEREDGI--PYSSFGLIWKKYGLEIC-------QGN---QELATSVDAGLVSTID 113
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
AID G +LS + N W E + +F + +D A +
Sbjct: 114 AIDCG-------HVTGVSTGISLSHTISMFNPTWQEEGDF----DASFNEAVDFASRVL- 161
Query: 207 DTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
RF + + A++IV + I + D ++VL+++ PWK + L E
Sbjct: 162 --ARFIASANGGISAKAIVAKAIDDAED----PRVIVLEKYTPWKRTVHALSEAAL---- 211
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
YV+Y G QWRVQ V FE RK LP W GL L + GI +F H
Sbjct: 212 --YVVYPSQSG-QWRVQTVPAELGSFEDRKSLPKAWAGLNGTALQEVTGIDDAMFCHNGL 268
Query: 325 FIGGNQSYGGALAMARAAL 343
FI G +S+ + MA AL
Sbjct: 269 FIAGAESFASTMKMASIAL 287
>gi|255608166|ref|XP_002538853.1| Protein MYG1, putative [Ricinus communis]
gi|223510110|gb|EEF23530.1| Protein MYG1, putative [Ricinus communis]
Length = 311
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 159/330 (48%), Gaps = 36/330 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+A ++ L + +A++VR+RDP ++ D +DVGG ++P+ +DHH
Sbjct: 5 THSGKFHADDAWAVAVLMLL--YPDAELVRTRDPAIIARADVAIDVGGEWNPAAGRFDHH 62
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP----DVHRLFLAVYKNFM 142
QKGF+ G +SAGLV++ +G +A GH ++ A+ + +
Sbjct: 63 QKGFDGARLSG--VPYASAGLVWREYGARCVALLAERHTGHKLSEEGAQQMAYAIDADVV 120
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE-------PDQSAERENEAFQ 195
+ +D D G + + P Y LS+ V N +W + P A R +E F+
Sbjct: 121 QYLDLSDVGAAR---NAPGSY----GLSAIVSGFNPNWLDEQRLGYGPAAEAYR-DEQFR 172
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-PWKLHLFE 254
+ + L ++ V++ V + L + + E G ++ LK PW
Sbjct: 173 RVLGLLTDLMINAVKYRVGAVLAVEQVRHAEVLE------GGRVLYLKNAALPWTA--IV 224
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+E K+ +I Y L E ++ + V VS + F++R LPA W GLRD +L+ GI
Sbjct: 225 RKEMPKVLFVISYSLAE----QRHMIHTVPVSAESFDARADLPAAWAGLRDADLAAVTGI 280
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALK 344
P F H FI +S+ G L MAR AL+
Sbjct: 281 PDAGFCHNGRFIAAAKSFEGILEMARQALR 310
>gi|359455470|ref|ZP_09244689.1| hypothetical protein P20495_3460 [Pseudoalteromonas sp. BSi20495]
gi|358047467|dbj|GAA80938.1| hypothetical protein P20495_3460 [Pseudoalteromonas sp. BSi20495]
Length = 289
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 144/321 (44%), Gaps = 41/321 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ F ++VR+RD ++++ D V+DVGG YDP + +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVLPAF--KLVRTRDKALIENADFVIDVGGEYDPETNRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL++K +G L D+ R V +
Sbjct: 64 HHQRGGAGERENGI--PFSSFGLIWKKYGLAL------CDDNQAVADR----VDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWEEESNF----DTCFDEAVEFAARM 160
Query: 205 FLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+RF + + A+ IV + I D ++VL+++ PWK + L +
Sbjct: 161 L---IRFIASAHGSVNAKEIVAKAIENAED----ARVIVLEKYTPWKKTVHILSSDAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y++Y G QW +Q V V P FE RK LP W GL D E GI VF H
Sbjct: 212 ----YMVYPSHSG-QWILQTVPVEPGSFEDRKSLPKAWSGLSDQAFVDETGIDDAVFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAAL 343
FI G +S+ + +A AL
Sbjct: 267 GLFIAGTKSFESTMKLATMAL 287
>gi|150388301|ref|YP_001318350.1| metal-dependent protein hydrolase [Alkaliphilus metalliredigens
QYMF]
gi|149948163|gb|ABR46691.1| metal-dependent protein hydrolase [Alkaliphilus metalliredigens
QYMF]
Length = 297
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 165/329 (50%), Gaps = 43/329 (13%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
+ P KRVGTH+G FH DE + ++ + F ++ R+RDPK+L LD V DVGG
Sbjct: 2 EKPYKRVGTHHGRFHADEVMATAILM---ELFEIEVTRTRDPKILSKLDIVYDVGG---- 54
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIA-KELNVDEGHPDVHRLFLAV 137
+DHH G E+V+ ++ GL++ FG+++I+ KE ++ E ++ +F +V
Sbjct: 55 --GVFDHH--GIEKVYRDD-GIPFAACGLIWNEFGRKVISMKESSLVES--EIELVFESV 107
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ M+ IDAIDNG+ + + V+ ++SS V N W + E E F +
Sbjct: 108 DRALMKGIDAIDNGVRIGE-----QIVDLMDISSIVSMFNPPW----DLEKSEKECFDRA 158
Query: 198 MDLAGKEFLDTVRF---YVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ +A +T+ +R+ +P +++ Y + +I+VL++ CPW+ L E
Sbjct: 159 VAVASSVLNNTIDHKLAVLRTRIP--------VSKAYKRRENPKILVLEKSCPWQKVLSE 210
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
++E + + +V+Y D + +Q V + E +K LP W G + EL++ G+
Sbjct: 211 IDERNE----VLFVVYPDK--DNYAIQ--TVRGEDGEDKKYLPKSWVGKEEKELAEVTGV 262
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
VF H FI + + MA A+
Sbjct: 263 ADAVFCHTGRFIAVARRLESIVKMAELAI 291
>gi|326203981|ref|ZP_08193842.1| metal-dependent protein hydrolase [Clostridium papyrosolvens DSM
2782]
gi|325985748|gb|EGD46583.1| metal-dependent protein hydrolase [Clostridium papyrosolvens DSM
2782]
Length = 331
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 157/323 (48%), Gaps = 35/323 (10%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K+VGTH+G FH DE + +++ + F ++ R+RDP++L+ D + D+G N
Sbjct: 7 FKKVGTHSGRFHADEVMATAILK---QVFEIELTRTRDPEILEKQDLIYDIG------NG 57
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKN 140
+DHHQ E+ + T ++ GL+++ FG++ ++ K V E ++ +F V
Sbjct: 58 EFDHHQ--LEKEYRDN-GTPYAACGLIWRQFGRQAILTKHSEVSENEVEI--IFRYVDAV 112
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E IDA+DNGI + P +SS +G N W P E + AF +
Sbjct: 113 LIEGIDAVDNGIRTTENIIP-----TMCISSIIGGYNPTWDSP----ESVDAAFNDAVGF 163
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A ++ L + S L AR+ V+E Y+ E+++L PW+ L +E+
Sbjct: 164 A-EDILKNLIDQKVSTLKARTFVIEA----YNNRKRPELLILDNSYPWERTL----KEID 214
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
I + +V+Y + G + +Q V + RK LP +W G R++EL GI +F
Sbjct: 215 INKDVLFVIYPKEEG--FYIQTVREYGEVRRDRKRLPEEWAGKREEELGSIIGIKDAIFC 272
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H S FI S+ L MA A+
Sbjct: 273 HSSRFIAKASSFESILKMADIAI 295
>gi|376260656|ref|YP_005147376.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944650|gb|AEY65571.1| hypothetical protein Clo1100_1330 [Clostridium sp. BNL1100]
Length = 331
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 159/323 (49%), Gaps = 35/323 (10%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K+VGTH+G FH DE + +++ + F ++ R+RDP++L+ D + D+G N
Sbjct: 7 FKKVGTHSGKFHADEVMATAILK---QIFEIELTRTRDPEILEKQDLIYDIG------NG 57
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKN 140
+DHHQ E+ + T ++ GL+++ FG++ +++K V E ++ +F V
Sbjct: 58 EFDHHQ--LEKEYRDN-GTPYAACGLIWRKFGRQGILSKHPEVSENEVEI--IFRYVDAV 112
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E IDA DNGI + P +SS +G N W P E + AF +
Sbjct: 113 LIEGIDAADNGIRTTENIIP-----TMCISSIIGGYNPTWDSP----ESVDAAFNDAVGF 163
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A ++ L+ + S L A++ V+E Y+ E+++L PW+ L +E+
Sbjct: 164 A-EDILENLIDQKVSTLKAKTFVIEA----YNNRTRPELLILDNSYPWERTL----KEID 214
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
I + +V+Y + G + +Q V + RK LP +W G R++EL + GI VF
Sbjct: 215 INKEVLFVIYPKEEG--FYIQTVREYGEVRRDRKSLPEEWAGKREEELGRIIGIKDAVFC 272
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H S FI S+ L MA A+
Sbjct: 273 HTSRFIAKAGSFDSILRMADIAI 295
>gi|119472546|ref|ZP_01614594.1| hypothetical protein ATW7_06128 [Alteromonadales bacterium TW-7]
gi|392537170|ref|ZP_10284307.1| hypothetical protein Pmarm_03485 [Pseudoalteromonas marina mano4]
gi|119444870|gb|EAW26170.1| hypothetical protein ATW7_06128 [Alteromonadales bacterium TW-7]
Length = 289
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 49/325 (15%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ FN +VR+RD ++D D V+DVGG YDP +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVYPSFN--LVRTRDKALMDSADIVIDVGGQYDPDAGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV++ +G L V + V +
Sbjct: 64 HHQRGGAGERENGI--PYSSFGLVWQKYGLALCDNNQAVAD----------RVDAGLVST 111
Query: 145 IDAID----NGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
IDAID G+ Q +LS + N W E + ++ F + ++
Sbjct: 112 IDAIDCGHVEGVQQ-----------GISLSQTISMYNPTWEESPEF----DKCFDEAVEF 156
Query: 201 AGKEFLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
A + RF + + A++IV + I D ++VL+++ PWK + L E
Sbjct: 157 ASRML---TRFIASASGSVNAKAIVAKAIESAED----PRVIVLEKYTPWKKTVHILSSE 209
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
Y++Y G QW +Q V V P FE RK LP W GL D++ + G+ V
Sbjct: 210 AL------YMVYPSHSG-QWILQTVPVEPGSFEDRKSLPKAWAGLSDEDFQAKTGLDDAV 262
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
F H FI G +S+ + +A AL
Sbjct: 263 FCHNGLFIAGTKSFESTMKLAEVAL 287
>gi|119946055|ref|YP_943735.1| metal-dependent protein hydrolase [Psychromonas ingrahamii 37]
gi|119864659|gb|ABM04136.1| metal-dependent protein hydrolase [Psychromonas ingrahamii 37]
Length = 290
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 97/322 (30%), Positives = 144/322 (44%), Gaps = 41/322 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG+FH D+ + L F +++R+RD ++ D VLDVGG YDP +D
Sbjct: 6 IATHNGNFHADDVFS--IAALKSIFPTFKLIRTRDLALIGQADIVLDVGGEYDPDAGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL++K +G E+ +G+ DV A+ +
Sbjct: 64 HHQRGGAGERENGI--PYSSFGLIWKKYGLEIC-------QGNQDVAN---ALDAGLVSN 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W E + F +D A +
Sbjct: 112 IDAIDCG-------HVEGVSKGISLSQTISMFNPTWQEESHF----DACFDDAVDFAVRV 160
Query: 205 FLDTVRFYVRS--WLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
RF + + A+ IV I D ++VL+++ PWK + L +E
Sbjct: 161 L---ARFIASANGGISAKVIVAGAIENALD----PRVIVLEKYTPWKRTVHALSKEAL-- 211
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
Y+++ G +WR+Q V V P FE RK LP +W GL L GI +F H
Sbjct: 212 ----YMVFPSQTG-EWRIQTVPVEPGSFEDRKSLPKEWAGLSGKALVDVTGIDDAMFCHN 266
Query: 323 SGFIGGNQSYGGALAMARAALK 344
FI G +S+ + MA AL+
Sbjct: 267 GLFIAGAESFSSVMKMAAMALQ 288
>gi|220927874|ref|YP_002504783.1| metal-dependent protein hydrolase [Clostridium cellulolyticum H10]
gi|219998202|gb|ACL74803.1| metal-dependent protein hydrolase [Clostridium cellulolyticum H10]
Length = 331
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 159/325 (48%), Gaps = 35/325 (10%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K+VGTH+G FH DE + +++ + F ++ R+RDP++L+ D + D+G N
Sbjct: 7 FKKVGTHSGRFHADEVMATAILK---QVFEIKLTRTRDPEILEKQDLIYDIG------NG 57
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKN 140
+DHHQ E+ + T ++ GL+++ FG++ +++K V E ++ +F V
Sbjct: 58 EFDHHQ--LEKEYRDN-GTPYAACGLIWRKFGRQAILSKHPEVSEN--EIESIFRYVDAV 112
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+E IDA DNGI + P +S+ +G N W P E + AF +
Sbjct: 113 LIEGIDAADNGIRTTENIIP-----TMCISAIIGGYNPTWDSP----ESVDAAFSDAVGF 163
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A ++ L+ + S L AR+ V++ Y+ E+++L PW+ L +E+
Sbjct: 164 A-EDILENLIDQKVSTLKARTFVIQA----YNNRKRPELLILDNSYPWERTL----KEID 214
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
I + +V+Y + G + +Q V + RK LP QW G R++EL GI VF
Sbjct: 215 INKEVLFVIYPKEEG--FYIQTVREYGEVRRDRKSLPRQWAGKREEELCNIIGIKDAVFC 272
Query: 321 HMSGFIGGNQSYGGALAMARAALKL 345
H S FI S+ L MA A+ +
Sbjct: 273 HSSRFIAKAGSFESILKMADIAISM 297
>gi|445494664|ref|ZP_21461708.1| putative metal-dependent hydrolase [Janthinobacterium sp. HH01]
gi|444790825|gb|ELX12372.1| putative metal-dependent hydrolase [Janthinobacterium sp. HH01]
Length = 318
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/331 (28%), Positives = 158/331 (47%), Gaps = 34/331 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G FH D+A +++ F +A I+R+RDP ++ D +DVGG++DP++ +D
Sbjct: 3 IATHSGKFHADDAWAVAALKVL--FPDADILRTRDPAAIEAADFAVDVGGIWDPASGRFD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH--PD--VHRLFLAVYKN 140
HHQKGF+ G +SAGLV++ +G +A GH PD + + +
Sbjct: 61 HHQKGFDGARQSG--VPYASAGLVWREYGARCVAALALAHGGHRLPDDKAREIAYGIDAD 118
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE------AF 194
++ +D D G + P Y LS+ + N +W + + E F
Sbjct: 119 VVQYLDLSDVGAAK---SAPGGY----GLSAVISGYNTNWLDEQRLGYGEAADAYRMAQF 171
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-PWKLHLF 253
++ M++ L+ VR+ + + + + E G+++ LK PW
Sbjct: 172 RRAMEVLTDVMLNAVRYRLGALFALEQVRQGELLE------GGQVLFLKNGALPWGQ--- 222
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+ +EM P + +V+ + ++ + V+VS + FE+R LP W GLRD EL+ G
Sbjct: 223 VVRKEM---PKVLFVISHNLAEQRHMLHTVSVSAESFEARADLPEAWAGLRDAELAAVTG 279
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+P F H FI +SY G AMA AL+
Sbjct: 280 VPDAGFCHNGRFIASAKSYEGIRAMAALALQ 310
>gi|114328773|ref|YP_745930.1| MYG1 protein [Granulibacter bethesdensis CGDNIH1]
gi|114316947|gb|ABI63007.1| MYG1 protein [Granulibacter bethesdensis CGDNIH1]
Length = 316
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 159/331 (48%), Gaps = 31/331 (9%)
Query: 20 TPLKRVGTHNGSFHCDEALGCFMIR----LTDKFFNAQIVRSRDPKVLDDLDAVLDVGGV 75
TPL + TH+G FHCDE ++R L+ + ++R+R P++++ D V DVG +
Sbjct: 6 TPL--LITHSGKFHCDEVFAYAVLRFALGLSRSGEDHVLLRTRKPELIETGDIVFDVGLI 63
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
DPSN+ +DHHQ G T SSAGLV++ +G+ +A L + +
Sbjct: 64 SDPSNNRFDHHQIGAPT---REDGTPFSSAGLVWQIYGERAVASLL-APQDAAFAPAIAT 119
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE-NEAF 194
A+ ++ ID IDNG++ P N+ +L++ VG N W PD + ++AF
Sbjct: 120 ALDGKLVKRIDEIDNGVSASG----PVVRNSLDLAALVGDFNPPWDSPDANGPTAGDDAF 175
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF- 253
Q + V RS L A ++V+ A D ++VL+ PWK +F
Sbjct: 176 QHATAMVAGVLARQVDIQ-RSKLQAEALVLAAHAAADDK----RLLVLETGMPWKNVVFS 230
Query: 254 -ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+L + + P W V V P F R PLP W GL+ +L+ +
Sbjct: 231 HDLPVLLAVSPA---------SNGNWMVDTVPPEPGSFAQRLPLPESWAGLQGADLAAVS 281
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G+ VFVH+ F+GG ++ GA+A+A AL
Sbjct: 282 GVADAVFVHVRRFVGGAKTREGAIALAHKAL 312
>gi|359451778|ref|ZP_09241167.1| hypothetical protein P20480_3915 [Pseudoalteromonas sp. BSi20480]
gi|358042404|dbj|GAA77416.1| hypothetical protein P20480_3915 [Pseudoalteromonas sp. BSi20480]
Length = 289
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 96/325 (29%), Positives = 146/325 (44%), Gaps = 49/325 (15%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
V THNG+FH D+ +++ FN +VR+RD ++D D V+DVGG YDP +D
Sbjct: 6 VVTHNGNFHADDVFSIAVLKHVYPSFN--LVRTRDKALMDSADIVIDVGGQYDPDAGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GLV++ +G L V + V +
Sbjct: 64 HHQRGGAGERENGI--PYSSFGLVWQKYGLALCDNNQAVAD----------RVDAGLVST 111
Query: 145 IDAID----NGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
IDAID G+ Q +LS + N W E + ++ F + ++
Sbjct: 112 IDAIDCGHVEGVQQ-----------GISLSQTISMYNPTWEESPEF----DKCFDEAVEF 156
Query: 201 AGKEFLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
A + RF + + A++IV + I D ++VL+++ PWK + L E
Sbjct: 157 ASRML---TRFIASASGSVNAKAIVAKAIESAED----PRVIVLEKYTPWKKTVHILSSE 209
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
Y++Y QW +Q V V P FE RK LP W GL D++ + G+ V
Sbjct: 210 AL------YMVYPS-HSSQWILQTVPVEPGSFEDRKSLPKAWAGLSDEDFQAQTGLDDAV 262
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
F H FI G +S+ + +A AL
Sbjct: 263 FCHNGLFIAGTKSFESTMKLAEVAL 287
>gi|119617095|gb|EAW96689.1| chromosome 12 open reading frame 10, isoform CRA_c [Homo sapiens]
Length = 132
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 64/135 (47%), Positives = 88/135 (65%), Gaps = 5/135 (3%)
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+Y+NF+E +DA+DNGI+Q+ + PRY T LS+RV +LN W PDQ E F++
Sbjct: 1 MYENFVEEVDAVDNGISQW-AEGEPRYALTTTLSARVARLNPTWNHPDQDTEA---GFKR 56
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVL-KRFCPWKLHLFEL 255
MDL +EFL + FY SWLPAR++V E +A+R+ DPSGEI+ L K CPWK HL+ L
Sbjct: 57 AMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGACPWKEHLYHL 116
Query: 256 EEEMKIEPLIKYVLY 270
E + I +V+Y
Sbjct: 117 ESGLSPPVAIFFVIY 131
>gi|409400957|ref|ZP_11250887.1| MYG1 protein [Acidocella sp. MX-AZ02]
gi|409130160|gb|EKM99950.1| MYG1 protein [Acidocella sp. MX-AZ02]
Length = 322
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 147/323 (45%), Gaps = 25/323 (7%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQ----IVRSRDPKVLDDLDAVLDVGGVYD 77
+ R+ TH+G FHCDE G ++RL ++R+R P++++ D V DVG VYD
Sbjct: 1 MPRLVTHSGKFHCDEVFGYAVLRLALGLSRPGEDHVLLRTRKPELIESGDIVFDVGSVYD 60
Query: 78 PSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
P +DHHQ G + T S+AGL+++ +G +A L D + +
Sbjct: 61 PQAQRFDHHQIGAPQ---REDGTPYSAAGLLWQVYGARAVAALLP-DAAQGFAAAIAEEL 116
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW-TEPDQSAERENEAFQQ 196
+ ++ ID IDNG++ P ++ L++ +G N W ++ A + AF +
Sbjct: 117 NEGLVKRIDEIDNGVSMSG----PVLRDSLGLAALIGDYNPSWDSDEANGATAGDAAFLR 172
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
D A V RS + A + V+ Y P ++VL+ PWK F E
Sbjct: 173 AADFAAGVLALKVEGK-RSRMEADARVLAA----YQAGPEPRLLVLETGMPWKSAAFAHE 227
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
I + + G W + + P F R P P W GL+ + L+ G+
Sbjct: 228 LP------ILFCISPASNGN-WMIDTMPPEPGSFAQRLPFPEAWAGLQGEALAAVCGVED 280
Query: 317 CVFVHMSGFIGGNQSYGGALAMA 339
VFVH+ F+G +S GALAMA
Sbjct: 281 AVFVHVRRFVGAAKSREGALAMA 303
>gi|397635213|gb|EJK71768.1| hypothetical protein THAOC_06762, partial [Thalassiosira oceanica]
Length = 164
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 100/168 (59%), Gaps = 21/168 (12%)
Query: 6 VSSSPAYSTSSPSQTP-LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD 64
+++SP Y + S P K +GTH+G+F DEALG +++R ++ N+ +VRSRDP L
Sbjct: 1 MAASPMYKRAKLSIPPSTKTIGTHSGTFQADEALGVWILRQLPEYRNSAVVRSRDPDTLV 60
Query: 65 DLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFS---------TKLSSAGLVYKHFGKE 115
D V+DVGGVYD + YDHHQ+G++E F TKLS++GLVY+H+GKE
Sbjct: 61 KCDIVIDVGGVYDHATLRYDHHQRGYDERFAKKAKPDGTEVERCTKLSASGLVYRHYGKE 120
Query: 116 LIAKELNVDEGHPD-----VHRLFLAVYKNFMEAIDAIDNGINQYDTD 158
LI+ +P+ V + +Y FMEAIDAID G+ +D
Sbjct: 121 LISTY------YPNLSSELVELAYTKMYNEFMEAIDAIDTGVEPIPSD 162
>gi|389695471|ref|ZP_10183113.1| hypothetical protein MicloDRAFT_00052870 [Microvirga sp. WSM3557]
gi|388584277|gb|EIM24572.1| hypothetical protein MicloDRAFT_00052870 [Microvirga sp. WSM3557]
Length = 302
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 155/321 (48%), Gaps = 31/321 (9%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
+V TH+G+FH D+ ++R F + R+RD +++ D V DVGG YD + Y
Sbjct: 8 KVVTHSGTFHADDVFAFSILREALGPF--EFARTRDSALIESADLVFDVGGTYDVARGRY 65
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELN-VDEGHPDVHRLFLAVYKNFM 142
DHH + T SS GL+++ FG+ + + +DE D ++ + F+
Sbjct: 66 DHHMRDLPR---RPDGTPYSSVGLIWRDFGRNALPNFIQGIDEDLLDA--IWQDIDTGFI 120
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
AID DNG+ ++ +LS + N W DQS ++AF + D A
Sbjct: 121 LAIDQADNGVAS---------ISQGHLSLLIEAFNPTWAS-DQS---YDDAFLEAADFAR 167
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ R + A+S+V+ A R DP ++VL R PW+ +FE +
Sbjct: 168 DILVRACR-QAHAEAQAQSLVLA--AARKARDP--RVIVLNRKLPWEKAVFE----GGLR 218
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
L+ +++Y ++ W + + P+ F R LP WRGL+++E S+ AGI VF H
Sbjct: 219 DLL-FIIYPNEDATAWYCRTIPPEPNSFGQRLSLPEAWRGLQEEEFSRVAGIDDGVFCHP 277
Query: 323 SGFIGGNQSYGGALAMARAAL 343
SGFI G +S A+ +A A+
Sbjct: 278 SGFICGARSQESAVRLAEKAI 298
>gi|226182915|dbj|BAH31019.1| hypothetical protein RER_03110 [Rhodococcus erythropolis PR4]
Length = 289
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 143/316 (45%), Gaps = 46/316 (14%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG FH D+ G +++ + +A +VRSRD VLD D VLDVGG Y+P+ +D
Sbjct: 3 IATHNGKFHADDVFGVSLLK--QLYPDATVVRSRDESVLDSADIVLDVGGRYNPAEGRFD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ E S G++Y FG L EG V R + +E
Sbjct: 61 HHQRDAGE----------RSNGILYSAFGLLWQEYGLQFCEGDASVFR---RIDSRLVEG 107
Query: 145 IDAIDNG-----INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE-NEAFQQGM 198
IDA+DNG +N Y T KP +LSS LD P S + E + F+ +
Sbjct: 108 IDAVDNGQEIYTLNDYGT-KP------FDLSSV-----LDLFNPISSTDEEFDTQFELAV 155
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
LA + V +R+ + E Y+ +VL+RF P
Sbjct: 156 VLATQ-----VLIRLRAKYAGDAAAEREFTETYEKATDPRFVVLERFVPHG-------RA 203
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+P + + ++ + G W +Q V + +F SRK LP WRGL +L+ E G+ V
Sbjct: 204 ASAQPELLFTIFPNTNGG-WSIQTVKPADSKFGSRKLLPEPWRGLNGSDLAAETGVDSSV 262
Query: 319 FVHMSGFIGGNQSYGG 334
F H +GFI Q+ G
Sbjct: 263 FCHKAGFIAAAQTRDG 278
>gi|427401566|ref|ZP_18892638.1| hypothetical protein HMPREF9710_02234 [Massilia timonae CCUG 45783]
gi|425719675|gb|EKU82607.1| hypothetical protein HMPREF9710_02234 [Massilia timonae CCUG 45783]
Length = 316
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 158/330 (47%), Gaps = 34/330 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH G FH D+A ++++ F A +VR+R+ +D D +DVGGV+DP+ +D
Sbjct: 3 IATHGGKFHADDAWAVAVLKVL--FPEADVVRTREQARIDAADFAIDVGGVWDPATGRFD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK--ELNVDEGHPD--VHRLFLAVYKN 140
HHQK F+ G +SAGLV++ +G +A E + E D ++ A+ +
Sbjct: 61 HHQKEFDGARASG--VPYASAGLVWREYGARCVAALAERHTGERLADDTAQQIAYAIDAD 118
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAEREN-EAFQQGMD 199
++ +D D G+ + + P Y LS+ V N W + + E E ++ G
Sbjct: 119 IVQYLDLSDVGVAK---NAPGSY----GLSAVVSGFNPGWLDEQRLGYGEAVEVYRMGQF 171
Query: 200 LAGKEFL-----DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-PWKLHLF 253
+ EFL + VR+ V + L + + E ++ LK PW
Sbjct: 172 MRAVEFLTDIMGNAVRYRVGAMLAVTQVRQAEVLE------DCRLLFLKNAALPWS---S 222
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+ +EM P + +V+ +++ + V+V + F++R LP W GLR+ EL+ G
Sbjct: 223 VVRKEM---PKVLFVISHSLTEQRYMLHTVSVDTESFDARADLPEAWAGLREAELAAVTG 279
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ VF H FI ++Y GAL+MAR AL
Sbjct: 280 VEDAVFCHTGRFIAAARTYAGALSMARQAL 309
>gi|269861342|ref|XP_002650382.1| MYG1 protein [Enterocytozoon bieneusi H348]
gi|220066185|gb|EED43680.1| MYG1 protein [Enterocytozoon bieneusi H348]
Length = 321
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 153/321 (47%), Gaps = 19/321 (5%)
Query: 27 THNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDH 85
TH+GSFH DE L F++ L D + ++R+RDPK++ V DVG +DP+N +DH
Sbjct: 16 THSGSFHYDELLATAFLMELFD---DVILLRTRDPKIIKTGTIVYDVGFEFDPANKRFDH 72
Query: 86 HQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAI 145
H K F EVF ++ KLSSAGL+YK++ + + K + + +YK A
Sbjct: 73 HMKWFSEVFSEDYNVKLSSAGLIYKYYHEAVFKKYGLHSNDILIFNYIKNKMYKELFLAT 132
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQQGMDLAGKE 204
DAIDNGI + KP + + N + + D+ ++ F + ++ +
Sbjct: 133 DAIDNGIEITYSIKP------RTIQDIIKLFNTSYIDDMDEYNVAQDSQFHKALEFVKMD 186
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
+ + ++LP ++ + D D I++ + +E E
Sbjct: 187 LKNYLNNLFNNFLPGFRKAIDLLQNNKDPDI---IVITDNYISISA---IVEAERFTSRD 240
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+KY++++ + ++R+ + F+ + PL +WRG +EL + IPG FVH +G
Sbjct: 241 LKYMIFK--KNNEYRIYCFNIPEHDFQPKVPLKQEWRGKSQEELKTISQIPGIRFVHATG 298
Query: 325 FIGGNQSYGGALAMARAALKL 345
F G + A+ M +LK+
Sbjct: 299 FTGSVDTLDSAIMMCHESLKV 319
>gi|229492642|ref|ZP_04386445.1| metal-dependent protein hydrolase [Rhodococcus erythropolis SK121]
gi|453070270|ref|ZP_21973522.1| hypothetical protein G418_16516 [Rhodococcus qingshengii BKS 20-40]
gi|229320628|gb|EEN86446.1| metal-dependent protein hydrolase [Rhodococcus erythropolis SK121]
gi|452761916|gb|EME20215.1| hypothetical protein G418_16516 [Rhodococcus qingshengii BKS 20-40]
Length = 289
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 143/316 (45%), Gaps = 46/316 (14%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG FH D+ G +++ + +A +VRSRD VLD D VLDVGG Y+P+ +D
Sbjct: 3 IATHNGKFHADDVFGVSLLK--QLYPDATVVRSRDESVLDSADIVLDVGGRYNPAEGRFD 60
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ E S G++Y FG L EG V R + +E
Sbjct: 61 HHQRDAGE----------RSNGILYSAFGLLWQEYGLQFCEGDASVFR---RIDSRLVEG 107
Query: 145 IDAIDNG-----INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERE-NEAFQQGM 198
IDA+DNG +N Y T KP +LSS LD P S + E + F+ +
Sbjct: 108 IDAVDNGQEIYTLNDYGT-KP------FDLSSV-----LDLFNPISSTDEEFDTQFELAV 155
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
LA + V +R+ + E Y+ +VL+RF P
Sbjct: 156 VLATQ-----VLIRLRAKYAGDAAAEREFTETYEKATDPRFVVLERFVPHG-------RA 203
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+P + + ++ + G W +Q V + +F SRK LP WRGL +L+ + G+ V
Sbjct: 204 ASAQPELLFTIFPNTNGG-WSIQTVKPADSKFGSRKLLPEPWRGLNGSDLAAQTGVDSSV 262
Query: 319 FVHMSGFIGGNQSYGG 334
F H +GFI Q+ G
Sbjct: 263 FCHKAGFIAAAQTRDG 278
>gi|149197836|ref|ZP_01874885.1| putative Metal-dependent protein hydrolase [Lentisphaera araneosa
HTCC2155]
gi|149139057|gb|EDM27461.1| putative Metal-dependent protein hydrolase [Lentisphaera araneosa
HTCC2155]
Length = 306
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 159/331 (48%), Gaps = 33/331 (9%)
Query: 14 TSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
++ PS +GTHNG FH D+ L + LT + +I+RSRD ++L D ++DVG
Sbjct: 2 STDPSPEQEVTIGTHNGFFHADDCLA--VAALTMIYPKHKIIRSRDKQILSTCDFLVDVG 59
Query: 74 GVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIA-KELNVDEGHPDVHR 132
G+YD ++ +DHH F + +SS GLV++ FG+++ +E+ +
Sbjct: 60 GIYDEESNRFDHH---FSNGPAYNDGLLMSSFGLVWQKFGEQICGLREIKEN-------- 108
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
+ + + +DA DNG+ + + VN +LS+ + +N DQ+ + E
Sbjct: 109 ----IQSSLVRPVDAADNGVAIHCRQRGAPEVNMLSLSAVLAVMNPG--SIDQADDVFLE 162
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL 252
L G+ F+D R + S R +V A Y + M L W+ L
Sbjct: 163 QVTWCRRLIGR-FIDNSRQRIES----REMVRHAFA--YAEKKNTNFMELPGSMKWEEAL 215
Query: 253 FELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEA 312
+ L+ KI +V++ + QW ++ V+ +P + RK LPA+W GLRDDE S+
Sbjct: 216 YSLDRTHKI----YFVVFPHN--NQWYLRCVSRTPHSYTPRKRLPAEWAGLRDDEFSRVL 269
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
GI VF H + F+ +S L +A AL
Sbjct: 270 GISDGVFCHHAAFVCAAKSRQSILKVAEMAL 300
>gi|149911864|ref|ZP_01900465.1| hypothetical protein PE36_08276 [Moritella sp. PE36]
gi|149805069|gb|EDM65094.1| hypothetical protein PE36_08276 [Moritella sp. PE36]
Length = 289
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 144/325 (44%), Gaps = 43/325 (13%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNA-QIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
K + THNG FH D+ F I F A ++VR+RD ++ + D V+DVGG YDP
Sbjct: 4 KTIATHNGKFHADD---VFSIAALKNIFPAFKLVRTRDLDIIGEADIVIDVGGEYDPETG 60
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHHQ+G +G SS GL+++ +G E+ +G+ ++ AV
Sbjct: 61 RFDHHQRGGAGERENGI--PYSSFGLIWQKYGLEIC-------QGNQEIAD---AVDAGL 108
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ IDA+D G +LS + N W E + F + + A
Sbjct: 109 VSTIDAVDCG-------HVEGVAQGISLSQTISMFNPTWEEDSDL----DACFDEAVAFA 157
Query: 202 GKEFLDTVRFYVRSW--LPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
+ RF + + A+ IV + I D ++VL+++ PWK + L E
Sbjct: 158 SRIL---TRFIASATGGINAKDIVAKAI----DNAEDPRVIVLEQYTPWKTTVLNLAAEA 210
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
+++Y GK W +QAV V FE RK LP W GL + E G+ +F
Sbjct: 211 L------FMVYPSQSGK-WIIQAVPVELGSFEDRKSLPKPWAGLSEQAFKDETGLDDAMF 263
Query: 320 VHMSGFIGGNQSYGGALAMARAALK 344
H FI G S+ + MA AL+
Sbjct: 264 CHNGLFIAGAASFESTMKMAAMALQ 288
>gi|78485201|ref|YP_391126.1| metal-dependent protein hydrolase [Thiomicrospira crunogena XCL-2]
gi|78363487|gb|ABB41452.1| MYG1 family protein [Thiomicrospira crunogena XCL-2]
Length = 280
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 158/319 (49%), Gaps = 47/319 (14%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH DE MI++ ++ IVRSRD V+D + VLDVGG YDP +DHH
Sbjct: 4 THSGRFHADEVFAIAMIQMIEE---VDIVRSRDQDVIDQAEMVLDVGGEYDPERLRFDHH 60
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFG-KELIAKELNVDEGHPDVHRLFLAVYKNFMEAI 145
Q F G T ++AGLV++HFG K L AK L EG + V K + I
Sbjct: 61 QNSFTRAREDG--TPYATAGLVWEHFGAKILAAKGL---EGEYETQFALEWVDKKIIRDI 115
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN---LDWTEPDQSAERENEAFQQGMDLAG 202
DA+DNG+ D PR ++S +G +N D E ++A ++ AF G+
Sbjct: 116 DAVDNGMFTED----PR----PSVSMLIGMMNASSTDELEQQETAFKDAIAFTSGI---- 163
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ ++++ + ++VE A + D I+VL P+K + ++
Sbjct: 164 ------LNNFIQAAIKEAEVIVELEACAKNVDEG--ILVLAENLPFK-------DFIRSH 208
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
P I V+Y +G + V + E+ LP ++RGLR++EL G+ VF H
Sbjct: 209 PEITRVVY--PKGS----EGYGVFCNGKENH--LPERFRGLREEELKAVTGLEDAVFCHK 260
Query: 323 SGFIGGNQSYGGALAMARA 341
SGF+ +S+ ALAMA++
Sbjct: 261 SGFMSVCRSFESALAMAKS 279
>gi|77361947|ref|YP_341521.1| metal-dependent protein hydrolase [Pseudoalteromonas haloplanktis
TAC125]
gi|76876858|emb|CAI89075.1| putative Metal-dependent protein hydrolase [Pseudoalteromonas
haloplanktis TAC125]
Length = 289
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 138/308 (44%), Gaps = 37/308 (12%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ THNG+FH D+ ++ F +++R+RD +++ D V+DVGG ++P +D
Sbjct: 6 IATHNGTFHADDVFSVAALKYIYPSF--KLIRTRDMELISKADVVIDVGGEHNPETGRFD 63
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G +G SS GL+++ +G E+ +G + A+ +
Sbjct: 64 HHQRGGAGARENGI--PFSSFGLIWQKYGLEIC-------QGSQETAN---AIDSGLVST 111
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
IDAID G +LS + N W Q + F + ++ A +
Sbjct: 112 IDAIDCG-------HVEGVAKGISLSQTISMFNPTW----QEESNFDACFDEAVEFASR- 159
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
L + A++IV + I D ++VL+++ PWK + L ++
Sbjct: 160 ILTRFIAAANGGINAKAIVAKAIENAAD----PRVIVLEQYTPWKKTVHALSDKAL---- 211
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
Y++Y G QW VQ V V P FE RK LP W GL D E E G+ VF H
Sbjct: 212 --YMVYPSHSG-QWIVQTVPVEPGSFEDRKSLPKPWAGLSDKEFQDETGLEDAVFCHNGL 268
Query: 325 FIGGNQSY 332
FI G +S+
Sbjct: 269 FIAGAKSF 276
>gi|260221600|emb|CBA30327.1| hypothetical protein Csp_C22990 [Curvibacter putative symbiont of
Hydra magnipapillata]
Length = 328
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 161/339 (47%), Gaps = 55/339 (16%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G+FH D+ G + L F + +++R+R +++D D V+DVGGV+D + +D
Sbjct: 9 IATHSGTFHADDVFGVGI--LMGVFPSHRLIRTRKQELIDTADFVVDVGGVWDAAKGRFD 66
Query: 85 HHQKGFE------EVFGH-GFSTKLSSAGLVYKHFGKELIAKELNVDEGH----PDVHRL 133
HHQ+GF+ EV G +SAGLV+ FG + +GH V +
Sbjct: 67 HHQRGFDGARPATEVDGAIEPGVGYASAGLVWSAFGTAYVQAWCK-GQGHALDEAAVAEV 125
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
++ + ++ +D +D G D P LSS + +LN W E
Sbjct: 126 VRSIDHSLVQYLDIVDTG----QGDVSPGIF---GLSSLIAQLNTHWLE----------- 167
Query: 194 FQQGMDLAGKEFLDTVRF-----YVRSWLPARSIVVECIAERYDYDP--------SGEIM 240
++G+D A K L RF R +L + + +A+ D G ++
Sbjct: 168 -EKGLDHAAKAQLLETRFREAIAITRKFLD--HAISKKVAQLRAMDTVRQAPRLLGGRVL 224
Query: 241 VLKR-FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQ 299
L+ PW H+ + EM P + +V+Y D G Q++++ V V F +R LP
Sbjct: 225 HLQEGGMPWT-HV--VLNEM---PEVMFVIYPDSDGDQYQIKTVPVEAGSFTARMDLPKS 278
Query: 300 WRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAM 338
W GLRD EL+ G+ VF H++ FIGG +S+ GAL M
Sbjct: 279 WAGLRDGELAAVNGVLDSVFCHLNLFIGGARSFEGALKM 317
>gi|354594150|ref|ZP_09012193.1| hypothetical protein CIN_08890 [Commensalibacter intestini A911]
gi|353673261|gb|EHD14957.1| hypothetical protein CIN_08890 [Commensalibacter intestini A911]
Length = 348
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 146/330 (44%), Gaps = 39/330 (11%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNA--QIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ THNG FH D+ + + N ++ R+RD ++++ D V DVGG++D
Sbjct: 45 IVTHNGRFHIDDVFAFTTLAIALDLDNRPFKVERTRDTEIIEKADIVFDVGGIFD-GKRR 103
Query: 83 YDHHQKGFEE------VFGHGFSTKLSSAGLVYKHFGKELIAK---ELNVDEGHPDVHRL 133
+DHHQ G E G + SSAGL+++ FG ++I K +L+ D+ H +
Sbjct: 104 FDHHQIGAPERDIKQTPKGIEGTIPYSSAGLIWRAFGLDVIQKLAPDLD-DKSRKIAHNV 162
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA 193
+ K+ + IDAIDNG + N N SS + N W D S + E
Sbjct: 163 ---IEKSLVIPIDAIDNGKMHPE--------NGLNFSSIINVFNPPWDTDDSSTQLERF- 210
Query: 194 FQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF 253
+ + TV Y + AR + C E Y P I++L R+ P H++
Sbjct: 211 ------FEASQIVRTVLMYQLNIEFARLRAIGCTQEAYKSSPDQRILMLPRWMP---HIY 261
Query: 254 ELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAG 313
+ L+ Y +WR+ V + + RK PA W GL + L+ E G
Sbjct: 262 PIFANGWATQLVIY-----PAENEWRIGTVPIRMHGNDRRKLFPASWGGLEGEALAAETG 316
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
IPG FVH FI ++ A+A+A L
Sbjct: 317 IPGSKFVHKGLFIAVTETKEAAIALAEKTL 346
>gi|162146419|ref|YP_001600878.1| hypothetical protein GDI_0594 [Gluconacetobacter diazotrophicus PAl
5]
gi|161784994|emb|CAP54537.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus
PAl 5]
Length = 340
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 150/341 (43%), Gaps = 43/341 (12%)
Query: 24 RVGTHNGSFHCDEALGCFMIR--------LTDKFFNAQ------IVRSRDPKVLDDLDAV 69
R TH+G+FH DE LG ++ L + AQ R+R P+ + D V
Sbjct: 17 RALTHSGNFHADETLGYVILHYALAPQGDLRARVLEAQPGDRLIFARTRAPERIQASDIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG+Y P+ YDHH K + + G T S+AGL++K +G I L
Sbjct: 77 FDVGGLYAPAKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGVAAIRNILETPLDDST 133
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE-----PD 184
V ++ A+ K+ + ID DNG+ + + +L+ V W P+
Sbjct: 134 VAAIWQAIDKSLILPIDQDDNGVAK---------MGKLSLADIVSACGPAWDTAELYGPE 184
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE F ++ V R+ L A V+ YD I+V++
Sbjct: 185 EARAREARGFANAAVAVASHLVNVVD-RARASLKATDRVLTA----YDTAEDKRILVMET 239
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + + YV+ G QW V+AV F R PLP WRGL
Sbjct: 240 GMPTEKVIFEHDLP------VVYVVSPTGTG-QWNVKAVPPVRGDFGQRVPLPEAWRGLE 292
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ L+ +G+P VF H + FI G S GA+ MAR AL++
Sbjct: 293 REALAGISGVPDAVFAHPARFICGAGSRDGAIRMARLALEI 333
>gi|209543572|ref|YP_002275801.1| metal-dependent protein hydrolase [Gluconacetobacter diazotrophicus
PAl 5]
gi|209531249|gb|ACI51186.1| metal-dependent protein hydrolase [Gluconacetobacter diazotrophicus
PAl 5]
Length = 341
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 150/341 (43%), Gaps = 43/341 (12%)
Query: 24 RVGTHNGSFHCDEALGCFMIR--------LTDKFFNAQ------IVRSRDPKVLDDLDAV 69
R TH+G+FH DE LG ++ L + AQ R+R P+ + D V
Sbjct: 18 RALTHSGNFHADETLGYVILHYALAPQGDLRARVLEAQPGDRLIFARTRAPERIQASDIV 77
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG+Y P+ YDHH K + + G T S+AGL++K +G I L
Sbjct: 78 FDVGGLYAPAKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGVAAIRNILETPLDDST 134
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE-----PD 184
V ++ A+ K+ + ID DNG+ + + +L+ V W P+
Sbjct: 135 VAAIWQAIDKSLILPIDQDDNGVAK---------MGKLSLADIVSACGPAWDTAELYGPE 185
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE F ++ V R+ L A V+ YD I+V++
Sbjct: 186 EARAREARGFANAAVAVASHLVNVVD-RARASLKATDRVLTA----YDTAEDKRILVMET 240
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + + YV+ G QW V+AV F R PLP WRGL
Sbjct: 241 GMPTEKVIFEHDLP------VVYVVSPTGTG-QWNVKAVPPVRGDFGQRVPLPEAWRGLE 293
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ L+ +G+P VF H + FI G S GA+ MAR AL++
Sbjct: 294 REALAGISGVPDAVFAHPARFICGAGSRDGAIRMARLALEI 334
>gi|86358226|ref|YP_470118.1| metal-dependent hydrolase [Rhizobium etli CFN 42]
gi|86282328|gb|ABC91391.1| putative metal-dependent hydrolase protein [Rhizobium etli CFN 42]
Length = 307
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 149/322 (46%), Gaps = 33/322 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P+ + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPEWITPAPGRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH--RLFLAVYKNFM 142
HHQ+G + SS GL++KH+G+E +A G PD H L A F+
Sbjct: 66 HHQRGAPQ---RDDGQPYSSFGLIWKHYGREYLAAS-----GLPDHHVEALHAAFDTGFV 117
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG P + L + + L + E + +A+ + AF + +A
Sbjct: 118 LPIDLTDNG-----ALSPSGPLAGLMLPALLETLKPVFDEAEPAAD--DRAFHAALAIA- 169
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F+D + L A +IV I + G I+ L R P++ + + + +
Sbjct: 170 RSFVDAGIAQRAAKLRAEAIVYRAIGDAG----QGRILELPRGMPFRPAIVKAGADHLL- 224
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ R K W V + + D FE R LPA W GL + EL GI G F H
Sbjct: 225 ----FVVHP--REKDWCVTGIRRAEDGFELRADLPAAWAGLTNGELEAVCGIEGASFCHN 278
Query: 323 SGFIGGNQSYGGALAMARAALK 344
F+ ++ ALAMA A++
Sbjct: 279 GRFVAAAKTRDAALAMAELAVQ 300
>gi|114770299|ref|ZP_01447837.1| hypothetical protein OM2255_11700 [Rhodobacterales bacterium
HTCC2255]
gi|114549136|gb|EAU52019.1| hypothetical protein OM2255_11700 [alpha proteobacterium HTCC2255]
Length = 310
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 157/320 (49%), Gaps = 27/320 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL--DDLDAVLDVGGVYDPSNDCYD 84
TH+G FH DE L +++ F A+I+RSR+ + ++ + DVGG YD +N +D
Sbjct: 8 THSGGFHADELLSTAILKFI--FPKAKIIRSREKDWITPNNRKIIYDVGGEYDLNNQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++KHFGK+ + DE +HR F F+
Sbjct: 66 HHQRP-NPLRADG--QPYSSFGLIWKHFGKKFLKINSVSDEDIEHIHREF---DTKFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D IDNG + D + N +L + + + P SA ++AF + + +A ++
Sbjct: 120 VDLIDNG--EIDLSLSGA-IANLSLPALLENFKPAFDNPSASAN--DDAFMKALSIA-QD 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F+ ++ + S A IV + I + S EI+ L P++ L E
Sbjct: 174 FIQSIMQNLSSKHRANKIVNDLINDL----GSSEILELPTGMPFQSAL-----ENSNAKH 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ +V+Y RG +W + V +S + F+ R LP W GL + +L +GI G +F H +
Sbjct: 225 VLFVIYP--RGSEWTLSTVKMSENTFDQRAKLPISWAGLTNKDLEVASGIDGALFCHNAR 282
Query: 325 FIGGNQSYGGALAMARAALK 344
FI ++ L+MA+ A++
Sbjct: 283 FIAIAETRDAILSMAKIAVE 302
>gi|406988680|gb|EKE08600.1| hypothetical protein ACD_17C00085G0005 [uncultured bacterium]
Length = 288
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 149/315 (47%), Gaps = 41/315 (13%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ +GTH+GSFH DE C ++ L + +I R+RDP VLD D V DVGG+Y S
Sbjct: 4 RSLGTHDGSFHADEVTACSLLLLVNLIDRDKIYRTRDPGVLDQCDFVCDVGGIYSSSKRR 63
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ ++ LSSAG+V + KE N+ E H LF K +
Sbjct: 64 FDHHQVEYQ--------GPLSSAGMVLFYL------KEQNLLEPH-----LFDYFNKALI 104
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
IDA DNG+++ + T+ S + N E + S E+ + AF + +D A
Sbjct: 105 MGIDAHDNGLSKLEP-------GVTSFSQVIS--NFMPIEYNVSREKMDVAFLRAVDFAL 155
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
L ++ + RS V + +A++ +++ PW + FEL E
Sbjct: 156 GH-LSRLKERHAYTMRCRSQVQQAMADQR------RVLMFDESIPWLENFFELGGETHPA 208
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
++V+ + W+++ V S +R + RK P++W GL +E K +GIPG +F H
Sbjct: 209 ---EFVIMPTEH--HWKLRGVPPSLSERMKVRKAHPSEWAGLHGEEFKKISGIPGAIFCH 263
Query: 322 MSGFIGGNQSYGGAL 336
FI ++ AL
Sbjct: 264 KGRFISVWETKQDAL 278
>gi|340779473|ref|ZP_08699416.1| metal-dependent protein hydrolase [Acetobacter aceti NBRC 14818]
Length = 347
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 157/345 (45%), Gaps = 47/345 (13%)
Query: 24 RVGTHNGSFHCDEALGCFMIR--------LTDKFFNA------QIVRSRDPKVLDDLDAV 69
R TH+G+FH DE LG ++ L + N VRSR+P+ + D V
Sbjct: 17 RALTHSGNFHTDETLGYVILHYALEPEGDLRGRVLNGGEGGRLTFVRSRNPEDIQAADIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP- 128
DVGG Y P YDHH + + + G T S+AGL++K +GK I+ L P
Sbjct: 77 FDVGGEYAPPKGRYDHHMR-VKPLREDG--TPYSAAGLLWKDYGKAAISNILGAVLKKPV 133
Query: 129 ---DVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD- 184
DV ++ + K+ + +D DNG+ + + +L+ V N W +
Sbjct: 134 SEADVASIWQTLDKSLILPVDLDDNGVAK---------MGKLSLADIVSACNPPWDTTEL 184
Query: 185 ----QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIM 240
+ +E F ++++ VR+ L A S V+E Y+ I+
Sbjct: 185 YGVEDAKTKETTGFANAASAVAAHLVNSMD-RVRASLKAASRVMEA----YEKAEDKRIL 239
Query: 241 VLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
++ P + +FE + P++ YV+ + G QW V+A+ + F R LP W
Sbjct: 240 LMDTGMPTEKMIFENDL-----PVV-YVVSPTNNG-QWNVKAIPPTRGDFGQRVSLPEAW 292
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GL EL+K +G+P VF H + FI G S GAL MA+ AL++
Sbjct: 293 GGLEKSELAKVSGVPDAVFAHPARFICGAGSREGALKMAQLALQI 337
>gi|402580253|gb|EJW74203.1| hypothetical protein WUBG_14890, partial [Wuchereria bancrofti]
Length = 150
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 86/154 (55%), Gaps = 10/154 (6%)
Query: 83 YDHHQKGFEEVFG----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
YDHHQ+ F F TKLSSAGL+Y HFGK +I+ L + + LF +Y
Sbjct: 3 YDHHQRDFAHTMNTLGVMNFHTKLSSAGLIYAHFGKNVISALLGLQHD-SIIDVLFKKIY 61
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ F+E+IDAIDNGI Q+D PRY LSSR+ LN W E + + E F +
Sbjct: 62 ETFVESIDAIDNGIAQFD--GKPRYYLGGTLSSRISMLNPSWNEDTVNVD---ERFMMAI 116
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYD 232
L KEF + + + +SWLPARS ++ + RYD
Sbjct: 117 KLVDKEFNELLTYLHKSWLPARSHIINAVTHRYD 150
>gi|323143978|ref|ZP_08078632.1| hypothetical protein HMPREF9444_01270 [Succinatimonas hippei YIT
12066]
gi|322416227|gb|EFY06907.1| hypothetical protein HMPREF9444_01270 [Succinatimonas hippei YIT
12066]
Length = 329
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 161/337 (47%), Gaps = 35/337 (10%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+ TH+G+FH DE + C + L+ F N ++R+R+P L+ D ++DV G D N +D
Sbjct: 3 IATHDGTFHADETVACAI--LSYIFDNTSVIRTRNPLELEKADLIIDVSGKND--NRHFD 58
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK----ELNVDEGHPDVHRLFLAVYKN 140
HH K F +G + ++AGL++ FGKE ++K L + + F + +
Sbjct: 59 HHSKEFTLSRENGI--RYATAGLMWCKFGKEFLSKIAEDLLKTHYENDIIDAAFNRIDRE 116
Query: 141 FMEAIDAIDNG-INQYDTDK-PPRYVNNTNLSSRVGKL-NLD---------WTEPDQSAE 188
M +D DNG +N++ +K P+ N+ + + + +D P S +
Sbjct: 117 IMCMVDLNDNGQLNEFLENKIAPQTTEARNVFNALNEFYQIDPGIPYIVAMQNLPAVSGQ 176
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPW 248
+++AF Q + + K+ L + ++ + + YD G+I+++ PW
Sbjct: 177 EQDKAFMQTVKML-KQILQNASINALN----TEFGIKEVLKVYD---GGKILIMHTRLPW 228
Query: 249 KLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDE 307
+ + K L Y DR + WRVQ++ +S RF +R P WRGL +
Sbjct: 229 TQAVLSHFDIFKNCILAVY----PDRKRGWRVQSLPLSKASRFANRCGAPLAWRGLDGQD 284
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
L K +G+ G +FVH +GF GG ++ L MA LK
Sbjct: 285 LDKVSGLNGTIFVHKAGFTGGALTFETNLEMANLWLK 321
>gi|402581518|gb|EJW75466.1| hypothetical protein WUBG_13627 [Wuchereria bancrofti]
Length = 108
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 57/101 (56%), Positives = 72/101 (71%), Gaps = 2/101 (1%)
Query: 247 PWKLHLFELEEEMKIE-PLIKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGLR 304
PWK H F +EE+ ++ I YV+YED+ QWRVQA+ V+ + FE+R PLP WRGLR
Sbjct: 2 PWKDHFFLIEEQFHLKNDDIIYVIYEDNVNSQWRVQAIPVNERQPFENRLPLPEAWRGLR 61
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
D EL+K A IPGC+FVH SGFIGGN+S G + MAR +L L
Sbjct: 62 DAELTKVADIPGCIFVHPSGFIGGNKSMQGVIEMARKSLSL 102
>gi|349802765|gb|AEQ16855.1| hypothetical protein [Pipa carvalhoi]
Length = 132
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/137 (46%), Positives = 92/137 (67%), Gaps = 9/137 (6%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
THNG FHCDEAL C+++R + + + +IVR+RDP++L D V+DVG YDP DHH
Sbjct: 1 THNG-FHCDEALACYLLRTLEPYRDTEIVRTRDPQLLAQCDVVVDVGE-YDPCRH-RDHH 57
Query: 87 QKGFEEVFG----HGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
Q+ E + + TKLSSAGLVY HFG++++A L E P++ L+ +Y+ F+
Sbjct: 58 QRFCETMNSLYPDKPWVTKLSSAGLVYAHFGRQILAT-LGTVEEEPNITVLYDKMYE-FV 115
Query: 143 EAIDAIDNGINQYDTDK 159
E IDAIDNGI+Q+D ++
Sbjct: 116 EEIDAIDNGISQFDGEQ 132
>gi|413935792|gb|AFW70343.1| hypothetical protein ZEAMMB73_937446 [Zea mays]
Length = 291
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 52/79 (65%), Positives = 61/79 (77%), Gaps = 1/79 (1%)
Query: 230 RYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDR 289
R + DPSGEIMVL RFCPWKLHLFELEE + +PL KYV Y+D+ K WRVQAV V+PDR
Sbjct: 208 RGNIDPSGEIMVLHRFCPWKLHLFELEEVLTTDPLTKYVFYQDE-SKSWRVQAVVVAPDR 266
Query: 290 FESRKPLPAQWRGLRDDEL 308
F S K L +WRG+RD+ L
Sbjct: 267 FNSWKALLEKWRGMRDERL 285
>gi|284045104|ref|YP_003395444.1| metal-dependent protein hydrolase [Conexibacter woesei DSM 14684]
gi|283949325|gb|ADB52069.1| metal-dependent protein hydrolase [Conexibacter woesei DSM 14684]
Length = 299
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 146/324 (45%), Gaps = 37/324 (11%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
RV TH+GSFH D+ + L D ++VR+RDP++L D +DVG DP+ +
Sbjct: 2 RVATHSGSFHADDVFAIAALSLLDDAAPLEVVRTRDPQLLAAADVRVDVGQRDDPAGGDF 61
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+G +G + +S GLV++ G + + + + + ++
Sbjct: 62 DHHQRGGAGERPNGI--RYASFGLVWREHGARICGGDEIAER-----------IDQVLVQ 108
Query: 144 AIDAIDNG--INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+DA D G I++ D V +S + LN +W + +A++ +AF ++LA
Sbjct: 109 GVDANDTGQTISRSLVDG----VAPFTVSHAIAALNPNWDDAPTAADK-RQAFDAAVELA 163
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ A ++V IA D ++ L R PW L +
Sbjct: 164 AGILRREIAAATSQ-ARAAALVRSAIARAEDP----RLIELDRGMPWHREL------IPG 212
Query: 262 EPLIKYVLY--EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
P +VLY EDD W +QAV F +RK LP W GL D EL+ G+ F
Sbjct: 213 APDALFVLYPREDD----WGLQAVPRQLGEFANRKDLPESWAGLSDAELAAATGVADARF 268
Query: 320 VHMSGFIGGNQSYGGALAMARAAL 343
H+ FI S GALA+AR AL
Sbjct: 269 CHIGRFIAVAGSREGALALARQAL 292
>gi|418409819|ref|ZP_12983130.1| hypothetical protein AT5A_21416 [Agrobacterium tumefaciens 5A]
gi|358003868|gb|EHJ96198.1| hypothetical protein AT5A_21416 [Agrobacterium tumefaciens 5A]
Length = 311
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 146/326 (44%), Gaps = 39/326 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + +D ++ DVGG YD ++ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGVDRIIYDVGGAYDATSRIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KHFG++ +A DE VH F +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHFGRDYLAAFGIPDEHVETVHTSF---DGSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG P + L + K D T+P E ++ +F + +A +
Sbjct: 120 VDLVDNGAVSPSVAGPLVALTLPVLLETL-KPVFDETDP----EADDRSFHAALVIA-RS 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A ++V++ I + K H+ EL M P
Sbjct: 174 FVEAKIAKSAAKLRAEALVMQAIVDA-----------------GKGHILELPMGMPFRPA 216
Query: 265 I------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
I + R K W + + + + FE R LPA W GL + +L GI G
Sbjct: 217 IVKAGADNLLFVVHPRDKDWCLTGIRRADEGFELRADLPAAWAGLTNGDLEAVCGIEGAS 276
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H FI ++ ALAMA A+K
Sbjct: 277 FCHNGRFIAAAKTRDAALAMAELAVK 302
>gi|282891717|ref|ZP_06300198.1| hypothetical protein pah_c197o010 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|338176065|ref|YP_004652875.1| hypothetical protein PUV_20710 [Parachlamydia acanthamoebae UV-7]
gi|281498301|gb|EFB40639.1| hypothetical protein pah_c197o010 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|336480423|emb|CCB87021.1| UPF0160 protein CT_386 [Parachlamydia acanthamoebae UV-7]
Length = 296
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 147/320 (45%), Gaps = 43/320 (13%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q + +GTH+G+FH DE C ++ L D + +I+R+R L + V DVGG+YDP
Sbjct: 4 QKAPRSLGTHDGTFHADEVTACALLLLFDLIDSDKILRTRSLDQLSRCEYVCDVGGIYDP 63
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ +DHHQ ++ + +SSAG++ + + I L +E H L L V
Sbjct: 64 AQKLFDHHQVQYQGM--------MSSAGMILLYLKDQGI---LKTNEYQFFNHALILGV- 111
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
DA DNG P+ S V N E + S + +N+AF + +
Sbjct: 112 -------DASDNG-------NDPQIQGLCTYSHVVS--NFTPIEHNASPDVQNKAFFEAL 155
Query: 199 DLAGKEFLDTVRFYVR-SWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ A F R + R + + VVE +Y E ++ ++ PW FEL+
Sbjct: 156 EFA---FGHLKRLWERYCYTQSCRQVVEETMNKY-----SECLIFEKGIPWLEIFFELD- 206
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDELSKEAGIPG 316
+ K+V+ G W+++ + + DR R PLP +W GL D+EL K +GI G
Sbjct: 207 --GVNHPAKFVIMPS--GSHWKLRGIPPNYEDRMNVRFPLPQEWAGLIDEELKKISGIDG 262
Query: 317 CVFVHMSGFIGGNQSYGGAL 336
+F H FI ++ AL
Sbjct: 263 AIFCHKGRFISVWETREDAL 282
>gi|296114815|ref|ZP_06833464.1| metal-dependent protein hydrolase [Gluconacetobacter hansenii ATCC
23769]
gi|295978629|gb|EFG85358.1| metal-dependent protein hydrolase [Gluconacetobacter hansenii ATCC
23769]
Length = 341
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 154/341 (45%), Gaps = 49/341 (14%)
Query: 27 THNGSFHCDEALGCFMIRL-----------------TDKFFNAQIVRSRDPKVLDDLDAV 69
TH+G+FH DE LG ++ TD+ R+R P+ + D V
Sbjct: 20 THSGNFHVDETLGYVILHYALAPHGDLRGRVLGTVPTDRL---DFGRTRSPERIRAADIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG+YDP YDHH K + + G T S+AGL++K +G I L
Sbjct: 77 FDVGGIYDPQKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGIAAIRNILKAPVDDAA 133
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE-----PD 184
+ ++ A+ K+ + ID DNG+ + + +L+ V W P+
Sbjct: 134 LPIIWQAIDKSLVLPIDLDDNGVVK---------LEKLSLADIVSSCRPVWDTAELYGPE 184
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++A RE+ F ++TV VR+ L A V+E A D I+V++
Sbjct: 185 EAAMRESVGFADAACAVAGYLVNTVD-RVRASLKATDRVLEAYANAQDK----RILVMET 239
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ YV+ G+ W V+AV F R LP WRGL
Sbjct: 240 GMPTEKVIFEHDL-----PVV-YVVSPAAPGR-WNVKAVPPVRGDFGQRVSLPEAWRGLD 292
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ L+K +G+ VF H + FI G S GAL MAR AL++
Sbjct: 293 GEALAKVSGVADAVFAHPARFICGAGSREGALEMARLALEI 333
>gi|384485099|gb|EIE77279.1| hypothetical protein RO3G_01983 [Rhizopus delemar RA 99-880]
Length = 152
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 85/155 (54%), Gaps = 6/155 (3%)
Query: 190 ENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWK 249
E E F++ L G+EF + + SW+P+R +V + R+ + SG+++VL FCPW
Sbjct: 2 ELEQFEKASVLVGQEFHQHLDEIMMSWIPSRQLVQKAFQNRFKHHKSGKMIVLDTFCPWS 61
Query: 250 LHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
HL ++ + YV+Y WRVQ V FESR PLP +WRGL + L
Sbjct: 62 DHLSVIDTNN-----VLYVIYPATE-HDWRVQCVKKRGSLFESRHPLPRRWRGLSTERLR 115
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+ + +FVH SGFIGG ++Y A+ MA ALK
Sbjct: 116 QITNLETAMFVHASGFIGGCKNYSDAMKMAEMALK 150
>gi|168334523|ref|ZP_02692684.1| hypothetical protein Epulo_06018 [Epulopiscium sp. 'N.t. morphotype
B']
Length = 295
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 160/334 (47%), Gaps = 51/334 (15%)
Query: 22 LKRVGTHNGSFHCDEALG-CFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSN 80
+K++ H+G+FH D+ M + D + ++VR+RD L D + DVGG
Sbjct: 1 MKKMAVHSGTFHADDVFAVALMYGIYD---DLEVVRTRDEAELSTCDIIADVGG------ 51
Query: 81 DCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK- 139
YDHH + KL + G+ Y FG L+ ++ +D + L Y+
Sbjct: 52 GQYDHHY----------VNKKLRADGIPYCAFG--LLWQDFGIDYIKNNFEPLPQEQYEE 99
Query: 140 -------NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP-DQSAEREN 191
+F+ IDA DNG++ +D V+ +D P D +
Sbjct: 100 IKDRIAIDFITVIDANDNGLDIVRSDYKIMTVSGI----------IDVFMPFDATQAAAT 149
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPS-GEIMVLKRFCPWKL 250
+ F + ++LA K V VR + + V E +A + +P I+VL++ +K
Sbjct: 150 KGFFEAVELAKKILYHVVAKEVR-YFGDFNYVKEQLAIQ---NPKESHILVLEKRVSFKK 205
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
L +L+ +M + +V+Y+D K W VQ + ++ D F++RK LP W GL ++ L K
Sbjct: 206 PLIKLDVDMDV----LFVVYKDLSDK-WMVQNIQLTEDSFDARKNLPDSWAGLNEEALDK 260
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
GI GCVF H + F+ GN++ GALAMAR A++
Sbjct: 261 VTGIDGCVFCHPAKFLCGNKTKEGALAMARLAVE 294
>gi|330992440|ref|ZP_08316388.1| UPF0160 protein [Gluconacetobacter sp. SXCC-1]
gi|329760639|gb|EGG77135.1| UPF0160 protein [Gluconacetobacter sp. SXCC-1]
Length = 349
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 162/365 (44%), Gaps = 51/365 (13%)
Query: 8 SSPAYSTSSPSQTPLK--------RVGTHNGSFHCDEALGCFMI------------RLTD 47
S+PA + TP+ TH+G+FH DE LG ++ R+
Sbjct: 2 STPAAGSPMSQHTPIGLDNGHGTITAVTHSGNFHLDETLGYVILHYALAPQGDLAGRVVG 61
Query: 48 KFFNAQI--VRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSA 105
+ ++ R+R P+ + D V DVGG +DP+ YDHH K + + G T S+A
Sbjct: 62 NVPDDRLHFTRTRAPERIAAADIVFDVGGRHDPATGRYDHHMKD-KPLREDG--TPYSAA 118
Query: 106 GLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN 165
GL++K +G I L D+ ++ A+ K+ + ID DNG+ + +
Sbjct: 119 GLLWKDYGIAAIRNMLATPVDEADLPAIWQAIDKSLVLPIDQDDNGVAK---------MG 169
Query: 166 NTNLSSRVGKLNLDWTE-----PDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPAR 220
+L+ V W P+Q+ RE F + ++TV VR+ L A
Sbjct: 170 KLSLADIVSACRPTWDTAELYGPEQARLREAAGFAEAATTVAGYLVNTVD-RVRASLKAA 228
Query: 221 SIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRV 280
V+ D ++V+ P + +FE + P++ YV+ + +W V
Sbjct: 229 GRVLAAFEAAQDK----RVLVMDTGMPTEKVIFEHDL-----PVV-YVVSPAGQ-DRWNV 277
Query: 281 QAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMAR 340
+AV + F R LP WRGL + L++ +G+P VF H + FI G S GA+ MA
Sbjct: 278 KAVPPTRGDFGQRVSLPEAWRGLDGETLARVSGVPDAVFAHPARFICGAASKAGAIRMAT 337
Query: 341 AALKL 345
AL++
Sbjct: 338 LALEI 342
>gi|349700685|ref|ZP_08902314.1| hypothetical protein GeurL1_07763 [Gluconacetobacter europaeus LMG
18494]
Length = 384
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 151/341 (44%), Gaps = 43/341 (12%)
Query: 24 RVGTHNGSFHCDEALGCFMIR--------LTDKFFN------AQIVRSRDPKVLDDLDAV 69
R TH+G+FH DE LG ++ L + + +RSR P + D V
Sbjct: 59 RALTHSGNFHVDETLGYVILHYALAPKGDLRARVLGDGPADRLEFIRSRSPDRIAASDIV 118
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG YDP+ YDHH K + + G T S+AGL++K +G I + D
Sbjct: 119 FDVGGQYDPAKGRYDHHMKD-KPLRDDG--TPYSAAGLLWKDYGVAAIRNIVQTQVDEAD 175
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW--TE---PD 184
+ ++ A+ ++ + ID DNG+ + + +L+ V W TE P+
Sbjct: 176 IAAIWQALDRSLVLPIDEDDNGVVK---------MGRLSLADIVSSCRPAWDTTELYGPE 226
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE F ++ V VR+ L A V+ Y+ I+++
Sbjct: 227 EARRRETLGFANAATTVASHLVNAVD-RVRASLKAAHRVMAA----YEAAEDKRILLMDT 281
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ V +W V+AV F R LP WRGL
Sbjct: 282 GMPTEKVIFENDL-----PVVYVV--SPAAPDRWNVKAVPPVRGDFGQRVSLPEAWRGLD 334
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
++L++ G+P VF H + FI G S GAL MAR ALK+
Sbjct: 335 GEKLARVCGVPDAVFAHPARFICGAGSKEGALQMARLALKI 375
>gi|114761192|ref|ZP_01441107.1| hypothetical protein 1100011001310_R2601_02678 [Pelagibaca
bermudensis HTCC2601]
gi|114545440|gb|EAU48442.1| hypothetical protein R2601_02678 [Roseovarius sp. HTCC2601]
Length = 322
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 145/322 (45%), Gaps = 32/322 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT + A++VRSR + + D + DVGG YDP +D
Sbjct: 8 THSGGFHADEVLSTVI--LTRVYPEAEVVRSRSSEWITPADGRLIYDVGGAYDPDAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKNFME 143
HHQ+ E T SS GLV+KHFG + + K + E H + VH F ++F+
Sbjct: 66 HHQR---ESPLREDDTPYSSFGLVWKHFGIDFL-KSFEIPEAHLETVHASF---DRSFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLA 201
+D +DNG P +++ L + L D T+P E E AF + +A
Sbjct: 119 PVDQVDNGTVSLSEAGP---LSSMTLPGLIETLKPVFDDTDP----ESETRAFHAAVGIA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
++F++ + L A + E I D I+ L P++ + E + +
Sbjct: 172 -RQFVEARIGRSAAKLRAEVLAAEAIRAAGD----SPILELPTGMPFRPAVIEAGADHLL 226
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V+ G +W + + FE R LPA W GL EL + +G+PG F H
Sbjct: 227 -----FVVTPRSAG-EWTLGGIRKHEQGFEQRADLPAAWAGLSGAELEQVSGVPGAKFCH 280
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
+ FI S + MA A+
Sbjct: 281 KARFIAAASSREAIIKMAELAV 302
>gi|126727005|ref|ZP_01742843.1| hypothetical protein RB2150_17997 [Rhodobacterales bacterium
HTCC2150]
gi|126703677|gb|EBA02772.1| hypothetical protein RB2150_17997 [Rhodobacterales bacterium
HTCC2150]
Length = 305
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 154/326 (47%), Gaps = 30/326 (9%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPS 79
+ ++ TH+G FH DE L + LT F +A+++R+R+ +L+ + DVGG YD +
Sbjct: 3 ITQLITHSGGFHADELLSSVI--LTQLFPDARLIRTRNKSLLEPASDKIIYDVGGAYDAA 60
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVY 138
+DHHQ+ SS GL+++H+G + L+A + D D+ +
Sbjct: 61 AQIFDHHQR--PGPLREEDEKPYSSFGLIWRHYGFDYLVAMNVPAD----DIEAIHHKFD 114
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
NF+ ID +DNG + P ++ L S +G L + D S +++AF +
Sbjct: 115 TNFVTPIDLLDNGAIEPSVAGP---LSILTLPSLLGSLKPSFD--DASPTADDDAFLVAL 169
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+A + F + + + A+ IV E IA+ + I+ L P++ L + E +
Sbjct: 170 PIA-RAFTEAEIRALAANARAKGIVAEAIAKTG----ASAILELPMGMPYRSSLAKAEAD 224
Query: 259 MKIEPLIKYVLYED-DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+V++ RG+ W + + +S D F+ R LPA W GL D L + +G+ G
Sbjct: 225 --------HVMFMVCPRGEDWTLNGIKLSGDTFDQRADLPASWAGLSDAALEEASGVTGA 276
Query: 318 VFVHMSGFIGGNQSYGGALAMARAAL 343
F H + FI S + MA A+
Sbjct: 277 KFCHNARFIAVASSREAIMKMAEIAV 302
>gi|198416587|ref|XP_002121538.1| PREDICTED: similar to LOC496075 protein, partial [Ciona
intestinalis]
Length = 136
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 71/99 (71%), Gaps = 5/99 (5%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
++GTHNG+FHCDE L C++++L K+ +A+IVR+RD ++L++ D V+DVGGVYD Y
Sbjct: 18 KIGTHNGTFHCDEVLACYLLKLLPKYKDAEIVRTRDTEILNNCDIVVDVGGVYDHDKSRY 77
Query: 84 DHHQKGFEEVFG-----HGFSTKLSSAGLVYKHFGKELI 117
DHHQ+ F + TKLSSAGLVY H+G+E++
Sbjct: 78 DHHQRSFSGTMNSIRPDKPWKTKLSSAGLVYCHYGEEIL 116
>gi|332715797|ref|YP_004443263.1| hypothetical protein AGROH133_10312 [Agrobacterium sp. H13-3]
gi|325062482|gb|ADY66172.1| hypothetical protein AGROH133_10312 [Agrobacterium sp. H13-3]
Length = 311
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 147/326 (45%), Gaps = 39/326 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + +D ++ DVGG YD ++ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGVDRIIYDVGGAYDATSRIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KHFG++ +A DE VH F +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHFGRDYLAAFGIPDEHVETVHTSF---DGSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG P + L + K D +P E ++ +F + +A +
Sbjct: 120 VDQVDNGAVSPSVAGPLVALTLPVLLETL-KPVFDEIDP----EADDRSFHAALVIA-RS 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A ++V++ I + +GE H+ EL M P
Sbjct: 174 FVEAKIAKSAAKLRAEALVMQAIVD------AGE-----------GHILELPMGMPFRPA 216
Query: 265 I------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
I + R K W + + + + FE R LPA W GL + +L GI G
Sbjct: 217 IVKAGADNLLFVVHPRDKDWCLTGIRRADEGFELRADLPAAWAGLTNGDLEAVCGIEGAS 276
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H FI ++ ALAMA A+K
Sbjct: 277 FCHNGRFIAAAKTRDAALAMAELAVK 302
>gi|424882217|ref|ZP_18305849.1| hypothetical protein Rleg8DRAFT_3816 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392518580|gb|EIW43312.1| hypothetical protein Rleg8DRAFT_3816 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 307
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 145/320 (45%), Gaps = 29/320 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGEDRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ + + E H V + + +F+
Sbjct: 66 HHQRGAPL---RDDGQPFSSFGLIWKHYGRDYLTA-FGLPEAH--VEAMHGSFDASFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DNG P + L + + L + E D E +N AF + +A +
Sbjct: 120 VDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDEADP--EADNRAFHAALAIA-RS 171
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +IV I + G ++ L R P++ + + + +
Sbjct: 172 FVEARIAQSAAKLRAEAIVHRAI----EAAGQGRVLELPRGMPFRPAIIKAGADHLL--- 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W V + D FE R LPA W GL + L GI G F H
Sbjct: 225 --FVVHP--REKDWCVTGIRRGEDGFELRADLPAAWAGLTNGALEAVCGIEGASFCHNGR 280
Query: 325 FIGGNQSYGGALAMARAALK 344
FI ++ LAMA A+K
Sbjct: 281 FIAAAKTREATLAMAELAVK 300
>gi|384135056|ref|YP_005517770.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339289141|gb|AEJ43251.1| metal-dependent protein hydrolase [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 258
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 131/282 (46%), Gaps = 31/282 (10%)
Query: 62 VLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL 121
+L D V+DVGG YDHH + +SAGL+++ FG + + L
Sbjct: 1 MLAQCDLVVDVGG------GPYDHHSV---QKVHRPNGIPYASAGLIWRDFGDRFL-ESL 50
Query: 122 NVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWT 181
V E D + + +AIDAIDNGI+ R + +S V N W
Sbjct: 51 GV-EREEDRALISSNIDDKLFQAIDAIDNGIDLE------RDMRIKGISELVSSFNPPWN 103
Query: 182 EPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
S E EN AF++ +D A + ++ + S + A IV A R + P+ ++V
Sbjct: 104 ----SQEDENRAFERALDFATQILMNYANHEI-SRIQATEIVKAAYAARKE--PA--LLV 154
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
L CPW L E++ ++ YV + D G Q+R+Q V P FE+RKPLP +W
Sbjct: 155 LPTCCPWTETLLEMDPAGEV----LYVAFPDKTG-QYRLQVVPKGPGTFEARKPLPHEWA 209
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G +EL G+ VF H + FI G ++ G L MA AL
Sbjct: 210 GKEGEELVSICGVEDAVFCHPARFIAGAETLDGILQMAEEAL 251
>gi|209549857|ref|YP_002281774.1| metal-dependent protein hydrolase [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209535613|gb|ACI55548.1| metal-dependent protein hydrolase [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 307
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 149/323 (46%), Gaps = 35/323 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGEGRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKNFME 143
HHQ+G SS GL++KH+G++ +A + E H + VH F A F+
Sbjct: 66 HHQRGAPT---REDGQPYSSFGLIWKHYGRDYLAAS-GLPEAHVEAVHGSFDA---GFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLA 201
ID DNG P + L + + L D EP E ++ AF + +A
Sbjct: 119 PIDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDEAEP----EADDRAFHAALAIA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A ++V I G ++ L R P++ + + + +
Sbjct: 170 -RSFVEARIAQSAAKLRAEAMVHLAI----QAAGQGRVLELPRGMPFRPAIVKAGADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + + FE R LPA W GL + EL GI G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRAEEGFELRADLPAAWAGLANGELEAVCGIKGASFCH 277
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
FI ++ ALAMA A+K
Sbjct: 278 NGRFIAAARTREAALAMADLAVK 300
>gi|310830299|ref|YP_003965399.1| hypothetical protein EIO_3293 [Ketogulonicigenium vulgare Y25]
gi|385235193|ref|YP_005796534.1| Metal-dependent hydrolase protein [Ketogulonicigenium vulgare
WSH-001]
gi|308753205|gb|ADO44348.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25]
gi|343464348|gb|AEM42781.1| Metal-dependent hydrolase protein [Ketogulonicigenium vulgare
WSH-001]
Length = 308
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 152/320 (47%), Gaps = 27/320 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+GSFH DE + + LT F +A I+R+RD + + D + DVG YD +D
Sbjct: 8 THSGSFHADELMSSVV--LTRLFPDATILRTRDAQAITPADDRIIYDVGRAYDADQRIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ + SS GL++KHFG + + + + V EG D+ + L++ ++F
Sbjct: 66 HHQP---DAPRREDDQPYSSFGLIWKHFGADYL-RAMAVPEG--DIEAIHLSMDRHFALP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG D V T L + L + D +AE ++AF + +A +
Sbjct: 120 VDLVDNG--ALDPATAGALVGLT-LPVLLESLKPVFDNDDPAAE--DQAFHAALAIA-RA 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F + + A S+V++ IA + G+++ L R P++ + ++ + L
Sbjct: 174 FFEASVGRKAAKARAESMVLDAIATAGE----GKVLELPRGMPFRSAI----DQAGADHL 225
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ +V+ RG W + + PD FE R LPA W GL D L +G+ G F H
Sbjct: 226 L-FVITP--RGTDWSLAGIRKKPDGFEQRADLPAAWAGLNDAALEAASGVKGAKFCHNGR 282
Query: 325 FIGGNQSYGGALAMARAALK 344
FI S +AMA A++
Sbjct: 283 FIAVADSREAIVAMAEIAVR 302
>gi|424919161|ref|ZP_18342525.1| hypothetical protein Rleg9DRAFT_6879 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392855337|gb|EJB07858.1| hypothetical protein Rleg9DRAFT_6879 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 307
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 149/322 (46%), Gaps = 35/322 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGEGRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKNFME 143
HHQ+G SS GL++KH+G++ +A + E H + VH F A F+
Sbjct: 66 HHQRGAPT---REDGQPYSSFGLIWKHYGRDYLAAS-GLPEAHVEAVHGSFDA---GFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLA 201
ID DNG P + L + + L D EP E ++ AF + +A
Sbjct: 119 PIDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDEAEP----EADDRAFHAALAIA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A ++V I + G ++ L R P++ + + + +
Sbjct: 170 -RSFVEARIAQSAAKLRAEAMVHRAI----EAAGQGRVLELPRGMPFRPAIVKAGADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + + FE R LPA W GL + EL GI G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRAEEGFELRADLPAAWAGLANGELEAVCGIEGASFCH 277
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
FI ++ ALAMA A+
Sbjct: 278 NGRFIAAARTREAALAMADLAV 299
>gi|347759581|ref|YP_004867142.1| hypothetical protein GLX_03600 [Gluconacetobacter xylinus NBRC
3288]
gi|347578551|dbj|BAK82772.1| hypothetical protein GLX_03600 [Gluconacetobacter xylinus NBRC
3288]
Length = 357
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 152/338 (44%), Gaps = 43/338 (12%)
Query: 27 THNGSFHCDEALGCFMIR--------LTDKFF------NAQIVRSRDPKVLDDLDAVLDV 72
TH+G+FH DE LG ++ L + R+R P+ + + V DV
Sbjct: 36 THSGNFHLDETLGYVILHYALAPQGDLAGRVMANTPGDRLHFTRTRTPERIAAANIVFDV 95
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
GG +DP+ YDHH K + + G T S+AGL++K +G I L ++
Sbjct: 96 GGRHDPAAGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGIAAIRNMLATPVDEAELPA 152
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTE-----PDQSA 187
++ A+ K+ + ID DNG+ + + +L+ V W P+Q+
Sbjct: 153 IWQAIDKSLVLPIDQDDNGVAK---------MGKLSLADIVSACRPAWDTAELYGPEQAR 203
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
RE+ F Q ++ V VR+ L A S V+ Y+ I+++ P
Sbjct: 204 ARESAGFSQAATTIAGYLVNMVD-RVRASLKAASRVLAA----YEAAQDKRILIMDTGMP 258
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
+ +FE + P++ YV+ R + W V+AV + F R LP WRGL +
Sbjct: 259 TEKVIFEHDL-----PVV-YVVSPAGRDR-WNVKAVPPTRGDFGQRVSLPDAWRGLEGEA 311
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L+K +G+ VF H + FI G S GA+ MA AL++
Sbjct: 312 LAKVSGVSDAVFAHPARFICGAASKAGAVRMATLALEI 349
>gi|440224137|ref|YP_007337533.1| metal-dependent protein hydrolase [Rhizobium tropici CIAT 899]
gi|440043009|gb|AGB74987.1| metal-dependent protein hydrolase [Rhizobium tropici CIAT 899]
Length = 313
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 147/326 (45%), Gaps = 39/326 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A++VRSR P+ + D ++ DVGG YD + +D
Sbjct: 8 THSGGFHADEVLSSVI--LTQLFPEARLVRSRAPEWITPGADRIVYDVGGAYDVAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G + SS GL++KHFG++ + L+V V + A +F+
Sbjct: 66 HHQRGAPQ---RNDGQPFSSFGLIWKHFGRDYL-TALSVPGAF--VETMHAAFDASFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID +DNG D+ + L S + L + +PD +E AF + +A +
Sbjct: 120 IDLVDNGALSPDSAG---LLAGLTLPSLLETLKPVFDDPD--SEATERAFHAAVAIA-RS 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + + L A ++V++ IA +GE VL EL M P
Sbjct: 174 FVEAKIAGIHAKLRAETLVLKAIAS------AGEARVL-----------ELPMGMPFRPA 216
Query: 265 I------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
I + R W + + + + FE R LPA W GL +L G+ G
Sbjct: 217 IMKAGADHLLFVVHPRETDWCLTGIRRAEEGFELRADLPAAWAGLSGRDLEVATGVEGAS 276
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H FI ++ LA+AR A++
Sbjct: 277 FCHNGRFIAAAKTREAILALARLAVE 302
>gi|304392851|ref|ZP_07374783.1| metal-dependent protein hydrolase [Ahrensia sp. R2A130]
gi|303295019|gb|EFL89387.1| metal-dependent protein hydrolase [Ahrensia sp. R2A130]
Length = 310
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 147/324 (45%), Gaps = 35/324 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE + + LT F +A++VRSRD + + D + DVGG YD + +D
Sbjct: 8 THSGGFHADELMSSVI--LTKLFPDAELVRSRDARWITPADDRIIYDVGGAYDATAQIFD 65
Query: 85 HHQK-GFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
HHQ+ G G +S S GL++ H+GK+ +A L V H D+ + A ++
Sbjct: 66 HHQRPGPLRQDGQPYS----SFGLIWLHYGKDYLAA-LAV-PAH-DIDAIHSAFDTKYVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDL 200
ID +DNG + P + L LD +P D S ++AF + +
Sbjct: 119 PIDLLDNGAIEPSVAGPLSVLTLPAL--------LDGLKPVFDDTSPTANDDAFFSALPI 170
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
+ F++ + + + AR I IA D I+ L P++ L + + E
Sbjct: 171 V-RSFIEALVRDLTATARARGIAQSAIASAGD----SAILELPMGMPYRAALKDAQAE-- 223
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ RG+ W + + +S D F+ R LPA W GL D L +G+ G F
Sbjct: 224 -----HMLFMVAPRGEDWTLNGIKLSDDTFDQRADLPAAWAGLTDAALEVASGVSGAKFC 278
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H + FI S + MA+ A++
Sbjct: 279 HNARFIAVAGSREAIMQMAQIAVR 302
>gi|349686062|ref|ZP_08897204.1| hypothetical protein Gobo1_02543 [Gluconacetobacter oboediens
174Bp2]
Length = 342
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 149/341 (43%), Gaps = 43/341 (12%)
Query: 24 RVGTHNGSFHCDEALGCFMIR--------LTDKFFN------AQIVRSRDPKVLDDLDAV 69
R TH+G+FH DE LG ++ L + RSR P+ + D V
Sbjct: 17 RALTHSGNFHVDETLGYVILHYALAPEGDLRGRVLGDAPADRLAFTRSRSPERIAASDIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG Y P YDHH K + + G T S+AGL++K +G I L V D
Sbjct: 77 FDVGGQYAPDRGRYDHHMKD-KPLRDDG--TPYSAAGLLWKDYGTAAIRNILRVPAEDTD 133
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDW--TE---PD 184
V ++ A+ ++ + ID DNG+ + + +L+ V W TE P+
Sbjct: 134 VAAIWQAIDRSLVLPIDEDDNGVVK---------MGKLSLADIVSSCRPAWDTTELYGPE 184
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE F ++ V VR+ L A + V+ Y+ I+++
Sbjct: 185 EAKRRETLGFANAATTVAGHLVNVVD-RVRASLKAANRVMAA----YEAAEDKRILLMDT 239
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ V +W V+AV + F R LP WRGL
Sbjct: 240 GMPTEKVIFENDL-----PVVYVV--SPAAPDRWNVKAVPPTRGDFGQRVSLPEAWRGLD 292
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L+K +G+ VF H + FI G S GAL MAR ALK+
Sbjct: 293 GPALAKVSGVTDAVFAHPARFICGAGSREGALQMARLALKI 333
>gi|384919787|ref|ZP_10019824.1| metal-dependent protein hydrolase [Citreicella sp. 357]
gi|384466389|gb|EIE50897.1| metal-dependent protein hydrolase [Citreicella sp. 357]
Length = 316
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 151/320 (47%), Gaps = 28/320 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F +AQ+VRSRDP+ + + DVG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPDAQLVRSRDPEWIAPARTRIIYDVGQQYDPAQGVFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKNFME 143
HHQ+ + SS GLV+ FG++ +A L + G D VH F + F+
Sbjct: 66 HHQRPTPQ---RPDGQPFSSFGLVWNRFGRDYLAA-LGLPAGDIDAVHAQF---DERFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
ID +DNG + P + L + L + +PD +A+ + AF + +A +
Sbjct: 119 PIDLMDNGAVDPGSAGP--LLARLTLPVLLESLKPVFDDPDPNAD--DRAFAAALPIA-R 173
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F+++ + + A IV + I D G++M L R P++ + EE +
Sbjct: 174 AFVESNLNREAAKVRAAGIVADAIRAAGD----GKVMELPRGMPFRPAI----EEAGADH 225
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
L+ +V++ R W + + + + FE R LP W GL D L +G+ G F H
Sbjct: 226 LL-FVVHP--RDTDWALTGIRIGENTFEQRADLPKAWAGLTDKALEAASGVEGAKFCHNG 282
Query: 324 GFIGGNQSYGGALAMARAAL 343
F+ +S AL MA A+
Sbjct: 283 RFLAIARSREAALRMAALAV 302
>gi|418939249|ref|ZP_13492651.1| metal-dependent protein hydrolase [Rhizobium sp. PDO1-076]
gi|375054037|gb|EHS50430.1| metal-dependent protein hydrolase [Rhizobium sp. PDO1-076]
Length = 341
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/329 (29%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGG 74
P TP V TH+G FH DE L + LT F A+IVRSR P+ + + DVGG
Sbjct: 23 PGMTPDFLV-THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPASDRIIYDVGG 79
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
YD +DHHQ+G + G SS GL++KH+GK+ +A +V E H V ++
Sbjct: 80 AYDAQAQIFDHHQRG-APLRDDG--QPYSSFGLIWKHYGKDYLAAS-DVPEAH--VEKIH 133
Query: 135 LAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAF 194
+ +F+ ID +DNG P + L + K D T+P E + F
Sbjct: 134 ASFDASFVLPIDLVDNGALSPSIAGPLAGLTLPALLETL-KPVFDETDP----EAIDRCF 188
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ +A + F++ + L A ++V + I + +G ++ L P++ + +
Sbjct: 189 HAALAVA-RSFVEARIAGSAAKLRAEALVHQAIVDTG----TGRVLELPMGMPFRPAILK 243
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+ + +V++ R K W + + + + FE R LPA W GL + +L GI
Sbjct: 244 AGADH-----LLFVVHP--REKDWCLTTIRRADEGFEVRADLPAAWAGLTNGDLEAACGI 296
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G F H F+ +S ALAMA A+
Sbjct: 297 AGASFCHNGRFVAAARSREAALAMADLAV 325
>gi|358067449|ref|ZP_09153928.1| hypothetical protein HMPREF9333_00809 [Johnsonella ignava ATCC
51276]
gi|356694365|gb|EHI56027.1| hypothetical protein HMPREF9333_00809 [Johnsonella ignava ATCC
51276]
Length = 317
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 96/329 (29%), Positives = 153/329 (46%), Gaps = 37/329 (11%)
Query: 17 PSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL-DVGGV 75
P + ++ TH H DE ++ ++ +VR+R+ +L+ ++ DVGG
Sbjct: 10 PEDSSEAKLYTHTPPHHADEVFATAILSFIEE---PVVVRTRNINILEKSQGIVYDVGGR 66
Query: 76 YDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFL 135
YD N +DHHQ F+ K SSAGL+++++G+E++ K L DE +
Sbjct: 67 YDAQNGFFDHHQIDFKR--ERSDRIKYSSAGLIWEYYGREVL-KVLGCDEKF--LQEAQA 121
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
V ++ IDA DNG R V ++S+ + N +W D++ ++EAF
Sbjct: 122 MVDYELIKGIDANDNG--------QSREVGEMSVSAMISLYNPNW---DENKLSKDEAFI 170
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP-WKLHLFE 254
+ + A +E L V S L ++V + I +R G I+++ RF W
Sbjct: 171 KACN-AAREILVLEIKKVLSKLKGGAVVRDAITKR-----QGPILMMSRFIGGW------ 218
Query: 255 LEEEMKIE-PLIKYVLYEDDRG--KQWRVQAVAVSP-DRFESRKPLPAQWRGLRDDELSK 310
+EE + P + +LY G K W ++AV S + E RKP P WRGL + L K
Sbjct: 219 IEEVLSSTLPSAREILYGIYVGHDKNWCIRAVPPSAGNLMEQRKPFPDSWRGLSGEALEK 278
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMA 339
GI VF H +GF + A+ MA
Sbjct: 279 ACGIKNVVFCHKAGFFAVAATKEAAVEMA 307
>gi|440301611|gb|ELP93997.1| hypothetical protein EIN_181740 [Entamoeba invadens IP1]
Length = 303
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 102/208 (49%), Gaps = 29/208 (13%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
KRVGTH+G FH DE G +++LT ++ N + RSRD +L D VLDVG VY+
Sbjct: 3 KRVGTHDGQFHIDETTGVALLQLTTEYNNLTVFRSRDMTLLSSCDLVLDVGRVYNHKLRR 62
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEL---IAKELNVD--EGHPDVHRLFLAV 137
YDHHQ+GF E + KLSS+GL+ KH+GKE+ I+ E + D + +V
Sbjct: 63 YDHHQRGFSETWDETSVVKLSSSGLILKHYGKEVISHISSEPSFDPLKDDKEVDWFLNKW 122
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNT------NLSSRVGKLNLDWTEPDQSAEREN 191
Y F +ID DNGI P+ VN+ +L S + +N +
Sbjct: 123 YYFFFVSIDGEDNGI--------PQTVNSLYCFEFGSLRSTISTMN----------SSRS 164
Query: 192 EAFQQGMDLAGKEFLDTVRFYVRSWLPA 219
+ F + +A LD+ SW P
Sbjct: 165 KGFFNAVKVASCALLDSFYRLYFSWRPT 192
>gi|424912530|ref|ZP_18335907.1| hypothetical protein Rleg13DRAFT_04796 [Rhizobium leguminosarum bv.
viciae USDA 2370]
gi|392848561|gb|EJB01084.1| hypothetical protein Rleg13DRAFT_04796 [Rhizobium leguminosarum bv.
viciae USDA 2370]
Length = 321
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 145/330 (43%), Gaps = 47/330 (14%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLD-DLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P+ L D ++ DVGG YD + +D
Sbjct: 8 THSGGFHADELLSS--VVLTRLFPQARLIRSRAPEWLAAGADRIIYDVGGAYDAARGMFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKNFME 143
HHQ+G SS GL++KHFG++ +A V E H + +H F A F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHFGRDYLAA-FGVPEDHIETIHASFDA---GFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDL 200
+D +DNG T P L+ + L+ +P D E +N F + +
Sbjct: 119 PVDLVDNGALSPSTAGP--------LAGLTLPVLLETLKPVFDDTDPEADNRNFHAALGI 170
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A + F++ + L A ++V + I + +GE H+ EL M
Sbjct: 171 A-RSFVEAKIANSAAKLRAEALVKQAIVD------AGEG-----------HILELPMGMP 212
Query: 261 IEPLI------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
P I + R W + + + + F R LPA W GL D +L G+
Sbjct: 213 FRPAIVKAGADHLLFVVHPRNNDWCLTGIRRADEGFALRADLPAAWAGLTDKDLEAVCGV 272
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALK 344
G F H F+ ++ ALAMA A+K
Sbjct: 273 EGATFCHNGRFVAAAKTRDAALAMAELAVK 302
>gi|163759672|ref|ZP_02166757.1| metal-dependent protein hydrolase [Hoeflea phototrophica DFL-43]
gi|162283269|gb|EDQ33555.1| metal-dependent protein hydrolase [Hoeflea phototrophica DFL-43]
Length = 316
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 148/320 (46%), Gaps = 27/320 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A ++R+R+ + L+ + + DVGG YD S YD
Sbjct: 19 THSGGFHADELLSS--VVLTRLFPAAALLRTREREWLEPAEGRIIFDVGGDYDASRQIYD 76
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ H+G++ +A +D D+ + A + F+
Sbjct: 77 HHQRP-SPLRDDG--QPFSSFGLIWAHYGRDYLAA---LDVPGADLDAIHSAFDEEFVLP 130
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG Q P + L + +G L + D S +++AF + +A +
Sbjct: 131 VDLLDNGAIQPSVAGP---LAELTLPALLGSLKPVFD--DTSPTADDDAFLAALPVA-RS 184
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + + AR +V+E IA+ + I+ L P+ L + + + +
Sbjct: 185 FVEASVRNLAAKARARGVVLEAIAKAG----ASPILELPMGMPY-LSVLNAADAGHMMFM 239
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ RG+ W + + +S D F+ R LPA W GL D L G+ G F H
Sbjct: 240 VA------PRGEDWTLNGIKLSNDTFDQRADLPAAWAGLSDAALEAACGVKGAKFCHNGR 293
Query: 325 FIGGNQSYGGALAMARAALK 344
FI S + MA A+K
Sbjct: 294 FIAVASSREAIMEMAELAVK 313
>gi|194373705|dbj|BAG56948.1| unnamed protein product [Homo sapiens]
Length = 166
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/107 (53%), Positives = 74/107 (69%), Gaps = 5/107 (4%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ +A+IVR+RDP+ L D V+DVGG YDP Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYDPRRHRY 105
Query: 84 DHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKELIAKELNVDE 125
DHHQ+ F E G + TKLSSAGL+Y HFG +L+A+ L E
Sbjct: 106 DHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSE 152
>gi|408379877|ref|ZP_11177468.1| hypothetical protein QWE_19828 [Agrobacterium albertimagni AOL15]
gi|407746254|gb|EKF57779.1| hypothetical protein QWE_19828 [Agrobacterium albertimagni AOL15]
Length = 314
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 148/326 (45%), Gaps = 39/326 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT + +A IVRSR P + + DVGG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLYPDATIVRSRAPDWITPASDRIIYDVGGAYDAETQIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKN 140
HHQ+G E+ SS GL++KH+G+ +A + + H + +H+ F A
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHYGRAYLAA-MGLPADHVEAIHQGFDA---K 115
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGM 198
F+ ID +DNG T P + + L S + L D EP ++ FQ +
Sbjct: 116 FVLPIDLMDNGALSPATAGP---LADLTLPSLLETLKPVFDNKEPGA----DDRGFQSAL 168
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+A + ++ + L A ++V+E IA D ++ L P++ + + +
Sbjct: 169 SIA-RMIVEAAIGQRAAKLRAETMVLEAIAATGD----ARVLELPMGMPFRPAIVKAGAD 223
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ +V++ R K W + + + D FE R LPA W GL + +L G+ G
Sbjct: 224 HLL-----FVVHP--RDKDWCITGIRKAEDSFEQRADLPAAWAGLSNSDLEAACGVKGAT 276
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H FI S LAMA A++
Sbjct: 277 FCHNGRFIAAANSRDAILAMADIAVE 302
>gi|453331590|dbj|GAC86504.1| hypothetical protein NBRC3255_0165 [Gluconobacter thailandicus NBRC
3255]
Length = 342
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 156/344 (45%), Gaps = 49/344 (14%)
Query: 24 RVGTHNGSFHCDEALGCFMIRL-----------------TDKFFNAQIVRSRDPKVLDDL 66
R TH+G+FH DE LG ++ TD+ VR+R P + +
Sbjct: 17 RALTHSGNFHADETLGYAILHYALAPEGDLRGRVLGEPGTDRL---TFVRTRSPDRIAEA 73
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
D V DVGGV+DP+ YDHH K + + G T S+AGL++K +G + +
Sbjct: 74 DIVFDVGGVFDPTKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGLAAVRNIVKTPVD 130
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD-- 184
+ ++ ++ K+ + +D DNG+ + + +L+ V + W +
Sbjct: 131 ETTLTAIWQSLDKSLIIPVDQDDNGVAK---------MGKLSLADIVSACSPPWDTAELY 181
Query: 185 ---QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
++ +RE+ F ++ V VR+ L A + VVE Y+ I++
Sbjct: 182 GAEEAKKRESLGFANAATAVAAHLVNMVD-RVRASLKAANRVVEA----YEAAEDKRILL 236
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
++ P + +FE + P++ YV+ +QW V+A+ F R LP W
Sbjct: 237 METGMPTEKVIFERDL-----PVV-YVVSPAGP-EQWNVKAIPPVRGDFGQRVSLPEAWG 289
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GL L++ +G+P VF H + FI G S GAL MAR AL++
Sbjct: 290 GLEKKALAEISGVPDAVFAHPARFICGAGSREGALKMARLALEI 333
>gi|357031310|ref|ZP_09093254.1| hypothetical protein GMO_09550 [Gluconobacter morbifer G707]
gi|356416004|gb|EHH69647.1| hypothetical protein GMO_09550 [Gluconobacter morbifer G707]
Length = 352
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 152/341 (44%), Gaps = 48/341 (14%)
Query: 27 THNGSFHCDEALGCFMIR--------LTDKFFNA-------QIVRSRDPKVLDDLDAVLD 71
TH+G+FH DE +G ++ L + N + VR+R P+ + D V D
Sbjct: 31 THSGNFHLDETMGYVILHYALSPEGDLRGRVLNGKADGDRLEFVRTRAPERIAAADLVFD 90
Query: 72 VGGVYDPSNDCYDHHQ--KGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
VG V++P YDHH K E T S+AGL++K +G+ I L
Sbjct: 91 VGAVHEPVRGRYDHHMRDKPLRE-----DGTPYSAAGLLWKDYGQAAIRNLLTTPVDDTT 145
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD----- 184
V ++ A+ K+ + ID DNG+ + + +L+ V + W +
Sbjct: 146 VTAIWRAIDKSLILPIDQDDNGVAK---------MGKLSLADIVSACSPPWDTVELYGRQ 196
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE + F +TV VR+ L A S V++ Y+ I+V++
Sbjct: 197 EATRREAQGFANAAFAVASHLTNTVD-RVRASLKAASRVLQA----YEQAQDKRILVMET 251
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ YV+ +QW V+AV F R LP W GL
Sbjct: 252 GMPTEKVIFEHDL-----PVV-YVVSPAG-PEQWNVKAVPPVRGDFGQRVSLPEAWGGLE 304
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L++ +G+P VF H + FI G S GA+ MAR AL++
Sbjct: 305 KKALAEISGVPDAVFAHPARFICGAGSRQGAIDMARLALEI 345
>gi|367470143|ref|ZP_09469861.1| hypothetical protein PAI11_31880 [Patulibacter sp. I11]
gi|365814847|gb|EHN10027.1| hypothetical protein PAI11_31880 [Patulibacter sp. I11]
Length = 305
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/333 (28%), Positives = 141/333 (42%), Gaps = 36/333 (10%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDV 72
S +P P RVGTH+GSFH DE + L + +VR+RD + L +DV
Sbjct: 2 SLDAPPTAPPLRVGTHSGSFHADEVFAIAALGLARGPLD--VVRTRDREQLAACALRIDV 59
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
G YDP+ +DHHQ G E + +S GL+++ G++L E +V
Sbjct: 60 GRGYDPATGDFDHHQGGVGE---RANGIRFASFGLIWREVGEQLTGSA----EVAAEVDA 112
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRY--VNNTNLSSRVGKLNLDWTEPDQSAERE 190
L +A IDA DNG YD P+ V +S + LN W
Sbjct: 113 LLVA-------PIDAGDNGQELYD----PKIDGVAPYAVSGLIAALNPPWDA--DGGAAA 159
Query: 191 NEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKL 250
A ++ L + A IV + + R D ++ L R PW+
Sbjct: 160 ERAAFDAAIAIAEQALKGEIARATARARAADIVRQALERRGDP----RLLELDRGLPWR- 214
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSK 310
+ + P + +V+Y R W +QAV + F +RK LP W GL + L +
Sbjct: 215 -----DVVIGEAPEVLFVVYP--RTGDWGLQAVPAAAHGFANRKSLPEPWAGLEGEPLQQ 267
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G+ VF H++ F+ S+ G L +AR AL
Sbjct: 268 VTGVADAVFCHVARFMAVAGSHDGVLELARQAL 300
>gi|406876064|gb|EKD25743.1| hypothetical protein ACD_79C01527G0001 [uncultured bacterium]
Length = 307
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 137/304 (45%), Gaps = 42/304 (13%)
Query: 26 GTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDH 85
TH+G FH DE + +I+L + +I+RSRDPK+L D +LDVGG+YDPS +DH
Sbjct: 16 ATHDGDFHADEVVAISLIKLA--YQKIKIIRSRDPKILGTADFMLDVGGIYDPSIRKFDH 73
Query: 86 HQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKN-FMEA 144
HQK +S L++AG+V E +EG D F KN F++
Sbjct: 74 HQK--------NYSGTLATAGMVL----------EWLRNEGVLDTK--FANYLKNIFIKG 113
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DNG D + +++ L P + E ++ F + ++
Sbjct: 114 VDMQDNG----DFSSQRGFCTFSDVIRYFNPL------PPATEEEYHKCFHKALEFTSFF 163
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F Y+ + + + ++ P I++L PWK ++FE +
Sbjct: 164 FERLKSRYIHNRKHKKKFLFYLHKKK---QPIPGILILNENIPWKEYIFEENDCYN---- 216
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDR-FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
IK V++ ++ K W + V+ S ++ F SR LP W GL D + + GI +F H
Sbjct: 217 IKLVIFPVNKNK-WILHTVSKSLEKPFSSRLKLPNTWAGLLDQDFGQTLGIKEAIFCHKR 275
Query: 324 GFIG 327
F+
Sbjct: 276 RFMA 279
>gi|254440573|ref|ZP_05054067.1| conserved hypothetical protein [Octadecabacter antarcticus 307]
gi|198256019|gb|EDY80333.1| conserved hypothetical protein [Octadecabacter antarcticus 307]
Length = 304
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 148/321 (46%), Gaps = 29/321 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A ++RSRD + + D ++ DVGG YD +D
Sbjct: 8 THSGGFHADELLSS--VVLTRLFPQADLLRSRDRQWITPTADKIIYDVGGDYDSEAQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKNFME 143
HHQ+ + G SS GL++ H+G+ L+A ++ D D+ + F+
Sbjct: 66 HHQRP-SPLRSDG--QPFSSFGLIWAHYGRAYLVAMDVPTD----DIEAIHTKFDTKFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
ID +DNG + P ++ L + +G L + D S +++AF + +A +
Sbjct: 119 PIDLLDNGAIEPSVAGP---LSILTLPALLGSLKPVFD--DTSPTADDDAFFAALPIA-R 172
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ + + A+ IV+E I + I+ L P++ L + E +
Sbjct: 173 SFVEAQIRNLAAKARAQGIVLEAITNAG----TSPILELPMGMPYRSALDQAGAEHML-- 226
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+V++ RG W + + +S D FE R LP W GL D L +GI G F H +
Sbjct: 227 ---FVIHP--RGDDWTLGGIKLSSDTFEQRADLPVAWAGLTDTALEDASGIKGAKFCHNA 281
Query: 324 GFIGGNQSYGGALAMARAALK 344
FI S L MA A++
Sbjct: 282 RFIAVADSREAILKMAEIAVR 302
>gi|58040579|ref|YP_192543.1| hypothetical protein GOX2151 [Gluconobacter oxydans 621H]
gi|58002993|gb|AAW61887.1| Hypothetical protein GOX2151 [Gluconobacter oxydans 621H]
Length = 336
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 153/338 (45%), Gaps = 43/338 (12%)
Query: 27 THNGSFHCDEALGCFMIR--------LTDKFFNAQ------IVRSRDPKVLDDLDAVLDV 72
TH+G+FH DE +G ++ L + N + +R+R+P V+ D V DV
Sbjct: 20 THSGNFHVDETMGYVILHYALAPQGDLRARVLNEKSADRLTFIRTRNPDVIKSADIVFDV 79
Query: 73 GGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHR 132
GG+YDP++ YDHH K + + G T S+AGL++K +G I L V
Sbjct: 80 GGLYDPTHGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGHAAIRNILKTPVDDATVDL 136
Query: 133 LFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD-----QSA 187
++ ++ K+ + ID DNG+ + + +L+ V + W + ++
Sbjct: 137 IWKSLDKSLILPIDQDDNGVVK---------MGKLSLADIVSACSPPWDTAELYGRQEAR 187
Query: 188 ERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP 247
RE+ F + V VR+ L A V+ Y+ I+V+ P
Sbjct: 188 ARESLGFANAATAVASHLTNVVD-RVRASLKATDRVLAA----YENAEDKRILVMGTGMP 242
Query: 248 WKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
+ +FE + P++ YV+ +QW V+AV F R LP W GL ++
Sbjct: 243 TEKVIFEHDL-----PVV-YVVSPAGP-EQWNVKAVPPVRGDFGQRVSLPEAWGGLEREK 295
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L+ +G+ VF H + FI G S GA+ MA+ AL++
Sbjct: 296 LAAVSGVSDAVFAHPARFICGAGSREGAIRMAQLALQI 333
>gi|414342384|ref|YP_006983905.1| hypothetical protein B932_1392 [Gluconobacter oxydans H24]
gi|411027719|gb|AFW00974.1| hypothetical protein B932_1392 [Gluconobacter oxydans H24]
Length = 341
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 155/344 (45%), Gaps = 49/344 (14%)
Query: 24 RVGTHNGSFHCDEALGCFMIRL-----------------TDKFFNAQIVRSRDPKVLDDL 66
R TH+G+FH DE LG ++ TD+ VR+R P + +
Sbjct: 17 RALTHSGNFHADETLGYAILHYALAPEGDLRGRVLGEPGTDRL---TFVRTRSPDRIAEA 73
Query: 67 DAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEG 126
D V DVGGV+DP+ YDHH K + + G T S+AGL++K +G + +
Sbjct: 74 DIVFDVGGVFDPTKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGLAAVRNIVKTPVD 130
Query: 127 HPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD-- 184
+ ++ ++ K+ + +D DNG+ + + +L+ V + W +
Sbjct: 131 EATLTAIWQSLDKSLIIPVDQDDNGVAK---------MGKLSLADIVSACSSPWDTAELY 181
Query: 185 ---QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMV 241
++ ++E+ F ++ V VR+ L A + VV+ D I++
Sbjct: 182 GAEEAKKQESLGFANAATTVAAHLVNMVD-RVRASLKAANRVVQAFEAAEDK----RILL 236
Query: 242 LKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWR 301
++ P + +FE + P++ YV+ +QW V+A+ F R LP W
Sbjct: 237 METGMPTEKVIFERDL-----PVV-YVVSPAGP-EQWNVKAIPPVRGDFGQRVSLPEAWG 289
Query: 302 GLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
GL L++ +G+P VF H + FI G S GAL MAR AL++
Sbjct: 290 GLEKKALAEISGVPDAVFAHPARFICGAGSREGALKMARLALEI 333
>gi|297620811|ref|YP_003708948.1| conserved hypothetical protein, MYG1 family [Waddlia chondrophila
WSU 86-1044]
gi|297376112|gb|ADI37942.1| conserved hypothetical protein, MYG1 family [Waddlia chondrophila
WSU 86-1044]
gi|337293327|emb|CCB91317.1| UPF0160 protein CPn_0489/CP_0265/CPj0489/CpB0509 [Waddlia
chondrophila 2032/99]
Length = 296
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/325 (28%), Positives = 143/325 (44%), Gaps = 47/325 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K GTH+GSFH DE C ++ L + +I R+R+ VL+ + V DVGG+YDPS
Sbjct: 8 KSFGTHDGSFHADEVTACALLLLFELIEEEKIHRTRERSVLEKCEYVCDVGGIYDPSKKL 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ ++ LSSAG+ + + E +++ + +
Sbjct: 68 FDHHQVDYQ--------GPLSSAGMTLLYLKDSGVISE-----------KMYHFYNETLI 108
Query: 143 EAIDAIDNGINQYDTDKPPR-YVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+D DNG D PR + + + + S + + TE E +N AF++ + A
Sbjct: 109 IGVDDHDNG-----KDMQPRGHSSYSYVVSNFAPIPYNPTE-----EEQNAAFREALHFA 158
Query: 202 GKEF--LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
L YV+S R +V E + S E+++ PW F+L
Sbjct: 159 VGHLGRLRKRYEYVQS---CRQVVEETMKS------SDEVLIFDEGIPWLQLFFDLGG-- 207
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
K P K+V+ G W ++ + S +R + R PLP +W GL + EL K IPG V
Sbjct: 208 KNHP-AKFVIMPS--GPHWNLRGIPPSYEERMDVRIPLPKEWAGLLEGELKKVCPIPGGV 264
Query: 319 FVHMSGFIGGNQSYGGALAMARAAL 343
F H FI ++ AL L
Sbjct: 265 FCHKGRFISVWETKESALQALNYVL 289
>gi|56755639|gb|AAW25998.1| SJCHGC02195 protein [Schistosoma japonicum]
Length = 180
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 91/152 (59%), Gaps = 5/152 (3%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
T +KR+GTH+G FHCDE L +++ ++ NA +VRSRDP VL D V+DVGGVYDP
Sbjct: 1 MTSIKRIGTHDGCFHCDEVLAVVLLKHLPEYKNASVVRSRDPDVLSVCDVVVDVGGVYDP 60
Query: 79 SNDCYDHHQKGFEEVFGHGF-----STKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRL 133
+DHHQK F + F KLSSAGLVY HFGK +++ ++ H + ++
Sbjct: 61 QTYRFDHHQKDFSLTWSKYFDVKMWDVKLSSAGLVYVHFGKRVLSLLTGLEINHEVLEKI 120
Query: 134 FLAVYKNFMEAIDAIDNGINQYDTDKPPRYVN 165
F+ VY++F+ ID DNG Q R +N
Sbjct: 121 FMRVYESFILEIDGQDNGTPQSKMPLKVRRLN 152
>gi|402488676|ref|ZP_10835484.1| metal-dependent protein hydrolase [Rhizobium sp. CCGE 510]
gi|401812389|gb|EJT04743.1| metal-dependent protein hydrolase [Rhizobium sp. CCGE 510]
Length = 307
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 146/324 (45%), Gaps = 37/324 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + D ++ DVGG YDP +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGKDRIIYDVGGAYDPDAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH--RLFLAVYKNFM 142
HHQ+ SS GL++KH+G++ + V G P++H + + F+
Sbjct: 66 HHQRAAPL---RDDGQPYSSFGLIWKHYGRDYL-----VASGLPELHVEAVLSSFDAGFV 117
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDL 200
ID DNG T P + L + + L D EP E ++ F + +
Sbjct: 118 LPIDLTDNG-----TLSPSGPLAGLTLPALLETLKPVFDEAEP----EADDRDFHAALAI 168
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A + F++ + L A ++V I + G ++ L R P++ + + +
Sbjct: 169 A-RSFVEARIAQSAAKLRAEAMVHRAI----EAAGQGRVLELPRGMPFRPAIVKAGADHL 223
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ +V++ R K W V + + + FE R LP W GL + EL G+ G F
Sbjct: 224 L-----FVVHP--REKDWCVTGIRRAEEGFELRADLPVAWAGLANGELEAVCGVEGATFC 276
Query: 321 HMSGFIGGNQSYGGALAMARAALK 344
H FI ++ LAMA A+K
Sbjct: 277 HNGRFIAAARTREATLAMAELAVK 300
>gi|116252792|ref|YP_768630.1| hypothetical protein RL3048 [Rhizobium leguminosarum bv. viciae
3841]
gi|115257440|emb|CAK08536.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 313
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 140/307 (45%), Gaps = 29/307 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGEDRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ +A ++ E H V + + F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLAAS-DLPEAH--VEAMHGSFDAGFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DNG P + L + + L + E D E ++ AF + +A +
Sbjct: 120 VDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDETDP--EADDRAFHAALAIA-RS 171
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +IV I + G ++ L R P++ + + + +
Sbjct: 172 FIEARIAQSAAKLRAEAIVHRAI----EAAGQGRVLELPRGMPFRPAIVKAGADQLL--- 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W V + + D FE R LPA W GL + L GI G F H
Sbjct: 225 --FVVHP--REKDWCVTGIRRAEDGFELRADLPAAWAGLTNGALEAVCGIEGASFCHNGR 280
Query: 325 FIGGNQS 331
FI ++
Sbjct: 281 FIAAART 287
>gi|254451839|ref|ZP_05065276.1| conserved hypothetical protein [Octadecabacter arcticus 238]
gi|198266245|gb|EDY90515.1| conserved hypothetical protein [Octadecabacter arcticus 238]
Length = 304
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 149/320 (46%), Gaps = 27/320 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSRD K + D ++ DVGG YD +D
Sbjct: 8 THSGGFHADELLSSAV--LTRVFPQAELLRSRDRKWITPATDKIIYDVGGDYDGEAQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ H+G+ +A +D D+ + F+
Sbjct: 66 HHQRP-SPLRDDG--QPFSSFGLIWAHYGRAYLAA---MDVPTDDIEAIHTKFDTKFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID +DNG + P ++ L + +G L + D S +++AF + +A +
Sbjct: 120 IDLLDNGAIEPSVAGP---LSILTLPALLGSLKPVFD--DMSPTADDDAFFAALPIA-RS 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + + A IV+ IA+ + I+ L P++ L ++ +
Sbjct: 174 FVEAQIRGLAAKARASGIVLNAIAQAG----TSPILELPMGMPYRAALDQMGSDH----- 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
I +V++ RG W + + +S D FE R LPA W GL D L +G+ G F H +
Sbjct: 225 ILFVVHP--RGGDWTLGGIKLSQDTFEQRADLPAAWAGLTDAALEDASGVKGAKFCHNAR 282
Query: 325 FIGGNQSYGGALAMARAALK 344
FI + + MA A++
Sbjct: 283 FIAVAYTREAIMKMAEIAVR 302
>gi|421592990|ref|ZP_16037621.1| metal-dependent protein hydrolase [Rhizobium sp. Pop5]
gi|403701195|gb|EJZ18115.1| metal-dependent protein hydrolase [Rhizobium sp. Pop5]
Length = 308
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 151/323 (46%), Gaps = 35/323 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR P+ + D ++ DVGG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRAPEWITPGGDRIIYDVGGAYDAEAGIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
HHQ+G E+ SS GL++KH+G++ +A +E VH F A +F
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHYGRDYLAAAGLPEEHIGLVHASFDA---SF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ ID DNG P + L + + L + D+ E ++ AF + +A
Sbjct: 117 VLPIDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFD--DRGPEADDRAFHAALTVA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A ++V I + + G ++ L P++ + + + +
Sbjct: 170 -RSFVEAKIAQSAAKLRAEALVHRAILDNGE----GRVLELPTGMPFRPAIVKAGADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + + FE R LPA W GL + EL G+ G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRTDEGFELRADLPAAWAGLTNGELEAVCGVGGASFCH 277
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
FI ++ ALAMA+ A++
Sbjct: 278 NGRFIAAARTREAALAMAKLAVE 300
>gi|15891145|ref|NP_356817.1| hypothetical protein Atu3805 [Agrobacterium fabrum str. C58]
gi|15159493|gb|AAK89602.1| conserved hypothetical protein [Agrobacterium fabrum str. C58]
Length = 315
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 148/323 (45%), Gaps = 33/323 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A++VRSR P+ + D D ++ DVGG YD +D
Sbjct: 8 THSGGFHADELLSS--VVLTRLFPQARLVRSRAPEWITPDADRIIYDVGGAYDAEKCIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
HHQ+G E+ SS GL++KHFG++ +A + E H V L + F
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHFGRDYLATS-GIPEDH--VETLHASFDAGF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ +D +DNG P + L + K D T+P E ++ F + +A
Sbjct: 117 VLPVDLVDNGALSPSIAGPLATLTLPVLLETL-KPVFDDTDP----EADDRGFIAALAVA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A ++V + IA+ + G I+ L P++ + + + +
Sbjct: 172 -RSFVEAKLATGAAKLRAEALVQKAIADTGE----GHILELPMGMPFRSAIVKAGADHLL 226
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R W + + + + FE R LPA W GL EL G+ G F H
Sbjct: 227 -----FVVHP--RDNDWCLTGIRRADEGFELRADLPAAWAGLTGKELEAVCGVEGASFCH 279
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
FI ++ L MA A+K
Sbjct: 280 NGRFIAAAKTREAILTMAELAVK 302
>gi|410944395|ref|ZP_11376136.1| hypothetical protein GfraN1_08117 [Gluconobacter frateurii NBRC
101659]
Length = 341
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 155/341 (45%), Gaps = 49/341 (14%)
Query: 27 THNGSFHCDEALGCFMIRL-----------------TDKFFNAQIVRSRDPKVLDDLDAV 69
TH+G+FH DE LG ++ TD+ VR+R P + + D V
Sbjct: 20 THSGNFHADETLGYAILHYALAPEGDLRGRVLGEPGTDRL---TFVRTRSPDRIAEADIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGGV+DP+ YDHH K + + G T S+AGL++K +G + +
Sbjct: 77 FDVGGVFDPTKGRYDHHMKD-KPLREDG--TPYSAAGLLWKDYGLAAVRNIVKTPVDEAT 133
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD----- 184
+ ++ ++ K+ + +D DNG+ + + +L+ V + W +
Sbjct: 134 LTAIWQSLDKSLIIPVDQDDNGVAK---------MGKLSLADIVSACSPPWDTAELYGAE 184
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ ++E+ F ++ V VR+ L A + VV+ Y+ I++++
Sbjct: 185 EAKKQESLGFANAATAVAAHLVNMVD-RVRASLKAANRVVQA----YEAAEDKRILLMET 239
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ YV+ +QW V+A+ F R LP W GL
Sbjct: 240 GMPTEKVIFERDL-----PVV-YVVSPAGP-EQWNVKAIPPVRGDFGQRVSLPEAWGGLE 292
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
L++ +G+P VF H + FI G S GAL MAR AL++
Sbjct: 293 KKALAEVSGVPDAVFAHPARFICGAGSREGALKMARLALEI 333
>gi|452963039|gb|EME68128.1| hypothetical protein H261_20033 [Magnetospirillum sp. SO-1]
Length = 294
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 137/322 (42%), Gaps = 31/322 (9%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ +V THNG+FH D+ F I ++ RSRD + D V DVGG+YD
Sbjct: 1 MLKVATHNGTFHADDVF-AFAILRAAAGGRIELARSRDRQDWDAAAVVFDVGGIYDREAR 59
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD-VHRLFLAVYKN 140
YDHH + + + +G SSAGLV++ FG +I L E D + R+ V
Sbjct: 60 RYDHHMRD-KPLRPNG--EPYSSAGLVWRDFGAAVIGHMLP--EAPADAIARMVERVDAG 114
Query: 141 FMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
+ +D +DNG N + S+ + N + E + EN AF Q D+
Sbjct: 115 LVRDVDLMDNGAMTP---------NPGHFSTVIEAFNATFVEDGRD---ENAAFLQAADI 162
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A V + A +AE I+VL PW+ + +L +
Sbjct: 163 AA-----LVLERACARAYAAVRAEAVVAEAARCAEDARIVVLDSRIPWEDAIHDLGLDAA 217
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ YV+ G W AV F R PLP W GLRD + + GI F
Sbjct: 218 L-----YVVRP--AGAAWTCSAVPPERGSFAQRHPLPEAWGGLRDADFAALTGISDATFC 270
Query: 321 HMSGFIGGNQSYGGALAMARAA 342
H + F+ G QS GA+A+AR A
Sbjct: 271 HPALFVCGAQSREGAVALARLA 292
>gi|424895560|ref|ZP_18319134.1| hypothetical protein Rleg4DRAFT_1428 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393179787|gb|EJC79826.1| hypothetical protein Rleg4DRAFT_1428 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 307
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 143/309 (46%), Gaps = 33/309 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+IVRSR + + D ++ DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIVRSRASEWITPGKDRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ +A + EGH +V L+ + +F+
Sbjct: 66 HHQRGAPM---REDGQPYSSFGLIWKHYGRDYLAAS-GLPEGHIEV--LYGSFDASFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG P + L + L D EP E ++ AF + +A
Sbjct: 120 IDLTDNG-----ALSPSGPLAGLTLPVLLETLKPVFDEAEP----EADDRAFHAALAIA- 169
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F++ + L A ++V I + G ++ L R P++ + + + +
Sbjct: 170 RIFVEARIAQSAAKLRAEAMVHRAI----EATGQGRVLELPRGMPFRPAIIKAGADHLL- 224
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ R K W V + + D FE R LPA W GL + EL GI G F H
Sbjct: 225 ----FVVHP--REKDWCVTGIRRAEDGFELRADLPAAWAGLANGELEAVCGIEGASFCHN 278
Query: 323 SGFIGGNQS 331
FI ++
Sbjct: 279 GRFIAAART 287
>gi|418297419|ref|ZP_12909260.1| hypothetical protein ATCR1_07859 [Agrobacterium tumefaciens
CCNWGS0286]
gi|355537605|gb|EHH06860.1| hypothetical protein ATCR1_07859 [Agrobacterium tumefaciens
CCNWGS0286]
Length = 309
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 139/326 (42%), Gaps = 39/326 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSS--VVLTRLFPQARLIRSRASEWITPGANRIIYDVGGAYDPTAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KHFG++ +A V E H + + + +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHFGRDYLAA-FGVPEDH--IEAIHASFDTSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG P L+ V L + E +N +F + +A +
Sbjct: 120 VDLVDNG-----ALSPSIAGQLAGLTLPVLLETLKPVFDETDPEADNRSFHAALAVA-RS 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A ++V I + +GE H+ EL M P
Sbjct: 174 FVEAKIANSAAKLRAEALVNRAIVD------AGEG-----------HILELPVGMPFRPA 216
Query: 265 I------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
I + R W + + + + FE R LPA W GL D +L G+ G
Sbjct: 217 IMKAGADHLLFVVHPRNNDWCLTGIRRADEGFELRADLPAAWAGLTDKDLEAVCGVEGAT 276
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H F+ ++ ALAMA A+K
Sbjct: 277 FCHNGRFVAAAKTRDAALAMAELAVK 302
>gi|144901334|emb|CAM78198.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
MSR-1]
Length = 295
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 145/321 (45%), Gaps = 31/321 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
TH+GSFH D+ F I + RSR+ D V DVGGVYDP YD
Sbjct: 4 AATHSGSFHADDVFA-FAILKAATDGAVSLTRSREAMDWDRAQVVFDVGGVYDPDQGRYD 62
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNV-DEGHPDVHRLFLAVYKNFME 143
HH + + + +G SSAGL+++ +G+ IA N G D+ ++ + +
Sbjct: 63 HHMRD-KPLRPNG--EAFSSAGLIWRDYGRAAIAHLQNTATPGQIDI--IWAKLDAGLLR 117
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
ID +DNG D+ P + S+ + N + E ++ EN + Q + +A
Sbjct: 118 DIDLMDNGA----MDRHPGH-----FSALLETWNPTFAE---NSADENACYHQAVMVADN 165
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
+V + +I + +A+ + I+VL PW+ +F+L+ +
Sbjct: 166 VLARSVAHAFAA-----AIAQDAVADAAERAADPRIIVLDSRLPWEEAVFDLDLAQAL-- 218
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
YV+ GK W V AV F +KPLP W GLR+ L+ G+ F H +
Sbjct: 219 ---YVIRP--AGKDWTVSAVPPERGSFAQKKPLPDAWGGLREAALAAVCGVSDATFCHSA 273
Query: 324 GFIGGNQSYGGALAMARAALK 344
F+ G ++ GA+AMA A++
Sbjct: 274 RFVCGAKTREGAMAMAALAVE 294
>gi|427799103|gb|JAA65003.1| putative secreted peptide precursor, partial [Rhipicephalus
pulchellus]
Length = 126
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 58/81 (71%), Gaps = 3/81 (3%)
Query: 17 PSQTPL---KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVG 73
P++ P K +GTHNG+FHCD+AL CF+++L ++ +A +VRSRDP VLD D V+DVG
Sbjct: 41 PTKKPAAMGKTIGTHNGNFHCDDALACFLLKLLPEYKDATVVRSRDPAVLDTCDVVVDVG 100
Query: 74 GVYDPSNDCYDHHQKGFEEVF 94
VYDP+ YDHHQK F E
Sbjct: 101 AVYDPATRRYDHHQKSFNETM 121
>gi|405383204|ref|ZP_11036974.1| hypothetical protein PMI11_07006 [Rhizobium sp. CF142]
gi|397320302|gb|EJJ24740.1| hypothetical protein PMI11_07006 [Rhizobium sp. CF142]
Length = 306
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 151/320 (47%), Gaps = 29/320 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+I+RSR+ + +D ++ DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPEARIIRSRNVEWTTPAVDRIIYDVGGQYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH-PDVHRLFLAVYKNFME 143
HHQ+G V G SS GLV+KH+G + +A L V E H VH F A F+
Sbjct: 66 HHQRG-APVRQDG--RPYSSFGLVWKHYGADYLAA-LGVPEAHISPVHASFDA---KFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D DNG + L + K D +P E ++ AF + +A +
Sbjct: 119 PVDLTDNGALDPSIAGELAGLTLPALLETL-KPVFDAKDP----EADDRAFHAALSIA-R 172
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ + L A ++V++ I E +GE VL+ P + K +
Sbjct: 173 SFVEAGIAAQDAKLRAEALVLKAINE------TGEARVLE--LPVGMPFRAAIVRAKADH 224
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
L+ +V++ R W + + + + FE R LPA W GL EL + G+ G F H
Sbjct: 225 LL-FVVHP--RENDWCLTGIRRADEGFELRADLPASWAGLTGQELERVCGVEGATFCHKG 281
Query: 324 GFIGGNQSYGGALAMARAAL 343
F+ ++ ALAMA+ A+
Sbjct: 282 LFVAAAKTREAALAMAQIAV 301
>gi|167391633|ref|XP_001739869.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896287|gb|EDR23746.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 302
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 79/133 (59%), Gaps = 5/133 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FHCDE C ++ LT +F ++I R+RD +L + D V+DVG ++ +
Sbjct: 4 KIIGTHDGLFHCDELTSCVILLLTKEFMGSKIRRTRDSSILKECDVVVDVGKEFNVEHHL 63
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL----NVDEGHPDVHRLFLAVY 138
+DHHQ+GF E + G T SSAGL+YK++G+E+I K V E ++ Y
Sbjct: 64 FDHHQQGFNERW-EGSPTLFSSAGLIYKYYGREIIIKLCKGPHTVFEDEEEIKWFMNKWY 122
Query: 139 KNFMEAIDAIDNG 151
+ +IDA DNG
Sbjct: 123 FFYFVSIDAEDNG 135
>gi|335035696|ref|ZP_08529030.1| hypothetical protein AGRO_3029 [Agrobacterium sp. ATCC 31749]
gi|333792877|gb|EGL64240.1| hypothetical protein AGRO_3029 [Agrobacterium sp. ATCC 31749]
Length = 315
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 148/325 (45%), Gaps = 37/325 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A++VRSR P+ + D ++ DVGG YD +D
Sbjct: 8 THSGGFHADELLSS--VVLTRLFPQARLVRSRAPEWITPGADRIIYDVGGAYDAEKCIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH--RLFLAVYK 139
HHQ+G E+ SS GL++KHFG++ +A G PD H L +
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHFGRDYLATS-----GIPDDHIETLHASFDA 114
Query: 140 NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
F+ +D +DNG P + L + K D T+P E ++ F +
Sbjct: 115 GFVLPVDLVDNGALSPSIAGPLAGLTLPVLLETL-KPVFDDTDP----EADDRGFIAALA 169
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
+A + F++ + L A ++V + IA+ + G I+ L P++ + + +
Sbjct: 170 VA-RSFVEAKLATGAAKLRAEALVQKAIADTGE----GHILELPMGMPFRSAIVKAGADH 224
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
+ +V++ R W + + + + FE R LPA W GL EL G+ G F
Sbjct: 225 LL-----FVVHP--RDNDWCLTGIRRADEGFELRADLPAAWAGLTGKELEAVCGVEGASF 277
Query: 320 VHMSGFIGGNQSYGGALAMARAALK 344
H FI ++ LAMA A+K
Sbjct: 278 CHNGRFIAAAKTRDAILAMAELAVK 302
>gi|83309527|ref|YP_419791.1| hypothetical protein amb0428 [Magnetospirillum magneticum AMB-1]
gi|82944368|dbj|BAE49232.1| Uncharacterized conserved protein [Magnetospirillum magneticum
AMB-1]
Length = 294
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/322 (30%), Positives = 143/322 (44%), Gaps = 31/322 (9%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ +V THNG+FH D+ ++R + ++ RSRD + D V DVGG+YDP
Sbjct: 1 MLKVATHNGTFHADDVFAFAILRASCGG-RIELARSRDQQDWDAAAVVFDVGGLYDPGTR 59
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
YDHH + + + +G SSAGLV++ FG I L V R+ V
Sbjct: 60 RYDHHMRD-KPLRPNG--EPYSSAGLVWRDFGAAAIGTLLP-GLAADAVARVVAKVDTGL 115
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ +D +DNG T P + S+ + N + E + EN AF Q D+A
Sbjct: 116 VRDVDLMDNGAM---TPTPGHF------STVIEAFNATFVEDGRD---ENAAFLQAADIA 163
Query: 202 GKEFLDTVRFYVRSWLPARSIVVEC-IAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
L R R+ + S+ E +A+ I+VL PW+ + +L +
Sbjct: 164 A---LVLERACARA---SASVQAETTVAQAAAGAGDARIIVLDTRVPWEDAIHDLGLDRA 217
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
+ YV+ G W AV F R PLP W GLRD+ + GI F
Sbjct: 218 L-----YVVRP--AGAAWTCSAVPPERGSFAQRHPLPESWGGLRDEAFAALTGIADATFC 270
Query: 321 HMSGFIGGNQSYGGALAMARAA 342
H + F+ G +S GA+A+AR A
Sbjct: 271 HPARFVCGARSREGAVALARLA 292
>gi|218463737|ref|ZP_03503828.1| putative hydrolase protein [Rhizobium etli Kim 5]
Length = 307
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 141/310 (45%), Gaps = 35/310 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P+ + D ++ DVGGVYD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPEWITPGQDRIIYDVGGVYDADAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKNFME 143
HHQ+G SS GL++KH+G++ L A L D V L + +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLAACGLPADH----VEALHGSFDASFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLA 201
ID DNG P + L + + L D EPD ++ AF + +A
Sbjct: 119 PIDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDEAEPDA----DDRAFHAALAIA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A +IV E I G I+ L R P++ + + + +
Sbjct: 170 -RSFVEAKIAQSAAKLRAEAIVHEAIKAAG----QGHILELPRGMPFRPAIVKAGADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + + FE R LPA W GL + EL+ GI G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRAEEGFELRADLPAAWAGLANGELAAVCGIEGASFCH 277
Query: 322 MSGFIGGNQS 331
FI +S
Sbjct: 278 NGRFIAAARS 287
>gi|260428592|ref|ZP_05782571.1| metal-dependent protein hydrolase [Citreicella sp. SE45]
gi|260423084|gb|EEX16335.1| metal-dependent protein hydrolase [Citreicella sp. SE45]
Length = 322
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 144/322 (44%), Gaps = 30/322 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT + A++VRSR + D + DVGG YDP +D
Sbjct: 8 THSGGFHADEVLSTVI--LTRLYPEAEVVRSRSSAWITPADGRLIYDVGGAYDPEAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ E T SS GLV+KHFG + + ++ E ++ + + ++F+
Sbjct: 66 HHQR---ESPLREDETPYSSFGLVWKHFGMGFL-RSFDIPE--TELETIHASFDRSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAFQQGMDLAG 202
+D +DNG P +++ L + L D T+P E E AF + +A
Sbjct: 120 VDQVDNGTVSVSEAGP---LSSMTLPGLIETLKPVFDDTDP----ESETRAFHAAVGIA- 171
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
++F++ RS R+ V+ AE I+ L P++ + E + +
Sbjct: 172 RQFVEAR--VNRSAAKRRAEVLA--AEAVRAAGKSPILELPMGMPFRPAVIEAGADHLL- 226
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V+ G +W + + FE R LPA W GL EL + +G+ G F H
Sbjct: 227 ----FVVTPRGDG-EWTLGGIRKHEKGFEQRADLPAAWAGLSGSELEQVSGVTGAKFCHK 281
Query: 323 SGFIGGNQSYGGALAMARAALK 344
+ FI + L MA A++
Sbjct: 282 ARFIAAASNRDAILKMAELAVQ 303
>gi|440292431|gb|ELP85636.1| hypothetical protein EIN_409350 [Entamoeba invadens IP1]
Length = 342
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 143/311 (45%), Gaps = 27/311 (8%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
VG + S++ D L ++R T F ++I + + ++ D VL GG+Y+ +D
Sbjct: 28 VGINGMSYNFDTILAVSLLRRTRDFAKSEIRFIKAKEDMNGCDMVLGYGGMYNADTLRFD 87
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK---------ELNVDEGHPDVHRLFL 135
+HQ F+EVF + +SSAG+V+K FGKE++ ++NV + +V +
Sbjct: 88 YHQHDFKEVFSNKSKYPMSSAGMVFKRFGKEIVKSVLVSLSEKFDMNVSDELLNVAK--D 145
Query: 136 AVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
+Y++ +E +DA+ NG +++ D+ P Y+N T+ + + + E F
Sbjct: 146 VIYQSLIEPVDAMTNGFSKF--DETPLYLNPTDSFRLMSAQEMKNENFVEEVEETASTFI 203
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
MD + ++ Y ++ A+ + +R SG I V ++ +
Sbjct: 204 SEMD----AHVQSIECYQQTLYNAKKALTFNGEKRRILLVSGFIQV--------AYIQSI 251
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E ++ + L ++ Y + + ++ V F +R P WRGL ++EL G
Sbjct: 252 ENKLGV--LGNFLFYIREGKARVTIRTVTSPESVFVNRAAFPESWRGLENEELENAVGEE 309
Query: 316 GCVFVHMSGFI 326
G F H SGF+
Sbjct: 310 GVYFCHHSGFM 320
>gi|339022053|ref|ZP_08646025.1| hypothetical protein ATPR_2333 [Acetobacter tropicalis NBRC 101654]
gi|338750933|dbj|GAA09329.1| hypothetical protein ATPR_2333 [Acetobacter tropicalis NBRC 101654]
Length = 340
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 153/341 (44%), Gaps = 43/341 (12%)
Query: 24 RVGTHNGSFHCDEALGCFMI------------RLTDKFFN--AQIVRSRDPKVLDDLDAV 69
R TH+G+FH DE +G ++ R+ D+ + ++VR+R P + D V
Sbjct: 17 RALTHSGNFHLDETMGYVILHYALAPKGDLRARVIDRTASDGLELVRTRAPDQIARADIV 76
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG +DP+ YDHH K E S+AGL++K +G+ + L
Sbjct: 77 FDVGGRHDPAQGRYDHHMK---EKPLREDGVPYSAAGLLWKDYGRAALRNILTSPVDDDT 133
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPD----- 184
+ ++ + K+ + ID DNG+ + + +L+ V W +
Sbjct: 134 LEAMWRTIDKSLILPIDQDDNGVAK---------MGKLSLADIVSACGPAWDTTELYGRQ 184
Query: 185 QSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR 244
++ RE F A ++TV VR+ L A V++ Y+ I++++
Sbjct: 185 EALAREALGFANAATAAAAHLVNTVD-RVRASLKATDRVLQA----YEKAEDKRILLMET 239
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLR 304
P + +FE + P++ YV+ QW V+A+ F R LP W GL
Sbjct: 240 GMPTEKVIFEHDL-----PVV-YVVSPAG-PDQWNVKAIPPVRGDFGQRVSLPEAWGGLE 292
Query: 305 DDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
+ L++ +G+ VF H + FI G +S GAL MA+ AL++
Sbjct: 293 KETLAQISGVEDAVFAHPARFICGAKSREGALKMAQLALEI 333
>gi|405956473|gb|EKC23059.1| UPF0160 protein MYG1, mitochondrial [Crassostrea gigas]
Length = 123
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 49/90 (54%), Positives = 60/90 (66%), Gaps = 2/90 (2%)
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR-FCPWKLHLFELE 256
M + G EFLD V +Y +SWLPAR +V E + R + DPSGEI+V K CPWK HLF LE
Sbjct: 1 MKMVGAEFLDKVLYYKKSWLPARELVEEAVKGRTEVDPSGEIVVFKTGGCPWKDHLFNLE 60
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVS 286
E+ I P IKYVLY D+ +WR+Q V S
Sbjct: 61 AELDINPPIKYVLY-TDQANKWRIQCVPES 89
>gi|407035217|gb|EKE37605.1| metal dependent hydrolase, putative [Entamoeba nuttalli P19]
Length = 302
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 102/202 (50%), Gaps = 20/202 (9%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FHCDE C ++ LT +F ++I R+RD + L + D V+DVG ++
Sbjct: 4 KLIGTHDGLFHCDELTSCVILLLTKEFMGSKIRRTRDNEKLKECDVVVDVGKEFNVERHL 63
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL----NVDEGHPDVHRLFLAVY 138
+DHHQ+GF+E + G T SSAGL+YK++G+E+I K E ++ Y
Sbjct: 64 FDHHQQGFDERW-EGSPTLFSSAGLIYKYYGREIIIKLCKGPHTTFEDEEEIKWFMDKWY 122
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNT-NLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +IDA DNG + + Y+ +L + +G LN D F+ G
Sbjct: 123 FFYFVSIDAEDNGCPR---GQKQNYIEGIGSLKNVIGNLN------DLRG-----GFEIG 168
Query: 198 MDLAGKEFLDTVRFYVRSWLPA 219
+ + E + + + +W P
Sbjct: 169 LTICKLELVKSFYRLLLNWRPT 190
>gi|222086640|ref|YP_002545174.1| metal-dependent hydrolase [Agrobacterium radiobacter K84]
gi|221724088|gb|ACM27244.1| metal-dependent hydrolase protein [Agrobacterium radiobacter K84]
Length = 318
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 85/309 (27%), Positives = 141/309 (45%), Gaps = 31/309 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+I+RSR P+ + D ++ DVGG YD + YD
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIIRSRAPEWITPGADRIIYDVGGKYDAAERIYD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH--RLFLAVYKNFM 142
HHQ+G SS GL++KH+G++ ++ G P+ H L + F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLSAS-----GLPEAHIEALHTSFDNGFV 117
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+D +DNG + L + + L + E D E E+ FQ + +A
Sbjct: 118 LPVDLVDNGALSPSVAG---QLAGLTLPALLETLKPVFDETDP--EAEDRGFQNALGIA- 171
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F++ + L A ++V + I E + G I+ L P++ + + + +
Sbjct: 172 RSFVEARIGQSAAKLRAEALVHQAIVETGE----GRILELPMGMPFRPAIVKAGADHLL- 226
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ D K W + + + FE R LPA W GL + +L +G+ G F H
Sbjct: 227 ----FVVHPRD--KDWCLTGIRRADQGFELRADLPAAWAGLTNGDLEAASGVEGASFCHN 280
Query: 323 SGFIGGNQS 331
FI ++
Sbjct: 281 GRFIAAAKT 289
>gi|67472230|ref|XP_651975.1| metal dependent hydrolase [Entamoeba histolytica HM-1:IMSS]
gi|56468771|gb|EAL46587.1| metal dependent hydrolase, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449706298|gb|EMD46173.1| metal dependent hydrolase, putative [Entamoeba histolytica KU27]
Length = 302
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 77/133 (57%), Gaps = 5/133 (3%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K +GTH+G FHCDE C ++ LT +F ++I R+RD + L + D V+DVG ++
Sbjct: 4 KLIGTHDGIFHCDELTSCVILLLTKEFMGSKIRRTRDNEKLKECDVVVDVGKEFNVERHL 63
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL----NVDEGHPDVHRLFLAVY 138
+DHHQ+GF E + G T SSAGL+YK++G+E+I K E ++ Y
Sbjct: 64 FDHHQQGFNERW-EGSPTLFSSAGLIYKYYGREIIIKLCKGPHTTFEDEEEIKWFMDKWY 122
Query: 139 KNFMEAIDAIDNG 151
+ +IDA DNG
Sbjct: 123 FFYFVSIDAEDNG 135
>gi|420239940|ref|ZP_14744214.1| hypothetical protein PMI07_01976 [Rhizobium sp. CF080]
gi|398078110|gb|EJL69038.1| hypothetical protein PMI07_01976 [Rhizobium sp. CF080]
Length = 310
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 140/307 (45%), Gaps = 27/307 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A++VRSR + + D ++ DVG YD + +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPGARVVRSRALEWITPGADRIIYDVGAAYDAAAQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GLV+KH+G++ +A L + E H + L + +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLVWKHYGRDYLAA-LGLPEAH--IEALHASFDGSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID +DNG P + L + K D T+P E EN F + +AG
Sbjct: 120 IDLMDNGALSPSVAGPLAGLTLPALLETL-KPVFDETDP----EAENRGFGAALAIAGS- 173
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +V + I + + G I+ L P++ + + + +
Sbjct: 174 FVEARIGQSAAKLRAEGLVHQAIVDTGE----GRILELPAGMPFRPAIVKAGADHLL--- 226
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W + + + + FE R LPA W GL D +L GI G F H
Sbjct: 227 --FVVHP--REKDWCLTTIRRADEGFEVRADLPADWAGLTDRDLEAVCGIEGASFCHNGR 282
Query: 325 FIGGNQS 331
FI ++
Sbjct: 283 FIAAART 289
>gi|424884895|ref|ZP_18308506.1| hypothetical protein Rleg10DRAFT_5381 [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393176657|gb|EJC76698.1| hypothetical protein Rleg10DRAFT_5381 [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 307
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 140/317 (44%), Gaps = 49/317 (15%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA-----VLDVGGVYDPSND 81
TH+G FH DE L + LT F A+IVRSR LD + + DVGG YDP+
Sbjct: 8 THSGGFHADELLSSVI--LTGLFPQARIVRSR---ALDWITPGQDRIIYDVGGTYDPAAR 62
Query: 82 CYDHHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP--DVHRLFLA 136
+DHHQ+G E+ SS GL++KHFG++ +A G P DV + +
Sbjct: 63 IFDHHQRGAPLRED------GQPYSSFGLIWKHFGRDYLAAS-----GLPEADVETVHAS 111
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLN--LDWTEPDQSAERENEAF 194
F+ ID DNG P + L + L D EP E ++ AF
Sbjct: 112 FDAGFVLPIDLTDNG-----ALSPSGPLAGLTLPVLLETLKPVFDEAEP----EADDRAF 162
Query: 195 QQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFE 254
+ +A + F++ + L A +IV + I G ++ L R P++ + +
Sbjct: 163 HAALAIA-RSFVEARIGQSAAKLRAETIVHQAIRAAG----QGRVLELPRGMPFRPAIVK 217
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+ + +V++ R K W V + + + FE R LPA W GL + EL GI
Sbjct: 218 AGADHLL-----FVVHP--REKDWCVTGIRRAEEGFELRADLPAAWAGLTNGELEAVCGI 270
Query: 315 PGCVFVHMSGFIGGNQS 331
G F H FI ++
Sbjct: 271 EGASFCHNGRFIAAART 287
>gi|372278822|ref|ZP_09514858.1| hypothetical protein OS124_04109 [Oceanicola sp. S124]
Length = 313
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 144/320 (45%), Gaps = 28/320 (8%)
Query: 27 THNGSFHCDEALGCFMI-RLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCY 83
TH+G FH DE L ++ RL A++VRSRDP + + V DVGG YDP +
Sbjct: 8 THSGGFHADEVLSTVILGRLHPA---AEVVRSRDPDWICPAEGRLVYDVGGRYDPEAGIF 64
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFME 143
DHHQ+ E T SS GLV+KHFG + + + ++ E D+ + A ++F+
Sbjct: 65 DHHQR---ESPLREDGTPYSSFGLVWKHFGLDYL-RSFDIPE--EDLAGIHAAFDRSFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D +DNG P + L + K D T+P EAF + +A +
Sbjct: 119 PVDQVDNGTVSVSEAGPLASLTLPGLIETL-KPVFDDTDPASL----EEAFASAVAIA-R 172
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ RS R+ ++ A R D + I+ L P++ + E +
Sbjct: 173 SFVEA--RVARSAAKRRAELLAAEAVRAAGDSA--ILELPMGMPFRPAVIEAGADH---- 224
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+ +V+ + G W + + + FE R LPA W GL EL +G+ G F H +
Sbjct: 225 -LLFVITPRE-GGDWSLGGIRKHDEGFEQRADLPAAWGGLSGAELEAVSGVAGAKFCHKA 282
Query: 324 GFIGGNQSYGGALAMARAAL 343
F+ S L MA A+
Sbjct: 283 LFVAAAASREAILQMAELAV 302
>gi|126730952|ref|ZP_01746761.1| hypothetical protein SSE37_14103 [Sagittula stellata E-37]
gi|126708668|gb|EBA07725.1| hypothetical protein SSE37_14103 [Sagittula stellata E-37]
Length = 325
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 144/322 (44%), Gaps = 33/322 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F NA++VRSRD + + DVG +YD +D
Sbjct: 8 THSGGFHADELLST--VVLTRLFPNAKLVRSRDVDWISPAPGRVIYDVGRIYDAEAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + SS GL+++HFG + + + L V E DV + + F+
Sbjct: 66 HHQRPAPQ---REDGAPYSSFGLIWRHFGHDYL-RALGVPE--VDVSGIHACFDREFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDLA 201
+D +DNG + P L+ + L+ +P D+ ++ AF + +A
Sbjct: 120 VDLLDNGAMEPSVAGP--------LAGMTLPVLLETLKPVFDDRGPGADDRAFMDSLTVA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + A ++V+ IAE +G VL+ P + E+
Sbjct: 172 -QAFVEAAVKGQAAKRRAEAMVMTAIAE------AGASRVLE--LPMGMPFRAAIEQAGA 222
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ L+ +V++ D W + + + FESR LP W GL D L +G+ G F H
Sbjct: 223 DHLL-FVVHPRD--GDWALNTIRKGSNTFESRADLPESWAGLTDTALEAASGVTGAKFCH 279
Query: 322 MSGFIGGNQSYGGALAMARAAL 343
+ FI S L MAR A+
Sbjct: 280 NARFIAVATSREAVLEMARLAV 301
>gi|15836020|ref|NP_300544.1| hypothetical protein CPj0489 [Chlamydophila pneumoniae J138]
gi|33241840|ref|NP_876781.1| hypothetical protein CpB0509 [Chlamydophila pneumoniae TW-183]
gi|384449249|ref|YP_005661851.1| hypothetical protein CPK_ORF01006 [Chlamydophila pneumoniae LPCoLN]
gi|14195495|sp|Q9Z862.2|Y489_CHLPN RecName: Full=UPF0160 protein CPn_0489/CP_0265/CPj0489/CpB0509
gi|8978859|dbj|BAA98695.1| CT386 hypothetical protein [Chlamydophila pneumoniae J138]
gi|33236349|gb|AAP98438.1| putative metal dependent hydrolase [Chlamydophila pneumoniae
TW-183]
gi|269303367|gb|ACZ33467.1| conserved hypothetical protein [Chlamydophila pneumoniae LPCoLN]
Length = 290
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 133/310 (42%), Gaps = 46/310 (14%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + +GTH+GSFH DE C ++ + D +I+RSRDP VL + V DVGGVY
Sbjct: 2 QIP-RSIGTHDGSFHADEVTACALLIIFDLVDENKIIRSRDPVVLSKCEYVCDVGGVYSI 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
N +DHHQ ++ SSAG++ H+ KE ++ +E H FL
Sbjct: 61 ENKRFDHHQVSYD--------GSWSSAGMIL-HYLKEF--GYMDCEEYH------FLN-- 101
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA-FQQG 197
+ +D DNG R+ + S + + P + E ++A F
Sbjct: 102 NTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPREEEETNSDADFSCA 149
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ +FL +R + R IV E + + + R W+ + F L
Sbjct: 150 LHFT-IDFLCRLRKKFQYDRVCRGIVREAM------ETEDMCLYFDRPLAWQENFFFLGG 202
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPG 316
E + + QW ++ + + D R E R P P W GL ELSK +GIPG
Sbjct: 203 EKHPAAFVCF-----PSCDQWILRGIPPNLDRRMEVRVPFPENWAGLLGKELSKVSGIPG 257
Query: 317 CVFVHMSGFI 326
VF H F+
Sbjct: 258 AVFCHKGLFL 267
>gi|241205307|ref|YP_002976403.1| metal-dependent protein hydrolase [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240859197|gb|ACS56864.1| metal-dependent protein hydrolase [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 307
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 138/303 (45%), Gaps = 31/303 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L ++ T F A+IVRSR P+ + + + DVGG Y+P+ +D
Sbjct: 8 THSGGFHADELLSSVIV--TRLFPQARIVRSRAPEWITPGEDRIIYDVGGAYNPTIGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPDVHRLFLAVYKNFME 143
HHQ+G SS GL++KH+G++ LIA + E H + L + +F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLIA--FGLPEAH--IETLHGSFDASFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D DNG P + L + + L + E D E +N AF + +A +
Sbjct: 119 PVDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDEADP--EADNRAFHAALAIA-R 170
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ + L A +IV I + G ++ L R P++ + + + +
Sbjct: 171 SFVEARIAQSAAKLRAEAIVHRAI----EAAGQGRVLELPRGMPFRPAILKAGADHLL-- 224
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+V++ R K W V + + + FE R LP W GL + L GI G F H
Sbjct: 225 ---FVVHP--REKDWCVTGIRRAEEGFELRADLPGAWAGLTNGALEAVCGIEGASFCHNG 279
Query: 324 GFI 326
FI
Sbjct: 280 RFI 282
>gi|398378209|ref|ZP_10536375.1| hypothetical protein PMI03_01988 [Rhizobium sp. AP16]
gi|397725422|gb|EJK85873.1| hypothetical protein PMI03_01988 [Rhizobium sp. AP16]
Length = 318
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/309 (27%), Positives = 141/309 (45%), Gaps = 31/309 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+I+RSR P+ + D ++ DVGG YD + YD
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARIIRSRAPEWITPGADRIIYDVGGQYDAAERIYD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH--RLFLAVYKNFM 142
HHQ+G SS GL++KH+G++ ++ G P+ H L + F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLSAS-----GLPEAHIEALHTSFDNGFV 117
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+D +DNG + L + K D T+P E E+ FQ + +A
Sbjct: 118 LPVDLVDNGALSPSVAGQLAGLTLPALLETL-KPVFDETDP----EAEDRGFQNALGIA- 171
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F++ + L A ++V + I + + G I+ L P++ + + + +
Sbjct: 172 RSFVEARIGQSAAKLRAEALVHQAIVDTGE----GRILELPMGMPFRPAIVKAGADHLL- 226
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ D K W + + + FE R LPA W GL + +L +G+ G F H
Sbjct: 227 ----FVVHPRD--KDWCLTGIRRADQGFELRADLPAAWAGLTNGDLEAASGVEGASFCHN 280
Query: 323 SGFIGGNQS 331
FI ++
Sbjct: 281 GRFIAAAKT 289
>gi|16752554|ref|NP_444816.1| hypothetical protein CP0265 [Chlamydophila pneumoniae AR39]
gi|7189191|gb|AAF38127.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39]
Length = 300
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 133/310 (42%), Gaps = 46/310 (14%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + +GTH+GSFH DE C ++ + D +I+RSRDP VL + V DVGGVY
Sbjct: 12 QIP-RSIGTHDGSFHADEVTACALLIIFDLVDENKIIRSRDPVVLSKCEYVCDVGGVYSI 70
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
N +DHHQ ++ SSAG++ H+ KE ++ +E H FL
Sbjct: 71 ENKRFDHHQVSYD--------GSWSSAGMIL-HYLKEF--GYMDCEEYH------FLN-- 111
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA-FQQG 197
+ +D DNG R+ + S + + P + E ++A F
Sbjct: 112 NTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPREEEETNSDADFSCA 159
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ +FL +R + R IV E + + + R W+ + F L
Sbjct: 160 LHFT-IDFLCRLRKKFQYDRVCRGIVREAM------ETEDMCLYFDRPLAWQENFFFLGG 212
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPG 316
E + + QW ++ + + D R E R P P W GL ELSK +GIPG
Sbjct: 213 EKHPAAFVCF-----PSCDQWILRGIPPNLDRRMEVRVPFPENWAGLLGKELSKVSGIPG 267
Query: 317 CVFVHMSGFI 326
VF H F+
Sbjct: 268 AVFCHKGLFL 277
>gi|424871294|ref|ZP_18294956.1| hypothetical protein Rleg5DRAFT_2771 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393166995|gb|EJC67042.1| hypothetical protein Rleg5DRAFT_2771 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 313
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 136/307 (44%), Gaps = 29/307 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPSNDCYD 84
TH+G FH DE L + L F A+IVRSR P+ + + + DVGG YDP+ +D
Sbjct: 8 THSGGFHADELLSSVI--LARLFPQARIVRSRAPEWITPGEDRIIYDVGGAYDPAAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ +A + E H V + + F+
Sbjct: 66 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLAAS-ELPEAH--VEAMHGSFDAGFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DNG P + L + + L + E D E ++ F + +A +
Sbjct: 120 VDLTDNG-----ALSPSGPLAGLTLPALLETLKPVFDETDP--EADDRTFHAALAIA-RS 171
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +IV I + G ++ L R P++ + + + +
Sbjct: 172 FVEARIAQSAAKLRAEAIVHRAI----EAAGQGRVLELPRGMPFRPAIVKAGADQLL--- 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W V + + D FE R LP W GL + L GI G F H
Sbjct: 225 --FVVHP--REKDWCVTGIRRAEDGFELRADLPEAWAGLTNGALEAVCGIEGASFCHNGR 280
Query: 325 FIGGNQS 331
FI ++
Sbjct: 281 FIAAART 287
>gi|15618400|ref|NP_224685.1| hypothetical protein CPn0489 [Chlamydophila pneumoniae CWL029]
gi|4376775|gb|AAD18629.1| CT386 hypothetical protein [Chlamydophila pneumoniae CWL029]
Length = 290
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 133/310 (42%), Gaps = 46/310 (14%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + +GTH+GSFH DE C ++ + D +I+RSRDP VL + V DVGGVY
Sbjct: 2 QIP-RSIGTHDGSFHADEVTACALLIIFDLVDENKIIRSRDPVVLSKCEYVCDVGGVYSI 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
N +DHHQ ++ SSAG++ H+ KE ++ +E H FL
Sbjct: 61 ENKRFDHHQVSYD--------GSWSSAGMIL-HYLKEF--GYMDCEEYH------FLN-- 101
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEA-FQQG 197
+ +D DNG R+ + S + + P + E ++A F
Sbjct: 102 NTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPREEEETNSDADFSCA 149
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ +FL +R + R IV E + + + R W+ + F L
Sbjct: 150 LHFT-IDFLCRLRKKFQYDRVCRGIVREAM------ETEDMCLYFDRPLAWQENFFFLGG 202
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPG 316
E + + QW ++ + + D R + R P P W GL ELSK +GIPG
Sbjct: 203 EKHPAAFVCF-----PSCDQWILRGIPPNLDRRMDVRVPFPENWAGLLGKELSKVSGIPG 257
Query: 317 CVFVHMSGFI 326
VF H F+
Sbjct: 258 AVFCHKGLFL 267
>gi|84683843|ref|ZP_01011746.1| hypothetical protein 1099457000264_RB2654_20758 [Maritimibacter
alkaliphilus HTCC2654]
gi|84668586|gb|EAQ15053.1| hypothetical protein RB2654_20758 [Maritimibacter alkaliphilus
HTCC2654]
Length = 307
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 136/306 (44%), Gaps = 33/306 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A +VRSRD + + + DVG YD +D
Sbjct: 8 THSGGFHADELLAS--VVLTRLFPEAALVRSRDAAWITPMAGRIIYDVGRAYDAEAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ SS+GL++ FG + +A L V E D +F + + F+
Sbjct: 66 HHQRPAPL---RPDGRPYSSSGLIWARFGADYLAA-LGVPEA--DRAGVFAGMDEGFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDLA 201
+D +DNG P L+ + L+ +P D+S E ++ AF + +A
Sbjct: 120 VDLLDNGAIAPSVAGP--------LAGLTLPVLLESLKPAFDDRSPEADDAAFHAALPVA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + + A +V+ IA+ +G VL+ P + E+
Sbjct: 172 -RAFVEAEVRRLAAKARAEGMVLAAIAK------AGASRVLE--LPMGMPFRSAVEKAGA 222
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ ++ +V++ D W + + V D FESR LP W GL D L G+PG F H
Sbjct: 223 DHML-FVIHPRD--GDWALNTIRVGADTFESRADLPESWAGLTDAALEAACGVPGAKFCH 279
Query: 322 MSGFIG 327
+ FI
Sbjct: 280 NARFIA 285
>gi|406592080|ref|YP_006739260.1| hypothetical protein B711_0290 [Chlamydia psittaci CP3]
gi|406593190|ref|YP_006740369.1| hypothetical protein B712_0283 [Chlamydia psittaci NJ1]
gi|405787952|gb|AFS26695.1| hypothetical protein B711_0290 [Chlamydia psittaci CP3]
gi|405789062|gb|AFS27804.1| hypothetical protein B712_0283 [Chlamydia psittaci NJ1]
Length = 292
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 138/324 (42%), Gaps = 48/324 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H H L +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYHFLNHTL--------I 108
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 109 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDADFFFSLKFTI-- 156
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 157 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 209
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 210 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 264
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
F+ S + R AL+L
Sbjct: 265 KGLFL----SVWDSKEHCRRALQL 284
>gi|332287205|ref|YP_004422106.1| hypothetical protein CPSIT_0280 [Chlamydophila psittaci 6BC]
gi|384450359|ref|YP_005662959.1| hypothetical protein G5O_0285 [Chlamydophila psittaci 6BC]
gi|384451358|ref|YP_005663956.1| hypothetical protein CPS0A_0286 [Chlamydophila psittaci 01DC11]
gi|384452334|ref|YP_005664931.1| hypothetical protein CPS0D_0285 [Chlamydophila psittaci 08DC60]
gi|384453308|ref|YP_005665904.1| hypothetical protein CPS0C_0285 [Chlamydophila psittaci C19/98]
gi|392376458|ref|YP_004064236.1| conserved hypothetical protein [Chlamydophila psittaci RD1]
gi|406594674|ref|YP_006741409.1| hypothetical protein B599_0282 [Chlamydia psittaci MN]
gi|407460429|ref|YP_006738204.1| hypothetical protein B603_0282 [Chlamydia psittaci WC]
gi|410858235|ref|YP_006974175.1| conserved hypothetical protein [Chlamydia psittaci 01DC12]
gi|449070910|ref|YP_007437990.1| hypothetical protein AO9_01365 [Chlamydophila psittaci Mat116]
gi|313847801|emb|CBY16791.1| conserved hypothetical protein [Chlamydophila psittaci RD1]
gi|325507255|gb|ADZ18893.1| conserved hypothetical protein [Chlamydophila psittaci 6BC]
gi|328914453|gb|AEB55286.1| conserved hypothetical protein [Chlamydophila psittaci 6BC]
gi|334692089|gb|AEG85308.1| conserved hypothetical protein [Chlamydophila psittaci C19/98]
gi|334693068|gb|AEG86286.1| conserved hypothetical protein [Chlamydophila psittaci 01DC11]
gi|334695023|gb|AEG88239.1| conserved hypothetical protein [Chlamydophila psittaci 08DC60]
gi|405783106|gb|AFS21854.1| hypothetical protein B599_0282 [Chlamydia psittaci MN]
gi|405786809|gb|AFS25553.1| hypothetical protein B603_0282 [Chlamydia psittaci WC]
gi|410811130|emb|CCO01773.1| conserved hypothetical protein [Chlamydia psittaci 01DC12]
gi|449039418|gb|AGE74842.1| hypothetical protein AO9_01365 [Chlamydophila psittaci Mat116]
Length = 289
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 138/324 (42%), Gaps = 48/324 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H H L +
Sbjct: 65 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYHFLNHTL--------I 105
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 106 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDADFFFSLKFTI-- 153
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 154 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 206
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 207 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 261
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
F+ S + R AL+L
Sbjct: 262 KGLFL----SVWDSKEHCRRALQL 281
>gi|424824943|ref|ZP_18249930.1| hypothetical protein CAB1_0263 [Chlamydophila abortus LLG]
gi|333410042|gb|EGK69029.1| hypothetical protein CAB1_0263 [Chlamydophila abortus LLG]
Length = 289
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 130/305 (42%), Gaps = 44/305 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +IVR+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEDKIVRTRNPEKLAECEYVCDVGGVYSPEQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V + ++ + ++++E H H L +
Sbjct: 65 FDHHQVAYE--------GPWSSAGMVLDYLKEQRL---IDLEEYHFLNHML--------I 105
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 106 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDADFFFSLKFTI-- 153
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 154 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 206
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 207 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 261
Query: 322 MSGFI 326
F+
Sbjct: 262 KGLFL 266
>gi|407457816|ref|YP_006736121.1| hypothetical protein B601_0283 [Chlamydia psittaci WS/RT/E30]
gi|405785320|gb|AFS24066.1| hypothetical protein B601_0283 [Chlamydia psittaci WS/RT/E30]
Length = 289
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 139/324 (42%), Gaps = 48/324 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H FL + +
Sbjct: 65 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYH------FLN--RTLI 105
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 106 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDADFFFSLKFTI-- 153
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 154 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 206
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 207 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 261
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
F+ S + R AL+L
Sbjct: 262 KGLFL----SVWDSKEHCRRALQL 281
>gi|62184891|ref|YP_219676.1| hypothetical protein CAB249 [Chlamydophila abortus S26/3]
gi|62147958|emb|CAH63705.1| conserved hypothetical protein [Chlamydophila abortus S26/3]
Length = 289
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 137/324 (42%), Gaps = 48/324 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +IVR+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEDKIVRTRNPEKLAECEYVCDVGGVYSPEQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V + ++ + ++++E H H L +
Sbjct: 65 FDHHQVAYE--------GPWSSAGMVLDYLREQRL---IDLEEYHFLNHTL--------I 105
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 106 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDADFFFSLKFTI-- 153
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 154 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 206
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 207 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 261
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
F+ S + R AL+L
Sbjct: 262 KGLFL----SVWDSKEHCRRALQL 281
>gi|407459060|ref|YP_006737163.1| hypothetical protein B602_0282 [Chlamydia psittaci M56]
gi|405786508|gb|AFS25253.1| hypothetical protein B602_0282 [Chlamydia psittaci M56]
Length = 292
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 137/324 (42%), Gaps = 48/324 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +I+R+R+P+ L + + V DVGG+Y P
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFDLVDEEKIIRTRNPEKLAECEYVCDVGGIYSPDQKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V + ++ + ++++E H H L +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMVLDYLKEQ---RFIDLEEYHFLNHTL--------I 108
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG R+ + S + + E ++A + F +
Sbjct: 109 HGIDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNASDTDFFFSLKFTI-- 156
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 157 -DLLKRLRNKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHPA 209
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL EL + +GIPG +F H
Sbjct: 210 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCH 264
Query: 322 MSGFIGGNQSYGGALAMARAALKL 345
F+ S + R AL+L
Sbjct: 265 KGLFL----SVWDSKEHCRRALQL 284
>gi|400755270|ref|YP_006563638.1| hypothetical protein PGA2_c24110 [Phaeobacter gallaeciensis 2.10]
gi|398654423|gb|AFO88393.1| hypothetical protein PGA2_c24110 [Phaeobacter gallaeciensis 2.10]
Length = 309
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 154/323 (47%), Gaps = 33/323 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F +A+++RSRD + + DVG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPDAELIRSRDAAWITPAAGRIIYDVGQAYDGEAQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ HFG++ + + L+V E DV + ++F+
Sbjct: 66 HHQRP-NPLREDG--QPYSSFGLIWHHFGRDYL-RGLDVPEA--DVDAIHHGFDQSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDLA 201
+D +DNG P L+ + L+ +P D+ +++AF + +A
Sbjct: 120 VDLLDNGAIDPGVAGP--------LAGMTLPVLLETLKPVFDDRRDGADDQAFLAALPVA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + A ++V+E I + +G+++ L P++ + E+
Sbjct: 172 -RAFVEASIRGKAAKRRAETMVLEAI----EGAGAGKVLELPMGMPFRSAV----EKAGA 222
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ L+ +V++ RG W + + S D FESR LPA W GL D+ L +G+ G F H
Sbjct: 223 DHLL-FVVHP--RGSDWTLTTIRKSGDSFESRADLPAAWAGLTDEALEAASGVAGAKFCH 279
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
+ FI ++ + +A A++
Sbjct: 280 NARFIAVARTREAIMDLANRAVQ 302
>gi|163744364|ref|ZP_02151724.1| metal-dependent protein hydrolase [Oceanibulbus indolifex HEL-45]
gi|161381182|gb|EDQ05591.1| metal-dependent protein hydrolase [Oceanibulbus indolifex HEL-45]
Length = 305
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 153/321 (47%), Gaps = 29/321 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT + +A ++RSRD + + DVG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLYPDATLLRSRDADWITPGAGRIIYDVGREYDADALIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGH-PDVHRLFLAVYKNFME 143
HHQ+ + G SS GL+++H+G++ + + +V E D+HR F + F+
Sbjct: 66 HHQRP-NPLREDG--QPFSSFGLIWQHYGRDYL-RSFDVPEADVEDIHRSF---DQGFVL 118
Query: 144 AIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D IDNG + P + L + L + E + A+ + AF + +A +
Sbjct: 119 PVDLIDNGALEPSVAGP---LAGMTLPVLLETLKPVFDERGEDAD--DRAFMAALPVA-R 172
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ + A ++V++ I + SG ++ L R P++ + E+ +
Sbjct: 173 AFVEAAIKGKAAKRRAEAMVMQAI----EAAGSGRVLELPRGMPFRSAV----EKAGADH 224
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
L+ +V++ RG+ W + + S D FE+R LP W GL D +L +G+ G F H +
Sbjct: 225 LL-FVVHP--RGEDWALTTIRKSGDSFEARADLPEAWAGLTDADLEAASGVQGAKFCHNA 281
Query: 324 GFIGGNQSYGGALAMARAALK 344
FI + AL MA A++
Sbjct: 282 RFIAVAANREAALRMADLAVQ 302
>gi|399993755|ref|YP_006573995.1| hypothetical protein PGA1_c26080 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
gi|398658310|gb|AFO92276.1| hypothetical protein PGA1_c26080 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
Length = 309
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 154/323 (47%), Gaps = 33/323 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F +A+++RSRD + + DVG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPDAELIRSRDAAWITPAAGRIIYDVGQAYDGEAQIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ HFG++ + + L+V E DV + ++F+
Sbjct: 66 HHQRP-NPLREDG--QPYSSFGLIWHHFGRDYL-RGLDVPEA--DVDAIHHGFDQSFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP---DQSAERENEAFQQGMDLA 201
+D +DNG P L+ + L+ +P D+ +++AF + +A
Sbjct: 120 VDLLDNGAIDPGVAGP--------LAGMTLPVLLETLKPVFDDRRDGADDQAFLAALPVA 171
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + A ++V+E I + +G+++ L P++ + E+
Sbjct: 172 -RAFVEASIRGKAAKRRAETMVLEAI----EGAGAGKVLELPMGMPFRSAV----EKAGA 222
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ L+ +V++ RG W + + S D FESR LPA W GL D+ L +G+ G F H
Sbjct: 223 DHLL-FVVHP--RGSDWTLTTIRKSGDSFESRADLPAAWAGLMDEALEAASGVAGAKFCH 279
Query: 322 MSGFIGGNQSYGGALAMARAALK 344
+ FI ++ + +A A++
Sbjct: 280 NARFIAVARTREAIMDLANRAVQ 302
>gi|417105785|ref|ZP_11961926.1| putative hydrolase protein [Rhizobium etli CNPAF512]
gi|327190296|gb|EGE57394.1| putative hydrolase protein [Rhizobium etli CNPAF512]
Length = 307
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 133/302 (44%), Gaps = 29/302 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P + D ++ DVGGVYDP +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPAWITPGQDRIIYDVGGVYDPDAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ +A + VH F A +F+
Sbjct: 66 HHQRGAPL---REDGQPYSSFGLIWKHYGRDYLAACGLPADHVEAVHSSFDA---SFVLP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID DNG + P K D EP E ++ AF + +A +
Sbjct: 120 IDLTDNGAL---SPSGPLANLTLPTLLETLKPVFDEAEP----EADDRAFHAALAIA-RS 171
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +IV + I D ++ L R P++ + + + +
Sbjct: 172 FVEARIAQSAAKLRAEAIVRQAI----DIAGQARVLELPRGMPFRPAIVKAGADHLL--- 224
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W V + + + F R LP W GL + EL G+ G F H
Sbjct: 225 --FVVHP--REKDWCVTGIRRAEEGFALRADLPKAWAGLANGELEAVCGVEGASFCHNGR 280
Query: 325 FI 326
FI
Sbjct: 281 FI 282
>gi|409438272|ref|ZP_11265359.1| Metal-dependent hydrolase protein [Rhizobium mesoamericanum
STM3625]
gi|408750138|emb|CCM76528.1| Metal-dependent hydrolase protein [Rhizobium mesoamericanum
STM3625]
Length = 330
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 144/331 (43%), Gaps = 46/331 (13%)
Query: 11 AYSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAV 69
A + + P TP V TH+G FH DE L + LT F A++ RSR P+ + D +
Sbjct: 10 ANNKAHPGMTPNFLV-THSGGFHADEVLSSVI--LTRLFPAARLTRSRAPEWIKPGADRI 66
Query: 70 L-DVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHP 128
+ DVGG YD S +DHHQ+G SS GL++KH+G++ + V G P
Sbjct: 67 IYDVGGEYDASRRIFDHHQRGAPL---REDGQPYSSFGLIWKHYGRDYL-----VATGVP 118
Query: 129 DVH--RLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQS 186
+VH L + F+ AID DNG P + L + L + + D +
Sbjct: 119 EVHVEALHASFDAAFVLAIDLTDNG-----ALSPSGPLAGLTLPVLLETLKPVFDDTDPN 173
Query: 187 AERENEAFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC 246
A+ + FQ + +A + F++ + L A ++V++ I ER +G VL
Sbjct: 174 AD--DRRFQDAVAIA-RSFVEAKIAQGAAKLRAEALVLQAI-ER-----TGPARVL---- 220
Query: 247 PWKLHLFELEEEMKIEPLI------KYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQW 300
EL M P I + R + W V + + FE R LPA W
Sbjct: 221 -------ELPMGMPFRPAIVKAGADHLLFVVHPRERDWCVTGIRRGDEGFELRADLPAAW 273
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFIGGNQS 331
GL + +L G+ G F H F+ ++
Sbjct: 274 AGLTNHDLEAVCGVEGATFCHNGRFVAAAKT 304
>gi|399036731|ref|ZP_10733695.1| hypothetical protein PMI09_01196 [Rhizobium sp. CF122]
gi|398065558|gb|EJL57179.1| hypothetical protein PMI09_01196 [Rhizobium sp. CF122]
Length = 305
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 141/310 (45%), Gaps = 35/310 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P+ + D ++ DVGG YD S +D
Sbjct: 8 THSGGFHADEVLSSVI--LTRLFLQARVIRSRAPEWITPGADRIIYDVGGAYDASKGIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
HHQ+G E+ SS GL++KH+G++ + V E H V L + F
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHYGRDYLVAT-GVPEAH--VEALHTSFDLAF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ ID DNG P + L + + L + + D A+ FQ + +A
Sbjct: 117 VLPIDLTDNG-----ALSPSGPIAGLTLPALLETLKPVFDDTDPDADDRK--FQDAVAIA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A ++V++ I ER G ++ L P++ + + + +
Sbjct: 170 -RSFVEAKIAQSAAKLRAEALVLQAI-ERTG---QGRVLELPTGMPFRPAIVKARADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + FE R LPA W GL + +L G+ G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRGDEGFELRADLPAAWAGLTNHDLEAVCGVQGATFCH 277
Query: 322 MSGFIGGNQS 331
F+ ++
Sbjct: 278 NGRFVAAAKT 287
>gi|83944409|ref|ZP_00956863.1| hypothetical protein EE36_09200 [Sulfitobacter sp. EE-36]
gi|83953450|ref|ZP_00962172.1| hypothetical protein NAS141_14116 [Sulfitobacter sp. NAS-14.1]
gi|83842418|gb|EAP81586.1| hypothetical protein NAS141_14116 [Sulfitobacter sp. NAS-14.1]
gi|83844732|gb|EAP82615.1| hypothetical protein EE36_09200 [Sulfitobacter sp. EE-36]
Length = 308
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 156/326 (47%), Gaps = 28/326 (8%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPS 79
+ ++ TH+G FH DE L + LT F +A ++RSRD ++ + + DVGG +D
Sbjct: 3 ITQLVTHSGGFHADELLSSVI--LTRLFPDATVLRSRDKALITPAEGRIIYDVGGDFDAE 60
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ + G SS GL++ H+G + + + L+V E DV+ + + +
Sbjct: 61 AGIFDHHQRP-NPLREDG--QPYSSFGLIWAHYGHDYL-RALDVPEA--DVNAIHDSFDR 114
Query: 140 NFMEAIDAIDNG-INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
F+ ID +DNG +N ++ P + T L + + L + + ++ ++ AF +
Sbjct: 115 GFVLPIDLLDNGAVNA--SEAGPLFAGLT-LPALLESLKPVFDDREEGV--DDRAFAAAL 169
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+A + F++ + A ++V+ I D G ++ L P++ + + +
Sbjct: 170 PIA-RAFVEASIRRKAAKFRAEAMVMTAI----DAAGEGHVLDLPMGMPFRGAVVKAGAD 224
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCV 318
+ +V++ RG++W + + D FE+R LP W G +D L + +G+ G
Sbjct: 225 H-----LLFVIHP--RGEEWTLTTIRSGDDTFENRADLPVAWAGQKDAALEEASGVKGAK 277
Query: 319 FVHMSGFIGGNQSYGGALAMARAALK 344
F H FI S L MA A+K
Sbjct: 278 FCHNGRFIAVADSREAILQMADIAVK 303
>gi|218509787|ref|ZP_03507665.1| putative hydrolase protein [Rhizobium etli Brasil 5]
Length = 327
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 136/309 (44%), Gaps = 33/309 (10%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P + D ++ DVGGVYD +D
Sbjct: 46 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPAWITPGQDRIIYDVGGVYDADAGIFD 103
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKE-LIAKELNVDEGHPD-VHRLFLAVYKNFM 142
HHQ+G SS GL++KH+G++ L A L D H D VH F A +F+
Sbjct: 104 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLTACGLPAD--HVDAVHSSFDA---SFV 155
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
ID DNG + P K D EP E ++ AF + +A
Sbjct: 156 LPIDLTDNGAL---SPSGPLANLTLPTLLETLKPVFDEAEP----EADDRAFHAALAIA- 207
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ F++ + L A +IV + I D ++ L R P++ + + + +
Sbjct: 208 RSFVEARIAQSAAKLRAEAIVRQAI----DIAGQARVLELPRGMPFRPAIVKAGADHLL- 262
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ R K W V + + + F R LP W GL + EL G+ G F H
Sbjct: 263 ----FVVHP--REKDWCVTGIRRAEEGFALRADLPKAWAGLANGELEAVCGVEGASFCHN 316
Query: 323 SGFIGGNQS 331
FI +
Sbjct: 317 GRFIAAAAT 325
>gi|67481547|ref|XP_656123.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473303|gb|EAL50737.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449703049|gb|EMD43563.1| Hypothetical protein EHI5A_067400 [Entamoeba histolytica KU27]
Length = 348
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 149/329 (45%), Gaps = 30/329 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G F D L ++R +F + I + + + D VL GG YDPS + +D
Sbjct: 35 IGVSGFGFEFDVILSLTLLRNVSQFKRSSIKLLHSKEEMKECDIVLGYGGQYDPSLNLFD 94
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIA--------KELNVDEGHPDVHRLFLA 136
+HQKGF + + G + ++ + +V+ FGKE++ + N+ + ++ RL
Sbjct: 95 YHQKGFNQKYP-GSTYPMTCSSMVFLKFGKEIVRSCCLKYVDSKGNLIKVTDEIIRLACK 153
Query: 137 VYK-NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
Y + +E+IDAI NG+ Y++ P +Y T+ + E + F
Sbjct: 154 FYYLSRLESIDAILNGVLCYNS--PTKYEPLTDGIYITSDYCNSIDDIKLMVINEYKIFN 211
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLF-E 254
M+LA ++T Y A+S + E+ D I+ + P K +L +
Sbjct: 212 STMNLA----IETAVLYHPVMRNAKSSL-----EKKGLDK--RILCFDK--PVKSNLIVQ 258
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+E+E+K E + + +G +++ +F +RK LP WRG ++EL + G
Sbjct: 259 VEKELKCEGNFLFFIRHSPKG----YFIFSINETQFVNRKSLPKSWRGKENEELEEACGE 314
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G +F H SGF+ S+ A +A A+
Sbjct: 315 KGALFCHHSGFMLSCSSFSVATKLAIKAM 343
>gi|190892336|ref|YP_001978878.1| hydrolase [Rhizobium etli CIAT 652]
gi|190697615|gb|ACE91700.1| putative hydrolase protein [Rhizobium etli CIAT 652]
Length = 307
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 134/306 (43%), Gaps = 35/306 (11%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P + D ++ DVGGVYD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPAWITPGQDRIIYDVGGVYDADAGIFD 65
Query: 85 HHQKGF---EEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
HHQ+G E+ SS GL++KH+G++ +A + VH F A +F
Sbjct: 66 HHQRGAPLRED------GQPYSSFGLIWKHYGRDYLAACGLPADHVEAVHSSFDA---SF 116
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
+ ID DNG + P K D EP E ++ AF + +A
Sbjct: 117 VLPIDLTDNGAL---SPSGPLANLTLPTLLETLKPVFDEAEP----EADDRAFHAALAIA 169
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKI 261
+ F++ + L A +IV + I D ++ L R P++ + + + +
Sbjct: 170 -RSFVEARIAQSAAKLRAEAIVRQAI----DIAGQARVLELPRGMPFRPAIVKAGADHLL 224
Query: 262 EPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ R K W V + + + F R LP W GL + EL G+ G F H
Sbjct: 225 -----FVVHP--REKDWCVTGIRRAEEGFALRADLPKAWAGLANGELEAVCGVEGASFCH 277
Query: 322 MSGFIG 327
FI
Sbjct: 278 NGRFIA 283
>gi|407040775|gb|EKE40320.1| hypothetical protein ENU1_094600 [Entamoeba nuttalli P19]
Length = 346
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 148/329 (44%), Gaps = 30/329 (9%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G F D L +++ +F + I + + + D VL GG YDPS + +D
Sbjct: 35 IGVSGFGFEFDVILSLTLLKNVSQFKRSSIKLLHSKEEMKECDIVLGYGGQYDPSLNLFD 94
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAK--------ELNVDEGHPDVHRLFLA 136
+HQKGF + + G S ++ + +V+ FGKE++ + N + ++ RL
Sbjct: 95 YHQKGFNKKYP-GSSYLMTCSSMVFLKFGKEIVHSCCLKYIDPKGNSIKVTDEIIRLACK 153
Query: 137 VYK-NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQ 195
Y N +E+IDAI NG+ Y++ P +Y T+ + E + F
Sbjct: 154 FYYLNRLESIDAILNGVLCYNS--PTKYEPLTDGIYITSDYCNSIDDIKLIVTNEYKTFN 211
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHL-FE 254
M+LA ++T Y A+S + E+ D I++ + P K +L +
Sbjct: 212 STMNLA----IETAVLYHPVMRNAKSSL-----EKQGLDK--RILLFDK--PVKSNLIIQ 258
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGI 314
+E+E+K E + + +G +++ +F +RK LP WRG ++EL + G
Sbjct: 259 VEKELKCEGNFLFFIRHSPKG----YFIFSINETQFVNRKSLPKSWRGKENEELEEACGE 314
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G +F H SGF+ S A +A A+
Sbjct: 315 KGALFCHHSGFMLSCSSLSVATKLAIKAM 343
>gi|218515217|ref|ZP_03512057.1| putative hydrolase protein [Rhizobium etli 8C-3]
Length = 331
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 132/302 (43%), Gaps = 29/302 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPK-VLDDLDAVL-DVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P + D ++ DVGGVYD +D
Sbjct: 32 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPAWITPGQDRIIYDVGGVYDADAGIFD 89
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+G SS GL++KH+G++ +A + VH F A +F+
Sbjct: 90 HHQRGAPL---RDDGQPYSSFGLIWKHYGRDYLAACGLPADHVEAVHSSFDA---SFVLP 143
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
ID DNG + P K D EP E ++ AF + +A +
Sbjct: 144 IDLTDNGAL---SPSGPLANLTLPTLLETLKPVFDEAEP----EADDRAFHAALAIA-RS 195
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + L A +IV + I D ++ L R P++ + + + +
Sbjct: 196 FVEARIAQSAAKLRAEAIVRQAI----DIAGQARVLELPRGMPFRPAIVKAGADHLL--- 248
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+V++ R K W V + + + F R +P W GL + EL G+ G F H
Sbjct: 249 --FVVHP--REKDWCVTGIRRAEEGFALRADMPKAWAGLANGELEAVCGVEGASFCHNGR 304
Query: 325 FI 326
FI
Sbjct: 305 FI 306
>gi|29840021|ref|NP_829127.1| hypothetical protein CCA00254 [Chlamydophila caviae GPIC]
gi|29834368|gb|AAP05005.1| conserved hypothetical protein [Chlamydophila caviae GPIC]
Length = 292
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 129/305 (42%), Gaps = 44/305 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +I+R+R+P+ L + + V DVGG+Y
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFDLVDEGKIIRTRNPEKLAECEYVCDVGGIYSLEEKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG++ + ++ + ++++E H FL + +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMILNYLKEQRL---MSLEEYH------FLN--QTLI 108
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+D DNG R+ + S + + P++ + F +
Sbjct: 109 HGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPEEGKNSSDADFFFALKFT- 155
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R +V R + + R W+ + F L E
Sbjct: 156 MDLLKRLRSKFRYDRMCRDVV------RSAMEKDEFCLFFDRPLAWQENFFFLGGEQHTA 209
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL +L + +GIPG +F H
Sbjct: 210 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPESWAGLLGKDLEEVSGIPGAIFCH 264
Query: 322 MSGFI 326
F+
Sbjct: 265 KGLFL 269
>gi|167395479|ref|XP_001741544.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165893880|gb|EDR21998.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 346
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 146/336 (43%), Gaps = 44/336 (13%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYD 84
+G F D L ++R ++F + I + + D VL GG YDPS + +D
Sbjct: 35 IGVSGFGFEFDVVLSIVLLRSVNQFKRSSIKFLHSKEEMKGCDVVLGYGGQYDPSLNLFD 94
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELI-----------AKELNVDEGHPDVHRL 133
+HQKGF + G + ++ + +V+ FGKE++ K + V + ++ RL
Sbjct: 95 YHQKGFNNKYP-GSTYLMTCSSMVFLKFGKEIVNSYCFKYIDPKGKSIKVTD---EIIRL 150
Query: 134 FLAVYK-NFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
Y N +E+IDAI NG+ Y + P +Y + T+ + + E +
Sbjct: 151 VCKFYYLNRLESIDAILNGVLCY--NDPIKYESLTDGIYVTSDYCTTIDDIELIVANEYK 208
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKR--FCPWKL 250
F M+L ++T Y P E + ++ Y KR FC +
Sbjct: 209 NFHSTMNLT----IETATLY----HPVIRKAKESLQQQGLY---------KRILFCDQSV 251
Query: 251 H---LFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDE 307
+ + E+E+E+K E + + + +G +++ +F +RK LP WRG ++E
Sbjct: 252 NNSLVIEVEKELKCEGNFLFFIRQSPKG----CYIFSINETQFVNRKSLPKSWRGKENEE 307
Query: 308 LSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
L + G G F H SGF+ + A +A A+
Sbjct: 308 LEEACGEKGAFFCHHSGFMLSCSNLNVAKKLAIKAM 343
>gi|156384258|ref|XP_001633248.1| predicted protein [Nematostella vectensis]
gi|156220315|gb|EDO41185.1| predicted protein [Nematostella vectensis]
Length = 170
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 64/91 (70%), Gaps = 2/91 (2%)
Query: 98 FSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDT 157
+ TKLSSAGLVY HFG+ ++++ + + E H + +++ +Y+N ++ +DAIDNG++Q +
Sbjct: 11 WKTKLSSAGLVYLHFGRRVLSQVMQMPEDHQALDKVYDKIYENLIQEVDAIDNGVSQ--S 68
Query: 158 DKPPRYVNNTNLSSRVGKLNLDWTEPDQSAE 188
D+ PRY+ TNLS+RVG LN W + + E
Sbjct: 69 DEKPRYIITTNLSARVGNLNPKWNDKNMDEE 99
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 37/64 (57%), Positives = 47/64 (73%)
Query: 281 QAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMAR 340
+ V+V P FE+R L +WRG+RDD LSK +GI GC+FVH +GFIGGNQ+ GAL MAR
Sbjct: 99 ECVSVRPQSFENRLGLLEEWRGIRDDALSKLSGIQGCIFVHATGFIGGNQTRKGALEMAR 158
Query: 341 AALK 344
L+
Sbjct: 159 KTLQ 162
>gi|254486819|ref|ZP_05100024.1| metal-dependent protein hydrolase [Roseobacter sp. GAI101]
gi|214043688|gb|EEB84326.1| metal-dependent protein hydrolase [Roseobacter sp. GAI101]
Length = 308
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/334 (25%), Positives = 140/334 (41%), Gaps = 46/334 (13%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD--AVLDVGGVYDPS 79
++ + TH+G FH DE L + LT F +AQ+VR+RD + + D+GG +D
Sbjct: 3 IQHLVTHSGGFHADELLSSVI--LTRLFPDAQLVRTRDKDWITPAQDRIIYDLGGQFDAG 60
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
+DHHQ+ + +G S+ GL++ H+G++ + + ++V + D+ + + +
Sbjct: 61 LQIFDHHQRP-NPLRENG--QPFSAFGLIWAHYGRDYL-RAMDVPD--QDIEAIHTSFDR 114
Query: 140 NFMEAIDAIDNG-INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
F+ +D IDNG IN ++ P + L+ V L D+ ++ AF
Sbjct: 115 GFVVPVDLIDNGAINA--SEAGPLF---AGLTLPVLLETLKPVFDDREDGADDRAFAAA- 168
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRF-CPWKLHLFELEE 257
LP VE R E MV+ H+ EL
Sbjct: 169 ------------------LPIARAFVEAQINRKAAKNRAEAMVMAAITAAGDRHVLELPM 210
Query: 258 EMKIEPLIK--------YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
M ++ +V++ RG W + + D F++R LPA W GL D L
Sbjct: 211 GMPFRAGVENAGADHLLFVVHP--RGADWALTTIRTGDDTFDTRADLPAAWAGLTDTALE 268
Query: 310 KEAGIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+G+ G F H FI S L MA A+
Sbjct: 269 DASGVKGAKFCHNGRFIAVASSRDAVLQMAELAV 302
>gi|330444271|ref|YP_004377257.1| hypothetical protein G5S_0582 [Chlamydophila pecorum E58]
gi|328807381|gb|AEB41554.1| conserved hypothetical protein [Chlamydophila pecorum E58]
Length = 288
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/327 (26%), Positives = 134/327 (40%), Gaps = 44/327 (13%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVRSR+ + L + V DVGGVY
Sbjct: 2 QIP-RSVGTHDGSFHADEVTACALLIIFDLVDEEKIVRSRNFEELKKCEYVCDVGGVYSL 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVY 138
+ +DHHQ +E SSAG+V + L + L+ +E H
Sbjct: 61 KDKRFDHHQLSYE--------GPWSSAGMVLHYL---LQSGHLDFEEYH--------FFN 101
Query: 139 KNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGM 198
+ ID DNG R+ + S + + + ++ E ++ F +
Sbjct: 102 NTLVRGIDDQDNG----------RFFSEKGFCSFSDIIKI-YNPREEQEEFSDKDFFLAL 150
Query: 199 DLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+FL +R R R +V E + + S + R W+ + F L
Sbjct: 151 RFT-VDFLRRLREKFRYDRACREVVKEAM------EGSDLCLFFDRPLAWQENFFFLGGA 203
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ + QW ++ + + + R E R P P W GL +ELSK + IPG
Sbjct: 204 THPAAFVCF-----PSCDQWILRGIPPNLERRMEVRIPFPESWAGLLGEELSKVSQIPGA 258
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALK 344
VF H F+ S G + L+
Sbjct: 259 VFCHKGRFLSVWSSREGCQRALQVTLE 285
>gi|241896307|ref|ZP_04783603.1| metal-dependent protein hydrolase [Weissella paramesenteroides ATCC
33313]
gi|241870468|gb|EER74219.1| metal-dependent protein hydrolase [Weissella paramesenteroides ATCC
33313]
Length = 276
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 143/323 (44%), Gaps = 56/323 (17%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
K + THNG FH D+ + +T F I+R+RD V DD + VYD
Sbjct: 3 KNLVTHNGKFHADDVFASVI--MTQLFPELTIIRTRDEIVTDDHQNL-----VYDVGGGE 55
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHH G ++ H +++ GL+++ FG++ I K L D + + V +F+
Sbjct: 56 FDHH--GIDDTRQHQNGVPMAAFGLIWQTFGQQYIEK-LVPDLSEEIIKAVHRHVDSSFV 112
Query: 143 EAIDAIDNGINQYDTD--KPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDL 200
IDA+DNG+++Y ++ P +N D+ E D E +F +
Sbjct: 113 IGIDALDNGVSEYQSEVFTVPDVIN-------------DFYEEDD----ETNSFNHAVAC 155
Query: 201 AGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMK 260
A + V+ V A+ I + I E+ + S +I++L PWK + + ++
Sbjct: 156 ATLILENRVKKTV-----AKQIAQQAIIEQATF-VSNQILLLPISGPWKDYAEQFDQ--- 206
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFV 320
L+ VL D W +Q V V+ FE + +P +++D G VF+
Sbjct: 207 ---LVFVVLPRKD--GNWMIQGVPVTHGSFEVKHAIPQ----VKED---------GIVFI 248
Query: 321 HMSGFIGGNQSYGGALAMARAAL 343
H +GF+ + AL +A+ L
Sbjct: 249 HRTGFMAVVDTQEHALQLAQQVL 271
>gi|15605111|ref|NP_219896.1| metal dependent hydrolase [Chlamydia trachomatis D/UW-3/CX]
gi|166154597|ref|YP_001654715.1| hypothetical protein CTL0642 [Chlamydia trachomatis 434/Bu]
gi|166155472|ref|YP_001653727.1| hypothetical protein CTLon_0639 [Chlamydia trachomatis
L2b/UCH-1/proctitis]
gi|255311192|ref|ZP_05353762.1| MYG1 protein [Chlamydia trachomatis 6276]
gi|255317493|ref|ZP_05358739.1| MYG1 protein [Chlamydia trachomatis 6276s]
gi|255348751|ref|ZP_05380758.1| MYG1 protein [Chlamydia trachomatis 70]
gi|255503291|ref|ZP_05381681.1| MYG1 protein [Chlamydia trachomatis 70s]
gi|255506970|ref|ZP_05382609.1| MYG1 protein [Chlamydia trachomatis D(s)2923]
gi|301335864|ref|ZP_07224108.1| MYG1 protein [Chlamydia trachomatis L2tet1]
gi|339626059|ref|YP_004717538.1| hypothetical protein CTL2C_394 [Chlamydia trachomatis L2c]
gi|385239902|ref|YP_005807744.1| MYG1 protein [Chlamydia trachomatis G/9768]
gi|385240825|ref|YP_005808666.1| MYG1 protein [Chlamydia trachomatis G/11222]
gi|385241757|ref|YP_005809597.1| MYG1 protein [Chlamydia trachomatis E/11023]
gi|385242679|ref|YP_005810518.1| MYG1 protein [Chlamydia trachomatis G/9301]
gi|385245364|ref|YP_005814187.1| MYG1 protein [Chlamydia trachomatis E/150]
gi|385246288|ref|YP_005815110.1| MYG1 protein [Chlamydia trachomatis G/11074]
gi|386262740|ref|YP_005816019.1| hypothetical protein SW2_3921 [Chlamydia trachomatis Sweden2]
gi|389858079|ref|YP_006360321.1| hypothetical protein FSW4_3921 [Chlamydia trachomatis F/SW4]
gi|389858955|ref|YP_006361196.1| hypothetical protein ESW3_3921 [Chlamydia trachomatis E/SW3]
gi|389859831|ref|YP_006362071.1| hypothetical protein FSW5_3921 [Chlamydia trachomatis F/SW5]
gi|14195435|sp|O84391.2|Y386_CHLTR RecName: Full=UPF0160 protein CT_386
gi|6578103|gb|AAC67982.2| predicted metal dependent hydrolase [Chlamydia trachomatis
D/UW-3/CX]
gi|165930585|emb|CAP04082.1| conserved hypothetical protein [Chlamydia trachomatis 434/Bu]
gi|165931460|emb|CAP07036.1| conserved hypothetical protein [Chlamydia trachomatis
L2b/UCH-1/proctitis]
gi|289525428|emb|CBJ14905.1| conserved hypothetical protein [Chlamydia trachomatis Sweden2]
gi|296434980|gb|ADH17158.1| MYG1 protein [Chlamydia trachomatis E/150]
gi|296435907|gb|ADH18081.1| MYG1 protein [Chlamydia trachomatis G/9768]
gi|296436833|gb|ADH19003.1| MYG1 protein [Chlamydia trachomatis G/11222]
gi|296437767|gb|ADH19928.1| MYG1 protein [Chlamydia trachomatis G/11074]
gi|296438700|gb|ADH20853.1| MYG1 protein [Chlamydia trachomatis E/11023]
gi|297140267|gb|ADH97025.1| MYG1 protein [Chlamydia trachomatis G/9301]
gi|339460811|gb|AEJ77314.1| uncharacterized protein family (UPF0160) family protein [Chlamydia
trachomatis L2c]
gi|380249151|emb|CCE14443.1| conserved hypothetical protein [Chlamydia trachomatis F/SW5]
gi|380250026|emb|CCE13554.1| conserved hypothetical protein [Chlamydia trachomatis F/SW4]
gi|380250904|emb|CCE12665.1| conserved hypothetical protein [Chlamydia trachomatis E/SW3]
gi|440525305|emb|CCP50556.1| hypothetical protein SOTONK1_00405 [Chlamydia trachomatis
K/SotonK1]
gi|440526192|emb|CCP51676.1| hypothetical protein L2B8200_00400 [Chlamydia trachomatis
L2b/8200/07]
gi|440527090|emb|CCP52574.1| hypothetical protein SOTOND1_00408 [Chlamydia trachomatis
D/SotonD1]
gi|440527981|emb|CCP53465.1| hypothetical protein SOTOND5_00405 [Chlamydia trachomatis
D/SotonD5]
gi|440528872|emb|CCP54356.1| hypothetical protein SOTOND6_00405 [Chlamydia trachomatis
D/SotonD6]
gi|440529764|emb|CCP55248.1| hypothetical protein SOTONE4_00406 [Chlamydia trachomatis
E/SotonE4]
gi|440530663|emb|CCP56147.1| hypothetical protein SOTONE8_00412 [Chlamydia trachomatis
E/SotonE8]
gi|440531554|emb|CCP57064.1| hypothetical protein SOTONF3_00406 [Chlamydia trachomatis
F/SotonF3]
gi|440532446|emb|CCP57956.1| hypothetical protein SOTONG1_00406 [Chlamydia trachomatis
G/SotonG1]
gi|440533339|emb|CCP58849.1| hypothetical protein SOTONIA1_00407 [Chlamydia trachomatis
Ia/SotonIa1]
gi|440534233|emb|CCP59743.1| hypothetical protein SOTONIA3_00407 [Chlamydia trachomatis
Ia/SotonIa3]
gi|440535131|emb|CCP60641.1| hypothetical protein BOUR_00411 [Chlamydia trachomatis E/Bour]
gi|440536017|emb|CCP61530.1| hypothetical protein L2B795_00401 [Chlamydia trachomatis L2b/795]
gi|440536909|emb|CCP62423.1| hypothetical protein L1440_00403 [Chlamydia trachomatis L1/440/LN]
gi|440537799|emb|CCP63313.1| hypothetical protein L11322_00401 [Chlamydia trachomatis
L1/1322/p2]
gi|440538689|emb|CCP64203.1| hypothetical protein L1115_00401 [Chlamydia trachomatis L1/115]
gi|440539578|emb|CCP65092.1| hypothetical protein L1224_00401 [Chlamydia trachomatis L1/224]
gi|440540469|emb|CCP65983.1| hypothetical protein L225667R_00402 [Chlamydia trachomatis
L2/25667R]
gi|440541358|emb|CCP66872.1| hypothetical protein L3404_00401 [Chlamydia trachomatis L3/404/LN]
gi|440542245|emb|CCP67759.1| hypothetical protein L2BUCH2_00400 [Chlamydia trachomatis
L2b/UCH-2]
gi|440543136|emb|CCP68650.1| hypothetical protein L2BCAN2_00401 [Chlamydia trachomatis
L2b/Canada2]
gi|440544027|emb|CCP69541.1| hypothetical protein L2BLST_00400 [Chlamydia trachomatis L2b/LST]
gi|440544917|emb|CCP70431.1| hypothetical protein L2BAMS1_00400 [Chlamydia trachomatis L2b/Ams1]
gi|440545807|emb|CCP71321.1| hypothetical protein L2BCV204_00400 [Chlamydia trachomatis
L2b/CV204]
gi|440914069|emb|CCP90486.1| hypothetical protein L2BAMS2_00400 [Chlamydia trachomatis L2b/Ams2]
gi|440914959|emb|CCP91376.1| hypothetical protein L2BAMS3_00400 [Chlamydia trachomatis L2b/Ams3]
gi|440915851|emb|CCP92268.1| hypothetical protein L2BCAN1_00402 [Chlamydia trachomatis
L2b/Canada1]
gi|440916745|emb|CCP93162.1| hypothetical protein L2BAMS4_00401 [Chlamydia trachomatis L2b/Ams4]
gi|440917635|emb|CCP94052.1| hypothetical protein L2BAMS5_00401 [Chlamydia trachomatis L2b/Ams5]
Length = 289
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 128/317 (40%), Gaps = 61/317 (19%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVR+RDP+ L + V DVGG Y
Sbjct: 2 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDENKIVRTRDPQKLAQCEYVCDVGGRYST 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHF-GKELIAKELNVDEGHPDVHRLFLAV 137
+ +DHHQ ++ SSAG+V + G ++ H + +
Sbjct: 61 EHKRFDHHQV--------SYTGSWSSAGMVLDYLHGLGFLS------------HDEYEYL 100
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +D DNG R+ + S + + ++G
Sbjct: 101 NNTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI------------YNPLEEG 138
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-------PWKL 250
+ KEF +RF + R + +R D ++M + C W+
Sbjct: 139 GN-TDKEFFFALRFAIDLLTRLRE---KFCYDRVCRDIVKQVMEKESVCLRFDRPLAWQE 194
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELS 309
+ F L E + + + QW ++ + + D R E R P P +W GL D+L
Sbjct: 195 NFFSLGGESHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEEWAGLLGDQLV 249
Query: 310 KEAGIPGCVFVHMSGFI 326
+ GIPG +F H F+
Sbjct: 250 QATGIPGAIFCHKGLFL 266
>gi|89179348|gb|ABD63086.1| hypothetical protein 17.t00013 [Asparagus officinalis]
Length = 117
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/50 (82%), Positives = 43/50 (86%)
Query: 61 KVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYK 110
K+LD LDAVLDVGGVYDP D YDHHQKGF EV HGF+TKLSSAGLVYK
Sbjct: 9 KILDTLDAVLDVGGVYDPDRDRYDHHQKGFTEVLEHGFNTKLSSAGLVYK 58
>gi|89898561|ref|YP_515671.1| hypothetical protein CF0754 [Chlamydophila felis Fe/C-56]
gi|89331933|dbj|BAE81526.1| hypothetical protein [Chlamydophila felis Fe/C-56]
Length = 292
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 128/305 (41%), Gaps = 44/305 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L +I+R+R+P+ L + + V DVGGVY
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFGLVDEGKIIRTRNPEKLAECEYVCDVGGVYSVEQKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG++ + ++ + ++++E H H L +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMILDYLKEQRL---IDLEEYHFLNHTL--------I 108
Query: 143 EAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAG 202
+D DNG R+ + S + + E ++A + F +
Sbjct: 109 HGVDEQDNG----------RFFSKEGFCSFSDIIKIYNPEEGRNATDADFFFALKFTI-- 156
Query: 203 KEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIE 262
+ L +R R R VV E+ D+ + R W+ + F L E
Sbjct: 157 -DLLKRLRNKFRYDRMCRD-VVRSAMEKDDF-----CLFFDRPLAWQENFFFLGGERHPA 209
Query: 263 PLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+ + + QW ++ + + D R E R P P W GL +L GIPG +F H
Sbjct: 210 AFVCFPACD-----QWILRGIPPTLDRRMEVRVPFPENWAGLLGKDLEDVCGIPGAIFCH 264
Query: 322 MSGFI 326
F+
Sbjct: 265 KGLFL 269
>gi|385243593|ref|YP_005811439.1| MYG1 protein [Chlamydia trachomatis D-EC]
gi|385244473|ref|YP_005812317.1| MYG1 protein [Chlamydia trachomatis D-LC]
gi|297748516|gb|ADI51062.1| MYG1 protein [Chlamydia trachomatis D-EC]
gi|297749396|gb|ADI52074.1| MYG1 protein [Chlamydia trachomatis D-LC]
Length = 304
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 128/317 (40%), Gaps = 61/317 (19%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVR+RDP+ L + V DVGG Y
Sbjct: 17 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDENKIVRTRDPQKLAQCEYVCDVGGRYST 75
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHF-GKELIAKELNVDEGHPDVHRLFLAV 137
+ +DHHQ ++ SSAG+V + G ++ H + +
Sbjct: 76 EHKRFDHHQV--------SYTGSWSSAGMVLDYLHGLGFLS------------HDEYEYL 115
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +D DNG R+ + S + + ++G
Sbjct: 116 NNTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI------------YNPLEEG 153
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-------PWKL 250
+ KEF +RF + R + +R D ++M + C W+
Sbjct: 154 GN-TDKEFFFALRFAIDLLTRLRE---KFCYDRVCRDIVKQVMEKESVCLRFDRPLAWQE 209
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELS 309
+ F L E + + + QW ++ + + D R E R P P +W GL D+L
Sbjct: 210 NFFSLGGESHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEEWAGLLGDQLV 264
Query: 310 KEAGIPGCVFVHMSGFI 326
+ GIPG +F H F+
Sbjct: 265 QATGIPGAIFCHKGLFL 281
>gi|76789117|ref|YP_328203.1| MYG1 protein [Chlamydia trachomatis A/HAR-13]
gi|76167647|gb|AAX50655.1| MYG1 protein [Chlamydia trachomatis A/HAR-13]
Length = 304
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 128/317 (40%), Gaps = 61/317 (19%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVR+RDP+ L + V DVGG Y
Sbjct: 17 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDENKIVRTRDPQKLAQCEYVCDVGGRYST 75
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHF-GKELIAKELNVDEGHPDVHRLFLAV 137
+ +DHHQ ++ SSAG+V + G ++ H + +
Sbjct: 76 EHKRFDHHQV--------SYTGSWSSAGMVLDYLHGLGFLS------------HDEYEYL 115
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +D DNG R+ + S + + ++G
Sbjct: 116 NNTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI------------YNPLEEG 153
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFC-------PWKL 250
+ KEF +RF + R + +R D ++M + C W+
Sbjct: 154 GN-TDKEFFFALRFAIDLLTRLRE---KFCYDRVCRDIVKQVMEKESVCLRFDRPLAWQE 209
Query: 251 HLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELS 309
+ F L E + + + QW ++ + + D R E R P P +W GL D+L
Sbjct: 210 NFFSLGGESHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEEWAGLLGDQLV 264
Query: 310 KEAGIPGCVFVHMSGFI 326
+ GIPG +F H F+
Sbjct: 265 QATGIPGAIFCHKGLFL 281
>gi|237802811|ref|YP_002888005.1| hypothetical protein JALI_3851 [Chlamydia trachomatis B/Jali20/OT]
gi|237804733|ref|YP_002888887.1| hypothetical protein CTB_3851 [Chlamydia trachomatis B/TZ1A828/OT]
gi|376282392|ref|YP_005156218.1| hypothetical protein CTR_3851 [Chlamydia trachomatis A2497]
gi|231273033|emb|CAX09946.1| conserved hypothetical protein [Chlamydia trachomatis B/TZ1A828/OT]
gi|231274045|emb|CAX10839.1| conserved hypothetical protein [Chlamydia trachomatis B/Jali20/OT]
gi|371908422|emb|CAX09052.1| conserved hypothetical protein [Chlamydia trachomatis A2497]
gi|438690316|emb|CCP49573.1| hypothetical protein A7249_00413 [Chlamydia trachomatis A/7249]
gi|438691400|emb|CCP48674.1| hypothetical protein A5291_00413 [Chlamydia trachomatis A/5291]
gi|438692773|emb|CCP47775.1| hypothetical protein A363_00414 [Chlamydia trachomatis A/363]
Length = 289
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 131/322 (40%), Gaps = 71/322 (22%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVR+RDP+ L + V DVGG Y
Sbjct: 2 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDENKIVRTRDPQKLAQCEYVCDVGGRYST 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHF-GKELIAKELNVDEGHPDVHRLFLAV 137
+ +DHHQ ++ SSAG+V + G ++ H + +
Sbjct: 61 EHKRFDHHQV--------SYTGSWSSAGMVLDYLHGLGFLS------------HDEYEYL 100
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +D DNG R+ + S + + ++G
Sbjct: 101 NNTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI------------YNPLEEG 138
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP-----SGEIMVLKRFC------ 246
+ KEF +RF + ++ + E++ YD ++M + C
Sbjct: 139 GN-TDKEFFFALRFAID--------LLTRLREKFCYDRVCRHIVKQVMEKESVCLRFDRP 189
Query: 247 -PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLR 304
W+ + F L E + + + QW ++ + + D R E R P P +W GL
Sbjct: 190 LAWQENFFSLGGESHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEEWAGLL 244
Query: 305 DDELSKEAGIPGCVFVHMSGFI 326
D+L + GIPG +F H F+
Sbjct: 245 GDQLVQATGIPGAIFCHKGLFL 266
>gi|99082108|ref|YP_614262.1| metal-dependent protein hydrolase [Ruegeria sp. TM1040]
gi|99038388|gb|ABF65000.1| metal-dependent protein hydrolase [Ruegeria sp. TM1040]
Length = 309
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 148/319 (46%), Gaps = 26/319 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + L+ + +A++VR+RD + + + DVGG +D + +D
Sbjct: 8 THSGGFHADELLSTVI--LSRLYPHAELVRTRDKAWITPAEGRIIYDVGGQFDAAARIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ FG++ + + L V E D+ R+ A +F+
Sbjct: 66 HHQRP-NPLRPDG--QPYSSFGLIWAAFGQDYL-RALEVPE--KDLDRVHAAFDADFVRP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG + P + L+ V +L D+ ++ AF + +A +
Sbjct: 120 VDLLDNGAVD-SAEAGPLF---AGLTLPVLLESLKPVFDDRRDGADDAAFVDALAIA-RA 174
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
F++ + + L A +V+ I + +GE VL+ P + E+ + L
Sbjct: 175 FVEARLRHKAAKLRATDVVLRAIED------AGERRVLE--LPMGMPFRGAVEQAGADHL 226
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ +V++ R W + + D FE+R LP W GL D +G+PG F H
Sbjct: 227 L-FVIHP--RDSDWALTTIRTGDDTFETRADLPLAWAGLTDAAFEAASGVPGAAFCHNGR 283
Query: 325 FIGGNQSYGGALAMARAAL 343
F+ +S LA+A A+
Sbjct: 284 FLAIARSREAVLALAELAV 302
>gi|270285454|ref|ZP_06194848.1| hypothetical protein CmurN_03373 [Chlamydia muridarum Nigg]
gi|270289465|ref|ZP_06195767.1| hypothetical protein CmurW_03463 [Chlamydia muridarum Weiss]
gi|301336851|ref|ZP_07225053.1| hypothetical protein CmurM_03430 [Chlamydia muridarum MopnTet14]
gi|14195469|sp|Q9PK08.2|Y665_CHLMU RecName: Full=UPF0160 protein TC_0665
Length = 289
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 127/311 (40%), Gaps = 49/311 (15%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +I+R+RD + L + V DVGG Y
Sbjct: 2 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDKDKIIRTRDSEKLAQCEYVCDVGGRYSI 60
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLV--YKHFGKELIAKELNVDEGHPDVHRLFLA 136
++ +DHHQ ++ SSAG+V Y H L H +
Sbjct: 61 ADKRFDHHQV--------SYTGSWSSAGMVLDYLHHLGSL-------------PHEEYEY 99
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+ + +D DNG R+ + S + + P + ++ F
Sbjct: 100 LNSTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPLEEGGNTDKEFFF 147
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
+ A + L +R R R +V + + E+ D + R W+ + F L
Sbjct: 148 ALHFA-IDLLTRLREKFRYDRICRDVVKQ-VMEKEDV-----CLRFDRPLAWQENFFSLG 200
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIP 315
E + + + QW ++ + + D R E R P P W GL D+L K GIP
Sbjct: 201 GENHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEDWAGLLGDQLVKATGIP 255
Query: 316 GCVFVHMSGFI 326
G +F H F+
Sbjct: 256 GAIFCHKGLFL 266
>gi|385270077|ref|YP_005813237.1| MYG1 protein [Chlamydia trachomatis A2497]
gi|347975217|gb|AEP35238.1| MYG1 protein [Chlamydia trachomatis A2497]
Length = 304
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 131/322 (40%), Gaps = 71/322 (22%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +IVR+RDP+ L + V DVGG Y
Sbjct: 17 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDENKIVRTRDPQKLAQCEYVCDVGGRYST 75
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHF-GKELIAKELNVDEGHPDVHRLFLAV 137
+ +DHHQ ++ SSAG+V + G ++ H + +
Sbjct: 76 EHKRFDHHQV--------SYTGSWSSAGMVLDYLHGLGFLS------------HDEYEYL 115
Query: 138 YKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQG 197
+ +D DNG R+ + S + + ++G
Sbjct: 116 NNTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI------------YNPLEEG 153
Query: 198 MDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDP-----SGEIMVLKRFC------ 246
+ KEF +RF + ++ + E++ YD ++M + C
Sbjct: 154 GN-TDKEFFFALRFAID--------LLTRLREKFCYDRVCRHIVKQVMEKESVCLRFDRP 204
Query: 247 -PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLR 304
W+ + F L E + + + QW ++ + + D R E R P P +W GL
Sbjct: 205 LAWQENFFSLGGESHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEEWAGLL 259
Query: 305 DDELSKEAGIPGCVFVHMSGFI 326
D+L + GIPG +F H F+
Sbjct: 260 GDQLVQATGIPGAIFCHKGLFL 281
>gi|15835280|ref|NP_297039.1| hypothetical protein TC0665 [Chlamydia muridarum Nigg]
gi|7190700|gb|AAF39487.1| conserved hypothetical protein [Chlamydia muridarum Nigg]
Length = 291
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 127/311 (40%), Gaps = 49/311 (15%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P + VGTH+GSFH DE C ++ + D +I+R+RD + L + V DVGG Y
Sbjct: 4 QIP-RSVGTHDGSFHADEVTACALLIMFDLVDKDKIIRTRDSEKLAQCEYVCDVGGRYSI 62
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLV--YKHFGKELIAKELNVDEGHPDVHRLFLA 136
++ +DHHQ ++ SSAG+V Y H L H +
Sbjct: 63 ADKRFDHHQV--------SYTGSWSSAGMVLDYLHHLGSL-------------PHEEYEY 101
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+ + +D DNG R+ + S + + P + ++ F
Sbjct: 102 LNSTLVHGVDEQDNG----------RFFSKEGFCSFSDIIKI--YNPLEEGGNTDKEFFF 149
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
+ A + L +R R R +V + + E+ D + R W+ + F L
Sbjct: 150 ALHFA-IDLLTRLREKFRYDRICRDVVKQ-VMEKEDV-----CLRFDRPLAWQENFFSLG 202
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPD-RFESRKPLPAQWRGLRDDELSKEAGIP 315
E + + + QW ++ + + D R E R P P W GL D+L K GIP
Sbjct: 203 GENHPAAFVSFPCSD-----QWILRGIPPTLDRRMEVRIPFPEDWAGLLGDQLVKATGIP 257
Query: 316 GCVFVHMSGFI 326
G +F H F+
Sbjct: 258 GAIFCHKGLFL 268
>gi|357469189|ref|XP_003604879.1| hypothetical protein MTR_4g020990, partial [Medicago truncatula]
gi|355505934|gb|AES87076.1| hypothetical protein MTR_4g020990, partial [Medicago truncatula]
Length = 207
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 36/41 (87%), Positives = 39/41 (95%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKV 62
LKRVGTHNGSFHCDEALGCFMIRLT FFNAQIVR+RDP++
Sbjct: 55 LKRVGTHNGSFHCDEALGCFMIRLTRNFFNAQIVRTRDPQI 95
>gi|341821277|emb|CCC57634.1| metal-dependent protein hydrolase [Weissella thailandensis fsh4-2]
Length = 276
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 80/325 (24%), Positives = 141/325 (43%), Gaps = 58/325 (17%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVL-DDLDAVLDVGGVYDPSND 81
K + THNG FH D+ + +T F IVR+RD V + + V DVGG
Sbjct: 3 KNLVTHNGKFHADDVFASVI--MTRLFPTLTIVRTRDETVTGEQSNFVYDVGG------G 54
Query: 82 CYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNF 141
+DHH G + H +++ GL+++ FGK+ IA +LN D ++ + +F
Sbjct: 55 AFDHH--GINDTRQHANGVPMAAFGLIWQQFGKQYIA-DLNPDLAPAITQAVYQKIDSHF 111
Query: 142 MEAIDAIDNGINQYDTD--KPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMD 199
+ IDA+DNG++ Y ++ P +N D+ + D A +F + +
Sbjct: 112 IVGIDALDNGVSAYQSEVFTVPDIIN-------------DFYDEDDEA----NSFDRALA 154
Query: 200 LAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEM 259
A + V+ + + + ++ + S +I+ + PWK + + ++
Sbjct: 155 FAKTILENRVKKTIAKEIAQQDVIAQATF------VSDQILFVPISGPWKDYAEQFDQ-- 206
Query: 260 KIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVF 319
LI VL D W +Q V V+ FE + +P ++ +E G VF
Sbjct: 207 ----LIFAVLPRKD--GNWMIQGVPVAHGSFEVKHAIP---------QIKEE----GIVF 247
Query: 320 VHMSGFIGGNQSYGGALAMARAALK 344
+H +GF+ + AL +A+ L
Sbjct: 248 IHRTGFMAVVDTAEHALQLAKDVLN 272
>gi|259416630|ref|ZP_05740550.1| metal-dependent protein hydrolase [Silicibacter sp. TrichCH4B]
gi|259348069|gb|EEW59846.1| metal-dependent protein hydrolase [Silicibacter sp. TrichCH4B]
Length = 307
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 146/319 (45%), Gaps = 26/319 (8%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + L+ + +A++VR+RD + + + DVGG +D +D
Sbjct: 8 THSGGFHADELLSTVI--LSCLYPDAELVRTRDKSWITPAEGRIIYDVGGEFDAGAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ FG++ + + ++ VH F ++F+
Sbjct: 66 HHQRP-NPLRPDG--QPYSSFGLIWAEFGRDYLRAQAVPEKDLEQVHAAF---DEDFVRP 119
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D +DNG DT + L + L + D+S ++ AF + +A +
Sbjct: 120 VDLLDNG--AVDTAEAGPLFAGLTLPVLLESLKPVFD--DRSEGADDTAFHAALTVA-RA 174
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPL 264
++ + + L A +V+ I ++ +G ++ L P++ + E + L
Sbjct: 175 IVEARIRHKAAKLRAADVVLTAI----EHAGTGCVLELPMGMPFRGAV----ERAGADHL 226
Query: 265 IKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSG 324
+ +V++ R W + + D FE+R LPA W GL D L + +G+ G F H +
Sbjct: 227 L-FVVHP--RNADWALTTIRTGDDTFETRADLPAAWAGLTDAALEQASGVEGAKFCHNAR 283
Query: 325 FIGGNQSYGGALAMARAAL 343
F+ S L +A A+
Sbjct: 284 FLAIADSRDAILKLAELAV 302
>gi|238924786|ref|YP_002938302.1| hypothetical protein EUBREC_2437 [Eubacterium rectale ATCC 33656]
gi|238876461|gb|ACR76168.1| Hypothetical protein EUBREC_2437 [Eubacterium rectale ATCC 33656]
Length = 414
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 134/323 (41%), Gaps = 62/323 (19%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++ + N +I +R V DD G V+D +DHH
Sbjct: 17 THGGKFHADDVFSSALLL----YINPEISITRGNSVPDDF-----TGIVFDIGRGEFDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
QK + +G ++ GL+++ G +++ KEL V ++F++ +D
Sbjct: 68 QKD-SRIRENG--VPYAAFGLLWEAVGADILGKELAVK------------FDESFVQPLD 112
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN + L++ +G N W S +EAF Q + +AG
Sbjct: 113 NNDNTGEK------------NELATLIGNFNPSWDYEGGS----DEAFFQAVSVAGMILE 156
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ Y R A V E +A+ +DP+ I+VL F P + L E + I
Sbjct: 157 NKFERY-RGNERADKRVEEVLAK---HDPASRILVLPEFIPCQKALSETD--------IA 204
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ +RG + P + E + PA+W GL +EL GIPG +F H
Sbjct: 205 FVIFPSNRG------GFCIQPQKREYSMNYKCSFPAEWLGLEGEELVNATGIPGAIFCHK 258
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
GFI + A+ AL L
Sbjct: 259 GGFIMTVKEQDEAVKACEKALSL 281
>gi|291526064|emb|CBK91651.1| Uncharacterized conserved protein related to MYG1 family
[Eubacterium rectale DSM 17629]
Length = 414
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 133/323 (41%), Gaps = 62/323 (19%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++ + N +I +R V DD G V+D +DHH
Sbjct: 17 THGGKFHADDVFSSALLL----YINPEISITRGNSVPDDF-----TGIVFDIGRGEFDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
QK + +G ++ GL+++ G +++ KEL V ++F++ +D
Sbjct: 68 QKD-SRIRENG--VPYAAFGLLWEAVGADILGKELAVK------------FDESFVQPLD 112
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN L++ +G N W S +EAF Q + +AG
Sbjct: 113 NNDN------------IGEKNELATLIGNFNPSWDYEGGS----DEAFFQAVSVAGMILE 156
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ Y R A V E +A+ +DP+ I+VL F P + L E + I
Sbjct: 157 NKFERY-RGNERADKRVEEVLAK---HDPASRILVLPEFIPCQKALSETD--------IA 204
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ +RG + P + E + PA+W GL +EL GIPG +F H
Sbjct: 205 FVIFPSNRG------GFCIQPQKREYSMNYKCSFPAEWLGLEGEELVNATGIPGAIFCHK 258
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
GFI + A+ AL L
Sbjct: 259 GGFIMTVKEQDEAVKACEKALSL 281
>gi|384485100|gb|EIE77280.1| hypothetical protein RO3G_01984 [Rhizopus delemar RA 99-880]
Length = 87
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 54/83 (65%), Gaps = 1/83 (1%)
Query: 28 HNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQ 87
H + DEAL +++R T +F AQ+VRSRDP +L D ++DVGG +DP +DHHQ
Sbjct: 6 HFFIYKSDEALAVYLLRQTKEFKEAQLVRSRDPNLLSKCDVLVDVGGEFDPDRLRFDHHQ 65
Query: 88 KGFEEVFGHGFSTKLSSAGLVYK 110
K F E F G KLSSAGLVYK
Sbjct: 66 KDFSESFIGG-DIKLSSAGLVYK 87
>gi|328859013|gb|EGG08123.1| hypothetical protein MELLADRAFT_84868 [Melampsora larici-populina
98AG31]
Length = 197
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 54/91 (59%)
Query: 70 LDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVG Y +DHHQ+GF E + TKLSS GL+YK+ GK++IA L ++
Sbjct: 98 FDVGAEYKTDAHRHDHHQRGFNETYPTSHFTKLSSTGLIYKYSGKQIIATHLKLESDDKS 157
Query: 130 VHRLFLAVYKNFMEAIDAIDNGINQYDTDKP 160
+ L +Y +F+EAID +DNGI QY+ P
Sbjct: 158 LPILMAKMYDDFVEAIDGVDNGITQYEAVNP 188
>gi|291527196|emb|CBK92782.1| Uncharacterized conserved protein related to MYG1 family
[Eubacterium rectale M104/1]
Length = 414
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 133/323 (41%), Gaps = 62/323 (19%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++ + N +I +R V DD G V+D +DHH
Sbjct: 17 THGGKFHADDVFSSALLL----YINPEISITRGNSVPDDF-----TGIVFDIGRGEFDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
QK + +G ++ GL+++ G +++ KEL V ++F++ +D
Sbjct: 68 QKD-SRIRENG--VPYAAFGLLWEAVGADILGKELAVK------------FDESFVQPLD 112
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN + L++ +G N W S +EAF Q + +AG
Sbjct: 113 NNDNTGEK------------NELATLIGNFNPSWDYEGGS----DEAFFQAVSVAGMILE 156
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ Y R A V E +A+ +DP+ I+VL F P + L E + I
Sbjct: 157 NKFERY-RGNERADKRVEEVLAK---HDPASRILVLPEFIPCQKALSETD--------IA 204
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGIPGCVFVHM 322
+V++ +RG + P + E + PA+W GL +EL GI G +F H
Sbjct: 205 FVIFPSNRG------GFCIQPQKREYSMNYKCSFPAEWLGLEGEELVNATGISGAIFCHK 258
Query: 323 SGFIGGNQSYGGALAMARAALKL 345
GFI + A+ AL L
Sbjct: 259 GGFIMTVKEQDEAVKACEKALSL 281
>gi|83951498|ref|ZP_00960230.1| hypothetical protein ISM_13085 [Roseovarius nubinhibens ISM]
gi|83836504|gb|EAP75801.1| hypothetical protein ISM_13085 [Roseovarius nubinhibens ISM]
Length = 305
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 147/320 (45%), Gaps = 29/320 (9%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + L+ + A IVRSR+ + + + DVG YD + +D
Sbjct: 8 THSGGFHADELLSS--VVLSRLYPEATIVRSREAEWITPGAGRIIYDVGRGYDAAARLFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++ H+G++ + + L V E D+ + A + F+
Sbjct: 66 HHQRP-NPLREDG--QPYSSFGLIWHHYGRDYL-RALAVPEA--DIEEVHAAFDRGFVLP 119
Query: 145 IDAIDNG-INQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGK 203
+D +DNG I+ + + L + L + + ++A ++ AF + + +A +
Sbjct: 120 VDLLDNGAIDPSEAGA----LAGMTLPELLETLKPVFDDRGEAA--DDRAFAEALPVA-R 172
Query: 204 EFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
F++ + A ++V E I +G VL+ P + E+ +
Sbjct: 173 AFVEAGVRARAAKRRAEAMVSEAIVA------AGASRVLE--LPMGMPFRSAVEKAGADH 224
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
L+ +V++ RG+ W + + S FE+R LP W GL D L + +G+ G F H
Sbjct: 225 LL-FVVHP--RGEDWALTTIRKSGGSFENRADLPEAWAGLTDVALEEASGVAGAKFCHNG 281
Query: 324 GFIGGNQSYGGALAMARAAL 343
FI + +A+A A+
Sbjct: 282 RFIAVAATREAVMALADLAV 301
>gi|226324092|ref|ZP_03799610.1| hypothetical protein COPCOM_01870 [Coprococcus comes ATCC 27758]
gi|225207641|gb|EEG89995.1| hypothetical protein COPCOM_01870 [Coprococcus comes ATCC 27758]
Length = 485
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 129/305 (42%), Gaps = 62/305 (20%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I R KV +D D + ++D YDHH
Sbjct: 100 THSGKFHADDVFSSALLL----YLNPEITILRGNKVPEDFDGI-----IFDIGRGRYDHH 150
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA-I 145
QK + +G ++ GL+++ G E++ +EL F EA +
Sbjct: 151 QKD-SRIRENGIP--YAAFGLLWEELGTEILGEELAA----------------KFDEAFV 191
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
+DN N + ++ L+S +G N W E + +EAF + + +AG
Sbjct: 192 QPLDNNDNTGEKNE---------LASLIGSFNPSWDEDGGT----DEAFFRAVSVAGMIL 238
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLI 265
+ Y+ + + I E I E + + I+VL F P + L E + I
Sbjct: 239 DNKFARYLGNERADKRI--EEILETQNPEADSRILVLPEFIPCQKRLSETD--------I 288
Query: 266 KYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGIPGCVFVH 321
+V++ +RG + P + E + P++W GL ++EL KE G+ F H
Sbjct: 289 AFVIFPSNRG------GYCIQPQKKEYSLNYKCSFPSEWLGLENEELQKETGLSSASFCH 342
Query: 322 MSGFI 326
GF+
Sbjct: 343 KGGFL 347
>gi|374587973|ref|ZP_09661063.1| metal-dependent protein hydrolase [Leptonema illini DSM 21528]
gi|373872661|gb|EHQ04657.1| metal-dependent protein hydrolase [Leptonema illini DSM 21528]
Length = 304
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 145/338 (42%), Gaps = 64/338 (18%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVL---------------- 70
TH+G+ H D+ L + RL + + +++R+RD ++ D+ VL
Sbjct: 9 THSGTGHADDLLAFSLARLL--WPDIRLIRTRDFRITDEGIDVLAGEGKGMHRDGTFVVA 66
Query: 71 DVGGVYDPSNDCYDHHQKGFE-EVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPD 129
DVGG+Y P YDHHQ G GH + S+AGL +KH G++++ + LN D
Sbjct: 67 DVGGIYAPERRIYDHHQPGSPVRPDGHPY----SAAGLFFKHHGRDIL-RNLNADASDEV 121
Query: 130 VHRLFLAVYKNFMEAIDAIDN-GINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAE 188
++ F + + + +D DN G + Y+ D T++ + + P A+
Sbjct: 122 INDAFRDIDEAILLPVDLSDNTGCSFYEPD--------TDVKGFMFSDYIALARPGLIAD 173
Query: 189 RENEAFQQGMDLAGKEFLDTVRFYVRSWLPA--RSIVVECIAERYDYDPSGEIMVLKRFC 246
E A M A +R ++ +L A R ++ + ER E++ + F
Sbjct: 174 FEETA----MGCAA-----ILRIGMQQYLDANYRGRQLKKLGERC----GEEVLAVDAFY 220
Query: 247 PWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDD 306
P E ++ +++ + G+ W R+ P WRG D
Sbjct: 221 PSARRYLETTP-------VRILVHPNPEGR-WNA--------RWVKDGLFPHAWRGRIDG 264
Query: 307 ELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
EL K +GI G VF H SG + +S A+ +A AL+
Sbjct: 265 ELEKASGISGAVFCHRSGHLAVAKSRDEAVLLAELALR 302
>gi|149031934|gb|EDL86846.1| MYG1 protein, isoform CRA_b [Rattus norvegicus]
Length = 110
Score = 74.3 bits (181), Expect = 8e-11, Method: Composition-based stats.
Identities = 38/65 (58%), Positives = 50/65 (76%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R+GTHNG+FHCDEAL C ++RL ++ NA+IVR+RDP+ L D V+DVGG Y+P Y
Sbjct: 46 RIGTHNGTFHCDEALACALLRLLPEYRNAEIVRTRDPEKLALCDIVVDVGGEYNPQRHRY 105
Query: 84 DHHQK 88
DHHQ+
Sbjct: 106 DHHQR 110
>gi|384454286|ref|YP_005666881.1| hypothetical protein CPS0B_0281, partial [Chlamydophila psittaci
02DC15]
gi|334694043|gb|AEG87260.1| conserved hypothetical protein [Chlamydophila psittaci 02DC15]
Length = 200
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 69/129 (53%), Gaps = 19/129 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H H L +
Sbjct: 65 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYHFLNHTL--------I 105
Query: 143 EAIDAIDNG 151
ID DNG
Sbjct: 106 HGIDEQDNG 114
>gi|345894489|gb|AEO20105.1| hypothetical protein VASRS_30 [Variovorax sp. SRS16]
Length = 154
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 72/139 (51%), Gaps = 13/139 (9%)
Query: 19 QTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDP 78
Q P+ + TH+GSFH D+ C + L F A +VR+R+P+ + +DVGG++DP
Sbjct: 4 QNPV--IATHSGSFHADDVAACAV--LAKLFPAATLVRTRNPEFIRRAQFAVDVGGIWDP 59
Query: 79 SNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGK---ELIAKELNVDEGHPDVHRLFL 135
N +DHHQKGF G +SAGLV+ G+ + +A +L + R+
Sbjct: 60 VNGRFDHHQKGFVGARSSG--VVYASAGLVWAAHGQAYVQAVAPKLTPLQA----ARVAS 113
Query: 136 AVYKNFMEAIDAIDNGINQ 154
++ M+ +D D G Q
Sbjct: 114 SIDDELMQHLDMADTGAAQ 132
>gi|329942574|ref|ZP_08291384.1| hypothetical protein G5Q_0271 [Chlamydophila psittaci Cal10]
gi|407456453|ref|YP_006735026.1| hypothetical protein B600_0298 [Chlamydia psittaci VS225]
gi|328815484|gb|EGF85472.1| hypothetical protein G5Q_0271 [Chlamydophila psittaci Cal10]
gi|405783714|gb|AFS22461.1| hypothetical protein B600_0298 [Chlamydia psittaci VS225]
Length = 198
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 69/129 (53%), Gaps = 19/129 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 5 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 64
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H H L +
Sbjct: 65 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYHFLNHTL--------I 105
Query: 143 EAIDAIDNG 151
ID DNG
Sbjct: 106 HGIDEQDNG 114
>gi|407453751|ref|YP_006732859.1| hypothetical protein B595_0289 [Chlamydia psittaci 84/55]
gi|405780510|gb|AFS19260.1| hypothetical protein B595_0289 [Chlamydia psittaci 84/55]
Length = 201
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 69/129 (53%), Gaps = 19/129 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H H L +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYHFLNHTL--------I 108
Query: 143 EAIDAIDNG 151
ID DNG
Sbjct: 109 HGIDEQDNG 117
>gi|112253647|gb|ABI14408.1| uncharacterized protein UPF0160 [Prorocentrum minimum]
Length = 119
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 45/66 (68%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+K +GTH+G FHCDEALGC M+++ + + +VR+R+ K LD D V+DVG VYD S
Sbjct: 53 VKLIGTHDGVFHCDEALGCAMLQMMPAWAGSTVVRTRNEKELDKCDIVIDVGAVYDHSKM 112
Query: 82 CYDHHQ 87
Y H Q
Sbjct: 113 RYYHTQ 118
>gi|407455081|ref|YP_006733972.1| hypothetical protein B598_0283 [Chlamydia psittaci GR9]
gi|405781624|gb|AFS20373.1| hypothetical protein B598_0283 [Chlamydia psittaci GR9]
Length = 201
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 19/129 (14%)
Query: 23 KRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDC 82
+ VGTH+GSFH DE C ++ L D +++R+R+P+ L + + V DVGGVY P
Sbjct: 8 RSVGTHDGSFHADEVTACALLILFDLVDEEKVIRTRNPEKLAECEYVCDVGGVYSPDQKR 67
Query: 83 YDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFM 142
+DHHQ +E SSAG+V H+ KE + ++++E H FL + +
Sbjct: 68 FDHHQVSYE--------GPWSSAGMVL-HYLKE--QRLIDLEEYH------FLN--RTLI 108
Query: 143 EAIDAIDNG 151
ID DNG
Sbjct: 109 HGIDEQDNG 117
>gi|357469219|ref|XP_003604894.1| Resistance protein, partial [Medicago truncatula]
gi|355505949|gb|AES87091.1| Resistance protein, partial [Medicago truncatula]
Length = 194
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/38 (78%), Positives = 34/38 (89%)
Query: 297 PAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGG 334
P+QWRGLRDD LSKE+GIPGCV HMSGFIGGNQ++ G
Sbjct: 107 PSQWRGLRDDILSKESGIPGCVLSHMSGFIGGNQTFEG 144
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 21/24 (87%), Positives = 22/24 (91%)
Query: 248 WKLHLFELEEEMKIEPLIKYVLYE 271
WKLHLFELE EMKI+P IKYVLYE
Sbjct: 3 WKLHLFELEGEMKIDPPIKYVLYE 26
>gi|153813495|ref|ZP_01966163.1| hypothetical protein RUMOBE_03915 [Ruminococcus obeum ATCC 29174]
gi|149830439|gb|EDM85531.1| hypothetical protein RUMOBE_03915 [Ruminococcus obeum ATCC 29174]
Length = 410
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 138/331 (41%), Gaps = 72/331 (21%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I +R +V +D D + V+D YDHH
Sbjct: 17 THSGKFHADDVFSAALLL----YLNPEITITRGNRVPEDFDGI-----VFDIGRGRYDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA-I 145
QK V +G + ++ GL+++ G E++ +EL + F EA +
Sbjct: 68 QKD-SRVRENGIA--YAAFGLLWEELGSEILGEELA----------------QKFDEAFV 108
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAEREN-EAFQQGMDLAG-- 202
+DN N + ++ L+S +G N W AE N EAF Q + +AG
Sbjct: 109 QPLDNNDNTGEKNE---------LASLIGNFNPTW-----DAEGSNDEAFFQAVSVAGMI 154
Query: 203 -KEFLDTVRFYVRSWLPARSIV---VECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+ D R R+ I ++ +R + +I++L F P + L E
Sbjct: 155 LENKFDRYRGNERADKRVEEIYERHMQAFHDREKHSEDAKILILPEFVPCQKFLSETP-- 212
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGI 314
+ +V++ +RG + P + E + P++W GL ++EL + G+
Sbjct: 213 ------VAFVIFPSNRG------GYCIQPQKKEYSMNYKCSFPSEWLGLENEELQEVTGL 260
Query: 315 PGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
F H GF+ G LA A A K+
Sbjct: 261 KTAGFCHKGGFLMTT----GELADAVQACKI 287
>gi|225027441|ref|ZP_03716633.1| hypothetical protein EUBHAL_01697 [Eubacterium hallii DSM 3353]
gi|224955176|gb|EEG36385.1| hypothetical protein EUBHAL_01697 [Eubacterium hallii DSM 3353]
Length = 402
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 131/327 (40%), Gaps = 62/327 (18%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N I +R KV D D + V+D YDHH
Sbjct: 17 THSGKFHADDVFSYALLL----YLNPAITITRGNKVPKDFDGI-----VFDIGRGKYDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNFMEA 144
Q+ + +G ++ GL+++ G E++ +EL DE +F++
Sbjct: 68 QRD-SRIRENG--VPYAAFGLLWEELGAEILGEELAAKFDE--------------SFIQP 110
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DN + L++ +G N W + ENEAF + + AG
Sbjct: 111 LDINDNTGEK------------NELATLIGNFNPSWDVENG----ENEAFSRAVQTAGMI 154
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGE-------IMVLKRFCPWKLHLFELEE 257
++ Y + A V E +A SGE I+VL F P + L E +
Sbjct: 155 LVNMFEKY-KGNERAEKRVEEILAAHNSSVLSGEKSEIEAKILVLPEFVPCQKQLRETD- 212
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGC 317
I ++++ +RG + +Q + + P W GL DEL + G+
Sbjct: 213 -------IAFIIFPSNRGG-YCIQPLK-KEHSLNYKCSFPENWLGLERDELKQATGLTSA 263
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALK 344
F H GFI A++ + +L+
Sbjct: 264 NFCHKGGFIMTVDDVNDAISACKISLE 290
>gi|224541280|ref|ZP_03681819.1| hypothetical protein CATMIT_00440 [Catenibacterium mitsuokai DSM
15897]
gi|224525799|gb|EEF94904.1| hypothetical protein CATMIT_00440 [Catenibacterium mitsuokai DSM
15897]
Length = 427
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 59/106 (55%), Gaps = 3/106 (2%)
Query: 239 IMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPA 298
IM+L++F PWK L + E K+ I ++ RG W Q V +SP F++R +P
Sbjct: 317 IMILEQFVPWKGTLLK-SESNKVND-IYLTVFPSQRGG-WNFQGVPLSPASFDTRIKVPE 373
Query: 299 QWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARAALK 344
+W G RD EL G+ G F+H GFIGG +++ + +A+ ++
Sbjct: 374 EWCGKRDQELVDLTGVEGARFIHPGGFIGGAENFESIMELAQRIVE 419
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 15/125 (12%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH GSFH D+ + ++ + + V D K D DA VYD YDHH
Sbjct: 14 THAGSFHADDVMATVLLEILYEDIPLARVAEIDEK---DTDAF-----VYDIGLGKYDHH 65
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q ++ SS GL+++ +G +++ K L + + D F + + ID
Sbjct: 66 Q---DDKVRRDNGIAYSSVGLIWRDYGIQILEK-LGIQDYLED---YFYDIDDQIIMPID 118
Query: 147 AIDNG 151
A+DNG
Sbjct: 119 ALDNG 123
>gi|295109384|emb|CBL23337.1| Uncharacterized conserved protein related to MYG1 family
[Ruminococcus obeum A2-162]
Length = 410
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 133/326 (40%), Gaps = 70/326 (21%)
Query: 12 YSTSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLD 71
Y PS + TH+G FH D+ ++ + N +I+ +R +V +D D +
Sbjct: 6 YQIHRPSASAF----THSGKFHADDVFSAALLL----YLNPEIMITRGNRVPEDFDGI-- 55
Query: 72 VGGVYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVH 131
V+D YDHHQK +++ G+ Y FG L+ +EL D ++
Sbjct: 56 ---VFDIGRGQYDHHQK----------DSRIRENGIPYAAFG--LLWEELGADILGEELA 100
Query: 132 RLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAEREN 191
+ F ++F++ +D DN + L++ +G N W AE N
Sbjct: 101 QKF---DESFVQPLDNNDNTGEK------------NELATLIGNFNPTW-----DAEGSN 140
Query: 192 -EAFQQGMDLAG---KEFLDTVRFYVRSWLPARSIVV---ECIAERYDYDPSGEIMVLKR 244
EAF Q + +AG + + R R+ I + + +R + I++L
Sbjct: 141 DEAFFQAVSVAGMILENKFERFRGNERADKRVEEIYAHHEQAVHDREKHRDDARILILPE 200
Query: 245 FCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQW 300
F P + L E E I +V++ +RG + P + E + P +W
Sbjct: 201 FVPCQKFLSETE--------IAFVIFPSNRG------GYCIQPQKKEYSMNYKCSFPVEW 246
Query: 301 RGLRDDELSKEAGIPGCVFVHMSGFI 326
GL ++EL+ G+ F H GF+
Sbjct: 247 LGLENEELAAVTGLKSAGFCHKGGFL 272
>gi|124004066|ref|ZP_01688913.1| metal-dependent protein hydrolase [Microscilla marina ATCC 23134]
gi|123990645|gb|EAY30125.1| metal-dependent protein hydrolase [Microscilla marina ATCC 23134]
Length = 289
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 132/303 (43%), Gaps = 59/303 (19%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNA--QIVRSRDPKVLDDLDA-----VLDVGGVYDPS 79
THNGSFH DE F + + K A I R+R P +L + V+DVG VYDP+
Sbjct: 10 THNGSFHADE---VFAVAILQKLKGAPLHITRTRHPDLLGKVVKNSNVLVVDVGLVYDPA 66
Query: 80 NDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYK 139
++ +DHHQ + S+AGLV++HFG +L NVD + L V +
Sbjct: 67 HNNFDHHQDRHLK----------SAAGLVWEHFGSQLCGN--NVD--------VALLVQE 106
Query: 140 NFMEAIDAID---NGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQ 196
+ ++ +D D N I + + P T +S +G N EPD +++ E F +
Sbjct: 107 SLIDLVDQTDTNQNNILKTIDEHLPGGTAGT-VSGLIGAFN---REPDND-DKQMEQFDK 161
Query: 197 GMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP-WKLHLFEL 255
+ +A FL+ + + + I R P +++ L++ C WK
Sbjct: 162 AVKMAAT-FLENTLYEAEKIVAQKPI----WEARQMLTP--QVVRLEQHCKGWK------ 208
Query: 256 EEEMKIEPLIKYVLY------EDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELS 309
+EP I++ L E G+QW++ + + + + A+ D E +
Sbjct: 209 -RWAALEPGIRFCLIPRASKNELIEGQQWQLTTIDAEAHPLPTLETMQAEVSNANDIEFA 267
Query: 310 KEA 312
+A
Sbjct: 268 HKA 270
>gi|225374631|ref|ZP_03751852.1| hypothetical protein ROSEINA2194_00247 [Roseburia inulinivorans DSM
16841]
gi|225213524|gb|EEG95878.1| hypothetical protein ROSEINA2194_00247 [Roseburia inulinivorans DSM
16841]
Length = 410
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 133/335 (39%), Gaps = 70/335 (20%)
Query: 24 RVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCY 83
R TH G FH D+ ++ + N +I R +V DD D + V+D Y
Sbjct: 14 RAFTHGGKFHADDVFSAALLF----YINPEITILRGNRVPDDFDGI-----VFDIGRGAY 64
Query: 84 DHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNF 141
DHHQ+ V +G ++ GL+++ G E++ +EL DE F
Sbjct: 65 DHHQRD-SRVRENG--VPYAAFGLLWEAVGAEILGEELAEEFDEA--------------F 107
Query: 142 MEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
++ +D DN + L++ +G N W + +EAF Q + +A
Sbjct: 108 VQPLDHNDNTGEK------------NELANLIGNFNPTW----DAQGGNDEAFFQAVSVA 151
Query: 202 GKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGE-------IMVLKRFCPWKLHLFE 254
G + Y R A V E + E SG+ I++L F P + L E
Sbjct: 152 GMILENKFERY-RGNERADRRVEEILEEHRQAVTSGKRDSEDAKILILPEFVPCQKRLSE 210
Query: 255 LEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSK 310
E I +V++ +RG + P + E + PA+W GL ++EL +
Sbjct: 211 TE--------IAFVIFPSNRG------GYCIQPQKKEYSMNYKCSFPAEWLGLENEELEQ 256
Query: 311 EAGIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
G+ F H GF+ A+ R +++L
Sbjct: 257 VTGLQSAGFCHKGGFLMTVGMLEDAVKACRISMEL 291
>gi|365131410|ref|ZP_09341822.1| hypothetical protein HMPREF1032_02901 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363618779|gb|EHL70120.1| hypothetical protein HMPREF1032_02901 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 274
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 125/319 (39%), Gaps = 55/319 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ N I R V G V+D + +DHH
Sbjct: 9 THGGKFHADDVFSAALLKIC----NPNIAIQRGFAVPQGF-----AGLVFDIGDGPFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
K V +G ++ GL+++ G +L+ VD G D ++F++ +D
Sbjct: 60 AKN-SPVRENG--VPYAAFGLLWRELGPQLVGP---VDAGRFD---------ESFVQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN L++ + N W D R ++ F Q + LA ++ L
Sbjct: 105 LDDN------------TGCGNQLANIIAAYNPRWDGED----RPDDCFAQAVALA-QDML 147
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+RS A + V E + I+ L RF PWK L I +
Sbjct: 148 AHKLEAIRSVQRAAAEVNEALGRMKR-----RIVRLSRFAPWKQQL--------IPSKAR 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W QAV S + P PA W G + EL +GI F H F+
Sbjct: 195 FVVYPSQRGG-WAAQAVPASFGSPALKVPFPAHWAGASEQELPGLSGIETLRFCHAGRFL 253
Query: 327 GGNQSYGGALAMARAALKL 345
+ A+A AA++L
Sbjct: 254 VTAGTEEDAVAACEAAMEL 272
>gi|170040639|ref|XP_001848100.1| MYG1 [Culex quinquefasciatus]
gi|167864239|gb|EDS27622.1| MYG1 [Culex quinquefasciatus]
Length = 344
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 59/129 (45%), Gaps = 13/129 (10%)
Query: 220 RSIVVECIAERYDYDPSGEIM-------VLKRFCPWKLHLFELEEEMKIEPLIKYVLYED 272
RS C R G I+ V FCP K HL+ELE E I L KYV+Y
Sbjct: 76 RSSSTRCFTTRTVGGRQGAIVEKAVRNRVEVHFCPCKEHLYELEGEHGIGGLPKYVIY-F 134
Query: 273 DRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSY 332
R WRV V + F K WRG RD++L + +G G F H +GF GGN++
Sbjct: 135 KRPNDWRVICVPLETASFVCHK-----WRGKRDNKLEEVSGTEGANFCHQTGFNGGNRTR 189
Query: 333 GGALAMARA 341
G A R
Sbjct: 190 EGCAADCRG 198
>gi|402585641|gb|EJW79580.1| hypothetical protein WUBG_09509, partial [Wuchereria bancrofti]
Length = 66
Score = 69.7 bits (169), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSND 81
+ ++GTH+G+FHCDE F+++ ++ N ++VRSRD VL + V+DVGG Y+ +
Sbjct: 1 MPKIGTHDGNFHCDEVFAIFLLKSLPEYNNYEVVRSRDKDVLSLCNIVVDVGGEYNHTAM 60
Query: 82 CYDHHQ 87
YDHHQ
Sbjct: 61 KYDHHQ 66
>gi|160945000|ref|ZP_02092226.1| hypothetical protein FAEPRAM212_02515 [Faecalibacterium prausnitzii
M21/2]
gi|158442731|gb|EDP19736.1| hypothetical protein FAEPRAM212_02515 [Faecalibacterium prausnitzii
M21/2]
Length = 276
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD D + VYD +DHH
Sbjct: 9 THAGKFHADDVFATALLQI----LRPDIKITRGFTVPDDFDGI-----VYDIGFGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ E ++ GL+++ G L+ + R + +NF++ +D
Sbjct: 60 QEPREY---RANGVPYAAFGLLWRVLGPGLVGE------------RQARLIDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L +G N W S E ++ F + + +A +
Sbjct: 105 LNDNTGEQ------------NSLCDAIGFFNPVW----DSKEDQDSCFFKAVAVAKQILE 148
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + A + E + + Y G I+VL + PWK L++ +
Sbjct: 149 NQI-----DSANAVNRADEKVQQAYRNSRDG-IVVLPCYLPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W Q V + +S+ P P W G + + +++GIPG F H S F+
Sbjct: 195 FVVYPSQRGG-WSAQCVTDHKTK-KSKLPFPQSWAGQPQEVIEQKSGIPGISFCHASRFL 252
Query: 327 GGNQSYGGALAMARAALK 344
+ ALA R LK
Sbjct: 253 ITAKDKETALAACRQVLK 270
>gi|167771096|ref|ZP_02443149.1| hypothetical protein ANACOL_02450 [Anaerotruncus colihominis DSM
17241]
gi|167666766|gb|EDS10896.1| hypothetical protein ANACOL_02450 [Anaerotruncus colihominis DSM
17241]
Length = 288
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 132/322 (40%), Gaps = 60/322 (18%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAV-LDVGGVYDPSNDCYDH 85
TH G FH D+ ++RL N +I R ++ ++ D + D+GG +DH
Sbjct: 10 THGGRFHADDVFSSALLRL----LNPKIKIRRVMRLPENFDGLAFDIGG------GDFDH 59
Query: 86 HQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAI 145
HQ G V +G ++ GL+++ FG LI PD ++F++ +
Sbjct: 60 HQNG-APVRENG--VPYAAFGLLWRAFGDRLI----------PDAEEA-ARFDQHFIQPL 105
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
D DN +T + + N W S E + F + + A +
Sbjct: 106 DLDDNTGCGSET------------AGLIALFNPGW----DSGEPADRCFWEAVSFA-QVI 148
Query: 206 LDTVRFYVRSWLPARSIVVECI--AERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEP 263
L+ VRS ARS V E + ER ++ L+RF PWK L + +
Sbjct: 149 LEKRLESVRSMCRARSYVEEALRGMER-------GVVTLERFAPWKAVLSASDAQ----- 196
Query: 264 LIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMS 323
+V+Y RG + Q V + + P PA W G ++EL K +GI F H +
Sbjct: 197 ---FVVYPSQRGG-FSAQGVPADDETHALKIPFPADWAGRSEEELPKLSGIATLRFCHNN 252
Query: 324 GFIGGNQSYGGALAMARAALKL 345
F+ + A+A AA K+
Sbjct: 253 RFLIAAGTRSDAVAACHAAQKV 274
>gi|295104529|emb|CBL02073.1| Uncharacterized conserved protein related to MYG1 family
[Faecalibacterium prausnitzii SL3/3]
Length = 276
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD D + VYD +DHH
Sbjct: 9 THAGKFHADDVFATALLQI----LRPDIKITRGFTVPDDFDGI-----VYDIGFGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ E ++ GL+++ G L+ + R + +NF++ +D
Sbjct: 60 QEPREY---RPNGVPYAAFGLLWRVLGPGLVGE------------RQARLIDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L +G N W S E ++ F + + +A +
Sbjct: 105 LNDNTGEQ------------NSLCDAIGFFNPVW----DSKEDQDSCFFKAVAVAKQILE 148
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + A + E + + Y G I+VL + PWK L++ +
Sbjct: 149 NQI-----DSANAVNRADEKVQQAYRNSRDG-IVVLPCYLPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W Q V + +S+ P P W G + + +++GIPG F H S F+
Sbjct: 195 FVVYPSQRGG-WSAQCVTDHKTK-KSKLPFPQSWAGQPQEVIEQKSGIPGISFCHASRFL 252
Query: 327 GGNQSYGGALAMARAALK 344
+ ALA R LK
Sbjct: 253 ITAKDKETALAACRQVLK 270
>gi|313113049|ref|ZP_07798686.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii
KLE1255]
gi|310624641|gb|EFQ07959.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii
KLE1255]
Length = 275
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 128/318 (40%), Gaps = 56/318 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD D + VYD +DHH
Sbjct: 9 THAGKFHADDVFATALLQI----LRPDIKITRGFVVPDDFDGI-----VYDVGFGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ E +G ++ GL+++ G L+ + R + +NF++ +D
Sbjct: 60 QEP-RECRPNG--VPYAALGLLWRVLGPGLVGE------------RQARLIDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L +G N W S E ++ F + + +A +
Sbjct: 105 LNDNTGEQ------------NSLCDAIGFFNPVW----DSKEDQDTCFFKAVAVAKQILE 148
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + + A + E + + Y G I+VL + PWK L++ +
Sbjct: 149 NQI-----ASANAVNRADEKVQQAYKNSRDG-IVVLPCYLPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W Q V + + P P W G + + +++GIPG F H S F+
Sbjct: 195 FVVYPSQRGG-WSAQCVTDHKTK-RPKLPFPQSWAGQPQEVIEQKSGIPGISFCHASRFL 252
Query: 327 GGNQSYGGALAMARAALK 344
+ ALA R LK
Sbjct: 253 ITAKDKETALAACRQVLK 270
>gi|375337559|ref|ZP_09778903.1| metal-dependent protein hydrolase [Succinivibrionaceae bacterium
WG-1]
Length = 207
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 10/130 (7%)
Query: 222 IVVECIAERYDYDP------SGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRG 275
I++ C+ D + GE+++L PW ++ E K K +Y D
Sbjct: 79 ILINCLKNARDEEKVIASYKGGELLILSEKLPWFDTVYNNWELFKD---CKLAIYPDHNK 135
Query: 276 KQWRVQAVAVS-PDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGG 334
+ WR+Q++ S RFE+R P W G +EL+ AGI FVH +GF GG +++
Sbjct: 136 EAWRIQSLPASLSARFENRCSAPKSWHGKEGEELNALAGIKSATFVHKAGFTGGAKTFED 195
Query: 335 ALAMARAALK 344
MA+ +K
Sbjct: 196 VQIMAQNWIK 205
>gi|153854623|ref|ZP_01995873.1| hypothetical protein DORLON_01868 [Dorea longicatena DSM 13814]
gi|149752727|gb|EDM62658.1| hypothetical protein DORLON_01868 [Dorea longicatena DSM 13814]
Length = 410
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/328 (23%), Positives = 130/328 (39%), Gaps = 67/328 (20%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++ + N +I +R +V ++ D + V+D YDHH
Sbjct: 19 THGGKFHADDIFSSALLL----YLNPEIQITRGNQVPEEYDGI-----VFDIGRGAYDHH 69
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL--NVDEGHPDVHRLFLAVYKNFMEA 144
QK V +G ++ GL+++ G E++ +EL DE +F++
Sbjct: 70 QKD-SRVRENGIP--YAAFGLLWEELGTEILGEELAEKFDE--------------SFVQP 112
Query: 145 IDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKE 204
+D DN + L+S +G N W + ++AF + +AG
Sbjct: 113 LDQNDNTGEK------------NELASLIGSFNPAW----DAKGNHDDAFFHAVSMAGM- 155
Query: 205 FLDTVRFYVRSWLPARSIVVECIAERYDYDPSGE----IMVLKRFCPWKLHLFELEEEMK 260
L++ R A + E + D G+ I++L F P + L E E
Sbjct: 156 ILESKFERFRGNERADRKIEEILEAHDDAVEEGKCDERILILPEFVPCQKRLSETE---- 211
Query: 261 IEPLIKYVLYEDDRGKQWRVQAVAVSPDRFE----SRKPLPAQWRGLRDDELSKEAGIPG 316
I +V++ +RG + P + E + P W GL + L K G+ G
Sbjct: 212 ----IAFVIFPSNRG------GYCIQPQKKEFSMNYKCAFPEAWLGLEGEALQKATGLSG 261
Query: 317 CVFVHMSGFIGGNQSYGGALAMARAALK 344
F H GF+ ++ A+ +LK
Sbjct: 262 AGFCHKGGFLMSTENLEDAVKACEISLK 289
>gi|240144809|ref|ZP_04743410.1| putative metal-dependent protein hydrolase [Roseburia intestinalis
L1-82]
gi|257203145|gb|EEV01430.1| putative metal-dependent protein hydrolase [Roseburia intestinalis
L1-82]
Length = 411
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/331 (22%), Positives = 135/331 (40%), Gaps = 72/331 (21%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I +R KV +D D + V+D YDHH
Sbjct: 18 THSGKFHADDVFSSALLL----YLNPEITITRGSKVPEDYDGI-----VFDIGRGEYDHH 68
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA-I 145
QK + +G ++ GL+++ G ++ +EL + F EA +
Sbjct: 69 QKD-SRIRENG--VPYAAFGLLWEQLGAGILGEELA----------------QTFDEAFV 109
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
+DN N + ++ L++ +G N W +A ++AF + + +AG
Sbjct: 110 QPLDNNDNTGEKNE---------LATLIGNFNPTW----DAAGSSDDAFFKAVGVAGMIL 156
Query: 206 LDTVRFY---------VRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
+ Y + L A+ +E + D +I+VL F P + L E +
Sbjct: 157 ENKFERYLGNERADKRIEEVLEAQQKALEAGEKPED---EAKILVLPEFIPCQKRLSETD 213
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEA 312
I +V++ +RG + P + E + P +W GL ++EL +
Sbjct: 214 --------IAFVIFPSNRG------GYCIQPQKREYSMNYKCSFPEKWLGLENEELVQAT 259
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
G+ F H GF+ + A+A + +L
Sbjct: 260 GLFSAGFCHKGGFLMTAGTLEDAVAACKISL 290
>gi|295102545|emb|CBL00090.1| Uncharacterized conserved protein related to MYG1 family
[Faecalibacterium prausnitzii L2-6]
Length = 275
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 130/318 (40%), Gaps = 56/318 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD D + VYD +DHH
Sbjct: 9 THAGKFHADDVFATALLQI----LRPDIKITRGFVVPDDFDGI-----VYDIGFGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ E +G ++ GL+++ G L+ + R + +NF++ +D
Sbjct: 60 QEP-RETRPNGIP--YAAFGLLWRVLGPGLVGE------------RQARLIDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L +G N W S E ++ F + + +A K+ L
Sbjct: 105 LNDNTGEQ------------NSLCDAIGFFNPVW----DSKEDQDACFFKAVAVA-KQIL 147
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + S A + E + + Y G I+VL + PWK L++ +
Sbjct: 148 EN---QIES-ANAVNRADEKVQQAYKNSRDG-IVVLPCYLPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W Q V + + + P P W G + + +++GI G F H S F+
Sbjct: 195 FVIYPSQRGG-WSAQCVTDHKTK-KPKLPFPQSWAGQPQEVIEQKSGIEGISFCHASRFL 252
Query: 327 GGNQSYGGALAMARAALK 344
+ ALA R LK
Sbjct: 253 ITAKDKETALAACRQVLK 270
>gi|257437655|ref|ZP_05613410.1| putative Metal-dependent protein hydrolase [Faecalibacterium
prausnitzii A2-165]
gi|257199962|gb|EEU98246.1| hypothetical protein FAEPRAA2165_00175 [Faecalibacterium
prausnitzii A2-165]
Length = 275
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 130/318 (40%), Gaps = 56/318 (17%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD D + VYD +DHH
Sbjct: 9 THAGKFHADDVFATALLQI----IRPDIRITRGFVVPDDFDGI-----VYDIGFGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
Q+ E +G ++ GL+++ G L+ + R + +NF++ +D
Sbjct: 60 QEP-REYRANGIP--YAAFGLLWRVLGPGLVGE------------RQARLIDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L +G N W S E ++ F + + +A K+ L
Sbjct: 105 LNDNTGEQ------------NSLCDAIGFFNPVW----DSKEDQDTCFFKAVAVA-KQIL 147
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + S A + E + + Y G I++L + PWK L++ +
Sbjct: 148 EN---QIES-ANAVNRADEKVQQAYKNSRDG-IVILPCYLPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
+V+Y RG W Q V + + + P P W G + + +++GI G F H S F+
Sbjct: 195 FVIYPSQRGG-WSAQCVTDHKTK-KPKLPFPQSWAGQPQEVIEQKSGIEGISFCHASRFL 252
Query: 327 GGNQSYGGALAMARAALK 344
+ ALA R LK
Sbjct: 253 ITAKDKETALAACRQVLK 270
>gi|347532859|ref|YP_004839622.1| hypothetical protein RHOM_12910 [Roseburia hominis A2-183]
gi|345503007|gb|AEN97690.1| hypothetical protein RHOM_12910 [Roseburia hominis A2-183]
Length = 411
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 125/312 (40%), Gaps = 68/312 (21%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++ + N +IV +R +V ++ G V+D YDHH
Sbjct: 18 THGGKFHADDVFSAALLL----YLNPEIVITRGNRVPENF-----AGIVFDIGRGRYDHH 68
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA-I 145
QK V +G ++ GL+++ G ++ +EL + F EA +
Sbjct: 69 QKD-SRVRENG--VPYAAFGLLWEELGGAILGEELAAE----------------FDEAFV 109
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
+DN N + ++ L++ +G N W S + AF Q + +AG
Sbjct: 110 QPLDNNDNTGEKNE---------LATLIGNFNPAWDAQGGS----DAAFFQAVSVAGMIL 156
Query: 206 LDTVRFYVRSWLPARSIVVECIAERYDYDPSGE-------IMVLKRFCPWKLHLFELEEE 258
+ Y R A + E +A SGE I++L F P + L E
Sbjct: 157 ENKFERY-RGNERADRQIEEVLAAHERAVVSGETAERDAAILILPYFIPCQKRLSETG-- 213
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAGI 314
I +V++ +RG + P + E + P W GL +EL++ +G+
Sbjct: 214 ------IAFVIFPSNRG------GYCIQPQKREYSLHYKCSFPECWLGLEKEELAEASGL 261
Query: 315 PGCVFVHMSGFI 326
VF H GF+
Sbjct: 262 ESAVFCHKGGFL 273
>gi|291534568|emb|CBL07680.1| Uncharacterized conserved protein related to MYG1 family [Roseburia
intestinalis M50/1]
Length = 411
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/330 (21%), Positives = 132/330 (40%), Gaps = 70/330 (21%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I +R KV + D + V+D YDHH
Sbjct: 18 THSGKFHADDVFSSALLL----YLNPEITITRGSKVPEGYDGI-----VFDIGRGEYDHH 68
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
QK +++ G++Y FG L+ + L ++ + F + +
Sbjct: 69 QK----------DSRIRENGVLYAAFG--LLWERLGAGILGEELAQAFDEAF------VQ 110
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
+DN N + ++ L++ +G N W +A ++AF + + +AG
Sbjct: 111 PLDNNDNTGEKNE---------LATLIGNFNPTW----DAAGSSDDAFFRAVGVAGMILE 157
Query: 207 DTVRFY---------VRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEE 257
+ Y + L A+ +E + D +I+VL F P + L E +
Sbjct: 158 NKFERYLGNERADKRIEEVLEAQQKALEAGEKPED---EAKILVLPEFIPCQKRLSETD- 213
Query: 258 EMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEAG 313
I +V++ +RG + P + E + P +W GL ++EL + G
Sbjct: 214 -------IAFVIFPSNRG------GYCIQPQKREYSMNYKCSFPEKWLGLENEELVQATG 260
Query: 314 IPGCVFVHMSGFIGGNQSYGGALAMARAAL 343
+ F H GF+ + A+A + +L
Sbjct: 261 LFSAGFCHKGGFLMTAGTLEDAVAACKISL 290
>gi|218675443|ref|ZP_03525112.1| putative hydrolase protein [Rhizobium etli GR56]
Length = 139
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 65/127 (51%), Gaps = 10/127 (7%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDA--VLDVGGVYDPSNDCYD 84
TH+G FH DE L + LT F A+++RSR P+ + + DVGG YD +D
Sbjct: 8 THSGGFHADELLSSVI--LTRLFPQARLIRSRAPEWITPGQGRIIYDVGGAYDADAGIFD 65
Query: 85 HHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA 144
HHQ+ + G SS GL++KH+G++ +A + VH F A +F+
Sbjct: 66 HHQR-CAPLRDDG--QPYSSFGLIWKHYGRDYLAARGLAADHVEAVHSSFDA---SFVLP 119
Query: 145 IDAIDNG 151
ID DNG
Sbjct: 120 IDLTDNG 126
>gi|291540585|emb|CBL13696.1| Uncharacterized conserved protein related to MYG1 family [Roseburia
intestinalis XB6B4]
Length = 411
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 134/333 (40%), Gaps = 76/333 (22%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I +R KV + D + V+D YDHH
Sbjct: 18 THSGKFHADDVFSSALLL----YLNPEITITRGSKVPEGYDGI-----VFDIGRGEYDHH 68
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEA-I 145
QK + +G ++ GL+++ G ++ +EL + F EA +
Sbjct: 69 QKD-SRIRENG--VPYAAFGLLWERLGAGILGEELA----------------QAFDEAFV 109
Query: 146 DAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEF 205
+DN N + ++ L++ +G N W +A ++AF + + +AG
Sbjct: 110 QPLDNNDNTGEKNE---------LATLIGNFNPTW----DAAGSSDDAFFRAVGVAGMIL 156
Query: 206 LDTVRFY---------VRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELE 256
+ Y + L A+ +E + D +I+VL F P + L E +
Sbjct: 157 ENKFERYLGNGRADKRIEEVLEAQQKALEAGEKPED---EAKILVLPEFIPCQKRLSETD 213
Query: 257 EEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFES----RKPLPAQWRGLRDDELSKEA 312
I +V++ +RG + P + E + P +W GL ++EL +
Sbjct: 214 --------IAFVIFPSNRG------GYCIQPQKREYSMNYKCSFPEKWLGLENEELVQAT 259
Query: 313 GIPGCVFVHMSGFIGGNQSYGGALAMARAALKL 345
G+ F H GF+ G L A AA K+
Sbjct: 260 GLFSAGFCHKGGFL----MTAGTLEDAVAACKI 288
>gi|170058231|ref|XP_001864830.1| MYG1 [Culex quinquefasciatus]
gi|167877410|gb|EDS40793.1| MYG1 [Culex quinquefasciatus]
Length = 319
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 57/134 (42%), Gaps = 16/134 (11%)
Query: 161 PRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPAR 220
P+Y S+RV N W E + F++ EF++ V + W AR
Sbjct: 176 PKYT-----STRVRHFNPAWNATADDVEDVAKRFEKAKAYVAAEFINKVLYNANRWWSAR 230
Query: 221 SIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWRV 280
+IV + + R + FCP K HL+ELE E I L KYV+Y R WRV
Sbjct: 231 AIVEKAVRNRVEV----------HFCPCKEHLYELENEHGIGGLPKYVIY-FKRPNDWRV 279
Query: 281 QAVAVSPDRFESRK 294
V + F RK
Sbjct: 280 ICVPLETASFVCRK 293
>gi|406978297|gb|EKE00291.1| hypothetical protein ACD_22C00056G0001 [uncultured bacterium]
Length = 187
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/207 (23%), Positives = 86/207 (41%), Gaps = 30/207 (14%)
Query: 144 AIDAIDNGINQYDTDKPPRY--VNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLA 201
++DA+DNG+N PP+ +N +L + +N + E ++ A AF + +++A
Sbjct: 2 SVDALDNGVNI----NPPQISGINTYSLHHILSAINSAYGEENRDA-----AFSKALEIA 52
Query: 202 GKEFLDTVRFYVRSWLPARSIVVE---CIAERYDYDPSGEIMVLKRFCPWKLHLFELEEE 258
+ + +E + E+ + + +VL ++ W E
Sbjct: 53 S--------LVISGEIKKAEAKIEGERFVNEQIELQGKPDFLVLDKYTAW-------ESA 97
Query: 259 MKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRF-ESRKPLPAQWRGLRDDELSKEAGIPGC 317
+ +K V++ D W +Q + F R P WRGL D EL+ IP
Sbjct: 98 VSKNKKVKLVIFPDSFSTNWCLQVARDDLEVFGNDRINFPHNWRGLSDHELALTTQIPDA 157
Query: 318 VFVHMSGFIGGNQSYGGALAMARAALK 344
VF H SGF ++ A+ MA ++
Sbjct: 158 VFCHASGFYAVVKTKQAAITMASKVIR 184
>gi|123469349|ref|XP_001317887.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121900632|gb|EAY05664.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 349
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 125/321 (38%), Gaps = 48/321 (14%)
Query: 37 ALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQK-GFEEVFG 95
A+ +++ ++ N Q++R+ D ++ + ++ YDP Y F +
Sbjct: 60 AVALLILKWVPEYRNFQVLRTDDKEIYEQCGFQFNITDEYDPKRHIYSKQSNLSFPD--- 116
Query: 96 HGFSTKLSSAGLVYKHFGKELIAKELNVD--EGHPDVHRLFLAVYKNFMEAIDAIDNGIN 153
+ L+ AGL+Y FG IA N+ E D L +YK + +D
Sbjct: 117 --YPDTLTIAGLIYHEFGGRAIANHYNLPGFEFQEDFDFLIQKLYKTMILPLDT------ 168
Query: 154 QYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYV 213
+ D D SR+ +PD ++E + M+L ++F V +
Sbjct: 169 KQDCD-----------ISRLASTLDPSDDPDPVVKQE--VLESLMNLIEEQFNQRVSWIT 215
Query: 214 RSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDD 273
++ +P RS + + +R Y +GEI+ L RF P + ++ + + IK+V+ +
Sbjct: 216 KTMIPDRSYIRRAMEDRKRYYQTGEILCLNRFVPVHQYHDIIDPDETKKQSIKFVVV--N 273
Query: 274 RGKQWRVQAVAVSPDRFESRKPLPAQWR------GLRDDELSKEAG----IPGCVFVHMS 323
RG A L WR GLR + G I G +VH +
Sbjct: 274 RGNAGANSGPA---------NVLAFSWRNNYRRLGLRGKSGEQLTGLLQNITGTGWVHPN 324
Query: 324 GFIGGNQSYGGALAMARAALK 344
G ++ AL + LK
Sbjct: 325 GSCAEWNNFANALEYTKQILK 345
>gi|429329931|gb|AFZ81690.1| hypothetical protein BEWA_011080 [Babesia equi]
Length = 66
Score = 58.2 bits (139), Expect = 6e-06, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 37/63 (58%)
Query: 282 AVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQSYGGALAMARA 341
AV +F+SR P P RGLRD+EL E+GIPG VFVH +GF ++ L +
Sbjct: 3 AVKEKGSQFQSRLPFPKHLRGLRDEELEAESGIPGLVFVHATGFTCAGKTRDSVLRLFDL 62
Query: 342 ALK 344
AL+
Sbjct: 63 ALE 65
>gi|255039235|ref|YP_003089856.1| hypothetical protein Dfer_5500 [Dyadobacter fermentans DSM 18053]
gi|254951991|gb|ACT96691.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 280
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 126/328 (38%), Gaps = 79/328 (24%)
Query: 22 LKRVGTHNGSFHCDE--ALGCFMIRLTDKFFNAQIVRSRDPKVLDDL-----DAVLDVGG 74
R+ TH+ +FH D+ A+ ++ D F + R+RD +++ +LD G
Sbjct: 5 FNRLITHDTTFHADDVIAVALLIVAGYDGF---SLSRTRDKATVEEALRSSETLILDTGM 61
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLF 134
YDP +DHHQ LS+AGL+Y EL+A +E P
Sbjct: 62 AYDPVMLNFDHHQD----------KNLLSAAGLIYNEIKDELLA-----EEAQP------ 100
Query: 135 LAVYKNFMEAIDAIDNGINQ--YDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENE 192
++ ++ +IDAID + + D+ P +T SS +G N D +AE ++
Sbjct: 101 --YFEKWISSIDAIDTNRDHIYHLWDQLPSGFRHT--SSILGGFNRD----PSNAEEQDR 152
Query: 193 AFQQGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCP-WKLH 251
F Q +++A L+ + + R + E + V + WK
Sbjct: 153 QFGQAVEMAKSIILNEIYSATKKAESERQYASRVVLE-------NNVAVFDDYSTVWK-- 203
Query: 252 LFELEEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKE 311
E + M +V+ G W++Q + + D L
Sbjct: 204 --EKRDHM-------FVVMPHANG--WQIQTIDTALDLIPESISLA-------------- 238
Query: 312 AGIPGCVFVHMSGFIGGNQSYGGALAMA 339
PG F HMSGF+ + A++ A
Sbjct: 239 ---PGFSFRHMSGFMATLDTKENAVSFA 263
>gi|269860741|ref|XP_002650089.1| MYG1 protein [Enterocytozoon bieneusi H348]
gi|220066461|gb|EED43941.1| MYG1 protein [Enterocytozoon bieneusi H348]
Length = 198
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/210 (21%), Positives = 90/210 (42%), Gaps = 15/210 (7%)
Query: 137 VYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEP-DQSAERENEAFQ 195
+YK A DAIDNGI + KP + + N + + D+ ++ F
Sbjct: 1 MYKELFLATDAIDNGIEITYSIKP------RTIQDIIKLFNTSYIDDMDEYNAAQDSQFH 54
Query: 196 QGMDLAGKEFLDTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFEL 255
+ ++ + + + ++LP ++ + D D +++ + +
Sbjct: 55 KALEFVKMDLKNYLNNLFNNFLPGFRKAIDLLQNNKDPDI---VVITDNYISISAIV--- 108
Query: 256 EEEMKIEPLIKYVLYEDDRGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIP 315
E E +KY++++ + ++R+ + F+ + PL +WRG +EL + IP
Sbjct: 109 EAERFTSRDLKYMIFK--KNNEYRIYCFNIPEHDFQPKVPLKQEWRGKSQEELKTISQIP 166
Query: 316 GCVFVHMSGFIGGNQSYGGALAMARAALKL 345
G FVH +GF G + A+ M +LK+
Sbjct: 167 GIRFVHATGFTGSVDTLDSAIMMCHESLKV 196
>gi|261368757|ref|ZP_05981640.1| metal-dependent protein hydrolase [Subdoligranulum variabile DSM
15176]
gi|282569179|gb|EFB74714.1| hypothetical protein SUBVAR_07017 [Subdoligranulum variabile DSM
15176]
Length = 277
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/319 (23%), Positives = 128/319 (40%), Gaps = 59/319 (18%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH G FH D+ ++++ I +R V DD + + VYD +DHH
Sbjct: 9 THGGKFHADDVFSTALLQIV----RPDIRITRGFTVPDDFEGI-----VYDVGGGMFDHH 59
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKELNVDEGHPDVHRLFLAVYKNFMEAID 146
+ E ++ GL+++ G +L+ + H+ L + +NF++ +D
Sbjct: 60 SEPRET---RPNGVPYAAFGLLWRVLGAQLVGE-----------HQARL-LDENFIQPLD 104
Query: 147 AIDNGINQYDTDKPPRYVNNTNLSSRVGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
DN Q +L+ +G N W DQ A+ +E F + + +A K+ L
Sbjct: 105 LNDNTGEQ------------NSLADAIGSFNPVW---DQKAD-PDECFWRAVPVA-KQIL 147
Query: 207 DTVRFYVRSWLPARSIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIK 266
+ + A V A D I+VL + PWK L++ +
Sbjct: 148 ENEIAAANAVNRADETVRRAYANMRD-----GIVVLPAYMPWKNGLYKTDA--------L 194
Query: 267 YVLYEDDRGKQWRVQAVAVSPDRFE-SRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGF 325
+V+Y RG A V+ R + S+ P P W G + +L + +G+ G F H S F
Sbjct: 195 FVVYPSQRGG---YSAQCVNDHRTKRSKLPFPPAWAGKPEAQLREISGL-GLRFCHPSRF 250
Query: 326 IGGNQSYGGALAMARAALK 344
+ A+ R L+
Sbjct: 251 LITADDKATAIEACRRTLR 269
>gi|294882701|ref|XP_002769804.1| hypothetical protein Pmar_PMAR028566 [Perkinsus marinus ATCC
50983]
gi|239873563|gb|EER02522.1| hypothetical protein Pmar_PMAR028566 [Perkinsus marinus ATCC
50983]
Length = 66
Score = 54.3 bits (129), Expect = 8e-05, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 8/68 (11%)
Query: 13 STSSPSQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFF----NAQIVRSRDPKVLDDLDA 68
S + PS P+ + TH+G FHCDE LG M+ DK N +VR+R+P+V+ D
Sbjct: 3 SATLPSLRPVT-IATHSGKFHCDEVLGTVML---DKILGGSKNYNLVRTRNPEVISKADI 58
Query: 69 VLDVGGVY 76
V+DVG +
Sbjct: 59 VIDVGAEF 66
>gi|375337558|ref|ZP_09778902.1| hypothetical protein SbacW_11581 [Succinivibrionaceae bacterium
WG-1]
Length = 80
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 25 VGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSN 80
V THNGSFH DE C ++ L + + +I+RSR+ + L+ D V+DV G +D N
Sbjct: 4 VATHNGSFHADELTACVILSLL--YEDIKIIRSRNLETLEQGDIVIDVSGQFDLKN 57
>gi|238586821|ref|XP_002391287.1| hypothetical protein MPER_09306 [Moniliophthora perniciosa FA553]
gi|215455749|gb|EEB92217.1| hypothetical protein MPER_09306 [Moniliophthora perniciosa FA553]
Length = 81
Score = 48.5 bits (114), Expect = 0.005, Method: Composition-based stats.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 6 VSSSPAYSTSSP---SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKF 49
V P ST +P Q K +GTHNG+FHCDEAL F++R T +
Sbjct: 20 VLQRPKMSTEAPIAKKQKVQKVIGTHNGTFHCDEALAVFLLRQTSTY 66
>gi|218672874|ref|ZP_03522543.1| metal-dependent protein hydrolase [Rhizobium etli GR56]
Length = 90
Score = 45.8 bits (107), Expect = 0.028, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 29/58 (50%)
Query: 274 RGKQWRVQAVAVSPDRFESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFIGGNQS 331
R K W V + + + FE R LPA W GL + EL+ GI G F H FI +S
Sbjct: 13 REKDWCVTGIRRAEEGFELRADLPAAWAGLANGELAAVCGIEGASFCHNGRFIAAARS 70
>gi|449134030|ref|ZP_21769538.1| hypothetical protein RE6C_01593 [Rhodopirellula europaea 6C]
gi|448887350|gb|EMB17731.1| hypothetical protein RE6C_01593 [Rhodopirellula europaea 6C]
Length = 325
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 7/69 (10%)
Query: 22 LKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD---AVLDVGGVYDP 78
++ + TH G H DE L C ++ + + ++ RDP D D V+DVGGV+DP
Sbjct: 37 IQLIVTHAGGAHKDEFLACSLL----AYLHGVPIQRRDPTEEDLADPSVCVVDVGGVHDP 92
Query: 79 SNDCYDHHQ 87
+DHHQ
Sbjct: 93 RMKNFDHHQ 101
>gi|78356983|ref|YP_388432.1| hypothetical protein [Desulfovibrio alaskensis G20]
gi|78219388|gb|ABB38737.1| hypothetical protein Dde_1940 [Desulfovibrio alaskensis G20]
Length = 194
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 8/97 (8%)
Query: 42 MIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHHQKGFEEVFGHGFSTK 101
+ RLTD+ + + DP V L+A+ + GG P +++ ++ F G +
Sbjct: 40 LARLTDRISLSAV----DPAVRRQLEAIAEQGGSLAP---VLENNISQLQDAFIDGLRSA 92
Query: 102 LSSAGL-VYKHFGKELIAKELNVDEGHPDVHRLFLAV 137
L+SAGL V + +L EL V GHPD H+L +AV
Sbjct: 93 LTSAGLSVDEKVTLQLQDGELAVLGGHPDKHKLDMAV 129
>gi|329942575|ref|ZP_08291385.1| hypothetical protein G5Q_0272 [Chlamydophila psittaci Cal10]
gi|407453752|ref|YP_006732860.1| hypothetical protein B595_0290 [Chlamydia psittaci 84/55]
gi|407455082|ref|YP_006733973.1| hypothetical protein B598_0284 [Chlamydia psittaci GR9]
gi|407456454|ref|YP_006735027.1| hypothetical protein B600_0299 [Chlamydia psittaci VS225]
gi|328815485|gb|EGF85473.1| hypothetical protein G5Q_0272 [Chlamydophila psittaci Cal10]
gi|405780511|gb|AFS19261.1| hypothetical protein B595_0290 [Chlamydia psittaci 84/55]
gi|405781625|gb|AFS20374.1| hypothetical protein B598_0284 [Chlamydia psittaci GR9]
gi|405783715|gb|AFS22462.1| hypothetical protein B600_0299 [Chlamydia psittaci VS225]
Length = 98
Score = 43.9 bits (102), Expect = 0.10, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 277 QWRVQAVAVSPDR-FESRKPLPAQWRGLRDDELSKEAGIPGCVFVHMSGFI 326
QW ++ + + DR E R P P W GL EL + +GIPG +F H F+
Sbjct: 25 QWILRGIPPTLDRRMEVRVPFPESWAGLLGKELEEISGIPGAIFCHKGLFL 75
>gi|170066829|ref|XP_001868239.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167862982|gb|EDS26365.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 369
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 11/89 (12%)
Query: 75 VYDPSNDCYDHHQKGFEEVFGH-------GFSTKLSSAGLVYKHFGKELIAKELNVDE-G 126
V+DP+ + Y +Q F+E +LSSAGL++ +FG+E+ K L +
Sbjct: 129 VFDPAKNRYGTYQASFQETLNSMRPEVKVKREIRLSSAGLIHTYFGEEVNRKILERNSIA 188
Query: 127 HPD---VHRLFLAVYKNFMEAIDAIDNGI 152
+P+ V ++ +Y + +D IDNG+
Sbjct: 189 NPEKELVRGVYRKLYDTLIAELDGIDNGV 217
>gi|254445044|ref|ZP_05058520.1| hypothetical protein VDG1235_3287 [Verrucomicrobiae bacterium
DG1235]
gi|198259352|gb|EDY83660.1| hypothetical protein VDG1235_3287 [Verrucomicrobiae bacterium
DG1235]
Length = 292
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 43/99 (43%), Gaps = 14/99 (14%)
Query: 18 SQTPLKRVGTHNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLD---AVLDVGG 74
S T + TH GS H D+ L C ++ N + R+P D + AV+DVG
Sbjct: 2 SDTTYTTLLTHPGSAHKDDLLACCVLLAA----NPVPIERREPTPADIANPKIAVIDVGD 57
Query: 75 VYDPSNDCYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFG 113
+DP +DHHQ + + + LV KH G
Sbjct: 58 SHDPERGNFDHHQFPRD-------TPPTCALSLVLKHLG 89
>gi|452003741|gb|EMD96198.1| hypothetical protein COCHEDRAFT_1075113, partial [Cochliobolus
heterostrophus C5]
Length = 592
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 27/139 (19%)
Query: 178 LDWTEPDQSAERENEAFQQGMDLAGKEFLDTVRFYVRSWLPAR----------------- 220
LDW P A ++N+ +Q +GK LD+ F +SWL +
Sbjct: 48 LDWLTPTDYAPQQNDFLKQWQAGSGKWLLDSAEF--KSWLDTKKQKLFCPGIPGAGKTIL 105
Query: 221 -SIVVECIAERYDYDPSGEIMVLKRFCPWKLHLFELEEEMKIEPLIKYVLYEDDRGKQWR 279
SIV+E ++ R+ + + I + +C F+ ++E +E L+ VL + + +
Sbjct: 106 TSIVIEELSSRFQDESNYGIAYI--YCN-----FKRQDEQTLEDLLASVLKQLAQARFSL 158
Query: 280 VQAVAVSPDRFESRKPLPA 298
Q V DR+ES K P+
Sbjct: 159 PQTVRSLHDRYESMKARPS 177
>gi|325969926|ref|YP_004246117.1| AraC family transcriptional regulator [Sphaerochaeta globus str.
Buddy]
gi|324025164|gb|ADY11923.1| two component transcriptional regulator, AraC family [Sphaerochaeta
globus str. Buddy]
Length = 535
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 4/94 (4%)
Query: 113 GKELIAKELNVDEGHPDVHRLFLAVYKNFMEAIDAIDNGINQYDTDKPPRYVNNTNLSSR 172
G ELI K + HP + L+ Y F A A+ +N+Y KP + NL R
Sbjct: 64 GIELIQK---IRLSHPTTTLVILSGYDEFTYAQQAMRYDVNEY-VLKPVSVEDLCNLLKR 119
Query: 173 VGKLNLDWTEPDQSAERENEAFQQGMDLAGKEFL 206
+GK + + Q +R N+A+QQ + L ++FL
Sbjct: 120 LGKHLDEEIKRIQDQDRLNQAYQQALPLIREKFL 153
>gi|291546156|emb|CBL19264.1| Uncharacterized conserved protein related to MYG1 family
[Ruminococcus sp. SR1/5]
Length = 125
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 48/95 (50%), Gaps = 12/95 (12%)
Query: 27 THNGSFHCDEALGCFMIRLTDKFFNAQIVRSRDPKVLDDLDAVLDVGGVYDPSNDCYDHH 86
TH+G FH D+ ++ + N +I +R +V ++ + G V+D YDHH
Sbjct: 17 THSGKFHADDVFSAALLL----YLNPEITITRGNQVPENYE-----GLVFDIGRGQYDHH 67
Query: 87 QKGFEEVFGHGFSTKLSSAGLVYKHFGKELIAKEL 121
QK + +G ++ GL+++ G E++ +EL
Sbjct: 68 QKD-SRIRDNG--VPYAAFGLLWEKLGPEILGEEL 99
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,895,152,956
Number of Sequences: 23463169
Number of extensions: 259725009
Number of successful extensions: 562081
Number of sequences better than 100.0: 671
Number of HSP's better than 100.0 without gapping: 637
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 559176
Number of HSP's gapped (non-prelim): 730
length of query: 345
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 202
effective length of database: 9,003,962,200
effective search space: 1818800364400
effective search space used: 1818800364400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)