BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017050
(378 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255552315|ref|XP_002517202.1| Protein C9orf114, putative [Ricinus communis]
gi|223543837|gb|EEF45365.1| Protein C9orf114, putative [Ricinus communis]
Length = 369
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 267/371 (71%), Positives = 312/371 (84%), Gaps = 6/371 (1%)
Query: 1 MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPI-- 58
MG KKKR E + E T ENHE N++ + NGDS KKKK+++R++ +
Sbjct: 1 MGKKKKRAEAEAQ-TETETVENHEPVNDV-VVNGDSDRKKKKKKKEKERNERKKEENESK 58
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
E T+S+AVPGSIIDN QSLELATRLAGQIARA TIFRIDEVVVFDN+SSS R++
Sbjct: 59 ETATISIAVPGSIIDNAQSLELATRLAGQIARAATIFRIDEVVVFDNESSSVKEDRTTMI 118
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
N SDENESGAAFL+R+L+YLETPQYLRKALF +SLRFVG+LPPLDAPHHLRKHEW
Sbjct: 119 T-GNNSDENESGAAFLIRILRYLETPQYLRKALFPRLNSLRFVGLLPPLDAPHHLRKHEW 177
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
APFREGVTLKE APNS+GTLVDVGL+K+VV+DQV++PG+RVTV MGT+RNLD++ PRQVV
Sbjct: 178 APFREGVTLKEKAPNSIGTLVDVGLSKNVVIDQVVEPGIRVTVEMGTDRNLDSELPRQVV 237
Query: 239 PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
SKP+E +GMYWGY+VRYA NIS+VF +C YKGGYDHL+GTSEHG I+N+S L+LPTFR
Sbjct: 238 SLSKPREEAGMYWGYRVRYASNISTVFNDCPYKGGYDHLVGTSEHGQIINASKLSLPTFR 297
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
HLLIAFGGLAGLEESIEED+ LK KN REVF+SY NTCPHQGSRTIRTEEAIFISLQYFQ
Sbjct: 298 HLLIAFGGLAGLEESIEEDNSLKGKNVREVFNSYLNTCPHQGSRTIRTEEAIFISLQYFQ 357
Query: 358 EPISRALRRVE 368
EPI+RAL+R++
Sbjct: 358 EPINRALQRIQ 368
>gi|224099761|ref|XP_002311607.1| predicted protein [Populus trichocarpa]
gi|222851427|gb|EEE88974.1| predicted protein [Populus trichocarpa]
Length = 366
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 268/372 (72%), Positives = 303/372 (81%), Gaps = 11/372 (2%)
Query: 1 MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPI-- 58
MG K+K+ E +A EN +++NEL L NGDS KKK K + + +D I
Sbjct: 1 MGKKQKKAEAE---TDAERVENDKAENELELTNGDSHKKKKKKKNKERNE--SDKEVIKA 55
Query: 59 -EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
E+PTVSVA+ GSII+N QSLELATRLAGQIARA TIFRIDEVVVFDNKS+ + R+
Sbjct: 56 KEIPTVSVAISGSIINNAQSLELATRLAGQIARAATIFRIDEVVVFDNKSNYEKEDRTLT 115
Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
+ N SDENESGAAF VR+L+YLETPQYLRKALF H +LRFVGMLPPLDAPHHLRKHE
Sbjct: 116 TD--NYSDENESGAAFFVRILRYLETPQYLRKALFPKHCNLRFVGMLPPLDAPHHLRKHE 173
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
WAPFREGVTL E PNS TLVDVGL+K+V ++QVL+PG+RVTVAMGTNRNLD+DSPRQV
Sbjct: 174 WAPFREGVTLNEKVPNSGETLVDVGLSKNVSINQVLEPGIRVTVAMGTNRNLDSDSPRQV 233
Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
V KP+E +GMYWGY+VRYA NISSVFK+C YKGGYDHLIGTSEHG I+NSS+L+LP F
Sbjct: 234 VSLLKPREEAGMYWGYRVRYASNISSVFKDCPYKGGYDHLIGTSEHGLIINSSELSLPAF 293
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
RHLLIAFGGLAGLEE IEED LK KN REVF SY NTCPHQGSRTIRTEEAIFISLQYF
Sbjct: 294 RHLLIAFGGLAGLEECIEEDSNLKGKNVREVFDSYLNTCPHQGSRTIRTEEAIFISLQYF 353
Query: 357 QEPISRALRRVE 368
QEPI+RAL RV+
Sbjct: 354 QEPINRALHRVQ 365
>gi|147790065|emb|CAN75987.1| hypothetical protein VITISV_012192 [Vitis vinifera]
Length = 363
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 262/374 (70%), Positives = 300/374 (80%), Gaps = 16/374 (4%)
Query: 1 MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
MG KKKR E E EN+ ++NE L NGDS S K K + + Q D
Sbjct: 1 MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PT+++AVPGSII N QSLELATRLAGQ+ARA TIFRIDEVVVFD KS+S + +S
Sbjct: 50 IPTLTIAVPGSIIHNAQSLELATRLAGQVARAATIFRIDEVVVFDCKSTSGD---NSTVA 106
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ SDENE+GAAFL+R+L+YLETPQYLRK LF H+SL+FVGMLPPLDAPHHLRKHEW
Sbjct: 107 TPDASDENETGAAFLIRILRYLETPQYLRKILFPKHNSLKFVGMLPPLDAPHHLRKHEWG 166
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD QVV
Sbjct: 167 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 226
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
SKP+E G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 227 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 286
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
LLIAFGGLAGLEES+EED LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 287 LLIAFGGLAGLEESVEEDHSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 346
Query: 359 PISRALRRVEYKSK 372
PI+RAL+R EY+S+
Sbjct: 347 PINRALQRFEYQSQ 360
>gi|225432580|ref|XP_002277845.1| PREDICTED: uncharacterized protein C9orf114 isoform 1 [Vitis
vinifera]
gi|297737011|emb|CBI26212.3| unnamed protein product [Vitis vinifera]
Length = 363
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 261/374 (69%), Positives = 299/374 (79%), Gaps = 16/374 (4%)
Query: 1 MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
MG KKKR E E EN+ ++NE L NGDS S K K + + Q D
Sbjct: 1 MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PT+++AVPGSII N QSLELATRLAGQIARA TIFRIDEVVVFD KS+S + S
Sbjct: 50 IPTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDEVVVFDCKSTSGD---DSTVA 106
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ SDENE+G AFL+R+L+YLETPQYLRK LF H+SL+FVGMLPP+DAPHHLRKHEW
Sbjct: 107 TPDASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPHHLRKHEWG 166
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD QVV
Sbjct: 167 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 226
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
SKP+E G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 227 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 286
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
LLIAFGGLAGLEES+EED+ LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 287 LLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 346
Query: 359 PISRALRRVEYKSK 372
PI+RAL+R EY+S+
Sbjct: 347 PINRALQRFEYRSQ 360
>gi|359477498|ref|XP_003631986.1| PREDICTED: uncharacterized protein C9orf114 isoform 2 [Vitis
vinifera]
Length = 359
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 257/374 (68%), Positives = 293/374 (78%), Gaps = 20/374 (5%)
Query: 1 MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
MG KKKR E E EN+ ++NE L NGDS S K K + + Q D
Sbjct: 1 MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PT+++AVPGSII N QSLELATRLAGQIARA TIFRIDEV + N S S
Sbjct: 50 IPTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDEVCIVMNSSRSVEL------- 102
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
N SDENE+G AFL+R+L+YLETPQYLRK LF H+SL+FVGMLPP+DAPHHLRKHEW
Sbjct: 103 LINASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPHHLRKHEWG 162
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD QVV
Sbjct: 163 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 222
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
SKP+E G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 223 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 282
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
LLIAFGGLAGLEES+EED+ LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 283 LLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 342
Query: 359 PISRALRRVEYKSK 372
PI+RAL+R EY+S+
Sbjct: 343 PINRALQRFEYRSQ 356
>gi|449432714|ref|XP_004134144.1| PREDICTED: uncharacterized protein C9orf114 homolog [Cucumis
sativus]
Length = 367
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 249/374 (66%), Positives = 292/374 (78%), Gaps = 21/374 (5%)
Query: 1 MGNKKKRGGLEPELK-EAATGENHESQNELSLANGDS--------SSCDNKKKKKRKRDQ 51
MG KK + PE + EA ++ E+ +E L NG S +N + KR
Sbjct: 1 MGKKKHK---RPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKVENGSIEAEKRK- 56
Query: 52 LNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDN 111
PI PTVS+AV GSIIDN QSLELATRLAGQIARA TIFRI+EVVVFD+ SS
Sbjct: 57 -----PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINEVVVFDSGRSSTT 111
Query: 112 YSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
S +AAN S DE+ESGAAFL+R+L+YLETPQYLRKALF H++LRFVGMLPPLDAPH
Sbjct: 112 GSEVAAANNS---DEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 168
Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
HLRKHEW P+REGVTLKE AP++ GT VDVGL+K+VVVD++L+PG RVTVAMGT+RNL +
Sbjct: 169 HLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLFS 228
Query: 232 DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
D PRQVV SKP E G+YWGY+VRYA ++S+VFK SY+GGYDHLIGTSEHG ++ SS+L
Sbjct: 229 DLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSEL 288
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
TLP FRHLLIAFGGLAGLEESIEED+ K KNA E+F SY NTCP QGSRTIRTEEAIFI
Sbjct: 289 TLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIFI 348
Query: 352 SLQYFQEPISRALR 365
SLQYFQEPI++A++
Sbjct: 349 SLQYFQEPINKAMQ 362
>gi|449520559|ref|XP_004167301.1| PREDICTED: uncharacterized protein C9orf114 homolog [Cucumis
sativus]
Length = 368
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 249/375 (66%), Positives = 292/375 (77%), Gaps = 22/375 (5%)
Query: 1 MGNKKKRGGLEPELK-EAATGENHESQNELSLANGDS---------SSCDNKKKKKRKRD 50
MG KK + PE + EA ++ E+ +E L NG S +N + KR
Sbjct: 1 MGKKKHK---RPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKKVENGSIEAEKRK 57
Query: 51 QLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
PI PTVS+AV GSIIDN QSLELATRLAGQIARA TIFRI+EVVVFD+ SS
Sbjct: 58 ------PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINEVVVFDSGRSST 111
Query: 111 NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP 170
S +AAN S DE+ESGAAFL+R+L+YLETPQYLRKALF H++LRFVGMLPPLDAP
Sbjct: 112 TGSEVAAANNS---DEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAP 168
Query: 171 HHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
HHLRKHEW P+REGVTLKE AP++ GT VDVGL+K+VVVD++L+PG RVTVAMGT+RNL
Sbjct: 169 HHLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLF 228
Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+D PRQVV SKP E G+YWGY+VRYA ++S+VFK SY+GGYDHLIGTSEHG ++ SS+
Sbjct: 229 SDLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSE 288
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
LTLP FRHLLIAFGGLAGLEESIEED+ K KNA E+F SY NTCP QGSRTIRTEEAIF
Sbjct: 289 LTLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIF 348
Query: 351 ISLQYFQEPISRALR 365
ISLQYFQEPI++A++
Sbjct: 349 ISLQYFQEPINKAMQ 363
>gi|42567956|ref|NP_197431.2| uncharacterized protein [Arabidopsis thaliana]
gi|45825145|gb|AAS77480.1| At5g19300 [Arabidopsis thaliana]
gi|110741745|dbj|BAE98818.1| hypothetical protein [Arabidopsis thaliana]
gi|332005299|gb|AED92682.1| uncharacterized protein [Arabidopsis thaliana]
Length = 398
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 230/307 (74%), Positives = 265/307 (86%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+ GSII NTQSLELATRLAGQIARA TIFRIDE+VVFDNKSSS+ SAA +
Sbjct: 99 TVSIAIAGSIIHNTQSLELATRLAGQIARAATIFRIDEIVVFDNKSSSEI---ESAA--T 153
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
N SD NESGA+FLVR+L+YLETPQYLRK+LF + LR+VGMLPPLDAPHHLRKHEW +
Sbjct: 154 NASDSNESGASFLVRILKYLETPQYLRKSLFPKQNDLRYVGMLPPLDAPHHLRKHEWEQY 213
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVTL E APNS GTLVDVGL+K VVVDQVL PG+RVTVAMGT+ +LD RQ+VPPS
Sbjct: 214 REGVTLSEKAPNSEGTLVDVGLSKSVVVDQVLGPGIRVTVAMGTDHDLDL--VRQIVPPS 271
Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
KP+E +GMYWGYKVRYA +SSVFK C ++GGYD+LIGTSEHG +++SS+L +PTFRHLL
Sbjct: 272 KPREEAGMYWGYKVRYASQLSSVFKECPFEGGYDYLIGTSEHGLVISSSELKIPTFRHLL 331
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
IAFGGLAGLEESIE+D+ K KN R+VF+ Y NTCPHQGSRTIR EEA+FISLQYFQEPI
Sbjct: 332 IAFGGLAGLEESIEDDNQYKGKNVRDVFNVYLNTCPHQGSRTIRAEEAMFISLQYFQEPI 391
Query: 361 SRALRRV 367
SRA+RR+
Sbjct: 392 SRAVRRL 398
>gi|297807989|ref|XP_002871878.1| hypothetical protein ARALYDRAFT_909964 [Arabidopsis lyrata subsp.
lyrata]
gi|297317715|gb|EFH48137.1| hypothetical protein ARALYDRAFT_909964 [Arabidopsis lyrata subsp.
lyrata]
Length = 381
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 230/307 (74%), Positives = 264/307 (85%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+ GSII NTQSLELATRLAGQIARA TIFRIDE+VVFDNKSSS+ SAA
Sbjct: 82 TVSIAIAGSIIHNTQSLELATRLAGQIARAATIFRIDEIVVFDNKSSSEI---ESAA--M 136
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
N SD NESGA+FLVR+L+YLETPQYLRK+LF + LR+VGMLPPLDAPHHLRKHEW +
Sbjct: 137 NPSDSNESGASFLVRILKYLETPQYLRKSLFPKQNDLRYVGMLPPLDAPHHLRKHEWEQY 196
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVTL E APNS GT+VDVGL+K VVVDQVL PGVRVTVAMGT+ +LD RQ+VPPS
Sbjct: 197 REGVTLNEKAPNSEGTMVDVGLSKSVVVDQVLSPGVRVTVAMGTDHDLDL--VRQIVPPS 254
Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
KP+E +GMYWGYKVRYA +SSVFK C ++GGYD+LIGTSEHG +++SS+L +PTFRHLL
Sbjct: 255 KPREEAGMYWGYKVRYASQLSSVFKECPFQGGYDYLIGTSEHGVVISSSELKIPTFRHLL 314
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
IAFGGLAGLEESIE+D+ K KN R+VF+ Y NTCPHQGSRTIR EEA+FISLQYFQEPI
Sbjct: 315 IAFGGLAGLEESIEDDNQYKGKNVRDVFNIYLNTCPHQGSRTIRAEEAMFISLQYFQEPI 374
Query: 361 SRALRRV 367
SRA+RR+
Sbjct: 375 SRAVRRL 381
>gi|224111238|ref|XP_002315789.1| predicted protein [Populus trichocarpa]
gi|222864829|gb|EEF01960.1| predicted protein [Populus trichocarpa]
Length = 338
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 230/318 (72%), Positives = 262/318 (82%), Gaps = 9/318 (2%)
Query: 54 DDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV-VVFDNKSSSDNY 112
D E+PTVS+AV GSII+N QSLELATRLAGQIARA TIFRIDEV ++ N +S +
Sbjct: 28 DKEAKEIPTVSIAVSGSIINNAQSLELATRLAGQIARAATIFRIDEVSILIFNGFTSSSE 87
Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
+N SDENESG AF +R+L+YLETPQYLRKALF H++LRFVGMLPPLDAPHH
Sbjct: 88 KEDPTVTTNNNSDENESGGAFFIRILRYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHH 147
Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
LRKHEWAPFREGVTL E NS TLVDVGL+K+VVV+Q L+PG+RVTVAMGTNRNLD D
Sbjct: 148 LRKHEWAPFREGVTLNEKVSNSGETLVDVGLSKNVVVNQALEPGIRVTVAMGTNRNLDDD 207
Query: 233 SPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
SPRQVV SKP+ E+G+YWGY+VRYA NISSVFK+C Y+GGYD LIGTSEHG I+NSS+L
Sbjct: 208 SPRQVVSLSKPREEAGLYWGYRVRYASNISSVFKDCPYRGGYDLLIGTSEHGLIINSSEL 267
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
+LPTFRHLLIAFGGLAGLEESIEED LK K+ REVF SY NTCPHQGSRTIRTE
Sbjct: 268 SLPTFRHLLIAFGGLAGLEESIEEDSNLKGKDVREVFDSYLNTCPHQGSRTIRTE----- 322
Query: 352 SLQYFQEPISRALRRVEY 369
YFQEPISRA++RV++
Sbjct: 323 --VYFQEPISRAMQRVQH 338
>gi|356575809|ref|XP_003556029.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
C9orf114-like [Glycine max]
Length = 353
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 216/293 (73%), Positives = 247/293 (84%), Gaps = 7/293 (2%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PTVS+AVP SIIDN +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN S
Sbjct: 43 IPTVSIAVPASIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDNDSVLDNVG 102
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ESGAAFL+R+LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW
Sbjct: 103 -------DESGAAFLMRILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWF 155
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVT+KE NS TLVDVGL K+V+VDQ+ +PG RVTVAMGT+RNLD+D PRQV+
Sbjct: 156 PYREGVTVKERDSNSGATLVDVGLVKNVIVDQIFEPGRRVTVAMGTDRNLDSDLPRQVIS 215
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
SKP+E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRHL
Sbjct: 216 SSKPREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRHL 275
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
LIAFGGLAGLEESIEEDD LK K A++ F+ Y NTCPHQGSRTIRTEEAIFIS
Sbjct: 276 LIAFGGLAGLEESIEEDDNLKGKKAQDAFNLYLNTCPHQGSRTIRTEEAIFIS 328
>gi|363807614|ref|NP_001242411.1| uncharacterized protein LOC100786821 [Glycine max]
gi|255639023|gb|ACU19812.1| unknown [Glycine max]
Length = 352
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 217/294 (73%), Positives = 248/294 (84%), Gaps = 8/294 (2%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PTVS+AVPGSIIDN +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN S +
Sbjct: 43 IPTVSIAVPGSIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDN---DSVLD 99
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
SN +ESGA+FL+++LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW
Sbjct: 100 NSN----DESGASFLMQILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWG 155
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVT KE NS TLVDVGL K+V+VDQ+ PG RVTVAMGT+RNLD+D PRQVV
Sbjct: 156 PYREGVTAKERHLNSGATLVDVGLVKNVIVDQIFKPGRRVTVAMGTDRNLDSDLPRQVVS 215
Query: 240 PSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
SKP +E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRH
Sbjct: 216 SSKPTREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRH 275
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
LLIAFGGLAGLEESIEEDD LK K A++ F+ Y NTCPHQGSRTIRTEEAI IS
Sbjct: 276 LLIAFGGLAGLEESIEEDDNLKGKKAQDAFNLYLNTCPHQGSRTIRTEEAILIS 329
>gi|226510311|ref|NP_001149839.1| conserved protein [Zea mays]
gi|195634967|gb|ACG36952.1| conserved protein [Zea mays]
Length = 365
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 215/308 (69%), Positives = 250/308 (81%), Gaps = 12/308 (3%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ +++N S
Sbjct: 69 PTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSTPAAENGSAG----- 123
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
D ESGA FLVR+L+YLETPQYLR+ LF MH +L+FVG+LPPLDAPHH+RKHEW+
Sbjct: 124 ----DGEESGARFLVRILEYLETPQYLRRRLFPMHKNLKFVGLLPPLDAPHHVRKHEWSE 179
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
FREGVTL+ + S GTLVDVGL+K+V+V+Q L+PG RVTVAMGTNR+L R+VVPP
Sbjct: 180 FREGVTLESDP--SKGTLVDVGLSKNVLVEQKLEPGKRVTVAMGTNRDLTTACIRKVVPP 237
Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
S P+E G YWGYKVRY N+S VFKN +K YDH+IGTSEHG IVNSS+LTLPTFRHL
Sbjct: 238 STPREQMGSYWGYKVRYTSNLSGVFKNSPFKEEYDHIIGTSEHGQIVNSSELTLPTFRHL 297
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
LIAFGGLAGLEESIEED LK K A +VF SYFNTCP+QGSRTIRTEEA+ ISLQYFQ+P
Sbjct: 298 LIAFGGLAGLEESIEEDTNLKGKRADDVFTSYFNTCPNQGSRTIRTEEALLISLQYFQDP 357
Query: 360 ISRALRRV 367
I RA +++
Sbjct: 358 IRRAEQKL 365
>gi|242092392|ref|XP_002436686.1| hypothetical protein SORBIDRAFT_10g007030 [Sorghum bicolor]
gi|241914909|gb|EER88053.1| hypothetical protein SORBIDRAFT_10g007030 [Sorghum bicolor]
Length = 367
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 211/308 (68%), Positives = 251/308 (81%), Gaps = 14/308 (4%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ +++N
Sbjct: 73 PTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSTPAAEN--------- 123
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+D+ ESGA FLVR+L+YLETPQYLR+ LF MH +L+FVG+LPPLDAPHH+RKHEW+
Sbjct: 124 GGAADDEESGARFLVRILEYLETPQYLRRRLFPMHKNLKFVGLLPPLDAPHHVRKHEWSE 183
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
FREGVTL+ + S GTLVDVGL+K+V+V+Q L+PG RVTVAMGTNR+L + R+VVPP
Sbjct: 184 FREGVTLESDP--SKGTLVDVGLSKNVLVEQTLEPGKRVTVAMGTNRDLT--TARKVVPP 239
Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
S P+E G YWGYKVRY N+S V KN +K YDH+IGTSEHG I+NSS+LTLPTFRHL
Sbjct: 240 STPREQMGSYWGYKVRYTSNLSGVLKNSPFKEEYDHIIGTSEHGQIINSSELTLPTFRHL 299
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
LIAFGGLAGLEESIEED LK K+A +VF SY NTCP+QGSRTIRTEEA+ ISLQYFQ+P
Sbjct: 300 LIAFGGLAGLEESIEEDTNLKGKHADDVFTSYLNTCPNQGSRTIRTEEALLISLQYFQDP 359
Query: 360 ISRALRRV 367
I RA +++
Sbjct: 360 IRRAEQKM 367
>gi|357132223|ref|XP_003567731.1| PREDICTED: uncharacterized protein C9orf114-like [Brachypodium
distachyon]
Length = 362
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 213/324 (65%), Positives = 249/324 (76%), Gaps = 13/324 (4%)
Query: 43 KKKKRKRDQLNDDAPIE--VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV 100
KKKKR +D + PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDE+
Sbjct: 48 KKKKRNKDTEEGKKTDQNRKPTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEI 107
Query: 101 VVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF 160
VVFD+ +N S + A E ESGA FL+R+LQYLETPQYLR+ LF MH + +F
Sbjct: 108 VVFDSCPPVENASGAEA--------EEESGAQFLIRILQYLETPQYLRRRLFPMHKNFKF 159
Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVT 220
VG+LPPLDAPHH+R+HEW+ FREGVTL S GTLVDVGL+K ++V+Q+L+PG RVT
Sbjct: 160 VGLLPPLDAPHHVRRHEWSEFREGVTL--GGDRSKGTLVDVGLSKDILVEQILEPGKRVT 217
Query: 221 VAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGT 279
VAMGTNR++ R+VVPPS P E G YWGYKVRYA N+S VF YK GYDH+IGT
Sbjct: 218 VAMGTNRDITTACVRKVVPPSSPSDEMGSYWGYKVRYASNLSGVFNGSPYKEGYDHIIGT 277
Query: 280 SEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQG 339
SEHG+ + SS+LTLP FRHLLIAFGGLAGLEESIEED LK K A++VF SY NTCP+QG
Sbjct: 278 SEHGEAIGSSELTLPAFRHLLIAFGGLAGLEESIEEDINLKGKGAKDVFTSYLNTCPNQG 337
Query: 340 SRTIRTEEAIFISLQYFQEPISRA 363
SRTIRTEEA+ ISLQYFQ+PI RA
Sbjct: 338 SRTIRTEEALLISLQYFQDPIRRA 361
>gi|115457406|ref|NP_001052303.1| Os04g0244500 [Oryza sativa Japonica Group]
gi|113563874|dbj|BAF14217.1| Os04g0244500 [Oryza sativa Japonica Group]
gi|116310997|emb|CAH67931.1| H0211F06-OSIGBa0153M17.3 [Oryza sativa Indica Group]
gi|215686888|dbj|BAG89738.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 358
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/303 (66%), Positives = 242/303 (79%), Gaps = 14/303 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ SS +N
Sbjct: 68 TVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSNSSVEN---------- 117
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
D+ ESGA FLVR+LQYLETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ F
Sbjct: 118 -SGDDVESGARFLVRILQYLETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEF 176
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVTL + S+GT VDVGL+K+V+V+Q+L+PG RVT+AMGTNR++ R++V PS
Sbjct: 177 REGVTL--DGDRSMGTFVDVGLSKNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPS 234
Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
P+ E +YWGYKVRYA N+ VF + YK GYD++IGTSEHG I++SS+L LP+FRHLL
Sbjct: 235 SPRDEMELYWGYKVRYASNLGGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLL 294
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
IAFGGLAGLEE IEED LK K +VF++Y NTCP QGSRTIRTEEA+ ISLQYFQ+PI
Sbjct: 295 IAFGGLAGLEECIEEDRNLKGKGVDDVFNTYLNTCPSQGSRTIRTEEALLISLQYFQDPI 354
Query: 361 SRA 363
RA
Sbjct: 355 RRA 357
>gi|255648220|gb|ACU24563.1| unknown [Glycine max]
Length = 307
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/273 (71%), Positives = 225/273 (82%), Gaps = 9/273 (3%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+PTVS+AVP SIIDN +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN S
Sbjct: 43 IPTVSIAVPASIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDNDSVLDNVG 102
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ESGAAFL+R+LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW
Sbjct: 103 -------DESGAAFLMRILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWF 155
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
P+REGVT+KE NS TLVDVGL K+V+VDQ+ +PG RVTVAMGT+RNLD+D PRQV+
Sbjct: 156 PYREGVTVKERDSNSGATLVDVGLVKNVIVDQIFEPGRRVTVAMGTDRNLDSDLPRQVIS 215
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
SKP+E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRHL
Sbjct: 216 SSKPREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRHL 275
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYF 332
LIAFGGLAGLEESIEEDD LK K R HS +
Sbjct: 276 LIAFGGLAGLEESIEEDDNLKGKRHR--VHSIY 306
>gi|294461137|gb|ADE76133.1| unknown [Picea sitchensis]
Length = 364
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 195/307 (63%), Positives = 233/307 (75%), Gaps = 9/307 (2%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN--KSSSDNYSRSSA 117
V TVS+AV GSIIDN QS ELATRLAGQIARA TIFRIDEV+VFD+ K+++ N S A
Sbjct: 62 VQTVSIAVAGSIIDNAQSSELATRLAGQIARAATIFRIDEVIVFDDQAKATTSNISIWDA 121
Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
++N+SGA F+ R+L YL+ PQYLR+ L H+ LR VG+LPPLDAPHH RKHE
Sbjct: 122 ------DEKNKSGALFMARILNYLDIPQYLRRTLVPRHNDLRHVGLLPPLDAPHHARKHE 175
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
W P+REGVTL + GT VDVGL+K V + Q +PG+RVTVAMG+ R+L+ D RQ
Sbjct: 176 WVPYREGVTLDRTSVAGGGTFVDVGLHKAVCIQQTFEPGLRVTVAMGSKRSLEDDCIRQA 235
Query: 238 VPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
V S+P K+ G+YWGY VRY+ NIS VFK C Y+GGYD+ IGTSEHGD ++S DL +P F
Sbjct: 236 VDRSEPNKKMGLYWGYNVRYSSNISCVFKECPYQGGYDYSIGTSEHGDKISSGDLEIPKF 295
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
RHLLIAFGGLAGLEE+IE+D L KN ++F Y NTCP QGSRTIRTEEAIFISLQYF
Sbjct: 296 RHLLIAFGGLAGLEENIEQDGDLMGKNVTQLFSRYLNTCPKQGSRTIRTEEAIFISLQYF 355
Query: 357 QEPISRA 363
Q+PI RA
Sbjct: 356 QDPIQRA 362
>gi|38346841|emb|CAD39923.2| OSJNBa0091C12.1 [Oryza sativa Japonica Group]
Length = 361
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 172/266 (64%), Positives = 209/266 (78%), Gaps = 14/266 (5%)
Query: 82 TRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYL 141
T LAGQIARA T+FRIDEVVVFD+ SS +N D+ ESGA FLVR+LQYL
Sbjct: 67 TTLAGQIARAATVFRIDEVVVFDSNSSVEN-----------SGDDVESGARFLVRILQYL 115
Query: 142 ETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDV 201
ETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ FREGVTL + S+GT VDV
Sbjct: 116 ETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEFREGVTL--DGDRSMGTFVDV 173
Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
GL+K+V+V+Q+L+PG RVT+AMGTNR++ R++V PS P+ E +YWGYKVRYA N+
Sbjct: 174 GLSKNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNL 233
Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
VF + YK GYD++IGTSEHG I++SS+L LP+FRHLLIAFGGLAGLEE IEED LK
Sbjct: 234 GGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLLIAFGGLAGLEECIEEDRNLK 293
Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTE 346
K +VF++Y NTCP QGSRTIRTE
Sbjct: 294 GKGVDDVFNTYLNTCPSQGSRTIRTE 319
>gi|302756383|ref|XP_002961615.1| hypothetical protein SELMODRAFT_76235 [Selaginella moellendorffii]
gi|300170274|gb|EFJ36875.1| hypothetical protein SELMODRAFT_76235 [Selaginella moellendorffii]
Length = 329
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 175/308 (56%), Positives = 221/308 (71%), Gaps = 10/308 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-KSSSDNYSRSSAANR 120
TVS+AV GSII+N QSLELATRLAGQIARA TIFRIDE+VVFD+ +SS+ ++ R+
Sbjct: 25 TVSLAVSGSIINNAQSLELATRLAGQIARAATIFRIDEIVVFDDGESSASSFQRTGWGRE 84
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ + SG FL R+L+YLE PQYLR+AL MH SLR+ G LPPLD+PHHLRK W P
Sbjct: 85 TGEA----SGGEFLARVLRYLEVPQYLRRALVPMHKSLRYAGQLPPLDSPHHLRKQAWLP 140
Query: 181 FREGVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQV 237
FREGV L ++ S G DVGL + V++ + ++ G R+TVAMG T++NL V
Sbjct: 141 FREGVALDKSFEKSRGNGCYADVGLGQDVLIQETVNAGTRITVAMGSTDKNLK-RGKLSV 199
Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VP S P+ES G+YWGY VR ++ F + S++ GYD+ IGTSEHG+ V +SDL +P+F
Sbjct: 200 VPSSTPRESAGLYWGYTVRVVDRLTKAFSDSSFEEGYDYTIGTSEHGEKVLTSDLRVPSF 259
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
RH LI FGG AGLEES+E D ++ RE+F Y NTCP QGSRTIRTEEAIFISLQ+
Sbjct: 260 RHALIVFGGPAGLEESLELDGSSAVEDVRELFTRYLNTCPLQGSRTIRTEEAIFISLQFL 319
Query: 357 QEPISRAL 364
QEP+ RA+
Sbjct: 320 QEPLLRAV 327
>gi|302775540|ref|XP_002971187.1| hypothetical protein SELMODRAFT_95276 [Selaginella moellendorffii]
gi|300161169|gb|EFJ27785.1| hypothetical protein SELMODRAFT_95276 [Selaginella moellendorffii]
Length = 324
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 168/306 (54%), Positives = 211/306 (68%), Gaps = 15/306 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-KSSSDNYSRSSAANR 120
TVS+AV GSII+N QSLELATRLAGQIARA TIFRIDE+VVFD+ +SS+ ++ R+
Sbjct: 31 TVSLAVSGSIINNAQSLELATRLAGQIARAATIFRIDEIVVFDDGESSASSFQRTGWGRE 90
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ + SG FL R+L+YLE PQYLR+AL MH SLR+ G LPPLD+PHHLRK
Sbjct: 91 TGEA----SGGEFLARVLRYLEVPQYLRRALVPMHKSLRYAGQLPPLDSPHHLRK----- 141
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVP 239
+ E G DVGL + V++ + + G R+TVAMG T++NL VVP
Sbjct: 142 --QACCHSEKVARGNGCYADVGLGQDVLIQETVKAGTRITVAMGSTDKNLK-RGKLSVVP 198
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
S P+ES G+YWGY VR ++ F + S++ GYD+ IGTSEHG+ V +SDL +P+FRH
Sbjct: 199 SSTPRESAGLYWGYTVRVVDRLTKAFSDSSFEEGYDYTIGTSEHGEKVLTSDLRVPSFRH 258
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
LI FGG AGLEES+E D K+ RE+F Y NTCP QGSRTIRTEEAIFISLQ+ QE
Sbjct: 259 ALIVFGGPAGLEESLELDGSSTVKDVRELFTRYLNTCPLQGSRTIRTEEAIFISLQFLQE 318
Query: 359 PISRAL 364
P+ RA+
Sbjct: 319 PLLRAV 324
>gi|167998478|ref|XP_001751945.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697043|gb|EDQ83380.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 296
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 159/303 (52%), Positives = 199/303 (65%), Gaps = 10/303 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AV GSI+DN QSL LA+RLAGQIARA IFRIDE+VVFD+ + S
Sbjct: 4 TISMAVAGSIVDNAQSLVLASRLAGQIARAAAIFRIDEIVVFDDGDEGSRVPKWSEG--- 60
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E+ +FL RLL+Y+ETPQYLR L H SL+ G LPPLD PH RKHEW P+
Sbjct: 61 ----ATETSGSFLCRLLKYMETPQYLRLTLCPKHRSLQCAGSLPPLDVPHQARKHEWIPY 116
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV G V L + VV+ Q L PG RVTV MG +R +VVP S
Sbjct: 117 REGVNPTRMYCIEKGISVSSILVQDVVIRQNLQPGTRVTVKMGESRE---QRILEVVPKS 173
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
+P++ G YWGY VR+A ++S+ + GYD++IGTSEHG V+SS+L LP F+HLL+
Sbjct: 174 EPRDKGHYWGYAVRHASSLSTAISESPFPNGYDYVIGTSEHGVKVSSSELVLPKFKHLLV 233
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPIS 361
FGG+AGLEES E D ++ + R++F+ Y N CP QGSRTIRTEEAI ISLQY Q+P+
Sbjct: 234 VFGGVAGLEESKELDKTIEVSDVRDLFNIYLNVCPGQGSRTIRTEEAILISLQYLQDPLM 293
Query: 362 RAL 364
RAL
Sbjct: 294 RAL 296
>gi|260800799|ref|XP_002595284.1| hypothetical protein BRAFLDRAFT_232338 [Branchiostoma floridae]
gi|229280529|gb|EEN51296.1| hypothetical protein BRAFLDRAFT_232338 [Branchiostoma floridae]
Length = 327
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 153/307 (49%), Positives = 200/307 (65%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGSI+DN QS EL T LAGQIARA IF +DE+V+FD S+ + +
Sbjct: 15 TVSVALPGSILDNAQSPELRTYLAGQIARACVIFNVDEIVIFDESGSTQPSTEGTFTGVG 74
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + N + R+LQYLE PQYLRK F H L++ G+L PLD PHH+R+HE + +
Sbjct: 75 KKGNANVQ----IARILQYLECPQYLRKVFFPKHQDLQYAGLLNPLDCPHHVREHELSEY 130
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRN-LDADSPRQVVPP 240
REGV L G+ V+VG+ + V +D+VL PG+RVTV + N + +VV P
Sbjct: 131 REGVVLDRPTKEGKGSFVNVGMRREVRIDKVLQPGLRVTVQLDEPGNTVSRVLTGKVVSP 190
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
S P+ E+G+YWGY VR AP++ +VF +KGGYD IGTSE G+ ++ ++ LP+F HL
Sbjct: 191 SVPRTEAGLYWGYTVRLAPSLGAVFTESPFKGGYDLTIGTSERGE--STDEVFLPSFSHL 248
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
LI FGGL GLE S+E D+ L +AR +FH Y NTCP QGSRTIRTEEAI ISL +
Sbjct: 249 LIVFGGLKGLEYSVECDEALDIADARVMFHYYLNTCPTQGSRTIRTEEAILISLSGLRPK 308
Query: 360 ISRALRR 366
I AL R
Sbjct: 309 IHEALIR 315
>gi|193718507|ref|XP_001944777.1| PREDICTED: uncharacterized protein C9orf114 homolog [Acyrthosiphon
pisum]
Length = 383
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 218/353 (61%), Gaps = 20/353 (5%)
Query: 12 PELKEAATGENHESQNELSLANGDS------SSCDNKKKKKRKRDQLNDDAPIEVPTVSV 65
PE+ E G E +N++ + + + S D K + D++ ++PTV++
Sbjct: 30 PEVIEINEGLETEHENKIKITDPEKVEVIIDDSIDTSKTTEIS-DKIKPSKVRQLPTVTI 88
Query: 66 AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
AVPGSI+DN QS E T LAGQIARA I++IDE+VVFD+ N S+ +++
Sbjct: 89 AVPGSILDNAQSPEFRTYLAGQIARAACIYKIDEIVVFDDVGEQTNVMVSN-----KKTE 143
Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGV 185
L +LQYLE PQYLR+ LF +H+ L++ G+L PLDAPHH R+ + PFREGV
Sbjct: 144 GGSKTCKQLAIILQYLECPQYLRRILFPIHNFLKYTGVLNPLDAPHHFRQEDNVPFREGV 203
Query: 186 TLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ---VVPPSK 242
G VDVGL KHV+VD+ L P VRVTV + N + DS RQ VV PS
Sbjct: 204 VSTLPPKKGKGCNVDVGLRKHVLVDKCLQPYVRVTVKLIPNSD---DSKRQKGIVVAPST 260
Query: 243 PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLL 300
PK ESG+YWGY VR A +I+ VF C Y GGYD IGTS+ G ++ +D T L +F+H L
Sbjct: 261 PKNESGIYWGYTVRNAESINEVFTKCPYPGGYDLKIGTSDKGVDIDQTDHTQLSSFKHAL 320
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG+ GLE ++E D + +N +F+ Y NTCP+QGSRTIRTEEA+FI+L
Sbjct: 321 ICFGGVHGLEAALEADQSIDEENPSTLFNVYLNTCPNQGSRTIRTEEAVFITL 373
>gi|239790536|dbj|BAH71823.1| ACYPI008697 [Acyrthosiphon pisum]
Length = 375
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 218/353 (61%), Gaps = 20/353 (5%)
Query: 12 PELKEAATGENHESQNELSLANGDS------SSCDNKKKKKRKRDQLNDDAPIEVPTVSV 65
PE+ E G E +N++ + + + S D K + D++ ++PTV++
Sbjct: 22 PEVMEINEGLETEHENKIKITDPEKVKVIIDDSIDTSKTTEIS-DKVKPSKVRQLPTVTI 80
Query: 66 AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
AVPGSI+DN QS E T LAGQIARA I++IDE+VVFD+ N S+ +++
Sbjct: 81 AVPGSILDNAQSPEFRTYLAGQIARAACIYKIDEIVVFDDVGEQTNVMISN-----KKTE 135
Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGV 185
L +LQYLE PQYLR+ LF +H+ L++ G+L PLDAPHH R+ + PFREGV
Sbjct: 136 GGSKTCKQLAIILQYLECPQYLRRILFPIHNFLKYTGVLNPLDAPHHFRQEDNVPFREGV 195
Query: 186 TLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ---VVPPSK 242
G VDVGL KHV+VD+ L P VRVTV + N + DS RQ VV PS
Sbjct: 196 VSTLPPKKGKGCNVDVGLRKHVLVDKCLQPYVRVTVKLIPNSD---DSKRQKGIVVAPST 252
Query: 243 PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLL 300
PK ESG+YWGY VR A +I+ VF C Y GGYD IGTS+ G ++ +D T L +F+H L
Sbjct: 253 PKNESGIYWGYTVRNAESINEVFTKCPYPGGYDLKIGTSDKGVDIDQTDHTQLSSFKHAL 312
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG+ GLE ++E D + +N +F+ Y NTCP+QGSRTIRTEEA+FI+L
Sbjct: 313 ICFGGVHGLEAALEADQSIDEENPFTLFNVYLNTCPNQGSRTIRTEEAVFITL 365
>gi|384245829|gb|EIE19321.1| DUF171-domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 318
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 152/314 (48%), Positives = 193/314 (61%), Gaps = 24/314 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AV S+IDN QS ELAT +AGQIAR IF +DEV+V D+ +D+ +
Sbjct: 4 TLSIAVAASVIDNVQSAELATLVAGQIARTAAIFNVDEVMVIDD---ADDICVGCSGRVG 60
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
GAAFL R+LQ++ETPQYLRKAL MH LR G LPPLDAPHH+R EW PF
Sbjct: 61 K-------GAAFLARVLQFMETPQYLRKALCPMHRDLRLAGQLPPLDAPHHMRASEWRPF 113
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG------------TNRNL 229
REGV ++ G+L+DVGL+K V + L VRVTV MG + +
Sbjct: 114 REGVVVRSAV--GAGSLLDVGLDKVAFVPEALKQSVRVTVEMGEQPRPHRVEGGPLGKGV 171
Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
D V+P ESG YWGY VR A +S+VF C Y+ GYD +GTSE G + +++
Sbjct: 172 DTLEGSVVLPSDPRAESGTYWGYTVRIASGLSAVFSECPYQEGYDLKVGTSERGSVQSAA 231
Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
+L LP +H LIAFGG GLEE +++D G +A E+F + NTCP QGSRTIRTEEA+
Sbjct: 232 ELELPPHQHALIAFGGPQGLEECLKKDPGKAAGDAEELFDLWLNTCPGQGSRTIRTEEAL 291
Query: 350 FISLQYFQEPISRA 363
ISL YFQ + A
Sbjct: 292 LISLAYFQPALQSA 305
>gi|299470201|emb|CBN79505.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 577
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 197/319 (61%), Gaps = 24/319 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--------DNYS 113
T+S+A+PGS++DN QS EL T L G+IARA +F +DE+VV+D+ S S D+ +
Sbjct: 241 TLSIAIPGSVVDNAQSRELKTYLVGEIARAAAVFEVDEIVVYDDLSVSRKGQGGVRDDSA 300
Query: 114 RSSAANRSNRS-----------DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
+AA NR D+N F+ RLLQY ETPQYLRK LF MHSSLR G
Sbjct: 301 SVAAAGGFNRGGGSANGSGGGGDKNNPNV-FMARLLQYAETPQYLRKQLFPMHSSLRLAG 359
Query: 163 MLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVA 222
+L PLDAPHH+R + PFREGV L G+ VD+G K +D+ LDPGVRVTV
Sbjct: 360 LLNPLDAPHHMRADDACPFREGVVLDRTIKKGAGSFVDIGKRKEARIDRPLDPGVRVTVK 419
Query: 223 MGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSE 281
+ + + VPPS P+E G+YWGY+ R A ++ + C YKGGYD +GTSE
Sbjct: 420 LDKGDPCKKNHKGRAVPPSAPREELGLYWGYQTRLASGMADLLAGCPYKGGYDLTVGTSE 479
Query: 282 HGDI-VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNARE--VFHSYFNTCPHQ 338
G++ V+S D LP +RH L+ FGG+ G+E +++ D+ L+ + +F + NTCP Q
Sbjct: 480 RGNVTVDSPDFALPEYRHALVVFGGVQGIEATVDADESLQLPGSESSALFDMWVNTCPGQ 539
Query: 339 GSRTIRTEEAIFISLQYFQ 357
GSRTIRTEEA+ I+L +
Sbjct: 540 GSRTIRTEEAVLITLARLR 558
>gi|156368817|ref|XP_001627888.1| predicted protein [Nematostella vectensis]
gi|156214850|gb|EDO35825.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 197/309 (63%), Gaps = 15/309 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGSI+DN QS EL T LAGQIARAV +F +DEV+VF+ S + S+ +N
Sbjct: 5 TVSLALPGSILDNAQSAELRTYLAGQIARAVVVFNVDEVIVFNESRKSLDKSKDGVSNFH 64
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+++D N L R+LQYLE PQYLRKA F H L++ G+L PLD PHH+R + P+
Sbjct: 65 SKADANLQ----LARILQYLECPQYLRKAFFPRHKDLQYAGLLNPLDCPHHMRIDDDMPY 120
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM-----GTNRNLDADSPRQ 236
REGV L S G+LV+ G+ K V +D+ + PG+RVTV + N+ + +
Sbjct: 121 REGVVLDRPVGQSGGSLVNAGMRKEVKIDKCIKPGIRVTVKLKPTSKSGNKQIFFGN--- 177
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
VV PS P+ ++G+YWGY VR AP+ S VF Y GYD +IGTSE G V+ L L
Sbjct: 178 VVSPSAPRVDAGLYWGYSVRLAPSFSDVFTESPYPQGYDVMIGTSERGAPVDC--LELQP 235
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
F H+LI FGGL GLE S+E D+ L + E+FH Y NTCP QGSRTIRTEEAI IS+
Sbjct: 236 FSHMLIVFGGLKGLEASLESDESLDVSDPSELFHHYVNTCPGQGSRTIRTEEAILISMAA 295
Query: 356 FQEPISRAL 364
+ I R L
Sbjct: 296 LKPVIERTL 304
>gi|340377485|ref|XP_003387260.1| PREDICTED: uncharacterized protein C9orf114 homolog [Amphimedon
queenslandica]
Length = 445
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 146/293 (49%), Positives = 194/293 (66%), Gaps = 16/293 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGSI++N QS +L T LAGQIAR IF ++E+V+F + + +
Sbjct: 157 TVSIALPGSILNNVQSFQLKTYLAGQIARIAAIFCVNEIVIFKEEGTD--------TRKG 208
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ +D + F+ R+LQY+ETPQYLRKAL+ MH L++ G+L PLDAPHH+R HE +PF
Sbjct: 209 SETDPD----VFMCRILQYMETPQYLRKALYGMHPDLKYAGVLNPLDAPHHMRAHEDSPF 264
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+ L S ++VD GLN +V V++ L+ GVRVTV M + ++V PS
Sbjct: 265 REGIVLSSTDDGS--SMVDCGLNNNVTVNKKLETGVRVTVKMKKRSESGNNLRGKIVAPS 322
Query: 242 KPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
P S G+YWGY+VRYA + VF + SYK YD +IGTSE GD ++S L P ++HLL
Sbjct: 323 IPLTSKGIYWGYQVRYARSFGEVFTSSSYKESYDVIIGTSERGDPIDSV-LPFPQYKHLL 381
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG GLE S+E+D L+ + R +FH Y NTC HQGSRTIRTEEAI ISL
Sbjct: 382 IVFGGQHGLEYSLEKDRTLEVDDVRSLFHFYLNTCTHQGSRTIRTEEAITISL 434
>gi|242018127|ref|XP_002429532.1| protein C9orf114, putative [Pediculus humanus corporis]
gi|212514480|gb|EEB16794.1| protein C9orf114, putative [Pediculus humanus corporis]
Length = 371
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 195/305 (63%), Gaps = 12/305 (3%)
Query: 53 NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
+D P E+ T+S+AVPGSI+ N QS EL +AGQIARA I++I+EV+VFD D
Sbjct: 62 SDCKPCELITMSIAVPGSILQNAQSNELRAYVAGQIARAACIYKINEVIVFD-----DAE 116
Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
A+N + ++ F RLLQYLE PQYLRK F +HS L++ G++ PLDAPHH
Sbjct: 117 EEPLASNEKSLVKCSKYCLQF-GRLLQYLECPQYLRKYFFPLHSDLQYAGIMNPLDAPHH 175
Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
LR+ E FREGVTL + N GT V+VGL K V+V + L PG+RVTV + +
Sbjct: 176 LRQDENNDFREGVTLDKPVKNDKGTFVEVGLTKQVLVYKHLQPGLRVTVKLNPPKENSKK 235
Query: 233 SPRQVVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG---DIVNS 288
++V P+ P +S +YWGY VR A +IS VF NC YKGGYD IGTS+ G D +
Sbjct: 236 LKGEIVSPNFPLLDSNLYWGYTVRIAHSISEVFLNCPYKGGYDLSIGTSDKGKNIDEIKK 295
Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
DL F+H LI FGGL GLE +E+D L +N +F Y NTCP+QGSRTIRTEEA
Sbjct: 296 KDLR--NFKHGLIVFGGLQGLENIVEKDTTLNVENVSHLFDFYINTCPNQGSRTIRTEEA 353
Query: 349 IFISL 353
+FI+L
Sbjct: 354 VFITL 358
>gi|52219102|ref|NP_001004627.1| uncharacterized protein LOC447888 [Danio rerio]
gi|51858842|gb|AAH81398.1| Zgc:101748 [Danio rerio]
Length = 381
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 147/302 (48%), Positives = 190/302 (62%), Gaps = 8/302 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGS++DN QS EL T LAGQIARA +F +DEVV+FD ++S
Sbjct: 77 TVSVALPGSVMDNAQSPELRTYLAGQIARACAVFCVDEVVIFDELGED---AKSVEGEFK 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ +S L R+LQYLE PQYLRK F H L+F G+L PLD+PHH+R E A F
Sbjct: 134 GVGKKGQSCVQ-LARILQYLECPQYLRKFFFPKHQDLQFAGLLNPLDSPHHMRIDEEAEF 192
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REGV L G+ V+ G+ K V +D+ L G+RVTV + ++N D + VV P
Sbjct: 193 REGVVLDRPVKAGKGSFVNCGMRKEVQIDKQLKTGLRVTVQLNESQNKDGRVYKGVVVAP 252
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ E+G+YWGY VR A +S+VF C YK GYD IGTSE G N+ +TLP FRH+
Sbjct: 253 HIPRTEAGLYWGYSVRLASCLSAVFSECPYKEGYDLTIGTSEKGQ--NTDHVTLPAFRHM 310
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D+ L + +F Y NTCP QGSRTIRTEEA+ ISL ++
Sbjct: 311 LVVFGGLQGLEASVDADENLTISDPSVLFDLYLNTCPAQGSRTIRTEEALLISLSGLRQK 370
Query: 360 IS 361
I+
Sbjct: 371 IA 372
>gi|322784360|gb|EFZ11331.1| hypothetical protein SINV_00732 [Solenopsis invicta]
Length = 369
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/300 (47%), Positives = 189/300 (63%), Gaps = 10/300 (3%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
+V T+S+AVPGSI+DN QS EL T LAGQIARA I+R+DE+VVFD++ +
Sbjct: 62 DVCTISIAVPGSILDNAQSPELRTYLAGQIARAACIYRVDEIVVFDDRGEV-----TEDE 116
Query: 119 NRSNRSD----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
R R D E G L R+LQYLE PQYLRK F +H L++ G+L PLDAPHHLR
Sbjct: 117 KRKVRKDDVLGEGRVGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGLLNPLDAPHHLR 176
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
+ + + +REG+ + G+ V+VGL + VD+VL PG+RVTV + +
Sbjct: 177 QEDESLYREGIVTNKPIKVGKGSQVNVGLLNEIHVDKVLMPGIRVTVKIPAGQPNPKKLK 236
Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
VVPP+ P+ E+G+YWGY V+ N++ V NC YKGGYD IGTS+ G +++ +
Sbjct: 237 GIVVPPNVPRAETGIYWGYTVKLVKNLTEVLTNCQYKGGYDLTIGTSDKGTLIDEIEARS 296
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+RH LI FGGL GLE +++ D L + VFH Y N CP QGSRTIRTEEAI ++L
Sbjct: 297 LKYRHCLIVFGGLRGLEAAVDVDPHLNIDDPSLVFHKYVNMCPQQGSRTIRTEEAILLTL 356
>gi|307181843|gb|EFN69283.1| Uncharacterized protein C9orf114-like protein [Camponotus
floridanus]
Length = 368
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 138/296 (46%), Positives = 189/296 (63%), Gaps = 2/296 (0%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
++ TVS+AVPGSI+DN QS EL T LAGQIARA I+R+DE++VFD+K R
Sbjct: 60 DICTVSIAVPGSILDNAQSPELRTYLAGQIARAACIYRVDEIIVFDDKGDVTESERKKI- 118
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ + E G L R+LQYLE PQYLRK F +H L++ G+L PLDAPHHLR+ +
Sbjct: 119 RKDDALGEARLGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGVLNPLDAPHHLRQMDE 178
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+ +REG+ + G+ V+VGL V VD+VL PG+RVTV + + +V
Sbjct: 179 SLYREGIVTNKPIKVGKGSQVNVGLLNEVHVDKVLMPGLRVTVKIPMEQPNSKKLKGIIV 238
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
PP+ P+ E+G+YWGY V+ +++ V NC YKGGYD IGTS+ G +++ + +R
Sbjct: 239 PPNVPRAEAGIYWGYTVKLVKHLTEVLTNCQYKGGYDLTIGTSDKGTLIDEIEAKSLEYR 298
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
H LI FGG+ GLE +++ D L + +FH Y NTCP QGSRTIRTEEAI ++L
Sbjct: 299 HCLIVFGGVTGLEAAVDVDPNLDVDDPSLLFHKYVNTCPQQGSRTIRTEEAILLTL 354
>gi|157128149|ref|XP_001661329.1| hypothetical protein AaeL_AAEL011052 [Aedes aegypti]
gi|108872678|gb|EAT36903.1| AAEL011052-PA [Aedes aegypti]
Length = 438
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 197/315 (62%), Gaps = 3/315 (0%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
V T+S+AVPGSI+DN QS EL T LAGQIARA IF++DEVVVFD+ + + + +
Sbjct: 69 VSTLSIAVPGSILDNAQSPELRTYLAGQIARAACIFQVDEVVVFDDCGTGNQATEKLGTV 128
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ S S L R+LQYLE PQYLRK F +H+ L+F G+L PLD+ HHLR+
Sbjct: 129 DTSEGSTSARRCCVQLARILQYLECPQYLRKFFFPIHNDLKFCGLLSPLDSQHHLRQASE 188
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QV 237
FREG+ + N + V+VGL V+VD LDPGVRVTV M +L + R +V
Sbjct: 189 FQFREGIVTNKPTKNDKSSFVNVGLLNDVLVDTHLDPGVRVTVKMPPGVDLKSKKIRAKV 248
Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VPP++P+ E+G+YWGY VR A ++S VF YKGGYD IGTS+ G V + +
Sbjct: 249 VPPAQPRQETGIYWGYTVRIANSLSQVFTKSPYKGGYDMTIGTSDTGTNVQELEPQSLKY 308
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
+H L+ FGG+ GLE +++ D+ L + ++F Y NT P QGSRT+RTEEAI ISL
Sbjct: 309 KHALVVFGGVLGLEPALDNDNKLNVDSVEQLFDQYLNTVPRQGSRTVRTEEAILISLAAL 368
Query: 357 QEPISRALRRVEYKS 371
E + ++ E+ S
Sbjct: 369 GEKLDPTVKPKEFTS 383
>gi|148236641|ref|NP_001084926.1| uncharacterized protein LOC431981 [Xenopus laevis]
gi|47123001|gb|AAH70663.1| MGC82267 protein [Xenopus laevis]
Length = 375
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 189/306 (61%), Gaps = 7/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGSI+DN QS EL T LAGQIARA IF IDE+V+FD S+S +
Sbjct: 72 TVSVALPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGDFE 128
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 129 GVGKKGQACVQ-LARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRMDEESEY 187
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV L +S G+ V+ G+ K V +D+ L GVRVTV + VV P
Sbjct: 188 REGVVLDRPTKSSKGSFVNCGMRKEVQIDKQLQAGVRVTVHLQQENVEQRVQKGIVVSPQ 247
Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
P+ E G+YWGY+VR A +S+VF C +K GYD +GTSEHG N ++LP FRH +
Sbjct: 248 HPRTEGGLYWGYRVRLASCLSAVFTECPFKDGYDLTMGTSEHGS--NVETVSLPDFRHAV 305
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
I FGGL GLE S+E D L +F+ Y NTCP QGSRTIRTEEAI ISL + +
Sbjct: 306 IIFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALRPRL 365
Query: 361 SRALRR 366
A+++
Sbjct: 366 QAAIKK 371
>gi|156545293|ref|XP_001605386.1| PREDICTED: uncharacterized protein C9orf114 homolog [Nasonia
vitripennis]
Length = 404
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 146/298 (48%), Positives = 192/298 (64%), Gaps = 6/298 (2%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
+V TVS+AVPGSI+DN QS EL T +AGQIARA I+++DEV+VFD+K +
Sbjct: 60 KVSTVSIAVPGSILDNAQSHELRTYVAGQIARAACIYKVDEVIVFDDKGEITEDEKKKL- 118
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
R E G L R+LQYLE PQYLRK F +H L++ G+L PLDAPHHLR+ +
Sbjct: 119 RRDEVLGEARPGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGLLNPLDAPHHLRQQDA 178
Query: 179 APFREGVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
+ +REGV N P VG +L++VGL V VD+VL G+RVTV + ++
Sbjct: 179 SLYREGVV--SNKPLKVGKGSLINVGLLTDVRVDKVLTSGLRVTVKIPESQENPKKPKGI 236
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+VPPS P+ ++G+YWGY V+ A ++S +F C Y GGYD IGTS+ G V+
Sbjct: 237 IVPPSLPRADTGIYWGYSVKLASSLSDIFAKCPYPGGYDVSIGTSDKGTSVDDIAPKSLK 296
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
++H LI FGGLAG+EE+IE D L +A VF+ Y NTCP QGSRTIRTEEAI ++L
Sbjct: 297 YKHALIVFGGLAGIEEAIEADSNLDIDDASLVFNMYLNTCPQQGSRTIRTEEAILLTL 354
>gi|259155393|ref|NP_001158758.1| CI114 protein [Salmo salar]
gi|223647122|gb|ACN10319.1| C9orf114 [Salmo salar]
gi|223672995|gb|ACN12679.1| C9orf114 [Salmo salar]
Length = 380
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 195/305 (63%), Gaps = 8/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGS++DN QS EL T LAGQIARA +F +DE++VFD + + +
Sbjct: 75 TVSVALPGSVLDNAQSTELRTYLAGQIARACVVFCVDEIIVFDEQ----DEDVKTVEGEF 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
N + L R+LQ+LE PQYLRK+ F MH L++ G+L PLD+PHH+RK + +
Sbjct: 131 NGVGKKGQACIQLARILQFLECPQYLRKSFFPMHKDLQYAGLLNPLDSPHHMRKDDECEY 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REGV L + G+LV+ G+ K V +D+ L G+RVTV M +N D + VV P
Sbjct: 191 REGVVLDRPSKPGKGSLVNCGMWKEVQIDKQLQSGLRVTVHMNKIQNKDGRLHKGVVVAP 250
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
KP+ E G+YWGY VR A ++S+VF C +K GYD IGTSE G V+++ L+ F+H+
Sbjct: 251 HKPRTEGGLYWGYSVRLASSLSNVFTECPHKEGYDLTIGTSEKGVDVDTTSLS--PFKHM 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D L+ + +F Y NTCP+QGSRTIRTEEAI IS+ ++
Sbjct: 309 LVVFGGLQGLEASVDADQNLEVTDPGVLFDLYLNTCPNQGSRTIRTEEAILISMSSLRQK 368
Query: 360 ISRAL 364
I+ A
Sbjct: 369 ITAAF 373
>gi|348664786|gb|EGZ04626.1| hypothetical protein PHYSODRAFT_551704 [Phytophthora sojae]
Length = 301
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 204/306 (66%), Gaps = 12/306 (3%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+A+PGSI+DN Q+ EL T LAGQ+ARA IF++DE+VVFD++ + S S+ A +
Sbjct: 1 MAIPGSILDNAQTKELRTYLAGQVARAAVIFQVDEIVVFDDQLGN---SVSTGAPKKRAH 57
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
D + F+ R+LQYLETPQYLR+ALF MHS L G+L PLD PHHLR EW+ +REG
Sbjct: 58 DCH----VFMARILQYLETPQYLRRALFPMHSDLSCAGLLNPLDCPHHLRAQEWSVYREG 113
Query: 185 VTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVPPSKP 243
V + G+ V VGL + VVVD+ + PG+RVTV + +++ ++V ++P
Sbjct: 114 VVTDRPLKDKEGSHVYVGLQREVVVDKRITPGIRVTVKIDEASKDFKKKMVGKLVSSAEP 173
Query: 244 KES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VNSSDLTLPTFRHLLI 301
+E G+YWGY R+AP++S+V+ +C + GGYD +GTSE G++ V+ +LP FRH LI
Sbjct: 174 REHLGVYWGYTTRFAPSLSNVWSDCPFPGGYDLKVGTSERGNVSVDDPSFSLPKFRHALI 233
Query: 302 AFGGLAGLEESIEEDDG--LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
FGG+AG+EE ++ D+ + +++ +F + NTCP QGSRTIR+EEA+ ISL +
Sbjct: 234 VFGGVAGIEECVDADEKVPVSGEDSNTLFDMWVNTCPEQGSRTIRSEEALLISLAALRPH 293
Query: 360 ISRALR 365
+ R+ R
Sbjct: 294 VKRSGR 299
>gi|158285430|ref|XP_001687890.1| AGAP007570-PA [Anopheles gambiae str. PEST]
gi|157019988|gb|EDO64539.1| AGAP007570-PA [Anopheles gambiae str. PEST]
Length = 439
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 147/298 (49%), Positives = 190/298 (63%), Gaps = 4/298 (1%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
V TVS+AVPGSI+DN QS EL T LAGQIARA IF++DEV+VFD+ SS+ +
Sbjct: 61 VSTVSIAVPGSIMDNAQSPELRTYLAGQIARAACIFQVDEVIVFDDCGSSNQVTDKLGTI 120
Query: 120 RSNRSDENESGAAF-LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
++N + L R+LQYLE PQYLRK F +H+ L+F G+L PLD+ HHLR+
Sbjct: 121 QTNEGSTSARRCCIQLARILQYLECPQYLRKYFFPLHNDLKFTGLLNPLDSQHHLRQQSE 180
Query: 179 APFREG-VTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-Q 236
FREG VT K N G V+VGL V+VD LDP +RVTV + +L + R +
Sbjct: 181 FVFREGIVTTKPTKGNKPGAFVNVGLLNDVLVDTKLDPNLRVTVKLPAGADLKSKKLRGK 240
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
VVPPS+P+ E+G+YWGY VR A ++S VF YKGGYD IGTS+ G V+ +
Sbjct: 241 VVPPSQPRQETGIYWGYTVRIANSLSQVFTKSPYKGGYDMTIGTSDRGKNVHELQPASLS 300
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
++H LI FGG+ GLE +++ D L ++F Y NT P QGSRTIRTEEAI IS+
Sbjct: 301 YQHALIVFGGVLGLEPALDNDQKLNVDTVEDLFEEYLNTVPTQGSRTIRTEEAILISM 358
>gi|62859635|ref|NP_001016729.1| uncharacterized protein LOC549483 [Xenopus (Silurana) tropicalis]
gi|115530318|emb|CAL49394.1| novel protein [Xenopus (Silurana) tropicalis]
Length = 318
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 148/307 (48%), Positives = 192/307 (62%), Gaps = 9/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGSI+DN QS EL T LAGQIARA IF IDE+V+FD S+S N
Sbjct: 15 TVSVALPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGNFE 71
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 72 GVGKKGQA-CVQLARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRIDEESEY 130
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L + G+ V+ G+ K V +D+ L GVRVTV + N++ + VV P
Sbjct: 131 REGVVLDRPTKSGKGSFVNCGMRKEVQIDRQLQTGVRVTVKL-QQENVELKVQKGTVVSP 189
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY+VR A +S+VF C +K GYD IGTSE G V + ++LP FRH
Sbjct: 190 QHPRTASGLYWGYRVRLASCLSAVFTECPFKDGYDLTIGTSERGSSVET--VSLPDFRHA 247
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
+I FGGL GLE S+E D L +F+ Y NTCP QGSRTIRTEEAI ISL +
Sbjct: 248 MIVFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALKAR 307
Query: 360 ISRALRR 366
+ A+++
Sbjct: 308 LQAAIKK 314
>gi|427779265|gb|JAA55084.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 376
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/299 (48%), Positives = 192/299 (64%), Gaps = 8/299 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN-R 120
T+S+AVPGSI+DN Q+ EL T LAGQIARA +F +DE+VV+D+ S+ + +A +
Sbjct: 70 TLSIAVPGSIMDNAQNAELRTYLAGQIARAAVVFCVDEIVVYDDDGSTIRGANGTAVDGE 129
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ ++ G L R+LQYLE PQYLRK LF +H L+F G+L PLD PHHLR E +P
Sbjct: 130 FSGLAKSGQGVVQLARVLQYLECPQYLRKHLFPLHRDLQFAGLLNPLDCPHHLRATEESP 189
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVP 239
+REG+ + G+ V+ GLNK V +D+ L PGVRVTV M + R V
Sbjct: 190 YREGIVARLPVKEGSGSYVNCGLNKEVKIDRCLQPGVRVTVKM--KEGVPGKKLRGTAVS 247
Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFR 297
PS P+ E+G+YWGY+VR A ++ +V C + GGYD +GTSE G V+ +L LP F
Sbjct: 248 PSCPRTEAGLYWGYEVRVARSLGAVLAECPFPGGYDITLGTSERGTPVD--ELELPHRFE 305
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
H L+ FGGL GLE ++E D+ L + +F+ Y NTCP QGSRTIRTEEAI ISL F
Sbjct: 306 HALVVFGGLKGLEAALEADEALDVDDPEYLFNHYLNTCPSQGSRTIRTEEAILISLSXF 364
>gi|427784287|gb|JAA57595.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 381
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 193/306 (63%), Gaps = 8/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVPGSI+DN Q+ EL T LAGQIARA +F +DE+VV+D+ S+ + +A +
Sbjct: 70 TLSIAVPGSIMDNAQNAELRTYLAGQIARAAVVFCVDEIVVYDDDGSTIRGANGTAVDGE 129
Query: 122 NRS-DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
++ G L R+LQYLE PQYLRK LF +H L+F G+L PLD PHHLR E +P
Sbjct: 130 FSGLAKSGQGVVQLARVLQYLECPQYLRKHLFPLHRDLQFAGLLNPLDCPHHLRATEESP 189
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVP 239
+REG+ + G+ V+ GLNK V +D+ L PGVRVTV M + R V
Sbjct: 190 YREGIVARLPVKEGSGSYVNCGLNKEVKIDRCLQPGVRVTVKM--KEGVPGKKLRGTAVS 247
Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFR 297
PS P+ E+G+YWGY+VR A ++ +V C + GGYD +GTSE G V+ +L LP F
Sbjct: 248 PSCPRTEAGLYWGYEVRVARSLGAVLAECPFPGGYDITLGTSERGTPVD--ELELPHRFE 305
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
H L+ FGGL GLE ++E D+ L + +F+ Y NTCP QGSRTIRTEEAI ISL +
Sbjct: 306 HALVVFGGLKGLEAALEADEALDVDDPEYLFNHYLNTCPSQGSRTIRTEEAILISLSALR 365
Query: 358 EPISRA 363
I A
Sbjct: 366 PKIVAA 371
>gi|410979354|ref|XP_003996050.1| PREDICTED: uncharacterized protein C9orf114 homolog [Felis catus]
Length = 398
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 194/318 (61%), Gaps = 8/318 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRALRRVEYKSKPEDGK 377
+++A R Y SK GK
Sbjct: 368 LTQAGARQAYDSKAAAGK 385
>gi|328784069|ref|XP_003250387.1| PREDICTED: uncharacterized protein C9orf114 homolog [Apis
mellifera]
Length = 378
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 198/319 (62%), Gaps = 9/319 (2%)
Query: 43 KKKKRKRDQLNDDAPIE-------VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
K++ +K+ Q PIE + T+S+AVPGSI+DN QS EL T LAGQIARA ++
Sbjct: 36 KEETKKKLQKEVVKPIEKHFEKKDISTISIAVPGSILDNAQSPELRTYLAGQIARAACVY 95
Query: 96 RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
+I+E++VFD+K + N E+E G L R+LQYLE PQYLRK F +H
Sbjct: 96 KINEIIVFDDKGEIIESEKKKIKNDEILG-EDELGCLQLARILQYLECPQYLRKYFFPIH 154
Query: 156 SSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDP 215
L++ G+L P+DAPHHLR+ + + +REGV + + G+ V+VGL + VD+VL
Sbjct: 155 KDLQYAGILNPVDAPHHLRQQDISLYREGVVTNKPVKSGKGSHVNVGLLNDICVDKVLTA 214
Query: 216 GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
G+RVTV + ++ +VPP P+ ++G+YWGY VR A N++ V C YK GYD
Sbjct: 215 GLRVTVKIPEEQSNLKKLKGFIVPPDIPRSDTGIYWGYTVRLANNLTEVLTQCPYKSGYD 274
Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNT 334
IGTS+ G ++ + + H LI FGGL+GLE +++ D L +A VFH Y NT
Sbjct: 275 LTIGTSDKGKSIDDIESKGIEYHHALIVFGGLSGLEATVDCDPNLDVDDASLVFHQYLNT 334
Query: 335 CPHQGSRTIRTEEAIFISL 353
CP QGSRTIRTEEAI I+L
Sbjct: 335 CPQQGSRTIRTEEAILITL 353
>gi|112419371|gb|AAI21999.1| hypothetical protein LOC549483 [Xenopus (Silurana) tropicalis]
Length = 318
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/307 (47%), Positives = 191/307 (62%), Gaps = 9/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV +PGSI+DN QS EL T LAGQIARA IF IDE+V+FD S+S N
Sbjct: 15 TVSVVLPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGNFE 71
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 72 GVGKKGQA-CVQLARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRIDEESEY 130
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L + G+ V+ G+ K V +D+ L GVRVTV + N++ + VV P
Sbjct: 131 REGVVLDRPTKSGKGSFVNCGMRKEVQIDRQLQTGVRVTVKL-QQENVELKVQKGTVVSP 189
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY+VR A +S+VF C +K GYD IGTSE G V + ++LP FRH
Sbjct: 190 QHPRTASGLYWGYRVRLASCLSAVFTECPFKDGYDLTIGTSERGSSVET--VSLPDFRHA 247
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
+I FGGL GLE S+E D L +F+ Y NTCP QGSRTIRTEEAI ISL +
Sbjct: 248 MIVFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALKPR 307
Query: 360 ISRALRR 366
+ A+++
Sbjct: 308 LQAAIKK 314
>gi|312381995|gb|EFR27594.1| hypothetical protein AND_05627 [Anopheles darlingi]
Length = 430
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 193/307 (62%), Gaps = 3/307 (0%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
V T+S+AVPGSI+DN QS EL T LAGQIARA IF+IDEV+VFD+ S+ + R S
Sbjct: 61 VSTISLAVPGSIMDNAQSPELRTYLAGQIARAACIFQIDEVIVFDDCGPSNQVTDRLSTI 120
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ S L R+LQYLE PQYLRK F +H+ L+F G+L PLD+ HHLR+
Sbjct: 121 ETAEGSTSARRCCIQLARILQYLECPQYLRKYFFPIHNDLKFCGLLNPLDSQHHLRQQSE 180
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QV 237
FREG+ + + G+ V+VGL V +D L+P VRVTV + +L + R +V
Sbjct: 181 FVFREGIVTSKPTKHGKGSYVNVGLLNDVRIDTQLEPNVRVTVKLPDGADLKSKKIRAKV 240
Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VPPS+P+ E+G+YWGY VR A ++S VF YKGGYD IGTS+ G V+ ++
Sbjct: 241 VPPSQPRQETGIYWGYSVRIANSLSQVFTKSPYKGGYDLTIGTSDRGTNVHEVAAKSLSY 300
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
RH L+ FGG+ GLE +++ D L + ++F Y NT P QGSRT+RTEEAI IS+
Sbjct: 301 RHALVVFGGVLGLEPALDNDTKLTVETVEDLFDEYLNTVPAQGSRTVRTEEAILISMAAL 360
Query: 357 QEPISRA 363
E +S A
Sbjct: 361 GEKVSPA 367
>gi|344271297|ref|XP_003407476.1| PREDICTED: uncharacterized protein C9orf114-like [Loxodonta
africana]
Length = 376
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 189/304 (62%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDPKTVEGEFMGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L G+ V+ G+ K V +D+ L+PG+RVTV + + L++ + R +VV
Sbjct: 190 REGVVLDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQKQLLESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF +K GYD IGTSE G V S+ LP+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFKDGYDLTIGTSERGSDVASAQ--LPSFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP+QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLEGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|291415402|ref|XP_002723940.1| PREDICTED: CG12128-like [Oryctolagus cuniculus]
Length = 374
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 187/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DEVVVFD +
Sbjct: 72 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEVVVFDEEGQDAKTVEGEFKGVG 131
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E A F
Sbjct: 132 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEEAEF 187
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 188 REGVVVDRPTRPGHGSFVNCGMKKEVKIDRNLEPGLRVTVQLSQKQLPECKTYRGKVVSA 247
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 248 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSEVASAQ--LPSFRHA 305
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 306 LVVFGGLQGLEAAVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 365
Query: 360 ISRA 363
++RA
Sbjct: 366 LARA 369
>gi|339242005|ref|XP_003376928.1| conserved hypothetical protein [Trichinella spiralis]
gi|316974332|gb|EFV57827.1| conserved hypothetical protein [Trichinella spiralis]
Length = 770
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 134/294 (45%), Positives = 187/294 (63%), Gaps = 7/294 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGSI++N QS EL T LAGQIARA +IF +DE+++FD ++ + N
Sbjct: 184 TVSIAIPGSIVENAQSFELRTYLAGQIARAASIFEVDEIIIFDETATMTSKEVEMFENGE 243
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW-AP 180
R+ + FLVR+L+Y+E PQYLRK LF + S ++VG+L PLD HH R E P
Sbjct: 244 LRAF---NCNCFLVRILRYIECPQYLRKHLFPLCSDFKYVGLLNPLDCFHHFRMDEVEIP 300
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
+RE V + + + G+ + GL K V +D+ L PG+RVTV + +VV P
Sbjct: 301 YREAVVVNKPVKDGRGSFCNAGLAKDVEIDRALKPGIRVTVKFENSDQKQKRLVAKVVNP 360
Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
++P+ + G YWGY VR A ++ SVF +C Y GYD ++GTS+ G N ++ LP+F+HL
Sbjct: 361 TEPRTAMGYYWGYNVRLAKSLKSVFSDCPYVEGYDLIVGTSDKG--TNVDNVQLPSFKHL 418
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
LI FGG+ G+E IE D L+ + +F Y NTCP+QGSRTIRTEEAI I+L
Sbjct: 419 LIVFGGVKGIEACIENDQHLQENDPESLFQFYLNTCPNQGSRTIRTEEAILITL 472
>gi|325182840|emb|CCA17295.1| conserved unknown protein putative [Albugo laibachii Nc14]
Length = 314
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 193/306 (63%), Gaps = 6/306 (1%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TV++A+PGSIIDN Q+ EL T LAGQIARA IF++DE+V++D+ + S A+
Sbjct: 3 TVTIALPGSIIDNAQTKELKTYLAGQIARAAVIFQVDEIVIYDDHLGQKTKNASQASETL 62
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R + FL+R+LQYLETPQYLRK LF H+ L G+L PLD PHH+R E + +
Sbjct: 63 QR--DAYDCHVFLMRILQYLETPQYLRKQLFPKHTDLSCAGLLNPLDTPHHVRSSESSLY 120
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV + G+ V VGL K V VD+ + PG+RVTV + N +V PS
Sbjct: 121 REGVVSDRMLKDKEGSYVYVGLPKEVRVDKRIHPGIRVTVKLDQTTNNSKKLLGTLVSPS 180
Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD-IVNSSDLTLPTFRHL 299
+P++ +G+YWGY R+AP++ V++ C Y+ GYD IGTSE G+ N +LP F+H
Sbjct: 181 EPRQKTGIYWGYTCRFAPSLRHVWQECPYESGYDLKIGTSERGNQSTNDPAFSLPAFKHG 240
Query: 300 LIAFGGLAGLEESIEEDD--GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
LI FGG+AG+EE + D+ L + + ++F + NTCP QGSRTIRTEEA+ ISL
Sbjct: 241 LIVFGGVAGIEECVGNDETISLSSEESNKLFDMWVNTCPSQGSRTIRTEEAVLISLSALM 300
Query: 358 EPISRA 363
+S +
Sbjct: 301 PHVSMS 306
>gi|326930329|ref|XP_003211300.1| PREDICTED: uncharacterized protein C9orf114 homolog [Meleagris
gallopavo]
Length = 385
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 143/306 (46%), Positives = 187/306 (61%), Gaps = 8/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QSLEL T LAGQIARA IF +DE+VVFD
Sbjct: 75 TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDESGEDAKSVEGEFEGIG 134
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 135 KRG----KACVQLARILQYLECPQYLRKSFFPKHGDLQFAGLLNPLDSPHHMRVDEDSEY 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L G+ V+ G+ K V +D+ L PG+RVTV + +N + + VV
Sbjct: 191 REGVVLDRPTKPGRGSFVNCGMKKEVQIDKQLKPGLRVTVRLQEPQNPETKVRKGTVVSS 250
Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY VR A +S+VF C +K GYD IGTSE G+ V+ +TLP+FRH+
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGNSVDQ--VTLPSFRHI 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ + +F Y NTCP QGSRTIRTEEA+ ISL +
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368
Query: 360 ISRALR 365
I A++
Sbjct: 369 IDEAVK 374
>gi|363740343|ref|XP_003642310.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gallus gallus]
gi|363740439|ref|XP_003642329.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gallus gallus]
Length = 379
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 143/306 (46%), Positives = 188/306 (61%), Gaps = 8/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QSLEL T LAGQIARA IF +DE+VVFD
Sbjct: 75 TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDEHGEDAKSVEGEFEGIG 134
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 135 KRG----KACVQLARILQYLECPQYLRKSFFPKHGDLQFAGLLNPLDSPHHMRVDEDSEY 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L + G+ V+ G+ K V +D+ L PG+RVTV + +N + + VV
Sbjct: 191 REGVVLDRPSKPGRGSFVNCGMKKEVQIDKQLKPGLRVTVRLQEPQNPETKVRKGTVVSS 250
Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY VR A +S+VF C +K GYD IGTSE G+ V+ +TLP+FRH+
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDVSIGTSERGNSVDQ--VTLPSFRHM 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ + +F Y NTCP QGSRTIRTEEA+ ISL +
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368
Query: 360 ISRALR 365
I A++
Sbjct: 369 IDEAVK 374
>gi|345805985|ref|XP_851147.2| PREDICTED: uncharacterized protein C9orf114 [Canis lupus
familiaris]
Length = 376
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDVKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLATLQAG 367
Query: 360 ISRALRR 366
+S+A R
Sbjct: 368 LSQAGAR 374
>gi|301758806|ref|XP_002915276.1| PREDICTED: uncharacterized protein C9orf114-like [Ailuropoda
melanoleuca]
Length = 376
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 190/308 (61%), Gaps = 8/308 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRALRRV 367
+++A R+
Sbjct: 368 LTQAGARL 375
>gi|432884800|ref|XP_004074593.1| PREDICTED: uncharacterized protein C9orf114 homolog [Oryzias
latipes]
Length = 370
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 191/305 (62%), Gaps = 8/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGS++DN QS EL T LAGQIARA +F +DE++VFD + +S +
Sbjct: 72 TVSVALPGSVLDNAQSPELRTYLAGQIARACAVFCVDEIIVFDEQGED---VKSVEGDFK 128
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQY+E PQYLRK F +H L++ G+L PLD+PHH+R E + +
Sbjct: 129 GVGKKGQA-CIQLARILQYVECPQYLRKCFFPVHKDLQYAGLLNPLDSPHHMRMDEESDY 187
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REGV L G+LV+ G+ K V +D+ L G+RVTV + +N D+ + VVPP
Sbjct: 188 REGVVLNRPPKQGKGSLVNCGMRKEVRIDKQLQSGLRVTVKLNKMQNQDSKLYKGVVVPP 247
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ E+ +YWGY VR A +S+VF +KGGYD +GTSE G+ + + TL F+HL
Sbjct: 248 HVPRTEAALYWGYTVRLASCLSAVFTEGPHKGGYDLTVGTSEKGNTLEQT--TLSPFKHL 305
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D L + +F Y NTCP QGSRTIRTEEAI IS+ Q
Sbjct: 306 LVVFGGLHGLEASVDADQNLNVTDPSLLFDLYLNTCPGQGSRTIRTEEAILISMAALQPK 365
Query: 360 ISRAL 364
I+ A
Sbjct: 366 IAAAF 370
>gi|297480783|ref|XP_002691684.1| PREDICTED: uncharacterized protein C9orf114 [Bos taurus]
gi|296482120|tpg|DAA24235.1| TPA: CG12128-like [Bos taurus]
Length = 376
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + N+ ++ + R +VV
Sbjct: 190 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP+QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|194671747|ref|XP_001789343.1| PREDICTED: uncharacterized protein C9orf114 [Bos taurus]
Length = 397
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 95 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 154
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 155 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 210
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + N+ ++ + R +VV
Sbjct: 211 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 270
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 271 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 328
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP+QGSRTIRTEEAI ISL Q
Sbjct: 329 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 388
Query: 360 ISRALRR 366
+++A R
Sbjct: 389 LTQAGAR 395
>gi|405974466|gb|EKC39109.1| hypothetical protein CGI_10020686 [Crassostrea gigas]
Length = 371
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 203/315 (64%), Gaps = 8/315 (2%)
Query: 43 KKKKRKRDQLNDDAPIE--VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV 100
K++K ++++ +++P + + TVS+A+PGSI+DN QS EL T LAGQIARA IF IDE+
Sbjct: 48 KRQKEGQEEITEESPKQGRLYTVSIALPGSILDNAQSPELRTYLAGQIARAAVIFNIDEI 107
Query: 101 VVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF 160
++FD S D+ + N + R+LQYLE PQYLRK+ F H L++
Sbjct: 108 IIFDEIFSKDSTIEGEFKGVGKKGQAN----VQMARILQYLECPQYLRKSFFPHHKDLQY 163
Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKH-VVVDQVLDPGVRV 219
G+L PLD+PHH+R + +P+REGV + + G+LV+ G+ K V +D++L PG+RV
Sbjct: 164 AGILNPLDSPHHMRGEDESPYREGVVTNKPVKSGRGSLVNCGIGKQDVQIDKLLTPGIRV 223
Query: 220 TVAMGTNRNLDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
TV + + +VV P+ P+E G+YWGY +R A ++ +VF C YK GYD IG
Sbjct: 224 TVKINQYDSSRRILKGRVVSPAAPREEGGLYWGYSIRLAKSLGAVFTECPYKEGYDLAIG 283
Query: 279 TSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQ 338
TSE GD V+S ++ FRH LI FGG+ GLE S+E D+ L ++ +F Y NTCP+Q
Sbjct: 284 TSEKGDNVDSVEMDTSNFRHALIVFGGVQGLEASLESDEALDIEDPSLLFQYYLNTCPNQ 343
Query: 339 GSRTIRTEEAIFISL 353
GSRTIRTEEAI ISL
Sbjct: 344 GSRTIRTEEAILISL 358
>gi|91091892|ref|XP_970427.1| PREDICTED: similar to CG12128 CG12128-PA [Tribolium castaneum]
gi|270001126|gb|EEZ97573.1| hypothetical protein TcasGA2_TC011435 [Tribolium castaneum]
Length = 360
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/302 (46%), Positives = 189/302 (62%), Gaps = 7/302 (2%)
Query: 53 NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
N AP PTVS+AVPGSI++ QS EL L+GQIARA IF++DE+VVFD+
Sbjct: 50 NKKAP---PTVSIAVPGSILETAQSPELRAYLSGQIARAACIFQVDEIVVFDDYGDESTA 106
Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
++ + ES L R+LQY+E PQYLRK F +H+ L+F G+L PL+APHH
Sbjct: 107 KKAELEDNYGLKTMRESCVQ-LGRILQYMECPQYLRKHFFPIHNHLKFCGILNPLNAPHH 165
Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
L K E FREGV L + G++V+VGL K V VD++L PG+R TV + +
Sbjct: 166 LTKDEEFEFREGVVLNKPVKPGRGSIVNVGLLKEVHVDKLLTPGIRCTVKLSAQQESSKK 225
Query: 233 SPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
VV PS PK E+G+YWGY +R A ++S VF YK GYD IGTS+ G V+ +
Sbjct: 226 LKGTVVAPSTPKSETGVYWGYTIRLANSLSKVFSQSPYKDGYDLTIGTSDKGTSVD--EF 283
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
PTF+H+L+ FGG+ GLE ++E D L + + +F Y NT P+QGS+TIRTEEAI +
Sbjct: 284 KCPTFKHVLVVFGGVQGLELALESDSVLNVDDPKFLFDHYLNTLPNQGSKTIRTEEAILV 343
Query: 352 SL 353
+L
Sbjct: 344 TL 345
>gi|297685497|ref|XP_002820324.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C9orf114
homolog [Pongo abelii]
Length = 473
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 191/304 (62%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD + +++ N
Sbjct: 171 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQD---AKTVEGNSQ 227
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 228 GVGKKGQA-CVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 286
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + D + R +VV
Sbjct: 287 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYRGKVVSS 346
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 347 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 404
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 405 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 464
Query: 360 ISRA 363
++RA
Sbjct: 465 LTRA 468
>gi|225706296|gb|ACO08994.1| C9orf114 [Osmerus mordax]
Length = 378
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 190/305 (62%), Gaps = 8/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TV+VA+PGS++DN QS EL T LAGQIARA +F IDE+VVFD RS +
Sbjct: 76 TVTVALPGSVMDNAQSTELRTYLAGQIARACGVFCIDEIVVFDEHGED---VRSVEGEFT 132
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQ+LE PQYLRK+ F H L++ G+L PLD+PHH+R E + +
Sbjct: 133 GVKKKGQA-CIQLARILQFLECPQYLRKSFFPKHQDLQYAGLLNPLDSPHHMRIDEESEY 191
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REGV L + G+ V+ G+ K V +D+ L PG+RVTV + N+N ++ + VV P
Sbjct: 192 REGVVLDRPSRPGKGSFVNCGMWKEVQIDKQLQPGLRVTVHLNKNQNNESRIYKGVVVAP 251
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ + G+YWGY VR A +S+VF C YK GYD IGTSE G N ++ F+HL
Sbjct: 252 HVPRSQGGLYWGYSVRLASGLSTVFTECPYKEGYDLTIGTSEKG--CNIDQASIKPFKHL 309
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D L + +F Y NTCP QGSRTIRTEEAI IS+ ++
Sbjct: 310 LVVFGGLQGLEASVDADQNLDVTDPSLLFDLYLNTCPSQGSRTIRTEEAILISMAGLRQK 369
Query: 360 ISRAL 364
IS A
Sbjct: 370 ISIAF 374
>gi|431898882|gb|ELK07252.1| hypothetical protein PAL_GLEAN10012500 [Pteropus alecto]
Length = 412
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 189/304 (62%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 110 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFKGIG 169
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 170 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 225
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 226 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 285
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 286 QDPRTKAGLYWGYTVRLASCLSAVFAETPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 343
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP+QGSRTIRTEEAI ISL Q
Sbjct: 344 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 403
Query: 360 ISRA 363
+++A
Sbjct: 404 LTQA 407
>gi|432095359|gb|ELK26558.1| hypothetical protein MDA_GLEAN10019279, partial [Myotis davidii]
Length = 372
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD D +
Sbjct: 62 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDE----DGQEAKTVEGEF 117
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 118 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 177
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 178 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLSQKQLPESKTYRGKVVSS 237
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 238 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 295
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 296 LVVFGGLQGLEAGVNADPNLEVTEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 355
Query: 360 ISRA 363
+++A
Sbjct: 356 LTKA 359
>gi|354505303|ref|XP_003514710.1| PREDICTED: uncharacterized protein C9orf114 homolog, partial
[Cricetulus griseus]
Length = 303
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 4 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTKTVEGEFKGVG 63
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 64 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 119
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+LV+ GL K V +D+ L+PG+RVTV + + + + + VV
Sbjct: 120 REGIVVDRPTKAGHGSLVNCGLKKEVKIDKKLEPGLRVTVRLNQKQLPECKTYKGTVVSS 179
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 180 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSNVASAQ--LPSFRHA 237
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 238 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPG 297
Query: 360 ISRA 363
+++A
Sbjct: 298 LTQA 301
>gi|332024865|gb|EGI65053.1| Uncharacterized protein C9orf114-like protein [Acromyrmex
echinatior]
Length = 370
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 137/296 (46%), Positives = 185/296 (62%), Gaps = 3/296 (1%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
++ T+S+AVPGSI+DN QS EL T LAGQIARA I+++DE+VVFD+K
Sbjct: 62 DICTISIAVPGSILDNAQSPELRTYLAGQIARAACIYKVDEIVVFDDKGEITE-DEKRKV 120
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ + E G L R+LQYLE PQYLRK F +H L++ G+L PLDAPHHLR+
Sbjct: 121 KKDDMLGEARLGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGVLNPLDAPHHLRQENE 180
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+REG+ + G+ V+VGL V VD+VL G+RVTV + ++ +V
Sbjct: 181 FLYREGIVTNKPIKAGRGSQVNVGLLNEVHVDKVLMTGLRVTVKIPPDQPNRKKMKGIIV 240
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
PP+ P+ E+G+YWGY V+ A N++ V NC YK GYD +IGTS+ G ++ +
Sbjct: 241 PPNMPRAETGIYWGYNVKLAKNLTEVLTNCQYKEGYDLMIGTSDKGTSIDEIKAKSLEYH 300
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
H LI FGGLAGLE +I+ D L ++ +F Y NTCP QGSRTIRTEEAI ++L
Sbjct: 301 HCLIVFGGLAGLEAAIDVDPHL-NDDSSVIFDKYINTCPQQGSRTIRTEEAILLTL 355
>gi|149039118|gb|EDL93338.1| rCG45649, isoform CRA_a [Rattus norvegicus]
Length = 377
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD + S
Sbjct: 75 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDTK----SVEGEF 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+LV+ G+ K V +D+ LDPG+RVTV + + + + + VV
Sbjct: 191 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 250
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSSVASTQ--LPNFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ + +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVADPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPG 368
Query: 360 ISRA 363
+++
Sbjct: 369 LTQG 372
>gi|338720320|ref|XP_001499881.3| PREDICTED: uncharacterized protein C9orf114-like [Equus caballus]
Length = 376
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 190 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQRQLPESKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP+QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|307205236|gb|EFN83616.1| Uncharacterized protein C9orf114-like protein [Harpegnathos
saltator]
Length = 331
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 191/308 (62%), Gaps = 13/308 (4%)
Query: 54 DDAPIEVP---TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
+DA E P TVS+AVPGSI+DN QS EL T LAGQIARA I+ +DE++VFD+K
Sbjct: 15 EDAKYERPDICTVSIAVPGSILDNAQSPELRTYLAGQIARAACIYNVDEIIVFDDKGDV- 73
Query: 111 NYSRSSAANRSNRSD----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
+ R R D E G L R+LQY+E PQYLRK F +H L++ G+L P
Sbjct: 74 ----TEDEKRKIRKDDVLGEARIGCLQLARILQYMECPQYLRKYFFPIHKDLQYAGVLNP 129
Query: 167 LDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTN 226
L+APHHL++ + + +REG+ + G+ V+VGL V VD+VL PG+RVTV + N
Sbjct: 130 LNAPHHLKQDDISLYREGIVTNKPIKVGKGSQVNVGLVNDVHVDKVLIPGLRVTVKIPPN 189
Query: 227 RNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI 285
+ VVPP+ P+ E+G YWGY V+ A N++ V NC K GYD +IGTS+ G +
Sbjct: 190 QPNSKKLNGIVVPPNVPRAETGTYWGYTVKLAKNLTEVITNCQCKDGYDLIIGTSDKGTL 249
Query: 286 VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
++ + + H LI FGGLAGLE ++ D L + +FH Y NTCP QGS+TIRT
Sbjct: 250 IDKIEAKSLEYHHCLIVFGGLAGLEAIVDVDPYLDIDDPSLMFHKYLNTCPQQGSQTIRT 309
Query: 346 EEAIFISL 353
EEA+ I+L
Sbjct: 310 EEAVLITL 317
>gi|357611423|gb|EHJ67480.1| hypothetical protein KGM_03544 [Danaus plexippus]
Length = 405
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 139/300 (46%), Positives = 186/300 (62%), Gaps = 2/300 (0%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
E+ TVS+AVPGSI++N QS EL T LAGQIARA +F +DE++VFD+ + +S
Sbjct: 64 EISTVSIAVPGSILENAQSAELRTYLAGQIARAACVFCVDEIIVFDDIGDKVDTKKSKLE 123
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ S +S L R+LQYLE PQYLRK F +H L F G+L PLDAPHHLR
Sbjct: 124 DNSGTKIARKSCVQ-LARILQYLECPQYLRKHFFPLHKDLEFAGLLNPLDAPHHLRMSND 182
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
FREG+T+ +N G+ V+VGL + V D++L+PG+RVTV + ++V
Sbjct: 183 FQFREGITMNKNVKPGKGSQVNVGLLQDVSTDKLLNPGIRVTVKLLPLTEGKKKLKGKIV 242
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
S P+ E+G+YWGY VR A N+S VF C YK GYD IGTS+ G ++ +
Sbjct: 243 SLSTPRAETGVYWGYTVRIANNLSQVFTQCPYKDGYDLTIGTSDKGTPIDDLPNKEVRYN 302
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
H LI FGGL G+E ++E D+ L+ A +F+ Y N P+QGSRTIRTEEAI I++ Q
Sbjct: 303 HALIVFGGLHGIEAALESDEQLQVDEASLLFNHYVNVLPNQGSRTIRTEEAILIAMSCLQ 362
>gi|351697028|gb|EHA99946.1| hypothetical protein GW7_01906 [Heterocephalus glaber]
Length = 377
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 185/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD + S
Sbjct: 75 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQEAK----SVEGEF 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E F
Sbjct: 131 RGVGKKGQACVQLARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQEEECEF 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R V S
Sbjct: 191 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLKQQQLPESKTYRGTVVSS 250
Query: 242 K-PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
K P+ +G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 251 KEPRTRAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPGFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDLYINTCPGQGSRTIRTEEAILISLAALQPS 368
Query: 360 ISRA 363
+++A
Sbjct: 369 LTQA 372
>gi|348524370|ref|XP_003449696.1| PREDICTED: uncharacterized protein C9orf114 homolog [Oreochromis
niloticus]
Length = 379
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 140/302 (46%), Positives = 182/302 (60%), Gaps = 8/302 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGSI+DN QS EL T LAGQIARA +F +DE+VVFD + +
Sbjct: 72 TVSLALPGSILDNAQSPELRTYLAGQIARACVVFSVDEIVVFDERGEDVKSVEGEFSGVG 131
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R G L R+LQYLE PQYLRKA F H L++ G+L PLD+PHH+R E + +
Sbjct: 132 KRG----HGCIQLARILQYLECPQYLRKAFFPKHQDLQYAGLLNPLDSPHHMRIDEESDY 187
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REGV + + G+ VD G+ K + +D+ L PG+RVTV + +N + + VV P
Sbjct: 188 REGVVVDRPTKQNKGSFVDCGMRKEIQIDKQLQPGLRVTVRLSKTQNPEGKVYKGVVVAP 247
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ E G YWGY VR A +S+VF +K GYD IGTSE G N+ L F+HL
Sbjct: 248 HVPRTEGGYYWGYSVRLASCLSAVFTESPFKEGYDLTIGTSERGS--NADQALLTPFKHL 305
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D L + +F Y NTCP QGSRTIRTEEAI IS+ +
Sbjct: 306 LVVFGGLQGLEASLDADQKLDVTDPSVLFDLYLNTCPGQGSRTIRTEEAILISMAVLRPK 365
Query: 360 IS 361
I+
Sbjct: 366 IT 367
>gi|170042367|ref|XP_001848900.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167865860|gb|EDS29243.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 433
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 187/305 (61%), Gaps = 3/305 (0%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
+ T+S+AVPGSI+DN QS EL LAGQIARA IF++DEVVVFD+ S + + +
Sbjct: 68 ISTLSIAVPGSILDNAQSPELRACLAGQIARAACIFQVDEVVVFDDCGSGNQVTDKVGTI 127
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ + + L R+LQYLE PQYLRK F +H+ L+F G+L PLD+ HHLR+
Sbjct: 128 DTAEGTTSARRCCVQLARILQYLECPQYLRKFFFPIHNDLKFCGLLAPLDSQHHLRQASE 187
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-V 237
FREG+ + N + V+VGL V+VD LD GVRVTV M +L + R V
Sbjct: 188 FEFREGIVTNKPTKNDKSSFVNVGLLNDVLVDTKLDVGVRVTVKMPAGVDLKSKKIRAIV 247
Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VPPS+P++ +G+YWGY VR A ++S VF YKGGYD IGTS+ G V + +
Sbjct: 248 VPPSQPRQDTGIYWGYTVRIANSLSQVFTKSPYKGGYDLTIGTSDTGTNVQELEPKSLKY 307
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
+H L+ FGG+ GLE +++ D L +F Y NT P QGSRT+RTEEAI ISL
Sbjct: 308 KHALVVFGGVLGLEPALDNDAKLTVDKVDMLFDEYLNTVPRQGSRTVRTEEAILISLAAL 367
Query: 357 QEPIS 361
E ++
Sbjct: 368 GEKLA 372
>gi|305682577|ref|NP_766248.3| uncharacterized protein C9orf114 homolog [Mus musculus]
gi|109861106|sp|Q3UHX9.1|CI114_MOUSE RecName: Full=Uncharacterized protein C9orf114 homolog
gi|74151214|dbj|BAE27727.1| unnamed protein product [Mus musculus]
gi|148676498|gb|EDL08445.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
CRA_d [Mus musculus]
Length = 385
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD + S
Sbjct: 75 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+LV+ G+ K V +D+ LDPG+RVTV + + + + + VV
Sbjct: 191 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 250
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ + +F Y NTC QGSRTIRTEEAI ISL Q
Sbjct: 309 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 368
Query: 360 ISRALRR 366
+++ R
Sbjct: 369 LTQVGSR 375
>gi|148676495|gb|EDL08442.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
CRA_b [Mus musculus]
gi|148676497|gb|EDL08444.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
CRA_b [Mus musculus]
Length = 379
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD + S
Sbjct: 69 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 124
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 125 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 184
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+LV+ G+ K V +D+ LDPG+RVTV + + + + + VV
Sbjct: 185 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 244
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 245 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 302
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ + +F Y NTC QGSRTIRTEEAI ISL Q
Sbjct: 303 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 362
Query: 360 ISRALRR 366
+++ R
Sbjct: 363 LTQVGSR 369
>gi|417410219|gb|JAA51586.1| Hypothetical protein, partial [Desmodus rotundus]
Length = 379
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF ++E+VVFD +
Sbjct: 77 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVEEIVVFDEEGQDAKTVEGEFRGVG 136
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 137 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEESEF 192
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 193 REGVVVDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLNQEQLPECKTYRGKVVSS 252
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 253 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 310
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 311 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYLNTCPGQGSRTIRTEEAILISLAALQPS 370
Query: 360 ISRA 363
+++A
Sbjct: 371 LTQA 374
>gi|296190958|ref|XP_002743410.1| PREDICTED: uncharacterized protein C9orf114 [Callithrix jacchus]
gi|166183804|gb|ABY84165.1| hypothetical protein [Callithrix jacchus]
Length = 376
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGQFIGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 190 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYINTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+ +A
Sbjct: 368 LIQA 371
>gi|148676496|gb|EDL08443.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
CRA_c [Mus musculus]
Length = 323
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD + S
Sbjct: 13 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 68
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 69 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 128
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+LV+ G+ K V +D+ LDPG+RVTV + + + + + VV
Sbjct: 129 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 188
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 189 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 246
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ + +F Y NTC QGSRTIRTEEAI ISL Q
Sbjct: 247 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 306
Query: 360 ISRALRR 366
+++ R
Sbjct: 307 LTQVGSR 313
>gi|311246579|ref|XP_003122255.1| PREDICTED: uncharacterized protein C9orf114-like [Sus scrofa]
Length = 377
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 75 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 134
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 135 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEDSEF 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 191 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVWLNQKQLPESKTYRGKVVSS 250
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 309 LVVFGGLQGLEAAVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 368
Query: 360 ISRA 363
+++A
Sbjct: 369 LTQA 372
>gi|449477941|ref|XP_002194797.2| PREDICTED: uncharacterized protein C9orf114 homolog [Taeniopygia
guttata]
Length = 379
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 187/306 (61%), Gaps = 8/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QS L T LAGQIARA TIF +DE+VVFD
Sbjct: 75 TLSVALPGSILNNAQSPALRTYLAGQIARACTIFCVDEIVVFDEHGEDVKTVEGDFEGIG 134
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 135 KKG----KACVQLARILQYLECPQYLRKSFFPKHEDLQFAGLLNPLDSPHHMRMDEDSEY 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L A G+ V+ G+ K V +D+ L+PG+RVTV + +N +A + VV
Sbjct: 191 REGVVLDRPAKPGRGSFVNCGMRKEVQIDKQLNPGLRVTVRLNEPQNPEAKVRKGTVVSS 250
Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY VR A +S+VF C +K GYD IGTSE G V+ +TLP+FRH
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGSSVDQ--VTLPSFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ + +F Y NTCP QGSRTIRTEEA+ ISL +
Sbjct: 309 LVVFGGLEGLEAGVDVDPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368
Query: 360 ISRALR 365
+ A++
Sbjct: 369 MEEAVK 374
>gi|332230194|ref|XP_003264272.1| PREDICTED: uncharacterized protein C9orf114 homolog [Nomascus
leucogenys]
Length = 376
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACIQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ LDPG+RVTV + + + + R +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLDPGLRVTVRLNQQQLPECKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|395506105|ref|XP_003757376.1| PREDICTED: uncharacterized protein C9orf114 homolog [Sarcophilus
harrisii]
Length = 374
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QSLEL T LAGQIARA IF +DE+VVFD +
Sbjct: 71 TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDEEGEDVKSVEGEFVGIG 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E +
Sbjct: 131 KKGQ----ASVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEVLEY 186
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ L G+ V+ G+ K V +D+ L+PG+RVTV + + ++ S + +VV
Sbjct: 187 REGIVLHRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLDQQQIPESKSLKGRVVSS 246
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
+P+ +G+YWGY VR A +S+VF +K GYD IGTSE G+ + + + LP F H
Sbjct: 247 QEPRTHAGLYWGYSVRLASCLSAVFAEGPFKDGYDLTIGTSERGE--DLASVQLPNFNHA 304
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
++ FGGL GLE S+E D L+ + +F Y NTCP QGSRTIRTEEAI ISL + P
Sbjct: 305 IVVFGGLQGLEASVEADPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEAILISLATLRPP 364
Query: 360 ISRA 363
I A
Sbjct: 365 IIAA 368
>gi|410903718|ref|XP_003965340.1| PREDICTED: uncharacterized protein C9orf114 homolog [Takifugu
rubripes]
Length = 381
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 183/305 (60%), Gaps = 8/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVAVPGS++DN QS EL T LAGQIARA +F +DE++VFD D S
Sbjct: 72 TVSVAVPGSVLDNAQSPELRTYLAGQIARACVVFCVDEIIVFDE----DGEELKSVEGEF 127
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + L R+LQYLE PQYLRK F H L++ G+L PLD+PHH+R + + +
Sbjct: 128 KGVGKKGNACIQLARILQYLECPQYLRKWFFPKHHDLQYAGLLNPLDSPHHMRTDDESEY 187
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
REG+ L G+LV+ G+ K V +D+ L G+RVTV + + ++ + VV P
Sbjct: 188 REGIVLDRPTKQGQGSLVNCGMRKDVRIDKQLQSGLRVTVQLSQTHSSESKHCKGVVVAP 247
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ E G+YWGY VR A +S+VF +K GYD IGTSE G+ N T+P F+HL
Sbjct: 248 HVPRTEGGLYWGYSVRLASCLSAVFTESQFKEGYDVTIGTSERGN--NLDKATVPPFKHL 305
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D L + +F Y NTCP QGSRTIRTEEAI +S+ +
Sbjct: 306 LVVFGGLQGLEASVDADQNLDVTDPSVLFDLYLNTCPSQGSRTIRTEEAILVSMSGLRPK 365
Query: 360 ISRAL 364
I A
Sbjct: 366 IEAAF 370
>gi|403298554|ref|XP_003940082.1| PREDICTED: uncharacterized protein C9orf114 homolog [Saimiri
boliviensis boliviensis]
Length = 376
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSIALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGQFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 190 REGVVVDRPTRPGHGSFVNCGMKKEVKIDRNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|395844433|ref|XP_003794966.1| PREDICTED: uncharacterized protein C9orf114 homolog [Otolemur
garnettii]
Length = 376
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQKQLPECKTYRGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ +P+FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--IPSFRHG 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|198436996|ref|XP_002131245.1| PREDICTED: similar to Uncharacterized protein C9orf114 [Ciona
intestinalis]
Length = 361
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/298 (47%), Positives = 188/298 (63%), Gaps = 14/298 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--DNYSRSSAAN 119
TVSVA+PGSI+DN QS EL T LAGQIARA +F++DE++VFD +SS D+ ++
Sbjct: 60 TVSVALPGSILDNAQSPELRTYLAGQIARACVVFQVDEIIVFDESASSGKDSSTQGDFTG 119
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ + N L R+LQYLE PQYLRK+ F H L++ G+L PLD PHH+R+ + A
Sbjct: 120 VGKKGNCNIQ----LARILQYLECPQYLRKSFFPKHKDLQYAGVLNPLDCPHHMREQDNA 175
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
FREGV + G+ VD G K V +D+VL P +RVTV + + +VVP
Sbjct: 176 EFREGVVVDRPIKPGKGSFVDCGTLKTVQIDKVLQPNIRVTVKIKEQKKKYILG--EVVP 233
Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTF 296
P P+ + G+YWGY VR A ++SS +C Y G YD IGTS+ G+ V +D +PTF
Sbjct: 234 PHYPRTKGGLYWGYNVRLANSLSSAIVDCPYSDTGKYDITIGTSDKGEPV--ADFKMPTF 291
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRK-NAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+HLLI FGGL GLE +E D+ L +F Y NTCP+QGSRTIRTEEAI +++
Sbjct: 292 KHLLIVFGGLQGLEAGLEADEALGGDIELSLLFDKYLNTCPNQGSRTIRTEEAILVTM 349
>gi|348569715|ref|XP_003470643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
C9orf114-like [Cavia porcellus]
Length = 377
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DEVVVFD + S
Sbjct: 75 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEVVVFDEEGQEAK----SVEGEF 130
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQDEESEF 190
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R VV
Sbjct: 191 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGTVVSS 250
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
+P+ +G+YWGY R A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 251 QEPRTRAGLYWGYTXRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 308
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 368
Query: 360 ISRA 363
+++A
Sbjct: 369 LTQA 372
>gi|46255718|gb|AAH10579.1| Chromosome 9 open reading frame 114 [Homo sapiens]
gi|55777177|gb|AAH46133.1| Chromosome 9 open reading frame 114 [Homo sapiens]
Length = 376
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 184/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|26337855|dbj|BAC32613.1| unnamed protein product [Mus musculus]
Length = 345
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 188/307 (61%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD + S
Sbjct: 35 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 90
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 91 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 150
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+LV+ G+ K V +D+ LDPG+RVTV + + + + + VV
Sbjct: 151 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 210
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++ +YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 211 QDPRTKACLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 268
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE +++ D L+ + +F Y NTC QGSRTIRTEEAI ISL Q
Sbjct: 269 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 328
Query: 360 ISRALRR 366
+++ R
Sbjct: 329 LTQVGSR 335
>gi|196000250|ref|XP_002109993.1| hypothetical protein TRIADDRAFT_53475 [Trichoplax adhaerens]
gi|190588117|gb|EDV28159.1| hypothetical protein TRIADDRAFT_53475 [Trichoplax adhaerens]
Length = 373
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 196/296 (66%), Gaps = 12/296 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD--NKSSSDNYSRSSAAN 119
TVSVAVPGSI++N QS EL T LAGQIARA+ IF +DEVV+FD K+ +++ +
Sbjct: 69 TVSVAVPGSILNNAQSPELRTYLAGQIARALVIFEVDEVVIFDESGKTVNEDVGEFTGVT 128
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
R + D N+ L +LQYLE PQYLRK F H LR+ G+L PL++ HH+ E
Sbjct: 129 RKDY-DPNK----VLANILQYLECPQYLRKFFFPKHPDLRYAGLLNPLNSTHHMLIDEDV 183
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVV 238
P+REGVTL + G+LV+ G+ K V +D+ L G+RVTV + + ++ D+ + + VV
Sbjct: 184 PYREGVTL-DRPVGKKGSLVNCGMRKEVRIDRQLQSGLRVTVKLDSEQSSDSKTYKGTVV 242
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
P+ ++G+YWGY+VR A ++ + C Y+GGYD IGTS+ GD + +L L +F+
Sbjct: 243 SADAPRMDAGIYWGYRVRVARSLGAAITECPYEGGYDLSIGTSDGGD--SMEELNLSSFK 300
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
H+LIAFGGL+GLE ++E D+ L+ + R +F Y NTCP QGSR IRTEEA+ ISL
Sbjct: 301 HILIAFGGLSGLEAALESDESLRLSDIRLLFDYYLNTCPGQGSRIIRTEEAMLISL 356
>gi|119608245|gb|EAW87839.1| chromosome 9 open reading frame 114, isoform CRA_b [Homo sapiens]
Length = 437
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 185/307 (60%), Gaps = 8/307 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 135 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 194
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 195 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 250
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 251 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 310
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 311 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 368
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 369 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 428
Query: 360 ISRALRR 366
+++A R
Sbjct: 429 LTQAGAR 435
>gi|402896354|ref|XP_003911267.1| PREDICTED: uncharacterized protein C9orf114 homolog [Papio anubis]
Length = 514
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 212 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 271
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 272 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 327
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV
Sbjct: 328 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 387
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 388 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAH--LPSFRHA 445
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 446 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 505
Query: 360 ISRA 363
+++A
Sbjct: 506 LTQA 509
>gi|390337040|ref|XP_792707.2| PREDICTED: uncharacterized protein C9orf114-like
[Strongylocentrotus purpuratus]
Length = 537
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 191/312 (61%), Gaps = 22/312 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSI+DN QS EL T LAGQIARA+ IF IDE+++FD S AA R
Sbjct: 233 TLSIALPGSILDNAQSPELRTYLAGQIARALVIFNIDEIIIFD---------ESGAAARE 283
Query: 122 NRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
N +D SG L R+LQYLE PQYLRK+ F H+ L++ G+L PLD PHH+R
Sbjct: 284 N-TDGVYSGIGKKGNANVQLARILQYLECPQYLRKSFFPKHNDLQYAGLLNPLDCPHHVR 342
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-S 233
E +REGV + G+ V+VGL K V +D+ L +RVTV M T + + +
Sbjct: 343 ADEEIAYREGVVVDRPVAKGRGSHVNVGLYKEVEIDKKLQAQLRVTVKMNTKKGISPKHA 402
Query: 234 PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDL 291
P +VV PS P+ E+G+YWGY VR A ++ +VF C + GYD IGTSE G + D+
Sbjct: 403 PCKVVSPSAPRLEAGLYWGYSVRLASSLGAVFTGCPFSDNGYDLTIGTSERG--TKADDV 460
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
LP F+H LI FGG+ GLE S+E D+ L + +F Y N CP QGSRTIRTEEA+ I
Sbjct: 461 DLPHFKHALIIFGGVHGLELSLETDEELLVSDPAVLFDHYINVCPDQGSRTIRTEEAVLI 520
Query: 352 SLQYFQEPISRA 363
++ + I A
Sbjct: 521 TMATLRPKILSA 532
>gi|397503802|ref|XP_003822507.1| PREDICTED: uncharacterized protein C9orf114 homolog [Pan paniscus]
Length = 438
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 136 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 195
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 196 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 251
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + D + +VV
Sbjct: 252 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYHGKVVSS 311
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 312 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 369
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 370 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 429
Query: 360 ISRA 363
+++A
Sbjct: 430 LTQA 433
>gi|383850844|ref|XP_003700984.1| PREDICTED: uncharacterized protein C9orf114 homolog [Megachile
rotundata]
Length = 376
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 190/308 (61%), Gaps = 10/308 (3%)
Query: 59 EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
+V T+S+AVPGSI+DN QS EL T LAGQIARA +++IDE+VVFD+K +
Sbjct: 57 DVSTISIAVPGSILDNAQSPELRTYLAGQIARAACVYQIDEIVVFDDKGEITEDEKKKVR 116
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
N E G L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+ +
Sbjct: 117 N-DEVLGERRIGCLQLARILQYLECPQYLRKYFFPLHKDLQYAGVLNPLDTPHHLRQSDV 175
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV- 237
+ +REGV + G+ V+VGL +V VD+VL G+RVTV + D +P+++
Sbjct: 176 SLYREGVITNKPVKGDKGSYVNVGLLNNVHVDKVLTAGLRVTVKIPP----DQPNPKKIK 231
Query: 238 ---VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
VPP+ P+ ESG+YWGY VR A N++ Y+ GYD IGTS+ G V+ ++
Sbjct: 232 GFIVPPNIPRAESGIYWGYNVRLANNLTEALTQGPYEDGYDLTIGTSDKGKSVDDIEVKS 291
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ H LI FGGL+GLE +++ D L + +F Y NTCP QGSRTIRTEEAI I+L
Sbjct: 292 IKYHHALIVFGGLSGLEAALDGDPSLDVDDPSLIFDKYLNTCPQQGSRTIRTEEAILITL 351
Query: 354 QYFQEPIS 361
+ +S
Sbjct: 352 AELRTKLS 359
>gi|38679912|ref|NP_057474.2| uncharacterized protein C9orf114 [Homo sapiens]
gi|126302532|sp|Q5T280.3|CI114_HUMAN RecName: Full=Uncharacterized protein C9orf114
Length = 376
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+ +A
Sbjct: 368 LIQA 371
>gi|332832960|ref|XP_520300.3| PREDICTED: uncharacterized protein C9orf114 homolog [Pan
troglodytes]
gi|410221478|gb|JAA07958.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410221480|gb|JAA07959.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410251074|gb|JAA13504.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410251076|gb|JAA13505.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410297534|gb|JAA27367.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410297536|gb|JAA27368.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410337681|gb|JAA37787.1| chromosome 9 open reading frame 114 [Pan troglodytes]
gi|410337683|gb|JAA37788.1| chromosome 9 open reading frame 114 [Pan troglodytes]
Length = 376
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + D + +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYHGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+++A
Sbjct: 368 LTQA 371
>gi|224000970|ref|XP_002290157.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973579|gb|EED91909.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
Length = 328
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 197/323 (60%), Gaps = 36/323 (11%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PTVS+A+PGS++ N Q+ EL T+LAGQIARA ++R+DE+VVFD+ S S +
Sbjct: 12 PTVSIAIPGSVVSNAQTRELQTQLAGQIARAAAVYRVDEIVVFDDGLGS------SLKTQ 65
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ AFL R+LQY E PQYLR+ F MH L+F G+LPPLDAPHHLR+ ++A
Sbjct: 66 PPHMQPSTDPHAFLARILQYCECPQYLRRKFFPMHPDLQFAGLLPPLDAPHHLRRGDFAA 125
Query: 181 FREGVTLKENA-------PNSVGTLVDVGL-NKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
FREGV ++ + G+ VD G+ N+ V +D+ + PG+R TV + T L
Sbjct: 126 FREGVVVERKSGDGGKGGDEDGGSFVDCGVPNRLVQIDRDITPGIRCTVKLDTKSYLTKK 185
Query: 233 SPRQ--VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VNS 288
+ V P+ P+ E GMYWGY R A +I+ +F+ C Y+GGYD +GTSE GDI ++
Sbjct: 186 GVMEGKAVSPTTPRDEEGMYWGYTTRLASSINGIFEECPYEGGYDLKVGTSERGDISIDD 245
Query: 289 SDLTLPT----------------FRHLLIAFGGLAGLEESIEEDDG--LKRKNAREVFHS 330
+ L F HLLI FGG+AG+EE ++ D+ L ++++++F
Sbjct: 246 INFGLKKKRQYTKSKNKSGGDERFSHLLIVFGGVAGIEECVDADESMTLPGEDSKKMFDV 305
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ N CP+QGSRTIRTEEA+ I+L
Sbjct: 306 WVNICPYQGSRTIRTEEAVLITL 328
>gi|47219853|emb|CAF97123.1| unnamed protein product [Tetraodon nigroviridis]
Length = 394
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 181/306 (59%), Gaps = 8/306 (2%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PTVSVAVPGS++DN QS EL + LAGQIARA +F +DE+VVFD D S
Sbjct: 71 PTVSVAVPGSVLDNAQSPELRSYLAGQIARACVVFCVDEIVVFDE----DGEDLKSVEGE 126
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ + L R+LQYLE PQYLRK F H L++ G+L PLD+PHH+R +
Sbjct: 127 FKGVGKKGNACIQLARILQYLECPQYLRKWFFPKHQDLQYAGLLNPLDSPHHMRTDDECE 186
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVP 239
+REGV L G+LV+ G+ K V +D+ L G+RVTV +G T+ + VV
Sbjct: 187 YREGVVLDRPTKAGQGSLVNCGMRKDVRIDKQLQAGLRVTVRLGPTHSPENKHCKGVVVA 246
Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
P P+ E G+YWGY VR A +S+VF +K GYD IGTSE G N +P F+H
Sbjct: 247 PHVPRTEGGLYWGYSVRLASCLSAVFTESQFKDGYDVTIGTSERG--TNLDQTAVPPFKH 304
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
LL+ FGGL GLE S++ D L + +F Y NTCP QGSRTIRTEEAI +S+ +
Sbjct: 305 LLVVFGGLQGLEASVDADPNLDVTDPSVLFDLYLNTCPGQGSRTIRTEEAILVSMSGLRP 364
Query: 359 PISRAL 364
I A
Sbjct: 365 KIEAAF 370
>gi|327291021|ref|XP_003230220.1| PREDICTED: uncharacterized protein C9orf114-like [Anolis
carolinensis]
Length = 326
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 185/306 (60%), Gaps = 8/306 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVAVPGSI+DN QS EL T LAGQIAR+ +F +DEVVVFD +
Sbjct: 19 TLSVAVPGSILDNAQSPELRTYLAGQIARSCAVFSVDEVVVFDERGEDAKTVEGEFGGLK 78
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 79 ARGQ----ACVQLARILQYLECPQYLRKSFFPKHQDLQFAGLLNPLDSPHHVRIDEVSQY 134
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L G+ V+ G+ K V +D+ L+ G+RVTV + +N ++ + + VV
Sbjct: 135 REGVVLARPVKPGRGSFVNCGMRKEVQIDKQLEAGLRVTVKLNEQQNPESKTQKGTVVSS 194
Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY VR A +S+VF + GYD +GTSE G V+ + TLP FRH
Sbjct: 195 HHPRTVSGLYWGYTVRLASCLSAVFAESPFTDGYDLSVGTSERGTPVDQA--TLPAFRHA 252
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
LI FGGL GLE +++ D L+ + +F Y NTCP QGSRTIRTEEA+ ISL +
Sbjct: 253 LIVFGGLQGLEAAVDADPHLEVSDPSTLFDLYLNTCPRQGSRTIRTEEAMLISLSALRPR 312
Query: 360 ISRALR 365
I+ A +
Sbjct: 313 IAEAAK 318
>gi|18204644|gb|AAH21273.1| Chromosome 9 open reading frame 114 [Homo sapiens]
gi|39794258|gb|AAH63644.1| Chromosome 9 open reading frame 114 [Homo sapiens]
gi|55249540|gb|AAH39590.1| Chromosome 9 open reading frame 114 [Homo sapiens]
gi|312152042|gb|ADQ32533.1| chromosome 9 open reading frame 114 [synthetic construct]
Length = 376
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+ +A
Sbjct: 368 LIQA 371
>gi|7512808|pir||T14751 hypothetical protein DKFZp566D143.1 - human (fragment)
Length = 389
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 87 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 146
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 147 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 202
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 203 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 262
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 263 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 320
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NTCP QGSRTIRTEEAI ISL Q
Sbjct: 321 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 380
Query: 360 ISRA 363
+ +A
Sbjct: 381 LIQA 384
>gi|291232700|ref|XP_002736285.1| PREDICTED: CG12128-like [Saccoglossus kowalevskii]
Length = 377
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 184/305 (60%), Gaps = 8/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSI+DN QS EL T LAGQIARA IF++DE+++FD S
Sbjct: 77 TISIALPGSILDNAQSPELRTYLAGQIARACVIFKVDEIIIFDEAGFSSESITGVYDGVG 136
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R + N + R+LQYLE PQYLRK F HS L++ G+L PLD PHH+R + F
Sbjct: 137 KRGNANVQ----MARILQYLECPQYLRKHFFPKHSDLQYAGVLNPLDTPHHMRIDDECEF 192
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-SPRQVVPP 240
REGV L G+ V+ GL K + +D+ L G+RVTV + + + + +VV P
Sbjct: 193 REGVVLDRPVKGGKGSFVNAGLRKEIRIDRQLRDGLRVTVRLDNKQTDEGEYYTGKVVSP 252
Query: 241 SKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P + G+YWGY VR A ++S VF SYK GYD IGTS+ GD ++ + T F H
Sbjct: 253 RCPCTDRGLYWGYTVRLASSLSKVFTESSYKNGYDLTIGTSDSGDDIDHA--TFSKFNHA 310
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE S++ D+ LK ++F Y NTCP QGSRTIRTEEAI I++ +
Sbjct: 311 LVVFGGLKGLEYSLDCDEKLKIHEVNDLFDMYINTCPKQGSRTIRTEEAILITMAALRPR 370
Query: 360 ISRAL 364
I +AL
Sbjct: 371 IIQAL 375
>gi|334311915|ref|XP_001367954.2| PREDICTED: uncharacterized protein C9orf114-like [Monodelphis
domestica]
Length = 422
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 118 TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGEDVKSIEGEFVGIG 177
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E +
Sbjct: 178 KKGQ----ASVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEVLEY 233
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ L G+ V+ G+ K V +D+ L+PG+RVTV + + ++ S + +VV
Sbjct: 234 REGIVLDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLDQQQIPESKSLKGRVVSS 293
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
+P+ +G+YWGY VR A +S+VF YK GYD IGTSE G+ + S + LP F H
Sbjct: 294 QEPRTHAGLYWGYSVRLASCLSAVFAEGPYKDGYDLTIGTSERGEGLAS--IQLPNFNHA 351
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
++ FGGL GLE +E D L+ + +F Y NTCP QGSRTIRTEEAI ISL +
Sbjct: 352 IVVFGGLQGLEAGVEADPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEAILISLATLRPH 411
Query: 360 ISRA 363
I+ A
Sbjct: 412 ITGA 415
>gi|449266744|gb|EMC77760.1| Putative protein C9orf114 like protein, partial [Columba livia]
Length = 354
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 140/294 (47%), Positives = 180/294 (61%), Gaps = 8/294 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QS EL T LAGQIARA IF +DE+VVFD
Sbjct: 62 TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEHGEDVKTVEGDFEGIG 121
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
R L R+LQYLE PQYLRK+ F H L+F G+L PLD+PHH+R E + +
Sbjct: 122 RRG----KACVQLARILQYLECPQYLRKSFFPKHEDLQFAGLLNPLDSPHHMRVDEDSEY 177
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L G+ V+ G+ K V +D+ L+PG+RVTV + + +A + VV
Sbjct: 178 REGVVLDRPTKPGRGSFVNCGMRKEVQIDRQLNPGLRVTVRLEEPQKPEAKVRKGTVVSS 237
Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ SG+YWGY VR A +S+VF C +K GYD IGTSE G V+ + TLP+FRH
Sbjct: 238 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGSSVDQA--TLPSFRHA 295
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+ FGGL GLE ++ D L+ + +F Y NTCP QGSRTIRTEEA+ ISL
Sbjct: 296 LVVFGGLEGLEAGVDVDPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEALLISL 349
>gi|390366072|ref|XP_003730960.1| PREDICTED: uncharacterized protein C9orf114-like
[Strongylocentrotus purpuratus]
Length = 492
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 189/309 (61%), Gaps = 22/309 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSI+DN QS EL T LAGQIARA+ IF IDE+++FD S AA R
Sbjct: 188 TLSIALPGSILDNAQSPELRTYLAGQIARALVIFNIDEIIIFD---------ESGAAARD 238
Query: 122 NRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
N +D SG L R+LQYLE PQYLRK+ F H+ L++ G+L PLD PHH+R
Sbjct: 239 N-TDGVYSGIGKKGNANVQLARILQYLECPQYLRKSFFPKHNDLQYAGLLNPLDCPHHVR 297
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-S 233
E +REGV + G+ V+VGL K V +D+ L +RVTV M T + +
Sbjct: 298 ADEEIAYREGVVVDRPVAKGRGSHVNVGLYKEVEIDKKLQAQLRVTVKMNTKKGTPTKHA 357
Query: 234 PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDL 291
P +VV PS P+ E+G+YWGY VR A ++ +VF C + GYD IGTSE G + D+
Sbjct: 358 PCKVVSPSAPRLEAGLYWGYSVRLASSLGAVFTGCPFSDNGYDLTIGTSERG--TKADDV 415
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
LP F+H LI FGG+ GLE S+E D+ L + +F Y N CP QGSRTIRTEEA+ I
Sbjct: 416 DLPHFKHALIIFGGVQGLELSLEMDEELLVSDPGVLFDHYINVCPDQGSRTIRTEEAVLI 475
Query: 352 SLQYFQEPI 360
++ + I
Sbjct: 476 TMATLRPKI 484
>gi|355736575|gb|AES12043.1| hypothetical protein [Mustela putorius furo]
Length = 356
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 134/287 (46%), Positives = 177/287 (61%), Gaps = 8/287 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +
Sbjct: 76 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 135
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 136 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 191
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + R +VV
Sbjct: 192 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 251
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP+FRH
Sbjct: 252 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 309
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
L+ FGGL GLE +E D L+ +F Y NTCP QGSRTIRTE
Sbjct: 310 LVVFGGLQGLEAGVEADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTE 356
>gi|46255825|gb|AAH33677.1| Chromosome 9 open reading frame 114 [Homo sapiens]
Length = 376
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 182/304 (59%), Gaps = 8/304 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D + +VV
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ LP FRH
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
L+ FGGL GLE + D L+ +F Y NT P QGSRTIRTEEAI ISL Q
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTYPGQGSRTIRTEEAILISLAALQPG 367
Query: 360 ISRA 363
+ +A
Sbjct: 368 LIQA 371
>gi|195150735|ref|XP_002016306.1| GL10562 [Drosophila persimilis]
gi|194110153|gb|EDW32196.1| GL10562 [Drosophila persimilis]
Length = 501
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 189/314 (60%), Gaps = 11/314 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + +Y +
Sbjct: 74 TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETKRSYEQD 133
Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
+ N + S + L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+
Sbjct: 134 AEGNATGTV---RSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQ 190
Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
+REGV + A + + +VGL V+VD+ ++PGVRVTV M +
Sbjct: 191 QSKFRYREGVISDKKAKDG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDPASDTSRKQRG 249
Query: 236 QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP 294
+V P +P+ E+G+YWGY+VR A ++S +F Y+ YD +GTS+ G V+
Sbjct: 250 TLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYESSYDVTLGTSDRGTNVHEVPNRSI 309
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
++HLLI FGGL GLEE++ DD L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 310 QYKHLLIVFGGLQGLEEALANDDKLTVDDPEMLFDHYVNVLPRQGSRTIRTEEALLIALA 369
Query: 355 YFQEPISRALRRVE 368
QE + L VE
Sbjct: 370 SLQEKLQPELADVE 383
>gi|195120365|ref|XP_002004699.1| GI20065 [Drosophila mojavensis]
gi|193909767|gb|EDW08634.1| GI20065 [Drosophila mojavensis]
Length = 491
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 12/310 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
TVS+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + Y +
Sbjct: 69 TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVTTARETKRIYEQD 128
Query: 116 SAANRSNRSDEN----ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
A N N + + S + L R+LQYLE PQYLRK F +H L++ G+L PLD PH
Sbjct: 129 DADNNGNGAGKATSTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPH 188
Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
HLR+ +REGV + A + +VGL V++D+ L+PGVRVTV M + +
Sbjct: 189 HLRQQSKFRYREGVVCDKQAKEG-HSYANVGLLNDVLIDKALEPGVRVTVKMEPSSDGSR 247
Query: 232 DSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+V P +P+ E+G+YWGY+VR A ++S +F Y GGYD ++GTS+ G V +
Sbjct: 248 KQRGTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYDGGYDVMVGTSDRGTSVQNVP 307
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
+ H LI FGGL GLEE++ D+ L + + +F Y N P QGSRTIRTEEA+
Sbjct: 308 NQSIGYNHALIVFGGLQGLEEALANDEKLTVDDPQLLFDHYINVLPKQGSRTIRTEEALL 367
Query: 351 ISLQYFQEPI 360
I++ QE +
Sbjct: 368 IAMAALQEKL 377
>gi|125808292|ref|XP_001360698.1| GA11422 [Drosophila pseudoobscura pseudoobscura]
gi|54635870|gb|EAL25273.1| GA11422 [Drosophila pseudoobscura pseudoobscura]
Length = 501
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 190/314 (60%), Gaps = 11/314 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + +Y +
Sbjct: 74 TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETKRSYEQD 133
Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
+ N + S + L R+LQYLE PQYLRK F +H+ L++ G+L PLD PHHLR+
Sbjct: 134 AEGNATGTV---RSSSLQLARILQYLECPQYLRKYFFPLHNDLKYSGLLNPLDTPHHLRQ 190
Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
+REGV + A + + +VGL V+VD+ ++PGVRVTV M +
Sbjct: 191 QSKFRYREGVISDKKAKDG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDPASDTSRKQRG 249
Query: 236 QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP 294
+V P +P+ E+G+YWGY+VR A ++S +F Y+ YD +GTS+ G V+
Sbjct: 250 TLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYESSYDVTLGTSDRGTNVHEVPNRSI 309
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
++HLLI FGGL GLEE++ DD L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 310 QYKHLLIVFGGLQGLEEALANDDKLTVDDPEMLFDHYVNVLPRQGSRTIRTEEALLIALA 369
Query: 355 YFQEPISRALRRVE 368
QE + L VE
Sbjct: 370 SLQEKLQPELADVE 383
>gi|149418281|ref|XP_001505833.1| PREDICTED: uncharacterized protein C9orf114-like [Ornithorhynchus
anatinus]
Length = 405
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 133/287 (46%), Positives = 182/287 (63%), Gaps = 8/287 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI++N QS EL T LAGQIARA IF +DE+VVFD + S +
Sbjct: 82 TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGEDSK----SVEGKF 137
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + L R+LQYLE PQYLRK+ F H L+F G+L PLD PHH+R+ E + +
Sbjct: 138 DGIGKKGRATVQLARILQYLECPQYLRKSFFPKHQDLQFAGLLNPLDGPHHVRQDEDSEY 197
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV L + G+ V+ G+ K V +D+ L+PG+RVTV + + ++ + + +VV
Sbjct: 198 REGVVLARPFKPARGSFVNCGMRKEVQIDKQLEPGLRVTVRLDRQQIPESKNHKGRVVSS 257
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
+P+ E+G+YWGY VR A +S+VF +K GYD IGTSE G ++S+ LP FRH
Sbjct: 258 QEPRTEAGLYWGYSVRLASCLSAVFAEVPFKDGYDLTIGTSERGGDLDSA--RLPDFRHA 315
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
LI FGGL GLE +++ D L+ +F Y NTCP+QGSRTIRTE
Sbjct: 316 LIVFGGLQGLEAAVDADPNLEVVEPSVLFDLYLNTCPNQGSRTIRTE 362
>gi|307111555|gb|EFN59789.1| hypothetical protein CHLNCDRAFT_133484 [Chlorella variabilis]
Length = 319
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 178/312 (57%), Gaps = 45/312 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AV S I+N Q+LELAT LAGQIARA IF +DEVVV D+ + SSAA
Sbjct: 38 TVSIAVAASCIENAQNLELATLLAGQIARAAAIFNVDEVVVLDDSPAKQPGHVSSAA--- 94
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
A R+LQ++ETPQYL+KAL MH L++ G+LPPLDAPHHLR EW P+
Sbjct: 95 ----------ALFARVLQFMETPQYLKKALIPMHPDLKYAGVLPPLDAPHHLRSTEWGPY 144
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQ----VLDPGVRVTVAMGTNRNLDADSPRQV 237
REGV + G+ +DVGL+K + Q GVR+T++MG +QV
Sbjct: 145 REGVV--RRSAKGEGSFIDVGLDKDAHIPQASLMAARQGVRLTLSMGEQPLPATVQGQQV 202
Query: 238 V-----PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
+ P+ P+E G+YWGY R AP++ ++ + C + GGY +GTSE G+ + L
Sbjct: 203 LRGQLALPTDPRERGGLYWGYITRIAPSLQAMLEQCPFYGGYSLTVGTSERGERTPACQL 262
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
L FRHLL+ FGG GLE Y NTC QGSRTIR+EEAI I
Sbjct: 263 DLGAFRHLLVVFGGPQGLE--------------------YLNTCFDQGSRTIRSEEAILI 302
Query: 352 SLQYFQEPISRA 363
SL + Q ++ A
Sbjct: 303 SLAFLQPAVTAA 314
>gi|21428350|gb|AAM49835.1| GM02932p [Drosophila melanogaster]
Length = 485
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 5/311 (1%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ +++ RS A+
Sbjct: 73 TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 132
Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
S S + L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+
Sbjct: 133 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 192
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+REGV + A + +VGL V+VD+ ++PGVRVTV M +V
Sbjct: 193 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 251
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+ F
Sbjct: 252 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 311
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L FQ
Sbjct: 312 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAAFQ 371
Query: 358 EPISRALRRVE 368
E + + VE
Sbjct: 372 EKLQPQVADVE 382
>gi|290491270|gb|ADD31614.1| GM22547p [Drosophila melanogaster]
Length = 503
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 5/311 (1%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ +++ RS A+
Sbjct: 91 TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 150
Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
S S + L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+
Sbjct: 151 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 210
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+REGV + A + +VGL V+VD+ ++PGVRVTV M +V
Sbjct: 211 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 269
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+ F
Sbjct: 270 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 329
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L FQ
Sbjct: 330 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAAFQ 389
Query: 358 EPISRALRRVE 368
E + + VE
Sbjct: 390 EKLQPQVADVE 400
>gi|219122472|ref|XP_002181568.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406844|gb|EEC46782.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 330
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 194/319 (60%), Gaps = 21/319 (6%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGS++ N Q+ EL T++ GQ+ARA TI+ +DE++VFD+K + + RS
Sbjct: 14 TVSIAIPGSVVSNCQTRELRTQMVGQLARAATIYHVDEIIVFDDKLAKEMKPDRGYYQRS 73
Query: 122 N----RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
N RSD +E F+ R+ QY E PQYLR+ F MH L++ G+L P+DAPHH+R ++
Sbjct: 74 NHHGGRSDPHE----FMARVFQYCECPQYLRRDFFPMHGDLQYAGLLAPMDAPHHVRVND 129
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGL-NKHVVVDQVLDPGVRVTVAMGTNRNLDADSP-- 234
A FREG+ L++ + + +LV G+ + V +D L PG+R TV + + +
Sbjct: 130 RARFREGIVLEKTSSTNGNSLVHCGIRGRPVEIDVKLTPGIRCTVQLDPKASYETGGKPN 189
Query: 235 -----RQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VN 287
+VV PS P++ G YWGY R A +I +VF C + G YD +GTSE G ++
Sbjct: 190 SIIRGGKVVSPSAPRKFDGTYWGYTTRLASSIKAVFDECPF-GVYDLKVGTSERGSTSLD 248
Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR--KNAREVFHSYFNTCPHQGSRTIRT 345
LP+++H LI FGG+AG+EE ++ D+ L +R++F + N CP QGSRTIRT
Sbjct: 249 DGKFRLPSYQHALIVFGGVAGIEECVDADESLSLPGSQSRKLFDLWVNICPFQGSRTIRT 308
Query: 346 EEAIFISLQYFQEPISRAL 364
EEA+ I+L +S A+
Sbjct: 309 EEAVLIALAKLSPLLSTAV 327
>gi|195379965|ref|XP_002048741.1| GJ21159 [Drosophila virilis]
gi|194143538|gb|EDW59934.1| GJ21159 [Drosophila virilis]
Length = 483
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/307 (44%), Positives = 182/307 (59%), Gaps = 13/307 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + RS
Sbjct: 71 TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDV----GVATARETKRS 126
Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
D +G A L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR
Sbjct: 127 YEQDAEGNGTATVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 186
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
+ +REGV + A + + +VGL V++D+ LDPGVRVTV M
Sbjct: 187 QQSKFQYREGVVCDKQAKDG-HSYANVGLLNDVLIDKALDPGVRVTVKMDPPNETSRKQR 245
Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+V P +P+ E+G+YWGY+VR A ++S +F Y GGYD IGTS+ G V+
Sbjct: 246 GTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYAGGYDVTIGTSDRGASVHDVPSKS 305
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ H+LI FGGL GLEE++ D+ L +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 306 IEYNHILIVFGGLQGLEEALANDEKLTVDEPELLFDHYVNVLPKQGSRTIRTEEALLIAL 365
Query: 354 QYFQEPI 360
QE +
Sbjct: 366 AALQEKL 372
>gi|195028012|ref|XP_001986876.1| GH21612 [Drosophila grimshawi]
gi|193902876|gb|EDW01743.1| GH21612 [Drosophila grimshawi]
Length = 490
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 183/305 (60%), Gaps = 9/305 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AVPGSI++N QS EL +AGQIARA IFR+ EV+VFD+ + +R + +
Sbjct: 69 TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVHEVIVFDDVGVAT--ARETKRSYE 126
Query: 122 NRSDENESGAAF-----LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
+D +G L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+
Sbjct: 127 QDADGGTTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQ 186
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
+REGV + A N + +VGL V+VD+ L+PGVRVTV M +
Sbjct: 187 SKFRYREGVVCDKQAKNG-HSYANVGLLNDVLVDKALEPGVRVTVKMEPTSDASRKQRGT 245
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+V P +P+ E+G+YWGY+VR A ++S +F YKGGYD +GTS+ G V+
Sbjct: 246 LVSPEEPRRETGVYWGYQVRIAHSLSEIFTRSPYKGGYDVTVGTSDRGTSVHDVPSQSIQ 305
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
+ H+LI FGGL GLEE++ D L +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 306 YNHMLIVFGGLQGLEEALANDSKLTVDEPELLFDHYVNVLPKQGSRTIRTEEALLIALAA 365
Query: 356 FQEPI 360
QE +
Sbjct: 366 LQEKL 370
>gi|453080110|gb|EMF08162.1| DUF171-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 347
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 186/314 (59%), Gaps = 24/314 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSII N + + T LAGQIARA +F +DEVVVFD+ + +
Sbjct: 37 TLSLALPGSIIANALTHDQKTSLAGQIARACAVFCVDEVVVFDDGQAETRAPEH--GGYT 94
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D N FL +L YLETP LRKALF MH LR G LP LD PHHLR EW +
Sbjct: 95 AFADPN----FFLFHVLTYLETPPNLRKALFPMHPDLRTAGALPSLDMPHHLRAEEWCEY 150
Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPR 235
REG+T K P T VD GL++ V + +DP RVTV +G T+ + S
Sbjct: 151 REGITTHSSSGKGKMPQ---TFVDCGLSQQVAIPGAIDPNTRVTVKLGHTSSDGYTISGL 207
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDL 291
V P + +E+G YWGY VR A ++S++F C + GGYD +GTSE G I++ S
Sbjct: 208 PVSPETPREEAGYYWGYSVRQASSLSAIFTECPFDGGYDVSVGTSERGKPLKSIIDQSSP 267
Query: 292 TL--PTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTE 346
T PT++HL++ FGG+AGLE +++ D L+ A EVF S+ N P+QGSRTIRTE
Sbjct: 268 TYVEPTWKHLIVVFGGVAGLEAALKADSELQSAGVSQAEEVFDSWVNLVPNQGSRTIRTE 327
Query: 347 EAIFISLQYFQEPI 360
EA+++ L +E +
Sbjct: 328 EAVWVGLSGLREAV 341
>gi|167517265|ref|XP_001742973.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778072|gb|EDQ91687.1| predicted protein [Monosiga brevicollis MX1]
Length = 289
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 126/291 (43%), Positives = 175/291 (60%), Gaps = 9/291 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+PGS++ N QS EL L GQIARA IFR DE+V+F +K + +R + S
Sbjct: 1 TVSLAIPGSLVANAQSAELRAYLVGQIARAAVIFRADEIVIFKDKCTQPGLARGTTQGHS 60
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E E F+ R+L+YL+ PQYLRK LF +H L+F G+LPPLDAPHH R + + F
Sbjct: 61 KTMKEPE---LFMARVLEYLDCPQYLRKHLFPIHRDLKFAGLLPPLDAPHHTRSSDISVF 117
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT----NRNLDADSPRQV 237
REGVTL+E GTLV+VGL K ++ +VL P +RVT+ + + L +
Sbjct: 118 REGVTLEERPRKGTGTLVEVGLRKAAIISKVLPPKMRVTIKLNEAQLHKQKLKTLYRGEA 177
Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPT 295
V P++P+++ +YWGY+VR A +I VFK Y GGYD IG S+ G +N L
Sbjct: 178 VAPTEPRQAMDLYWGYQVRLAEDIEQVFKQSPYPGGYDLTIGISDKGSPLNKLHLHKHAA 237
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
+ HLL+ FGG LE + D L+ ++ E+F +Y N GSR++RTE
Sbjct: 238 YNHLLVVFGGTVSLEAIVAADSELEARDPTELFDAYLNPLSGHGSRSLRTE 288
>gi|194754479|ref|XP_001959522.1| GF12917 [Drosophila ananassae]
gi|190620820|gb|EDV36344.1| GF12917 [Drosophila ananassae]
Length = 485
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 13/315 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + RS
Sbjct: 71 TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDV----GIATARETKRS 126
Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
DEN G L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR
Sbjct: 127 YEQDENGGGTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 186
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
+ +REGV + A + +VGL V+VD+ ++PGVRVTV M
Sbjct: 187 QQSKFRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAVEPGVRVTVKMDPPSETTRKQR 245
Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+V P +P+ E+G+YWGY+VR A ++S +F Y+ GYD +GTS+ G V+
Sbjct: 246 GSLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYETGYDVTLGTSDRGTNVHEVPSRS 305
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+++H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 306 YSYKHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIAL 365
Query: 354 QYFQEPISRALRRVE 368
QE + + VE
Sbjct: 366 AALQEKLQPEVADVE 380
>gi|452987275|gb|EME87031.1| hypothetical protein MYCFIDRAFT_187007 [Pseudocercospora fijiensis
CIRAD86]
Length = 327
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 178/308 (57%), Gaps = 30/308 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS------DNYSRS 115
T+S+A+PGSII N + + T LAGQIARA +F IDE+VVFD+ + D Y+
Sbjct: 21 TLSIALPGSIIANAVTPDQKTSLAGQIARACAVFCIDEIVVFDDGQAETRPPERDGYTAF 80
Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
+ N FL +L YLETP LRKALF MH LR G LP LD PHHLR
Sbjct: 81 ADPNF------------FLYHVLTYLETPPNLRKALFPMHPDLRTAGALPSLDMPHHLRS 128
Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
EW FREGV + S GTLVD GL + V + LD RVTV + ++ D
Sbjct: 129 DEWCEFREGVA--TSPARSGGTLVDCGLPQKVSIPVELDSNTRVTVQLESDVAQDGLFMG 186
Query: 236 QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NS 288
V P P+ES G YWGY VR A ++ SVF CS+ GGYD IGTSE G + +S
Sbjct: 187 TAVSPEMPRESAGYYWGYNVRQAASLGSVFTECSFDGGYDVSIGTSERGKPLSAIIQNDS 246
Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRT 345
D PT+ HLL+ FGG+AGLE +++ D L+ + A ++F + N P QGSRTIRT
Sbjct: 247 PDHVEPTWNHLLVVFGGVAGLEAALKADKELQAAGVQKAADLFDCWVNLVPGQGSRTIRT 306
Query: 346 EEAIFISL 353
EEA+++ L
Sbjct: 307 EEAVWVGL 314
>gi|406864343|gb|EKD17388.1| deoxyribose-phosphate aldolase 2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 387
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 195/317 (61%), Gaps = 27/317 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSDNYSRS- 115
TVSVA+PGSII N Q+ + T LAG IARA+ +F IDE+ +F NKS+ +++R+
Sbjct: 60 TVSVALPGSIIANAQTHDQRTSLAGHIARALAVFCIDEIAIFSDGHGLNKSNKRHHNRTH 119
Query: 116 SAANRSNRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
S A++++ + + +G + FLV LL YLETP +LRK LF +H +LR G L LD PH
Sbjct: 120 SFAHQADPAQDEYTGTSDPDHFLVHLLSYLETPPHLRKHLFPLHGNLRTAGTLSSLDLPH 179
Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV-----AMGTN 226
HLR EW P+REGVTL + GT V+ GL V V + + RVT+ A N
Sbjct: 180 HLRSDEWCPYREGVTLP--GADEHGTYVEAGLRIPVTVKEQIPEKTRVTLKFSEGAEAAN 237
Query: 227 RNLDADS-PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD 284
++L+A + + V P+ P+ E+G YWG+ VR A ++S VF C Y+GGYD IG SE G
Sbjct: 238 KDLNAPTIDAEPVNPNDPREEAGYYWGFNVRKAGSLSDVFTECPYEGGYDVTIGLSERGI 297
Query: 285 IV-----NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCP 336
V S L F+HLLI GG+AGLE +++ D L++ A +VF + N CP
Sbjct: 298 DVEKLYSGSEGQKLGKFKHLLIVLGGVAGLEVAVKNDKELQKLGVMKAEDVFDRWVNVCP 357
Query: 337 HQGSRTIRTEEAIFISL 353
+QGSRTIRTEEA++I L
Sbjct: 358 NQGSRTIRTEEAVWIGL 374
>gi|195582066|ref|XP_002080849.1| GD10707 [Drosophila simulans]
gi|194192858|gb|EDX06434.1| GD10707 [Drosophila simulans]
Length = 482
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 199/345 (57%), Gaps = 11/345 (3%)
Query: 28 ELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQ 87
EL A S + + + +D+LN T+S+AVPGSI++N QS EL +AGQ
Sbjct: 42 ELQQAKEAESQAEKAANEVQAKDKLNPS------TLSIAVPGSILENAQSNELRAYVAGQ 95
Query: 88 IARAVTIFRIDEVVVFDNK--SSSDNYSRSSAANRSNRSDEN-ESGAAFLVRLLQYLETP 144
IARA IFR++EV+VFD+ +++ RS A+ S S + L R+LQYLE P
Sbjct: 96 IARAACIFRVNEVIVFDDVGIATARETKRSYEADAEGSSTGTVRSSSLQLARILQYLECP 155
Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
QYLRK F +H L++ G+L PLD PHHLR+ +REGV + A + +VGL
Sbjct: 156 QYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYANVGLL 214
Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSV 263
V+VD+ ++PGVRVTV M +V P +P+ E+G+YWGY+VR A ++S +
Sbjct: 215 NDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLVSPDEPRRETGVYWGYQVRIAHSMSEI 274
Query: 264 FKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN 323
F Y GYD +GTS+ G V+ +F+H+LI FGGL GLE ++ D+ L +
Sbjct: 275 FTKSPYANGYDVTVGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKLTVDD 334
Query: 324 AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
+F Y N P QGSRTIRT+EA+ I+L QE + + VE
Sbjct: 335 PELLFDHYLNVLPRQGSRTIRTDEALLIALAALQEKLQPQVADVE 379
>gi|195475216|ref|XP_002089880.1| GE19326 [Drosophila yakuba]
gi|194175981|gb|EDW89592.1| GE19326 [Drosophila yakuba]
Length = 482
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 197/349 (56%), Gaps = 19/349 (5%)
Query: 28 ELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQ 87
EL LA S + + + ++Q N T+S+AVPGSI++N QS EL +AGQ
Sbjct: 42 ELQLAKEAESQAEQVANEVQAKEQANPS------TLSIAVPGSILENAQSNELRAYVAGQ 95
Query: 88 IARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAF-------LVRLLQY 140
IARA IFR++EV+VFD+ + + RS +D SG L R+LQY
Sbjct: 96 IARAACIFRVNEVIVFDDV----GIATARETKRSYEADAEGSGTGTVRSSSLQLARILQY 151
Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
LE PQYLRK F +H L++ G+L PLD PHHLR+ +REGV + A + +
Sbjct: 152 LECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYAN 210
Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPN 259
VGL V+VD+ ++PGVRVTV M +V P +P+ E+G+YWGY+VR A +
Sbjct: 211 VGLLNDVLVDKAIEPGVRVTVKMEHQSENCRKQRGTLVSPDEPRRETGVYWGYQVRIAHS 270
Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGL 319
+S +F Y GYD +GTS+ G V+ +F+H+LI FGGL GLE ++ D+ L
Sbjct: 271 LSEIFTKSPYASGYDVTLGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKL 330
Query: 320 KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
+ +F Y N P QGSRTIRTEEA+ I+L QE + + VE
Sbjct: 331 TVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAALQEKLQPQVADVE 379
>gi|194858153|ref|XP_001969113.1| GG24128 [Drosophila erecta]
gi|190660980|gb|EDV58172.1| GG24128 [Drosophila erecta]
Length = 482
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 185/315 (58%), Gaps = 13/315 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + + RS
Sbjct: 70 TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDV----GIATARETKRS 125
Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
+D SG L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR
Sbjct: 126 YEADAEGSGTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 185
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
+ +REGV + A + +VGL V+VD+ ++PGVRVTV M
Sbjct: 186 QQSKFRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSENCRKQR 244
Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+V P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+
Sbjct: 245 GTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYASGYDVTLGTSDRGTNVHEVPNRS 304
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+F+H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 305 YSFKHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIAL 364
Query: 354 QYFQEPISRALRRVE 368
QE + + VE
Sbjct: 365 AALQEKLQPQVAEVE 379
>gi|195430968|ref|XP_002063520.1| GK21364 [Drosophila willistoni]
gi|194159605|gb|EDW74506.1| GK21364 [Drosophila willistoni]
Length = 516
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 183/310 (59%), Gaps = 11/310 (3%)
Query: 56 APIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSS 109
A + TVS+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ + +
Sbjct: 68 ATVNPSTVSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETK 127
Query: 110 DNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
+Y + N S S + L R+LQYLE PQYLRK F +H L++ G+L PLD
Sbjct: 128 KSYEEDAEGNASGTV---RSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDT 184
Query: 170 PHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL 229
PHHLR+ +REGV + A + +VGL V+VD+ ++PGVRVTV M
Sbjct: 185 PHHLRQQSKFRYREGVICDKKAKEG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDQPNEA 243
Query: 230 DADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
+V P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+
Sbjct: 244 QRKQRGTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYDSGYDVTLGTSDRGKNVHE 303
Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
++ HLLI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA
Sbjct: 304 VPAKSFSYNHLLIVFGGLQGLESALSNDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEA 363
Query: 349 IFISLQYFQE 358
+ I+L QE
Sbjct: 364 LLIALAALQE 373
>gi|154323528|ref|XP_001561078.1| hypothetical protein BC1G_00163 [Botryotinia fuckeliana B05.10]
Length = 385
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 182/316 (57%), Gaps = 26/316 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGSII N QS +L T +AGQIARA+ +F +DEV+VFD+ ++ +S
Sbjct: 61 TVSVALPGSIIANAQSHDLKTSMAGQIARALAVFCVDEVIVFDDGNARPAKKPKHPPTQS 120
Query: 122 NRSD-----ENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
+ S E+E A FL LL +LETP LRK LF +H +LR G LP +D PH
Sbjct: 121 HNSQQKPPLEDEYTAKSDPDHFLTHLLSFLETPPNLRKHLFPIHENLRTAGALPSIDLPH 180
Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
HLR EW +REG TL + GT V+ GL V V + + RVT+ A
Sbjct: 181 HLRADEWCMYREGTTLP--GADEHGTFVEAGLRIPVTVVEQIPANNRVTLKFSVEGEKAA 238
Query: 232 DS------PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG- 283
P + V P +P+ E G YWG+ VR A +S VF C Y GGYD I TSE G
Sbjct: 239 KDKSCEIIPAEAVSPEEPREEGGYYWGWNVRRAGCLSDVFTECGYDGGYDMTIATSERGV 298
Query: 284 ---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPH 337
D+ + + + F+HLL+ FGG+AGLE +++ D+ L++ A++VF + N CP
Sbjct: 299 DVQDLYDDEEQKVGKFKHLLVVFGGVAGLEVAVKNDEELQKLGVVEAKDVFDRWINVCPG 358
Query: 338 QGSRTIRTEEAIFISL 353
QGSRTIRTEEA++I L
Sbjct: 359 QGSRTIRTEEAVWIGL 374
>gi|328863664|gb|EGG12763.1| hypothetical protein MELLADRAFT_32413 [Melampsora larici-populina
98AG31]
Length = 316
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 181/308 (58%), Gaps = 22/308 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------NYSRS 115
T+S+A+P SI++N Q+ EL T L GQIARA IF +DE+V+F+ +D NY R
Sbjct: 5 TISIALPSSIVNNAQTWELKTALVGQIARACAIFSVDEIVIFEETLPADSAIAPSNYVRG 64
Query: 116 SA--ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL 173
+ +S+E + F+ R+L+YLE PQYLRK+LF +H LR G+LPPLD PHH
Sbjct: 65 KYRDVDEGQQSEEPFQPSQFMARILEYLECPQYLRKSLFPLHPDLRLAGLLPPLDLPHHF 124
Query: 174 RKHEWAPFREGVTL-KENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
R+ P+REG L +E A N L NK Q P V V V + + D
Sbjct: 125 RRDHQTPWREGCVLPREKADN---YLAGNKPNKRKKDGQ--SPTVWVDVGLAEPVEVKLD 179
Query: 233 SPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSS 289
P ++V P +P E SG YWGY VR A +IS VF Y GGYD IGTSE G + +
Sbjct: 180 -PARIVSPRQPTETSGTYWGYSVRLATSISKVFTESPYLKTGGYDLTIGTSERGQNITEA 238
Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDD----GLKRKNAREVFHSYFNTCPHQGSRTIRT 345
+ +F+HLL+AFGGL+GLE I D+ L ++A +F + NT PHQGSRT+RT
Sbjct: 239 LQDMGSFKHLLLAFGGLSGLELCIASDETLTPNLSAEDASLLFDHWVNTLPHQGSRTVRT 298
Query: 346 EEAIFISL 353
EEA+ +SL
Sbjct: 299 EEALLVSL 306
>gi|398388217|ref|XP_003847570.1| hypothetical protein MYCGRDRAFT_15728, partial [Zymoseptoria
tritici IPO323]
gi|339467443|gb|EGP82546.1| hypothetical protein MYCGRDRAFT_15728 [Zymoseptoria tritici IPO323]
Length = 313
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 180/307 (58%), Gaps = 28/307 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSII N + + T LAGQIARA +F +DE+VVF++ + S +
Sbjct: 16 TLSIALPGSIIANAVTPDQKTSLAGQIARACAVFCVDEIVVFNDGQAETRAPEPS--GYT 73
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D N FL +L YLETP LRK+LF MH LR G LP LD PHHLR EW F
Sbjct: 74 AFTDPN----FFLYHVLTYLETPPNLRKSLFPMHPDLRTAGALPSLDMPHHLRSDEWCEF 129
Query: 182 REGVTLKENAPNSVG-TLVDVGLNKHVVVDQVLDPGVRVTV----AMGTNRNLDADSPRQ 236
REG+T P G TLVD GL + V + +DP RVTV ++ T + D+
Sbjct: 130 REGIT---TGPGKRGTTLVDCGLGRKVSIPVEIDPSTRVTVQLPQSLPTGDTISGDA--- 183
Query: 237 VVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--- 292
V P P+E +G YWGY+VR AP++ S+F C Y GGYD IGTSE G ++S T
Sbjct: 184 -VAPDTPREQAGYYWGYQVRQAPSLGSIFTECPYDGGYDISIGTSERGTSLDSLTDTSSP 242
Query: 293 ---LPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTE 346
PT+ HL++ FGG+AGLE +++ D L+ A +VF + N QGSRTIRTE
Sbjct: 243 SHVAPTWNHLVVVFGGVAGLESALKADAELQAAGVTQAADVFDCWINLVSGQGSRTIRTE 302
Query: 347 EAIFISL 353
EA+++ L
Sbjct: 303 EAVWVGL 309
>gi|426226131|ref|XP_004023663.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C9orf114
homolog, partial [Ovis aries]
Length = 350
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 176/306 (57%), Gaps = 17/306 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 53 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 112
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 113 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 168
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
REGV + G+ V+ G+ K V +D+ L+PG+RVTV + N+ ++ + R +VV
Sbjct: 169 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 228
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ P FRH
Sbjct: 229 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--FPNFRHA 286
Query: 300 LIAFGGLAGLE--ESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
L+ FGGL GLE D G A P Q RTIRTEEAI ISL Q
Sbjct: 287 LVVFGGLQGLEAVAGAXGDPGTLSPWAA-------TPSPTQAGRTIRTEEAILISLAALQ 339
Query: 358 EPISRA 363
+++A
Sbjct: 340 PGLTQA 345
>gi|20129839|ref|NP_610543.1| CG12128, isoform A [Drosophila melanogaster]
gi|7303814|gb|AAF58861.1| CG12128, isoform A [Drosophila melanogaster]
Length = 485
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 185/311 (59%), Gaps = 5/311 (1%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
T+S+AVPGSI++N QS EL +AGQIARA IFR++EV+VFD+ +++ RS A+
Sbjct: 73 TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 132
Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
S S + L R+LQYLE PQYLRK F +H L++ G+L PLD PHHLR+
Sbjct: 133 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 192
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+REGV + A + +VGL V+VD+ ++PGVRVTV M +V
Sbjct: 193 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 251
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+ F
Sbjct: 252 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 311
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L Q
Sbjct: 312 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAALQ 371
Query: 358 EPISRALRRVE 368
E + + VE
Sbjct: 372 EKLQPQVADVE 382
>gi|71021553|ref|XP_761007.1| hypothetical protein UM04860.1 [Ustilago maydis 521]
gi|46100927|gb|EAK86160.1| hypothetical protein UM04860.1 [Ustilago maydis 521]
Length = 491
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 151/364 (41%), Positives = 203/364 (55%), Gaps = 83/364 (22%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSDNYSRSS 116
T+SVA+PGSI+ N QS EL +RLA IAR IF +DE+VVFD ++ +D Y
Sbjct: 123 TISVAIPGSIVLNAQSPELQSRLAAHIARTCAIFNVDEIVVFDEGQVRTEAGADLYQHKH 182
Query: 117 AANRSNR--------------SDENES------------GA------AFLVRLLQYLETP 144
N+ NR +DE E GA FL R+LQYLETP
Sbjct: 183 --NQQNRGTKRRFNQPQGNPNADEQEESTEQRDARAEAEGARGFDPHTFLARVLQYLETP 240
Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL------KENAPNSVGTL 198
QYLRKALF MH LR G++PPLD PHHLR + + +REGVT+ + NSV
Sbjct: 241 QYLRKALFPMHRDLRLAGLMPPLDCPHHLRFEDQSEYREGVTVDPPHWARRGGQNSV--Y 298
Query: 199 VDVGLNKHVVVDQV-----LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGY 252
V+VGL + V+ ++ ++ G RVTV+M +DA ++V P +P E G YWGY
Sbjct: 299 VNVGL-RDVIEAKLPEGARVEAGTRVTVSM----PMDAYGAGEIVSPREPVEQLGWYWGY 353
Query: 253 KVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDLTLPT---------------- 295
VR A ++S+V C + GGYD +IGTSE G V+ +DLTL T
Sbjct: 354 SVRLANSLSAVLTGCPFAGGGYDLVIGTSERG--VSLTDLTLATSNNTPVALASGETVQP 411
Query: 296 ----FRHLLIAFGGLAGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
F H L+ FGGL+GLE ++E+D +K R A+++F ++ N +QGSRT+RTEEAI
Sbjct: 412 MSEQFNHALLVFGGLSGLEVAVEQDAAIKLDRHTAKDLFDAWINVVENQGSRTVRTEEAI 471
Query: 350 FISL 353
I+L
Sbjct: 472 MIAL 475
>gi|443900037|dbj|GAC77364.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
Length = 462
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 147/352 (41%), Positives = 190/352 (53%), Gaps = 68/352 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
TVSVA+PGSI+ N QS EL +RLA IARA IF +DE+VVFD + + R
Sbjct: 105 TVSVAIPGSIVVNAQSPELQSRLAAHIARACAIFNVDEIVVFDEGDVRSIDPDAHSQQRP 164
Query: 116 SAANRSNRSDENESGAA---------------------FLVRLLQYLETPQYLRKALFSM 154
A R E +S AA FL R+LQYLETPQYLRKALF M
Sbjct: 165 PARGSKRRWSEPQSAAANEGEGEEQQEESDNKGFDPQTFLARVLQYLETPQYLRKALFPM 224
Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVV- 208
H LR G++PPLD PHHLR + +REGVT+ + P+ G V+VGL +
Sbjct: 225 HRDLRLAGLMPPLDCPHHLRFEDECEYREGVTV--DPPHWARGRGQGVYVNVGLRDPIQA 282
Query: 209 -----VDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISS 262
V ++ G RVT+ M A ++V P +P E G YWGY VR AP++S+
Sbjct: 283 TLPGGVPARVETGTRVTIKM----PYSAHEAGEIVSPREPVEKLGWYWGYTVRLAPSLSA 338
Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-------------------RHLLIAF 303
V +C + YD +IGTSE G V+ +DL L T +H LI F
Sbjct: 339 VLTDCPFGSEYDLVIGTSERG--VSLTDLALATSQATPVATPEGQTVQPLAQPQHALIVF 396
Query: 304 GGLAGLEESIEEDDG--LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
GGL+GLE ++E D G L R NA E+F ++ N +QGSRT+R+EEAI I+L
Sbjct: 397 GGLSGLELAVEADAGIQLDRNNAAELFDAWVNVVDNQGSRTVRSEEAIMIAL 448
>gi|322696041|gb|EFY87839.1| deoxyribose-phosphate aldolase [Metarhizium acridum CQMa 102]
Length = 340
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 179/298 (60%), Gaps = 13/298 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SI+ N + + + G+IARA+ +F +DEVVVFD+ +S ++ A +
Sbjct: 40 TVSVAIPTSILTNLATADQRMSIPGRIARALAVFAVDEVVVFDDSPASSRPRQTDTAGYT 99
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D A FL +L +LE P ++RKALF +H +LR +LP LD PHH EW P+
Sbjct: 100 GDTDP----AHFLTHILSFLEAPPFMRKALFPLHPNLRLTALLPSLDMPHHPNPKEWIPY 155
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVT+ GTLV+VGL++ V +++ + P R+T+ ++ + P+ V P +
Sbjct: 156 REGVTVPGKTSTGSGTLVEVGLDEPVEIEEDIPPKTRLTLLFPDDQ---SQYPQCVDPQA 212
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFRHLL 300
E G YWGY VR P++SSVF Y+ GYD IGTSE G ++ + + P F+HLL
Sbjct: 213 PRTEGGYYWGYTVRKCPSLSSVFTESPYEDGYDFSIGTSERGTPISKAFSSKPLNFKHLL 272
Query: 301 IAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG GLE + D+ L + + RE+F + N P QGSRTIRTEEA+FI+L
Sbjct: 273 IVFGGPRGLEFASMNDEELVKMDIQGGRTRELFDQWVNVLPGQGSRTIRTEEALFIAL 330
>gi|221484612|gb|EEE22906.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 833
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+P SIIDN Q+ EL L GQIAR +T+F +DE++V+++ ++ A +R
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535
Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
DE S A F VR L+YLETPQ+LRK+LF +H LRF G+ PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595
Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
+REGV + +E P N G VD GL HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655
Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
D L G+RVT+ + + R L DA+ P V P P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715
Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEES 312
A + VF NC + G YD +GTSE G V D P ++H+L+ FGGL GLE
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPMNYKHMLLVFGGLQGLEAV 774
Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ + + ++ +F Y NTCP Q SRTIR EEA+ I+L +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821
>gi|237839871|ref|XP_002369233.1| hypothetical protein TGME49_085950 [Toxoplasma gondii ME49]
gi|211966897|gb|EEB02093.1| hypothetical protein TGME49_085950 [Toxoplasma gondii ME49]
Length = 833
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+P SIIDN Q+ EL L GQIAR +T+F +DE++V+++ ++ A +R
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535
Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
DE S A F VR L+YLETPQ+LRK+LF +H LRF G+ PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595
Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
+REGV + +E P N G VD GL HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655
Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
D L G+RVT+ + + R L DA+ P V P P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715
Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEES 312
A + VF NC + G YD +GTSE G V D P ++H+L+ FGGL GLE
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPMNYKHMLLVFGGLQGLEAV 774
Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ + + ++ +F Y NTCP Q SRTIR EEA+ I+L +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821
>gi|221504806|gb|EEE30471.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 833
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+P SIIDN Q+ EL L GQIAR +T+F +DE++V+++ ++ A +R
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535
Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
DE S A F VR L+YLETPQ+LRK+LF +H LRF G+ PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595
Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
+REGV + +E P N G VD GL HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655
Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
D L G+RVT+ + + R L DA+ P V P P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715
Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPT-FRHLLIAFGGLAGLEES 312
A + VF NC + G YD +GTSE G V D P ++H+L+ FGGL GLE
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPVNYKHMLLVFGGLQGLEAV 774
Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ + + ++ +F Y NTCP Q SRTIR EEA+ I+L +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821
>gi|378731584|gb|EHY58043.1| hypothetical protein HMPREF1120_06061 [Exophiala dermatitidis
NIH/UT8656]
Length = 373
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 183/309 (59%), Gaps = 27/309 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N +S E T LAGQIARA+ +F +DE+V+FD++S R + A
Sbjct: 54 TLSVALPGSIIANAKSHEQKTYLAGQIARALAVFCVDEIVIFDDESPDVQRRRPAIAEHD 113
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + FL LL YLETP +LRK LF MH +LR G LP LD PHHLR +EW +
Sbjct: 114 YTAFSHPD--HFLAHLLSYLETPPHLRKTLFPMHPNLRTAGTLPSLDMPHHLRANEWCEY 171
Query: 182 REGVTLKENAP---NSVGTLVDVGLNK-HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
REGV+L + S TLVDVGL++ + D + G RVTV + + A +
Sbjct: 172 REGVSLGPTSSPRGTSTATLVDVGLSEPREIPDSDIPRGTRVTVELEEPPSKLATA---- 227
Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----------DIV 286
V P P+E+ G YWGY VR ++S+VF C + GGYD IGTSE G +
Sbjct: 228 VSPDTPREAKGYYWGYAVRQCSSLSNVFTECPFDGGYDISIGTSERGVPLSEVFARPEFA 287
Query: 287 NSSDLTLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
++ LP F+H+LI FGG+AGLE + + D G+ KN +++F + N P QGSR
Sbjct: 288 DADSKPLP-FQHVLIVFGGVAGLETAAKNDPQLQEMGITPKNVKDLFDYWVNVLPGQGSR 346
Query: 342 TIRTEEAIF 350
TIRTEEA++
Sbjct: 347 TIRTEEAVW 355
>gi|313212170|emb|CBY17791.1| unnamed protein product [Oikopleura dioica]
Length = 458
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 181/301 (60%), Gaps = 17/301 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSI+DN QS EL T LAGQIARA +F +DE+V+FD + + +
Sbjct: 158 TLSIALPGSILDNAQSFELRTYLAGQIARACVVFNVDEIVIFDESGEAAKDTTGEFSGVK 217
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + N++ L R+LQYLE PQYLRK LF +H L+ G+L PLD PHHLR +E++ +
Sbjct: 218 KKGNPNQT----LGRILQYLECPQYLRKDLFPVHKDLKNAGLLNPLDTPHHLRANEFSKY 273
Query: 182 REGVTLKENAPN----SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD--SPR 235
REG+ ++ P+ G+ ++ G+ V V++ RVTV + + D + +
Sbjct: 274 REGIV--QDRPSGKKKGKGSWINCGIEHDVRVEKEAPAFSRVTVKLDDEQREDGRFLTGK 331
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
V +ESG YWGY VR A ++ VF + ++G YD +IGTSE G N +
Sbjct: 332 LVTAAQVREESGTYWGYNVRLADSLGQVFMHSPWEGNYDTIIGTSERG--TNIKKMPPQK 389
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
+H +I FGG+ GLE S E+D LK ++ ++F + NTCP+QGSRTIRTEEAI ++
Sbjct: 390 AKHYMIVFGGVKGLEHSFEQDPELKENGIESVTDLFDMWLNTCPNQGSRTIRTEEAILVT 449
Query: 353 L 353
L
Sbjct: 450 L 450
>gi|346465421|gb|AEO32555.1| hypothetical protein [Amblyomma maculatum]
Length = 284
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 120/271 (44%), Positives = 163/271 (60%), Gaps = 4/271 (1%)
Query: 95 FRIDEVVVFDNKSSSDNYSRSSAANRS-NRSDENESGAAFLVRLLQYLETPQYLRKALFS 153
F +DE+VV+D+ S+ + +A + E+ G L R+LQYLE PQYLRK LF
Sbjct: 6 FVVDEIVVYDDDGSTVRGKKGAAVDGEFGGLGESGQGVVQLARVLQYLECPQYLRKHLFP 65
Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVL 213
+H L+F G+L PLD PHHLR E +P+REG+ + G+ V+ GL+K V +D+ L
Sbjct: 66 LHRDLQFAGLLNPLDCPHHLRATEESPYREGIVARLPIKEGSGSYVNCGLSKEVKIDRCL 125
Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
PGVRVTV + V P E+G+YWGY+VR A ++ +V C + GGY
Sbjct: 126 QPGVRVTVKLKEGTPGXXLRGTAVSPSCPRTEAGLYWGYEVRVAGSLGAVLAECPFPGGY 185
Query: 274 DHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYF 332
D +GTSE G V+ +L LP F+H L+ FGGL GLE ++E D+ L + +F Y
Sbjct: 186 DITLGTSERGTPVD--ELELPHCFQHALVVFGGLKGLEAALEADEALDVDDPAYLFSHYL 243
Query: 333 NTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
NTCP+QGSRTIRTEEAI ISL + I+ A
Sbjct: 244 NTCPNQGSRTIRTEEAILISLSTLRPKIAVA 274
>gi|388857119|emb|CCF49334.1| uncharacterized protein [Ustilago hordei]
Length = 484
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 193/362 (53%), Gaps = 76/362 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYS-RSS 116
T+SVA+PGSI+ N QS EL +RLA IARA IF +DE+VVF+ +SD Y R +
Sbjct: 114 TISVAIPGSIVLNAQSPELQSRLAAHIARACAIFNVDEIVVFNEGELRSETSDPYQHRHN 173
Query: 117 AANRSNRSDENESGAA----------------------------FLVRLLQYLETPQYLR 148
+R + NE A FL R+LQYLETPQYLR
Sbjct: 174 QQHRGTKRRYNEPQNAKTQQGEGEVQEGGEGEEEGKGGGFEPHTFLARVLQYLETPQYLR 233
Query: 149 KALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL--------KENAPNSVGTLVD 200
KALF MH LR G++PPLD PHHLR + + +REGVT+ + V+
Sbjct: 234 KALFPMHRDLRLAGLMPPLDCPHHLRFEDESEYREGVTVDTPHWARNRGAGGGGNSVYVN 293
Query: 201 VGLNKHV----VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVR 255
+GL + L+ GVRVTV M N P +V P +P E G YWGY VR
Sbjct: 294 IGLRDPIEASLPTTAKLESGVRVTVRMPIN----TYEPGSIVSPREPVERLGWYWGYSVR 349
Query: 256 YAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPT------------------ 295
A ++SSV +C + GYD +IGTSE G V+ +DL + T
Sbjct: 350 LASSLSSVLTSCPFSSSSGYDLVIGTSERG--VSLADLAIATSAQTPVALPSGETIQPLP 407
Query: 296 --FRHLLIAFGGLAGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
F H L+ FGGL+GLE ++E+D+G+K R A+++F ++ N +QGSRT+RTEEAI I
Sbjct: 408 SSFNHALVVFGGLSGLEVAVEQDEGIKLDRHTAKDLFDAWVNVVENQGSRTVRTEEAIMI 467
Query: 352 SL 353
+L
Sbjct: 468 AL 469
>gi|322703851|gb|EFY95453.1| deoxyribose-phosphate aldolase 2 [Metarhizium anisopliae ARSEF 23]
Length = 339
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 178/298 (59%), Gaps = 13/298 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SI+ N + + + G+IARA+ +F +DEVVVFD+ +S ++ A +
Sbjct: 39 TVSVAIPTSILTNLATADQRMSIPGRIARALAVFAVDEVVVFDDSPASSRPRQTDTAGYT 98
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D A FL +L +LE P ++RKALF +H +LR +LP LD PHH +W +
Sbjct: 99 GDTDP----AHFLTHILSFLEAPPFMRKALFPLHPNLRLTALLPSLDMPHHPNPKDWISY 154
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVT+ GTLV+VGL++ V +++ + P R+T+ ++ + P+ V P +
Sbjct: 155 REGVTIPGKTSTGSGTLVEVGLDEPVEIEEDIPPKTRLTLLFPDDQ---SQYPQCVDPQA 211
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-FRHLL 300
E G YWGY VR P++SSVF Y+ GYD IGTSE G ++ + + P F+HLL
Sbjct: 212 PRTEGGYYWGYTVRKCPSLSSVFTESPYEDGYDFSIGTSERGTPISKAFSSKPLKFKHLL 271
Query: 301 IAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG GLE + D+ L + + RE+F + N P QGSRTIRTEEA+FI+L
Sbjct: 272 IVFGGPRGLEFASMNDEELVKMDIQGGRTRELFDQWVNVLPGQGSRTIRTEEALFIAL 329
>gi|397575650|gb|EJK49808.1| hypothetical protein THAOC_31275, partial [Thalassiosira oceanica]
Length = 532
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 139/366 (37%), Positives = 197/366 (53%), Gaps = 80/366 (21%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-----DNYSRS 115
PTVS+A+PGS++ N Q+ EL T+LAGQIARA +FR+DEVVV+D+ S NY R
Sbjct: 148 PTVSIAIPGSVVSNAQTRELQTQLAGQIARAAAVFRVDEVVVYDDGLGSTLKTMSNYRRG 207
Query: 116 SAANR-------------SNRSDENESGA---------------AFLVRLLQYLETPQYL 147
+ R + R ++ G FL R+LQY E
Sbjct: 208 NQRRRDGEGGDDEGKNTIAKRKEDPRQGVDTEKPAHLQASTDPHTFLARILQYCE----- 262
Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGL-NKH 206
+H L+F G+LPPLDAPHHLR+ + A +REG+ + N +LVD G+ N+
Sbjct: 263 ------LHPDLQFCGLLPPLDAPHHLRRGDVATYREGIVVDNEDANDSTSLVDCGVPNRL 316
Query: 207 VVVDQVLDPGVRVTVAM-------GTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAP 258
V +D+ + PGVR TV + G ++ +VV P+ P+ E G+YWGY R A
Sbjct: 317 VKIDRKVPPGVRCTVRLEPKAYETGKKGHMKG----EVVSPTCPRDEEGVYWGYTTRMAS 372
Query: 259 NISSVFKNCSYKGGYDHLIGTSEHGDI-VNSSDLTL--------------PT-----FRH 298
I ++F C Y G YD +GTSE GD+ ++ L P+ F H
Sbjct: 373 TIDAIFAECPY-GSYDMKVGTSERGDVSIDDPKFCLRKRRENDGKPARRPPSAESEGFDH 431
Query: 299 LLIAFGGLAGLEESIE--EDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
LLI FGG+AG+EES++ E L +++R++F + N CP+QGSRTIR+EEA+FI+L
Sbjct: 432 LLIVFGGVAGIEESVDAHESMHLSGRDSRKLFDVWVNVCPYQGSRTIRSEEAVFITLARL 491
Query: 357 QEPISR 362
I+R
Sbjct: 492 SPYIAR 497
>gi|156057771|ref|XP_001594809.1| hypothetical protein SS1G_04617 [Sclerotinia sclerotiorum 1980]
gi|154702402|gb|EDO02141.1| hypothetical protein SS1G_04617 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 383
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 177/321 (55%), Gaps = 31/321 (9%)
Query: 62 TVSVAVPGSIIDN-----TQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSD 110
TVSVA+PGSII QS +L T +AG IARA+ +F +DEV+VFD+ K
Sbjct: 54 TVSVALPGSIISKYVASLAQSHDLKTSMAGHIARALAVFCVDEVIVFDDGNARSPKKPRQ 113
Query: 111 NYSRSSAANRSNRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
+ + A S++ +G FL LL YLETP LRK LF MH +LR G LP
Sbjct: 114 SPCQPPAIQPKAPSEDKYTGTLDPDHFLTHLLSYLETPPNLRKYLFPMHENLRTAGTLPS 173
Query: 167 LDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM--- 223
+D PHHLR EW +REG TL + GT V+ GL V V + + RVT+
Sbjct: 174 IDLPHHLRADEWCMYREGTTLP--GADEHGTFVEAGLRIPVTVPEQIPENNRVTLKFDLE 231
Query: 224 ----GTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGT 279
+++ + V P +E G YWGY VR A +S VF C Y GGYD IGT
Sbjct: 232 AEKAAKDKSYEVIKAEAVRPEEPREEGGYYWGYNVRRAGCLSDVFTECGYDGGYDMTIGT 291
Query: 280 SEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYF 332
SE G D+ + + + F+HLL+ FGG+AGLE +++ D+ L + A++VF +
Sbjct: 292 SERGVDVQDLYDDKEQKVGKFKHLLVVFGGVAGLEVAVKNDEELHKLGVTEAKDVFDRWI 351
Query: 333 NTCPHQGSRTIRTEEAIFISL 353
N CP QGSRTIRTEEA +I L
Sbjct: 352 NVCPGQGSRTIRTEEAAWIGL 372
>gi|320169134|gb|EFW46033.1| hypothetical protein CAOG_04001 [Capsaspora owczarzaki ATCC 30864]
Length = 543
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 188/348 (54%), Gaps = 48/348 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGS++ QS EL T +AGQIAR++++F +DE+V++++ S S S
Sbjct: 182 TVSVAIPGSVLHLAQSAELKTYMAGQIARSLSLFNVDEIVIYNDGSGGKQASSSIVTTEG 241
Query: 122 NRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
+ + FL R+LQYLETPQYLR+ LF MH L+F G+L P+++ HH ++ E
Sbjct: 242 EFEGAGKQATSNPDVFLARVLQYLETPQYLRRQLFPMHRDLKFAGLLAPIESAHHPKQDE 301
Query: 178 WAPFREGVTLK---------------------------ENAPNSVGTLVDVGLNKHVVVD 210
+ FREG+ +AP + VDVGL ++
Sbjct: 302 FVEFREGLVQSRSKGQRGNAHSHGPASAWDDQEGGEQHHSAPLPTSSFVDVGLRTEAKIN 361
Query: 211 QVLDPGVRVTVAMGT----NRNLDADSPRQ-------VVPPSKPKE-SGMYWGYKVRYAP 258
L PG RVTV N++ P VV PS P+E G+YWGY VR P
Sbjct: 362 VALQPGTRVTVQFPQWSVDNKDGHFSQPSHHRYPSGNVVTPSLPRELRGLYWGYSVRLVP 421
Query: 259 NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLLIAFGGLAGLEESIEE 315
+S +C + +G YD LIG + GD V+ D++ LP F+H LI FGG++GL +IE
Sbjct: 422 TLSRALVDCPFNERGRYDLLIGAGDEGDSVD--DMSELPEFKHALIVFGGVSGLPAAIEA 479
Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
D+ ++ N R VF + +TCP +G R++R +E++ ISL + ++ A
Sbjct: 480 DESMEAHNPRAVFSKFLSTCPSRGCRSVRPDESLLISLAALRPHLAHA 527
>gi|343426289|emb|CBQ69820.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 455
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 146/349 (41%), Positives = 190/349 (54%), Gaps = 67/349 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYSRSSA 117
T+SVA+PGSI+ N QS EL +RLA IARA IF +DE+VVFD D Y
Sbjct: 103 TLSVAIPGSIVLNAQSPELQSRLAAHIARACAIFNVDEIVVFDEGEVRTGEGDLYQHRH- 161
Query: 118 ANRSNRSDENESGA--------------------AFLVRLLQYLETPQYLRKALFSMHSS 157
N+ NR + A +L R+LQYLETPQYLRKALF MH
Sbjct: 162 -NQQNRGTKRRFLADGEQEQQQEGEGEGKGFDPHTYLARVLQYLETPQYLRKALFPMHRD 220
Query: 158 LRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVVD-- 210
LR G++PPLD PHHLR + + +REGVT+ +AP+ V+VGL +
Sbjct: 221 LRLAGLMPPLDCPHHLRFEDHSAYREGVTV--DAPHWATRARDSVYVNVGLRDPIEASLP 278
Query: 211 --QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNC 267
+D G RVTV M L++ QVV P +P E G YWGY VR AP++SSV
Sbjct: 279 AGAHVDAGTRVTVKM----PLNSYDRGQVVSPREPVEHRGWYWGYSVRLAPSLSSVLTTN 334
Query: 268 SYKG-GYDHLIGTSEHGDIVNSSDLTL--------------------PTFRHLLIAFGGL 306
+ YD +IGTSE G V +DL L P F H L+ FGGL
Sbjct: 335 PFASEPYDLVIGTSERG--VPLTDLALATASATPVALASGETVQPLQPRFAHALLVFGGL 392
Query: 307 AGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+GLE ++E+D+G++ R A+E+F ++ N +QGSRT+RTEEAI I+L
Sbjct: 393 SGLEVAVEQDEGIRLDRHTAKELFDAWVNVVENQGSRTVRTEEAIMIAL 441
>gi|225555276|gb|EEH03568.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 380
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 186/338 (55%), Gaps = 54/338 (15%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSII N QS + T LAG IARA+ +F +DEVV+FD+ + R N +
Sbjct: 41 TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDA------RHIHLNGN 94
Query: 122 NRSDENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
+ +E+E A FL +L YLETP YLRK LF MH +LR G LP LD PHH+R +
Sbjct: 95 HNINEDEYTAYSDPSHFLAHVLSYLETPPYLRKYLFPMHKNLRTAGTLPSLDMPHHIRAN 154
Query: 177 EWAPFREGVTLK--------------------------------ENAPNSVGTLVDVGLN 204
EW +REGVTL +++P S T V+ GL
Sbjct: 155 EWCEYREGVTLSAAEEQGEVGMAGEHDNGRHGKKKGKRNKDNTGKDSPTSY-TSVNTGLP 213
Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
+ V V + P RVTV G+ ++D +VV P P+ E+G YWGY VR ++S+
Sbjct: 214 EKVRVPYIPTPENTRVTVKFGS--SIDPSDGAEVVSPFAPREETGYYWGYSVRRCASLSA 271
Query: 263 VFKNCSYKGGYDHLIGTSEHG-DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
VF C + GGYD GTSE G + + +P F HL++ FGG+AGLE ++ D L+
Sbjct: 272 VFTECPFDGGYDLSFGTSERGCPLSEAVAGEVPKFEHLIVVFGGVAGLEAAVTADKDLRD 331
Query: 322 K-----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
K +A VF + N P QGSRTIRTEEA+++ L
Sbjct: 332 KGLGPSDAERVFDYWVNVLPGQGSRTIRTEEAVWLGLM 369
>gi|209879415|ref|XP_002141148.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209556754|gb|EEA06799.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 397
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/330 (40%), Positives = 183/330 (55%), Gaps = 43/330 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--DNYSRSSAAN 119
TVSVA+P SII N QSLEL L GQIAR + IF IDE+VV+++K + DN R S
Sbjct: 55 TVSVALPSSIISNAQSLELKAYLVGQIARILAIFGIDEIVVYEDKYKNIVDNSERQSDYI 114
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
N S E F V+ L+YLETPQYLRK+LF + L+F G+ P+DAPHH+R EW
Sbjct: 115 GFNCSRSME----FFVKNLRYLETPQYLRKSLFKFEADLKFAGLQNPIDAPHHMRISEWL 170
Query: 180 PFREGVT---------LKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM------- 223
P+R+GV L ++ G+ V+ GL +D+ L R+T+ M
Sbjct: 171 PYRQGVVVSKPRLFKGLIKSEHKKSGSWVNCGLPVEAWIDESLPDNTRITIKMIDNSENL 230
Query: 224 -----------GTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKG 271
G N + +VV P+ + G+YWGYKVR A ++ SV N YK
Sbjct: 231 HKKLCDEFSQKGKNEDSKTYFIGKVVSDETPRTKKGIYWGYKVRPANSLKSVLFNSPYKK 290
Query: 272 GYDHLIGTSEHGDIVNSSDLTL------PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
GYD +IGTSE G + + + L +F+HLLI FGGL GLE+ + + L + +
Sbjct: 291 GYDLVIGTSERGCCI-TDNFKLEKSPDKKSFKHLLIVFGGLGGLEDVLGDPQSLFNSDTK 349
Query: 326 --EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
++F Y N CP Q SRTIRTEEA+ ++L
Sbjct: 350 PEDLFDMYVNICPEQKSRTIRTEEALGLTL 379
>gi|222628479|gb|EEE60611.1| hypothetical protein OsJ_14024 [Oryza sativa Japonica Group]
Length = 175
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 104/163 (63%), Positives = 129/163 (79%), Gaps = 1/163 (0%)
Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
G +V+V+Q+L+PG RVT+AMGTNR++ R++V PS P+ E +YWGYKVRYA N+
Sbjct: 12 GKRMNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNL 71
Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
VF + YK GYD++IGTSEHG I++SS+L LP+FRHLLIAFGGLAGLEE IEED LK
Sbjct: 72 GGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLLIAFGGLAGLEECIEEDRNLK 131
Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
K +VF++Y NTCP QGSRTIRTEEA+ ISLQYFQ+PI RA
Sbjct: 132 GKGVDDVFNTYLNTCPSQGSRTIRTEEALLISLQYFQDPIRRA 174
>gi|85074739|ref|XP_965737.1| hypothetical protein NCU00597 [Neurospora crassa OR74A]
gi|28927550|gb|EAA36501.1| conserved hypothetical protein [Neurospora crassa OR74A]
gi|38567310|emb|CAE76598.1| conserved hypothetical protein [Neurospora crassa]
gi|336465334|gb|EGO53574.1| hypothetical protein NEUTE1DRAFT_133922 [Neurospora tetrasperma
FGSC 2508]
Length = 349
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 173/299 (57%), Gaps = 14/299 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVAVP S+I + + E T AG+IARA +F +DE+V++D+ S SR ++ +
Sbjct: 48 TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + A FL LL YLETP ++RK LF MH +LR G+LP LD PHH K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+TL+ GT+VD+GL V +D+ L P RVT+ M + SP V P
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEDTVTIDEELPPKTRVTLKMPVG---EYGSPEPVHPAV 220
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
E G +WGY VR A +S VF Y+ GYD IGTSE G ++ + + F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280
Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+AFGG GLE + D G+ +E+F + N P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339
>gi|336275401|ref|XP_003352453.1| hypothetical protein SMAC_01287 [Sordaria macrospora k-hell]
gi|380094341|emb|CCC07720.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 349
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/299 (41%), Positives = 174/299 (58%), Gaps = 14/299 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVAVP S+I + + E T AG+IARA +F +DE+V++D+ S SR ++ +
Sbjct: 48 TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + A FL LL YLETP ++RK LF MH +LR G+LP LD PHH K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+TL+ GT+VD+GL + V +D+ L P RVT+ M + SP + P
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEETVTIDEELPPKTRVTLKMPVG---EYGSPEPIHPAV 220
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
E G +WGY VR A +S VF Y+ GYD IGTSE G ++ + + F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280
Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+AFGG GLE + D G+ +E+F + N P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339
>gi|350295631|gb|EGZ76608.1| DUF171-domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 349
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 173/299 (57%), Gaps = 14/299 (4%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVAVP S+I + + E T AG+IARA +F +DE+V++D+ S SR ++ +
Sbjct: 48 TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + A FL LL YLETP ++RK LF MH +LR G+LP LD PHH K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+TL+ GT+VD+GL V +D+ L P RVT+ M + SP V P
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEDTVTIDEELPPKTRVTLKMPVG---EYGSPEPVHPAV 220
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
E G +WGY VR A +S VF Y+ GYD IGTSE G ++ + + F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280
Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+AFGG GLE + D G+ +E+F + N P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339
>gi|358057838|dbj|GAA96340.1| hypothetical protein E5Q_03006 [Mixia osmundae IAM 14324]
Length = 526
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 188/312 (60%), Gaps = 28/312 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS---SSDN---YSRS 115
T+S+A+PGSI+ N Q+ EL T LAGQIAR+ IF +DE++VF + SS+N R
Sbjct: 213 TLSIALPGSILLNAQTPELRTILAGQIARSCAIFNVDEIIVFHEPTFSPSSENGHEQKRM 272
Query: 116 SAANRSNRSDENESGAA---FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
R +E++S A LV++L YLETPQYLRKAL MHS+LR G+LPPLD PHH
Sbjct: 273 RGKYRQTAEEESQSTAQADEMLVKILTYLETPQYLRKALIPMHSALRQAGILPPLDMPHH 332
Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRN 228
LR+ + +PFRE V L + + +D+GL V + + G RVT A TN
Sbjct: 333 LRRDDPSPFREAVVLSHDD-DGRSVQIDIGLRDPVRARTKSSALPERGTRVT-AHYTNAE 390
Query: 229 LDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGG---YDHLIGTSEHGD 284
+V P P+E G+ WGY+VR+ IS +F + ++ YD +IGTSE G
Sbjct: 391 WS------LVSPLLPRELLGISWGYRVRHVSQISRIFLDPAHGQPNEEYDLVIGTSERGQ 444
Query: 285 IVNSS-DLTLPTFRHLLIAFGGLAGLEESI--EEDDGLKRKNAREVFHSYFNTCPHQGSR 341
+ S+ D P +H+LI GGLAGLE +I +ED GL + A ++F + N CPHQGSR
Sbjct: 445 PLQSALDSFPPHPKHVLIVLGGLAGLELTIGADEDTGLTAETALDLFDHWINLCPHQGSR 504
Query: 342 TIRTEEAIFISL 353
TIR+EEA+ I+L
Sbjct: 505 TIRSEEALPIAL 516
>gi|326426771|gb|EGD72341.1| hypothetical protein PTSG_00362 [Salpingoeca sp. ATCC 50818]
Length = 484
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 184/322 (57%), Gaps = 18/322 (5%)
Query: 36 SSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
SS+ +++ +K D P TVS+A+PGS+I QS E+ + LAGQIARA +F
Sbjct: 150 SSTARDERLPSKKAKVKQDHGPGRHYTVSIAIPGSLIAGVQSQEVRSHLAGQIARAAVLF 209
Query: 96 RIDEVVVFDNKSSSDNYSRSSAA----NRSNRSDENESGAAFLVRLLQYLETPQYLRKAL 151
++DE+VV+ + S + + + N AFL R+L+YL+ P +LRK
Sbjct: 210 QVDEIVVYRESTISLDNTVTEGIPVLHNEHGTGKMRGDPCAFLARVLEYLDCPPFLRKHF 269
Query: 152 FSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ 211
F +H L+F +LPPL+APHH +PFREGVTLK+ G+ VD+GL + +D+
Sbjct: 270 FGIHRDLKFASLLPPLEAPHHTVADAPSPFREGVTLKKYVKPGKGSFVDIGLTRPAKIDR 329
Query: 212 VLDPGVRVTVAMGTNR--NLDADSPR-QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNC 267
L PGVRVTVA+ + + R + V PS P+E GMYWGY VR A + + V C
Sbjct: 330 HLAPGVRVTVALDVTQYHKKQVHTFRGKAVSPSAPREQKGMYWGYSVRVAESAADVLSEC 389
Query: 268 SYKGGYDHLIG-TSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
+K GYD +G T++ G + +T+P +F+H L+ FGG GL+E +E R+ +
Sbjct: 390 PWKTGYDLCMGITTKRGKAPEA--VTMPKSFKHALVIFGGAEGLDEFVE-----SRQAQK 442
Query: 326 EVFHSYFNTCPHQGSRTIRTEE 347
V H NTC H G+R IRTEE
Sbjct: 443 RVLHR-INTCKHAGTRNIRTEE 463
>gi|261193745|ref|XP_002623278.1| hypothetical protein BDBG_06717 [Ajellomyces dermatitidis SLH14081]
gi|239588883|gb|EEQ71526.1| hypothetical protein BDBG_06717 [Ajellomyces dermatitidis SLH14081]
gi|239613796|gb|EEQ90783.1| hypothetical protein BDCG_05903 [Ajellomyces dermatitidis ER-3]
Length = 380
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/337 (40%), Positives = 186/337 (55%), Gaps = 54/337 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN---KSSSDNYSRSSAA 118
T+S+A+PGSII N QS + T LAG IARA+ +F +DEVV+FD+ ++ S Y ++
Sbjct: 41 TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDARQTRSKPYHNANED 100
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ SD + FL +L YLETP YLRK LF MH +LR G LP LD PHH+R +EW
Sbjct: 101 EYTAYSDPSH----FLAHVLSYLETPPYLRKYLFPMHRNLRTAGTLPSLDMPHHIRANEW 156
Query: 179 APFREGVTL---KENA----------------------------PNSVGTLVDVGLNKHV 207
+REGVT+ +E A ++ TLV+ GL +
Sbjct: 157 CEYREGVTVSASEEQADARTAREYGNTNHERKKGKKNKEKKGQGSSTSYTLVNTGLPEKS 216
Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
V + P RVTV G ++ D +VV P P+E G YWGY VR ++S+VF
Sbjct: 217 RVPHIPAPENTRVTVKFGASK--DPSEVAEVVSPVTPREEMGYYWGYSVRRCSSLSTVFT 274
Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED----- 316
C ++GGYD GTSE G D++ +P HL+I FGG+AGLE +++ D
Sbjct: 275 ECPFEGGYDLSFGTSERGAPLSDVIAGH---VPKSEHLIIVFGGVAGLEAAVKADKELQE 331
Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
GLK A ++F + N P QGSRTIRTEEA+++ L
Sbjct: 332 KGLKPSEAEKMFDYWINVVPGQGSRTIRTEEAVWLGL 368
>gi|240275221|gb|EER38736.1| hypothetical protein HCDG_07605 [Ajellomyces capsulatus H143]
gi|325094582|gb|EGC47892.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 380
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 187/338 (55%), Gaps = 54/338 (15%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSII N QS + + LAG IARA+ +F +DEVV+FD+ + R N +
Sbjct: 41 TLSIALPGSIIANAQSHDQKSFLAGSIARALAVFCVDEVVIFDDDA------RHIHLNGN 94
Query: 122 NRSDENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
+ +E+E A FL +L YLETP YLRK LF MH +LR G LP LD PHH+R +
Sbjct: 95 HNINEDEYTAYSDPSHFLAHVLSYLETPPYLRKYLFPMHKNLRTAGTLPSLDMPHHIRAN 154
Query: 177 EWAPFREGVTLK--------------------------------ENAPNSVGTLVDVGLN 204
EW +REGVTL +++P S TLV+ GL
Sbjct: 155 EWCEYREGVTLSAAEEQGEVGMVGEHDNGRHGKKKGKRNKDNMGKDSPTSY-TLVNTGLP 213
Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
+ V V + P RVTV G+ ++D +VV P P+ E+G YWGY VR ++S+
Sbjct: 214 EKVRVPYIPTPENTRVTVKFGS--SIDPSDGAEVVSPFAPREETGYYWGYSVRRCASLSA 271
Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
VF C + GGYD GTSE G ++ + + F HL++ FGG+AGLE ++ D L+
Sbjct: 272 VFTECPFDGGYDLSFGTSERGCPLSEAIAGEVLKFEHLIVVFGGVAGLEAAVTADKNLRD 331
Query: 322 K-----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
K +A VF + N P QGSRTIRTEEA+++ L
Sbjct: 332 KGLGASDAERVFDYWVNVLPGQGSRTIRTEEAVWLGLM 369
>gi|401404268|ref|XP_003881686.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116099|emb|CBZ51653.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 821
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 185/347 (53%), Gaps = 59/347 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+P SIIDN Q+ EL L GQIAR +T+F +DE++V+++ ++ A +R
Sbjct: 471 TLSVALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 522
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
N D + F VR L+YLETPQ+LRK+LF +HS LRF G+ PLDAPHHLR++EW P+
Sbjct: 523 NAEDGHSRALEFFVRNLRYLETPQFLRKSLFPIHSDLRFAGLQNPLDAPHHLRRNEWLPY 582
Query: 182 REGVTL---------------------------------KENAPNSVGTLVDVGLNKHVV 208
REGV + K+ N G V+ GL V
Sbjct: 583 REGVVVASSKTRQERQHAKATECDRAKGPGDDVKLTAAEKKRLKNKQGAWVECGLPALVW 642
Query: 209 V-DQVLDPGVRVTVAMGTN-RNL--------DADSPRQ----VVPPSKPKESGMYWGYKV 254
+ + L+ G+RVTV + + R L DA+ P V P P ++G+YWGY+V
Sbjct: 643 IPNSRLEDGMRVTVRLDASVRQLQRQPQERGDAEPPLMRGVAVSPDEPPVKAGLYWGYRV 702
Query: 255 RYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEES 312
R A + VF +C + G YD +GTSE G V +RH+L+ FGGL GLE
Sbjct: 703 RIAQHFQDVFSSCPFSADGRYDLTVGTSERGACVGRGFEFPRNYRHMLLVFGGLQGLEAV 762
Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ + + ++ +F Y NTC Q SRTIR EEA+ I+L +
Sbjct: 763 LLDRQSNCAPCRDPSTLFDMYLNTCAFQRSRTIRAEEAVPITLALLR 809
>gi|449302535|gb|EMC98544.1| hypothetical protein BAUCODRAFT_23290 [Baudoinia compniacensis UAMH
10762]
Length = 382
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 133/302 (44%), Positives = 168/302 (55%), Gaps = 46/302 (15%)
Query: 79 ELATRLAGQIARAVTIFRIDEVVVFDNKS------SSDNYSRSSAANRSNRSDENESGAA 132
EL T LAGQIARA +F +DEVVVFD+ + D Y+ +D N
Sbjct: 86 ELKTVLAGQIARACAVFCVDEVVVFDDGAVKARPPDEDGYT--------GYTDPNN---- 133
Query: 133 FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLK--EN 190
FLV++L YLETP +LR+ L H+ + G+ P LD PHHLR EW FREGV L EN
Sbjct: 134 FLVQMLTYLETPPHLRRTLIPKHADFQCAGVAPSLDMPHHLRSDEWCIFREGVALHPGEN 193
Query: 191 APNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---------QVVPPS 241
+ GTLVD GL + V V L+ RVTV + D P+ + V P
Sbjct: 194 RDGNPGTLVDCGLQQKVFVGVPLEERSRVTVKL-------PDQPQAGRQDMLHGEAVSPD 246
Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NSSDLTLP 294
P+ E+G YWGY VR A ++ +VF C Y GGYD IGTSE G V +S P
Sbjct: 247 TPREEAGYYWGYSVRQASSLGAVFTECQYDGGYDISIGTSERGMSVQSVLSADSVSCVQP 306
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPHQGSRTIRTEEAIFI 351
T++HLLI FGG+AGLE ++ D L+ A+EVF + N P QGSRTIRTEEA +I
Sbjct: 307 TWQHLLIVFGGVAGLEAALSADAELQAAGVSEAKEVFDRWLNVLPQQGSRTIRTEEATWI 366
Query: 352 SL 353
+L
Sbjct: 367 AL 368
>gi|327350021|gb|EGE78878.1| deoxyribose-phosphate aldolase [Ajellomyces dermatitidis ATCC
18188]
Length = 380
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 135/337 (40%), Positives = 186/337 (55%), Gaps = 54/337 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN---KSSSDNYSRSSAA 118
T+S+A+PGSII N QS + T LAG IARA+ +F +DEVV+FD+ ++ S Y ++
Sbjct: 41 TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDARQTRSKPYHNANED 100
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
+ SD + FL +L YLETP YLRK LF MH +LR G LP LD PHH+R +EW
Sbjct: 101 EYTAYSDPSH----FLAHVLSYLETPPYLRKYLFPMHRNLRTAGTLPSLDMPHHIRANEW 156
Query: 179 APFREGVTL---KENA----------------------------PNSVGTLVDVGLNKHV 207
+REGVT+ +E A ++ TLV+ GL +
Sbjct: 157 CEYREGVTVSASEEQADARTAREYGNTNHERKKGKKNKEKKGQGSSTSYTLVNTGLPEKS 216
Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
V + P RVTV G ++ D +VV P P+E G YWGY VR ++S+VF
Sbjct: 217 RVPHIPAPENTRVTVKFGASK--DPSEVAEVVSPVTPREEMGYYWGYSVRRCSSLSTVFT 274
Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED----- 316
C ++GGYD GTSE G D++ +P HL++ FGG+AGLE +++ D
Sbjct: 275 ECPFEGGYDLSFGTSERGAPLSDVIAGH---VPKSEHLILVFGGVAGLEAAVKADKELQE 331
Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
GLK A ++F + N P QGSRTIRTEEA+++ L
Sbjct: 332 KGLKPSEAEKMFDYWINMVPGQGSRTIRTEEAVWLGL 368
>gi|226294556|gb|EEH49976.1| deoxyribose-phosphate aldolase [Paracoccidioides brasiliensis Pb18]
Length = 380
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 182/335 (54%), Gaps = 50/335 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N QS + T LAG IARA+ +F +DEVV+F++ D R S A+
Sbjct: 41 TLSVALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFED----DVRQRQSNAHYD 96
Query: 122 NRSDENES---GAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
DE + + FL +L YLETP YLRK LF MH +LR G LP LD PHH+R +E
Sbjct: 97 LSKDEYTAYSDPSHFLAHVLSYLETPPYLRKFLFPMHKNLRTAGTLPSLDMPHHIRANES 156
Query: 179 APFREGVTLKENAPNSVG-------------------------------TLVDVGLNKHV 207
+REGVT+ N + G TLVD GL V
Sbjct: 157 CEYREGVTVPANEEHGFGQSNAENHHNQHKRKKGKKNMEEGVQDTVMSYTLVDTGLPNKV 216
Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
+ + P RVTV + ++ D +VV PS P+E G YWGY VR ++S VF
Sbjct: 217 CIPHIPVPENTRVTVKLCSSE--DPSQGAEVVSPSTPREEMGYYWGYSVRCCDSLSDVFT 274
Query: 266 NCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRHLLIAFGGLAGLEESIEEDD-----G 318
C + GGYD GTSE G + S +T +P F HL++ FGG+AGLE +++ D G
Sbjct: 275 ECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEHLIVVFGGVAGLEVAVKADKELLARG 333
Query: 319 LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+ + +F + N P QGSRTIRTEEA+++ L
Sbjct: 334 LRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 368
>gi|115402537|ref|XP_001217345.1| deoxyribose-phosphate aldolase 2 [Aspergillus terreus NIH2624]
gi|114189191|gb|EAU30891.1| deoxyribose-phosphate aldolase 2 [Aspergillus terreus NIH2624]
Length = 661
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 145/377 (38%), Positives = 194/377 (51%), Gaps = 72/377 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSS-SDNYSR------ 114
T+SVA+PGSI+ N S+E T LAG IARA+ +F +DEVV+FD++ S NY
Sbjct: 285 TLSVAIPGSIVANAHSIEQKTFLAGIIARALAVFCVDEVVIFDDEEHRSRNYRHYDDGYD 344
Query: 115 SSAANRSNRSDENESG-------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
S A S+ + + G + FL +L YLETP YLRK LF MH +LR G+LP L
Sbjct: 345 SPIAKTSDELNASTKGYTAYSDPSHFLAHVLSYLETPPYLRKHLFPMHPNLRGAGLLPSL 404
Query: 168 DAPHHLRKHEWAPFREGVTLKEN--------------APN-------------------- 193
D PHHLR HEW +REG+ L +PN
Sbjct: 405 DMPHHLRAHEWCDYREGIVLSSTPRDGRGRSGSQMSLSPNRHSYRNLSPSSNSSQASTNP 464
Query: 194 SVGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWG 251
+ T+VD GL++ VV+ + P RVTV + + D + V PS P+ E+G YWG
Sbjct: 465 TTATVVDTGLSEKVVLPDIQLPEHARVTVRFSHHGH---DHYAEPVHPSAPRAEAGYYWG 521
Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-----------SDLTLPTFRHLL 300
Y VR ++SSVF C + GGYD GTSE G V P ++HLL
Sbjct: 522 YYVRRCRSLSSVFTECPFDGGYDLSFGTSERGAPVGEVLEDEAPYHRPPHTHRPDYKHLL 581
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAR-----EVFHSYFNTCPHQGSRTIRTEEAIFI---S 352
+ FGG+AG+E +I D L+ + R ++F + N P QGSRTIRTEEA+++ S
Sbjct: 582 VVFGGVAGIEAAIRNDPQLRDMDIRPTDAGKLFDYWINLLPGQGSRTIRTEEAVWLGLTS 641
Query: 353 LQYFQEPISRALRRVEY 369
L+ E RA R Y
Sbjct: 642 LRGLVEGTHRAKRSYRY 658
>gi|258573617|ref|XP_002540990.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237901256|gb|EEP75657.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 369
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/334 (39%), Positives = 182/334 (54%), Gaps = 48/334 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGSII N +S + T LAG IARA+ +F +DE+V+FD++ + + +
Sbjct: 40 TLSIALPGSIIANAKSHDQKTYLAGYIARALAVFCVDEIVIFDDEPRAQGH-------EN 92
Query: 122 NRSDENESGA-----AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
+ ENE A FL +L YLETP +LR+ LF MH +LR G LP LD PHHLR +
Sbjct: 93 KQPFENEYTAFSDPSHFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHLRAN 152
Query: 177 EWAPFREGVTLK----------------------ENAPNSVGTLVDVGLNKHVVVDQV-L 213
EW +REGVT+ EN ++ TL + G + V + + +
Sbjct: 153 EWCEYREGVTIPGENYGSEESTSNKKKNKRSNKGENGSETL-TLANAGFSHPVRISNLSI 211
Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGG 272
P R+T+ T D P V PS P+ E+G YWGY +R ++SSV C + GG
Sbjct: 212 PPHTRLTLKFKTENPSDGADP---VAPSAPREEAGYYWGYSIRRCSSLSSVLTECPFDGG 268
Query: 273 YDHLIGTSEHGD---IVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK-----NA 324
YD GTSE G V +P +RHLL+ FGG+AGLE +++ D L K N
Sbjct: 269 YDLSFGTSERGTPLATVLQCPEEIPQYRHLLLVFGGVAGLETAVKADKELADKGIGPGNV 328
Query: 325 REVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
E+F + N P QGSRTIRTEEA+++ L +E
Sbjct: 329 GELFDYWVNVLPGQGSRTIRTEEAVWLGLMGLRE 362
>gi|134079630|emb|CAK40846.1| unnamed protein product [Aspergillus niger]
Length = 843
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 142/367 (38%), Positives = 190/367 (51%), Gaps = 76/367 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK---SSSDNYSR---- 114
T+SVA+PGSIIDN S+E T LAG IARA+ +F +DEVVVFD+ + D Y
Sbjct: 463 TLSVAIPGSIIDNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRVSY 522
Query: 115 -SSAANRSNRSD----------ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
S A ++ D N + FL +L YLETP YLRK LF +H +LR G+
Sbjct: 523 DSPIAKTRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 582
Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENA-------------------------PNSVGTL 198
LP LD PHHLR +EW FREG+ + +A N T+
Sbjct: 583 LPSLDMPHHLRANEWCDFREGIVVPSSARKSNPASKITQMSKDHPFHRRNSSTNNGPATI 642
Query: 199 VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSP-RQVVPPSKPK-ESGMYWGYKVR 255
VD GL++ +V+ + P RVTV R L + P + V PS P+ E+G YWGY VR
Sbjct: 643 VDTGLSQKIVLPNIHLPEHARVTV-----RFLQHEFPYAEPVHPSTPRAEAGYYWGYYVR 697
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLTLPT 295
++SSVF C Y GGYD GTSE G V++ D P
Sbjct: 698 RCRSLSSVFTECPYDGGYDLSFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPD 757
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
F+H+L+ FGG+AG+E ++ D L+ +A ++F + N P QGSRTIRTEEA++
Sbjct: 758 FQHILVVFGGVAGIEAAVRNDPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVW 817
Query: 351 ISLQYFQ 357
+ L Q
Sbjct: 818 MGLTSLQ 824
>gi|317033146|ref|XP_001394931.2| deoxyribose-phosphate aldolase [Aspergillus niger CBS 513.88]
Length = 456
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 142/367 (38%), Positives = 190/367 (51%), Gaps = 76/367 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK---SSSDNYSR---- 114
T+SVA+PGSIIDN S+E T LAG IARA+ +F +DEVVVFD+ + D Y
Sbjct: 76 TLSVAIPGSIIDNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRVSY 135
Query: 115 -SSAANRSNRSD----------ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
S A ++ D N + FL +L YLETP YLRK LF +H +LR G+
Sbjct: 136 DSPIAKTRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 195
Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENA-------------------------PNSVGTL 198
LP LD PHHLR +EW FREG+ + +A N T+
Sbjct: 196 LPSLDMPHHLRANEWCDFREGIVVPSSARKSNPASKITQMSKDHPFHRRNSSTNNGPATI 255
Query: 199 VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVR 255
VD GL++ +V+ + P RVTV R L + P + V PS P+ E+G YWGY VR
Sbjct: 256 VDTGLSQKIVLPNIHLPEHARVTV-----RFLQHEFPYAEPVHPSTPRAEAGYYWGYYVR 310
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLTLPT 295
++SSVF C Y GGYD GTSE G V++ D P
Sbjct: 311 RCRSLSSVFTECPYDGGYDLSFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPD 370
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
F+H+L+ FGG+AG+E ++ D L+ +A ++F + N P QGSRTIRTEEA++
Sbjct: 371 FQHILVVFGGVAGIEAAVRNDPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVW 430
Query: 351 ISLQYFQ 357
+ L Q
Sbjct: 431 MGLTSLQ 437
>gi|380494662|emb|CCF32981.1| hypothetical protein CH063_05257 [Colletotrichum higginsianum]
Length = 335
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 180/304 (59%), Gaps = 27/304 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SI+ + + E T +AG+IARA+ +F +DE+V+FD D+ S N
Sbjct: 37 TVSVAIPSSILTDAITAERRTAVAGRIARALAVFSVDEIVLFD-----DSPVESRPKNVD 91
Query: 122 NRSDENE-SGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
++ + G FL LL YLE P ++RK LF +H++LR LP L+ PHH +W P
Sbjct: 92 TKAYTGDVDGCHFLEHLLGYLEVPPFMRKMLFPLHANLRLANQLPALEMPHHPNPADWLP 151
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
+REGV K P GT++DVGL HV+++ + P RVT++ + N D + P
Sbjct: 152 YREGVASK-GVPQ--GTMIDVGLKNHVLIEDEVPPNTRVTLSY-PDYNTDK---AEACDP 204
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
S P+ E G YWGY+VR A ++SSVF C Y+GGYD IGTSE G ++ + P+
Sbjct: 205 SAPREEGGYYWGYQVRRANSLSSVFTECKYEGGYDVSIGTSERGRSLSDA---FPSAAPL 261
Query: 296 -FRHLLIAFGGLAGLEESIEEDDGLK-----RKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
F+HLLI FGG G+E + D L+ ARE+F + + P QGSRTIRT+EAI
Sbjct: 262 DFKHLLIVFGGPRGIEYAALNDKELEGMEMTPAKARELFDHWVDVLPGQGSRTIRTDEAI 321
Query: 350 FISL 353
+++L
Sbjct: 322 WVAL 325
>gi|452836847|gb|EME38790.1| hypothetical protein DOTSEDRAFT_139652, partial [Dothistroma
septosporum NZE10]
Length = 331
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 172/305 (56%), Gaps = 19/305 (6%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+A+PGS+I N + + T LAGQIARA +F +DEVVVFD D + A +
Sbjct: 20 TLSIALPGSVIANAVTHDQKTSLAGQIARACAVFCVDEVVVFD-----DGQAEIRAPEQG 74
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ + FL +L YLETP LRK+LF +H LR G LP LD PHHLR EW F
Sbjct: 75 GYTAYSDP-SFFLYHVLTYLETPPNLRKSLFPIHPDLRTAGALPSLDMPHHLRSDEWCQF 133
Query: 182 REGVTLKENAPN-SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADS--PRQVV 238
REG+T+ + GTLVD GL V + ++P RVTV + N + + V
Sbjct: 134 REGITIGPSRNGREGGTLVDCGLATKVYIPDEIEPQTRVTVQLPDNGLIHHAEIIIGEAV 193
Query: 239 PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NSSDL 291
P P+E G YWGY VR A ++ SVF C + GGYD IGTSE G + S+
Sbjct: 194 SPDTPREDGGYYWGYAVRQASSLGSVFTECPFDGGYDVSIGTSERGRPLRSITEQGSATY 253
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPHQGSRTIRTEEA 348
PT+ H+++ GG++GLE + D L+ A +VF + N QGSRTIRTEEA
Sbjct: 254 VEPTWNHMIMVLGGVSGLEAGLRADKELQAAGVSEAAQVFDFWINLVLGQGSRTIRTEEA 313
Query: 349 IFISL 353
I+I L
Sbjct: 314 IWIGL 318
>gi|403355602|gb|EJY77383.1| hypothetical protein OXYTRI_00986 [Oxytricha trifallax]
Length = 395
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 120/299 (40%), Positives = 170/299 (56%), Gaps = 16/299 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+ VP SIIDN QSLEL T L GQIA+A F ++E++V+ N ++
Sbjct: 92 TISIVVPSSIIDNAQSLELKTYLVGQIAKACGHFGVNEIIVYSNDRQQ--------HMKN 143
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
D + F V+ L+Y+ETPQYLRKALF HS+LR+ G+ PL+ PHHL+ EW +
Sbjct: 144 LVGDTRTTTTEFFVKNLEYIETPQYLRKALFPRHSALRYTGLTNPLEGPHHLKITEWCAY 203
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ--VVP 239
REGV + + G+ V++GL K VD L+ G RVTV + +N D VV
Sbjct: 204 REGVVINRPTAHGKGSWVNIGLPKDCQVDIQLEEGTRVTVKLN-EKNFDNTRYYSGVVVS 262
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSY-KGGYDHLIGTSEHGDIVNSSDL-TLPTF 296
++P E G++WGY VR A +F C + + GYD +G SE G + D F
Sbjct: 263 SNEPTEKLGLFWGYTVRVANKFEDIFDECPFEETGYDLKLGVSEQGQEIEHVDFRQHQNF 322
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKN--AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+H LI FGG+ G+E IE+D+ K ++ F S N P +G+RTIRTEE+I +++
Sbjct: 323 KHALIFFGGMEGIEGIIEQDERTKMSTEAIKKKFDSLINPVPERGTRTIRTEESILVTM 381
>gi|393908513|gb|EJD75089.1| deoxyribose-phosphate aldolase [Loa loa]
Length = 365
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 23/318 (7%)
Query: 50 DQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS-- 107
D +N+ +S+A+PGSI++N QS EL T LAGQIAR+ IF +DE++V+D +
Sbjct: 46 DNMNETVNGRFLMLSIALPGSILNNAQSPELRTYLAGQIARSAAIFCVDEIIVYDETARM 105
Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGML 164
D+Y N L R+L+++E PQYLRK LF + +LR+ G+L
Sbjct: 106 KPQQRDDYCSGKWCPDLPVHSGNVECNFHLARILEFMECPQYLRKTLFPLQKTLRYAGIL 165
Query: 165 PPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV 221
PLD PHHLR + + P+REGV L + G + ++GL K V +++ VL G RVTV
Sbjct: 166 NPLDCPHHLRASDLSVPYREGVVLDKPVKAGRGPICNIGLYKEVQINEEVVLQAGTRVTV 225
Query: 222 AMGTNRNLDADSPRQ------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
+ LD S R+ V +E+G+YWGYKVR A ++ G YD
Sbjct: 226 KV-----LDIYSERKRYHGCLVNSQQIKRETGLYWGYKVRIALSLHDALN----AGEYDV 276
Query: 276 LIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTC 335
+IGTSE G V+ ++ L +LI FGGL GLE +IE D + E+F Y N
Sbjct: 277 IIGTSERGKPVSKFEMPLCERNRILIVFGGLEGLEAAIEADKNISCSTPEELFEHYLNVV 336
Query: 336 PHQGSRTIRTEEAIFISL 353
P QGSR IRTEEAI I+L
Sbjct: 337 PGQGSRIIRTEEAIPITL 354
>gi|444721244|gb|ELW61988.1| Kynurenine--oxoglutarate transaminase 1 [Tupaia chinensis]
Length = 982
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/235 (48%), Positives = 150/235 (63%), Gaps = 4/235 (1%)
Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPN 193
L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + FREGV +
Sbjct: 748 LARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEFREGVVVDRPTRP 807
Query: 194 SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWG 251
G+ V+ G+ K V +D+ L+PG+RVTV + + + + R +VV P+ ++G+YWG
Sbjct: 808 GHGSFVNCGMKKEVKIDKNLEPGLRVTVQLSQKQLPECKTYRGKVVSSQDPRTKAGLYWG 867
Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEE 311
Y VR A +S+VF ++ GYD IGTSEHG V S+ LP+FRH L+ FGGL GLE
Sbjct: 868 YTVRLASCLSAVFAEAPFEDGYDLTIGTSEHGSDVGSAQ--LPSFRHALVVFGGLQGLEA 925
Query: 312 SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
++ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q ++RA R
Sbjct: 926 GVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPGLNRAGAR 980
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/144 (47%), Positives = 92/144 (63%), Gaps = 4/144 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD + +++ +
Sbjct: 544 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQD---AKTVEGEFT 600
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ ++ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 601 GVGKKGQACVQ-LARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 659
Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
REGV + G+ V+ G+ K
Sbjct: 660 REGVVVDRPTRPGHGSFVNCGMKK 683
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 37/45 (82%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK 106
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 484 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEE 528
>gi|324509698|gb|ADY44067.1| Unknown [Ascaris suum]
Length = 388
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 177/301 (58%), Gaps = 15/301 (4%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------NYSRSS 116
+S+A+PGSI++N QS EL T LAGQIARA +F +DEVVVFD + +S S
Sbjct: 82 LSIALPGSIMNNAQSPELRTYLAGQIARAAAVFCVDEVVVFDETARMKPEELKAYHSGSW 141
Query: 117 AANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
A + R D E + R+L++LE PQYLRK LF + +L+ G+L PLD HHLR
Sbjct: 142 YAEKPIREDNVECNF-HMARILEFLECPQYLRKTLFPLQRALKNAGVLNPLDCMHHLRST 200
Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLD-PGVRVTVAMGTNRNLDADSP 234
+ + P+REGV L + G L +VGL K + +D +D P T +LDA+
Sbjct: 201 DLSIPYREGVVLNKPVKKGRGPLCNVGLEKELQLDSEIDLPPGTRVTVRVTTTDLDANRL 260
Query: 235 RQVVPPSKP--KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT 292
R + ++ ESG+YWGY VR A ++S Y YD +IGTSE G+ ++ +L
Sbjct: 261 RGSLISARRVRDESGIYWGYNVRVASSLSDAI----YGAKYDIIIGTSERGEAIDKFELP 316
Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
L +L+ FGGL GLE ++E D+ ++ + ++F Y N+ P QGSRTIRTEEAI I+
Sbjct: 317 LSEGSRILLVFGGLEGLEAAVEADESIRESDPAQLFKHYLNSVPDQGSRTIRTEEAIPIT 376
Query: 353 L 353
L
Sbjct: 377 L 377
>gi|119192166|ref|XP_001246689.1| hypothetical protein CIMG_00460 [Coccidioides immitis RS]
Length = 370
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 185/342 (54%), Gaps = 51/342 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N +S + T L G IARA+ +F +DE+V+F+++ S + S A
Sbjct: 28 TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIVIFEDEPESQQFGNRSRAENE 87
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + S FL +L YLETP +LR+ LF MH +LR G LP LD PHH+R +EW +
Sbjct: 88 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 145
Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
REGVT+ E+ P N+ TLV+ G+ + V
Sbjct: 146 REGVTIPNDNYGAEDGPSERSNKKKKSKKSKRNQQGGQEHENSSSNNTFTLVNTGIPEPV 205
Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
+ + + P R+T+ T D P V PS P+ E+G YWGY +R ++SSVF
Sbjct: 206 RIPNLSILPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 262
Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
C ++GGYD GTSE G I+ + +P ++HLL+ FGG+AGLE + + D L
Sbjct: 263 ECPFEGGYDLSFGTSERGLPLAHILQHPE-KIPQYKHLLLVFGGVAGLETAAKADKELAA 321
Query: 322 KN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
K E+F + N P QGSRTIRTEEA+++ L +E
Sbjct: 322 KGIGAGEVGELFDFWVNVLPGQGSRTIRTEEAVWLGLMGLRE 363
>gi|302895721|ref|XP_003046741.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256727668|gb|EEU41028.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 340
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 168/307 (54%), Gaps = 30/307 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+P S++ N + + G+IARA+ +F +DEVVVFD+ S ++ + +
Sbjct: 39 TVSIAIPSSVLTNLATADQRMTQPGRIARALAVFSVDEVVVFDDSPLSTRPRQTDPNSYT 98
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D F +L YLE P ++RK LF +H +LR +LP LD PHH EW +
Sbjct: 99 GDTDP----CHFFTHILSYLEAPPFMRKTLFPLHPNLRLTALLPSLDMPHHPHPKEWVSY 154
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGVT+ GTLV+VGL++ V +++ + P R+T+ + + P V P +
Sbjct: 155 REGVTIAGQTSTGSGTLVEVGLDQPVEIEEEIPPKTRITLKFPDD---ETQYPEPVHPAA 211
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSE----------HGDIVNSSDL 291
E G YWGY VR A ++S VF C Y+ GYD IGTSE HG VN
Sbjct: 212 PRTEGGYYWGYTVRKANSLSGVFTECPYENGYDISIGTSERGVPASRAFPHGKRVN---- 267
Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTE 346
F HLLI FGG GLE + D+ L + +E+F + N P+QGSRTIRT+
Sbjct: 268 ----FNHLLIVFGGPRGLEFASMNDEELSSMDIQGSKTKELFDHWINVLPNQGSRTIRTD 323
Query: 347 EAIFISL 353
EA+FI+L
Sbjct: 324 EAVFIAL 330
>gi|392864074|gb|EAS35126.2| hypothetical protein CIMG_00460 [Coccidioides immitis RS]
Length = 382
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 185/342 (54%), Gaps = 51/342 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N +S + T L G IARA+ +F +DE+V+F+++ S + S A
Sbjct: 40 TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIVIFEDEPESQQFGNRSRAENE 99
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + S FL +L YLETP +LR+ LF MH +LR G LP LD PHH+R +EW +
Sbjct: 100 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 157
Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
REGVT+ E+ P N+ TLV+ G+ + V
Sbjct: 158 REGVTIPNDNYGAEDGPSERSNKKKKSKKSKRNQQGGQEHENSSSNNTFTLVNTGIPEPV 217
Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
+ + + P R+T+ T D P V PS P+ E+G YWGY +R ++SSVF
Sbjct: 218 RIPNLSILPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 274
Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
C ++GGYD GTSE G I+ + +P ++HLL+ FGG+AGLE + + D L
Sbjct: 275 ECPFEGGYDLSFGTSERGLPLAHILQHPE-KIPQYKHLLLVFGGVAGLETAAKADKELAA 333
Query: 322 KN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
K E+F + N P QGSRTIRTEEA+++ L +E
Sbjct: 334 KGIGAGEVGELFDFWVNVLPGQGSRTIRTEEAVWLGLMGLRE 375
>gi|66363284|ref|XP_628608.1| Mth1. SpoU superfamily - SPOUT methylase [Cryptosporidium parvum
Iowa II]
gi|46229613|gb|EAK90431.1| Mth1. SpoU superfamily - SPOUT methylase [Cryptosporidium parvum
Iowa II]
Length = 402
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/350 (37%), Positives = 191/350 (54%), Gaps = 53/350 (15%)
Query: 53 NDDAPIEVP--------TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
NDD ++P T+SVA+P SI N QS EL L GQIAR +++F +DE+++++
Sbjct: 43 NDDILSKIPFAGSPSKATLSVAIPASIAGNAQSFELRAYLVGQIARIISVFGVDEIIIYE 102
Query: 105 NKSS--SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
+K +DN +RSS + F V+ L+YLETPQYLRK LF + +F G
Sbjct: 103 DKCKDVTDNENRSS---KEWVEFACSKWMEFFVKNLKYLETPQYLRKTLFKFDNDFKFAG 159
Query: 163 MLPPLDAPHHLRKHEWAPFREGV----------TLKENAPNSVGTLVDVGLNKHVVVDQV 212
+ P+DAPHH+R EW P+R+GV T+ N G+ V+ GL ++
Sbjct: 160 LQNPIDAPHHMRISEWLPYRQGVIVPGPKLFKGTISPEHKNK-GSWVNCGLPVEAWIETK 218
Query: 213 LDPGVRVTVAMGT-----NRNLDADSPR-------------QVVPPSKP-KESGMYWGYK 253
L+ R+T+ M ++ L +D + ++V S P ++ G+YWGYK
Sbjct: 219 LENNTRITIKMSKESESLHKKLCSDFNKFGRYSEIKSYFTGKLVDDSTPYRKKGIYWGYK 278
Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--------DLTLPTFRHLLIAFGG 305
VR A ++ SVF + Y+ GYD IGTSE G+ V+++ D +F+H+LI FGG
Sbjct: 279 VRPANSLKSVFNDSEYEEGYDLKIGTSERGEPVDNNFKLSTKGKDSKSQSFKHILIVFGG 338
Query: 306 LAGLEESIEEDDGL--KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L GLE+ + + K KN +F Y N CP Q SRTIRTEEAI ++L
Sbjct: 339 LGGLEDVLSDPQCSLDKVKNPSSLFDMYINICPEQRSRTIRTEEAIGLTL 388
>gi|67601352|ref|XP_666391.1| DKFZp566D143.1 [Cryptosporidium hominis TU502]
gi|54657381|gb|EAL36163.1| DKFZp566D143.1 [Cryptosporidium hominis]
Length = 402
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/349 (37%), Positives = 193/349 (55%), Gaps = 51/349 (14%)
Query: 53 NDDAPIEVP--------TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
NDD ++P T+SVA+P SI N QS EL L GQIAR +++F +DE+++++
Sbjct: 43 NDDILSKIPFAGSPSKSTLSVAIPASITGNAQSFELRAYLVGQIARIISVFGVDEIIIYE 102
Query: 105 NKSS--SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
+K +DN +RSS + F V+ L+YLETPQYLRK LF + +F G
Sbjct: 103 DKCKDVTDNENRSSKEWVEFACSK---WMEFFVKNLKYLETPQYLRKTLFKFDNDFKFAG 159
Query: 163 MLPPLDAPHHLRKHEWAPFREGVT------LKENA-PN--SVGTLVDVGLNKHVVVDQVL 213
+ P+DAPHH+R EW P+R+GV K N P+ + G+ V+ GL ++ L
Sbjct: 160 LQNPIDAPHHMRISEWLPYRQGVIVPGPKFFKGNILPDHKNKGSWVNCGLPVEAWIETKL 219
Query: 214 DPGVRVTVAMGT-----NRNLDADSPR-------------QVVPPSKP-KESGMYWGYKV 254
+ R+T+ M ++ L +D + ++V S P ++ G+YWGYKV
Sbjct: 220 ENNTRITIKMSKESESLHKKLCSDFNKFGRYSEIKSYFTGKLVDDSTPYRKKGIYWGYKV 279
Query: 255 RYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--------DLTLPTFRHLLIAFGGL 306
R A ++ SVF + Y+ GYD IGTSE G+ V+++ D +F+H+LI FGGL
Sbjct: 280 RPANSLKSVFNDSEYEEGYDLKIGTSERGEPVDNNFKLSTKGKDSKSQSFKHILIVFGGL 339
Query: 307 AGLEESIEEDDGL--KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
GLE+ + + K KN +F Y N CP Q SRTIRTEEAI ++L
Sbjct: 340 GGLEDVLSDPQCSLDKVKNPSSLFDMYINICPEQRSRTIRTEEAIGLTL 388
>gi|310797700|gb|EFQ32593.1| hypothetical protein GLRG_07607 [Glomerella graminicola M1.001]
Length = 335
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 180/304 (59%), Gaps = 27/304 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SI+ + + E T +AG+IARA+ +F IDE+V+FD D+ S N
Sbjct: 37 TVSVAIPSSILTDAITAERRTAVAGRIARALAVFSIDEIVLFD-----DSPVESRPKNVD 91
Query: 122 NRSDENE-SGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+S + G FL LL YLE P ++RK LF +H +LR LP L+ PHH +W P
Sbjct: 92 TKSYTGDIDGCHFLEHLLGYLEVPPFMRKMLFPLHPNLRLANQLPALEMPHHPNPTDWLP 151
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
+REGV K P GTL++VGL V+++ + P RVT++ + N D + P
Sbjct: 152 YREGVAGK-GTPQ--GTLIEVGLKDPVLIEDEVPPNTRVTLSY-PDYNTDK---AEACDP 204
Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
S P+ E G YWGY+VR A ++S+VF C Y+GGYD IGTSE G ++ + P+
Sbjct: 205 SAPREEGGYYWGYQVRRADSLSAVFTECKYEGGYDVSIGTSERGRTLSEA---FPSAAPM 261
Query: 296 -FRHLLIAFGGLAGLEESIEED---DGLKRKNA--REVFHSYFNTCPHQGSRTIRTEEAI 349
F+HLLI FGG G+E + D +G++ A RE+F + N P QGSRTIRT+EAI
Sbjct: 262 DFKHLLIVFGGPRGIEFAALNDKELEGMEMTPAKTRELFDHWVNVLPGQGSRTIRTDEAI 321
Query: 350 FISL 353
+++L
Sbjct: 322 WVAL 325
>gi|350631644|gb|EHA20015.1| hypothetical protein ASPNIDRAFT_39432 [Aspergillus niger ATCC 1015]
Length = 867
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 184/347 (53%), Gaps = 55/347 (15%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS---------DNY 112
T+SVA+PGSII N S+E T LAG IARA+ +F +DEVVVFD+ D+Y
Sbjct: 506 TLSVAIPGSIITNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRDSY 565
Query: 113 SRSSAANRSNRSDENESGAA---------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
+ A NR + S A FL +L YLETP YLRK LF +H +LR G+
Sbjct: 566 NSPIAKNRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 625
Query: 164 LPPLDAPHHLRKHEWAPFREGVTL----KENAPNSVGTLV--DVGLNKHVVVDQVLDP-G 216
LP LD PHHLR +EW FREG+ + ++N P S T + D+ +V+ + P
Sbjct: 626 LPSLDMPHHLRANEWCDFREGIVVSSSARKNNPASKITQMSKDILSTVKIVLPNIHLPEH 685
Query: 217 VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
RVTV + A+ V PS P+ E+G YWGY VR + SSVF C Y GGYD
Sbjct: 686 ARVTVRFPQHEFPYAEP----VHPSTPRAEAGYYWGYYVRRCRSHSSVFTECPYDGGYDL 741
Query: 276 LIGTSEHGDIVNS--------------------SDLTLPTFRHLLIAFGGLAGLEESIEE 315
GTSE G V++ D P F+H+L+ FGG+AG+E ++
Sbjct: 742 SFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPDFQHILVVFGGVAGIEAAVRN 801
Query: 316 DDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
D L+ +A ++F + N P QGSRTIRTEEA+++ L Q
Sbjct: 802 DPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVWMGLTSLQ 848
>gi|341038421|gb|EGS23413.1| hypothetical protein CTHT_0001020 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 333
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/302 (38%), Positives = 173/302 (57%), Gaps = 23/302 (7%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AVP SI+ + + E T G+IARA IF IDE+VVFD+ SR + +
Sbjct: 35 TVSIAVPTSILTDRVTREQKTTTVGRIARAAAIFSIDEIVVFDDSPPD---SRLPSVDPD 91
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + G F+ LL YLETP ++RK LF +H +L+ G+LP +D PHH K EW P+
Sbjct: 92 AYTGDTDPGH-FMEHLLNYLETPPFMRKVLFPLHPNLKSQGLLPNMDMPHHPHKDEWMPY 150
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+T++ GT VD+GL V + + + P RVT+ M + +P + P
Sbjct: 151 REGMTIEGKG---KGTTVDIGLPTPVTISESIPPRTRVTLKMPVT---PSGTPEPIHPAQ 204
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-----F 296
E+G +WG+ VR AP++S++F + GYD IGTSE G +N++ P+ F
Sbjct: 205 PRTEAGYFWGFTVRRAPSLSAIFTTSPFPDGYDMSIGTSERGKPLNAA---FPSYEQADF 261
Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
+HL++ FGG GLE + D G++ RE+F + N P QG+R I+TEEA+ +
Sbjct: 262 KHLIVVFGGPRGLEYAAMNDPELVQMGIQGGRTRELFDHWVNVLPGQGTRGIKTEEAVLV 321
Query: 352 SL 353
+L
Sbjct: 322 AL 323
>gi|429853814|gb|ELA28862.1| deoxyribose-phosphate aldolase [Colletotrichum gloeosporioides Nara
gc5]
Length = 334
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 16/298 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SII + + E T +AG+IARA+ +F IDEVV+FD+ +R +R
Sbjct: 37 TVSVAIPTSIIADAVNAERRTAVAGRIARALGVFGIDEVVIFDDSPVE---TRPKTVDRK 93
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + G FL LL YLE P ++RK LF +H +LR G LP LD PHH +W P+
Sbjct: 94 AYTGDIDPGH-FLEHLLGYLEVPPFMRKMLFPLHPNLRLAGQLPSLDMPHHPNPADWMPY 152
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV K+ P GT +D+GL V+++ + P RVTV + D + P +
Sbjct: 153 REGVA-KKGTPK--GTELDLGLKTPVIIEDEVPPNTRVTVQFS---DYDTEKAEACDPAA 206
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-FRHLL 300
+E G YWGY VR A ++S+++ C Y+ GYD IGTSE G V S + P F+HLL
Sbjct: 207 PREEGGYYWGYSVRRAASLSALWTECKYEDGYDISIGTSERGRSVGQSFPSTPLDFKHLL 266
Query: 301 IAFGGLAGLEESIEEDDGLKRKNA-----REVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FGG G+E + D L + +E+F + N P Q SR IRT+EA+FI L
Sbjct: 267 IVFGGPRGIEYAAVNDKELDMVDVSGGKTKELFDHWINVLPGQNSRQIRTDEALFIGL 324
>gi|303313045|ref|XP_003066534.1| hypothetical protein CPC735_057590 [Coccidioides posadasii C735
delta SOWgp]
gi|240106196|gb|EER24389.1| hypothetical protein CPC735_057590 [Coccidioides posadasii C735
delta SOWgp]
Length = 370
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 182/341 (53%), Gaps = 49/341 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N +S + T L G IARA+ +F +DE+++F+++ S S A
Sbjct: 28 TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIIIFEDEPESQQSGHRSRAENE 87
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + S FL +L YLETP +LR+ LF MH +LR G LP LD PHH+R +EW +
Sbjct: 88 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 145
Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
REGVT+ E+ P N+ TLV+ G+ + V
Sbjct: 146 REGVTIPSDNYGAEDGPSERSNKKKKSKKSKRNQQGGEEHENCSSNNTFTLVNTGIPEPV 205
Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
+ + + P R+T+ T D P V PS P+ E+G YWGY +R ++SSVF
Sbjct: 206 RIPNLSIQPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 262
Query: 266 NCSYKGGYDHLIGTSEHG---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
C ++GGYD GTSE G V +P ++HLL+ FGG+AGLE + + D L K
Sbjct: 263 ECPFEGGYDLSFGTSERGLPLAHVLQHPEQIPQYKHLLLVFGGVAGLETAAKADKELAAK 322
Query: 323 N-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
E+F + + P QGSRTIRTEEA+++ L +E
Sbjct: 323 GIGAGEVGELFDFWVDVLPGQGSRTIRTEEAVWLGLMGLRE 363
>gi|320036589|gb|EFW18528.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 382
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 182/341 (53%), Gaps = 49/341 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSII N +S + T L G IARA+ +F +DE+++F+++ S S A
Sbjct: 40 TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIIIFEDEPESQQSGHRSRAENE 99
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + S FL +L YLETP +LR+ LF MH +LR G LP LD PHH+R +EW +
Sbjct: 100 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 157
Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
REGVT+ E+ P N+ TLV+ G+ + V
Sbjct: 158 REGVTIPSDNYGAEDGPSERSNKKKKSKKSKRNQQGGEEHENCSSNNTFTLVNTGIPEPV 217
Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
+ + + P R+T+ T D P V PS P+ E+G YWGY +R ++SSVF
Sbjct: 218 RIPNLSIQPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 274
Query: 266 NCSYKGGYDHLIGTSEHG---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
C ++GGYD GTSE G V +P ++HLL+ FGG+AGLE + + D L K
Sbjct: 275 ECPFEGGYDLSFGTSERGLPLAHVLQHPEKIPQYKHLLLVFGGVAGLETAAKADKELAAK 334
Query: 323 N-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
E+F + + P QGSRTIRTEEA+++ L +E
Sbjct: 335 GIGAGEVGELFDFWVDVLPGQGSRTIRTEEAVWLGLMGLRE 375
>gi|358381224|gb|EHK18900.1| hypothetical protein TRIVIDRAFT_43974 [Trichoderma virens Gv29-8]
Length = 345
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 181/322 (56%), Gaps = 35/322 (10%)
Query: 62 TVSVAVPGSIIDNTQSLELAT---RLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSRSS 116
TVSVA P S I + LAT R+A +IARA+ IF IDEV+VFD+ S SR
Sbjct: 39 TVSVAFPSSTIATYNAPSLATAEQRIAVPSRIARALAIFSIDEVIVFDD---SPVKSRPH 95
Query: 117 AANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
++S + + + FL +L YLE P ++RKALF +H +LR LP LD PHH
Sbjct: 96 NTDKSAYTGDTDP-CHFLTHVLSYLEAPPFMRKALFPLHPNLRLTSSLPGLDTPHHPHIK 154
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
E +REGVT+ +S GTLVDVGL+K V + + + P RVT+ + D P +
Sbjct: 155 ESMIYREGVTIAGMTKSSDGTLVDVGLDKPVEIKEDIPPKTRVTLKISE----DESEPAE 210
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP- 294
V P P+ E+G YWGY VR A ++S+VF Y+GGYD IGTSE G V P
Sbjct: 211 CVHPHAPRTEAGFYWGYTVRQANSLSAVFTESPYEGGYDVSIGTSERGSPVGKE---FPY 267
Query: 295 ----TFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRT 345
F H+LI FGG G+E + D+ G++ RE+F + + P+QGSRTIRT
Sbjct: 268 GKRVQFNHVLIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVMPNQGSRTIRT 327
Query: 346 EEAIFISLQYFQEPISRALRRV 367
EEA++I+L AL+R+
Sbjct: 328 EEAMYIALT--------ALKRI 341
>gi|195332993|ref|XP_002033176.1| GM21175 [Drosophila sechellia]
gi|194125146|gb|EDW47189.1| GM21175 [Drosophila sechellia]
Length = 401
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/288 (41%), Positives = 168/288 (58%), Gaps = 5/288 (1%)
Query: 85 AGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAANRSNRSDEN-ESGAAFLVRLLQYL 141
GQIARA IFR+++V+VFD+ +++ RS A+ S S + L R+LQYL
Sbjct: 12 TGQIARAACIFRVNDVIVFDDVGIATARETKRSYEADAEGSSTGTVRSSSLQLARILQYL 71
Query: 142 ETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDV 201
E PQYLRK F +H L++ G+L PLD PHHLR+ +REGV + A + +V
Sbjct: 72 ECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYANV 130
Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
GL V+VD+ ++PGVRVTV M +V P +P+ E+G+YWGY+VR A ++
Sbjct: 131 GLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLVSPDEPRRETGVYWGYQVRIAHSM 190
Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
S +F Y GYD +GTS+ G V+ +F+H+LI FGGL GLE ++ D+ L
Sbjct: 191 SEIFTKSPYANGYDVTVGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKLT 250
Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
+ +F Y N P QGSRTIRTEEA+ I+L QE + + VE
Sbjct: 251 VDDPELLFDHYLNVLPRQGSRTIRTEEALLIALAALQEKLQPQVADVE 298
>gi|340515208|gb|EGR45464.1| predicted protein [Trichoderma reesei QM6a]
Length = 346
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 176/324 (54%), Gaps = 37/324 (11%)
Query: 62 TVSVAVPGSIIDNTQS--LELAT---RLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
TVSVA P S I +Q LAT R+A +IARA+ IF +DEV+V+D+ +
Sbjct: 38 TVSVAFPSSTIATSQGQKYSLATAEQRIAVPSRIARALAIFSVDEVIVYDDSPVKTRPHK 97
Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
+ + +D FL +L YLE P ++RKALF +H +LR LP LD PHH
Sbjct: 98 TDKTAYTGDTDP----CHFLTHVLSYLEAPPFMRKALFPLHPNLRMTSSLPGLDTPHHPH 153
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
EW +REGVT+ + GTLVDVGL K V + + + P RVT+ + P
Sbjct: 154 IKEWMQYREGVTIAGMTRSGEGTLVDVGLEKPVEIKEDIPPKTRVTLKVAEEEG----EP 209
Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ V P+ P+ E+G+YWGY VR A ++S+VF Y+ GYD IGTSE G V+
Sbjct: 210 AECVHPAAPRTEAGIYWGYTVRQANSLSAVFTESPYENGYDLSIGTSERGSPVSKE---F 266
Query: 294 PT-----FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTI 343
P F H LI FGG G+E + D+ G++ RE+F + + P+QGSRTI
Sbjct: 267 PASKRVQFNHALIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVMPNQGSRTI 326
Query: 344 RTEEAIFISLQYFQEPISRALRRV 367
RTEEA+FI L AL+R+
Sbjct: 327 RTEEAVFIGLM--------ALKRI 342
>gi|116192931|ref|XP_001222278.1| hypothetical protein CHGG_06183 [Chaetomium globosum CBS 148.51]
gi|88182096|gb|EAQ89564.1| hypothetical protein CHGG_06183 [Chaetomium globosum CBS 148.51]
Length = 348
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 188/349 (53%), Gaps = 33/349 (9%)
Query: 23 HESQNELSLA--------NGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDN 74
+E Q++L L +G + + N K R AP E TVS+AVP S+I +
Sbjct: 5 YEHQSKLMLGQRRRLDTEDGGAGNGSNFKAYPR----FTPTAPREW-TVSIAVPTSLITD 59
Query: 75 TQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFL 134
+ E T G++ARA+ IF +DEVV++D+ + + + + D A F+
Sbjct: 60 CVTREQRTTNVGRVARALAIFSVDEVVIYDDSPTDKRITNADPDAYTGDVDP----AHFM 115
Query: 135 VRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNS 194
+L YLETP ++RK LF +H +LR G+LP LD PHH K EW P+REG+TL+ A
Sbjct: 116 EHILTYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPYREGMTLELPAKGG 175
Query: 195 VGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
GT+VD+G+ V + + + P RVT+ M + + +P V P + E G +WG+ V
Sbjct: 176 KGTVVDIGMPNTVTITESIPPKTRVTLKMPDD---PSGTPEPVHPAAPRTEDGYFWGFSV 232
Query: 255 RYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-----TFRHLLIAFGGLAGL 309
R A ++S+V Y GYD IGTSE G ++S+ P F HLL+ FGG GL
Sbjct: 233 RKASSLSNVLTESPYADGYDMSIGTSERGMPLSSA---FPGHEQANFNHLLVIFGGPRGL 289
Query: 310 EESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
E + D G+ +E+F + N P+QG+R IRT+EA+ I++
Sbjct: 290 EYAAMNDPQLGEMGISGARTKELFDHWVNVLPNQGTRGIRTDEALLIAM 338
>gi|389646437|ref|XP_003720850.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae 70-15]
gi|86196595|gb|EAQ71233.1| hypothetical protein MGCH7_ch7g640 [Magnaporthe oryzae 70-15]
gi|351638242|gb|EHA46107.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae 70-15]
gi|440474736|gb|ELQ43461.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae Y34]
gi|440484507|gb|ELQ64569.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae P131]
Length = 347
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 187/360 (51%), Gaps = 26/360 (7%)
Query: 21 ENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLEL 80
E H S+ GD + + R + P + TVSVA+P SII + E
Sbjct: 5 EQHPSKKRKLGRYGDGADLSDTITTFRPSARFAPTVPRDW-TVSVAIPSSIITDCARDEQ 63
Query: 81 ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQY 140
+IARA+ +F IDEVVVFD+ S + SR A + + + + A ++ LL Y
Sbjct: 64 RINAPSRIARALAVFSIDEVVVFDD---SPHDSRPPAVDTDAYTGDVDP-AHYIWHLLNY 119
Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
+E P ++RK LF +H +LR G+ P LD PHH + EW P+ E +TL GTLVD
Sbjct: 120 MECPPFMRKTLFPLHPNLRLQGLFPGLDMPHHPHRDEWLPYMEAMTLPGKPEGGKGTLVD 179
Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPN 259
+G+ K + + + + P R+T+ +A++ V P+ P+ E G YWG+ VR +
Sbjct: 180 IGMRKPITIAESIPPKTRLTLRFPGESYDNAEA----VDPAAPRTEGGYYWGFGVRRCAS 235
Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSS---DLTLPTFRHLLIAFGGLAGLEESIEED 316
+S V ++GGYD IGTSE GD+ ++ P F+HLLI GG GLE + D
Sbjct: 236 LSEVLTESPFEGGYDLSIGTSERGDVSANAFPHGKPPPQFKHLLIVLGGPRGLEFAAMND 295
Query: 317 D-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVEYKS 371
G+ REVF + N P QGSR I+TEE++FI+L LRR+ Y++
Sbjct: 296 PQLSEIGIGTGRVREVFDHWINVLPDQGSRNIKTEESLFIALT--------DLRRLWYRT 347
>gi|346977837|gb|EGY21289.1| deoxyribose-phosphate aldolase [Verticillium dahliae VdLs.17]
Length = 355
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 181/326 (55%), Gaps = 46/326 (14%)
Query: 76 QSLELATRLAGQIARAVTIFRIDEVVVFDN-----------------KSSSDNYSRSSAA 118
++L+ G+IARA+ IF +DEVVV+++ K DN S
Sbjct: 38 KTLDQRMTAPGRIARALAIFNVDEVVVYNDLPPLTNEGYQEQPLPHKKDRRDNRSHGGHN 97
Query: 119 NRSNRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
+RS+ +++ES FL +L YLE P ++RKALF +H +LR G+LP LD PH
Sbjct: 98 SRSSHRNDSESSITADVDPCHFLTHVLSYLEAPPFMRKALFPIHPNLRLAGLLPGLDMPH 157
Query: 172 HLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
H + A P+REGVTL+ APN+ GTLVD+G V + + P RVTV + N
Sbjct: 158 HPNPIDIALPYREGVTLEAMAPNAKGTLVDIGNRAPVHLKDEVPPKTRVTVHLPEGDN-- 215
Query: 231 ADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
P + V P+ P+E G YWGY VR A ++S+VF+ ++GGYD IGTSE GD VN +
Sbjct: 216 --GPAEAVHPAAPREEGGYYWGYTVRRAASLSAVFEESPHEGGYDLSIGTSERGDSVNRA 273
Query: 290 --DL-TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
DL F+HLLI FGG G+E + D G+ + R++F + N P QGSR
Sbjct: 274 FPDLKKTAKFKHLLIVFGGPKGIEHAAYNDSQLYEMGIGGQGTRQLFDHWVNVLPGQGSR 333
Query: 342 TIRTEEAIFISLQYFQEPISRALRRV 367
T+RT+EA+++++ LRRV
Sbjct: 334 TMRTDEAVYVAMA--------GLRRV 351
>gi|225685225|gb|EEH23509.1| deoxyribose-phosphate aldolase [Paracoccidioides brasiliensis Pb03]
Length = 684
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 136/360 (37%), Positives = 183/360 (50%), Gaps = 75/360 (20%)
Query: 62 TVSVAVPGSII-------------------------DNTQSLELATRLAGQIARAVTIFR 96
T+SVA+PGSII ++ QS + T LAG IARA+ +F
Sbjct: 320 TLSVALPGSIIAKCALNAGCPSKRIKKRNLPDTSSRNSAQSHDQKTFLAGSIARALAVFC 379
Query: 97 IDEVVVFDNKSSSDNYSRSSAANRSNRSDENES---GAAFLVRLLQYLETPQYLRKALFS 153
+DEVV+F++ D R S A+ DE + + FL +L YLETP YLRK LF
Sbjct: 380 VDEVVIFED----DVRQRQSNAHYDLSKDEYTAYSDPSHFLAHVLSYLETPPYLRKFLFP 435
Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG----------------- 196
MH +LR G LP LD PHH+R +E +REGVT+ N + G
Sbjct: 436 MHKNLRTAGALPSLDMPHHIRANESCEYREGVTVPANEEHGFGQSNAENHHNQHKRKKGK 495
Query: 197 --------------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPS 241
TLVD GL V + + P RVTV + ++ D +VV PS
Sbjct: 496 KNTEEGVQDTVMSYTLVDTGLPNKVCIPHIPVPENTRVTVKLCSSE--DPSQGAEVVSPS 553
Query: 242 KPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRH 298
P+E G YWGY VR ++S VF C + GGYD GTSE G + S +T +P F H
Sbjct: 554 TPREEMGYYWGYSVRCCDSLSDVFTECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEH 612
Query: 299 LLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L++ FGG+AGLE +++ D GL+ + +F + N P QGSRTIRTEEA+++ L
Sbjct: 613 LIVVFGGVAGLEVAVKADKELLARGLRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 672
>gi|302420647|ref|XP_003008154.1| deoxyribose-phosphate aldolase [Verticillium albo-atrum VaMs.102]
gi|261353805|gb|EEY16233.1| deoxyribose-phosphate aldolase [Verticillium albo-atrum VaMs.102]
Length = 355
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 181/326 (55%), Gaps = 46/326 (14%)
Query: 76 QSLELATRLAGQIARAVTIFRIDEVVVFDN-----------------KSSSDNYSRSSAA 118
++L+ G+IARA+ IF +DEVVV+++ K DN S
Sbjct: 38 KTLDQRMTAPGRIARALAIFNVDEVVVYNDLPPLTNEGYQEQPLPHKKDRRDNRSHGGHN 97
Query: 119 NRSNRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
+RS+ +++ES FL +L YLE P ++RKALF +H +LR G+LP LD PH
Sbjct: 98 SRSSHRNDSESSITADVDPCHFLTHVLSYLEAPPFMRKALFPIHPNLRLAGLLPGLDMPH 157
Query: 172 HLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
H + A P+REGVTL+ APN+ GTLVD+G V + + P RVTV + N
Sbjct: 158 HPNPIDIALPYREGVTLEAMAPNAKGTLVDIGNRAPVHLKDEVPPKTRVTVHLPEGDN-- 215
Query: 231 ADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
P + V P+ P+E G YWGY VR A ++S+VF+ ++GGYD IGTSE GD VN +
Sbjct: 216 --GPAEAVHPAAPREEGGYYWGYTVRRAASLSAVFEESPHEGGYDLSIGTSERGDSVNRA 273
Query: 290 --DL-TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
DL F+HLLI FGG G+E + D G+ + R++F + N P QGSR
Sbjct: 274 FPDLKKTAKFKHLLIVFGGPKGIEHAAYNDAQLYEMGIGGQGTRQLFDHWVNVLPGQGSR 333
Query: 342 TIRTEEAIFISLQYFQEPISRALRRV 367
T+RT+EA+++++ LRRV
Sbjct: 334 TMRTDEAVYVAMA--------GLRRV 351
>gi|367043134|ref|XP_003651947.1| hypothetical protein THITE_2112761 [Thielavia terrestris NRRL 8126]
gi|346999209|gb|AEO65611.1| hypothetical protein THITE_2112761 [Thielavia terrestris NRRL 8126]
Length = 320
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/302 (39%), Positives = 170/302 (56%), Gaps = 20/302 (6%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVAVP SII + + E T G++ARA+ IF +DEVVVFD+ S + +
Sbjct: 19 TVSVAVPTSIITDCVTREQRTTNVGRVARALAIFSVDEVVVFDDSPVDQRISNADPDAYT 78
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
D A F+ LL YLETP ++RK LF +H +LR G+LP LD PHH K EW P+
Sbjct: 79 GDVDP----AHFMEHLLTYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPY 134
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+TL+ GT VD+G+ V + + + P RVT+ M ++ +P V P +
Sbjct: 135 REGMTLETPPKGGKGTEVDIGMPSTVRIAESIPPRTRVTLKMPDDQ---YGTPEPVHPAA 191
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-----F 296
E+G +WG+ VR A ++S+V ++ GYD IGTSE G ++ + P+ F
Sbjct: 192 PRTEAGYFWGFSVRRASSLSNVLTESPFEDGYDMSIGTSERGVPLSKA---FPSYEPANF 248
Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
HLLI FGG GLE + D G+ +E+F + N P+QG+R I+T+EA+ I
Sbjct: 249 NHLLIVFGGPRGLEFAAMNDPQLGEMGITGPRTKELFDHWVNVLPNQGTRGIKTDEALLI 308
Query: 352 SL 353
+L
Sbjct: 309 AL 310
>gi|391340918|ref|XP_003744780.1| PREDICTED: uncharacterized protein C9orf114-like [Metaseiulus
occidentalis]
Length = 313
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 177/301 (58%), Gaps = 30/301 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AV S+++N + R+ QIARA IFR++EV+V D+ + + S +N
Sbjct: 34 TISIAVCASMLENLKGEGSKHRVIAQIARAAAIFRVNEVIVVDDAYKNPGFDSSKDSN-- 91
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+L+YL+ PQYLRK+LF M L+ +G+ PPLDA HH+R + P+
Sbjct: 92 -----------LFQLILEYLDCPQYLRKSLFPMCLELKHIGVAPPLDASHHVRMEDEVPY 140
Query: 182 REGVTLKENAPNSVGTL-VDVGLNKHVVVDQ--VLDPGVRVTVAMGTNRNLDADSPRQVV 238
REG+ + S+G++ +DVGL++ + ++ + G+RVTV M + D+PR
Sbjct: 141 REGIVTR----VSLGSVWLDVGLHEDIKIESRAEVKQGLRVTVNMTEKSIVGPDTPR--- 193
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
K+ G+YWGY+ R A +IS V C ++ GYD IGTSE G+ + + L FRH
Sbjct: 194 -----KKDGLYWGYRTRVAKSISEVIALCPFEKGYDMTIGTSERGESILELE-ALGKFRH 247
Query: 299 LLIAFGGLAGLEESIEEDDGLKR-KNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
LLI FGG GLE ++E+D L + K + VF Y NTCP QGSRTIRTEEA+ ISL Q
Sbjct: 248 LLIVFGGPKGLEHALEQDKALTQVKEPKNVFDKYLNTCPLQGSRTIRTEEAVLISLSALQ 307
Query: 358 E 358
+
Sbjct: 308 K 308
>gi|403177364|ref|XP_003888780.1| hypothetical protein PGTG_22495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172839|gb|EHS64743.1| hypothetical protein PGTG_22495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 441
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 144/359 (40%), Positives = 192/359 (53%), Gaps = 64/359 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK-----------SSSD 110
TVS+A+PGSI++N Q+ EL T L GQIARA IF ++E+++FD S
Sbjct: 98 TVSIALPGSIVNNAQTWELKTALVGQIARACAIFSVNEIIIFDESVGEADPSAKPHQPSS 157
Query: 111 NYSRSSAANRSNRSDENESG--------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
Y R A R+ D +G + F+ R+L+YLE PQYLRK+LF +H LR G
Sbjct: 158 TYGR--AKYRTPAEDLELAGPEDGPFQPSHFVARILEYLECPQYLRKSLFPIHPDLRLAG 215
Query: 163 MLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVG-----------------TLVDVGLN 204
+LPPLD PHH RK P+REG L + N V + DVGL
Sbjct: 216 LLPPLDLPHHFRKDHETPWREGCILPADKVENYVAGGHYQKKKNRKGDDRKTSWADVGLA 275
Query: 205 KHVVVDQVLDPGV------RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAP 258
+ V V LD GV RVTV M A R V P + +G+YWGY +R A
Sbjct: 276 EPVQVQ--LDEGVSLPDWTRVTVQMPQGEGKLA---RLVSPRLPTQTTGIYWGYSIRLAS 330
Query: 259 NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED 316
+IS VF Y +GGYD IGTSE G+ V+ + F+H+L+ GGL+GLE +I D
Sbjct: 331 SISKVFTETPYASEGGYDLTIGTSERGENVDDVVDGIGDFKHMLLVLGGLSGLELTIASD 390
Query: 317 DGLKRK----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVEYKS 371
D L+ + +A +F + NT P QGSRT+RTEEA+ +SL ALRR+ ++S
Sbjct: 391 DTLEPRLTAADAPLLFDHWLNTLPFQGSRTVRTEEALLVSLG--------ALRRLLFRS 441
>gi|443711334|gb|ELU05162.1| hypothetical protein CAPTEDRAFT_182660 [Capitella teleta]
Length = 234
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 104/231 (45%), Positives = 136/231 (58%), Gaps = 3/231 (1%)
Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPN 193
+ R+LQYLE PQYLRK+ F H LR+ G+L PLD+PHH++ E +REGV +
Sbjct: 1 MARILQYLECPQYLRKSFFPQHDDLRYAGVLNPLDSPHHMKADETPKYREGVIVNAQIKE 60
Query: 194 SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGY 252
+ G+ K+V +D+VL PG+RVT+ M VV P P+ G+YWGY
Sbjct: 61 GRNSYALAGILKNVEIDRVLMPGMRVTIQMDYASESKKKVKGYVVSPDTPRTRDGLYWGY 120
Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEES 312
+ R A ++S VF + + GYD IGTSE G V+ D LP F HLL+ FGGL GLE +
Sbjct: 121 QTRLAASLSEVFTDSPFSSGYDLTIGTSERGSSVD--DAELPHFNHLLVVFGGLRGLESA 178
Query: 313 IEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
+E D L + +FH Y NTCP QGS TIRTEEAI IS+ + I A
Sbjct: 179 LEADPKLNMDDPSLLFHQYLNTCPDQGSGTIRTEEAILISMAALRTKIKAA 229
>gi|384493430|gb|EIE83921.1| hypothetical protein RO3G_08626 [Rhizopus delemar RA 99-880]
Length = 595
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 193/340 (56%), Gaps = 37/340 (10%)
Query: 24 ESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVP---TVSVAVPGSIIDNTQSLEL 80
ES+ + N + S D K++ K+ + E P TV++A+P S+I N +LEL
Sbjct: 3 ESKKRRAFENKELSGPDYGKERVVKKQDVKKPLVTEKPRKHTVTIAIPSSVISNAPTLEL 62
Query: 81 ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQY 140
T LAGQIARA+ +F +DEVV++++K + +++ + N FL RLLQY
Sbjct: 63 KTILAGQIARALVLFCVDEVVIYEDK----------VPHPTSKVNPN----LFLARLLQY 108
Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
+ETP YLRKAL + S L+F G+LP LD PHH + + +REGVTL N + TLVD
Sbjct: 109 METPPYLRKALVPISSDLKFAGLLPSLDVPHHPARDDMTLYREGVTL--NKADDQSTLVD 166
Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKPKE-SGMYWGY 252
VGL + +D+ + P VRVTV + + AD+ + +VV P P+E SG+YWGY
Sbjct: 167 VGLFRRARIDRPVQPNVRVTVEL-SQVVAAADTKKGQKPIQAKVVSPKAPREKSGLYWGY 225
Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSE-HGDIVNSSDL--TLPTFRHLLIAFGGL-AG 308
+R A + S V + Y+ YD L+G S+ GD V + + F H+LIAFG G
Sbjct: 226 SIRLASSFSKVMTDSPYE--YDFLVGVSDCKGDNVYQPSIKDKVKPFEHILIAFGAPNGG 283
Query: 309 LEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTE 346
LEE+IE D+ LK +NA E+F + G+R+IR E
Sbjct: 284 LEEAIEADEDLKVGEENANELFDLFIQPL-TSGTRSIRLE 322
>gi|367021134|ref|XP_003659852.1| hypothetical protein MYCTH_2115401 [Myceliophthora thermophila ATCC
42464]
gi|347007119|gb|AEO54607.1| hypothetical protein MYCTH_2115401 [Myceliophthora thermophila ATCC
42464]
Length = 340
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 173/306 (56%), Gaps = 28/306 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AVP S+I + + E T G++ARA+ IF +DEVVV+D+ + + +
Sbjct: 39 TVSIAVPTSLITDCVTREQRTTNVGRVARALAIFSVDEVVVYDDSPMDKRITNADPDAYT 98
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
D A F+ +L YLETP ++RK LF +H +LR G+LP LD PHH K EW P+
Sbjct: 99 GDVDP----AHFMDHILNYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPY 154
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---QVV 238
REG+TL+ GT+VD+G+ V + + + P R+T+ M D PR + V
Sbjct: 155 REGMTLETPPKGGKGTVVDIGMPNTVTIAESIPPKTRLTLKM-------PDDPRGTPEPV 207
Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-- 295
P+ P+ E G +WG+ VR A ++S+V Y+ GYD IGTSE G ++ + P+
Sbjct: 208 HPTAPRTEGGYFWGFSVRRASSLSNVLTESPYEDGYDLSIGTSERGVPLSKA---FPSYE 264
Query: 296 ---FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
F+HL++ FGG GLE + D G+ +E+F + N P+QG+R IRT+E
Sbjct: 265 QADFKHLIVIFGGPRGLEYAAMNDPQLSEMGISGSRTKELFDHWVNVLPNQGTRGIRTDE 324
Query: 348 AIFISL 353
A+ I++
Sbjct: 325 ALLIAM 330
>gi|295663196|ref|XP_002792151.1| deoxyribose-phosphate aldolase [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279326|gb|EEH34892.1| deoxyribose-phosphate aldolase [Paracoccidioides sp. 'lutzii' Pb01]
Length = 399
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 183/354 (51%), Gaps = 69/354 (19%)
Query: 62 TVSVAVPGSII-------------------DNTQSLELATRLAGQIARAVTIFRIDEVVV 102
T+SVA+PGSII ++ QS + T LAG IARA+ +F +DEVV+
Sbjct: 41 TLSVALPGSIIAKYALKEISKRNLPDTNSLNSAQSHDQKTFLAGSIARALAVFCVDEVVI 100
Query: 103 FDNKSSSDNYSRSSAANRSNRSDENES---GAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
F+++ R S A+ DE + + FL +L YLETP +LRK LF MH +LR
Sbjct: 101 FEDEVRQ----RQSNAHCDLSKDEYTAYSNPSHFLAHVLSYLETPPHLRKFLFPMHKNLR 156
Query: 160 FVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG----------------------- 196
G LP LD PHH+R +E +REGVT N + G
Sbjct: 157 TAGTLPSLDMPHHIRANELCEYREGVTFPANEEHGFGQSNAENHNNQHKRKKGKKNMEEG 216
Query: 197 --------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES- 246
TLVD GL V + + P RVTV + +++ D +VV PS P+E
Sbjct: 217 AQDTVMSYTLVDTGLPNKVRIPHIPVPENTRVTVKLCSSK--DPSQVAEVVSPSAPREEM 274
Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRHLLIAFG 304
G YWGY VR ++S VF C + GGYD GTSE G + S +T +P F HL++ FG
Sbjct: 275 GYYWGYSVRCCDSLSDVFTECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEHLIVVFG 333
Query: 305 GLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
G+AGLE +++ D GL+ + +F + N P QGSRTIRTEEA+++ L
Sbjct: 334 GVAGLEVAVKADKELLARGLRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 387
>gi|358396408|gb|EHK45789.1| hypothetical protein TRIATDRAFT_152499 [Trichoderma atroviride IMI
206040]
Length = 344
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 120/311 (38%), Positives = 172/311 (55%), Gaps = 26/311 (8%)
Query: 62 TVSVAVPGSIIDNTQSL----ELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
TVSVA P S I SL E + ++ARA+ IF +DEV+V+D+ +S R+
Sbjct: 39 TVSVAFPSSTIATYASLLPTAEQRMAVPSRVARALAIFSVDEVIVYDDSPASSRPHRTDT 98
Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
+ + +D FL +L Y+ETP ++RKALF +H +LR LP LD HH E
Sbjct: 99 SVYTGDTDP----VHFLTHVLSYMETPPFMRKALFPLHPNLRLTSSLPGLDTAHHPHPKE 154
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
P+REGVT+ + GTLVD+G K V + + + P RVT+ + D P +
Sbjct: 155 SMPYREGVTIVGMTKSGNGTLVDIGQEKPVEIKEDIPPKTRVTLKLSE----DESEPCEC 210
Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT- 295
V P P+ E+G YWGY VR A ++S+VF Y+ GYD IGTSE G + + P+
Sbjct: 211 VHPHAPRTEAGFYWGYTVRKAASLSAVFTESPYENGYDLSIGTSERG---SPASKEFPSS 267
Query: 296 ----FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
F H+LI FGG G+E + D+ G++ RE+F + + P+QGSRTIRTE
Sbjct: 268 KRVQFNHVLIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVLPNQGSRTIRTE 327
Query: 347 EAIFISLQYFQ 357
EA++I+L +
Sbjct: 328 EAMYIALTVLK 338
>gi|169594860|ref|XP_001790854.1| hypothetical protein SNOG_00159 [Phaeosphaeria nodorum SN15]
gi|111070534|gb|EAT91654.1| hypothetical protein SNOG_00159 [Phaeosphaeria nodorum SN15]
Length = 443
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 144/412 (34%), Positives = 205/412 (49%), Gaps = 73/412 (17%)
Query: 9 GLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKK---RKRDQLNDDAPIEV----- 60
G+ + ++A+T H++QN G+SSS KK++ ++ ++++ P V
Sbjct: 25 GVGSKKRKASTVVIHQNQN------GESSSSPMKKRRANEDKEEEEIDTSRPTAVFTPKG 78
Query: 61 ---PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSS------SDN 111
TVS+A+PGS ++N + + T G+IARA +F +DE+VVFD+ +
Sbjct: 79 ARDWTVSIALPGSWLNNAKKPDHKTAQVGRIARAAAVFCVDEIVVFDDDPVNVPSVVAPK 138
Query: 112 YSRSSAANRSNRS------DENESGA---AFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
Y R S + +E+E FL +L Y E P +LR LF H +L G
Sbjct: 139 YMRKKGKQLSKQEVMDSILEEDEPWQNPDQFLYHVLSYAECPPHLRMKLFPRHKNLEQAG 198
Query: 163 MLPPLDAPHHLRKHEWAPFREGVTL---KENAPNSVG---------TLVDVGLNKHVVV- 209
MLP +D PHHLR HEW +REG+ + AP G V+VGL + V
Sbjct: 199 MLPSMDMPHHLRSHEWCQYREGMVIGPAPAPAPTQKGKKSQDTLDYAYVEVGLPYPIRVP 258
Query: 210 ---DQVLDPGVRVTVAMGTNRNL---------DADSPRQVVPP-SKPK-ESGMYWGYKVR 255
++ G+R TV L D DS + S P+ E G YWGY VR
Sbjct: 259 IPAGSPVEEGMRTTVRFSNPDPLASWPNMSKSDCDSIDAIACANSLPREEGGYYWGYTVR 318
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHG---------DIVNSSDLTLPTFRHLLIAFGGL 306
AP++SSVF C Y GYD+ IGTSE G + +S + L F+HLL+ FGG+
Sbjct: 319 RAPSLSSVFSECEYPSGYDYTIGTSERGIPIHSILPDNASSSHNKVLEPFKHLLLVFGGV 378
Query: 307 AGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
AG+E ++ D GL +KNA +F + N P QGSRTIRTEEA+ L
Sbjct: 379 AGIEPAVANDPELVAKGLGKKNADTLFDKWVNLVPGQGSRTIRTEEAVEFGL 430
>gi|402075402|gb|EJT70873.1| deoxyribose-phosphate aldolase 2 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 485
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 184/344 (53%), Gaps = 25/344 (7%)
Query: 34 GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVT 93
GD + + K + + P + TVS+AVP SII E +IARA+
Sbjct: 18 GDGADTSEEIKSFKPSTRFTPTVPRDW-TVSIAVPSSIITEASRDEQRIGAPSRIARALA 76
Query: 94 IFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFS 153
+F +DE+VVFD+ + SR A + + + + ++ LL YLE P ++RK LF
Sbjct: 77 VFGMDEIVVFDDSPTD---SRPPAVDTDAYTGDVDP-CHYIWHLLTYLECPPFMRKTLFP 132
Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVL 213
+H +L+ G+LP LD P H + EW P+ E +TL GTLVD+G+ K + + + +
Sbjct: 133 LHPNLKLQGLLPNLDMPSHPHRDEWLPYMEAMTLPGKPDGGNGTLVDIGMRKPITISESI 192
Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGG 272
P R+T+ N++ + Q V P+ P+ ++G WG+ VR ++S V C Y+GG
Sbjct: 193 PPKTRLTLHF-PNQSYEG---AQAVDPTAPRTDAGYAWGFNVRRCASLSDVLTECPYEGG 248
Query: 273 YDHLIGTSEHGDIVNSSDL-----TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRK 322
YD IGTSE G+ NS+ + P F+HLLI FGG GLE + D G+
Sbjct: 249 YDLSIGTSERGE--NSASVFPYSSAPPDFKHLLIVFGGPRGLEYAAMNDPQLSEIGIGAG 306
Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ---EPISRA 363
REVF + N P QGSR I+T+EA++I+L + +P+ RA
Sbjct: 307 RVREVFDHWINVLPDQGSRAIKTDEAVYIALTESRRRDQPLGRA 350
>gi|408392531|gb|EKJ71885.1| hypothetical protein FPSE_07986 [Fusarium pseudograminearum CS3096]
Length = 342
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 116/301 (38%), Positives = 164/301 (54%), Gaps = 18/301 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+P SI+ N + + +IARA+ +F +DEVVVFD+ S ++ + +
Sbjct: 41 TLSVAIPSSILTNLATADQRMTQPARIARALAVFSVDEVVVFDDSHVSTRPRQTDPNSYT 100
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D FL +L +LE P ++RK LF +H +LR MLP LD PHH ++ +
Sbjct: 101 GDTDP----CHFLAHILSFLEAPPFMRKTLFPLHPNLRLTAMLPSLDMPHHPNPRDYLSY 156
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
EGVT+ GTLV+ GL V ++ + P R+T+ + N P V P +
Sbjct: 157 AEGVTVTGATSTGSGTLVECGLENPVEIEADIPPKTRITLKFPEDEN---QYPEPVHPAA 213
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
E G YWGY VR A ++S VF Y+ GYD IGTSE G V +S PT F
Sbjct: 214 PRTEGGYYWGYTVRKANSLSKVFTESPYENGYDVSIGTSERG--VPASKAFPPTKKVNFS 271
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
HLLI FGG G+E + D+ L + +E+F + N P+QGSRTIRT+EA+FI+
Sbjct: 272 HLLIVFGGPRGIEFAAMNDEELSSMDIQGSKTKELFDHWINVLPNQGSRTIRTDEAVFIA 331
Query: 353 L 353
L
Sbjct: 332 L 332
>gi|342883458|gb|EGU83952.1| hypothetical protein FOXB_05535 [Fusarium oxysporum Fo5176]
Length = 340
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 115/301 (38%), Positives = 163/301 (54%), Gaps = 18/301 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVAVP SI+ + S + +IARA+ F +DEVVVFD+ S + + +
Sbjct: 39 TISVAVPSSILSDLASADQRMNQPARIARALAAFSVDEVVVFDDSPLSARPRHTDTNSYT 98
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+D FL +L YLE P ++RKALF +H +LR MLP LD PHH +W +
Sbjct: 99 GDTDP----CHFLTHILTYLEAPPFMRKALFPLHPNLRLTAMLPTLDMPHHPNPKDWISY 154
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+ + G+LV+VGL + V V+ + P R+T+ + + P V P +
Sbjct: 155 REGIVIPGETSTGSGSLVEVGLGEPVEVEAKIPPKTRITLKFSED---ETQYPEPVHPAA 211
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
E G YWGY VR A ++S+VF Y+ GYD IGTSE G + +S + P+ F
Sbjct: 212 PRTEGGYYWGYTVRRASSLSNVFTESPYENGYDVSIGTSERG--IPASRVFPPSKRVNFN 269
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
HL+I FGG GLE + D L + +E+F + N P QG R IRT+EA+FI+
Sbjct: 270 HLIIVFGGPRGLEFAAMNDQELSSMDIQGPRTKELFDHWVNILPGQGCRNIRTDEAVFIA 329
Query: 353 L 353
L
Sbjct: 330 L 330
>gi|331246527|ref|XP_003335896.1| replication factor C subunit 1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1470
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 181/338 (53%), Gaps = 56/338 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK-----------SSSD 110
TVS+A+PGSI++N Q+ EL T L GQIARA IF ++E+++FD S
Sbjct: 908 TVSIALPGSIVNNAQTWELKTALVGQIARACAIFSVNEIIIFDESVGEADPSAKPHQPSS 967
Query: 111 NYSRSSAANRSNRSDENESG--------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
Y R A R+ D +G + F+ R+L+YLE PQYLRK+LF +H LR G
Sbjct: 968 TYGR--AKYRTPAEDLELAGPEDGPFQPSHFVARILEYLECPQYLRKSLFPIHPDLRLAG 1025
Query: 163 MLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVG-----------------TLVDVGLN 204
+LPPLD PHH RK P+REG L + N V + DVGL
Sbjct: 1026 LLPPLDLPHHFRKDHETPWREGCILPADKVENYVAGGHYQKKKNRKGDDRKTSWADVGLA 1085
Query: 205 KHVVVDQVLDPGV------RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAP 258
+ V V LD GV RVTV M A R V P + +G+YWGY +R A
Sbjct: 1086 EPVQVQ--LDEGVSLPDWTRVTVQMPQGEGKLA---RLVSPRLPTQTTGIYWGYSIRLAS 1140
Query: 259 NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED 316
+IS VF Y +GGYD IGTSE G+ V+ + F+H+L+ GGL+GLE +I D
Sbjct: 1141 SISKVFTETPYASEGGYDLTIGTSERGENVDDVVDGIGDFKHMLLVLGGLSGLELTIASD 1200
Query: 317 DGLKRK----NAREVFHSYFNTCPHQGSRTIRTEEAIF 350
D L+ + +A +F + NT P QGSRT+RTEEA+
Sbjct: 1201 DTLEPRLTAADAPLLFDHWLNTLPFQGSRTVRTEEALL 1238
>gi|46135861|ref|XP_389622.1| hypothetical protein FG09446.1 [Gibberella zeae PH-1]
Length = 344
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 164/303 (54%), Gaps = 20/303 (6%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
T+SVA+P SI+ N R+ +IARA+ +F +DEVVVFD+ S ++ +
Sbjct: 41 TLSVAIPSSILTNPSLATADQRMTQPARIARALAVFSVDEVVVFDDSPVSTRPRQTDPNS 100
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
+ +D FL +L +LE P ++RK LF +H +LR MLP LD PHH ++
Sbjct: 101 YTGDTDP----CHFLAHILSFLEAPPFMRKTLFPLHPNLRLTAMLPSLDMPHHPNPRDYL 156
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
+ EGVT+ GTLV+ GL V ++ + P R+T+ + N P V P
Sbjct: 157 SYAEGVTVAGATSTGSGTLVECGLENPVEIEADIPPKTRITLKFPEDEN---QYPEPVHP 213
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
+ E G YWGY VR A ++S VF Y+ GYD IGTSE G V +S PT
Sbjct: 214 AAPRTEGGYYWGYTVRKANSLSKVFTESPYENGYDVSIGTSERG--VPASKAFPPTKKVN 271
Query: 296 FRHLLIAFGGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
F HLLI FGG G+E + D+ L + +E+F + N P+QGSRTIRT+EA+F
Sbjct: 272 FNHLLIVFGGPRGIEFAAMNDEELSSMEIQGSKTKELFDHWINVLPNQGSRTIRTDEAVF 331
Query: 351 ISL 353
I+L
Sbjct: 332 IAL 334
>gi|170589912|ref|XP_001899717.1| Uncharacterized ACR, COG2106 family protein [Brugia malayi]
gi|158592843|gb|EDP31439.1| Uncharacterized ACR, COG2106 family protein [Brugia malayi]
Length = 391
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 166/316 (52%), Gaps = 18/316 (5%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+A+PGSI++N QS EL T LAGQIAR+ +F IDE+VV+D + + R N
Sbjct: 80 LSIALPGSILNNAQSPELRTYLAGQIARSAAVFCIDEIVVYDETARMKSQQREDYCNGQW 139
Query: 123 RSD-----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
D N L ++L++++ PQYLRK LF + +L++ G+L PLD PHHLR +
Sbjct: 140 YPDLPVHSGNVECNFHLAKILEFMDCPQYLRKTLFPLQKTLKYAGILNPLDCPHHLRASD 199
Query: 178 WA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
+ P+REGV L + G + +VGL K D+ + V N L R+
Sbjct: 200 LSVPYREGVVLDKPIKAGRGPICNVGLYKEPRKDEAISEFCPVFGLGSFNVPLKTAGIRK 259
Query: 237 --------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
V +E+G+YWGYKVR A ++ YD +IGTSE G V+
Sbjct: 260 RKRYHGCLVNSKQIKQEAGLYWGYKVRLALSLYDALN----AEEYDVIIGTSERGKPVSK 315
Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
+++ +LI FGGL GLE ++E D + E+F Y N P QGSR IRTEEA
Sbjct: 316 FEMSFSGKNRILIVFGGLEGLETAVEADKNINCSTPEELFEHYLNVVPGQGSRIIRTEEA 375
Query: 349 IFISLQYFQEPISRAL 364
I I+L + I L
Sbjct: 376 IPITLATLRPMICTDL 391
>gi|407918289|gb|EKG11560.1| hypothetical protein MPH_11053 [Macrophomina phaseolina MS6]
Length = 436
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 143/388 (36%), Positives = 197/388 (50%), Gaps = 99/388 (25%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-------NKSSSDN--- 111
T+S+A+PGSII N QS +L T +AG IARA +F +DE+V+FD NK D+
Sbjct: 49 TLSIALPGSIIANAQSHDLKTAMAGAIARAAAVFTVDEIVIFDDGQSQVLNKGRRDSANG 108
Query: 112 --YSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
Y S A+ + SD + FL LL YLETP LRKALF +H +LR G LP LD
Sbjct: 109 HMYGVSDYASYTGYSDPDY----FLYHLLSYLETPPNLRKALFPLHPNLRTAGTLPSLDM 164
Query: 170 PHHLRKHEWAPFREGVTL----------------------------------KEN----- 190
PHHLR EW +REGV L K+N
Sbjct: 165 PHHLRADEWCQYREGVVLGASVYDDTSSVTGSYAGSYTGSPPAPNKPSSKKHKKNRKAYA 224
Query: 191 APNSV------GTLVDVGLNK--HVVVDQVLDPGVRVTVAM------GTNRNLDADSPRQ 236
AP+++ TLVD GL H+ + + P RVT+ G NL+A Q
Sbjct: 225 APDTMPGAGAPPTLVDAGLPSPIHLDLPSPIPPHTRVTLKFACAEDPGEYDNLEA----Q 280
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDL 291
+ P+ P+ E+G YWGY+ R A ++S+VF C + GGYD +GTSE G I+ S +
Sbjct: 281 AIDPALPREEAGYYWGYQTRQASSLSAVFTECPWDGGYDVSVGTSERGVPLSQILGSGGI 340
Query: 292 ---------------TLPT-FRHLLIAFGGLAGLEESIEED-----DGLKRKNAREVFHS 330
+LP F+HL++ FGG++GLE + D G+ KN ++F +
Sbjct: 341 PKEDAGGGGVEGGGSSLPAGFKHLVLVFGGVSGLEVAAMADKELVAKGVTGKNVGDIFDA 400
Query: 331 YFNTCPHQGSRTIRTEEAIFISLQYFQE 358
+ N P+QGSRTIR EEA+++ L F +
Sbjct: 401 WVNLVPNQGSRTIRCEEAVWLGLMGFSD 428
>gi|391863037|gb|EIT72351.1| hypothetical protein Ao3042_01340 [Aspergillus oryzae 3.042]
Length = 700
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 188/374 (50%), Gaps = 67/374 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
T++VA+PGSI+ N S+E T LAG IARA+ +F +DEVVVF D NY
Sbjct: 327 TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 386
Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
S + N+S A FL +L YLETP YLRK LF MH +LR G+
Sbjct: 387 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 446
Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
LP LD PHHLR +EW +REG+ + + ++P+S T+
Sbjct: 447 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 506
Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
+D GL K VV+ + P RVTV + + Q V PS P+ E+G YWGY VR
Sbjct: 507 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 563
Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
++SSVF C + GGYD GTSE G V S P ++HLLI F
Sbjct: 564 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 623
Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
GG+AG+E ++ D L + A ++F + N P QGSRTIRTEEA+++ L
Sbjct: 624 GGVAGIEAAVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 683
Query: 359 PISRALR-RVEYKS 371
+ R R YKS
Sbjct: 684 LVEGTHRPRPSYKS 697
>gi|358369151|dbj|GAA85766.1| deoxyribose-phosphate aldolase 2 [Aspergillus kawachii IFO 4308]
Length = 430
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 133/389 (34%), Positives = 183/389 (47%), Gaps = 78/389 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF-----------------RIDEVVVFD 104
T+SVA+PGSII N S+E T LAG IARA+ +F E D
Sbjct: 46 TLSVAIPGSIIANAHSVEQKTLLAGVIARALAVFCVDEVVVFDDDENSAYCDFHEHYYHD 105
Query: 105 NKSSSDNYSR-SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
N S+ ++ S A+ N + FL +L YLETP YLRK LF +H +LR G+
Sbjct: 106 NSPSAKTRNQMDSGASSPKAYTANSDPSHFLTHILSYLETPPYLRKHLFPIHPNLRTAGL 165
Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKE-----------------------------NAPNS 194
LP LD PHHLR +EW FREG+ + + N
Sbjct: 166 LPSLDMPHHLRANEWCDFREGIVVSSPGRYSARNRHSTSKITQRSNDHQFHHHSSSTDNG 225
Query: 195 VGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGY 252
T+VD GL++ +V+ + P RVTV + + A+ V S P+ E+G YWGY
Sbjct: 226 PATIVDTGLSQKIVLPNIHLPEHARVTVRFPQHESPYAEP----VHSSTPRVEAGYYWGY 281
Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLT 292
VR ++SSVF C Y GGYD GTSE G +++ +
Sbjct: 282 YVRRCRSLSSVFTECPYDGGYDLSFGTSERGMSISTVLEEKPQQAHDRYEQDHPPDREQH 341
Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEE 347
LP F+H+L+ FGG+AG+E ++ D L+ +A ++F + N P QGSRTIRTEE
Sbjct: 342 LPDFQHILVVFGGIAGIEAAVRNDPQLRDMAIRPSDADKLFDYWVNFLPGQGSRTIRTEE 401
Query: 348 AIFISLQYFQEPISRALRRVEYKSKPEDG 376
A+++ L R P DG
Sbjct: 402 AVWMGLTSLHGLFDGTHRLKRAYRYPHDG 430
>gi|114052745|ref|NP_001040575.1| uncharacterized protein LOC499770 [Rattus norvegicus]
gi|68534765|gb|AAH98818.1| Similar to LOC495800 protein [Rattus norvegicus]
Length = 242
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 102/224 (45%), Positives = 139/224 (62%), Gaps = 4/224 (1%)
Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
+YLRKA F H L+F G+L PLD+PHH+R+ E + FREGV + G+LV+ G+
Sbjct: 19 RYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEFREGVVVDRPTKAGHGSLVNCGMK 78
Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVRYAPNISS 262
K V +D+ LDPG+RVTV + + + + + VV P+ ++G+YWGY VR A +S+
Sbjct: 79 KEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYTVRLASCLSA 138
Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
VF ++ GYD IGTSE G V S+ LP FRH L+ FGGL GLE ++ D L+
Sbjct: 139 VFAEAPFQDGYDLTIGTSERGSSVASTQ--LPNFRHALVVFGGLQGLEAGVDADPNLEVA 196
Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
+ +F Y NTCP QGSRTIRTEEAI ISL Q +++ R
Sbjct: 197 DPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPGLTQGGAR 240
>gi|68067088|ref|XP_675514.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56494743|emb|CAH99398.1| conserved hypothetical protein [Plasmodium berghei]
Length = 392
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 193/367 (52%), Gaps = 58/367 (15%)
Query: 40 DNKKKKKRKRDQLNDDAPIEVPT--------VSVAVPGSIIDNTQSLELATRLAGQIARA 91
+N+KK K+ +++ + P +P VSVA+P +II+N + + + ++ +AR
Sbjct: 31 ENEKKIKKINEEIKEFIPKRIPNISRDLSVDVSVAIPATIINNKNDV-IKSYISSYLARI 89
Query: 92 VTIFRIDEVVVFDN------------------KSSSDNYSRSSA-ANRSNRSDENESGAA 132
TIF I ++ ++D+ K++ D+ S SS N +N+ E
Sbjct: 90 FTIFSISKIYIYDDGFELNRNERNEIKNSISQKTNKDHPSCSSTEKNENNKKYEYSYLCK 149
Query: 133 FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAP 192
+L LQYLETPQYLRK +F + + L+ G++ P+DAPHHLR EW PFREGV +K+N+
Sbjct: 150 YLHFNLQYLETPQYLRKHIFPITNFLKHSGLMSPVDAPHHLRSDEWLPFREGVVIKKNSN 209
Query: 193 NSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKP 243
N +VDVGL +V+V+ + ++ G RVT+ ++ + + +V+ PS P
Sbjct: 210 N---IIVDVGLFSNVLVENIYNINVGTRVTILFNSDGFNNFKNKNTKNLFIGKVIHPSMP 266
Query: 244 KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAF 303
K +YWGY ++ NI+ VF D +IGTSE GD + + + +LI F
Sbjct: 267 KLYNIYWGYTIQILKNITDVF-----DIQVDCIIGTSERGDPIQDLKTQIKNAKSILIVF 321
Query: 304 GGLAGLEESIEEDDGLKRKNAR-------------EVFHSYFNTCPHQGSRTIRTEEAIF 350
G G+E+ ++ +K+ + F + NTCP Q SRTIRTEEAI
Sbjct: 322 GNRDGVEDLFIKEREIKKNKMYTGKKRIKVLNKILKKFDYFINTCPKQTSRTIRTEEAIS 381
Query: 351 ISLQYFQ 357
I+L FQ
Sbjct: 382 ITLSLFQ 388
>gi|83773413|dbj|BAE63540.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 719
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 187/374 (50%), Gaps = 67/374 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
T++VA+PGSI+ N S+E T LAG IARA+ +F +DEVVVF D NY
Sbjct: 346 TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 405
Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
S + N+S A FL +L YLETP YLRK LF MH +LR G+
Sbjct: 406 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 465
Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
LP LD PHHLR +EW +REG+ + + ++P+S T+
Sbjct: 466 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 525
Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
+D GL K VV+ + P RVTV + + Q V PS P+ E+G YWGY VR
Sbjct: 526 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 582
Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
++SSVF C + GGYD GTSE G V S P ++HLLI F
Sbjct: 583 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 642
Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
GG+AG+E + D L + A ++F + N P QGSRTIRTEEA+++ L
Sbjct: 643 GGVAGIEAVVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 702
Query: 359 PISRALR-RVEYKS 371
+ R R YKS
Sbjct: 703 LVEGTHRPRPSYKS 716
>gi|317151462|ref|XP_001824673.2| deoxyribose-phosphate aldolase [Aspergillus oryzae RIB40]
Length = 419
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 187/374 (50%), Gaps = 67/374 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
T++VA+PGSI+ N S+E T LAG IARA+ +F +DEVVVF D NY
Sbjct: 46 TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 105
Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
S + N+S A FL +L YLETP YLRK LF MH +LR G+
Sbjct: 106 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 165
Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
LP LD PHHLR +EW +REG+ + + ++P+S T+
Sbjct: 166 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 225
Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
+D GL K VV+ + P RVTV + + Q V PS P+ E+G YWGY VR
Sbjct: 226 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 282
Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
++SSVF C + GGYD GTSE G V S P ++HLLI F
Sbjct: 283 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 342
Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
GG+AG+E + D L + A ++F + N P QGSRTIRTEEA+++ L
Sbjct: 343 GGVAGIEAVVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 402
Query: 359 PISRALR-RVEYKS 371
+ R R YKS
Sbjct: 403 LVEGTHRPRPSYKS 416
>gi|159482322|ref|XP_001699220.1| hypothetical protein CHLREDRAFT_152464 [Chlamydomonas reinhardtii]
gi|158273067|gb|EDO98860.1| predicted protein [Chlamydomonas reinhardtii]
Length = 396
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 175/344 (50%), Gaps = 51/344 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV +PG+++D S+E A AGQI RA F++DEVVV+D+ ++ + + +
Sbjct: 53 TLSVGLPGTLLDAPSSVEFAVATAGQIHRAAANFQVDEVVVYDDSLAATTFQPGQPLSAA 112
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALF-----SMHSSLRFVGMLPPLDAPHHLRKH 176
+ + G A L R+LQYL+TP +LR ++ H LR LP L PHH R
Sbjct: 113 SVT----PGTALLARILQYLDTPPHLRPNMYPEEVLQQHPELRLAASLPVLSPPHHARPS 168
Query: 177 E-WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG----TNRNLDA 231
W P+REGV LK A G+ VDVGL++ V ++ L G RVTV +G R A
Sbjct: 169 AVWMPYREGVVLKAEA--GTGSYVDVGLDRMVYLEHSLQQGARVTVHLGDMEPVTRFFAA 226
Query: 232 DSPRQ----VVPPSKPKES-GMYWGYKVRYAPNISSVFKNC--SYKGGYDHLIGTSEHGD 284
S VVPPS+P+E G+YWGY VR A + VFK+ S G YD IG+S G
Sbjct: 227 YSETMIVGTVVPPSEPRERLGLYWGYTVRIALGLQRVFKDAAVSKTGSYDLTIGSSPAGA 286
Query: 285 IVNSSDLTLPTFRHLLIAFG---------------------GLAG-------LEESIEED 316
+ L +P F+H L+ FG + G L++ +
Sbjct: 287 DTDPGSLVMPRFKHALMVFGGGSPVAAAAAVAEGGAKAKPKAVPGQAVPQQDLDQMVNRV 346
Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
+ K ++VF+ Y NT PH G+ IRTE+AI ++L Y P+
Sbjct: 347 PDWEHKTPQDVFNLYLNTTPHLGTLRIRTEDAIPVALSYLIMPL 390
>gi|70950204|ref|XP_744446.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524401|emb|CAH78880.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 357
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 185/340 (54%), Gaps = 50/340 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----------------- 105
VSVA+P +II+N + + + L+ +AR TIF I + ++D+
Sbjct: 27 VSVAIPATIINNKNDV-IKSYLSSYLARIFTIFSISTIYIYDDGFELNRNERNENRNNIP 85
Query: 106 -KSSSDNYSRSSA-ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
K++++ + SS N +N+ E +L LQYLETPQYLRK +F + + L+ G+
Sbjct: 86 PKTNNERPTISSTEKNENNKKYEYSYLCKYLHFNLQYLETPQYLRKHIFPITNFLKHSGL 145
Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTV 221
+ P+DAPHHLR EW PFREGV +K+N N +VDVGL +V+V+ + ++ G RVT+
Sbjct: 146 MSPVDAPHHLRSDEWLPFREGVVIKKNLNN---IIVDVGLFSNVLVENINNINVGTRVTI 202
Query: 222 AM---GTNRNLDADSPR----QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
G NR + ++ +++ PS PK +YWGY ++ N++ VF D
Sbjct: 203 LFDSEGFNRFKNKNTKNLFIGKIIHPSMPKLYNIYWGYNIQILKNLTDVF-----DVKVD 257
Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK-------RKNAREV 327
+IGTSE GD + + + R +LI FG G+E+ + +K +K A+ +
Sbjct: 258 CIIGTSERGDPIQDLKTQIKSARSILIVFGNRDGVEDLFIREREIKKNKTYTGKKRAKVL 317
Query: 328 ------FHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPIS 361
F + NTCP+Q SRTIRTEEAI I+L FQ +S
Sbjct: 318 NKILKKFDYFINTCPNQTSRTIRTEEAISITLSLFQSILS 357
>gi|396469502|ref|XP_003838421.1| hypothetical protein LEMA_P113570.1 [Leptosphaeria maculans JN3]
gi|312214989|emb|CBX94942.1| hypothetical protein LEMA_P113570.1 [Leptosphaeria maculans JN3]
Length = 436
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 131/370 (35%), Positives = 177/370 (47%), Gaps = 79/370 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-------KSSSDNYSR 114
TVS+A+PGS N + + T G+IARA +F +DE+VV+D+ K S+ Y+R
Sbjct: 54 TVSIALPGSWTLNAKKPDHKTMQVGRIARAAAVFSVDEIVVYDDDPCNIDPKVVSEKYTR 113
Query: 115 SSAANRSNR-----------SDENE---SGAAFLVRLLQYLETPQYLRK-------ALFS 153
+ + ++E+E + FL LL + E P +LR +LF
Sbjct: 114 AKGGRGKQQKKSKQEILDSIAEEDELWQNPDQFLYHLLAFAECPPHLRYDRDQPGLSLFK 173
Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVGT--------------- 197
H +L++ G LP +D PHHLR HEW +REGV + AP T
Sbjct: 174 EHQNLKWTGCLPSMDMPHHLRSHEWCQYREGVVVGPAPAPTPSSTAKSSKKAKMAGDGGL 233
Query: 198 ---LVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR-----------QVVP 239
V GL V V + ++ G+R TV TN P P
Sbjct: 234 EYAYVKCGLPYPVRVPIPPESPIEEGMRTTVRF-TNATEPPSWPHLSPEACTNLSATACP 292
Query: 240 PSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLP 294
S P+E SG YWGY VR AP++S+VF C Y GYD+ IGTSE G I+ P
Sbjct: 293 SSTPREESGYYWGYTVRRAPSLSAVFSECEYPAGYDYTIGTSERGVPVHSILPGGAKPCP 352
Query: 295 T------FRHLLIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTI 343
T F HLLI FGG+AG+E ++ D GL + A E+F ++ N P QGSRTI
Sbjct: 353 TTRAPASFNHLLIVFGGVAGIEPAVANDPELVAKGLGKATAHELFDAWVNLVPGQGSRTI 412
Query: 344 RTEEAIFISL 353
RTEEA+ L
Sbjct: 413 RTEEAVEFGL 422
>gi|449019335|dbj|BAM82737.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 380
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 144/383 (37%), Positives = 188/383 (49%), Gaps = 99/383 (25%)
Query: 36 SSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
+S+C ++K R T++VA+PGS++ N Q+ L T LAGQIARA+ IF
Sbjct: 8 ASNCTKSEQKGRSW------------TLAVALPGSVLANAQTPALQTYLAGQIARALAIF 55
Query: 96 RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAF---LVRLLQYLETPQYLRKALF 152
+DEV++F + R++ A + + S AF L RLLQY E PQYLR+ALF
Sbjct: 56 EVDEVIIFAERV------RATIAG----TVSSPSSTAFDTHLARLLQYAECPQYLRQALF 105
Query: 153 SMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----LVDVGLNKHV 207
L++VG+L PLD PHH R+HE P+REGV + AP++ T V VG + V
Sbjct: 106 PHTPELQYVGLLNPLDTPHHPRRHEGTPYREGVVVP--APDTGATSDRSRYVHVGWDAPV 163
Query: 208 VV--DQVLDPGVRVTVAM-------------GTNRNLDADS------------PRQV-VP 239
V D L G RVTV G+NR DS R V V
Sbjct: 164 AVDTDMPLPVGQRVTVRFEKLERSGHYVAEQGSNRKRKRDSHPSLHMQGCWTWTRAVMVA 223
Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV----------NS 288
+P+E G+YWGYKVR A N+ G D IGTSE G + +
Sbjct: 224 RDEPRERLGLYWGYKVRLAANLVQALST----SGADVKIGTSERGTDLFATLWGSSAERA 279
Query: 289 SDLT----------------LPT--FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHS 330
+D T +P FR LL+AFGG+ GLE S D K +F +
Sbjct: 280 TDSTAAETEIDGASARRASFIPPGGFRKLLVAFGGIHGLEASYGSPDVAK------LFDA 333
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
Y N CPHQGSRTIRTEEA+ + L
Sbjct: 334 YVNACPHQGSRTIRTEEALLVVL 356
>gi|320589534|gb|EFX01995.1| deoxyribose-phosphate aldolase [Grosmannia clavigera kw1407]
Length = 341
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 108/302 (35%), Positives = 168/302 (55%), Gaps = 18/302 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+P SI+ + + E ++ARA+ F IDE+V++D+ S SR A +
Sbjct: 38 TVSVAIPTSILSDCTTSEQRINAPSRVARALAAFSIDELVIYDD---SPMESRPKAVDHD 94
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ + + + +L LL Y+E P ++RKALF +H +LR G+ P L+ PHH + E P+
Sbjct: 95 GYTGDTDP-SHYLFHLLSYIEAPPFMRKALFPLHPNLRLQGLFPNLEMPHHPHRDEHLPY 153
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
EG+T+ TLV++GL K V + + + RVT+ D ++ V P+
Sbjct: 154 MEGLTVAGKPDAGHKTLVELGLRKPVSIKEEIPVKTRVTLRFPRESFEDGEA----VDPA 209
Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
P+ E+G YWGY++R ++S+VF +Y+ GYD IGTSE G + + F
Sbjct: 210 APRTEAGYYWGYQLRQCSSLSAVFTESAYENGYDVSIGTSERGLPTSQVFPRATTGTLDF 269
Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
+HL++ GG GLE + D G+ + N RE F + N P+QG+R IRT+EA+FI
Sbjct: 270 QHLIVVLGGPRGLEYAAMNDPDLAGVGITKANVRETFDHWVNVLPNQGTRNIRTDEAVFI 329
Query: 352 SL 353
+L
Sbjct: 330 AL 331
>gi|452821207|gb|EME28240.1| hypothetical protein Gasu_42420 [Galdieria sulphuraria]
Length = 269
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/267 (38%), Positives = 153/267 (57%), Gaps = 26/267 (9%)
Query: 44 KKKRKRDQ------LNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRI 97
KKKR + Q ++ + + T+SVA+P SI++N Q +L T + GQ+AR +TIF++
Sbjct: 6 KKKRAKVQPKLTKGIHLGPQLTLGTLSVAIPCSILENIQKAQLQTYVIGQLARTLTIFKV 65
Query: 98 DEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
DE++++ ++ +E AFL LL+YLETPQYLR+ LF M +
Sbjct: 66 DEIILY---------------SQDRNKEEAAPSIAFLKLLLEYLETPQYLRRHLFPMSEN 110
Query: 158 LRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVV--DQVLDP 215
L++ G+L P+DAPHHLRK EW P+REGV L+E LVDVGL++ V +
Sbjct: 111 LKYAGLLNPIDAPHHLRKDEWLPWREGVILEEKQGGVY--LVDVGLDRPATVKAQHNVKK 168
Query: 216 GVRVTVAMGTNRNLDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
G R+TV+ ++ + + P+E G+YWGY+VR A + ++ C Y+GGYD
Sbjct: 169 GERITVSFDQSKETSDPLMGSLCFKATPREKGGLYWGYQVRIATALCKTWEECPYEGGYD 228
Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLI 301
+IGTSE G ++ L RHLL+
Sbjct: 229 KVIGTSERGKSIDDPSFRLTPCRHLLL 255
>gi|399216712|emb|CCF73399.1| unnamed protein product [Babesia microti strain RI]
Length = 367
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 125/338 (36%), Positives = 171/338 (50%), Gaps = 61/338 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+P SI+ N Q+ EL + G IAR IF I E+V+F++ S N+
Sbjct: 38 TVSIAIPISILANCQTEELRAYVVGNIARTCAIFGIAEIVLFNDFESLSENELKRYKNKP 97
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFS-------MHSSLRFVGMLPPLDAPHHLR 174
+ + S + +L+YLETPQYLR+++F + L+F G+L PLDAPHH+R
Sbjct: 98 HM--QPHSLMNYFTLILEYLETPQYLRRSIFDYEKGKLVCYLDLKFAGLLNPLDAPHHMR 155
Query: 175 KHEWAPFR-------EGVTLKENAPNSVGTLVDVGLNKHVVVD------------QVLDP 215
+EW PFR +G+ L + G VD GL V+VD QV++
Sbjct: 156 ANEWLPFRYINHIYRQGILLPGYSME--GGSVDCGLPVPVLVDNKKVNIENVKGGQVIED 213
Query: 216 G-----VRVTVAMGTN--------RNLDADSPRQV------VPPSKPKE-SGMYWGYKVR 255
+VTV + + D +V VP S P G+YWGY VR
Sbjct: 214 HKSNRYQQVTVQLDIKSLSLCAKLKYTDGQCQEEVQFSGKIVPNSLPTTLMGLYWGYTVR 273
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEE 315
+F + +Y+ YD +IGTSEHG+++ S LP F HLLI FGGL GLE I++
Sbjct: 274 QCQTFEEIFTSVNYE--YDLVIGTSEHGEVI-SGQFKLPKFEHLLIVFGGLKGLEVVIDK 330
Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ + F Y N C HQ SRTIRTEEAI I L
Sbjct: 331 PE--------KKFKYYINICEHQHSRTIRTEEAIMICL 360
>gi|429328463|gb|AFZ80223.1| hypothetical protein BEWA_030760 [Babesia equi]
Length = 388
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 184/374 (49%), Gaps = 80/374 (21%)
Query: 34 GDSSSCDNKKKKKRKRD------QLNDDAPIEVP--------TVSVAVPGSIIDNTQSLE 79
G+ + + K+ K +K D QL + P ++P T+SVA+P S++ N QS E
Sbjct: 32 GEVAKPEGKQAKHKKEDDGLSSIQLFKEIPQKIPFMGRPREHTISVALPASLVQNVQSEE 91
Query: 80 LATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQ 139
L + G IAR +TI+ I+EVV++++ +E L+
Sbjct: 92 LRAYVIGNIARTLTIYGINEVVLYND-----------------MGNEGTKWQDHFALNLR 134
Query: 140 YLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL---KENAPNSVG 196
YLETPQYLR+ L+ M+ +L++ G+ PLDAPHHLR +EW P+REGV K
Sbjct: 135 YLETPQYLRRYLYPMNYALKYAGLQNPLDAPHHLRSNEWLPYREGVIKLIPKSQCGKHNK 194
Query: 197 TLVDVGLNKHVV------------VDQVLDPG-----VRVTV-----------------A 222
+ GL ++V VD V D G RVT+
Sbjct: 195 FFAECGLFRNVEIKNISALSSIYGVDHVTDDGDEEIYQRVTIRLDGSSLTQCKQYWKNNK 254
Query: 223 MGTNRNLDADSPRQVVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYK--GGYDHLIGT 279
+G+ + + +V P +P + +G+YWGY VR A + V C + G YD+ IGT
Sbjct: 255 LGSTDSGNYSLSGYIVSPEEPLQRAGLYWGYTVREADSFEQVLVECPFNDSGEYDYKIGT 314
Query: 280 SEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQG 339
SE G++ + LP +++LL+ FG + GLE +E N + F Y+N C HQ
Sbjct: 315 SERGELF-GPNTKLPKYKNLLVVFGPVNGLEHIME--------NPSDKFDKYYNFCLHQK 365
Query: 340 SRTIRTEEAIFISL 353
SRTIRTEEA+ ISL
Sbjct: 366 SRTIRTEEALSISL 379
>gi|389585659|dbj|GAB68389.1| hypothetical protein PCYB_132630 [Plasmodium cynomolgi strain B]
Length = 475
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 122/360 (33%), Positives = 178/360 (49%), Gaps = 74/360 (20%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------------ 110
V VA+P SII+N + + + L +AR T+F + +V ++D++ +++
Sbjct: 121 VCVAIPSSIINNRSDV-IKSYLTSYLARIFTVFSVSKVYIYDDQMANERCEQQLNSGNST 179
Query: 111 --------------NY----------SRSSAANRSNRSDENESG-------AAFLVRLLQ 139
NY + +++ S++ E E +L LQ
Sbjct: 180 HGTAHPHKTHEGDQNYHIDRVGSKHQGKRPSSHSSSQEGEGEKSTPQYSYLCQYLHYNLQ 239
Query: 140 YLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLV 199
YLETPQYLRK LF + L+ G++ P+DAPHHLR EW PFREGV +K+ S G +V
Sbjct: 240 YLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVVIKK---KSNGVIV 296
Query: 200 DVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKESGMYW 250
DVGL H ++ V +D G RVTV N RN +A +++ PS PK +YW
Sbjct: 297 DVGLFAHAHIENVTHVDIGTRVTVLFHPNSFLLFQRRNGNALFSGKLINPSTPKLYNLYW 356
Query: 251 GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLE 310
GY V +S VF G D+++GTSE G + + + +LI FG GLE
Sbjct: 357 GYSVELLKRLSDVF-----DLGVDYIVGTSERGHCMGDAADAIRGVSSILIVFGNKQGLE 411
Query: 311 ESIEEDDGLKRKNAREV-------------FHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ + ++ ++K + V FH + NTCPHQ SRTIRTEEAI I+L F
Sbjct: 412 DLLIKEREERKKKSYAVEKRSKVLNKMLKKFHLFVNTCPHQTSRTIRTEEAIAITLSLFH 471
>gi|341900584|gb|EGT56519.1| hypothetical protein CAEBREN_07270 [Caenorhabditis brenneri]
Length = 378
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 175/325 (53%), Gaps = 20/325 (6%)
Query: 45 KKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
KK K ++++ P T+S+AVPG ++N QS EL T +AGQIARA T++R+DE++++D
Sbjct: 53 KKAKIEKVDHTPPF---TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYD 109
Query: 105 NKSSSDNYSRSSAANRSNR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
N + ++ N + + ++ N G +L ++L+YLE PQYLRK LF + L+
Sbjct: 110 ESCRMTNEAVNAYYNGTWQGTLLPAETNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLK 169
Query: 160 FVGMLPPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDP 215
+ G+L PLDA HHL+ E FREGV LK+ + G + ++GL+K +D L P
Sbjct: 170 YAGLLNPLDAQHHLKSDEKTFRFREGVVLKKRSKEGRGPICNIGLDKEFEIDSDAVSLPP 229
Query: 216 GVRVTVAMGTNRNLDADSPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
RVTV + N N R + + +ESG YWGY VR + V + +
Sbjct: 230 YTRVTVEI-KNLNEQCKLYRGSITSGAAVTRESGQYWGYSVRLMTGLQKVLEG----KKF 284
Query: 274 DHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFN 333
D + G S G + + D+ + +L+ FGG+AG++ ++E ++ + + A + F
Sbjct: 285 DVVAGVSPRGKLASQMDVCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDVLIR 344
Query: 334 TCPH-QGSRTIRTEEAIFISLQYFQ 357
T GSR+ R EE + L Q
Sbjct: 345 TTSLPNGSRSERVEENVLSVLAQVQ 369
>gi|449669826|ref|XP_002157979.2| PREDICTED: uncharacterized protein LOC100206864 [Hydra
magnipapillata]
Length = 767
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 131/218 (60%), Gaps = 12/218 (5%)
Query: 85 AGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETP 144
A IARA IF +DE+++F N+S S + + SD N L R+LQYLE P
Sbjct: 557 AVLIARAAVIFNVDEIIIF-NESLSKKTNEGLFKGSTKSSDPN----ILLARILQYLECP 611
Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
QYLRK F +H L++ G+L PLD PHH+R + +REGVTL G+ V+VG+
Sbjct: 612 QYLRKEFFPVHQDLKYAGILNPLDCPHHMRADDIVAYREGVTLNRPVKKGEGSFVNVGIM 671
Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPR----QVVPPSKPK-ESGMYWGYKVRYAPN 259
K V +D+ L P +RVTV + + R +VV PS P+ E+G+YWGY VR A N
Sbjct: 672 KDVKIDKHLKPNIRVTVKLDPYEQETIEKKRYLTGRVVTPSTPREEAGIYWGYSVRLAEN 731
Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
SSVF NC YKGGYD IGTSE+G +NS + +P+++
Sbjct: 732 FSSVFTNCPYKGGYDLTIGTSENGGDINS--IEIPSYK 767
>gi|156086160|ref|XP_001610489.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797742|gb|EDO06921.1| conserved hypothetical protein [Babesia bovis]
Length = 373
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 144/407 (35%), Positives = 195/407 (47%), Gaps = 104/407 (25%)
Query: 3 NKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE--- 59
KKKR +L++ A E + Q E + N S D + R ++ P +
Sbjct: 12 TKKKRKDAR-KLRKEAPQELRKKQKESTSKNTIPKSTDGEMSMVRLLREIPQRIPFQGKP 70
Query: 60 -VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
T SVA+P SI+ N Q+ EL + G IAR++TI+ I EVV++++
Sbjct: 71 SKKTFSVALPVSILQNIQTDELRAYIIGNIARSLTIYGIHEVVIYND------------- 117
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
LETPQYLRK +F + LR G+ PLDAPHHLR +EW
Sbjct: 118 ----------------------LETPQYLRKYMFPIVPHLRNTGLQNPLDAPHHLRSNEW 155
Query: 179 APFREGVTLKE--NAPNSVGTL---VDVGLNKHVVVD--QVLDP--GV------------ 217
P+REGV E +A S+G L VD GL + VD + L GV
Sbjct: 156 LPYREGVVKLEEKSASLSMGNLDMMVDCGLFGRIKVDNPEALSEIYGVERFCLDNEDSVY 215
Query: 218 -RVTVAMGTNRNLDADSPRQ------------------------VVPPSKPKE-SGMYWG 251
RVTV LDA+S R+ +VPP +P+ G+YWG
Sbjct: 216 QRVTVM------LDANSLRECKRRWKEQLNGPSYSDSCTMLSGKLVPPDEPQSLVGLYWG 269
Query: 252 YKVRYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGL 309
+ VR A + VF +C + G YD+ IGTSE GDI D+++P F+H+L+ G + GL
Sbjct: 270 FVVREAESYEGVFSDCPFSTDGKYDYTIGTSERGDIY-PKDVSVPHFQHMLMVLGPIKGL 328
Query: 310 EESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
ESI+E N RE +Y N CP+Q SRTIRTEEA+ I L F
Sbjct: 329 -ESIKE-------NPREDLDAYVNFCPNQRSRTIRTEEALTICLSQF 367
>gi|221060084|ref|XP_002260687.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
gi|193810761|emb|CAQ42659.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
Length = 460
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 136/425 (32%), Positives = 200/425 (47%), Gaps = 95/425 (22%)
Query: 13 ELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAP--IE--VPTVS---- 64
E+K AA E + + +G KK ++ + LN+D IE +P VS
Sbjct: 47 EIKAAALREGDDPSTQAHQTDG------TKKSPEKAKAVLNEDIKKCIEKRIPNVSRNLQ 100
Query: 65 ----VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR------ 114
VA+P SII+N + + + L +AR T+F + +V ++D++ +D + R
Sbjct: 101 VDLYVAIPSSIINNRSDV-IKSYLTSYLARIFTVFSVSKVYIYDDQMMNDRFERREQQLN 159
Query: 115 ------------------------------------SSAANRSNRSDENESGAAFLVRLL 138
S ++++ D++ ++L + L
Sbjct: 160 SGNSTHGTSHRQTTHEGDKSYHNGRVGSMHLGKRPSSHSSSQEGEGDKSNPEYSYLCQYL 219
Query: 139 ----QYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNS 194
QYLETPQYLRK LF + L+ G++ P+DAPHHLR EW PFREGV +K+ S
Sbjct: 220 HYNLQYLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVVVKK---KS 276
Query: 195 VGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKE 245
G +VDVGL ++ V +D G RVTV N RN + +++ PS PK
Sbjct: 277 NGVIVDVGLFAPAHIENVNHVDIGTRVTVLFHPNSFVLFQRRNANVLFSGKLINPSTPKL 336
Query: 246 SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGG 305
+YWGY V +S VF D+++GTSE G + + + R +LI FG
Sbjct: 337 YNLYWGYSVELLKRLSDVF-----DIDVDYIVGTSERGHSMGDATDAIRGVRSILIVFGN 391
Query: 306 LAGLEESI--EEDDGLKRKNARE-----------VFHSYFNTCPHQGSRTIRTEEAIFIS 352
GLE+ + E ++ K+ A E FH + NTCPHQ SRTIRTEEAI I+
Sbjct: 392 KQGLEDLLIKEREERKKKSYAMEKRNKVLSKMLKKFHLFVNTCPHQTSRTIRTEEAIAIT 451
Query: 353 LQYFQ 357
L F
Sbjct: 452 LSLFH 456
>gi|451851982|gb|EMD65277.1| hypothetical protein COCSADRAFT_35351 [Cochliobolus sativus ND90Pr]
Length = 443
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 179/360 (49%), Gaps = 69/360 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-------- 113
TV++A+PGS N + + T G+IARA I+ +DE+VV+D+ S+ +++
Sbjct: 73 TVTIALPGSWTLNAKKPDHKTIQVGRIARAAAIYCVDEIVVYDDDPSNIDHNVVEQKYIR 132
Query: 114 --RSSAANRSNRSDENESGA---AFLVRLLQYLETPQYLRK-------ALFSMHSSLRFV 161
R+ + S+++E+ FL LL + E P +LR ++F H +L++V
Sbjct: 133 KGRTKQQVLDSISEQDEAWQNPDQFLYHLLAFAECPPHLRYDREDPRLSIFKEHQNLKWV 192
Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLNKHVVV- 209
G LP +D PHHLR HEW +REGV L +AP G+ V GL + V
Sbjct: 193 GNLPSMDMPHHLRSHEWCQYREGVFLG-SAPPPGGSKSKKVAREDYAYVKCGLPYPIRVP 251
Query: 210 ---DQVLDPGVRVTVAMGTNR------NL---DADS-PRQVVPPSKPKE-SGMYWGYKVR 255
+ ++ G+R TV NL D DS S P+E G YWGY VR
Sbjct: 252 IPKEAPVEQGMRTTVRFANQEAPRGWPNLSQQDCDSLEATACAASLPREEGGYYWGYNVR 311
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS----------SDLTLPT-------FRH 298
A ++ +VF C Y GGYD IGTSE G V S D++ T F H
Sbjct: 312 RAASLGAVFSECEYPGGYDMSIGTSERGQSVYSFLSNTGGVSPGDMSSSTVSGAPESFNH 371
Query: 299 LLIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
LLI FGG AG+E ++ D GL + A ++F + N P QGSRTIRTEEA+ I L
Sbjct: 372 LLIVFGGQAGIEPAVASDPELTSKGLDKSTAHQLFDVWVNLVPRQGSRTIRTEEAVIIGL 431
>gi|451997668|gb|EMD90133.1| hypothetical protein COCHEDRAFT_1178547 [Cochliobolus
heterostrophus C5]
Length = 443
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 193/419 (46%), Gaps = 84/419 (20%)
Query: 15 KEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPT------------ 62
+ A + E + +L + DS KK+K ++Q +D PT
Sbjct: 17 RNAKSNEKENGKKRKTLGDNDSPM---KKRKGDGQEQSVEDIDTSRPTAVFNPTKGRNWT 73
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-------DNYSRS 115
V++A+PGS N + + T G+IARA I+ +DE+VV+D+ S+ Y R
Sbjct: 74 VTIALPGSWTLNAKKPDHKTIQVGRIARAAAIYCVDEIVVYDDDPSNIDSKVIDQRYIRK 133
Query: 116 SAANR------SNRSDENESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFVG 162
+ S + + ++ FL LL + E P +LR ++F H +L++VG
Sbjct: 134 GRTKQQVLDSISEQDEAWQNPDQFLYHLLAFAECPPHLRYDREDPRLSIFKEHQNLKWVG 193
Query: 163 MLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLNKHVVV-- 209
LP +D PHHLR HEW +REGV + +AP GT V GL + V
Sbjct: 194 NLPSMDMPHHLRSHEWCQYREGVFVG-SAPAPGGTKSKKVAQEDYAYVKCGLPYPIRVPI 252
Query: 210 --DQVLDPGVRVTVAMG-----------TNRNLDADSPRQVVPPSKPKESGMYWGYKVRY 256
+ ++ G+R T+ + ++ D +E G YWGY VR
Sbjct: 253 PKEAPVEQGMRTTIRFANYEAPKGWPNLSQQDCDGLEATACAASLPREEGGYYWGYNVRR 312
Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIV-------------NSSDLTLP----TFRHL 299
A ++ +VF C Y GGYD IGTSE G V N S T+ +F HL
Sbjct: 313 AASLGAVFSECEYPGGYDMSIGTSERGQSVYSFLSNTGGVSPGNMSGSTVSGAPESFNHL 372
Query: 300 LIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
LI FGG AG+E ++ D GL + A ++F ++ N P QGSRTIRTEEA+ I L
Sbjct: 373 LIVFGGQAGIEPAVASDPELTSKGLDKSTAHQLFDAWVNLVPRQGSRTIRTEEAVIIGL 431
>gi|82594085|ref|XP_725278.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480219|gb|EAA16843.1| Uncharacterized ACR, COG2106, putative [Plasmodium yoelii yoelii]
Length = 502
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 177/345 (51%), Gaps = 59/345 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYSRSSAA 118
VSVA+P +II+N + + + L+ +AR TIF I ++ ++D+ + N +++ +
Sbjct: 163 VSVAIPATIINNKNDV-IKSYLSSYLARIFTIFSISKIYIYDDGFEINCNERNEIKNTIS 221
Query: 119 NRSNRSD------------------------ENESGAAFLVRLLQYLETPQYLRKALFSM 154
++N+ E +L LQYLETPQYLRK LF +
Sbjct: 222 QKTNKEHFSCSSRENNEKNENNKNNKNNKKFEYSYLCKYLHFNLQYLETPQYLRKHLFPI 281
Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-- 212
+ L+ G++ P+DAPHHLR EW PFREGV +K+N+ NS+ +VDVGL +V+V+ +
Sbjct: 282 TNFLKHSGIMSPVDAPHHLRSDEWLPFREGVVIKKNS-NSI--IVDVGLFSNVLVENIYN 338
Query: 213 LDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKPKESGMYWGYKVRYAPNISSVFK 265
++ G RVT+ + + + +V+ PS PK +YWGY + N++ VF
Sbjct: 339 INIGTRVTILFDSYAFNNFKNKNTKNLFIGKVIHPSMPKLYNIYWGYTIHILKNLTDVF- 397
Query: 266 NCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
D +IGTSE GD + + + + +LI FG G+E+ + +K+
Sbjct: 398 ----DIQVDCIIGTSERGDPIQDLKTQIKSAKSILIVFGNRDGVEDLFIREREMKKNKMY 453
Query: 326 -------------EVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
+ F + NTCP Q SRTIRTEEAI I+L FQ
Sbjct: 454 TGKKRIKVLNKILKKFDYFINTCPKQTSRTIRTEEAISITLSLFQ 498
>gi|268576392|ref|XP_002643176.1| Hypothetical protein CBG24142 [Caenorhabditis briggsae]
Length = 433
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 104/308 (33%), Positives = 165/308 (53%), Gaps = 17/308 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVPG ++N QS EL T +AGQIARA T++R+DE++++D N S ++ N +
Sbjct: 67 TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYDESCRMTNESVNAYYNGN 126
Query: 122 NR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
++ N G +L ++L+YLE PQYLR+ LF + L++ G+L PLDA HHL+
Sbjct: 127 WEGSLLPAESNYEGCFYLAKILEYLECPQYLRRDLFPIQKPLKYAGLLNPLDAQHHLKSD 186
Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDPGVRVTVAMGTNRNLDAD 232
E FREGV LK+ + G + ++GL+K +D L P RVTV + N N
Sbjct: 187 EKTFKFREGVVLKKRSKEGRGPICNIGLDKEFEIDTDTITLPPYTRVTVEI-KNLNEQCK 245
Query: 233 SPRQVVPPSK--PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
R + +E+G YWGY VR + V + +D + G S G + + D
Sbjct: 246 LYRGSITSGTRVTRETGQYWGYSVRLMTGLQKVLEG----KKFDIVAGVSPRGKLASQVD 301
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPH-QGSRTIRTEEAI 349
+ + +L+ FGG++G++ ++E ++ + + A + F T GSR+ R EE +
Sbjct: 302 VCILNKPKILLVFGGVSGVDAAVESEELAEWRRAEDAFDVLIRTTSLPNGSRSERVEENV 361
Query: 350 FISLQYFQ 357
L Q
Sbjct: 362 LSVLAQMQ 369
>gi|346323916|gb|EGX93514.1| deoxyribose-phosphate aldolase 2 [Cordyceps militaris CM01]
Length = 402
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 171/335 (51%), Gaps = 45/335 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA-NR 120
T+SVAVPGSI+ N + E ++AR++ IF +DEV+V+D+ +++ SRSS R
Sbjct: 60 TISVAVPGSILHNLPTAEQRLATVARLARSLAIFCVDEVIVYDDDAAAVPSSRSSTTITR 119
Query: 121 SNRSDENESG----AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR-K 175
+ S + +G F+ ++L +LE P ++R+ LF +H++LR +LP +D PHH R K
Sbjct: 120 LDGSPDAYTGDSDPCHFVAQVLSFLECPPFMRRRLFPLHANLRHTALLPAVDMPHHPRPK 179
Query: 176 HEWAPFREGVTLKENAPNSVGT----LVDVG-LNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
W P+ EGVT+ A G +VD G V V + P RVT+ +GT+
Sbjct: 180 AAWLPYVEGVTVPPAAEEEGGGGSGSVVDTGPCGGRVCVPDDIPPHTRVTLHLGTHAA-- 237
Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---- 286
A P P + E G YWGY VR ++S VF C + GYD +GTSE G +
Sbjct: 238 AQQPVCADPHAPRAEGGYYWGYAVRRCASLSRVFTECPHAAGYDLSVGTSERGTPLAQVM 297
Query: 287 -----------------------NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGL---- 319
+ + P+F+HL++ FGG G+E + + D L
Sbjct: 298 AEHQQKRAQQKRRGYGHHQADSDDGAAAATPSFQHLVVVFGGPRGIELAADHDPALAAMH 357
Query: 320 -KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ RE+F + + P QGSR I T+EA+ I+L
Sbjct: 358 IRGPRTRELFDHWVDVLPGQGSRAIATDEAVPIAL 392
>gi|417401122|gb|JAA47457.1| Hypothetical protein [Desmodus rotundus]
Length = 448
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 128/205 (62%), Gaps = 4/205 (1%)
Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVT 220
G+L PLD+PHH+R+ E + FREGV + G+ V+ G+ K V +D+ L+PG+RVT
Sbjct: 241 TGLLNPLDSPHHVRQEEESEFREGVVVDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVT 300
Query: 221 VAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
V + + + + R +VV P+ ++G+YWGY VR A +S+VF ++ GYD IG
Sbjct: 301 VRLNQEQLPECKTYRGKVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIG 360
Query: 279 TSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQ 338
TSE G V S+ L P+FRH L+ FGGL GLE ++ D L+ +F Y NTCP Q
Sbjct: 361 TSERGSDVASAQL--PSFRHALVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYLNTCPGQ 418
Query: 339 GSRTIRTEEAIFISLQYFQEPISRA 363
GSRTIRTEEAI ISL Q +++A
Sbjct: 419 GSRTIRTEEAILISLAALQPSLTQA 443
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 59/101 (58%), Gaps = 4/101 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF ++E+VVFD +
Sbjct: 74 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVEEIVVFDEEGQDAKTVEGEFRGVG 133
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
+ L R+LQYLE PQYLRKA F H L+F G
Sbjct: 134 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAG 170
>gi|218194448|gb|EEC76875.1| hypothetical protein OsI_15076 [Oryza sativa Indica Group]
gi|218194449|gb|EEC76876.1| hypothetical protein OsI_15077 [Oryza sativa Indica Group]
gi|222628481|gb|EEE60613.1| hypothetical protein OsJ_14027 [Oryza sativa Japonica Group]
Length = 212
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 86/122 (70%), Positives = 98/122 (80%), Gaps = 11/122 (9%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ SS +N
Sbjct: 68 TVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSNSSVEN---------- 117
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
D+ ESGA FLVR+LQYLETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ F
Sbjct: 118 -SGDDVESGARFLVRILQYLETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEF 176
Query: 182 RE 183
RE
Sbjct: 177 RE 178
>gi|308463009|ref|XP_003093783.1| hypothetical protein CRE_24777 [Caenorhabditis remanei]
gi|308249389|gb|EFO93341.1| hypothetical protein CRE_24777 [Caenorhabditis remanei]
Length = 378
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 172/325 (52%), Gaps = 20/325 (6%)
Query: 45 KKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
KK K ++++ P T+S+AVPG ++N QS EL T +AGQIARA T++R+DE++++D
Sbjct: 53 KKAKIEKVDHTPPF---TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYD 109
Query: 105 NKSSSDNYSRSSAANRSNR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
+ + ++ N + + ++ N G +L ++L+YLE PQYLRK LF + L+
Sbjct: 110 ESCRMTDEAVNAYYNGTWQGNLLPAENNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLK 169
Query: 160 FVGMLPPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDP 215
+ G+L PLDA HHL+ E FREGV LK+ + G + +GL+K +D L P
Sbjct: 170 YAGLLNPLDAQHHLKYDEKTFRFREGVVLKKRSKEGRGPICSIGLDKEFEIDSDAVSLPP 229
Query: 216 GVRVTVAMGTNRNLDADSPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
RVTV + N R + + +E+G YWGY VR + V + +
Sbjct: 230 YTRVTVEI-KNLTEQCKLYRGSITNGAAVTRETGQYWGYSVRLLTGLQKVLEG----KKF 284
Query: 274 DHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFN 333
D + G S G + ++ + +L+ FGG+AG++ ++E ++ + + A + F
Sbjct: 285 DMVAGVSPRGKLATEMNVCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDMLIR 344
Query: 334 TCPH-QGSRTIRTEEAIFISLQYFQ 357
T GSR+ R EE + L Q
Sbjct: 345 TTSLPNGSRSERVEENVLSVLAQVQ 369
>gi|124809062|ref|XP_001348481.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
gi|23497376|gb|AAN36920.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
Length = 444
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 181/361 (50%), Gaps = 80/361 (22%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----------------- 105
V VA+P +II+N + + + L +AR TIF I ++ ++D+
Sbjct: 86 VCVAIPSTIINNKSDV-IKSYLTSFLARIFTIFSISKIYIYDDYLEQEHLINKQKNYGDY 144
Query: 106 -------------------------KSSSDNYSRSSAANRSNRSDENES-GAAFLVRL-- 137
K S+DN + +N ++ ES ++L +
Sbjct: 145 QNEKYKKDIKNNSNNVNKNMHMNLKKDSNDNDHNTFDNIVNNEKNKEESPKYSYLCKYMH 204
Query: 138 --LQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV 195
LQYLETPQYLRK LF + + L+ G++ P+DAPHHLR EW PFREGV L++ +S
Sbjct: 205 YNLQYLETPQYLRKHLFPITNFLKNSGIMNPVDAPHHLRSDEWLPFREGVVLQK---SSN 261
Query: 196 GTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKES 246
G +VDVGL +V+++ V ++ G RVTV +N +N + +++ PS PK
Sbjct: 262 GIIVDVGLFSNVLIENVNYINIGTRVTVLFQSNSYELFQKKNPNVLFTGKLIHPSVPKLY 321
Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGL 306
+YWGY + N+S +F K D+LIGTSE G ++ S ++ + +LI FG
Sbjct: 322 NLYWGYSIEILKNLSDIF-----KLQVDYLIGTSERGKFIDDSMVSFRDAKSVLIVFGNK 376
Query: 307 AGLEESI--------------EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
GLE+ + E+ + K ++ F + NTCP Q SRTIRTEEAI I+
Sbjct: 377 DGLEDLLIKEKEKKKNKVFSPEKKIKVLNKMLKK-FDLFINTCPCQTSRTIRTEEAITIT 435
Query: 353 L 353
L
Sbjct: 436 L 436
>gi|17551846|ref|NP_498602.1| Protein B0361.6 [Caenorhabditis elegans]
gi|1730629|sp|Q10950.1|YMP6_CAEEL RecName: Full=Uncharacterized protein B0361.6
gi|373253800|emb|CCD61821.1| Protein B0361.6 [Caenorhabditis elegans]
Length = 378
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 166/308 (53%), Gaps = 17/308 (5%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVPG ++N QS EL T +AGQIARA T++R+DE++++D + + ++ N +
Sbjct: 67 TISIAVPGQFLNNAQSAELRTYMAGQIARAATLYRVDEIIIYDESCRMTDEAVNAYYNGT 126
Query: 122 NR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
+ ++ N G +L ++L+YLE PQYLRK LF + L+ G+L PLDA HHL+
Sbjct: 127 WQGNLIPAETNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLKNAGLLNPLDAQHHLKYD 186
Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDPGVRVTVAMGTNRNLDAD 232
E FREGV LK+ + + G + ++GL K +D L P RVTV + N
Sbjct: 187 EKTLRFREGVVLKKRSKDGRGPICNIGLEKEFEIDSDAVQLPPYTRVTVEI-KNLTEQCK 245
Query: 233 SPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
R + + +E+G+YWGY VR + V + +D + G S G + + D
Sbjct: 246 LYRGSITSGATVTRETGLYWGYSVRLMTGLQKVLQ----AKKFDIVAGVSPRGKLASQMD 301
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCP-HQGSRTIRTEEAI 349
+ + +L+ FGG+AG++ ++E ++ + + A + F T GSR+ R EE +
Sbjct: 302 VCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDVLIRTTSLSNGSRSERVEENV 361
Query: 350 FISLQYFQ 357
L Q
Sbjct: 362 LSVLAQVQ 369
>gi|156101914|ref|XP_001616650.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805524|gb|EDL46923.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 484
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 173/373 (46%), Gaps = 87/373 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------------ 110
V VA+P SII+N + + + L +AR T+F + V ++D++ S+
Sbjct: 117 VCVAIPSSIINNRSDV-IKSYLTSYLARIFTVFAVARVYIYDDQMGSERDGRHERRDSHE 175
Query: 111 ---------NYSRSSAANRS-----------------------NRSDENESG-------- 130
N S S+A + ++S +E G
Sbjct: 176 QREQQLKRGNSSHSTALPQKTHEREKTPHSDRDGSNHQRKRPISQSSSHEGGEGENPPPE 235
Query: 131 ----AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVT 186
+L LQYLETPQYLRK LF + L+ G++ P+DAPHHLR EW PFREGV
Sbjct: 236 YSYLCQYLHYNLQYLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVV 295
Query: 187 LKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQV 237
+K+ S G +VDVGL H ++ V +D G RVTV N RN D ++
Sbjct: 296 IKK---KSNGVIVDVGLFAHAHIENVHHVDIGTRVTVLFHPNSFLLFQRRNADVLFSGKL 352
Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
+ P PK +YWGY V +S VF D+++GTSE G + +
Sbjct: 353 INPGTPKLYNLYWGYSVELLKRLSDVF-----DLDVDYIVGTSERGRFMGEVADAVRGVS 407
Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREV-------------FHSYFNTCPHQGSRTIR 344
+LI FG GLE+ + ++ ++K + V FH + NTCPHQ SRTIR
Sbjct: 408 SILIVFGNKQGLEDLLIKEREEQKKKSYAVEKRSKVLSKMLKKFHLFVNTCPHQTSRTIR 467
Query: 345 TEEAIFISLQYFQ 357
TEEAI I+L F
Sbjct: 468 TEEAIAITLSLFH 480
>gi|125547448|gb|EAY93270.1| hypothetical protein OsI_15078 [Oryza sativa Indica Group]
Length = 204
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 108/150 (72%), Gaps = 8/150 (5%)
Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSV 263
++V+V+Q+L+PG RVT+AMGTNR++ R++V PS P+ E +YWGYKVRYA N+ V
Sbjct: 13 RNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNLGGV 72
Query: 264 FKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI-------AFGGLAGLEESIEED 316
F + YK GYD++IGTSEHG I++SS+L LP+FR GGLAGLEE IEED
Sbjct: 73 FSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRVAAFGALAPFNCIGGLAGLEECIEED 132
Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
LK K +VF++Y NTCP QGSRTIRTE
Sbjct: 133 RNLKGKGVDDVFNTYLNTCPSQGSRTIRTE 162
>gi|189202400|ref|XP_001937536.1| deoxyribose-phosphate aldolase 2 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984635|gb|EDU50123.1| deoxyribose-phosphate aldolase 2 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 440
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 133/406 (32%), Positives = 182/406 (44%), Gaps = 88/406 (21%)
Query: 33 NGDSSSCDNKKKKKRKRDQLNDDAPIEVPT------------VSVAVPGSIIDNTQSLEL 80
N +S KK+K ++Q +D PT V++A+PGS N + +
Sbjct: 25 NSHDNSSPTKKRKSDNQEQSMEDVDTSKPTAIFNPTKGRDWTVTIALPGSWTLNAKKPDH 84
Query: 81 ATRLAGQIARAVTIFRIDEVVVFDN------------KSSSDNYSRSSAANRSNRSDEN- 127
T G+IARA I+ +DE+VVFD+ K S+ + DE
Sbjct: 85 KTIQVGRIARAAAIYCVDEIVVFDDDPVNIDPRVIDPKYIRKGRSKQQVLDSILEQDEAW 144
Query: 128 ESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
++ FL LL + E P +LR ++F H +L++VG LP +D PHHLR HEW
Sbjct: 145 QNPDQFLYHLLSFAECPPHLRYDREDHRLSIFKEHQNLKWVGNLPSMDMPHHLRSHEWCQ 204
Query: 181 FREGVTLKENAP--------------NSVG-TLVDVGLNKHVVV----DQVLDPGVRVTV 221
+REGV + P + G V GL V V D ++ G+R TV
Sbjct: 205 YREGVFVGPAPPLPSTPKSKKSKKSKDEQGYAYVKCGLPYPVRVPTPKDVPIEEGMRTTV 264
Query: 222 AMGT--------------NRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNC 267
NLDA + +P +E G YWGY VR A +S V+ C
Sbjct: 265 RFANIDPPNNWPHLSQVDCDNLDATACAASLPR---EEGGYYWGYTVRRAACLSDVYSEC 321
Query: 268 SYKGGYDHLIGTSEHGDIV--------NSSDLTLP--------TFRHLLIAFGGLAGLEE 311
Y GGYD IGTSE G V S +T +F+HLLI FGG AG+E
Sbjct: 322 EYPGGYDFSIGTSERGQSVYSFISGVGPSQSVTWTGDVAKAPESFKHLLIVFGGQAGIEP 381
Query: 312 SIEED----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
++ D L + A +F ++ N P QGSRTIRTEEA+ I L
Sbjct: 382 AVANDPVLGKELNKSTANSIFDAWINLVPAQGSRTIRTEEAVTIGL 427
>gi|330922064|ref|XP_003299680.1| hypothetical protein PTT_10728 [Pyrenophora teres f. teres 0-1]
gi|311326537|gb|EFQ92220.1| hypothetical protein PTT_10728 [Pyrenophora teres f. teres 0-1]
Length = 437
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 76/365 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-DNYSRSSAANR 120
TV++A+PGS N + + T G+IAR I+ +DE+VVFD+ + D+ R
Sbjct: 63 TVTIALPGSWTLNAKKPDHKTIQVGRIARTAAIYCVDEIVVFDDDPVNIDSRVIDPKYIR 122
Query: 121 SNRSDEN------------ESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFV 161
RS + ++ FL LL + E P +LR ++F H +L++V
Sbjct: 123 KGRSKQQVLDSILEQDEAWQNPDQFLYHLLSFAECPPHLRYDREDPRLSIFKEHQNLKWV 182
Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG---------------TLVDVGLNKH 206
G LP +D PHHLR HEW +REGV + P S V GL
Sbjct: 183 GNLPSMDMPHHLRSHEWCQYREGVFVGPAPPLSSTPKSKKSKKPTDEQEYAYVKCGLPYP 242
Query: 207 VVV----DQVLDPGVRVTVAMGT--------------NRNLDADSPRQVVPPSKPKESGM 248
V V D ++ G+R TV NLDA + +P +E G
Sbjct: 243 VRVPTPKDVPIEEGMRTTVRFANVNPPRNWPHLSQVDCDNLDATACAASLPR---EEGGY 299
Query: 249 YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------SDLTLPT------ 295
YWGY VR A +S V+ C Y GGYD IGTSE G V S S PT
Sbjct: 300 YWGYTVRRAACLSDVYSECEYPGGYDFAIGTSERGQSVYSFISGVGPSQSFTPTGGITKA 359
Query: 296 ---FRHLLIAFGGLAGLEESIEED----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
F+HLLI FGG AG+E ++ D L + A +F ++ N P QGSRTIRTEEA
Sbjct: 360 PESFKHLLIVFGGQAGIEPAVANDPVLGKELNKSTANSIFDAWVNLVPAQGSRTIRTEEA 419
Query: 349 IFISL 353
+ I L
Sbjct: 420 VTIGL 424
>gi|71032507|ref|XP_765895.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352852|gb|EAN33612.1| hypothetical protein, conserved [Theileria parva]
Length = 396
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 132/392 (33%), Positives = 196/392 (50%), Gaps = 91/392 (23%)
Query: 23 HESQNELSLANGD-SSSCDNKKKKKRKRDQ------------LNDDAPIEVP-------- 61
H+S + L+N + + DNK K+ KR + L + P +P
Sbjct: 26 HKSNIHVELSNKNLDNVTDNKVLKENKRPKKDTNKYNLNYVRLFKEIPQRIPFLGKPRDY 85
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF---DNKSSSDNYSRSSAA 118
T+SVA+P SI++N QS +L + G IAR +TI+ ++EVV++ D+KSS N+ +
Sbjct: 86 TISVALPVSIMENIQSDDLRAYVIGNIARTLTIYGVNEVVLYNDLDDKSS--NWMEYFSL 143
Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
N L+YLETPQYLRK LF M ++L+F G+ P++APHHLR EW
Sbjct: 144 N------------------LRYLETPQYLRKFLFPMDNALKFAGLQNPINAPHHLRSTEW 185
Query: 179 APFREGVT--LKENAPNSVGTLVDVGLNKHV----------------VVDQVLDPGVRVT 220
P+REGV +++++ N+ D G+ V + D+ D RVT
Sbjct: 186 LPYREGVVGLIRKDSTNTT-LYADCGVFSSVKIHNKEDLKEFYGIEYMTDEDYDVYQRVT 244
Query: 221 VAMG------TNRNL-DADSPRQ---------VVPPSKP-KESGMYWGYKVRYAPNISSV 263
V + ++N + S Q +V P +P K +G+YWGY VR +
Sbjct: 245 VRLDDESIDKCHKNWKEGKSLSQSETGSLSAYLVHPEEPLKVAGLYWGYIVRECNTVVES 304
Query: 264 FKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
K C + YD +GTSE G++ S + LP F+++LI FG + GLE ++E +
Sbjct: 305 LKGCPFNENARYDLKVGTSERGELY-SLNTKLPKFKNMLIYFGPVLGLEHVMQEPE---- 359
Query: 322 KNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ F YFN C Q SRTIRTEEA+ I L
Sbjct: 360 ----KKFDKYFNFCNQQKSRTIRTEEALLIVL 387
>gi|301113670|ref|XP_002998605.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111906|gb|EEY69958.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 293
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/182 (47%), Positives = 123/182 (67%), Gaps = 9/182 (4%)
Query: 44 KKKRKRDQLNDD--APIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVV 101
+KK+++DQL+++ A T+S+A+PGSI+DN Q+ EL T LAGQ+ARA IF++DE+V
Sbjct: 96 EKKQQQDQLDNERAAKGRAYTLSMAIPGSILDNAQTKELKTYLAGQVARAAVIFQVDEIV 155
Query: 102 VFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFV 161
VFD++ +N S + R++ F+ R+LQYLETPQYLR+ALF MH+ L
Sbjct: 156 VFDDQLG-NNVSTGAPKKRTH------DCHVFMARILQYLETPQYLRRALFPMHTDLSCA 208
Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV 221
G+L PLD PHHLR EW+ +REGV + G+ V VGL + VVVD+ + PG+RVTV
Sbjct: 209 GLLNPLDCPHHLRAQEWSVYREGVVTDRPLKDKEGSHVYVGLQREVVVDKRIQPGIRVTV 268
Query: 222 AM 223
+
Sbjct: 269 KI 270
>gi|171684083|ref|XP_001906983.1| hypothetical protein [Podospora anserina S mat+]
gi|170942002|emb|CAP67654.1| unnamed protein product [Podospora anserina S mat+]
Length = 368
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 160/317 (50%), Gaps = 64/317 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+S+AVP SII + + E T AG IARA+ IF +DEVV+FD+ S + R+ +
Sbjct: 101 TISIAVPTSIITSCVTREQRTTAAGSIARALAIFSVDEVVIFDD-SPIEQRPRNFDPDAY 159
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E A FL LL YLETP ++RK LF +H +L+ G+L LD PHH K EW P+
Sbjct: 160 TGDIEP---AHFLEHLLNYLETPPFMRKVLFPIHPNLKSQGLLHGLDMPHHPHKDEWLPY 216
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG+TL+ AP G P V P
Sbjct: 217 REGLTLE--APPRSG------------------------------------KPEPVNPAE 238
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP------T 295
E G +WGY VR A ++S VF + +Y+ GYD IGTSE G ++ + P T
Sbjct: 239 PRTEGGYFWGYSVRKAKSLSDVFTSSAYEDGYDLSIGTSERGVPLSKA---FPNHNQTAT 295
Query: 296 FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
F H+LI FGG GLE + D G++ +E+F + N P+QG+R IRT+E++
Sbjct: 296 FNHMLIVFGGPRGLEFAAMNDPDLGQMGIQGARTKELFDHWVNVLPNQGTRGIRTDESLL 355
Query: 351 ISLQYFQEPISRALRRV 367
I+L ALRR+
Sbjct: 356 IALT--------ALRRL 364
>gi|312087671|ref|XP_003145563.1| hypothetical protein LOAG_09988 [Loa loa]
Length = 282
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/258 (37%), Positives = 141/258 (54%), Gaps = 23/258 (8%)
Query: 50 DQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS-- 107
D +N+ +S+A+PGSI++N QS EL T LAGQIAR+ IF +DE++V+D +
Sbjct: 30 DNMNETVNGRFLMLSIALPGSILNNAQSPELRTYLAGQIARSAAIFCVDEIIVYDETARM 89
Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGML 164
D+Y N L R+L+++E PQYLRK LF + +LR+ G+L
Sbjct: 90 KPQQRDDYCSGKWCPDLPVHSGNVECNFHLARILEFMECPQYLRKTLFPLQKTLRYAGIL 149
Query: 165 PPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV 221
PLD PHHLR + + P+REGV L + G + ++GL K V +++ VL G RVTV
Sbjct: 150 NPLDCPHHLRASDLSVPYREGVVLDKPVKAGRGPICNIGLYKEVQINEEVVLQAGTRVTV 209
Query: 222 AMGTNRNLDADSPRQ------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
+ LD S R+ V +E+G+YWGYKVR A ++ G YD
Sbjct: 210 KV-----LDIYSERKRYHGCLVNSQQIKRETGLYWGYKVRIALSLHDALN----AGEYDV 260
Query: 276 LIGTSEHGDIVNSSDLTL 293
+IGTSE G V+ ++ L
Sbjct: 261 IIGTSERGKPVSKFEMPL 278
>gi|344255280|gb|EGW11384.1| Uncharacterized protein C9orf114-like [Cricetulus griseus]
Length = 193
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 84/193 (43%), Positives = 119/193 (61%), Gaps = 4/193 (2%)
Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
+R+ E + FREG+ + G+LV+ GL K V +D+ L+PG+RVTV + + +
Sbjct: 1 MRQDEESEFREGIVVDRPTKAGHGSLVNCGLKKEVKIDKKLEPGLRVTVRLNQKQLPECK 60
Query: 233 SPRQVVPPSK-PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ + V S+ P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+
Sbjct: 61 TYKGTVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSNVASAQ 120
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
L P+FRH L+ FGGL GLE ++ D L+ +F Y NTCP QGSRTIRTEEAI
Sbjct: 121 L--PSFRHALVVFGGLQGLEAGVDADPNLEVSEPSVLFDFYVNTCPSQGSRTIRTEEAIL 178
Query: 351 ISLQYFQEPISRA 363
ISL Q +++A
Sbjct: 179 ISLAALQPGLTQA 191
>gi|440632788|gb|ELR02707.1| hypothetical protein GMDG_05656 [Geomyces destructans 20631-21]
Length = 345
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/340 (35%), Positives = 172/340 (50%), Gaps = 40/340 (11%)
Query: 41 NKKKKKRKRD-QLNDDAPIEVP------TVSVAVPGSIIDNTQSLELATRLAGQIARAVT 93
N KK+ R D + + DA I P T S+A+PG+IID SLE + +A QIARA++
Sbjct: 7 NNKKRNRNGDVKGSQDAAIYGPGKGRSWTTSLAIPGNIIDMNMSLESRSIIASQIARAIS 66
Query: 94 IFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF- 152
FRIDEV++ S+S + S+AA+ + A FL+R+L YLETP Y+R LF
Sbjct: 67 TFRIDEVIIL---STSTSPLDSTAASTDS--------APFLLRVLSYLETPPYIRDRLFQ 115
Query: 153 ----SMHSSLRFVGMLPPLDA--PHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKH 206
++H L + L PHH E +REGVTLK + LV+ GL
Sbjct: 116 PGKSAIHPDLAHISTLSAASTALPHHASFAETL-YREGVTLKAQEEGTYHRLVNCGLPNG 174
Query: 207 VVVDQVLDP---GVRVTVAMG--TNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNI 260
++ + DP RVTV + T P +VV PS+P+E G Y GY+ R +P++
Sbjct: 175 ILA-SIPDPIPKSTRVTVKLSSQTPSGDRTSIPAEVVAPSEPREEIGQYLGYETRTSPSL 233
Query: 261 SSVFKNCSYKGGYDHLIGTSEH-GDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED--- 316
S F++C Y GGYD I S+ GD + F+H+L+AFGG EE + D
Sbjct: 234 VSFFEDCPYDGGYDISILVSDSDGDDAGNISAANSKFQHILLAFGGAENFEELTKNDPKL 293
Query: 317 DGLKRKNAREVFHSYFNTC---PHQGSRTIRTEEAIFISL 353
L N ++F N P+ ++R EE + + L
Sbjct: 294 GSLDLDNFADLFDHTVNVAKGLPYVNPGSLRVEELVLVGL 333
>gi|349806181|gb|AEQ18563.1| hypothetical protein [Hymenochirus curtipes]
Length = 166
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 108/205 (52%), Gaps = 39/205 (19%)
Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHV 207
R++ F H L+F G L PLD+PHH+R E + +REGV L G+ V+ G+ K V
Sbjct: 1 RRSFFPKHPDLQFAGALNPLDSPHHVRIDEESEYREGVVLDRPTKPGRGSFVNCGMRKEV 60
Query: 208 VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNC 267
+D+ WGYKVR A +S+VF C
Sbjct: 61 QIDK-------------------------------------QWGYKVRLAACLSAVFTEC 83
Query: 268 SYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREV 327
YK GYD IGTSE G V + ++LP FRH +I FGGL GLE S+E D L +
Sbjct: 84 PYKDGYDLSIGTSERGSSVET--VSLPKFRHAVIVFGGLKGLEYSVEGDQTLDISEPSII 141
Query: 328 FHSYFNTCPHQGSRTIRTEEAIFIS 352
F+ Y NTCP+QGSRTIRTEEAI IS
Sbjct: 142 FNHYLNTCPNQGSRTIRTEEAILIS 166
>gi|149039119|gb|EDL93339.1| rCG45649, isoform CRA_b [Rattus norvegicus]
Length = 233
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 108/173 (62%), Gaps = 4/173 (2%)
Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK-PK-ESGMYWGYK 253
G+LV+ G+ K V +D+ LDPG+RVTV + + + + + V S+ P+ ++G+YWGY
Sbjct: 61 GSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYT 120
Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
VR A +S+VF ++ GYD IGTSE G V S+ L P FRH L+ FGGL GLE +
Sbjct: 121 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSSVASTQL--PNFRHALVVFGGLQGLEAGV 178
Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
+ D L+ + +F Y NTCP QGSRTIRTEEAI ISL Q +++ R
Sbjct: 179 DADPNLEVADPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPGLTQGGAR 231
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 18/25 (72%), Positives = 21/25 (84%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAG 86
T+SVA+PGSI+DN QS EL T LAG
Sbjct: 35 TLSVALPGSILDNAQSPELRTYLAG 59
>gi|118396877|ref|XP_001030775.1| hypothetical protein TTHERM_01016150 [Tetrahymena thermophila]
gi|89285089|gb|EAR83112.1| hypothetical protein TTHERM_01016150 [Tetrahymena thermophila
SB210]
Length = 1437
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 151/289 (52%), Gaps = 30/289 (10%)
Query: 51 QLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
QLN+ I++ +++ +P SII QS EL + ++AR IF +DE+++ + S
Sbjct: 303 QLNE--KIKLKHLAICIPDSIISIPQSSELRSYFISELARTAAIFCVDEIIILKDNS--- 357
Query: 111 NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP 170
Y S +N ++VR LQYLETPQYLRK LF +H L+ G++ P++
Sbjct: 358 -YIAKS---------QNFQQGPYIVRNLQYLETPQYLRKQLFPIHPDLKNAGLMNPIECH 407
Query: 171 HHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
HHL+ ++ P+REGV + + G+ V++GL K + L RVT+ + LD
Sbjct: 408 HHLKSDQYCPYREGVVMNRPTKENEGSWVEIGLKKQAKIPYKLVTNTRVTLRINE-PTLD 466
Query: 231 ADSP---RQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG-DI 285
+ VV +PKE G+YWGY+VR A +S VFK G D+L + +
Sbjct: 467 IEQKYYSADVVSQKEPKEKMGLYWGYQVRLADTLSQVFK------GEDNLRDHQQFSEES 520
Query: 286 VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDD--GLKRKNAREVFHSYF 332
++ ++L L +++LI F G LE I+ D+ L K+A F + F
Sbjct: 521 ISETELDLED-KNVLIVFSGYNNLESIIDSDEYTKLSGKDALNKFDANF 568
>gi|148676494|gb|EDL08441.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
CRA_a [Mus musculus]
Length = 241
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 109/173 (63%), Gaps = 4/173 (2%)
Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK-PK-ESGMYWGYK 253
G+LV+ G+ K V +D+ LDPG+RVTV + + + + + V S+ P+ ++G+YWGY
Sbjct: 61 GSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYT 120
Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
VR A +S+VF ++ GYD IGTSE G V S+ L P+FRH L+ FGGL GLE ++
Sbjct: 121 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQL--PSFRHALVVFGGLQGLEAAV 178
Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
+ D L+ + +F Y NTC QGSRTIRTEEAI ISL Q +++ R
Sbjct: 179 DADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPGLTQVGSR 231
>gi|145532104|ref|XP_001451813.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419479|emb|CAK84416.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 157/312 (50%), Gaps = 35/312 (11%)
Query: 58 IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
+E V++A+P SII N QS EL + Q+AR IF++DEV++ + S
Sbjct: 33 VEQRHVTIAIPDSIISNAQSSELRSYFMSQLARMFAIFQVDEVIILRD---------YSY 83
Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
+S + D A+++VR LQYLETPQYLRK LF +HS L+ VG++ P+++ HHL +
Sbjct: 84 IPKSKQFD----VASYVVRNLQYLETPQYLRKYLFPIHSDLKNVGLMNPIESKHHLLTEQ 139
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT-NRNLDADSPRQ 236
PFREGV + E + V++GL + V+++ L PG RVT+ + N +P
Sbjct: 140 VCPFREGVVV-ERPSKGDTSWVEIGLKRQVLINYPLQPGTRVTLRLDDPNSTQLTGTP-- 196
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGT-----SEHGDIVNSS 289
V KE G YWGY V I S F + +D LI T ++ N S
Sbjct: 197 -VSSQDAKEEGYYWGYTV----TIESKFHRLLERDDFDMKILIETEKSDEAQQTTTFNPS 251
Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF----HSYFNTCPHQGSRTIRT 345
L+ LL+ F GL + + E D+ K N +++ H Y G + +R
Sbjct: 252 QLSQQP-LKLLLLFSGLQKITDFTENDEKSKLSN-EDIYGKFDHVYRYGNNDFGVKQLRL 309
Query: 346 EEAIFISLQYFQ 357
EE +F+ LQ Q
Sbjct: 310 EEQMFMFLQSIQ 321
>gi|325958530|ref|YP_004289996.1| hypothetical protein Metbo_0773 [Methanobacterium sp. AL-21]
gi|325329962|gb|ADZ09024.1| protein of unknown function DUF171 [Methanobacterium sp. AL-21]
Length = 274
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 150/307 (48%), Gaps = 58/307 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ VP SI+ T+ L + T G I R+ IFR D +VV++ DN
Sbjct: 6 LSIFVPASILSETKDLRIKTYKIGLIGRSAAIFRADRIVVYN-----DN----------- 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
SDE E F+ +L Y+ TPQYLRK +F + LR VG+LPPL PHH HE
Sbjct: 50 -SDEKE--VKFISDVLTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHE-GD 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVV------VDQVLDPGVRVTVAMGTNRNLDADSP 234
+R+G+T+K GT+VD+G +K + V++VL V + +G LD D P
Sbjct: 106 YRQGLTIKRTKK---GTVVDIGADKFALCKEKLSVNKVLSFKV---IKLGKEIILDPDVP 159
Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSD 290
YWGYKV A N K G D +IGTS + +++
Sbjct: 160 ------------DFYWGYKV-LATNKDLYNSILMMKPGPDLVIGTSRYAPSITSVLDEVK 206
Query: 291 LTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
L +H I FGG +GLEE I + +N +E+ NT P QG++T+RTEEA+
Sbjct: 207 TKLKGSKHTAILFGGPYSGLEELISD------QNEKEIIDLEVNTVPEQGTKTVRTEEAV 260
Query: 350 FISLQYF 356
+L F
Sbjct: 261 LATLSVF 267
>gi|145473603|ref|XP_001462465.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430305|emb|CAK95092.1| unnamed protein product [Paramecium tetraurelia]
Length = 334
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 158/313 (50%), Gaps = 37/313 (11%)
Query: 58 IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
+E V++A+P SII N QS EL + Q+AR IF++DEV++ + S
Sbjct: 46 VEQRHVTIAIPDSIISNAQSSELRSYFMSQLARMFAIFQVDEVIILRD---------YSY 96
Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
+S + D A+++VR LQYLETPQYLRK LF +HS L+ VG++ P+++ HHL +
Sbjct: 97 IPKSKQFD----VASYVVRNLQYLETPQYLRKYLFPIHSDLKNVGLMNPIESKHHLLTEQ 152
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT-NRNLDADSPRQ 236
PFREGV + E + V++GL + V+++ L PG RVT+ + N +P
Sbjct: 153 VCPFREGVVV-ERPSKGDTSWVEIGLKRQVLINYPLQPGTRVTLKLDDPNSTHLTGTP-- 209
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYD-HLIGTSEHGD------IVNSS 289
V K+ G YWGY V I S F + +D ++ +E D N S
Sbjct: 210 -VSSQDAKKEGYYWGYTV----TIESKFHRLLERDDFDMKILVETEKSDEAQQTTTFNPS 264
Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTC-----PHQGSRTIR 344
L+ LL+ F GL + + E D+ K K ++E + F+ G + +R
Sbjct: 265 QLSQQP-LKLLLLFSGLQKITDFTENDE--KSKLSKEDIYGKFDHVYRYGNSEYGVKQLR 321
Query: 345 TEEAIFISLQYFQ 357
EE +F+ LQ Q
Sbjct: 322 LEEQMFMFLQSIQ 334
>gi|400594356|gb|EJP62211.1| deoxyribose-phosphate aldolase [Beauveria bassiana ARSEF 2860]
Length = 456
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 136/263 (51%), Gaps = 41/263 (15%)
Query: 62 TVSVAVPGSI------------------IDNTQSLELAT---RLA--GQIARAVTIFRID 98
T+S+AVPGSI I NT L T RL ++AR+++IF +D
Sbjct: 58 TISIAVPGSILSKYVPHHTAPPFSVFKSITNTHICSLPTAEQRLVTVARLARSLSIFCVD 117
Query: 99 EVVVFDNKSSS--DNYSRSSAANRSNRSDENESGA---AFLVRLLQYLETPQYLRKALFS 153
EV+V+D+ + S + +R A+ S+ + A F+ ++L +LE P ++R+ LF
Sbjct: 118 EVIVYDDDAPSARNTTTRFDMADGSSNTGIYTGDADPCHFVAQVLSFLECPPFMRRRLFP 177
Query: 154 MHSSLRFVGMLPPLDAPHHLR-KHEWAPFREGVTLKENAPN---------SVGTLVDVGL 203
+H +LR +LP LD PHH R K W P+ EGVT+ + S G+LVD G+
Sbjct: 178 LHPNLRHTALLPALDMPHHPRAKAPWLPYAEGVTVTAAGEDADTLLLPDGSGGSLVDTGI 237
Query: 204 NKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR--QVVPPSKPK-ESGMYWGYKVRYAPNI 260
V V V+ P RVT+ + SP V P +P+ + G YWGY VR + ++
Sbjct: 238 GGLVRVADVIPPNTRVTLRFDGPNDAQQQSPPPPMCVHPDEPRTKGGYYWGYAVRRSASL 297
Query: 261 SSVFKNCSYKGGYDHLIGTSEHG 283
S VF C + GGYD +GTSE G
Sbjct: 298 SGVFTECPHAGGYDLSVGTSERG 320
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 5/66 (7%)
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
P+F+HLL+ FGG G+E + D L + RE+F + N P QGSR I T+EA
Sbjct: 382 PSFQHLLVVFGGPRGIELAAAHDPHLAAMQIRGAKTRELFDHWVNVLPGQGSRDIATDEA 441
Query: 349 IFISLQ 354
+ I+L
Sbjct: 442 VPIALM 447
>gi|51593245|gb|AAH78441.1| D2Wsu81e protein [Mus musculus]
Length = 179
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 89/144 (61%), Gaps = 4/144 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD + S
Sbjct: 35 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 90
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 91 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 150
Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
REGV + G+LV+ G+ K
Sbjct: 151 REGVVVDRPTKAGHGSLVNCGMKK 174
>gi|426363303|ref|XP_004048782.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gorilla
gorilla gorilla]
Length = 242
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 102/173 (58%), Gaps = 4/173 (2%)
Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYK 253
G L G + V +D+ L+PG+RVTV + + D + +VV P+ ++G+YWGY
Sbjct: 70 GELGSWGGPQEVKIDKNLEPGLRVTVRLNQQQLPDCKTFHGKVVSSQDPRTKAGLYWGYT 129
Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
VR A +S+VF ++ GYD IGTSE G V S+ L P FRH L+ FGGL GLE
Sbjct: 130 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQL--PNFRHALVVFGGLQGLEAGA 187
Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
+ D L+ +F Y NTCP QGSRTIRTEEAI ISL Q +++A R
Sbjct: 188 DADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPGLTQAGAR 240
>gi|47202262|emb|CAF88010.1| unnamed protein product [Tetraodon nigroviridis]
Length = 196
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 87/145 (60%), Gaps = 4/145 (2%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PTVSVAVPGS++DN QS EL + LAGQIARA +F +DE+VVFD D S
Sbjct: 55 PTVSVAVPGSVLDNAQSPELRSYLAGQIARACVVFCVDEIVVFDE----DGEDLKSVEGE 110
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ + L R+LQYLE PQYLRK F H L++ G+L PLD+PHH+R +
Sbjct: 111 FKGVGKKGNACIQLARILQYLECPQYLRKWFFPKHQDLQYAGLLNPLDSPHHMRTDDECE 170
Query: 181 FREGVTLKENAPNSVGTLVDVGLNK 205
+REGV L G+LV+ G+ K
Sbjct: 171 YREGVVLDRPTKAGQGSLVNCGMRK 195
>gi|6841442|gb|AAF29074.1|AF161459_1 HSPC109 [Homo sapiens]
Length = 388
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/252 (37%), Positives = 131/252 (51%), Gaps = 21/252 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+ T L+ + R+ E V S R + R
Sbjct: 74 TLSVALPGSIL---------TMLSRRSFAPTWPVRLPEPVPSSVWMRSWCLMRRARMPRL 124
Query: 122 NR--SDENESGAAFLVRLL----QYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
R S E E GA VR E PQYLRKA F H L+F G+L PLD+PHH+R+
Sbjct: 125 WRGNSQELEEGAG--VRTAGPDPAVPECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQ 182
Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
E + FREG+ + G+ V+ G+ K V +D+ L+PG+RVTV + ++ D +
Sbjct: 183 DEESEFREGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYH 242
Query: 236 -QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+VV P+ ++G+YWGY VR A +S+VF ++ GYD IGTSE G V S+ +
Sbjct: 243 GKVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASA--SF 300
Query: 294 PTFRHLLIAFGG 305
PT LL G
Sbjct: 301 PTSGMLLWCSGA 312
>gi|297302932|ref|XP_001119071.2| PREDICTED: uncharacterized protein C9orf114-like, partial [Macaca
mulatta]
Length = 147
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/144 (46%), Positives = 86/144 (59%), Gaps = 4/144 (2%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 4 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 63
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ L R+LQYLE PQYLRKA F H L+F G+L PLD+PHH+R+ E + F
Sbjct: 64 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 119
Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
REG+ + G+ V+ G+ K
Sbjct: 120 REGIVVDRPTRPGHGSFVNCGMKK 143
>gi|402581918|gb|EJW75865.1| D2Wsu81e protein [Wuchereria bancrofti]
Length = 247
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 94/266 (35%), Positives = 129/266 (48%), Gaps = 45/266 (16%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+A+PGSI++N QS EL T LAGQIAR+ +F +DE+VV+D + + R N
Sbjct: 2 LSIALPGSILNNAQSSELRTYLAGQIARSAAVFCVDEIVVYDETARMKSQQREDYCNGQW 61
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA-PF 181
D +HS +L PLD PHHLR + + P+
Sbjct: 62 YPD--------------------------LPVHSG-NVECILNPLDCPHHLRASDLSVPY 94
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVV--DQVLDPGVRVTVAMGTNRNLDADSPRQ--- 236
REGV L + G + +VGL K V + D L G RVTV + LD S R+
Sbjct: 95 REGVVLNKPIKAGRGPICNVGLYKEVQINEDVTLKTGTRVTVKI-----LDIYSERKRYH 149
Query: 237 ---VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
V +E+G+YWGYKVR A ++ YD +IGTSE G V+ ++
Sbjct: 150 GCLVNSKQIKQEAGLYWGYKVRLALSLYDALN----AEEYDVIIGTSERGKPVSKFEMPF 205
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGL 319
+LI FGGL GLE +IE D +
Sbjct: 206 SGKNRILIVFGGLEGLETAIEADKNI 231
>gi|148642823|ref|YP_001273336.1| hypothetical protein Msm_0763 [Methanobrevibacter smithii ATCC
35061]
gi|148551840|gb|ABQ86968.1| conserved hypothetical protein Msm_0763 [Methanobrevibacter smithii
ATCC 35061]
Length = 275
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 156/303 (51%), Gaps = 47/303 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV +P S + ++ L++ T G + RA+ +F+ID VV+++ D+Y ++
Sbjct: 6 LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYN-----DDYIKN------- 53
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
+ E F+ +L Y+ TPQYLRK+ F + L+ VG+LPPL APHH + +
Sbjct: 54 --ENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
+R+G T+K N GT VD+G++K + ++TV N + + ++V V
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162
Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
P KP + +YWGY V ISS KN D ++ T+ +GD ++S L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFIVETTRYGDYIDSIFDELKLKV 215
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
F+++ I FGG SI ED + N H NT P+QG+ T+RTEEA+ +L
Sbjct: 216 DEFKNIAILFGGPYS---SIPED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267
Query: 354 QYF 356
F
Sbjct: 268 SLF 270
>gi|288560107|ref|YP_003423593.1| hypothetical protein mru_0850 [Methanobrevibacter ruminantium M1]
gi|288542817|gb|ADC46701.1| hypothetical protein mru_0850 [Methanobrevibacter ruminantium M1]
Length = 271
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 150/300 (50%), Gaps = 43/300 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
VSV +P S + T+ L+L T G I RA+ +F +DEVV++ + S D S
Sbjct: 6 VSVFIPNSFLAETKDLKLKTSKVGLIGRALAVFEVDEVVIYKDLSIPD----------SE 55
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
++D+ + F+ +L Y++TPQYLRK + + LR VG+LPPL PHH K E +
Sbjct: 56 QTDDGD----FIAEVLNYMDTPQYLRKKAIPIKAELRHVGILPPLRVPHHPTGKPELGDY 111
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G T+K N GT VD+G++K + ++TV + + + +V P
Sbjct: 112 RQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVNKIFSFKITKFAKEVIVTPD 162
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
+P + +YWG+K + + KN D ++ T+ + + ++S L T
Sbjct: 163 EPDD--IYWGFKTL---STNKSLKNSLKLVNPDLVVETTRYAETIDSIFNELKTKVESSN 217
Query: 298 HLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
H+ I FGG + + E++E K NT P+QG+ T+RTEEA+ +L F
Sbjct: 218 HIAIVFGGPYSSISENVESSKWETVK---------INTIPNQGTETVRTEEAVISTLAIF 268
>gi|222445054|ref|ZP_03607569.1| hypothetical protein METSMIALI_00671 [Methanobrevibacter smithii
DSM 2375]
gi|222434619|gb|EEE41784.1| hypothetical protein METSMIALI_00671 [Methanobrevibacter smithii
DSM 2375]
Length = 275
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 154/303 (50%), Gaps = 47/303 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV +P S + ++ L++ T G + RA+ +F+ID VV++++
Sbjct: 6 LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYNDDHI-------------- 51
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
+++ E F+ +L Y+ TPQYLRK+ F + L+ VG+LPPL APHH + +
Sbjct: 52 KNENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
+R+G T+K N GT VD+G++K + ++TV N + + ++V V
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162
Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
P KP + +YWGY V ISS KN D ++ T+ +GD ++S L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFVVETTRYGDYIDSIFDELKLKV 215
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
F+++ I FGG SI ED + N H NT P+QG+ T+RTEEA+ +L
Sbjct: 216 DEFKNIAILFGGPYS---SIPED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267
Query: 354 QYF 356
F
Sbjct: 268 SLF 270
>gi|261350377|ref|ZP_05975794.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
2374]
gi|288861160|gb|EFC93458.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
2374]
Length = 275
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 154/303 (50%), Gaps = 47/303 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV +P S + ++ L++ T G + RA+ +F+ID VV++++
Sbjct: 6 LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYNDDHI-------------- 51
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
+++ E F+ +L Y+ TPQYLRK+ F + L+ VG+LPPL APHH + +
Sbjct: 52 KNENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
+R+G T+K N GT VD+G++K + ++TV N + + ++V V
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162
Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
P KP + +YWGY V ISS KN D ++ T+ +GD ++S L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFVVETTRYGDYIDSIFDELKLKV 215
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
F+++ I FGG SI ED + N H NT P+QG+ T+RTEEA+ +L
Sbjct: 216 DEFKNIAILFGGPYS---SILED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267
Query: 354 QYF 356
F
Sbjct: 268 SLF 270
>gi|408381923|ref|ZP_11179470.1| hypothetical protein A994_05701 [Methanobacterium formicicum DSM
3637]
gi|407815371|gb|EKF85956.1| hypothetical protein A994_05701 [Methanobacterium formicicum DSM
3637]
Length = 273
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 150/305 (49%), Gaps = 53/305 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S + T+ L+L T G I R+ +FR+ ++VV+ + S
Sbjct: 6 ISIFIPSSFLRETKDLKLKTYKVGLIGRSAALFRVTKIVVYSDTEDS------------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
E++ G F+ +L Y+ TPQYLRK +F + LR VG+LPPL PHH + F
Sbjct: 53 ---EDQKGVKFISDILTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHQGDF 109
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+TLK GT+VD+G ++ + + +++V + +D ++ P
Sbjct: 110 RQGLTLKRTKK---GTIVDIGADRDALCKE------KLSVNRVLSFRVDKLGKEIIITPD 160
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCS-----YKGGYDHLIGTSEHGD----IVNSSDLT 292
+P+ +YWGY ++ S +K+ K D ++GTS + + ++N
Sbjct: 161 EPQ---VYWGY------DVLSTYKSLDDSIDMLKPRPDLVVGTSRYAEPITSVLNEVREG 211
Query: 293 LPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
L +H I FGG +GL E + N +V NT P+QG++T+RTEEA+
Sbjct: 212 LRGSKHAAILFGGPYSGLHELM--------GNPGDVLDLEVNTIPNQGTKTVRTEEAVLA 263
Query: 352 SLQYF 356
+L F
Sbjct: 264 TLSVF 268
>gi|340506128|gb|EGR32344.1| hypothetical protein IMG5_087030 [Ichthyophthirius multifiliis]
Length = 491
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 130/247 (52%), Gaps = 34/247 (13%)
Query: 37 SSCDNKKKKKRKRDQLNDDAP-----------IEVPTVSVAVPGSIIDNTQSLELATRLA 85
+ NKKK ++ ++ P +++ +S+ +P SII QS EL +
Sbjct: 87 TEIQNKKKDQKNSNEKKQQIPQIINFSQNPTKVKLNQISICIPDSIISIPQSSELRSYFI 146
Query: 86 GQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQ 145
++AR IF +DE+++ DN +S + N E G ++ R LQYLETPQ
Sbjct: 147 SELARTAAIFCVDEIIIL-----KDNSYKSKSTNF-------EQGP-YIARNLQYLETPQ 193
Query: 146 YLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG---TLVDVG 202
YLRK LF +H L+ G++ P++ HHL++ ++ +REGV + + P G + V++G
Sbjct: 194 YLRKQLFPIHPDLKNAGLMNPIECHHHLKQDQYCKYREGVVV--SRPIKGGEDSSWVEIG 251
Query: 203 LNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---QVVPPSKPKES-GMYWGYKVRYAP 258
L K V+ L R+T+ + + LD + +VV +PKE G++WGY VR A
Sbjct: 252 LKKQAKVNYRLAANTRITLKLEDD-TLDIQNKYYNAEVVSQKEPKEKMGLFWGYSVRLAD 310
Query: 259 NISSVFK 265
N++ VFK
Sbjct: 311 NLTQVFK 317
>gi|410722183|ref|ZP_11361492.1| hypothetical protein B655_1961 [Methanobacterium sp. Maddingley
MBC34]
gi|410597419|gb|EKQ52038.1| hypothetical protein B655_1961 [Methanobacterium sp. Maddingley
MBC34]
Length = 273
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/305 (28%), Positives = 148/305 (48%), Gaps = 53/305 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S + T+ L+L T G I R+ +FR ++V++ +
Sbjct: 6 ISIFIPSSFLRETKDLKLKTYKVGLIGRSAALFRATKIVIYSD----------------T 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
E+ G F+ +L Y+ TPQYLRK +F + LR VG+LPPL PHH + F
Sbjct: 50 EDPEDRKGVKFISDILTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHQGDF 109
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+TLK GT+VD+G ++ + + +++V + +D ++ P
Sbjct: 110 RQGLTLKRTKK---GTIVDIGADRDALCKE------KLSVNRVLSFRVDKLGKEIIITPD 160
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCS-----YKGGYDHLIGTSEHGD----IVNSSDLT 292
+P+ +YWGY ++ S +K+ K D ++GTS + + ++N
Sbjct: 161 EPQ---VYWGY------DVLSTYKSLDDSIDLLKPQPDLVVGTSRYAEPITSVLNEVKEG 211
Query: 293 LPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
L +H+ I FGG +GL E + N +V NT P+QG++T+RTEEA+
Sbjct: 212 LRGSKHVAILFGGPYSGLHELM--------GNPGDVLDLEVNTIPNQGTKTVRTEEAVLA 263
Query: 352 SLQYF 356
+L F
Sbjct: 264 TLSVF 268
>gi|154286758|ref|XP_001544174.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150407815|gb|EDN03356.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 206
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/192 (39%), Positives = 109/192 (56%), Gaps = 13/192 (6%)
Query: 182 REGVTLKENAPN---SVGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQV 237
++G T K+N N + TLV+ GL + V V + P RVTV G++ +D +V
Sbjct: 13 KKGKTNKDNMGNDSPTSYTLVNTGLPEKVRVPYIPTPENTRVTVKFGSS--IDPSDGAEV 70
Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPT 295
V P P+E +G YWGY VR +S+VF C + GGYD GTSE G ++ + +P
Sbjct: 71 VSPFAPREETGYYWGYSVRRCATLSAVFTECPFDGGYDLSFGTSERGCPLSEAIAGEVPK 130
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRK-----NAREVFHSYFNTCPHQGSRTIRTEEAIF 350
F HL++ FGG+AGLE ++ D L+ K +A VF + N P QGSRTIRTEEA++
Sbjct: 131 FEHLIVVFGGVAGLEAAVTADKDLRDKGLGPSDAGRVFDYWVNVLPGQGSRTIRTEEAVW 190
Query: 351 ISLQYFQEPISR 362
+ L + + R
Sbjct: 191 LGLMGLRGVVER 202
>gi|383319752|ref|YP_005380593.1| hypothetical protein Mtc_1321 [Methanocella conradii HZ254]
gi|379321122|gb|AFD00075.1| hypothetical protein Mtc_1321 [Methanocella conradii HZ254]
Length = 271
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 146/300 (48%), Gaps = 55/300 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S+ + T + T GQ+ARA +IFR+DE+VV+ D+
Sbjct: 10 LSLLIPASLTEETADPRIKTYKVGQVARAASIFRVDEIVVYKTPKRDDS----------- 58
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
F+ +L+Y ETPQYLRK +F M +LR +G++PPL P H + E +R
Sbjct: 59 ---------RFISTVLRYAETPQYLRKEIFPMQGALRHIGVIPPLRIPSHTSEEE---YR 106
Query: 183 EGVTLKENAPNSVGT----LVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDAD-SPRQ 236
EG+ K VGT VDVG + ++ D ++ G RVTV + + R L + R
Sbjct: 107 EGIVTK------VGTDGNAWVDVGSDSPAMLPDAKVEKGQRVTVRIYSRRPLTVELVKRS 160
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SDLTL 293
VP +YWGY+VR A + + + I TS G + S++
Sbjct: 161 DVP--------LYWGYEVRIADTLHDALETDGLR------IATSRLGHPLACEMLSEIKS 206
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ +AFG + E + D+G K ++ + N+ P+QG+ T+R EEAI+++L
Sbjct: 207 KIRDKVSVAFGSPSKGLEQLLHDEGHKLEDHSDCV---VNSIPNQGTATVRAEEAIYVTL 263
>gi|305664237|ref|YP_003860525.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
gi|304378806|gb|ADM28645.1| Protein of unknown function DUF171 [Ignisphaera aggregans DSM
17230]
Length = 297
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 41/300 (13%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
P +S+A+P SI+ + R G I RA IFR++E+ ++
Sbjct: 15 PYISIAIPSSIVSESPDPREKVRKIGYIGRASAIFRVEEIFIY----------------- 57
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW-A 179
D++E ++ LL YLE P YLR+ L SLR+VG+LPPL PHH+R+ +
Sbjct: 58 ---MDDSEENLNYIYELLSYLEIPPYLRRKLVPYKPSLRYVGLLPPLKTPHHVRREIFNT 114
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKH-VVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
REGV + N + +++D+GL+K V + + G RVTV + D +V
Sbjct: 115 NLREGVVIDS---NDIRSIIDIGLDKKGVAYGRYIPRGSRVTVKI-VREQRDV----YIV 166
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-- 296
KE +YWGYKV ++ + + S K Y +IG S+ G + + +
Sbjct: 167 DIVDEKEIDIYWGYKVYRFKSMKEMLE-YSIKNSY-IVIGASKKGQPLYCLESEIKDILS 224
Query: 297 --RHLLIAFGGLAGLEESIEEDDGLK-RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ ++I FGG + I ++G+ + +R + N P QG ++RTEEAIF L
Sbjct: 225 GDKRIVIVFGGPRLDIDEIAVNEGIDIYRYSRYI----INFIPRQGVESVRTEEAIFAVL 280
>gi|238505443|ref|XP_002383948.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220690062|gb|EED46412.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 249
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 87/248 (35%), Positives = 121/248 (48%), Gaps = 49/248 (19%)
Query: 170 PHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTLVDVGLN 204
PHHLR +EW +REG+ + + ++P+S T++D GL
Sbjct: 2 PHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATVIDTGLP 61
Query: 205 KHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
K VV+ + P RVTV + + Q V PS P+ E+G YWGY VR ++SS
Sbjct: 62 KKVVLPDIQLPEYARVTVRFP---DYGREHYAQPVHPSTPRSEAGYYWGYYVRRCRSLSS 118
Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAFGGLAGL 309
VF C + GGYD GTSE G V S P ++HLLI FGG+AG+
Sbjct: 119 VFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVFGGVAGI 178
Query: 310 EESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRAL 364
E ++ D L + A ++F + N P QGSRTIRTEEA+++ L + +
Sbjct: 179 EAAVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLRGLVEGTH 238
Query: 365 R-RVEYKS 371
R R YKS
Sbjct: 239 RPRPSYKS 246
>gi|312137074|ref|YP_004004411.1| hypothetical protein Mfer_0851 [Methanothermus fervidus DSM 2088]
gi|311224793|gb|ADP77649.1| Protein of unknown function DUF171 [Methanothermus fervidus DSM
2088]
Length = 268
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 145/307 (47%), Gaps = 57/307 (18%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
V +S+ +P S+ ++ ++ T G I R IFR++E++++ ++
Sbjct: 2 VSILSIFIPDSLTIESKDKKIKTYKVGIIGRTAAIFRVNEIIIYHDEFG----------- 50
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEW 178
N F+ +L Y++TPQYLRK +F + ++LR+VG+LPPL PHH + +
Sbjct: 51 -------NSKEGKFIKDILTYMDTPQYLRKDIFPLTNNLRYVGVLPPLRTPHHPTGQPKV 103
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV----AMGTNRNLDADSP 234
R+G+TLK GT+V++G + + + L P +V +G ++ D P
Sbjct: 104 GELRQGLTLKSTKK---GTIVNIGAKRLALCKEKL-PIYKVYTFRIKKLGKRILVERDIP 159
Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD----IVNSSD 290
V YWGYKV+Y I+ + YD +I TS G ++ +
Sbjct: 160 ENV-----------YWGYKVKY---INKPIEKAIKMKKYDSIIATSRKGVPITFVLEEIE 205
Query: 291 LTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
+ HL I FGG GL +++ + +K NT P+QG+ T+RTEEA+
Sbjct: 206 NIIHNTHHLAILFGGPYKGLPTTLDREIDMK-----------INTIPNQGTETVRTEEAV 254
Query: 350 FISLQYF 356
+L F
Sbjct: 255 LATLSIF 261
>gi|281206162|gb|EFA80351.1| DUF171 family protein [Polysphondylium pallidum PN500]
Length = 531
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 133/249 (53%), Gaps = 37/249 (14%)
Query: 40 DNKKKKKRKRDQLNDDAP---IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFR 96
D K++KK + L AP + + S+A+P +I+D+ S E+AT L I+R V +F
Sbjct: 153 DIKEQKKEVQSILESSAPKIFAKHASKSIAIPANILDDITSEEMATYLVEMISRMVVLFA 212
Query: 97 IDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHS 156
+DE++V+ +S+ +N S + L++LL+Y+ETP+ +R LF +
Sbjct: 213 VDEIIVY----------QSALSN---------STSDKLMKLLEYIETPRNIRYHLFDLDD 253
Query: 157 -SLRFVGMLPPLDAPHHLRKHEWAPFREG-VTLKENAPNSVGTLVDVGL--NKHVVVDQV 212
+FV L ++A HH + W +REG VT K A S L+DVGL K + D+
Sbjct: 254 PDYKFVDKLKKMEAAHHNTTNRWTRYREGVVTDKVEAGTS---LIDVGLGHGKEALADKK 310
Query: 213 LDPGVRVTVAM---GTNRNLDADSPR----QVVPPSKPKESGMYWGYKVRYAPNISSVFK 265
L PG+RVT+ M +R + + + ++V P + KE+G YWGY +R+ ++ S+
Sbjct: 311 LQPGIRVTLEMEEDSKSREQNNSNSKYKKGKLVSPRQVKEAGHYWGYTIRHVKSLDSLDA 370
Query: 266 NCSYKGGYD 274
Y G YD
Sbjct: 371 ESPY-GSYD 378
>gi|66807919|ref|XP_637682.1| DUF171 family protein [Dictyostelium discoideum AX4]
gi|60466114|gb|EAL64179.1| DUF171 family protein [Dictyostelium discoideum AX4]
Length = 467
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 158/317 (49%), Gaps = 44/317 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+P + + QS E+ T + I+R VT+ +IDE++VF +D+Y+ +
Sbjct: 161 TVSIAIPSNALSGIQSEEMKTYIIEMISRQVTMNKIDEIIVF----KTDDYNEEDS---- 212
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF-VGMLPPLDAPHHLRKHEWAP 180
+ +S L+++L+Y+ETP YLR LF+ F V L +++ + ++
Sbjct: 213 ----QQQSDLNLLLKVLEYIETPSYLRDELFNTLEKDYFNVDKLNSINSCQQQQPNDEQL 268
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL----------- 229
+REGV ++ N ++V+VGL K V++ + L+PG R+T+ + + N
Sbjct: 269 YREGVVTNQHWKNQ--SIVNVGLEKLVLIPKRLNPGTRLTIDIKNSSNGDDDGTGGGGDG 326
Query: 230 -----DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKG--GYDHLIGTSEH 282
D +++ PS K+ G YWG+ +R +++ V C Y+ YD++I S++
Sbjct: 327 NRETNDKYINGKIISPSDIKKEGYYWGHHIRLVDSLAKVASTCQYEDTQNYDYIIMHSQY 386
Query: 283 GDIVNS-SDLTL---PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF--HSYFNTCP 336
G+ S +D+ + + HLL+ F A ++ + L EV ++ NT
Sbjct: 387 GEQFPSYTDVVINNAKKYNHLLVIFAD-ANASSTVNPEFKL----GSEVLVPDTHINTLV 441
Query: 337 HQGSRTIRTEEAIFISL 353
+ +R EE + I+L
Sbjct: 442 DFNIKKLRFEETLSITL 458
>gi|11499510|ref|NP_070751.1| hypothetical protein AF1926 [Archaeoglobus fulgidus DSM 4304]
gi|2648620|gb|AAB89330.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 256
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 150/293 (51%), Gaps = 53/293 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+A+P S + N + ++ T G IARA +FR+DE++++ +
Sbjct: 2 LSIAIPSSALINERDEKIKTYKVGLIARAAAVFRVDEIIIYYDPIL-------------- 47
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPF 181
DE++ F+ +L+YLETPQYLRK +F + SL++ G+LPPL P H KH +
Sbjct: 48 --DESD----FIAEILEYLETPQYLRKHIFPLKKSLKYAGLLPPLAIPSHRSKHLKVGEL 101
Query: 182 REGVTLKENAPNSVGT-LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REGV ++ AP+ GT VDVG++ + + G RVTV + + L +
Sbjct: 102 REGV-VRHVAPD--GTRWVDVGIDALAPMKSDAEKGARVTVRICSKTPLRVEE------- 151
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
++P+E YWGY+V+ + + K +L+ TS ++V L R +
Sbjct: 152 AEPQE---YWGYRVKKL-RLEELVKR-------KNLVVTSRKCEVVKPPALR----RDVT 196
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ FG EE + E + ++ E+ +N P+QG++T+R EEAIF +L
Sbjct: 197 LVFG---NPEEGVFE---IAKRLGVEMEAECWNVVPNQGTKTVRLEEAIFATL 243
>gi|333987680|ref|YP_004520287.1| hypothetical protein MSWAN_1472 [Methanobacterium sp. SWAN-1]
gi|333825824|gb|AEG18486.1| protein of unknown function DUF171 [Methanobacterium sp. SWAN-1]
Length = 275
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 144/301 (47%), Gaps = 45/301 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P SII T+ L + T G I R+ I++ D +VV+ + S+ +
Sbjct: 6 LSIFIPASIIAETKDLRIRTYKVGLIGRSAAIYKADRIVVYSDDSTKEE----------- 54
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
F+ +L Y+ TPQYLRK +F + L+ VG+LPPL PHH + +
Sbjct: 55 --------VKFISDVLTYMNTPQYLRKKVFPLTRELKNVGILPPLRTPHHPTGELSLGDY 106
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+T+K GT+VD+G + + + L ++ + + D+ ++ P
Sbjct: 107 RQGLTIKRTKK---GTMVDIGAERLALCKEKLSVNKILSFKV---EKIAKDN--ILIVPD 158
Query: 242 KPKESGMYWGYK-VRYAPNISSVFKNCSYKGGYDHLIGTSEHGD----IVNSSDLTLPTF 296
P+ YWGYK + NI + K D +IGTS + + +++ L
Sbjct: 159 IPE---FYWGYKTISTYKNIYQSIRMLDPKP--DLVIGTSRYAEPITSVLDEVKERLKGS 213
Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
+H I FGG +GL + I + + REV NT P QG++T+RTEEA+ +L
Sbjct: 214 KHTAILFGGPYSGLHDLISD------QKEREVIDLEVNTVPSQGTKTVRTEEAVISTLSV 267
Query: 356 F 356
F
Sbjct: 268 F 268
>gi|14591536|ref|NP_143618.1| hypothetical protein PH1779 [Pyrococcus horikoshii OT3]
gi|3258214|dbj|BAA30897.1| 258aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 258
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 139/298 (46%), Gaps = 56/298 (18%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T L+L GQIARA IF ++ + ++ A R R
Sbjct: 5 VFIPDSLLEETSDLKLKAYKVGQIARACAIFGVEHIWIY------------KAGGRDGR- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
+ LL+Y+ETPQYLRK +F + LR+ G++PPL P H K + +P
Sbjct: 52 --------LIKTLLEYVETPQYLRKRIFPIMPELRYAGVMPPLQIPSH--KEKTSPRIGE 101
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REG + D+GL++ +V + + G GT + + P +VVP
Sbjct: 102 IREGFAFRRGK----KIFADIGLDRPALVKGIAEEG------RGTFKIISV-KPLRVVPA 150
Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P YWGYKV ++ +++ KN D +I TS G V+ D +P +
Sbjct: 151 EPPG----YWGYKVHWSKKSLAKTLKNAD----LDVVIATSRRG--VDVRDAEVPLEGEV 200
Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
I FG G+ E +EE K K F NT P+Q +RT+RTEEA+ +L F
Sbjct: 201 GIVFGSPRKGVFEILEE---FKEKYE---FDLVLNTIPNQKTRTVRTEEAVLATLAIF 252
>gi|328871124|gb|EGG19495.1| DUF171 family protein [Dictyostelium fasciculatum]
Length = 509
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 117/224 (52%), Gaps = 29/224 (12%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T S+A+P I+D+ S E+ T L IAR + +F IDEV+V+ RS
Sbjct: 182 TRSIAIPNDILDDFASEEMKTYLVEMIARTLVLFSIDEVIVY----------------RS 225
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHS-SLRFVGMLPPLDAPHHLRKHEWAP 180
+S S L +LL++++ P+ +R L+ ++ +FV L +++ H + W
Sbjct: 226 TKS--TNSTTDKLCKLLEFIDLPKNIRHHLYDLNDIDYQFVDKLKRVESGQHATSNRWTK 283
Query: 181 FREGVTLKENAPNSVGTLVDVGL--NKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-- 236
+REG+ L + + +LVDVGL K + +D+ L G+RVT+ M ++ Q
Sbjct: 284 YREGIVL--DKLSGKFSLVDVGLGDGKEIKIDKELQAGIRVTLEMEEEHGKQINNANQHR 341
Query: 237 ---VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
VV P++ + G YWGY+VR +I +VF++ S+ YD+ I
Sbjct: 342 KGKVVSPAEVRHDGHYWGYQVRSVDSIDNVFQDSSF-NAYDYKI 384
>gi|413952712|gb|AFW85361.1| hypothetical protein ZEAMMB73_404324 [Zea mays]
Length = 73
Score = 114 bits (285), Expect = 7e-23, Method: Composition-based stats.
Identities = 55/71 (77%), Positives = 62/71 (87%)
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
RHLLIAFGGLAGLEESIEED LK K A +VF SYFNTCP+QGSRTIRTEEA+ ISLQYF
Sbjct: 3 RHLLIAFGGLAGLEESIEEDTNLKGKRADDVFTSYFNTCPNQGSRTIRTEEALLISLQYF 62
Query: 357 QEPISRALRRV 367
Q+PI RA +++
Sbjct: 63 QDPIRRAEQKL 73
>gi|119719149|ref|YP_919644.1| hypothetical protein Tpen_0231 [Thermofilum pendens Hrk 5]
gi|119524269|gb|ABL77641.1| Protein of unknown function DUF171 [Thermofilum pendens Hrk 5]
Length = 273
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 141/300 (47%), Gaps = 56/300 (18%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
+VAVP S N +A +ARAVTIFR++EV ++
Sbjct: 12 AVAVPSSTFSNLSEPARTFHVA-LLARAVTIFRVEEVAIY-------------------- 50
Query: 124 SDENESG--AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
ESG A L R+L LE PQYLRK L R++G++PPL +P HL + E + +
Sbjct: 51 ---QESGHPCAELKRILDALEVPQYLRKYLVPRSREYRYLGLVPPLRSPSHLLRGEESEY 107
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG + + LVDVGL V + D G RVT+ + R V
Sbjct: 108 REGYVVSRRGTKA---LVDVGLGNPVEAEVPTDAGRRVTL------RREGAGWRYV---- 154
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
P+E +YWGY+ + ++ ++ +G L+GTS V+ L L +
Sbjct: 155 SPEEVKVYWGYRTKCYSSLREALEHYRSRGFL--LVGTSRKAAPVS---LVLEEIKREAT 209
Query: 302 AFGGLA--------GLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
A G+A GL+E I E++G + ++ + NT P QG +TIRTEEA++ISL
Sbjct: 210 AKNGVAVFFGTWNKGLQE-ISEEEGFELESFLDFI---VNTAPIQGVKTIRTEEAVYISL 265
>gi|14520559|ref|NP_126034.1| hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
gi|5457775|emb|CAB49265.1| Hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
gi|380741086|tpe|CCE69720.1| TPA: hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
Length = 258
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 134/298 (44%), Gaps = 56/298 (18%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T +L GQIARA IF ++ + ++ A R R
Sbjct: 5 VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY------------KAGGRDGR- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
+ LL+Y ETPQYLRK LF + LR+ G++PPL P H K + +P
Sbjct: 52 --------LIKTLLEYAETPQYLRKRLFPIMPELRYAGVMPPLQIPSH--KEKGSPRIGE 101
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REG + D+GL++ +V + D G GT + + P +VVP
Sbjct: 102 IREGFAFRRGK----KLFADIGLDRPALVKGIADEG------RGTFKIISV-KPLRVVPA 150
Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P YWGYKV ++ ++ KN D +I TS G V+ D +P +
Sbjct: 151 EPPG----YWGYKVHWSRKSLVKTLKNAD----LDVVIATSRRG--VDVRDAEVPLEGEV 200
Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
I FG G+ E LK N R F NT P Q ++T+RTEEA+ +L F
Sbjct: 201 GIVFGSPRKGVFEI------LKEFNERYEFDLVLNTIPGQKTKTVRTEEAVLATLAIF 252
>gi|282164889|ref|YP_003357274.1| hypothetical protein MCP_2219 [Methanocella paludicola SANAE]
gi|282157203|dbj|BAI62291.1| conserved hypothetical protein [Methanocella paludicola SANAE]
Length = 276
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 133/281 (47%), Gaps = 44/281 (15%)
Query: 78 LELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRL 137
L + T GQ+ARA +IFR+DE+V++ D + F+ +
Sbjct: 28 LRIKTYKVGQVARAASIFRVDEIVIYKTPIRDD--------------------SKFISTV 67
Query: 138 LQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT 197
L+Y ETPQYLRK +F M LR +G++PPL P H E +REG+ K + +
Sbjct: 68 LRYAETPQYLRKEIFPMQDDLRHIGVIPPLRTPAHTVT-ENEEYREGIVTKVGSDGN--A 124
Query: 198 LVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVR 255
VDVG + ++ + G RVTV + + R L V K + +YWGY VR
Sbjct: 125 WVDVGSDSPAMLRDAKNVRKGQRVTVRIYSRRPL-------TVHLVKRSDVPLYWGYDVR 177
Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SDLTLPTFRHLLIAFGGLAGLEES 312
+ + + I TS G+ + S++ T + +AFG + E
Sbjct: 178 IVNTLHDALETEGLR------IATSRLGEPLACEKLSEIKSNTRDKVCVAFGSPSMGLEQ 231
Query: 313 IEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ D+G K ++ + N+ PHQG+ T+R+EEA++I+L
Sbjct: 232 LLHDEGHKLEDHSDCV---VNSIPHQGTATVRSEEAVYITL 269
>gi|330796272|ref|XP_003286192.1| hypothetical protein DICPUDRAFT_150135 [Dictyostelium purpureum]
gi|325083862|gb|EGC37304.1| hypothetical protein DICPUDRAFT_150135 [Dictyostelium purpureum]
Length = 471
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 150/312 (48%), Gaps = 40/312 (12%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+P + + +S E+ T + I+R VT+ +IDE++V+ +
Sbjct: 171 TVSIAIPSDCLKDIESEEMKTYVIEMISRQVTMNKIDEIIVY------------KTDDYD 218
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALF-SMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+ + +S L+++L+Y+ETP YLR+ LF ++ +V L L++ +
Sbjct: 219 DTDNTQQSNLNLLLKILEYIETPSYLREELFNTLDKDFFYVDKLNRLNSTQEEVNADDKL 278
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR----- 235
+REGV + N +++ VGL K V++ + L+PG RVT+ M + + +
Sbjct: 279 YREGVVTDQQFRNQ--SIISVGLEKQVLISKRLNPGTRVTIDMKQENFTEEELSKIINTT 336
Query: 236 -------QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKG--GYDHLIGTSEHGDIV 286
+V+ P K+ G YWG+ VR ++ + C YK YD+ I S+ G+
Sbjct: 337 DEQYTLGRVISPQNIKKEGYYWGHHVRLVNSMDEIASTCEYKDTKTYDYSILLSDFGEQF 396
Query: 287 NS-SDLTLPT---FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF-HSYFNTCPHQGSR 341
S +D L + HLLI F + ++ + R N+ + ++ NT + +
Sbjct: 397 PSPTDPMLNNAKRYNHLLIIFAN------NHKKFNPADRLNSEVLIPDTHINTLFNFNNV 450
Query: 342 TIRTEEAIFISL 353
IR EE +F++L
Sbjct: 451 PIRFEENLFVTL 462
>gi|84489709|ref|YP_447941.1| hypothetical protein Msp_0910 [Methanosphaera stadtmanae DSM 3091]
gi|84373028|gb|ABC57298.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
Length = 279
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 148/295 (50%), Gaps = 47/295 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ VP S + ++ ++ T G I R +FR + +V++++ +SD SR A
Sbjct: 12 LSIFVPNSFLAESKDSKIRTYKVGLIGRYAALFRANNIVIYND--NSDGGSRDDAL---- 65
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAP 180
++ +L+Y++TPQYLRK +F + L+ VG+LPPL PHH E
Sbjct: 66 ----------YMKTILEYMDTPQYLRKQVFPITPELKNVGILPPLRTPHHPATDEANVGD 115
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
FR+G+T K GT+VD+G+ + + + +++V + ++ ++ P
Sbjct: 116 FRKGLTTKRVRK---GTMVDIGVGRLALCKE------KLSVNKVLSFRIEKLGKEILIEP 166
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCS-YKGGYDHLIGTSEHGDIVNS----SDLTLPT 295
+P +YWGY V + S+++ + + K D +IGTS++ +NS ++
Sbjct: 167 DEP--DSVYWGYDVITSD--SNLYDSITMMKNKPDLVIGTSKYAPNINSILDEVQTSIQQ 222
Query: 296 FRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
H+ I FGG +G+ I E RK EV NT P QG+ T+RTEEA+
Sbjct: 223 ANHVAILFGGPYSGINSLINE-----RKLDYEV-----NTVPKQGTETVRTEEAV 267
>gi|213407844|ref|XP_002174693.1| DUF171 family protein [Schizosaccharomyces japonicus yFS275]
gi|212002740|gb|EEB08400.1| DUF171 family protein [Schizosaccharomyces japonicus yFS275]
Length = 326
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 111/228 (48%), Gaps = 23/228 (10%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+A+ + + +L L T L +IA + F + EV++ + K +
Sbjct: 39 TVSLAIDTGCVLHETNLLLRTALVWRIAGLLLTFNVTEVILVEPKKEAAG---------- 88
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH---LRKHEW 178
+ E ++ F+ LL YLE P YLRKALF ++ +LRF P L PHH +
Sbjct: 89 --ALEVKNCTEFVETLLSYLEAPPYLRKALFPLNPALRFTSSFPLLAFPHHSTLTTSAQS 146
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM------GTNRNLDAD 232
+REGV TLV GL+K V V G RVTV++ +L++D
Sbjct: 147 VRYREGVVEGSEPYVRNKTLVYAGLDKPVRVPGKYPRGTRVTVSLKQVPKKTAKSSLNSD 206
Query: 233 SPR-QVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
VV S P+E G+YWGY +R A + +CSYKGGY H+IG
Sbjct: 207 CFEGDVVSSSAPREKGGLYWGYTIRRAQSWKDAVSSCSYKGGYSHVIG 254
>gi|20093577|ref|NP_613424.1| hypothetical protein MK0137 [Methanopyrus kandleri AV19]
gi|19886433|gb|AAM01354.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19]
Length = 255
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 140/292 (47%), Gaps = 48/292 (16%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
V+VA P S+ ++ G +ARA+ I+R++EV ++ + R N
Sbjct: 2 VAVAFPWSLFSEETDPKIYAYRVGTLARALAIYRVEEVYLYGD----------GVGTRRN 51
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
A L +LL+Y E PQYLRK +F + LR+ G++PPL APHH + +
Sbjct: 52 --------AERLRKLLEYQECPQYLRKRVFRLDRDLRYAGVMPPLRAPHHKVHSPKEGEV 103
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REG ++ + G LVDVG ++ P RVTV + + L+ + P+
Sbjct: 104 REGYVVRRSRN---GALVDVGADRLARTRWRFKPHERVTVRIVSEDPLEVE-------PA 153
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
+P+E YWGY+VR ++ V + +K G +I TS +G+ V + P +
Sbjct: 154 EPEE---YWGYRVRIVNELNEVLRE--FKEG---IIVTSRYGEDVREVEFKSPV---KCL 202
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FG + ++ D G++ + + N P+QG + +RTEEA+ +L
Sbjct: 203 VFG--SSEVSVLDVDPGVRDE------YPVINFVPNQGVQVVRTEEAVHTTL 246
>gi|19112370|ref|NP_595578.1| DUF171 family protein [Schizosaccharomyces pombe 972h-]
gi|74626596|sp|O13641.1|YGWG_SCHPO RecName: Full=Uncharacterized protein C8D2.16c
gi|2257535|dbj|BAA21429.1| pi049 [Schizosaccharomyces pombe]
gi|2950503|emb|CAA17831.1| DUF171 family protein [Schizosaccharomyces pombe]
Length = 315
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 135/271 (49%), Gaps = 26/271 (9%)
Query: 48 KRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS 107
K ++ P T+S+A+P S ++ +L+L T +I+R V+++ IDE+++ ++
Sbjct: 3 KESEIQSLPPTRRYTISLALPISSLNVAYNLQLKTSFVWKISRIVSLYGIDEIILLEDPE 62
Query: 108 SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLP-- 165
N + ++ + D + FL LL Y ETP ++RK LF ++ L++ P
Sbjct: 63 YVQNTQVHTLSSDAYLKDPTK----FLTDLLCYFETPFFMRKELFPLNPHLKYTSCFPLL 118
Query: 166 PLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD--QVLDPGVRVTVAM 223
PL + P+REG+ + +++ GL+ +V+V VL P RVTV
Sbjct: 119 PLRNDKASTVNIEFPYREGIVTHPSPQAKNKYIINAGLSHNVIVSSPSVLAPRTRVTV-- 176
Query: 224 GTNRNLDADSPRQ-------VVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
L A SP + +V S P+E G YWGYKVR + S + K+ YKGGYD
Sbjct: 177 ----RLKAQSPNEEGQLQGDIVSFSAPREKGGHYWGYKVRSCLS-SQLCKSSPYKGGYDF 231
Query: 276 LIGTSEHGDIVNSSDL--TLP-TFRHLLIAF 303
++ + + S +L +LP +FRH ++
Sbjct: 232 VVQINSQTSAITSKELEASLPSSFRHAVLVL 262
>gi|302841751|ref|XP_002952420.1| hypothetical protein VOLCADRAFT_93008 [Volvox carteri f.
nagariensis]
gi|300262356|gb|EFJ46563.1| hypothetical protein VOLCADRAFT_93008 [Volvox carteri f.
nagariensis]
Length = 224
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 112/224 (50%), Gaps = 46/224 (20%)
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADS----PR 235
P+REGV LK G+ VDVGL++ V ++ L R+TV +G NL+ ++ P
Sbjct: 2 PYREGVVLKSEP--GAGSYVDVGLDRMVWIEHELPQATRLTVHLG---NLEPETRFMTPY 56
Query: 236 -------QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSY---KGGYDHLIGTSEHGD 284
+VVPP++P+E G+YWGY VR A + VFK+ + G YD IG+S G
Sbjct: 57 SETMIVGKVVPPARPREQLGLYWGYSVRLALGLQRVFKDAAVTRGSGKYDLTIGSSPLGA 116
Query: 285 IVNSSDLTLPTFRHLLIAFGGLAG--------------------------LEESIEEDDG 318
+ L LPTF+H L+ FGG AG L++ +
Sbjct: 117 HTDPLSLVLPTFKHALLVFGGGAGGASGGAGGPGSTSSSSSSTRAVPEQDLDQLVPRVPD 176
Query: 319 LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISR 362
K ++VF+ Y NT PH G+ +RTE+A+ I+L Y P+ +
Sbjct: 177 WGHKQPQDVFNLYLNTAPHLGTLRLRTEDAVPIALSYLITPLLK 220
>gi|57641478|ref|YP_183956.1| hypothetical protein TK1543 [Thermococcus kodakarensis KOD1]
gi|57159802|dbj|BAD85732.1| hypothetical protein, conserved, DUF171 family [Thermococcus
kodakarensis KOD1]
Length = 256
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 140/295 (47%), Gaps = 52/295 (17%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T ++ T GQ+ARA +IF ++ + ++ A R +
Sbjct: 5 VFIPDSLLEETDDPKIRTYKVGQVARACSIFGVEHIWIY------------RAGGRDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
F+ +L+Y ETPQYLRK LF + L++VG++PPL PHH K R
Sbjct: 52 --------FIKTVLEYAETPQYLRKRLFPLMPELKYVGVIPPLRTPHHKLKGRPRVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG + T D+GL++ VV+ ++ R T + + R P +VV P+K
Sbjct: 104 EGFAFRRGK----RTYADIGLDELAVVEGNVEG--RATFKIVSTR------PLRVV-PTK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P++ YWGY+V S+ K K G D +I TS G + ++ P +
Sbjct: 151 PED---YWGYRVHLTRK--SLAKTLK-KAGLDLVIATSRKGRDIREVEIP-PLEGEVGFV 203
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E + E+ + FH NT P+Q + T+RTEEA+ +L F
Sbjct: 204 FGSPRKGVMELLGEE--------KFEFHLILNTIPNQRTETVRTEEALLATLAIF 250
>gi|389853118|ref|YP_006355352.1| hypothetical protein Py04_1708 [Pyrococcus sp. ST04]
gi|388250424|gb|AFK23277.1| hypothetical protein Py04_1708 [Pyrococcus sp. ST04]
Length = 258
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 130/295 (44%), Gaps = 50/295 (16%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T +L GQIARA IF ++ + ++ R+
Sbjct: 5 VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY----------------RAGGK 48
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
D + LL+Y ETPQYLRK LF + L++ G++PPL P H K R
Sbjct: 49 D-----GKLIKTLLEYAETPQYLRKRLFPLMPELKYAGVMPPLQIPSHKPKTTPRIGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ D+GL++ ++ + + G GT + + P +VVP
Sbjct: 104 EGYAFRKGK----KLFADIGLDRPALIKGIAEEG------RGTFKIISV-KPLRVVPSDP 152
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P YWGYKV ++ K D +I TS G V+ D +P + I
Sbjct: 153 PD----YWGYKVHWS---RKTLAKTLKKAELDVVIATSRRG--VDVRDAEVPLEGEVGIV 203
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E +E + N + F NT P Q ++T+RTEEA+ +L F
Sbjct: 204 FGSPRMGIMEILE------KYNEKFEFDLIVNTIPGQKTKTVRTEEAVLATLAIF 252
>gi|337283643|ref|YP_004623117.1| hypothetical protein PYCH_01360 [Pyrococcus yayanosii CH1]
gi|334899577|gb|AEH23845.1| hypothetical protein PYCH_01360 [Pyrococcus yayanosii CH1]
Length = 298
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 128/295 (43%), Gaps = 50/295 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T V +P S+++ T +L GQIARA IF ++ + ++ A R
Sbjct: 42 TWHVFIPDSLLEETHDPKLRAYKVGQIARAAAIFGVEHIWIY------------RAGGRD 89
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWA 179
R F+ LL+Y ETPQYLRK LF + LR+ G++PPL P H K
Sbjct: 90 GR---------FIKTLLEYAETPQYLRKRLFPLMPELRYAGVMPPLQIPSHKPKESPRIG 140
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
REG ++ D+GL++ +V V G GT + + R V
Sbjct: 141 EIREGFAYRKGKK----LFADIGLDRPALVKGVAQEG------RGTFKVISVKPLRVV-- 188
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P+KP+ YWGYKV K D +I TS G V +D+TL +
Sbjct: 189 PTKPEG---YWGYKVHLT---RMTLAKTLKKANLDVVIATSRKGVDVRRADVTLEG--DV 240
Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
I FG G+ E + E N F NT P Q ++T+RTEEA+ +L
Sbjct: 241 GIVFGSPRRGVMEILREF------NEDYEFDVILNTIPGQKTKTVRTEEAVLATL 289
>gi|223477732|ref|YP_002582390.1| methylase [Thermococcus sp. AM4]
gi|214032958|gb|EEB73786.1| methylase [Thermococcus sp. AM4]
Length = 255
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 135/300 (45%), Gaps = 63/300 (21%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T ++ T GQ+ARA IF ++ + ++ A R +
Sbjct: 5 VFIPDSLLEETDDPKIRTYKVGQVARACAIFGVEHIWIY------------RAGGRDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
F+ +L+Y+ETPQYLRK LF + LR+VG++PPL PHH L+ K + R
Sbjct: 52 --------FIKTVLEYMETPQYLRKRLFPLMPELRYVGVVPPLRTPHHKLKGKPKVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ D+GL++ +V+ ++ R T + + R L V P+K
Sbjct: 104 EGYAFRKGR----KIYADIGLDELAIVEGNVEG--RATFRIVSTRPLK-------VVPTK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P E YWGYKV + K D I TS G V +L P + IA
Sbjct: 151 PVE---YWGYKVHLS---GKPLAKTLKKAHLDLAIATSRKGRDVRKVNLP-PLEGEVGIA 203
Query: 303 FGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG RK E+ F NT P+Q ++T+RTEEA+ +L F
Sbjct: 204 FGS--------------PRKGVMELLGGEYEFDFVLNTIPNQKTKTVRTEEALLATLAIF 249
>gi|393796609|ref|ZP_10379973.1| hypothetical protein CNitlB_09876 [Candidatus Nitrosoarchaeum
limnia BG20]
Length = 270
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 140/299 (46%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+S+A+P S + + +SL+L TR +ARA +IF+ID + V+ NY
Sbjct: 3 ISIAIPESALSD-ESLKLDKTRKISVLARACSIFKIDTIYVYQ----EGNY--------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
+ D N LV +L+YLETPQ+LR+ LF + L+F G+L PL P H +K
Sbjct: 49 -KEDGN-----LLVTILKYLETPQFLRRRLFPKMNELKFAGVLLPLRIPSHATPANSKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REGV + VDVG+N+ + D G RVTV + P
Sbjct: 103 NIGDVREGVVISIKGKR----FVDVGINEAIPFFGKTDVGKRVTV------QFKSGYPDF 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
V + E+ +YWGY V+ N+ S+ ++G + I TS G LT L
Sbjct: 153 SVKEIQRNETPLYWGYTVKERSNLFSLL--TEWRG---NTIITSRKGRTATKEQLTKYLS 207
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ +L+ FG E+ + E G K KN + N P+Q + T+R EEA+ +L
Sbjct: 208 SELPILVVFG---SPEKGVHEILGGKMKNIQNA--KTLNFFPNQATETVRLEEALLGTL 261
>gi|21064729|gb|AAM29594.1| RH40775p [Drosophila melanogaster]
Length = 250
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 80/136 (58%), Gaps = 1/136 (0%)
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+V P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+
Sbjct: 15 LVSPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYN 74
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
F H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 75 FNHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAA 134
Query: 356 FQEPISRALRRVEYKS 371
QE + + VE S
Sbjct: 135 LQEKLQPQVADVEIDS 150
>gi|315229855|ref|YP_004070291.1| methylase [Thermococcus barophilus MP]
gi|315182883|gb|ADT83068.1| methylase [Thermococcus barophilus MP]
Length = 258
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 134/309 (43%), Gaps = 72/309 (23%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+ + +P S+++ T ++ T GQIARA IF ++ + ++ A +
Sbjct: 2 TLHIFIPDSLLEETADPKIRTYKVGQIARAAAIFGVEHIWIY------------KAGGKD 49
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWA 179
+ F+ +L+Y ETPQYLRK LF + L++VG++PPL PHH K +
Sbjct: 50 GK---------FIKLILEYAETPQYLRKTLFPIRKELKYVGVIPPLRTPHHKLKGRPKLG 100
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNR---NLDADSPRQ 236
REGV +K+ D+GL++ +V+ G R + + +P +
Sbjct: 101 EIREGVIIKKGK----RLYADIGLDELALVE-----------GSGEGRMTFKIVSVNPLK 145
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VV P+KP E YWGY+V + K D I TS G+ V
Sbjct: 146 VV-PAKPAE---YWGYRVHLT---NKPLAKTLKKARLDLAIATSRKGEDVRKVK------ 192
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSY---------FNTCPHQGSRTIRTEE 347
L LE I G RK E+ + NT P+Q ++T+RTEE
Sbjct: 193 ---------LPPLEGDIGFVFGSPRKGVMEILRDFNEDYPFDLILNTIPNQKTKTVRTEE 243
Query: 348 AIFISLQYF 356
A+ +L F
Sbjct: 244 AVLATLAVF 252
>gi|330508499|ref|YP_004384927.1| hypothetical protein MCON_2704 [Methanosaeta concilii GP6]
gi|328929307|gb|AEB69109.1| conserved hypothetical protein [Methanosaeta concilii GP6]
Length = 279
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 132/299 (44%), Gaps = 46/299 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
++ +P S+ ++ + T GQIARA +FR+ +V++ ++ +D
Sbjct: 8 AILIPSSLTMESRDTRVNTLKVGQIARAAAVFRVSRIVIYRDQEYND------------- 54
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LRKHEW 178
+ F+ +L+Y+ETPQYLRK L LR VGMLPPL HH +
Sbjct: 55 -------SRFISMILRYMETPQYLRKLLIPRKEELRHVGMLPPLRTSHHPINSKSESLKI 107
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
FR G + E+ + G V++G+N+ + L G + V N + + P
Sbjct: 108 GEFRVGAVV-ESVGSDGGVWVEIGINRPI----PLRTGEKYPVGQRLNVRIFSQEPLAAE 162
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-- 296
P + K+ YWGY+ ++ S ++ ++ S G + L
Sbjct: 163 PVDQ-KDIPHYWGYETEVVDSLESYLESMRDT----RIVLASRSGRAITPQALLELVQSG 217
Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
R L +AFG A ++ + +KR + NT PHQG+ T+R EEA+F +L
Sbjct: 218 NKRDLAVAFGSPARGLDAFLSKESMKR-------YEMINTIPHQGTETVRVEEAVFATL 269
>gi|352682603|ref|YP_004893127.1| hypothetical protein TTX_1417 [Thermoproteus tenax Kra 1]
gi|350275402|emb|CCC82049.1| conserved hypothetical protein [Thermoproteus tenax Kra 1]
Length = 270
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 130/293 (44%), Gaps = 44/293 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ VA+P ++ E R G IARA IFR+++++++
Sbjct: 1 MDVAIPHDVLSEAPDEESKVRKLGYIARAAAIFRVEKIIIY------------------T 42
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
D + G + LL+Y TP +LR+ +F + LR G+LPPL P HL R
Sbjct: 43 YGDVDHEGVEEMRLLLEYAATPPHLRRKVFRLDKRLRLAGLLPPLKIPSHLAPRSPAVGD 102
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
EG+ + + S +V +G K V + G R+ V + R D D R VV
Sbjct: 103 VVEGIVERWDGYYS---MVYIGAGKWAKVPKPYPVGSRLIVKLEAQR--DKDVFRGVVLK 157
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
+ P YWGYKV V GYD++I T + G V L P + L+
Sbjct: 158 NPPD----YWGYKV-------EVKGLRELADGYDYVILTGKEGRSV-CEILPRPEGKTLV 205
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ G AG++E + ++ E+ + N P QG+ TIRTEEA+FI L
Sbjct: 206 VFGGPRAGVDE-------IMKREGVELRGLFLNFAPKQGTETIRTEEAMFIVL 251
>gi|332158025|ref|YP_004423304.1| hypothetical protein PNA2_0383 [Pyrococcus sp. NA2]
gi|331033488|gb|AEC51300.1| hypothetical protein PNA2_0383 [Pyrococcus sp. NA2]
Length = 258
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 132/298 (44%), Gaps = 56/298 (18%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T +L GQIARA IF ++ + ++ A + R
Sbjct: 5 VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY------------KAGGKDGR- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
F+ LL+Y ETPQYLRK LF + LR+ G++PPL P H K + +P
Sbjct: 52 --------FIKTLLEYAETPQYLRKRLFPIMPELRYAGVMPPLQIPSH--KEKTSPRIGE 101
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REG + D+GL + +V + + G GT + + P +VVP
Sbjct: 102 IREGFAFRRGK----KLFADIGLERPALVKGIAEEG------RGTFKIISV-KPLRVVPA 150
Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P YWGYKV + ++ KN D +I TS G V+ + +P +
Sbjct: 151 EPP----TYWGYKVHLSRKTLAKTLKNAD----LDVVIATSRLG--VDVREAEVPLEGEV 200
Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
I FG G+ E L+ N F NT P Q ++T+RTEEA+ +L F
Sbjct: 201 GIVFGSPKKGVFEI------LRSFNEEFEFDLVLNTIPGQKTKTVRTEEAVLATLAIF 252
>gi|240104061|ref|YP_002960370.1| hypothetical protein TGAM_2004 [Thermococcus gammatolerans EJ3]
gi|239911615|gb|ACS34506.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
Length = 255
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 135/300 (45%), Gaps = 63/300 (21%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T ++ T GQ+ARA IF ++ + ++ A R +
Sbjct: 5 VFIPDSLLEETDDPKIRTYKVGQVARACAIFGVEHIWIY------------RAGGRDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
F+ +L+Y+ETPQYLRK LF + LR+VG++PPL PHH L+ K + R
Sbjct: 52 --------FIKTVLEYMETPQYLRKRLFPLMPELRYVGVVPPLRTPHHKLKGKPKVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ D+GL++ +V+ ++ R T + + R L V P+K
Sbjct: 104 EGYAFRKGR----KIYADIGLDELAIVEGNIEG--RATFRVVSTRPLK-------VVPAK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P E YWGY+V + K D I TS G V +L P + IA
Sbjct: 151 PVE---YWGYRVHLS---GKPLAKTLKKAHLDLAIATSRKGRDVRKVNLP-PLEGEVGIA 203
Query: 303 FGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG RK E+ F NT P+Q ++T+RTEEA+ +L F
Sbjct: 204 FGS--------------PRKGVMELLGGEYEFDFVLNTIPNQKTKTVRTEEALLATLAIF 249
>gi|386876304|ref|ZP_10118426.1| hypothetical protein BD31_I1465 [Candidatus Nitrosopumilus salaria
BD31]
gi|386805880|gb|EIJ65377.1| hypothetical protein BD31_I1465 [Candidatus Nitrosopumilus salaria
BD31]
Length = 270
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 138/299 (46%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+SVA+P S + + +SL++ TR +ARA IF+ID + V+ ++S++
Sbjct: 3 LSVAIPDSALSD-ESLKIDKTRKISVLARACAIFKIDTIYVYQEGNNSND---------- 51
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
+ +V +L+YLETPQ+LR+ LF + L+F G+L PL P H+ +K
Sbjct: 52 ---------GSLMVMILKYLETPQFLRRRLFPKMNDLKFAGVLQPLKIPSHITPTNSKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + VDVG+N+ + G RVT+ P+
Sbjct: 103 NAGDVREGIVVSVKGKR----FVDVGINQLIPFFGDTSIGKRVTL------RFKEGYPKL 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
V ES YWGY V+ N+ S+ +KGG +I TS G + T
Sbjct: 153 SVKEIDRSESPDYWGYTVKERSNLYSIL--ADWKGG---IIITSRKGKTATKEQIAKYTK 207
Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ +LI FG E+ + E G K KN + N P+Q + T+R EEA+ +L
Sbjct: 208 SDQPVLIVFG---SPEKGVHEIIGGKMKNVQNA--KALNFFPNQATETVRLEEALLGTL 261
>gi|45552535|ref|NP_995790.1| CG12128, isoform B [Drosophila melanogaster]
gi|45445612|gb|AAS64882.1| CG12128, isoform B [Drosophila melanogaster]
Length = 250
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 79/133 (59%), Gaps = 1/133 (0%)
Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+V P +P+ E+G+YWGY+VR A ++S +F Y GYD +GTS+ G V+
Sbjct: 15 LVSPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYN 74
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
F H+LI FGGL GLE ++ D+ L + +F Y N P QGSRTIRTEEA+ I+L
Sbjct: 75 FNHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAA 134
Query: 356 FQEPISRALRRVE 368
QE + + VE
Sbjct: 135 LQEKLQPQVADVE 147
>gi|212223210|ref|YP_002306446.1| hypothetical protein TON_0065 [Thermococcus onnurineus NA1]
gi|212008167|gb|ACJ15549.1| hypothetical protein, conserved, DUF171 family [Thermococcus
onnurineus NA1]
Length = 305
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 134/295 (45%), Gaps = 52/295 (17%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+ +P S+++ T ++ T GQIARA IF I+ + ++ A + R
Sbjct: 54 IFIPDSLLEETDDPKIRTYKVGQIARAAAIFGIEHIWIY------------RAGGKDGR- 100
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
F+ +L+Y ETPQYLRK LF + L++VG++PPL PHH L+ K + R
Sbjct: 101 --------FIKLILEYAETPQYLRKRLFPLMPELKYVGVIPPLRTPHHKLKGKPKVGEIR 152
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG + D+GL++ +V D R T + + R L V P+K
Sbjct: 153 EGFAFRRGR----RVYADIGLDELALVGG--DVEGRATFRIVSTRPLR-------VIPAK 199
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P+E YWGY+V S+ K K G D I TS G + L P +
Sbjct: 200 PEE---YWGYRVHLTKK--SLAKTLK-KAGLDLAIATSRKGQDIREVKLP-PLEGEVGFI 252
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E + E+D F NT P+Q + T+RTEEA+ +L F
Sbjct: 253 FGSPRKGVMELLGEED--------YSFDLILNTIPNQRTATVRTEEAVLATLAVF 299
>gi|386001186|ref|YP_005919485.1| hypothetical protein Mhar_0477 [Methanosaeta harundinacea 6Ac]
gi|357209242|gb|AET63862.1| hypothetical protein Mhar_0477 [Methanosaeta harundinacea 6Ac]
Length = 279
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 136/300 (45%), Gaps = 49/300 (16%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P S ++ + T G IARA +FR+D +V++
Sbjct: 8 SILIPSSATMESRDERVRTLKVGMIARAAAVFRVDRIVIY-------------------- 47
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP--- 180
DE + F+ R+LQY ETPQYLRK LF +LR+VG+LPPL PHH + + +
Sbjct: 48 RDEEFDDSRFISRVLQYAETPQYLRKRLFPRERALRYVGILPPLRTPHHPKSSKVSTLKV 107
Query: 181 --FREGVTLKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPRQ 236
+R G+ + E + G V++G+ + + + G R+ V + + + L A+
Sbjct: 108 GEYRVGLVVDEVGSDD-GAWVEIGVERPLPLRTAKRFSAGQRLNVRVFSLKPLAAE---- 162
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
P E YWGY+ P++ ++ +GT D++ + +
Sbjct: 163 ---PVDRSEIPHYWGYETEVIPDLDEHVRSSGRLVVATSRLGTPATPDLL-ARVVRQGQG 218
Query: 297 RHLLIAFGGLA-GLEE--SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
R L + +G A G++ S E +G NT PHQG+ T+R EEAIF +L
Sbjct: 219 RGLELVYGSPARGVDAFLSSETMEGC----------CVINTIPHQGTETVRLEEAIFATL 268
>gi|327400873|ref|YP_004341712.1| hypothetical protein Arcve_0986 [Archaeoglobus veneficus SNP6]
gi|327316381|gb|AEA46997.1| protein of unknown function DUF171 [Archaeoglobus veneficus SNP6]
Length = 261
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 144/299 (48%), Gaps = 57/299 (19%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P S + N + ++ GQIARA IFR+DE++++ R +
Sbjct: 3 SIAIPSSALINERDEKIKAYKVGQIARAAAIFRVDEIIIY----------------RDPK 46
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPFR 182
DE++ F+ +LQYLETPQYLRK L + SL++ G+LPPL P H K + R
Sbjct: 47 LDESD----FIADVLQYLETPQYLRKYLIPIKPSLKYAGVLPPLRIPSHRPKDLKIGEVR 102
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQ-----VLDPGVRVTVAMGTNRNLDADSPRQV 237
+GV ++ P+ VDVG+ ++ + L G RVTV + + + L +
Sbjct: 103 DGV-VRRVGPDGTA-WVDVGVKALALLKREKSRYKLKSGARVTVRVCSTKPLVVEE---- 156
Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG---DIVNSSDLTLP 294
+P+E YWGYKVR A + V + + ++ TS+ I + L P
Sbjct: 157 ---VEPREVEEYWGYKVRKA-ELDEVLRR-------EEVVLTSKRCHKPGIGEIASLKSP 205
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
T + FG EE + E + + ++ +N P+QG+ T+R EEA+ +L
Sbjct: 206 T-----LVFGSP---EEGVHE---IAARLGVKLPSRCWNIVPNQGTETVRLEEAVIAAL 253
>gi|329765070|ref|ZP_08256654.1| hypothetical protein Nlim_0412 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329138447|gb|EGG42699.1| hypothetical protein Nlim_0412 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length = 270
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 139/299 (46%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+S+A+P S + + +SL+L TR +ARA +IF+ID + V+ NY
Sbjct: 3 ISIAIPESALSD-ESLKLDKTRKISVLARACSIFKIDTIYVYQ----EGNY--------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
+ D N LV +L+YLETPQ+LR+ LF + L+F G+L PL P H +K
Sbjct: 49 -KEDGN-----LLVTILKYLETPQFLRRRLFPKMNELKFAGVLLPLRIPSHATPANSKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REGV + VDVG+N+ + D G RVTV + P
Sbjct: 103 NIGDVREGVVISIKGKR----FVDVGINEVIPFFGKTDVGKRVTV------QFKSGYPDF 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
V + E+ +YWGY V+ N+ S+ ++G + I TS G LT L
Sbjct: 153 SVKEIQRNETPLYWGYTVKERSNLFSLL--TEWRG---NTIITSRKGRTATKEQLTKYLR 207
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ +L+ FG E+ + E G K KN + N P Q + T+R EEA+ +L
Sbjct: 208 SDLPILVVFG---SPEKGVHEILGGKMKNIQNA--KTLNFFPSQATETVRLEEALLGTL 261
>gi|304314252|ref|YP_003849399.1| RNA-binding protein [Methanothermobacter marburgensis str. Marburg]
gi|302587711|gb|ADL58086.1| predicted RNA-binding protein [Methanothermobacter marburgensis
str. Marburg]
Length = 266
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 138/301 (45%), Gaps = 56/301 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S+ T L++ T G I RA IF ++ +V++
Sbjct: 6 LSIFIPDSLTAETGDLKIKTYKVGLIGRAAAIFGVNRIVIY------------------- 46
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
D+ + A F+ +L Y++TPQYLR+ +F + L+ VG+LPPL PHH K +
Sbjct: 47 -HDDADGEAGFIRDILNYMDTPQYLRRKVFPIMKELKHVGILPPLRTPHHPTGKPVAGEY 105
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+T+K GTLVD+G +K + + ++TV + + ++ P
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRVMSFRVIRLGKEILIEPD 156
Query: 242 KPKESGMYWGYKVRYAPN-ISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
+P++ YWGY+V +S K D ++ TS + I++ +
Sbjct: 157 EPEDR--YWGYEVLSTEQGLSKSLKTVD----ADVVVATSRYASPITSILDEVKSKVEGA 210
Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
R + I FGG GL E I+ D + NT P Q + T+RTEEA+ +L
Sbjct: 211 RRVAILFGGPYKGLPE-IDAD-------------VWVNTIPGQRTETVRTEEAVLATLSV 256
Query: 356 F 356
F
Sbjct: 257 F 257
>gi|325969276|ref|YP_004245468.1| hypothetical protein VMUT_1765 [Vulcanisaeta moutnovskia 768-28]
gi|323708479|gb|ADY01966.1| hypothetical protein VMUT_1765 [Vulcanisaeta moutnovskia 768-28]
Length = 292
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 133/298 (44%), Gaps = 38/298 (12%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
P + +A+P + + A R G I R IF++ ++V+ ++ + D
Sbjct: 14 PIIDIAIPSDFLAESPDNREAIRKIGYIGRGAAIFQVSRIIVYRHRLAGDR--------- 64
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR--KHEW 178
A F+V+ L+YL TP YLRK LF + L++VG+LPPL P+H K E
Sbjct: 65 --------DKAEFIVKNLRYLSTPPYLRKDLFKLDRDLKYVGLLPPLKTPNHAPEGKPER 116
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
+REG+ +K + S+ +G + V + + G RV V + D VV
Sbjct: 117 GEYREGIVIKWDGYYSIAK---IGDGIYAKVPRPMPLGTRVVVQIDAPTTRDDTYRAHVV 173
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR- 297
P + +YWG++ ++S +F SY ++ T + G + ++ + F
Sbjct: 174 PRDR---LSIYWGFESEVM-DVSKLFDEYSY------VVLTGKEGISIKNAIEQINDFLS 223
Query: 298 --HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+ FG + I +G++ + + + N P QG T+RTEEA+ L
Sbjct: 224 GDRFLVVFGSPYHGVDEILRAEGMENTLKK---YPFINFIPGQGVETVRTEEAVIAVL 278
>gi|375083295|ref|ZP_09730321.1| hypothetical protein OCC_12206 [Thermococcus litoralis DSM 5473]
gi|374742026|gb|EHR78438.1| hypothetical protein OCC_12206 [Thermococcus litoralis DSM 5473]
Length = 258
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 136/301 (45%), Gaps = 62/301 (20%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+ +P S+++ T ++ T GQI RA IF ++ V ++ A + +
Sbjct: 5 IFIPDSLLEETSDPKIRTYKVGQIGRAAAIFGVEHVWIY------------KAGGKDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
F+ +L+Y+ETPQYLRK+L + LR+VG+LPPL PHH K + R
Sbjct: 52 --------FIKLILEYMETPQYLRKSLIPLTKELRYVGVLPPLRTPHHKLKGRPKLGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNR---NLDADSPRQVVP 239
EG+ +++ D+GL++ +V+ G R + + P +VV
Sbjct: 104 EGIVIRKGK----RLYADIGLDELALVE-----------GSGEGRMTFKIVSLKPLKVV- 147
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
PSKP+E YWGY+V S+ K K + I TS G+ V +L P +
Sbjct: 148 PSKPEE---YWGYRVHLTRK--SLAKTLK-KAKLNLAIATSRKGEDVRKVNLP-PLEGEV 200
Query: 300 LIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
FG G+ + ED F NT P+Q ++T+RTEEA+ +L
Sbjct: 201 GFVFGSPRKGIMEILRDFNED---------YPFDLILNTIPNQKTKTVRTEEAVLATLAI 251
Query: 356 F 356
F
Sbjct: 252 F 252
>gi|242398278|ref|YP_002993702.1| hypothetical protein TSIB_0287 [Thermococcus sibiricus MM 739]
gi|242264671|gb|ACS89353.1| hypothetical protein TSIB_0287 [Thermococcus sibiricus MM 739]
Length = 261
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 133/299 (44%), Gaps = 58/299 (19%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+ +P S+++ T ++ T GQI RA IF ++ + ++ A R +
Sbjct: 8 IFIPDSLLEETSDPKIKTYKVGQIGRAAAIFGVEHIWIY------------KAGGREGK- 54
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
F+ +L+Y+ETPQYLRK L + L++ G+LPPL PHH K E R
Sbjct: 55 --------FIKLVLEYMETPQYLRKTLIPLTKELKYAGILPPLRTPHHKLKREPKVGEIR 106
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP--- 239
EGV +K+ D+GL++ +V+ G+ V+P
Sbjct: 107 EGVIIKKGK----RLYADIGLDELALVE-------------GSGEGRMTFEIISVIPLKV 149
Query: 240 -PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
P+KP+E YWGY+V S+ K K + I TS G+ V +L P
Sbjct: 150 APTKPRE---YWGYRVHLTR--MSLAKTLK-KAKLNLAIATSRMGEDVRKVNLP-PLEGE 202
Query: 299 LLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
+ FG G+ E L+ N F NT P+Q ++T+RTEEA+ ++L F
Sbjct: 203 VGFVFGSPRKGIMEI------LRDFNEDYPFDLILNTIPNQKTKTVRTEEAVLVTLAIF 255
>gi|341582064|ref|YP_004762556.1| hypothetical protein GQS_04890 [Thermococcus sp. 4557]
gi|340809722|gb|AEK72879.1| hypothetical protein GQS_04890 [Thermococcus sp. 4557]
Length = 256
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/295 (31%), Positives = 132/295 (44%), Gaps = 52/295 (17%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+ +P S+++ T ++ T GQIARA IF ++ V V+ A R R
Sbjct: 5 IFIPDSLLEETDDPKIRTYKVGQIARAAAIFGVEHVWVY------------RAGGRDGR- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
F+ +L+Y ETPQYLRK LF + LR+VG++PPL PHH K + R
Sbjct: 52 --------FIKTILEYAETPQYLRKRLFPLMPELRYVGVIPPLRTPHHKLKGKPRVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ T D+GL+ V+ D R T + + R L V P+K
Sbjct: 104 EGFAFRKGR----RTYADIGLDDLAAVEG--DVEGRATFRIVSVRPLR-------VIPAK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P+E YWGY+V S+ K K D I TS G + L P +
Sbjct: 151 PEE---YWGYRVHLTRK--SLAKTLK-KARLDLAIATSRKGRDIREVKLP-PLEGEVGFV 203
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E + E++ F NT P+Q + T+RTEEA+ +L F
Sbjct: 204 FGSPRKGVMELLGEEE--------YDFDLILNTIPNQRTATVRTEEAVLATLAVF 250
>gi|15678032|ref|NP_275146.1| hypothetical protein MTH1 [Methanothermobacter thermautotrophicus
str. Delta H]
gi|2621037|gb|AAB84510.1| conserved protein [Methanothermobacter thermautotrophicus str.
Delta H]
Length = 268
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/301 (28%), Positives = 139/301 (46%), Gaps = 56/301 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S+ T L++ T IARA +IF + +V++
Sbjct: 6 LSIFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIY------------------- 46
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
D+ + A F+ +L Y++TPQYLR+ +F + L+ VG+LPPL PHH K +
Sbjct: 47 -HDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGKPVTGEY 105
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+T+K GTLVD+G +K + + ++TV + + ++ P
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRIMSFRVVRLGKEILIEPD 156
Query: 242 KPKESGMYWGYKVR-YAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
+P++ YWGY+V N++ K G D ++ TS + I++ +
Sbjct: 157 EPEDR--YWGYEVLDTRRNLAESLKTV----GADVVVATSRNASPITSILDEVKTRMRGA 210
Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
R I FGG GL E I+ D + NT P Q + T+RTEEA+ +L
Sbjct: 211 REAAILFGGPYKGLPE-IDAD-------------IWVNTLPGQCTETVRTEEAVLATLSV 256
Query: 356 F 356
F
Sbjct: 257 F 257
>gi|288931509|ref|YP_003435569.1| hypothetical protein Ferp_1135 [Ferroglobus placidus DSM 10642]
gi|288893757|gb|ADC65294.1| Protein of unknown function DUF171 [Ferroglobus placidus DSM 10642]
Length = 251
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 132/297 (44%), Gaps = 62/297 (20%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ +A+P SI++N ++ T G I RA IFR++++ ++ + S
Sbjct: 1 MEIAIPSSILENESDDKIKTFKVGIIGRAAAIFRVEKIYIYKDPS--------------- 45
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPF 181
R D N F+ +L+Y+ETPQYLRK LF LR+ G+L PL P H KH +
Sbjct: 46 RDDSN-----FISEILRYMETPQYLRKYLFPRSEKLRYAGVLQPLQIPSHKPKHLKVGEI 100
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV +K + D+G+ + G RVTV + + L VV +
Sbjct: 101 REGVIVKVADGTA---WADIGMKALALFRGKARKGARVTVRVCSTNPL-------VVEEA 150
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL-PTFRHL- 299
KP+E YWG+KV + CS K + D V +S L P+ +
Sbjct: 151 KPEE---YWGFKV----------EKCSLK-------KLLKREDAVVTSKLGYHPSVDEIK 190
Query: 300 ---LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+ FG + + I + GL E +N P QG R++R EEA+ L
Sbjct: 191 KVNLLIFGSPSKEVKEIADILGL------EFEAKMWNLIPKQGVRSVRVEEAVISCL 241
>gi|21465769|pdb|1K3R|A Chain A, Crystal Structure Of The Methyltransferase With A Knot
From Methanobacterium Thermoautotrophicum
gi|21465770|pdb|1K3R|B Chain B, Crystal Structure Of The Methyltransferase With A Knot
From Methanobacterium Thermoautotrophicum
Length = 268
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/301 (28%), Positives = 139/301 (46%), Gaps = 56/301 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P S+ T L++ T IARA +IF + +V++
Sbjct: 6 LSLFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIY------------------- 46
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
D+ + A F+ +L Y++TPQYLR+ +F + L+ VG+LPPL PHH K +
Sbjct: 47 -HDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGKPVTGEY 105
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
R+G+T+K GTLVD+G +K + + ++TV + + ++ P
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRIMSFRVVRLGKEILIEPD 156
Query: 242 KPKESGMYWGYKVR-YAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
+P++ YWGY+V N++ K G D ++ TS + I++ +
Sbjct: 157 EPEDR--YWGYEVLDTRRNLAESLKTV----GADVVVATSRNASPITSILDEVKTRMRGA 210
Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
R I FGG GL E I+ D + NT P Q + T+RTEEA+ +L
Sbjct: 211 REAAILFGGPYKGLPE-IDAD-------------IWVNTLPGQCTETVRTEEAVLATLSV 256
Query: 356 F 356
F
Sbjct: 257 F 257
>gi|390960795|ref|YP_006424629.1| hypothetical protein CL1_0629 [Thermococcus sp. CL1]
gi|390519103|gb|AFL94835.1| hypothetical protein CL1_0629 [Thermococcus sp. CL1]
Length = 256
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 136/295 (46%), Gaps = 52/295 (17%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ T ++ T GQIAR+ IF ++ V ++ A R R
Sbjct: 5 VFIPDSLLEETDDPKIRTYKVGQIARSCAIFGVEHVWIY------------RAGGRDGR- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
F+ +L+Y ETPQYLRK LF + LR+VG++PPL PHH K + R
Sbjct: 52 --------FIKTILEYAETPQYLRKRLFPLMPELRYVGVIPPLRTPHHKLKGKPRVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ D+GL++ +V+ D R T + + R P +VV P+K
Sbjct: 104 EGFAFRKGR----RVYADIGLDELALVEG--DVEGRTTFRIVSVR------PLRVV-PAK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P++ YWGY+V S+ K K D I TS G V L P +
Sbjct: 151 PEK---YWGYRVHLTRK--SLAKTLK-KARLDLTIATSRRGRDVREVKLP-PLEGEVGFV 203
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E + E++ F NT P+Q ++T+RTEEA+ +L F
Sbjct: 204 FGSPRKGVMELLGEEE--------YDFDLILNTVPNQRTKTVRTEEALLATLAVF 250
>gi|294659916|ref|XP_462356.2| DEHA2G18788p [Debaryomyces hansenii CBS767]
gi|199434333|emb|CAG90863.2| DEHA2G18788p [Debaryomyces hansenii CBS767]
Length = 382
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 159/386 (41%), Gaps = 90/386 (23%)
Query: 47 RKRDQLNDDAPIEVPT-------VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRI 97
+++ + ++ P++ P S+ VP +II N ++LE T +A QIA+A TI+ +
Sbjct: 3 KRKTEAKEEKPVKKPVKAIPETQYSICVPSTIISAANAKNLEQITSIAYQIAKAATIYNV 62
Query: 98 DEVVVFDNKSSSD--------------------------NYS-----RSSAANRSNR--- 123
EVVV D +SS N+S + +A ++
Sbjct: 63 AEVVVLDVPTSSKKQDLAEKEAAKVVELGSNKGGKKIKFNFSDEDILKPTAVEKTKEPET 122
Query: 124 -----SDENESGAAFLVRLLQYLETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLR 174
SD NE+ LLQY TP YL K++F+ H+ ++ LP + P
Sbjct: 123 PVDLSSDLNENNTLLFATLLQYFMTPPYLVKSVFANNQFHTKFKYAQKLPKISTLPFMSN 182
Query: 175 KHEWAPFREGVTLKENAPN--------------SVGTLVDVGLNKHVVVDQVLDPGVRVT 220
H F+EG+T+ + P SV V++G + V+D + VRVT
Sbjct: 183 NHVSKDFKEGLTIPKATPKVTKKNKKVSALKKLSVTKYVNIGDAEPFVLDHEVPVNVRVT 242
Query: 221 VAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI--- 277
V + +N SP Q K S +GY VRY N SS+F CS+ GY I
Sbjct: 243 VDL---KNKKIVSPLQAYGLIGSKSS---FGYHVRYCANFSSIFTQCSFPEGYTSTIYVN 296
Query: 278 --------GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDG--LKRKNAREV 327
T E + +N + P +L+ FG ++ S + D +NA ++
Sbjct: 297 CDDYFNSNDTIEARNSINKNPDLSPNDGKILLLFGNYNDIDYSFQNDKANLPGVENASQM 356
Query: 328 FHSYFNTCPHQGSRTIRTEEAIFISL 353
F + ++ E+A ISL
Sbjct: 357 FDGELPI-----PKGVKIEDATLISL 377
>gi|307595040|ref|YP_003901357.1| hypothetical protein Vdis_0916 [Vulcanisaeta distributa DSM 14429]
gi|307550241|gb|ADN50306.1| Protein of unknown function DUF171 [Vulcanisaeta distributa DSM
14429]
Length = 286
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 139/303 (45%), Gaps = 48/303 (15%)
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
PT+ +A+P + + A R G + R IF++ +++V+ +K + + +R
Sbjct: 8 PTIDIAIPSDFLAESPDDREAIRKIGYLGRGAAIFQVSKIIVYRHKLAGER-------DR 60
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
+N F+++ LQYL TP YLRK LF + L++ G+LPPL P+H E P
Sbjct: 61 TN----------FIIKNLQYLVTPPYLRKDLFKLDRDLKYAGLLPPLKTPNH--APEGMP 108
Query: 181 ----FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
+REG+ +K + S+ + +G + V + + G RV V +
Sbjct: 109 QRGEYREGIVVKWDGYFSI---IKIGEGVYAKVPKPMPLGTRVIVQIDAQTTRGDTYRAH 165
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VVP + +YWG++ +S +F + YD++I T + G +N D +
Sbjct: 166 VVPRDR---LSVYWGFEAEVV-ELSRLFSD------YDYVILTGKEG--MNIKD-AMEQL 212
Query: 297 R------HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
R +L+ FG + I +G++ + + + N P QG T+RTEEA+
Sbjct: 213 RSALSRDRVLVVFGSPYHGVDEILRAEGMEEALRK---YPFINFIPGQGVETVRTEEAVI 269
Query: 351 ISL 353
L
Sbjct: 270 AVL 272
>gi|340344792|ref|ZP_08667924.1| hypothetical protein COG2106 [Candidatus Nitrosoarchaeum koreensis
MY1]
gi|339519933|gb|EGP93656.1| hypothetical protein COG2106 [Candidatus Nitrosoarchaeum koreensis
MY1]
Length = 270
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 139/299 (46%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+SVA+P S + + +SL+L TR +ARA IF+ID + V+ NY
Sbjct: 3 ISVAIPESALSD-ESLKLDKTRKISVLARACAIFKIDTIYVYQ----EGNY--------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LRKH 176
R D N LV +L+YLETPQ+LR+ LF + L+F G+L PL P H +K
Sbjct: 49 -REDGN-----LLVTILRYLETPQFLRRRLFPKMNELKFAGVLYPLKIPSHSTPANSKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REGV + +DVG+N+ + + G R TV T P
Sbjct: 103 NVGDVREGVIVSLKGKK----FIDVGINELIPFFGKENVGKRATVQFKTG------YPDF 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
V + E+ +YWGY V+ ++ S+ ++G ++I TS G LT L
Sbjct: 153 SVKEIQRNETSLYWGYAVKERASLFSLL--TEWQG---NIIITSRKGRTATKEQLTKYLS 207
Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ +L+ FG E+ + E G K KN + N P+Q + T+R +EA+ +L
Sbjct: 208 SDLPILVVFG---SPEKGVHEILGGKMKNVQNA--KTLNFFPNQATETVRLDEALLGTL 261
>gi|119608244|gb|EAW87838.1| chromosome 9 open reading frame 114, isoform CRA_a [Homo sapiens]
Length = 236
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/105 (48%), Positives = 60/105 (57%), Gaps = 4/105 (3%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SVA+PGSI+DN QS EL T LAGQIARA IF +DE+VVFD +
Sbjct: 135 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 194
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
+ L R+LQYLE PQYLRKA F H L+F L P
Sbjct: 195 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAACLAP 235
>gi|18978198|ref|NP_579555.1| hypothetical protein PF1826 [Pyrococcus furiosus DSM 3638]
gi|397652526|ref|YP_006493107.1| hypothetical protein PFC_09460 [Pyrococcus furiosus COM1]
gi|18894007|gb|AAL81950.1| hypothetical protein PF1826 [Pyrococcus furiosus DSM 3638]
gi|393190117|gb|AFN04815.1| hypothetical protein PFC_09460 [Pyrococcus furiosus COM1]
Length = 260
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 132/295 (44%), Gaps = 50/295 (16%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ ++ GQ+ARA IF ++ + ++ A + +
Sbjct: 5 VFIPDSLLEEANDPKIKAYKVGQVARACAIFGVEHIWIY------------KAGGKDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
F+ LL+Y ETPQYLRK +F + L++ G++PPL P H K AP R G
Sbjct: 52 --------FIKLLLEYAETPQYLRKRIFPITPELKYAGVIPPLQIPSHKPK---APPRIG 100
Query: 185 VTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
++E G D+GL++ ++ V G GT + + + R + P++
Sbjct: 101 -EIREGYAYRKGKKLFADIGLDRPALIKGVAREG------RGTFKIISVNPLRVI--PAE 151
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P E YWGYKV + K D +I TS G V S++ +P + I
Sbjct: 152 PSE---YWGYKVHLS---RKTLAKTLKKANLDVVIATSRKGVDVRSAE--VPLEGEVGIV 203
Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
FG G+ E LK N F NT P Q ++T+RTEEA+ +L F
Sbjct: 204 FGSPKKGVIEI------LKEYNEDYEFDLILNTIPEQKTKTVRTEEALLATLAIF 252
>gi|407464764|ref|YP_006775646.1| hypothetical protein NSED_04490 [Candidatus Nitrosopumilus sp. AR2]
gi|407047952|gb|AFS82704.1| hypothetical protein NSED_04490 [Candidatus Nitrosopumilus sp. AR2]
Length = 268
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 139/306 (45%), Gaps = 62/306 (20%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+SVA+P S + + +SL++ TR +ARA IF++D + ++ S
Sbjct: 3 LSVAIPQSALSD-ESLKIDKTRKISVLARACAIFKVDTIYIYQEGS-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
N+S + +L+YLETPQ+LR+ LF + L+F G+L PL P H+ +K
Sbjct: 48 -----NKSDGGLMAMILKYLETPQFLRRRLFPKMNDLKFAGVLHPLKIPSHITPVNSKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
+ REG+ + VDVG+N+ V G RVTV P+
Sbjct: 103 KTGDIREGIVVSIKGKK----FVDVGINQLVQYYGSTSTGKRVTV------RFKEGYPKL 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL--- 293
+ E YWGY V+ N+ SV +KG ++I TS G +++ L
Sbjct: 153 SIKDIDKSEVPTYWGYSVKERANLFSVL--SEWKG---NIILTSRKGKPISTEQLIKYAK 207
Query: 294 ---PTFRHLLIAFGGLAGLEESIEEDDGL---KRKNAREVFHSYFNTCPHQGSRTIRTEE 347
PT L+ FG E+ I E G K +NA+ + N P+Q + T+R EE
Sbjct: 208 SDEPT----LVVFG---SPEKGIHEIIGGRMNKVQNAKSL-----NFFPNQATETVRLEE 255
Query: 348 AIFISL 353
A+ +L
Sbjct: 256 ALLGTL 261
>gi|167043663|gb|ABZ08356.1| putative uncharacterized ACR, COG2106 [uncultured marine
crenarchaeote HF4000_APKG2O16]
Length = 248
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 122/267 (45%), Gaps = 39/267 (14%)
Query: 88 IARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYL 147
IARA IF+I+++ ++ + +N++ A L L+YLETPQY
Sbjct: 2 IARACAIFKINQIFIYQD------------------GKQNKNDLALLSTSLKYLETPQYF 43
Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG-TLVDVGLNKH 206
RK +F L++ G L PL+ HL + + G T N G +D+G+NK
Sbjct: 44 RKDIFPKTQLLKYAGALQPLNISSHLTTSDQKMIKIGDTRDALIINYKGKKFLDIGINKL 103
Query: 207 VVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKN 266
+ + G R+ + + T + P V K+ YWGY V+ N+ S+
Sbjct: 104 IQYFGKMKSGTRIAIQIKTTQ------PELTVKEISRKDIKDYWGYSVKERANLLSIL-- 155
Query: 267 CSYKGGYDHLIGTSEHGD---IVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN 323
++KG +I TS+ G I+++ +T P +LI FG I D K +N
Sbjct: 156 STWKG---KIILTSKKGKNFTILDAKKMTEPD-EPILIVFGTTNKGIYDILGTDIKKIQN 211
Query: 324 AREVFHSYFNTCPHQGSRTIRTEEAIF 350
A+ FN P+Q ++T+R EEAI
Sbjct: 212 AK-----IFNFFPNQATQTVRLEEAIL 233
>gi|312079981|ref|XP_003142406.1| hypothetical protein LOAG_06822 [Loa loa]
Length = 117
Score = 97.4 bits (241), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/110 (46%), Positives = 65/110 (59%), Gaps = 4/110 (3%)
Query: 244 KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAF 303
+E+G+YWGYKVR A ++ G YD +IGTSE G V+ ++ L +LI F
Sbjct: 1 RETGLYWGYKVRIALSLHDALN----AGEYDVIIGTSERGKPVSKFEMPLCERNRILIVF 56
Query: 304 GGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
GGL GLE +IE D + E+F Y N P QGSR IRTEEAI I+L
Sbjct: 57 GGLEGLEAAIEADKNISCSTPEELFEHYLNVVPGQGSRIIRTEEAIPITL 106
>gi|327311990|ref|YP_004338887.1| hypothetical protein TUZN_2118 [Thermoproteus uzoniensis 768-20]
gi|326948469|gb|AEA13575.1| hypothetical protein TUZN_2118 [Thermoproteus uzoniensis 768-20]
Length = 272
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 127/293 (43%), Gaps = 42/293 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ +AVP ++ E R G IAR +FR + ++++ N
Sbjct: 1 MDIAVPHDVLSEAPDEESKVRKLGYIARGAAVFRAENLIIY---------------TYGN 45
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAP 180
D E L LL+Y TP +LRK LF + LR G+LPPL P H E
Sbjct: 46 DVDWEEVERMRL--LLEYASTPPHLRKKLFKLDRRLRLAGLLPPLKIPSHTPPKEPSVGD 103
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
EGV + + S LV +G ++ + + G R+ V + + D R VV
Sbjct: 104 VIEGVVERWDGYYS---LVYIGGRRYAKIPKPYPIGSRLLVKIEAETD-RPDIYRAVVVK 159
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
P YWGYKV P FK + GYD +I T G V L P R L+
Sbjct: 160 KPPD----YWGYKVEVKP-----FKQLA--DGYDTVIYTGREGRSV-CEGLPRPVGRTLV 207
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ G AG++E I + +G++ K + N P QG+ T+RTEEAIF L
Sbjct: 208 VFGGPRAGVDE-IAKIEGVELKGL------FLNFIPRQGTETVRTEEAIFAVL 253
>gi|284162461|ref|YP_003401084.1| hypothetical protein Arcpr_1362 [Archaeoglobus profundus DSM 5631]
gi|284012458|gb|ADB58411.1| Protein of unknown function DUF171 [Archaeoglobus profundus DSM
5631]
Length = 254
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 133/292 (45%), Gaps = 50/292 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SVA+P S + N ++ T G IARA IFR+ E++++ R N S
Sbjct: 2 ISVAIPSSALINENDPKIKTFKVGLIARACAIFRVGEIIIY----------RDPKLNESQ 51
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
F+ +L+Y ETPQYLRK + + +L++ G+LPPL P H K
Sbjct: 52 ----------FIKDVLEYAETPQYLRKYI-PIKETLKYAGVLPPLKIPSHKPKRLKVGEV 100
Query: 183 EGVTLKENAPNSVGT-LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
+ P+ GT VD+G+ + G RVTV + + L VV +
Sbjct: 101 REGVVVRVGPD--GTRWVDIGVKALAPLKSKAKRGARVTVRVCSTNPL-------VVEEA 151
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
KP+E YWGYKVR A + V ++++ TS + ++ + + + +
Sbjct: 152 KPEE---YWGYKVRIA-ELKDVLSK-------ENVVLTSRKCKVPKIEEIK--SLKDVTL 198
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FG EE + E + R+ + +NT P QG T+R EEAI+ +L
Sbjct: 199 VFG---SPEEGVFE---IMRRLGIDTDARCWNTIPMQGVETVRLEEAIYATL 244
>gi|407462404|ref|YP_006773721.1| hypothetical protein NKOR_04455 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407046026|gb|AFS80779.1| hypothetical protein NKOR_04455 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 270
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 137/299 (45%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+SVA+P S + + +SL++ TR +ARA IF+I+ + V+ +
Sbjct: 3 LSVAIPESSLSD-ESLKIDKTRKISVLARACAIFKIETIYVYQEGN-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
N+ D N +V +L+YLETPQ+LR+ LF + L+F G+L PL P H+ +K
Sbjct: 48 NKQDGN-----LMVMILKYLETPQFLRRRLFPKMNDLKFAGVLQPLRIPSHVTPANPKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + VDVG+N+ + G RVT+ P+
Sbjct: 103 TRGDVREGIVVSVKGKR----FVDVGINQLIPFFGKTPIGKRVTI------QFKEGYPKF 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
+ E+ YWGY V+ N+ S+ +KG ++I TS G + T
Sbjct: 153 SIKEITRNEAPEYWGYAVKERANLFSLL--SEWKG---NIIITSRKGKTATKEQIAKYTK 207
Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ L+ FG E+ + E G K KN + N P+Q ++T+R EEA+ +L
Sbjct: 208 SDQPTLVVFG---SPEKGVHEILGGKMKNVQNA--KSLNFFPNQATQTVRLEEALLGTL 261
>gi|409096185|ref|ZP_11216209.1| hypothetical protein TzilA_05992 [Thermococcus zilligii AN1]
Length = 256
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 132/301 (43%), Gaps = 64/301 (21%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S+++ ++ T GQIARA IF ++ + ++ A + +
Sbjct: 5 VFIPDSLLEERDDPKIRTYKVGQIARACAIFGVEHIWIY------------GAGGKDGK- 51
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR--KHEWAPFR 182
F+ +L+Y ETPQYLRK LF + L++ G++PPL PHH K E R
Sbjct: 52 --------FIKTILEYAETPQYLRKKLFPLMPELKYAGVIPPLRTPHHKLEGKPEVGEIR 103
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
EG ++ D+GL++ +V+ D R T + + R P +VV P+K
Sbjct: 104 EGFAFRKGR----RVYADIGLDELAMVEG--DIEGRATFRIVSLR------PLRVV-PAK 150
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
P E YWGY+V S+ K K D +I TS G + ++ LP
Sbjct: 151 PAE---YWGYRVHLTEE--SLAKTLK-KARLDLVIATSRRGR--DIREVKLPP------- 195
Query: 303 FGGLAGLEESIEEDDGLKRKNAREV-------FHSYFNTCPHQGSRTIRTEEAIFISLQY 355
LE + G RK E+ F NT P Q + T+RTEEA+ +L
Sbjct: 196 ------LEGEVGFVFGSPRKGVMELLGGEEFEFDLILNTIPDQRTETVRTEEALLATLAI 249
Query: 356 F 356
F
Sbjct: 250 F 250
>gi|161528318|ref|YP_001582144.1| hypothetical protein Nmar_0810 [Nitrosopumilus maritimus SCM1]
gi|160339619|gb|ABX12706.1| Protein of unknown function DUF171 [Nitrosopumilus maritimus SCM1]
Length = 270
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 136/299 (45%), Gaps = 48/299 (16%)
Query: 63 VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+SVA+P S + + +SL++ TR +ARA IF+I+ + V+ +
Sbjct: 3 LSVAIPESSLSD-ESLKIDKTRKISVLARACAIFKIETIYVYQEGN-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
N+ D N +V +L+YLETPQ+LR+ LFS + L+F G+L PL P H +K
Sbjct: 48 NKQDGN-----LMVMILKYLETPQFLRRRLFSKVNDLKFAGVLQPLRIPSHATPANPKKI 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + VDVG+N+ + G RVTV P
Sbjct: 103 NKGDVREGIVVSVKGKR----FVDVGINQLIPFFGKTLMGKRVTV------QFKEGHPNF 152
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
+ E+ YWGY V+ N+ S+ +KG ++I TS G + T
Sbjct: 153 SIKEINRNEAPDYWGYTVKERSNLFSLL--SEWKG---NIILTSRKGKTATKEQIAKYTK 207
Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ L+ FG E+ I E G K KN + N P+Q ++T+R EEA+ +L
Sbjct: 208 SDQPTLVVFG---SPEKGIHEILGGKMKNVQNA--KSLNFFPNQATQTVRLEEALLGTL 261
>gi|347830143|emb|CCD45840.1| hypothetical protein [Botryotinia fuckeliana]
Length = 193
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 69/114 (60%), Gaps = 10/114 (8%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSVA+PGSII N QS +L T +AGQIARA+ +F +DEV+VFD+ ++ +S
Sbjct: 61 TVSVALPGSIIANAQSHDLKTSMAGQIARALAVFCVDEVIVFDDGNARPAKKPKHPPTQS 120
Query: 122 NRSD-----ENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLP 165
+ S E+E A FL LL +LETP LRK LF +H +LR G LP
Sbjct: 121 HNSQQKPPLEDEYTAKSDPDHFLTHLLSFLETPPNLRKHLFPIHENLRTAGALP 174
>gi|116755017|ref|YP_844135.1| hypothetical protein Mthe_1729 [Methanosaeta thermophila PT]
gi|116666468|gb|ABK15495.1| Protein of unknown function DUF171 [Methanosaeta thermophila PT]
Length = 277
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 52/304 (17%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P S T L L T G IARA +FRID +VV+ R
Sbjct: 8 SILIPSSYTMETADLRLRTAKVGLIARAAAVFRIDRIVVY-------------------R 48
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFRE 183
DE + + F+ +L+Y ETPQYLRK LF LR G+LPPL HH + +
Sbjct: 49 DDEFDD-SRFISTVLRYAETPQYLRKLLFPRMRELRHAGVLPPLRTAHHPVGSRSSTLKV 107
Query: 184 GVT----LKENAPNSVGTLVDVGLNKHVVV--DQVLDPGVRVTVAMGTNRNLDADSPRQV 237
G + E+ + G V+VGL++ V + + G R+ V + + L A+
Sbjct: 108 GEIRVGMVVESVGSDGGAWVEVGLDRPVPLKSKRRFQKGQRLNVRIFSLSPLAAE----- 162
Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG-----DIVNSSDLT 292
P E YWGY+ + ++ + ++ TS G D++N + +
Sbjct: 163 --PVDRSEIPGYWGYETIVVDSAEEYLRSRD-----EFVVATSRKGTPVSCDLLNHIERS 215
Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
R L + FG A ++ + L+R NT PHQG+ T+R EEA+ +
Sbjct: 216 --GSRGLAVVFGSPARGVDAFLSREMLER-------CCVINTIPHQGTETVRVEEAVIST 266
Query: 353 LQYF 356
L
Sbjct: 267 LALL 270
>gi|345005656|ref|YP_004808509.1| hypothetical protein [halophilic archaeon DL31]
gi|344321282|gb|AEN06136.1| protein of unknown function DUF171 [halophilic archaeon DL31]
Length = 277
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 132/309 (42%), Gaps = 62/309 (20%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
++ VP S++ + ATR G +ARA I+R++ +VVF +R
Sbjct: 6 TLVVPSSLVREAEDKREATRKIGYVARAAVIYRVERIVVF-----------------PDR 48
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLD-APHHLRKHEWAPFR 182
E G F+ +L Y TP YLRK ++ LR+ G+LPPL +P E R
Sbjct: 49 EGERRWGGEFVRTVLAYAATPPYLRKEVWDQRDELRYAGVLPPLRVSPRTGSASERPESR 108
Query: 183 EGVTLKENAPNSVGTLVDVGLNK----HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
EG+ + + V V+ GL HV + G RVTV + + + A R V
Sbjct: 109 EGIVTEVGPDDRV--RVNCGLQHPISLHVPSSMTVVEGERVTVRVSSREPVRA---RLVD 163
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
P+ GY V A +I +N + IGTS HGD DLT+
Sbjct: 164 EPTA--------GYVVEAA-DIQETLARQNAGVR------IGTSRHGD-----DLTVSRL 203
Query: 297 RHL---------LIAFGGLA-GLEE--SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIR 344
L I FG G+ E + DD L + F + NT P+QGS +R
Sbjct: 204 AELGSQCRDDGATIVFGSPGRGIPEILGVSPDD-LPVEPDSPGFDLWLNTVPNQGSEVVR 262
Query: 345 TEEAIFISL 353
TEEA+F +L
Sbjct: 263 TEEAVFATL 271
>gi|444318731|ref|XP_004180023.1| hypothetical protein TBLA_0C07130 [Tetrapisispora blattae CBS 6284]
gi|387513064|emb|CCH60504.1| hypothetical protein TBLA_0C07130 [Tetrapisispora blattae CBS 6284]
Length = 384
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 133/302 (44%), Gaps = 35/302 (11%)
Query: 1 MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEV 60
M +K R G +P ++ TG+ +Q + ++ + K+++K + +
Sbjct: 1 MASKNTRKGEKPS--DSKTGKTETTQKKKQAKT--NAKINKKQREKPISKKKTLKVKSKT 56
Query: 61 PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
S+ +P +IID +LE T + QIA++ T+F + E+V+ + S +DN N+
Sbjct: 57 LNYSLCIPNNIIDQCTNLEQITHIMYQIAKSATMFNVGEIVILETSSKTDN-------NK 109
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFV--GMLPPLDAPHHLR--KH 176
S + D S + + LLQY TP YL K++F F LP L +R K
Sbjct: 110 SKKKDGKLSNSMLMASLLQYFVTPPYLLKSVFKKEYWKYFTVASKLPRLSVLPFMRYLKE 169
Query: 177 EWAPFREGVTLKENAPNSVGT-------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+ +REG+ ++ PN+ +++G + + + L P VRVTV +
Sbjct: 170 DEGRYREGLAIRMEKPNAKSNKEFKQTKYINIGKSDALELKSQLVPINVRVTVDTVERKV 229
Query: 229 LDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
+ D +K +GY VR A + VF C++ GGY + VNS
Sbjct: 230 VSPDEAYGDFVGAKAS-----YGYHVRVAKQFADVFTECAFPGGYSQAVW-------VNS 277
Query: 289 SD 290
D
Sbjct: 278 GD 279
>gi|145592077|ref|YP_001154079.1| hypothetical protein Pars_1876 [Pyrobaculum arsenaticum DSM 13514]
gi|145283845|gb|ABP51427.1| Protein of unknown function DUF171 [Pyrobaculum arsenaticum DSM
13514]
Length = 265
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 129/292 (44%), Gaps = 42/292 (14%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P + T R G +ARA ++F++ V+++ + R
Sbjct: 3 SIAIPHDFLSETPDEPSKIRKLGYLARAASVFKVSTVIIYYYGAPL-------------R 49
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
D + + +L+YL TP YLRK ++ + LR G+LPPL P H+ E
Sbjct: 50 EDIDLAKT-----VLEYLVTPPYLRKRVYKIDKRLRLAGLLPPLKIPSHVVPKEPRIGEV 104
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV + + S+ V +G K+ + + G R+ V + + D+ R +
Sbjct: 105 REGVVERWDGYYSI---VYIGGGKYAKIPKPYPVGARLLVRIEAPTS-RPDTYRAAIYKG 160
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
P YWGYKV P I S+ G+D +I T G + L ++ L+
Sbjct: 161 PPPA---YWGYKVEVRP-IQSL------TDGFDAVILTGREGKPICEVSPKL--GKNTLV 208
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FGG + I ++ GL E+ N P QG+ TIRTEEA+FI L
Sbjct: 209 VFGGPRRGVDEIFKESGL------ELPGDLINFAPGQGTETIRTEEAVFIVL 254
>gi|379003216|ref|YP_005258888.1| hypothetical protein Pogu_0254 [Pyrobaculum oguniense TE7]
gi|375158669|gb|AFA38281.1| hypothetical protein Pogu_0254 [Pyrobaculum oguniense TE7]
Length = 265
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 129/292 (44%), Gaps = 42/292 (14%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P + T R G +ARA ++F++ V+++ + R
Sbjct: 3 SIAIPHDFLSETPDEPSKIRKLGYLARAASVFKVSTVIIYYYGAPL-------------R 49
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
D + + +L+YL TP YLRK ++ + LR G+LPPL P H+ E
Sbjct: 50 EDIDLAKT-----VLEYLVTPPYLRKKVYKIDKRLRLAGLLPPLKIPSHVVPKEPRIGEV 104
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
REGV + + S+ V +G K+ + + G R+ V + + D+ R +
Sbjct: 105 REGVVERWDGYYSI---VYIGGGKYAKIPKPYPVGARLLVRIEAPTS-RPDTYRAAIYKG 160
Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
P YWGYKV P I S+ G+D +I T + G + L ++ L+
Sbjct: 161 PPPA---YWGYKVEVRP-IQSL------TDGFDAVILTGKEGKPICEVSPKL--GKNTLV 208
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FGG + I ++ GL E+ N P QG+ TIRTEEA+ I L
Sbjct: 209 VFGGPRRGVDEIFKESGL------ELPGDLINFAPGQGTETIRTEEAVIIVL 254
>gi|448704115|ref|ZP_21700596.1| hypothetical protein C446_00624 [Halobiforma nitratireducens JCM
10879]
gi|445796504|gb|EMA47008.1| hypothetical protein C446_00624 [Halobiforma nitratireducens JCM
10879]
Length = 294
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 130/323 (40%), Gaps = 68/323 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR+D ++V+ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDSD-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
D G F+ +L+Y TP YLR ++ M L +VG+LPPL A ++
Sbjct: 48 --GDTGRFGDGFVSTVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRAASRTGSESNDSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ G+ + + D + G RVTV + + R + A
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGMQHPISLNTPPDMEVSEGERVTVRISSRRPVRAK--- 160
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIVNSSDLT 292
R P +S + S G + I S +G+ + L
Sbjct: 161 ----------------LVDRPLPGLSVERTDLSAALGREDAGVCIAASRYGETLTVGRLE 204
Query: 293 LPTFR----HLLIAFGG-------LAGLEESI-----------EEDDGLKRKNAREVFHS 330
R + +AFG + G+E S DDG++ A F
Sbjct: 205 TLAGRVEADGMTVAFGAPERGLPDILGIEASDVASAHAGDGPESADDGVEPTPADPGFDL 264
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P QGS +RTEEA+F +L
Sbjct: 265 WLNTVPDQGSDVVRTEEAVFATL 287
>gi|448737861|ref|ZP_21719894.1| hypothetical protein C451_10022 [Halococcus thailandensis JCM
13552]
gi|445802823|gb|EMA53124.1| hypothetical protein C451_10022 [Halococcus thailandensis JCM
13552]
Length = 276
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 133/311 (42%), Gaps = 50/311 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ AT G +ARA T+FR+D V+VF + +
Sbjct: 2 TRSVLVPSSLCREAADKREATHKVGLVARAATVFRVDRVIVFPDPDGDRRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +L Y T YLRK + L + G+LPPL A +
Sbjct: 53 --------GGGFVSTVLAYAATAPYLRKEAWGTRDELEYAGVLPPLRAAARTGSESNGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + + + +D G RVTV + + + A
Sbjct: 105 SLRQGI-VTEVGPDGR-VRVNCGLQHPISLVAPPEMAVDEGERVTVRISSREPVRA---- 158
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
++ S P G+ V A ++S+ ++ ++ + TS HG+++ + L
Sbjct: 159 RITDESPP-------GFAVERA-DLSAALGREDAGFR------VATSRHGEVLTTERLGT 204
Query: 294 PTFRH---LLIAFGGLA-GLEESIEEDDG-LKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
R L +AFG GL ++ D + + E F + NT P QGS +RTEEA
Sbjct: 205 LAGRADDGLTVAFGAPERGLPAMLDIDAASVASDSPHEGFDCWLNTVPKQGSEVVRTEEA 264
Query: 349 IFISLQYFQEP 359
+F +L P
Sbjct: 265 VFATLAPLTLP 275
>gi|84999294|ref|XP_954368.1| hypothetical protein [Theileria annulata]
gi|65305366|emb|CAI73691.1| hypothetical protein TA21040 [Theileria annulata]
Length = 232
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/120 (42%), Positives = 69/120 (57%), Gaps = 12/120 (10%)
Query: 237 VVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTL 293
+V P +P K +G+YWGY VR +++ K C + G YD IGT E G++ S + L
Sbjct: 113 LVHPEEPLKVAGLYWGYVVRECKSVAESLKGCPFNESGNYDLKIGTCERGELY-SLNTKL 171
Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
P F+++LI FG + GLE +EE + F YFN C Q SRTIRTEEA+ I L
Sbjct: 172 PKFKNMLIHFGPVLGLEHVMEEPEN--------KFDKYFNFCNQQKSRTIRTEEALLIVL 223
>gi|383625046|ref|ZP_09949452.1| hypothetical protein HlacAJ_17003 [Halobiforma lacisalsi AJ5]
gi|448697502|ref|ZP_21698542.1| hypothetical protein C445_11277 [Halobiforma lacisalsi AJ5]
gi|445781455|gb|EMA32311.1| hypothetical protein C445_11277 [Halobiforma lacisalsi AJ5]
Length = 296
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 70/325 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TI+R+D ++V+ ++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIYRVDRLIVYPDRDG------------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
+ G F+ +L+Y TP YLR + M L +VG+LPPL A ++
Sbjct: 49 ---ETGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRAVSRTGSESNDSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ G+ + + D ++ G RVTV + + R
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGMQHPISLNTPPDMEVEEGERVTVRISSRR-------- 155
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL---IGTSEHGDIVNSSDLT 292
P + K R P +S + S G + I S HG+ + L
Sbjct: 156 ----PVRAK-------LVDRPLPGLSVERTDLSAALGREDAGVRIAASRHGEELTVGRLE 204
Query: 293 LPTFR----HLLIAFGG-------LAGLEESI-------------EEDDGLKRKNAREVF 328
R L +AFG + G+E S + DDG++ A F
Sbjct: 205 TLAGRIEDDGLTVAFGAPERGLPDILGIEASAVAVDPDPSDRGDSDADDGVEPTPADPGF 264
Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
+ NT P QGS +RTEEA+F +L
Sbjct: 265 DLWLNTVPDQGSDVVRTEEAVFATL 289
>gi|374326111|ref|YP_005084311.1| hypothetical protein P186_0607 [Pyrobaculum sp. 1860]
gi|356641380|gb|AET32059.1| hypothetical protein P186_0607 [Pyrobaculum sp. 1860]
Length = 264
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 125/295 (42%), Gaps = 48/295 (16%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P + R G +ARA IFR++ VV++ +Y R A +
Sbjct: 3 SIAIPHDFLSEAPDEASKVRKLGYLARAAAIFRVEYVVIY-------HYGRPLAED---- 51
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
D ++ +L+YL TP YLRK +F + LR G+LPPL P H E
Sbjct: 52 IDLAKT-------VLEYLVTPPYLRKKVFKIDRRLRLAGLLPPLKIPSHRAPSEPRVGEV 104
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV---AMGTNRNLDADSPRQVV 238
REG+ + + S LV +G K+ V + G R+ V A T R+ S +
Sbjct: 105 REGIVERWDGYYS---LVYIGAEKYAKVPKPYPIGTRLYVKIEAPTTRRDTYRASVYRGP 161
Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
PP P GY+V P S G+D LI T G + ++ +
Sbjct: 162 PPYYP-------GYRVEVRP-------VQSLAEGFDTLILTGREGK--SICEVKSKIGKK 205
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L FG + I E+ G+K + N P QG TIRTEEA+FI L
Sbjct: 206 TLAVFGSPRKGVDEIMEETGVKLAG------EFINFAPGQGVETIRTEEAVFIVL 254
>gi|18314060|ref|NP_560727.1| hypothetical protein PAE3431 [Pyrobaculum aerophilum str. IM2]
gi|18161641|gb|AAL64909.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Length = 270
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 127/293 (43%), Gaps = 44/293 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P + R G +ARA IF+ + ++++ + R
Sbjct: 3 SIAIPHDFLSEIPDEAGKVRKLGYLARAAAIFKAEYIIIYHYGTP-------------RR 49
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
D + F +L+YL TP YLRK ++ + + L+ G+LPPL P H E
Sbjct: 50 EDID-----FAKTVLEYLVTPPYLRKKVYKIDNRLKLAGLLPPLKIPSHTVPVEPRIGEI 104
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM-GTNRNLDADSPRQVVPP 240
REG+ + + S L+ +G K+ V + G R+ V + GT D+ R V
Sbjct: 105 REGIVERWDGYYS---LIYIGGGKYAKVPKPYPIGTRLLVRIEGTTSR--PDTYRAAVYR 159
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
P + YWGYK+ P +N S G+D +I T + G + + + + L
Sbjct: 160 GAPPD---YWGYKIDVRP-----LQNLS--EGFDTVILTGKEGKSICEAKPKIG--KKTL 207
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ FGG + I + GL+ N P QG TIRTEEA+FI L
Sbjct: 208 VVFGGPRKGVDEIFREAGLEPPG------ELINFAPGQGVETIRTEEAVFIVL 254
>gi|118576185|ref|YP_875928.1| hypothetical protein CENSYa_0992 [Cenarchaeum symbiosum A]
gi|118194706|gb|ABK77624.1| conserved hypothetical protein [Cenarchaeum symbiosum A]
Length = 265
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/295 (27%), Positives = 125/295 (42%), Gaps = 49/295 (16%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SVA+P S + + + +R A IARA ++F + V V+ +
Sbjct: 3 LSVAIPDSCLADESTQMDKSRKASVIARACSVFGVSAVYVYKDGGGDGR----------- 51
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKHE 177
L +L+YLETP +LRK LF + L++ G+L PL P H +
Sbjct: 52 ----------LLTTILRYLETPPFLRKRLFPRINELKYAGVLSPLRIPSHTASSDPKSVS 101
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
REGV L + VD+GL + V P RVTV + P V
Sbjct: 102 RGDVREGVVLSKGGRK----FVDIGLGRPVPYSGQTIPPKRVTV------RFERAPPDPV 151
Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTF 296
+ +E+G YWGY+V+ + + +KG +I TS G ++ S + +
Sbjct: 152 IREIPREEAGAYWGYRVKERGRLGELLSG--WKG---RIILTSRKGSAISQSGIRSYAGT 206
Query: 297 RHLLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
+L+ FG G+ + G + +NAR N P Q S T+R EEA+
Sbjct: 207 GEVLVVFGSTDKGIHGMLGGGTG-RVQNAR-----VLNFFPGQSSETVRLEEAML 255
>gi|322372169|ref|ZP_08046710.1| hypothetical protein ZOD2009_21752 [Haladaptatus paucihalophilus
DX253]
gi|320548178|gb|EFW89851.1| hypothetical protein ZOD2009_21752 [Haladaptatus paucihalophilus
DX253]
Length = 280
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 127/312 (40%), Gaps = 61/312 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S++ + ATR G +ARA T+FR D +VVF +
Sbjct: 2 TVSILVPSSLVREAEDKREATRKIGYVARAATVFRADRLVVFPDGEG------------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
E + GA F+ +L+Y TP YLRK +F L + G+LPPL A
Sbjct: 49 ----ERKWGAGFVETILKYAATPPYLRKEVFGRRDELEYAGILPPLRATSQTGSESEGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLN---KHVVVDQV-LDPGVRVTVAMGTNRNLDA---D 232
R+G+ + V V+ GL VV Q+ + G RVTV + + R + A D
Sbjct: 105 SLRQGIVTEVGPEGRV--RVNCGLQHPISLVVPSQMTVGEGERVTVRISSRRPVRAKLVD 162
Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL---IGTSEHGDIVNSS 289
+P P ++ + S D I TS +G+ ++
Sbjct: 163 AP----------------------VPGLTVTRMDLSAALDRDDAGVRIATSRYGEPLSVG 200
Query: 290 DLTLPTFRH----LLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
L R + +AFG GL + E F + NT P+QGS
Sbjct: 201 RLEEVVTRTNRDGMTVAFGSPGRGLPEIFGVPPEGISAVESGVPARFDLWLNTVPNQGSE 260
Query: 342 TIRTEEAIFISL 353
IRTEEA+F SL
Sbjct: 261 VIRTEEAMFASL 272
>gi|171185760|ref|YP_001794679.1| hypothetical protein Tneu_1306 [Pyrobaculum neutrophilum V24Sta]
gi|170934972|gb|ACB40233.1| Protein of unknown function DUF171 [Pyrobaculum neutrophilum
V24Sta]
Length = 263
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 125/294 (42%), Gaps = 42/294 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+ S+AVP + R G +ARA IFR++EVV++
Sbjct: 2 SFSIAVPHDFLLEAPDEAGKVRKLGYLARAAAIFRVEEVVIY------------------ 43
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA-- 179
+ + F +L+Y+ TP YLR+ ++ + LR G+LPPL P H E A
Sbjct: 44 HYGEPLREEIDFAKTILEYMVTPPYLRRRVYKLDRRLRLAGLLPPLKIPSHTVPREPAVG 103
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
REGV + + S LV +G ++ + + G R+ V + D+ R V
Sbjct: 104 EIREGVVERWDGYYS---LVYIGGGRYAKIPRPYPIGTRLLVKIEAPTG-RPDTYRASVY 159
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P YWG++V + S+ + G+D ++ T G + L R
Sbjct: 160 RGAPPN---YWGFRV----EVRSLRELAE---GFDAVVLTGREGRSICEVKPRLG--RRT 207
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L+ FGG + I ++G+ E+ N P QG TIRTEEA+FI L
Sbjct: 208 LVVFGGPRKGVDEIYREEGV------EMPGELINFVPGQGVETIRTEEAVFIVL 255
>gi|448399309|ref|ZP_21570611.1| hypothetical protein C476_07478 [Haloterrigena limicola JCM 13563]
gi|445669216|gb|ELZ21829.1| hypothetical protein C476_07478 [Haloterrigena limicola JCM 13563]
Length = 291
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 133/303 (43%), Gaps = 31/303 (10%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S+ + ATR G +ARA TIFR D ++V+ +R
Sbjct: 2 TISVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLIVY--------------PDRD 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
+ + + G F+ +L+Y TP YLR ++ M L +VG+LPPL A +
Sbjct: 48 GETGQFDGG--FVATVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRAVSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + + D +D G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVDEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---NSSD 290
+ +P +++ + G + +S F G + L G E + + +
Sbjct: 164 EPLPGLSVEQTDLSAALGREDAGVCIAASRFGEELTVGRLETLAGRVEDDGMTVAFGAPE 223
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
LP L I + ++E+ + D A F + NT P+QGS +RTEEA+F
Sbjct: 224 RGLPDI--LGIEASAVGAVQETATDGDDGVEPTADPGFDLWLNTVPNQGSEVVRTEEALF 281
Query: 351 ISL 353
+L
Sbjct: 282 ATL 284
>gi|119872478|ref|YP_930485.1| hypothetical protein Pisl_0966 [Pyrobaculum islandicum DSM 4184]
gi|119673886|gb|ABL88142.1| Protein of unknown function DUF171 [Pyrobaculum islandicum DSM
4184]
Length = 263
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 124/298 (41%), Gaps = 50/298 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
+ S+AVP + R G +ARA IF ++ ++++ Y S
Sbjct: 2 SFSIAVPHDFLSEAPDEASKVRKLGYLARAAAIFNVETIIIY--------YYGSPLREEI 53
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
+ F +L+Y+ TP YLRK LF + L+ G+LPPL P H E
Sbjct: 54 D----------FAKTILEYVVTPPYLRKRLFKLDQRLKLAGLLPPLKIPSHTVSLEPRVG 103
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV- 238
REGV + + S LV +G K+ + + G R+ V + D+ R V
Sbjct: 104 EIREGVVERWDGYFS---LVYIGGGKYAKIPKPYPVGSRLLVKIEAPTE-RPDTYRASVY 159
Query: 239 ---PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
PP+ YWGYKV P S G+D +I T + G V + +
Sbjct: 160 RGTPPA-------YWGYKVEVRP-------LQSLTEGFDAIILTGKEGRSVCEAKPKV-- 203
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ L+ FGG + I ++ G++ N P QG +IRTEEA+FI L
Sbjct: 204 GKKTLVVFGGPRKGVDEIYKEAGIEPPG------ELINFVPRQGVESIRTEEAVFIVL 255
>gi|385805568|ref|YP_005841966.1| hypothetical protein FFONT_0524 [Fervidicoccus fontis Kam940]
gi|383795431|gb|AFH42514.1| hypothetical protein FFONT_0524 [Fervidicoccus fontis Kam940]
Length = 278
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 52/308 (16%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+ ++ + +P ++N T A IARA +IF+++ + ++ ++S +
Sbjct: 1 MKSIEILLPYCSLENEPDQIRITLKANIIARAASIFKVERITLYRSRS-----------D 49
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHE 177
+ + D+ E L LL+Y TP YL+K +F L++VGM PL P H +K +
Sbjct: 50 KKSCKDQKE----ILGLLLRYFVTPPYLKKKIFGKRKELKYVGMAYPLQIPTHSLSKKAK 105
Query: 178 WAPFREGVT--LKENAPNSVGTLVDVGLNKHVVV---DQVLDPG--VRVTVAMGTNRNLD 230
R G+ +KE+A D+G+ K V + + L G + V V G L+
Sbjct: 106 EGDIRVGLVERVKESA-----IFADIGVGKQVKISNSNYELKRGDLIFVKVKKGDLSELE 160
Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---- 286
K +Y GY VR N K G +IGTS+ GD +
Sbjct: 161 I-----------IKNPDIYVGYSVRECENPLDYLKEKKKDG---LVIGTSKFGDPIWENL 206
Query: 287 NSSDLTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
+ L + +++ FG G+ E ++ G N ++FH +N QG++TIR
Sbjct: 207 KEVEEKLKSSDKVVVVFGEPYRGIFEIFKQLGG----NTDDLFHGIYNLVRDQGTKTIRM 262
Query: 346 EEAIFISL 353
EEA+FI+L
Sbjct: 263 EEALFIAL 270
>gi|448415420|ref|ZP_21578220.1| hypothetical protein C474_05545 [Halosarcina pallida JCM 14848]
gi|445681078|gb|ELZ33519.1| hypothetical protein C474_05545 [Halosarcina pallida JCM 14848]
Length = 278
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 130/317 (41%), Gaps = 72/317 (22%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S++ + ATR G +ARA T+FR + +VVF
Sbjct: 2 TLSVLVPSSVVREAEDKREATRKLGYVARAATVFRANRLVVF-----------------P 44
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL--DAPHHLRKHEWA 179
+R EN G F+ +L+Y TP YLRK ++ L++VG+LPPL + ++
Sbjct: 45 DREGENRWGGEFVETVLRYAVTPPYLRKEVWGHRDELQYVGVLPPLLVSSTTGSESNDSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
++G+ + V V+ GL V + L+ G RV + + + + A
Sbjct: 105 SLQQGIVTEVGPEGRV--RVNCGLQHPVSLYAPSGTELEEGERVAIRISSREPVRARIVD 162
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+ VP G+ + + + + I TS HG+ LT+P
Sbjct: 163 EPVP-----------GFSI-----ARTDLEEALGRSDAGVTIATSRHGE-----SLTVPK 201
Query: 296 FRHL---------LIAFG----------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
L + FG G+ E ++E DG F + NT P
Sbjct: 202 LAELSPRIDREGATVVFGSPGRGLPDILGINAEEVTVEPSDGPG-------FDLWLNTIP 254
Query: 337 HQGSRTIRTEEAIFISL 353
QGS +RTEEA+F SL
Sbjct: 255 RQGSEVVRTEEAMFASL 271
>gi|257387875|ref|YP_003177648.1| hypothetical protein Hmuk_1827 [Halomicrobium mukohataei DSM 12286]
gi|257170182|gb|ACV47941.1| Protein of unknown function DUF171 [Halomicrobium mukohataei DSM
12286]
Length = 282
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 134/316 (42%), Gaps = 66/316 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S++ + ATR G +ARA +FR+D + +F + + + +
Sbjct: 2 TTSVLVPSSLVREAEDRREATRKLGYVARAAVVFRVDRLTIFPDPAGAGKW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E G F+ +L+Y TP YLRK ++ L +VG+LPPL +R +
Sbjct: 53 ------EDG--FVETVLRYAATPPYLRKEVWGKRDELEYVGVLPPL----RVRSQTGSGS 100
Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVVD-----QVLDPGVRVTVAMGTNRNLDA- 231
+L++ VG V+ GL + + +V G RVTV + + R + A
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGLQHPISLPVPSAMEVPGEGERVTVRVSSRRPVRAK 160
Query: 232 --DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
D P GY V A ++ + + G+ I S HG+ +
Sbjct: 161 LVDEPLT--------------GYVVDTA-DVDTALERSD--AGF--AIAASRHGEELGVD 201
Query: 290 DLTLPTFR-----HLLIAFG-------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPH 337
L R H+ +AFG + GL+ DG +A F + NT P+
Sbjct: 202 RLGQLVERRDDAGHMTVAFGAPERGLPAIFGLDPDDAVPDGTGDDDAG--FDLWLNTVPN 259
Query: 338 QGSRTIRTEEAIFISL 353
QGS +RTEEA+F +L
Sbjct: 260 QGSEVVRTEEAMFATL 275
>gi|336252441|ref|YP_004595548.1| hypothetical protein Halxa_1033 [Halopiger xanaduensis SH-6]
gi|335336430|gb|AEH35669.1| protein of unknown function DUF171 [Halopiger xanaduensis SH-6]
Length = 292
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 133/320 (41%), Gaps = 64/320 (20%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D +VV+ ++ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRADRLVVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A ++
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNDSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + + D +D G RVTV + + R + A
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVDEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ +P G V ++S+ ++ + I S HG+ + L
Sbjct: 164 EPLP-----------GLSVEQT-DLSAALSREDAGVR------IAASRHGERLTVGRLET 205
Query: 294 PTFR----HLLIAFG-------GLAGLEESI-------EEDDGLKRKNAR--EVFHSYFN 333
R L +AFG + G+E S + DDG A F + N
Sbjct: 206 LAGRIDRDGLTVAFGAPERGLPAILGIEASDVAAAYGGDADDGNVEPTADPGPGFDLWLN 265
Query: 334 TCPHQGSRTIRTEEAIFISL 353
T P QGS +RTEEA+F +L
Sbjct: 266 TVPDQGSEVVRTEEALFATL 285
>gi|156844294|ref|XP_001645210.1| hypothetical protein Kpol_1060p5 [Vanderwaltozyma polyspora DSM
70294]
gi|156115869|gb|EDO17352.1| hypothetical protein Kpol_1060p5 [Vanderwaltozyma polyspora DSM
70294]
Length = 377
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 107/234 (45%), Gaps = 32/234 (13%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P SI+DN +LE T QIA+ TI+ + E+V+ SDN + S +++ +
Sbjct: 51 SICIPTSILDNCHNLEQITSTLYQIAKTATIYNVGEIVILQISKKSDNNNNDSGTSKNQK 110
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
S + + LLQY TP YL +F H ++ LP L A +R +
Sbjct: 111 K---LSDSVLIASLLQYFVTPPYLVNTVFKKDYHKYFKYASKLPRLSALPFMRYLDEDNG 167
Query: 180 PFREGVTLKENAPNSVGT-----------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNR 227
+REG+T++ + P+S V+VG + + + L P VRVTV
Sbjct: 168 RYREGLTIRMSKPSSNNNSNKGKEFKQTKYVNVGKPEALELKTQLVPVNVRVTVDT---- 223
Query: 228 NLDADSPRQVVPPSKP----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
++VV P + + +GY VR A + +VF CS+K GY +
Sbjct: 224 -----IEKKVVSPQEAYGDFYGAQFSYGYHVRIAQSFGNVFTECSFKNGYSQAV 272
>gi|126459649|ref|YP_001055927.1| hypothetical protein Pcal_1036 [Pyrobaculum calidifontis JCM 11548]
gi|126249370|gb|ABO08461.1| Protein of unknown function DUF171 [Pyrobaculum calidifontis JCM
11548]
Length = 273
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 127/294 (43%), Gaps = 46/294 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+A+P + + R G +ARA IF+ ++++ + R
Sbjct: 3 SIAIPHDFLSESPDEASKVRKIGFLARAAAIFKAPLIIIYHY-------------GKPLR 49
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
+ + + LL+YL TP YLRK ++ + L+ G+LPPL P H+ E
Sbjct: 50 EEID-----LMKTLLEYLATPPYLRKKVYKLDKRLKLAGLLPPLKIPSHVAPEEPRVGEV 104
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM--GTNRNLDADSPRQVVP 239
REGV + + S LV +G ++ V + G R+ V + T R+ D+ R V
Sbjct: 105 REGVVERWDGYYS---LVYIGGGRYAKVPKPYPIGTRLLVKIEAPTQRS---DTFRASVF 158
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
P YWGY+V P + ++ + G+D ++ T + G + +++ L
Sbjct: 159 KGPPP---AYWGYRVEVRP-LETLTE------GFDTVVLTGKEGKSICETNIRL--GEKT 206
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
LI FG + I + GLK N P QG TIRTEEA FI L
Sbjct: 207 LIVFGSPRKGVDEIYREAGLKPPG------DLVNFVPGQGVETIRTEEAAFIVL 254
>gi|303283876|ref|XP_003061229.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457580|gb|EEH54879.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 313
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 102/205 (49%), Gaps = 21/205 (10%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF----DNKSSSDNYSRSSAANR 120
+AV ++ ++ SL T LA + A A + +EVVV+ ++ + +D + ++
Sbjct: 3 LAVDAAVANSATSLAEVTHLAWRAACAAIKVKANEVVVYGGVKESVAKNDGHYSGPPSHA 62
Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP----LDAPHHLRKH 176
+NR A + R+L+Y++TP YLR AL + H L VG LP A HHLR H
Sbjct: 63 ANRP------AVMIARVLEYVDTPPYLRNALLARHPDLALVGSLPRGGVLKGARHHLRIH 116
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD---- 232
+ + +REG+ + S T +VGL + +D+ + GVRVTV + NR +D
Sbjct: 117 DVSEYREGIVVAAANGASKSTRANVGLLSDLELDKHVKVGVRVTVRL-DNRCRASDGSTT 175
Query: 233 --SPRQVVPPSKPKESGMYWGYKVR 255
+ R V P +G W + VR
Sbjct: 176 PLTGRVVSPEEAALSNGGSWAFNVR 200
>gi|448726795|ref|ZP_21709187.1| hypothetical protein C448_09064 [Halococcus morrhuae DSM 1307]
gi|445793841|gb|EMA44412.1| hypothetical protein C448_09064 [Halococcus morrhuae DSM 1307]
Length = 276
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 130/311 (41%), Gaps = 50/311 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ AT G +ARA T+FR+D V VF + +
Sbjct: 2 TRSVLVPSSLCREAADKREATHKVGLVARAATVFRVDRVTVFPDLDGDRRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +L Y T YLRK + L + G+LPPL A +
Sbjct: 53 --------GGGFVSTVLAYAATAPYLRKEAWGTRDELEYAGVLPPLRAAARTGSESNGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + + + +D G RVTV + + + A
Sbjct: 105 SSRQGI-VTEVGPDG-RVRVNCGLQHPISLVAPPEMAVDEGERVTVRISSREPVRA---- 158
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
++ S P G+ V A ++S+ ++ ++ + TS HG+++ + L
Sbjct: 159 RITDESPP-------GFAVERA-DLSAALGREDAGFR------VATSRHGEVLTTERLGT 204
Query: 294 PTFRH---LLIAFGGLA-GLEESIEEDDG-LKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
R L +AFG GL ++ D + F + NT P QGS +RTEEA
Sbjct: 205 LAGRADDGLTVAFGAPERGLPAMLDIDAASVASDTPHAGFDCWLNTVPKQGSEVVRTEEA 264
Query: 349 IFISLQYFQEP 359
+F +L P
Sbjct: 265 VFATLAPLTLP 275
>gi|347524197|ref|YP_004781767.1| hypothetical protein Pyrfu_1659 [Pyrolobus fumarii 1A]
gi|343461079|gb|AEM39515.1| protein of unknown function DUF171 [Pyrolobus fumarii 1A]
Length = 286
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/301 (28%), Positives = 136/301 (45%), Gaps = 40/301 (13%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ + VP S++ SL+L T AG I RA +FR +V++ +++ + R R
Sbjct: 14 LEIVVPASVLSVEPSLQLKTVKAGVIGRAAAVFRASRIVLYVDRAEA---WRDLETFR-- 68
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMH-SSLRFVGMLPPLDAPHH-LRKHEWAP 180
+LL+YL TP YLRK ++ LR+ G+LPPL P H + +
Sbjct: 69 -------------KLLEYLATPPYLRKRVYPPGVPELRYAGLLPPLQIPTHGVGGPKEGE 115
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
RE L++ + V VD GL + V VD L GV+V + + +P +
Sbjct: 116 IREAYVLRKRGRSVV---VDAGLEEEVEVD--LPRGVQVNRGERILVKIVSLTPPILEYV 170
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
P +Y GY+V ++ ++ LI TS G ++V+ L
Sbjct: 171 ENPP---VYTGYRVATFDSLGDYLRSARKD---TLLIATSRRGRRIDEVVDELRKKLGEK 224
Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
+ + + FG GL E I E +G+K E + N P+QG+ T+RTEEA++ +L
Sbjct: 225 KRVALLFGSPREGLYE-IAEREGIKLD---ESVDAVVNVVPNQGTLTVRTEEAVWAALSL 280
Query: 356 F 356
Sbjct: 281 L 281
>gi|408404335|ref|YP_006862318.1| hypothetical protein Ngar_c17280 [Candidatus Nitrososphaera
gargensis Ga9.2]
gi|408364931|gb|AFU58661.1| protein of unknown function DUF171 [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 280
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 131/315 (41%), Gaps = 48/315 (15%)
Query: 60 VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
+P + VA+P S + + Q+ + Q ARA IF++ + ++ + S
Sbjct: 1 MPNLWVAIPDSSLSDEQTRRDKSIKIAQFARACAIFQVKRIYIYHDSLS----------- 49
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LR 174
+ R D N L +L+YL+TPQYLRKAL+ L + G+L P+ APHH ++
Sbjct: 50 QFERDDPN-----LLKTILRYLDTPQYLRKALYPRMHQLEYAGILHPIKAPHHKPPEDIK 104
Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHV-VVDQVLDPGVRVTVAMGTNRNLDADS 233
+ R GV K V+ GL V V Q + G N+ S
Sbjct: 105 RVRAGDVRTGVITKVKG----QLFVEAGLGSLVPFVGQGFE---------GKKVNVKFIS 151
Query: 234 PRQVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
P + + E + YWGY+V+ P++ + + ++ TS G + +
Sbjct: 152 PYPNLKAIEASEEDIFEYWGYEVKEVPSLGKLLLASAEN---TEVVITSRKGSYFKNVEA 208
Query: 292 TL----PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
L +++L+ FG I +G K + N P+QG+ T+R EE
Sbjct: 209 KLIERAKNAQNILVVFGAPKHGVYDILAKEGASIKP----YEFVVNMFPNQGTETVRLEE 264
Query: 348 AIFISLQYFQEPISR 362
A+ +L I +
Sbjct: 265 AVLGTLAILNHSIGK 279
>gi|448311981|ref|ZP_21501734.1| hypothetical protein C493_08771 [Natronolimnobius innermongolicus
JCM 12255]
gi|445603602|gb|ELY57564.1| hypothetical protein C493_08771 [Natronolimnobius innermongolicus
JCM 12255]
Length = 294
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 133/326 (40%), Gaps = 74/326 (22%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TI+R+D ++V+ +++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIYRVDRLIVYPDRA-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
+ G F+ +L+Y TP YLR + M L +VG+LPPL A +
Sbjct: 48 --GETGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRAVSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
R+G+ + V V+ GL + + D + G RVTV + + R + A D
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVSEGERVTVRISSRRPVRAKLVD 163
Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIVNSS 289
P P ++ + S G + I +S +G +
Sbjct: 164 EP----------------------LPGLAVERTDLSTALGREDAGVCIASSRYGIELTVG 201
Query: 290 DLTLPTFR---HLLIAFGG-------LAGLEES-----------IEEDDGLK-RKNAREV 327
L R L +AFG + G+E S DDG++ ++A
Sbjct: 202 RLETLAGRVGDGLTVAFGAPERGLPDILGIEASAIDADPSGQLDTAADDGVEPTQSADPG 261
Query: 328 FHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 262 FDLWLNTVPDQGSEVVRTEEALFATL 287
>gi|297527411|ref|YP_003669435.1| hypothetical protein Shell_1448 [Staphylothermus hellenicus DSM
12710]
gi|297256327|gb|ADI32536.1| Protein of unknown function DUF171 [Staphylothermus hellenicus DSM
12710]
Length = 279
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 134/295 (45%), Gaps = 38/295 (12%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ VA+P SI+ SL L T Q+ R +IF + E+V F D ++ S N+
Sbjct: 7 ILVALPTSILSTESSLLLKTMKIYQVIRYSSIFGVLEIVFF-----RDPFTDFSQHNKY- 60
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
+A + ++ +YL TP YLR+ L L+FVG+LPP L+ R
Sbjct: 61 --------SALIEKIWRYLLTPPYLRRKLIPKDPDLKFVGLLPPLRLNIFDVSRNGYVGE 112
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVP 239
R G KE L D+GL K ++ +PG +G +D ++ R +
Sbjct: 113 KRLGFIYKEKN----KLLADIGLLKPYRIEAGNCEPG-----DIGYVEIVDMNTRRAICL 163
Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
+P Y G + +A ++ V + Y D +I TS +G I N +L +
Sbjct: 164 DEEP-----YRGPVLAFADSLREVLE--EYGRIVDLIIATSRYGRIPNYKELAGVKGKTT 216
Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+I FGG GL E I E +G +N + +NT P Q +TIR+EEA+ +L
Sbjct: 217 IILFGGPHRGLYE-IAEKEGFALENKVD---KVWNTIPEQMVKTIRSEEALISTL 267
>gi|448630851|ref|ZP_21673306.1| nucleic acid methylase [Haloarcula vallismortis ATCC 29715]
gi|445755225|gb|EMA06615.1| nucleic acid methylase [Haloarcula vallismortis ATCC 29715]
Length = 281
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 137/310 (44%), Gaps = 55/310 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA ++R+D ++V+ + + +
Sbjct: 2 TTSVLVPSSLTREAEDRREATRKLGYVARAAAVYRVDRLIVYPDPDGAGKW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E G F+ +L+Y TP +LRK ++ L +VG+LPPL +R +
Sbjct: 53 ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100
Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
+L++ VG V+ G+ + + D ++ G RVTV + + R + A
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRA-- 158
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ V P+ G+ V +++ + I +S +G+ V S+ L
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204
Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
R + +AFG GL L+ + + G + + E F + NT P+QGS +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264
Query: 344 RTEEAIFISL 353
RTEEA+F SL
Sbjct: 265 RTEEALFASL 274
>gi|448298464|ref|ZP_21488493.1| hypothetical protein C496_02927 [Natronorubrum tibetense GA33]
gi|445591660|gb|ELY45861.1| hypothetical protein C496_02927 [Natronorubrum tibetense GA33]
Length = 290
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 130/306 (42%), Gaps = 38/306 (12%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ + ATR G +ARA TIFR D ++V+ ++ +
Sbjct: 2 TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGILPPLRAMSQTGSESDGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+P + + + VR A +S F G + L G E +
Sbjct: 164 DPLPGLSVERTDLSAALGREDAGVRIA---ASRFGENLTVGRLETLAGRVERDGMTVAFG 220
Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
+ + LP L I +A + + + DDG++ A F + NT P QGS +RTEE
Sbjct: 221 APERGLPDI--LDIEASAVAPADGTGDADDGVE-PTADPGFDLWLNTVPDQGSEVVRTEE 277
Query: 348 AIFISL 353
A+F SL
Sbjct: 278 ALFASL 283
>gi|15790631|ref|NP_280455.1| hypothetical protein VNG1688C [Halobacterium sp. NRC-1]
gi|169236370|ref|YP_001689570.1| hypothetical protein OE3386F [Halobacterium salinarum R1]
gi|68068034|sp|Q06847.2|Y1688_HALSA RecName: Full=Uncharacterized protein VNG_1688C
gi|10581158|gb|AAG19935.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
gi|167727436|emb|CAP14224.1| conserved hypothetical protein [Halobacterium salinarum R1]
Length = 283
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 129/312 (41%), Gaps = 57/312 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S++ + ATR G +ARA +FRID VVVF ++
Sbjct: 2 TRSVLVPSSLVREAEDKREATRKLGYVARAAAVFRIDRVVVFPDEDG------------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
E + G F+ +L+Y TP YLRK F L + G+LPP L +
Sbjct: 49 ----ERQWGGGFVETVLRYAATPPYLRKEAFDTRDELAYAGVLPPLRLSSWTGSDSSGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ----VLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + + V V+ G+ + + + + G RVT+ + + R + A
Sbjct: 105 SLRQGIVTQVGSEGRV--RVNCGMQHPISLHEPPGMAVSEGERVTIRVSSRRPVRAKLVD 162
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+P + +G+ + + I TS HG+ ++ + +L
Sbjct: 163 DPLPGFSVERTGL----------------GDALDRSDAGVRIATSRHGEPLSVA--SLGG 204
Query: 296 FRH------LLIAFGG--------LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
+R + +AFG L +++ E +A F ++ NT P QGS
Sbjct: 205 YRERIARDGVTVAFGAPERGLPPMLGVSADAVNESVTDSSADAPARFDAWLNTIPDQGSE 264
Query: 342 TIRTEEAIFISL 353
+RTEEA+ +L
Sbjct: 265 VVRTEEAVLATL 276
>gi|344231046|gb|EGV62931.1| DUF171-domain-containing protein [Candida tenuis ATCC 10573]
gi|344231047|gb|EGV62932.1| hypothetical protein CANTEDRAFT_115920 [Candida tenuis ATCC 10573]
Length = 358
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 137/338 (40%), Gaps = 59/338 (17%)
Query: 64 SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAANR 120
S+ +P S+I N +LE T +A Q+A+A TI+ + E+++ D + Y S AN+
Sbjct: 26 SICLPSSLISYANASNLEQITNIAYQVAKAATIYNVPEIIILDVPTIDAQYEILESKANK 85
Query: 121 SNRSD--------------------------ENESGAAFLVRLLQYLETPQYLRKALF-- 152
+ + D EN + LLQ+ TP YL KA+
Sbjct: 86 AVKIDKKIKFNEILEPPVQQQIVPNQPQNVQENHNNHELFESLLQFFITPPYLVKAMLKD 145
Query: 153 -SMHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNS--------VGTLVDVG 202
++ LP L P F+EG+++ + +P + V V++G
Sbjct: 146 NKYTKKFKYAQTLPKLSTLPFMGNNGTHNDFKEGLSIPKKSPGNKKQRNKLKVTKYVNIG 205
Query: 203 LNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISS 262
+ + Q + VRVTV + +N SP K S +GY R A N +
Sbjct: 206 HAQPFELTQEVPVNVRVTVDL---KNKTIVSPLHAYGVIGYKSS---FGYHTRIARNFNE 259
Query: 263 VFKNCSYKGGYDHLI-------GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEE 315
+F ++ GY I +SE D N+ L +L+ FG LE S ++
Sbjct: 260 IFTKSAFPTGYTSSIYVNCANYFSSEPADTTNNIQLNSSENNRVLLVFGNPKDLEFSFKQ 319
Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
D L + ++F P TI+ E+A+ I+L
Sbjct: 320 DKSLSVPSFTDLFDQKLCIPP-----TIKVEDAVLIAL 352
>gi|210076192|ref|XP_504197.2| YALI0E20669p [Yarrowia lipolytica]
gi|199426946|emb|CAG79792.2| YALI0E20669p [Yarrowia lipolytica CLIB122]
Length = 384
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 149/353 (42%), Gaps = 79/353 (22%)
Query: 62 TVSVAVPGSIIDNT--QSLELATRLAGQIARAVTIFRIDEVVVFD------------NKS 107
+V++ VP + I T +S + A A QIARA + +DE+V+++ KS
Sbjct: 46 SVTLVVPSTCISETVVKSYQQAVHTAYQIARAAVAYNVDEIVIYEPSERPDIKQEKTKKS 105
Query: 108 SSDNYSRSSAANRSNRSDENE----SGAAFLVRLLQYLETPQYLRKALFSMHSS---LRF 160
+ ++ + S+ S++++ A L LLQ+ TP YLRK++F+ ++ +
Sbjct: 106 ETSSFGKPKIVFNSDASNDHKPELSERAVTLASLLQFFVTPDYLRKSIFTPETTTTNFEY 165
Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTL---------------KENAPNSVGTLVDV---- 201
LP L P + A ++EG+++ K N + +++
Sbjct: 166 AKKLPKLLLPFMESRD--ARYKEGMSIPMILNKNVKSKISKQKRNVKKAEAKGIELQPPT 223
Query: 202 --GLNKHVVV----------DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMY 249
G ++ + Q + GVRVTV N+ + A + G
Sbjct: 224 VDGATPYINIGAAEPHELEYGQTVPVGVRVTVDTKENKVVSAT-----------EAYGQD 272
Query: 250 WGYKVRYAPNISSVFKNCSYKGGYDHLI----GTSEHGDIVNSSD----LTLP-TFRHLL 300
GY VR A + S +F S GYD+ + G ++ + L LP T + L
Sbjct: 273 TGYSVRVATDFSHIFTMPSTPEGYDYTVWCPGGEFRQSELAPETQPKPILALPRTVKRPL 332
Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FG +E++++ D L A+E+F S + R +R E+ +F++L
Sbjct: 333 FVFGQWDHVEDAVKHDTALGVSGAKELFDSKVD-----FKRPVRVEDGVFVAL 380
>gi|55378385|ref|YP_136235.1| nucleic acid methylase [Haloarcula marismortui ATCC 43049]
gi|344212414|ref|YP_004796734.1| nucleic acid methylase [Haloarcula hispanica ATCC 33960]
gi|448636988|ref|ZP_21675436.1| nucleic acid methylase [Haloarcula sinaiiensis ATCC 33800]
gi|448648022|ref|ZP_21679500.1| nucleic acid methylase [Haloarcula californiae ATCC 33799]
gi|448671479|ref|ZP_21687418.1| nucleic acid methylase [Haloarcula amylolytica JCM 13557]
gi|448681420|ref|ZP_21691553.1| nucleic acid methylase [Haloarcula argentinensis DSM 12282]
gi|57015353|sp|P20571.2|Y1612_HALMA RecName: Full=Uncharacterized protein rrnAC1612
gi|55231110|gb|AAV46529.1| nucleic acid methylase [Haloarcula marismortui ATCC 43049]
gi|343783769|gb|AEM57746.1| nucleic acid methylase [Haloarcula hispanica ATCC 33960]
gi|445765294|gb|EMA16433.1| nucleic acid methylase [Haloarcula sinaiiensis ATCC 33800]
gi|445766082|gb|EMA17219.1| nucleic acid methylase [Haloarcula amylolytica JCM 13557]
gi|445767953|gb|EMA19046.1| nucleic acid methylase [Haloarcula argentinensis DSM 12282]
gi|445775892|gb|EMA26887.1| nucleic acid methylase [Haloarcula californiae ATCC 33799]
Length = 281
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 136/310 (43%), Gaps = 55/310 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA ++R+D + V+ + + +
Sbjct: 2 TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E G F+ +L+Y TP +LRK ++ L +VG+LPPL +R +
Sbjct: 53 ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100
Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
+L++ VG V+ G+ + + D ++ G RVTV + + R + A
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRA-- 158
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ V P+ G+ V +++ + I +S +G+ V S+ L
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204
Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
R + +AFG GL L+ + + G + + E F + NT P+QGS +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264
Query: 344 RTEEAIFISL 353
RTEEA+F SL
Sbjct: 265 RTEEALFASL 274
>gi|302348986|ref|YP_003816624.1| hypothetical protein ASAC_1188 [Acidilobus saccharovorans 345-15]
gi|302329398|gb|ADL19593.1| hypothetical protein ASAC_1188 [Acidilobus saccharovorans 345-15]
Length = 277
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 131/306 (42%), Gaps = 65/306 (21%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
V +P S++ Q L T AG I+RA+ IFR+DEV +F ++ S+
Sbjct: 17 VLLPASVLSVEQDLRDKTYKAGLISRALAIFRVDEVRIFLDEDST--------------H 62
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
D+ E+ L LL Y P +L+K + + L++ G++PPL+ P+HL + EG
Sbjct: 63 DDQET----LSELLNYQVVPPHLKKRVVGLSEKLKYAGIMPPLNLPNHLPPRD---LNEG 115
Query: 185 VTLKENAPNSVGTLVDV-------GLNK--HVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
+ G+ V GL K H V+ ++ RV G L+ S
Sbjct: 116 DLIDVLIITRKGSQCSVYMGEAGEGLLKPCHFDVNDIVT--ARVIRRSGRQFELEPSS-- 171
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+Y GYKV + + + + +G L+GTS++G S+D L
Sbjct: 172 ---------WGNIYTGYKVLRSGKLLTELQELKEQG--VALVGTSKYG----STDYWL-- 214
Query: 296 FRHLL-----IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
R LL + GG GL + +R+ F N P QG+ T+R+EEA+
Sbjct: 215 LRGLLGRPIALVLGG---------PKSGLLQYTSRKAFDLLINAAPLQGTETLRSEEALL 265
Query: 351 ISLQYF 356
SL
Sbjct: 266 ASLTLL 271
>gi|448735136|ref|ZP_21717353.1| hypothetical protein C450_17783 [Halococcus salifodinae DSM 8989]
gi|445798749|gb|EMA49140.1| hypothetical protein C450_17783 [Halococcus salifodinae DSM 8989]
Length = 279
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/314 (27%), Positives = 130/314 (41%), Gaps = 53/314 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ AT G +ARA T+FRID V VF + +
Sbjct: 2 TVSLLVPSSLCREAADKREATHKVGLVARAATVFRIDRVTVFPDPDGDRRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +L+Y T YLRK + L G+LPPL A
Sbjct: 53 --------GGGFVSTVLRYAATAPYLRKEAWGTRDELEQAGVLPPLRAVSRTGSESQGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + + D LD G RVTV + + + A
Sbjct: 105 SSRQGI-VTEVGPDGR-VRVNCGLQHPISLVDPTDVGLDEGERVTVRISSREPVRA---- 158
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
++V P G+ V A ++S+ ++ + I TS HG+ + + L +
Sbjct: 159 RIVDEPAP-------GFAVERA-DLSAALGREDAGLR------IATSRHGEALTTDRLGV 204
Query: 294 PTFR---HLLIAFGGLA-GLEESIEEDDGLKRKNAR----EVFHSYFNTCPHQGSRTIRT 345
R + +AFG GL + D G + F + +T P+QGS +RT
Sbjct: 205 LAGRVEDGMTVAFGAPERGLPAMLGIDTGSVASGSETDEPAGFDRWLDTVPNQGSEVVRT 264
Query: 346 EEAIFISLQYFQEP 359
EEA+F +L P
Sbjct: 265 EEAVFATLACLTLP 278
>gi|50295076|ref|XP_449949.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529263|emb|CAG62929.1| unnamed protein product [Candida glabrata]
Length = 367
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 108/231 (46%), Gaps = 31/231 (13%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
+V +P S++DN Q+L T + Q+AR +T+F + EVVV D K N + + +
Sbjct: 43 AVCIPTSVLDNCQNLSQKTYVVYQLARTLTLFNVAEVVVLDLKQEKQN--SIIEKDDTKK 100
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLRKHEWAPF 181
+ S A L LLQY TP YL K++F F LP L A +R E +
Sbjct: 101 EKKRMSDAMLLASLLQYFVTPPYLIKSVFKKEYLPYFKEASKLPRLSALPFMRYMEENRY 160
Query: 182 REGVTLKENAPNSVG-------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADS 233
REG+ ++ + P+ +++G + + + L P VRVTV + +D
Sbjct: 161 REGLAIRMSKPDPKSKKEFKQTKYINIGESDALELKSQLVPINVRVTVDI-----ID--- 212
Query: 234 PRQVVPPSK-------PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
++VV P + PK S +GY VR A + + +F S+ GY +
Sbjct: 213 -KKVVAPEEAYGDYVGPKAS---YGYHVRIAKSFADLFMGSSFPSGYTQTV 259
>gi|307102254|gb|EFN50592.1| hypothetical protein CHLNCDRAFT_135058 [Chlorella variabilis]
Length = 86
Score = 79.7 bits (195), Expect = 2e-12, Method: Composition-based stats.
Identities = 42/101 (41%), Positives = 53/101 (52%), Gaps = 20/101 (19%)
Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
+ + C + GGY +GTSE G+ + L L FRHLL+ FGG GLE
Sbjct: 1 MLEQCPFYGGYSLTVGTSERGERTPACQLDLGAFRHLLVVFGGPQGLE------------ 48
Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
Y NTC QGSRTIR+EEAI ISL + Q ++ A
Sbjct: 49 --------YLNTCFDQGSRTIRSEEAILISLAFLQPAVTAA 81
>gi|335437514|ref|ZP_08560290.1| hypothetical protein HLRTI_10401 [Halorhabdus tiamatea SARL4B]
gi|334895862|gb|EGM34025.1| hypothetical protein HLRTI_10401 [Halorhabdus tiamatea SARL4B]
Length = 301
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 134/333 (40%), Gaps = 81/333 (24%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T S VP S+ + ATR G +ARA T+FR+D ++V+ + + +
Sbjct: 2 TQSALVPSSLCREAEDKREATRKIGYVARAATVFRVDRLIVYPDPDGAGRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +L+Y TP YLRK ++ L VG+LPPL +
Sbjct: 53 --------GDGFVETVLRYAATPPYLRKEVWDRRDELECVGVLPPLRVATQTGSGSDDSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
R+G+ + + V V+ G+ + + D+ + G RVTV + + R + A D
Sbjct: 105 SLRQGIVTEVGSDGRV--RVNCGMQHPISLAVPGDRTIREGERVTVRISSRRPVRAKIVD 162
Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL- 291
P PP GY+V A N+ + + I S HG + + L
Sbjct: 163 QP----PP----------GYQVDRA-NLDAALS----RPDAGVRIAASRHGQSLTTERLG 203
Query: 292 ----TLPTFRHLLIAFGG-----------------------LAGLEESIEEDD----GLK 320
T+ + + +AFG LA ++ E +D G
Sbjct: 204 RLVGTIASAGDMTVAFGAPERGLPAILGVEPGAVDPCTVDELAAAVDAAERNDETVTGSG 263
Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
R F + NT P+QGS +RTEEA+F +L
Sbjct: 264 SDGPR--FDLWLNTVPNQGSEVVRTEEAMFATL 294
>gi|406602263|emb|CCH46156.1| hypothetical protein BN7_5744 [Wickerhamomyces ciferrii]
Length = 355
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 121/274 (44%), Gaps = 51/274 (18%)
Query: 46 KRKRDQLNDDAPIEVPTV-----SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRID 98
K+KR Q +D ++ P S+ +P SII+ N ++L+ T + QIA+ +I+ +
Sbjct: 4 KQKRSQQDDLTSVKKPKTRSINYSLCIPTSIINRSNAKNLQQVTNILYQIAKVASIYNVG 63
Query: 99 EVVVFDNKSSSD-----------NYSRSSAANRSNRSDE---NE--SGAAFLVRLLQYLE 142
E+++ D ++ D N + + N S E NE A + +LQY
Sbjct: 64 EIIIMDIQTPPDEKVNLNDVQDPNSKKKIKFDDDNTSTEKAKNEVSEDAMTIATILQYFV 123
Query: 143 TPQYLRKALF--SMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL------------K 188
TP YL K LF S+ ++ P L + + +REG+T+ K
Sbjct: 124 TPPYLTKTLFKKKFFSNFQYAKNFPKLTTLPFMNPEVSSKYREGLTVTMGKTTKSQNKSK 183
Query: 189 ENAPNSVGTLVDVGLNKHVVVD-QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESG 247
++ P V++G +K++ + Q + VRVTV T ++VV P + E
Sbjct: 184 KSQPLKNTKFVNIGYDKYLELQGQQVPVNVRVTVNTET---------KKVVSPIEAYEEQ 234
Query: 248 M----YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
+ +GY VR A +S+F SY GY I
Sbjct: 235 VGAKSSYGYHVRIAKKFTSIFTESSYPDGYSQSI 268
>gi|429190345|ref|YP_007176023.1| hypothetical protein Natgr_0316 [Natronobacterium gregoryi SP2]
gi|448326257|ref|ZP_21515624.1| hypothetical protein C490_12702 [Natronobacterium gregoryi SP2]
gi|429134563|gb|AFZ71574.1| hypothetical protein Natgr_0316 [Natronobacterium gregoryi SP2]
gi|445612914|gb|ELY66631.1| hypothetical protein C490_12702 [Natronobacterium gregoryi SP2]
Length = 288
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 132/316 (41%), Gaps = 60/316 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIF +D +VV+ ++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFGVDRLVVYPDRD-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
+ G F+ +L+Y TP YLR ++ M L +VG+LPPL A +
Sbjct: 48 --GETGRFGDGFVSTVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRATSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
R+G+ + V V+ GL + + D + G RVT+ + + R + A D
Sbjct: 106 STRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDVEVAEGERVTIRISSRRPVRAKLVD 163
Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
P + + S VR A +S F G + L G E GD
Sbjct: 164 CPLPGLSIERTDLSAALGREDAGVRIA---ASRFGEHLTVGRLETLAGRIE-GD------ 213
Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIEE------DDGLKRKNAREVFHSYFNTCPH 337
+ +AFG + G+E S DDG++ A F + NT P+
Sbjct: 214 -------GMTVAFGAPERGLPDILGIEASAISSSDEGGDDGVE-PTADPGFDLWLNTVPN 265
Query: 338 QGSRTIRTEEAIFISL 353
QGS +RTEEA+F +L
Sbjct: 266 QGSDVVRTEEALFATL 281
>gi|448329358|ref|ZP_21518658.1| hypothetical protein C489_09466 [Natrinema versiforme JCM 10478]
gi|445614097|gb|ELY67778.1| hypothetical protein C489_09466 [Natrinema versiforme JCM 10478]
Length = 291
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 134/319 (42%), Gaps = 63/319 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D +VV+ +R
Sbjct: 2 TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
+ + + G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 48 GETGQFDGG--FVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ +P +++ + VR A +S F G + L G ++ +
Sbjct: 164 EPLPGLSIEQTDLRAALGREDAGVRIA---ASRFGEHLTVGRLETLAGRTKRDGMT---- 216
Query: 291 LTLPTFRHLLIAFG-------GLAGLEESIEE---------DDGLKRKNAREVFHSYFNT 334
+AFG + G+EES E D+G++ A F + NT
Sbjct: 217 ----------VAFGAPERGLPAILGIEESAIEASSGQGDAADNGVE-PTADPGFDLWLNT 265
Query: 335 CPHQGSRTIRTEEAIFISL 353
P QGS +RTEEA+F +L
Sbjct: 266 VPDQGSEVVRTEEALFATL 284
>gi|126465912|ref|YP_001041021.1| hypothetical protein Smar_1015 [Staphylothermus marinus F1]
gi|126014735|gb|ABN70113.1| Protein of unknown function DUF171 [Staphylothermus marinus F1]
Length = 279
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 135/297 (45%), Gaps = 42/297 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ VA+P SI+ SL L T Q+ R +IF + E+V F D ++ S
Sbjct: 7 ILVALPTSILSTESSLLLKTIKTYQVIRYSSIFGVSEIVFF-----RDPFTDFS------ 55
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL-----DAPHHLRKHE 177
++ + + ++ +YL TP YLR+ L L+FVG+LPPL D + R E
Sbjct: 56 ---QHRKYSVLIEKIWRYLLTPPYLRRKLIPKDPDLKFVGLLPPLRLSTFDVSRNGRVGE 112
Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-LDPGVRVTVAMGTNRNLDADSPRQ 236
R G+ +E L D+GL + ++ PG +G +D ++ R
Sbjct: 113 K---RLGLIYREKN----KLLADIGLPRPYRIEAGNCKPG-----DIGYVEIVDVNTRRA 160
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
+ +P Y G + +A ++ V + Y+ D +I TS++G I + +L
Sbjct: 161 ICLDVEP-----YRGPILAFADSLQEVLEE--YRKTVDLIIATSKYGKIPSHKELAGVKG 213
Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ ++I FGG I + +G +N + +NT P Q +TIR+EEA+ +L
Sbjct: 214 KTIIILFGGPHRGLYDIAKKEGFILENKVD---KVWNTIPEQMVKTIRSEEALISTL 267
>gi|320580950|gb|EFW95172.1| hypothetical protein HPODL_3544 [Ogataea parapolymorpha DL-1]
Length = 310
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 114/261 (43%), Gaps = 38/261 (14%)
Query: 62 TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
++++ +P S I + ++LE AT A QIARA + + E+V+ D + + A
Sbjct: 4 SITLCIPSSCISRRSCKNLEQATFTAYQIARAACTYNVGEIVILDVPENIE-------AV 56
Query: 120 RSNRSDENESGAA------FLVRLLQYLETPQYLRKALF--SMHSSLRFVGMLPPLDAPH 171
SN++ E G + L LLQ+ TP YL K +F S+ + LP +
Sbjct: 57 ESNKTKFEEKGTSASDSSRLLAALLQFFTTPPYLVKTVFQASVIEYFKIAKKLPLISTLP 116
Query: 172 HLRKHEWAPFREGVTL------KENAPNSV--------GTLVDVGLNKHVVVDQVLDPGV 217
+++ + FREG+++ K+NA V V++G K + +++ +
Sbjct: 117 YMQNDAASCFREGISIPKKSVVKKNAEGKVLKKKKPTTTKFVNIGERKMLELEKEIPINT 176
Query: 218 RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
RVTV T R + D K +GY VR A SS+F Y GYD I
Sbjct: 177 RVTVNTKTKRVVSPDDAYGKAGALKT------FGYHVRVATKFSSLFTEPGYTEGYDRCI 230
Query: 278 GTSEHGDIVNSSDLT-LPTFR 297
S N T LP+++
Sbjct: 231 YVSSGDYFSNDQPATGLPSYQ 251
>gi|315425962|dbj|BAJ47611.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
gi|343484754|dbj|BAJ50408.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
Length = 278
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 129/298 (43%), Gaps = 43/298 (14%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
+V +P S S AT G + R + FR+DE+V++ ++ +
Sbjct: 5 AVFLPHSFTAEVPSFREATMKIGFVGRVLATFRVDELVLYPDEPGAPRLR---------- 54
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFRE 183
A + LL YL T YLR ++ + SLR+ G+LPPL+ P H +E
Sbjct: 55 ------NALLIKELLDYLVTAPYLRSRVYPLKPSLRYAGLLPPLNIPTHPEADALK--KE 106
Query: 184 GVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
G ++ + G +L++ G+ K + + + L VT+ + R + VV
Sbjct: 107 GTHYRQALVTASGQTSLLEAGMGKPLKIPRKLPKNTIVTLRV---RVKEGRVKYSVV--- 160
Query: 242 KPKESG-MYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTF 296
K+SG Y G++ R ++ K +++ + TS G+ +N T L
Sbjct: 161 -GKKSGETYTGFRTRVVKSLEEALKPYTFR------LATSRLGEPINQVYATVAQRLKES 213
Query: 297 RHLLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ FG GL E I GL+ +F NT P+QG +TIRTEEA+ +L
Sbjct: 214 TQTCVVFGAADKGLHE-IARAHGLEYNR---LFTHTVNTAPNQGVKTIRTEEALAYTL 267
>gi|313125789|ref|YP_004036059.1| hypothetical protein Hbor_10220 [Halogeometricum borinquense DSM
11551]
gi|448285628|ref|ZP_21476869.1| hypothetical protein C499_02614 [Halogeometricum borinquense DSM
11551]
gi|312292154|gb|ADQ66614.1| uncharacterized conserved protein [Halogeometricum borinquense DSM
11551]
gi|445576264|gb|ELY30721.1| hypothetical protein C499_02614 [Halogeometricum borinquense DSM
11551]
Length = 278
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 128/317 (40%), Gaps = 72/317 (22%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S++ + ATR G +ARA T+FR D ++VF
Sbjct: 2 TLSVLVPSSLVREAEDKREATRKLGYVARAATVFRADRLIVF-----------------P 44
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL--DAPHHLRKHEWA 179
+R E G F+ +L+Y TP YLRK ++ L +VG+LPPL + +
Sbjct: 45 DREGERRWGRKFVETVLRYAVTPPYLRKEVWGHRDELEYVGVLPPLLVSSTTGSESDDSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
++G+ + E P+ V+ G+ + + D ++ G RV + + + + A
Sbjct: 105 SLQQGI-VTEVGPDG-RVRVNCGMQHPISLYTPSDMMVKEGERVAIRISSREPVRARIVD 162
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
+ VP G+ V + + I TS G+ LT+P
Sbjct: 163 EPVP-----------GFDVSRMD-----LEEALGRADAGITIATSRFGE-----SLTVPK 201
Query: 296 FRHL---------LIAFG----------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
L + FG G+ E ++E DG F + NT P
Sbjct: 202 LADLSPRIEREGATVVFGSPGRGLPDILGIDAEEVTVEPSDGPG-------FDLWLNTIP 254
Query: 337 HQGSRTIRTEEAIFISL 353
QGS +RTEEA+F SL
Sbjct: 255 RQGSEVVRTEEAMFASL 271
>gi|448358803|ref|ZP_21547477.1| hypothetical protein C482_12774 [Natrialba chahannaoensis JCM
10990]
gi|445644483|gb|ELY97496.1| hypothetical protein C482_12774 [Natrialba chahannaoensis JCM
10990]
Length = 314
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/325 (25%), Positives = 135/325 (41%), Gaps = 52/325 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR+D ++V+ +++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRA-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
D G F+ +L+Y TP YLR + M L +VG+LPPL A
Sbjct: 48 --GDTGRFGDEFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRATSQTGSESDGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ + G RVT+ + + R + A
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVSVGERVTIRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
+ +P + + + VR A + ++ + + + GY
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGYTIAF 223
Query: 278 GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIE------EDDGLKRKNARE---VF 328
G E G + +L T L LA E+ +E +DD + + F
Sbjct: 224 GAPERG-LPAILELEDTTVESLSAWPSWLAEDEDGVEPTSAAGDDDTVAAETDAADPGGF 282
Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
+ NT P QGS +RTEEA+F +L
Sbjct: 283 DLWLNTVPDQGSDVVRTEEALFATL 307
>gi|452206449|ref|YP_007486571.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
gi|452082549|emb|CCQ35807.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
Length = 277
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 130/308 (42%), Gaps = 54/308 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA T+FR+D ++V+ ++ D
Sbjct: 2 THSVLVPSSLTREAEDKREATRKLGYVARAATVFRVDRLIVYPDQGGDDGRFGDGFVG-- 59
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+L+Y TP YLRK + L + G+LPPL +R +
Sbjct: 60 --------------TVLEYAATPPYLRKEAWGKRDELEYAGVLPPL----RVRTQTGSGS 101
Query: 182 REGVTLKENAPNSVGT----LVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
+L++ VG+ V+ GL + + + G RVT+ + + + A
Sbjct: 102 DGSGSLRQGIVTEVGSDGRVRVNCGLQHPISLPVPPSMAVSAGERVTIRVSSREPVRA-- 159
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SD 290
+ V PS G+ V A +C + I S HG+++ +
Sbjct: 160 -KLVADPSP--------GFAVERA-----TIDDCLARPDAGVRIAASRHGEVLTVDRLGE 205
Query: 291 LTLPTFRHLLIAFGGLA-GLEESIEEDDGLK----RKNAREVFHSYFNTCPHQGSRTIRT 345
L+ T L +AFG GL E ++ + + NAR F + NT P+QGS +RT
Sbjct: 206 LSGRTADGLTVAFGAPERGLPEILDCEPRAEAIGDEPNAR--FDLWLNTVPNQGSEVVRT 263
Query: 346 EEAIFISL 353
EEA+F +L
Sbjct: 264 EEALFATL 271
>gi|448345584|ref|ZP_21534473.1| hypothetical protein C485_07297 [Natrinema altunense JCM 12890]
gi|445633517|gb|ELY86704.1| hypothetical protein C485_07297 [Natrinema altunense JCM 12890]
Length = 301
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 137/328 (41%), Gaps = 71/328 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D +VV+ +R
Sbjct: 2 TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
+ + + G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 48 GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ +P +++ + VR A +S F + G + L G +
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDELTVGRLETLAGRVQRD------- 213
Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IE----EDDGLKRKNARE------------ 326
+ +AFG + G+EES IE +DD ++ + R+
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEESAIEPSSGQDDAIEPSSDRDDAADNGVEPTAD 266
Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS IRTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVIRTEEALFATL 294
>gi|257053358|ref|YP_003131191.1| hypothetical protein Huta_2291 [Halorhabdus utahensis DSM 12940]
gi|256692121|gb|ACV12458.1| Protein of unknown function DUF171 [Halorhabdus utahensis DSM
12940]
Length = 301
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 133/333 (39%), Gaps = 81/333 (24%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T++ VP S+ + ATR G +ARA T+FR+D ++V+ + + +
Sbjct: 2 TLNALVPSSLCREAEDKREATRKIGYVARAATVFRVDRLIVYPDPDGAGRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +L+Y TP YLRK + L VG+LPPL +
Sbjct: 53 --------GDGFVETVLRYAATPPYLRKEAWERRDELGCVGVLPPLRVATQTGSGSDDSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
R+G+ + + V V+ G+ + + D+ + G RVTV + + R + A D
Sbjct: 105 SLRQGIVTEVGSDGRV--RVNCGMQHPISLAVPGDRTIKEGERVTVRISSRRPVRAKIVD 162
Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL- 291
P PP GY+V A N+ + I S HG + + L
Sbjct: 163 QP----PP----------GYQVDRA-NLDAALSRPDAG----VRIAASRHGQSLTTERLG 203
Query: 292 ----TLPTFRHLLIAFGG-----------------------LAGLEESIEEDD----GLK 320
T+ + + +AFG LA ++ E +D G
Sbjct: 204 RLAGTIASAGDMTVAFGAPERGLPAILGVEPGAVDPSTVDELAAAVDAAERNDETVTGSG 263
Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
R F + NT P+QGS +RTEEA+F +L
Sbjct: 264 SDGPR--FDLWLNTVPNQGSEVVRTEEAMFATL 294
>gi|448350180|ref|ZP_21538999.1| hypothetical protein C484_11446 [Natrialba taiwanensis DSM 12281]
gi|445637687|gb|ELY90835.1| hypothetical protein C484_11446 [Natrialba taiwanensis DSM 12281]
Length = 299
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 132/312 (42%), Gaps = 41/312 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA T+FR D +VV+ ++ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
R D F+ +L+Y TP YLR ++ M + L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ LD G RVT+ + + R + A
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMALDEGERVTIRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+P + + + VR A +S + G D L G + +
Sbjct: 164 DPLPGLSVERTDLQAALSREDAGVRIA---ASRYGTELTVGRLDTLAGRVDRDGMTVAFG 220
Query: 288 SSDLTLPTFRHLLI--AFGGLAGLEESIEEDDGLKRKNAR----EVFHSYFNTCPHQGSR 341
+ + LP A L+ ++ DG++ + F + NT P+QGS
Sbjct: 221 APERGLPDILEFESEDAVDTLSSWPPRSDDADGVEPTSGDVDPGRGFDLWLNTVPYQGSD 280
Query: 342 TIRTEEAIFISL 353
+RTEEA+F +L
Sbjct: 281 VVRTEEALFATL 292
>gi|433638934|ref|YP_007284694.1| hypothetical protein Halru_1963 [Halovivax ruber XH-70]
gi|433290738|gb|AGB16561.1| hypothetical protein Halru_1963 [Halovivax ruber XH-70]
Length = 291
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 129/319 (40%), Gaps = 63/319 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP SI + ATR G +ARA TIFR D ++VF + +
Sbjct: 2 TVSVLVPSSIAREAEDKREATRKLGYVARAATIFRADRLIVFPDGD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YL+ + L +VG+LPPL A +
Sbjct: 51 GRFD-----GGFVETVLRYAATPPYLKTEAWGKRDELEYVGVLPPLRASSRTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
R G+ + V V+ GL + ++ V+D G RVTV + + R + A D
Sbjct: 106 SLRRGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMVVDEGERVTVRISSRRPVRAKLVD 163
Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
P + + S VR A SS + G + L G E
Sbjct: 164 EPLAGLSIERTDLSAALGREDAGVRIA---SSRYGEELTTGRLETLAGRVER-------- 212
Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIE---------EDDGLKRKNAREVFHSYFNT 334
+ +AFG + G+EES DG++ + F + NT
Sbjct: 213 ------DGMTVAFGAPERGLPDILGIEESRAGASSSGDGGTADGVE-PSGDPGFDLWLNT 265
Query: 335 CPHQGSRTIRTEEAIFISL 353
P+QGS +RTEEA+F +L
Sbjct: 266 VPNQGSEVVRTEEAMFATL 284
>gi|448352596|ref|ZP_21541378.1| hypothetical protein C483_01224 [Natrialba hulunbeirensis JCM
10989]
gi|445642158|gb|ELY95228.1| hypothetical protein C483_01224 [Natrialba hulunbeirensis JCM
10989]
Length = 314
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 138/328 (42%), Gaps = 58/328 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S+ + ATR G +ARA TIFR+D ++V+ +++
Sbjct: 2 TISVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRA-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
D G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 48 --GDTGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRATSQTGSESDGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ + G RVT+ + + R + A
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVSEGERVTIRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
+ +P + + + VR A + ++ + + + GY
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGYTIAF 223
Query: 278 GTSEHG--DIVNSSDLTLPTFRHLLIAF-GGLAGLEESIE------EDDGLKRKNARE-- 326
G E G I+ D T+ + L A+ LA E+ +E +DD + +
Sbjct: 224 GAPERGLPAILELEDTTVES----LSAWPSWLADDEDGVEPTSAAGDDDTVAAETDAADP 279
Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 280 GGFDLWLNTVPDQGSGVVRTEEALFATL 307
>gi|448338135|ref|ZP_21527187.1| hypothetical protein C487_10519 [Natrinema pallidum DSM 3751]
gi|445623310|gb|ELY76732.1| hypothetical protein C487_10519 [Natrinema pallidum DSM 3751]
Length = 301
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 138/328 (42%), Gaps = 71/328 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D +VV+ +R
Sbjct: 2 TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
+ + + G F+ +L+Y TP YLR + M + L + G+LPPL A
Sbjct: 48 GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ +P +++ + VR A +S F + G + L G +
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDELTVGRLETLAGRVQRD------- 213
Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IE----EDDGLKRKNARE------------ 326
+ +AFG + G+EES IE +DD ++ + R+
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEESAIEASSGQDDAIEPSSNRDDAADNGVEPTAD 266
Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVVRTEEALFATL 294
>gi|448685259|ref|ZP_21693251.1| nucleic acid methylase [Haloarcula japonica DSM 6131]
gi|445781870|gb|EMA32721.1| nucleic acid methylase [Haloarcula japonica DSM 6131]
Length = 281
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 136/310 (43%), Gaps = 55/310 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA ++R+D + V+ + + +
Sbjct: 2 TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E G F+ +L+Y TP +LRK ++ L +VG+LPPL +R +
Sbjct: 53 ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100
Query: 182 REGVTLKENAPNSVGTLVDVGLN---KHVVVDQV-----LDPGVRVTVAMGTNRNLDADS 233
+L++ VG V +N +H + V ++ G RVTV + + R + A
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPAAMDVEQGERVTVRVSSRRPVRA-- 158
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ V P+ G+ V +++ + I +S +G+ V S+ L
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204
Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
R + +AFG GL L+ + + G + + E F + NT P+QGS +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264
Query: 344 RTEEAIFISL 353
RTEEA+F SL
Sbjct: 265 RTEEALFASL 274
>gi|401625618|gb|EJS43618.1| YGR283C [Saccharomyces arboricola H-6]
Length = 342
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 109/247 (44%), Gaps = 30/247 (12%)
Query: 35 DSSSCDNKKKKKRKRDQLNDDAPIEVPT----VSVAVPGSIIDNTQSLELATRLAGQIAR 90
D+S + K KK KR+ LN I + + S+ +P +IIDN +LE + A QIA+
Sbjct: 8 DNSKSNFKVTKKIKRNILNLKKIINIKSGVINYSICIPTTIIDNCCNLEQVSFTAYQIAK 67
Query: 91 AVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKA 150
+F I E++V D S+ R RS E S L LLQY TP L
Sbjct: 68 TAVLFNIQEIIVLD-------MSKDKKHERQPRSKETISDCLLLATLLQYFVTPPNLVDV 120
Query: 151 LFSM--HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKEN------APNSVGTLVDVG 202
F S L++ PPL + + + EG+++ E + ++ +LV +G
Sbjct: 121 TFKKKHRSYLKYAFTFPPLGQLPFMNEPADETYIEGLSIAEQTSEKDISDENLTSLVYIG 180
Query: 203 LNKHV-VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS 261
N + + Q + VRVTV D + V P K++ + GY VR A ++
Sbjct: 181 KNNTIKLSSQNIPKTVRVTV--------DTLRKKVVSPRDAYKDTQL--GYYVRMASSLD 230
Query: 262 SVFKNCS 268
V C+
Sbjct: 231 EVSGGCT 237
>gi|448362305|ref|ZP_21550916.1| hypothetical protein C481_09647 [Natrialba asiatica DSM 12278]
gi|445648826|gb|ELZ01774.1| hypothetical protein C481_09647 [Natrialba asiatica DSM 12278]
Length = 300
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/327 (26%), Positives = 137/327 (41%), Gaps = 70/327 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA T+FR D ++V+ ++ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLIVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
R D F+ +L+Y TP YLR ++ M + L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAVSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ LD G RVT+ + + R + A
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMALDEGERVTIRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ +P + + + VR A +S + G D L G H V+
Sbjct: 164 EPLPGLSVERTDLQAALSREDAGVRIA---ASRYGTELTVGRLDTLAG---H---VDRDG 214
Query: 291 LTLPTFRHLLIAFG----GLAGLEESIEED----------------DGLKRKNAR----E 326
+T +AFG GL + E ED DG++ +
Sbjct: 215 MT--------VAFGAPERGLPDILEFESEDAVDTLSSWPPRPDDDTDGVEPTSGDVDPGR 266
Query: 327 VFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P+QGS +RTEEA+F +L
Sbjct: 267 GFDLWLNTVPYQGSDVVRTEEALFATL 293
>gi|448309693|ref|ZP_21499549.1| hypothetical protein C494_18167 [Natronorubrum bangense JCM 10635]
gi|445589550|gb|ELY43780.1| hypothetical protein C494_18167 [Natronorubrum bangense JCM 10635]
Length = 300
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 130/336 (38%), Gaps = 88/336 (26%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ + ATR G +ARA TIFR D +VV+ ++
Sbjct: 2 TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLVVYPDR--------------- 46
Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
E E+G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 47 ----EGETGRFDGGFVSTVLRYAATPPYLRNEAWGMRDELEYAGILPPLRAMSQTGSEST 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA- 231
R+G+ + V V+ GL + ++ +D G RVTV + + R + A
Sbjct: 103 GSGSSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVDEGERVTVRISSRRPVRAK 160
Query: 232 --DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIV 286
D P P +S + S G + I S G+ +
Sbjct: 161 LEDKP----------------------LPGLSVERTDLSAALGREDAGVCIAASRFGEEL 198
Query: 287 NSSDLTLPTFR----HLLIAFGG-------LAGLEESI------------------EEDD 317
L R L +AFG + G+E S DD
Sbjct: 199 TVGRLGTLAGRIDRDGLTVAFGAPERGLPDILGIEASAVNASSGQGDAAESSTGDDTADD 258
Query: 318 GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
G++ A F + NT P+QGS +RTEEA+F +L
Sbjct: 259 GVE-PTADPGFDLWLNTVPNQGSEVVRTEEALFATL 293
>gi|448377805|ref|ZP_21560501.1| hypothetical protein C479_14683 [Halovivax asiaticus JCM 14624]
gi|445655749|gb|ELZ08594.1| hypothetical protein C479_14683 [Halovivax asiaticus JCM 14624]
Length = 291
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 129/319 (40%), Gaps = 63/319 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP SI + ATR G +ARA TIFR D ++VF + +
Sbjct: 2 TVSVLVPSSIAREAEDKREATRKLGYVARAATIFRADRLIVFPDGD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YL+ + L +VG+LPPL A +
Sbjct: 51 GRFD-----GGFVETVLRYAATPPYLKTEAWGKRDELEYVGVLPPLRASSRTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
R G+ + V V+ GL + ++ V+D G RVTV + + R + A D
Sbjct: 106 SLRRGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMVVDEGERVTVRISSRRPVRAKLVD 163
Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
P + + S VR A SS + G + L G E
Sbjct: 164 EPLAGLSIERTDLSAALGREDAGVRIA---SSRYGEELTAGRLETLAGRVER-------- 212
Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIE---------EDDGLKRKNAREVFHSYFNT 334
+ +AFG + G+EES DG++ + F + NT
Sbjct: 213 ------DGMTVAFGAPERGLPDILGIEESRAGASSSGDGGAADGVE-PSGDPGFDLWLNT 265
Query: 335 CPHQGSRTIRTEEAIFISL 353
P+QGS +RTEEA+F +L
Sbjct: 266 VPNQGSGVVRTEEAMFATL 284
>gi|448386088|ref|ZP_21564296.1| hypothetical protein C478_18131 [Haloterrigena thermotolerans DSM
11522]
gi|445655986|gb|ELZ08828.1| hypothetical protein C478_18131 [Haloterrigena thermotolerans DSM
11522]
Length = 291
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/306 (26%), Positives = 131/306 (42%), Gaps = 37/306 (12%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ + ATR G +ARA T+FR D ++V+ ++ +
Sbjct: 2 TVSILVPSSLSREAEDKREATRKLGYVARAATVFRADRLLVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A +
Sbjct: 51 GRFD-----GGFVETVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLED 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+P + + + VR A +S F G + L G E +
Sbjct: 164 VPLPGLAVERTDLSTALGREDAGVRIA---ASRFGEELTVGRLETLAGRIEGDGMTVAFG 220
Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
+ + LP + + G A + + EDDG++ A F + NT P QGS +RTEE
Sbjct: 221 APERGLPDILEIEASAVG-APKDAAAGEDDGVE-PTADPGFDLWLNTVPDQGSEVVRTEE 278
Query: 348 AIFISL 353
A+F +L
Sbjct: 279 ALFATL 284
>gi|435848749|ref|YP_007310999.1| hypothetical protein Natoc_3480 [Natronococcus occultus SP4]
gi|433675017|gb|AGB39209.1| hypothetical protein Natoc_3480 [Natronococcus occultus SP4]
Length = 291
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 135/328 (41%), Gaps = 81/328 (24%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA T+FR D +VVF ++
Sbjct: 2 TASVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDR--------------- 46
Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
E E G F+ +L+Y TP YLR ++ M L + G+LPPL A
Sbjct: 47 ----EGERGRFDGEFVDTVLRYATTPPYLRNEVWGMRDELEYAGILPPLRAVS------- 95
Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVD--------------QVLDPGVRVTVAMG 224
R G ++ + G + +VG ++ V V+ L+ G RVTV +
Sbjct: 96 ---RTGSESNDSGSSRQGIVTEVGPDQRVRVNCGLQHPISLNVPPSMELEEGERVTVRIS 152
Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD 284
+ R + A ++V S P G V + +++ + + I S G
Sbjct: 153 SRRPVRA----KLVDDSLP-------GIDVEWTDLEAALGREDA-----GVRIAASRFGA 196
Query: 285 IVNSSDL-TLPTFRH---LLIAFG----GLAGL----EESIEE-------DDGLKRKNAR 325
+ L TL R + +AFG GL + EE+++ D G +
Sbjct: 197 ELTVGRLGTLAGRRERDGMTVAFGAPERGLPAILGMDEETVDALSRWDGVDGGGVEPTSD 256
Query: 326 EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 257 PGFDLWLNTVPDQGSDVVRTEEALFATL 284
>gi|448369940|ref|ZP_21556393.1| hypothetical protein C480_16265 [Natrialba aegyptia DSM 13077]
gi|445650380|gb|ELZ03304.1| hypothetical protein C480_16265 [Natrialba aegyptia DSM 13077]
Length = 299
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 132/320 (41%), Gaps = 57/320 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA T+FR D +VV+ ++ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
R D F+ +L+Y TP YLR ++ M + L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
R+G+ + E P+ V+ GL + ++ +D G RVT+ + + R + A D
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVDEGERVTIRISSRRPVRAKLVD 163
Query: 233 SP-------RQVVPPSKPKESGMYWGYKVRYAPNIS----SVFKNCSYKGGYDHLIGTSE 281
P R + + +E RY ++ + G G E
Sbjct: 164 DPLPGLSVERTDLQAALSREDAGVRIAASRYGTELTVGRLGTLAGRVERDGMTVAFGAPE 223
Query: 282 HG--DIV--NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR----EVFHSYFN 333
G DI+ S D A L+ ++ DG++ + F + N
Sbjct: 224 RGLPDILEFESED-----------AVDTLSSWPPRSDDADGVEPTSGDVDPGRGFDLWLN 272
Query: 334 TCPHQGSRTIRTEEAIFISL 353
T P+QGS +RTEEA+F +L
Sbjct: 273 TVPYQGSDVVRTEEALFATL 292
>gi|397773844|ref|YP_006541390.1| hypothetical protein NJ7G_2073 [Natrinema sp. J7-2]
gi|448341391|ref|ZP_21530352.1| hypothetical protein C486_06988 [Natrinema gari JCM 14663]
gi|397682937|gb|AFO57314.1| hypothetical protein NJ7G_2073 [Natrinema sp. J7-2]
gi|445628073|gb|ELY81384.1| hypothetical protein C486_06988 [Natrinema gari JCM 14663]
Length = 301
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 136/315 (43%), Gaps = 45/315 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D +VV+ +R
Sbjct: 2 TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
+ + + G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 48 GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+ +P +++ + VR A +S F + G + L G + +
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDALTVGRLETLAGRVQRDGMTVAFG 220
Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIE--------EDDGLKRKNAREVFHSYFNTCPHQ 338
+ + LP + A +G +++++ D+G++ A F + NT P Q
Sbjct: 221 APERGLPAILGIEESAIDASSGRDDAVDPSSNRDDAADNGVE-PTADPGFDLWLNTVPDQ 279
Query: 339 GSRTIRTEEAIFISL 353
GS +RTEEA+F +L
Sbjct: 280 GSEVVRTEEALFATL 294
>gi|448411358|ref|ZP_21575823.1| hypothetical protein C475_15889 [Halosimplex carlsbadense 2-9-1]
gi|445670546|gb|ELZ23145.1| hypothetical protein C475_15889 [Halosimplex carlsbadense 2-9-1]
Length = 304
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 132/325 (40%), Gaps = 62/325 (19%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA IFR+D + V+ + +
Sbjct: 2 TTSVLVPASLTREAEDKREATRKVGYVARAAAIFRVDRLTVYPDPDGVE----------- 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+ GA F+ +L+Y TP YLRK ++ L + G+LP LR W
Sbjct: 51 ------KRGAGFVETVLRYAATPPYLRKEVWDRRGELEYAGVLP------ALRVRSWT-- 96
Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTN--------------- 226
G E+ G + +VG ++ V V+ L V + V G
Sbjct: 97 --GSGSDESGSLRQGIVTEVGADERVRVNCGLQHPVSLPVPSGMEVGEGERVSVRVSSRR 154
Query: 227 ----RNLDADSPRQVVPPS-------KPKESGMYWGYKVRYAPNISSVFKNC---SYKGG 272
+ +D + VV + +P ++G+ RY ++ + +GG
Sbjct: 155 PVRAKLVDEPAAGFVVDSTDIDDALARP-DAGLTIAAS-RYGEELTVTKLDQLVERLRGG 212
Query: 273 YDHLIGTSEHG--DIVNSSDLTLPTFRHLLIAFGGLA-GLEESIEED-DGLKRKNAREVF 328
D E G D ++SD L +AFG GL + ED D + + + F
Sbjct: 213 DDGNASGVETGGSDESDASDDEAAPLGDLTVAFGAPERGLPPILGEDPDTVAAETGDDRF 272
Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
+ NT P+QGS +RTEEA+F +L
Sbjct: 273 DLWLNTVPNQGSEVVRTEEAVFATL 297
>gi|289579897|ref|YP_003478363.1| hypothetical protein Nmag_0209 [Natrialba magadii ATCC 43099]
gi|448281137|ref|ZP_21472445.1| hypothetical protein C500_01488 [Natrialba magadii ATCC 43099]
gi|289529450|gb|ADD03801.1| Protein of unknown function DUF171 [Natrialba magadii ATCC 43099]
gi|445579461|gb|ELY33855.1| hypothetical protein C500_01488 [Natrialba magadii ATCC 43099]
Length = 316
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 139/330 (42%), Gaps = 60/330 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR+D ++V+ ++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRD-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
D G F+ +L+Y TP YLR + M L +VG+LPPL A
Sbjct: 48 --GDTGRFGDEFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRATSQTGSESDGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ + G RVT+ + + R + A
Sbjct: 106 SLRQGI-VTEVGPDQ-RVRVNCGLQHPISLNVPPKMAVSVGERVTIRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
+ +P + + + VR A + ++ + + + G+
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGFTIAF 223
Query: 278 GTSEHG--DIVNSSDLTLPTFRHLLIAF-GGLAGLEESIE------EDDGLKRKNAREV- 327
G E G I+ D T+ + L A+ LA E+ +E +DD + + +
Sbjct: 224 GAPERGLPAILELEDTTVES----LSAWPSWLAEDEDGVEPTSAVGDDDTVAAETDADAA 279
Query: 328 ----FHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 280 DPGGFDLWLNTVPDQGSDVVRTEEALFATL 309
>gi|448323164|ref|ZP_21512628.1| hypothetical protein C491_19399 [Natronococcus amylolyticus DSM
10524]
gi|445600350|gb|ELY54363.1| hypothetical protein C491_19399 [Natronococcus amylolyticus DSM
10524]
Length = 290
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/313 (26%), Positives = 130/313 (41%), Gaps = 52/313 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA T+FR D +VVF ++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDRE-------------- 47
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
+ G F+ +L+Y TP YLR + M L + G+LPPL A +
Sbjct: 48 --GERGRFGGEFVDTVLRYATTPPYLRNEAWGMRDELEYAGILPPLRAVSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + ++ L+ G RVTV + + R + A
Sbjct: 106 SSRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPGMELEEGERVTVRISSRRPVRAKLED 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD---IVN 287
+ +P + + + VR A +S F G L G E +
Sbjct: 164 EPLPGLDVERTDLQAALGREDAGVRIA---ASRFGAELTVGRLGTLAGRRERDGMTVVFG 220
Query: 288 SSDLTLPTFRHLLIAFGGLAGLEE----SIEEDDGLKRKNAREV---FHSYFNTCPHQGS 340
+ + LP + G++E ++ DG+ + F + NT P QGS
Sbjct: 221 APERGLPD----------ILGMDEEAVDALSNWDGVDGDGVEPIDPGFDLWLNTVPDQGS 270
Query: 341 RTIRTEEAIFISL 353
+RTEEA+F +L
Sbjct: 271 DVVRTEEALFATL 283
>gi|448302865|ref|ZP_21492815.1| hypothetical protein C495_01155 [Natronorubrum sulfidifaciens JCM
14089]
gi|445593872|gb|ELY48039.1| hypothetical protein C495_01155 [Natronorubrum sulfidifaciens JCM
14089]
Length = 300
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 44/314 (14%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ + ATR G +ARA TIFR D ++V+ ++
Sbjct: 2 TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDR--------------- 46
Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
E E+G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 47 ----EGETGRFDGGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSEST 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDAD 232
R+G+ + V V+ GL + ++ +D G RVTV + + R + A
Sbjct: 103 GSGSSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVDEGERVTVRISSRRPVRAK 160
Query: 233 SPRQVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+ +P + + + G + +S F G + L G + +
Sbjct: 161 LENKPLPGLAVERTDLSAALGREDAGVCIAASRFGEELTVGRLETLAGRIDRDGLTVAFG 220
Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIEE-------DDGLKRKNAREVFHSYFNTCPHQG 339
+ + LP + A +G ++ E DDG++ A F + NT P+QG
Sbjct: 221 APERGLPDILGIEASAVNASSGQGDAAESSTGDDTADDGVE-PTADPGFDLWLNTVPNQG 279
Query: 340 SRTIRTEEAIFISL 353
S +RTEEA+F +L
Sbjct: 280 SEVVRTEEALFATL 293
>gi|433590193|ref|YP_007279689.1| hypothetical protein Natpe_0869 [Natrinema pellirubrum DSM 15624]
gi|448332401|ref|ZP_21521645.1| hypothetical protein C488_03535 [Natrinema pellirubrum DSM 15624]
gi|433304973|gb|AGB30785.1| hypothetical protein Natpe_0869 [Natrinema pellirubrum DSM 15624]
gi|445627505|gb|ELY80829.1| hypothetical protein C488_03535 [Natrinema pellirubrum DSM 15624]
Length = 291
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 130/303 (42%), Gaps = 31/303 (10%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ + ATR G +ARA T+FR D ++V+ ++ +
Sbjct: 2 TVSILVPSSLSREAEDKREATRKLGYVARAATVFRADRLLVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A +
Sbjct: 51 GRFD-----GGFVETVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLED 163
Query: 236 QVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---NSSD 290
+P + + + G + +S F G + L G E + + +
Sbjct: 164 VPLPGLAVERTDLSTALGREDAGVCIAASRFGEELTVGRLETLAGRIEGDGMTVAFGAPE 223
Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
LP + + G A + + EDDG++ A F + NT P QGS +RTEEA+F
Sbjct: 224 RGLPDILEIEASAVG-APKDAAAGEDDGVE-PTADPGFDLWLNTVPDQGSEVVRTEEALF 281
Query: 351 ISL 353
+L
Sbjct: 282 ATL 284
>gi|448315111|ref|ZP_21504764.1| hypothetical protein C492_01898 [Natronococcus jeotgali DSM 18795]
gi|445612370|gb|ELY66096.1| hypothetical protein C492_01898 [Natronococcus jeotgali DSM 18795]
Length = 290
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 131/322 (40%), Gaps = 70/322 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA T+FR D +VVF ++
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDR--------------- 46
Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
E E G F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 47 ----EGERGRFDGEFVATVLRYATTPPYLRNEAWGMRDELEYAGILPPLRAVSQTGSESD 102
Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDAD 232
+ R+G+ + E P+ V+ GL + ++ L+ G RVTV + + R + A
Sbjct: 103 DSGSSRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPGMELEEGERVTVRVSSRRPVRAK 160
Query: 233 SPRQVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+P + + + VR A +S F G L G E
Sbjct: 161 LVDGPLPGLSIERTDLEAALGREDAGVRIA---ASRFGAELTVGRLGTLAGRRERD---- 213
Query: 288 SSDLTLPTFRHLLIAFG-------GLAGLEESIEEDDGLKRKNARE---------VFHSY 331
+ +AFG + G++E E D L R++ + F +
Sbjct: 214 ----------GMTVAFGAPERGLPAILGMDE--EAVDALSRRDGVDGDGVEPTDPGFDLW 261
Query: 332 FNTCPHQGSRTIRTEEAIFISL 353
NT P QGS +RTEEA+F +L
Sbjct: 262 LNTVPDQGSDVVRTEEALFATL 283
>gi|374109570|gb|AEY98475.1| FAGL342Cp [Ashbya gossypii FDAG1]
Length = 424
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 125/293 (42%), Gaps = 58/293 (19%)
Query: 41 NKKKKKRKRDQLNDDAPIEVPTVS-------VAVPGSIIDNTQSLELATRLAGQIARAVT 93
NKK +K + P +V VS + +P SI+DN +LE T QIA+A T
Sbjct: 32 NKKAQKASKPAARPKRPKKVLKVSSKTLNYTLCIPTSILDNCANLEQVTHALYQIAKAAT 91
Query: 94 IFRIDEVVVFDNKSSSDNYSRSSAANR--------------------SNRSDENESGAAF 133
+F + E+VV D + +D + AA+ + +S SGA
Sbjct: 92 LFNVGEIVVLDLGTDTDAAAPHLAASELEPRKRKIKFEDVTEKTETSNKKSASRLSGAML 151
Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLR--KHEWAPFREGVTL-- 187
+ LLQY TP+YL ++F S F LP L A +R + + +REG+++
Sbjct: 152 IATLLQYFVTPRYLVSSVFKKKYSSYFNCAAKLPRLSALPFMRHLQTDQGRYREGLSVNM 211
Query: 188 ----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
P++ +++G + + + Q + VRVTV N+N VV P++
Sbjct: 212 AKPAAAKKPSTRTKYINIGKDTMLELRSQQVPVNVRVTVDT-VNKN--------VVSPAE 262
Query: 243 P----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
+ +GY VR A + +F + GGY + VNS+D
Sbjct: 263 AYGDYSGANAAYGYSVRVARTFADIFLGAGFAGGYTQSLW-------VNSADF 308
>gi|238881614|gb|EEQ45252.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 371
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 144/350 (41%), Gaps = 66/350 (18%)
Query: 62 TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSDNYS-- 113
T+S+ +P ++I N +L+ T + QIA+A TI+++ E++VFD N D+ S
Sbjct: 25 TISICIPSTVISSKNAYNLQQITNIIYQIAKACTIYKVAEIIVFDVPQSNNDKEDDKSTV 84
Query: 114 -------------RSSAANRSNRSDEN---ESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
+ AN S +S+ N S + + LLQ+ TP YL K +FS H +
Sbjct: 85 VVGSKVKFVEDEPEKTLANPSKKSNVNGDKVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144
Query: 158 LRFVGMLPPLDAPHHLRKHEWAP----------FREGVTLKENAP--------------N 193
+F +LP L K P F+EG+ + P
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKDFKEGMIIPRETPLMKKKNKKTKSDHKI 204
Query: 194 SVGTLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWG 251
+V V++G + + +D + + RVTV + +N SP Q K + +G
Sbjct: 205 TVSKYVNIGEKEALKLDIKREIPIYSRVTVDL---KNKTVVSPLQAYGVVGHKAA---FG 258
Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTFR----HLLIAF 303
Y VR A + +F GY I + N++ ++ L TF+ ++L+
Sbjct: 259 YHVRMASEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKISELDNLSTFKEATGNVLMVL 318
Query: 304 GGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
G L+ + D +N V H + + R E+AI ISL
Sbjct: 319 GNYKDLQAGFKADTSNVFENIDSVAHLFDSKL--NIPDGCRIEDAILISL 366
>gi|68466013|ref|XP_722781.1| hypothetical protein CaO19.12032 [Candida albicans SC5314]
gi|68466308|ref|XP_722636.1| hypothetical protein CaO19.4563 [Candida albicans SC5314]
gi|46444625|gb|EAL03898.1| hypothetical protein CaO19.4563 [Candida albicans SC5314]
gi|46444780|gb|EAL04052.1| hypothetical protein CaO19.12032 [Candida albicans SC5314]
Length = 371
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 138/344 (40%), Gaps = 54/344 (15%)
Query: 62 TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSDNYS-- 113
T+S+ +P ++I N +L+ T + QIA+A TI+++ E++VFD N D+ S
Sbjct: 25 TISICIPSTVISSKNAYNLQQITNIIYQIAKACTIYKVAEIIVFDVPQSNNDKEDDKSTV 84
Query: 114 -------------RSSAANRSNRSDEN---ESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
+ AN S +S+ N S + + LLQ+ TP YL K +FS H +
Sbjct: 85 VVGSKVKFVEDEPEKTLANPSKKSNVNGDKVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144
Query: 158 LRFVGMLPPLDAPHHLRKHEWAP----------FREGVTLKENAPNSVGTLVDVGLNKHV 207
+F +LP L K P F+EG+ + P + +
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKDFKEGMIIPRETPLMKKKNKKTKSDHKI 204
Query: 208 VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMY----------WGYKVRYA 257
V + ++ G + + + R + S V +K S + +GY VR A
Sbjct: 205 TVSKYVNIGEKEALKLNIKREIPIYSRVTVDLKNKTVVSPLQAYGVVGHKAAFGYHVRMA 264
Query: 258 PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTFR----HLLIAFGGLAGL 309
+ +F GY I + N++ ++ L TF+ ++L+ G L
Sbjct: 265 SEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKISELDNLSTFKEATGNVLMVLGNYKDL 324
Query: 310 EESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ + D +N V H + + R E+AI ISL
Sbjct: 325 QAGFKADTSNVFENIDSVAHLFDSKL--NIPDGCRIEDAILISL 366
>gi|45200755|ref|NP_986325.1| AGL342Cp [Ashbya gossypii ATCC 10895]
gi|44985453|gb|AAS54149.1| AGL342Cp [Ashbya gossypii ATCC 10895]
Length = 424
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 125/293 (42%), Gaps = 58/293 (19%)
Query: 41 NKKKKKRKRDQLNDDAPIEVPTVS-------VAVPGSIIDNTQSLELATRLAGQIARAVT 93
NKK +K + P +V VS + +P SI+DN +LE T QIA+A T
Sbjct: 32 NKKAQKASKPAARPKRPKKVLKVSSKTLNYTLCIPTSILDNCANLEQVTHALYQIAKAAT 91
Query: 94 IFRIDEVVVFDNKSSSDNYSRSSAANR--------------------SNRSDENESGAAF 133
+F + E+VV D + +D + AA+ + +S SGA
Sbjct: 92 LFNVGEIVVLDLGTDTDAAAPHLAASELEPRKRKIKFEDVTEKTETSNKKSASRLSGAML 151
Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLR--KHEWAPFREGVTL-- 187
+ LLQY TP+YL ++F S F LP L A +R + + +REG+++
Sbjct: 152 IATLLQYFVTPRYLVSSVFKKKYSSYFNCAAKLPRLSALPFMRHLQTDQGRYREGLSVNM 211
Query: 188 ----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
P++ +++G + + + Q + VRVTV N+N VV P++
Sbjct: 212 AKPAAAKKPSTRTKYINIGKDTMLELRSQQVPVNVRVTVDT-VNKN--------VVSPAE 262
Query: 243 P----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
+ +GY VR A + +F + GGY + VNS+D
Sbjct: 263 AYGDYSGANAAYGYSVRVARTFADIFLGAGFAGGYTQSLW-------VNSADF 308
>gi|367016219|ref|XP_003682608.1| hypothetical protein TDEL_0G00300 [Torulaspora delbrueckii]
gi|359750271|emb|CCE93397.1| hypothetical protein TDEL_0G00300 [Torulaspora delbrueckii]
Length = 357
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 109/257 (42%), Gaps = 38/257 (14%)
Query: 42 KKKKKRKRDQLNDDAPIEVPTV--SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDE 99
+KKK + +L I TV S+ +P +++ N ++LE T QIA+ TIF + E
Sbjct: 12 QKKKVKPVAKLKKSIKILSKTVNYSLCIPTTVLANCRNLEQITFAIYQIAKTATIFNVGE 71
Query: 100 VVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
+VV + S+ +S + + + S A + LLQY TP YL +F +
Sbjct: 72 IVVLNLGDRSE---------KSKKHENSLSDAMLIASLLQYFVTPPYLVNTVFKKQYTRY 122
Query: 160 F--VGMLPPLDAPHHL--RKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLN 204
F LP L A + R + +REG+ ++ P +++G
Sbjct: 123 FQAAAKLPRLSALPFMRHRNEDKGRYREGLAIRMTKPGQASIKKKNKEFKQTKFINIGKA 182
Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKE---SGMYWGYKVRYAPNI 260
+ + + L P VRVTV D R V P + + +GY VR A N
Sbjct: 183 EPLELKSQLVPVNVRVTV--------DTVEQRVVSPQEAYGDFVGANASFGYHVRVANNF 234
Query: 261 SSVFKNCSYKGGYDHLI 277
S+F C++ GY I
Sbjct: 235 GSIFTECAFPNGYSQAI 251
>gi|409731025|ref|ZP_11272575.1| hypothetical protein Hham1_17426 [Halococcus hamelinensis 100A6]
gi|448724783|ref|ZP_21707288.1| hypothetical protein C447_16559 [Halococcus hamelinensis 100A6]
gi|445784992|gb|EMA35788.1| hypothetical protein C447_16559 [Halococcus hamelinensis 100A6]
Length = 275
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 128/313 (40%), Gaps = 54/313 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ AT G +ARA T+FR+D V VF + + +
Sbjct: 2 TRSVLVPSSLCREASDKREATHKVGLVARAATVFRVDRVTVFPDDGGNRRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
G F+ +L Y T YLRK + L +VG+LPPL + + P
Sbjct: 53 --------GGGFVATVLAYAATAPYLRKEAWGTREELEYVGVLPPL---RPVPRTGSGPQ 101
Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDA 231
G +L++ VG V+ GL V + V+ PGV RVTV + + + A
Sbjct: 102 SSG-SLRQGIVTEVGPDGRVRVNCGLQHPVSL--VVPPGVEVGEGERVTVRISSREPVRA 158
Query: 232 DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--- 288
++V P G+ V +++ + + + TS HG+ + +
Sbjct: 159 ----RIVDEPAP-------GFAVDRTDLPAALGREDA-----GLRVATSRHGEALTADGL 202
Query: 289 SDLTLPTFRHLLIAFGGLA-GLEESIEED-DGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
L L +AFG GL + D + + F + NT P+QGS +RTE
Sbjct: 203 GALAGGPGGDLTVAFGAPERGLPAMLGIDVESVTSAEPNGGFDRWLNTVPNQGSEVVRTE 262
Query: 347 EAIFISLQYFQEP 359
EA+F +L P
Sbjct: 263 EAVFATLACLTLP 275
>gi|366992812|ref|XP_003676171.1| hypothetical protein NCAS_0D02290 [Naumovozyma castellii CBS 4309]
gi|342302037|emb|CCC69810.1| hypothetical protein NCAS_0D02290 [Naumovozyma castellii CBS 4309]
Length = 378
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 112/269 (41%), Gaps = 31/269 (11%)
Query: 41 NKKKKKRKRDQLNDDAPIEVPT----VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFR 96
NKKK K I+V + S+ +P +I+ N +LE T + QIA+A TIF
Sbjct: 14 NKKKSKVLEPTKKSKTSIKVLSKKLNYSLCIPTTILANCTNLEQITHVVYQIAKAATIFN 73
Query: 97 IDEVVVFD-NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
E+V+ D + + ++D S + + LLQY TP YL +F
Sbjct: 74 AGEIVILDMGDRKETKITEEKEEKKKKKTDARLSDSMLIASLLQYFVTPPYLVSTVFKKQ 133
Query: 156 SSLRFV--GMLPPLDAPHHLR--KHEWAPFREGVTLKENAPNSVGT---------LVDVG 202
F LP L A +R + +REG+ ++ P S +++G
Sbjct: 134 FMKYFTVASKLPRLSALPFMRYLNEDNGRYREGLAIRMTKPGSNSNSKKEFKQTKYINIG 193
Query: 203 LNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS 261
K++ + + L P VRVTV NR + SP + ++ +GY VR A
Sbjct: 194 KEKNLELKKQLVPVNVRVTVDTIENRVV---SPEEAYGDYVGAQAS--YGYHVRIAKTFG 248
Query: 262 SVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+F C++ GY + VNS D
Sbjct: 249 DIFTQCAFPSGYSQTVW-------VNSGD 270
>gi|255720905|ref|XP_002545387.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135876|gb|EER35429.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 334
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/307 (24%), Positives = 131/307 (42%), Gaps = 57/307 (18%)
Query: 59 EVPTVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVF-----------DN 105
E ++S+ +P S+I N +LE T + QIA+A TI++I E++VF DN
Sbjct: 9 EETSISICIPSSVISSKNAYNLEQITNIIYQIAKAATIYKISEIIVFKAPKEEEEEEDDN 68
Query: 106 KSS-----------SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSM 154
KS D ++SS N + + E ++ LLQ+ TP YL K++FS
Sbjct: 69 KSKEKSKKIVFEEDDDTNNKSSMNNNTQKEQEEDNSTLLAASLLQFFITPPYLVKSMFSP 128
Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAP----FREGVTLKENAP--------------NSVG 196
H + +F +LP L K ++EG+ + P +V
Sbjct: 129 HLNKKFSNILPKFKYAFKLPKITTIDMDKNYKEGMIIPRETPLIKKKNKKVKADHKITVS 188
Query: 197 TLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
V++G ++ + +D + + RVTV + +N S ++ S K + +GY V
Sbjct: 189 KYVNIGESEAIKLDIKREIPIYSRVTVDL---KNKTIVSAKKAYGNSGHKNA---FGYHV 242
Query: 255 RYAP--NISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR----HLLIAFGGLAG 308
R + + VF S GY + I + D ++ + LP ++L+ G L
Sbjct: 243 RMIDDGDFNKVFIQSSIPDGYSNTIYVN-CNDYFGNNKIDLPEINEIKGNILVVLGNLKD 301
Query: 309 LEESIEE 315
+ + +
Sbjct: 302 YQPDVHQ 308
>gi|284165491|ref|YP_003403770.1| hypothetical protein Htur_2217 [Haloterrigena turkmenica DSM 5511]
gi|284015146|gb|ADB61097.1| Protein of unknown function DUF171 [Haloterrigena turkmenica DSM
5511]
Length = 301
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 127/314 (40%), Gaps = 43/314 (13%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D ++V+ ++ +
Sbjct: 2 TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ ++ G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
+ +P + + + VR A SS + G + L G E +
Sbjct: 164 EPLPGLSVERTDLQTALGREDAGVRIA---SSRYGEELTVGRLETLAGRIEGDGMTVAFG 220
Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIEEDDGLKRKNAREV-------FHSYFNTCPHQG 339
+ + LP + A +G ++IE V F + NT P QG
Sbjct: 221 APERGLPAILGIEASAIEASSGQGDAIESSSDRDDAADDGVEPTADPGFDLWLNTVPDQG 280
Query: 340 SRTIRTEEAIFISL 353
S +RTEEA+F +L
Sbjct: 281 SEVVRTEEALFATL 294
>gi|398366411|ref|NP_011799.3| hypothetical protein YGR283C [Saccharomyces cerevisiae S288c]
gi|1723785|sp|P53336.1|YG5X_YEAST RecName: Full=Uncharacterized protein YGR283C
gi|1323517|emb|CAA97314.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285812471|tpg|DAA08371.1| TPA: hypothetical protein YGR283C [Saccharomyces cerevisiae S288c]
Length = 341
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 94/217 (43%), Gaps = 25/217 (11%)
Query: 55 DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
D V S+ +P ++IDN +LE T A QIAR +F + E++V D S+
Sbjct: 32 DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84
Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
+ +RS E S L LLQY TP L F + L+ PPL+
Sbjct: 85 DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLNQLPF 144
Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
+ ++EG+++ K + +++ LV +G N+ + + +Q + RVTV
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199
Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
D+ R+ V G GY VR A ++ V
Sbjct: 200 -----DTERKEVVSPIDAYKGKPLGYHVRMASTLNEV 231
>gi|241954318|ref|XP_002419880.1| unnamed protein product [Candida dubliniensis CD36]
gi|223643221|emb|CAX42095.1| unnamed protein product [Candida dubliniensis CD36]
Length = 371
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 144/356 (40%), Gaps = 78/356 (21%)
Query: 62 TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSD----- 110
TVS+ +P ++I N +L+ T + QIA+A TI+++ E+VVFD N D
Sbjct: 25 TVSICIPSTVISPKNAYNLQQITNIIYQIAKACTIYKVAEIVVFDVPELNNEQEDVKSTV 84
Query: 111 -------------NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
S +S +S+ + ++ S + + LLQ+ TP YL K +FS H +
Sbjct: 85 VVGSKVKFVEDEPEKSLASPNKKSSVNGDDVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144
Query: 158 LRFVGMLPPLDAPHHLRKHEWAPF----------REGVTLKENAP--------------N 193
+F +LP L K PF +EG+ + P
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKVLKEGMIIPRETPLMRKKNKKTKSDHKI 204
Query: 194 SVGTLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWG 251
+V V++G + + ++ + + RVTV + +N SP Q K + +G
Sbjct: 205 TVSKYVNIGEKEALKLEIKREIPIYSRVTVDL---KNKTVVSPLQAYGVIGHKAA---FG 258
Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL----TLPTFR----HLLIAF 303
Y VR A + +F GY I + N++ + L TF+ ++L+
Sbjct: 259 YHVRMASEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKIKELDNLSTFKEATGNVLMVL 318
Query: 304 GGLAGLEESIEEDDG---LKRKNAREVFHSYFNT---CPHQGSRTIRTEEAIFISL 353
G L+ + D +N ++F S N C R E+AI ISL
Sbjct: 319 GNYKDLQAGFKADSSNVFQNVENVAQLFDSKLNIPDGC--------RIEDAILISL 366
>gi|151943555|gb|EDN61866.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|349578483|dbj|GAA23649.1| K7_Ygr283cp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 341
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 94/217 (43%), Gaps = 25/217 (11%)
Query: 55 DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
D V S+ +P ++IDN +LE T A QIAR +F + E++V D S+
Sbjct: 32 DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84
Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
+ +RS E S L LLQY TP L F + L+ PPL+
Sbjct: 85 DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLNQLPF 144
Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
+ ++EG+++ K + +++ LV +G N+ + + +Q + RVTV
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199
Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
D+ R+ V G GY VR A ++ V
Sbjct: 200 -----DTERKEVVSPIDAYKGKPLGYHVRMASTLNEV 231
>gi|448392362|ref|ZP_21567136.1| hypothetical protein C477_13040 [Haloterrigena salina JCM 13891]
gi|445664452|gb|ELZ17161.1| hypothetical protein C477_13040 [Haloterrigena salina JCM 13891]
Length = 301
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 128/328 (39%), Gaps = 71/328 (21%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVSV VP S+ + ATR G +ARA TIFR D ++V+ ++ +
Sbjct: 2 TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
R D F+ +L+Y TP YLR + M L + G+LPPL A
Sbjct: 51 GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + V V+ GL + ++ +D G RVTV + + R + A
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVDEGERVTVRISSRRPVRAKLVD 163
Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
+ +P + + + VR A +S F G L G E GD
Sbjct: 164 EPLPGLSVERTDLQTALGREDAGVRIA---ASRFGEELTVGRLGTLAGRIE-GD------ 213
Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IEEDDGLK-----------------RKNAR 325
+ +AFG + G+E S IE G A
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEASAIEASSGQGDVIEPSSDRDDAADDGVEPTAD 266
Query: 326 EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
F + NT P QGS +RTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVVRTEEALFATL 294
>gi|354610538|ref|ZP_09028494.1| protein of unknown function DUF171 [Halobacterium sp. DL1]
gi|353195358|gb|EHB60860.1| protein of unknown function DUF171 [Halobacterium sp. DL1]
Length = 283
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 133/312 (42%), Gaps = 57/312 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S++ + ATR G +ARA +FR D +VVF ++
Sbjct: 2 TLSVLVPSSLVREAEDKREATRKLGYVARAAAVFRADRLVVFPDEDG------------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
E + G F+ +L+Y TP YLRK F L +VG+LPP L +
Sbjct: 49 ----ERKWGGGFVETVLRYAATPPYLRKEAFDTRDELEYVGVLPPLRLSSWTGSDSSGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADS 233
R+G+ + + V V+ G+ + + Q PG+ RVT+ + + R + A
Sbjct: 105 SLRQGIVTQVGSEGRV--RVNCGMQHPISLHQ--PPGMEVVEGERVTIRVSSRRPVRAKL 160
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ +P G+ V + + + I TS HG ++++ L
Sbjct: 161 VDEPLP-----------GFSVE-----RTTIGDALDRPDAGVRIATSRHGTPLSAASLGG 204
Query: 294 PTFR----HLLIAFGG-------LAGL-EESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
R L +AFG + GL E+ + E +A F ++ NT P QGS
Sbjct: 205 YVERIARDGLTVAFGAPERGLPPMLGLTEDEVSEAVADPNSDAPARFDAWLNTIPSQGSE 264
Query: 342 TIRTEEAIFISL 353
+RTEEA+F SL
Sbjct: 265 VVRTEEALFASL 276
>gi|300710374|ref|YP_003736188.1| hypothetical protein HacjB3_05025 [Halalkalicoccus jeotgali B3]
gi|448294698|ref|ZP_21484777.1| hypothetical protein C497_03402 [Halalkalicoccus jeotgali B3]
gi|299124057|gb|ADJ14396.1| hypothetical protein HacjB3_05025 [Halalkalicoccus jeotgali B3]
gi|445586375|gb|ELY40657.1| hypothetical protein C497_03402 [Halalkalicoccus jeotgali B3]
Length = 285
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 17/108 (15%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S++ + ATR G +ARA T+FR D ++VF
Sbjct: 2 TVSILVPSSVVREAEDKREATRKLGYVARAATVFRADRLIVF-----------------P 44
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
+R E G F+ +L+Y TP YLR ++ M L +VG+LPPL A
Sbjct: 45 DRDGERRWGGGFVEAVLRYAATPPYLRTEVWGMRDELEYVGVLPPLRA 92
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+ E F + NT PHQGS T+RTEEA+F SL
Sbjct: 248 DGSERFDLWLNTIPHQGSETVRTEEAMFASL 278
>gi|159040854|ref|YP_001540106.1| hypothetical protein Cmaq_0268 [Caldivirga maquilingensis IC-167]
gi|157919689|gb|ABW01116.1| Protein of unknown function DUF171 [Caldivirga maquilingensis
IC-167]
Length = 275
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 85/301 (28%), Positives = 135/301 (44%), Gaps = 52/301 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+A+P ++ + + + E A R G I RA IFR+ ++++ K
Sbjct: 3 ISIAIPYNVTEESTTEEEAVRKIGYIGRAAAIFRVRSILIYTFKG--------------- 47
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP---HHLRKHEWA 179
DE A+F+ + L+YL TP YLRK LF + LR G+L PL P H
Sbjct: 48 --DEPLRKASFIKKNLEYLVTPPYLRKDLFGIDPDLRLAGLLQPLTLPVFGHRKSNPRTG 105
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV-AMGTNRNLDADSPRQ 236
R G+ ++ S+ V VG + +V V + + + V++ +M +NR
Sbjct: 106 DVRIGLVIRWEGYYSI---VKVGEDTYVKVPKPYPIKSMLLVSIDSMISNRFYRG----H 158
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
VV K S +Y G Y+ NI + + +LI T + G+ V + L
Sbjct: 159 VV-----KSSKVYAG----YSANIIELRDLVKLR----NLILTGKEGNSVINEINKLRAL 205
Query: 297 R--HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHS--YFNTCPHQGSRTIRTEEAIFIS 352
+ +++ FG + DD LK + +V +S + N P+QG TIRTEEAI
Sbjct: 206 KDEDVVVVFG-----SPRMGVDDILKGEGLGDVLNSSLFVNFVPNQGLVTIRTEEAIIAV 260
Query: 353 L 353
L
Sbjct: 261 L 261
>gi|323304774|gb|EGA58534.1| YGR283C-like protein [Saccharomyces cerevisiae FostersB]
Length = 297
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 93/217 (42%), Gaps = 25/217 (11%)
Query: 55 DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
D V S+ +P ++IDN +LE T A QIAR +F + E++V D S+
Sbjct: 32 DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84
Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
+ +RS E S L LLQY TP L F + L+ PPL
Sbjct: 85 DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLKQLPF 144
Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
+ ++EG+++ K + +++ LV +G N+ + + +Q + R+TV
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARLTV----- 199
Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
D+ R+ V G GY VR A ++ V
Sbjct: 200 -----DTERKEVVSPIDSYKGKPLGYHVRMASTLNEV 231
>gi|256272682|gb|EEU07659.1| YGR283C-like protein [Saccharomyces cerevisiae JAY291]
Length = 341
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 25/208 (12%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P ++IDN +LE T A QIAR +F + E++V D S+ + +R
Sbjct: 41 SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
S E S L LLQY TP L F + L+ PPL + +
Sbjct: 94 SKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLKQLPFMNASAEQHY 153
Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
+EG+++ K + +++ LV +G N+ + + +Q + RVTV D+ R
Sbjct: 154 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 203
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ V G GY VR A ++ V
Sbjct: 204 KEVVSPIDSYKGKPLGYHVRMASTLNEV 231
>gi|366998671|ref|XP_003684072.1| hypothetical protein TPHA_0A05640 [Tetrapisispora phaffii CBS 4417]
gi|357522367|emb|CCE61638.1| hypothetical protein TPHA_0A05640 [Tetrapisispora phaffii CBS 4417]
Length = 397
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 67/277 (24%), Positives = 109/277 (39%), Gaps = 48/277 (17%)
Query: 38 SCDNKKKKKRKRDQLNDDAPIEVPT-----------VSVAVPGSIIDNTQSLELATRLAG 86
S K+KK++ + ++ PT S+ +P SI D+ ++LE T +
Sbjct: 19 SVTQTKQKKKQNHAAKKEVVLKKPTKSLKVSSKNLNYSICIPTSITDDCKNLEQTTNVIY 78
Query: 87 QIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQY 146
IA+ TIF + EV++ D + + + S + LLQY TP Y
Sbjct: 79 HIAKTATIFNVGEVIILDLGKETPIEVTEDIKGTKGETKKRLSRPMLIATLLQYFVTPPY 138
Query: 147 LRKALFS--MHSSLRFVGMLPPLDAPHHLRKHE--WAPFREGVTLKENAPNSVGT----- 197
L +F H ++ LP L +R +E FREG+ ++ + P + +
Sbjct: 139 LVNTVFKKDYHKYFKYACKLPRLSVLPFMRYYEEDKGRFREGLAVRMSKPEATNSNQSNS 198
Query: 198 ----------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES 246
+++G ++ + + L P VRVTV D R V P +
Sbjct: 199 KKAKEFKQTKYINIGKSECLELKSQLVPVNVRVTV--------DTVEKRVVAP---QEAY 247
Query: 247 GMY------WGYKVRYAPNISSVFKNCSYKGGYDHLI 277
G Y +GY VR SVF ++ GY I
Sbjct: 248 GDYVGAQSSFGYHVRVCKKFGSVFTESAFPEGYSQAI 284
>gi|190346247|gb|EDK38287.2| hypothetical protein PGUG_02385 [Meyerozyma guilliermondii ATCC
6260]
Length = 355
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 140/329 (42%), Gaps = 68/329 (20%)
Query: 63 VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
V++ +P ++I N ++L+ AT +A QIA+A T++ + E+VV
Sbjct: 25 VALCIPSTVISHSNARNLQQATFIAYQIAKAATLYNVSEIVVLSIPDAEAVAEISGPKVV 84
Query: 105 ------NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF---SMH 155
K D + S + N + N+ + L LLQY TP YL KA+F
Sbjct: 85 SAATGGQKIMFDEVANSPKVEQ-NHENRNDDDSLLLATLLQYFVTPPYLVKAVFKDSKFR 143
Query: 156 SSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVV 209
++ LP + + P FREG+++ + P V + + + K+V +
Sbjct: 144 KKFKYAEKLPKIPSLPFMANNDVHKDFREGLSVAKKTPKIVKKNKKVSGKKLSVTKYVNI 203
Query: 210 --DQVLD---PGVRVTVAMGTN-RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
D++L+ P + V V + + +N SPR S K + +GY VR + + +S+
Sbjct: 204 GGDELLELAGPDIPVNVRVTVDVKNKKVVSPRDAYGVSGNKST---FGYFVRVSRSFTSI 260
Query: 264 FKNCSYKGGY---------------DHLIGTSEHGDIVNSSD-LTLPTFRHL-----LIA 302
F ++ GY D I GDI L + ++HL + A
Sbjct: 261 FTESTFPSGYTKSYYISCDDFYGKPDTGIKKETFGDISEGKILLVMSNWKHLEASFSVEA 320
Query: 303 FGGLAGLEE----SIEEDDGLKRKNAREV 327
F G+A + E IE + G++ ++A V
Sbjct: 321 FEGVAKVSELFDGEIEMEQGIRVEDAVMV 349
>gi|146417374|ref|XP_001484656.1| hypothetical protein PGUG_02385 [Meyerozyma guilliermondii ATCC
6260]
Length = 355
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 140/332 (42%), Gaps = 74/332 (22%)
Query: 63 VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
V++ +P ++I N ++L+ AT +A QIA+A T++ + E+VV
Sbjct: 25 VALCIPSTVISHSNARNLQQATFIAYQIAKAATLYNVSEIVVLSIPDAEAVAEISGPKVV 84
Query: 105 ------NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF---SMH 155
K D + S + N + N+ + L LLQY TP YL KA+F
Sbjct: 85 SAATGGQKIMFDEVANSPKVEQ-NHENRNDDDSLLLATLLQYFVTPPYLVKAVFKDSKFR 143
Query: 156 SSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVV 209
++ LP + + P FREG+++ + P V + + + K+V +
Sbjct: 144 KKFKYAEKLPKIPSLPFMANNDVHKDFREGLSVAKKTPKIVKKNKKVSGKKLSVTKYVNI 203
Query: 210 --DQVLDPG-------VRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNI 260
D++L+ VRVTV + +N SPR S K + +GY VR + +
Sbjct: 204 GGDELLELAGPDIPVNVRVTVDV---KNKKVVSPRDAYGVSGNKST---FGYFVRVSRSF 257
Query: 261 SSVFKNCSYKGGY---------------DHLIGTSEHGDIVNSSD-LTLPTFRH-----L 299
+S+F ++ GY D I GDI L + ++H L
Sbjct: 258 TSIFTESTFPSGYTKSYYISCDDFYGKPDTGIKKETFGDISEGKILLVMSNWKHLEASFL 317
Query: 300 LIAFGGLAGLEE----SIEEDDGLKRKNAREV 327
+ AF G+A + E IE + G++ ++A V
Sbjct: 318 VEAFEGVAKVSELFDGEIEMEQGIRVEDAVMV 349
>gi|118430942|ref|NP_147061.2| hypothetical protein APE_0226.1 [Aeropyrum pernix K1]
gi|116062268|dbj|BAA79138.2| conserved hypothetical protein [Aeropyrum pernix K1]
Length = 270
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 123/295 (41%), Gaps = 45/295 (15%)
Query: 65 VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
+A+P SI+ +L L T AGQI R I+R+DE+ VF++ SS SR +
Sbjct: 1 MALPSSILRVEPTLLLKTLKAGQIGRVAAIYRVDEIAVFEDPDSSPRLSRLLQLLLN--- 57
Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
Y TP +L+K +F + L++ ++PPL P H E + G
Sbjct: 58 ---------------YQATPPHLKKRVFPLRRELKYASLMPPLKIPSHTVPAE---PKVG 99
Query: 185 VTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
L + G V +G + V+ PG VTV +G + R V+ K
Sbjct: 100 AILDGYVESCSGKSCRVFLGSMGYGVLRGRARPGDVVTVRIG-----EISGGRIVL--EK 152
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL---PTFRHL 299
+Y+G+ V + + +G ++ T ++G S L P+ + L
Sbjct: 153 ASWGELYYGFSVSRFKRLEEAVTSFRRRG--YRVVATDKNGLCPPESWKILAEIPSAKTL 210
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
++ FGG G+ ++ N P QG+ T+RTEEA+ +L+
Sbjct: 211 VV-FGG---------PYKGVLEYTNPYLYDGVVNVIPRQGTETVRTEEALLATLE 255
>gi|296242582|ref|YP_003650069.1| hypothetical protein Tagg_0844 [Thermosphaera aggregans DSM 11486]
gi|296095166|gb|ADG91117.1| Protein of unknown function DUF171 [Thermosphaera aggregans DSM
11486]
Length = 272
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 129/305 (42%), Gaps = 62/305 (20%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+ VAVP SI+ +L L T Q+AR + IF + + + + + + R A
Sbjct: 4 IEVAVPTSILRVESTLLLKTLRIHQVARILGIFGVSRIFFYRDFETDPSTHREYAR---- 59
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL-----DAPHHLR--- 174
+ + +Y TP YLR+ L + L+ VGMLPP+ D P L+
Sbjct: 60 ----------LIRKQWEYFFTPPYLRRRLVPRNPLLKHVGMLPPIRLEWFDVPRRLKPGD 109
Query: 175 KHEWAPFREGVTLK-----ENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL 229
+ F+EG K N+VG + GL V +++DPG ++ +G +
Sbjct: 110 ERVGYVFKEGGGFKVYVDSSRVFNAVGECRE-GLG----VIRIVDPGEKMAECVGKD--- 161
Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
Y G + R+A + FK + ++ TS +G +
Sbjct: 162 ------------------FYKGPEPRFADS----FKQLLEESRGARIVATSRYGRVPGFE 199
Query: 290 DLT-LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
DL+ L + +LI FG + I +G + + +V +NT P Q +T+RTEEA
Sbjct: 200 DLSALSSSNRVLILFGSPSRGLHDIAGAEGFRLEELGDV----WNTVPGQRVKTVRTEEA 255
Query: 349 IFISL 353
+ I+L
Sbjct: 256 LIITL 260
>gi|254579457|ref|XP_002495714.1| ZYRO0C01276p [Zygosaccharomyces rouxii]
gi|238938605|emb|CAR26781.1| ZYRO0C01276p [Zygosaccharomyces rouxii]
Length = 351
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 105/228 (46%), Gaps = 30/228 (13%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++LE T Q+A+A T+F + E+VV D ++S
Sbjct: 40 SLCIPTTLLSHCKNLEQVTYTVYQVAKAATMFNVGEIVVLDLGTTS-----------GTT 88
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMH--SSLRFVGMLPPLDAPHHLR--KHEWA 179
++ S + + LLQY TP YL +F S R LP L + +R + +
Sbjct: 89 KEKKLSDSMLIASLLQYFVTPPYLVNTVFKQKYLSYFREACKLPRLPSLPFMRYLQADEG 148
Query: 180 PFREGVTLKENAPNSVG---------TLVDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNL 229
+REG+ ++ + P + G V+VG ++ + + L P VRVTV + + +
Sbjct: 149 RYREGLAIRMSKPGTQGKKTKEFKQTKYVNVGKSEALELKAQLVPANVRVTVDVVERKVV 208
Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
SP++ +S +GY+VR A N VF ++ GY +
Sbjct: 209 ---SPKEAYGDFVGAQSS--YGYQVRIAKNFGDVFMESAFADGYSQAL 251
>gi|365990481|ref|XP_003672070.1| hypothetical protein NDAI_0I02590 [Naumovozyma dairenensis CBS 421]
gi|343770844|emb|CCD26827.1| hypothetical protein NDAI_0I02590 [Naumovozyma dairenensis CBS 421]
Length = 395
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 101/233 (43%), Gaps = 26/233 (11%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P S+I N +LE T QIA+A T F E+++ D S + + +
Sbjct: 60 SLCIPTSVISNCTNLEQITYTIYQIAKAATFFNAGEIIILDLTSPPNTDDTQLKTKKKST 119
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLRKHEWAP- 180
+ ++S + LLQY TP YL ++F F LP L A +R +
Sbjct: 120 TKLSDS--MLIASLLQYFVTPPYLVNSVFKKQYRQYFHEASKLPRLSALPFMRYLDHGEG 177
Query: 181 -----FREGVTLKENAPNSVGT----------LVDVGLNKHVVVDQVLDP-GVRVTVAMG 224
+REG+ ++ P + T +++G + ++ + L P VRVTV
Sbjct: 178 NDNHRYREGLAIRMTKPGKISTSNKKEFKQTKFINIGKDINLELKSQLVPINVRVTVDTI 237
Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
R + SP + ++ +GY VR A + +F +C+++ GY +
Sbjct: 238 EKRVV---SPEEAYGDFVGAQAS--YGYHVRIAKSFGDIFTHCAFRNGYSQSV 285
>gi|259146785|emb|CAY80042.1| EC1118_1G1_6238p [Saccharomyces cerevisiae EC1118]
Length = 341
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P ++IDN +LE T A QIAR +F + E++V D S+ + +R
Sbjct: 41 SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
S E S L LLQY TP L F + L+ P L + +
Sbjct: 94 SKETISDCLLLATLLQYFVTPPNLLDTTFKKKTKLYLKCASTFPSLKQLPFMNASAEQHY 153
Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
+EG+++ K + +++ LV +G N+ + + +Q + RVTV D+ R
Sbjct: 154 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 203
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ V G GY VR A ++ V
Sbjct: 204 KEVVSPIDSYKGKPLGYHVRMASTLNEV 231
>gi|260941594|ref|XP_002614963.1| hypothetical protein CLUG_04978 [Clavispora lusitaniae ATCC 42720]
gi|238851386|gb|EEQ40850.1| hypothetical protein CLUG_04978 [Clavispora lusitaniae ATCC 42720]
Length = 371
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 133/337 (39%), Gaps = 69/337 (20%)
Query: 63 VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
V++ VP S+I N ++LE T +A QIA+A +++ + E+VV D
Sbjct: 25 VALCVPSSVISKSNARNLEQITNIAYQIAKAASVYNVSEIVVLDIPTADEIQKKLEAEAN 84
Query: 105 ----------NKSSSDNYSRSS-----------AANRSNRSDENESGAAFLVRLLQYLET 143
NK N S AA + +ENE+ A L LLQ+ T
Sbjct: 85 KAVQLEGDKGNKKIKFNISNEEIVAEAKSEEGPAAEKLVEKEENENNALLLATLLQFFVT 144
Query: 144 PQYLRKALF---SMHSSLRFVGMLPPL-DAPHHLRKHEWAPFREGVTLKENAP------N 193
P YL K +F + ++ LP L + P F+EG+T+ ++ P
Sbjct: 145 PPYLVKGVFRASKYNKKFKYAEKLPKLSNLPFMNNNGVIEDFKEGLTVPKHTPKIQKKNK 204
Query: 194 SVGTLVDVGLNKHVVVDQV---------LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK 244
V L + + K+V V Q + VRVTV + +N SP++ + K
Sbjct: 205 KVSALKKLKVTKYVNVGQSSLLELNGTEVPVNVRVTVDV---KNKKIVSPQEAYGVTGSK 261
Query: 245 ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR-----HL 299
S +GY VRYA + SS+F S+ GY + + D L T +
Sbjct: 262 AS---FGYFVRYAKSFSSIFTESSFPSGYTESVFVQADDFFGKAKDSKLDTVDKVGKGEV 318
Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
L L LE S ++ + E+F S P
Sbjct: 319 LFVVSNLKDLEYSFSQEKIPGVEQVTEMFDSQVKVPP 355
>gi|76803056|ref|YP_331151.1| rpl operon protein [Natronomonas pharaonis DSM 2160]
gi|76558921|emb|CAI50517.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
Length = 278
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 130/310 (41%), Gaps = 57/310 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA T+FR+D +VV+ ++ D +
Sbjct: 2 TRSVLVPSSLTREAEDKREATRKLGYVARAATVFRVDRLVVYPDRDGDDGRFGDGFVST- 60
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
+L+Y TP YLRK ++ L + G+LPPL +R +
Sbjct: 61 ---------------VLEYAATPAYLRKEVWDRRGELEYAGILPPL----RVRSQTGSGS 101
Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
+L++ VG V+ G+ + + + G RVT+ + + + A
Sbjct: 102 DGSGSLRQGIVTEVGPDGRVRVNCGMQHPISLPVPPSMEVGKGERVTIRVSSREPVRA-- 159
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT- 292
++V S P G+ V A +S+ + G I S HG+ ++ L
Sbjct: 160 --KLVDESPP-------GFAVDCATIEASLARP---DAGVR--IAASRHGETLSVDRLDG 205
Query: 293 ---LPTFRHLLIAFGGLA-GLEESI-----EEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
T +AFG GL E + EE G AR F + NT P+QGS +
Sbjct: 206 IVGRTTANGFTVAFGAPERGLPEILGCEPREEAIG-DEPEAR--FDLWLNTVPNQGSEVV 262
Query: 344 RTEEAIFISL 353
RTEEA+F +L
Sbjct: 263 RTEEALFATL 272
>gi|190406716|gb|EDV09983.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
Length = 341
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 92/217 (42%), Gaps = 25/217 (11%)
Query: 55 DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
D V S+ +P ++IDN +LE T A QIAR +F + E++V D S+
Sbjct: 32 DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84
Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
+ +RS E S L LLQY TP L F + L+ P L
Sbjct: 85 DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPSLKQLPF 144
Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
+ ++EG+++ K + +++ LV +G N+ + + +Q + RVTV
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199
Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
D+ R+ V G GY VR A ++ V
Sbjct: 200 -----DTERKEVVSPIDSYKGKPLGYHVRMASTLNEV 231
>gi|323354916|gb|EGA86749.1| YGR283C-like protein [Saccharomyces cerevisiae VL3]
Length = 311
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P ++IDN +LE T A QIAR +F + E++V D S+ + +R
Sbjct: 11 SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 63
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
S E S L LLQY TP L F + L+ P L + +
Sbjct: 64 SKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPSLKQLPFMNASAEQHY 123
Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
+EG+++ K + +++ LV +G N+ + + +Q + RVTV D+ R
Sbjct: 124 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 173
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ V G GY VR A ++ V
Sbjct: 174 KEVVSPIDSYKGKPLGYHVRMASTLNEV 201
>gi|344305174|gb|EGW35406.1| hypothetical protein SPAPADRAFT_58623 [Spathaspora passalidarum
NRRL Y-27907]
Length = 379
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 139/362 (38%), Gaps = 82/362 (22%)
Query: 62 TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS---------- 109
++S+ +P S+I N +L+ T + QIA+A TI+ + E+VV D
Sbjct: 25 SISICIPTSVISQKNAYNLQQITTIVYQIAKACTIYNVSEIVVLDVPEKKEEEPEPTVEV 84
Query: 110 ---------------DNYSRSSAANRSNRSDENESGAAFLV-RLLQYLETPQYLRKALF- 152
D+ S+ A + + + E A L+ LLQ+ TP YL K +F
Sbjct: 85 VSGDKGGKKLKFNFDDDLSKDVPAKKEIQEETTEDDKAILLASLLQFFITPPYLVKTMFS 144
Query: 153 --------SMHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAP--------NSV 195
S+ + ++ LP + P + F+EG+ + + P V
Sbjct: 145 PALNPNFKSILAKFKYAFKLPKITTLPFMNNNQTYKDFKEGIIIPKETPKVRRKTSVKKV 204
Query: 196 GTLVDVGLNKHVVVD--QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESG--MYWG 251
V ++K+V + Q L+ ++ + + + +D + V P G +G
Sbjct: 205 KAANKVTVSKYVNIGEAQALELNIKREIPINSRVTVDLKNKTIVSPLQAYGIVGHKAAFG 264
Query: 252 YKVRYAPNISSVFKNCSYKGGY--------DHLIGTSEHGDIVNSSDLT-LPTFR----- 297
Y VR S +F C GY D G++E SDLT LP
Sbjct: 265 YHVRVVKQFSKLFTECPIPEGYSSSVYVNCDVYFGSNEK-----ISDLTNLPQLEVNSAD 319
Query: 298 --HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT----EEAIFI 351
++L+ G L+ S EDD + V + G TI T E+A+ I
Sbjct: 320 KNNVLVVVGNYKYLQTSF-EDDKTNLEGVESVSQMF------DGKLTIPTGVKIEDAVLI 372
Query: 352 SL 353
SL
Sbjct: 373 SL 374
>gi|2425174|dbj|BAA22268.1| unnamed protein product [Halobacterium salinarum]
Length = 226
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 17/106 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S++ + ATR G +ARA +FRID V+VF ++
Sbjct: 15 TRSVLVPSSLVREAEDKREATRKLGYVARAAAVFRIDRVLVFPDEDG------------- 61
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
E + G F+ +L+Y TP YLRK F L + G+LPPL
Sbjct: 62 ----ERQWGGGFVETVLRYAATPPYLRKEAFDTRDELAYAGVLPPL 103
>gi|323337420|gb|EGA78671.1| YGR283C-like protein [Saccharomyces cerevisiae Vin13]
Length = 311
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P ++IDN +LE T A QIAR +F + E++V D S+ + +R
Sbjct: 11 SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 63
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
S E S L LLQY TP L F + L+ P L + +
Sbjct: 64 SKETISDCLLLATLLQYFVTPPNLLDTTFXKKNKLYLKCASTFPSLKQLPFMNASAEQHY 123
Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
+EG+++ K + +++ LV +G N+ + + +Q + RVTV D+ R
Sbjct: 124 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 173
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ V G GY VR A ++ V
Sbjct: 174 KEVVSPIDSYKGKPLGYHVRMASTLNEV 201
>gi|448099375|ref|XP_004199133.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
gi|359380555|emb|CCE82796.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
Length = 380
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 121/296 (40%), Gaps = 72/296 (24%)
Query: 48 KRDQLNDDAPIEVPT-------VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRID 98
K+ + + P +VP+ ++ +P SII N ++LE T +A Q+A+A TI+ +
Sbjct: 3 KKGEAKKEVPTKVPSKTSPSRRFALCIPSSIISASNARNLEQITYVAYQVAKAATIYNVV 62
Query: 99 EVVVFD------------------NKSSSDNYSRSSAAN------------RSNRSDENE 128
EVV+ D SSSD + N + +++D++
Sbjct: 63 EVVILDIPSPEEQERLAEQEATTVVSSSSDKGGKKIKFNLNDKDAVETPGEKDSQADDHT 122
Query: 129 SGA----------AFLVRLLQYLETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLR 174
S + LLQ+ TP YL KA+F S L++ LP + P
Sbjct: 123 SSSNQHVADSNNGLLFATLLQFFMTPPYLVKAVFGGSEFSSKLKYARKLPKISTLPFMSN 182
Query: 175 KHEWAPFREGVTLKENAPNSVGT-------------LVDVGLNKHVVVDQVLDPGVRVTV 221
+ +A F+EG+T+ + A V++G + +V+ + VRVTV
Sbjct: 183 NNVFADFKEGLTVPKKALKIKKKNKKPVSRKFNSTKYVNIGSDAPLVLSSEVPVNVRVTV 242
Query: 222 AMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
+ +N SP Q + +G+ VR+ S+VF CS GY I
Sbjct: 243 DI---KNKKVVSPAQAYGVVGAHSA---FGFHVRFCKKFSAVFTECSIPAGYTQSI 292
>gi|110668742|ref|YP_658553.1| rpl operon protein [Haloquadratum walsbyi DSM 16790]
gi|109626489|emb|CAJ52950.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
Length = 303
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 17/106 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV +P S++ + ATR G IAR+ T+FR+D ++VF
Sbjct: 2 TLSVLIPSSVVREAEDKREATRKLGYIARSATVFRVDHLIVF-----------------P 44
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
+R E G F+ +L+Y TP +LRK ++ L +VG+LPPL
Sbjct: 45 DRGGERRLGGEFVETVLRYAATPPHLRKKIWGRRDELEYVGVLPPL 90
>gi|385804242|ref|YP_005840642.1| hypothetical protein Hqrw_3231 [Haloquadratum walsbyi C23]
gi|339729734|emb|CCC41012.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
Length = 303
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 17/106 (16%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV +P S++ + ATR G IAR+ T+FR+D ++VF
Sbjct: 2 TLSVLIPSSVVREAEDKREATRKLGYIARSATVFRVDHLIVF-----------------P 44
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
+R E G F+ +L+Y TP +LRK ++ L +VG+LPPL
Sbjct: 45 DRGGERRLGGEFVETVLRYAATPPHLRKKIWGRRDELEYVGVLPPL 90
>gi|6323970|ref|NP_014041.1| hypothetical protein YMR310C [Saccharomyces cerevisiae S288c]
gi|2497224|sp|Q04867.1|YM91_YEAST RecName: Full=Uncharacterized protein YMR310C
gi|984683|emb|CAA90828.1| unknown [Saccharomyces cerevisiae]
gi|285814316|tpg|DAA10211.1| TPA: hypothetical protein YMR310C [Saccharomyces cerevisiae S288c]
gi|392297485|gb|EIW08585.1| hypothetical protein CENPK1137D_355 [Saccharomyces cerevisiae
CEN.PK113-7D]
Length = 317
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 129/302 (42%), Gaps = 47/302 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++L T Q+A+ ++F + EVV+ ++ S D
Sbjct: 21 SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
+ + S A ++ LLQY TP YLR +F+ L LP L R K +
Sbjct: 69 TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128
Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P +G + +++G +K + + L P RVT+ T
Sbjct: 129 RYREGLTIKMQKPTLARKKIGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTIT--- 185
Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSE 281
R++V P + +G+ +GY R A + + +F K GY + T+
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSQYGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTR 239
Query: 282 HGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQ 338
I S L T T H+L+ F L + +++D + + +E F + CP
Sbjct: 240 DTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVS 298
Query: 339 GS 340
S
Sbjct: 299 NS 300
>gi|124028156|ref|YP_001013476.1| hypothetical protein Hbut_1301 [Hyperthermus butylicus DSM 5456]
gi|123978850|gb|ABM81131.1| conserved archaeal protein [Hyperthermus butylicus DSM 5456]
Length = 280
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/307 (25%), Positives = 134/307 (43%), Gaps = 52/307 (16%)
Query: 66 AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
AVP S L T G + R ++R+DE +V+ ++ + Y R + R
Sbjct: 15 AVPASYTSTEGDLRHKTIKVGLLGRLFAVYRVDEAIVYIDRPA---YWRDAELMR----- 66
Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMH-SSLRFVGMLPPLDAPHH-LRKHEWAPFRE 183
++L+Y+ T YLRK ++ + LR+ +LPPL P H + R+
Sbjct: 67 ----------KILEYMVTAPYLRKKIYPPNVPELRYASLLPPLQLPTHGVGGPVDGEVRQ 116
Query: 184 GVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKP 243
+ L+ + + LV+ GL + V V G R+ V + R +V P +
Sbjct: 117 ALVLRASGGRA---LVEAGLEEPVWVRGRFHRGQRILVRI-------VSVERGIVEPVE- 165
Query: 244 KESGMYWGYKVRYAPNIS---SVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH-- 298
+ +Y G+KV +S S ++N + I TS G +V++ L+ R
Sbjct: 166 -DELVYNGFKVTIVEGLSRLLSKYRNYTR-------IATSRKGRLVDAELLSQIGSRSSS 217
Query: 299 ---LLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
+L+ FG GL E + E + LK ++ + NT P QG++T+R EEA+ +L
Sbjct: 218 GPGVLVLFGAPDLGLYE-LAELENLKLED---LVDYVVNTIPQQGTKTVRVEEAVAATLA 273
Query: 355 YFQEPIS 361
+S
Sbjct: 274 LLNLVLS 280
>gi|255711846|ref|XP_002552206.1| KLTH0B09636p [Lachancea thermotolerans]
gi|238933584|emb|CAR21768.1| KLTH0B09636p [Lachancea thermotolerans CBS 6340]
Length = 389
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 110/258 (42%), Gaps = 44/258 (17%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P S+IDN ++LE AT + QIA++ +F + E+VV D R+ + +R
Sbjct: 62 SLCIPVSVIDNCKNLEQATHVVYQIAKSAVLFNVGEIVVLD-------LGRAEDVPKESR 114
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLRK-HE-WA 179
S + LLQY TP YL K++F + LP + A +R HE
Sbjct: 115 HRFQMSPVLLVASLLQYFVTPPYLVKSVFKKEYQECFKVAAKLPRITALPFMRHLHEDNG 174
Query: 180 PFREGVTLKENAPNSVG-----------TLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNR 227
+REG+ ++ P V+VG + + + Q++ VRVTV +
Sbjct: 175 RYREGLAVRMQRPAQGARPAKKKAYDQTKYVNVGKDSALELRGQLVPANVRVTVDL---- 230
Query: 228 NLDADSPRQVVPPSKPK----ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG 283
+D +VV P++ + +G+ VR A + +F GY +
Sbjct: 231 -IDC----KVVSPAEAYGDFVGAQASYGFHVRVAKTFADIFTQSPAPNGYTQTVW----- 280
Query: 284 DIVNSSDLTL-PTFRHLL 300
VNS D PT + L
Sbjct: 281 --VNSGDFYFDPTTKKSL 296
>gi|256272178|gb|EEU07175.1| YMR310C-like protein [Saccharomyces cerevisiae JAY291]
gi|349580602|dbj|GAA25762.1| K7_Ymr310cp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 317
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 127/298 (42%), Gaps = 39/298 (13%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++L T Q+A+ ++F + EVV+ ++ S D
Sbjct: 21 SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
+ + S A ++ LLQY TP YLR +F+ L LP L R K +
Sbjct: 69 TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128
Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P G + +++G +K + + L P RVT+ T +
Sbjct: 129 RYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTITRKI 188
Query: 229 LDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSEHGDI 285
+ SP++ ES +GY R A + + +F K GY + T+ I
Sbjct: 189 V---SPQEAYGDFTGLESQ--YGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTRDTSI 243
Query: 286 VNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQGS 340
S L T T H+L+ F L + +++D + + +E F + CP S
Sbjct: 244 PELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVSNS 300
>gi|151946019|gb|EDN64251.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|190408536|gb|EDV11801.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|259148902|emb|CAY82147.1| EC1118_1M3_5292p [Saccharomyces cerevisiae EC1118]
gi|323332207|gb|EGA73618.1| YMR310C-like protein [Saccharomyces cerevisiae AWRI796]
gi|323336010|gb|EGA77286.1| YMR310C-like protein [Saccharomyces cerevisiae Vin13]
gi|323346996|gb|EGA81273.1| YMR310C-like protein [Saccharomyces cerevisiae Lalvin QA23]
gi|323353230|gb|EGA85530.1| YMR310C-like protein [Saccharomyces cerevisiae VL3]
gi|365764004|gb|EHN05530.1| YMR310C-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 317
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 128/302 (42%), Gaps = 47/302 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++L T Q+A+ ++F + EVV+ ++ S D
Sbjct: 21 SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
+ + S A ++ LLQY TP YLR +F+ L LP L R K +
Sbjct: 69 TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128
Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P G + +++G +K + + L P RVT+ T
Sbjct: 129 RYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTIT--- 185
Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSE 281
R++V P + +G+ +GY R A + + +F K GY + T+
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSQYGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTR 239
Query: 282 HGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQ 338
I S L T T H+L+ F L + +++D + + +E F + CP
Sbjct: 240 DTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVS 298
Query: 339 GS 340
S
Sbjct: 299 NS 300
>gi|399577345|ref|ZP_10771098.1| hypothetical protein HSB1_31370 [Halogranum salarium B-1]
gi|399237728|gb|EJN58659.1| hypothetical protein HSB1_31370 [Halogranum salarium B-1]
Length = 276
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 128/304 (42%), Gaps = 52/304 (17%)
Query: 67 VPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDE 126
+P S++ + ATR G +ARA T+FR D +VVF +R E
Sbjct: 1 MPSSLVREAEDKREATRKLGYVARAATVFRADRLVVF-----------------PDREGE 43
Query: 127 NESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLD-APHHLRKHEWA-PFREG 184
+ G F+ +L+Y TP YLR+ + + L +VG+LPPL +P + + + +G
Sbjct: 44 SRWGGEFVTTVLEYAVTPPYLRQEAWDVRDELEYVGVLPPLRVSPQTGSESDGSGSLTQG 103
Query: 185 VTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
+ + V V+ G+ + + + G RVTV + + + R +
Sbjct: 104 IVTEVGPEGRV--RVNCGMQHPISLITPDGMSVGEGERVTVRISSREPV-----RARIVD 156
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR--- 297
P GY V ++ ++ + I TS +G+ ++ S L+ R
Sbjct: 157 EAPS------GYVVERMDLAEALTRDDA-----GVRIATSRYGEKLSVSRLSSLVGRLDD 205
Query: 298 HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFN----TCPHQGSRTIRTEEAI 349
+ + FG GL + + ED A S F+ T P QGS +RTEEA+
Sbjct: 206 GMTVVFGSPGRGLPDILDVAVEDLPTSTGGASVGPDSGFDLWLNTIPRQGSEVVRTEEAM 265
Query: 350 FISL 353
F SL
Sbjct: 266 FASL 269
>gi|448578202|ref|ZP_21643637.1| hypothetical protein C455_11768 [Haloferax larsenii JCM 13917]
gi|448592778|ref|ZP_21651825.1| hypothetical protein C453_14938 [Haloferax elongans ATCC BAA-1513]
gi|445726743|gb|ELZ78359.1| hypothetical protein C455_11768 [Haloferax larsenii JCM 13917]
gi|445730804|gb|ELZ82391.1| hypothetical protein C453_14938 [Haloferax elongans ATCC BAA-1513]
Length = 280
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 130/311 (41%), Gaps = 58/311 (18%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T+SV VP S++ + ATR G +ARA +FR DE+V+F ++
Sbjct: 2 TLSVLVPSSLVREAEDQREATRKLGYVARAAAVFRADELVIFPDEDG------------- 48
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP + + E
Sbjct: 49 ----ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVSSTTADDSDELP 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADS 233
REG+ + + V V+ G+ + + + PG+ RV V + + + A
Sbjct: 105 ALREGIVTEVGPDDRV--RVNCGMQHPISL--FVPPGMDATEGERVAVRISSREPVRARI 160
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDL 291
+ +P G+ V S G D I TS +G+ ++ L
Sbjct: 161 VDEPLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPRL 202
Query: 292 TLPTFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRT 342
T R + + FG GL + ED + F + NT P QGS
Sbjct: 203 GELTARIADADGMTVVFGSPGRGLPDMLGMTPEDVADVEPSVGPGFDLWLNTIPRQGSDV 262
Query: 343 IRTEEAIFISL 353
+RTEEA+F SL
Sbjct: 263 VRTEEAMFASL 273
>gi|126274487|ref|XP_001387555.1| hypothetical protein PICST_5340 [Scheffersomyces stipitis CBS 6054]
gi|126213425|gb|EAZ63532.1| hypothetical protein PICST_5340, partial [Scheffersomyces stipitis
CBS 6054]
Length = 368
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 116/290 (40%), Gaps = 69/290 (23%)
Query: 63 VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD---------- 110
VSV +P ++I N ++LE T + QIARA TI+ + E+++ D S+
Sbjct: 12 VSVCIPSTVISSKNARNLEQITSIVNQIARAATIYNVVEIIILDIPESAAEDENQKVVEL 71
Query: 111 -----------NYS--------RSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKAL 151
N+S + + A S ++ + A L LLQ+ TP YL K +
Sbjct: 72 GGSKGGKKLKFNFSDEEILKDKQETVAQTSEDVQDSSANAFLLAGLLQFFVTPPYLVKTI 131
Query: 152 FS-----------MHSSLRFVGMLPPLDAPHHLRKHE-WAPFREGVTLKENAPN------ 193
FS M ++ LP + + +E + F+EG T+ + P
Sbjct: 132 FSPAINPSPLNKDMLKKFKYAYKLPKITTLPFMSNNEVFRDFKEGFTVPKETPKVVSKKD 191
Query: 194 -----------SVGTLVDVGLNK--HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
SV V++G + + + + + RVTV + +N SP Q
Sbjct: 192 KSKKVKAEKKISVTKYVNIGEAELLELNIKREIPAYSRVTVDL---KNKTIVSPLQAYGV 248
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
K S +GY VR+ SS+F S GY I + D +S+D
Sbjct: 249 MGNKSS---FGYYVRFCKKFSSIFTESSAPEGYSSSICV-QSDDFYSSAD 294
>gi|389860646|ref|YP_006362886.1| hypothetical protein TCELL_0323 [Thermogladius cellulolyticus 1633]
gi|388525550|gb|AFK50748.1| hypothetical protein TCELL_0323 [Thermogladius cellulolyticus 1633]
Length = 274
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 131/296 (44%), Gaps = 42/296 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRL-AGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
V V +P S++ + L+ L A QI R +IF +++ V F + S+S+
Sbjct: 5 VHVVLPSSVL-TVEPPGLSRLLKAYQILRYTSIFGVEKAVFFKDFSTSER---------- 53
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA-PHHLRKHEWAP 180
++ + AF+ ++ Y +TP YLR+ L + LR VG+LPPL H+ K +
Sbjct: 54 ----DHGTARAFIEKVWNYYKTPPYLRRKLIPIDRDLRGVGILPPLRLESFHVSKRPYTG 109
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
+K +A G D+G+ + V+ GVR + D ++VV
Sbjct: 110 QVRPALVKRSAD---GCFADIGVGEDFAVEGECREGVRQVRVV--------DPVKKVV-- 156
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN-SSDLTLPTFRHL 299
+ E+ +Y G + + ++ V + + G L+ T G + + +D + + +
Sbjct: 157 -ELTETKLYTGPETAFKNSLREVLEEYA---GKSFLLATDRRGRVPSLVTDAKRMSGKKV 212
Query: 300 LIAFGGLAG--LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
++ G + E S+ E N R +NT P Q T+RTEEA+ I+L
Sbjct: 213 VVLLGSPSHDLFEMSVAEG-----FNLRMFVDDVWNTIPGQKVVTVRTEEALIITL 263
>gi|448452122|ref|ZP_21593142.1| hypothetical protein C470_10435 [Halorubrum litoreum JCM 13561]
gi|445809766|gb|EMA59804.1| hypothetical protein C470_10435 [Halorubrum litoreum JCM 13561]
Length = 288
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 7 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 50 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201
Query: 288 SS---DLTLPTFR--HLLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S DL + +AFG E + E GL R F
Sbjct: 202 VSRLGDLVSDARKAGSYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281
>gi|156937544|ref|YP_001435340.1| hypothetical protein Igni_0751 [Ignicoccus hospitalis KIN4/I]
gi|156566528|gb|ABU81933.1| conserved hypothetical protein [Ignicoccus hospitalis KIN4/I]
Length = 276
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 95/224 (42%), Gaps = 21/224 (9%)
Query: 130 GAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP---HHLRKHEWAPFREGVT 186
G+ L+ +L+YL P YLRK + + L++ G+LPPL + E F +
Sbjct: 54 GSDVLMSVLEYLSYPTYLRKLIVPLKPELKYAGVLPPLAVKALNEGFKDREEGLFFKFGL 113
Query: 187 LKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES 246
+ E S+ T +DVG V V V V +G N D +V+P +
Sbjct: 114 IVECKRGSIAT-IDVG-EPERVTHNVKKCKKNVLVLVGFN---DKKRVVKVLPAKRGIWR 168
Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGL 306
G Y G++V Y NI + + YKG +GTS G L + ++ + FG
Sbjct: 169 GEYLGFEVNYFDNIYELVE--FYKGAGFKALGTSRRGTWPGKLRDYLGS--NVGVLFG-- 222
Query: 307 AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
D GL K + N P QG RT+R EEA++
Sbjct: 223 -------SPDKGLLDKYPDLELDALVNLFPCQGVRTVRLEEAVW 259
>gi|448426508|ref|ZP_21583357.1| hypothetical protein C473_10123 [Halorubrum terrestre JCM 10247]
gi|448484333|ref|ZP_21606040.1| hypothetical protein C462_11693 [Halorubrum arcis JCM 13916]
gi|445679388|gb|ELZ31855.1| hypothetical protein C473_10123 [Halorubrum terrestre JCM 10247]
gi|445820108|gb|EMA69937.1| hypothetical protein C462_11693 [Halorubrum arcis JCM 13916]
Length = 288
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 7 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 50 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201
Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S DL + +AFG E + E GL R F
Sbjct: 202 VSRLGDLVSDARKAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281
>gi|448489439|ref|ZP_21607662.1| hypothetical protein C463_03472 [Halorubrum californiensis DSM
19288]
gi|445694811|gb|ELZ46929.1| hypothetical protein C463_03472 [Halorubrum californiensis DSM
19288]
Length = 288
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 7 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 50 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPNGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGLAVATSRHGESLS 201
Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S L + R +AFG E + E GL R F
Sbjct: 202 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281
>gi|448606194|ref|ZP_21658773.1| hypothetical protein C441_12325 [Haloferax sulfurifontis ATCC
BAA-897]
gi|445739611|gb|ELZ91118.1| hypothetical protein C441_12325 [Haloferax sulfurifontis ATCC
BAA-897]
Length = 280
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 130/304 (42%), Gaps = 46/304 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++
Sbjct: 3 LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REG+ + + V V+ G+ + + + PG+ +T + + P +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISL--FVPPGMDLTEGERVAVRISSREPVRARIV 161
Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLPTFR- 297
+P+ G+ V S G D I TS +G+ ++ L T R
Sbjct: 162 DEPRP-----GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDLTARI 209
Query: 298 ----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
+ + FG GL + ED + F + NT P QGS +RTEEA+
Sbjct: 210 TDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRTEEAM 269
Query: 350 FISL 353
F SL
Sbjct: 270 FASL 273
>gi|410083202|ref|XP_003959179.1| hypothetical protein KAFR_0I02650 [Kazachstania africana CBS 2517]
gi|372465769|emb|CCF60044.1| hypothetical protein KAFR_0I02650 [Kazachstania africana CBS 2517]
Length = 351
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 113/257 (43%), Gaps = 43/257 (16%)
Query: 40 DNKKKKKRKRDQLNDDAPIEVPTVSV----AVPGSIIDNTQSLELATRLAGQIARAVTIF 95
D+ K KK PI+V + S+ +P SI+++ +L+ T QIA+A T+F
Sbjct: 15 DDLKIKKPNLPLKKSKKPIKVLSKSINYSLCIPTSILNSCSNLDQITHTLYQIAKAATLF 74
Query: 96 RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
+ E+V+ D S + S+ + + + LLQY TP YL K++F
Sbjct: 75 NVAEIVILDQGSRN--------------SESKITDSMLMASLLQYFVTPPYLIKSVFKKQ 120
Query: 156 --SSLRFVGMLP-PLDAPH--HLRKHEWAPFREGVTLKENAPNSVGTL-----VDVGLNK 205
+ LP L+ P +L +E +REG++++ P++ +++G +
Sbjct: 121 YLGYFKIASKLPRILNLPFMRYLEANE-GRYREGLSIRMTHPDAKKEFKQTKYINIGKAE 179
Query: 206 HVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK----ESGMYWGYKVRYAPNI 260
+ + L P VRVTV + ++VV P + + +GY VR A +
Sbjct: 180 TLELKSQLVPTNVRVTVDI---------VEKKVVSPIEAYGDFVGANSSYGYHVRIAKSF 230
Query: 261 SSVFKNCSYKGGYDHLI 277
+F ++ GY I
Sbjct: 231 GDIFTESAFSKGYSQAI 247
>gi|401839560|gb|EJT42732.1| YGR283C-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 341
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 92/219 (42%), Gaps = 25/219 (11%)
Query: 53 NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
N D V S+ +P S IDN +LE T A QIA+ +F I E++V D
Sbjct: 30 NMDIESAVAQYSICIPTSAIDNCFNLEQTTFTAYQIAKTAVLFNIQEIIVLD-------L 82
Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAP 170
S+ + RS E S L LLQY TP L F + L+ P L
Sbjct: 83 SKDRRHGKRPRSKETISDCLLLATLLQYFVTPSNLLDITFKRKNMLYLKHAFAFPKLHQL 142
Query: 171 HHLRKHEWAPFREGVTL-KENAPNSV----GTLVDVGLNKHV-VVDQVLDPGVRVTVAMG 224
+ + +REG+++ ++NA + LV +G + + + +Q + VRVTV +
Sbjct: 143 AFMNESAEHIYREGLSIAQDNAEKEIERDQTNLVYIGKGEIIELSNQNIPRTVRVTVDL- 201
Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ + SPR V GY VR A + V
Sbjct: 202 --QRKEVVSPRDVY-------KNQTLGYHVRVASTLDEV 231
>gi|389848018|ref|YP_006350257.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
gi|448618115|ref|ZP_21666460.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
gi|388245324|gb|AFK20270.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
gi|445747670|gb|ELZ99125.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
Length = 280
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 132/309 (42%), Gaps = 56/309 (18%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++
Sbjct: 3 LSVLVPTSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP + + E
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVSSTTAGDSDETPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADSP 234
REG+ + + V V+ G+ + + + PG+ RV V + + + A
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISL--FIPPGMDIAKGERVAVRISSREPVRARIV 161
Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFK-NCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ +P G+ V ++ + + + I TS +G+ ++ L
Sbjct: 162 DEPLP-----------GFDVSRMDLTEALGRPDAGVR------IATSRYGESLSVPRLGD 204
Query: 294 PTFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIR 344
T R + +AFG GL + ED + F + NT P QGS +R
Sbjct: 205 LTARITDAGGMTVAFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVR 264
Query: 345 TEEAIFISL 353
TEEA+F SL
Sbjct: 265 TEEAMFASL 273
>gi|448103215|ref|XP_004199984.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
gi|359381406|emb|CCE81865.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
Length = 380
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 111/273 (40%), Gaps = 65/273 (23%)
Query: 64 SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----------------- 104
++ +P SII N ++LE T +A Q+A+A TI+ + EVV+ D
Sbjct: 26 ALCIPSSIISASNAKNLEQITYVAYQVAKAATIYNVVEVVILDIPSPEEQERLEEREATK 85
Query: 105 -NKSSSDNYSR-------------------SSAANRSNRSDE---NESGAAFLVRLLQYL 141
SSSD + S A + ++RS + + + LLQ+
Sbjct: 86 VVSSSSDKGGKKIKFNLNDKEAIEPPREKDSQADDYTSRSSQHVADPNNGLLFATLLQFF 145
Query: 142 ETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSVGT 197
TP YL KA+F S L++ LP + P + ++ F+EG+T+ + A
Sbjct: 146 MTPPYLVKAVFGSSEFLSKLKYARKLPKISTLPFMGNNNVFSDFKEGLTVPKKALKIKKK 205
Query: 198 -------------LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK 244
V++G + +V+ + VRVTV + +N SP Q
Sbjct: 206 NKKPVSRKFNSTKYVNIGSDAPLVLSSEVPVNVRVTVDI---KNKKVVSPAQAYGVVGAH 262
Query: 245 ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
+ +GY VR S+VF CS GY I
Sbjct: 263 SA---FGYHVRLCKKFSAVFTECSIPAGYTQSI 292
>gi|292656686|ref|YP_003536583.1| hypothetical protein HVO_2565 [Haloferax volcanii DS2]
gi|448290690|ref|ZP_21481836.1| hypothetical protein C498_08115 [Haloferax volcanii DS2]
gi|291372324|gb|ADE04551.1| conserved protein [Haloferax volcanii DS2]
gi|445578061|gb|ELY32476.1| hypothetical protein C498_08115 [Haloferax volcanii DS2]
Length = 280
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++ +N+
Sbjct: 3 LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 53 -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|429217689|ref|YP_007175679.1| hypothetical protein Calag_1531 [Caldisphaera lagunensis DSM 15908]
gi|429134218|gb|AFZ71230.1| hypothetical protein Calag_1531 [Caldisphaera lagunensis DSM 15908]
Length = 276
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 140/330 (42%), Gaps = 72/330 (21%)
Query: 38 SCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRI 97
SC+ KKR + L + +P ++V S+ID T AG I+R ++I+++
Sbjct: 2 SCNEFPPKKRNKKLL-----VLLPISVLSVEDSLIDKTYK-------AGFISRILSIYKV 49
Query: 98 DEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
DEV+++ + + SN DE FL +L Y P +++K +
Sbjct: 50 DEVILYRD-------------DESNSFDE-----KFLQLILNYQIIPPHIKKKIIKKSKL 91
Query: 158 LRFVGMLPPLDAPHH--LRKHEWAPFREGVTLKENAPNSVGTLVDVG---LNKHVVVDQV 212
L++ G+LPPL +H +K E +G K + + L +G L+K V D++
Sbjct: 92 LKYAGILPPLRLVNHDPPKKAEIGKILDGYVEKCDENYCIVYLGMIGNGYLDKKEVDDKI 151
Query: 213 LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS-SVFKNCSYKG 271
+ ++ S + + K +Y GYKV N+ FK+ Y
Sbjct: 152 ITVKIK--------------SIKNKIMLEKASWGNLYTGYKVSKIKNLDIKKFKDEGY-- 195
Query: 272 GYDHLIGTSEHGDIVNSS-----DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNARE 326
++GTS+ G+ N D L + ++ FGG + N +E
Sbjct: 196 ---LIVGTSKLGECFNDKIKEEIDNKLRNSKGIVFVFGGPHA-----------ELYNEKE 241
Query: 327 VFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
F NT +QG+ T+RTEEA+ +L F
Sbjct: 242 -FDYILNTVLNQGTLTVRTEEALASTLAIF 270
>gi|448569069|ref|ZP_21638481.1| hypothetical protein C456_04136 [Haloferax lucentense DSM 14919]
gi|448600512|ref|ZP_21656008.1| hypothetical protein C452_16704 [Haloferax alexandrinus JCM 10717]
gi|445725219|gb|ELZ76844.1| hypothetical protein C456_04136 [Haloferax lucentense DSM 14919]
gi|445735229|gb|ELZ86781.1| hypothetical protein C452_16704 [Haloferax alexandrinus JCM 10717]
Length = 280
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++ +N+
Sbjct: 3 LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 53 -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|448517688|ref|ZP_21617262.1| hypothetical protein C466_01182 [Halorubrum distributum JCM 10118]
gi|445705903|gb|ELZ57791.1| hypothetical protein C466_01182 [Halorubrum distributum JCM 10118]
Length = 288
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 127/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 7 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 50 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201
Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S DL + + FG E + E GL R F
Sbjct: 202 VSRLGDLVSDARKAGGYTVVFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281
>gi|448508629|ref|ZP_21615580.1| hypothetical protein C465_08536 [Halorubrum distributum JCM 9100]
gi|445697255|gb|ELZ49325.1| hypothetical protein C465_08536 [Halorubrum distributum JCM 9100]
Length = 300
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 127/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 19 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 61
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 62 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 165
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 166 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 213
Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S DL + + FG E + E GL R F
Sbjct: 214 VSRLGDLVSDARKAGGYTVVFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293
>gi|448565514|ref|ZP_21636381.1| hypothetical protein C457_13294 [Haloferax prahovense DSM 18310]
gi|445715258|gb|ELZ67014.1| hypothetical protein C457_13294 [Haloferax prahovense DSM 18310]
Length = 280
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++
Sbjct: 3 LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITDVGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|448583469|ref|ZP_21646825.1| hypothetical protein C454_09616 [Haloferax gibbonsii ATCC 33959]
gi|445729698|gb|ELZ81293.1| hypothetical protein C454_09616 [Haloferax gibbonsii ATCC 33959]
Length = 280
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++
Sbjct: 3 LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|401626168|gb|EJS44127.1| YMR310C [Saccharomyces arboricola H-6]
Length = 319
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 124/298 (41%), Gaps = 45/298 (15%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + +L T Q+A+ ++F + E+V+ +N D++ ++ A
Sbjct: 21 SLCIPTTLVSDCCNLSQITHKIYQVAKFASLFNVSELVILEN----DSHVQTGA------ 70
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
S + S A ++ LLQY TP YL +F+ L LP L R K +
Sbjct: 71 SKKKISTAKLILALLQYFVTPPYLCNTVFNEKFKPYLIAASKLPRLSTLPFTRYQKQDHG 130
Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P +++G + + + L P RVT+ T
Sbjct: 131 RYREGLTIKMQKPTLTRKKTGKEFKQTKYINIGKPEALALQSQLVPINARVTIDTIT--- 187
Query: 229 LDADSPRQVVPPSKP----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI----GTS 280
R++V P + + +GY R A + + +F K GY I T
Sbjct: 188 ------RKIVSPQEAYGDFTGTDSLYGYYTRIASSFTDLFMKSPLKEGYTQSIYVPLATR 241
Query: 281 EHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIE--EDDGLKRKNAREVFHSYFNTCP 336
E SS + T ++L+ F LE + E +D L+ + +E F + CP
Sbjct: 242 ETNIPALSSLPAIGTPSNILLVFSAWDTLERAFELDKDQFLECQGPQEFFDAQLP-CP 298
>gi|401838346|gb|EJT42026.1| YMR310C-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 317
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 101/233 (43%), Gaps = 40/233 (17%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++L T +A+ ++F + E+V+ + DN RS+
Sbjct: 21 SLCIPSTLVADCRNLSQITHKVYHVAKVASLFNVSELVILE-----DNPQRST------- 68
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
S + S A ++ LLQY TP YLR +F+ L LP L R K +
Sbjct: 69 SKKKISNAKLILALLQYFVTPPYLRNTVFNEKFRPYLVAASKLPRLSTLPFTRYEKQDHG 128
Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P S +++G + + + L P RVT+ T
Sbjct: 129 RYREGLTIKMQKPTSARKKTGKEFKQTKYINIGKPEALALQNQLVPINARVTIDTIT--- 185
Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
R++V P + +G+ ++GY R A + + +F K GY I
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSHYGYHTRIASSFTDLFMKGPLKEGYTQSI 232
>gi|448623079|ref|ZP_21669728.1| hypothetical protein C438_11873 [Haloferax denitrificans ATCC
35960]
gi|445753587|gb|EMA05004.1| hypothetical protein C438_11873 [Haloferax denitrificans ATCC
35960]
Length = 280
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++
Sbjct: 3 LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
EN+ G F+ +L+Y TP YLRK ++ LR+ G+LPP L A + +P
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|365760478|gb|EHN02195.1| YGR283C-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 343
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 91/219 (41%), Gaps = 25/219 (11%)
Query: 53 NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
N D V S+ +P S IDN +LE T A QIA+ +F I E++V D
Sbjct: 30 NMDIESAVAQYSICIPTSAIDNCFNLEQTTFTAYQIAKTAVLFNIQEIIVLD-------L 82
Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAP 170
S+ + RS E S L LLQY TP L F + L+ P L
Sbjct: 83 SKDRRHGKRPRSKETISDCLLLATLLQYFVTPSNLLDITFKRKNMLYLKHAFAFPKLHQL 142
Query: 171 HHLRKHEWAPFREGVTL-KENAPNSV----GTLVDVGLNKHV-VVDQVLDPGVRVTVAMG 224
+ + +REG+++ ++NA + LV +G + + + +Q + VRVTV +
Sbjct: 143 AFMNESAEHIYREGLSIAQDNAEKEIERDQTNLVYIGKGEIIELSNQNIPRTVRVTVDLQ 202
Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
+ SPR V GY VR A + V
Sbjct: 203 RK---EVVSPRDVY-------KNQTLGYHVRVASTLDEV 231
>gi|448545987|ref|ZP_21626314.1| hypothetical protein C460_16402 [Haloferax sp. ATCC BAA-646]
gi|448548061|ref|ZP_21627405.1| hypothetical protein C459_03800 [Haloferax sp. ATCC BAA-645]
gi|448557084|ref|ZP_21632519.1| hypothetical protein C458_11715 [Haloferax sp. ATCC BAA-644]
gi|445703333|gb|ELZ55264.1| hypothetical protein C460_16402 [Haloferax sp. ATCC BAA-646]
gi|445714763|gb|ELZ66521.1| hypothetical protein C459_03800 [Haloferax sp. ATCC BAA-645]
gi|445714953|gb|ELZ66710.1| hypothetical protein C458_11715 [Haloferax sp. ATCC BAA-644]
Length = 280
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 128/308 (41%), Gaps = 54/308 (17%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++ +N+
Sbjct: 3 LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
G F+ +L+Y TP YLRK + LR+ G+LPP L A + +P
Sbjct: 53 -------GGEFVETVLRYAATPPYLRKEAWGKRDELRYAGILPPVLVASTTADDSDESPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
REG+ + + V V+ G+ + + L G RV V + + + A +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163
Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
+P G+ V S G D I TS +G+ ++ L
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205
Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
T R + + FG GL + ED + F + NT P QGS +RT
Sbjct: 206 TARITEAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265
Query: 346 EEAIFISL 353
EEA+F SL
Sbjct: 266 EEAMFASL 273
>gi|448611748|ref|ZP_21662178.1| rpl operon protein [Haloferax mucosum ATCC BAA-1512]
gi|445742509|gb|ELZ94003.1| rpl operon protein [Haloferax mucosum ATCC BAA-1512]
Length = 280
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 127/303 (41%), Gaps = 44/303 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR DE+VVF ++
Sbjct: 3 LSVLVPTSLVREAEDQREATRKLGYVARAAAVFRADELVVFPDEDG-------------- 48
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
EN+ G F+ +L+Y TP YLRK + LR+ G+LPP + + E
Sbjct: 49 ---ENKWGGEFVETVLRYAATPPYLRKEAWGKRDELRYAGILPPVLVSSTTAGDSDETPA 105
Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
REG+ + + V V+ G+ V + PG+ V+ + + P +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPVSL--FTPPGMNVSKGERVAVRISSREPVRARIV 161
Query: 241 SKPKESGMYWGYKVRYAPNISSVFK-NCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR-- 297
+P G+ V ++ + + + I TS +G+ ++ L T R
Sbjct: 162 DEPLP-----GFDVSRMDLTEALGRPDAGVR------IATSRYGESLSVPRLGDLTARIT 210
Query: 298 ---HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
+ + FG GL + ED + F + NT P QGS +RTEEA+F
Sbjct: 211 DAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSDVVRTEEAMF 270
Query: 351 ISL 353
SL
Sbjct: 271 ASL 273
>gi|365758876|gb|EHN00699.1| YMR310C-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 317
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 99/233 (42%), Gaps = 40/233 (17%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P +++ + ++L T +A+ ++F + E+V+ + DN RS+
Sbjct: 21 SLCIPSTLVADCRNLSQITHKVYHVAKVASLFNVSELVILE-----DNPPRST------- 68
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
S + S A ++ LLQY TP YLR +F+ L LP L R K +
Sbjct: 69 SKKKISNAKLILALLQYFVTPPYLRNTVFNEKFRPYLVAASKLPRLSTLPFTRYEKQDHG 128
Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
+REG+T+K P S +++G + + + L P RVT+ T
Sbjct: 129 RYREGLTIKMQKPTSARKKTGKEFKQTKYINIGKPEALALQNQLVPINARVTIDTIT--- 185
Query: 229 LDADSPRQVVPPSKPKESGM----YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
R++V P + + ++GY R A + + +F K GY I
Sbjct: 186 ------RKIVSPQEAYGDFIGLDSHYGYHTRIASSFTDLFMKGPLKEGYTQSI 232
>gi|402586177|gb|EJW80115.1| hypothetical protein WUBG_08978, partial [Wuchereria bancrofti]
Length = 67
Score = 59.7 bits (143), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
+LI FGGL GLE +IE D + ++F Y N P QGSR IRTEEAI I+L
Sbjct: 2 ILIVFGGLEGLETAIEADKNINCLTPEKLFEHYLNIVPGQGSRIIRTEEAIPITL 56
>gi|222480858|ref|YP_002567095.1| hypothetical protein Hlac_2450 [Halorubrum lacusprofundi ATCC
49239]
gi|222453760|gb|ACM58025.1| Protein of unknown function DUF171 [Halorubrum lacusprofundi ATCC
49239]
Length = 288
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 132/320 (41%), Gaps = 74/320 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF ++
Sbjct: 7 LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 58 --------GEYVRTVLGYAATPPELRKDLWGERDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+E G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQVVEA-DLSEAL------AGDGLAVATSRHGEELS 201
Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYFN 333
S L + R +AFG GL E + DG++ A R V F + N
Sbjct: 202 VSRLGDIVSDARKAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261
Query: 334 TCPHQGSRTIRTEEAIFISL 353
T P Q S +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281
>gi|448460078|ref|ZP_21596998.1| hypothetical protein C469_14736 [Halorubrum lipolyticum DSM 21995]
gi|445807796|gb|EMA57877.1| hypothetical protein C469_14736 [Halorubrum lipolyticum DSM 21995]
Length = 288
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 76/321 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF ++
Sbjct: 7 LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 58 --------GEYVRTVLGYAATPPELRKDLWGERDELRYVGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL-IGTSEHGDIV 286
+ + P + KP+E G++V V + S D L + TS HG+ +
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQV--------VGADLSEALAGDGLAVATSRHGEEL 200
Query: 287 NSSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKR--KNAREV-----FHSYF 332
+ S L + R +AFG GL E + DG++ + R V F +
Sbjct: 201 SVSRLGGIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWL 260
Query: 333 NTCPHQGSRTIRTEEAIFISL 353
NT P Q S +RTEEA+F++L
Sbjct: 261 NTIPAQASEVVRTEEALFVTL 281
>gi|392299539|gb|EIW10633.1| hypothetical protein CENPK1137D_3251 [Saccharomyces cerevisiae
CEN.PK113-7D]
Length = 129
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 7/94 (7%)
Query: 64 SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
S+ +P ++IDN +LE T A QIAR +F + E++V D S+ + +R
Sbjct: 41 SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93
Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
S E S L LLQY TP L F +S
Sbjct: 94 SKETISDCLLLATLLQYFVTPPNLLDTTFKKKTS 127
>gi|448475311|ref|ZP_21603029.1| hypothetical protein C461_11528 [Halorubrum aidingense JCM 13560]
gi|445816782|gb|EMA66669.1| hypothetical protein C461_11528 [Halorubrum aidingense JCM 13560]
Length = 288
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 87/321 (27%), Positives = 132/321 (41%), Gaps = 76/321 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF ++
Sbjct: 7 LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
++ +L Y TP LRK L+ LR+ G+LPPL P+R
Sbjct: 58 --------GEYVRTVLGYAATPPELRKDLWGERDELRYAGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL-IGTSEHGDIV 286
+ + P + KP+E G++V V + S D L + TS HG+ +
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQV--------VGADLSEALAGDGLAVATSRHGEEL 200
Query: 287 NSS---DLTLPTFRH--LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYF 332
+ S DL + +IAFG GL E + DG++ A R V F +
Sbjct: 201 SVSRLGDLVSDARKAGGYVIAFGAPERGLPEMLGLSPDGIRAAVADGRPVEPDPGFDLWL 260
Query: 333 NTCPHQGSRTIRTEEAIFISL 353
NT P Q S +RTEEA+F++L
Sbjct: 261 NTIPAQSSEVVRTEEALFVTL 281
>gi|1181552|gb|AAA86857.1| ribosomal protein [Haloarcula marismortui]
Length = 214
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 83/184 (45%), Gaps = 32/184 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
T SV VP S+ + ATR G +ARA ++R+D + V+ + + +
Sbjct: 2 TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
E G F+ +L+Y TP +LRK ++ L +VG+LPPL +R +
Sbjct: 53 ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100
Query: 182 REGVTLKENAPNSVGTLVDVGLN---KH-----VVVDQVLDPGVRVTVAMGTNRNLDA-- 231
+L++ VG V +N +H V D ++ G RVTV + + R + A
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRAKL 160
Query: 232 -DSP 234
D+P
Sbjct: 161 VDAP 164
>gi|448436251|ref|ZP_21587052.1| hypothetical protein C472_12365 [Halorubrum tebenquichense DSM
14210]
gi|445682919|gb|ELZ35329.1| hypothetical protein C472_12365 [Halorubrum tebenquichense DSM
14210]
Length = 300
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 126/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 19 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVFPDSEG-------------- 64
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
+ G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 65 ---DGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLGEHPISLLVPSGMEVEQGERVTI------ 165
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+ G++V A ++S G + TS HG+ ++
Sbjct: 166 RVSSREPVRARITGKPEA-----GFQV-VAADLSEALA------GDGLAVATSRHGEELS 213
Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S L + R +AFG E + E GL R F
Sbjct: 214 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293
>gi|448503713|ref|ZP_21613342.1| hypothetical protein C464_14720 [Halorubrum coriense DSM 10284]
gi|445691914|gb|ELZ44097.1| hypothetical protein C464_14720 [Halorubrum coriense DSM 10284]
Length = 300
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 125/317 (39%), Gaps = 68/317 (21%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 19 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 61
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
R + G ++ +L Y TP LRK L+ LR+VG+LPPL P+R
Sbjct: 62 REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVD--------QVLDP-GVRVTVAMGTNRNLDADS 233
G T + G + +VG V V+ +L P G+ V + +
Sbjct: 112 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVVRGERVTIRVSSRE 171
Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-- 291
P + KP+ G++V A ++S G + TS HG ++ S L
Sbjct: 172 PVRARITGKPEV-----GFQV-AAADLSEALA------GDGLAVATSRHGGELSVSRLGD 219
Query: 292 TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHSYFNTCP 336
+ R +AFG E + E GL R F + NT P
Sbjct: 220 LVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRSAVADGRPVEPDPGFDLWLNTIP 276
Query: 337 HQGSRTIRTEEAIFISL 353
Q S +RTEEA+F +L
Sbjct: 277 AQASEVVRTEEALFATL 293
>gi|448730376|ref|ZP_21712684.1| hypothetical protein C449_11338 [Halococcus saccharolyticus DSM
5350]
gi|445793544|gb|EMA44116.1| hypothetical protein C449_11338 [Halococcus saccharolyticus DSM
5350]
Length = 280
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 130/315 (41%), Gaps = 54/315 (17%)
Query: 62 TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
TVS+ VP S+ AT G +ARA T+FR+D VVVF + +
Sbjct: 2 TVSLLVPSSLCREAADKREATHKVGLVARAATVFRVDRVVVFPDPDGDRRW--------- 52
Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
G F+ +++Y T YLRK + L VG+LPPL A
Sbjct: 53 --------GGGFVSTVMEYAATAPYLRKEAWGTRDELEQVGVLPPLRAVSRTGSESQGSG 104
Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
R+G+ + E P+ V+ GL + + D LD G RVTV + + + A
Sbjct: 105 SSRQGI-VTEVGPDGR-VRVNCGLQHPISLVDPTDVGLDEGERVTVRISSREPVRARIVD 162
Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
+ +P G+ V A ++S+ ++ + I TS HG + + L +
Sbjct: 163 EPLP-----------GFVVERA-DLSAALGREDAGLR------IATSRHGQALTTERLGV 204
Query: 294 PTFR---HLLIAFGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIR 344
R + +AFG ++ D +A E F + +T P+QGS +R
Sbjct: 205 LAGRVEDGMTVAFGAPERGLPAMLGIDTASVASADETGSDPGGFDRWLDTVPNQGSEVVR 264
Query: 345 TEEAIFISLQYFQEP 359
TEEA+F +L P
Sbjct: 265 TEEAVFATLACLTLP 279
>gi|50310397|ref|XP_455218.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49644354|emb|CAG97926.1| KLLA0F03058p [Kluyveromyces lactis]
Length = 409
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 129/322 (40%), Gaps = 76/322 (23%)
Query: 29 LSLANGDSSSCDNKKKKKR------KRDQL------NDDAPIEVPTV--SVAVPGSIIDN 74
+S+ S N KK KR KRD L + TV S+ +P +++D+
Sbjct: 1 MSVKRSKEQSKQNYKKVKRDTKVTEKRDHLAQFKRAKKVVKVSTKTVNHSICIPTTVLDS 60
Query: 75 TQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSD--NYSRSSA------ANRS 121
++LE T QIAR+ I+ + E+V+ NK SD N + A N +
Sbjct: 61 CKNLEQITYTLYQIARSACIYNVAEIVILKAESETNKGESDRKNDKKQGAKIKFDDGNNT 120
Query: 122 NRS-------DENESGA------AFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPP 166
N + E E G + LLQ+ TP YL ++F + LP
Sbjct: 121 NNASTEYLEDSEKEDGKKRLSKPMLMASLLQFFITPPYLVNSVFKKDYMKYFTYAKQLPR 180
Query: 167 LDAPHHLRKHE--WAPFREGVTLKENAPNSVG---------TLVDVGLNKHVVVDQVLDP 215
+ + +R ++ +REG+ ++ + P G +++G +K++ + L P
Sbjct: 181 IPSLPFMRFYQDNNGRYREGLAIRMSKPGERGKSKKSFDQTKYINIGKDKNLELKGQLVP 240
Query: 216 -GVRVTVAMGTNRNLDADSPRQ-VVPPSKPK----ESGMYWGYKVRYAPNISSVFKNCSY 269
VRVTV D+ Q VV P + + +GY VR A + + +F +
Sbjct: 241 INVRVTV----------DTIEQKVVSPEEAYGDFVGAKASFGYHVRVAKSFADLFTQSPF 290
Query: 270 KGGYDHLIGTSEHGDIVNSSDL 291
GY + +NS D
Sbjct: 291 PQGYTQTVW-------INSGDF 305
>gi|448535709|ref|ZP_21622229.1| hypothetical protein C467_10476 [Halorubrum hochstenium ATCC
700873]
gi|445703210|gb|ELZ55145.1| hypothetical protein C467_10476 [Halorubrum hochstenium ATCC
700873]
Length = 300
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF +
Sbjct: 19 LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVFPDAEG-------------- 64
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
+ G ++ +L Y TP LRK L+ LR+VG+LPPL + W R
Sbjct: 65 ---DGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPL-------RVLW---R 111
Query: 183 EGVTLKENAPNSVGTLVDVG----------LNKH-----VVVDQVLDPGVRVTVAMGTNR 227
G T + G + +VG L +H V D ++ G RVT+
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSDMEVEQGERVTI------ 165
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP+ G++V A ++S G + TS HG+ ++
Sbjct: 166 RVSSREPVRARIAGKPEA-----GFQV-VAADLSEALA------GDGLAVATSRHGEELS 213
Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
S L + R +AFG E + E GL R F
Sbjct: 214 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270
Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
+ NT P Q S +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293
>gi|448441450|ref|ZP_21589113.1| hypothetical protein C471_06403 [Halorubrum saccharovorum DSM 1137]
gi|445688859|gb|ELZ41106.1| hypothetical protein C471_06403 [Halorubrum saccharovorum DSM 1137]
Length = 288
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 130/320 (40%), Gaps = 74/320 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ +P S++ + ATR G +ARA +FR D +VVF +
Sbjct: 7 LTICIPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDGEGERRRG--------- 57
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
++ +L Y TP LRK L+ LR+ G+LPPL P+R
Sbjct: 58 --------GEYVRTVLGYAATPPGLRKDLWGERDELRYAGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP++ G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPED-----GFQVVEA-DLSEALA------GDGLAVATSRHGEELS 201
Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYFN 333
S L + R +AFG GL E + DG++ A R V F + N
Sbjct: 202 VSRLGTIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261
Query: 334 TCPHQGSRTIRTEEAIFISL 353
T P Q S +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281
>gi|448463229|ref|ZP_21598007.1| hypothetical protein C468_03543 [Halorubrum kocurii JCM 14978]
gi|445817224|gb|EMA67100.1| hypothetical protein C468_03543 [Halorubrum kocurii JCM 14978]
Length = 288
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 131/320 (40%), Gaps = 74/320 (23%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+++ VP S++ + ATR G +ARA +FR D +VVF ++
Sbjct: 7 LTICVPSSVVREAEDDREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
++ +L Y TP LRK L+ LR+ G+LPPL P+R
Sbjct: 58 --------GEYVRTVLGYAATPPGLRKDLWGERDELRYAGVLPPLR----------VPWR 99
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
G T + G + +VG V V+ L + G RVT+
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153
Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
+ + P + KP++ G++V A ++S G + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPED-----GFQVVEA-DLSEALA------GDGLAVATSRHGEELS 201
Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKR--KNAREV-----FHSYFN 333
S L + R +AFG GL E + DG++ + R V F + N
Sbjct: 202 VSRLGTIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261
Query: 334 TCPHQGSRTIRTEEAIFISL 353
T P Q S +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281
>gi|354547624|emb|CCE44359.1| hypothetical protein CPAR2_401610 [Candida parapsilosis]
Length = 403
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 92/411 (22%), Positives = 155/411 (37%), Gaps = 109/411 (26%)
Query: 34 GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIID--NTQSLELATRLAGQIARA 91
+S++ + K KK R Q P++S+ VP S+I N +L+ T +A QIA+A
Sbjct: 6 AESATSEAKSTKKPTRQQT-------APSISICVPSSVISSKNAYNLQQKTMIAYQIAKA 58
Query: 92 VTIFRIDEV-------------------------------VVFDNKSSSDNYSRSSAANR 120
I+ + E+ VVFD+ +D S + N
Sbjct: 59 SLIYDVAEIIVLTESTTKQEGKGGQSEEEKKDISSSIGKKVVFDDNGENDIGSSAPVENP 118
Query: 121 SN---------RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
++ R ++++ A L LLQ+ TP YL K +FS + +F +LP
Sbjct: 119 TDTELDENTDERVNDDKDDALLLASLLQFFITPPYLIKTMFSPRLNPKFGQVLPKFKYAF 178
Query: 172 HLRKHEWAP----------FREGVTLKENAPN-----------------SVGTLVDVGLN 204
L K P F+EG+ + P +V V++G +
Sbjct: 179 KLPKIPSLPFMQNNNVHEHFKEGIIIPRETPKIKSKTDKQKLVPSPHKVTVSKYVNIGES 238
Query: 205 KHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPN--I 260
+ + +D + V RVT+ + RN S +Q + K S +GY VR +
Sbjct: 239 EAMKLDIAREVPVYSRVTIDV---RNKTIVSAKQAYGINGHKSS---FGYHVRMVNDNQF 292
Query: 261 SSVFKNCSYKGGYDH--LIGTSEHGDIVNS-SDLTLPTFR----------HLLIAFGGLA 307
+ VF N GY + ++ D + D ++P F +LLI G
Sbjct: 293 NKVFTNSPLSDGYSQTIFVNCDDYFDKWSKLVDESVPLFEKKGDANGNNDNLLIILGNYG 352
Query: 308 GLEESIEEDDGLKRKNAREVFHSYFNT-----CPHQGSRTIRTEEAIFISL 353
++ + + D+ +F N C + R E+A I+L
Sbjct: 353 TIQRNFQSDEA-----KSSIFEGLDNVSKLFDCRLDIPKGCRIEDATLIAL 398
>gi|433431321|ref|ZP_20407623.1| hypothetical protein D320_16455, partial [Haloferax sp. BAB2207]
gi|432194053|gb|ELK50713.1| hypothetical protein D320_16455, partial [Haloferax sp. BAB2207]
Length = 228
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 17/105 (16%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV VP S++ + ATR G +ARA +FR +E+VVF ++ +N+
Sbjct: 3 LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
G F+ +L+Y TP YLRK ++ LR+ G+LPP+
Sbjct: 53 -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPV 90
>gi|403215892|emb|CCK70390.1| hypothetical protein KNAG_0E01240 [Kazachstania naganishii CBS
8797]
Length = 352
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 104/227 (45%), Gaps = 32/227 (14%)
Query: 61 PTVSVAV-PGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
P VS+ V +++ N Q+LE T ++ARA +++ + EVVV S + R++ +
Sbjct: 25 PRVSLCVCAAAVVGNCQNLEQITHKVYELARAASMYNVREVVVL-----SLDPPRTAGTS 79
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--- 174
+ E S + + LLQY TP YL +++F H + LP L +R
Sbjct: 80 KG----EKLSDSMLVATLLQYFVTPPYLVRSVFKKRFHKYFQHAEKLPRLAQLPFMRQDT 135
Query: 175 -KHEWAPFREGVTLKENAPNSVG--------TLVDVGLNKHVVVDQVLDP-GVRVTVAMG 224
+ + +REGV++ P + V+VG ++ + + L P VRVTV +
Sbjct: 136 PREGKSRYREGVSVAMERPAKMSKQKQFQQTKYVNVGASELLELRAQLVPLNVRVTVDVV 195
Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKN--CSY 269
++ + SP + E+ +GY VR A + +F C+Y
Sbjct: 196 EHKIV---SPAEAYGDHVGAEAA--FGYHVRVADTLVDLFLKSPCNY 237
>gi|390938994|ref|YP_006402732.1| hypothetical protein Desfe_1289 [Desulfurococcus fermentans DSM
16532]
gi|390192101|gb|AFL67157.1| protein of unknown function DUF171 [Desulfurococcus fermentans DSM
16532]
Length = 276
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 121/292 (41%), Gaps = 32/292 (10%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P SI+ +L L + Q+AR +IF ++ VV + +S
Sbjct: 6 LSIVLPTSILSVEPTLFLKSLRIHQVARWSSIFGVNRVVFYREYETS------------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
DE + +Y TP YLR+ L + LR+VG LPP+ +
Sbjct: 53 -RDEFREHREIISIHWRYFFTPPYLRRRLVPRNPLLRYVGALPPIRLSEFNVSGKPVDGE 111
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
E + + +D + K+ +V +PG G +R DS R++ ++
Sbjct: 112 ERIGFITLEEGKLTAYLD-NVEKYSLV----NPG---GCKQGFSRVRVVDSRRKL---AE 160
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTFRHLLI 301
++ Y G + + ++ + K +I T + G+ + S L +L + L++
Sbjct: 161 CIDTEYYIGPSLVFRDSLREALDDAEEKV---FIIATDKTGEAPSLSKLVSLKERKELML 217
Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
FG I +G N E + +NT P Q ++RTEEA+ I+L
Sbjct: 218 LFGSPERDLFEISRGEGF---NLLEYVDAVWNTVPGQHVVSVRTEEAVIITL 266
>gi|218884489|ref|YP_002428871.1| hypothetical protein DKAM_1178 [Desulfurococcus kamchatkensis
1221n]
gi|218766105|gb|ACL11504.1| DUF171 domain containing protein [Desulfurococcus kamchatkensis
1221n]
Length = 276
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 125/298 (41%), Gaps = 44/298 (14%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+S+ +P SI+ +L L + Q+AR +IF ++ VV + +S
Sbjct: 6 LSIVLPTSILSVEPTLFLKSLRIHQVARWSSIFGVNRVVFYREYETS------------- 52
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
DE + +Y TP YLR+ L + LR+VG LPP+ +
Sbjct: 53 -RDEFREHREIISIHWRYFFTPPYLRRRLVPRNPLLRYVGALPPIRLSEFNVSGKPVDGE 111
Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
E + + +D + K+ +V +PG G +R DS R++ ++
Sbjct: 112 ERIGFITLEEGKLTAYLD-NVEKYSLV----NPG---DCKQGFSRVKVVDSRRKL---AE 160
Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH------LIGTSEHGDIVNSSDL-TLPT 295
++ Y G P++ VF+N S + D +I T + G+ + S+L +L
Sbjct: 161 CIDTEYYIG------PSL--VFRN-SLREALDEAGEKVFIIATDKTGEAPSLSNLASLKG 211
Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
L++ FG I +G N E + +NT P Q ++RTEEA+ I+L
Sbjct: 212 RMELMLLFGSPERDLFEISRGEGF---NLLEYVDAVWNTVPGQHVVSVRTEEAVIITL 266
>gi|448529670|ref|XP_003869886.1| hypothetical protein CORT_0E01660 [Candida orthopsilosis Co 90-125]
gi|380354240|emb|CCG23753.1| hypothetical protein CORT_0E01660 [Candida orthopsilosis]
Length = 411
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 82/379 (21%), Positives = 143/379 (37%), Gaps = 107/379 (28%)
Query: 34 GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIID--NTQSLELATRLAGQIARA 91
+S+S + K KK + Q P+VS+ +P S+I N +L+ T +A QIA+A
Sbjct: 6 AESASSEPKPTKKPTKHQ-------STPSVSICIPSSVISYKNAYNLQQKTMIAYQIAKA 58
Query: 92 VTIFRIDEVVVF-------------------------------------DNKS------- 107
I+ + E++V DN++
Sbjct: 59 SLIYDVSEIIVLKESPIKHEKEEEKEKHGNGGGKKEAMSAVGKKVVFNEDNENGASSSLQ 118
Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLL-QYLETPQYLRKALF---------SM 154
S+D + + + +D NE+ A L+ L Q+ TP YL K +F S+
Sbjct: 119 PVPSTDTKPKEETPSEAVNNDANENDDALLLASLLQFFITPPYLIKTMFLPRLNPKFTSI 178
Query: 155 HSSLRFVGMLPPL-DAPHHLRKHEWAPFREGVTLKENAPN-----------------SVG 196
S ++ LP + + P + + F+EG+ + P +V
Sbjct: 179 LSKFKYAFKLPKIPNLPFMQNNNVYEHFKEGIIIPRETPKIKSKTDKSKLVPSPHKVTVS 238
Query: 197 TLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
V++G + + +D + V RVT+ + RN S R+ K S +GY V
Sbjct: 239 KYVNIGEPEAMKLDIAREVPVYSRVTIDV---RNKTIVSAREAYGTHGHKSS---FGYHV 292
Query: 255 RYA--PNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLPTFR-----------HL 299
R S VF N GY + ++ D + + +P+F +L
Sbjct: 293 RMVNETQFSKVFTNSPLSDGYSQTIFVNCDDYFDKWSKTVDQVPSFEEKKSRGNDGNDNL 352
Query: 300 LIAFGGLAGLEESIEEDDG 318
LI G ++++ E D+
Sbjct: 353 LIILGNCGTIQQNFENDEA 371
>gi|70606413|ref|YP_255283.1| hypothetical protein Saci_0598 [Sulfolobus acidocaldarius DSM 639]
gi|449066625|ref|YP_007433707.1| hypothetical protein SacN8_02905 [Sulfolobus acidocaldarius N8]
gi|449068899|ref|YP_007435980.1| hypothetical protein SacRon12I_02895 [Sulfolobus acidocaldarius
Ron12/I]
gi|68567061|gb|AAY79990.1| conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
gi|449035133|gb|AGE70559.1| hypothetical protein SacN8_02905 [Sulfolobus acidocaldarius N8]
gi|449037407|gb|AGE72832.1| hypothetical protein SacRon12I_02895 [Sulfolobus acidocaldarius
Ron12/I]
Length = 250
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 19/117 (16%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
+SV + SI D + LE G I R + IFR+ EV+ D+ + N +R + SN
Sbjct: 10 LSVVLFTSIFDTEKKLEEIIYKLGFIIRTLVIFRVSEVIWLDDLKNKKNITR-IIKDVSN 68
Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
Y TP Y +K F + +L VG++PP++ P H+ +++
Sbjct: 69 -----------------YALTPPYGKKY-FPIKRTLSKVGLIPPINIPSHVVSNDYV 107
>gi|320101369|ref|YP_004176961.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
gi|319753721|gb|ADV65479.1| protein of unknown function DUF171 [Desulfurococcus mucosus DSM
2162]
Length = 274
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 63 VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
VS+ +P SI+ +L L + Q+AR +IF + EVV + Y +
Sbjct: 3 VSIVLPSSILSVESTLLLKSLRIHQVARWTSIFGVSEVVFYRE------YGTPHGEYMGH 56
Query: 123 RSDENESGAAFLVRL-LQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
R L+ L +Y TP YLR+ L LR+VG LPP+
Sbjct: 57 RQ---------LIELHWRYFFTPPYLRRRLVPRSPLLRYVGALPPI 93
>gi|323303496|gb|EGA57290.1| YMR310C-like protein [Saccharomyces cerevisiae FostersB]
Length = 265
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 100/246 (40%), Gaps = 35/246 (14%)
Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--K 175
RS + S A ++ LLQY TP YLR +F+ L LP L R K
Sbjct: 13 RSMPPKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQK 72
Query: 176 HEWAPFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMG 224
+ +REG+T+K P G + +++G +K + + L P RVT+
Sbjct: 73 QDHGRYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTI 132
Query: 225 TNRNLDADSPRQVVPPSKP--KESGMY--WGYKVRYAPNISSVFKNCSYKGGYDHLIG-- 278
T R++V P + +G+ +GY R + + +F K GY +
Sbjct: 133 T---------RKIVSPQEAYGDFTGLXSQYGYYTRIXSSFTDLFMKGPLKEGYTQSVYVP 183
Query: 279 -TSEHGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNT 334
T+ I S L T T H+L+ F L + +++D + + +E F +
Sbjct: 184 LTTRDTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP- 242
Query: 335 CPHQGS 340
CP S
Sbjct: 243 CPVSNS 248
>gi|385301686|gb|EIF45859.1| DEHA2G18788p-like protein [Dekkera bruxellensis AWRI1499]
Length = 258
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 75/176 (42%), Gaps = 35/176 (19%)
Query: 128 ESGAAFLVRLLQYLETPQYLRKALF----------SMHSSLRFVGMLPPLDAPHHLRKHE 177
+S L LLQ+ TP YL +++F + L + MLP + A H
Sbjct: 2 DSDTMLLAGLLQFFVTPPYLVESVFGNSPIDLKKFNYAKKLPTIPMLPFMQANDG---HG 58
Query: 178 WAPFREGVTL-------KENAPNSVGTL---------VDVGLNKHVVVDQVLDPGVRVTV 221
++EG+T+ K+ A V + V+VGL K + + + + G RVTV
Sbjct: 59 TKKYKEGMTIQNLSKSRKKTASGKVKKIKKADRMTKYVNVGLKKMLELKEAVPKGARVTV 118
Query: 222 AMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
M + + SP S S +GY+VR A ++ ++ + GYD ++
Sbjct: 119 DMKKHIVV---SPYVAYGTSSTNNS---YGYQVRVATSLEALLGQTPFPDGYDMIL 168
>gi|198423892|ref|XP_002121436.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 1860
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 32/58 (55%)
Query: 15 KEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSII 72
+E NH+S ++ L NGD +S D K +R+R + D + P V +A+P S++
Sbjct: 404 EEKLVNGNHDSHDDTRLMNGDDASHDLKGIYERQRLKHEQDESAKTPIVDIAIPNSVV 461
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.132 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,016,339,570
Number of Sequences: 23463169
Number of extensions: 256169958
Number of successful extensions: 529151
Number of sequences better than 100.0: 512
Number of HSP's better than 100.0 without gapping: 445
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 527268
Number of HSP's gapped (non-prelim): 652
length of query: 378
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 234
effective length of database: 8,980,499,031
effective search space: 2101436773254
effective search space used: 2101436773254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 78 (34.7 bits)